Documentation for multi.py (Bayesian audit support program)

Note: This repo "audit-lab" was created initially 2017-09-04 as a copy of www.github.com/ron-rivest/2017-bayes-audit/2017-code As of 2017-09-04, nothing else has been changed. It is just a copy. This repo was initialized with the python code this README, and some data files. But it does not contain the git history of how it got to this point. If you are interested in that, or more links on Bayesian audits, see the original repo (which is public) www.github.com/ron-rivest/2017-bayes-audit

This repo will be used by students in a Fall 2017 UC Berkeley course taught by Philip Stark.

Copied material starts here.

Documentation for multi.py (Bayesian audit support program)

multi.py is Python3 software (or suite of programs) to support the post-election auditing of elections with multiple contests and multiple separately-managed collections of paper ballots.

The software is designed to be helpful for auditing elections such as the November 2017 Colorado election, which has hundreds of contests spread across 64 counties.

This README file is a design document, not a description of what the code does yet. The code here is still in progress and only partially implements this design.

Overview
Election and audit
Scanning of cast paper ballots
Auditing
Audit workflow
Implementation notes: identifiers, votes, file names, and directory structure
(Pre-election) Election specification.
Reported data (ballot manifests, CVRs and outcomes)
Audit details
- Audit setup
- Dialogue between Audit Central and Collection Managers
Command-line interface
Appendix: File names
Appendix (Possible future work)
- Compression

Overview

The system described here is a collection of Python 3 modules to support post-election audits, especially "risk-limiting" audits of both the Bayesian and frequentist style.

On the one hand, this is an experimental platform designed to facilitate research into post-election audits. It is an "election lab" (a term suggested by Philip Stark) that can be easily extended or configured to run experiments.

On the other hand, we hope that the code will be robust, usable, and scalable enough that it can be adapted or ported for use in real post-election audits.

The code emphasizes the case (as in Colorado 2017) where there is an Audit Central (run by the Secretary of State) coordinating audits all across a state, where the paper ballots are in collections managed by county-level election officials. For contests that span several counties, the audit needs to guide the relevant county-level election officials regarding the random sampling of ballots from their collections, to aggregate the resulting audit data, and to compute whether the desired risk limits have been met.

The current design is "file-based": CSV (comma-separated values) format files are used as the main interface data structure for all data, as it is both human and machine readable, and commonly used in the election community.

The code has the capability of generating large and complex "synthetic" data sets for testing and experimental purposes.

Some of the planned experiments include:

comparing different approaches for choosing Bayesian priors,
comparing frequentist and Bayesian risk-limiting audit methods, and
testing the scalability of the Bayesian approach.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
elections		elections
experiments/2017-09-17-scalability		experiments/2017-09-17-scalability
test_data		test_data
tests		tests
.coverage		.coverage
.coveralls.yml		.coveralls.yml
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
audit.py		audit.py
audit_orders.py		audit_orders.py
cli.py		cli.py
csv_readers.py		csv_readers.py
csv_writers.py		csv_writers.py
election_spec.py		election_spec.py
groups.py		groups.py
ids.py		ids.py
multi.py		multi.py
outcomes.py		outcomes.py
planner.py		planner.py
reported.py		reported.py
risk_bayes.py		risk_bayes.py
risk_bayes_2.py		risk_bayes_2.py
risk_frequentist.py		risk_frequentist.py
saved_state.py		saved_state.py
scalability_test.py		scalability_test.py
snapshot.py		snapshot.py
syn.py		syn.py
syn1.py		syn1.py
syn2.py		syn2.py
test_csv_readers.py		test_csv_readers.py
utils.py		utils.py

Attribute	Value
Election name	Colorado 2017 General Election
Election dirname	CO-2017-11-07
Election date	2017-11-07
Election URL	https://sos.co.gov/election/2017-11-07/

Contest	Contest type	Write-ins	Selections
Denver Prop 1	Plurality	No	Yes	No
Denver Prop 2	Plurality	No	Yes	No
Denver Mayor	Plurality	Qualified	John Smith	Bob Cat	Mary Mee	+Jack Frost
Denver Clerk	Plurality	No	Yet Again	New Guy
Logan Mayor	Plurality	Arbitrary	Susan Hat	Barry Su	Benton Liu
Logan Water	Plurality	No	Yes	No
U.S. President	Plurality	Arbitrary	Don Brown	Larry Pew
U.S. Senate 1	Plurality	Qualified	Deb O'Crat	Rhee Pub	Val Green	+Tom Cruz
U.S. Senate 2	Plurality	Qualified	Term Three	Old Guy	+Hot Stuff
CO Prop A	Plurality	No	Yes	No

Contest group	Contest(s) or group(s)
FEDERAL	U.S. President	U.S. Senate 1	U.S. Senate 2
STATE	CO Prop A
FED STATE	FEDERAL	STATE
DENVER LOCAL	Denver Mayor	Denver Clerk	Denver Prop 1	Denver Prop 2
DENVER	FED STATE	DENVER LOCAL
LOGAN REQ	FED STATE	Logan Mayor
LOGAN POSS	Logan Water

Collection	Manager	CVR type	Required Contests	Possible Contests
DEN-A01	[email protected]	CVR	DENVER	DENVER
DEN-A02	[email protected]	CVR	DENVER	DENVER
LOG-B13	[email protected]	noCVR	LOGAN REQ	LOGAN POSS

Collection	Box	Position	Stamp	Ballot id	Number of ballots	Required Contests	Possible Contests	Comments
LOG-B13	B	1	XY04213	B-0001	1
LOG-B13	B	2	XY04214	B-0002	1
LOG-B13	B	3	XY04215	B-0003	1
LOG-B13	C	1	QE55311	C-0001	3	FEDERAL	FEDERAL
LOG-B13	D	1		D-0001	50
LOG-B13	E	1	FF91320	E-0200	50
LOG-B13	F	1	JS23334	F-0001	1			See Doc. #211

Collection	Scanner	Ballot id	Contest	Selections
DEN-A01	FG231	B-231	Denver Prop 1	Yes
DEN-A01	FG231	B-231	Denver Prop 2
DEN-A01	FG231	B-231	U.S. Senate 1	Rhee Pub	Deb O'Crat
DEN-A01	FG231	B-777	Denver Prop 1	No
DEN-A01	FG231	B-777	Denver Prop 2	Yes
DEN-A01	FG231	B-777	U.S. Senate 1	+Tom Cruz
DEN-A01	FG231	B-888	U.S. Senate 1	-Invalid

Measurement id	Contest	Risk Measurement Method	Risk Limit	Risk Upset Threshold	Sampling Mode	Initial Status
1	Denver Prop 1	Bayes	0.05	0.99	Active	Open
2	Denver Prop 2	Bayes	1.00	1.00	Opportunistic	Open
3	DEN-mayor	Bayes	0.05	0.99	Active	Open
4	LOG-mayor	Bayes	0.05	0.99	Active	Off
5	U.S. Senate 1	Bayes	0.05	0.99	Active	Open
6	Boulder-clerk	Bayes	1.00	0.99	Active	Open
7	Boulder-council	Bayes	1.00	0.99	Active	Open
8	Boulder-council	Frequentist	0.05	1.00	Opportunistic	Open

Filename	Hash
`11-general-2017-09-08.csv`	`ca978112ca1bbdcafac231b39a23dc4da786eff8147c4e72b9807785afee48bb`
`12-contests-2017-09-08.csv`	`3e23e8160039594a33894f6564e1b1348bbd7a0088d42c4acb73eeaed59c009d`
`14-collections-2017-09-08.csv`	`2e7d2c03a9507ae265ecf5b5356885a53393a2029d241394997265a1a25aefc6`
...	...
`audited-votes-LOG-B13-2017-11-22.csv`	`18ac3e7343f016890c510e93f935261169d9e3f565436429830faf0934f4f8e4`
`23-reported-outcomes-2017-11-07.csv`	`252f10c83610ebca1a059c0bae8255eba2f95be4d1d7bcfa89d7248a82d9f111`
...
`12-audit-parameters-collection-2017-11-22.csv`	`3f79bb7b435b05321651daefd374cdc681dc06faa65e374e38337b88ca046dea`

Command	Action
`python3 multi.py --read_election_spec CO-2017-11`	Reads and checks election spec
`python3 multi.py --read_reported CO-2017-11`	Reads and checks reported data
`python3 multi.py --read_seed CO-2017-11`	Reads and checks audit seed
`python3 multi.py --make_audit orders CO-2017-11`	Produces initial audit order files
`python3 multi.py --read_audited CO-2017-11`	Reads and checks audited votes
`python3 multi.py --audit CO-2017-11`	Runs audit
`python3 multi.py --audit --pause CO-2017-11`	Runs audit, pausing after each stage

Collection	Audited so far	Next stage increment request	Estimated total needed
DEN-A01	150	50	300
DEN-A02	150	50	300
LOG-B13	90	30	150

lisajian/audit-lab

Folders and files

Latest commit

History

Repository files navigation

Documentation for multi.py (Bayesian audit support program)

Table of contents

Overview

Election and audit

Scanning of cast paper ballots

Auditing

Audit workflow

Pre-election

Election

Audit

Implementation notes: identifiers, votes, file names, and directory structure

Identifiers

Votes

File formats

Directory structure

(Pre-election) Election specification.

Election specification general file

Contests file

Contest groups file

Collections file

Reported data: (ballot manifests, CVRs, and outcomes)

Reported ballot manifest files

Reported CVRs file

Reported outcomes file

Audit details

Audit setup

Global audit parameters

Contest audit parameters

Collection audit parameters

Audit seed file

Dialogue between Audit Central and Collection Managers

Audit order file

Audited votes

Output file formats

Audit snapshot file

Audit output file(s)

Audit plan file

Command-line interface

Appendix: File names

Appendix (Possible Future Work)

Compression

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages