RunDMCMC

https://api.codacy.com/project/badge/Grade/b02dfe3d778b40f3890d228889feee52

This code implements Monte-Carlo exploration of districting plans, exploring the space around an initial districting plan to give some idea of the degree of gerrymandering. It is a Python rewrite of the chain C++ program (https://github.com/gerrymandr/cfp_mcmc), originally by Maria Chikina, Alan Frieze and Wesley Pegden, for their paper, "Assessing significance in a Markov chain without mixing" (http://www.pnas.org/content/114/11/2860)

Installation

Ideally, the following conda command will work:

conda install -c gerrymandr rundmcmc

Should our release system be broken, cloning this repository and manually running setup.py will also work:

git clone https://github.com/gerrymandr/RunDMCMC.git
cd RunDMCMC
python3 setup.py install

Example usage

Below is an example of using the chain. It uses the v0.1.0 release, which is rough around the edges, but usable.

import json

import geopandas as gp
import networkx.readwrite

from rundmcmc.defaults import BasicChain
from rundmcmc.make_graph import add_data_to_graph, get_assignment_dict
from rundmcmc.partition import Partition
from rundmcmc.scores import (efficiency_gap, final_report, mean_median,
                            mean_thirdian)
from rundmcmc.updaters import cut_edges, votes_updaters


def example_partition():
    df = gp.read_file("./testData/mo_cleaned_vtds.shp")

    with open("./testData/MO_graph.json") as f:
        graph_json = json.load(f)

    graph = networkx.readwrite.json_graph.adjacency_graph(graph_json)

    assignment = get_assignment_dict(df, "GEOID10", "CD")

    add_data_to_graph(df, graph, ['PR_DV08', 'PR_RV08'], id_col='GEOID10')

    updaters = {
        **votes_updaters(['PR_DV08', 'PR_RV08'], election_name='08'),
        'cut_edges': cut_edges
    }
    return Partition(graph, assignment, updaters)


def print_summary(partition, scores):
    print("")
    for name, score in scores.items():
        print(f"{name}: {score(partition, 'PR_DV08%')}")


def main():
    initial_partition = example_partition()

    chain = BasicChain(initial_partition, total_steps=100)

    scores = {
        'Efficiency Gap': efficiency_gap,
        'Mean-Median': mean_median,
        'Mean-Thirdian': mean_thirdian
    }

    for partition in chain:
        print_summary(partition, scores)


if __name__ == "__main__":
    main()

Using in an interactive python session

Here's how you can use RunDMCMC in an interactive python session. Navigate to the RunDMCMC/rundmcmc folder in a terminal, and then run an ipython or python command to open an interactive session. Alternatively, this should work in the terminal window in Spyder.

Now we can start playing with Markov chains! First we'll import some things.

from rundmcmc.grid import Grid

The Grid class is a little helper class for playing around with grid examples.

grid = Grid((20,20))    # Make a 20x20 grid
print(grid)

You should see a grid made out of 0's, 1's, 2's, and 3's. By default, the Grid is partitioned into four equal quadrants.

Running a chain

Now we can configure and run a MarkovChain.

from rundmcmc.chain import MarkovChain
from rundmcmc.proposals import propose_random_flip
from rundmcmc.validity import Validator, contiguous
from rundmcmc.accept import always_accept

is_valid = Validator([contiguous])

We'll configure a chain starting with grid, using the regular boundary flip proposal, validating that the districts are connected, and always accepting if the proposal is valid.

chain = MarkovChain(propose_random_flip, is_valid, always_accept, grid, total_steps=1000)

The MarkovChain in RunDMCMC is just a python generator. This means we can do a simple for loop over all the states in the chain.

for partition in chain:
    print(partition)

This should output a bunch of grids like before, but with the districts changing over time.

Making a histogram

Now we can make a histogram! The Grid class comes with a fake 'population' attribute. This attribute can be accessed as grid['population']. It is a dictionary from the districts to their populations. We'll make a histogram of the minimum district population at each step in the chain.

We'll import matplotlib to make the histogram, but feel free to use your favorite alternative.

import matplotlib.pyplot as plt

We can generate the data for our histogram using a simple list comprehension:

data = [min(partition['population'].values()) for partition in chain]

plt.hist(data)
plt.show()

The histogram should pop up in a new window. Yay!

Name		Name	Last commit message	Last commit date
Latest commit History 330 Commits
.circleci		.circleci
conda.recipe		conda.recipe
docs		docs
rundmcmc		rundmcmc
tests		tests
.codacy.yml		.codacy.yml
.codecov.yml		.codecov.yml
.coveragerc		.coveragerc
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
projections.py		projections.py
setup.cfg		setup.cfg
setup.py		setup.py
versioneer.py		versioneer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RunDMCMC

Installation

Example usage

Using in an interactive python session

Running a chain

Making a histogram

About

Uh oh!

Releases

Packages

Languages

License

eionblanc/RunDMCMC

Folders and files

Latest commit

History

Repository files navigation

RunDMCMC

Installation

Example usage

Using in an interactive python session

Running a chain

Making a histogram

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages