• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
Montreal AI Ethics Institute

Montreal AI Ethics Institute

Democratizing AI ethics literacy

  • Articles
    • Public Policy
    • Privacy & Security
    • Human Rights
      • Ethics
      • JEDI (Justice, Equity, Diversity, Inclusion
    • Climate
    • Design
      • Emerging Technology
    • Application & Adoption
      • Health
      • Education
      • Government
        • Military
        • Public Works
      • Labour
    • Arts & Culture
      • Film & TV
      • Music
      • Pop Culture
      • Digital Art
  • Columns
    • AI Policy Corner
    • Recess
    • Tech Futures
  • The AI Ethics Brief
  • AI Literacy
    • Research Summaries
    • AI Ethics Living Dictionary
    • Learning Community
  • The State of AI Ethics Report
    • Volume 7 (November 2025)
    • Volume 6 (February 2022)
    • Volume 5 (July 2021)
    • Volume 4 (April 2021)
    • Volume 3 (Jan 2021)
    • Volume 2 (Oct 2020)
    • Volume 1 (June 2020)
  • About
    • Our Contributions Policy
    • Our Open Access Policy
    • Contact
    • Donate

Implications of Distance over Redistricting Maps: Central and Outlier Maps

August 6, 2023

🔬 Research Summary by Seyed A. Esmaeili, who received his PhD in Computer Science from the University of Maryland, College Park. He will join the Simons Laufer Mathematical Sciences Institute at Berkeley and the Data Science Institute at the University of Chicago as a Postdoc.

[Original paper by Seyed A. Esmaeili, Darshan Chakrabarti, Hayley Grape, and Brian Brubach]


Overview: In recent years, computational and sampling methods have had a significant impact on redistricting and gerrymandering. In this paper, we introduce a distance measure over redistricting maps and study its implications. First, we identify a central map that mirrors the Kemeny ranking in a scenario where a committee is voting over a collection of redistricting maps to be drawn. Second, we also show how our distance measure can be used to possibly detect gerrymandered maps. Specifically, some redistricting instances known to be gerrymandered are found to be outliers, being in the 99th percentile in terms of their distance from our central maps.  


Introduction

Redistricting is the process of partitioning a state into a collection of districts that satisfy a set of constraints. These constraints, however, are not stringent and allow for an enormous number of valid redistricting maps, hence the possibility of gerrymandering. Despite the fact that gerrymandering dates back at least two centuries, it is only in the last decade that mathematical and computational methods have had a significant impact. In a nutshell, the works of Chikina et al., DeFord et al., and Herschlag et al. introduced sampling methods that have been and continue to be used to detect possible gerrymandering.  Specifically, these sampling methods produce a large ensemble of valid redistricting maps, which can then be used to estimate various informative statistics, such as the election histogram. Therefore, one can, for example, use the election histogram and argue that a given map is gerrymandered because of the rarity of its election outcome.   

This paper introduces a distance measure to analyze an ensemble of redistricting maps. We focus on two major implications of this distance. First, we define a central map that may be considered “most typical” and give a rigorous justification for it by showing that it mirrors the Kemeny ranking in a scenario where a committee is voting over a collection of redistricting maps to be drawn. Second, we show that a by-product of our framework is the ability to detect some gerrymandered instances since they can have very large distances from our central maps compared to the entire ensemble. 

Key Insights

Distance over Redistricting Maps

Since there is a large ensemble of valid redistricting maps, introducing a distance measure over these maps seems like a reasonable idea that can help tame the difficulty in analyzing them. Interestingly, two immediate implications of having a distance over a collection of objects are the existence of a central object which lies at a minimum distance from the collection and an outlier object that lies further away from the collection. But before delving into these implications, we describe our distance measure over redistricting maps.

Given two maps, M1 and M2, we create an adjacency matrix for each by connecting voting blocks (vertices) in the same district by an edge. Each edge has a non-negative weight associated with it that we can choose. The distance is simply the minimum total weight of edges that must be deleted and added to obtain one of the maps starting from the other, as shown in the figure.  

The Medoid Map and its Justification 

As we noted earlier, now that we have defined a distance measure, a ”central” redistricting map must exist that minimizes the distance from the ensemble, which we call the medoid map. Although, at an intuitive level, it seems interesting to compute, it is worthwhile to wonder if the medoid map has deeper utility and a rigorous justification. We find that it actually mirrors the Kemeny ranking.    

More specifically, suppose that we have a committee voting over a collection of maps to select one to be drawn. If we were to weigh the distance from each map by the total number of votes it receives, the medoid map would be the map that minimizes the sum of weighted distances. This definition of the medoid redistricting map is essentially the mirror of the Kemeny ranking if the committee voted on rankings instead. We actually show that, in general, it is computationally hard to obtain the medoid map, but we introduce a heuristic method for finding it. 

Outlier and Gerrymandered Maps

As we noted earlier, the sampling methods of Chikina et al., DeFord et al., and Herschlag et al. have had a major impact by showing that certain maps are gerrymandered since their election outcomes are rare. 

Now that we have a distance measure, we can also detect maps that are outliers in terms of their distance from the ensemble or a central map. The idea of flagging a map as an outlier because of an abnormal measure (abnormal distance) connects to anomaly detection in machine learning. Further, it would be interesting to see if gerrymandered maps are also outliers regarding their distance. We arrive at a very interesting result. Specifically, as the figure below shows, for both Pennsylvania and North Carolina, maps that are considered to be gerrymandered are found to be outliers in terms of distance, whereas remedial maps are much better behaved. At a more precise level, we find these gerrymandered maps to have distances in the 99th percentile across repeated experiments consistently. Although this distance-based method is not likely to be able to detect all instances of gerrymandering, we believe it represents significant progress since it does not rely on election results at all.

Between the lines

The distance measure we introduced is both tractable and has an intuitive edit distance interpretation. However, it is worthwhile to wonder if this distance measure or another possible distance over redistricting maps can be justified in a rigorous manner based on an axiomatic approach. 

We believe that a major contribution of our work is that it has demonstrated that outlier behavior can be detected in some gerrymandered instances without using election results in a rigorous manner by giving the distance percentile of the map. Arguing that a gerrymandered map is an outlier without using election results has a clear advantage. It would be interesting to see if other methods that do not use election results can also be used to detect gerrymandering.

References 

Chikina, M., Frieze, A., and Pegden, W. (2017). Assessing significance in a markov chain without mixing. Proceedings of the National Academy of Sciences, 114(11):2860–2864.

DeFord, D., Duchin, M., and Solomon, J. (2019). Recombination: A family of markov chains for redistricting. arXiv preprint arXiv:1911.05725.

Herschlag, G., Kang, H. S., Luo, J., Graves, C. V., Bangia, S., Ravier, R., and Mattingly, J. C. (2020). Quantifying gerrymandering in north carolina. Statistics and Public Policy, 7(1):30–38

Want quick summaries of the latest research & reporting in AI ethics delivered to your inbox? Subscribe to the AI Ethics Brief. We publish bi-weekly.

Primary Sidebar

🔍 SEARCH

Spotlight

This image is a collage with a colourful Japanese vintage landscape showing a mountain, hills, flowers and other plants and a small stream. There are 3 large black data servers placed in the bottom half of the image, with a cloud of black smoke emitting from them, partly obscuring the scenery.

Tech Futures: Crafting Participatory Tech Futures

A network diagram with lots of little emojis, organised in clusters.

Tech Futures: AI For and Against Knowledge

A brightly coloured illustration which can be viewed in any direction. It has many elements to it working together: men in suits around a table, someone in a data centre, big hands controlling the scenes and holding a phone, people in a production line. Motifs such as network diagrams and melting emojis are placed throughout the busy vignettes.

Tech Futures: The Fossil Fuels Playbook for Big Tech: Part II

A rock embedded with intricate circuit board patterns, held delicately by pale hands drawn in a ghostly style. The contrast between the rough, metallic mineral and the sleek, artificial circuit board illustrates the relationship between raw natural resources and modern technological development. The hands evoke human involvement in the extraction and manufacturing processes.

Tech Futures: The Fossil Fuels Playbook for Big Tech: Part I

Close-up of a cat sleeping on a computer keyboard

Tech Futures: The threat of AI-generated code to the world’s digital infrastructure

related posts

  • Research summary: Artificial Intelligence: The Ambiguous Labor Market Impact of Automating Predictio...

    Research summary: Artificial Intelligence: The Ambiguous Labor Market Impact of Automating Predictio...

  • Prompt Middleware: Helping Non-Experts Engage with Generative AI

    Prompt Middleware: Helping Non-Experts Engage with Generative AI

  • Artificial Intelligence as a Force for Good

    Artificial Intelligence as a Force for Good

  • How Canada can be a global leader in ethical AI

    How Canada can be a global leader in ethical AI

  • Response to Scotland's AI Strategy

    Response to Scotland's AI Strategy

  • AI For Good Global Summit Interview (ITU/UN) 2018

    AI For Good Global Summit Interview (ITU/UN) 2018

  • The Artificiality of AI – Why are We Letting Machines Manage Employees?

    The Artificiality of AI – Why are We Letting Machines Manage Employees?

  • Reflections from Microsoft's Ignite The Tour

    Reflections from Microsoft's Ignite The Tour

  • Montreal AI Symposium Presentation at Polytechnique

    Montreal AI Symposium Presentation at Polytechnique

  • AI Ethics During Warfare: An Evolving Paradox

    AI Ethics During Warfare: An Evolving Paradox

Partners

  •  
    U.S. Artificial Intelligence Safety Institute Consortium (AISIC) at NIST

  • Partnership on AI

  • The LF AI & Data Foundation

  • The AI Alliance

Footer


Articles

Columns

AI Literacy

The State of AI Ethics Report


 

About Us


Founded in 2018, the Montreal AI Ethics Institute (MAIEI) is an international non-profit organization equipping citizens concerned about artificial intelligence and its impact on society to take action.

Contact

Donate


  • © 2025 MONTREAL AI ETHICS INSTITUTE.
  • This work is licensed under a Creative Commons Attribution 4.0 International License.
  • Learn more about our open access policy here.
  • Creative Commons License

    Save hours of work and stay on top of Responsible AI research and reporting with our bi-weekly email newsletter.