Police misconduct settlements

February 22, 2021 · View on GitHub

This repo contains the data behind the story Police Misconduct Costs Cities Millions Every Year, But That's Where The Accountability Ends.

Words of caution

As we describe in an accompanying article, this data has major issues. It constitutes our best guess at the amount of money that was paid out as settlements for police misconduct from 2010-19 (or for the range otherwise provided), but different cities have different ways of collecting, storing and categorizing such settlements. As a result, this data should not be compared across cities. We have no way of knowing (or checking) whether the kinds of misconduct this covers are comparable across cities. (For example, city officials in Boston likely interpreted our request more narrowly than those in New York City.) Descriptions of the types of misconduct are included where provided, but are also not comparable across cities. For this reason, while we are making all the data we obtained available, we are not providing it in an easy-to-use, collated data set. This is because we don’t want you to use it this way!

Moreover, while records within the same city might be more comparable across time, this is also not encouraged. Data storage may have changed in this time period, as have the humans who entered the information into it, and much of the information entered is subject to human categorization based on judgement calls that cannot be assumed to have stayed constant over time. And even if the data collection had stayed the same for the entire period, other factors (such as whether cities are likely to settle in response to police misconduct and if so, for how much) may have changed such that the settlements themselves are a poor measure of police misconduct. For these reasons, we advise extreme caution when comparing this data across time and within a city.

Folder structure

Each city folder contains:

  • an R script that cleans the data
  • an original/ folder that contains original data as provided by the city.
  • an intermediate/ folder - if we did any manual data cleaning. This folder contains data after the manual data cleaning step (e.g. an xlsx sheet into which data from a pdf was copied).
  • a final/ folder that contains data output of the cleaning script, with columns described by the data dictionary below.

Data dictionary

Variable nameDefinition
incident_dateDate on which incident took place
incident_yearPulled from filed_year
filed_dateDate claim or lawsuit was filed
filed_yearPulled from filed_date
closed_dateDate at which settlement was reached OR paid (depending on what was provided)
calendar_yearPulled from settlement date or closed_date
cityCity name
stateState abbreviation
amount_awardedAmount awarded to claimant in the settlement
other_expensesAdditional expenses, such as legal fees (e.g. in Charleston, North Charleston), when available
total_incurredTotal expenses: amount_awarded + other_expenses
collectionWhether money was collected?
case_outcomeCase status as of the date the data was collected, e.g. whether a case was settled, went to jury, or is still pending
docket_numberCase docket number, when available
claim_numberClaim number, when available
courtCourt in which the settlement was reached, when available
plaintiff_nameName of plaintiff/claimant
matter_nameCase name (generally of the form "Plaintiff v Defendant")
plaintiff_attorneyLegal representation of plaintiff
locationLocation at which the incident happened, when available
summary_allegationsDescription of allegations -- sometimes aggregated into categories, sometimes very detailed. We retained as much detail as was available. Separated by ";"
claim_or_lawsuitIndicator of whether the entry was a claim or a lawsuit, when available
defendantName of defendant(s), when available. Sometimes a list of police officers was provided separately.

FOIA text

This is the text of the FOIA request we sent to each city:

Dear Records Officer,

Pursuant to all laws and traditions governing the release of public records in your jurisdiction, I am requesting records related to any and all civil lawsuits brought forth against the [CITY] Police Department or [CITY] PD law enforcement officials that resulted in a monetary legal settlement between the period of January 1, 2010 and December 31, 2019.

Specifically, I am requesting any and all records concerning each legal settlement, including:

  • Name(s) of plaintiff(s)
  • Name(s) of officer(s) involved
  • Name of court and docket number
  • Date of incident at issue
  • Location of incident at issue
  • Date lawsuit filed
  • Date lawsuit resolved
  • Type of misconduct
  • Summary of allegations
  • Settlement amount
  • Name of plaintiff’s attorney (or pro se status if plaintiff represented him/herself)

Please provide the records in an electronic spreadsheet. In addition, I request you send along any data dictionaries that accompany these records.

Please waive any applicable fees. I am a representative of the news media through FiveThirtyEight and The Marshall Project. Release of this information is in the public interest because it will help the public understand how taxpayer dollars are used to compensate victims of police misconduct.

If my request is denied in whole or part, I ask that you justify all deletions and denials by reference to specific exemptions of the laws and traditions in your state. I also request that you release all segregable portions of the otherwise exempt material. I reserve the right to appeal your decision to withhold any information or to deny any waiver of fees.

As I am making this request as a journalist and this information is of timely value, I would appreciate your communication through telephone or email if you have any questions regarding this request. I can be reached by email at [email] and by phone at [phone].

Thank you for your assistance.

List of cities FOIA'd and what happened with each

CityOutcomeTime period
Atlanta, GAReceived data2015-2020
Baltimore, MDReceived data that was not responsive to FOIA2015-2020
Baton Rouge, LAReceived data2010-2019
Birmingham, ALDid not receive data
Boston, MAReceived data2010-2019
Bridgeport, CTDid not receive data
Buffalo, NYDid not receive data
Cambridge, MAReceived data2010-2019
Charleston, SCReceived data2010-2019
Chattanooga, TNDid not receive data
Chicago, ILReceived data2010-2019
Cincinnati, OHReceived data2010-2020
Cleveland, OHReceived data2010-2020
Columbia, SCReceived data2010-2019
Dayton, OHDid not receive data
Detroit, MIReceived data2010-2019
Elizabeth, NJDid not receive data
Fort Lauderdale, FLReceived data2011-2019
Hartford, CTReceived data, unusable
Indianapolis, INReceived data2010-2019
Jersey City, NJReceived data, unusable
Kansas City, MOReceived data, unusable
Little Rock, ARReceived data2010-2019
Los Angeles, CAReceived data2010-2019
Memphis, TNReceived data2013-2019
Miami, FLReceived data2010-2020
Milwaukee, WIReceived data2010-2019
New Haven, CTDid not receive data
New Orleans, LAReceived data2010-2019
New York City, NYReceived data2010-2019
Newark, NJDid not receive data
Norfolk, VADid not receive data
North Charleston, SCReceived data2010-2019
Orlando, FLReceived data2010-2018
Paterson, NJReceived data2010-2019
Philadelphia, PAReceived data2009-2019
Pittsburgh, PADid not receive data
Richmond, VAReceived data2010-2019
Roanoke, VAReceived data2010-2019
Rochester, NYDid not receive data
San Francisco, CAReceived data2010-2019
Shreveport, LAReceived data, unusable
Springfield, MAReceived data2006-2020
St Louis, MOReceived data2015-2019
Syracuse, NYDid not receive data
Tuscaloosa, ALDid not receive data
Washington, DCReceived data2010-2019
Waterbury, CTReceived data2011-2019
West Palm Beach, FLDid not receive data
Yonkers, NYDid not receive data