« Summary by Party 3/31/07 | Main | About FEC Data Mart »

Data Issues

Posted on Jun 3, 2007 by Registered CommenterTom in | CommentsPost a Comment

Problems with FEC source data:

  1. Detailed individual contributions to candidates are reported only for contributions over $200. To get a complete picture, quarterly filings must be taken into account.
  2. Committee contributions to candidates are stored in the Committee Contributions table for a total of three sources that have to be combined in order to get contribution totals.
  3. Party identification is not validated and is not coded consistently. I've cleaned the glaring errors.
  4. The Individual and Committee contributions tables each contain State, City, and Zip columns. I normalized the tables by yanking out State and City into a snowflake, only to learn that the data is dirty. The database includes whatever the candidates and committees entered, without data validation. Accordingly, there are invalid zip codes, invalid state codes, and other assorted garbage, cities entered into Street Address column, zips entered into City column, etc. I put City and State back into the tables, and geographic analysis will be approximate.*
Unrelated to FEC data:
  1. State population data comes from the Census Bureau using figures released Dec 2007.

 

* This should help: Zip Codes

PrintView Printer Friendly Version

EmailEmail Article to Friend

Reader Comments

There are no comments for this journal entry. To create a new comment, use the form below.

PostPost a New Comment

Enter your information below to add a new comment.

My response is on my own website »
Author Email (optional):
Author URL (optional):
Post:
 
Some HTML allowed: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <code> <em> <i> <strike> <strong>