Skip to content

cc5tudio/irs

 
 

Repository files navigation

Search all IRS nonprofits' financial docs (990s) by nonprofit name, based on info from:

--bulk.resource.org --http://www.irs.gov/uac/SOI-Tax-Stats-Exempt-Organizations-Business-Master-File-Extract-(EO-BMF) --http://www.irs.gov/uac/SOI-Tax-Stats-Annual-Extract-of-Tax-Exempt-Organization-Financial-Data

By Luke Rosiak, a reporter at the Washington Times (lukerosiak.info)

A working instance of this tool is at http://lukerosiak.info/irs/

Released under the GNU license

SQL for parsed and flattened master EIN table is in sql/masterfile.sql.tar.gz -- to create this I did a bunch of scripting on http://www.irs.gov/pub/irs-soi/eobk13.txt

To parse the excellent index of PDFs from bulk.resouce.org, see management/commands/irs_import.py

Elasticsearch was recently added to this app, so you'll have to set it up or use the older version in this repo, which used postgres full-text search.

Create the PDF table with create.sql, the irs_eins table with masterfile.sql.tar.gz and the structured extracts of some PDFs with masterfile.sql

About

Search all nonprofits' financial form 990s by filer name, dating back to 2002

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors