Where does our nonprofit data come from?
Why are we interested in data?
CauseBot is an artificial intelligence (AI) software company focused on assisting nonprofit organizations with innovative technology solutions. Current machine learning techniques require massive amounts of data and information to create these expert systems. Our solutions will assist nonprofits worldwide generate more capital and grant capital effectively thus increasing overall impact.
What is the data?
Hosted and serviced by Amazon Web Services (AWS), our data is derived from the IRS tax exempt open data project for tax periods from 2010 to 2015. After extract, transform and loading (ETL) the disparate formats from year to year, we contributed our work to open data projects at data.world. In addition to the tax open data project, each tax exempt entity must register with the IRS to obtain an EIN. During this process, the IRS requires the categorization of nonprofit organizations under NTEE and activity classification codes. The availability of this data is also public and hosted by the IRS via the Business Master File.
Below is a brief list of data types made available by the IRS:
- EIN of Grantor
- Form Type
- Organization Name
- Tax Period
- Address of Grantor/Grantee
- City Name
- Purpose Of Grant
- Cash Grant Amount
- Non Cash Assistance Amount
- EIN of Grantee
What is a 990 form?
990 forms are annual filings from tax exempt organizations reporting to the IRS. There are three different tax forms (990, 990-EZ, and 990-N) available to nonprofits based upon their organization type.
How often is the data updated?
Tax exempt organizations file their tax return on the 15th of May each year. Nonprofits may request an extension of three months until August 15th. The quality of our data and frequency of update is dependent on the IRS open data project.
What are NTEE Codes?
The National Taxonomy of Exempt Entities (NTEE) is a system used by the IRS and National Center for Charitable Statistics (NCCS) to place tax exempt organizations into categories and classifications.
Putting it all together
Armed with the knowledge of 990 tax filings and NTEE codes allows for targeted analytics and training for our machine learning technology. Our AI journey will take us into an exploration of the data obtained from the IRS. Continue to check our blog to learn more about this data and our technology innovation.