In addition to the "data sources" sections in each State of the Field topic area – which are overviews of the best data sources for different types of research topics, provided by experts – below are data sources reviewers have used in building the State of the Field.
The Kauffman Foundation creates data overviews that provide guidance on how to use certain datasets. The data overviews serve several purposes:
To suggest overviews on additional data sets, please email firstname.lastname@example.org.
The Annual Survey of Entrepreneurs (ASE) data provides a snapshot of select economic and demographic characteristics of employer firms and business owners in 2014 by the 2-digit 2012 North American Industry Classification System (NAICS), at the national, state, and metropolitan area (top 50) levels, in the U.S. This survey conducted by the U.S. census Bureau is the largest annual survey of entrepreneurs ever done in the United States. It documents the story of American entrepreneurs, providing more frequent and extensive data than previously available. The ASE will supplement the Survey of Business Owners (SBO), conducted every five years.
The BDS provides annual measures of business dynamics (such as job creation and destruction, establishment births and deaths, and firm startups and shutdowns) for the economy and aggregated by establishment and firm characteristics. The BDS is created from the Longitudinal Business Database (LBD), a confidential database available to qualified researchers through secure Census Bureau Research Data Centers. The use of the LBD as its source data permits tracking establishments and firms over time.
The BRDIS is the primary source of information on research and development performed or funded by businesses within the United States. The survey is conducted by the Census Bureau in accordance with an interagency agreement with the National Center for Science and Engineering Statistics. Results are used to assess trends in the performance and funding of business R&D. The annual survey examines a nationally representative sample of companies in manufacturing and nonmanufacturing industries.
The University of Michigan’s HRS is a longitudinal panel study that surveys a representative sample of more than 20,000 Americans over the age of 50 every two years. Since its launch in 1992, the study has collected information about income, work, assets, pension plans, health insurance, disability, physical health and functioning, cognitive functioning, and health care expenditures.
The KFS is a panel study of 4,928 businesses founded in 2004 and tracked over their early years of operation. The survey focuses on the nature of new business formation activity; characteristics of the strategy, offerings, and employment patterns of new businesses; the nature of the financial and organizational arrangements of these businesses; and the characteristics of their founders.
Walls & Associates converts Dun and Bradstreet (D&B) archival establishment data into a time-series database of establishment information, the National Establishment Time-Series (NETS) Database, which provides an annual record for a large part of the U.S. economy that includes establishment job creation and destruction, sales growth performance, survivability of business startups, mobility patterns, changes in primary markets, corporate affiliations that highlight M&A, and historical D&B credit and payment ratings.
There are also four specialized NETS Databases to help researchers focus their NETS database requests, each with a PDF description and an Excel file for more in-depth analysis:
The PSED II offers a nationally-representative database for the United States to offer systematic, reliable, and generalizable data on the business formation process. It includes information on the proportion and characteristics of the adult population attempting to start new businesses, the kinds of activities nascent entrepreneurs undertake during the business start-up process and the proportion and characteristics of the start-up efforts that become infant firms. The PSED II follows a cohort of nascent entrepreneurs for three years beginning in 2005.
Below is the complete list of the data sources referenced in State of the Field and other known entrepreneurship databases. Over time, we hope to include data overviews on all of these sources. Did we miss any important data sources? Contact Alex Krause.
Association of University Technology Managers (AUTM)
BioScan Directory by BioWorld
Business Plan Archive
Business Research & Development and Innovation Survey (BRDIS)
CB Insights Venture Capital Database
Center for Research in Security Prices (CRSP)/Computstat
Center for Venture Research
Chapter 11 Library
Comparative Immigrant Entrepreneurship Project (CIEP)
Comprehensive Australian Study of Entrepreneurial Emergence (CAUSEE)
Connecting Outcome Measures in Entrepreneurship Technology and Science (COMETS)
Contracting & Organizations Research Institute (CORI) K-Base
DowJones Venture Source
Dun & Bradstreet
EDGAR (U.S. Securities and Exchange Commission)
Employer Information Report (EEO-1)
European Patent Office’s Worldwide Patent Statistical Database (PATSTAT)
Eurostat Factors of Business Success
Firm and Industry Evolution and Entrepreneurship (FIVE)
General Social Survey (GSS)
Global Entrepreneurship Monitor (GEM)
Harvard’s Dataverse Patent Network Project
Hoover’s and Business Insight (formerly MarketPlace)
Indiana University Patent Data Concordance
IPO Data by Jay Ritter
Kauffman Firm Survey (KFS)
Kauffman Index of Entrepreneurial Activity
Health and Retirement Study (HRS)
IP Litigation Clearinghouse (IPLC) (Lex Machina)
Kenney-Patton Firm and Management Database of Emerging Growth IPOs
Minority Business Enterprise Program
National Establishment Time Series (NETS)
National Science Foundation Scientists and Engineers Statistical Data System (SESTAT)
National Science Foundation (NSF) Survey of Industrial Research and Development (SIRD)
National Bureau of Economic Research and U.S. Census Bureau's Center for Economic Studies (NBER-CES) Manufacturing Productivity Database
National Bureau of Economic Research (NBER) Patent Citations Data File
OECD-Eurostat Entrepreneurship Indicators Programme (EIP)
Panel Study of Entrepreneurial Dynamics I & II (PSED)
Panel Study of Income Dynamics (PSID)
S&P Capital IQ
SDC Mergers & Acquisition database by Thomson Reuters
Securities and Exchange Commission (SEC) Initial Public Offering (IPO) database
SFINNO - Database of Finnish Innovations
Small Business Administration (SBA)
Stanford Graduate School of Business' (GSB) Center for Entrepreneurial Studies' (CES) Project on Emerging Companies
Survey of Minority-Owned Business Enterprises (SMOBE)
The Indus Entrepreneurs (TiE)
UGA Patent Litigation Datafile by John L. Turner
United Nations Economic Commission for Europe (UNECE) Statistical Business Register (SBR) Survey
U.S. Patent and Trademark Office database
World Management Survey
The Venture Capital and Private Equity (VC-PE) Country Attractiveness IndexVentureXpert (Thomson One)
Do you manage a dataset? Google has new metadata for science datasets to formalize their structure.
Last updated 14 September 2017