The Kauffman Foundation creates data overviews that provide guidance on how to use certain datasets. The data overviews serve several purposes:
To serve as data reference guides for academics, other data providers, or anyone interested in new research developments in entrepreneurship.
To provide short (up to two pages) overviews of data sets in one place that point to lots of relevant information but are highly curated and edited to ensure brevity and clarity.
1. Annual Survey of Entrepreneurs (ASE) Data Overview [PDF]
The Annual Survey of Entrepreneurs (ASE) data provides a snapshot of select economic and demographic characteristics of employer firms and business owners in 2014 by the 2-digit 2012 North American Industry Classification System (NAICS), at the national, state, and metropolitan area (top 50) levels, in the U.S. This survey conducted by the U.S. census Bureau is the largest annual survey of entrepreneurs ever done in the United States. It documents the story of American entrepreneurs, providing more frequent and extensive data than previously available. The ASE will supplement the Survey of Business Owners (SBO), conducted every five years.
2. Business Dynamics Statistics (BDS) Data Overview [PDF]
The BDS provides annual measures of business dynamics (such as job creation and destruction, establishment births and deaths, and firm startups and shutdowns) for the economy and aggregated by establishment and firm characteristics. The BDS is created from the Longitudinal Business Database (LBD), a confidential database available to qualified researchers through secure Census Bureau Research Data Centers. The use of the LBD as its source data permits tracking establishments and firms over time.
3. Business R&D and Innovation Survey (BRDIS) Data [PDF] Overview
The BRDIS is the primary source of information on research and development performed or funded by businesses within the United States. The survey is conducted by the Census Bureau in accordance with an interagency agreement with the National Center for Science and Engineering Statistics. Results are used to assess trends in the performance and funding of business research and development. The annual survey examines a nationally representative sample of companies in manufacturing and non-manufacturing industries.
4. Health and Retirement Study (HRS) Data Overview [PDF]
The University of Michigan’s HRS is a longitudinal panel study that surveys a representative sample of more than 20,000 Americans over the age of 50 every two years. Since its launch in 1992, the study has collected information about income, work, assets, pension plans, health insurance, disability, physical health and functioning, cognitive functioning, and health care expenditures.
5. Kauffman Firm Survey (KFS) Data Overview [PDF]
The KFS is a panel study of 4,928 businesses founded in 2004 and tracked over their early years of operation. The survey focuses on the nature of new business formation activity; characteristics of the strategy, offerings, and employment patterns of new businesses; the nature of the financial and organizational arrangements of these businesses; and the characteristics of their founders.
6. National Establishment Time-Series (NETS) Database Data Overview
Walls & Associates converts Dun and Bradstreet (D&B) archival establishment data into a time-series database of establishment information, the National Establishment Time-Series (NETS) Database, which provides an annual record for a large part of the U.S. economy that includes establishment job creation and destruction, sales growth performance, survivability of business startups, mobility patterns, changes in primary markets, corporate affiliations that highlight M&A, and historical D&B credit and payment ratings.
The Connecting Outcome Measures in Entrepreneurship, Technology, and Science (COMETS) database is the result of close to two decades of research, hard work, and dedication by a community of scholars working to join data about science, technology, and economic outcomes such that links between innovations, people, and businesses can be tracked better. COMETS is a direct outgrowth of the earlier StarTechZD, STAR (Science and Technology Agents of Revolution), and NanoBank database initiatives.
There are also four specialized NETS Databases to help researchers focus their NETS database requests, each with a PDF description and an Excel file for more in-depth analysis:
An expanded Manufacturing Database [ PDF | XSLX ]
An expanded Retail Database [ PDF | XSLX ]
An All Publicly Listed Establishments Database [ PDF | XSLX ]
A High-Technology database based on BLS occupation definitions [ PDF | XSLX ]
7. Panel Study of Entrepreneurial Dynamics II (PSED II) Data Overview
The PSED II offers a nationally-representative database for the United States to offer systematic, reliable, and generalizable data on the business formation process. It includes information on the proportion and characteristics of the adult population attempting to start new businesses, the kinds of activities nascent entrepreneurs undertake during the business start-up process and the proportion and characteristics of the start-up efforts that become infant firms. The PSED II follows a cohort of nascent entrepreneurs for three years beginning in 2005.
Below is the complete list of the data sources referenced in State of the Field and other known entrepreneurship databases. Over time, we hope to include data overviews on all of these sources.
Association of University Technology Managers (AUTM)
BioScan Directory by BioWorld
Business Plan Archive
Business Research & Development and Innovation Survey (BRDIS)
CB Insights Venture Capital Database
Center for Research in Security Prices (CRSP)/Computstat
Center for Venture Research
Chapter 11 Library
Comparative Immigrant Entrepreneurship Project (CIEP)
Comprehensive Australian Study of Entrepreneurial Emergence (CAUSEE)
Connecting Outcome Measures in Entrepreneurship Technology and Science (COMETS)
Contracting & Organizations Research Institute (CORI) K-Base
DowJones Venture Source
Dun & Bradstreet
EDGAR (U.S. Securities and Exchange Commission)
Employer Information Report (EEO-1)
European Patent Office’s Worldwide Patent Statistical Database (PATSTAT)
Eurostat Factors of Business Success
Firm and Industry Evolution and Entrepreneurship (FIVE)
General Social Survey (GSS)
Global Entrepreneurship Monitor (GEM)
Harvard’s Dataverse Patent Network Project
Hoover’s and Business Insight (formerly MarketPlace)
Indiana University Patent Data Concordance
IPO Data by Jay Ritter
Kauffman Firm Survey (KFS)
Kauffman Index of Entrepreneurial Activity
Health and Retirement Study (HRS)
IP Litigation Clearinghouse (IPLC) (Lex Machina)
Kenney-Patton Firm and Management Database of Emerging Growth IPOs
Minority Business Enterprise Program
National Establishment Time Series (NETS)
National Science Foundation Scientists and Engineers Statistical Data System (SESTAT)
National Science Foundation (NSF) Survey of Industrial Research and Development (SIRD)
National Bureau of Economic Research and U.S. Census Bureau's Center for Economic Studies (NBER-CES) Manufacturing Productivity Database
National Bureau of Economic Research (NBER) Patent Citations Data File
OECD-Eurostat Entrepreneurship Indicators Programme (EIP)
Panel Study of Entrepreneurial Dynamics I & II (PSED)
Panel Study of Income Dynamics (PSID)
S&P Capital IQ
SDC Mergers & Acquisition database by Thomson Reuters
Securities and Exchange Commission (SEC) Initial Public Offering (IPO) database
SFINNO - Database of Finnish Innovations
Small Business Administration (SBA)
Stanford Graduate School of Business' (GSB) Center for Entrepreneurial Studies' (CES) Project on Emerging Companies
Survey of Minority-Owned Business Enterprises (SMOBE)
The Indus Entrepreneurs (TiE)
UGA Patent Litigation Datafile by John L. Turner
United Nations Economic Commission for Europe (UNECE) Statistical Business Register (SBR) Survey
U.S. Patent and Trademark Office database
World Management Survey
The Venture Capital and Private Equity (VC-PE) Country Attractiveness Index VentureXpert (Thomson One)
Business Employment Dynamics (BED)
Current Population Survey
National Longitudinal Survey of Youth
Annual Survey of Entrepreneurs (ASE)
Annual Survey of Manufacturers (ASM)
Annual (SAS) and Quarterly (QSS) Services Surveys
Business Dynamics Statistics (BDS)
Characteristics of Business Owners (CBO)
Federal Statistical Research Data Centers (RDC)
Integrated Public Use Microdata Samples (IPUMS)
Longitudinal Business Database (LBD)
Longitudinal Employer-Household Database (LEHD)
Management and Organizational Practices Survey (MOPS)
Survey of Business Owners (SBO)
Survey of Income and Program Participation (SIPP)
Statistics of U.S. Businesses (SUSB)
Survey of Consumer Finances (SCF)
Survey of Small Business Finances (SSBF)
The World Bank Database
Benchmarking Public Procurement
Enabling the Business of Agriculture
Women, Business and the Law
Do you manage a dataset? Google has metadata for science datasets to formalize their structure.