Need help with crunchbase-data?
Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

About the developer

notpeter
146 Stars 76 Forks Other 14 Commits 0 Opened issues

Description

2015 CrunchBase Data Export as CSV

Services available

!
?

Need anything else?

Contributors list

# 36,148
Node.js
Haxe
man-pag...
Termina...
11 commits

Crunchbase Data As CSV

This data was extracted from the December 4, 2015 Crunchbase Data Export.

This repository includes unofficial CSV exports derived from the individual worksheets from crunchbase_export.xlsx. I previously munged the data by hand with Excel, but have since moved the dirty work to python. Reading the XLSX file is handled with openpyxl while unicodecsv creates the CSVs.

The Excel workbook is transformed as follows:

  • One CSV file per worksheet
  • Skip the analysis page and empty columns
  • Remove redundant reduced precision date columns (month, quarter, year)
  • Remove dates missing a year (year 1000 is just wrong)
  • Remove trailing blank rows

Usage

virtualenv .venv
source .venv/bin/activate
pip install -r requirements.txt
python crunchbase-csv.py crunchbase_export.xlsx

License

Use of this data is governed by the CrunchBase Terms of Service and Licensing Policy.

This data dump for non-commercial use is provided under Creative Commons Attribution-NonCommercial (CC-BY-NC) license. Any commercial use requires a seperate license from CrunchBase.

crunchbase-csv.py is Copyright (c) Peter Tripp and made available under terms of the MIT License

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.