Need help with sparser?
Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

About the developer

421 Stars 51 Forks BSD 3-Clause "New" or "Revised" License 285 Commits 3 Opened issues


Sparser: Raw Filtering for Faster Analytics over Raw Data

Services available


Need anything else?

Contributors list


This code base implements Sparser, raw filtering for faster analytics over raw data. Sparser can parse JSON, Avro, and Parquet data up to 22x faster than the state of the art. For more details, check out our paper published at VLDB 2018.

See the

directory for a brief example. To run it:
# update rapidjson submodule
git submodule init
git submodule update
cd demo-repl
./bench /path/to/large/file.json

Then enter

at the

Sparser itself is just a header file and only depends on standard C libraries available on most systems.

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.