Need help with warc?
Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

About the developer

218 Stars 109 Forks GNU General Public License v2.0 94 Commits 30 Opened issues


Python library for reading and writing warc files

Services available


Need anything else?

Contributors list

warc: Python library to work with WARC files

.. image:: :alt: build status :target:

WARC (Web ARChive) is a file format for storing web crawls.


library makes it very easy to work with WARC files.::
import warc
f ="test.warc")
for record in f:
    print record['WARC-Target-URI'], record['Content-Length']


The documentation of the warc library is available at


This software is licensed under GPL v2. See LICENSE_ file for details.


We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.