Python library for reading and writing warc files
The developer of this repository has not created any items for sale yet. Need a bug fixed? Help with integration? A different license? Create a request here:
.. image:: https://secure.travis-ci.org/anandology/warc.png?branch=master :alt: build status :target: http://travis-ci.org/anandology/warc
WARC (Web ARChive) is a file format for storing web crawls.
warclibrary makes it very easy to work with WARC files.::
import warc f = warc.open("test.warc") for record in f: print record['WARC-Target-URI'], record['Content-Length']
The documentation of the warc library is available at http://warc.readthedocs.org/.
This software is licensed under GPL v2. See LICENSE_ file for details.
.. LICENSE: http://github.com/internetarchive/warc/blob/master/LICENSE