A new version of our WARC library warcio (1.6.1) is now out with support for the WARC 1.1 standard.
More info: https://github.com/webrecorder/warcio …
Install: pip install -U warcio
#webarchiving
-
Show this thread
-
warcio 1.6.1 also simplifies creating WARC files from http traffic to just 4 lines of Python: from warcio.capture_http import capture_http import requests with capture_http('example.warc.gz', warc_version='1.1'): requests.get('https://example.com/ ')
1 reply 6 retweets 12 likesShow this thread
warcio 1.6.1 also supports reading WARCs created by wget with angle brackets wrapped around the WARC-Target-URI (resulting from an unfortunate ambiguity in the spec, see: http://lists.gnu.org/archive/html/bug-wget/2017-11/msg00050.html …)
6:09 PM - 10 Oct 2018
0 replies
2 retweets
2 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.