Should twarc's docs reference http://ndjson.org or http://jsonlines.org for line-oriented-json? Context: https://github.com/DocNow/twarc/issues/173 …
-
-
Replying to @edsu
A superset of these, ORS/CDXJ http://ws-dl.blogspot.com/2015/09/2015-09-10-cdxj-object-resource-stream.html … is relevant and more flexible, but the draft needs more workhttps://github.com/oduwsdl/ORS
1 reply 4 retweets 0 likes -
Replying to @ibnesayeed
That's fine for CDX data, but it doesn't make much sense to call it that for other kinds of JSON does it?
1 reply 0 retweets 0 likes -
Replying to @edsu
Object Resource Stream (ORS) is a more generic term, suitable for any purpose, but CDXJ is geared towards CDX. Blog post explains the diff.
1 reply 0 retweets 0 likes -
Replying to @ibnesayeed
ORS sounds way to generic to me. It loses the fact that it’s really just JSON.
3 replies 0 retweets 0 likes -
Replying to @edsu @ibnesayeed
it's more than that although the blog & github are woefully short on examples. it's: key1 key2 ... {json block} key1 key2 ... {json block}
2 replies 1 retweet 1 like -
Replying to @phonedude_mln @ibnesayeed
Thanks, I missed that detail. I’m sure that makes sense in your CDX use case, but it adds an unnecessary level of complexity for mine.
1 reply 0 retweets 2 likes -
Replying to @edsu @phonedude_mln
Prefix keys outside JSON block are optional, that's why it is a superset of NDJSON (or alike formats).
2 replies 0 retweets 0 likes -
But
@edsu doesn't need the prefix at all (right?), just the line-delimited JSON. w/o the prefix key, it's not ORS, no need for new format.1 reply 0 retweets 0 likes
That's like saying we should call plain JSON "YAML" because YAML is a superset of JSON ;) Using most specific name helps avoid confusion
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.