Most of my data dumps are in NDJSON format. Remember, you can use this script to process NDJSON files extremely fast. If you're working off an SSD or NVMe drive, this script will give a really nice speed boost: github.com/pushshift/Para
#datascience #ndjson #script
