why do PDF readers need multiple seconds to search through something like a thousand pages of formatted text when grep can dig through the equivalent plaintext in ~50ms? is extracting plaintext from PDF that costly?
-
Show this thread
-
Replying to @tehjh
is it faster if you do pdftotext [file] | grep [pattern] on the commandline?
1 reply 0 retweets 0 likes
Replying to @hanno
somewhat - 20s for searching through the SDM in evince, 8s for pdftotext
9:27 AM - 7 Apr 2018
0 replies
0 retweets
1 like
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.