why do PDF readers need multiple seconds to search through something like a thousand pages of formatted text when grep can dig through the equivalent plaintext in ~50ms? is extracting plaintext from PDF that costly?
-
Show this thread
-
for evince, which uses libpoppler for text search in PDF, callgrind estimates 47% of time spent in pow(), called from cmsBuildSegmentedToneCurve() (56% in/below this function), which is some function for drawing curves? and further up in the call chain, there's Page::display()
2 replies 1 retweet 5 likesShow this thread -
Replying to @tehjh
it likely is actually graytone mapping (tone curves are the gamma curves for displays from the old CRT days) so it's even worse than you thought.. since you're not searching for color
1 reply 1 retweet 1 like
so I guess maybe I should try to see whether I can patch it to not bother with colors (and maybe some other things like that) when rendering pages for search
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.