Conversation

Maybe you could improve the EPUB reading experience by extracting text block layout parameters from the PDFs through computer vision: ie. try to estimate the text block width/height, line height, and font size in the original typesetting. Similar technique could map page numbers.
3
16
Related: while e-book reading software are truly impoverished, PDF software is also almost universally unimaginative and unserious for the task of reading. Would love to see more work there…
4
6
45
Replying to
This is such a fascinating point for me, as often my goal is to get a PDF into something that can flow so that I can actually read it on a smaller device. PDFs are so static, and they also don't work well with text-to-speech systems. I guess it depends on what sort of reading...
2
10
Show replies
Suspect you’d feel differently if you tried to read a dozen pages like this. The epub’s lines are too long, slightly too loose, and have terrible justification leading to irregular word spacing rhythms.
2
6