Talk Announcement – “Open Source OCR and PDF compression at the Internet Archive” by Merlijn Wajer
Join us for a captivating talk by Merlijn Wajer, a passionate advocate for free and open source software, sharing his expertise in diverse technological domains.
Talk: “Open Source OCR and PDF Compression at the Internet Archive”
Discover the inner workings of Archive PDF tools, the software behind the highly compressed PDFs with selectable text layers at the Internet Archive. Merlijn Wajer will delve into the technical intricacies of generating compressed PDFs from digitized content, primarily books. Learn about the development process, technical challenges faced, and the underlying algorithms that make this compression possible. Explore how similar techniques can be applied to personal archives, offering insights into the potential of open source OCR and PDF compression.
Don’t miss out on this enlightening session with Merlijn Wajer!