PDF OCR
Drop a scanned (image-only) PDF and the tool will read the text out of each page. Pages with an existing text layer are skipped automatically.
Drop your PDF here
or click to browse
FAQ
How is this private if the OCR engine is downloaded?
Only the OCR engine code and language data are downloaded — your PDF stays in your browser. The OCR runs locally as WebAssembly.
Why is the first run slow?
Tesseract downloads ~10 MB of language data the first time. Your browser caches it, so subsequent runs are much faster.