Extract Text From Scanned PDFs Privately

Convert scanned PDFs to selectable text using local browser OCR. Your confidential documents never touch a cloud server.

🔎

OCR PDF

Turn a scanned PDF — a picture of text — into real, copyable text.

📥

Drop a scanned PDF here or click to browse

Document language Accuracy

Runs in your browser. The first run downloads language data (~10–20 MB); recognition takes a few seconds per page.

OCR That Never Uploads Your Document

Most online OCR services send your scan to a server to be read. For confidential material — contracts, medical records, legal discovery — that's a non-starter. This tool runs the entire recognition pipeline in your browser with Tesseract.js, so the document is never transmitted anywhere.

How the Local Pipeline Works

Each page is rendered to a high-resolution image in the browser, then a Web Worker running the open-source Tesseract engine recognizes the characters and returns plain text you can copy or download. The only thing downloaded is the engine and its language data, fetched once and cached.