What is Document OCR?
Document OCR turns image-only pages—scanned PDFs, photos of documents, or standalone images—into editable text. The tool sends your file through Dynamic Duniya’s file-tools API with a Tesseract-style language code, then shows extracted text (for .txt) or a download link for a Word document (.docx). It complements PDF-to-Excel and other converters: once a scan has a text layer, downstream tools can work with selectable content.
Supported inputs and limits
The uploader accepts PDF plus common raster formats: JPEG, PNG, TIFF, and BMP. The UI caps uploads at 30 MB. OCR is CPU-heavy; the progress label notes that large files may take on the order of 10–30 seconds. Success depends on Tesseract and related components being installed and healthy on the server—a blue info banner reminds operators to verify that setup if errors appear.
Languages and output
Pick a recognition language from the dropdown (for example English eng, Hindi hin, French fra, Spanish spa, German deu, Arabic ara, Chinese Simplified chi_sim, Japanese jpn, Portuguese por, Russian rus). Choose Plain Text (.txt) to read and copy text in the page, optionally with a download named like extracted-text.txt, or Word Document (.docx) for ocr-result.docx when the pipeline returns a file URL. Word counts may come from server meta or a local estimate.
Privacy
Unlike purely in-browser OCR, your document leaves the device for server processing. Do not upload regulated health, legal, or classified material unless policy allows. Download outputs promptly; temporary URLs can expire depending on configuration.