Document Scanner
Capture or upload a page, fine-tune the corners, get a clean scan.
This tool uses a server-side service for processing; uploaded files or requests are not kept for long-term storage.
About
Document scanning used to mean a flatbed the size of a small printer, plugged into a desktop, taking thirty seconds per page. Phones killed that workflow somewhere around 2015 — modern cameras shoot 12+ megapixels with sensors good enough that the limiting factor for a paper scan is no longer image quality but the geometric and tonal post-processing that turns a snapshot of a page into something that looks scanned. Apps like CamScanner and Adobe Scan do that processing well, but the cost is obvious: accounts, paywalls, cloud archives tied to your identity, and a whole app to install for a one-shot operation. This document scanner does the same job from a single web page — no account, no install — and your photo lives only in memory on our server for the few seconds it takes to process.
The pipeline has four stages: capture, edge detection, perspective correction, and tonal enhancement. Capture is straightforward — point the camera at the document, frame it loosely (the corners do not need to be exactly aligned), tap the shutter. Edge detection finds the four corners of the page by looking for the strongest contrast boundaries that form a quadrilateral; when it is uncertain (busy backgrounds, white-on-white surfaces) you can drag the four corner handles manually. Manual override is fast and exact, and it is the right answer when auto-detect lacks confidence — the rest of the pipeline trusts whichever corners you give it.
Perspective correction is the step that makes the difference between a photo of a page and an actual scan. When you photograph a document on a desk from above, the resulting image has a trapezoidal distortion: the side closer to the camera looks larger than the side further away, even though both sides of the page are the same width in reality. Mathematically, this is a projective transformation between the document plane and the camera sensor plane. Reversing it requires computing a homography — a 3x3 matrix that maps the four detected corners back to a clean rectangle. The output pixel grid is then sampled bicubically through the matrix. This step runs on our server using OpenCV and takes a few hundred milliseconds for a typical full-resolution phone photo.
Tonal enhancement is the last stage and the one that distinguishes a usable scan from a phone photo. Four filter modes cover real scanning. Color keeps the colors but corrects perspective; Magic Color (the default) removes shadows, whitens the paper, lifts contrast and sharpens text — this is the office-scan look. Grayscale converts to a perception-weighted gray, the right choice for printed text where color carries no information. B&W uses an adaptive threshold that walks each pixel and compares it to its 31x31 neighborhood mean; this beats a global threshold because it handles documents with uneven lighting and produces the crisp pure-black-on-pure-white look most professional scanners default to for forms and contracts.
This release is intentionally single-page focused: capture one sheet, optionally drag the corners, scan, download. When you need a multi-page document, scan each page separately and join them with [Images to PDF](/tools/images-to-pdf).
Edge cases worth knowing. Glossy paper under direct overhead light produces glare that no filter can remove because the underlying pixels are saturated white; reposition the page or your light source. Lined notebook paper with strong horizontal lines can confuse auto edge detection; drag the corners manually in that case. Pages with photographs inside can over-threshold in B&W; use Grayscale or Magic Color instead. Curled or folded documents will keep some distortion in the middle because a single photo cannot recover the 3D shape; flatten the page under a book before scanning if absolute fidelity matters.
Mobile considerations are non-negotiable here because most users will reach for their phone first. The interface uses native pointer events so finger drags work identically to mouse drags, the corner handles are sized for touch, and the bottom action bar is fixed so primary actions stay reachable with the thumb. Camera capture defaults to the back camera (`facingMode: environment`). After capture, the photo immediately enters the corner-adjust view with auto-detect on — most documents need zero manual adjustment, but the corners are draggable for the cases that do.
Output is a standards-compliant PNG that opens in Adobe Acrobat, Apple Preview, Google Drive, your file manager and every image viewer. If you prefer pure black/white or grayscale, switch the filter chip before scanning; Magic gives you the office-scan look with sharpened text and a clean white background.
Privacy: capture and corner selection run in your browser. Cropping, perspective correction, deskew and paper whitening run on our server inside an in-memory OpenCV pipeline. Your image is not written to disk and not logged; it lives only for the few hundred milliseconds the request takes, then it is dropped. No account, no archive, no per-user tracking. Open the network tab during a scan and you will see one request for the scan endpoint and zero retained state on our end after the response.
The tool pairs naturally with the rest of Omnvert's PDF and image utilities. Combine multiple scans into one document with [Images to PDF](/tools/images-to-pdf), shrink the file with [PDF Compress](/tools/pdf-compress), add a searchable text layer with [PDF OCR](/tools/pdf-ocr), or sign with [PDF Sign](/tools/pdf-sign). For a different intake — say, a flatbed scan already in PDF form — the [PDF OCR](/tools/pdf-ocr) tool can ingest that directly.
A few practical tips. Diffuse overhead light produces the cleanest results — direct sunlight or a single-source desk lamp creates harsh shadows that survive even tonal correction. Place the page on a surface that contrasts with the paper color so auto-detect has an easy time (white paper on a wood desk works well; white paper on a white tablecloth fights the algorithm). Leave a small margin around the document in the frame so the corners are clearly visible and not cropped. Hold the phone parallel to the page if possible — perspective correction handles modest angles fine, but extreme angles (45° tilt) lose resolution because the far side of the page is much smaller in the source pixels than the near side.
Comparison with traditional flatbed scanners. A flatbed produces marginally better results for absolute fidelity (uniform lighting, perfect flatness, calibrated color), but the workflow gap is enormous: a phone scan takes 5–8 seconds per page; a flatbed takes 20–30 seconds per page from lid-open to scanned image. For the vast majority of everyday scanning — receipts, contracts, ID copies, recipe pages, school handouts — a phone scan is good enough and several times faster. Flatbeds remain the right tool for irreplaceable documents where color accuracy matters.
Use cases
- Scan receipts on the go without installing an app.
- Produce a clean PNG of a single contract page to share or sign.
- Digitize ID cards, licences and invoices without a cloud archive tied to your identity.
- Capture recipe, lesson note or handwritten pages into readable scans.
- Capture a form before filling it digitally for clean output.
- Turn a business card or receipt photo into a scan free of background, shadow and skew.
How it works
- 1Start the camera or upload an existing image (JPG / PNG / WebP, up to 25 MB).
- 2Auto-detect is on in the preview. If it picks the wrong region, drag any of the four corner handles to mark the real page boundary.
- 3Pick a filter chip: Magic (default — colour, paper whitening, sharpening), Color (perspective only), Gray, or B&W.
- 4Hit Scan. The image is sent to our server, where cropping + perspective correction + deskew + paper whitening run in memory.
- 5When processing finishes the result appears on screen. Download the PNG or start another scan.
FAQ
Does my photo or scan get uploaded anywhere?
No. Every step runs in your browser using local JavaScript. No image data leaves your device, nothing is logged, no telemetry tracks the contents. You can verify by opening the network tab during scanning — there are no outbound requests beyond the static page assets.
How does auto-edge detection work?
The scanner runs a fast edge-detection pass on a downscaled copy of the photo, looking for the strongest contrast boundaries that form a quadrilateral. It works well on documents with clear contrast between page and background. When detection is uncertain, the corners default to the photo edges and you drag them to refine — fast and reliable.
Can I use my laptop webcam instead of a phone?
Yes. The camera capture works on any device with a webcam — phone, laptop, tablet. Phone cameras typically produce better scans because their sensors are higher resolution and they default to autofocus, but a laptop webcam works fine for quick captures.
Why does my scan look skewed?
The four corner handles in Edit mode define the document boundary. If they do not match the actual page edges, perspective correction produces a skewed result. Drag the corners to the actual page corners and the output flattens correctly.
Which filter should I use?
For most documents, Magic Color is the right default — it boosts contrast and saturation modestly. For text-heavy printed documents, Grayscale produces clean readable output. For forms and contracts where you want the classic 'office scanner' look, B&W uses adaptive thresholding for crisp pure black on white. Original keeps the photo unchanged, useful when handwritten color annotations matter.
Can I scan more than one page into a single PDF?
Yes. Capture or upload as many pages as you need; each becomes a thumbnail in the page strip. Reorder by dragging, delete what does not look right, and export — the output PDF contains all pages in order.
What page sizes does the PDF export support?
A4 (210x297mm), US Letter (8.5x11in), and Original (preserves each captured page's aspect ratio). A4 is the global standard; Letter is the US default. Use Original for receipts, business cards, or anything that is not standard paper-sized.
How does this compare to CamScanner or Adobe Scan?
Same scanning quality and feature set for the common cases (auto-edge, perspective correction, multi-page, filters, PDF export), with a fundamental privacy difference: this tool runs entirely in your browser, no upload, no account, no install. Commercial apps upload to their cloud by default and have had documented privacy incidents.
Will it work offline?
Once the page is loaded, yes — all processing happens locally with no network calls. You will not be able to load the page again offline unless your browser cached it, but an open tab continues to work without an internet connection.
Can I make the scanned text searchable (OCR)?
Yes — toggle 'Add searchable text layer (OCR)' in the Export step. The scanner runs Tesseract.js entirely in your browser to recognize text in each page and embeds an invisible, selectable text layer in the PDF. The output looks identical visually but is fully searchable in any PDF viewer (Cmd/Ctrl+F to find) and the text is copy-pasteable. First run downloads ~3–5 MB per language; subsequent runs are cached. Supported languages include Turkish, English, German, French, and Spanish.