How Does Scanner Work? | Paper To Pixels

A scanner turns reflected light from a page into pixels, then software saves those pixels as an image or searchable file.

Pressing Scan feels simple, but the machine does several jobs. It lights the page, reads the bounced light, turns that light into electric signals, builds rows of pixels, then saves the result in a file your computer can open.

That is why it can copy receipts, photos, signed forms, and book pages. The scanner is not “reading” meaning at first. It is measuring light and dark areas, plus color values, one thin strip at a time.

How A Scanner Reads A Page

A flatbed scanner starts with the page pressed against glass. Under that glass, a moving carriage travels from one end to the other. The carriage holds a lamp, lenses or mirrors in some models, and a sensor that captures one line of the page at a time.

Sheet-fed scanners do the same job in a different shape. Instead of moving the sensor under a still page, rollers pull the paper past a fixed scan head. Duplex models scan both sides during one pass, so offices use them for stacks of forms.

The Light Source

The light source matters because sensors can only measure what reaches them. Many scanners use LEDs because they warm up fast, last a long time, and give steady light. Older models may use fluorescent lamps, which need more warm-up time and can drift as they age.

Canon’s own scanner mechanism notes describe two common designs: CCD systems that guide reflected light through optics, and CIS systems that place compact sensors close to the page. The difference affects thickness, sharpness, and raised or uneven items.

The Sensor

The sensor converts light into electrical values. A white area reflects more light, so it becomes a brighter pixel. Black ink reflects less light, so it becomes a darker pixel. Color scanning repeats that measurement through red, green, and blue channels.

A CCD scanner usually has more room for optics and handles slight depth better, which helps with books, wrinkled paper, and items that do not sit flat. A CIS scanner is thinner and draws less power, which suits portable units and many all-in-one printers.

How A Scanner Works Inside The Device

The scanner builds the image line by line. Each line is a row of tiny samples. The driver joins those rows into a rectangle. Smooth movement and still paper keep the lines aligned.

Bad alignment creates skew, stretched text, or bands. Dust on the glass creates streaks because the same dirty spot is captured again and again as the scan head moves. A loose lid can let stray light reach the sensor, which may wash out dark areas.

Resolution is measured in pixels per inch, often written as ppi. Federal digitizing rules define ppi as the resolution of a scanner or image, and say modern textual paper records should be digitized at a minimum of 300 ppi when sized to the source page under the Federal record digitizing rule. Home users do not need archival settings for every scan, but the same idea applies: useful detail comes from true optical capture, not inflated numbers.

What Resolution, Color, And File Type Change

Resolution controls how many samples the scanner records per inch. A 300 ppi scan is often enough for receipts, contracts, forms, and typed pages. Photos often benefit from 600 ppi, especially when you plan to crop or print later.

Higher settings are not always better. A 1200 ppi scan of a plain invoice may only create a huge file. The National Archives has also warned that ppi alone does not prove quality; optical capture, signal processing, and handling still matter.

Color mode changes file size and detail. Black-and-white mode works for clean typed text. Grayscale keeps pencil marks, stamps, and shaded areas. Full color is the right pick for photos, artwork, IDs, annotated pages, and anything where color carries meaning.

File type changes how the scan behaves later. PDF is handy for multipage records. JPEG keeps file size small for casual photo sharing, but it can add compression marks. TIFF preserves more image data and is common in archival work. The Library of Congress personal scanning sheet advises cleaning the scanner glass and preparing documents before scanning, which is still one of the simplest ways to improve output.

Scanner Part What It Does What Can Go Wrong
Glass plate Holds the page flat at a fixed distance from the scan head. Dust, fingerprints, and tape marks create streaks or spots.
Light source Shines even light across the page so the sensor can read reflection. Weak or uneven light can cause dull color and shadowed edges.
CCD or CIS sensor Changes reflected light into electrical values for each pixel. Low-grade sensors may lose fine lines, texture, or pale marks.
Lens and mirrors Guide light to the sensor in many CCD designs. Misalignment can soften detail or add distortion.
Carriage motor Moves the scan head under the page at a steady pace. Jerky motion can create bands, skew, or stretched shapes.
Rollers Pull sheets through a document feeder. Worn rollers can cause jams, double feeds, or crooked scans.
Driver software Controls ppi, color mode, crop, file type, and cleanup tools. Poor settings can create huge files, muddy text, or missing edges.
OCR engine Turns scanned letter shapes into selectable text. Smudges, tiny fonts, and crooked pages increase mistakes.

Why Optical Resolution Beats Inflated Numbers

Scanner boxes often show optical and interpolated resolution. Optical resolution is what the hardware can truly capture. Interpolated resolution is software guesswork that adds extra pixels after capture.

For documents, optical quality matters more than a huge setting. Sharp edges help OCR. Even lighting keeps background noise low. Straight feeding keeps letters in their proper places. These traits save more time than chasing the largest ppi number.

Scan Job Good Setting Best File Type
Typed form or contract 300 ppi, grayscale or black-and-white PDF
Receipt 300 ppi, grayscale PDF or JPEG
Family photo 600 ppi, color TIFF or JPEG
Artwork or print 600 ppi or higher, color TIFF
Text for OCR 300 ppi, clean grayscale Searchable PDF

How OCR Changes A Scan Into Text

OCR stands for optical character recognition. It comes after the image is captured. The OCR engine finds letter shapes, compares them with known patterns, then builds words and lines from those matches.

Clean source pages help OCR more than people expect. Straight text, strong contrast, and plain fonts raise accuracy. Curved book pages, coffee stains, glossy paper, and handwritten notes make the software guess.

That is why two scans of the same page can behave differently. A sharp image PDF may still be hard to search if OCR was not run. A searchable PDF has an image layer you see and a text layer the computer can select and copy.

How To Get Cleaner Scans At Home

Small habits make a clear difference. Before scanning, wipe the glass with a lint-free cloth. Line up the page with the corner guides. Close the lid gently so the paper does not slide.

  • Use 300 ppi for most paperwork and 600 ppi for photos.
  • Pick grayscale for faded receipts, pencil, stamps, and low-contrast text.
  • Scan photos in color, even old black-and-white prints, to retain tone.
  • Turn on deskew if pages enter crooked through a feeder.
  • Save multipage documents as PDF so pages stay together.
  • Use TIFF for scans you may edit or archive.

Common Scanner Problems And Fixes

Vertical streaks usually point to dust, glue, or correction fluid on the glass strip used by the feeder. Clean that narrow strip, not just the main flatbed glass. Repeating dots often come from debris on the source page.

Blurry text can mean the page is not flat, the ppi setting is too low, or the scanner is using a light cleanup filter that smears edges. If the file is too large, lower ppi, use grayscale instead of color, or save routine documents as PDF instead of TIFF.

What Happens After You Press Scan

The full process is neat: light hits the page, the sensor measures reflection, electronics convert signals into numbers, software builds rows of pixels, and the driver saves the result. OCR may then add searchable text.

Once you know that chain, scanner settings make more sense. Clean glass, steady paper movement, true optical detail, and the right file type do most of the work. The scanner is a light reader; good scans come from giving that reader a clean page and sensible settings.

References & Sources