pctechguide.com

  • Home
  • Guides
  • Tutorials
  • Articles
  • Reviews
  • Glossary
  • Contact

Scanner File Formats

The format in which a scanned image is saved can have a significant effect on file size – and file size is an important consideration when doing some image and document scanning, since the high resolutions supported by many modern scanners can result in the creation of image files as large as 30MB for an A4 page.

Windows bitmap (BMP) files are the largest, since they store the image in full colour without compression or in 256 colours with simple run-length encoding (RLE) compression. Images to be used as Windows wallpaper have to be saved in BMP format, but for most other cases it can be avoided.

Tagged image file format (TIFF) files are the most flexible, since they can store images in RGB mode for screen display, or CMYK for printing. TIFF also supports LZW compression, which can reduce the file size significantly without any loss of quality. This is based on two techniques introduced by Jacob Ziv and Abraham Lempel in 1977 and subsequently refined by Unisys researcher Terry Welch. LZ77 creates pointers back to repeating data, and LZ78 creates a dictionary of repeating phrases with pointers to those phrases.

CompuServe’s graphics interchange format (GIF) stores images using indexed colour. A total of 256 colours are available in each image, although what these colours are can change from image to image. A table of RGB values for each index colour is stored at the start of the image file. GIFs tend to be smaller than most other file formats because of this decreased colour depth, making them a good choice for use in WWW-published material.

The PC Paintbrush (PCX) format has fallen into disuse, but offers a compressed format at 24-bit colour depth. The JPEG file format uses lossy compression and can achieve small file sizes at 24-bit colour depth. The level of compression can be selected – and hence the amount of data loss – but even at the maximum quality setting JPEG loses some detail and is therefore only really suitable for viewing images on-line. The number of levels of compression available depends on the image editing software being used.

Unless there is a need to preserve colour information from the original document, images stored for subsequent OCR processing are best scanned in greyscale. This uses a third of the space of an RGB colour scan. An alternative is to scan in Line art mode – black and white with no greyscales – but this often loses detail, reducing the accuracy of the subsequent OCR process.

The table below illustrates the relative file sizes that can be achieved by the different file formats in storing a native 1MB image, and also indicates the colour depth supported:

File format Image size No. of colours
BMP – RGB 1MB 16.7 million
BMP -RLE 83KB 256
GIF 31KB 256
JPEG – min. compression 185KB 16.7 million
JPEG – min. progressive compression 150KB 16.7 million
JPEG – max. compression 20KB 16.7 million
JPEG – max. progressive compression 16KB 16.7 million
PCX 189KB 16.7 million
TIFF 1MB 16.7 million
TIFF – LZW compression 83KB 16.7 million
  • Scanner Operation
  • PMT Scanners
  • CCD Scanners
  • CIS Scanners
  • Scan Resolution
  • Scanner Interpolation
  • Color Scanners
  • Bit-Depth Printers
  • Dynamic Range Scanners
  • Scan Modes
  • Scanner File Formats
  • TWAIN Drivers
  • Color Calibration
  • OCR Technology
  • Photo Retouching

Filed Under: Scanners

Latest Articles

What is L2 (Level 2) cache memory?

Most PCs are offered with a Level 2 cache to bridge the processor/memory performance gap. Level 2 cache - also referred to as secondary cache) uses the same control logic as Level 1 cache and is also implemented in SRAM. Level 2 cache typically comes in two sizes, 256KB or 512KB, and … [Read More...]

Defragmenting the Hard Drive with Windows XP

In older file system architectures, if a file could not be stored contiguously, it could not be saved to the disk. Newer architectures intentionally divide files into multiple pieces so as to make more efficient use of disk storage space. Since files are constantly being written, deleted, … [Read More...]

Dye Sublimation Printers

For many years dye-sublimation printers were specialist devices widely used in demanding graphic arts and photographic applications. The advent of digital photography led to the technology entering the mainstream, forming the basis of many of the standalone, portable photo printers that emerged in … [Read More...]

20 Cool Creative Commons Photographs About the Future of AI

AI technology is starting to have a huge impact on our lives. The market value for AI is estimated to have been worth $279.22 billion in 2024 and it … [Read More...]

13 Impressive Stats on the Future of AI

AI technology is starting to become much more important in our everyday lives. Many businesses are using it as well. While he has created a lot of … [Read More...]

Graphic Designers on Reddit Share their Views of AI

There are clearly a lot of positive things about AI. However, it is not a good thing for everyone. One of the things that many people are worried … [Read More...]

Redditors Talk About the Impact of AI on Freelance Writers

AI technology has had a huge impact on our lives. A 2023 survey by Pew Research found that 56% of people use AI at least once a day or once a week. … [Read More...]

11 Most Popular Books on Perl Programming

Perl is not the most popular programming language. It has only one million users, compared to 12 million that use Python. However, it has a lot of … [Read More...]

10 Exceptional Books on ChatGPT that Will Blow Your Mind

ChatGPT is a powerful new AI tool that is taking the world by storm. You are going to find a lot of amazing books that will teach you how to make the … [Read More...]

Guides

  • Computer Communications
  • Mobile Computing
  • PC Components
  • PC Data Storage
  • PC Input-Output
  • PC Multimedia
  • Processors (CPUs)

Recent Posts

AMD 3DNow

With the launch of K6-2, in May 1998, AMD stole something of a march on Intel, whose similar Katmai technology was not due for release until up to a … [Read More...]

Dot Trio Monitors

The vast majority of computer monitors use circular blobs of phosphor and arrange them in triangular … [Read More...]

Correct the 401 Unauthorized Error

The 401 Error is a common HTML Error Code you may encounter while surfing on the Internet. The error is usually shown because the page you are trying … [Read More...]

[footer_backtotop]

Copyright © 2025 About | Privacy | Contact Information | Wrtie For Us | Disclaimer | Copyright License | Authors