pctechguide.com

  • Home
  • Guides
  • Tutorials
  • Articles
  • Reviews
  • Glossary
  • Contact

Scanner File Formats

The format in which a scanned image is saved can have a significant effect on file size – and file size is an important consideration when doing some image and document scanning, since the high resolutions supported by many modern scanners can result in the creation of image files as large as 30MB for an A4 page.

Windows bitmap (BMP) files are the largest, since they store the image in full colour without compression or in 256 colours with simple run-length encoding (RLE) compression. Images to be used as Windows wallpaper have to be saved in BMP format, but for most other cases it can be avoided.

Tagged image file format (TIFF) files are the most flexible, since they can store images in RGB mode for screen display, or CMYK for printing. TIFF also supports LZW compression, which can reduce the file size significantly without any loss of quality. This is based on two techniques introduced by Jacob Ziv and Abraham Lempel in 1977 and subsequently refined by Unisys researcher Terry Welch. LZ77 creates pointers back to repeating data, and LZ78 creates a dictionary of repeating phrases with pointers to those phrases.

CompuServe’s graphics interchange format (GIF) stores images using indexed colour. A total of 256 colours are available in each image, although what these colours are can change from image to image. A table of RGB values for each index colour is stored at the start of the image file. GIFs tend to be smaller than most other file formats because of this decreased colour depth, making them a good choice for use in WWW-published material.

The PC Paintbrush (PCX) format has fallen into disuse, but offers a compressed format at 24-bit colour depth. The JPEG file format uses lossy compression and can achieve small file sizes at 24-bit colour depth. The level of compression can be selected – and hence the amount of data loss – but even at the maximum quality setting JPEG loses some detail and is therefore only really suitable for viewing images on-line. The number of levels of compression available depends on the image editing software being used.

Unless there is a need to preserve colour information from the original document, images stored for subsequent OCR processing are best scanned in greyscale. This uses a third of the space of an RGB colour scan. An alternative is to scan in Line art mode – black and white with no greyscales – but this often loses detail, reducing the accuracy of the subsequent OCR process.

The table below illustrates the relative file sizes that can be achieved by the different file formats in storing a native 1MB image, and also indicates the colour depth supported:

File format Image size No. of colours
BMP – RGB 1MB 16.7 million
BMP -RLE 83KB 256
GIF 31KB 256
JPEG – min. compression 185KB 16.7 million
JPEG – min. progressive compression 150KB 16.7 million
JPEG – max. compression 20KB 16.7 million
JPEG – max. progressive compression 16KB 16.7 million
PCX 189KB 16.7 million
TIFF 1MB 16.7 million
TIFF – LZW compression 83KB 16.7 million
  • Scanner Operation
  • PMT Scanners
  • CCD Scanners
  • CIS Scanners
  • Scan Resolution
  • Scanner Interpolation
  • Color Scanners
  • Bit-Depth Printers
  • Dynamic Range Scanners
  • Scan Modes
  • Scanner File Formats
  • TWAIN Drivers
  • Color Calibration
  • OCR Technology
  • Photo Retouching

Filed Under: Scanners

Latest Articles

Disclaimer

Brain Box Consultants LLC Web Site Agreement The pctechguide.com Web Site (the "Site") is an online information service provided by Brain Box Consultants LLC ("pctechguide.com "), subject to your compliance with the terms and conditions set forth below. PLEASE READ THIS DOCUMENT CAREFULLY … [Read More...]

IEEE 802.11 – The new standard

The Institute of Electrical and Electronics Engineers (IEEE) ratified the original 802.11 specification in 1997 as the standard for WLANs. That version of 802.11 provided for 1 Mbit/s and 2 Mbit/s data rates and a set of fundamental signalling … [Read More...]

Redditors Talk About the Impact of AI on Freelance Writers

AI technology has had a huge impact on our lives. A 2023 survey by Pew Research found that 56% of people use AI at least once a day or once a week. That number is probably a lot higher today. There are loads of ways that people can use AI to make their lives easier. But it is also having both a … [Read More...]

Gaming Laptop Security Guide: Protecting Your High-End Hardware Investment in 2025

Since Jacob took over PC Tech Guide, we’ve looked at how tech intersects with personal well-being and digital safety. Gaming laptops are now … [Read More...]

20 Cool Creative Commons Photographs About the Future of AI

AI technology is starting to have a huge impact on our lives. The market value for AI is estimated to have been worth $279.22 billion in 2024 and it … [Read More...]

13 Impressive Stats on the Future of AI

AI technology is starting to become much more important in our everyday lives. Many businesses are using it as well. While he has created a lot of … [Read More...]

Graphic Designers on Reddit Share their Views of AI

There are clearly a lot of positive things about AI. However, it is not a good thing for everyone. One of the things that many people are worried … [Read More...]

Redditors Talk About the Impact of AI on Freelance Writers

AI technology has had a huge impact on our lives. A 2023 survey by Pew Research found that 56% of people use AI at least once a day or once a week. … [Read More...]

11 Most Popular Books on Perl Programming

Perl is not the most popular programming language. It has only one million users, compared to 12 million that use Python. However, it has a lot of … [Read More...]

Guides

  • Computer Communications
  • Mobile Computing
  • PC Components
  • PC Data Storage
  • PC Input-Output
  • PC Multimedia
  • Processors (CPUs)

Recent Posts

Simple Steps to Take to Troubleshoot Your Hard Drive

Are you having trouble with your hard drive? You shouldn't immediately throw it out. There might be plenty of things that you can do to fix it. The … [Read More...]

Machine Learning Advances Lay Foundation for More Effective Presentations

Machine learning is setting new standards for communication. The global market for machine learning technology was estimated to be worth $1.4 billion … [Read More...]

HSDPA

High-Speed Downlink Packet Access (HSDPA) is a new mobile telephony protocol that provides a smooth evolutionary path for UMTS networks … [Read More...]

[footer_backtotop]

Copyright © 2025 About | Privacy | Contact Information | Wrtie For Us | Disclaimer | Copyright License | Authors