pctechguide.com

  • Home
  • Guides
  • Tutorials
  • Articles
  • Reviews
  • Glossary
  • Contact

Hard Disk SMART Drives

In 1992, IBM began shipping 3.5-inch hard disk drives that could actually predict their own failure – an industry first. These drives were equipped with Predictive Failure Analysis (PFA), an IBM-developed technology that periodically measures selected drive attributes – things like head-to-disk flying height – and sends a warning message when a predefined threshold is exceeded. Industry acceptance of PFA technology eventually led to SMART (Self-Monitoring, Analysis and Reporting Technology) becoming the industry-standard reliability prediction indicator for both IDE/ATA and SCSI hard disk drives.

There are two kinds of hard disk drive failures: unpredictable and predictable. Unpredictable failures happen quickly, without advance warning. These failures can be caused by static electricity, handling damage, or thermal-related solder problems, and there is nothing that can be done to predict or avoid them. In fact, 60% of drive failures are mechanical, often resulting from the gradual degradation of the drive’s performance. The key vital areas include:

  • Heads/head assembly: crack on head, broken head, head contamination, head resonance, bad connection to electronics module, handling damage
  • Motors/bearings: motor failure, worn bearing, excessive run out, no spin, handling damage
  • Electronic module: circuit/chip failure, bad connection to drive or bus, handling damage
  • Media: scratch, defect, retries, bad servo, ECC corrections, handling damage.

These have been well explored over the years and have led to disk drive designers being able to not only develop more reliable products, but to also apply their knowledge to the prediction of device failures. Through research and monitoring of vital functions, performance thresholds which correlate to imminent failure have be determined, and it is these types of failure that SMART attempts to predict.

Just as hard disk drive architecture varies from one manufacturer to another, so SMART-capable drives use a variety of different techniques to monitor data availability. For example, a SMART drive might monitor the fly height of the head above the magnetic media. If the head starts to fly too high or too low, there’s a good chance the drive could fail. Other drives may monitor additional or different conditions, such as ECC circuitry on the hard drive card or soft error rates. When impending failure is suspected the drives sends an alert through the operating system to an application that displays a warning message.

A head crash is one of the most catastrophic types of hard disk failure and – since the height at which a head flies above the surface of the media has decreased steadily over the years as one of the means to increase areal recording densities, and thereby disk storage capacities – it might reasonably be expected to be an increasingly likely form of failure. Fortunately, this is not the case, since flying height has always been one of the most critical parameters for disk drive reliability and as this has steadily decreased, so the techniques used to predict head crashes have become progressively more sophisticated. Not only are heads flying too low are in danger of crashing, but if the recording head flies higher than intended, even for a short period of time, the magnetic field available may be insufficient to reliably write to the media. This is referred to as a high fly write. External shock, vibration, media defect or contamination may cause this. Soft errors caused by this phenomenon are recoverable, but hard errors are not.

The fly height is controlled by the suspension attached to the slider containing the magnetic recording head and the airbearing of the slider. This aerodynamic system controls the variation in fly height as the slider is positioned over the surface of the media. Traversing the head between the inner and outer radius of the disk causes a two-to-one change in velocity. Prior to current technology in airbearing designs, this change in velocity would have created a two-to-one change in nominal fly height. However, with current day airbearing designs, this variation can be reduced to a fraction of the nominal value and fly heights – the distance between the read/write elements and the magnetic surface – are typically of the order of a few millionths of an inch and as low as 1.2 micro-inches. There are several conditions – for example, altitude, temperature, and contamination – that can create disturbances between the airbearing and the disk surface and potentially change the fly height.

S.M.A.R.T

Thermal monitoring is a more recently introduced aspect of SMART, designed to alert the host to potential damage from the drive operating at too high a temperature. In a hard drive, both electronic and mechanical components – such as actuator bearings, spindle motor and voice coil motor – can be affected by excessive temperatures. Possible causes include a clogged cooling fan, a failed room air conditioner or a cooling system that is simply overextended by too many drives or other components. Many SMART implementations use a thermal sensor to detect the environmental conditions that affect drive reliability – including ambient temperature, rate of cooling airflow, voltage and vibration – and issue a user warning when the temperature exceeds a pre-defined threshold – typically in the range 60-65°C).

The table below identifies a number of other failure conditions, their typical symptoms and causes and the various factors whose monitoring can enable impending failure to be predicted:

Type of Failure Symptom/Cause Predictor
Excessive bad sectors Growing defect list, media defects, handling damage Number of defects, growth rate
Excessive run-out Noisy bearings, motor, handling damage Run-out, bias force diagnostics
Excessive soft errors Crack/broken head, contamination High retries, ECC involves
Motor failure, bearings Drive not ready, no platter spin, handling damage Spin-up retries, spin-up time
Drive not responding, no connect Bad electronics module None, typically catastrophic
Bad servo positioning High servo errors, handling damage Seek errors, calibration retries
Head failure, resonance High soft errors, servo retries, handling damage Read error rate, servo error rate

In its brief history, SMART technology has progressed through three distinct iterations. In its original incarnation SMART provided failure prediction by monitoring certain online hard drive activities. A subsequent version improved failure prediction by adding an automatic off-line read scan to monitor additional operations. The latest SMART technology not only monitors hard drive activities but adds failure prevention by attempting to detect and repair sector errors. Also, whilst earlier versions of the technology only monitored hard drive activity for data that was retrieved by the operating system, this latest SMART tests all data and all sectors of a drive by using off-line data collection to confirm the drive’s health during periods of inactivity.

RAID

Up until the late 1990s, the implementation of RAID had been almost exclusively in the server domain. By then, however, processor speeds had reached the point where the hard disk was often the bottleneck that prevented a system running at its full potential. Aided and abetted by the availability of motherboards that included a RAID controller – by 2000 the deployment of RAID’s striping technique had emerged as a viable solution to this problem on high-end desktop systems.

  • Hard disk (hard drive) construction
  • Hard Disk (hard drive) Operation
  • Hard disk (hard drive) format – the tracks and sectors of the hard disk
  • File systems (FAT, FAT8, FAT16, FAT32 and NTFS) explained
  • Hard Disk (Hard Drive) Performance – transfer rates, latency and seek times
  • Hard Disk AV Capability
  • Hard Disk Capacity
  • Hard Disk Capacity Barriers
  • Hard Disk MR Technology
  • Hard Disk GMR Technology
  • Hard Disk Pixie Dust
  • Hard Disk Longitudinal Recording
  • Hard Disk Perpendicular Recording
  • RAID – Redundant Arrays of Inexpensive Disks
  • Hard Disk SMART Drives
  • Hard Disk MicroDrives
  • Hard Disk OAW Technology
  • Hard Disk PLEDM
  • Hard Disk Millipede
  • Guide to Western Digital’s GreenPower hard drive technology
  • Solid state hard drive (SSD) technology guide

Filed Under: Hard Disks

Latest Articles

What is Spear Phishing? Everything You Need to Know

Last summer, CSO published a very insightful article on the concept of spear phishing. This is a problem that even many experienced cybersecurity experts are still getting used to. Cybersecurity is becoming an increasingly important aspect of running an everyday business. Recent trends indicate … [Read More...]

USB Sound Cards

Swiss semiconductor company Micronas has developed a technology which could render the sound card obsolete on future multimedia PC systems. Its USB audio controller integrates a DSP, DAC, operation amplifier, and a USB controller into … [Read More...]

How to Remove Live Security Platinum

Live Security Platinum is a fake antivirus program.  It's the exact same as several other malware viruses.  We already have a guide on Smart Fortress 2012.  As this is the same virus threat we just point people to use that guide as the removal process is the exact same.  Visit Guide … [Read More...]

Gaming Laptop Security Guide: Protecting Your High-End Hardware Investment in 2025

Since Jacob took over PC Tech Guide, we’ve looked at how tech intersects with personal well-being and digital safety. Gaming laptops are now … [Read More...]

20 Cool Creative Commons Photographs About the Future of AI

AI technology is starting to have a huge impact on our lives. The market value for AI is estimated to have been worth $279.22 billion in 2024 and it … [Read More...]

13 Impressive Stats on the Future of AI

AI technology is starting to become much more important in our everyday lives. Many businesses are using it as well. While he has created a lot of … [Read More...]

Graphic Designers on Reddit Share their Views of AI

There are clearly a lot of positive things about AI. However, it is not a good thing for everyone. One of the things that many people are worried … [Read More...]

Redditors Talk About the Impact of AI on Freelance Writers

AI technology has had a huge impact on our lives. A 2023 survey by Pew Research found that 56% of people use AI at least once a day or once a week. … [Read More...]

11 Most Popular Books on Perl Programming

Perl is not the most popular programming language. It has only one million users, compared to 12 million that use Python. However, it has a lot of … [Read More...]

Guides

  • Computer Communications
  • Mobile Computing
  • PC Components
  • PC Data Storage
  • PC Input-Output
  • PC Multimedia
  • Processors (CPUs)

Recent Posts

Bestadblocker virus removal

Yes - Bestadblocker is malware.  By the name of the program you would think it would at least try and pretend to block ads but it provides in-line … [Read More...]

Using Antivirus Reviews to Protect Your System

If you type the word “antivirus” into a web browser, you'll see a nearly limitless list of products asking for your money in exchange for … [Read More...]

Transferring Data Old Computer To A New Computer

In this particular article, we will discuss a few methods of transferring data from an old computer to a new one. Some of these solutions will … [Read More...]

[footer_backtotop]

Copyright © 2025 About | Privacy | Contact Information | Wrtie For Us | Disclaimer | Copyright License | Authors