Physics of failure


Physics of failure is a technique under the practice of reliability design that leverages the knowledge and understanding of the processes and mechanisms that induce failure to predict reliability and improve product performance.
Other definitions of Physics of Failure include:
The concept of Physics of Failure, also known as Reliability Physics, involves the use of degradation algorithms that describe how physical, chemical, mechanical, thermal, or electrical mechanisms evolve over time and eventually induce failure.
While the concept of Physics of Failure is common in many structural fields, the specific branding evolved from an attempt to better predict the reliability of early generation electronic parts and systems.

The beginning

Within the electronics industry, the major driver for the implementation of Physics of Failure was the poor performance of military weapon systems during World War II. During the subsequent decade, the United States Department of Defense funded an extensive amount of effort to especially improve the reliability of electronics, with the initial efforts focused on after-the-fact or statistical methodology. Unfortunately, the rapid evolution of electronics, with new designs, new materials, and new manufacturing processes, tended to quickly negate approaches and predictions derived from older technology. In addition, the statistical approach tended to lead to expensive and time-consuming testing. The need for different approaches led to the birth of Physics of Failure at the Rome Air Development Center. Under the auspices of the RADC, the first Physics of Failure in Electronics Symposium was held in September 1962. The goal of the program was to relate the fundamental physical and chemical behavior of materials to reliability parameters.

Early history – integrated circuits

The initial focus of physics of failure techniques tended to be limited to degradation mechanisms in integrated circuits. This was primarily because the rapid evolution of the technology created a need to capture and predict performance several generations ahead of existing product.
One of the first major successes under predictive physics of failure was a formula developed by James Black of Motorola to describe the behavior of electromigration. Electromigration occurs when collisions of electrons cause metal atoms in a conductor to dislodge and move downstream of current flow. Black used this knowledge, in combination with experimental findings, to describe the failure rate due to electromigration as
where A is a constant based on the cross-sectional area of the interconnect, J is the current density, Ea is the activation energy, k is the Boltzmann constant, T is the temperature and n is a scaling factor.
Physics of failure is typically designed to predict wearout, or an increasing failure rate, but this initial success by Black focused on predicting behavior during operational life, or a constant failure rate. This is because electromigration in traces can be designed out by following design rules, while electromigration at vias are primarily interfacial effects, which tend to be defect or process-driven.
Leveraging this success, additional physics-of-failure based algorithms have been derived for the three other major degradation mechanisms in modern integrated circuits. More recent work has attempted to aggregate these discrete algorithms into a system-level prediction.
TDDB: τ = τo exp
where τo = 5.4*10-7 exp, G = 120 + 5.8/kT, and εox is the permittivity.
HCI: λHCI = A3 expexp
where λHCI is the failure rate of HCI, A3 is an empirical fitting parameter, β is an empirical fitting parameter, VD is the drain voltage, Ea is the activation energy of HCI, typically −0.2 to −0.1eV, k is Boltzmann's constant, and T is temperature in Kelvin.
NBTI: λ = A εoxm VTμp exp
where A is determined empirically by normalizing the above equation, m = 2.9, VT is the thermal voltage, μP is the surface mobility constant, Ea is the activation energy of NBTI, k i s Boltzmann's constant, and T is the temperature in Kelvin

Next stage – Electronic packaging

The resources and successes with integrated circuits, and a review of some of the drivers of field failures, subsequently motivated the reliability physics community to initiate physics of failure investigations into package-level degradation mechanisms. An extensive amount of work was performed to develop algorithms that could accurately predict the reliability of interconnects. Specific interconnects of interest resided at 1st level, 2nd level, and 3rd level.
Just as integrated circuit community had four major successes with physics of failure at the die-level, the component packaging community had four major successes arise from their work in the 1970s and 1980s. These were
Peck: Predicts time to failure of wire bond / bond pad connections when exposed to elevated temperature / humidity
where A is a constant, RH is the relative humidity, f is a voltage function, Ea is the activation energy, KB is Boltzmann's constant, and T is temperature in Kelvin.
Engelmaier: Predicts time to failure of solder joints exposed to temperature cycling
where εf is a fatigue ductility coefficient, c is a time and temperature dependent constant, F is an empirical constant, LD is the distance from the neutral point, α is the coefficient of thermal expansion, ΔT is the change in temperature, and h is solder joint thickness.
Steinberg: Predicts time to failure of solder joints exposed to vibration
where Z is maximum displacement, PSD is the power spectral density, fn is the natural frequency of the CCA, Q is transmissibility, Zc is the critical displacement, B is the length of PCB edge parallel to component located at the center of the board, c is a component packaging constant, h is PCB thickness, r is a relative position factor, and L is component length.
IPC-TR-579: Predicts time to failure of plated through holes exposed to temperature cycling
where a is coefficient of thermal expansion, T is temperature, E is elastic modules, h is board thickness, d is hole diameter, t is plating thickness, and E and Cu correspond to board and copper properties, respectively, Su being the ultimate tensile strength and Df being ductility of the plated copper, and De is the strain range.
Each of the equations above uses a combination of knowledge of the degradation mechanisms and test experience to develop first-order equations that allow the design or reliability engineer to be able to predict time to failure behavior based on information on the design architecture, materials, and environment.

Recent work

More recent work in the area of physics of failure has been focused on predicting the time to failure of new materials, software programs, using the algorithms for prognostic purposes, and integrating physics of failure predictions into system-level reliability calculations.

Limitations

There are some limitations with the use of physics of failure in design assessments and reliability prediction. The first is physics of failure algorithms typically assume a ‘perfect design’. Attempting to understand the influence of defects can be challenging and often leads to Physics of Failure predictions limited to end of life behavior. In addition, some companies have so many use environments that performing a PoF assessment for each potential combination of temperature / vibration / humidity / power cycling / etc. would be onerous and potentially of limited value.