As the semiconductor industry pushes beyond the 5nm process node toward 3nm and the emerging 2nm "Angstrom era," the fundamental physics of chip manufacturing is undergoing a radical transformation. This transition has triggered what experts describe as an "explosion in defectivity," a phenomenon where the number of potential anomalies detected on a single silicon wafer has scaled from thousands to millions. This massive influx of data has created a critical bottleneck in the fabrication process, necessitating a paradigm shift in how semiconductor manufacturers identify, classify, and mitigate defects.
Prasad Bachiraju, senior director of business development at Onto Innovation, recently highlighted the growing disparity between the sheer volume of detected anomalies and the subset of those that actually impact device performance. The challenge facing modern foundries is no longer just finding defects, but rapidly determining which are "killer defects"—those that cause electrical failure—and which are "nuisance defects" resulting from benign process variations or background noise.
The Scaling Crisis: Why Defects are Proliferating
The move to advanced nodes has introduced complexities that were non-existent a decade ago. At 28nm or 45nm, features were large enough that minor variations in material deposition or lithography were often negligible. However, at the 3nm node and below, where feature sizes are measured in atoms, the margin for error has virtually disappeared.
Several factors contribute to this explosion in defectivity. First, the introduction of Extreme Ultraviolet (EUV) lithography, while revolutionary, has introduced its own set of challenges, including stochastic effects—random variations in photon absorption that can lead to "micro-bridges" or broken lines. Second, the transition from FinFET (Fin Field-Effect Transistor) architectures to GAAFET (Gate-All-Around Field-Effect Transistor) involves more complex vertical stacking and high-aspect-ratio etching. Every additional layer and every new material interface represents a potential site for a defect.
Furthermore, the industry is increasingly utilizing multi-patterning techniques and advanced packaging, such as 2.5D and 3D ICs. These processes require multiple touchpoints where wafers must be inspected. According to industry data, the number of inspection steps in a leading-edge logic process has increased by nearly 40% over the last three process generations.
The Data Deluge and the Nuisance Problem
The primary issue identified by Onto Innovation is the signal-to-noise ratio in modern metrology. As inspection tools become more sensitive to capture smaller defects, they also capture more "noise." This noise can stem from surface roughness, graininess in the metal layers, or subtle variations in the chemical-mechanical planarization (CMP) process that do not actually affect the chip’s functionality.

Bachiraju notes that a modern inspection tool can identify millions of points of interest on a single 300mm wafer. If a fab were to use traditional manual review methods for these millions of points, the time to yield (TTY) would be measured in years rather than months. In a high-volume manufacturing environment, speed is as critical as accuracy. A delay in identifying a systemic defect in the line can lead to the loss of thousands of wafers, representing hundreds of millions of dollars in wasted materials and lost opportunity.
Chronology of Inspection Evolution
The evolution of defect management can be categorized into four distinct eras:
- The Manual Era (Pre-2000s): Operators used high-powered optical microscopes to manually inspect samples of wafers. Defect density was relatively low, and process windows were wide.
- The Automated Optical Era (2000–2010): As nodes shrunk to 90nm and 65nm, Automated Optical Inspection (AOI) tools became standard. These tools used simple algorithms to compare a die against its neighbor (die-to-die) or a golden database (die-to-database).
- The Statistical Era (2010–2018): With the rise of FinFET at 22nm and 16/14nm, the number of defects began to climb. Fabs relied heavily on statistical process control (SPC) to filter out noise, focusing only on the most egregious outliers.
- The AI and Edge Era (2019–Present): At 7nm and below, statistical models alone are insufficient. The industry has moved toward deep learning and artificial intelligence integrated directly into the inspection hardware—often referred to as "inference at the edge."
Technical Solutions: Multi-Modal Imaging and AI at the Edge
To combat the explosion of data, Onto Innovation and other metrology leaders are deploying multi-modal imaging techniques. Instead of relying on a single light source, modern tools use various illumination modes—such as brightfield, darkfield, and oblique lighting—simultaneously or in rapid succession. Each mode reveals different characteristics of the wafer surface.
"By capturing different types of images using different illumination modes at different touchpoints," Bachiraju explains, "we can gather a more holistic view of the defect’s morphology."
Once these images are captured, the data is processed using AI and machine learning algorithms. This is where "edge computing" becomes vital. In a semiconductor fab, the sheer volume of raw image data is too massive to be uploaded to a central cloud or a distant server for analysis without causing significant latency. By performing AI classification at the "edge"—within the inspection tool itself—the system can filter out nuisance errors in real-time.
Machine learning models are trained to recognize the "fingerprints" of specific defect types. For example, a "bridge" defect between two metal lines has a distinct visual signature compared to a harmless speck of surface dust. By automatically classifying millions of detections into "critical" and "non-critical" categories, the system allows engineers to focus their attention on the 1% of defects that actually threaten yield.
Industry Implications and Economic Stakes
The economic stakes of defect management have never been higher. A modern 3nm fab, such as those operated by TSMC in Taiwan or Samsung in South Korea, costs upwards of $20 billion to construct. To achieve a return on this investment, these facilities must reach high-volume production with yields exceeding 80% as quickly as possible.

Industry analysts at Gartner and VLSI Research suggest that for every 1% increase in yield at an advanced node, a foundry can realize an additional $100 million to $150 million in annual revenue. Conversely, a "yield excursion"—a sudden drop in yield due to an undetected defect—can cost a company millions of dollars per day.
The shift toward AI-driven defectivity management is also a response to the global talent shortage. There are simply not enough highly trained yield engineers to manually analyze the data generated by a modern fab. AI acts as a force multiplier, allowing a small team of engineers to manage a facility that produces tens of thousands of wafers per month.
Official Responses and Strategic Shifts
Major players in the semiconductor ecosystem have echoed the sentiments expressed by Onto Innovation. KLA Corporation, the market leader in process control, has increasingly focused on "simultaneous multi-channel" detection. Similarly, Applied Materials and ASML have integrated more sensors and "e-beam" (electron beam) inspection capabilities to supplement optical tools, as e-beam provides the higher resolution necessary to see the smallest sub-surface defects.
In recent earnings calls, executives from leading foundries have emphasized that "metrology and inspection" are no longer seen as overhead costs but as strategic enablers. Without advanced defectivity management, the transition to the next node (2nm) would be economically unfeasible.
Fact-Based Analysis: The Path Forward
The "explosion in defectivity" is not a temporary hurdle but a permanent feature of the nanometer-scale manufacturing landscape. As the industry moves toward High-NA (High Numerical Aperture) EUV and backside power delivery, the complexity of the wafer surface will only increase.
The analysis of current trends suggests three primary directions for the future of defect management:
- Digital Twins of the Process: Fabs will increasingly use "digital twins"—virtual models of the manufacturing process—to predict where defects are likely to occur based on real-time sensor data from the lithography and etch tools.
- In-Situ Metrology: Rather than moving a wafer to a separate inspection station, more sensors will be integrated directly into the processing tools (lithography, deposition, etch) to catch defects the moment they are created.
- Cross-Fab Collaboration: As the industry consolidates, there is a growing trend of "copy-exactly" strategies where defect data is shared across different geographic locations of the same company to ensure that a solution found in a fab in Oregon can be immediately applied to a fab in Ireland or Israel.
In conclusion, the ability to rapidly distinguish between critical and non-critical defects is the new frontier of semiconductor competition. As Prasad Bachiraju and the team at Onto Innovation have noted, the solution lies in the intelligent application of AI at the edge. By turning a "data explosion" into "actionable intelligence," the semiconductor industry can continue its march toward smaller, faster, and more efficient chips, maintaining the pace of Moore’s Law despite the daunting physical challenges of the atomic scale.
