6
Full Wafer Nanotopography Analysis on Rough Surfaces Using Stitched White Light Interferometry Images Dirk Lewke, Martin Schellenberger, Lothar Pfitzner Fraunhofer Institute for Integrated Systems and Device Technology, IISB Erlangen, Germany [email protected] Thomas Fries, Bastian Tröger, Alexander Muehlig Fries Research & Technology GmbH, FRT Bergisch Gladbach, Germany Frank Riedel, Stefan Bauer, Hubert Wihr Siltronic AG Burghausen, Germany AbstractFeature sizes of transistors manufactured on silicon wafers in high volume reached 22 nm and will further decrease in the future. Superior wafer surface quality is mandatory to produce such small feature sizes. One relevant quality parameter is the nanotopography of the wafers surface. Applying dedicated state-of-the-art metrology systems, nanotopography is characterized end-of-line of wafer manufacturing on wafers with specular reflectance. In order to continuously improve wafer quality and identify nanotopography related features early, a measurement system capable of characterizing rough wafer surfaces is required. This paper presents a NT analysis approach using a white light interferometer (WLI) with a field of view of 85 x 85 mm 2 . An optimized stitching algorithm merges 16 individual WLI measurements to a height map of an entire 300 mm wafer. Nanotopography can be extracted at wafer edge exclusion below 1 mm by applying flexible Gaussian high-pass filters. A mathematical analysis of nanotopography characteristics according to SEMI M43 is implemented. Gauge repeatability and reproducibility studies provided standard deviations of less than 1 nm on wafer surfaces as-ground by using threshold height analysis (THA). This proves the measurement system’s capability for nanotopography analysis at early states of wafer manufacturing, which in turn can support improving wafer surface quality. Keywordsnanotopography; wafer manufacturing; stitching; optical measurement; Gaussian filtering I. INTRODUCTION Current device feature sizes in the semiconductor industry reached 22 nm and will further decrease in the near future [1]. To manufacture such small features, the flatness of silicon wafers has to be of very high quality. One approach of characterizing this quality is by determining the so-called nanotopography (NT) [2] of wafers at the end-of-line (EoL) of wafer manufacturing [3]. To analyze NT, the topography of the wafer surface has to be measured and subsequently high-pass filtered in order to unravel only those topography contributions at short spatial wavelengths between 0.2 mm and 20 mm, which typically exhibit amplitudes of few to tens nm [4][5][6]. Most state-of-the-art optical measurement systems analyze NT of surfaces with specular reflectance like those of, e. g., polished wafers [3]. Surface defects contributing to NT, which are generated at early stages of wafer manufacturing, may thus be detected EoL only. For enabling fast feedback loops and process optimization, a monitoring immediately after relevant manufacturing steps is desirable. Benefits are: Identification of critical, NT related features on wafer surfaces at an early stage of wafer manufacturing. Continuous improvement of relevant wafering, grinding, and lapping processes. Fig. 1. The FRT multi sensor metrology tool performing nanotopography analysis. This R&D work has been performed within the framework of the European project SEAL (Contract Number: INFSO-ICT-257379). The consortium acknowledges the European Commission for the support. 243 978-1-4673-5007-5/13/$31.00 ©2013 IEEE ASMC 2013

[IEEE 2013 24th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC 2013) - Saratoga Springs, NY (2013.5.14-2013.5.16)] ASMC 2013 SEMI Advanced Semiconductor Manufacturing

  • Upload
    hubert

  • View
    215

  • Download
    3

Embed Size (px)

Citation preview

Full Wafer Nanotopography Analysis on Rough

Surfaces Using Stitched White Light Interferometry

Images

Dirk Lewke, Martin Schellenberger, Lothar Pfitzner

Fraunhofer Institute for Integrated Systems

and Device Technology, IISB

Erlangen, Germany

[email protected]

Thomas Fries, Bastian Tröger, Alexander Muehlig

Fries Research & Technology GmbH, FRT

Bergisch Gladbach, Germany

Frank Riedel, Stefan Bauer, Hubert Wihr

Siltronic AG

Burghausen, Germany

Abstract—Feature sizes of transistors manufactured on silicon

wafers in high volume reached 22 nm and will further decrease in

the future. Superior wafer surface quality is mandatory to

produce such small feature sizes. One relevant quality parameter

is the nanotopography of the wafers surface. Applying dedicated

state-of-the-art metrology systems, nanotopography is

characterized end-of-line of wafer manufacturing on wafers with

specular reflectance. In order to continuously improve wafer

quality and identify nanotopography related features early, a

measurement system capable of characterizing rough wafer

surfaces is required. This paper presents a NT analysis approach

using a white light interferometer (WLI) with a field of view of

85 x 85 mm2. An optimized stitching algorithm merges 16

individual WLI measurements to a height map of an entire

300 mm wafer. Nanotopography can be extracted at wafer edge

exclusion below 1 mm by applying flexible Gaussian high-pass

filters. A mathematical analysis of nanotopography

characteristics according to SEMI M43 is implemented. Gauge

repeatability and reproducibility studies provided standard

deviations of less than 1 nm on wafer surfaces as-ground by using

threshold height analysis (THA). This proves the measurement

system’s capability for nanotopography analysis at early states of

wafer manufacturing, which in turn can support improving

wafer surface quality.

Keywords—nanotopography; wafer manufacturing; stitching;

optical measurement; Gaussian filtering

I. INTRODUCTION

Current device feature sizes in the semiconductor industry reached 22 nm and will further decrease in the near future [1]. To manufacture such small features, the flatness of silicon wafers has to be of very high quality. One approach of characterizing this quality is by determining the so-called nanotopography (NT) [2] of wafers at the end-of-line (EoL) of wafer manufacturing [3]. To analyze NT, the topography of the wafer surface has to be measured and subsequently high-pass

filtered in order to unravel only those topography contributions at short spatial wavelengths between 0.2 mm and 20 mm, which typically exhibit amplitudes of few to tens nm [4][5][6].

Most state-of-the-art optical measurement systems analyze NT of surfaces with specular reflectance like those of, e. g., polished wafers [3]. Surface defects contributing to NT, which are generated at early stages of wafer manufacturing, may thus be detected EoL only. For enabling fast feedback loops and process optimization, a monitoring immediately after relevant manufacturing steps is desirable. Benefits are:

Identification of critical, NT related features on wafer surfaces at an early stage of wafer manufacturing.

Continuous improvement of relevant wafering, grinding, and lapping processes.

Fig. 1. The FRT multi sensor metrology tool performing nanotopography

analysis.

This R&D work has been performed within the framework of the European project SEAL (Contract Number: INFSO-ICT-257379). The

consortium acknowledges the European Commission for the support.

243978-1-4673-5007-5/13/$31.00 ©2013 IEEE ASMC 2013

This paper presents an approach dedicated for analyzing NT on rough surfaces like those of, e. g., ground or lapped wafers. The tool is based on the Fries Research & Technology FRT multi sensor metrology platform and equipped with a white light interferometer (WLI) system allowing for reliable topography measurements on rough surfaces, Fig. 1.

II. APPROACH FOR APPLYING NANOTOPOGRAPHY

ANALYSIS

Typical wafer geometry can essentially be subdivided into a bow, warp, NT, and micro-roughness regime. Micro-roughness is probed at very short spatial wavelength scale down to atomic distances, while bow and warp represent contributions to wafer geometry to be obtained at the opposite end of scale. NT is the non-planar deviation of a wafer surface at spatial wavelengths ranging from 0.2 mm to 20 mm [4][5][6], Fig. 2.

Fig. 2. Spatial wavelength regimes of roughness, nanotopography, warp and

bow of silicon wafers [6].

The research and development work reported here for developing a tool capable of measuring NT of rough wafer surfaces can be split into four steps:

Development of a sensor system based on a white light Michelson interferometer with a field of view (FoV) of 85 x 85 mm

2 in order to measure the topography of a

300 mm wafer surface by 16 individual snapshots across the wafer.

Optimizing stitching algorithms to obtain the entire surface topography map of a 300 mm wafer, compensating height offsets of the individual measurements and residual curvature of sensor images.

Implementing a high pass filter algorithm with a minimum filter edge effect in order to extract topography information at NT related spatial wavelengths scale with an edge exclusion less than one millimeter.

Analyzing and reporting the wafer NT according to SEMI M43 [4].

These four steps are now described in detail.

A. Mapping the wafer

The topography of wafer surfaces is obtained by applying a WLI sensor with 85 x 85 mm2

FoV based on a Michelson-interferometer setup [7]. WLI is used to achieve sufficiently high resolution on rough surfaces. In combination with specialized analysis software, lateral and vertical resolutions of 150 µm and < 10 nm, respectively, are achieved.

The wafer to be analyzed rests on an x-y-scanning stage. The stage positions the wafer beneath the WLI sensor. The orientation of the x-y-coordinate systems of sensor and stage are aligned. While sensor alignment and positioning accuracy of the stage are better than the lateral resolution of the WLI, each individual measurement can be exactly located on the global map. 16 single measurements are necessary to cover the entire 300 mm wafer at 85 x 85 mm2

FoV.

In order to perform an NT analysis according to SEMI M43 [4], one map of the whole wafer is required. Hence, the 16 single measurements have to be stitched together.

B. Merging the single measurements

85 x 85 mm² FoV allows for approximately 10 mm overlap between adjoining measurements. The exact position of each single measurement is well known owing to the sufficiently precise mechanical alignment of stage and WLI sensor. Thus, a compensation for x and y displacement and a rotation is not necessary. However, the height data of adjoining measurements must fit to avoid stitching artifacts which may affect NT analysis. An appropriate stitching algorithm has to compensate for basically three types of height deviations: z-offsets, linearly tilted measurements, and nonlinear deformations of single measurements.

The root cause of z-offsets and tilted measurements are offsets between the z-axes of WLI and stage. These lead to inaccuracies in the µm range, whereas amplitudes of NT features to be quantified are in the nm range. Imperfections in the optical system and nonlinearities of z-scanning of the WLI result in nonlinear deformations of the height data in the nm range.

Common stitching algorithms compensate for a rotation of single measurements but not for a deformation of each single measurement. The optimized stitching algorithm developed in this work is based on analyzing the overlap region between adjoining measurements:

Calculation of the deviation in the overlap region of two adjoining measurements.

A three dimensional polynomial of degree up to 3 is fitted to the deviation describing the height mismatch of adjoining measurements.

This function is subtracted from the height date of the measurement to be stitched. Hence the height data of the measurement to be stitched is modified to optimally fit to adjoining measurements.

The height data along the overlapping area is calculated by differently weighted height data of adjoining measurement areas in order to guarantee a smooth transition between both areas. This is done in order to suppress minor residual deviations in the overlap region.

Stitching starts at the wafer center and continues in circular order.

By applying this polynomial deformation to the individual maps, the global wafer geometry is distorted, however, at

244 ASMC 2013

spatial wavelengths much higher than the upper boundary for NT, cf. Fig. 2. This is why determining bow and warp using this stitching approach is not feasible. Subsequent high pass filtering required for the NT analysis though removes such long wavelength contributions and NT results are not significantly affected.

An entire topography map after applying the stitching algorithm prior to high-pass filtering is displayed in Fig. 3. Mainly wafer bow and warp superimposed to the geometry of the wafer stage as well as to distortions caused by the stitching algorithm contribute to the image.

C. High pass filtering

Prior to NT evaluation the complete wafer map has to be high-pass filtered. This step removes global shape contributions. According to SEMI M78 [8] this is done by applying a Double Gaussian (DG) high-pass filter at filter cut-off wavelength of 20 mm, because it suppresses wafer bow and warp related signal more effectively than Single-Gaussian (SG) high-pass filtering at same cut-off wavelength. Nevertheless, in the semiconductor industry also SG with cut-off wavelength reduced to, e. g., 20/√2 mm is used for NT filtering. At such reduced cut-off longer spatial wavelength topography contributions like bow and warp can be suppressed more effectively than with SG at 20 mm cut-off, Fig. 5. A previous investigation on NT filtering by Riedel et al. [9] revealed that various alternative high-pass filtering options are available when applying EoL NT measurement tools.

Height (µm)

Fig. 3. Full 300 mm wafer topography map after applying stitching

algorithm. Such a surface topography image comprises wafer bow and warp,

stage geometry and distortions induced by the stitching algorithm.

Data treatment of the wafer surface near the edge requires special attention. For processing the near edge regions, a radial extrapolation of the measurement data beyond the physical wafer edge is implemented. Known height data within the wafer are used to fill the area outside the wafer. The approach to calculate the height data of a pixel outside the known wafer area as illustrated in Fig. 4 and might be summarized as reflection and inversion. An extrapolated pixel is calculated using the height data at the wafer edge, ze, and the height data zm of the point at the same distance to the wafer edge as the pixel to be filled along a line through the wafer center. With these two data points and (1), the height data zi of the pixel to be extrapolated is calculated:

Fig. 4. Principle of radial surface extrapolation. Extended matrix is filled by

extrapolation of measured height data of the wafer.

mei zzz 2

This approach allows for filtered wafer maps covering the full wafer without near edge artifacts known from, e. g., shrinking filter approaches or from missing data points [9][10].

The SG high-pass filter is defined as

2

1)(

c

x

c

exf

with

2ln

c denotes the cut-off wavelength of the high-pass filter

[9].

According to SEMI M78, the DG high-pass filter response is defined in the frequency domain [8]:

22

456.2228.1

21

cc

eeG

This filter response is obtained by executing the SG filter twice [9]. The normalized transmission characteristic of both the SG high-pass filter for 20 mm and 20/√2 mm cut-off wavelength, respectively, and the DG high-pass filter for cut-off wavelength of 20 mm are plotted in Fig. 5.

Wafer

Extended matrix

Measured height data z

m

Extrapolated height data z

i

Height data at wafer edge z

e

ze

zm

Wafer radius

Distance to mirror wafer edge

Wafer center

Height

zi

245 ASMC 2013

Fig. 5. Comparison of transmission characteristics of a SG and a DG high

pass filter with 20 mm cut-off wave length (λc) and a SG high pass filter with 20/√2 mm cut-off wave length.

Both filters – the SG high-pass and the DG high-pass – are implemented in the equipment software and can be applied at arbitrarily selected cut-off wavelength to adjust the filter technique to the desired analysis. Furthermore, with these flexible filtering options a fair comparison to results obtained with other measurement equipment is possible.

Fig. 6. Threshold @ % area curves of ground wafers according to SEMI M43

[4]. Diameters of the circular analysis areas are 2 mm and 10 mm. Results of a wafer without dimple (Fig. 7) and with dimple (Fig. 10) are shown in top and

bottom plot, respectively.

D. Nanotopography analysis

The SEMI standard M43 [4] specifies an approach for analyzing and reporting NT. The first step after NT filtering is the determination of peak-to-valley (PV) height variation among the pixels included in a predefined analysis area. The value is assigned to the center of the analysis area. This calculation is performed while floating the analysis area pixel by pixel across the whole area of the wafer. The shape of an analysis area can be circular or square, and the size is typically between 2 x 2 mm

2 and 10 x 10 mm

2 (in case of square

analysis area). In the next step, the threshold curve is generated by plotting % area against threshold. % area is calculated from

the number of analysis areas for which the PV value exceeds the threshold value. % area data are provided as percentage of whole wafer area. Two sample threshold curves are displayed in Fig. 6. In a last step, two parameters are calculated from the threshold curves: % area at a specific threshold, and threshold at a specific % area. This approach is also known as threshold height analysis (THA).

III. DISCUSSION OF THE RESULTS ACHIEVED

A. Comparison of linear and polynomial stitching algorithm

Polynomial stitching does not that significantly deteriorate NT due to residual distortions rather it reduces more effectively stitching artifacts. This has been confirmed by applying either linear stitching or polynomial stitching of degree 3 on the same WLI height measurements. Both wafer maps were SG high-pass filtered at cut-off wavelength 20 mm. Fig. 7 clearly shows differences in residual stitching artifacts while the real wafer NT is maintained.

(a)

Height (nm)

(b)

Fig. 7. Two NT height maps of the same 300 mm wafer as-ground.

Individual images were stitched using a linear fit (a) and a polynomial fit of

degree 3 (b), respectively. Both topography maps were SG high pass filtered

at cut-off wavelength 20 mm. It is evident that polynomial fitting does not

affect the NT signal but leads to considerably less significant stitching artifacts.

B. Adjustable filter characteristics

The filter response of the DG high-pass filter is steeper than the filter response of the SG high pass filter, Fig. 5. Consequently, different wafer surface characteristics can be analyzed by applying either filter, Fig. 8.

In order to emulate an EoL NT measurement according to common industry practice [9], a SG high pass filter with

246 ASMC 2013

14.1 mm cut-off wavelength was applied to a wafer map after polynomial stitching, Fig. 9 (a). The center region of the SG filtered map exhibits NT features qualitatively resembling the wafer’s NT map EoL, Fig. 9 (b).

(a) Height (nm)

(b) Height (nm)

Fig. 8. Two NT height maps of the same 300 mm wafer as-ground.

Individual images were stitched using a polynomial fit of degree 3. (a) NT

map after SG high-pass filtering; (b) NT map after DG high-pass filtering.

Both at filter cut-off wavelength 20 mm. More details of high-spatial-

frequency surface structures are visible when applying DG high pass filtering.

C. Minimized edge exclusion size

The filter algorithms with minimized filter edge effect also allow for analyzing NT features near the wafer edge. Two NT features can be seen in the NT map of a 300 mm wafer as-ground at edge exclusion less than 1 mm, Fig. 10, – a dimple and a circumferential feature close to the wafer edge.

D. Repeatability of nanotopography analysis

A complete gauge repeatability and reproducibility (GRR) study has been performed to determine the NT measurement capability of the measurement system. A cassette comprising 25 wafers with different surface qualities (from as-ground to polished) was measured on three days two times per day with raw data saving. Since the measurement repeatability for all surface types is comparable, the measurement of this limited number per wafer type gives significant results.

GRR figures were determined by reprocessing raw data with appropriate NT evaluation recipes. This allows for comparison of different filter and NT analysis settings. As an example, we applied DG high-pass filtering with 20 mm cut-off and determined THA values for 50 % area. Variability data for the 10 x 10 mm2

window size are shown in Fig. 11,

resulting in = 0.32 nm. GRRs for smaller window sizes are even lower [11].

(a) Height (nm)

(b)

Fig. 9. NT map of a 300 mm wafer as-ground after polynomial stitching and

SG filtering at cut-off wavelength 20/√2 mm (a). For comparison the wafer’s

NT map end-of-line using KLA-Tencor’s WaferSight 2 (b). NT features

obtained end-of-line, e. g., in the wafer center can already be sensed on the

wafer as-ground.

Height (nm)

Fig. 10. NT wafer map of a ground 300 mm wafer with a dimple and NT

features close to the wafer edge. Edge exclusion is < 1 mm.

20 nm

-20 nm

247 ASMC 2013

Repeatability figures for smooth and rough surfaces are in the same order of magnitude. Different products separate well regarding characteristic THA values.

Fig. 11. Variability chart of THA at 50 % area for 10 x 10 mm² analysis

window using DG high pass filtering.

IV. CONCLUSION

Topography maps of 300 mm silicon wafer surfaces after mechanical process steps, i. e., with rough surface have been successfully recorded using white light interferometry. Applying optimized stitching algorithms combined with data extrapolation near the wafer edge allows for generating a complete 300 mm wafer topography map from the 16 individual small maps recorded. It is possible to obtain NT height maps upon high-pass filtering exhibiting only minor impact from residual measurement artifacts. The implementation of NT analysis according to SEMI M43 enables characterization of NT early in wafer manufacturing

stages and identically to established EoL measurement systems. This facilitates continuous improvement of relevant wafering, grinding or lapping processes.

ACKNOWLEDGMENT

This R&D work has been performed within the framework of the European project SEAL (Contract Number: INFSO-ICT-257379). The consortium acknowledges the European Commission for the support. Additionally, the authors acknowledge Dr. Andreas Nutsch for his preliminary research in surface extrapolation combined with NT filtering.

REFERENCES

[1] ITRS – “International Technology Roadmap for Semiconductors”, 2011

[2] Ravi, K. V.; Future Fab Report, Volume 7, Page 207, 1999

[3] Paik, Ungyu; Park, Jea-Gun; “Nanoparticle Engineering for Chemical-Mechanical Planarization”; CRC Press; Boca Raton; 2009

[4] SEMI; “Guide for reporting wafer nanotopography”; SEMI Standard M43-1109; 2009

[5] Riedel, Frank; Gerber, H.-A.; Wagner, P.; „Impact of filtering on nanotopography measurement of 300 mm silicon wafers“;Materials Science in Semiconductor Processing, Volume 5, Issues 4–5, August–October 2002, Pages 465–472

[6] Nutsch, A.; Bucourt, S.; Grandin, T.; Lazareva, I.; Pfitzner L.; “Wave Front Sensor for Highly Accurate Characterization of Flatness on Wafer Surfaces”; AIP Conference Proceedings Vol. 1173; Melville, NY 2009

[7] Hecht, Eugene; “Optics”; Addison-Wesley; 2012

[8] SEMI; “Guide for determining nanotopography of unpatterned silicon wafers for the 130 nm to 22 nm generations in high volume manufacturing”; SEMI Standard M78-1110; 2010

[9] Riedel, F.; Gerber, H.-A.; Wagner, P.; “Double Gaussian Filters”; International Advanced Wafer Geometry Task Force SEMICON Europa, München 2004

[10] Muralikrishnan, Bala; Raja, Jay; “Computational Surface and Roundness Metrology”; Springer; London; 2009

[11] Tröger et al., “Automated Tool for Measuring Nanotopography of 300 mm Wafers at Early Stages of Wafer Production” Proceedings of the 11th ISMTII; Aachen, Germany; 2013; to be published

Thre

sho

ldB

yPe

rce

nt [

nm

] 10x10

mm

2

0

50

100

150

200

18 19 20 21 22 23 24 25 10 11 12 13 14 15 16 17 5 6 7 8 9 1 2 3 4

DDG FG POL PPG

Slot within Type

Std

De

v

0.0

0.1

0.2

0.3

0.4

0.5

18 19 20 21 22 23 24 25 10 11 12 13 14 15 16 17 5 6 7 8 9 1 2 3 4

DDG FG POL PPG

Slot within Type

Data was unbalanced, so a REML Fit was performed.

Response ThresholdByPercent [nm] 10x10 mm2

Type

Slot[Type]

Within

Total

Component

675.2849

978.9186

0.1020

1654.3054

Var Component

40.8

59.2

0.0062

100.0

% of Total 20 40 60 80

25.986

31.288

0.319

40.673

Sqrt(Var Comp)

Variance Components

Variability Chart for ThresholdByPercent [nm] 10x10 mm2

Variability Gage

Process 1 Process 2 Process 3 Process 4

Process 1 Process 2 Process 3 Process 4

248 ASMC 2013