Method for computing partially coherent aerial imagery
A method for simulating aerial images is provided where the integrand of a transmission cross-coefficient (TCC) integral is formed from defocused paraxial pupil transfer functions, and contour integration is performed over the boundary of the intersection of the offset pupil functions and the source function. Preferably, the paraxial pupil functions are approximated by a second order Taylor series expansion. The integrand is preferably parameterized in terms of the angles subtending the arcs of the boundary of the integration region, and the integrand is further approximated by an expansion of analytically integrable terms having an error term that substantially monotically decreases as the number of expansion terms increases. Additional factors such as aberrations and amplitude variations can be included by using functions that are simply multipied with the defocused paraxial pupil functions in the integrand. The integrands provide fast computations of TCC integrals that are accurate to within a desired tolerance.
Latest IBM Patents:
The present invention relates in general to manufacturing processes that require lithography and, in particular, to methods of designing photomasks and optimizing lithographic and etch processes used in microelectronics manufacturing.
BACKGROUND INFORMATIONDuring microelectronics manufacturing, a semiconductor wafer is processed through a series of tools that perform lithographic processing, followed by etch processing, to form features and devices in the substrate of the wafer. Such processing has a broad range of industrial applications, including the manufacture of semiconductors, flat-panel displays, micromachines, and disk heads.
The lithographic process allows for a mask or reticle pattern to be transferred via spatially modulated light (the aerial image) to a photoresist (hereinafter, also referred to interchangeably as resist) film on a substrate. Those segments of the absorbed aerial image, whose energy (so-called actinic energy) exceeds a threshold energy of chemical bonds in the photoactive component (PAC) of the photoresist material, create a latent image in the resist. In some resist systems the latent image is formed directly by the PAC; in others (so-called acid catalyzed photoresists), the photo-chemical interaction first generates acids which react with other photoresist components during a post-exposure bake to form the latent image. In either case, the latent image marks the volume of resist material that either is removed during the development process (in the case of positive photoresist) or remains after development (in the case of negative photoresist) to create a three-dimensional pattern in the resist film. In subsequent etch processing, the resulting resist film pattern is used to transfer the patterned openings in the resist to form an etched pattern in the underlying substrate.
Diffraction, interference and processing effects that occur during the transfer of the image pattern causes the image or pattern formed at the substrate to deviate from the desired (i.e. designed) dimensions and shapes. These deviations depend on the interaction of the pattern configurations with the process conditions, and can affect the yield and performance of the resulting microelectronic devices. Various techniques have been used to compensate for and correct for these deviations. Such techniques include known as optical proximity correction (OPC), for example, by biasing selected mask features to compensate for deviations. Other techniques include using sub-resolution assist features (SRAFs), also known as scattering bars or intensity leveling bars, to improve the uniformity of grating characteristics of the mask, and thereby improve optimization of lithographic process conditions for the mask. Phase shifted mask technology (PSM) has also been used to improve the contrast of image features by destructive interference, and thus improve resolution. These and other various techniques for improving the lithographic process are generally referred to as resolution enhancement techniques (RETs).
Prediction of the partially coherent images resulting from illumination in a modern lithographic scanner is of paramount importance as looming technology nodes stress the use of Resolution Enhancement Technologies (RETs), such as SubResolution Assist Features (SRAFs) and Alternating Phase-Shift Masks (AltPSM). Without fast and accurate simulation, it would be impossible to employ an strong RET solution in a practical setting. The reason for this is because simulations have made the transition from learning/research tool to a major ingredient in the design stage. However, in practice, only a portion of a mask pattern can be simulated at a time, to allow for reasonable computation times.
The ability to accurately predict the resulting aerial image, latent image and/or etched pattern due to the lithographic and etch processes is crucial for ensuring sufficient manufacturing yields and reducing costs of manufacturing. The aerial image of a mask pattern is the distribution of intensity at the plane of the wafer or resist surface. The accurate simulation of the aerial image is key in the design of photomasks, for example, by model-based optical proximity correction (model-based OPC). In model-based OPC, for given lithographic process conditions (e.g. illumination source parameters, numerical aperture (NA) and partial coherence (σ0) of the lithography tool, specified exposure dose and defocus, etc.), and an initial mask design, the resulting image is simulated. In the model-based OPC process, the simulated image is compared to the target (desired) design, and deviations from the desired image are determined. The mask design is iteratively modified until the simulated image agrees with the target design within an acceptable tolerance. Thus the accuracy of the simulated image is crucial in obtaining viable mask designs, and the speed of such calculations impacts the cost of designing the masks.
Simulated images can also be used to improve understanding the interaction of, and for optimizing the lithographic process conditions to maximize resolution and improve yields. For example, for a particular design pattern, a decision to use one type of resolution enhancement technique over another, for example, whether to use altPSM or SRAFs involves an understanding what the best range of process windows will be for a range of altPSM or SRAF processes. A wide range of process conditions must be simulated and corresponding images simulated for each process condition, at a required accuracy. The simulation of a single aerial image using conventional methods, with sufficient accuracy (e.g. within 1%), typically takes hours or even days.
To get an idea of what sort of accuracy is required in simulation, consider that the aerial image simulator can be viewed as a metrology tool. There is an inherent uncertainty in metrology, for example in using metrology techniques such as (scanning electron microscope) SEM, in which a target to be measured is bombarded with electrons, which in turn produce a signal, indicating, for example, line widths in the target. However, there is an inherent uncertainty in these measurements, for example, caused by charge damage to features on the target. The precision to tolerance ratio (P to T ratio) is the ratio of the precision, or accuracy, of the metrology tool to the tolerance specification for the device being measured. The specification for line widths (CD) may be, for example, 90 nm, within 3 sigma. The demand on the line width distribution is such that CD has a mean value of 90 nm, wherein the 3 sigma variation is within 5% of 90 nm (e.g. ±4.5 nm tolerance). A typical spec for P-T ratio is to measure the line within a small fraction of 4.5 nm (e.g. 20% or 0.90 nm or 9.0 Angstroms). A simulation tool should provide numerical accuracy in a similar vein as the metrology tools specifications, e.g. within 0.90 nm. One conventional method of simulating aerial images is to use a gridding algorithm, as in the prior art outlined above, but in order to obtain the precisions required, to obtain the required precision, the smaller grid sizes result in a large number of gridding intervals, which in turn result in impractical computation times. Such gridding methods cannot be used to simulate large portions of a mask in a practical amount of time.
Partially coherent imagery is simulated using what is now called the Hopkins Model (see, H. H. Hopkins, “On the diffraction theory of optical images,” Proc. of the Royal Society of London, Vol. A 217, pp. 408-432 (1953)). The Hopkins model for simulating the aerial image is a method to compute aerial images using the frequency space information of the optical system. In doing this, the calculation can be split into two independent steps. Here, the intensity, defined as the average of the square magnitudes of the electric fields that emanate from each point in the illumination source, is expressed as a quadratic form in the mask spectrum—which is the Fourier transform (in the spatial frequency domain) of the mask transmittance—multiplied by a matrix of complex numbers called Transmission Cross-Coefficients (TCCs), which takes into account the source (illumination shape) and pupil (aberrations, defocus, vector, obliquity) information. The TCCs are autocorrelations of the transfer function of the pupil (henceforth called the “pupil function”) weighted by the spatial Fourier transform of the illumination source (referred to hereinafter as the “source function”). The pupil is the image of the limiting stop, or aperture, in an optical system, but for the present purposes, the entrance/exit pupils represent the input/output planes for the optical system. These autocorrelations are, in general, double integrals over a very complicated region defined by the intersection of the pair of frequency-offset pupils and the source. Because of the complexity of the shape of this region, computation of these TCCs is potentially expensive.
The second step is the computation of the mask spectrum (or, Fourier transform of the mask, referred to hereinafter as the “mask function”). This can be computed analytically for most, if not all, one-dimensional masks in lithography. An analytical computation of the Fourier transform for all two-dimensional shapes can be obtained, for example, by the method disclosed in U.S. patent application Ser. No. 10/353,900, which is commonly assigned to the Assignee of the present application. This analytical calculation, while exact, tends to be expensive compared with the decomposition techniques in the SOCS method, because each edge required a trigonometric function evaluation. Nevertheless, such a calculation leaves little doubt as to the overall accuracy of the computation.
A major simplification is typically made by assuming that the portion of the mask to be simulated is periodic. This is because the spectrum of periodic gratings is nonzero at a discrete number of frequencies, meaning that the frequencies that have nonzero TCCs lie on a regular, discrete grid in spatial frequency space. This periodic assumption is used by all current lithography simulators, since rigorous image computation over an entire (26 MM)2 field is impractical. This, of course, is an approximation in itself, as real circuitry to be manufactured will not necessarily possess this periodic property. Nevertheless, a single local set of shapes in a region of interest, or cell, of any size can be simulated with great accuracy from the periodic assumption. The reason for this is that optical diffraction phenomena have a range of Cλ/(NAσ0), where NA is the numerical aperture, λ is the wavelength, σ0 is the partial coherence (or pupil-fill), and C is a constant equal to about at most 10 (for example, see Wong et al., U.S. Pat. No. 6,223,139). Therefore, there is a finite range of influence, beyond which features have no impact on the desired pattern. Thus, if ones assumes periodicity (or any other boundary condition) for a unit cell of the nominal pattern size plus this influence range, then the simulated image inside the unit cell can be considered exact for all practical purposes.
With the mask periodic and the spectrum discrete, the set of TCCs to be evaluated is therefore discrete and finite in number. In general, they fill a four-dimensional matrix, and it can be shown that the number of TCCs needed for computation varies as the square of the area of the unit cell—in other words, as the fourth power of the length of a nearly square cell. Therefore, even with this simplification, the number of difficult, double integrals that are needed to accurately define the image becomes unmanageable for even moderately large (<10 μm2 area) cells.
In all current lithography simulation software, the TCCs are currently computed by gridding the source and pupil for a numerical integration. This gridding can be sophisticated: for example, in the software SPLAT (see K. K. H. Toh and A. Neureuther, “Identifying and Monitoring Effects of Lens Aberrations in Projection Printing,” SPIE, Vol. 772, pp. 202-209, Optical Microlithography VI: 4-5 Mar. 1987, Santa Clara, Calif.), a fixed grid is used in one direction, while an adaptive grid (it refines until a certain tolerance is reached) is used in another. In order to achieve this, the integration limits of the integration region must be computed. This allows for greater accuracy in computing aberrated images, for example. Unfortunately, because of the adaptive stepping in the integration, the algorithm runs as long as it needs in order to achieve a certain accuracy. This can take a long time, especially with pupils that have large phase variations, such as in large defocus and/or aberrations.
The problem of providing fast simulation has already been attacked though the use of the so-called “Sum of Coherent Systems” (SOCS) method detailed in N. Cobb and A. Zakhor, “Fast low-complexity mask design”, Proc. SPIE, Vol. 3334, pp. 313-327 (1995). This method expresses the intensity as a sum over the squares of convolutions of the mask transmittance function with coherent point spread functions, or kernels; these kernels are inverse Fourier transforms of the eigenfunctions of the TCC matrix. By expressing the mask in terms of a discrete set of elements, these convolutions can be precomputed and stored in a database. This allows for very rapid image computation that is crucial for model-based OPC (MBOPC). There are caveats with this SOCS methodology, however. First of all, the sum used must be finite (and is typically between 8 and 12 terms), whereas the exact expression involves an infinite number of terms; typical accuracy estimates from using the finite series for the aerial image intensity are between 80 and 90%, with a relatively slow rate of convergence. While this may suffice for past generations with a larger error tolerance (that is, the larger allowed linewidth variation could accept larger simulation errors, as noted above), this will not serve the future generations, where the error tolerances are rapidly vanishing.
Second, these kernels, are eigenfunctions of a matrix operator defined by the TCCs. In order to compute these eigenfunctions, the TCCs must be computed. Since the TCCs are independent of the mask function, these TCCs will be computed only once, whereas the intensity is computed thousands of times. This is true if only the mask features (and not the parameters of the projection system) are to be varied during a MBOPC session. It turns out, however, that the task of calibration of a model to data has gotten so difficult that some physical parameters that are known to be in error—such as the partial coherence or focus—are varied as well. The fitting of these physical parameters demands recomputation of the TCCs, so the ability to compute them very quickly is crucial here.
There is a method of computing the TCC matrix rapidly due to Liebchen (U.S. patent application Pub. No. US 2002/0062206 A1). Here, the TCC matrix itself is approximated as a bilinear form ηTAη, where η is a vector of orthonormal Hermite-Gaussian terms, and A is a Hermitian matrix. The benefit of doing this is that the size of the TCC matrix can be reduced substantially, making changes with respect to optical parameters such as the numerical aperture or defocus very fast. While this certainly has resulted in a significant increase in computational speed, it is not obvious what accuracy of the image intensity that this method produces, as not only the pupil and source are evaluated on a grid and numerical integration is necessary on at least one dimension, but the expression of the TCC matrix in this orthogonal basis is only exact when an infinite number of terms are used.
Accuracy is crucial, and although few have ever questioned the half-century-old theory of optical image formation via partially coherent Kohler illumination (despite the scant experimental verifications of the various parametric dependencies, especially those relating to “obliquity factors” which are generally considered to have small impact), numerical accuracy in proprietary software has always been a concern. Many benchmarking projects have been performed, but most have been simple numerical checks between software. Speed is also a major issue, because many thousands of simulations must take place in order to perform a single optimization. Historically, there has always been a tradeoff between speed and accuracy, in that speed requires one to sample less source integration points, or less eigenfunction kernels in the “fast” algorithms used in Model-Based OPC software. On the other hand, analytical solutions, if they exist, solve the speed/accuracy issue, since all the computation time comes from the numerous double integrations required for the image computation. If one were to replace the double integrations (thousands of function evaluations) with a single function evaluation, then the speed of the algorithm would increase many times over, and the accuracy would be to machine precision. The trouble is, of course, finding such a solution.
Rigorous analytical methods of aerial images would be preferred to the numerical gridding methods commonly in use today, but are typically impractical because the resulting integrals cannot directly be evaluated in many cases. However, such analytical methods have been available for computing aerial images in special cases. The simplest case of this was illustrated by Kintner (Kintner, Eric C., “Method for the calculation of partially coherent imagery,” Applied Optics, Vol. 17, No. 17, pp. 2747-2757, Sep. 1, 1978) for simulation of in-focus, partially coherent images of one-dimensional (1D) gratings. By considering all of the possible geometrical configurations of the TCCs in 1D (i.e., along the x-axis), Kintner was able to compute the exact value of the TCCs due to the fact that these values are equal to the area of a region bounded by 3 circles, all of whose centers are collinear. While this was a useful first step, the important cases of more general patterns seen in lithography were untreated, as well as the ability to compute defocus effects.
Going one step further, Subramanian (Subramanian, S., “Rapid calculation of defocused partially coherent images,” Applied Optics, Vol. 20, No. 10, pp. 1854-1857, May 15, 1981) considered the simulation of defocused images of the one-dimensional (1D) gratings. Here, Kintner's work was built upon by finding the integration limits within each geometrical configuration. In the paraxial (i.e., small numerical aperture, with NA≦0.4) approximation, the defocus term could be integrated analytically in one dimension, while numerical integration was used for the other. While this extended the art significantly, it is still limited to one-dimensional (1D) gratings and small values of the numerical aperture.
The method outlined by Liebchen takes the approach of decomposing these TCCs (i.e. representing the TCC matrix as a bilinear Hermite-Gaussian expansion). While Liebchen's method does potentially reduce the number of TCCs needed, it also introduces a gridding; the combination of the decomposition and gridding methodologies introduce potential inaccuracies. Further, it is unclear how large values of defocus or aberrations are treated with any accuracy, as all treatments are Taylor expansions of the phases in the frequency variable. For small aberrations, this is acceptable, but such expansions quickly lose accuracy in the face of even a moderate defocus
Therefore, there is a need to compute the TCCs without recourse to grids or orthogonal basis decompositions so that accuracy is maintained, or at least monitored, while the speed needed to computed moderately large patterns is satisfied.
SUMMARY OF THE INVENTIONIt is an objective of the present invention to provide a method for obtaining solely analytical expressions for all of the TCC integrals. A further objective of the present invention is to provide an approximate analytical representation for the TCCs that is accurate to within a desired small error, for example, within the precision of a given computer, which is typically on the order of about 106 for typical single precision calculations in current machines. It is yet a further objective of the present invention to provide a method for computing aerial images very rapidly compared to conventional numerical integration techniques, and to within a desired accuracy, for example very close to machine precision. That is, a primary objective of the present invention is to eliminate the tradeoff between speed and accuracy in the simulation of aerial images, achieving both simultaneously.
In accordance with the present invention, an analytical solution for the TCC integral has been derived. For the in-focus, aberration-free case, the analytical solution is exact, using only a single arctangent and at most 4 square roots per transmission cross-coefficient (TCC). This solution has been extended to defocus and small aberrations, and has been implemented in computer code.
The invention takes advantage of two facts concerning the TCCs: 1) The integration region of the TCCs is bounded by circular arcs for circular-shaped sources and pupils; and 2) the integrand of the TCC expression can be represented by a simple, linear phase for the case of a defocused pupil in the paraxial approximation. These facts, combined with the application of Stokes' Theorem (a basic theorem of integral calculus) results in a reduction of the double integral to a single integral for any arbitrary 2D pattern.
In a further aspect of the present invention, the TCC integration regions are characterized all in terms of a finite set of all the possible geometrical configurations, based on a rotational alignment of the spatial coordinate axis with the axis of symmetry of the integration region. These regions change shape as the spatial frequencies vary, as do the integration limits over which the integrand is to be evaluated. It turns out that there are 18 distinct geometrical configurations when the double integral is used. Furthermore, when Stokes' Theorem is applied, only the boundary of these geometric configurations are considered, and it turns out that the number of distinct configurations is reduced to 9. Such a reduction simplifies the program logic and speeds up calculations.
In a further aspect of the present invention, all of the integrals over the various arcs that comprise the boundaries of the finite geometrical regions is reduced to the same integral form. Since this particular integral does not have an exact representation in terms of purely analytical functions, the present invention provides an approximation for the integrand that differs from the original function in absolute value by a desired small number, for example, on the order of single precision over all possible values of the integration limits. In accordance with the present invention, this new function is chosen such that the error substantially monotonically decreases as the number of terms used in the approximation increases. Furthermore, the integral of the terms of the approximate function is representable in terms of analytical functions that can be computed with a few arithmetic steps. Therefore, the present invention permits computation of a TCC that is accurate to within a desired precision and can be evaluated rapidly using a finite number of arithmetic operations.
In yet another aspect of the present invention, a computation of the partially coherent, defocused imagery of an arbitrary pattern in the paraxial approximation is provided in which the image intensity is represented purely in terms of analytical functions that is accurate to within a desired precision.
The present invention further provides an expression for nonparaxial defocus imagery, in which an analytical expression for the nonparaxial defocus image is obtained by resealing the coordinates of the paraxial defocused field. Again, this can be achieved to within a desired precision. Therefore, the inventive method can be extended to higher values of the NA with no additional computational effort and only a small, but known, increase in the overall error.
In another aspect of the present invention, aberrations are included by providing an analytical expression for an aberration pupil function by computing a Taylor expansion of the phase difference between the nonparaxial and paraxial phases. Each term in this expansion corresponds to a higher-order correction that is represented as a derivative of the paraxial result. With some additional computation depending on the precision desired, a pupil function including aberrations can be obtained by multiplying the inventive analytical paraxial defocus pupil function by the analytical aberration pupil function obtained using the Taylor expansion. This method is extendable to other phase errors that are described by the aberrations that may be present in a stepper. These aberrations are typically high-order polynomials in the frequency coordinate, and the coefficients are typically small—on the order of a few hundredths of a wavelength. Therefore, Taylor expansion is a natural way to extend this analytic methodology in the case of small aberrations.
In yet another aspect of the present invention, the effects of amplitude variation across the pupil can be included. Such amplitude variations may become significant at higher values of the NA. These amplitude variations, or apodizations, come from energy conservation and vector effects that are not significant in the paraxial approximation. In accordance with the present invention, these apodization effects are also represented by approximated successively more accurately via an expansions, and the resulting amplitude effects are provided simply as multiplicative factors to the inventive paraxial defocused pupil function.
The foregoing has outlined rather broadly the features and technical advantages of the present invention in order that the detailed description of the invention that follows may be better understood. Additional features and advantages of the invention will be described hereinafter which form the subject of the claims of the invention.
BRIEF DESCRIPTION OF THE DRAWINGSFor a more complete understanding of the present invention, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:
In the following description, numerous specific details may be set forth to provide a thorough understanding of the present invention. However, it will be obvious to those skilled in the art that the present invention may be practiced without such specific details. In other instances, well-known features may have been shown in block diagram form in order not to obscure the present invention in unnecessary detail.
Refer now to the drawings wherein depicted elements are not necessarily shown to scale and wherein like or similar elements are designated by the same reference numeral through the several views.
In accordance with the present invention, a fast, accurate analytical method for computing estimated aerial images, computed to within a desired tolerance. In accordance with the present invention, error estimates for the image computations are provided. The method of the present invention can incorporate aberrations.
The source is assumed to be circular, preferably having a step function intensity. Alternatively, more general intensity distributions may be used, such as a Gaussian distribution. In addition, the pupil is assumed to be circular. The lens aberrations are assumed to be small (e.g. tens of milliwaves), or zero.
The aerial image intensity is determined according to the Hopkins model for an image projected by a given mask that is illuminated in a Kohler illumination (projection) system using a given set of illumination conditions. The illumination conditions are characterized by a source shape (S), a pupil function (P) and the mask pattern (M) expressed in spatial frequency coordinates. The Hopkins model assumes linear, shift-invariance of the optical system. This means two things: 1) (linearity) each distinct temporal frequency is unchanged by the system; and 2) (shift-invariance) each off-axis plane wave input into the system does not change the shape of the spectrum, but merely shifts it by some amount that depends on the angle of propagation.
The following discussion is presented in order to provide a thorough understanding of the present invention, with specific examples presented as preferred embodiments.
1. General Theory
Consider a modern microlithographic imaging system 200 (generally known as a Köhler illumination system) as illustrated in
The coherent electric field E at the point x (wherein the bold notation x and the notation {right arrow over (x)} both refer equivalently to a vector quantity, hereinafter) on the wafer or image plane 215, due to the on-axis source point only, is given by the following expression:
where M represents the spatial Fourier transform of the mask transmittance function, or frequency response of the mask 207 (i.e., the mask function), P represents the pupil function, or frequency response of the objective optics 211, NA is the numerical aperture of the objective lens at the exit pupil 217, and λ is the illumination wavelength. R2 represents all of two-dimensional space. The vector quantity u represents a scaled frequency coordinate in which the pupil function P is zero when |u|>1. For an off-axis incident plane wave, the expression for the electric field due to a single source at the off-axis scaled source coordinate σ becomes the following:
The vector quantity NAσ represents the projection of a point on the unit sphere onto the mask plane 215, and in a Kohler illumination system, NA corresponds to the numerical aperture, and a represents a point in the illuminator 201, where the maximum magnitude of the vector σ is equal to the ratio of the NA of the condenser lens to the NA of the projector lens. This ratio is called the partial coherence factor σ0 of the projection system 200.
To simulate the effects of a partially coherent Kohler illumination system on the electric field intensity, the off-axis intensity is averaged over all of the plane waves generated by the illuminator. (Note that the use of partially coherent illumination represents both reality, as no illumination system can be perfectly spatially coherent, and desireability, as the shape of the illumination can be varied to achieve, say, optimum process latitude.) If it is assumed that the intensity is proportional to the square magnitude of the electric field distribution, then the intensity at a point x on the image plane 215 is modeled to take the form
where S is the source distribution for the illumination system. That is, as mentioned above, the intensity at the image plane is an average of the intensities emanating from the fields generated by the various point sources that comprise the illuminator. Substituting Eq. (2) into Eq. (3), the following expression for electric field intensity results:
where the transmission cross-coefficient T (also referred to as the TCC or TCC integral) is given by
In the integrand in Eq. (5), the two pupils functions P(u′+σ), P(u″+σ) will henceforth be referred to as the “offset pupils”. The pupil function and source distribution function has finite support; that is, |u|>1P(u)=0 and |u|>σmaxz,1 S(u)=0, due to the finite extent of the source and pupil apertures. This reduces the infinite integration interval specified in Eq. (5) into a finite interval, as shown in
2. Periodic Objects
As discussed above, in the Hopkins model, a simplifying assumption often made is that the mask 207, or a portion thereof, such as a local set of shapes, referred to as a cell, is considered to be a periodic object. The preferred embodiment of the present invention also makes use of this periodic assumption. Referring to
where the cmn are the mask Fourier coefficients. The indices m and n will represent the various orders, or directions, into which the mask pattern 300 will split incoming light. Of course, the entire spectrum is not used in determining the image, as the objective lens 211 filters out the higher orders. When the mask function M of Eq. (6) is substituted into the image intensity of Eq. (4), a series for the intensity results:
where the TCC integrals are evaluated at discrete lattice points due to the periodic assumption, and can be written as:
where the quantity q=pNA/λ. The discrete lattice points, where the TCC integrals Tm′n′m″n″ are evaluated, are spaced at intervals of Δux=1/qx and Δuy=1/qy. In principle, one could sum Eq. (7) directly, as the quantity Tm′n′m″n″ is bound to vanish when the indices are large enough, making the sum finite. To see how this works, we take a closer look at the definition of the TCC of Eq. (5).
The summation limits in Eq. (7) are now deduced from the bounds of the TCC integrals. Two geometrical conditions must be met simultaneously for nonzero values of the TCCs: 1) the separation between the centers of the two offset pupil functions must be less than twice their radius; and 2) both pupil functions must intersect the source function, which can be seen by reference again to
which provide finite integration limits.
Note that, with this variable change, the intensity now takes the form of a Fourier series, which is expected from the periodic assumption. The conditions in Eqs. (9) and (10), and the change of variable suggested above, lead directly to the following exact finite sum for the intensity:
where
where the function └x┘ is a floor function, representing the largest integer less than x (a real number), and
and θ is the Heaviside step function:
The quantities cm+k,n+l and c*mn are the discrete coefficients corresponding to the mask functions M,M* in the expression for intensity I(x) of Eq. (4). The number of TCCs needed can be deduced from Eqs. (11) and (14); however, it is difficult to get an intuitive feel for how this number behaves as a function of the optical parameters. Eqs. (9) and (10) describe integer lattices bounded by ellipses, and therefore the number of points bounded by the ellipses is roughly equal to the product of the ellipses areas. An approximation to the number of TCC's, T, in the limit of a large unit cell area, is
Note that this number is roughly proportional to the square of the period cell area.
3. Characterization of the Integration Region
Referring again to
-
- 1) the source 404 is completely contained within the pupil intersection 502 to form an integration region 503 as in
FIG. 4A ; - 2) the source 404 protrudes out of one side of the pupil intersection 502 to form integration region 506 as in
FIG. 4B ; - 3) the source 404 protrudes out of both sides of the pupil intersection 502 to form integration region 509 as in
FIG. 4C ; and - 4) the pupil intersection 502 is completely contained within the source 404 to form an integration region 512 as in
FIG. 4D .
- 1) the source 404 is completely contained within the pupil intersection 502 to form an integration region 503 as in
Clearly, the situation for the general two-dimensional (2D) mask is drastically more complicated. The number of distinct geometrical configurations for a 2D mask has not been published prior to the present application; the typical approach to computing the 2D TCC has been to employ the minimum grid that contains the pupils and the source. Enumerating these integration regions is an important aid in determining the proper integration limits for analytical and adaptive 2D numerical integrations. It will be shown below, however, that, taking into account all possible symmetries, there are only 18 distinct geometrical configurations to consider.
In considering the 2D case, note that the coordinate system can always be rotated such that the symmetry axis of the intersection region of the offset pupils is vertical (that is, aligned along one of the source coordinate axes). For example,
Before enumerating the different cases, the desired change of coordinates is stated. Let the tangent of the rotation angle be denoted as
Then the horizontal offsets and the common vertical offset are as follows:
Also, denote u1=max{û′,û″}, u2=min{û′,û″}, and v={circumflex over (v)}.
In the following descriptions of the integration regions of interest for the two-dimensional case, the organization is as follows. The two-dimensional geometric configurations may be categorized by analogy to the four one-dimensional cases. In each of these four categories of geometries, the pupils are moved together vertically, and each change in geometry is noted. Note that one geometric configuration can be derived from more than one logical condition. The cases are illustrated in
3.1) The Source is Completely Contained within the Pupil Intersection
This is achieved when u1−1<−σ0 and u2+1>σ0.
In this first category, the changes in geometry as the pupil intersection region 502 is moved vertically relative to the source function 404 are considered. It is assumed that |u1|≧|u2|; the complementary case results in the mirror image about the vertical. The following quantities are defined:
Then the distinct geometrical cases are determined under the following conditions:
Case 1 (FIG. 6A): |v|<v1.
Case 2 (FIG. 6B): u2<0, v2<v3 and v1<|v|<v2.
u2<0, v3<v2 and v1<|v|<v3.
u2>0, v2<v5 and v1<|v|<v2.
u2>0, v2>v5 and v1<|v|<v5.
Case 3 (FIG. 6C): u2<0, v2<v3 and v2<|v|<v3.
u2>0, v2<v5 and v2<|v|<v5.
Case 4 (FIG. 6D): u2<0, v2<v3, v4<v5, and v3<|v|<v4.
u2<0, v2<v3, v4>v5, and v3<|v|<v5.
u2<0, v2>v3, v4<v5, and v2<|v|<v4.
u2<0, v2>v3, v4>v5, and v2<|v|<v5.
Case 5 (FIG. 6E): u2<0, v4<v5, and v4<|v|<v5.
Case 6 (FIG. 6F): u2<0, v4<v5, and v5<|v|<v6.
v4>v5, and v4<|v|<v6.
Case 9 (FIG. 6I): u2>0, v2>v5 and v5<|v|<v2.
Case 10 (FIG. 6J): u2<0, v2>v3, and v3<|v|<v2.
3.2) The Source Protrudes Out of One Side of the Pupil Intersection
Recall that only the case of |u1|≧|u2| is being considered; the other case follows by symmetry. This is achieved when u1−1>−σ0 and u2+1>σ0, and the distinct geometrical cases are determined under the following conditions:
There are also many cases that revert back to a region defined in subsection 1:
3.3) The Source Protrudes Out of Both Sides of the Pupil Intersection
This is achieved when u1+1>−σ0, and u2+1<σ0, and σ02<u1u2+1, and the distinct geometrical cases are determined under the following conditions:
Case 16 (FIG. 6P): v5<v14,|v|<v5.
v5>v14, |v|<v14.
Case 17 (FIG. 6Q): v5<v14, v5<|v|<v14.
Cases that revert back to regions defined in subsections 1 and 2:
Case 5: v5>v14, v5>v13, v13<|v|<v5.
Case 6: v5<v14, v13<|v|<v6.
v5>v14, v5<v13, v13<|v|<v6.
v5>v14, v5>v13, v5<|v|<v6.
Case 13: v5>v14, v5<v13, v14<|v|<v5.
v5>v14, v5>v13, v14<|v|<v13.
Case 14: v5<v14, v14<|v|<v13.
v5>v14, v5<v13, v5<|v|<v13.
3.4) The Pupil Intersection is Completely Contained within the Source
This is achieved when u1−1>−σ0, u2+1<σ0, and σ02<u1u2+1, and the distinct geometrical cases are determnined under the following conditions:
Case 18 (FIG. 6R): |v|<v5.
Cases that revert back to regions defined in previous subsections:
Case 6: v13<|v|<v6.
Case 14: v14<|v|<v13.
Case 17: v5<|v|<v14.
4. Simplification via Contour Integration
The main result of the previous subsection was to identify all of the possible geometrical forms for the regions of integration for the TCCs, and the conditions under which each particular form would occur.
Consider the case of in-focus imaging, where the image plane is coincident with the focal plane of the projection lens. The pupil function P(u) takes the value 1 when |u|≦1 and zero when |u|>1. Then the in-focus TCCs are simply the areas of their respective regions. Given that each of the bounding contours of these regions is a circular arc, the determination of the areas exactly in terms of analytic functions is straightforward. There would be 18 separate analytical expressions to be evaluated according to the integration regions defined above, which is unwieldy to implement. However, it is possible to simply the 18 analytical expressions into a common form, as disclosed in Gordon (see Gordon, R. L., “Exact Computation of Scalar, 2D Aerial Imagery,” Proc. SPIE, Vol. 4692, pp. 517-528 (2002)).
This is accomplished by using Green's Theorem to convert the area integrals to contour integrals. According to Green's Theorem, the expression of the area A (e.g., the TCC for the in-focus case) of a closed region D can be expressed in terms of its bounding contour:
Because the closed contour consists of nothing more than a set of circular arcs, the parametrization of the above contour integral is achieved using the following parametrization: σx=u0(k)+ρk cos φ, σy=v0(k)+ρk sin φ; the final result takes the form
where the following notation applies:
-
- ρk is the radius, and (u0(k),v0(k)) is the center of the circle containing the kth arc.
- φ2(k) and φ1(k) are the angles subtended by the endpoints of the kth arc at its circle's center measured clockwise with respect to the positive horizontal axis σx. For example,
FIG. 7 illustrates a source function 404 and two offset pupil functions 401, 404 having center coordinates 801, 802 respectively. The intersection of the two pupil functions 502 is the region enclosed by the two circular arcs having endpoints A, B. The integration region 805 is defined by the intersection of the source function 404 and the two pupil functions 401, 402, and is the region enclosed by the arcs having endpoints C, D and B. In this example, there are three circular arcs: 1) the arc having endpoints C, D having radius ρ1 of the source function; 2) the arc having endpoints C, B having radius ρ2 corresponding to the pupil function 402 centered at coordinate 802; and 3) the arc defined by endpoints B, D having radius ρ3 corresponding to the first pupil function 401 centered at 801. The angles corresponding to the endpoints of arc C-D are φ2(1),φ1(1), respectively; for arc C-B the angles are φ1(2),φ2(2), respectively; and for arc C-D the angles are φ1(3),φ2(3), respectively, where all of the angles are measured clockwise with respect to the positive horizontal axis. In addition, sj(k)=sin φj(k), cj(k)=cos φj(k), jε{1,2}, and N is the number of arcs in the contour.
There are two immediate advantages to the contour integration method:
-
- 1) The number of unique geometrical configurations of the integration regions shrinks from 18 to 9, which would simplify the used used to compute TCCs. This number is less because some of the integration regions detailed above have the same type of boundary.
- 2) The final expression for the in-focus TCC takes the form of Eq. (37) which is evaluated with a single arctangent and a few square roots per arc, resulting in a more efficient algorithm.
5. Extension to Defocus
For scalar, moderate-to-high NA systems (i.e., NA between about 0.5 to 0.7), the pupil function of a defocused system (i.e. at a defocus position z along the optical axis) generally takes the form
Generally speaking, one must attack the problem of computing the TCCs with the pupil function in Eq. (38) numerically. However, as previously discussed, numerical computations of TCCs are slow and have errors that are not well understood. In accordance with this invention, the TCCs for defocused systems (i.e. to compute defocused images) are expressed in terms of standard, analytical functions, as described in more detail below. This is to be done without sacrificing accuracy, as other algorithms, such as SOCS or bilinear decomposition, must do.
A standard approximation for the defocused pupil function of Eq. (38) is the quadratic-phase, or paraxial, approximation, which is useful for small values of NA (e.g. less than about 0.5), which leads to the following expression for the defocused, paraxial pupil function:
Substituting PDP of Eq. (39) into the TCC integral of Eq. (5) leads to the following expression for the TCC:
where D is the intersection of the pupils, S is the source, a=−NA2z(u′−u″)/λ, and C is a phase term, where
The TCC integral in Eq. (40) now has the problem of having a nonuniform integrand over the intersection region D∩S. By analogy to the in-focus case discussed above, Stokes' Theorem can be applied to convert the TCC area integral of Eq. 40 into a contour integral. This is shown as follows. Consider the vector-valued function:
∇{circumflex over ( )}F=exp(−i2πa·σ){circumflex over (z)} (41)
where {circumflex over (z)} is a unit vector in the direction of the normal to the σ plane. Stokes Theorem converts the TCC double integral in Eq. (40) to a single, contour integral around the boundary of the integration region:
The idea is to find the value of F, given ∇{circumflex over ( )}F. F can be found exactly because it is a solution of the vector wave equation. That is, F can be written as
where {circumflex over (x)} is a vector in the σx direction and ŷ is a vector in the σy.
The expression in Eq. (43) is then put into the right-hand side of Eq. (42). The reduction of this contour integral into a simple, single integral will depend on the boundary arc. The boundary of the integration region is made up of anywhere between 1 and 4 circular arcs with distinct centers and radii. Actually, there are only two distinct types of arcs:
-
- 1) the boundary of the source function in σ-space is described by σx2+σy2=σ02; and
- 2) the boundary of the pupil function in σ-space is described by (σx−ux)2+(σy−uy)2=1
If these arcs are parameterized by the angle subtended from the center of their respective circles, then it turns out that both types of boundaries lead to the following single integral expression for the TCC, in accordance with the present invention:
where
φ1(j),φ2(j) are the angles corresponding to the endpoints of the arc, as illustrated in
Thus, the TCC in Eq. (44) is now expressed in terms of single integrals, rather than the double integrals of prior art TCC expressions. This integral is therefore more easily computed by numerical methods. However, it would be more desirable to compute these single integrals analytically to improve speed and accuracy. Unfortunately, the integral in Eq. (47) cannot be reduced to an exact analytical form, except for a few isolated special cases. More preferably, the integrand of Eq. (47) is replaced by a different function that differs from the original integrand by at most, some small number ε. This new integrand function will be chosen such that the integral can be expressed in terms of purely analytical functions.
First, note that Eq. (47) can be exactly rewritten in the following form:
Again, the integral on the right in Eq. (48) is not expressible in terms of analytical functions. Note in particular that {square root}{square root over (1−w2)}={square root}{square root over (1−w)}·{square root}{square root over (1+w)}. The first term {square root}{square root over (1+w)} has the property that it has a vertical tangent at w=1, but the integral of {square root}{square root over (1−w)} has a known analytical solution. The second term {square root}{square root over (1+w)} is smoothly varying, but the integral does not have a known analytical solution. Therefore, in accordance with the present invention, an approximation for the integrand {square root}{square root over (1−w2)} is made that preserves the properties of the first term {square root}{square root over (1−w)} (i.e. has a vertical tangent at w=1) and has smoothly varying properties, that is, the error decreases substantially monotonically as the number of terms in the approximation increases. In a preferred embodiment, the following polynomial approximation can be made for the integrand {square root}{square root over (1−w2)} over the interval for w of [0,1] (which covers all possible values of cos β):
For a given finite value of L, the coefficients fm may be determined by performing a curve fit of the polynomial to {square root}{square root over (1−w2)} at L+1 discrete values of w. In a preferred embodiment, for a given number of terms L, the coefficients fm are found by equating values of the left and right sides of Eq. (49) at equal steps over the interval for w of [0,1] and are deduced by solving a linear system of equations. It turns out that when L=4, the error |η|<10−5, which is sufficient for a typical single precision calculation. For example, referring to
Substituting the polynomial expression of Eq. (49) into the integral of Eq. (48) leads to a series of integrals. To evaluate the integral of the first term of the polynomial approximation from Eq. (49) we can make use of the following integral expression:
where
is the Fresnel integral. Although the Fresnel integral is not typically recognized as a standard analytical function, it does, in fact, contain all of the properties of such functions, and, most importantly here, can be computed to within machine precision using a (countably) small number of arithmetic operations, for example, by using a continued fraction expansion that is highly convergent, as is well known in the art.
The integrals from Eq. (48) including the higher order terms in the polynomial from Eq. (49) are computed using the relation
Eqs. (49), (50), and (52) are combined to realize, for L=4, the following approximation for the integral in Eq. (48) which is valid for W∈[0,1]:
where |ε|<10−5 for all values of α and w∈[0,1].
Therefore, the preferred embodiment in accordance with the present invention, the paraxially defocused TCC (i.e. for NA≦0.5) of Eq. (44), i.e.:
can now be computed analytically, within an error of 10−5, where the integrals J are expressed by the following expression, for L=4, as:
where α=2πρj|a|, a=−NA2z(u′−u″)/λ, and β=φ0+φ2(j) and φ0+φ1(j) at the endpoints of the N arcs of the region of integration, which is the intersection of the source and offset pupil functions. Similar analytical expressions for TCC can be derived having greater accuracy by including additional terms in a substantially monotonic approximation of Eq. (48) (e.g. by using larger values of L in Eq. (49) or similar expansions of Eq. (48)). Thus, in accordance with the present invention, these TCCs can be obtained with a relatively small number of arithmetic operations relative to numerical integrations that are used in current simulators. Accordingly, the image intensities can likewise be computed with improved speed and accuracy.
6. Extension to High NA
Eq. (38) dictates the nonparaxial (i.e. where NA is greater than about 0.5) phase behavior of the defocused pupil function PD. Because of the square root in the exponential, a simple, analytical expression, such as that found in Eq. (53), which is the result of a paraxial approximation, is not possible. Nevertheless, there is a simple algebraic transformation of the electric field expression in Eq. (1), that depends only on the spatial (x) coordinate and not the spatial frequency, that corrects for the phase errors introduced by the paraxial approximation (see, for example, Forbes et al., “Algebraic corrections for paraxial wave fields,” J. Opt. Soc. Am. A, Vol. 14, No. 12, pp. 3300-3315, Dec. 1997). By treating this nonparaxial correction as an algebraic transformation that is independent of spatial frequency, the values of the TCCs will not change.
The nonparaxial correction to the paraxial electric field intensity is given by the following transformation: Let Ip(x,z) be the paraxial intensity, at a point x on the image plane, and z is the defocus amount. Then the nonparaxial intensity I(x,z) (i.e. for all NA including NA greater than about 0.7) can be approximated by
The factor 1+NA2g(x,z)2 in square brackets is analogous to an obliquity term that is absent in the paraxial approximation. The denominator on the right-hand side of Eq. (55b) corresponds to a switching function that turns off the correction in the focal plane.
7. Extension to Aberrations
A wave aberration may be represented as a phase term of polynomials, such as Zernike polynomials, across the pupil. For the purposes of lithography modeling in a production semiconductor manufacturing environment, these aberrations are typically small, for example, less than about 1-3% of the illumination wavelength λ. It is convenient to express the aberrations as an exponential PA of a phase term across the pupil as:
where Zq is the qth Zernike polynomial, ζq is the corresponding Zernike coefficient, Q is typically 37 in most lithographic applications, and εw, which is a small, dimensionless number that represents the order of magnitude of the wavefront error (for example, as discussed above, εw may be on the order of 0.01λ). Because these aberrations are considered small compared to the wavelength, the exponential pupil function PA can be Taylor expanded with respect to εw out to, say, O(εw2), to yield:
Typically, aberrations are combined with a defocus term for simulation purposes in order to explore degradation of process window that a particular set of aberrations may cause. The combined pupil function is multiplicative; that is, it takes the form P(u)=PA(u)PD(u). Aberrations in a lens will result in a lack of a single geometrical focus, and increases the size of the diffraction “blur” in the image. This degrades the ability to image critical features over a wide range of focus and dose conditions, which is what we mean by the degradation of the process window. In accordance with the present invention, two methods are presented for analytically computing the aberration pupil function PA.
In a first embodiment, note that the phase term (i.e.
of the paraxial defocus pupil function of Eq. (39) is mathematically equivalent to the fourth Zernike polynomial Z4, more specifically, is a quadratic function of u, and thus can be considered as an effective aberration. That is, the paraxial defocus term can be combined with the (ζ4 aberration coefficient, thus containing the defocus amount z as a component. Substituting the series expansion of Eq. (56a) into the TCC expression in Eq. (5) results in a series of terms involving integrals of the form
For example, if Q=8, then there are 21 such terms. Again, the double integrals defined by Eq. (57) can be simplified by using Stokes' Theorem (see Eq. (42)), into a single integral around the boundary of the TCC integration region:
As in the previous case, the contour is made up of N circular arcs, the pth arc having center (up,vp) and radius ρp. When the path is parametrized as follows:
σx=up+ρp cos φ,
σx=vp+ρp sin φ, (59)
with φ∈[0,2π], and the angle φ of the endpoint is measured relative to the horizontal axis, then the integral in Eq. (58) can be evaluated as a sum over functions defined by a single integral:
It turns out that the integrals in Eq. (60) can be expressed in terms of incomplete beta functions:
where
is the incomplete beta function. This function is well-known in the mathematical literature and algorithms exist that compute its value using only a few arithmetic operations. Such an expansion of the aberration pupil function in terms of incomplete beta functions has been considered previously for one-dimensional (1D) geometries (see, for example, Steel, W. H, “Effects of Small Aberrations on the Images of Partially Coherent Objects,” J. Opt. Soc. Am., Vol. 47, No. 5, pp. 405-413 (1957)), but not for two-dimensional (2D) geometries. In accordance with the present invention, the form of Eq. (61) provides an analytical solution for the aberration pupil function PA for general 2D geometries. This pupil function can be multiplied with the defocused pupil function PD, or for example PDP as provided in Eq. (39) of a preferred embodiment, and the resulting TCC computed, for example as in Eq. (54a), can be used to provide an analytical expression for image intensity that includes aberrations, and that has small error. However, this solution is applicable to a limited range of defocus values, limited by the small value of εw that is assumed.
More preferrably, in a second embodiment for computing aberrated images, the paraxial defocus term PDP is treated separately from the aberration term PA and the argument in the exponential of PDP is not assumed to be small (i.e. for z larger than the wavelength λ). In accordance with the present invention, the aberration pupil function PA is Taylor expanded to 2nd order in εw in accordance with Eq. (56a), but the paraxial defocus pupil function PDP (see Eq. (39)) is kept as is. Substituting the Taylor expansion of PA into the TCC integral of Eq. (5), integrals of the form:
are obtained, where a=−NA2z(u′−u″)/λ.
The integral on the 2nd line of Eq. (63) has the same form as the TCC integral of Eq. (40) and therefore, Ψmn is expressed explicitly in terms of derivatives of the analytical functions of the form previously derived, for example as in Eqs. (54a) and (54b), whose integrals also have known analytical solutions. Therefore, the aberration pupil function PA is now expressed in terms of analytical functions which is then, in accordance with the present invention, multiplied with the paraxial defocused pupil function PDP to obtain an integrand for the TCC integral that includes aberrations.
8. Energy Conservation, Vector Effects, and Air/Resist Interfaces
In the simulation of the very high-NA (NA>0.7) systems in use at the latest technology nodes, other physical effects such as obliquity due to power conservation constraints, vector diffraction effects, and the effects of the resist layer complicate the analytical structure of the Hopkins' TCC integral. These three effects, however, have the common property that they are essentially pupil apodizations; that is, they are amplitude variations across the pupil. The functional forms that describe these variations each would make the TCC integral impossible to simplify in an exact form. For example, the propagation of the light into the resist from air produces the following pupil amplitude function:
where n is the index of refraction of the resist. Nevertheless, as with the aberration term, such apodizations can be considered as corrections to the unapodized image, so that the apodization terms can be Taylor expanded for small spatial frequencies. Thus, in a preferred embodiment of the present invention, an apodization pupil function PAP, which can be expressed as an amplitude factor, would then be multiplied with the paraxial defocused pupil function PD, or PDP to obtain a pupil function that includes apodizations.
9. Implementation in Computer Software
The method of simulating the image intensity in accordance with the present invention may be implemented in machine readable code (i.e. software or computer program product) and performed on a computer, such as, but not limited to, the type illustrated in
More specifically, the conventional method for computing TCCs is illustrated in
A flow chart of an embodiment in accordance with the present invention is illustrated in
Although the present invention and its advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.
Claims
1. A method of simulating an image of a patterned mask having a mask function in the spatial frequency domain, the image to be formed by a projection system having a defocus amount z along an optical axis, the projection system including pupil optics, the method comprising:
- providing a source function having a center spatial frequency coordinate;
- providing a first paraxial pupil function of the pupil optics at a first offset relative to said center spatial frequency coordinate and providing a second paraxial pupil function of the pupil optics at a second offset relative to said center spatial frequency coordinate;
- forming an integrand comprising a product of functions including said source function, said first paraxial pupil function, and said second paraxial pupil function;
- defining an integration region spanning the intersection of said source function with said first and second paraxial pupil functions, said integration region having a boundary comprising a finite number of arcs;
- integrating said integrand for each of said finite number of arcs to obtain a finite number of contour integrals each corresponding to one of said finite number of arcs, wherein each of said finite number of contour integrals comprises an analytical solution; and
- determining a transmission cross-coefficient (TCC) comprising a sum of said finite number of contour integrals.
2. The method of claim 1 wherein said first and second paraxial pupil functions each has a phase term that is approximated by a second order Taylor expansion.
3. The method of claim 1 wherein each of said arcs are circular arcs having a subtended angle φ relative to the center of the corresponding circle of said arc.
4. The method of claim 3 wherein said integrand is parameterized in terms of a square root of one plus the cosine of said subtended angle φ, and wherein said square root is approximated by an expansion having an error term, wherein said expansion has a finite number of terms L, wherein L is sufficiently large that said error term is smaller than a predetermined error tolerance.
5. The method of claim 4, wherein said expansion has L parameters fm, where m ranges from 1 to L, said parameters determined by a curve fit to the square root of one plus the cosine of said subtended angle φ.
6. The method of claim 5 wherein said parameters fm comprise a polynomial expansion of the form ( 1 + ∑ m = 1 L f m ( cos ϕ ) m ).
7. The method of claim 6 wherein said finite number of terms L is 4.
8. The method of claim 1 wherein the projection system has a numerical aperture (NA) between about 0.5 to 0.7.
9. The method of claim 1 further comprising the step of determining image intensity in accordance with a Hopkins model using said TCC integral.
10. The method of claim 9 wherein the projection system has a numerical aperture (NA) greater than about 0.7, and wherein said image intensity is determined at a coordinate {right arrow over (x)} orthogonal to the optical axis, and said method further comprises dividing said coordinate {right arrow over (x)} by the square root of a nonparaxial correction factor (1+NA2g({right arrow over (x)}, z)2) and dividing said image intensity by the square of said nonparaxial correction factor, wherein g ( x ->, z ) = x -> ( N A z ) ( 1 + 9 λ 4 256 π 8 N A 8 z 4 ) for an illumination comprising a wavelength of λ.
11. The method of claim 1 further comprising determining an aberration pupil function comprising an exponential of a phase term, said phase term expressed by a closed form polynomial series with respect to a deviation εw from a spherical lens, wherein said exponential is Taylor expanded in terms of said deviation εw to a specified order, wherein said step of forming an integrand further comprises multiplying each of said first and second paraxial pupil functions by said aberration pupil function.
12. The method of claim 11 wherein said closed form polynomial series comprises Zernike polynomials.
13. The method of claim 1 further comprising determining an apodization pupil function comprising a factor representing amplitude variations across the pupil, and wherein said step of forming an integrand further comprises multiplying each of said first and second paraxial pupil function by said apodization pupil function.
14. The method of claim 13 wherein said wherein the projection system has a numerical aperture (NA) greater than about 0.7.
15. A computer program product comprising computer readable storage medium having stored therein computer readable instructions executable by the computer for causing a computer to perform method steps for simulating an image of a patterned mask having a mask function in the spatial frequency domain, the image to be formed by a projection system having a defocus amount z along an optical axis, the projection system including pupil optics, the method steps comprising:
- providing a source function having a center spatial frequency coordinate;
- providing a first paraxial pupil function of the pupil optics at a first offset relative to said center spatial frequency coordinate and providing a second paraxial pupil function of the pupil optics at a second offset relative to said center spatial frequency coordinate;
- providing an integrand comprising a product of functions including said source function, said first paraxial pupil function, and said second paraxial pupil function;
- defining an integration region spanning the intersection of said source function with said first and second paraxial pupil functions, said integration region having a boundary comprising a finite number of arcs;
- integrating said integrand for each of said finite number of arcs to obtain a finite number of contour integrals each corresponding to one of said finite number of arcs, wherein each of said finite number of contour integrals comprises an analytical solution; and
- determining a transmission cross-coefficient (TCC) comprising a sum of said finite number of contour integrals.
16. The computer program product of claim 15 wherein said first and second paraxial pupil functions each has a phase term that is approximated by a second order Taylor expansion.
17. The computer program product of claim 15 wherein each of said arcs are circular arcs having a subtended angle φ relative to the center of the corresponding circle of said arc.
18. The computer program product of claim 17 wherein said integrand is parameterized in terms of a square root of one plus the cosine of said subtended angle φ, and wherein said square root is approximated by an expansion having an error term, wherein said expansion has a finite number of terms L, wherein L is sufficiently large that said error term is smaller than a predetermined error tolerance.
19. The computer program product of claim 18, wherein said expansion has L parameters fm, where m ranges from 1 to L, said parameters determined by a curve fit to the square root of one plus the cosine of said subtended angle φ.
20. The computer program product of claim 19 wherein said parameters fm comprise a polynomial expansion of the form ( 1 + ∑ m = 1 L f m ( cos ϕ ) m ).
21. The computer program product of claim 20 wherein said finite number of terms L is 4.
22. The computer program product of claim 15 wherein the projection system has a numerical aperture (NA) between about 0.5 to 0.7.
23. The computer program product of claim 15 wherein said method steps further comprise the step of determining image intensity in accordance with a Hopkins model using said TCC integral.
24. The computer program product of claim 23 wherein the projection system has a numerical aperture (NA) greater than about 0.7, and wherein said image intensity is determined at a coordinate {right arrow over (x)} orthogonal to the optical axis, and said method steps further comprise dividing said coordinate {right arrow over (x)} by the square root of a nonparaxial correction factor (1+NA2g({right arrow over (x)},z)2) and dividing said image intensity by the square of said nonparaxial correction factor, wherein g ( x ->, z ) = x -> ( N A z ) ( 1 + 9 λ 4 256 π 8 N A 8 z 4 ) for an illumination comprising a wavelength of λ.
25. The computer program product of claim 15 wherein said method steps further comprise providing an aberration pupil function comprising an exponential of a phase term, said phase term expressed by a closed form polynomial series with respect to a deviation εw from a spherical lens, wherein said exponential is Taylor expanded in terms of said deviation εw to a specified order, wherein said step of providing an integrand further comprises multiplying each of said first and second paraxial pupil functions by said aberration pupil function.
26. The computer program product of claim 25 wherein said closed form polynomial series comprises Zernike polynomials.
27. The computer program product of claim 15 wherein said method steps further comprise providing an apodization pupil function comprising a factor representing amplitude variations across the pupil, and wherein said step of providing an integrand further comprises multiplying each of said first and second paraxial pupil function by said apodization pupil function.
28. The computer program product of claim 27 wherein said wherein the projection system has a numerical aperture (NA) greater than about 0.7.
Type: Application
Filed: Jul 17, 2003
Publication Date: Jan 20, 2005
Applicant: International Business Machines Corporation (Armonk, NY)
Inventor: Ronald Gordon (Poughkeepsie, NY)
Application Number: 10/621,472