Method and apparatus for optically measuring periodic structures using orthogonal azimuthal sample orientations

Info

Publication number: 20080129986
Type: Application
Filed: Nov 29, 2007
Publication Date: Jun 5, 2008
Inventor: Phillip Walsh (Austin, TX)
Application Number: 11/998,263

Abstract

An optical metrology apparatus for measuring periodic structures using multiple incident azimuthal (phi) and polar (theta) incident angles is described. One embodiment provides the enhanced calculation speed for the special case of phi=90 incidence for 1-D (line and space) structures, which has the incident plane parallel to the grating lines, as opposed to the phi=0 classical mounting, which has incident plane perpendicular to the grating lines. The enhancement reduces the computation time of the phi=90 case to the same order as the corresponding phi=0 case, and in some cases the phi=90 case can be significantly faster. One advantageous configuration consists of two measurements for each sample structure, one perpendicular to the grating lines and one parallel. This provides additional information about the structure, equivalent to two simultaneous angles of incidence, without excessive increase in computation time. Alternately, in cases where the computation for phi=90 is faster than the corresponding phi=0 incidence, it may be advantageous to measure parallel to the grating lines only. In the case where two sets of incident angles are used, the incident light can be polarized to provide a total of four sets of data—Rs0, Rp0, Rs90, Rp90—for each incident polar angle, all from the same structure.

Description

Description

This application claims priority to Provisional Patent Application No. 60/872,010 filed Nov. 30, 2006; the disclosure of which is expressly incorporated herein by reference in its entirety.

TECHNICAL FIELD OF THE INVENTION

This invention relates to an optical metrology apparatus and methods and systems for measuring periodic structures using multiple incident azimuthal (phi) and polar (theta) angles, and particularly to enhanced calculation speed for a special case of phi=90 incidence for 1-D (line and space) gratings, having an incident plane parallel to grating lines. This results in additional datasets to supplement data collected in the phi=0 classical mount configuration without an untenable increase in computation cost.

BACKGROUND OF THE INVENTION

A widely referenced source on a rigorous coupled wave (RCW) algorithm is that of Moharam and Gaylord (M. G. Moharam, E. B. Grann, D. A. Pommet, and T. K. Gaylord, J. Opt. Soc., Am. A, Vol. 12, No. 5, p. 1068 (1995)). A schematic from their paper showing the diffraction problem is reproduced in FIG. 1. In particular, FIG. 1 defines the various incident conditions for the diffraction problem. The plane of incidence is defined by the polar angle, theta, and the azimuthal angle, phi. The azimuthal angle defines the angle the incident plane makes with the plane perpendicular to the grating lines, so that phi=0 corresponds to the classical incidence case. The angle psi defines the direction of the electric field with respect to the plane of incidence, with psi=90 corresponding to s polarized and psi=0 to p polarized incident light.

As shown in FIG. 1, a diffraction grating 100 has a grating region 104 formed in a substrate 102 (the substrate being designated as Region II). The grating region 104 has a height d as indicated in the figure. Region I is comprised of the material above the grating region, in this case in air or vacuum space. As indicated in FIG. 1, the grating region may be formed of alternating grating lines 106 and grating spaces 108. The grating lines 106 may have a width 108. The grating periodicity is characterized by the grating period 10 as indicated.

It will be recognized that a diffraction grating may be formed in other manners than that of FIG. 1 and that FIG. 1 is only one exemplary diffraction grating as known to those skilled in the art. For example, a diffraction grating need not be formed utilizing spaces. FIG. 1B shows one such alternative diffraction grating. As shown in FIG. 1A, the diffraction grating 100 may be comprised of grating lines 106A and 106B, again having a grating region 104 with height d. In this example, grating lines 106A and 106B will be formed in a manner in which the lines have different optical properties. Further, though the examples shown include gratings having two different optical properties within each period of the diffraction grating, it will be recognized that the diffraction grating may comprise three or more different materials within each period. Likewise, though each grating line is shown as a single material, it will be recognized that the grating lines may be formed of multiple layers of the same or different materials. In addition, though the grating lines are shown as being “squared off,” it will be recognized that each line may have sloped sides, curved edges, etc.

With reference again to FIG. 1, an x-y-z coordinate system is shown having a frame of reference in which the x-direction is shown as being perpendicular to the original alignment of the grating lines. The plane of incidence 112 of the incident light is defined by the polar angle 114, theta and the azimuthal angle 116, phi. The electric field 120 has a propagation vector 122 (k) of the incident wave. The unit vectors 124 (t) and 126 (n) are tangent and normal to the plane of incidence, respectively. As mentioned above, the angle 128, psi, defines the direction of the electric field with respect to the plane of incidence.

The RCW method involves the expansion of the field components inside and outside the grating region in terms of generalized Fourier series. The method consists of two major parts—an eigen-problem to determine a general solution inside the grating layer, and a boundary problem to determine the reflected and transmitted diffracted amplitudes along with the specific solution for the fields inside the grating region. The Fourier series are truncated after a finite number of terms. The truncation is usually characterized by the truncation order, N, which means that 2N+1 spatial harmonics are retained in the series (positive and negative terms to +N, and the 0 term).

Standard methods for solving the eigen-problem, boundary problem, and the various other matrix multiplications and inversions involved are order N³operations. This means that an increase of the truncation order by a factor of two results in an increase in overall computation time by a factor of approximately 8. The truncation order required for convergence is determined by the specifics of the diffraction problem, and generally increases for larger pitch to incident wavelength ratios and larger optical contrast between grating lines and spaces. The result is that while some diffraction problems are very tenable, others quickly become impractical to solve due to a large computation cost.

In the case of the phi=0 classical mount, the diffraction problem decouples into TE and TM components, which can be solved separately (for the phi=0 mount, TE polarization corresponds to s polarized incident light, and TM polarization corresponds to p polarized incident light). Any arbitrary polarization is decomposed into a combination of the TE and TM problems. In practice, the incident light is often purely TE or TM polarized, and only one case needs to be solved. For given truncation order N and classical mount the eigen-problem is of size 2N+1, and the boundary problem is of size 2(2N+1).

The general case where phi≠0 is known as conical diffraction. In this case, the s and p components are coupled, with a corresponding increase in the amount of computation time. The boundary problem involves 4(2N+1) sized matrices. The eigen-problem has been successfully decoupled into two smaller eigen-problems, each of size 2N+1 (see Moharam and Gaylord 1995 referenced above, or S. Peng and G. M. Morris, J. Opt. Soc. Am. A, Vol. 12, No. 5, p. 1087 (1995)). Therefore, the computation time for the general conical incidence case suffers a factor of 2 increase for the eigen-problem and a factor of 8 increase for the boundary problem compared to the corresponding classical mount case with same polar incident angle, theta.

Analysis of a diffraction grating problem is of particular use to determining the various characteristics of the diffraction grating structure. For example, critical dimensions of a device (such as in semiconductor processing in one exemplary use) may be monitored by evaluating the characteristics of a diffraction grating as is known in the art. By evaluating data from known optical metrology tools using regression and/or library methods, the diffraction analysis may lead to, for example, a determination of the grating line widths, the grating height/depth, the period of the grating, the slopes and profiles of the grating, the material composition of the grating, etc. As known in the art, such grating characteristics may be related to the characteristics of a device that is being analyzed, such as for example but not limited to widths, heights, depths, profiles, etc. of transistors, metallization lines, trenches, dielectric layers, or the like, all as is known to those skilled in the art. Since the regression and/or library methods may require many calculations of diffraction efficiencies, special consideration must be given to computation expense in such applications.

SUMMARY OF THE INVENTION

An optical metrology apparatus for measuring periodic structures using multiple incident azimuthal (phi) and polar (theta) incident angles is described. One embodiment provides the enhanced calculation speed for the special case of phi=90 incidence for 1-D (line and space) structures, which has the incident plane parallel to the grating lines, as opposed to the phi=0 classical mounting, which has incident plane perpendicular to the grating lines. The enhancement reduces the computation time of the phi=90 case to the same order as the corresponding phi=0 case, and in some cases the phi=90 case can be significantly faster. One advantageous configuration consists of two measurements for each sample structure, one perpendicular to the grating lines and one parallel. This provides additional information about the structure, equivalent to two simultaneous angles of incidence, without excessive increase in computation time. Alternately, in cases where the computation for phi=90 is faster than the corresponding phi=0 incidence, it may be advantageous to measure parallel to the grating lines only. In the case where two sets of incident angles are used, the incident light can be polarized to provide a total of four sets of data—R_s⁰, R_p⁰, R_s⁹⁰, R_p⁹⁰—for each incident polar angle, all from the same structure (R_s⁰being a data set having incident phi=0 and incident polarization normal to the plane of incidence, R_p⁹⁰being a data set having incident phi=90 and incident polarization within the plane of incidence, etc.).

In one embodiment, the techniques described herein provide an optical metrology apparatus and methods and systems for measuring periodic structures using multiple incident azimuthal (phi) and polar (theta) angles, and particularly to enhanced calculation speed for a special case of phi=90 incidence for 1-D (line and space) structures, having an incident plane parallel to grating lines.

In one embodiment, a method of reducing an RCW calculation for the phi=90 incidence mount by exploiting the degeneracy in the resulting diffraction problem is provided. The method may include using this calculation in the regression part of an optical grating measurement. Alternately, the method may include using the enhanced speed in the generation of a database library to be used in conjunction with an optical grating measurement. The optical method could be any in existence, such as reflectometry, polarized reflectometry, ellipsometry, polarimetry, etc. and can be broadband or single wavelength.

The method may include illuminating a grating structure with polarized or unpolarized, monochromatic or broadband light. The incident light may be at one or more polar angles, theta, at the phi=0 and phi=90 azimuthal directions for each of the polar angles. The method may further include detecting the response, for a total of up to four datasets per grating sample per incident polar angle, which are then simultaneously analyzed in order to take advantage of the enhanced information content contained in the multiple datasets. The calculation time is reduced compared to conventional RCW formulations due to the reduced calculation requirements for the phi=90 cases.

One or more diffracted orders may be detected along with or instead of the 0'th order. Further when the detected response is reflected or diffracted intensity, one or more of the datasets may be used to normalize the other datasets, making an absolute calibration of the tool unnecessary.

One or more of the datasets may be used to normalize the other datasets, making an absolute calibration of the optical tool unnecessary, and the inverse ratio is substituted in calculations for specific wavelength regions where the denominator of the original ratio is near zero.

One or more of the datasets may be used to normalize the other datasets, making an absolute calibration of the tool unnecessary, and the inverse ratio is substituted in calculations for specific wavelength regions where the denominator of the original ratio is near zero, and a weighting function is used to equalize the contribution to the merit function regardless of reflectance ratio magnitude.

In addition, one or more of the datasets may be used to normalize the other datasets, making an absolute calibration of the optical tool unnecessary, and the data regions where the denominator of the ratio is near zero are dropped from the analysis.

Data collected from the diffracting structure may be normalized by data from a nearby uniform film structure having the same stack layer structure as the diffracting structure.

The angle of incidence may be explicitly varied by changing the polar angle of incidence of the optical plane, or by rotating the optical plane or sample at fixed polar angle to generate phi=0 and phi=90 incident data.

The multiple angle of incidence data may be generated through use of a high numerical aperture optic (so it contains a spread of angles) and selecting specific angles using an aperture stop to allow light incident at only specific angles.

In addition, multiple angles of incidence may be allowed, and the data simultaneously analyzed.

Further, the method may include only measuring and analyzing the phi=90 incidence data.

As described below, other features and variations can be implemented, if desired, and a related method can be utilized, as well.

DESCRIPTION OF THE DRAWINGS

It is noted that the appended drawings illustrate only exemplary embodiments of the invention and are, therefore, not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.

FIG. 1 shows a schematic diagram of diffraction problem illustrating polar (theta) and azimuthal (phi) incident angles.

FIG. 1B shows an exemplary alternative diffraction grating.

FIGS. 2A AND 2B shows rotation of the stage/sample/objective by 90 degrees and back for each measurement.

FIGS. 3A-3E illustrate using a high NA objective with an aperture stop that allows light incident at specific angles. Multiple incident angles are achieved by rotating the sample, aperture, or objective. One modification might simultaneously illuminate the grating from both directions (phi=0 and phi=90) as shown in FIG. 3D and FIG. 3E.

FIG. 4 illustrates an apparatus for collecting VUV-Vis reflectance data at multiple incident angles.

DETAILED DESCRIPTION OF THE INVENTION

One way to directly attack the time required for the RCW method is to reduce the number of spatial harmonics involved in the eigen-problem, the boundary problem, or both. This was done for the general case to a large extent in the work of Moharam and Gaylord in their papers: M. G. Moharam, E. B. Grann, D. A. Pommet, and T. K. Gaylord, “Formulation for stable and efficient implementation of the rigorous coupled-wave analysis of binary gratings,” J. Opt. Soc. Am. A 12, 1068-1076 (1995) and M. G. Moharam, D. A. Pommet, E. B. Grann, and T. K. Gaylord, “Stable implementation of the rigorous coupled-wave analysis for surface-relief gratings: enhanced transmittance matrix approach,” J. Opt. Soc. Am. A 12, 1077-1086 (1995). Subsequent modifications to enhance the TM convergence have included those techniques shown in: P. Lalanne and G. M. Morris, “Highly improved convergence of the coupled-wave method for TM polarization,” J. Opt. Soc. Am. A 13, 779-784 (1996); G. Granet and B. Guizal, “Efficient implementation of the coupled-wave method for metallic lamellar gratings in TM polarization,” J. Opt. Soc. Am. A 13, 1019-1023 (1996); and L. Li, “Use of Fourier series in the analysis of discontinuous periodic structures,” J. Opt. Soc. Am. A 13, 1870-1876 (1996). However, for certain incidence conditions where the incident plane wave has an x-periodicity that is the same as or is a multiple of the grating period, the diffraction problem benefits from additional degeneracy, and the coupled equations can be even further reduced.

One case where this occurs is when the plane of incidence is in the phi=90 mount so that the incident wave is constant with x, and the resulting degeneracy can be exploited to reduce the total number of unknowns. This reduces the total number of spatial harmonics required for the conical diffraction case from 4(2N+1) to 4(N+1). The result is a reduction of the computation time required by a factor of approximately 8 compared to the standard phi=90 conical diffraction case. In addition, a small but significant reduction in computation time compared to the classical mount with same theta is achieved. There are still two eigen-problems, but each of size N+1, leading to an overall reduction of the eigen-problem by a factor of approximately 4 compared to the comparable phi=0 case. The boundary problems for the two cases require approximately the same computation time (2(2N+1) vs. 4(N+1) matrix sizes). Therefore, the computation time for the phi=90 case is reduced to the same order as the corresponding phi=0 case. The phi=90 case can sometimes be significantly faster than the phi=0 case, depending on how much influence the eigen-problem has on the overall computation time. The result is that the additional azimuthal angle phi=90 can be added to the data without an excessive increase in computation time. It will be recognized that although the concepts described herein may refer to analysis at particular angles such as phi=0 and phi=90, the concepts are not limited to use of these exact angles. For example, equipment and sample tolerances may result in other angles being actually used as variability from an anticipated angle is to be expected in real world applications. In addition, variations from the angles of best choice may be purposefully allowed beyond such tolerances while still obtaining the benefits of the techniques described herein. For example, the optical system and grating may be of such a nature that variations from the most desirable angles will still provide a sufficiently accurate calculation at other angles such that the data collected and the calculations may be effectively similar to the use of phi=0 and phi=90 to the extent that sufficient accuracy for a particular application is obtained. In this manner angles that deviate from the phi=0 and phi=90 may be considered to be effectively phi=0 and phi=90 for the purpose of utilizing a metrology tool implementing the techniques described herein for a given application.

An effective way of increasing the amount of information that can be extracted from a single sample is to collect more data sets. This is often done by using multiple angles of incidence theta or, in the case of grating structures, using multiple azimuthal angles. For instance, T. Novikova, A. De Martino, S. B. Hatit, and B. Drevillon, “Application of Mueller polarimetry in conical diffraction for critical dimension measurements in microelectronics,” Appl. Opt. 45, 3688-3697 (2006) shows a technique to extract more information about grating line-shapes using multiple azimuthal angles in conjunction with Mueller polarimetry.

This is significant since the hardware implementation for multiple phi configurations is considerably simpler than cases where multiple polar angles are used. In the simplest case, a fixed angle theta can be used, while the sample or stage is rotated through the azimuths. Alternately, the optic objective could be rotated. To take advantage of the above mentioned RCW improvements, the system would simply have to rotate the stage/sample/objective by 90 degrees and back for each measurement (FIGS. 2A-2B), generating two datasets for each sample, at orthogonal incident conditions.

As shown in FIG. 2A, a substrate is provided with a diffraction grating 100. A source 206 provides an incident light wave 200 at a polar angle 114 (theta) and an azimuthal angle 116 (phi). As shown in FIG. 2A phi=0 so no angle is indicated. A detector 208 may be provided to detect the diffracted orders of reflected light 202. As used herein, light detected from a diffraction grating may be referred to as reflected light, reflected data, or the like and may include one or both of specular reflection of the zero order light and higher order diffracted light.

Though not shown in the figures provided herein, a computer system, processor, or the like may be coupled to the detector to process the collected data according to the analysis techniques described herein. The rotation of the diffraction grating with reference to the position of FIG. 2A is shown in FIG. 2B. As shown in FIG. 2B, the diffraction grating has been rotated (for example by rotating the stage, the sample, or the incident light or a combination thereof) so as to create an azimuthal angle 116 (phi) that is phi=90.

Another means for collecting data at multiple angles of incidence could utilize the large cone angle inherent in high numeric aperture (NA) normal incident objective systems in conjunction with an aperture stop to allow light incident at specific angles, as shown in FIGS. 3A-C. One advantage of this configuration is that multiple polar angles can be more easily incorporated, in addition to the orthogonal azimuthal angles. As shown in FIG. 3A an aperture 300 provides collimated light to an objective 302 which focuses the light on a diffraction grating 100 at a polar angle 114 theta. Different aperture settings are shown between FIGS. 3A and 3B to illustrate utilizing different polar angles theta. As shown in FIGS. 3A and 3B, the incident wave is not rotated from the classical phi=0 orientation. FIG. 3C illustrates a rotation such that phi does not equal zero, for example phi=90.

Another possibility might simultaneously illuminate the sample from both directions (phi=0 and phi=90). This could be implemented using two separate, orthogonal optic planes, or a modified version of the high NA system (FIG. 3D). The configuration of FIG. 3D is similar to that of FIG. 3A except a modified aperture 300A is provided. A top view of the modified aperture 300A is shown in FIG. 3E. As can be seen, the aperture 300A allows for light to pass in multiple directions, more particularly in a manner such that phi=0 and phi=90 may be simultaneously illuminated.

It will be recognized by those in the art that the embodiments of FIGS. 2A-2B and 3A-3D are merely illustrative so as to demonstrate the technique of changing the azimuthal angle. A wide range of optical metrology tool configurations may be utilized to achieve the desired incidence conditions so as to acquire the data sets described herein. These data sets may then be utilized in a manner that yields desired characteristics of a diffraction grating from the reduced computational complexity analysis that is achieved by collecting the rotated data as described in more detail below. It will also be recognized that although the present disclosure generally is described in reference to data collected at two phi angles (phi=0 and 90 angles), the techniques described herein are not limited to such techniques. In particular, the techniques described herein provide a reduced computation technique that may be utilized for measurements collected at a single phi angle. Further, additional phi angles beyond two may also be utilized to collect data while still obtaining the benefits described herein. Thus, the concepts described herein are not merely limited to collection of data at two phi angles.

Note that collecting multiple broadband datasets using different azimuthal incidence conditions is distinct from the old method of “phi scatterometry”, where the entire dataset consists of the diffraction spectrum of a single wavelength as a function of the azimuthal angle, phi.

The light can additionally, be polarized parallel and perpendicular to the plane of incidence, so that four simultaneous data sets per incident theta can be obtained for a given 1-D grating structure, with corresponding enhancement in information. Alternately, each of the four configurations can be explored for a particular structure, and the most promising configuration employed in practice for measuring that particular structure.

Further, the additional datasets may enhance the information content to the extent that fewer wavelengths in a broadband system can be used in the analysis, and in this way the measurement can actually be made faster than a single angle of incidence configuration broadband measurement. In other-words, the total number of calculations with respect to incident conditions, including wavelength and angle, may be reduced over that required for a single incident condition over many more wavelengths, while still extracting the same information about the grating structure.

Another advantageous configuration is to collect and analyze the ratio of reflected (0 order, for instance) intensity from the phi=0 scan to that of the reflected (0 order) intensity of the phi=90 scan. The intensity ratio is the same as the reflectance ratio, as long as the intensities are measured in quick succession so that there is minimal system drift between the two measurements. In this way the system calibration can be skipped, and the intensity ratio can be analyzed according to the above methods, taking advantage of the reduction in computation expense for the phi=90 case. In some wavelength ranges, the denominator may be close to zero. Those regions need not be analyzed, or the inverse ratio can be analyzed instead. This implementation can be particularly advantageous when using Vacuum Ultra-Violet (VUV) incident light. One implementation of a VUV metrology apparatus is described in U.S. Pat. No. 7,126,131, Broad Band Referencing Reflectometer, by Harrison, the disclosure of which is incorporated herein by reference in its entirety. For such systems, contaminant buildup on calibration standards over time causes difficulties for traditional calibration methods.

An example of a VUV metrology apparatus configured to collect multiple broadband data sets using different azimuthal incident angles is presented in FIG. 4. The instrument is separated into two environmentally controlled chambers, the instrument chamber and the sample chamber. The instrument chamber houses most of the system optics and is not opened to the atmosphere on a regular basis. The sample chamber houses the sample, the sample focusing optic M-2, the reference focusing optic M-4 and the reference plane mirror M-5. This chamber is opened regularly to facilitate changing samples. The instrument is configured to enable collection of sample and reference data sets. The reference data set can be used to correct for system and/or environmental changes which may occur between calibration and sample measurement times. The system may be configured with multiple sources and spectrometers/detectors that are selected using flip-in mirrors FM-1, FM-2, FM-3 and FM-4.

In operation the VUV data is first obtained by switching flip-in source mirrors FM-1 and FM-3 into the “out” position so as to allow light from the VUV source to be collected, collimated and redirected towards beam splitter element BS by focusing mirror M-1. Light striking the beam splitter is divided into two components, the sample beam and the reference beam, using a balanced Michelson interferometer arrangement. The sample beam is reflected from the beam splitter BS and travels through shutter S-1, aperture A-1 and VUV-transparent window W-1. Aperture A-1 is configured to restrict illumination of the sample to some azimuthal plane(s). Shutter S-2 is closed during this time.

Light entering the sample chamber is focused by focusing optic M-2 onto the sample. Light collected from the sample is collimated and redirected by mirror M-2 back through window W-1, aperture A-1 and beam splitter BS. Light passing through the beam splitter encounters aperture A-2, which is configured to selectively pass some fraction of the collected sample response. Light passing through aperture A-2 is redirected and focused onto the entrance slit of the VUV spectrometer by focusing mirror M-3. Flip-in detector mirrors FM-2 and FM-4 are switched to the “out” position during this time.

Following collection of the sample beam, the reference beam is measured by closing shutter S-1 and opening shutter S-2. Once the reference signal has been recorded, data from other spectral regions can be collected in a similar manner using the appropriate flip-in mirrors.

In totality, the use of VUV incident radiation, large polar incident angle or angles, and multiple azimuthal angles, can greatly enhance sensitivity to grating line shape parameters. An analysis using a series of simulations can be done for any given grating structure in order to determine which combination of incident polar angles, azimuthal angles, and wavelengths yields the most information for smallest computation cost.

For faster measurements, where a smaller amount of information may be desired (e.g. line height and average width only), it may be sufficient to use only the phi=90 incidence case and take advantage of the improved calculation speed over the corresponding classical phi=0 incidence case.

Another technique disclosed herein analyzes the multiple sets using different models. For instance phi=90 data might be analyzed using a simpler rectangular line shape model with a course parameter search to narrow down the average parameter values, and then a more thorough analysis done using another one of the spectrum (or all of the spectra together) with a more complicated model to further refine the line shape.

Review of the RCW Method for Conical Incidence

This review follows the notation of the Moharam and Gaylord references cited above. Many publications exist on the basic RCW method, some with notation differences and some with modifications to the formulations/derivation procedures such as that shown in the P. Lalanne and G. M. Morris reference cited above. It should be noted that the phi=90 case reduction described in this disclosure is can be applied to any of these formulations, and is not limited to just that of Moharam and Gaylord.

Note that unreduced eigen-problem matrix and vector indices run from −N to N, with the (−N, −N) matrix element at the top left corner, in order to be consistent with a symmetric diffraction problem with positive and negative orders. When creating a computer algorithm, the indices are labeled from 1 to 2N+1 (or 0 to 2N), depending on the programming language used. It will be recognized, this is a notation preference and has no effect on the outcome. The indices of the reduced matrices will run from 0 to N in either case.

Following the Moharam and Gaylord references, the RCW method expands the fields in each region of FIG. 1 as a generalized Fourier series:

$\begin{matrix} E_{I} = E_{inc} + \sum_{i} R_{i} \exp [- j (k_{xi} x + k_{y} y - k_{I, zi} z)] & eq . 1 \\ E_{II} = \sum_{i} T_{i} \exp {- j [k_{xi} x + k_{y} y + k_{II, zi} (z - d)]} & eq . 2 \end{matrix}$

in regions I and II, and

$\begin{matrix} E_{g} = \sum_{i} [S_{xi} (z) x + S_{yi} (z) y + S_{zi} (z) z] \exp [- j (k_{xi} x + k_{y} y)] & eq . 3 \\ H_{g} = - {j (\frac{ɛ_{f}}{μ_{f}})}^{1 / 2} \sum_{i} [U_{xi} (z) x + U_{yi} (z) y + U_{zi} (z) z] \exp [- j (k_{xi} x + k_{y} y)] & eq . 4 \end{matrix}$

in the grating region, where

$\begin{matrix} k_{xi} = k_{0} [n_{I} \sin θcos φ - i (λ_{0} / Λ)], & eq . 5 \\ k_{y} = k_{0} n_{I} \sin θ \sin φ, & eq . 6 \\ k_{I, zi} = {\begin{matrix} {[{(k_{0} n_{l})}^{2} - k_{xi}^{2} - k_{y}^{2}]}^{1 / 2} & {(k_{xi}^{2} + k_{y}^{2})}^{1 / 2} < k_{0} n_{l} \\ - {j [k_{xi}^{2} + k_{y}^{2} - {(k_{0} n_{l})}^{2}]}^{1 / 2} & {(k_{xi}^{2} + k_{y}^{2})}^{1 / 2} > k_{0} n_{l}, \end{matrix} l = I, II, & eq . 7 \end{matrix}$

k₀=(2π/λ₀), λ₀is the incident wavelength, and A is the grating pitch. Note that for a ID grating, k_yis constant. In eqs. 3 and 4, ∈_fis the permittivity of free space, and μ_fis the magnetic permeability of free space.

In equations 1 and 2, the R_iand T_iare the Fourier coefficients of the electric field in regions I and II, and correspond to the amplitudes of the reflected and transmitted diffraction orders. The diffracted orders can be propagating or evanescent. The corresponding magnetic fields can be obtained from Maxwell's relations

∇×E=−jωμ_fH,

∇×H=jω∈_f∈(x)E, eq. 8

where ω is the angular frequency, and μ is the magnetic permeability. Usually, one assumes μ=μ_f.

The complex permittivity in the grating region is also expanded as a Fourier series, which is

$\begin{matrix} ɛ (x) = \sum_{h} ɛ_{h} \exp (j \frac{2 π h}{Λ}), ɛ_{0} = n_{r d}^{2} f + n_{g r}^{2} (1 - f), ɛ_{h} = (n_{r d}^{2} - n_{g r}^{2}) \frac{\sin (π hf)}{π h} & eq . 9 \end{matrix}$

for the binary grating structure of the first Moharam and Gaylord reference cited above and shown in FIG. 1. In eq. 9, n_rdand n_grare the complex indices of refraction for the lines and spaces, respectively.

Eqs. 3, 4, 8, and 9 combine to give a set of coupled equations:

$\begin{matrix} [\begin{matrix} \partial S_{y} / \partial (z^{'}) \\ \partial S_{x} / \partial (z^{'}) \\ \partial U_{y} / \partial (z^{'}) \\ \partial U_{x} / \partial (z^{'}) \end{matrix}] = [\begin{matrix} 0 & 0 & K_{y} E^{- 1} K_{x} & I - K_{y} E^{- 1} K_{y} \\ 0 & 0 & K_{x} E^{- 1} K_{x} - I & - K_{x} E^{- 1} K_{y} \\ K_{x} K_{y} & E - K_{y}^{2} & 0 & 0 \\ K_{x}^{2} - E & - K_{x} K_{y} & 0 & 0 \end{matrix}] \times [\begin{matrix} S_{y} \\ S_{x} \\ U_{y} \\ U_{x} \end{matrix}] & eq . 10 \end{matrix}$

where K_xis a diagonal matrix with elements k_x/k₀, K_yis a diagonal matrix with elements k_y/k₀, E is the permittivity matrix (not to be confused with the electric field), with E_i,j=∈_(i-j), and z′=k₀z.

When a truncation order of N is used, eq. 10 is a system of 4(2N+1)×4(2N+1) coupled equations. The authors of the first Moharam and Gaylord reference cited above further reduce eq. 10 to two 2(2N+1)×(2N+1) sets of equations:

$\begin{matrix} [\begin{matrix} \partial^{2} S_{y} / \partial {(z^{'})}^{2} \\ \partial^{2} S_{x} / \partial {(z^{'})}^{2} \end{matrix}] = [\begin{matrix} K_{x}^{2} + DE & K_{y} [E^{- 1} K_{x} E - K_{x}] \\ K_{x} [E^{- 1} K_{y} E - K_{y}] & K_{y}^{2} + BE \end{matrix}] [\begin{matrix} S_{y} \\ S_{x} \end{matrix}], or & eq . 11 \\ [\begin{matrix} \partial^{2} U_{y} / \partial {(z^{'})}^{2} \\ \partial^{2} U_{x} / \partial {(z^{'})}^{2} \end{matrix}] = [\begin{matrix} K_{y}^{2} + EB & [K_{x} - {EK}_{x} E^{- 1}] K_{y} \\ [K_{y} - {EK}_{y} E^{- 1}] K_{x} & K_{x}^{2} + ED \end{matrix}] [\begin{matrix} U_{y} \\ U_{x} \end{matrix}], & eq . 12 \end{matrix}$

Where B=K_xE⁻¹K_x−I and D=K_yE⁻¹K_y−I.

These last equations are reduced still further into two (2N+1)×(2N+1) sets of equations:

[∂²U_x/∂(z′)²]=[K_y²+A][U_x] eq. 13

and

[∂²S_x/∂(z′)²]=[K_y²+BE][S_x], eq. 14

where A=K_x²−E.

Later, Lalanne and Morris (cited above) were able to improve the convergence of the conical case by replacing the matrix E in the third row, second column of eq. 10 with the inverse of the inverse permittivity matrix, Einv, where Einv_i,j=(1/∈)_i,j:

$\begin{matrix} [\begin{matrix} \partial S_{y} / \partial (z^{'}) \\ \partial S_{x} / \partial (z^{'}) \\ \partial U_{y} / \partial (z^{'}) \\ \partial U_{x} / \partial (z^{'}) \end{matrix}] = [\begin{matrix} 0 & 0 & K_{y} E^{- 1} K_{x} & I - K_{y} E^{- 1} K_{y} \\ 0 & 0 & K_{x} E^{- 1} K_{x} - I & - K_{x} E^{- 1} K_{y} \\ K_{x} K_{y} & {Einv}^{- 1} - K_{y}^{2} & 0 & 0 \\ K_{x}^{2} - E & - K_{x} K_{y} & 0 & 0 \end{matrix}]  [\begin{matrix} S_{y} \\ S_{x} \\ U_{y} \\ U_{x} \end{matrix}] & eq . 15 \end{matrix}$

which lead to

[∂²U_x/∂(z′)²]=[K_y²+A][U_x], eq. 16

and

[∂²S_x/∂(z′)²]=[K_y²+BEinv⁻¹][S_x] eq. 17

in place of eqs. 13 and 14.

The new formulation eqs. 16 and 17 are advantageous and it may be noted that E and Einv⁻¹are not the same matrices when they are truncated. The details can be found in references cited above from P. Lalanne and G. M. Morris; G. Granet and B. Guizal; and L. Li.

Equations 16 and 17 are solved by finding the eigenvalues and eigenvectors of the matrices [K_y²+A] and [K_y²+BEinv⁻¹], which leads to

$\begin{matrix} U_{xi} = \sum_{m = 1}^{2 N + 1} w_{1, i, m} {- c_{1, m}^{+} \exp (- k_{0} q_{1, m} z) + c_{1, m}^{-} \exp [k_{0} q_{1, m} (z - d)]}, & eq . 18 \\ S_{xi} (z) = \sum_{m = 1}^{2 N + 1} w_{2, i, m} {c_{2, m}^{+} \exp (- k_{0} q_{2, m} z) + c_{2, m}^{-} \exp [k_{0} q_{2, m} (z - d)]}, & eq . 19 \\ S_{yi} (z) \sum_{m = 1}^{2 N + 1} v_{11, i, m} {c_{1, m}^{+} \exp (- k_{0} q_{1, m} z) + c_{1, m}^{-} \exp [k_{0} q_{1, m} (z - d)]} + \sum_{m = 1}^{2 N + 1} v_{12, i, m} {c_{2, m}^{+} \exp (- k_{0} q_{2, m} z) + c_{2, m}^{-} \exp [k_{0} q_{2, m} (z - d)]}, & eq . 20 \\ U_{yi} (z) = \sum_{m = 1}^{2 N + 1} v_{21, i, m} {- c_{1, m}^{+} \exp (- k_{0} q_{1, m} z) + c_{1, m}^{-} \exp [k_{0} q_{1, m} (z - d)]} + \sum_{m = 1}^{2 N + 1} v_{22, i, m} {- c_{2, m}^{+} \exp (- k_{0} q_{2, m} z) + c_{2, m}^{-} \exp [k_{0} q_{2, m} (z - d)]}, where & eq . 21 \\ V_{11} = A^{- 1} W_{1} Q_{1}, & eq . 22 \\ V_{12} = (k_{y} / k_{0}) A^{- 1} K_{x} W_{2}, & eq . 23 \\ V_{21} = (k_{y} / k_{0}) B^{- 1} K_{x} E^{- 1} W_{1}, & eq . 24 \\ V_{22} = B^{- 1} W_{2} Q_{2}, & eq . 25 \end{matrix}$

Q₁and Q₂are diagonal matrices with elements q_1,mand q_2,m, which are the square roots of the 2N+1 eigenvalues of the matrices [K_y²+A] and [K_y²+BEinv⁻¹], and W₁and W₂are the (2N+1)×(2N+1) matrices formed by the corresponding eigenvectors, with elements w_1,i,mand w_2,i,m. Eqs. 16-25 constitute the eigen-problem portion of the RWC method given in the first the Moharam and Gaylord reference cited above. It is noted that there are other equivalent formulations of the same eigen-problem that will lead to the same final results.

The constants c_1,m⁺, c_1,m⁻, c_2,m⁺, c_2,m⁻ are determined by matching the tangential electric and magnetic field components at the two boundary regions of the grating. The first the Moharam and Gaylord reference cited above uses a boundary formulation where the field components are rotated into the corresponding diffraction plane, φ_i, for each diffracted order:

sin ψδ_i0+R_s,k=cos φ_iS_yi(0)−sin φ_iS_xi(0), eq. 26

j[sin ψn₁cos θδ_i0−(k_1,zi/k₀)R_s,i]=−[cos φ_iU_xi(0)+sin φ_iU_yi(0)], eq. 27

cos ψ cos θδ_i0−j[k_1,zi/(k₀n₁²)]R_p,i=cos φ_iS_xi(0)+sin φ_iS_yi(0) eq. 28

−jn₁cos ψδ_i0+R_p,i=−[cos φ_iU_yi(0)−sin φ_iU_xi(0)], eq. 29

where

φ_i=tan⁻¹(k_y/k_xi), eq. 30

R_x,i=cos φ_iR_yi−sin φ_iR_xi, eq. 31

R_p,i=(j/k₀)[cos φ_i(k_1,zi,R_xi+k_xiR_zi)+sin φ_i(k_yR_zi+k_1,ziR_yi)], eq. 32

at the z=0 boundary, and

cos φ_iS_yi(d)−sin φ_iS_xi(d)=T_x,i, eq. 33

−[cos φ_iU_xi(d)+sin φ_iU_yi(d)]=j(k_1,zi/k0)T_s,i, eq. 34

−[cos φ_iU_yi(d)−sin φ_iU_xi(d)]=T_p,i, eq. 35

cos φ_iS_xi(d)+sin φ_iS_yi(d)=j(k_1,zi/k₀n₁²)T_p,i, eq. 36

T_s,i=cos φ_iT_yi−sin φ_iT_xi, eq. 37

T_p,i=(−j/k₀)[cos φ_i(k_11,ziT_xi−k_xiT_zi)−sin φ_i(−k_11,ziT_yi+k_yT_zi)] eq. 38

at the z=d boundary. Note that there is one equation for each spatial harmonic retained in the Fourier expansions. R_x,iand R_p,iare the components of the reflected electric and magnetic field amplitudes normal to the diffraction plane, and T_x,iand T_p,iare the transmitted amplitudes.

In matrix form, eqs. 26-29 are

$\begin{matrix} [\begin{matrix} \sin ψ δ_{i 0} \\ j \sin ψ n_{I} \cos θ δ_{i 0} \\ - j \cos ψ n_{I} δ_{i 0} \\ \cos ψ \cos θ δ_{i 0} \end{matrix}] = [\begin{matrix} I & 0 \\ - {jY}_{1} & 0 \\ 0 & I \\ 0 & - {jZ}_{1} \end{matrix}] [\begin{matrix} R_{s} \\ R_{p} \end{matrix}] = [\begin{matrix} V_{ss} & V_{sp} & V_{ss} X_{1} & V_{sp} X_{2} \\ W_{ss} & W_{sp} & - W_{ss} X_{1} & - W_{sp} X_{2} \\ W_{p s} & W_{pp} & - W_{p s} X_{1} & - W_{pp} X_{2} \\ V_{p s} & V_{pp} & V_{p s} X_{1} & V_{pp} X_{2} \end{matrix}] [\begin{matrix} c_{1}^{+} \\ c_{2}^{+} \\ c_{1}^{-} \\ c_{2}^{-} \end{matrix}] & eq . 39 \end{matrix}$

for the z=0 boundary and eqs. 33-36 are

$\begin{matrix} [\begin{matrix} V_{ss} X_{1} & V_{sp} X_{2} & V_{ss} & V_{sp} \\ W_{ss} X_{1} & W_{sp} X_{2} & - W_{ss} & - W_{sp} \\ W_{p s} X_{1} & W_{pp} X_{2} & - W_{p s} & - W_{pp} \\ V_{p s} X_{1} & V_{pp} X_{2} & V_{p s} & V_{pp} \end{matrix}] [\begin{matrix} c_{1}^{+} \\ c_{2}^{+} \\ c_{1}^{-} \\ c_{2}^{-} \end{matrix}] = [\begin{matrix} I & 0 \\ {jY}_{11} & 0 \\ 0 & I \\ 0 & {jZ}_{11} \end{matrix}] [\begin{matrix} T_{s} \\ T_{p} \end{matrix}] & eq . 40 \end{matrix}$

for the z=d boundary, where

V_ss=F_cV₁₁W_pp=F_cV₂₂

W_ss=F_cW₁+F_sV₂₁V_pp=F_cW₂+F_sV₁₂

V_sp=F_cV₁₂−F_sW₂W_ps=F_cV₂₁−F_sW₁

W_sp=F_sV₂₂V_ps=F_sV₁₁ eq. 41

Y₁, Y₁₁, Z₁, and Z₁₁are diagonal matrices with elements (k_1,zi/k₀), (k_1,zi/k₀), (k_1,zi/k₀n₁²), and (k_11,zi/k₀n₁₁²), respectively, X₁and X₂are diagonal matrices with elements exp(−k₀q_1,md) and exp(−k₀q_2,md), respectively, and F_cand F_sare diagonal matrices with elements cos φ_iand sin φ_i, respectively.

Eqs. 39 and 40 are typically solved by eliminating R_sand R_pfrom eq. 39, T_sand T_pfrom eq. 40 and solving the resulting 4(2N+1) equations for the 4(2N+1) coefficients c_1,m⁺, c_1,m⁻, c_2,m⁺, c_2,m⁻ which can be substituted back into 39 and 40 to solve for the reflected and transmitted amplitudes.

Alternately, a procedure similar to the partial solution approach given in the second Moharam and Gaylord reference cited above can be used to determine reflected amplitudes only, giving a 2(2N+1)×2(2N+1) system of equations for the c_1,m⁺ and c_2,m⁺ coefficients:

$\begin{matrix} {jY}_{1} \sin ψ δ_{i 0} + j \sin ψ n_{I} \cos θ δ_{i 0} = [{jY}_{I} f_{T} + f_{B}] [\begin{matrix} c_{1}^{+} \\ c_{2}^{+} \end{matrix}] & eq . 42 \\ {(Z_{I})}_{0, 0} \cos ψ n_{I} δ_{i 0} + \cos ψ \cos θ δ_{i 0} = [{jZ}_{I} g_{T} + g_{B}] [\begin{matrix} c_{1}^{+} \\ c_{2}^{+} \end{matrix}] & eq . 43 \end{matrix}$

which are related to the reflected amplitudes:

$\begin{matrix} R_{s} = f_{T} [\begin{matrix} c_{1}^{+} \\ c_{2}^{+} \end{matrix}] - \sin {ψδ}_{i 0}, & eq . 44 \\ R_{p} = g_{T} [\begin{matrix} c_{1}^{+} \\ c_{2}^{+} \end{matrix}] + j \cos ψ n_{I} δ_{i 0}, where & eq . 45 \\ [\begin{matrix} f_{T} \\ f_{B} \\ g_{T} \\ g_{B} \end{matrix}] = [\begin{matrix} V_{ss} & V_{sp} \\ W_{ss} & W_{sp} \\ W_{p s} & W_{pp} \\ V_{p s} & V_{pp} \end{matrix}] + [\begin{matrix} V_{ss} X_{1} & V_{sp} X_{2} \\ - W_{ss} X_{1} & - W_{sp} X_{2} \\ - W_{p s} X_{1} & - W_{pp} X_{2} \\ V_{p s} X_{1} & V_{pp} X_{2} \end{matrix}] \cdot a, & eq . 46 \end{matrix}$

and the matrix a is defined as the top half of

$\begin{matrix} {[\begin{matrix} - V_{ss} & - V_{sp} & I & 0 \\ W_{ss} & W_{sp} & {jY}_{11} & 0 \\ W_{p s} & W_{pp} & 0 & I \\ - V_{p s} & V_{pp} & 0 & {jZ}_{11} \end{matrix}]}^{- 1} [\begin{matrix} V_{ss} X_{1} & V_{sp} X_{2} \\ W_{ss} X_{1} & W_{sp} X_{2} \\ W_{p s} X_{1} & W_{pp} X_{2} \\ V_{p s} X_{1} & V_{pp} X_{2} \end{matrix}] \equiv [\begin{matrix} a \\ b \end{matrix}] . & eq . 47 \end{matrix}$

Note that (Y₁)_0,0and (Z₁)_0,0refer to the center elements of the matrices Y₁, or (k_1,zi/k₀), and Z₁, or (k_1,zi/k₀n₁²), respectively.

The boundary matching can be generalized to multiple layers using (for example) the enhanced transmittance matrix approach outlined in the second the Moharam and Gaylord reference cited above. Given and L layer stack, where L+1 refers to the substrate, start by setting

$\begin{matrix} [\begin{matrix} f_{L + 1, T} \\ f_{L + 1, B} \\ g_{L + 1, T} \\ g_{L + 1, B} \end{matrix}] = [\begin{matrix} 1 & 0 \\ j Y_{11} & 0 \\ 0 & 1 \\ 0 & {jZ}_{11} \end{matrix}] . & eq . 48 \end{matrix}$

The matrices for a_Land b_Lare constructed for layer L,

$\begin{matrix} {[\begin{matrix} - V_{ss, L} & - V_{sp, L} & f_{L + 1, T} \\ W_{ss, L} & W_{sp, 1} & f_{L + 1, B} \\ W_{p s, L} & W_{pp, L} & g_{L + 1, T} \\ - V_{ps, L} & V_{pp, L} & g_{L + 1, B} \end{matrix}]}^{- 1} [\begin{matrix} V_{ss, L} X_{1, L} & V_{sp, L} X_{2, L} \\ W_{ss, L} X_{1, L} & W_{sp, L} X_{2, L} \\ W_{p s, L} X_{1, L} & W_{pp, L} X_{2, L} \\ V_{p s, L} X_{1, L} & V_{pp, L} X_{2, L} \end{matrix}] \equiv [\begin{matrix} a_{L} \\ b_{L} \end{matrix}], & eq . 49 \end{matrix}$

where W_Land V_Lcome from the solution to the eigen-problem for layer L, X_1,L=exp(−k₀q_1,m,Ld_L), and X_2,L=exp(−k₀q_2,m,Ld_L), where d_Lis the thickness of layer L. f_Land g_Lare then obtained from

$\begin{matrix} [\begin{matrix} f_{L, T} \\ f_{L, B} \\ g_{L, T} \\ g_{L, B} \end{matrix}] = [\begin{matrix} V_{ss, L} & V_{sp, L} \\ W_{ss, L} & W_{sp, L} \\ W_{p s, L} & W_{pp, L} \\ V_{p s, L} & V_{pp, L} \end{matrix}] + [\begin{matrix} V_{ss, L} X_{1, L} & V_{sp, L} X_{2, L} \\ - W_{ss, L} X_{1, L} & - W_{sp, L} X_{2, L} \\ - W_{p s, L} X_{1, L} & - W_{pp, L} X_{2, L} \\ V_{p s, L} X_{1, L} & V_{pp, L} X_{2, L} \end{matrix}] \cdot a_{L} . & eq . 50 \end{matrix}$

f_Land g_Lare fed back into eq. 49 along with the solution to the eigen-problem for layer L-1 to find a_L−1and b_L−1, and so on until at the top layer f_1T, f_1B, g_1T, and g_1Bare obtained. These are substituted into eqs. 42-45 in place of f_T, f_B, g_T, and g_Bto solve for the coefficients c_1,m⁺ and c_2,m⁺ for the top layer, and finally for the reflection coefficients for the diffracted orders via eqs. 44 and 45.

Eqs. 42 and 43 reduce the boundary problem to a 2(2N+1)×2(2N+1) set of equations.

For large truncation order, the boundary problem can still be dominated by the 4(2N+1)×4(2N+1) matrix inversion in eq. 49, but efficient inversion techniques can be employed since only the top half of the matrix is used.

Therefore, for a given incident polar angle theta, the computational expense incurred by using nonzero azimuthal incidence phi is two (2N+1)×(2N+1) eigen-problems versus one (2N+1)×(2N+1) eigen-problem in the corresponding classical (same theta, phi=0) case, a 2(2N+1)×2(2N+1) linear system of equations for the boundary problem (to solve for reflected amplitudes only) versus a (2N+1)×(2N+1) system of equations in the corresponding classical incidence case, and a 4(2N+1)×4(2N+1) matrix inversion in the boundary problem versus a 2(2N+1)×2(2N+1) matrix inversion in the corresponding classical mount. Since these operations are governed by order n³operations, the conical mount requires approximately 2 times the computing time as the corresponding classical mount case for the eigen-problem, and approximately 8 times the computing time for the boundary problem.

Details of the Reduction in RCW Computation Time for the phi=90 Conical Mount

For the purposes of this description, the symmetry properties of the Fourier series are assumed a priori, and not proved. The initial assumptions can be derived through symmetry arguments, or by experimentation with the conventional formulation given above. In particular, for the phi=90 mount and s polarized incident light (psi=90),

E_x,i=E_x,−i eq. 51

E_y,i=−E_y,−i eq. 52

H_x,i=−H_x,−i eq. 53

H_y,i=H_y,−i eq. 54

while for p polarized incident light (psi=0),

E_x,i=−E_x,−i =eq. 55

E_y,i=E_y,−i eq. 56

H_x,i=H_x,−i eq. 57

H_y,i=−H_y,−i eq. 58

In equations 51-58, the subscript i refers to the expansion term, which in the incident region corresponds to the diffraction order.

These relationships are valid in all regions of the grating problem, and all of the Fourier expansions can be reduced accordingly. In addition to these relationships, there is a 180 degree phase difference between opposite odd orders, but this can be ignored when not considering interference between multiple gratings.

Also, for phi=90, eq. 5 becomes

k_xi=−ik₀(λ₀/Λ) eq. 59

This gives

k_xi=−k_x−i eq. 60

k_1,zi=k_1,z−i eq. 61

The relations 51-61 show that for the phi=90 incidence case:

- i) The generalized Fourier expansions in eqs. 1-4 become regular Fourier expansions, and
- ii) The Fourier expansions for the fields have either even or odd symmetry, depending on the particular field component.

This means that a complex Fourier series representation is not necessary, and the field components can be expressed as cosine series for even symmetry cases or sine series for odd symmetry cases—although the Fourier coefficients themselves will still in general be complex. In either case, the entire content of the 2N+1 terms of a truncated complex Fourier series is contained in the N+1 terms of a cosine or sine series, depending on the symmetry. The usefulness of this for the grating problem arises from the fact the fields have this symmetry in every region. When re-expressed as cosine and sine series, all of the information about the grating problem is contained in roughly half the number of terms required for the traditional formulation. This leads to a reduction in computation time by a factor of approximately 8 compared with the usual phi=90 formulation.

Each incident polarization case will be treated separately. For s polarized light, incident at polar angle theta and the phi=90 conical plane, eqs. 51-54 give

S_x,i=S_x,−i eq. 62

S_y,i=−S_y,−i eq. 63

U_x,i=−U_x,−i eq. 64

U_y,i=U_y,−i eq. 65

in the grating region.

The reduced Eqs. 26-29 and eqs. 33-36 may be derived by substituting the symmetry relations for R_iand T_iinto eqs. 1-4 (this requires determining further symmetry relations for R_z,iand T_z,i), reducing the fields everywhere to the appropriate Fourier cosine or sine series, applying the boundary conditions to the tangential components of the fields at z=0 and z=d, and rotating the boundary equations into the diffraction plane. However, the notation is unnecessarily cumbersome, and it is easier to apply eqs. 62-65 directly to eqs. 26-29 and eqs. 33-36 and use

R_x,i=R_x,−i eq. 66

R_p,i=−R_p,−i eq. 67

T_x,i=T_x,−i eq. 68

T_p,i=−T_p,−i eq. 69

to derive the same thing. Eqs. 66-69 can again be verified using the conventional formulation with the phi=90 mount.

In addition, eq. 30 for phi=90 gives

cos φ_i=−cos φ₋₁ eq. 70

and

sin φ_i=sin φ_−i. eq. 71

Applying the symmetry relations, the i=0 terms in eqs. 26-29 and eqs. 33-36 remain the same, but the nonzero i terms can be combined by adding the i and −i terms of eqs. 26, 27, 33, and 34, and subtracting the −ith from the ith terms in eqs. 28, 29, 35, and 36.

For example, eq. 26 gives

sin ψ+R_s,0=−S_x,0 (0) eq. 72

for i=0, and

R_x,i+R_s,−i=cos φ_iS_y,i(0)+cos φ_−iS_y,−i(0)−sin φ_iS_x,i(0)−sin φ_−iS_x,−i(0),

2R_s,i=2 cos φ_iS_y,i(0)−2 sin φ_iS_x,i(0),

R_s,i=cos φ_iS_y,i(0)−sin φ_iS_x,i(0), eq. 73

which is the same as eq. 26, except that i>0.

Similarly, eq. 28 gives

$\begin{matrix} \cos ψcos θ - j [k_{I, z 0} / (k_{0} n_{I}^{2})] R_{p, 0} = S_{y, 0} (0) for i = 0, and - j [k_{I, zi} / (k_{0} n_{I}^{2})] R_{p, i} + j [k_{I, z (- i)} / (k_{0} n_{I}^{2})] R_{p, - i} = \cos φ_{i} S_{x, i} (0) - \cos φ_{- i} S_{x, - i} (0) + \sin φ_{i} S_{y, i} (0) - \sin φ_{- i} S_{y, - i} (0), & eq . 74 \\ \begin{matrix} - 2 j [k_{I, zi} / (k_{0} n_{I}^{2})] R_{p, i} = 2 \cos φ_{i} S_{x, i} (0) + 2 \sin φ_{i} S_{y, i} (0), - \\ j [k_{I, zi} / (k_{0} n_{I}^{2})] R_{p, i} \\ = \cos φ_{i} S_{x, i} (0) + \sin φ_{i} S_{y, i} (0), \end{matrix} & eq . 75 \end{matrix}$

which is eq. 28, but with i>0.

The other boundary equations can be similarly reduced, and the form of the boundary problem is the same as the conventional one, except that only the i=0 and i>0 terms occur in the matrix equations. This reduces the matrix boundary problem (eqs. 39 and 40) to 4(N+1)×4(N+1) systems of equations, but leaves the form the same as in eqs. 26-41, as long as it is possible to also reduce the solution in the grating region to determining 4(N+1) coefficients c_1,m⁺, c_1,m⁻, c_2,m⁺, c_2,m⁻ instead of 4(2N+1) coefficients. This is shown to be the case below. Therefore, except for modifying the matrices to consist of N+1 harmonic terms (and therefore N+1 diffraction orders), the boundary problem is the same as previously defined. Now, R_xi, R_pi, T_xi, and T_piare the amplitudes of both the +i and −i diffracted orders.

To reduce the eigen-system, apply eqs. 60 and 62-65 directly to equations 15-17, reducing the total number of unknowns from 4(2N+1) to 4(N+1). The eigen-problems specified by eqs. 16 and 17 are each reduced from size (2N+1)×(2N+1) to size (N+1)×(N+1), for a total reduction of a factor of approximately 8 over the previous conical descriptions, and a factor of 4 over the corresponding classical mount eigen-problem.

Aside from reducing eqs. 16 and 17, reduced matrices for eqs. 22-25 will also need to be found. This will reduce the solution in the grating region to the determination of 4(N+1) coefficients instead of 4(N+1).

Note that the symmetry of the lamellar grating also implies that the elements of the permittivity matrix satisfy

E_i,j=E_−i,−j. eq. 76

The i th row of equation 16 can be written as

$\begin{matrix} \frac{\partial^{2} U_{xi}}{\partial {(z^{'})}^{2}} = \frac{k_{y}^{2}}{k_{0}} U_{xi} + \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - \sum_{m = - \infty}^{\infty} E_{i, m} U_{xm} . & eq . 77 \end{matrix}$

Due to eq. 64, eq. 16 obeys the following symmetry condition:

$\begin{matrix} \frac{\partial^{2} U_{xi}}{\partial {(z^{'})}^{2}} = - \frac{\partial^{2} U_{x - i}}{\partial {(z^{'})}^{2}} . & eq . 78 \end{matrix}$

Subtracting the −i th row from the i th row gives

$\begin{matrix} \frac{\partial^{2} U_{x 0}}{\partial {(z^{'})}^{2}} = \frac{k_{y}^{2}}{k_{0}^{2}} U_{x 0} - \sum_{m = - \infty}^{\infty} E_{0, m} U_{xm} \begin{matrix} \frac{\partial^{2} U_{x 0}}{\partial {(z^{'})}^{2}} = \frac{k_{y}^{2}}{k_{0}^{2}} U_{x 0} - E_{0, 0} U_{x 0} - \sum_{m = - \infty}^{- 1} E_{0, m} U_{xm} - \sum_{m = 1}^{\infty} E_{0, m} U_{xm} \\ = \frac{k_{y}^{2}}{k_{0}^{2}} U_{x 0} - E_{0, 0} U_{x 0} + \sum_{m = 1}^{\infty} E_{0, - m} U_{xm} - \sum_{m = 1}^{\infty} E_{0, m} U_{xm}, \end{matrix} so & eq . 79 \\ \frac{\partial^{2} U_{x 0}}{\partial {(z^{'})}^{2}} = \frac{k_{y}^{2}}{k_{0}^{2}} U_{x 0} - {E_{0, 0} U_{x 0} + \sum_{m = 1}^{\infty} (E_{0, m} - E_{0, - m}) U_{xm}} for i = 0, and \begin{matrix} 2 \frac{\partial^{2} U_{xi}}{\partial {(z^{'})}^{2}} = 2 \frac{k_{y}^{2}}{k_{0}^{2}} U_{xi} + 2 \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - \sum_{m = - \infty}^{\infty} E_{i, m} U_{xm} + \\ \sum_{m = - \infty}^{\infty} E_{- i, m} U_{xm} \\ = 2 \frac{k_{y}^{2}}{k_{0}^{2}} U_{xi} + 2 \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - E_{i, 0} U_{x 0} - \sum_{m = - \infty}^{- 1} E_{i, m} U_{xm} - \\ \sum_{m = 1}^{\infty} E_{i, m} U_{xm} + E_{- i, 0} U_{x 0} + \sum_{m = - \infty}^{- 1} E_{- i, m} U_{xm} + \\ \sum_{m = 1}^{\infty} E_{- i, m} U_{xm} \\ = 2 \frac{k_{y}^{2}}{k_{0}^{2}} U_{xi} + 2 \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - E_{i, 0} U_{x 0} + \sum_{m = 1}^{\infty} E_{i, - m} U_{xm} - \\ \sum_{m = 1}^{\infty} E_{i, m} U_{xm} + E_{- i, 0} U_{x 0} - \sum_{m = 1}^{\infty} E_{- i, - m} U_{xm} + \\ \sum_{m = 1}^{\infty} E_{- i, m} U_{xm} + \sum_{m = 1}^{\infty} E_{- i, m} U_{xm} \\ = 2 \frac{k_{y}^{2}}{k_{0}^{2}} U_{xi} + 2 \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - E_{i, 0} U_{x 0} + \sum_{m = 1}^{\infty} E_{i, - m} U_{xm} - \\ \sum_{m = 1}^{\infty} E_{i, m} U_{xm} + E_{- i, 0} U_{x 0} - \sum_{m = 1}^{\infty} E_{- i, - m} U_{xm} + \\ \sum_{m = 1}^{\infty} E_{- i, m} U_{xm} \\ = 2 \frac{k_{y}^{2}}{k_{0}^{2}} U_{xi} + 2 \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - (E_{i, 0} - E_{- i, 0}) U_{x 0} - \\ \sum_{m = 1}^{\infty} (E_{i, m} + E_{- i, - m} - E_{i, - m} - E_{- i, m}) U_{xm}, \end{matrix} so \\ \frac{\partial^{2} U_{xi}}{\partial {(z^{'})}^{2}} = \frac{k_{y}^{2}}{k_{0}^{2}} U_{xi} + \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - {\frac{1}{2} (E_{i, 0} - E_{- i, 0}) U_{x 0} + \frac{1}{2} \sum_{m = 1}^{\infty} (E_{i, m} + E_{- i, - m} - E_{- i, m} - E_{i, - m}) U_{xm}} & eq . 80 \end{matrix}$

for i>0. Note that i now runs from 0 to ∞ instead of −∞ to ∞.

The first two terms in eq. 79 and 80 indicate that the matrices K_y²and K_x²in eq. 16 should simply be replaced by diagonal matrices consisting of the 0 and positive terms of the original matrices. In fact, this will turn out to be the case for K_xand K_ythroughout, and the subscripts and superscripts on these matrices distinguishing reduced from unreduced will hereafter be omitted.

The terms

$\begin{matrix} E_{i, 0} U_{x 0} + \sum_{m = 1}^{\infty} (E_{0, m} - E_{0, - m}) U_{xm} & eq . 81 \end{matrix}$

from eq. 79 and

$\begin{matrix} \frac{1}{2} (E_{i, 0} - E_{- i, 0}) U_{x 0} + \frac{1}{2} \sum_{m = 1}^{\infty} (E_{i, m} + E_{- i, - m} - E_{- i, m} - E_{i, - m}) U_{xm} & eq . 82 \end{matrix}$

from eq. 80 are the rows of the reduced matrix that replaces the matrix E in eq. 16:

$\begin{matrix} E_{reduced}^{s} = [\begin{matrix} E_{0, 0} & E_{0, 1} - E_{0, - 1} & E_{0, 2} - E_{0, - 2} & \dots \\ \frac{1}{2} (E_{1, 0} - & \frac{1}{2} (E_{1, 1} + E_{- 1, - 1} - & \frac{1}{2} (E_{1, 2} + E_{- 1, - 2} - & \dots \\ E_{- 1, 0}) & E_{- 1, 1} - E_{1, - 1}) & E_{- 1, 2} - E_{1, - 2}) \\ \frac{1}{2} (E_{2, 0} - & \frac{1}{2} (E_{2, 1} + E_{- 2, - 1} - & \frac{1}{2} (E_{2, 2} + E_{- 2, - 2} - & \dots \\ E_{- 2, 0}) & E_{- 2, 1} - E_{2, - 1}) & E_{- 2, 2} - E_{1, - 2}) \\ ⋮ & ⋰ \end{matrix}] & eq . 83 \\ with \\ A_{reduced}^{s} = K_{x}^{2} - E_{reduced}^{s} & eq . 84 \\ and \\ [\frac{\partial^{2} U_{x}}{\partial {(z^{'})}^{2}}] = [K_{y}^{2} + A_{reduced}^{s}] [U_{x}], & eq . 85 \end{matrix}$

where the subscript s refers to the incident polarization case. All of the vectors in eq. 85 are of size N+1, and the matrices are of size (N+1)×(N+1) for a given truncation order, N.

Many of the terms in eq. 83 can be reduced using eq. 76, but it is more useful to assume nothing about the elements of the matrices being reduced. This way, other matrices that may not necessarily obey eq. 76 can be reduced using the same formulas. Along these lines, more general reductions can be formulated, which can be applied to a variety of matrices or even the products of matrices that will be required to find the reduced matrices of eqs. 22-25.

Disregarding the simpler diagonal matrices K_yand K_x, the unreduced equations have the general form

$\begin{matrix} l [P_{i}] = \sum_{m = - \infty}^{\infty} ɛ_{i, m} Q_{m} & eq . 86 \end{matrix}$

Where is a linear operator, such as

$\frac{\partial}{\partial (z^{'})} or \frac{\partial^{2}}{\partial {(z^{'})}^{2}},$

and the elements of the vectors P and Q are spatial harmonic coefficients of the Fourier expansions for the corresponding fields. The goal is to find a reduced matrix for ∈ through application of symmetry relations to the vectors P and Q. Without making any assumptions about the elements of the matrix ∈, there are in general four types of reductions:

- 1) Both P and Q are even and the corresponding Fourier series can be reduced to cosine series,
- 2) Both P and Q are odd and the corresponding Fourier expressions can be reduced to sine series,
- 3) P is even and Q is odd,
- 4) P is odd and Q is even.

Note that if P has even or odd symmetry, then

$\frac{\partial P}{\partial (z^{'})} and \frac{\partial^{2} P}{\partial {(z^{'})}^{2}}$

are also even or odd, respectively.

The reduction leading to eqs. 81 and 82 belongs to category 2. The same argument can be applied to eq. 86 to give

$\begin{matrix} l [P_{0}] = ɛ_{0, 0} Q_{0} + \sum_{m = 1}^{\infty} (ɛ_{0, m} - ɛ_{0, - m}) Q_{m}, i = 0, and & eq . 87 \\ l [P_{i}] = \frac{1}{2} (ɛ_{i, 0} - ɛ_{- i, 0}) Q_{0} + \frac{1}{2} \sum_{m = 1}^{\infty} (ɛ_{i, m} + ɛ_{- i, - m} - ɛ_{- i, m} - ɛ_{i, - m}) Q_{m}, i > 0, & eq . 88 \end{matrix}$

for any matrix ∈ and field harmonics P and Q having odd symmetry.

The other 3 cases are developed below.

For case 1, both P and Q have even symmetry. Therefore the i and −i rows can be added together:

$\begin{matrix} \begin{matrix} l [P_{0}] = \sum_{m = - \infty}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} + \sum_{m = - \infty}^{- 1} ɛ_{0, m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} + \sum_{m = 1}^{\infty} ɛ_{0, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m} \end{matrix} & eq . 89 \\ l [P_{0}] = ɛ_{0, 0} Q_{0} + \sum_{m = 1}^{\infty} (ɛ_{0, m} + ɛ_{0, - m}) Q_{m}, i = 0, and \\ \begin{matrix} l [P_{i}] + l [P_{- i}] = \sum_{m = - \infty}^{\infty} ɛ_{i, m} Q_{m} + \sum_{m = - \infty}^{\infty} ɛ_{- i, m} Q_{m} \\ = ɛ_{i, 0} Q_{0} + ɛ_{- i, 0} Q_{0} + \sum_{m = - \infty}^{- 1} ɛ_{i, m} Q_{m} + \\ \sum_{m = 1}^{\infty} ɛ_{i, m} Q_{m} + \sum_{m = - \infty}^{- 1} ɛ_{- i, m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{- i, m} Q_{m} \\ = (ɛ_{i, 0} + ɛ_{- i, 0}) Q_{0} + \sum_{m = 1}^{\infty} ɛ_{i, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{i, m} Q_{m} + \\ \sum_{m = 1}^{\infty} ɛ_{- i, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{- i, m} Q_{m} \\ = (ɛ_{i, 0} + ɛ_{- i, 0}) Q_{0} + \sum_{m = 1}^{\infty} (ɛ_{i, m} + ɛ_{i, - m} + ɛ_{- i, m} + ɛ_{- i, - m}) Q_{m} \\ = 2 l [P_{i}], so \end{matrix} \\ \begin{matrix} l [P_{0}] = \sum_{m = - \infty}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} + \sum_{m = - \infty}^{- 1} ɛ_{0, m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} + \sum_{m = 1}^{\infty} ɛ_{0, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m} \end{matrix} & eq . 89 \\ □ & □ \\ □ & □ \\ □ & □ \end{matrix}$

Eqs. 89 and 90 define a reduced matrix

$\begin{matrix} ɛ_{reduced} = [\begin{matrix} ɛ_{0, 0} & ɛ_{0, 1} + ɛ_{0, - 1} & ɛ_{0, 2} + ɛ_{0, - 2} & \dots \\ \frac{1}{2} (ɛ_{1, 0} + & \frac{1}{2} (ɛ_{1, 1} + ɛ_{- 1, - 1} + & \frac{1}{2} (ɛ_{1, 2} + ɛ_{- 1, - 2} + & \dots \\ ɛ_{- 1, 0}) & ɛ_{1, - 1} + ɛ_{- 1, 1}) & ɛ_{1, - 2} + ɛ_{- 1, 2}) \\ \frac{1}{2} (ɛ_{2, 0} + & \frac{1}{2} (ɛ_{2, 1} + ɛ_{- 2, - 1} + & \frac{1}{2} (ɛ_{2, 2} + ɛ_{- 2, - 2} + & \dots \\ ɛ_{- 2, 0}) & ɛ_{2, - 1} + ɛ_{- 2, 1}) & ɛ_{2, - 2} + ɛ_{- 1, 2}) \\ ⋮ & ⋰ \end{matrix}] & eq . 91 \end{matrix}$

For case 3, the i and −i rows are again added:

$\begin{matrix} \begin{matrix} l [P_{0}] = \sum_{m = - \infty}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} + \sum_{m = - \infty}^{- 1} ɛ_{0, m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} - \sum_{m = 1}^{\infty} ɛ_{0, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m}, \end{matrix} & eq . 92 \\ l [P_{0}] = ɛ_{0, 0} Q_{0} + \sum_{m = 1}^{\infty} (ɛ_{0, m} - ɛ_{0, - m}) Q_{m}, \\ i = 0, \\ and \\ \begin{matrix} l [P_{i}] + l [P_{- i}] = \sum_{m = - \infty}^{\infty} ɛ_{i, m} Q_{m} + \sum_{m = - \infty}^{\infty} ɛ_{- i, m} Q_{m} \\ = ɛ_{i, 0} Q_{0} + ɛ_{- i, 0} Q_{0} + \sum_{m = - \infty}^{- 1} ɛ_{i, m} Q_{m} + \\ \sum_{m = 1}^{\infty} ɛ_{i, m} Q_{m} + \sum_{m = - \infty}^{- 1} ɛ_{- i, m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{- i, m} Q_{m} \\ = (ɛ_{i, 0} + ɛ_{- i, 0}) Q_{0} - \sum_{m = 1}^{\infty} ɛ_{i, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{i, m} Q_{m} - \\ \sum_{m = 1}^{\infty} ɛ_{- i, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{- i, m} Q_{m} \\ = (ɛ_{i, 0} + ɛ_{- i, 0}) Q_{0} + \sum_{m = 1}^{\infty} (ɛ_{i, m} + ɛ_{- i, m} - ɛ_{i, - m} - ɛ_{- i, - m}) Q_{m} \\ = 2 l [P_{i}], \end{matrix} \\ giving \\ l [P_{i}] = \frac{1}{2} (ɛ_{i, 0} + ɛ_{- i, 0}) Q_{0} + \frac{1}{2} \sum_{m = 1}^{\infty} (ɛ_{i, m} + ɛ_{- i, m} - ɛ_{i, - m} - ɛ_{- i, - m}) Q_{m}, & eq . 93 \\ i > 0. \end{matrix}$

For case 4, subtract the −i th row from the i th row:

$\begin{matrix} \begin{matrix} l [P_{0}] = \sum_{m = - \infty}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} + \sum_{m = - \infty}^{- 1} ɛ_{0, m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m} \\ = ɛ_{0, 0} Q_{0} + \sum_{m = 1}^{\infty} ɛ_{0, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{0, m} Q_{m}, \end{matrix} & eq . 94 \\ l [P_{0}] = ɛ_{0, 0} Q_{0} + \sum_{m = 1}^{\infty} (ɛ_{0, m} + ɛ_{0, - m}) Q_{m}, i = 0, \\ and \\ \begin{matrix} l [P_{i}] - l [P_{- i}] = \sum_{m = - \infty}^{\infty} ɛ_{i, m} Q_{m} - \sum_{m = - \infty}^{\infty} ɛ_{- i, m} Q_{m} \\ = ɛ_{i, 0} Q_{0} - ɛ_{- i, 0} Q_{0} + \sum_{m = - \infty}^{- 1} ɛ_{i, m} Q_{m} + \\ \sum_{m = 1}^{\infty} ɛ_{i, m} Q_{m} - \sum_{m = - \infty}^{- 1} ɛ_{- i, m} Q_{m} - \sum_{m = 1}^{\infty} ɛ_{- i, m} Q_{m} \\ = (ɛ_{i, 0} - ɛ_{- i, 0}) Q_{0} + \sum_{m = 1}^{\infty} ɛ_{i, - m} Q_{m} + \sum_{m = 1}^{\infty} ɛ_{i, m} Q_{m} - \\ \sum_{m = 1}^{\infty} ɛ_{- i, - m} Q_{m} - \sum_{m = 1}^{\infty} ɛ_{- i, m} Q_{m} \\ = (ɛ_{i, 0} - ɛ_{- i, 0}) Q_{0} + \\ \sum_{m = 1}^{\infty} (ɛ_{i, m} + ɛ_{i, - m} - ɛ_{- i, m} - ɛ_{- i, - m}) Q_{m} \\ = 2 l [P_{i}], \end{matrix} \\ giving \\ \begin{matrix} l [P_{i}] = \frac{1}{2} (ɛ_{i, 0} - ɛ_{- i, 0}) Q_{0} + \\ \frac{1}{2} \sum_{m = 1}^{\infty} (ɛ_{i, m} + ɛ_{i, - m} - ɛ_{- i, m} - ɛ_{- i, - m}) Q_{m}, i > 0. \end{matrix} & eq . 95 \end{matrix}$

Application of case 2 with ∈=E leads directly to eqs. 81 and 82 for E_reducedand leads to the reduced eigenproblem of eqs. 84 and 85. To reduce eq. 17, case 1 can be applied directly to the product BEinv⁻¹, giving

$\begin{matrix} \frac{\partial^{2} S_{x 0}}{\partial {(z^{'})}^{2}} = (\frac{k_{y}^{2}}{k_{0}}) S_{x 0} + {({BEinv}^{- 1})}_{0, 0} S_{x 0} + \sum_{m = 1}^{\infty} [{({BEinv}^{- 1})}_{0, m} + {({BEinv}^{- 1})}_{0, - m}] S_{xm} & eq . 96 \\ \frac{\partial^{2} S_{xi}}{\partial {(z^{'})}^{2}} = (\frac{k_{y}^{2}}{k_{0}}) S_{x, i} + \frac{1}{2} [{({BEinv}^{- 1})}_{i, 0} + {({BEinv}^{- 1})}_{- i, 0}] S_{x 0} + \frac{1}{2} \sum_{m = 1}^{\infty} [{({BEinv}^{- 1})}_{i, m} + {({BEinv}^{- 1})}_{i, - m} + {({BEinv}^{- 1})}_{- i, m} + {({BEinv}^{- 1})}_{- i, - m}] S_{xm} . & eq . 97 \end{matrix}$

This involves (2N+1)×(2N+1) matrix multiplications to find the elements of BE_inv⁻¹. A slightly more efficient way to construct the reduced eigen-problem is to reduce the components of the product first, and multiply the reduced (N+1)×(N+1) matrices together to form B_reduced(Einv⁻¹)_reduced.

To do this one can go back to eq. 15 and apply the appropriate reductions to the third column of the second row and second column of the third row for B and Einv⁻¹, respectively.

For B, explicitly reduce the product K_xE⁻¹K_x:

$\begin{matrix} \frac{\partial S_{xi}}{\partial (z^{'})} = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = - \infty}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}, & eq . 98 \end{matrix}$

where the dots replace other terms in eq. 15 that are not relevant for the purpose of finding the reduced matrix.

Adding the i th and −i th rows:

$\begin{matrix} \frac{\partial S_{x 0}}{\partial (z^{'})} = \dots + 0, i = 0, & eq . 99 \\ since k_{x 0} = 0, and \\ \begin{matrix} 2 \frac{\partial S_{xi}}{\partial (z^{'})} = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = - \infty}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym} + \\ \frac{k_{x - i}}{k_{0}} \sum_{m = - \infty}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym} \\ = \dots + \frac{k_{xi}}{k_{0}} [\sum_{m = - \infty}^{- 1} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}] + \\ \frac{k_{x - i}}{k_{0}} [\sum_{m = - \infty}^{- 1} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} [\sum_{m = 1}^{\infty} {(E^{- 1})}_{i, - m} \frac{k_{x - m}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}] + \\ \frac{k_{x - i}}{k_{0}} [\sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, - m} \frac{k_{x - m}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} [- \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}] - \\ \frac{k_{xi}}{k_{0}} [- \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} [- \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym} + \\ \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, - m} \frac{k_{xm}}{k_{0}} U_{ym} - \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = 1}^{\infty} [{(E^{- 1})}_{i, m} + {(E^{- 1})}_{- i, - m} - {(E^{- 1})}_{i, - m} - \\ {(E^{- 1})}_{- i, m}] \frac{k_{xm}}{k_{0}} U_{ym} \\ = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = 1}^{\infty} [2 {(E^{- 1})}_{i, m} - 2 {(E^{- 1})}_{i, - m}] \frac{k_{xm}}{k_{0}} U_{ym}, \end{matrix} \\ giving \\ \frac{\partial S_{xi}}{\partial (z^{'})} = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = 1}^{\infty} [{(E^{- 1})}_{i, m} - {(E^{- 1})}_{i, - m} \frac{k_{xm}}{k_{0}} U_{ym}, & eq . 100 \\ i > 0, \\ or \\ {(K_{x} E^{- 1} K_{x})}_{reduced}^{s} = [\begin{matrix} 0 & 0 & 0 & \dots \\ 0 & \begin{matrix} \frac{k_{x 1}}{k_{0}} [{(E^{- 1})}_{1, 1} - \\ {(E^{- 1})}_{1, - 1}] \frac{k_{x 1}}{k_{0}} \end{matrix} & \begin{matrix} \frac{k_{x 1}}{k_{0}} [{(E^{- 1})}_{1, 2} - \\ {(E^{- 1})}_{1, - 2}] \frac{k_{x 2}}{k_{0}} \end{matrix} & \dots \\ 0 & \begin{matrix} \frac{k_{x 2}}{k_{0}} [{(E^{- 1})}_{2, 1} - \\ {(E^{- 1})}_{2, - 1}] \frac{k_{x 1}}{k_{0}} \end{matrix} & \begin{matrix} \frac{k_{x 2}}{k_{0}} [{(E^{- 1})}_{2, 2} - \\ {(E^{- 1})}_{2, - 2}] \frac{k_{x 2}}{k_{0}} \end{matrix} & \dots \\ ⋮ & ⋰ \end{matrix}] & eq . 101 \end{matrix}$

in explicit form. Then

B_reduced^s=(K_xE⁻¹K_x)_reduced^s−I. eq. 102

For Einv⁻¹, use

$\begin{matrix} \frac{\partial U_{yi}}{\partial z^{'}} = \dots + \sum_{m = - \infty}^{\infty} {({Einv}^{- 1})}_{im} S_{xm} & eq . 103 \end{matrix}$

to which case 1 may be directly applied:

$\begin{matrix} \frac{\partial U_{y 0}}{\partial z^{'}} = \dots + {({Einv}^{- 1})}_{0, 0} S_{x 0} + \sum_{m = 1}^{\infty} 2 {({Einv}^{- 1})}_{0, m} S_{xm}, i = 0, & eq . 104 \\ \frac{\partial U_{y i}}{\partial z^{'}} = \dots + {({Einv}^{- 1})}_{i, 0} S_{x 0} + \sum_{m = 1}^{\infty} [{({Einv}^{- 1})}_{i, m} + {({Einv}^{- 1})}_{i, - m}] S_{xm}, i > 0, which implies & eq . 105 \\ {({Einv}^{- 1})}_{reduced}^{s} = [\begin{matrix} {({Einv}^{- 1})}_{0, 0} & 2 {({Einv}^{- 1})}_{0, 1} & 2 {({Einv}^{- 1})}_{0, 2} & \dots \\ {({Einv}^{- 1})}_{1, 0} & {({Einv}^{- 1})}_{1, 1} + {({Einv}^{- 1})}_{1, - 1} & {({Einv}^{- 1})}_{1, 2} + {({Einv}^{- 1})}_{1, - 2} & \dots \\ {({Einv}^{- 1})}_{2, 0} & {({Einv}^{- 1})}_{2, 1} + {({Einv}^{- 1})}_{2, - 1} & {({Einv}^{- 1})}_{2, 2} + {({Einv}^{- 1})}_{2, - 2} & \dots \\ ⋮ & ⋰ \end{matrix}] & eq . 106 \end{matrix}$

where one makes use of the fact that (Einv⁻¹)_i,m=(Einv⁻¹)_−i,−m.

Eq. 17 becomes

[∂²U_y/∂(z′)²]=[K_y²+B_reduced²(Einv⁻¹)_reduced^s][S_x] eq. 107

In eqs. 85 and 107 the vectors S_xand U_yand diagonal matrices K_yand K_xare trivially reduced to consist of the zeroth and positive terms of the original vectors/matrices. When truncated with truncation order N; the size of the eigen-problems are (N+1)×(N+1) instead of (2N+1)×(2N+1), and require much less computation time to solve.

The solution to the reduced eigen-problems has the same form as eqs. 18-25, but with 4(N+1) coefficients to be determined instead of 4(2N+1). The correct reduced matrices to use in eqs. 22-25 should still be found, so that the reduced form of eq. 15 is satisfied. Here again one could have derived the entire reduced set of eqs. for eq. 15, but it is really only necessary to reduce a few specific terms in order to find A⁻¹, B⁻¹, A⁻¹K_x, and B⁻¹K_xE⁻¹to use in eqs. 22-25.

Substituting eqs. 18-21 into the second row of eq. 15 gives

W₂Q₂=(K_xE⁻¹K_x−I)V₂₂ eq. 108

and

(K_xE⁻¹K_x−I)V₂₁=K_xE⁻¹K_yW₁. eq. 109

Substituting eq. 25 into eq. 108 gives

W₂Q₂=BV₂₂=BB⁻¹W₂Q₂, eq. 110

which implies that B⁻¹in eq. 25 should be replaced by the inverse of the reduced matrix B_reducedfound earlier.

Eqs. 24 and 109 give

$\begin{matrix} {BV}_{21} = {BB}^{- 1} (\frac{k_{y}}{k_{0}}) K_{x} E^{- 1} W_{1} = K_{x} E^{- 1} K_{y} W_{1}, & eq . 111 \end{matrix}$

Which again implies that B⁻¹→(B_reduced)⁻¹in eq. 24, and K_xE⁻¹is found by reducing

$\begin{matrix} \frac{\partial S_{x}}{\partial (z^{'})} = \dots - K_{x} E^{- 1} K_{y} U_{x} . & eq . 112 \end{matrix}$

Since S_xis even in x and U_xis odd, the reduced matrix for K_xE⁻¹can be found by applying case 3 with ∈=K_xE⁻¹:

$\begin{matrix} \frac{\partial S_{x 0}}{\partial (z^{'})} = \dots - {\begin{matrix} {(K_{x} E^{- 1})}_{0, 0} U_{x 0} + \\ \sum_{m = 1}^{\infty} [{(K_{x} E^{- 1})}_{0, m} - {(K_{x} E^{- 1})}_{0, - m}] U_{xm} \end{matrix}} for i = 0, and & eq . 113 \\ \frac{\partial S_{xi}}{\partial (z^{'})} = \dots {\begin{matrix} \frac{1}{2} [{(K_{x} E^{- 1})}_{i, 0} + {(K_{x} E^{- 1})}_{- i, 0}) U_{x 0} + \\ \frac{1}{2} \sum_{m = 1}^{\infty} [\begin{matrix} {(K_{x} E^{- 1})}_{i, m} + \\ {(K_{x} E^{- 1})}_{- i, m} - \\ {(K_{x} E^{- 1})}_{i, - m} - \\ {(K_{x} E^{- 1})}_{- i, - m} \end{matrix}] U_{xm} \end{matrix}}, i > 0. & eq . 114 \end{matrix}$

This gives

$\begin{matrix} {(K_{x} E^{- 1})}_{reduced}^{s} = [\begin{matrix} {(K_{x} E^{- 1})}_{0, 0} & {(K_{x} E^{- 1})}_{0, 1} + {(K_{x} E^{- 1})}_{0, - 1} & \dots \\ \frac{1}{2} [{(K_{x} E^{- 1})}_{1, 0} - {(K_{x} E^{- 1})}_{- 1, 0}] & \frac{1}{2} [{(K_{x} E^{- 1})}_{1, 1} + {(K_{x} E^{- 1})}_{- 1, 1} - {(K_{x} E^{- 1})}_{1, - 1} - {(K_{x} E^{- 1})}_{- 1, - 1}] & \dots \\ \frac{1}{2} [{(K_{x} E^{- 1})}_{2, 0} - {(K_{x} E^{- 1})}_{- 2, 0}] & \frac{1}{2} [{(K_{x} E^{- 1})}_{2, 1} + {(K_{x} E^{- 1})}_{- 2, 1} - {(K_{x} E^{- 1})}_{2, - 1} - {(K_{x} E^{- 1})}_{- 2, - 1}] & \dots \\ ⋮ & ⋰ \end{matrix}] & eq . 115 \end{matrix}$

Substituting eqs. 18-20, 22, and 23 into the fourth row of eq. 15 gives

W₁Q₁=(K_x²−E)V₁₁=AV₁₁=AA⁻¹W₁Q₁ eq. 116

and

(K_x²−E)V₁₂=AV₁₂=AA⁻¹K_xK_yW₂. eq. 117

Since A is replaced by A_reducedin eqs. 116 and 117, A⁻¹should be replaced by (A_reduced)⁻¹in both eqs. 22 and 23. K_xis simply replaced by a diagonal matrix with the (K_x)₀₀, (K_x)₁₁, . . . , (K_x)_NNcomponents of the original K_xmatrix, as always.

Therefore eqs. 22-25 are replaced by

V₁₁=(A_reduced^s)⁻¹W₁Q₁, eq. 118

V₁₂=(k_y/k₀)(A_reduced^s)⁻¹K_xW₂, eq. 119

V₂₁=(k_y/k₀)(B_reduced^s)⁻¹(K_xE⁻¹)_reduced^sW₁, eq. 120

V₂₂=(B_reduced^s)⁻¹W₂Q₂, eq. 121

where Q₁, W₁, Q₂, and W₂are the eigenvalue and eigenvector matrices for the new, reduced eigenproblems of eqs. 85 and 107.

This new, reduced eigen-system, combined with the reduced boundary problem, gives exactly the same diffracted amplitudes and diffraction efficiencies as the old formulation for phi=90 for any given truncation order, N, but with much improved computational efficiency. For a given order, N, the computation speed is reduced by a factor of approximately 8 compared to the old formulation.

In some cases, the new, reduced phi=90 algorithms can be significantly faster than even the corresponding classical mount problem with the same polar incidence angle. In the theoretical best case limit, the phi=90 case requires about 62.5% the time as the corresponding classical case. This assumes that the eigen-problem and boundary value problem require equal amounts of time to solve for a given truncation order, N. In practice, this is more or less realized for lower truncation orders. Such a speed advantage can quickly add up when considering the amount of time that may be required to generate a library of several million spectra. In such cases, it may be beneficial to use the phi=90 mount only.

In the other limiting case where a very large truncation order is required, the computation time is basically dominated by the large matrix inversion in the boundary problem (eq. 47). In this limit, the phi=90 case requires approximately 92.5% of the computation time as the phi=0 case. Steps can be taken to make the matrix inversion more efficient, since only the top half is used, which is of some help.

These estimates ignore the fact that there is a little more overhead when constructing the various matrices for the phi=90 case than with the phi=0 case. In practice, the differences in computation speed ranges from being about equal for the phi=90 and phi=0 cases to a 20-30% speed improvement for the phi=90 mount over the corresponding phi=0 case. Either way, the improvement over the old phi=90 formulation is quite significant, and the ideas outlined in the introduction section involving multiple azimuthal datasets can be employed without a disabling increase in computation cost.

To complete the description, the reduced eigen-system is derived for p polarized incident light in the phi=90 conical mount. In this case, the fields satisfy

R_x,i=−R_x,−i eq. 122

R_p,i=R_p,−i eq. 123

T_x,i=−T_x,−i eq. 124

T_p,i=T_p,−i, eq. 125

in regions I and II, and

S_x,i=−S_x,−i eq. 126

S_y,i=S_y,−i eq. 127

U_x,i=U_x,−i eq. 128

U_y,i=−U_y,−i, eq. 129

in the grating region.

Eqs. 122-129 applied to the boundary problem lead to the same conclusion as in the s-polarized incidence case, except in this case add the i and −i terms for eqs. 28, 29, 35, and 36, and subtract the −ith from the ith terms in eqs. 26, 27, 33, and 34.

Again, the boundary matching at z=0 and z=d leads to eqs. 26-41 for the boundary equations, but with N+1 sized vectors R_s, R_p, T_s, and T_p, so long as it is again possible to reduce the eigen-problem as well.

To do this, start with eq. 16 and apply the case 1 reduction:

$\begin{matrix} \frac{\partial^{2} U_{x 0}}{\partial {(z^{'})}^{2}} = \frac{k_{y}^{2}}{k_{0}^{2}} U_{x 0} - {E_{0, 0} U_{x 0} + \sum_{m = 1}^{\infty} (E_{0, m} + E_{0, - m}) U_{xm}} & eq . 130 \\ \frac{\partial^{2} U_{xi}}{\partial {(z^{'})}^{2}} = \frac{k_{y}^{2}}{k_{0}^{2}} U_{xi} + \frac{k_{xi}^{2}}{k_{0}^{2}} U_{xi} - {\begin{matrix} \frac{1}{2} (E_{i, 0} + E_{- i, 0}) U_{x 0} + \\ \frac{1}{2} \sum_{m = 1}^{\infty} (\begin{matrix} E_{i, m} + \\ E_{- i, - m} + \\ E_{- i, m} + \\ E_{i, - m} \end{matrix}) U_{xm} \end{matrix}} & eq . 131 \end{matrix}$

which shows that E_reducedis given by

$\begin{matrix} E_{i, 0} U_{x 0} + \sum_{m = 1}^{\infty} (E_{0, m} + E_{0, - m}) U_{xm}, i = 0, and & eq . 132 \\ \begin{matrix} \frac{1}{2} (E_{i, 0} + E_{- i, 0}) U_{x 0} + \frac{1}{2} \sum_{m = 1}^{\infty} (E_{i, m} + E_{- i, - m} + E_{- i, m} + E_{i, - m}) U_{xm}, i > 0, \end{matrix} or & eq . 133 \\ E_{reduced}^{p} = [\begin{matrix} E_{0, 0} & E_{0, 1} + E_{0, - 1} & E_{0, 2} + E_{0, - 2} & \dots \\ \frac{1}{2} (E_{1, 0} + E_{- 1, 0}) & \frac{1}{2} (E_{1, 1} + E_{- 1, - 1} + E_{1, - 1} + E_{- 1, 1}) & \frac{1}{2} (E_{1, 2} + E_{- 1, - 2} + E_{1, - 2} + E_{- 1, 2}) & \dots \\ \frac{1}{2} (E_{2, 0} + E_{- 2, 0}) & \frac{1}{2} (E_{2, 1} + E_{- 2, - 1} + E_{2, - 1} + E_{- 2, 1}) & \frac{1}{2} (E_{2, 2} + E_{- 2, - 2} + E_{2, - 2} + E_{- 1, 2}) & \dots \\ ⋮ & ⋰ \end{matrix}] Using & eq . 134 \\ A_{reduced}^{p} = K_{x}^{2} - E_{reduced}^{p}, & eq . 135 \end{matrix}$

eq. 16 becomes

[∂²U_x/(z′)²]=[K_y²+A_reduced^p][U_x] eq. 136

where the indices on U_xrun from 0 to N, and K_xand K_yare reduced as in the s polarization case.

For eq. 17, case 2 could be applied directly to the product BEinv⁻¹, but a more efficient set of operations is to proceed as in the s polarization case. Einv⁻¹is reduced by applying case 2 to eq.

$\begin{matrix} \frac{\partial U_{y 0}}{\partial z^{'}} = \dots + {({Einv}^{- 1})}_{0, 0} S_{x 0}, i = 0, & eq . 137 \\ \frac{\partial U_{yi}}{\partial z^{'}} = \dots + \sum_{m = 1}^{\infty} [{({Einv}^{- 1})}_{i, m} - {({Einv}^{- 1})}_{i, - m}] S_{xm}, i > 0, or & eq . 138 \\ {({Einv}^{- 1})}_{reduced}^{p} = [\begin{matrix} {({Einv}^{- 1})}_{0, 0} & 0 & 0 & \dots \\ 0 & {({Einv}^{- 1})}_{1, 1} + {({Einv}^{- 1})}_{1, - 1} & {({Einv}^{- 1})}_{1, 2} + {({Einv}^{- 1})}_{1, - 2} & \dots \\ 0 & {({Einv}^{- 1})}_{2, 1} + {({Einv}^{- 1})}_{2, - 1} & {({Einv}^{- 1})}_{2, 2} + {({Einv}^{- 1})}_{2, - 2} & \dots \\ ⋮ & ⋰ \end{matrix}] & eq . 139 \end{matrix}$

where the fact that (Einv⁻¹)_i,m=(Einv⁻¹)_−i,−mis utilized.

For B, explicitly reduce the product K_xE⁻¹K_x:

$\begin{matrix} \frac{\partial S_{xi}}{\partial (z^{'})} = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = - \infty}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym} . & eq . 140 \end{matrix}$

Subtracting the −i th row from the i th row:

$\begin{matrix} \frac{\partial S_{x 0}}{\partial (z^{'})} = \dots + 0, i = 0, since k_{x 0} = 0, and & eq . 141 \\ \begin{matrix} 2 \frac{\partial S_{xi}}{\partial (z^{'})} = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = - \infty}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym} - \frac{k_{x - i}}{k_{0}} \sum_{m = - \infty}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym} \\ = \dots + \frac{k_{xi}}{k_{0}} [\sum_{m = - \infty}^{- 1} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}] - \\ \frac{k_{x - i}}{k_{0}} [\sum_{m = - \infty}^{- 1} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} [- \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, - m} \frac{k_{x - m}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}] - \\ \frac{k_{x - i}}{k_{0}} [- \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, - m} \frac{k_{x - m}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} [\sum_{m = 1}^{\infty} {(E^{- 1})}_{i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}] - \\ \frac{k_{x - i}}{k_{0}} [\sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} [\sum_{m = 1}^{\infty} {(E^{- 1})}_{i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym}] + \\ \frac{k_{xi}}{k_{0}} [\sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} [\sum_{m = 1}^{\infty} {(E^{- 1})}_{i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{i, m} \frac{k_{xm}}{k_{0}} U_{ym} + \\ \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, - m} \frac{k_{xm}}{k_{0}} U_{ym} + \sum_{m = 1}^{\infty} {(E^{- 1})}_{- i, m} \frac{k_{xm}}{k_{0}} U_{ym}] \\ = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = 1}^{\infty} [{(E^{- 1})}_{i, - m} + {(E^{- 1})}_{i, m} + {(E^{- 1})}_{- i, - m} + {(E^{- 1})}_{- i, m}] \frac{k_{xm}}{k_{0}} U_{ym} \\ = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = 1}^{\infty} [2 {(E^{- 1})}_{i, m} + 2 {(E^{- 1})}_{i, - m}] \frac{k_{xm}}{k_{0}} U_{ym} \end{matrix} \\ giving \\ \frac{\partial S_{xi}}{\partial (z^{'})} = \dots + \frac{k_{xi}}{k_{0}} \sum_{m = 1}^{\infty} [{(E^{- 1})}_{i, m} + {(E^{- 1})}_{i, - m}] \frac{k_{xm}}{k_{0}} U_{ym}, i > 0, which implies & eq . 142 \\ {(K_{x} E^{- 1} K_{x})}_{reduced}^{p} = [\begin{matrix} 0 & 0 & 0 & \dots \\ 0 & \frac{k_{x 1}}{k_{0}} [{(E^{- 1})}_{1, 1} + {(E^{- 1})}_{1, - 1}] \frac{k_{x 1}}{k_{0}} & \frac{k_{x 1}}{k_{0}} [{(E^{- 1})}_{1, 2} + {(E^{- 1})}_{1, - 2}] \frac{k_{x 2}}{k_{0}} & \dots \\ 0 & \frac{k_{x 2}}{k_{0}} [{(E^{- 1})}_{2, 1} + {(E^{- 1})}_{2, - 1}] \frac{k_{x 1}}{k_{0}} & \frac{k_{x 2}}{k_{0}} [{(E^{- 1})}_{2, 2} + {(E^{- 1})}_{2, - 2}] \frac{k_{x 2}}{k_{0}} & \dots \\ ⋮ & ⋰ \end{matrix}] & eq . 143 \end{matrix}$

in explicit form. Then

B_reduced^p=(K_xE⁻¹K_x)_reduced^p−I eq. 144

and eq. 17 becomes

[∂²U_y/∂(z′)²]=[K_y²+B_reduced^p(Einv⁻¹)_reduced^p][S_x], eq. 145

where again the indices run from 0 to N and K_yis reduced as in the s polarization case.

The corresponding equations to replace eqs 21-25 are found in a similar manner as before. Most of the verification steps are omitted here. A⁻¹and B⁻¹are replaced by (A_reduced)⁻¹and (B_reduced)⁻¹as before. To find K_xE⁻¹in eq. 24, use eq. 111 with case 4:

$\begin{matrix} \frac{\partial S_{x 0}}{\partial (z^{'})} = \dots - {\begin{matrix} {(K_{x} E^{- 1})}_{0, 0} U_{x 0} + \\ \sum_{m = 1}^{\infty} [{(K_{x} E^{- 1})}_{0, m} + {(K_{x} E^{- 1})}_{0, - m}] U_{xm} \end{matrix}}, i = 0, and & eq . 146 \\ \frac{\partial S_{xi}}{\partial (z^{'})} = - {\begin{matrix} \frac{1}{2} [{(K_{x} E^{- 1})}_{i, 0} - {(K_{x} E^{- 1})}_{- i, 0}] U_{x 0} + \\ \frac{1}{2} \sum_{m = 1}^{\infty} [\begin{matrix} {(K_{x} E^{- 1})}_{i, m} + {(K_{x} E^{- 1})}_{i, - m} - \\ {(K_{x} E^{- 1})}_{- i, m} - {(K_{x} E^{- 1})}_{- i, - m} \end{matrix}] U_{xm} \end{matrix}}, i < 0. & eq . 147 \end{matrix}$

This gives

$\begin{matrix} {(K_{x} E^{- 1})}_{reduced}^{p} = [\begin{matrix} {(K_{x} E^{- 1})}_{0, 0} & {(K_{x} E^{- 1})}_{0, 1} + {(K_{x} E^{- 1})}_{0, - 1} & \dots \\ \frac{1}{2} [{(K_{x} E^{- 1})}_{1, 0} - {(K_{x} E^{- 1})}_{- 1, 0}] & \frac{1}{2} [{(K_{x} E^{- 1})}_{1, 1} + {(K_{x} E^{- 1})}_{1, - 1} - {(K_{x} E^{- 1})}_{- 1, 1} - {(K_{x} E^{- 1})}_{- 1, - 1}] & \dots \\ \frac{1}{2} [{(K_{x} E^{- 1})}_{2, 0} - {(K_{x} E^{- 1})}_{- 2, 0}] & \frac{1}{2} [{(K_{x} E^{- 1})}_{2, 1} + {(K_{x} E^{- 1})}_{2, - 1} - {(K_{x} E^{- 1})}_{- 2, 1} - {(K_{x} E^{- 1})}_{- 2, - 1}] & \dots \\ ⋮ & ⋰ \end{matrix}] & eq . 148 \end{matrix}$

Putting all of this together, eqs. 22-25 for p polarized incidence are replaced by

V₁₁=(A_reduced^p)⁻¹W₁Q₁, eq. 149

V₁₂=(k_y/k₀)(A_reduced^p)⁻¹K_xW₂, eq. 150

V₂₁=(k_y/k₀)(B_reduced^p)⁻¹(K_xE⁻¹)_reduced^pW₁, eq. 151

V₂₂=(B_reduced^p)⁻¹W₂Q₂, eq. 152

where Q₁, W₁, Q₂, and W₂are the eigenvalue and eigenvector matrices for the new, reduced eigen-problems of eqs. 136 and 145. The speed improvement is very similar to the s polarization case.

After solving the reduced boundary problem for the particular s or p incidence case, the diffraction efficiencies can be obtained from

$\begin{matrix} {DE}_{ri} = {\langle R_{s, i} \rangle}^{2} Re (\frac{k_{I, zi}}{k_{0} n_{I} \cos θ}) + {\langle R_{p, i} \rangle}^{2} Re (\frac{k_{I, zi} / n_{I}^{2}}{k_{0} n_{I} \cos θ}) . & eq . 153 \end{matrix}$

For i=0, eq. 153 is just the specular reflectance for the given incident condition.

It should be pointed out that the only assumption about the grating permittivity expansion coefficients was the symmetry exploited in Eq. 76. In other words, the specific form of the permittivity Fourier coefficients for a binary grating shown in Eq. 9 were not explicitly used in the above descriptions. The grating can consist of more than 2 different materials with differing optical properties, the only difference being that the permittivity Fourier coefficients are different from the coefficients given for the binary structure in Eq. 9. The grating should still satisfy Eq. 76 where required. Additionally, as with the conventional formulation, profile shapes other than rectangular can be treated using a staircase approximation consisting of multiple rectangular grating layers.

The calculated diffraction efficiencies or amplitudes can be used to compute polarized or unpolarized reflectance data, ellipsometric data, or polarimetric data. During an optical grating measurement, one or more datasets are generated by varying the incident wavelength, polar angle of incidence, theta, and rotating the azimuthal angle of incidence between 0 degrees and 90 degrees. The optical data of the one or more datasets are compared to data generated from a theoretical model of the grating using the above calculation methods. A regression analysis is used to optimize the parameters of the theoretical grating model. The result of the optical measurement is given by the optimized grating parameters. The average of the s and p incident calculations can be used to analyze unpolarized reflectance.

The regression algorithm can be the Simplex or Levenberg-Marquardt algorithms, described in W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, Numerical Recipes in C (2^ndEdition), Cambridge University Press, Cambridge, 1992, among others, or can even consist of a simple parameter grid search. The model calculation can be performed in real-time (at the time of measurement), using one or multiple CPUs. The theoretical model spectra can also be pre-calculated ahead of time, generating a library database of spectra, from which the calculation result can be rapidly extracted during the measurement. A neural network can be pre-generated, from which the best-fit model can be directly extracted using a fixed number of relatively simple calculation steps during the measurement.

In many cases, specularly reflected, transmitted, and/or diffracted intensities are detected, and the optical system can be calibrated to give reflectance (OR diffraction efficiency), transmittance, or diffraction efficiency. However, in some cases, particularly for VUV reflectance work, it may be beneficial to normalize some of the intensities with intensities from other structures or from different incidence conditions. These ratios are independent of incident intensity, and a system calibration that involves determining incident intensity may be skipped. The analysis can be done by calculating the corresponding reflectance or diffraction efficiency ratios. For example, a first dataset may be reflected (0 order) intensity 1(0) due to unpolarized light incident at phi=0, and the second dataset may be reflected (0 order) intensity I(90) due to unpolarized light incident at phi=90. The incident intensity will typically not change over short time periods, so if the datasets are collected in close succession, the intensity ratio is the same as the reflectance ratio:

$\begin{matrix} \frac{I (0)}{I (90)} = \frac{R (0)}{R (90)} & eq . 154 \end{matrix}$

R(0) and R(90) can be calculated using the conventional phi=0 calculation and new phi=90 calculation presented above. A regression procedure might use the following merit function:

$\begin{matrix} χ^{2} = \sum_{i = 1}^{N} {(\frac{1}{σ_{i}})}^{2} {({(\frac{R (0)}{R (90)})}_{i, measured} - {(\frac{R (0)}{R (90)})}_{i, calculated})}^{2} & eq . 155 \end{matrix}$

where the subscript i refers to incident condition (usually wavelength), σ_iis the estimated uncertainty of the measured reflectance ratio, and N is the total number of data points included for the ratio. The merit function is minimized by the regression procedure, thereby optimizing the grating parameters, which affect the calculated values for both numerator and denominator of the ratio. Note that in this case, the grating parameters are the same for both numerator and denominator.

As describe above the analysis of a diffraction grating problem is of particular use to determining the various characteristics of the diffraction grating structure including, for example, the critical dimensions and the composition of a diffraction grating. The analysis techniques described herein are of particular use in reducing the complexity and increase the speed of such analysis, which is of particular importance in high volume manufacturing processes. It will be recognized that the diffraction problem analysis techniques described herein may be utilized in a wide range of applications where is desirable to analysis a diffraction grating to obtain any of a wide range of types of characteristics of the grating structure.

Further modifications and alternative embodiments of this invention will be apparent to those skilled in the art in view of this description. It will be recognized, therefore, that the present invention is not limited by these example arrangements. Accordingly, this description is to be construed as illustrative only and is for the purpose of teaching those skilled in the art the manner of carrying out the invention. It is to be understood that the forms of the invention herein shown and described are to be taken as the presently preferred embodiments. Various changes may be made in the implementations and architectures. For example, equivalent elements may be substituted for those illustrated and described herein, and certain features of the invention may be utilized independently of the use of other features, all as would be apparent to one skilled in the art after having the benefit of this description of the invention.

Claims

1. A method of measuring characteristics of a diffraction grating structure, comprising:

providing incident light comprising one or more multiple wavelengths;

providing the incident light in an incident plane of the light that is at a first azimuthal angle of phi=0 with respect to a plane perpendicular to the diffraction grating structure;

providing the incident light in the incident plane of the light that is at a second azimuthal angle of phi=90 with respect to a plane perpendicular to the diffraction grating structure;

detecting light reflected from the diffraction grating structure when the incident light is at the first azimuthal angle of effectively phi=0 and when the incident light is at the second azimuthal angle of effectively phi=90;

utilizing the detected light as part of a diffraction analysis; and

determining at least one characteristic of the diffraction grating structure by exploiting symmetry properties of the diffraction analysis such that the computation time for data at the second azimuthal angle is approximately the same as or is less than the computation time for data at the first azimuthal angle.

2. The method of claim 1, wherein data is only collected at the first azimuthal angle and the second azimuthal angle.

3. The method of claim 1, wherein the diffraction analysis comprises utilizing a rigorous coupled wave analysis.

4. The method of claim 3, wherein the symmetry properties comprise symmetry properties of the Fourier expansions of the rigorous coupled wave (RCW) analysis for the second azimuthal angle, allowing RCW eigen- and boundary problems to be reduced in complexity.

5. The method of claim 4, wherein a second azimuthal angle boundary problem is reduced to a 4(N+1)×4(N+1) system of equations and an eigen-problem is reduced to two (N+1)×(N+1) eigen-systems for a given truncation order, N.

6. The method of claim 2, wherein four data sets are utilized in the diffraction analysis for each polar angle, the data sets being comprised of two different polarizations at each of the first and second azimuthal angles.

7. The method of claim 2, further comprising utilizing multiple polar angles at each of the first and second azimuthal angles.

8. The method of claim 2, wherein the determined characteristic is a geometrical characteristic.

9. The method of claim 8, wherein the geometrical characteristic is a line width, height and/or depth.

10. The method of claim 2, wherein computation time for data at the second azimuthal angle is at least 20% less than the computation time for data at the first azimuthal angle.

11. The method of claim 10, wherein computation time for data at the second azimuthal angle is at least 30% less than the computation time for data at the first azimuthal angle.

12. The method of claim 1, wherein the computation time for data at the second azimuthal angle is significantly reduced compared to a conventional computation at the second azimuthal angle.

13. The method of claim 12, wherein the computation time for data at the second azimuthal angle is reduced by a factor of approximately 8 over a conventional calculation at the second azimuthal angle.

14. The method of claim 1, wherein the incident light comprises multiple wavelengths of light.

15. A method of characterizing a diffraction grating structure, comprising

collecting a first set of reflected data from the grating structure by providing incident light at a first angle of azimuthal incidence with respect to the grating structure;

collecting a second set of reflected data from the grating structure by providing incident light at a second angle of azimuthal incidence with respect to the grating structure, the first and second angles being effectively orthogonal and the second angle of azimuthal incidence being different from zero;

analyzing a combination of at least the first and second set of reflected data; and

utilizing symmetrical characteristics of a diffraction analysis of the second angle of azimuthal incidence reflected data so as to reduce the computation complexity of the analysis of the second set of reflected data during the determination of at least one geometrical characteristic of the grating structure.

16. The method of claim 15, where one or more of the sets of data are used to normalize other set(s) of data so that optical metrology data comprises ratios of reflected data collected for different incident conditions, avoiding the need to determine incident intensity via an absolute calibration process.

17. The method of claim 16, wherein the optical metrology data comprises a first ratio of at least a portion of the first set of reflected data and at least a portion of the second set of reflected data.

18. The method of claim 17, wherein the diffraction analysis comprises a regression or library lookup procedure that minimizes the difference between a calculated reflectance or diffraction efficiency ratio and a measured intensity ratio.

19. The method of claim 18, wherein an inverse ratio is substituted in specific wavelength regions where the denominator of the first ratio is near zero.

20. The method of claim 19, wherein a weighting function is used to equalize a contribution to a merit function regardless of a reflectance ratio magnitude.

21. The method of claim 18, wherein data regions at which a denominator of the first ratio is near zero are dropped from the diffraction analysis.

22. The method of claim 15, where one or more diffracted orders of reflected data are detected along with or instead of the 0'th order.

23. The method of claim 15, wherein data is only collected at the first azimuthal angle and the second azimuthal angle.

24. The method of claim 23, wherein the diffraction analysis comprises utilizing a rigorous coupled wave analysis.

25. The method of claim 15, wherein four data sets are utilized in the diffraction analysis for each polar angle, the data sets being comprised of two different polarizations at each of the first and second azimuthal angles.

26. The method of claim 15, further comprising utilizing multiple polar angles at each of the first and second azimuthal angles.

27. The method of claim 15, wherein the diffraction analysis comprises utilizing a rigorous coupled wave analysis.

28. The method of claim 27, wherein the symmetry properties comprise symmetry properties of the Fourier expansions of the rigorous coupled wave (RCW) analysis for the second azimuthal angle, allowing RCW eigen- and boundary problems to be reduced in complexity.

29. The method of claim 28, wherein a second azimuthal angle boundary problem is reduced to a 4(N+1)×4(N+1) system of equations and an eigen-problem is reduced to two (N+1)×(N+1) eigen-systems for a given truncation order, N.

30. An optical metrology system, comprising:

a light source;

a sample having a diffraction grating, the light source providing incident light to the diffraction grating, the plane of incidence of the light with respect to the diffraction grating being changeable with respect to at least an azimuthal rotation;

a detector collecting data from light diffracted by the diffraction grating, the system being configured to collect data from the diffraction grating when the incident light is at the first azimuthal angle of phi=0 and when the incident light is at the second azimuthal angle of phi=90;

a computing system which utilizes the symmetrical characteristics of a diffraction analysis of the second azimuthal angle so as to reduce the computation complexity of an analysis of the data collected from the diffraction grating at the second azimuthal angle during the determination of at least one characteristic of the diffraction grating.

31. The optical metrology tool of claim 30, wherein the sample is rotated to provide the azimuthal rotation.

32. The optical metrology tool of claim 30, wherein the plane of incidence of the light is rotated to provide the azimuthal rotation.

33. The optical metrology tool of claim 30, wherein the polar angle of the incident light is changeable.

34. The optical metrology tool of claim 30, wherein the light source provides at least VUV wavelengths of light.

35. The optical metrology tool of claim 30, wherein the incident light is provided through a high numeric aperture optic using an aperture stop to allow incident light at only specific angles.

36. The optical metrology tool of claim 32, wherein the numeric aperture optic is configured to transmit multiple azimuthal angles of incident light simultaneously.

37. A method of measuring characteristics of a diffraction grating structure, comprising:

providing incident light comprising multiple wavelengths;

providing the incident light in an incident plane of the light that is at a first azimuthal angle of effectively phi=90 with respect to a plane perpendicular to the diffraction grating structure;

detecting light reflected or diffracted from the diffraction grating structure when the incident light is at the first azimuthal angle;

utilizing the detected light as part of a diffraction analysis; and

determining at least one characteristic of the diffraction grating structure by exploiting symmetry properties of the diffraction analysis such that the computation time for data is significantly reduced compared to the conventional computation at the first azimuthal angle, and is approximately the same as or is less than the computation time for the comparable classical mount at the first azimuthal angle and a same polar angle.

38. The method of claim 37, wherein the diffraction analysis comprises utilizing a rigorous coupled wave analysis.

39. The method of claim 38, wherein the symmetry properties comprise symmetry properties of the Fourier expansions of the rigorous coupled wave (RCW) analysis for the first azimuthal angle case, allowing the RCW eigen- and boundary problems to be reduced in complexity.

40. The method of claim 39, wherein the first azimuthal angle boundary problem is reduced to a 4(N+1)×4(N+1) system of equations and the eigen-problem is reduced to two (N+1)×(N+1) eigen-systems for a given truncation order, N.

41. The method of claim 37, wherein two data sets are utilized in the diffraction analysis for each polar angle, the data sets being comprised of two different polarizations at the first azimuthal angle.

42. The method of claim 37, further comprising utilizing multiple polar angles at the first azimuthal angle.

43. The method of claim 37, wherein the computation time for data at the first azimuthal angle case is reduced by a factor of approximately 8 over the conventional first azimuthal angle calculation.