Method and system for molecular array scanner calibration
A method and system for calibrating molecular arrays to a reference molecular array, and for subsequently calibrating the molecular arrays to maintain a constant signal-intensity-to-label-concentration ratio. In the first step of the two-step calibration method, a reference array coated with the fluorophore or chromophore used to label probe molecules is employed, while in the second step of the two-step method, a reference array coated with a stable dye is employed.
This application claims priority to copending application Ser. No. 10/086,932, filed Feb. 28, 2002, under 35 U.S.C. 120, the entirety of which is incorporated herein by reference.
TECHNICAL FIELDThe present invention is related to molecular array scanners and, in particular, to a method and system for calibrating molecular array scanners.
BACKGROUND OF THE INVENTIONThe present invention is related to acquisition of molecular-array data and other types of genetic, biochemical, and chemical data from molecular arrays by molecular array scanners. A general background of molecular-array technology is first provided, in this section, to facilitate discussion of the scanning techniques described in following sections.
Array technologies have gained prominence in biological research and are likely to become important and widely used diagnostic tools in the healthcare industry. Currently, molecular-array techniques are most often used to determine the concentrations of particular nucleic-acid polymers in complex sample solutions. Molecular-array-based analytical techniques are not, however, restricted to analysis of nucleic acid solutions, but may be employed to analyze complex solutions of any type of molecule that can be optically or radiometrically scanned and that can bind with high specificity to complementary molecules synthesized within, or bound to, discrete features on the surface of an array. Because arrays are widely used for analysis of nucleic acid samples, the following background information on arrays is introduced in the context of analysis of nucleic acid solutions following a brief background of nucleic acid chemistry.
Deoxyribonucleic acid (“DNA”) and ribonucleic acid (“RNA”) are linear polymers, each synthesized from four different types of subunit molecules. The subunit molecules for DNA include: (1) deoxy-adenosine, abbreviated “A,” a purine nucleoside; (2) deoxy-thymidine, abbreviated “T,” a pyrimidine nucleoside; (3) deoxy-cytosine, abbreviated “C,” a pyrimidine nucleoside; and (4) deoxy-guanosine, abbreviated “G,” a purine nucleoside. The subunit molecules for RNA include: (1) adenosine, abbreviated “A,” a purine nucleoside; (2) uracil, abbreviated “U,” a pyrimidine nucleoside; (3) cytosine, abbreviated “C,” a pyrimidine nucleoside; and (4) guanosine, abbreviated “G,” a purine nucleoside.
The DNA polymers that contain the organization information for living organisms occur in the nuclei of cells in pairs, forming double-stranded DNA helixes. One polymer of the pair is laid out in a 5′ to 3′ direction, and the other polymer of the pair is laid out in a 3′ to 5′ direction. The two DNA polymers in a double-stranded DNA helix are therefore described as being anti-parallel. The two DNA polymers, or strands, within a double-stranded DNA helix are bound to each other through attractive forces including hydrophobic interactions between stacked purine and pyrimidine bases and hydrogen bonding between purine and pyrimidine bases, the attractive forces emphasized by conformational constraints of DNA polymers. Because of a number of chemical and topographic constraints, double-stranded DNA helices are most stable when deoxy-adenylate subunits of one strand hydrogen bond to deoxy-thymidylate subunits of the other strand, and deoxy-guanylate subunits of one strand hydrogen bond to corresponding deoxy-cytidilate subunits of the other strand.
FIGS. 2A-B illustrate the hydrogen bonding between the purine and pyrimidine bases of two anti-parallel DNA strands.
Two DNA strands linked together by hydrogen bonds forms the familiar helix structure of a double-stranded DNA helix.
Double-stranded DNA may be denatured, or converted into single stranded DNA, by changing the ionic strength of the solution containing the double-stranded DNA or by raising the temperature of the solution. Single-stranded DNA polymers may be renatured, or converted back into DNA duplexes, by reversing the denaturing conditions, for example by lowering the temperature of the solution containing complementary single-stranded DNA polymers. During renaturing or hybridization, complementary bases of anti-parallel DNA strands form WC base pairs in a cooperative fashion, leading to reannealing of the DNA duplex. Strictly A-T and G-C complementarity between anti-parallel polymers leads to the greatest thermodynamic stability, but partial complementarity including non-WC base pairing may also occur to produce relatively stable associations between partially-complementary polymers. In general, the longer the regions of consecutive WC base pairing between two nucleic acid polymers, the greater the stability of hybridization between the two polymers under renaturing conditions.
The ability to denature and renature double-stranded DNA has led to the development of many extremely powerful and discriminating assay technologies for identifying the presence of DNA and RNA polymers having particular base sequences or containing particular base subsequences within complex mixtures of different nucleic acid polymers, other biopolymers, and inorganic and organic chemical compounds. One such methodology is the array-based hybridization assay.
Once an array has been prepared, the array may be exposed to a sample solution of target DNA or RNA molecules (410-413 in
Array-based hybridization techniques allow extremely complex solutions of DNA molecules to be analyzed in a single experiment. An array may contain from hundreds to tens of thousands of different oligonucleotide probes, allowing for the detection of a subset of complementary sequences from a complex pool of different target DNA or RNA polymers. In order to perform different sets of hybridization analyses, arrays containing different sets of bound oligonucleotides are manufactured by any of a number of complex manufacturing techniques. These techniques generally involve synthesizing the oligonucleotides within corresponding features of the array through a series of complex iterative synthetic steps, or depositing oligonucleotides isolated from biological material.
As pointed out above, array-based assays can involve other types of biopolymers, synthetic polymers, and other types of chemical entities. For example, one might attach protein antibodies to features of the array that would bind to soluble labeled antigens in a sample solution. Many other types of chemical assays may be facilitated by array technologies. For example, polysaccharides, glycoproteins, synthetic copolymers, including block copolymers, biopolymer-like polymers with synthetic or derivitized monomers or monomer linkages, and many other types of chemical or biochemical entities may serve as probe and target molecules for array-based analysis. A fundamental principle upon which arrays are based is that of specific recognition, by probe molecules affixed to the array, of target molecules, whether by sequence-mediated binding affinities, binding affinities based on conformational or topological properties of probe and target molecules, or binding affinities based on spatial distribution of electrical charge on the surfaces of target and probe molecules.
An “array”, unless a contrary intention appears, includes any one, two or three dimensional arrangement of addressable regions bearing a particular chemical moiety to moieties (for example, biopolymers such as polynucleotide sequences) associated with that region. An array is “addressable” in that it has multiple regions of different moieties (for example, different polynucleotide sequences) such that a region (a “feature” or “spot” of the array) at a particular predetermined location (an “address”) on the array will detect a particular target or class of targets (although a feature may incidentally detect non-targets of that feature). Array features are typically, but need not be, separated by intervening spaces. In the case of an array, the “target” will be referenced as a moiety in a mobile phase (typically fluid), to be detected by probes (“target probes”) which are bound to the substrate at the various regions. However, either of the “target” or “target probes” may be the one which is to be evaluated by the other (thus, either one could be an unknown mixture of polynucleotides to be evaluated by binding with the other). An “array layout” refers collectively to one or more characteristics of the features, such as feature positioning, one or more feature dimensions, and the chemical moiety or mixture of moieties at a given feature. “Hybridizing” and “binding”, with respect to polynucleotides, are used interchangeably.
Any given substrate may carry one, two, four or more or more arrays disposed on a front surface of the substrate. Depending upon the use, any or all of the arrays may be the same or different from one another and each may contain multiple spots or features. A typical array may contain more than ten, more than one hundred, more than one thousand more ten thousand features, or even more than one hundred thousand features, in an area of less than 20 cm2 or even less than 10 cm2. For example, features may have widths (that is, diameter, for a round spot) in the range from a 10 μm to 1.0 cm. In other embodiments each feature may have a width in the range of 1.0 μm to 1.0 mm, usually 5.0 μm to 500 μm, and more usually 10 μm to 200 μm. Non-round features may have area ranges equivalent to that of circular features with the foregoing width (diameter) ranges. At least some, or all, of the features may be of different compositions (for example, when any repeats of each feature composition are excluded the remaining features may account for at least 5%, 10%, or 20% of the total number of features). Interfeature areas will typically (but not essentially) be present which do not carry any polynucleotide (or other biopolymer of a type of which the features are composed). Such interfeature areas typically will be present where the arrays are formed by processes involving drop deposition of reagents but may not be present when, for example, photolithographic array fabrication processes are used. It will be appreciated though, that the interfeature areas, when present, could be of various sizes and configurations.
The array features can have widths (that is, diameter, for a round spot) in the range from a minimum of about 10 μm to a maximum of about 1.0 cm. In embodiments where very small spot sizes or feature sizes are desired, material can be deposited according to the invention in small spots whose width is in the range about 1.0 μm to 1.0 mm, usually about 5.0 μm to 500 μm, and more usually about 10 μm to 200 μm. Features which are not round may have areas equivalent to the area ranges of round features 16 resulting from the foregoing diameter ranges.
Each array may cover an area of less than 100 cm2, or even less than 50, 10 or 1 cm2. In many embodiments, the substrate carrying the one or more arrays will be shaped generally as a rectangular solid (although other shapes are possible), having a length of more than 4 mm and less than 1 m, usually more than 4 mm and less than 600 mm, more usually less than 400 mm; a width of more than 4 mm and less than 1 m, usually less than 500 mm and more usually less than 400 mm; and a thickness of more than 0.01 mm and less than 5.0 mm, usually more than 0.1 mm and less than 2 mm and more usually more than 0.2 and less than 1 mm. With arrays that are read by detecting fluorescence, the substrate may be of a material that emits low fluorescence upon illumination with the excitation light. Additionally in this situation, the substrate may be relatively transparent to reduce the absorption of the incident illuminating laser light and subsequent heating if the focused laser beam travels too slowly over a region. For example, substrate 10 may transmit at least 20%, or 50% (or even at least 70%, 90%, or 95%), of the illuminating light incident on the front as may be measured across the entire integrated spectrum of such illuminating light or alternatively at 532 nm or 633 nm.
Once the labeled target molecule has been hybridized to the probe on the surface, the array may be scanned by an appropriate technique, such as by optical scanning in cases where the labeling molecule is a fluorophore or by radiometric scanning in cases where the signal is generated through a radioactive decay of labeled target. In the case of optical scanning, more than one fluorophore can be excited, with each different wavelength at which an array is scanned producing a different signal. In optical scanning, it is common to describe the signals produced by scanning in terms of the colors of the wavelengths of light employed for the scan. For example, a red signal is produced by scanning the array with light having a wavelength corresponding to that of visible red light.
Scanning of a feature by an optical scanning device or radiometric scanning device generally produces a scanned image comprising a rectilinear grid of pixels, with each pixel having a corresponding signal intensity. These signal intensities are processed by an array-data-processing program that analyzes data scanned from an array to produce experimental or diagnostic results which are stored in a computer-readable medium, transferred to an intercommunicating entity via electronic signals, printed in a human-readable format, or otherwise made available for further use. Molecular array experiments can indicate precise gene-expression responses of organisms to drugs, other chemical and biological substances, environmental factors, and other effects. Molecular array experiments can also be used to diagnose disease, for gene sequencing, and for analytical chemistry. Processing of molecular array data can produce detailed chemical and biological analyses, disease diagnoses, and other information that can be stored in a computer-readable medium, transferred to an intercommunicating entity via electronic signals, printed in a human-readable format, or otherwise made available for further use.
A scan system causes a light spot from each laser 800a-b to be moved in a regular pattern about the surface of the molecular array. The molecular array is mounted to a stage that can be moved in horizontal and vertical directions to position light from the lasers onto a particular region at the surface of the molecular array, from which region fluorescent emission is passed back to the photodetectors via the optical path described above. An autofocus detector 870 is provided to sense and correct any offset between different regions of the molecular array and the focal plane of the system during scanning. An autofocus system includes detector 870, processor 880, and a motorized adjuster to move the stage in the direction of arrow 896.
The controller 880 receives signals from photodetectors 850a-b, called “channels,” corresponding to the intensity of the green and red fluorescent light emitted by probe labels excited by the laser light. The controller 880 also receives a signal from autofocus offset detector 870 in order to control stage adjustment, provides the control signal to the EOMs 810a-b, and controls the scan system. Controller 880 may also analyze, store, and output data relating to emitted signals received from detectors 850a-b.
Pixel-based signal intensities produced by molecular array scanners often correspond to absolute concentrations of mRNA molecules or other chemical, biological, or pharmaceutical compounds in sample solutions. It is important therefore that the pixel-based signal intensities produced by different molecular array scanners for a given number of fluorophores or chromophores within a region of the surface of a molecular array be identical. Thus, molecular array scanners must be calibrated to a common standard. Unfortunately, methods for precisely calibrating molecular arrays have been elusive. Designers, manufacturers, and users of molecular array scanners have thus recognized a need for a method for precisely calibrating molecular array scanners to a common standard.
SUMMARY OF THE INVENTIONOne embodiment of the present invention provides a two-step molecular array calibration method for calibrating molecular arrays to a reference molecular array, and for subsequently calibrating the molecular arrays to maintain a constant signal-intensity-to-label-concentration ratio. In the first step of the two-step calibration method, a reference array coated with the fluorophore or chromophore used to label probe molecules is employed, while in the second step of the two-step method, a reference array coated with a stable dye is employed.
BRIEF DESCRIPTION OF THE DRAWINGS
FIGS. 2A-B illustrate the hydrogen bonding between the purine and pyrimidine bases of two anti-parallel DNA strands.
FIGS. 11A-B illustrate dye and scanner properties that inhibit precise calibration using same-dye and stable-dye reference arrays.
One embodiment of the present invention provides a means for calibrating molecular array scanners to a reference molecular array scanner and subsequently calibrating the molecular array scanners to maintain the initial calibration. This two-step calibration method uses a first reference array coated with the same dye that is subsequently used in probe molecules and scanned by the molecular arrays during data collection and uses as a second reference array coated with a more stable dye that does not degrade significantly over repeated scans.
FIGS. 11A-B illustrate dye and scanner properties that inhibit precise calibration using same-dye and stable-dye reference arrays. It should be noted that the relationships graphed in FIGS. 11A-B, and
In
In order to solve the same-dye reference array and stable-dye reference array problems, described above with reference to
Although the present invention has been described in terms of a particular embodiment, it is not intended that the invention be limited to this embodiment. Modifications within the spirit of the invention will be apparent to those skilled in the art. For example, scanning of reference arrays to produce integrated signals may be accomplished in many different ways. A constant scan pattern may be employed, and signal intensities associated with resulting pixels averaged or combined in more complex ways to produce an aggregate signal intensity. Alternatively, calibration may be iterated over a set of particular pixels. Calibration may be independently carried out for each signal channel using different reference arrays or a single reference array coated with a mixture of dye compounds responsive to the laser light of different wavelengths produced by the lasers within the molecular array scanner. Subsequent, internal calibrations can be carried out using a single stable-dye reference array, or using one stable-dye reference for each one, or for a set of, the molecular array scanners.
The foregoing description, for purposes of explanation, used specific nomenclature to provide a thorough understanding of the invention. However, it will be apparent to one skilled in the art that the specific details are not required in order to practice the invention. The foregoing descriptions of specific embodiments of the present invention are presented for purpose of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise forms disclosed. Obviously many modifications and variations are possible in view of the above teachings. The embodiments are shown and described in order to best explain the principles of the invention and its practical applications, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalents:
Claims
1-10. (canceled)
11. A method for calibrating a first molecular array scanner with a second, reference molecular array scanner, the method comprising:
- calibrating said first molecular array scanner with said second, reference molecular array scanner by using a same-dye reference array that comprises a first dye; and
- maintaining calibration of said first molecular array scanner with said second, reference molecular array scanner by using a stable-dye reference array that comprises a second dye.
12. The method according to claim 11, wherein said first molecular array scanner is calibrated using said stable-dye reference array to maintain a constant signal-intensity-to-dye-concentration-ratio.
13. The method according to claim 11, wherein said second dye is more stable than said first dye.
14. The method according to claim 11, wherein said first dye is a fluorophore or chromophore.
15. The method according to claim 11, wherein said same-dye reference array comprises said first dye coated on a surface of a substrate.
16. The method according to claim 11, wherein said second dye is a fluorophore or chromophore.
17. The method according to claim 11, wherein said stable-dye reference array comprises said first dye coated on a surface of a substrate.
18. The method of claim 11 wherein said same-dye reference array is used to initially calibrate the first molecular array scanner with the second, reference molecular army scanner by:
- scanning the same-dye reference array in the second, reference molecular array scanner to determine a measured signal intensity for the same-dye reference array in the reference molecular array scanner;
- calculating an expected intensity for scanning the same-dye reference array in the second, reference molecular array scanner a second time; and
- scanning the same-dye reference array in the first scanner, and adjusting parameters in the first molecular array scanner to produce the expected intensity.
19. The method of claim 18 wherein calculating an expected intensity for scanning the same-dye reference array in the second, reference molecular array scanner a second time further comprises:
- determining a function of expected intensity per scan of the same-dye reference array; and
- selecting the expected intensity corresponding to one more than a number of times that the same-dye reference array has been scanned.
20. The method of claim 11 wherein maintaining the initial calibration using one or more stable-dye reference arrays further includes:
- following initial calibration, scanning a stable-dye reference array with the first molecular array scanner in order to determine a signal-intensity-to-stable-dye-concentration ratio; and
- periodically rescanning the stable-dye reference array with the first molecular array scanner, adjusting the first molecular array scanner to provide the determined signal-intensity-to-stable-dye-concentration ratio.
21. A computer readable medium having recorded thereon signal intensity data, scanned from the surface of a molecular array by a molecular array scanner calibrated to a reference molecular array by the method of claim 11.
22. A system for calibrating a number of molecular array scanners to provide a fixed signal-intensity-to-label-concentration ratio, the system comprising:
- a reference molecular array scanner;
- a same-dye reference array comprising a first dye; and
- a stable-dye reference array comprising a second dye that is different from said first dye.
23. The system according to claim 22, wherein said second dye is more stable than said first dye.
24. The method according to claim 22, wherein said first dye is a fluorophore or chromophore.
25. The method according to claim 22, wherein said same-dye reference array comprises said first dye coated on a surface of a substrate.
26. The method according to claim 22, wherein said second dye is a fluorophore or chromophore.
27. The method according to claim 22, wherein said stable-dye reference array comprises said first dye coated on a surface of a substrate.
28. The system of claim 22 configured so that the same-dye reference array is used to establish an initial calibration of a first molecular array scanner to the reference molecular array scanner by:
- scanning the same-dye reference array in the reference molecular array scanner to determine a measured signal intensity for the same-dye reference array in the reference molecular array scanner;
- calculating an expected intensity for scanning the same-dye reference array in the reference molecular array scanner a second time; and
- adjusting the first molecular array scanner to produce the respective calculated expected intensity.
29. The system of claim 28 wherein calculating the expected intensity for subsequently scanning the same-dye reference array in the first molecular array scanner further comprises:
- determining a function of expected intensity per scan of the same-dye reference array; and
- selecting the expected intensity for the first molecular array scanner corresponding to one more than a number of times that the same-dye reference array has been scanned.
30. The system of claim 22 configured so that one or more stable-dye reference arrays are used to maintain the initial calibration of the first molecular array scanner by:
- scanning a stable-dye reference array with the molecular array scanner in order to determine a signal-intensity-to-stable-dye-concentration ratio; and
- periodically rescanning the stable-dye reference array with the molecular array scanner, adjusting the molecular array scanner to provide the determined signal-intensity-to-stable-dye-concentration ratio.
31. A computer readable medium having recorded thereon signal intensity data, scanned from the surface of a molecular array by a molecular array scanner calibrated to a reference molecular array by the system of claim 22.
Type: Application
Filed: Jul 21, 2005
Publication Date: Jan 26, 2006
Inventors: John Corson (Stanford, CA), Andreas Dorsel (Menlo Park, CA), Russell Parker (San Jose, CA), Andre Chow (Belmont, CA)
Application Number: 11/188,141
International Classification: G06F 19/00 (20060101); G01N 33/20 (20060101);