FINE-TUNED ULTRASPECIFIC NUCLEIC ACID HYBRIDIZATION PROBES

Info

Publication number: 20210230691
Type: Application
Filed: Dec 30, 2020
Publication Date: Jul 29, 2021
Applicant: William Marsh Rice University (Houston, TX)
Inventors: David Yu ZHANG (Houston, TX), Juexiao WANG (Houston, TX), Ruojia WU (Houston, TX)
Application Number: 17/137,853

Abstract

Compositions and methods for highly specific nucleic acid probes and primers are provided. The probe system comprises a complement strand and a protector stand that form a partially double-stranded probe. The reaction standard free energy of hybridization between the probe and target nucleic acid as determined by Expression 1 (ΔG°rxn=ΔG°t-TC−ΔG°nh-PC+(ΔG°v-TC−ΔG°h-PC)) is from about −4 kcal/mol to about +4 kcal/mol. Alternatively, the reaction standard free energy of hybridization between the probe and target nucleic acid is determined by Expression 1 to be within 5 kcal/mol of the standard free energy as determined by Expression 2 (−Rτln(([P]0−[C]0)/[C]0)]), where the [P]0 term of Expression 2 equals the concentration of the protector strand and the [C]0 term of Expression 2 equals the concentration of the complement strand. In addition, a method for on-the-fly fine tuning of a reaction using the present probe is provided.

Description

Description

CROSS REFERENCE TO RELATED APPLICATIONS

The present application is a divisional of U.S. application Ser. No. 15/174,373, filed Jun. 6, 2016, which is a Continuation application of International Application No. PCT/US14/52827, filed Aug. 27, 2014, which claims priority to U.S. Provisional Application No. 61/916,321 filed Dec. 16, 2013, each of which is incorporated herein by reference in its entirety.

BACKGROUND

Small differences in DNA and RNA sequence can lead to big differences in health. For example, a single-base change in a bacterial genome can lead to antibiotic resistance, and a single-base change in a human genome can lead to cancer remission. With the maturation of the genomics field and the accompanying discovery of many nucleic acid biomarker sequences and molecules, there is a strong demand from the biotechnology industry to develop reliable, robust, inexpensive, and precise nucleic acid assays that can discriminate single-base changes. Enzyme-based discrimination methods for nucleic acid sequence differences are difficult to integrate with a wide variety of technologies because enzymes demand specific temperatures and buffer conditions.

Enzyme-free techniques to ensure highly specific hybridization of nucleic acids to their complements has traditionally relied on the optimization of melting temperature, but this is difficult to precisely predict and control. Recently, toehold hybridization probes have been demonstrated in which single-base changes in nucleic acid sequences can be robustly discriminated across a wide range of temperatures and salinities. These probes are designed to react with their intended targets with reaction standard free energy (ΔG°_rxn) close to zero, so that hybridization yield is close to 50% for the intended target. A variant of the target that differs by even a single nucleotide will bind to the probe with significantly less yield (median 2%).

To achieve the ΔG°_rxn≈0 property, these probes balance the binding energies of a target-specific “toehold” region with that of a target-nonhomologous “balance” region. DNA probes have been experimentally demonstrated to function robustly to discriminate DNA targets, and RNA probes have been experimentally demonstrated to function robustly to discriminate RNA targets.

These probes, however, suffer from several limitations. For example, when the probe and the target are of different forms, such as when DNA probes are designed specifically to RNA targets, 2′-O-methyl RNA probes are designed to bind RNA targets, and when LNA probes are designed to specifically bind DNA targets, the differences in hybridization thermodynamics between nucleic acid molecules of different forms result in poor probe design, with either low specificity or low sensitivity. Additionally, the thermodynamic binding strength of individual base pairs/stacks are relatively large, practically precluding fine-tuning of the reaction ΔG°_rxn, which in turn limits the tunability of the tradeoff between probe system specificity and sensitivity. Furthermore, published DNA and RNA hybridization thermodynamic parameters are known to be incomplete and/or inaccurate in certain conditions. An in silico designed probe system may possess a real ΔG°_rxnthat differs significantly from the calculated ΔG°_rxn; without a method of fine-tuning probe performance, iterative trial-and-error must be employed to achieve an optimal probe design with the desired ΔG°_rxn.

SUMMARY

The present disclosure provides, according to certain instances, highly specific nucleic acid hybridization probe systems, which reliably discriminate single-base changes in target nucleic acids. Compared to previous work, the probe systems described in the present disclosure excel in (1) reliably probing DNA, RNA, and modified nucleic acid targets with DNA, RNA, and other nucleic acid probes, and (2) enabling fine-tuning of the tradeoff between sensitivity and specificity. The compositions and methods of the present disclosure may be useful in, among other things, molecular cancer diagnostics, infectious disease diagnostics, food safety diagnostics, and research discovery tools based on DNA and RNA detection and quantification.

In one instance, a composition for selective interaction with a target nucleic acid molecule is provided. The composition comprises a first concentration of a first nucleic acid strand comprising a first region, second region, and third region, and a second concentration of a second nucleic acid strand comprising a fourth region and fifth region. The target nucleic acid comprises a sixth and seventh region of a nucleotide sequence that is at least partially, if not fully, complementary to a nucleotide sequence of the first and second regions, respectively. The first and second concentrations are such that the interaction between the target nucleic acid and the composition possesses a standard free energy (ΔG°_rxn) as determined by Expression 1 [ΔG°_rxn=ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC)] within 5 kcal/mol of a standard free energy as determined by Expression 2 (−Rτln(([P]₀−[C]₀)/[C]₀)]), where the [P]₀term of Expression 2 equals the second concentration, and the [C]₀term of Expression 2 equals the first concentration, R equals the universal gas constant 8.314 J/mol·K, and τ equals the temperature in Kelvin. In this instance, the ΔG°_t-Tcterm of Expression 1 represents the standard free energy of hybridization between the sixth region and the first region; the ΔG°_nh-PCterm of Expression 1 represents the free energy of hybridization between the fifth region and the third region; the ΔG°_v-TCterm of Expression 1 represents the standard free energy of hybridization between the seventh region and the second region; and the ΔG°_h-PCterm of Expression 1 represents the standard free energy of hybridization between the fourth region and the second region. The method of calculating ΔG° values is described in detail later in the description. In certain instances, the concentration of the target nucleic acid is smaller than the first concentration. In certain other instances, the concentration of the target nucleic acid is equal to or greater than the first concentration.

In another instance, the sequences of the first, second, third, fourth, fifth, sixth, and seventh regions are such that the interaction between the target nucleic acid and the composition possesses a standard free energy (ΔG°_rxn) as determined by Expression 1 [ΔG°_rxn=ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC)] of about −4 kcal/mol and +4 kcal/mol, while [ΔG°_t-TC−ΔG°_nh-PC] is not between −1 kcal/mol and +1 kcal/mol. In other instances, the values of ΔG°_t-TCand ΔG°_nh-PCare not within 10% of each other.

In another instance, the target nucleic acid further comprises an eighth region adjacent to the seventh region, such that the eighth region nucleotide sequence is not complementary to the third region nucleotide sequence, with fewer than 50% of the aligned nucleotides paired between the eighth and the third region at equilibrium.

In another instance, a process for creating a nucleic acid probe is provided. The process comprises the following steps: selecting a target nucleotide sequence in a nucleic acid molecule, the target nucleotide sequence comprising a sixth nucleotide subsequence and a seventh nucleotide subsequence; selecting a first nucleotide sequence comprising a first nucleotide subsequence, a second nucleotide subsequence, and a third nucleotide subsequence; and selecting a second nucleotide sequence comprising a fourth nucleotide subsequence and a fifth nucleotide subsequence. In this instance, the steps of selecting the first, second, and target nucleotide sequences are based on the interactions between such possessing a standard free energy from about −4 kcal/mol to about +4 kcal/mol as determined by Expression 1 [ΔG°_rxn=ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC)], wherein the ΔG°_t-TCterm of Expression 1 represents the standard free energy of hybridization between the sixth region and the first region, wherein the ΔG°_nh-PCterm of Expression 1 represents the free energy of hybridization between the fifth region and the third region, wherein the ΔG°_v-TCterm of Expression 1 represents the standard free energy of hybridization between the seventh region and the second region, and wherein the ΔG°_h-PCterm of Expression 1 represents the standard free energy of hybridization between the fourth region and the second region. The process further comprises the step of synthesizing a first nucleotide strand comprising the first nucleotide sequence and a second nucleotide strand comprising the second nucleotide sequence.

In addition to selection of the relevant nucleotide sequences based on Expression 1, the process may alternatively or further comprise selecting the first and second concentrations such that the standard free energy as determined by Expression 2 (−Rτln(([P]₀−[C]₀)/[C]₀)) is within 5 kcal/mol of the standard free energy as determined by Expression 1 (ΔG°_rxn) where the terms [C]₀and [P]₀of Expression 2 represent a predetermined concentration of the first nucleotide strand and the second nucleotide strand, respectively, R equals the universal gas constant 8.314 J/mol·K, and τ equals the temperature in Kelvin. In one instance, if the standard free energy as determined by Expression 1 is not within 5 kcal/mol of the standard free energy as determined by Expression 2, then the predetermined concentration of at least one of the first nucleic acid strand or the second nucleic acid strand may be modified until this condition is met. Alternatively, optimization may occur by repeating the steps of the process and selecting modified nucleotide sequences that meet the desired free energy conditions.

A method for identifying the presence or quantity of a nucleic acid molecule bearing the target nucleotide sequence in a sample is provided. The method comprises applying a probe to a sample possibly comprising a target nucleic acid molecule and operating the hybridization reaction at a temperature from about 4° C. to about 75° C., from about 25° C. to about 70° C., or from about 37° C. to about 65° C., or any temperature range there between, to permit hybridization of the probe to the target nucleic acid molecule, if the target nucleic acid molecule is present in the sample. In this instance, the probe comprises a first nucleic acid strand and a second nucleic acid strand. The first nucleic acid strand comprises a first region, a second region, and a third region, wherein the first region possesses a nucleotide sequence that is complementary to a nucleotide sequence of a sixth region of the target nucleic acid molecule, and wherein the second region possesses a nucleotide sequence that is complementary to a nucleotide sequence of a seventh region of the target nucleic acid molecule. The second nucleic acid strand comprising a fourth region and a fifth region, wherein the fourth region possesses a nucleotide sequence that is complementary to the nucleotide sequence of the second region, and wherein the fifth region possesses a nucleotide sequence that is complementary to the nucleotide sequence of the third region. In one instance, the target nucleic acid molecule is RNA.

A method for selectively amplifying a target nucleic acid sequence from a sample, said method comprising applying the probe as an enzymatic primer to a mixture comprising the sample, a DNA or RNA polymerase, and a mixture of nucleotide triphosphates. In some instances, the mixture further comprises an additional DNA or RNA primer, or an additional enzyme, such as a nicking enzyme, a recombinase, a helicase, a restriction enzyme, a nuclease, or a ligase. In some instances, the combination of the probe and the mixture are allowed to react isothermally for between 1 minute and 72 hours. In some instances, the combination of the probe and the mixture are allowed to react through a number of temperature cycles, varying between 5 and 200 cycles.

The features and advantages of the present disclosure will be readily apparent to those skilled in the art upon a reading of the description of the instances that follows.

DRAWINGS

Some specific example instances of the disclosure may be understood by referring, in part, to the following description and the accompanying drawings.

FIG. 1 provides one embodiment of a suitable nucleic acid probe system 10 for use in the present invention. Probe system 10 comprises a complement strand C (also referred to herein as the “first strand” and a protector strand P (also referred to herein as the “second strand”) designed with respect to a target nucleic acid T (also referred to as “target nucleic acid molecule,” or “target nucleic acid strand”). Complement strand C includes a target-toehold-complementary region 1 (also referred to herein as the “first region”), a target-homologous complementary region 2 (also referred to herein as the “second region”), and a target-nonhomologous-complementary region 3 (also referred to herein as the “third region”). The protector comprises a target-homologous region 4 (also referred to herein as the “fourth region”) and a target-nonhomologous region 5 (also referred to herein as the “fifth region”). The target comprises a target-toehold region 6 (also referred to herein as the “sixth region”) and a target-validation region 7 (also referred to herein as the “seventh region”). In certain embodiments, the target may further comprise a target upstream region 8, and/or additional unnamed upstream and downstream regions. The target homologous region 4 of protector P may differ in sequence from the target validation region 7 of target T, for example in the instance protector P and target T are different types of nucleic acids (e.g., RNA vs. DNA). As used herein, the term “region” when referring to the probe system or target nucleic acid defines a group of contiguous nucleotide bases that act as a functional unit in hybridization and dissociation.

FIG. 2A provides an exemplary probe system 10 and its reaction with target nucleic acid T and FIG. 2B provides and exemplary probe system 10 and its reaction with a variant target V having a single-base difference 12 than target T in the target validation region 7. Referring now to FIG. 2A, probe system 10 is designed such that the standard free energy of the hybridization reaction of probe system 10 with intended target T (ΔG°_rxn) is approximately equal to (−Rτln(([P]₀−[C]₀)/[C]₀)) (Expression 2), and ensures a medium to high yield of complement strand C bound to target T. Referring now to FIG. 2B, probe system 10 reacts with variant target V with a standard free energy ΔG°_vthat is more positive than ΔG°_rxnby ΔΔG°_SNP, (i.e., ΔG°_v=ΔG°_rxn+ΔΔG°_SNP) where ΔΔG°_SNPdenotes the relative thermodynamic penalty of the single base change. This results in probe system 10 having a much lower binding yield for variant target V due to the single base mismatch 12 as compared to the intended target T.

FIG. 3 provides the various standard free energies of the binding region components that are used in the present invention to calculate the reaction standard free energy (ΔG°_rxn).

FIG. 4 provides a distribution of (ΔG°_v-TC−ΔG°_h-PC) values for 46 different 50 nt non-overlapping subsequences of BRAF expressed (exonic) mRNA at different temperatures, assuming that the first nucleic acid molecule and second nucleic acid molecules are both DNA. As can be seen, there is a wide spread, with the largest values over 20 kcal/mol greater than the smallest values. Considering that a 1.4 kcal/mol difference in ΔG°_rxncan lead to a factor of 10 difference in specificity or sensitivity, the results here demonstrate that the ΔG°_v-TC−ΔG°_h-PCterm should be considered in the design of the probes described herein and thereby improves upon prior art design parameters.

FIG. 5 provides the standard free energy contribution of differential label thermodynamics (ΔG°_label=ΔG°_F−ΔG°_FQ). In this instance, the label of protector strand P is a quencher Q that is specific to fluorophore F of complement strand C.

FIG. 6 is a representation of one aspect of the present method for tuning probe system behavior. Specifically, in addition to modulation of reaction standard free energy (ΔG°_rxn) via addition or removal of base stacks (modification of value of Expression 1), modulation of stoichiometry (ratio of the concentrations of P to C) can be utilized to control the tradeoff between specificity and sensitivity of the probe and provides a more effective method of doing so (modifying the value of Expression 2). Here, the target concentration is assumed to be smaller than the first concentration, and the sensitivity is calculated as the equilibrium binding yield of the intended target [TC]/([T]+[TC]), and the specificity is calculated as one minus the binding yield of a target variant 1−[VC]/([V]+[VC])=[V]/([V]+[VC]). In this figure, the variant differs from the target by a single base, and possesses ΔΔG°_SNP=+2 kcal/mol at 37° C. Modulating P to C stoichiometry is also beneficial when ΔG°_rxncannot be accurately calculated in silico by allowing rescue of probe systems with real ΔG°_rxnthat differs by up to 5 kcal/mol from their calculated ΔG°_rxn.

FIGS. 7A, 7B and 7C represent variant probe system designs. FIG. 7A represents a probe system with opposite 5′/3′ orientation. FIG. 7B represents a probe system in which region 1 is embedded within region 2, or in which region 1 exists between regions 2 and 3. FIG. 7C represents a probe system in which regions 2 and 4 are not perfectly complementary, or in which regions 3 and 5 are not perfectly complementary.

FIG. 8 provides a graphical representation of the different desired yield for target binding, and the tradeoff between specificity and sensitivity. If the reaction standard free energy ΔG°_rxnas determined by Expression 1 (or Expression 3 if a label is used) deviates from (Expression 2 or −Rτln(([P]₀−[C]₀)/[C]₀)]) by free energy deviation X, the yield of the target binding will likewise change. For positive values of X, the specificity (against a target variant V) will be improved, but sensitivity (yield) will be reduced. For negative values of X, the sensitivity will be improved, but specificity will be reduced. For particular applications, either specificity or sensitivity may be more important, and the ability to fine-tune thermodynamics via methods presented herein improves upon the prior art.

FIG. 9 provides an exemplary probe of the present disclosure (Example 1; the Protector has a sequence according to SEQ ID NO: 3, the Complement has a sequence according to SEQ ID NO: 4, and the Target has a sequence according to SEQ ID NO: 5) targeting a BRAF expressed mRNA subsequence at nucleotides 11-30 at τ=37° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be +0.15 kcal/mol and ([P]₀−[C]₀)/[C]₀=0.78 is recommended to achieve X=0. At ([P]₀−[C]₀)/[C]₀=7.8, X is +1.42 kcal/mol, and at ([P]₀−[C]₀)/[C]₀=0.10, X is −1.27 kcal/mol.

FIG. 10 provides an exemplary probe of the present disclosure (Example 2; the Protector has a sequence according to SEQ ID NO: 6, the Complement has a sequence according to SEQ ID NO: 7, and the Target has a sequence according to SEQ ID NO: 8) targeting a BRAF expressed mRNA subsequence at nucleotides 71-90 at τ=37° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −0.61 kcal/mol and ([P]₀−[C]₀)/[C]₀=2.69 is recommended to achieve X=0.

FIG. 11 provides an exemplary probe of the present disclosure (Example 3; the Protector has a sequence according to SEQ ID NO: 9, the Complement has a sequence according to SEQ ID NO: 10, and the Target has a sequence according to SEQ ID NO: 11) targeting a BRAF expressed mRNA subsequence at nucleotides 131-160 at τ=37° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −1.54 kcal/mol and ([P]₀−[C]₀)/[C]₀=12.14 is recommended to achieve X=0.

FIG. 12 provides an exemplary probe of the present disclosure (Example 4; the Protector has a sequence according to SEQ ID NO: 12, the Complement has a sequence according to SEQ ID NO: 13, and the Target has a sequence according to SEQ ID NO: 14) targeting a BRAF expressed mRNA subsequence at nucleotides 191-220 at τ=52° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −0.46 kcal/mol and ([P]₀−[C]₀)/[C]₀=2.11 is recommended to achieve X=0.

FIG. 13 provides an exemplary probe of the present disclosure (Example 5; the Protector has a sequence according to SEQ ID NO: 15, the Complement has a sequence according to SEQ ID NO: 16, and the Target has a sequence according to SEQ ID NO: 17) targeting a BRAF expressed mRNA subsequence at nucleotides 251-280 at τ=65° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −1.49 kcal/mol and ([P]₀−[C]₀)/[C]₀=11.2 is recommended to achieve X=0.

FIG. 14 provides an exemplary probe of the present disclosure (Example 6; the Protector has a sequence according to SEQ ID NO: 18, the Complement has a sequence according to SEQ ID NO: 19, and the Target has a sequence according to SEQ ID NO: 20) targeting a BRAF expressed mRNA subsequence at nucleotides 311-350 at τ=52° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −0.22 kcal/mol and ([P]₀−[C]₀)/[C]₀=1.43 is recommended to achieve X=0.

FIG. 15 provides an exemplary probe of the present disclosure (Example 7; the Protector has a sequence according to SEQ ID NO: 21, the Complement has a sequence according to SEQ ID NO: 22, and the Target has a sequence according to SEQ ID NO: 23) targeting a BRAF expressed mRNA subsequence at nucleotides 431-460 at τ=65° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be +1.03 kcal/mol and ([P]₀−[C]₀)/[C]₀=0.19 is recommended to achieve X=0.

FIG. 16 provides an exemplary probe of the present disclosure (Example 8; the Protector has a sequence according to SEQ ID NO: 24, the Complement has a sequence according to SEQ ID NO: 25, and the Target has a sequence according to SEQ ID NO: 26) with an alternative orientation targeting a BRAF expressed mRNA subsequence at nucleotides 491-520 at τ=37° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −0.27 kcal/mol and ([P]₀−[C]₀)/[C]₀=1.55 is recommended to achieve X=0.

FIG. 17 provides an exemplary probe of the present disclosure (Example 9; the Protector has a sequence according to SEQ ID NO: 27, the Complement has a sequence according to SEQ ID NO: 28, and the Target has a sequence according to SEQ ID NO: 29) with an intentional single nucleotide mismatch in the target-homologous-region (the fourth region) of the protector strand targeting a BRAF expressed mRNA subsequence at nucleotides 551-580 at τ=37° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −0.37 kcal/mol and ([P]₀−[C]₀)/[C]₀=1.82 is recommended to achieve X=0.

FIG. 18 provides an exemplary probe of the present disclosure (Example 10; the Protector has a sequence according to SEQ ID NO: 30, the Complement has a sequence according to SEQ ID NO: 31, and the Target has a sequence according to SEQ ID NO: 32) targeting a BRAF expressed mRNA subsequence at nucleotides 611-630 at τ=25° C., [Na⁺]=1M. Based on literature parameters, ΔG°_rxnis calculated to be −0.66 kcal/mol and ([P]₀−[C]₀)/[C]₀=3.0 is recommended to achieve X=0.

FIG. 19 provides an exemplary probe of the present disclosure (Example 11; the Protector has a sequence according to SEQ ID NO: 33, the Complement has a sequence according to SEQ ID NO: 34, and the Target has a sequence according to SEQ ID NO: 35) targeting a BRAF expressed mRNA subsequence at nucleotides 671-700 at τ=25° C., [Na⁺]=1M, 30% formamide. Based on literature parameters and an assumption that 1% formamide is equivalent to a temperature increase of 0.6° C., ΔG°_rxnis calculated to be 0.32 kcal/mol and ([P]₀−[C]₀)/[C]₀=0.58 is recommended to achieve X=0.

FIG. 20 provides an exemplary probe of the present disclosure (Example 12; the Protector has a sequence according to SEQ ID NO: 36, the Complement has a sequence according to SEQ ID NO: 37, and the Target has a sequence according to SEQ ID NO: 38) targeting a DNA sequence at nucleotides 671-700 at τ=62° C., [Mg²⁺]=3 mM. Based on literature parameters, ΔG°_rxnis calculated to be −3.07 kcal/mol and ([P]₀−[C]₀)/[C]₀=100 is recommended to achieve X=0.

FIG. 21 provides a schematic overview of hotspot multiplexing PCR using probes of the present disclosure as primers. A sample 20 that comprises desired target nucleic acid molecules 22 is mixed with enzyme 23, a forward primer 24, and a reverser primer set 10a-10d. The target nucleic acid molecules 22 comprise single-base mutations residing at loci close to one another (a “hotspot”), and are typically challenging to detect via standard PCR primers. Due to the fact that the present probes possess selectivity to single nucleotide mismatches along the entire length of the primer, the present probes may be uniquely advantageous in hotspot multiplexed PCR primers. Additionally, the use of the present probes as PCR primers, being primarily double-stranded, will suppress the formation of primer dimers, which often limits multiplexed amplification capabilities for PCR. By using different fluorescence channels (1, 2, 3, and 4), each target could be quantitated with very little undesired cross-interaction.

FIG. 22 provides an exemplary hotspot multiplexing PCR reverse primer set using probes of the present disclosure targeting four various NRAS codon 61 mutations. Sequence design and energy calculations are based on the descriptions of design above, and the [P]₀/[C]₀ratio that theoretically achieves X=0 are calculated for each primer system. For NRAS-Q61R c. 182A>G, the Protector has a sequence according to SEQ ID NO: 40, the Complement has a sequence according to SEQ ID NO: 41, and the Target has a sequence according to SEQ ID NO: 39. For NRAS-Q61H c. 183A>T, the Protector has a sequence according to SEQ ID NO: 43, the Complement has a sequence according to SEQ ID NO: 44, and the Target has a sequence according to SEQ ID NO: 42. For NRAS-Q61K c. 181C>A, the Protector has a sequence according to SEQ ID NO: 46, the Complement has a sequence according to SEQ ID NO: 44, and the Target has a sequence according to SEQ ID NO: 47. For NRAS-Q61L c. 182A>T, the Protector has a sequence according to SEQ ID NO: 49, the Complement has a sequence according to SEQ ID NO: 50, and the Target has a sequence according to SEQ ID NO: 48.

FIG. 23 provides fine-tuning of probes directed to a DNA target by modifying the [P]₀/[C]₀ratio. Probes in this figure were designed to bind to the same DNA target with different reaction standard free energies. Each complement strand was modified by a TAMRA fluorophore at 3′ end while each protector strand by an Iowa Black RQ quencher at 5′ end. Hybridization yields were experimentally obtained at different ([P]₀−[C]₀)/[C]₀ratios via fluorescence. The results indicate that each probe shown in this figure is tunable in specificity and sensitivity. All experiments were done with 1X PBS at 25° C. The Target has a sequence according to SEQ ID NO: 51. Probe 1 has a Protector sequence according to SEQ ID NO: 52 and a Complement sequence according to SEQ ID NO: 53. Probe 2 has a Protector sequence according to SEQ ID NO: 54 and a Complement sequence according to SEQ ID NO: 55. Probe 3 has a Protector sequence according to SEQ ID NO: 56 and a Complement sequence according to SEQ ID NO: 57. Probe 4 has a Protector sequence according to SEQ ID NO: 58 and a Complement sequence according to SEQ ID NO: 59.

FIG. 24 provides fine-tuning of probes targeting an RNA sequence (synthetic miR-122) by modifying the[P]₀/[C]₀ratio. The probe design process was similar to that of FIG. 22, except that RNA-DNA binding parameters were used. Experimental procedures were the same as DNA target. The results show that the sensitivity/specificity tradeoff of probes for RNA target are also adjustable. The Target has a sequence according to SEQ ID NO: 60. Probe 1 has a Protector sequence according to SEQ ID NO: 61 and a Complement sequence according to SEQ ID NO: 62. Probe 2 has a Protector sequence according to SEQ ID NO: 63 and a Complement sequence according to SEQ ID NO: 64. Probe 3 has a Protector sequence according to SEQ ID NO: 65 and a Complement sequence according to SEQ ID NO: 66. Probe 4 has a Protector sequence according to SEQ ID NO: 67 and a Complement sequence according to SEQ ID NO: 68. Probe 5 has a Protector sequence according to SEQ ID NO: 69 and a Complement sequence according to SEQ ID NO: 70.

FIG. 25 provides a schematic overview of selectively amplifying a target nucleic acid molecule using probes of the present disclosure as self-reporting primers. The first nucleic acid strand of the primer 10 comprises a fluorophore F on one end, and the second nucleic acid strand of the primer comprises a quencher Q on the other end. During amplification process 30, upon hybridization to the desired target nucleic acid molecule 25, fluorescence signal increases as the second nucleic acid strand of the primer P diffuses away, indicating the formation of amplicon 26.

FIG. 26 provides a schematic overview of amplifying a target nucleic acid molecule using probes of the present disclosure as fluorophore-labeled probes to quantitate the amount of desired amplicons formed through the amplification. A sample that possibly comprises desired target nucleic acid molecule 22 is mixed with enzyme 23, forward primer 27, reverse primer (not shown), and fluorophore-labeled probe 10. The first nucleic acid strand of the probe C is functionalized with a fluorophore F internally and a quencher at the 3′ end, so that the probe is natively dark due to the close proximity of the fluorophore and quencher. The forward primer and the first nucleic acid strand of the probe C hybridize to the desired target nucleotide molecule during the annealing process 31, while the second nucleic acid strand is displaced by the target nucleic acid molecule. During the extension process 32, enzyme 23 with exonuclease activity extends the primer and cleaves the phosphodiester bonds of first nucleic acid strand, resulting the increase of fluorescence signal.

While the present disclosure is susceptible to various modifications and alternative forms, specific example instances have been shown in the figures and are herein described in more detail. It should be understood, however, that the description of specific example instances is not intended to limit the invention to the particular forms disclosed, but on the contrary, this disclosure is to cover all modifications and equivalents as illustrated, in part, by the appended claims.

DESCRIPTION

The nucleic acid probe systems described herein possess provide several advantages over previously described system. First, the methods and compositions described herein provide for more economical DNA probes to assay RNA targets of specific sequence; DNA probes to RNA targets may also exhibit improved specificity because RNA hybridization is generally less specific than DNA hybridization. Additionally, the methods and compositions here allow modified nucleic acid probes, such as those incorporating 2′-O-methyl nucleotides or locked nucleic acid (LNA), to benefit from robust single nucleotide specificity; these modified nucleic acid probes may possess desirable properties such as nuclease resistance. Second, the methods and compositions described herein provide specificity and sensitivity performance which can be finely tuned by modification of the relative concentrations of protector and complement in the probe system. Additionally, the probe systems also possess two other desirable features: the probes described herein are extremely specific and the probes described herein are operable across a wide range of temperature and salt concentrations and are therefore functionally reliable under many different experimental conditions. For example, a single-base change results in binding yields that differ by approximately 30-fold across temperatures from 10° C. to 70° C. Finally, the probes described herein are kinetically fast. For example, the probe of the present disclosure interacts with the target nucleic acid molecule within a factor of 10 of hybridization.

An overview of probe system 10 consistent with the present disclosure reacting with its intended target T is shown in FIGS. 2A and B. In this example, probe system 10 consists of a protector oligonucleotide/strand P and a complement oligonucleotide/strand C with the protector P existing in excess of the complement C. Protector P and complement C can hybridize to form a partially double-stranded complex; this is true regardless of whether protector P and complement C are introduced separately to target T, or pre-reacted to form the complex. Additionally, in some instances, there may be an excess of complement C or protector P such that the excess strand may exist as a single stranded molecule in addition to the partially double stranded complex of protector and complement. In FIG. 2A, the concentration ratio [P]₀/[C]₀is selected so that the reaction between target T and probe system 10 has a reaction standard free energy (ΔG°_rxn) equal to (−Rτln(([P]₀−[C]₀)/[C]₀)]). This results, in some instance, in half of all target molecules T in a sample bound to complement strand C at equilibrium. Referring now to FIG. 2B, a target variant V that differs in sequence from target T in the target-validation 7 or target-toehold 6 regions, potentially by a single base, will bind with more a positive standard free energy (ΔG°_v) and possess significantly lower equilibrium yield (e.g. 2%).

The sequences of protector strand P and complementary strand C are designed based on the sequence of intended target T. Each strand is conceptually divided into a number of non-overlapping regions, as shown in FIG. 1. It is important to note that target-validation region 7 (also referred to herein as the “seventh region”) and target-homologous region 4 (also referred to herein as the “fourth region”), while both are partially or fully complementary to target-homologous-complementary region 2 (also referred to herein as the “second region”), can possess different sequences. For example, an RNA target will have a target-validation region 7 containing uracil whereas a DNA protector P will comprise thymine in target-homologous region 4. As another example, region 4 may be partially mismatched to region 2 at certain positions, whereas region 7 is perfectly matched to region 2. As another example, both region 4 and region 7 may be partially mismatched to region 2, but at different nucleotide bases.

The reaction standard free energy for the probe system without a label is provided by ΔG°_rxn=ΔG°_t-TC−ΔG°_th-PC+(ΔG°_h-PC) which is also referred to herein and in the appended claims as “Expression 1.” The reaction standard free energy for the probe system with a functionalized group or label is provided by ΔG°_rxn=ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC)+ΔG°_labelwhich is referred to herein and in the appended claims as “Expression 3.” It should be understood that all standard free energy terms used herein are evaluated at the temperature and buffer conditions at which the composition is applied to the target nucleic acid molecule.

As shown in FIGS. 3 and 5, Expressions 1 and 3 are comprised of a number of components representing the standard free energy of hybridization between the various regions of the protector/complementary/target nucleic acid strands. As depicted therein, the ΔG°_t-TCterm represents the standard free energy of hybridization between target-toehold region 6 of target nucleic acid T and target-toehold complementary region 1 of complement strand C of probe system 10. These regions can be either partially complementary or fully complementary. In this instance, the term “partially complementary” is defined as having over 60% of the nucleotides in the first region being complementary to the aligned nucleotides of the sixth region. However, it should be understood that the term “partially complementary” with respect to other paired sequences may have a different meaning.

The ΔG°_nh-PCterm represents the standard free energy of hybridization between target-nonhomologous region 5 of protector strand P and target-nonhomologous-complementary region 3 of complement strand C. These regions can be either partially complementary or fully complementary. In this instance, the term “partially complementary” is defined as having over 60% of the nucleotides in the third region being complementary to the aligned nucleotides of the fifth region.

The ΔG°_v-TCterm represents the standard free energy of hybridization between target-validation region 7 of target nucleic acid T and target-homologous-complementary region 2 of complement strand C. These regions can be either partially complementary or fully complementary. In this instance, the term “partially complementary” is defined as having over 60% of the nucleotides in the second region being complementary to the aligned nucleotides of the seventh region.

The ΔG°_h-PCterm represents the standard free energy of hybridization between the target-homologous region 4 of protector strand P and target-homologous-complementary region 2 of complement strand C. These regions can be either partially complementary or fully complementary. In this instance, the term “partially complementary” is defined as having over 60% of the nucleotides in the second region being complementary to the aligned nucleotides of the fourth region.

The term ΔG°_labelequals the standard free energy of a label on the complement strand (ΔG°_F) minus the standard free energy of the interaction between the label and the protector, including any other functionalized groups on the protector. In the example in FIG. 5, the label of the protector strand (Q) is a quencher specific to the fluorophore (F) of the complement strand.

Referring still to FIG. 1, in certain instances, the design of probe system sequences is such that (1) there is little to no secondary structure in target-toehold-complementary region 1, and (2) there is little to no binding between the target-upstream region 8 and target-nonhomologous-complementary region 3. Here, “little to no secondary structure” in the target-toehold-complementary region is defined as fewer than 50% of the nucleotides in the region being in double-stranded state in the evaluated minimum free energy structure, as computed in the operational temperature and salinity conditions. Here, “little to no binding” between the target-upstream region and target-nonhomologous-complementary region 3 is defined as fewer than 50% of the nucleotides in the target-nonhomologous-complementary region 3 being in double-stranded state in the evaluated minimum free energy structure, as computed in the operational temperature and salinity conditions.

In addition to the reaction standard free energy (ΔG°_rxn) as determined, for example, by Expression 1, the present probe design includes consideration of the relative concentrations of the protector and complement strands of the probe. This permits fine tuning of reactions by modifying the ratio of protector strand to complement strand independently of the probe's sequence design. Thus, in one instance, the design of the present nucleic acid hybridization probe system is based on the following:

ΔG°_rxn=ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC)=−Rτln(([P]₀−[C]₀)/[C]₀)+X

- or

ΔG°_rxn=ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC)+ΔG°_label=−Rτln(([P]₀−[C]₀)/[C]₀)+X

- where X is a value between −5 kcal/mol and +5 kcal/mol. The value of X further allows the user to control the tradeoff between high molecular sensitivity and high molecular specificity, with more positive values of X favoring higher specificity.

It should be understood that the values of the ΔG° terms can only be approximately calculated based on currently available literature values, whereas the claimed probes are described and constrained by real ΔG° terms. Based on our experimental studies of ΔG° values, calculations based on currently available parameters and software may differ from real values by up to 3 kcal/mol or 15%, whichever is larger.

In contrast, WO 2012/058488 describes the design of nucleic acid hybridization probes in which the primary design constraint is ΔG°_t-TC≈ΔG°_nh-PC, in the language of the present disclosure, where approximately equal to is defined as within 10% of each other. In one embodiment, the standard free energies ΔG°_t-TCand ΔG°_nh-PCfor the probes of the current invention differ by more than 10% because the desired value of X differs significantly from 0. In another embodiment, the standard free energies ΔG°_t-TCand ΔG°_bh-PCfor the probes of the current invention differ by more than 10% because (ΔG°_v-TC−ΔG°_h-PC) differs significantly from 0. In another embodiment, the standard free energies ΔG°_t-TCand ΔG°_nh-PCfor the probes of the current invention differ by more than 10% because ([P]₀−[C]₀)/[C]₀differs significantly from 1. In another embodiment, the standard free energies ΔG°_t-TCand ΔG°_nh-PCfor the probes of the current invention differ by more than 10% because ΔG°_labeldiffers significantly from 0.

Thus, the present probe system diverges from the prior art in the consideration of the ΔG°_v-TC, ΔG°_h-PC, ΔG°_label, X, and

$(\frac{{[P]}_{0} - {[C]}_{0}}{{[C]}_{0}})$

terms. Negligence of the ΔG°_v-TC, ΔG°_h-PCterms lead to poor probe design in many settings where the nucleotide sequences of region 4 and region 7 are not identical, negligence of the ΔG°_labelterm leads to poor probe design when fluorophore or other labels are used, negligence of the X term precludes different tradeoffs between specificity and sensitivity, and negligence of the stoichiometric ratio term precludes fine-tuning of probe system behavior independent of sequence design and furthermore cause probes to perform poorly in certain stoichiometries of P and C. Each of these will be discussed in more detail below.

First, referring back to FIG. 2A, the target-validation region 7 of the target T and target-homologous region 4 of protector P may differ in sequence and thermodynamic properties for a number of reasons, importantly in instances where T and P are different types of nucleic acids. For example, target T may be an RNA molecule due to scientific/clinical interest, whereas protector P may be a DNA molecule due to economics/synthesis capabilities. As another example, protector P may comprise a modified nucleic acid, such as 2′-O-methyl nucleotides or locked nucleic acid (LNA). As another example, region 4 and region 7 may both be DNA, but differ in nucleotide sequence in order to benefit from increased kinetics or decreased unwanted biological response.

When target-validation region 7 of target T and target-homologous region 4 of protector P differ, then the ΔG°_v-TCand ΔG°_h-PCterms are unequal, and must be considered in the ΔG°_rxndriven probe system design process. The value of ΔG°_v-TC−ΔG°_h-PCcan deviate significantly from zero. Referring now to FIG. 4, the distribution of these values for 46 different non-overlapping subsequences of the BRAF transcript RNA (each 50 nt long) versus homologous DNA sequences in binding to a DNA complement, using RNA-DNA hybridization thermodynamics values given by Sugimoto et al. [7] and DNA-DNA hybridization thermodynamics values given by SantaLucia and Hicks [8] is shown. As can be seen therein, not only does the value of ΔG°_v-TC−ΔG°_h-PCrange from −20 kcal/mol to +20 kcal/mol, the values are also temperature dependent. In contrast, even a 1 kcal/mol difference in ΔG°_rxncan lead to significant changes in sensitivity and/or specificity.

The detection of RNA targets T using DNA probes (P and C) is only one application in which ΔG°_v-TC−ΔG°_h-PCmust be considered. Other variations of the probe system exist where the target-homologous region of protector P differs from the target-validation region of the target T, either because T and P are different types of nucleic acids (RNA, DNA, LNA, PNA, phosphothioate DNA, 2′-methoxy nucleic acids, etc.) or because of small changes in sequence, which will be discussed in further detail herein below.

By ignoring the ΔG°_v-TC−ΔG°_h-PCterm, it must be assumed that the total value of this term is 0 kcal/mol. This assumption is satisfied only when target-homologous region 4 of protector P is of identical character and sequence as target-validation region 7 of target T, such as for applications of DNA targets using DNA protectors and where region 7 and 4 possess identical nucleotide sequence.

Second, many applications of detection or imaging of nucleic acids utilize labels to help visualize the existence or quantity of target nucleic acids. These labels can be organic fluorophores, metallic nanoparticles, or haptens that recruit antibodies. Frequently, these labels can have significant thermodynamic effects, stabilizing or destabilizing nucleic acid hybridization. Proper design of probe systems that utilize labels should account for the differential standard free energies of labels with the protector and with the target as shown in FIG. 5.

Third, as mentioned above, the relative concentrations of protector P and complement C serve is an important tuning parameter for the present probe system that exists independently of the probe system's sequence design. Given that current understanding of DNA and RNA hybridization thermodynamics and label thermodynamics are imperfect, the ability to modulate the performance of a particular probe system after design and synthesis is vitally important for practical applications involving these probe systems.

To understand the role of the relative concentrations of P and C in tuning the performance of the probe system, the equilibrium of the reaction between the target and the probe system should be considered. The overall chemical reaction can be written as the expression below.

T+PCTC+P

Typically, the targets (biological DNA or RNA molecules) are much lower in concentration than the probe components P and PC; the higher concentrations of P and PC aid in driving the reaction to equilibrium quickly. One useful metric for judging the reaction's behavior is the yield or sensitivity of the probe system to target T, which can be expressed as

$(\frac{[TC]}{[T] + [TC]}) .$

When the sensitivity is roughly 50%, that is, when the equilibrium concentration of unbound T is equal to the equilibrium concentration of T bound to C ([T]=[TC]), the fold-change discrimination against a variant target V ([TC]/[VC]) is within a factor of 2 of optimal. The value of the equilibrium constant Keq that enables [T]=[TC] can be analytically solved by the below expression.

$\begin{matrix} K_{eq} = \frac{[TC] [P]}{[T] [PC]} \\ = \frac{[P]}{[PC}} \end{matrix}$

The standard free energy of a reaction can be related to the reaction equilibrium constant by the following expression.

$\begin{matrix} Δ G_{rxn}^{o} = - R_{T} \ln (K_{eq}) \\ = - R_{T} \ln (\frac{[P]}{[PC]}) \\ = - R_{T} \ln (\frac{{[P]}_{0} - {[C]}_{0}}{{[C]}_{0}}) \end{matrix}$

In the above equation, [P]₀denotes the initial concentration of the protector and [C]₀denotes the initial concentration of the complement. Because the target concentration [T] is typically much lower than the concentrations of either protector or probe, the equilibrium concentrations of [P] and [PC] can be approximated as [P]₀−[C]₀and [C]₀, respectively. The term

$(\frac{{[P]}_{0} - {[C]}_{0}}{{[C]}_{0}})$

is scale-invariant, and the concentrations used for [P]₀and [C]₀can therefore be either the high stock concentration added to a sample, or the final concentration achieved after dilution by the sample. Note that [P]₀and [C]₀refer to the total concentrations of P and C, including those present in the partially double-stranded PC species. An alternative method of writing this expression is ([P_free]₀/[PC]₀), where [P_free]₀denotes the initial concentration of free P and [PC]₀denotes the initial concentration of PC.

For use in the present probe system, the concentration for [P]₀may be lower than, the same as, or greater than, but is generally greater than the concentration for [C]₀. For example, the concentration for [P]₀as can be from about 1.01 times to about 10,000 times that of [C]₀, from about 1.1 times to about 1,000 times that of [C]₀, or from about 1.2 times to about 100 times that of [C]₀and including any intermediate range between any of the above provided ranges.

In one instance, probe behavior can be tuned to achieve approximately 50% sensitivity by designing the probe system so that the ΔG°_rxnis close to 0, or from about −5 kcal/mol to about +5 kcal/mol, and then adjusting the [P]₀and [C]₀so that

$Δ G_{rxn}^{o} = R τ \ln (\frac{{[P]}_{0} - {[C]}_{0}}{{[C]}_{0}})$

is satisfied. Importantly, without tuning the probe system via [P]₀and [C]₀, it becomes practically impossible to obtain 50% sensitivity (or any other desired sensitivity), due to the coarse-grain nature of adjusting ΔG°_rxnvia addition or removal of base pairs/stacks.

FIG. 6 demonstrates that a single additional base pair changes the value of ΔG°_rxnby between −0.6 kcal/mol and −2.2 kcal/mol in 37° C., 1M Na⁺.

In the present disclosure, a novel concept of fine-tuning of ΔG°_rxnvia the stoichiometric ratio of protector P to complement C is therefore provided. The accuracy of the stoichiometric ratio between P and C is limited only by the accuracy of liquid handling systems (e.g. pipettor accuracy), and can typically be controlled to within 2%. This 2% accuracy of stoichiometry, in turn, results in the same precision of tuning probe performance −Rτln(1.02)=−0.012 kcal/mol as resolution in ΔG°_rxn. Thus, tuning the thermodynamics via P to C stoichiometry is over a factor of 50 more fine-grained than prior art methods of tuning thermodynamics via additional base pairs (−0.012 kcal/mol vs −0.60 kcal/mol). Tuning of P and C stoichiometry can occur at the design phase, or dynamically as the probe is being iteratively optimized for a particular application.

The experimental results provided in FIGS. 22 and 23 demonstrate the effectiveness of adjusting the ratio ([P]₀_o−[C]₀)/[C]₀in order to tune the specificity/sensitivity tradeoff. FIG. 22 depicts the sequence design of four different probes directed to a DNA target, each designed with a different ΔG°_rxnand the observed yield of the DNA target to each probe for different values of ([P]₀−[C]₀)/[C]₀. FIG. 23 depicts the sequence design of five different probes directed to a RNA target, each designed with a different ΔG°_rxnand the observed yield of the RNA target to each probe for different values of ([P]₀_o−[C]₀)/[C]₀. As taught previously, larger values of ([P]₀−[C]₀)/[C]₀monotonically decrease the yield of the hybridization between the target and the probe.

In another aspect, the present disclosure provides a probe system in which

$Δ G_{rxn}^{o} = - R_{T} \ln (\frac{{[P]}_{0} - {[C]}_{0}}{{[C]}_{0}})$

is not satisfied, but instead provides a slight variation where the values are not equal in order to achieve a different tradeoff between specificity and sensitivity. To this end, the thermodynamic property of the present probe system can be expressed by the following:

$Δ G_{rxn}^{o} = - R_{T} \ln (\frac{{[P]}_{0} - {[C]}_{0}}{{[C]}_{0}}) + X$

where X is the deviation from 0. In one instance, the value of X is from about −5 kcal/mol to about +5 kcal/mol. For positive values of X, the specificity (against a target variant V) will be improved, but sensitivity (yield) will be reduced. For negative values of X, the sensitivity will be improved but specificity will be reduced as demonstrated in FIG. 8. In practice, certain applications (such as those dealing with rare alleles) may require higher specificity at the cost of sensitivity, or vice versa. The present methods to fine-tune thermodynamics are particularly useful for these applications that require intricate control of sensitivity and specificity (see also Variants).

In yet another aspect, the present disclosure provides for minor sequence differences between target-validation and target-homologous regions. The target-validation region (of the target T) and the target-homologous region (of the protector P) are both intended to be complementary to the target-homologous-complementary region (of the complement C). However, there may be cases where it is desirable to have minor sequence modifications in the target-validation and/or in the target-homologous region, so that the target-validation and/or the target-homologous region are only partially complementary to the target-homologous-complementary region. To this end, in the instance that over 60% of the bases in the target-homologous-complementary region are complementary to the target-validation region, and over 60% of the bases in the target-homologous-complementary region are complementary to the target-homologous region, the resulting probes maintain consistency with the principles of probe construction described herein.

In addition, the present disclosure provides a probe system in which the 5′ to 3′ orientations of the protector and complement are reversed with respect to the positions of the nonhomologous and toehold regions as shown in FIGS. 7A, 7B, and 7C. Modern nucleic acid synthesis occurs from the 3′ end to the 5′ end, resulting in truncations and deletions being concentrated at the 5′ end. Consequently, it is expected that the original orientation shown in FIGS. 1-6 would be desirable because truncations on the protector and the complement will tend to balance each other energetically, maintaining the desired ΔG°_rxn. In contrast, in the design orientation shown in FIGS. 7A, 7B, and 7C, truncations in both the protector and the complement will tend to make ΔG°_rxnmore negative, reducing the reliability and specificity of the probe system. These effects are mitigated when the protector and complement oligonucleotides are purified post-synthesis, such as by high pressure liquid chromatography (HPLC) or polyacrylamide gel electrophoresis (PAGE).

In the analysis of reaction standard free energy (ΔG°_rxn) the standard free energy of formation ΔG° of an unstructured oligonucleotide is defined to be 0. The equilibrium constant (K_eq) of the reaction between the target T and the probe system (P and C) can be directly calculated from the reaction's standard free energy ΔG°_rxnvia the following expression:

K_eq=e^−ΔG°^rxn^/Rτ

where R=8.314 J/mol K is the ideal gas constant (alternatively, Boltzmann constant), and τ is the ambient temperature in Kelvin.

In the design of the present probe system, the reaction ΔG°_rxnis broken down into the sum of a number of ΔG° terms denoting the standard free energy of hybridization of various regions of the complement strand to target strand and complement strand to protector strand (e.g. ΔG°_nh-PCdenotes the hybridization of the target-nonhomologous region to the target-nonhomologous-complement region). The values of these terms can be approximately calculated by adding the standard free energies of base stacks as described in more detail herein below, though current literature-provided standard free energy values are incomplete and of limited accuracy. Experimental testing is needed to determine the true values of ΔG°_rxnfor each probe, but the literature-guided values provide a rough (typically within 3 kcal/mol or 15%) estimate of the ΔG°_rxn.

In one instance, the standard free energies of hybridization between regions of the present probe system are calculated based on a base pair stacking approach. In this method, two adjacent base pairs comprise one stack, which has a defined enthalpy (ΔH°) and entropy (ΔS°) value. The standard free energy of each stack (ΔG°) at a particular temperature τ (in Kelvin) can be calculated from the equation ΔG°=ΔH°−τΔS°. The standard free energies of several stacks can be summed to evaluate the standard free energy of a binding region. For example, the standard free energy of a ‘CTC’ region pairing to a ‘GAG’ region is the standard free energy of stack ‘CT/GA’ plus the standard free energy of stack ‘TC/AG’. At 37° C. in 1M Na⁺, the standard free energy of stack ‘CT/GA’ is −1.28 kcal/mol and the standard free energy of stack ‘TC/AG’ is −1.30 kcal/mol, so the standard free energy of ‘CTC’ pairing to ‘GAG’ is −2.58 kcal/mol.

The ΔH° and ΔS° values of DNA-DNA stacks, based on published work by SantaLucia and Hicks are shown in Table 1. The standard enthalpy change and the standard entropy change of RNA-DNA stacks, based on published work by Sugimoto et al., are shown in Table 2. The standard enthalpy change and the standard entropy change of RNA-RNA stacks, based on published work by Turner et al., are shown in Table 3. The values of ΔH° for base stacks are accepted in the literature to be the same regardless of salinity. In contrast, the ΔS° of base stacks are adjusted by 0.368*ln([Na⁺]) cal/mol*K, regardless of nucleotide base identity, due to the electrostatic screening properties of cations. Additionally, divalent cations (such as Mg²⁺) may also be used in the reaction solution; the effects of divalent cations on base pairing thermodynamics are described in the literature, such as by Owczarzy, Biochemistry, 2008. Finally, denaturants such as formamide may be used to facilitate hybridization reactions, particularly for in situ hybridization applications. It has been reported in literature that each percent (%) that water is replaced by formamide effectively increases the temperature by 0.6° C. for purposes of nucleic acid base pairing thermodynamics, see Blake and Delcourt, Nucleic Acids Research, 1996.

TABLE 1 Thermodynamic Parameters for DNA Watson-Crick Pairs in 1M NaCl. Propagation ΔH° ΔS° sequence (kcal mol⁻¹) (e.u.) AA/TT −7.6 −21.3 AT/TA −7.2 −20.4 TA/AT −7.2 −21.3 CA/GT −8.5 −22.7 GT/CA −8.4 −22.4 CT/GA −7.8 −21.0 GA/CT −8.2 −22.2 CG/GC −10.6 −27.2 GC/CG −9.8 −24.4 GG/CC −8.0 −19.9 Initiation +0.2 −5.7

TABLE 2 Thermodynamic Parameters for RNA-DNA Duplex Pairs in 1M NaCl. Sequence ΔH°/kcal mol⁻¹ ΔS°/cal mol⁻¹K⁻¹ rAA −7.8 −21.9 dTT rAC −5.9 −12.3 dTG rAG −9.1 −23.5 dTC rAU −8.3 −23.9 dTA rCA −9.0 −26.1 dGT rCC −9.3 −23.2 dGG rCG −16.3 −47.1 dGC rCU −7.0 −19.7 dGA rGA −5.5 −13.5 dCT rGC −8.0 −17.1 dCG rGG −12.8 −31.9 dCC rGU −7.8 −21.6 dCA rUA −7.8 −23.2 dAT rUC −8.6 −22.9 dAG rUG −10.4 −28.4 dAC rUU −11.5 −36.4 dAA initiation 1.9 −3.9

TABLE 3 Thermodynamic Parameters for RNA-RNA Duplex Pairs in 1M NaCl. Sequence ΔH°/kcal mol⁻¹ ΔS°/cal mol⁻¹K⁻¹ AA/UU −6.6 −18.4 AU/UA −5.7 −15.5 AC/UG −10.2 −26.2 AG/UC −7.6 −19.2 UA/AU −8.1 −22.6 UC/AG −13.3 −35.5 UG/AC −10.5 −27.8 CC/GG −12.2 −29.7 CG/GC −8.0 −19.4 GC/CG −12.2 −29.7 Initiation 0.0 −10.8

In one instance, the reaction standard free energy (ΔG°_rxnfrom Expression 1 or 3) of hybridization for the various regions of the present probe system are calculated as described below.

ΔG°_t-TC(hybridization of target-toehold-region (region 6) to target-toehold-complementary regions (region 1)) is composed by summing the standard free energy of all toehold region nucleic acid stacks, the neighboring stack and an initiation energy penalty (ΔG°_ini), due to the entropic loss of orienting two nucleic acid molecules for hybridization. The value of ΔG°_inican be calculated from ΔH°_iniand ΔS°_inivia ΔG°=ΔH°−τΔS°

For DNA-DNA hybridization as provided in Table 1, ΔH°_ini=0.2 kcal/mol and ΔS°_ini=−5.7 cal/(mol·K). For RNA-DNA hybridization as provided in Table 2, ΔH°_ini=1.9 kcal/mol and ΔS°_ini=−3.9 cal/(mol·K). For RNA-RNA hybridization as provided in Table 3, ΔH°_ini=0.0 kcal/mol and ΔS°_ini=−10.8 cal/(mol·K).

In one instance, the probes described herein have a ΔG°_t-TCfrom about −2 kcal/mol to about −16 kcal/mol, from about −5 kcal/mol to about −13 kcal/mol, or from about −7 kcal/mol to about −10 kcal/mol at operation conditions.

ΔG°_nh-PC(hybridization of target-nonhomologous region 5 of protector P to target-nonhomologous-complementary region 3 of complement C) is composed by summing the standard free energy of all stacks in the non-homologous region, the neighboring stack on the homologous region, and the hybridization initiation energy ΔG°_ini. Each stack standard free energy term and initiation standard free energy term is calculated based on the methods discussed above.

ΔG°_v-TC(hybridization of target-validation region 7 of target T to target-homologous-complementary region 2 of complement C) is equal to the sum of all nucleic acid stacks in the target-validation region. Each standard free energy term is calculated based on the methods discussed herein above. In this instance, the initiation energy ΔG°_iniis not applied in the calculation of this term.

ΔG°_h-PC(hybridization of target-homologous region 4 of protector P to target-homologous-complementary region 2 of complement C) is equal to the sum of all nucleic acid stacks in the target-homologous region. Each standard free energy term is calculated based on the methods discussed herein above. In this instance, the initiation energy ΔG°_iniis not applied in the calculation of this term.

In one instance, the sum of the standard free energy of hybridization between the target-toehold-complementary region (region 1) and the target-toehold region (region 6) and between the target-homologous-complementary region (region 2) and the target-validation region (region 7) (ΔG°_t-TC+ΔG°_v-TC) is more negative than −7 kcal/mol, for example between about −7 kcal/mol and about −70 kcal/mol, between about −7 kcal/mol and about −50 kcal/mol, and between −7 kcal/mol and about −30 kcal/mol. In this instance or other instances, the sum of the standard free energy of hybridization between the target-nonhomologous-complementary region (region 3) and the target-nonhomologous region (region 5) and between the target-homologous region (region 4) and the target-homologous-complementary region (region 2) (ΔG°_nh-PC+ΔG°_h-PC) is more negative than −10 kcal/mol, for example between about −10 kcal/mol and about −70 kcal/mol, between about −10 kcal/mol and about −50 kcal/mol, and between −10 kcal/mol and about −30 kcal/mol.

In addition to enzyme-free nucleic acid detection systems, the probes of the present disclosure are useful in PCR application or other isothermal amplification systems as primer, for example, in hotspot multiplexing PCR reactions. When applying the probes to PCR reactions, undesired amplification can be minimized after careful design and fine-tuning. Therefore, two or more primer systems for non-identical targets can be combined into one solution for hotspot multiplexing PCR. A schematic of hotspot multiplexing PCR is shown in FIG. 21. The sequences of targets in one multiplexing group can be highly similar due to the high specificity characteristic of the primer system. The design process for each primer system in the primer set is the same as that of the probes as described above. The specificity and sensitivity of each primer system could be adjusted according to experimental results. An example of primer systems for hotspot multiplexing PCR is shown in FIG. 22.

In one embodiment, the signal generation method for PCR or other isothermal amplification systems is using fluorophore-modified complement and quencher-modified protector. The protector would detach from the complement as the amplification proceeds, so the fluorescence signal is proportional to the copy number of amplified target. Different targets can be quantitated simultaneously by using spectral non-overlapping fluorophores. A similar signal generation method is using fluorophore-modified complement and quencher-modified protector as self-reporting primers as shown in FIG. 25. Another signal generation method that similar to traditional TaqMan probes is using fluorophore- and quencher-modified complement and non-modified protector as detection probes as shown in FIG. 26. In multiplexing setting, probes that bear different fluorophores can be used for different desired targets. Unlike traditional TaqMan probes that can be applied only when the desired targets are highly different, so that each TaqMan probe only specifically binds to one target and does not interfere with the reaction of other targets, the TaqMan-like probes presented in this disclosure may be uniquely advantageous in distinguishing similar targets.

Each probe system described herein may be comprised of DNA, RNA, or analogs thereof, and/or combinations thereof. In certain instances, a probe system comprises one or more non-natural nucleotides. The incorporation of non-natural nucleotides in the primers can further augment the performance of the probe systems, such as by providing improved per-base binding affinity and increased nuclease resistance.

The probe systems described herein may also be applied in the context of initiating enzymatic reactions; in such uses, the probe systems are referred to as primer systems, though the composition and method of action remains the same. Primer systems as described in this disclosure possess high specificity and capability for fine-tuning of performance, offering advantages to enzymatic assays of nucleic acids.

In certain instances, the primers described herein serve as starting points for polymerase extensions, including but not limited to polymerase chain reaction for replication of DNA templates, transcription for production of RNA from DNA templates, and reverse transcription for production of DNA from RNA templates, isothermal DNA and RNA amplification methods such as Nucleic Acid Sequence Based Amplification (NASBA), Loop mediated isothermal Amplification (LAMP), Helicase-Dependent Amplification (HDA), Recombinase Polymerase Amplification (RPA), isothermal Exponential Amplification Reaction (EXPAR), Nicking Enzyme Amplification Reaction (NEAR), Rolling Circle Amplification (RCA), and Transcription Mediated Amplification (TMA). The high specificity nature of the primers disclosed herein render them suitable for research and clinical applications in which only subsets of nucleic acids with particular sequences are to be extended and amplified.

A “target” for a probe system described herein can be any single-stranded nucleic acid, such as single-stranded DNA and single-stranded RNA, including double-stranded DNA and RNA rendered single-stranded through heat shock, asymmetric amplification, competitive binding, and other methods standard to the art. A “target” for a primer system can be any single-stranded (ss) or double-stranded (ds) nucleic acid, for example, DNA, RNA, or the DNA product of RNA subjected to reverse transcription. In some instances, a target may be a mixture (chimera) of DNA and RNA. In other instances, a target comprises artificial nucleic acid analogs, for example, peptide nucleic acids (Nielsen et al. Science 254(5037): 1497-500 (1991)) or locked nucleic acids (Alexei et al. Tetrahedron 54(14): 3607-30 (1998)). In some instances, a target may be naturally occurring (e.g., genomic DNA) or it may be synthetic (e.g., from a genomic library). As used herein, a “naturally occurring” nucleic acid sequence is a sequence that is present in nucleic acid molecules of organisms or viruses that exist in nature in the absence of human intervention. In some instances, a target is genomic DNA, messenger RNA, ribosomal RNA, micro-RNA, pre-micro-RNA, pro-micro-RNA, long non-coding RNA, small RNA, epigenetically modified DNA, epigenetically modified RNA, viral DNA, viral RNA or piwi-RNA. In certain instances, a target nucleic acid is a nucleic acid that naturally occurs in an organism or virus. In some instances, a target nucleic is the nucleic acid of a pathogenic organism or virus. In certain instances the presence or absence of a target nucleic acid in a subject is indicative that the subject has a disease or disorder or is predisposed to acquire a disease or disorder. In certain instances the presence or absence of a target nucleic acid in a subject is indicative that the subject will respond well or poorly to a treatment, such as a drug, to treat a disease or disorder. In certain instances the presence or absence of a target nucleic acid in a subject is indicative that the subject who has been treated previously for cancer and is in remission may be at risk of cancer recurrence.

The terms “polynucleotide,” “nucleic acid,” “oligonucleotide,” and “nucleic acid molecule” are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three-dimensional structure, and may perform any function. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. A polynucleotide may be further modified, such as by conjugation with a labeling component. The term “recombinant” polynucleotide means a polynucleotide of genomic, cDNA, semi- synthetic, or synthetic origin which either does not occur in nature or is linked to another polynucleotide in a non-natural arrangement. The term “isolated nucleic acid” refers to a polynucleotide of natural or synthetic origin or some combination thereof, which (1) is not associated with the cell in which the “isolated nucleic acid” is found in nature, and/or (2) is operably linked to a polynucleotide to which it is not linked in nature.

A nucleic acid may also encompass single- and double-stranded DNA and RNA, as well as any and all forms of alternative nucleic acid containing modified bases, sugars, and backbones. The term “nucleic acid” thus will be understood to include, but not be limited to, single- or double-stranded DNA or RNA (and forms thereof that can be partially single-stranded or partially double-stranded), cDNA, aptamers, peptide nucleic acids (“PNA”), 2′-5′ DNA (a synthetic material with a shortened backbone that has a base-spacing that matches the A conformation of DNA; 2′-5′ DNA will not normally hybridize with DNA in the B form, but it will hybridize readily with RNA), and locked nucleic acids (“LNA”). Nucleic acid analogues include known analogues of natural nucleotides that have similar or improved binding, hybridization of base -pairing properties. “Analogous” forms of purines and pyrimidines are well known in the art, and include, but are not limited to aziridinylcytosine, 4-acetylcytosine, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxymethylaminomethyluracil, inosine, N6-isopentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N.sup.6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid methylester, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid, and 2,6-diaminopurine. DNA backbone analogues provided herein include phosphodiester, phosphorothioate, phosphorodithioate, methylphosphonate, phosphoramidate, alkyl phosphotriester, sulfamate, 3′ -thioacetal, methylene(methylimino), 3′-N-carbamate, morpholino carbamate, and peptide nucleic acids (PNAs), methylphosphonate linkages or alternating methylphosphonate and phosphodiester linkages (Strauss-Soukup, 1997, Biochemistry 36:8692-8698), and benzylphosphonate linkages, as discussed in U.S. Pat. No. 6,664,057; see also OLIGONUCLEOTIDES AND ANALOGUES, A PRACTICAL APPROACH, edited by F. Eckstein, IRL Press at Oxford University Press (1991); Antisense Strategies, Annals of the New York Academy of Sciences, Volume 600, Eds. Baserga and Denhardt (NYAS 1992); Milligan, 1993, J. Med. Chem. 36: 1923-1937; Antisense Research and Applications (1993, CRC Press). The nucleic acids herein can be extracted from cells or synthetically prepared according to any means known to those skilled in the art; for example, the nucleic acids can be chemically synthesized or transcribed or reverse transcribed from cDNA or mRNA, among other sources.

A target nucleic acid utilized herein can be any nucleic acid, for example, human nucleic acids, bacterial nucleic acids, or viral nucleic acids. A target nucleic acid sample or sample comprising a target nucleic acid can be, for example, a nucleic acid sample from one or more biological samples including, but not limited to whole blood, nucleic acids extracted from whole blood, plasma, nucleic acids extracted from plasma, sputum, stool, urine, cheek or nasal swab. cells, tissues, or bodily fluids. Target biological samples can be derived from any source including, but not limited to, eukaryotes, plants, animals, vertebrates, fish, mammals, humans, non-humans, bacteria, microbes, viruses, biological sources, serum, plasma, blood, urine, semen, lymphatic fluid, cerebrospinal fluid, amniotic fluid, biopsies, needle aspiration biopsies, cancers, tumors, tissues, cells, cell lysates, crude cell lysates, tissue lysates, tissue culture cells, buccal swabs, mouthwashes, stool, mummified tissue, forensic sources, autopsies, archeological sources, infections, nosocomial infections, production sources, drug preparations, biological molecule productions, protein preparations, lipid preparations, carbohydrate preparations, inanimate objects, air, soil, sap, metal, fossils, excavated materials, and/or other terrestrial or extra-terrestrial materials and sources. The sample may also contain mixtures of material from one source or different sources. For example, nucleic acids of an infecting bacterium or virus can be amplified along with human nucleic acids when nucleic acids from such infected cells or tissues are amplified using the disclosed methods. Types of useful target samples include eukaryotic samples, plant samples, animal samples, vertebrate samples, fish samples, mammalian samples, human samples, non-human samples, bacterial samples, microbial samples, viral samples, biological samples, serum samples, plasma samples, blood samples, urine samples, semen samples, lymphatic fluid samples, cerebrospinal fluid samples, amniotic fluid samples, biopsy samples, needle aspiration biopsy samples, cancer samples, tumor samples, tissue samples, cell samples, cell lysate samples, crude cell lysate samples, tissue lysate samples, tissue culture cell samples, buccal swab samples, mouthwash samples, stool samples, mummified tissue samples, autopsy samples, archeological samples, infection samples, nosocomial infection samples, production samples, drug preparation samples, biological molecule production samples, protein preparation samples, lipid preparation samples, carbohydrate preparation samples, inanimate object samples, air samples, soil samples, sap samples, metal samples, fossil samples, excavated material samples, and/or other terrestrial or extra-terrestrial samples. In some instances, a target nucleic acids utilized herein comprise repetitive sequence, secondary structure, and/or a high G/C content.

In certain instances, a target nucleic acid molecule of interest is about 19 to about 1,000,000 nucleotides (nt) in length. In some instances, the target is about 19 to about 100, about 100 to about 1000, about 1000 to about 10,000, about 10,000 to about 100,000, or about 100,000 to about 1,000,000 nucleotides in length. In some instances, the target is about 20, about 100, about 200, about 300, about 400, about 500, about 600, about 700, about 800, about 900, about 1,000, about 2,000, about 3,000, about 4,000, about 5,000, about 6,000, about 7,000, about 8,000, about 9000, about 10,000, about 20,000, about 30,000, about 40,000, about 50,000, about 60,000, about 70,000, about 80,000, about 90,000, about 100,000, about 200,000, about 300,000, about 400,000, about 500,000, about 600,000, about 700,000, about 800,000, about 900,000, or about 1,000,000 nucleotides in length. It is to be understood that the target nucleic acid may be provided in the context of a longer nucleic acid (e.g., such as a coding sequence or gene within a chromosome or a chromosome fragment).

In certain instances, a target of interest is linear, while in other instances, a target is circular (e.g., plasmid DNA, mitochondrial DNA, or plastid DNA).

In some instances, provided herein are primer-target systems. A primer-target system comprises one or more nucleic acid targets, a polymerase, and one or more primers (e.g., primer duplex). The term “primer” encompasses any one of the primers or primer systems described herein. In certain instances, the primer-target systems described herein comprise a plurality of different primers. In some instances, a primer-target system can comprise at least two primers, which can be used to identify and, for example amplify, a target nucleic acid molecule. A target nucleic acid molecule may be present amongst a plurality of non-target nucleic acid molecules, for example, as a single copy or in low copy number. Any one of the primer-target systems described herein may comprises conditions similar to those used in nucleic acid amplification or sequencing reactions (e.g., similar reagents, reaction temperature, etc.).

Provided herein are kits comprising (1) at least one complement strand having a target-homologous-complementary region (region 2), a target-nonhomologous-complementary region (region 3), and a target-toehold-complementary region (region 1), and (2) at least one protector strand having a target-homologous region (region 4) and a target-nonhomologous region (region 5). Provided herein are kits comprising at least one primer duplex comprising (1) at least one complement strand having a target-homologous-complementary region, a target-nonhomologous-complementary region, and a target-toehold-complementary region, and (2) at least one protector strand having a target-homologous region and a target-nonhomologous region.

Any one of the kits described herein may further comprise a polymerase, including reverse transcriptase. Any one of the kits provided herein may further comprise one or more agent selected from buffer (e.g., KC1, MgCl2, Tris-HCl), dNTPs (e.g., dATP, dCTP, dGTP, dTTP), and water. Any one of the kits provided herein may comprise protector strand is molar excess of the primer. Any one of the kits provided herein may further comprise instructions or directions for obtaining instructions (e.g., from a website) for using the components of the kits. Any one of the kits provided herein may further comprise at least one reaction tube, well, chamber, or the like.

Unless otherwise indicated, all numbers expressing quantities of ingredients, properties such as molecular weight, reaction conditions, and so forth used in the specification and claims are to be understood as being modified in all instances by the term “about.” Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are approximations that may vary depending upon the desired properties sought to be obtained by the present invention. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.

The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.” As used herein “another” may mean at least a second or more.

It is contemplated that any instance discussed in this specification can be implemented with respect to any method or composition of the invention, and vice versa. Furthermore, compositions of the invention can be used to achieve the methods of the invention.

Throughout this application, the term “about” is used to indicate that a value includes the inherent variation of error for the device, the method being employed to determine the value, or the variation that exists among the study subjects.

The use of the term “or” in the claims is used to mean “and/or” unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive.

As used in this specification and claim(s), the words “comprising” (and any form of comprising, such as “comprise” and “comprises”), “having” (and any form of having, such as “have” and “has”), “including” (and any form of including, such as “includes” and “include”), or “containing” (and any form of containing, such as “contains” and “contain”) are inclusive or open-ended and do not exclude additional, unrecited elements or method steps.

To facilitate a better understanding of the present invention, the following examples of specific instances are given. In no way should the following examples be read to limit or define the entire scope of the invention.

EXAMPLES

Twelve examples of DNA probe systems to an RNA target are shown in FIGS. 9-20.

The following examples demonstrate the design principles, illustrate the mathematics of reaction standard free energy (ΔG°) calculations for the different regions, and exemplify typical probe systems generated in the method described in the present disclosure. These representative examples cover a range of different biological target sequences, are computed for a number of different operation temperatures and salinities. Example 11 furthermore shows the design of a probe intended to operate in a concentration of the denaturant formamide. Also given are the stoichiometric ratios [P]₀/[C]₀needed to satisfy the standard free energy value of Expression 1 being equal to the standard free energy value of Expression 2.

TABLE 4 Standard free energy and stoichiometric data for the probes of Examples 1-12. Ex. ΔG°_h-PC (kcal/mol) ΔG°_nh-PC (kcal/mol) ΔG°_v-TC (kcal/mol) ΔG°_t-TC (kcal/mol) ΔG°_rxn (kcal/mol)

\frac{([P] o - [C] o)}{[C] o}

\frac{[P] o}{[C] o}

1 −24.16 −11.68 −27.60 −8.09 0.15 0.78 1.78 7.8 8.8 0.1 1.10 2 −28.45 −10.02 −32.83 −6.26 −0.61 2.69 3.69 3 −31.15 −7.71 −26.44 −8.61 −1.54 12.14 13.14 4 −17.06 −9.59 −16.75 −10.36 −0.46 2.11 3.11 5 −11.68 −9.20 −13.42 −8.94 −1.49 11.2 12.2 6 −25.43 −5.25 −21.76 −9.12 −0.22 1.43 2.43 7 −12.95 −14.35 −15.94 −10.33 1.03 0.19 1.19 8 −31.64 −11.59 −34.77 −8.73 −0.27 1.55 2.55 9 −22.50 −14.81 −28.20 −9.48 −0.37 1.82 2.82 10 −19.12 −9.45 −19.84 −9.39 −0.66 3.00 4.00 11 −20.56 −11.09 −22.58 −8.75 0.32 0.58 1.58 12 −7.45 −4.43 −7.54 −7.50 −3.07 100.0 101.0

Example 1

Example 1 provides a probe directed to the target nucleic acid BRAF 11-30 as shown in FIG. 9. The following ΔG° values for hybridization of the probe to the target at 37° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4. Finally, the X value provides for the variation in Expression 2 to obtain a value equal to Expression 1 given the corresponding stoichiometric ratios and was 0.00, 1.42, and −1.27 kcal/mol, respectively.

Example 2

Example 2 provides a probe directed to the target nucleic acid BRAF 71-90 as shown in FIG. 10. The following ΔG° values for hybridization of the probe to the target at 37° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_th-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 3

Example 3 provides a probe directed to the target nucleic acid BRAF 131-160 as shown in FIG. 11. The following ΔG° values for hybridization of the probe to the target at 37° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 4

Example 4 provides a probe directed to the target nucleic acid BRAF 191-220 as shown in FIG. 12. The following ΔG° values for hybridization of the probe to the target at 52° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 5

Example 5 provides a probe directed to the target nucleic acid BRAF 251-280 as shown in FIG. 13. The following ΔG° values for hybridization of the probe to the target at 65° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 6

Example 6 provides a probe directed to the target nucleic acid BRAF 311-350 as shown in FIG. 14. The following ΔG° values for hybridization of the probe to the target at 52° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 7

Example 7 provides a probe directed to the target nucleic acid BRAF 431-460 as shown in FIG. 15. The following ΔG° values for hybridization of the probe to the target at 65° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 8

Example 8 provides a probe directed to the target nucleic acid BRAF 491-520 as shown in FIG. 16. The following ΔG° values for hybridization of the probe to the target at 37° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_v-TC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 9

Example 9 provides a probe directed to the target nucleic acid BRAF 551-580 as shown in FIG. 17. The following ΔG° values for hybridization of the probe to the target at 37° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 10

Example 10 provides a probe directed to the target nucleic acid BRAF 611-630 as shown in FIG. 18. The following ΔG° values for hybridization of the probe to the target at 25° C., 1M Na⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 11

Example 11 provides a probe directed to the target nucleic acid BRAF 670-700 as shown in FIG. 19. The following ΔG° values for hybridization of the probe to the target at 25° C., 1M Na⁺ in 30% formamide are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Example 12 provides a probe directed to a DNA target nucleic acid as shown in FIG. 20. The following ΔG° values for hybridization of the probe to the target at 62° C., 3 mM Mg²⁺ are provided in Table 4: (1) hybridization of target homologous complementary region 2 of complement strand C to target homologous region 4 of protector strand P (ΔG°_h-PC); (2) hybridization of target-nonhomologous-complementary region 3 of complement strand C to target-nonhomologous region 5 of protect strand P (ΔG°_nh-PC); (3) hybridization of target-homologous complementary region 2 of complement strand C to target-validation region 7 of target T (ΔG°_v-TC); (4) hybridization of target-toehold-complementary region 1 of complement strand C to target-toehold region 6 of target T (ΔG°_t-TC); and (5) ΔG°_rxnwhich is ΔG°_t-TC−ΔG°_nh-PC+(ΔG°_v-TC−ΔG°_h-PC) (Expression 1). In addition, the stoichiometric ratio ([P]₀/[C]₀) that allows the value provided by the ΔG°_rxnaccording to Expression 1 to have value identical to that provided by Expression 2 is also provided in Table 4.

Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the invention are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. Any numerical value, however, inherently contain certain errors necessarily resulting from the standard deviation found in their respective testing measurements as well as experimental error in literature-reported values.

Therefore, the present invention is well adapted to attain the ends and advantages mentioned as well as those that are inherent therein. While numerous changes may be made by those skilled in the art, such changes are encompassed within the spirit of this invention as illustrated, in part, by the appended claims.

Claims

1. A composition for selective interaction with a target nucleic acid molecule, the composition comprising:

a first concentration of a first nucleic acid strand comprising a first region, second region and third region;

a second concentration of a second nucleic acid strand comprising a fourth region and fifth region, wherein the target nucleic acid molecule comprises a sixth region and a seventh region;

wherein the first and second concentrations are such that an interaction between the composition and the target nucleic acid molecule possesses a standard free energy as determined by Expression 1 within 5 kcal/mol of a standard free energy as determined by Expression 2, where the [P]0 term of Expression 2 equals the second concentration and the [C]0 term of Expression 2 equals the first concentration; and

wherein the ΔG°t-TC term of Expression 1 represents the standard free energy of hybridization between the sixth region and the first region, wherein the ΔG°nh-PC term of Expression 1 represents the standard free energy of hybridization between the fifth region and the third region, wherein the ΔG°v-TC term of Expression 1 represents the standard free energy of hybridization between the seventh region and the second region, and wherein the ΔG°h-PC term of Expression 1 represents the standard free energy of hybridization between the fourth region and the second region.

2. The composition of claim 1 further comprising a label conjugated to the first nucleic acid strand and wherein the label is selected from the group consisting of organic fluorophores, haptens, nanoparticles, and radioisotopes.

3. The composition of claim 2 further comprising a label conjugated to the second nucleic acid strand and wherein the label is selected from the group consisting of organic fluorophores, haptens, nanoparticles, and radioisotopes, and wherein Expression 3 is substituted for Expression 1.

4. The composition of claim 1 wherein the values for ΔG°t-TC and ΔG°nh-PC differ by more than 10%, or differ by more than 1 kcal/mol.

5. The composition of claim 1 wherein the value of ΔG°t-TC−ΔG°nh-PC is not between −1 kcal/mol and +1 kcal/mol.

6. The composition of claim 1 wherein ΔG°t-TC is from about −2 kcal/mol to about −16 kcal/mol.

7. The composition of claim 1 wherein the first nucleic acid strand and the second nucleic acid strand form a partially double-stranded nucleic acid probe or primer.

8. The composition of claim 27 further comprising an excess of the second nucleic acid strand.

9. The composition of claim 1 wherein fewer than 50% of the nucleotides in the first region are in a double-stranded state in the evaluated minimum free energy structure, as computed in the operational temperature and salinity conditions.

10. A process comprising the steps of:

selecting a target nucleotide sequence in a nucleic acid molecule, the target nucleotide sequence comprising a sixth nucleotide subsequence, a seventh nucleotide subsequence, and an eighth nucleotide subsequence;

selecting a first nucleotide sequence comprising a first nucleotide subsequence, a second nucleotide subsequence, and a third nucleotide subsequence;

selecting a second nucleotide sequence comprising a fourth nucleotide subsequence and a fifth nucleotide subsequence;

calculating a standard free energy using Expression 1, wherein the ΔG°t-TC term of Expression 1 represents the standard free energy of hybridization between the sixth region and the first region, wherein the ΔG°nh-PC term of Expression 1 represents the free energy of hybridization between the fifth region and the third region, wherein the ΔG°v-TC term of Expression 1 represents the standard free energy of hybridization between the seventh region and the second region, and wherein the ΔG°h-PC term of Expression 1 represents the standard free energy of hybridization between the fourth region and the second region; and

determining if the standard free energy calculated using Expression 1 meets a predetermined condition; and

synthesizing a first nucleic acid strand comprising the first nucleotide sequence and a second nucleic acid strand comprising the second nucleotide sequence if the predetermined condition is met.

11. The process of claim 10 further comprising the step of calculating a standard free energy using Expression 2, wherein the predetermined condition comprises being within 5 kcal/mol of the standard free energy calculated using Expression 2, and wherein the terms [C]0 and [P]0 of Expression 2 represent a predetermined concentration of the first nucleic acid strand and the second nucleic acid strand, respectively.

12. The process of claim 11 wherein in the instance the standard free energy as determined by Expression 1 is not within 5 kcal/mol of the standard free energy as determined by Expression 2, then the process is repeated at least in part until the standard free energy as determined by Expression 1 is within 5 kcal/mol of the standard free energy as determined by Expression 2 such that the process further comprises at least one of the following steps:

(1) selecting a new set of nucleotide sequences and subsequences; and

(2) modifying the predetermined concentration of at least one of the first nucleic acid strand or the second nucleic acid strand.

13. The process of claim 10 wherein the predetermined condition comprises the standard free energy calculated by Expression 1 being from about −4 kcal/mol to about +4 kcal/mol.

14. A composition for selective interaction with a target nucleic acid molecule, the composition comprising:

a first nucleic acid strand comprising a first region, a second region, and a third region, wherein said first region possesses a nucleotide sequence that is complementary to a nucleotide sequence of a sixth region of the target nucleic acid molecule, wherein the second region possesses a nucleotide sequence that is complementary to a nucleotide sequence of a seventh region of the target nucleic acid molecule; and

a second nucleic acid strand comprising a fourth region and a fifth region, wherein the fourth region possesses a nucleotide sequence that is complementary to the nucleotide sequence of the second region, and wherein the fifth region possesses a nucleotide sequence that is complementary to the nucleotide sequence of the third region;

wherein the composition possesses a standard free energy of hybridization with the target nucleic acid molecule from about −4 kcal/mol to about +4 kcal/mol, wherein the standard free energy of hybridization is determined by Expression 1, wherein the ΔG°t-TC term of Expression 1 represents the standard free energy of hybridization between the first region and the sixth region, wherein the ΔG°nh-PC term of Expression 1 represents the free energy of hybridization between the third region and the fifth region, wherein the ΔG°v-TC term of Expression 1 represents the standard free energy of hybridization between the seventh region and the second region, and wherein the ΔG°h-PC term of Expression 1 represents the standard free energy of hybridization between the fourth region and the second region.

15. The composition of claim 14 comprising a first concentration of the first nucleic acid strand and a second concentration of the second nucleic acid strand, wherein the standard free energy of hybridization of the composition with respect to the target nucleic acid molecule as determined by Expression 1 is not from about −4 kcal/mol to about +4 kcal/mol, but instead is within 3 kcal/mol of a standard free energy of hybridization as determined by Expression 2, wherein the terms [C]0 and [P]0 of Expression 2 represent the first concentration and the second concentration, respectively.

16. The composition of claim 14 wherein ΔG°t-TC is from about −5 kcal/mol to about −15 kcal/mol.

17. The composition of claim 14 wherein the first nucleic acid strand and the second nucleic acid strand form a partially double-stranded nucleic acid molecule, and wherein the first region possesses no secondary structure.

18. The composition of claim 14 wherein the sum of the standard free energy of hybridization between the first region and the sixth region and between the second region and the seventh region (ΔG°t-TC+ΔG°v-TC) is more negative than −15 kcal/mol.

19. The composition of claim 14 wherein the sum of the standard free energy of hybridization between the third region and the fifth region and between the between the fourth region and the second region (ΔG°nh-PC+ΔG°h-PC) is more negative than −15 kcal/mol.

20. A method comprising the steps of:

applying a first composition to a sample to determine the presence or quantity of a first target nucleic acid molecule at an operating condition, wherein the first composition is the composition of claim 1; and

wherein the operating condition comprises a temperature between about 4° C. and about 75° C. to permit rapid hybridization of the first composition to the first target nucleic acid molecule.

21. The method of claim 20 further comprising the step of applying denaturants or crowding agents to the sample, wherein denaturants or crowding agents are selected form the group consisting of formamide, ethanol, methanol, Tween, Triton, sodium dodecasulfate, DMSO, polyethylene glycol, and dextran.

22. The method of claim 20 further comprising applying a second composition to the sample to determine the presence or quantity of a second target nucleic acid molecule, wherein the second composition is the composition of claim 1, and wherein the first and second target nucleic acid sequences overlap by at least five nucleotides thereby providing an overlapping region, wherein within the overlapping region, the first target nucleic acid sequences differ from the second target nucleic acid sequence by one or two nucleotides.

23. The method of claim 22 wherein the operation condition is sufficient to amplify the first and second target nucleic acid sequences.

24. The method of claim 22 wherein the operation conditions is sufficient for polymerase chain reaction.