siRNA targeting nuclear receptors

Info

Publication number: 20080076908
Type: Application
Filed: Oct 24, 2007
Publication Date: Mar 27, 2008
Applicant: DHARMACON, INC. (Lafayette, CO)
Inventors: Anastasia Khvorova (Boulder, CO), Angela Reynolds (Conifer, CO), Devin Leake (Denver, CO), William Marshall (Boulder, CO), Steven Read (Denver, CO), Stephen Scaringe (Lafayette, CO)
Application Number: 11/977,347

Abstract

Efficient sequence specific gene silencing is possible through the use of siRNA technology. By selecting particular siRNAs by rational design, one can maximize the generation of an effective gene silencing reagent, as well as methods for silencing genes. Methods, compositions, and kits generated through rational design of siRNAs are disclosed including those directed to nuclear receptors.

Description

Description

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of U.S. Ser. No. 10/714,333, filed Nov. 14, 2003, which claims the benefit of U.S. Provisional Application No. 60/426,137, filed Nov. 14, 2002, and also claims the benefit of U.S. Provisional Application No. 60/502,050, filed Sep. 10, 2003; this application is also a continuation-in-part of U.S. Ser. No. 10/940,892, filed Sep. 14, 2004, which is a continuation of PCT Application No. PCT/US04/14885, international filing date May 12, 2004. The disclosures of the priority applications, including the sequence listings and tables submitted in electronic form in lieu of paper, are incorporated by reference into the instant specification.

SEQUENCE LISTING

The sequence listing for this application has been submitted in accordance with 37 CFR § 1.52(e) and 37 CFR § 1.821 on CD-ROM in lieu of paper on a disk containing the sequence listing file entitled “DHARMA_—2100_US76_CRF.txt” created Sep. 27, 2007, 336 kb Applicants hereby incorporate by reference the sequence listing provided on CD-ROM in lieu of paper into the instant specification.

FIELD OF INVENTION

The present invention relates to RNA interference (“RNAi”).

BACKGROUND OF THE INVENTION

Relatively recently, researchers observed that double stranded RNA (“dsRNA”) could be used to inhibit protein expression. This ability to silence a gene has broad potential for treating human diseases, and many researchers and commercial entities are currently investing considerable resources in developing therapies based on this technology.

Double stranded RNA induced gene silencing can occur on at least three different levels: (i) transcription inactivation, which refers to RNA guided DNA or histone methylation; (ii) siRNA induced mRNA degradation; and (iii) mRNA induced transcriptional attenuation.

It is generally considered that the major mechanism of RNA induced silencing (RNA interference, or RNAi) in mammalian cells is mRNA degradation. Initial attempts to use RNAi in mammalian cells focused on the use of long strands of dsRNA. However, these attempts to induce RNAi met with limited success, due in part to the induction of the interferon response, which results in a general, as opposed to a target-specific, inhibition of protein synthesis. Thus, long dsRNA is not a viable option for RNAi in mammalian systems.

More recently it has been shown that when short (18-30 bp) RNA duplexes are introduced into mammalian cells in culture, sequence-specific inhibition of target mRNA can be realized without inducing an interferon response. Certain of these short dsRNAs, referred to as small inhibitory RNAs (“siRNAs”), can act catalytically at sub-molar concentrations to cleave greater than 95% of the target mRNA in the cell. A description of the mechanisms for siRNA activity, as well as some of its applications are described in Provost et al. (2002) Ribonuclease Activity and RNA Binding of Recombinant Human Dicer, EMBO J. 21(21): 5864-5874; Tabara et al. (2002) The dsRNA Binding Protein RDE-4 Interacts with RDE-1, DCR-1 and a DexH-box Helicase to Direct RNAi in C. elegans, Cell 109(7):861-71; Ketting et al. (2002) Dicer Functions in RNA Interference and in Synthesis of Small RNA Involved in Developmental Timing in C. elegans; Martinez et al., Single-Stranded Antisense siRNAs Guide Target RNA Cleavage in RNAi, Cell 110(5):563; Hutvagner & Zamore (2002) A microRNA in a multiple-turnover RNAi enzyme complex, Science 297:2056.

From a mechanistic perspective, introduction of long double stranded RNA into plants and invertebrate cells is broken down into siRNA by a Type III endonuclease known as Dicer. Sharp, RNA interference-2001, Genes Dev. 2001, 15:485. Dicer, a ribonuclease-III-like enzyme, processes the dsRNA into 19-23 base pair short interfering RNAs with characteristic two base 3′ overhangs. Bernstein, Caudy, Hammond, & Hannon (2001) Role for a bidentate ribonuclease in the initiation step of RNA interference, Nature 409:363. The siRNAs are then incorporated into an RNA-induced silencing complex (RISC) where one or more helicases unwind the siRNA duplex, enabling the complementary antisense strand to guide target recognition. Nykanen, Haley, & Zamore (2001) ATP requirements and small interfering RNA structure in the RNA interference pathway, Cell 107:309. Upon binding to the appropriate target mRNA, one or more endonucleases within the RISC cleaves the target to induce silencing. Elbashir, Lendeckel, & Tuschl (2001) RNA interference is mediated by 21- and 22-nucleotide RNAs, Genes Dev. 15:188, FIG. 1.

The interference effect can be long lasting and may be detectable after many cell divisions. Moreover, RNAi exhibits sequence specificity. Kisielow, M. et al. (2002) Isoform-specific knockdown and expression of adaptor protein ShcA using small interfering RNA, J. Biochem. 363:1-5. Thus, the RNAi machinery can specifically knock down one type of transcript, while not affecting closely related mRNA. These properties make siRNA a potentially valuable tool for inhibiting gene expression and studying gene function and drug target validation. Moreover, siRNAs are potentially useful as therapeutic agents against: (1) diseases that are caused by over-expression or misexpression of genes; and (2) diseases brought about by expression of genes that contain mutations.

Successful siRNA-dependent gene silencing depends on a number of factors. One of the most contentious issues in RNAi is the question of the necessity of siRNA design, i.e., considering the sequence of the siRNA used. Early work in C. elegans and plants circumvented the issue of design by introducing long dsRNA (see, for instance, Fire, A. et al. (1998) Nature 391:806-811). In this primitive organism, long dsRNA molecules are cleaved into siRNA by Dicer, thus generating a diverse population of duplexes that can potentially cover the entire transcript. While some fraction of these molecules are non-functional (i.e., induce little or no silencing) one or more have the potential to be highly functional, thereby silencing the gene of interest and alleviating the need for siRNA design. Unfortunately, due to the interferon response, this same approach is unavailable for mammalian systems. While this effect can be circumvented by bypassing the Dicer cleavage step and directly introducing siRNA, this tactic carries with it the risk that the chosen siRNA sequence may be non-functional or semi-functional.

A number of researches have expressed the view that siRNA design is not a crucial element of RNAi. On the other hand, others in the field have begun to explore the possibility that RNAi can be made more efficient by paying attention to the design of the siRNA. Unfortunately, none of the reported methods have provided a satisfactory scheme for reliably selecting siRNA with acceptable levels of functionality. Accordingly, there is a need to develop rational criteria by which to select siRNA with an acceptable level of functionality, and to identify siRNA that have this improved level of functionality, as well as to identify siRNAs that are hyperfunctional.

SUMMARY OF THE INVENTION

The present invention is directed to increasing the efficiency of RNAi, particularly in mammalian systems. Accordingly, the present invention provides kits, siRNAs and methods for increasing siRNA efficacy.

According to a first embodiment, the present invention provides a kit for gene silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, each of which is comprised of a sequence that is complementary to a portion of the sequence of one or more target messenger RNA, and each of which is selected using non-target specific criteria.

According to a second embodiment, the present invention provides a method for selecting an siRNA, said method comprising applying selection criteria to a set of potential siRNA that comprise 18-30 base pairs, wherein said selection criteria are non-target specific criteria, and said set comprises at least two siRNAs and each of said at least two siRNAs contains a sequence that is at least substantially complementary to a target gene; and determining the relative functionality of the at least two siRNAs.

According to a third embodiment, the present invention also provides a method for selecting an siRNA wherein said selection criteria are embodied in a formula comprising:
(−14)*G₁₃−13*A₁−12*U₇−11*U₂−10*A₁₁−10*U₄−10*C₃−10*C₅−10*C₆−9*A₁₀−9*U₉−9*C₁₈−8*G₁₀−7*U₁−7*U₁₆−7*C₁₇−7*C₁₉+7*U₁₇+8*A₂+8*A₄+8*A₅+8*C₄+9*G₈+10*A₇+10*U₁₈+11*A₁₉+11*C₉+15*G₁+18*A₃+19*U₁₀−Tm−3*(GC_total)−6*(GC_15-19)−30*X; or Formula VIII
(−8)*A1+(−1)*A2+(12)*A3+(7)*A4+(18)*A5+(12)*A6+(19)*A7+(6)*A8+(−4)*A9+(−5)*A10+(−2)*A11+(−5)*A12+(17)*A13+(−3)*A14+(4)*A15+(2)*A16+(8)*A17+(11)*A18+(30)*A19+(−13)*U1+(−1)*U2+(2)*U3+(−2)*U4+(−5)*U5+(5)*U6+(−2)*U7+(−10)*U8+(−5)*U9+(15)*U10+(−1)*U11+(0)*U12+(10)*U13+(−9)*U14+(−13)*U15+(−10)*U16+(3)*U17+(9)*U18+(9)*U19+(7)*C1+(3)*C2+(−21)*C3+(5)*C4+(−9)*C5+(−20)*C6+(−18)*C7+(−5)*C8+(5)*C9+(1)*C10+(2)*C11+(−5)*C12+(−3)*C13+(−6)*C14+(−2)*C15+(−5)*C16+(−3)*C17+(−12)*C18+(−18)*C19+(14)*G1+(8)*G2+(7)*G3+(−10)*G4+(−4)*G5+(2)*G6+(1)*G7+(9)*G8+(5)*G9+(−11)*G10+(1)*G11+(9)*G12+(−24)*G13+(18)*G14+(11)*G15+(13)*G16+(−7)*G17+(−9)*G18+(−22)*G19+6*(number of A+U in position 15-19)-3*(number of G+C in whole siRNA), Formula X
wherein position numbering begins at the 5′-most position of a sense strand, and
A₁=1 if A is the base at position 1 of the sense strand, otherwise its value is 0;
A₂=1 if A is the base at position 2 of the sense strand, otherwise its value is 0;
A₃=1 if A is the base at position 3 of the sense strand, otherwise its value is 0;
A₄=1 if A is the base at position 4 of the sense strand, otherwise its value is 0;
A₅=1 if A is the base at position 5 of the sense strand, otherwise its value is 0;
A₆=1 if A is the base at position 6 of the sense strand, otherwise its value is 0;
A₇=1 if A is the base at position 7 of the sense strand, otherwise its value is 0;
A₁₀=1 if A is the base at position 10 of the sense strand, otherwise its value is 0;
A₁₁=1 if A is the base at position 11 of the sense strand, otherwise its value is 0;
A₁₃=1 if A is the base at position 13 of the sense strand, otherwise its value is 0;
A₁₉=1 if A is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;
C₃=1 if C is the base at position 3 of the sense strand, otherwise its value is 0;
C₄=1 if C is the base at position 4 of the sense strand, otherwise its value is 0;
C₅=1 if C is the base at position 5 of the sense strand, otherwise its value is 0;
C₆=1 if C is the base at position 6 of the sense strand, otherwise its value is 0;
C₇=1 if C is the base at position 7 of the sense strand, otherwise its value is 0;
C₉=1 if C is the base at position 9 of the sense strand, otherwise its value is 0;
C₁₇=1 if C is the base at position 17 of the sense strand, otherwise its value is 0;
C₁₈=1 if C is the base at position 18 of the sense strand, otherwise its value is 0;
C₁₉=1 if C is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;
G₁≦1 if G is the base at position 1 on the sense strand, otherwise its value is 0;
G₂=1 if G is the base at position 2 of the sense strand, otherwise its value is 0;
G₈=1 if G is the base at position 8 on the sense strand, otherwise its value is 0;
G₁₀=1 if G is the base at position 10 on the sense strand, otherwise its value is 0;
G₁₃=1 if G is the base at position 13 on the sense strand, otherwise its value is 0;
G₁₉=1 if G is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;
U₁=1 if U is the base at position 1 on the sense strand, otherwise its value is 0;
U₂=1 if U is the base at position 2 on the sense strand, otherwise its value is 0;
U₃=1 if U is the base at position 3 on the sense strand, otherwise its value is 0;
U₄=1 if U is the base at position 4 on the sense strand, otherwise its value is 0;
U₇=1 if U is the base at position 7 on the sense strand, otherwise its value is 0;
U₉=1 if U is the base at position 9 on the sense strand, otherwise its value is 0;
U₁₀=1 if U is the base at position 10 on the sense strand, otherwise its value is 0;
U₁₅=1 if U is the base at position 15 on the sense strand, otherwise its value is 0;
U₁₆=1 if U is the base at position 16 on the sense strand, otherwise its value is 0;
U₁₇=1 if U is the base at position 17 on the sense strand, otherwise its value is 0;
U₁₈=1 if U is the base at position 18 on the sense strand, otherwise its value is 0.
GC_15-19=the number of G and C bases within positions 15-19 of the sense strand, or within positions 15-18 if the sense strand is only 18 base pairs in length;
GC_total=the number of G and C bases in the sense strand;
Tm=100 if the siRNA oligo has the internal repeat longer then 4 base pairs, otherwise its value is 0; and
X=the number of times that the same nucleotide repeats four or more times in a row.

According to a fourth embodiment, the invention provides a method for developing an algorithm for selecting siRNA, said method comprising: (a) selecting a set of siRNA; (b) measuring gene silencing ability of each siRNA from said set; (c) determining relative functionality of each siRNA; (d) determining improved functionality by the presence or absence of at least one variable selected from the group consisting of the presence or absence of a particular nucleotide at a particular position, the total number of As and Us in positions 15-19, the number of times that the same nucleotide repeats within a given sequence, and the total number of Gs and Cs; and (e) developing an algorithm using the information of step (d).

According to a fifth embodiment, the present invention provides a kit, wherein said kit is comprised of at least two siRNAs, wherein said at least two siRNAs comprise a first optimized siRNA and a second optimized siRNA, wherein said first optimized siRNA and said second optimized siRNA are optimized according a formula comprising Formula X.

The present invention also provides a method for identifying a hyperfunctional siRNA, comprising applying selection criteria to a set of potential siRNA that comprise 18-30 base pairs, wherein said selection criteria are non-target specific criteria, and said set comprises at least two siRNAs and each of said at least two siRNAs contains a sequence that is at least substantially complementary to a target gene; determining the relative functionality of the at least two siRNAs and assigning each of the at least two siRNAs a functionality score; and selecting siRNAs from the at least two siRNAs that have a functionality score that reflects greater than 80 percent silencing at a concentration in the picomolar range, wherein said greater than 80 percent silencing endures for greater than 120 hours.

According to a sixth embodiment, the present invention provides a hyperfunctional siRNA that is capable of silencing Bcl2.

According to a seventh embodiment, the present invention provides a method for developing an siRNA algorithm for selecting functional and hyperfunctional siRNAs for a given sequence. The method comprises:

(a) selecting a set of siRNAs;

(b) measuring the gene silencing ability of each siRNA from said set;

(c) determining the relative functionality of each siRNA;

(d) determining the amount of improved functionality by the presence or absence of at least one variable selected from the group consisting of the total GC content, melting temperature of the siRNA, GC content at positions 15-19, the presence or absence of a particular nucleotide at a particular position, relative thermodynamic stability at particular positions in a duplex, and the number of times that the same nucleotide repeats within a given sequence; and

(e) developing an algorithm using the information of step (d).

According to this embodiment, preferably the set of siRNAs comprises at least 90 siRNAs from at least one gene, more preferably at least 180 siRNAs from at least two different genes, and most preferably at least 270 and 360 siRNAs from at least three and four different genes, respectively. Additionally, in step (d) the determination is made with preferably at least two, more preferably at least three, even more preferably at least four, and most preferably all of the variables. The resulting algorithm is not target sequence specific.

In another embodiment, the present invention provides rationally designed siRNAs identified using the formulas above.

In yet another embodiment, the present invention is directed to hyperfunctional siRNA.

The ability to use the above algorithms, which are not sequence or species specific, allows for the cost-effective selection of optimized siRNAs for specific target sequences. Accordingly, there will be both greater efficiency and reliability in the use of siRNA technologies.

In various embodiments, siRNAs that target nuclear receptors are provided. In various embodiments, the siRNAs are rationally designed. In various embodiments, the siRNAs are functional or hyperfunctional.

In various embodiments, an siRNA that targets a nuclear receptor is provided, wherein the siRNA is selected from the group consisting of various siRNA sequences targeting nuclear receptors that are disclosed herein. In various embodiments, the siRNA sequence is selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905.

In various embodiments, siRNA comprising a sense region and an antisense region are provided, said sense region and said antisense region together form a duplex region comprising 18-30 base pairs, and said sense region comprises a sequence that is at least 90% similar to a sequence selected from the group consisting of siRNA sequences targeting nuclear receptors that are disclosed herein. In various embodiments, the siRNA sequence is selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905.

In various embodiments, an siRNA comprising a sense region and an antisense region is provided, said sense region and said antisense region together form a duplex region comprising 18-30 base pairs, and said sense region comprises a sequence that is identical to a contiguous stretch of at least 18 bases of a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905. In various embodiments, the duplex region is 19-30 base pairs, and the sense region comprises a sequence that is identical to a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905.

In various embodiments, a pool of at least two siRNAs is provided, wherein said pool comprises a first siRNA and a second siRNA, said first siRNA comprising a duplex region of length 18-30 base pairs that has a first sense region that is at least 90% similar to 18 bases of a first sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905, and said second siRNA comprises a duplex region of length 18-30 base pairs that has a second sense region that is at least 90% similar to 18 bases of a second sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905, wherein said first sense region and said second sense region are not identical.

In various embodiments, the first sense region comprises a sequence that is identical to at least 18 bases of a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905, and said second sense region comprises a sequence that is identical to at least 18 bases of a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905. In various embodiments, the duplex of said first siRNA is 19-30 base pairs, and said first sense region comprises a sequence that is at least 90% similar to a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905, and said duplex of said second siRNA is 19-30 base pairs and comprises a sequence that is at least 90% similar to a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905.

In various embodiments, the duplex of said first siRNA is 19-30 base pairs and said first sense region comprises a sequence that is identical to at least 18 bases of a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905, and said duplex of said second siRNA is 19-30 base pairs and said second region comprises a sequence that is identical to a sequence selected from the group consisting of SEQ ID NO. 438 to SEQ ID NO. 1905.

For a better understanding of the present invention together with other and further advantages and embodiments, reference is made to the following description taken in conjunction with the examples, the scope of which is set forth in the appended claims.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows a model for siRNA-RISC interactions. RISC has the ability to interact with either end of the siRNA or miRNA molecule. Following binding, the duplex is unwound, and the relevant target is identified, cleaved, and released.

FIG. 2 is a representation of the functionality of two hundred and seventy siRNA duplexes that were generated to target human cyclophilin, human diazepam-binding inhibitor (DB), and firefly luciferase.

FIG. 3a is a representation of the silencing effect of 30 siRNAs in three different cells lines, HEK293, DU145, and Hela. FIG. 3b shows the frequency of different functional groups (>95% silencing (black), >80% silencing (gray), >50% silencing (dark gray), and <50% silencing (white)) based on GC content. In cases where a given bar is absent from a particular GC percentage, no siRNA were identified for that particular group. FIG. 3c shows the frequency of different functional groups based on melting temperature (Tm).

FIG. 4 is a representation of a statistical analysis that revealed correlations between silencing and five sequence-related properties of siRNA: (A) an A at position 19 of the sense strand, (B) an A at position 3 of the sense strand, (C) a U at position 10 of the sense strand, (D) a base other than G at position 13 of the sense strand, and (E) a base other than C at position 19 of the sense strand. All variables were correlated with siRNA silencing of firefly luciferase and human cyclophilin. siRNAs satisfying the criterion are grouped on the left (Selected) while those that do not, are grouped on the right (Eliminated). Y-axis is “% Silencing of Control.” Each position on the X-axis represents a unique siRNA.

FIGS. 5A and 5B are representations of firefly luciferase and cyclophilin siRNA panels sorted according to functionality and predicted values using Formula VIII. The siRNA found within the circle represent those that have Formula VIII values (SMARTSCORES™, or siRNA rank) above zero. siRNA outside the indicated area have calculated Formula VIII values that are below zero. Y-axis is “Expression (% Control).” Each position on the X-axis represents a unique siRNA.

FIG. 6A is a representation of the average internal stability profile (AISP) derived from 270 siRNAs taken from three separate genes (cyclophilin B, DBI and firefly luciferase). Graphs represent AISP values of highly functional, functional, and non-functional siRNA. FIG. 6B is a comparison between the AISP of naturally derived GFP siRNA (filled squares) and the AISP of siRNA from cyclophilin B, DBI, and luciferase having >90% silencing properties (no fill) for the antisense strand. “DG” is the symbol for ΔG, free energy.

FIG. 7 is a histogram showing the differences in duplex functionality upon introduction of base pair mismatches. The X-axis shows the mismatch introduced in the siRNA and the position it is introduced (e.g., 8C>A reveals that position 8 (which normally has a C) has been changed to an A). The Y-axis is “% Silencing (Normalized to Control).” The samples on the X-axis represent siRNAs at 100 nM and are, reading from left to right: 1A to C, 1A to G, 1A to U; 2A to C, 2A to G, 2A to U; 3A to C, 3A to G, 3A to U; 4G to A, 4G to C; 4G to U; 5U to A, 5U to C, 5U to G; 6U to A, 6U to C, 6U to G; 7G to A, 7G to C, 7G to U; 8C to A, 8C to G, 8C to U; 9G to A, 9G to C, 9G to U; 10C to A, 10C to G, 10C to U; 11G to A, 11G to C, 11G to U; 12G to A, 12G to C, 12G to U; 13A to C, 13A to G, 13A to U; 14G to A, 14G to C, 14G to U; 15G to A, 15G to C, 15G to U; 16A to C, 16A to G, 16A to U; 17G to A, 17G to C, 17G to U; 18U to A, 18U to C, 18U to G; 19U to A, 19U to C, 19U to G; 20 wt; Control.

FIG. 8 is histogram that shows the effects of 5′sense and antisense strand modification with 2′-O-methylation on functionality.

FIG. 9 shows a graph of SMARTSCORES™, or siRNA rank, versus RNAi silencing values for more than 360 siRNA directed against 30 different genes. SiRNA to the right of the vertical bar represent those siRNA that have desirable SMARTSCORES™, or siRNA rank.

FIGS. 10A-E compare the RNAi of five different genes (SEAP, DBI, PLK, Firefly Luciferase, and Renilla Luciferase) by varying numbers of randomly selected siRNA and four rationally designed (SMART-selected) siRNA chosen using the algorithm described in Formula VIII. In addition, RNAi induced by a pool of the four SMART-selected siRNA is reported at two different concentrations (100 and 400 nM). 10F. is a comparison between a pool of randomly selected EGFR siRNA (Pool 1) and a pool of SMART-selected EGFR siRNA (Pool 2). Pool 1, S1-S4 and Pool 2 S1-S4 represent the individual members that made up each respective pool. Note that numbers for random siRNAs represent the position of the 5′ end of the sense strand of the duplex. The Y-axis represents the % expression of the control(s). The X-axis is the percent expression of the control.

FIG. 11 shows the Western blot results from cells treated with siRNA directed against twelve different genes involved in the clathrin-dependent endocytosis pathway (CHC, DynII, CALM, CLCa, CLCb, Eps15, Eps15R, Rab5a, Rab5b, Rab5c, β2 subunit of AP-2 and EEA.1). siRNA were selected using Formula VIII. “Pool” represents a mixture of duplexes 1-4. Total concentration of each siRNA in the pool is 25 nM. Total concentration=4×25=100 nM.

FIG. 12 is a representation of the gene silencing capabilities of rationally-selected siRNA directed against ten different genes (human and mouse cyclophilin, C-myc, human lamin A/C, QB (ubiquinol-cytochrome c reductase core protein I), MEK1 and MEK2, ATE1 (arginyl-tRNA protein transferase), GAPDH, and Eg5). The Y-axis is the percent expression of the control. Numbers 1, 2, 3 and 4 represent individual rationally selected siRNA. “Pool” represents a mixture of the four individual siRNA.

FIG. 13 is the sequence of the top ten Bcl2 siRNAs as determined by Formula VIII. Sequences are listed 5′ to 3′.

FIG. 14 is the knockdown by the top ten Bcl2 siRNAs at 100 nM concentrations. The Y-axis represents the amount of expression relative to the non-specific (ns) and transfection mixture control.

FIG. 15 represents a functional walk where siRNA beginning on every other base pair of a region of the luciferase gene are tested for the ability to silence the luciferase gene. The Y-axis represents the percent expression relative to a control. The X-axis represents the position of each individual siRNA. Reading from left to right across the X-axis, the position designations are 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, and Plasmid.

FIGS. 16A and 16B are histograms demonstrating the inhibition of target gene expression by pools of 2 (16A) and 3 (16B) siRNA duplexes taken from the walk described in FIG. 15. The Y-axis in each represents the percent expression relative to control. The X-axis in each represents the position of the first siRNA in paired pools, or trios of siRNAs. For instance, the first paired pool contains siRNAs 1 and 3. The second paired pool contains siRNAs 3 and 5. Pool 3 (of paired pools) contains siRNAs 5 and 7, and so on. For each of 16A and 16B, the X-axis from left to right reads 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, and Plasmid.

FIGS. 17A and 17B are histograms demonstrating the inhibition of target gene expression by pools of 4 (17A) and 5 (17B) siRNA duplexes. The Y-axis in each represents the percent expression relative to control. The X-axis in each represents the position of the first siRNA in each pool. For each of 17A and 17B, the X-axis from left to right reads 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, and Plasmid.

FIGS. 18A and 18B are histograms demonstrating the inhibition of target gene expression by siRNAs that are ten (18A) and twenty (18B) base pairs base pairs apart. The Y-axis represents the percent expression relative to a control. The X-axis represents the position of the first siRNA in each pool. For each of 18A and 18B, the X-axis from left to right reads 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, and Plasmid.

FIG. 19 shows that pools of siRNAs (dark gray bar) work as well (or better) than the best siRNA in the pool (light gray bar). The Y-axis represents the percent expression relative to a control. The X-axis represents the position of the first siRNA in each pool. The X-axis from left to right reads 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, and Plasmid.

FIG. 20 shows that the combination of several semifunctional siRNAs (dark gray) result in a significant improvement of gene expression inhibition over individual (semi-functional; light gray) siRNA. The Y-axis represents the percent expression relative to a control.

FIGS. 21A, 21B and 21C show both pools (Library, Lib) and individual siRNAs in inhibition of gene expression of Beta-Galactosidase, Renilla Luciferase and SEAP (alkaline phosphatase). Numbers on the X-axis indicate the position of the 5′-most nucleotide of the sense strand of the duplex. The Y-axis represents the percent expression of each gene relative to a control. Libraries contain 19 nucleotide long siRNAs (not including overhangs) that begin at the following nucleotides: SEAP: Lib 1: 206, 766, 812, 923, Lib 2: 1117, 1280, 1300, 1487, Lib 3: 206, 766, 812, 923, 1117, 1280, 1300, 1487, Lib 4: 206, 812, 1117, 1300, Lib 5: 766, 923, 1280, 1487, Lib 6: 206, 1487; Bga1: Lib 1: 979, 1339, 2029, 2590, Lib 2: 1087, 1783, 2399, 3257, Lib 3: 979, 1783, 2590, 3257, Lib 4: 979, 1087, 1339, 1783, 2029, 2399, 2590, 3257, Lib 5: 979, 1087, 1339, 1783, Lib 6: 2029, 2399, 2590, 3257; Renilla: Lib 1: 174, 300, 432, 568, Lib 2: 592, 633, 729, 867, Lib 3: 174, 300, 432, 568, 592, 633, 729, 867, Lib 4: 174, 432, 592, 729, Lib 5: 300, 568, 633, 867, Lib 6: 592, 568.

FIG. 22 shows the results of an EGFR and TfnR internalization assay when single gene knockdowns are performed. The Y-axis represents percent internalization relative to control.

FIG. 23 shows the results of an EGFR and TfnR internalization assay when multiple genes are knocked down (e.g., Rab5a, b, c). The Y-axis represents the percent internalization relative to control.

FIG. 24 shows the simultaneous knockdown of four different genes. siRNAs directed against G6PD, GAPDH, PLK, and UQC were simultaneously introduced into cells. Twenty-four hours later, cultures were harvested and assayed for mRNA target levels for each of the four genes. A comparison is made between cells transfected with individual siRNAs vs. a pool of siRNAs directed against all four genes.

FIG. 25 shows the functionality of ten siRNAs at 0.3 nM concentrations.

DETAILED DESCRIPTION Definitions

Unless stated otherwise, the following terms and phrases have the meanings provided below:

Complementary

The term “complementary” refers to the ability of polynucleotides to form base pairs with one another. Base pairs are typically formed by hydrogen bonds between nucleotide units in antiparallel polynucleotide strands. Complementary polynucleotide strands can base pair in the Watson-Crick manner (e.g., A to T, A to U, C to G), or in any other manner that allows for the formation of duplexes. As persons skilled in the art are aware, when using RNA as opposed to DNA, uracil rather than thymine is the base that is considered to be complementary to adenosine. However, when a U is denoted in the context of the present invention, the ability to substitute a T is implied, unless otherwise stated.

Perfect complementarity or 100% complementarity refers to the situation in which each nucleotide unit of one polynucleotide strand can hydrogen bond with a nucleotide unit of a second polynucleotide strand. Less than perfect complementarity refers to the situation in which some, but not all, nucleotide units of two strands can hydrogen bond with each other. For example, for two 20-mers, if only two base pairs on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 10% complementarity. In the same example, if 18 base pairs on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 90% complementarity.

Deoxynucleotide

The term “deoxynucleotide” refers to a nucleotide or polynucleotide lacking a hydroxyl group (OH group) at the 2′ and/or 3′ position of a sugar moiety. Instead, it has a hydrogen bonded to the 2′ and/or 3′ carbon. Within an RNA molecule that comprises one or more deoxynucleotides, “deoxynucleotide” refers to the lack of an OH group at the 2′ position of the sugar moiety, having instead a hydrogen bonded directly to the 2′ carbon.

Deoxyribonucleotide

The terms “deoxyribonucleotide” and “DNA” refer to a nucleotide or polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, at its 2′ and/or 3′position.

Duplex Region

The phrase “duplex region” refers to the region in two complementary or substantially complementary polynucleotides that form base pairs with one another, either by Watson-Crick base pairing or any other manner that allows for a stabilized duplex between polynucleotide strands that are complementary or substantially complementary. For example, a polynucleotide strand having 21 nucleotide units can base pair with another polynucleotide of 21 nucleotide units, yet only 19 bases on each strand are complementary or substantially complementary, such that the “duplex region” has 19 base pairs. The remaining bases may, for example, exist as 5′ and 3′ overhangs. Further, within the duplex region, 100% complementarity is not required; substantial complementarity is allowable within a duplex region. Substantial complementarity refers to 79% or greater complementarity. For example, a mismatch in a duplex region consisting of 19 base pairs results in 94.7% complementarity, rendering the duplex region substantially complementary.

Filters

The term “filter” refers to one or more procedures that are performed on sequences that are identified by the algorithm. In some instances, filtering includes in silico procedures where sequences identified by the algorithm can be screened to identify duplexes carrying desirable or undesirable motifs. Sequences carrying such motifs can be selected for, or selected against, to obtain a final set with the preferred properties. In other instances, filtering includes wet lab experiments. For instance, sequences identified by one or more versions of the algorithm can be screened using any one of a number of procedures to identify duplexes that have hyperfunctional traits (e.g., they exhibit a high degree of silencing at subnanomolar concentrations and/or exhibit high degrees of silencing longevity).

Gene Silencing

The phrase “gene silencing” refers to a process by which the expression of a specific gene product is lessened or attenuated. Gene silencing can take place by a variety of pathways. Unless specified otherwise, as used herein, gene silencing refers to decreases in gene product expression that results from RNA interference (RNAi), a defined, though partially characterized pathway whereby small inhibitory RNA (siRNA) act in concert with host proteins (e.g., the RNA induced silencing complex, RISC) to degrade messenger RNA (mRNA) in a sequence-dependent fashion. The level of gene silencing can be measured by a variety of means, including, but not limited to, measurement of transcript levels by Northern Blot Analysis, B-DNA techniques, transcription-sensitive reporter constructs, expression profiling (e.g., DNA chips), and related technologies. Alternatively, the level of silencing can be measured by assessing the level of the protein encoded by a specific gene. This can be accomplished by performing a number of studies including Western Analysis, measuring the levels of expression of a reporter protein that has e.g., fluorescent properties (e.g., GFP) or enzymatic activity (e.g., alkaline phosphatases), or several other procedures.

miRNA

The term “miRNA” refers to microRNA.

Nucleotide

The term “nucleotide” refers to a ribonucleotide or a deoxyribonucleotide or modified form thereof, as well as an analog thereof. Nucleotides include species that comprise purines, e.g., adenine, hypoxanthine, guanine, and their derivatives and analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives and analogs.

Nucleotide analogs include nucleotides having modifications in the chemical structure of the base, sugar and/or phosphate, including, but not limited to, 5-position pyrimidine modifications, 8-position purine modifications, modifications at cytosine exocyclic amines, and substitution of 5-bromo-uracil; and 2′-position sugar modifications, including but not limited to, sugar-modified ribonucleotides in which the 2′-OH is replaced by a group such as an H, OR, R, halo, SH, SR, NH₂, NHR, NR₂, or CN, wherein R is an alkyl moiety. Nucleotide analogs are also meant to include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 2′-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, phosphorothioates and peptides.

Modified bases refer to nucleotide bases such as, for example, adenine, guanine, cytosine, thymine, uracil, xanthine, inosine, and queuosine that have been modified by the replacement or addition of one or more atoms or groups. Some examples of types of modifications that can comprise nucleotides that are modified with respect to the base moieties include but are not limited to, alkylated, halogenated, thiolated, aminated, amidated, or acetylated bases, individually or in combination. More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2-propylguanine, 2-aminoadenine, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2-amino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1-methyladenosine, 2-methyladenosine, 3-methylcytidine, 6-methyluridine, 2-methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5-methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza-adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2-thiouridine, other thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any O- and N-alkylated purines and pyrimidines such as N6-methyladenosine, 5-methylcarbonylmethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridine-2-one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines and guanines, 5-substituted uracils and thymines, azapyrimidines, carboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyl nucleotides, and alkylcarbonylalkylated nucleotides. Modified nucleotides also include those nucleotides that are modified with respect to the sugar moiety, as well as nucleotides having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties may be, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4′-thioribose, and other sugars, heterocycles, or carbocycles.

The term nucleotide is also meant to include what are known in the art as universal bases. By way of example, universal bases include but are not limited to 3-nitropyrrole, 5-nitroindole, or nebularine. The term “nucleotide” is also meant to include the N3′ to P5′ phosphoramidate, resulting from the substitution of a ribosyl 3′ oxygen with an amine group.

Further, the term nucleotide also includes those species that have a detectable label, such as for example a radioactive or fluorescent moiety, or mass label attached to the nucleotide.

Off-Target Silencing and Off-Target Interference

The phrases “off-target silencing” and “off-target interference” are defined as degradation of mRNA other than the intended target mRNA due to overlapping and/or partial homology with secondary mRNA messages.

Polynucleotide

The term “polynucleotide” refers to polymers of nucleotides, and includes but is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of regularly and/or irregularly alternating deoxyribosyl moieties and ribosyl moieties (i.e., wherein alternate nucleotide units have an —OH, then and —H, then an —OH, then an —H, and so on at the 2′ position of a sugar moiety), and modifications of these kinds of polynucleotides, wherein the attachment of various entities or moieties to the nucleotide units at any position are included.

Polyribonucleotide

The term “polyribonucleotide” refers to a polynucleotide comprising two or more modified or unmodified ribonucleotides and/or their analogs. The term “polyribonucleotide” is used interchangeably with the term “oligoribonucleotide.”

Ribonucleotide and Ribonucleic Acid

The term “ribonucleotide” and the phrase “ribonucleic acid” (RNA), refer to a modified or unmodified nucleotide or polynucleotide comprising at least one ribonucleotide unit. A ribonucleotide unit comprises an hydroxyl group attached to the 2′ position of a ribosyl moiety that has a nitrogenous base attached in N-glycosidic linkage at the 1′ position of a ribosyl moiety, and a moiety that either allows for linkage to another nucleotide or precludes linkage.

siRNA

The term “siRNA” refers to small inhibitory RNA duplexes that induce the RNA interference (RNAi) pathway. These molecules can vary in length (generally 18-30 base pairs) and contain varying degrees of complementarity to their target mRNA in the antisense strand. Some, but not all, siRNA have unpaired overhanging bases on the 5′ or 3′ end of the sense strand and/or the antisense strand. The term “siRNA” includes duplexes of two separate strands, as well as single strands that can form hairpin structures comprising a duplex region.

siRNA may be divided into five (5) groups (non-functional, semi-functional, functional, highly functional, and hyper-functional) based on the level or degree of silencing that they induce in cultured cell lines. As used herein, these definitions are based on a set of conditions where the siRNA is transfected into said cell line at a concentration of 100 nM and the level of silencing is tested at a time of roughly 24 hours after transfection, and not exceeding 72 hours after transfection. In this context, “non-functional siRNA” are defined as those siRNA that induce less than 50% (<50%) target silencing. “Semi-functional siRNA” induce 50-79% target silencing. “Functional siRNA” are molecules that induce 80-95% gene silencing. “Highly-functional siRNA” are molecules that induce greater than 95% gene silencing. “Hyperfunctional siRNA” are a special class of molecules. For purposes of this document, hyperfunctional siRNA are defined as those molecules that: (1) induce greater than 95% silencing of a specific target when they are transfected at subnanomolar concentrations (i.e., less than one nanomolar); and/or (2) induce functional (or better) levels of silencing for greater than 96 hours. These relative functionalities (though not intended to be absolutes) may be used to compare siRNAs to a particular target for applications such as functional genomics, target identification and therapeutics.

Smartscore™, or siRNA rank

The term “SMARTSCORE™”, or “siRNA rank” refers to a number determined by applying any of the formulas to a given siRNA sequence. The term “SMART-selected” or “rationally selected” or “rational selection” refers to siRNA that have been selected on the basis of their SMARTSCORES™, or siRNA ranking.

Substantially Similar

The phrase “substantially similar” refers to a similarity of at least 90% with respect to the identity of the bases of the sequence.

Target

The term “target” is used in a variety of different forms throughout this document and is defined by the context in which it is used. “Target mRNA” refers to a messenger RNA to which a given siRNA can be directed against. “Target sequence” and “target site” refer to a sequence within the mRNA to which the sense strand of an siRNA shows varying degrees of homology and the antisense strand exhibits varying degrees of complementarity. The phrase “siRNA target” can refer to the gene, mRNA, or protein against which an siRNA is directed. Similarly, “target silencing” can refer to the state of a gene, or the corresponding mRNA or protein.

Transfection

The term “transfection” refers to a process by which agents are introduced into a cell. The list of agents that can be transfected is large and includes, but is not limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more genes and organized into an expression plasmid, proteins, protein fragments, and more. There are multiple methods for transfecting agents into a cell including, but not limited to, electroporation, calcium phosphate-based transfections, DEAE-dextran-based transfections, lipid-based transfections, molecular conjugate-based transfections (e.g., polylysine-DNA conjugates), microinjection and others.

The present invention is directed to improving the efficiency of gene silencing by siRNA. Through the inclusion of multiple siRNA sequences that are targeted to a particular gene and/or selecting an siRNA sequence based on certain defined criteria, improved efficiency may be achieved.

The present invention will now be described in connection with preferred embodiments. These embodiments are presented in order to aid in an understanding of the present invention and are not intended, and should not be construed, to limit the invention in any way. All alternatives, modifications and equivalents that may become apparent to those of ordinary skill upon reading this disclosure are included within the spirit and scope of the present invention.

Furthermore, this disclosure is not a primer on RNA interference. Basic concepts known to persons skilled in the art have not been set forth in detail.

The present invention is directed to increasing the efficiency of RNAi, particularly in mammalian systems. Accordingly, the present invention provides kits, siRNAs and methods for increasing siRNA efficacy.

According to a first embodiment, the present invention provides a kit for gene silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, each of which is comprised of a sequence that is complementary to a portion of the sequence of one or more target messenger RNA, and each of which is selected using non-target specific criteria. Each of the at least two siRNA duplexes of the kit complementary to a portion of the sequence of one or more target mRNAs is preferably selected using Formula X.

According to a second embodiment, the present invention provides a method for selecting an siRNA, said method comprising applying selection criteria to a set of potential siRNA that comprise 18-30 base pairs, wherein said selection criteria are non-target specific criteria, and said set comprises at least two siRNAs and each of said at least two siRNAs contains a sequence that is at least substantially complementary to a target gene; and determining the relative functionality of the at least two siRNAs.

In one embodiment, the present invention also provides a method wherein said selection criteria are embodied in a formula comprising:
(−14)*G₁₃−13*A₁−12*U₇−11−U₂−10*A₁₁−10*U₄−10−C₃−10*C₅−10*C₆−9A₁₀−9*U₉−9*C₁₈−8*G₁₀−7*U₁−7*U₁₆−7*C₁₇−7*C₁₉+7*U₁₇+8*A₂+8*A₄+8*A₅+8*C₄+9*G₈+10*A₇+10*U₁₈+11*A₁₉+11*C₉+15*G₁+18*A₃+19*U₁₀−Tm−3*(GC_total)−6*(GC_15-19)−30*X; or Formula VIII
(−8)*A1+(−1)*A2+(12)*A3+(7)*A4+(18)*A5+(12)*A6+(19)*A7+(6)*A8+(−4)*A9+(−5)*A10+(−2)*A11+(−5)*A12+(17)*A13+(−3)*A14+(4)*A15+(2)*A16+(8)*A17+(11)*A18+(30)*A19+(−13)*U1+(−10)*U2+(2)*U3+(−2)*U4+(−5)*U5+(5)*U6+(−2)*U7+(−10)*U8+(−5)*U9+(15)*U10+(−1)*U11+(0)*U12+(10)*U13+(−9)*U14+(−13)*U15+(−10)*U16+(3)*U17+(9)*U18+(9)*U19+(7)*C1+(3)*C2+(−21)*C3+(5)*C4+(−9)*C5+(−20)*C6+(−18)*C7+(−5)*C8+(5)*C9+(1)*C10+(2)*C11+(−5)*C12+(−3)*C13+(−6)*C14+(−2)*C15+(−5)*C16+(−3)*C17+(−12)*C18+(−18)*C19+(14)*G1+(8)*G2+(7)*G3+(−10)*G4+(−4)*G5+(2)*G6+(1)*G7+(9)*G8+(5)*G9+(−11)*G10+(1)*G11+(9)*G12+(−24)*G13+(18)*G14+(11)*G15+(13)*G16+(−7)*G17+(−9)*G18+(−22)*G19+6*(number of A+U in position 15-19)-3*(number of G+C in whole siRNA), Formula X

wherein position numbering begins at the 5′-most position of a sense strand, and

A₁=1 if A is the base at position 1 of the sense strand, otherwise its value is 0; A₂=1 if A is the base at position 2 of the sense strand, otherwise its value is 0; A₃=1 if A is the base at position 3 of the sense strand, otherwise its value is 0; A₄=1 if A is the base at position 4 of the sense strand, otherwise its value is 0; A₅=1 if A is the base at position 5 of the sense strand, otherwise its value is 0; A₆=1 if A is the base at position 6 of the sense strand, otherwise its value is 0; A₇=1 if A is the base at position 7 of the sense strand, otherwise its value is 0; A₁₀=1 if A is the base at position 10 of the sense strand, otherwise its value is 0; A₁₁=1 if A is the base at position 11 of the sense strand, otherwise its value is 0; A₁₃=1 if A is the base at position 13 of the sense strand, otherwise its value is 0; A₁₉=1 if A is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;

C₃=1 if C is the base at position 3 of the sense strand, otherwise its value is 0; C₄−1 if C is the base at position 4 of the sense strand, otherwise its value is 0; C₅−1 if C is the base at position 5 of the sense strand, otherwise its value is 0; C₆=1 if C is the base at position 6 of the sense strand, otherwise its value is 0; C₇=1 if C is the base at position 7 of the sense strand, otherwise its value is 0; C₉=1 if C is the base at position 9 of the sense strand, otherwise its value is 0; C₁₇=1 if C is the base at position 17 of the sense strand, otherwise its value is 0; C₁₈=1 if C is the base at position 18 of the sense strand, otherwise its value is 0; C₁₉=1 if C is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;

G₁=1 if G is the base at position 1 on the sense strand, otherwise its value is 0; G₂=1 if G is the base at position 2 of the sense strand, otherwise its value is 0; G₈=1 if G is the base at position 8 on the sense strand, otherwise its value is 0; G₁₀=1 if G is the base at position 10 on the sense strand, otherwise its value is 0; G₁₃=1 if G is the base at position 13 on the sense strand, otherwise its value is 0; G₁₉=1 if G is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;

U₁=1 if U is the base at position 1 on the sense strand, otherwise its value is 0; U₂=1 if U is the base at position 2 on the sense strand, otherwise its value is 0; U₃=1 if U is the base at position 3 on the sense strand, otherwise its value is 0; U₄=1 if U is the base at position 4 on the sense strand, otherwise its value is 0; U₇=1 if U is the base at position 7 on the sense strand, otherwise its value is 0; U₉=1 if U is the base at position 9 on the sense strand, otherwise its value is 0; U₁₀=1 if U is the base at position 10 on the sense strand, otherwise its value is 0; U₁₅=1 if U is the base at position 15 on the sense strand, otherwise its value is 0; U₁₆=1 if U is the base at position 16 on the sense strand, otherwise its value is 0; U₁₇=1 if U is the base at position 17 on the sense strand, otherwise its value is 0; U₁₈=1 if U is the base at position 18 on the sense strand, otherwise its value is 0.

GC_15-19=the number of G and C bases within positions 15-19 of the sense strand, or within positions 15-18 if the sense strand is only 18 base pairs in length;

GC_total=the number of G and C bases in the sense strand;

Tm=100 if the siRNA oligo has the internal repeat longer then 4 base pairs, otherwise its value is 0; and

X=the number of times that the same nucleotide repeats four or more times in a row.

Any of the methods of selecting siRNA in accordance with the invention can further comprise comparing the internal stability profiles of the siRNAs to be selected, and selecting those siRNAs with the most favorable internal stability profiles. Any of the methods of selecting siRNA can further comprise selecting either for or against sequences that contain motifs that induce cellular stress. Such motifs include, for example, toxicity motifs. Any of the methods of selecting siRNA can further comprise either selecting for or selecting against sequences that comprise stability motifs.

In another embodiment, the present invention provides a method of gene silencing, comprising introducing into a cell at least one siRNA selected according to any of the methods of the present invention. The siRNA can be introduced by allowing passive uptake of siRNA, or through the use of a vector.

According to a third embodiment, the invention provides a method for developing an algorithm for selecting siRNA, said method comprising: (a) selecting a set of siRNA; (b) measuring gene silencing ability of each siRNA from said set; (c) determining relative functionality of each siRNA; (d) determining improved functionality by the presence or absence of at least one variable selected from the group consisting of the presence or absence of a particular nucleotide at a particular position, the total number of As and Us in positions 15-19, the number of times that the same nucleotide repeats within a given sequence, and the total number of Gs and Cs; and (e) developing an algorithm using the information of step (d).

In another embodiment, the invention provides a method for selecting an siRNA with improved functionality, comprising using the above-mentioned algorithm to identify an siRNA of improved functionality.

According to a fourth embodiment, the present invention provides a kit, wherein said kit is comprised of at least two siRNAs, wherein said at least two siRNAs comprise a first optimized siRNA and a second optimized siRNA, wherein said first optimized siRNA and said second optimized siRNA are optimized according a formula comprising Formula X.

According to a fifth embodiment, the present invention provides a method for identifying a hyperfunctional siRNA, comprising applying selection criteria to a set of potential siRNA that comprise 18-30 base pairs, wherein said selection criteria are non-target specific criteria, and said set comprises at least two siRNAs and each of said at least two siRNAs contains a sequence that is at least substantially complementary to a target gene; determining the relative functionality of the at least two siRNAs and assigning each of the at least two siRNAs a functionality score; and selecting siRNAs from the at least two siRNAs that have a functionality score that reflects greater than 80 percent silencing at a concentration in the picomolar range, wherein said greater than 80 percent silencing endures for greater than 120 hours.

In other embodiments, the invention provides kits and/or methods wherein the siRNA are comprised of two separate polynucleotide strands; wherein the siRNA are comprised of a single contiguous molecule such as, for example, a unimolecular siRNA (comprising, for example, either a nucleotide or non-nucleotide loop); wherein the siRNA are expressed from one or more vectors; and wherein two or more genes are silenced by a single administration of siRNA.

According to a sixth embodiment, the present invention provides a hyperfunctional siRNA that is capable of silencing Bcl2.

According to a seventh embodiment, the present invention provides a method for developing an siRNA algorithm for selecting functional and hyperfunctional siRNAs for a given sequence. The method comprises:

(a) selecting a set of siRNAs;

(b) measuring the gene silencing ability of each siRNA from said set;

(c) determining the relative functionality of each siRNA;

(d) determining the amount of improved functionality by the presence or absence of at least one variable selected from the group consisting of the total GC content, melting temperature of the siRNA, GC content at positions 15-19, the presence or absence of a particular nucleotide at a particular position, relative thermodynamic stability at particular positions in a duplex, and the number of times that the same nucleotide repeats within a given sequence; and

(e) developing an algorithm using the information of step (d).

According to this embodiment, preferably the set of siRNAs comprises at least 90 siRNAs from at least one gene, more preferably at least 180 siRNAs from at least two different genes, and most preferably at least 270 and 360 siRNAs from at least three and four different genes, respectively. Additionally, in step (d) the determination is made with preferably at least two, more preferably at least three, even more preferably at least four, and most preferably all of the variables. The resulting algorithm is not target sequence specific.

In another embodiment, the present invention provides rationally designed siRNAs identified using the formulas above.

In yet another embodiment, the present invention is directed to hyperfunctional siRNA.

The ability to use the above algorithms, which are not sequence or species specific, allows for the cost-effective selection of optimized siRNAs for specific target sequences. Accordingly, there will be both greater efficiency and reliability in the use of siRNA technologies.

The methods disclosed herein can be used in conjunction with comparing internal stability profiles of selected siRNAs, and designing an siRNA with a desirable internal stability profile; and/or in conjunction with a selection either for or against sequences that contain motifs that induce cellular stress, for example, cellular toxicity.

Any of the methods disclosed herein can be used to silence one or more genes by introducing an siRNA selected, or designed, in accordance with any of the methods disclosed herein. The siRNA(s) can be introduced into the cell by any method known in the art, including passive uptake or through the use of one or more vectors.

Any of the methods and kits disclosed herein can employ either unimolecular siRNAs, siRNAs comprised of two separate polynucleotide strands, or combinations thereof. Any of the methods disclosed herein can be used in gene silencing, where two or more genes are silenced by a single administration of siRNA(s). The siRNA(s) can be directed against two or more target genes, and administered in a single dose or single transfection, as the case may be.

Optimizing siRNA

According to one embodiment, the present invention provides a method for improving the effectiveness of gene silencing for use to silence a particular gene through the selection of an optimal siRNA. An siRNA selected according to this method may be used individually, or in conjunction with the first embodiment, i.e., with one or more other siRNAs, each of which may or may not be selected by this criteria in order to maximize their efficiency.

The degree to which it is possible to select an siRNA for a given mRNA that maximizes these criteria will depend on the sequence of the mRNA itself. However, the selection criteria will be independent of the target sequence. According to this method, an siRNA is selected for a given gene by using a rational design. That said, rational design can be described in a variety of ways. Rational design is, in simplest terms, the application of a proven set of criteria that enhance the probability of identifying a functional or hyperfunctional siRNA. In one method, rationally designed siRNA can be identified by maximizing one or more of the following criteria:

- (1) A low GC content, preferably between about 30-52%.
- (2) At least 2, preferably at least 3 A or U bases at positions 15-19 of the siRNA on the sense strand.
- (3) An A base at position 19 of the sense strand.
- (4) An A base at position 3 of the sense strand.
- (5) A U base at position 10 of the sense strand.
- (6) An A base at position 14 of the sense strand.
- (7) A base other than C at position 19 of the sense strand.
- (8) A base other than G at position 13 of the sense strand.
- (9) A Tm, which refers to the character of the internal repeat that results in inter- or intramolecular structures for one strand of the duplex, that is preferably not stable at greater than 50° C., more preferably not stable at greater than 37° C., even more preferably not stable at greater than 30° C. and most preferably not stable at greater than 20° C.
- (10) A base other than U at position 5 of the sense strand.
- (11) A base other than A at position 11 of the sense strand.
- (12) A base other than an A at position 1 of the sense strand.
- (13) A base other than an A at position 2 of the sense strand.
- (14) An A base at position 4 of the sense strand.
- (15) An A base at position 5 of the sense strand.
- (16) An A base at position 6 of the sense strand.
- (17) An A base at position 7 of the sense strand.
- (18) An A base at position 8 of the sense strand.
- (19) A base other than an A at position 9 of the sense strand.
- (20) A base other than an A at position 10 of the sense strand.
- (21) A base other than an A at position 11 of the sense strand.
- (22) A base other than an A at position 12 of the sense strand.
- (23) An A base at position 13 of the sense strand.
- (24) A base other than an A at position 14 of the sense strand.
- (25) An A base at position 15 of the sense strand
- (26) An A base at position 16 of the sense strand.
- (27) An A base at position 17 of the sense strand.
- (28) An A base at position 18 of the sense strand.
- (29) A base other than a U at position 1 of the sense strand.
- (30) A base other than a U at position 2 of the sense strand.
- (31) A U base at position 3 of the sense strand.
- (32) A base other than a U at position 4 of the sense strand.
- (33) A base other than a U at position 5 of the sense strand.
- (34) A U base at position 6 of the sense strand.
- (35) A base other than a U at position 7 of the sense strand.
- (36) A base other than a U at position 8 of the sense strand.
- (37) A base other than a U at position 9 of the sense strand.
- (38) A base other than a U at position 11 of the sense strand.
- (39) A U base at position 13 of the sense strand.
- (40) A base other than a U at position 14 of the sense strand.
- (41) A base other than a U at position 15 of the sense strand.
- (42) A base other than a U at position 16 of the sense strand.
- (43) A U base at position 17 of the sense strand.
- (44) A U base at position 18 of the sense strand.
- (45) A U base at position 19 of the sense strand.
- (46) A C base at position 1 of the sense strand.
- (47) A C base at position 2 of the sense strand.
- (48) A base other than a C at position 3 of the sense strand.
- (49) A C base at position 4 of the sense strand.
- (50) A base other than a C at position 5 of the sense strand.
- (51) A base other than a C at position 6 of the sense strand.
- (52) A base other than a C at position 7 of the sense strand.
- (53) A base other than a C at position 8 of the sense strand.
- (54) A C base at position 9 of the sense strand.
- (55) A C base at position 10 of the sense strand.
- (56) A C base at position 11 of the sense strand.
- (57) A base other than a C at position 12 of the sense strand.
- (58) A base other than a C at position 13 of the sense strand.
- (59) A base other than a C at position 14 of the sense strand.
- (60) A base other than a C at position 15 of the sense strand.
- (61) A base other than a C at position 16 of the sense strand.
- (62) A base other than a C at position 17 of the sense strand.
- (63) A base other than a C at position 18 of the sense strand.
- (64) A G base at position 1 of the sense strand.
- (65) A G base at position 2 of the sense strand.
- (66) A G base at position 3 of the sense strand.
- (67) A base other than a G at position 4 of the sense strand.
- (68) A base other than a G at position 5 of the sense strand.
- (69) A G base at position 6 of the sense strand.
- (70) A G base at position 7 of the sense strand.
- (71) A G base at position 8 of the sense strand.
- (72) A G base at position 9 of the sense strand.
- (73) A base other than a G at position 10 of the sense strand.
- (74) A G base at position 11 of the sense strand.
- (75) A G base at position 12 of the sense strand.
- (76) A G base at position 14 of the sense strand.
- (77) A G base at position 15 of the sense strand.
- (78) A G base at position 16 of the sense strand.
- (79) A base other than a G at position 17 of the sense strand.
- (80) A base other than a G at position 18 of the sense strand.
- (81) A base other than a G at position 19 of the sense strand.

The importance of various criteria can vary greatly. For instance, a C base at position 10 of the sense strand makes a minor contribution to duplex functionality. In contrast, the absence of a C at position 3 of the sense strand is very important. Accordingly, preferably an siRNA will satisfy as many of the aforementioned criteria as possible.

With respect to the criteria, GC content, as well as a high number of AU in positions 15-19 of the sense strand, may be important for easement of the unwinding of double stranded siRNA duplex. Duplex unwinding has been shown to be crucial for siRNA functionality in vivo.

With respect to criterion 9, the internal structure is measured in terms of the melting temperature of the single strand of siRNA, which is the temperature at which 50% of the molecules will become denatured. With respect to criteria 2-8 and 10-11, the positions refer to sequence positions on the sense strand, which is the strand that is identical to the mRNA.

In one preferred embodiment, at least criteria 1 and 8 are satisfied. In another preferred embodiment, at least criteria 7 and 8 are satisfied. In still another preferred embodiment, at least criteria 1, 8 and 9 are satisfied.

It should be noted that all of the aforementioned criteria regarding sequence position specifics are with respect to the 5′ end of the sense strand. Reference is made to the sense strand, because most databases contain information that describes the information of the mRNA. Because according to the present invention a chain can be from 18 to 30 bases in length, and the aforementioned criteria assumes a chain 19 base pairs in length, it is important to keep the aforementioned criteria applicable to the correct bases.

When there are only 18 bases, the base pair that is not present is the base pair that is located at the 3′ of the sense strand. When there are twenty to thirty bases present, then additional bases are added at the 5′ end of the sense chain and occupy positions ⁻1 to ⁻11. Accordingly, with respect to SEQ. ID NO. 0001 NNANANNNNUCNAANNNNA and SEQ. ID NO. 0028 GUCNNANANNNNUCNAANNNNA, both would have A at position 3, A at position 5, U at position 10, C at position 11, A and position 13, A and position 14 and A at position 19. However, SEQ. ID NO. 0028 would also have C at position −1, U at position −2 and G at position −3.

For a 19 base pair siRNA, an optimal sequence of one of the strands may be represented below, where N is any base, A, C, G, or U:

SEQ. ID NO. 0001. NNANANNNNUCNAANNNNA SEQ. ID NO. 0002. NNANANNNNUGNAANNNNA SEQ. ID NO. 0003. NNANANNNNUUNAANNNNA SEQ. ID NO. 0004. NNANANNNNUCNCANNNNA SEQ. ID NO. 0005. NNANANNNNUGNCANNNNA SEQ. ID NO. 0006. NNANANNNNUUNCANNNNA SEQ. ID NO. 0007. NNANANNNNUCNUANNNNA SEQ. ID NO. 0008. NNANANNNNUGNUANNNNA SEQ. ID NO. 0009. NNANANNNNUUNUANNNNA SEQ. ID NO. 0010. NNANCNNNNUCNAANNNNA SEQ. ID NO. 0011. NNANCNNNNUGNAANNNNA SEQ. ID NO. 0012. NNANCNNNNUUNAANNNNA SEQ. ID NO. 0013. NNANCNNNNUCNCANNNNA SEQ. ID NO. 0014. NNANCNNNNUGNCANNNNA SEQ. ID NO. 0015. NNANCNNNNUUNCANNNNA SEQ. ID NO. 0016. NNANCNNNNUCNUANNNNA SEQ. ID NO. 0017. NNANCNNNNUGNUANNNNA SEQ. ID NO. 0018. NNANCNNNNUUNUANNNNA SEQ. ID NO. 0019. NNANGNNNNUCNAANNNNA SEQ. ID NO. 0020. NNANGNNNNUGNAANNNNA SEQ. ID NO. 0021. NNANGNNNNUUNAANNNNA SEQ. ID NO. 0022. NNANGNNNNUCNCANNNNA SEQ. ID NO. 0023. NNANGNNNNUGNCANNNNA SEQ. ID NO. 0024. NNANGNNNNUUNCANNNNA SEQ. ID NO. 0025. NNANGNNNNUCNUANNNNA SEQ. ID NO. 0026. NNANGNNNNUGNUANNNNA SEQ. ID NO. 0027. NNANGNNNNNUNUANNNNA

In one embodiment, the sequence used as an siRNA is selected by choosing the siRNA that score highest according to one of the following seven algorithrns that are represented by Formulas I-VII:
Relative functionality of siRNA=−(GC/3)+(AU_15-19)−(Tm_{20° C.})*3−(G₁₃)*3−(C₁₉)+(A₁₉)*2+(A₃)+(U₁₀)+(A₁₄)−(U₅)−(A₁₁) Formula I
Relative functionality of siRNA=−(GC/3)−(AU_15-19)*3−(G₁₃)*3−(C₁₉)+(A₁₉)*2+(A₃) Formula II
Relative functionality of siRNA=−(GC/3)+(AU_15-19)−(Tm_{20° C.})*3 Formula III
Relative functionality of siRNA=−GC/2+(AU_15-19)/2−(Tm_{20° C.})*2−(G₁₃)*3−(C₁₉)+(A₁₉)*2+(A₃)+(U₁₀)+(A₁₄)−(U₅)−(A₁₁) Formula IV
Relative functionality of siRNA=−(G₁₃)*3−(C₁₉)+(A₁₉)*2+(A₃)+(U₁₀)+(A₁₄)−(U₅)−(A₁₁) Formula V
Relative functionality of siRNA=−(G₁₃)*3−(C₁₉)+(A₁₉)*2+(A₃) Formula VI
Relative functionality of siRNA=−(GC/2)+(AU_15-19)/2−(Tm_{20° C.})*1−(G₁₃)*3−(C₁₉)+(A₁₉)*3+(A₃)*3+(U₁₀)/2+(A₁₄)/2−(U₅)/2−(A₁₁)/2 Formula VII

In Formulas I-VII:

wherein A₁₉=1 if A is the base at position 19 on the sense strand, otherwise its value is 0,

AU_15-19=0-5 depending on the number of A or U bases on the sense strand at positions 15-19;

G₁₃=1 if G is the base at position 13 on the sense strand, otherwise its value is 0;

C₁₉=1 if C is the base at position 19 of the sense strand, otherwise its value is 0;

GC=the number of G and C bases in the entire sense strand;

Tm_{20° C.}=1 if the Tm is greater than 20° C.;

A₃=1 if A is the base at position 3 on the sense strand, otherwise its value is 0;

U₁₀=1 if U is the base at position 10 on the sense strand, otherwise its value is 0;

A₁₄=1 if A is the base at position 14 on the sense strand, otherwise its value is 0;

U₅=1 if U is the base at position 5 on the sense strand, otherwise its value is 0; and

A₁₁=1 if A is the base at position 11 of the sense strand, otherwise its value is 0.

Formulas I-VII provide relative information regarding functionality. When the values for two sequences are compared for a given formula, the relative functionality is ascertained; a higher positive number indicates a greater functionality. For example, in many applications a value of 5 or greater is beneficial.

Additionally, in many applications, more than one of these formulas would provide useful information as to the relative functionality of potential siRNA sequences. However, it is beneficial to have more than one type of formula, because not every formula will be able to help to differentiate among potential siRNA sequences. For example, in particularly high GC mRNAs, formulas that take that parameter into account would not be useful and application of formulas that lack GC elements (e.g., formulas V and VI) might provide greater insights into duplex functionality. Similarly, formula II might by used in situations where hairpin structures are not observed in duplexes, and formula IV might be applicable for sequences that have higher AU content. Thus, one may consider a particular sequence in light of more than one or even all of these algorithms to obtain the best differentiation among sequences. In some instances, application of a given algorithm may identify an unusually large number of potential siRNA sequences, and in those cases, it may be appropriate to re-analyze that sequence with a second algorithm that is, for instance, more stringent. Alternatively, it is conceivable that analysis of a sequence with a given formula yields no acceptable siRNA sequences (i.e. low SMARTSCORES™, or siRNA ranking). In this instance, it may be appropriate to re-analyze that sequences with a second algorithm that is, for instance, less stringent. In still other instances, analysis of a single sequence with two separate formulas may give rise to conflicting results (i.e. one formula generates a set of siRNA with high SMARTSCORES™, or siRNA ranking, while the other formula identifies a set of siRNA with low SMARTSCORES™, or siRNA ranking). In these instances, it may be necessary to determine which weighted factor(s) (e.g. GC content) are contributing to the discrepancy and assessing the sequence to decide whether these factors should or should not be included. Alternatively, the sequence could be analyzed by a third, fourth, or fifth algorithm to identify a set of rationally designed siRNA.

The above-referenced criteria are particularly advantageous when used in combination with pooling techniques as depicted in Table I:

TABLE I FUNCTIONAL PROBABILITY OLIGOS POOLS CRITERIA >95% >80% <70% >95% >80% <70% CURRENT 33.0 50.0 23.0 79.5 97.3 0.3 NEW 50.0 88.5 8.0 93.8 99.98 0.005 (GC) 28.0 58.9 36.0 72.8 97.1 1.6

The term “current” used in Table I refers to Tuschl's conventional siRNA parameters (Elbashir, S. M. et al. (2002) “Analysis of gene function in somatic mammalian cells using small interfering RNAs” Methods 26: 199-213). “New” refers to the design parameters described in Formulas I-VII. “GC” refers to criteria that select siRNA solely on the basis of GC content.

As Table I indicates, when more functional siRNA duplexes are chosen, siRNAs that produce <70% silencing drops from 23% to 8% and the number of siRNA duplexes that produce >80% silencing rises from 50% to 88.5%. Further, of the siRNA duplexes with >80% silencing, a larger portion of these siRNAs actually silence >95% of the target expression (the new criteria increases the portion from 33% to 50%). Using this new criteria in pooled siRNAs, shows that, with pooling, the amount of silencing >95% increases from 79.5% to 93.8% and essentially eliminates any siRNA pool from silencing less than 70%.

Table II similarly shows the particularly beneficial results of pooling in combination with the aforementioned criteria. However, Table II, which takes into account each of the aforementioned variables, demonstrates even a greater degree of improvement in functionality.

TABLE II FUNCTIONAL PROBABILITY OLIGOS POOLS NON- NON- FUNCTIONAL AVERAGE FUNCTIONAL FUNCTIONAL AVERAGE FUNCTIONAL RANDOM 20 40 50 67 97 3 CRITERIA 1 52 99 0.1 97 93 0.0040 CRITERIA 4 89 99 0.1 99 99 0.0000

The terms “functional,” “Average,” and “Non-functional” used in Table II, refer to siRNA that exhibit >80%, >50%, and <50% functionality, respectively. Criteria 1 and 4 refer to specific criteria described above.

The above-described algorithms may be used with or without a computer program that allows for the inputting of the sequence of the mRNA and automatically outputs the optimal siRNA. The computer program may, for example, be accessible from a local terminal or personal computer, over an internal network or over the Internet.

In addition to the formulas above, more detailed algorithms may be used for selecting siRNA. Preferably, at least one RNA duplex of 18-30 base pairs is selected such that it is optimized according a formula selected from:
(−14)*G₁₃−13*A₁−12*U₇−11*U₂−10*A₁₁−10*U₄−10*C₃−10*C₅−10*C₆−9*A₁₀−9*U₉−9*C₁₈−8*G₁₀−7*U_17*U₁₆−7*C₁₇−7*C₁₉+7*U₁₇+8*A₂+8*A₄+8*A₅+8*C₄+9*G₈+10*A₇+10*U₁₈+11*A₁₉+11*C₉+15*G₁+18*A₃+19*U₁₀−Tm−3*(GC_total)−6*(GC_15-19)−30*X; and Formula VIII
(14.1)*A₃+(14.9)*A₆+(17.6)*A₁₃+(24.7)*A₁₉+(14.2)*U₁₀+(10.5)* C₉+(23.9)*G₁+(16.3)*G₂+(−12.3)*A₁₁+(−19.3)*U₁+(−12.1)*U₂+(−11)*U₃+(−15.2)*U₁₅+(−11.3)*U₁₆+(−11.8)*C₃+(−17.4)*C₆+(−10.5)*C₇+(−13.7)*G₁₃+(−25.9)*G₁₉−Tm−3*(GC_total)−6*(GC_15-19)−30*X; and Formula IX
(−8)*A1+(−1)*A2+(12)*A3+(7)*A4+(18)*A5+(12)*A6+(19)*A7+(6)*A8+(−4)*A9+(−5)*A10+(−2)*A11+(−5)*A12+(17)*A13+(−3)*A14+(4)*A15+(2)*A16+(8)*A17+(11)*A18+(30)*A19+(−13)*U1+(−10)*U2+(2)*U3+(−2)*U4+(−5)*U5+(5)*U6+(−2)*U7+(−10)*U8+(−5)*U9+(15)*U10+(−1)*U11+(0)*U12+(10)*U13+(−9)*U14+(−13)*U15+(−10)*U16+(3)*U17+(9)*U18+(9)*U19+(7)*C1+(3)*C2+(−21)*C3+(5)*C4+(−9)*C5+(−20)*C6+(−18)*C7+(−5)*C8+(5)*C9+(1)*C10+(2)*C11+(−5)*C12+(−3)*C13+(−6)*C14+(−2)*C15+(−5)*C16+(−3)*C17+(−12)*C18+(−18)*C19+(14)*G1+(8)*G2+(7)*G3+(−10)*G4+(−4)*G5+(2)*G6+(1)*G7+(9)*G8+(5)*G9+(−11)*G10+(1)*G11+(9)*G12+(−24)*G13+(18)*G14+(11)*G15+(13)*G16+(−7)*G17+(−9)*G18+(−22)*G19+6*(number of A+U in position 15-19)−3*(number of G+C in whole siRNA). Formula X

wherein

A₁=1 if A is the base at position 1 of the sense strand, otherwise its value is 0; A₂=1 if A is the base at position 2 of the sense strand, otherwise its value is 0; A₃=1 if A is the base at position 3 of the sense strand, otherwise its value is 0; A₄=1 if A is the base at position 4 of the sense strand, otherwise its value is 0; A₅=1 if A is the base at position 5 of the sense strand, otherwise its value is 0; A₆=1 if A is the base at position 6 of the sense strand, otherwise its value is 0; A₇=1 if A is the base at position 7 of the sense strand, otherwise its value is 0; A₁₀=1 if A is the base at position 10 of the sense strand, otherwise its value is 0; A₁₁=1 if A is the base at position 11 of the sense strand, otherwise its value is 0; A₁₃=1 if A is the base at position 13 of the sense strand, otherwise its value is 0; A₁₉₌₁if A is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;

C₃=1 if C is the base at position 3 of the sense strand, otherwise its value is 0; C₄−1 if C is the base at position 4 of the sense strand, otherwise its value is 0; C₅=1 if C is the base at position 5 of the sense strand, otherwise its value is 0; C₆=1 if C is the base at position 6 of the sense strand, otherwise its value is 0; C₇−1 if C is the base at position 7 of the sense strand, otherwise its value is 0; C₉−1 if C is the base at position 9 of the sense strand, otherwise its value is 0; C₁₇=1 if C is the base at position 17 of the sense strand, otherwise its value is 0; C₁₈=1 if C is the base at position 18 of the sense strand, otherwise its value is 0; C₁₉=1 if C is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;

G₁=1 if G is the base at position 1 on the sense strand, otherwise its value is 0; G₂=1 if G is the base at position 2 of the sense strand, otherwise its value is 0; G₈=1 if G is the base at position 8 on the sense strand, otherwise its value is 0; G₁₀=1 if G is the base at position 10 on the sense strand, otherwise its value is 0; G₁₃=1 if G is the base at position 13 on the sense strand, otherwise its value is 0; G₁₉=1 if G is the base at position 19 of the sense strand, otherwise if another base is present or the sense strand is only 18 base pairs in length, its value is 0;

U₁=1 if U is the base at position 1 on the sense strand, otherwise its value is 0; U₂=1 if U is the base at position 2 on the sense strand, otherwise its value is 0; U₃=1 if U is the base at position 3 on the sense strand, otherwise its value is 0; U₄=1 if U is the base at position 4 on the sense strand, otherwise its value is 0; U₇=1 if U is the base at position 7 on the sense strand, otherwise its value is 0; U₉=1 if U is the base at position 9 on the sense strand, otherwise its value is 0; U₁₀=1 if U is the base at position 10 on the sense strand, otherwise its value is 0; U₁₅=1 if U is the base at position 15 on the sense strand, otherwise its value is 0; U₁₆=1 if U is the base at position 16 on the sense strand, otherwise its value is 0; U₁₇=1 if U is the base at position 17 on the sense strand, otherwise its value is 0; U₁₈=1 if U is the base at position 18 on the sense strand, otherwise its value is 0;

GC_15-19=the number of G and C bases within positions 15-19 of the sense strand, or within positions 15-18 if the sense strand is only 18 base pairs in length;

GC_total=the number of G and C bases in the sense strand;

Tm=100 if the siRNA oligo has the internal repeat longer then 4 base pairs, otherwise its value is 0; and

X=the number of times that the same nucleotide repeats four or more times in a row.

The above formulas VIII, IX, and X, as well as formulas I-VII, provide methods for selecting siRNA in order to increase the efficiency of gene silencing. A subset of variables of any of the formulas may be used, though when fewer variables are used, the optimization hierarchy becomes less reliable.

With respect to the variables of the above-referenced formulas, a single letter of A or C or G or U followed by a subscript refers to a binary condition. The binary condition is that either the particular base is present at that particular position (wherein the value is “1”) or the base is not present (wherein the value is “0”). Because position 19 is optional, i.e., there might be only 18 base pairs, when there are only 18 base pairs, any base with a subscript of 19 in the formulas above would have a zero value for that parameter. Before or after each variable is a number followed by *, which indicates that the value of the variable is to be multiplied or weighed by that number.

The numbers preceding the variables A, or G, or C, or U in Formulas VIII, IX, and X (or after the variables in Formula I-VII) were detenmined by comparing the difference in the frequency of individual bases at different positions in functional siRNA and total siRNA. Specifically, the frequency in which a given base was observed at a particular position in functional groups was compared with the frequency that that same base was observed in the total, randomly selected siRNA set. If the absolute value of the difference between the functional and total values was found to be greater than 6%, that parameter was included in the equation. Thus, for instance, if the frequency of finding a “G” at position 13 (G₁₃) is found to be 6% in a given functional group, and the frequency of G₁₃in the total population of siRNAs is 20%, the difference between the two values is 6%-20%=−14%. As the absolute value is greater than six (6), this factor (−14) is included in the equation. Thus, in Formula VIII, in cases where the siRNA under study has a G in position 13, the accrued value is (−14)*(1)=−14. In contrast, when a base other than G is found at position 13, the accrued value is (−14)*(0)=0.

When developing a means to optimize siRNAs, the inventors observed that a bias toward low internal thermodynamic stability of the duplex at the 5′-antisense (AS) end is characteristic of naturally occurring miRNA precursors. The inventors extended this observation to siRNAs for which functionality had been assessed in tissue culture.

With respect to the parameter GC_15-19, a value of 0-5 will be ascribed depending on the number of G or C bases at positions 15 to 19. If there are only 18 base pairs, the value is between 0 and 4.

With respect to the criterion GC_totalcontent, a number from 0-30 will be ascribed, which correlates to the total number of G and C nucleotides on the sense strand, excluding overhangs. Without wishing to be bound by any one theory, it is postulated that the significance of the GC content (as well as AU content at positions 15-19, which is a parameter for formulas III-VII) relates to the easement of the unwinding of a double-stranded siRNA duplex. Duplex unwinding is believed to be crucial for siRNA functionality in vivo and overall low internal stability, especially low internal stability of the first unwound base pair is believed to be important to maintain sufficient processivity of RISC complex-induced duplex unwinding. If the duplex has 19 base pairs, those at positions 15-19 on the sense strand will unwind first if the molecule exhibits a sufficiently low internal stability at that position. As persons skilled in the art are aware, RISC is a complex of approximately twelve proteins; Dicer is one, but not the only, helicase within this complex. Accordingly, although the GC parameters are believed to relate to activity with Dicer, they are also important for activity with other RISC proteins.

The value of the parameter Tm is 0 when there are no internal repeats longer than (or equal to) four base pairs present in the siRNA duplex; otherwise the value is 1. Thus for example, if the sequence ACGUACGU, or any other four nucleotide (or more) palindrome exists within the structure, the value will be one (1). Alternatively if the structure ACGGACG, or any other 3 nucleotide (or less) palindrome exists, the value will be zero (0).

The variable “X” refers to the number of times that the same nucleotide occurs contiguously in a stretch of four or more units. If there are, for example, four contiguous As in one part of the sequence and elsewhere in the sequence four contiguous Cs, X=2. Further, if there are two separate contiguous stretches of four of the same nucleotides or eight or more of the same nucleotides in a row, then X=2. However, X does not increase for five, six or seven contiguous nucleotides.

Again, when applying Formula VIII, Formula IX, or Formula X, to a given mRNA, (the “target RNA” or “target molecule”), one may use a computer program to evaluate the criteria for every sequence of 18-30 base pairs or only sequences of a fixed length, e.g., 19 base pairs. Preferably the computer program is designed such that it provides a report ranking of all of the potential siRNAs 18-30 base pairs, ranked according to which sequences generate the highest value. A higher value refers to a more efficient siRNA for a particular target gene. The computer program that may be used may be developed in any computer language that is known to be useful for scoring nucleotide sequences, or it may be developed with the assistance of commercially available product such as Microsoft's PRODUCT.NET. Additionally, rather than run every sequence through one and/or another formula, one may compare a subset of the sequences, which may be desirable if for example only a subset are available. For instance, it may be desirable to first perform a BLAST (Basic Local Alignment Search Tool) search and to identify sequences that have no homology to other targets. Alternatively, it may be desirable to scan the sequence and to identify regions of moderate GC context, then perform relevant calculations using one of the above-described formulas on these regions. These calculations can be done manually or with the aid of a computer.

As with Formulas I-VII, either Formula VIII, Formula IX, or Formula X may be used for a given mRNA target sequence. However, it is possible that according to one or the other formula more than one siRNA will have the same value. Accordingly, it is beneficial to have a second formula by which to differentiate sequences. Formulas IX and X were derived in a similar fashion as Formula VIII, yet used a larger data set and thus yields sequences with higher statistical correlations to highly functional duplexes. The sequence that has the highest value ascribed to it may be referred to as a “first optimized duplex.” The sequence that has the second highest value ascribed to it may be referred to as a “second optimized duplex.” Similarly, the sequences that have the third and fourth highest values ascribed to them may be referred to as a third optimized duplex and a fourth optimized duplex, respectively. When more than one sequence has the same value, each of them may, for example, be referred to as first optimized duplex sequences or co-first optimized duplexes. Formula X is similar to Formula IX, yet uses a greater numbers of variables and for that reason, identifies sequences on the basis of slightly different criteria.

It should also be noted that the output of a particular algorithm will depend on several of variables including: (1) the size of the data base(s) being analyzed by the algorithm, and (2) the number and stringency of the parameters being applied to screen each sequence. Thus, for example, in U.S. patent application Ser. No. 10/714,333, entitled “Functional and Hyperfunctional siRNA,” filed Nov. 14, 2003, Formula VIII was applied to the known human genome (NCBI REFSEQ database) through ENTREZ (EFETCH). As a result of these procedures, roughly 1.6 million siRNA sequences were identified. Application of Formula VIII to the same database in March of 2004 yielded roughly 2.2 million sequences, a difference of approximately 600,000 sequences resulting from the growth of the database over the course of the months that span this period of time. Application of other formulas (e.g., Formula X) that change the emphasis of, include, or eliminate different variables can yield unequal numbers of siRNAs. Alternatively, in cases where application of one formula to one or more genes fails to yield sufficient numbers of siRNAs with scores that would be indicative of strong silencing, said genes can be reassessed with a second algorithm that is, for instance, less stringent. siRNA sequences identified using Formula VIII and Formula X (minus sequences generated by Formula VIII) are contained within the sequence listing. The data included in the sequence listing is described more fully below. The sequences identified by Formula VIII and Formula X that are disclosed in the sequence listing may be used in gene silencing applications.

It should be noted that for Formulas VIII, IX, and X all of the aforementioned criteria are identified as positions on the sense strand when oriented in the 5′ to 3′ direction as they are identified in connection with Formulas I-VII unless otherwise specified.

Formulas I-X, may be used to select or to evaluate one, or more than one, siRNA in order to optimize silencing. Preferably, at least two optimized siRNAs that have been selected according to at least one of these formulas are used to silence a gene, more preferably at least three and most preferably at least four. The siRNAs may be used individually or together in a pool or kit. Further, they may be applied to a cell simultaneously or separately. Preferably, the at least two siRNAs are applied simultaneously. Pools are particularly beneficial for many research applications. However, for therapeutics, it may be more desirable to employ a single hyperfunctional siRNA as described elsewhere in this application.

When planning to conduct gene silencing, and it is necessary to choose between two or more siRNAs, one should do so by comparing the relative values when the siRNA are subjected to one of the formulas above. In general a higher scored siRNA should be used.

Useful applications include, but are not limited to, target validation, gene functional analysis, research and drug discovery, gene therapy and therapeutics. Methods for using siRNA in these applications are well known to persons of skill in the art.

Because the ability of siRNA to function is dependent on the sequence of the RNA and not the species into which it is introduced, the present invention is applicable across a broad range of species, including but not limited to all mammalian species, such as humans, dogs, horses, cats, cows, mice, hamsters, chimpanzees and gorillas, as well as other species and organisms such as bacteria, viruses, insects, plants and C. elegans.

The present invention is also applicable for use for silencing a broad range of genes, including but not limited to the roughly 45,000 genes of a human genome, and has particular relevance in cases where those genes are associated with diseases such as diabetes, Alzheimer's, cancer, as well as all genes in the genomes of the aforementioned organisms.

The siRNA selected according to the aforementioned criteria or one of the aforementioned algorithms are also, for example, useful in the simultaneous screening and functional analysis of multiple genes and gene families using high throughput strategies, as well as in direct gene suppression or silencing.

Development of the Algorithms

To identify siRNA sequence features that promote functionality and to quantify the importance of certain currently accepted conventional factors—such as G/C content and target site accessibility—the inventors synthesized an siRNA panel consisting of 270 siRNAs targeting three genes, Human Cyclophilin, Firefly Luciferase, and Human DBI. In all three cases, siRNAs were directed against specific regions of each gene. For Human Cyclophilin and Firefly Luciferase, ninety siRNAs were directed against a 199 bp segment of each respective mRNA. For DBI, 90 siRNAs were directed against a smaller, 109 base pair region of the mRNA. The sequences to which the siRNAs were directed are provided below.

It should be noted that in certain sequences, “t” is present. This is because many databases contain information in this manner. However, the t denotes a uracil residue in mRNA and siRNA. Any algorithm will, unless otherwise specified, process at in a sequence as a u.

Human cyclophilin: 193-390, M60857 SEQ. ID NO. 29: gttccaaaaa cagtggataa ttttgtggcc ttagctacag gagagaaagg atttggctac aaaaacagca aattccatcg tgtaatcaag gacttcatga tccagggcgg agacttcacc aggggagatg gcacaggagg aaagagcatc tacggtgagc gcttccccga tgagaacttc aaactgaagc actacgggcc tggctggg Firefly luciferase: 1434-1631, U47298 (pGL3, Promega) SEQ. ID NO. 30: tgaacttccc gccgccgttg ttgttttgga gcacggaaag acgatgacgg aaaaagagat cgtggattac gtcgccagtc aagtaacaac cgcgaaaaag ttgcgcggag gagttgtgtt tgtggacgaa gtaccgaaag gtcttaccgg aaaactcgac gcaagaaaaa tcagagagat cctcataaag gccaagaagg DBI, NM_020548 (202-310) (every position) SEQ. ID NO. 0031: acgggcaagg ccaagtggga tgcctggaat gagctgaaag ggacttccaa ggaagatgcc atgaaagctt acatcaacaa agtagaagag ctaaagaaaa aatacggg

A list of the siRNAs appears in Table III (see Examples Section, Example II)

The set of duplexes was analyzed to identify correlations between siRNA functionality and other biophysical or thermodynamic properties. When the siRNA panel was analyzed in functional and non-functional subgroups, certain nucleotides were much more abundant at certain positions in functional or non-functional groups. More specifically, the frequency of each nucleotide at each position in highly functional siRNA duplexes was compared with that of nonfunctional duplexes in order to assess the preference for or against any given nucleotide at every position. These analyses were used to determine important criteria to be included in the siRNA algorithms (Formulas VIII, IX, and X).

The data set was also analyzed for distinguishing biophysical properties of siRNAs in the functional group, such as optimal percent of GC content, propensity for internal structures and regional thermodynamic stability. Of the presented criteria, several are involved in duplex recognition, RISC activation/duplex unwinding, and target cleavage catalysis.

The original data set that was the source of the statistically derived criteria is shown in FIG. 2. Additionally, this figure shows that random selection yields siRNA duplexes with unpredictable and widely varying silencing potencies as measured in tissue culture using HEK293 cells. In the figure, duplexes are plotted such that each x-axis tick-mark represents an individual siRNA, with each subsequent siRNA differing in target position by two nucleotides for Human Cyclophilin B and Firefly Luciferase, and by one nucleotide for Human DBI. Furthermore, the y-axis denotes the level of target expression remaining after transfection of the duplex into cells and subsequent silencing of the target.

siRNA identified and optimized in this document work equally well in a wide range of cell types. FIG. 3a shows the evaluation of thirty siRNAs targeting the DBI gene in three cell lines derived from different tissues. Each DBI siRNA displays very similar functionality in HEK293 (ATCC, CRL-1573, human embryonic kidney), HeLa (ATCC, CCL-2, cervical epithelial adenocarcinoma) and DU145 (HTB-81, prostate) cells as deterimined by the B-DNA assay. Thus, siRNA functionality is determined by the primary sequence of the siRNA and not by the intracellular environment. Additionally, it should be noted that although the present invention provides for a determination of the functionality of siRNA for a given target, the same siRNA may silence more than one gene. For example, the complementary sequence of the silencing siRNA may be present in more than one gene. Accordingly, in these circumstances, it may be desirable not to use the siRNA with highest SMARTSCORE™, or siRNA ranking. In such circumstances, it may be desirable to use the siRNA with the next highest SMARTSCORE™, or siRNA ranking.

To determine the relevance of G/C content in siRNA function, the G/C content of each duplex in the panel was calculated and the functional classes of siRNAs (<F50, ≧F50, ≧F80, ≧F95 where F refers to the percent gene silencing) were sorted accordingly. The majority of the highly-functional siRNAs (≧F95) fell within the G/C content range of 36%-52% (FIG. 3B). Twice as many non-functional (<F50) duplexes fell within the high G/C content groups (>57% GC content) compared to the 36%-52% group. The group with extremely low GC content (26% or less) contained a higher proportion of non-functional siRNAs and no highly-functional siRNAs. The G/C content range of 30%-52% was therefore selected as Criterion I for siRNA functionality, consistent with the observation that a G/C range 30%-70% promotes efficient RNAi targeting. Application of this criterion alone provided only a marginal increase in the probability of selecting functional siRNAs from the panel: selection of F50 and F95 siRNAs was improved by 3.6% and 2.2%, respectively. The siRNA panel presented here permitted a more systematic analysis and quantification of the importance of this criterion than that used previously.

A relative measure of local internal stability is the A/U base pair (bp) content; therefore, the frequency of A/U bp was determined for each of the five terminal positions of the duplex (5′ sense (S)/5′ antisense (AS)) of all siRNAs in the panel. Duplexes were then categorized by the number of A/U bp in positions 1-5 and 15-19 of the sense strand. The thermodynamic flexibility of the duplex 5′-end (positions 1-5; S) did not appear to correlate appreciably with silencing potency, while that of the 3′-end (positions 15-19; S) correlated with efficient silencing. No duplexes lacking A/U bp in positions 15-19 were functional. The presence of one A/U bp in this region conferred some degree of functionality, but the presence of three or more A/Us was preferable and therefore defined as Criterion II. When applied to the test panel, only a marginal increase in the probability of functional siRNA selection was achieved: a 1.8% and 2.3% increase for F50 and F95 duplexes, respectively (Table IV).

The complementary strands of siRNAs that contain internal repeats or palindromes may form internal fold-back structures. These hairpin-like structures exist in equilibrium with the duplexed form effectively reducing the concentration of functional duplexes. The propensity to form internal hairpins and their relative stability can be estimated by predicted melting temperatures. High Tm reflects a tendency to form hairpin structures. Lower Tm values indicate a lesser tendency to form hairpins. When the functional classes of siRNAs were sorted by T_m(FIG. 3c), the following trends were identified: duplexes lacking stable internal repeats were the most potent silencers (no F95 duplex with predicted hairpin structure T_m>60° C.). In contrast, about 60% of the duplexes in the groups having internal hairpins with calculated T_mvalues less than 20° C. were F80. Thus, the stability of internal repeats is inversely proportional to the silencing effect and defines Criterion III (predicted hairpin structure T_m≦20° C.).

Sequence-Based Determinants of siRNA Functionality

When the siRNA panel was sorted into functional and non-functional groups, the frequency of a specific nucleotide at each position in a functional siRNA duplex was compared with that of a nonfunctional duplex in order to assess the preference for or against a certain nucleotide. FIG. 4 shows the results of these queries and the subsequent resorting of the data set (from FIG. 2). The data is separated into two sets: those duplexes that meet the criteria, a specific nucleotide in a certain position-grouped on the left (Selected) and those that do not-grouped on the right (Eliminated). The duplexes are further sorted from most functional to least functional with the y-axis of FIG. 4a-e representing the % expression i.e., the amount of silencing that is elicited by the duplex (Note: each position on the X-axis represents a different duplex). Statistical analysis revealed correlations between silencing and several sequence-related properties of siRNAs. FIG. 4 and Table IV show quantitative analysis for the following five sequence-related properties of siRNA: (A) an A at position 19 of the sense strand; (B) an A at position 3 of the sense strand; (C) a U at position 10 of the sense strand; (D) a base other than G at position 13 of the sense strand; and (E) a base other than C at position 19 of the sense strand.

When the siRNAs in the panel were evaluated for the presence of an A at position 19 of the sense strand, the percentage of non-functional duplexes decreased from 20% to 11.8%, and the percentage of F95 duplexes increased from 21.7% to 29.4% (Table IV). Thus, the presence of an A in this position defined Criterion IV.

Another sequence-related property correlated with silencing was the presence of an A in position 3 of the sense strand (FIG. 4b). Of the siRNAs with A3, 34.4% were F95, compared with 21.7% randomly selected siRNAs. The presence of a U base in position 10 of the sense strand exhibited an even greater impact (FIG. 4c). Of the duplexes in this group, 41.7% were F95. These properties became criteria V and VI, respectively.

Two negative sequence-related criteria that were identified also appear on FIG. 4. The absence of a G at position 13 of the sense strand, conferred a marginal increase in selecting functional duplexes (FIG. 4d). Similarly, lack of a C at position 19 of the sense strand also correlated with functionality (FIG. 4e). Thus, among functional duplexes, position 19 was most likely occupied by A, and rarely occupied by C. These rules were defined as criteria VII and VIII, respectively.

Application of each criterion individually provided marginal but statistically significant increases in the probability of selecting a potent siRNA. Although the results were informative, the inventors sought to maximize potency and therefore consider multiple criteria or parameters. Optimization is particularly important when developing therapeutics. Interestingly, the probability of selecting a functional siRNA based on each thermodynamic criteria was 2%-4% higher than random, but 4%-8% higher for the sequence-related determinates. Presumably, these sequence-related increases reflect the complexity of the RNAi mechanism and the multitude of protein-RNA interactions that are involved in RNAi-mediated silencing.

TABLE IV PERCENT IMPROVEMENT CRITERION FUNCTIONAL OVER RANDOM (%) I. 30%-52% G/C Content <F50 16.4 −3.6 ≧F50 83.6 3.6 ≧F80 60.4 4.3 ≧F95 23.9 2.2 II. At least 3 A/U bases at <F50 18.2 −1.8 positions 15-19 of the sense ≧F50 81.8 1.8 strand ≧F80 59.7 3.6 ≧F95 24.0 2.3 III. Absence of internal <F50 16.7 −3.3 repeats, as measured by Tm of ≧F50 83.3 3.3 secondary structure ≦20° C. ≧F80 61.1 5.0 ≧F95 24.6 2.9 IV. An A base at position 19 <F50 11.8 −8.2 of the sense strand ≧F50 88.2 8.2 ≧F80 75.0 18.9 ≧F95 29.4 7.7 V. An A base at position 3 of <F50 17.2 −2.8 the sense strand ≧F50 82.8 2.8 ≧F80 62.5 6.4 ≧F95 34.4 12.7 VI. A U base at position 10 <F50 13.9 −6.1 of the sense strand ≧F50 86.1 6.1 ≧F80 69.4 13.3 ≧F95 41.7 20 VII. A base other than C at <F50 18.8 −1.2 position 19 of the sense strand ≧F50 81.2 1.2 ≧F80 59.7 3.6 ≧F95 24.2 2.5 VIII. A base other than G at <F50 15.2 −4.8 position 13 of the sense strand ≧F50 84.8 4.8 ≧F80 61.4 5.3 ≧F95 26.5 4.8

The siRNA Selection Algorithm

In an effort to improve selection further, all identified criteria, including but not limited to those listed in Table IV were combined into the algorithms embodied in Formula VIII, Formula IX, and Formula X. Each siRNA was then assigned a score (referred to as a SMARTSCORE™, or siRNA ranking) according to the values derived from the formulas. Duplexes that scored higher than 0 or −20 (unadjusted), for Formulas VIII and IX, respectively, effectively selected a set of functional siRNAs and excluded all non-functional siRNAs. Conversely, all duplexes scoring lower than 0 and −20 (minus 20) according to formulas VIII and IX, respectively, contained some functional siRNAs but included all non-functional siRNAs. A graphical representation of this selection is shown in FIG. 5. It should be noted that the scores derived from the algorithm can also be provided as “adjusted” scores. To convert Formula VIII unadjusted scores into adjusted scores it is necessary to use the following equation:
(160+unadjusted score)/2.25

When this takes place, an unadjusted score of “0” (zero) is converted to 75. Similarly, unadjusted scores for Formula X can be converted to adjusted scores. In this instance, the following equation is applied:
(228+unadjusted score)/3.56

When these manipulations take place, an unadjusted score of 38 is converted to an adjusted score of 75.

The methods for obtaining the seven criteria embodied in Table IV are illustrative of the results of the process used to develop the information for Formulas VIII, IX, and X. Thus similar techniques were used to establish the other variables and their multipliers. As described above, basic statistical methods were use to determine the relative values for these multipliers.

To determine the value for “Improvement over Random” the difference in the frequency of a given attribute (e.g., GC content, base preference) at a particular position is determined between individual functional groups (e.g., <F50) and the total siRNA population studied (e.g., 270 siRNA molecules selected randomly). Thus, for instance, in Criterion 1 (30%-52% GC content) members of the <F50 group were observed to have GC contents between 30-52% in 16.4% of the cases. In contrast, the total group of 270 siRNAs had GC contents in this range, 20% of the time. Thus for this particular attribute, there is a small negative correlation between 30%-52% GC content and this functional group (i.e., 16.4%-20%=−3.6%). Similarly, for Criterion VI, (a “U” at position 10 of the sense strand), the >F95 group contained a “U” at this position 41.7% of the time. In contrast, the total group of 270 siRNAs had a “U” at this position 21.7% of the time, thus the improvement over random is calculated to be 20% (or 41.7%-21.7%).

Identifying The Average Internal Stability Profile of Strong siRNA

In order to identify an internal stability profile that is characteristic of strong siRNA, 270 different siRNAs derived from the cyclophilin B, the diazepam binding inhibitor (DBI), and the luciferase gene were individually transfected into HEK293 cells and tested for their ability to induce RNAi of the respective gene. Based on their performance in the in vivo assay, the sequences were then subdivided into three groups, (i)>95% silencing; (ii) 80-95% silencing; and (iii) less than 50% silencing. Sequences exhibiting 51-84% silencing were eliminated from further consideration to reduce the difficulties in identifying relevant thermodynamic patterns.

Following the division of siRNA into three groups, a statistical analysis was performed on each member of each group to determine the average internal stability profile (AISP) of the siRNA. To accomplish this the Oligo 5.0 Primer Analysis Software and other related statistical packages (e.g., Excel) were exploited to determine the internal stability of pentamers using the nearest neighbor method described by Freier et al., (1986) Improved free-energy parameters for predictions of RNA duplex stability, Proc Natl. Acad. Sci. USA 83(24): 9373-7. Values for each group at each position were then averaged, and the resulting data were graphed on a linear coordinate system with the Y-axis expressing the ΔG (free energy) values in kcal/mole and the X-axis identifying the position of the base relative to the 5′ end.

The results of the analysis identified multiple key regions in siRNA molecules that were critical for successful gene silencing. At the 3′-most end of the sense strand (5′antisense), highly functional siRNA (>95% gene silencing, see FIG. 6a, >F95) have a low internal stability (AISP of position 19=˜−7.6 kcal/mol). In contrast low-efficiency siRNA (i.e., those exhibiting less than 50% silencing, <F50) display a distinctly different profile, having high ΔG values (˜8.4 kcal/mol) for the same position. Moving in a 5′ (sense strand) direction, the internal stability of highly efficient siRNA rises (position 12=˜8.3 kcal/mole) and then drops again (position 7=˜−7.7 kcal/mol) before leveling off at a value of approximately −8.1 kcal/mol for the 5′ terminus. siRNA with poor silencing capabilities show a distinctly different profile. While the AISP value at position 12 is nearly identical with that of strong siRNAs, the values at positions 7 and 8 rise considerably, peaking at a high of ˜−9.0 kcal/mol. In addition, at the 5′ end of the molecule the AISP profile of strong and weak siRNA differ dramatically. Unlike the relatively strong values exhibited by siRNA in the >95% silencing group, siRNAs that exhibit poor silencing activity have weak AISP values (−7.6, −7.5, and −7.5 kcal/mol for positions 1, 2 and 3 respectively).

Overall the profiles of both strong and weak siRNAs form distinct sinusoidal shapes that are roughly 180° out-of-phase with each other. While these thermodynamic descriptions define the archetypal profile of a strong siRNA, it will likely be the case that neither the ΔG values given for key positions in the profile or the absolute position of the profile along the Y-axis (i.e., the ΔG-axis) are absolutes. Profiles that are shifted upward or downward (i.e., having on an average, higher or lower values at every position) but retain the relative shape and position of the profile along the X-axis can be foreseen as being equally effective as the model profile described here. Moreover, it is likely that siRNA that have strong or even stronger gene-specific silencing effects might have exaggerated ΔG values (either higher or lower) at key positions. Thus, for instance, it is possible that the 5′-most position of the sense strand (position 19) could have ΔG values of 7.4 kcal/mol or lower and still be a strong siRNA if, for instance, a G-C→G-T/U mismatch were substituted at position 19 and altered duplex stability. Similarly, position 12 and position 7 could have values above 8.3 kcal/mol and below 7.7 kcal/mole, respectively, without abating the silencing effectiveness of the molecule. Thus, for instance, at position 12, a stabilizing chemical modification (e.g., a chemical modification of the 2′ position of the sugar backbone) could be added that increases the average internal stability at that position. Similarly, at position 7, mismatches similar to those described previously could be introduced that would lower the ΔG values at that position.

Lastly, it is important to note that while functional and non-functional siRNA were originally defined as those molecules having specific silencing properties, both broader or more limiting parameters can be used to define these molecules. As used herein, unless otherwise specified, “non-functional siRNA” are defined as those siRNA that induce less than 50% (<50%) target silencing, “semi-functional siRNA” induce 50-79% target silencing, “functional siRNA” are molecules that induce 80-95% gene silencing, and “highly-functional siRNA” are molecules that induce great than 95% gene silencing. These definitions are not intended to be rigid and can vary depending upon the design and needs of the application. For instance, it is possible that a researcher attempting to map a gene to a chromosome using a functional assay, may identify an siRNA that reduces gene activity by only 30%. While this level of gene silencing may be “non-functional” for, e.g., therapeutic needs, it is sufficient for gene mapping purposes and is, under these uses and conditions, “functional.” For these reasons, functional siRNA can be defined as those molecules having greater than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% silencing capabilities at 100 nM transfection conditions. Similarly, depending upon the needs of the study and/or application, non-functional and semi-functional siRNA can be defined as having different parameters. For instance, semi-functional siRNA can be defined as being those molecules that induce 20%, 30%, 40%, 50%, 60%, or 70% silencing at 100 nM transfection conditions. Similarly, non-functional siRNA can be defined as being those molecules that silence gene expression by less than 70%, 60%, 50%, 40%, 30%, or less. Nonetheless, unless otherwise stated, the descriptions stated in the “Definitions” section of this text should be applied.

Functional attributes can be assigned to each of the key positions in the AISP of strong siRNA. The low 5′ (sense strand) AISP values of strong siRNAs may be necessary for determining which end of the molecule enters the RISC complex. In contrast, the high and low AISP values observed in the central regions of the molecule may be critical for siRNA-target mRNA interactions and product release, respectively.

If the AISP values described above accurately define the thermodynamic parameters of strong siRNA, it would be expected that similar patterns would be observed in strong siRNA isolated from nature. Natural siRNAs exist in a harsh, RNase-rich environment and it can be hypothesized that only those siRNA that exhibit heightened affinity for RISC (i.e., siRNA that exhibit an average internal stability profile similar to those observed in strong siRNA) would survive in an intracellular environment. This hypothesis was tested using GFP-specific siRNA isolated from N. benthamiana. Llave et al. (2002) Endogenous and Silencing-Associated Small RNAs in Plants, The Plant Cell 14, 1605-1619, introduced long double-stranded GFP-encoding RNA into plants and subsequently re-isolated GFP-specific siRNA from the tissues. The AISP of fifty-nine of these GFP-siRNA were determined, averaged, and subsequently plotted alongside the AISP profile obtained from the cyclophilin B/DBI/luciferase siRNA having >90% silencing properties (FIG. 6b). Comparison of the two groups show that profiles are nearly identical. This finding validates the information provided by the internal stability profiles and demonstrates that: (1) the profile identified by analysis of the cyclophilin B/DBI/luciferase siRNAs are not gene specific; and (2) AISP values can be used to search for strong siRNAs in a variety of species.

Both chemical modifications and base-pair mismatches can be incorporated into siRNA to alter the duplex's AISP and functionality. For instance, introduction of mismatches at positions 1 or 2 of the sense strand destabilized the 5′ end of the sense strand and increases the functionality of the molecule (see Luc, FIG. 7). Similarly, addition of 2′-O-methyl groups to positions 1 and 2 of the sense strand can also alter the AISP and (as a result) increase both the functionality of the molecule and eliminate off-target effects that results from sense strand homology with the unrelated targets (FIG. 8).

Rationale for Criteria in a Biological Context

The fate of siRNA in the RNAi pathway may be described in 5 major steps: (1) duplex recognition and pre-RISC complex formation; (2) ATP-dependent duplex unwinding/strand selection and RISC activation; (3) mRNA target identification; (4) mRNA cleavage, and (5) product release (FIG. 1). Given the level of nucleic acid-protein interactions at each step, siRNA functionality is likely influenced by specific biophysical and molecular properties that promote efficient interactions within the context of the multi-component complexes. Indeed, the systematic analysis of the siRNA test set identified multiple factors that correlate well with functionality. When combined into a single algorithm, they proved to be very effective in selecting active siRNAs.

The factors described here may also be predictive of key functional associations important for each step in RNAi. For example, the potential formation of internal hairpin structures correlated negatively with siRNA functionality. Complementary strands with stable internal repeats are more likely to exist as stable hairpins thus decreasing the effective concentration of the functional duplex form. This suggests that the duplex is the preferred conformation for initial pre-RISC association. Indeed, although single complementary strands can induce gene silencing, the effective concentration required is at least two orders of magnitude higher than that of the duplex form.

siRNA-pre-RISC complex formation is followed by an ATP-dependent duplex unwinding step and “activation” of the RISC. The siRNA functionality was shown to correlate with overall low internal stability of the duplex and low internal stability of the 3′ sense end (or differential internal stability of the 3′ sense compare to the 5′ sense strand), which may reflect strand selection and entry into the RISC. Overall duplex stability and low internal stability at the 3′ end of the sense strand were also correlated with siRNA functionality. Interestingly, siRNAs with very high and very low overall stability profiles correlate strongly with non-functional duplexes. One interpretation is that high internal stability prevents efficient unwinding while very low stability reduces siRNA target affinity and subsequent mRNA cleavage by the RISC.

Several criteria describe base preferences at specific positions of the sense strand and are even more intriguing when considering their potential mechanistic roles in target recognition and mRNA cleavage. Base preferences for A at position 19 of the sense strand but not C, are particularly interesting because they reflect the same base preferences observed for naturally occurring miRNA precursors. That is, among the reported miRNA precursor sequences 75% contain a U at position 1 which corresponds to an A in position 19 of the sense strand of siRNAs, while G was under-represented in this same position for miRNA precursors. These observations support the hypothesis that both miRNA precursors and siRNA duplexes are processed by very similar if not identical protein machinery. The functional interpretation of the predominance of a U/A base pair is that it promotes flexibility at the 5′antisense ends of both siRNA duplexes and miRNA precursors and facilitates efficient unwinding and selective strand entrance into an activated RISC.

Among the criteria associated with base preferences that are likely to influence mRNA cleavage or possibly product release, the preference for U at position 10 of the sense strand exhibited the greatest impact, enhancing the probability of selecting an F80 sequence by 13.3%. Activated RISC preferentially cleaves target mRNA between nucleotides 10 and 11 relative to the 5′ end of the complementary targeting strand. Therefore, it may be that U, the preferred base for most endoribonucleases, at this position supports more efficient cleavage. Alternatively, a U/A bp between the targeting siRNA strand and its cognate target mRNA may create an optimal conformation for the RISC-associated “slicing” activity.

Post Algorithm Filters

According to another embodiment, the output of any one of the formulas previously listed can be filtered to remove or select for siRNAs containing undesirable or desirable motifs or properties, respectively. In one example, sequences identified by any of the formulas can be filtered to remove any and all sequences that induce toxicity or cellular stress. Introduction of an siRNA containing a toxic motif into a cell can induce cellular stress and/or cell death (apoptosis) which in turn can mislead researchers into associating a particular (e.g., nonessential) gene with, e.g., an essential function. Alternatively, sequences generated by any of the before mentioned formulas can be filtered to identify and retain duplexes that contain toxic motifs. Such duplexes may be valuable from a variety of perspectives including, for instance, uses as therapeutic molecules. A variety of toxic motifs exist and can exert their influence on the cell through RNAi and non-RNAi pathways. Examples of toxic motifs are explained more fully in commonly assigned U.S. Provisional Patent Application Ser. No. 60/538,874, entitled “Identification of Toxic Sequences,” filed Jan. 23, 2004. Briefly, toxic motifs include A/G UUU A/G/U, G/C AAA G/C, and GCCA, or a complement of any of the foregoing.

In another instance, sequences identified by any of the before mentioned formulas can be filtered to identify duplexes that contain motifs (or general properties) that provide serum stability or induce serum instability. In one envisioned application of siRNA as therapeutic molecules, duplexes targeting disease-associated genes will be introduced into patients intravenously. As the half-life of single and double stranded RNA in serum is short, post-algorithm filters designed to select molecules that contain motifs that enhance duplex stability in the presence of serum and/or (conversely) eliminate duplexes that contain motifs that destabilize siRNA in the presence of serum, would be beneficial.

In another instance, sequences identified by any of the before mentioned formulas can be filtered to identify duplexes that are hyperfunctional. Hyperfunctional sequences are defined as those sequences that (1) induce greater than 95% silencing of a specific target when they are transfected at subnanomolar concentrations (i.e., less than one nanomolar); and/or (2) induce functional (or better) levels of silencing for greater than 96 hours. Filters that identify hyperfunctional molecules can vary widely. In one example, the top ten, twenty, thirty, or forty siRNA can be assessed for the ability to silence a given target at, e.g., concentrations of 1 nM and 0.5 nM to identify hyperfunctional molecules.

Pooling

According to another embodiment, the present invention provides a pool of at least two siRNAs, preferably in the form of a kit or therapeutic reagent, wherein one strand of each of the siRNAs, the sense strand comprises a sequence that is substantially similar to a sequence within a target mRNA. The opposite strand, the antisense strand, will preferably comprise a sequence that is substantially complementary to that of the target mRNA. More preferably, one strand of each siRNA will comprise a sequence that is identical to a sequence that is contained in the target mRNA. Most preferably, each siRNA will be 19 base pairs in length, and one strand of each of the siRNAs will be 100% complementary to a portion of the target mRNA.

By increasing the number of siRNAs directed to a particular target using a pool or kit, one is able both to increase the likelihood that at least one siRNA with satisfactory functionality will be included, as well as to benefit from additive or synergistic effects. Further, when two or more siRNAs directed against a single gene do not have satisfactory levels of functionality alone, if combined, they may satisfactorily promote degradation of the target messenger RNA and successfully inhibit translation. By including multiple siRNAs in the system, not only is the probability of silencing increased, but the economics of operation are also improved when compared to adding different siRNAs sequentially. This effect is contrary to the conventional wisdom that the concurrent use of multiple siRNA will negatively impact gene silencing (e.g., Holen, T. et al. (2003) Similar behavior of single strand and double strand siRNAs suggests they act through a common RNAi pathway. NAR 31: 2401-21407).

In fact, when two siRNAs were pooled together, 54% of the pools of two siRNAs induced more than 95% gene silencing. Thus, a 2.5-fold increase in the percentage of functionality was achieved by randomly combining two siRNAs. Further, over 84% of pools containing two siRNAs induced more than 80% gene silencing.

More preferably, the kit is comprised of at least three siRNAs, wherein one strand of each siRNA comprises a sequence that is substantially similar to a sequence of the target mRNA and the other strand comprises a sequence that is substantially complementary to the region of the target mRNA. As with the kit that comprises at least two siRNAs, more preferably one strand will comprise a sequence that is identical to a sequence that is contained in the mRNA and another strand that is 100% complementary to a sequence that is contained in the mRNA. During experiments, when three siRNAs were combined together, 60% of the pools induced more than 95% gene silencing and 92% of the pools induced more than 80% gene silencing.

Further, even more preferably, the kit is comprised of at least four siRNAs, wherein one strand of each siRNA comprises a sequence that is substantially similar to a region of the sequence of the target mRNA, and the other strand comprises a sequence that is substantially complementary to the region of the target mRNA. As with the kit or pool that comprises at least two siRNAs, more preferably one strand of each of the siRNA duplexes will comprise a sequence that is identical to a sequence that is contained in the mRNA, and another strand that is 100% complementary to a sequence that is contained in the mRNA.

Additionally, kits and pools with at least five, at least six, and at least seven siRNAs may also be useful with the present invention. For example, pools of five siRNA induced 95% gene silencing with 77% probability and 80% silencing with 98.8% probability. Thus, pooling of siRNAs together can result in the creation of a target-specific silencing reagent with almost a 99% probability of being functional. The fact that such high levels of success are achievable using such pools of siRNA, enables one to dispense with costly and time-consuming target-specific validation procedures.

For this embodiment, as well as the other aforementioned embodiments, each of the siRNAs within a pool will preferably comprise 18-30 base pairs, more preferably 18-25 base pairs, and most preferably 19 base pairs. Within each siRNA, preferably at least 18 contiguous bases of the antisense strand will be 100% complementary to the target mRNA. More preferably, at least 19 contiguous bases of the antisense strand will be 100% complementary to the target mRNA. Additionally, there may be overhangs on either the sense strand or the antisense strand, and these overhangs may be at either the 5′ end or the 3′ end of either of the strands, for example there may be one or more overhangs of 1-6 bases. When overhangs are present, they are not included in the calculation of the number of base pairs. The two nucleotide 3′ overhangs mimic natural siRNAs and are commonly used but are not essential. Preferably, the overhangs should consist of two nucleotides, most often dTdT or UU at the 3′ end of the sense and antisense strand that are not complementary to the target sequence. The siRNAs may be produced by any method that is now known or that comes to be known for synthesizing double stranded RNA that one skilled in the art would appreciate would be useful in the present invention. Preferably, the siRNAs will be produced by Dharmacon's proprietary ACE® technology. However, other methods for synthesizing siRNAs are well known to persons skilled in the art and include, but are not limited to, any chemical synthesis of RNA oligonucleotides, ligation of shorter oligonucleotides, in vitro transcription of RNA oligonucleotides, the use of vectors for expression within cells, recombinant Dicer products and PCR products.

The siRNA duplexes within the aforementioned pools of siRNAs may correspond to overlapping sequences within a particular mRNA, or non-overlapping sequences of the mRNA. However, preferably they correspond to non-overlapping sequences. Further, each siRNA may be selected randomly, or one or more of the siRNA may be selected according to the criteria discussed above for maximizing the effectiveness of siRNA.

Included in the definition of siRNAs are siRNAs that contain substituted and/or labeled nucleotides that may, for example, be labeled by radioactivity, fluorescence or mass. The most common substitutions are at the 2′ position of the ribose sugar, where moieties such as H (hydrogen) F, NH₃, OCH₃and other O— alkyl, alkenyl, alkynyl, and orthoesters, may be substituted, or in the phosphorous backbone, where sulfur, amines or hydrocarbons may be substituted for the bridging of non-bridging atoms in the phosphodiester bond. Examples of modified siRNAs are explained more fully in commonly assigned U.S. patent application Ser. No. 10/613,077, filed Jul. 1, 2003.

Additionally, as noted above, the cell type into which the siRNA is introduced may affect the ability of the siRNA to enter the cell; however, it does not appear to affect the ability of the siRNA to function once it enters the cell. Methods for introducing double-stranded RNA into various cell types are well known to persons skilled in the art.

As persons skilled in the art are aware, in certain species, the presence of proteins such as RdRP, the RNA-dependent RNA polymerase, may catalytically enhance the activity of the siRNA. For example, RdRP propagates the RNAi effect in C. elegans and other non-mammalian organisms. In fact, in organisms that contain these proteins, the siRNA may be inherited. Two other proteins that are well studied and known to be a part of the machinery are members of the Argonaute family and Dicer, as well as their homologues. There is also initial evidence that the RISC complex might be associated with the ribosome so the more efficiently translated mRNAs will be more susceptible to silencing than others.

Another very important factor in the efficacy of siRNA is mRNA localization. In general, only cytoplasmic mRNAs are considered to be accessible to RNAi to any appreciable degree. However, appropriately designed siRNAs, for example, siRNAs modified with internucleotide linkages or 2′-O-methyl groups, may be able to cause silencing by acting in the nucleus. Examples of these types of modifications are described in commonly assigned U.S. patent application Ser. Nos. 10/431,027 and 10/613,077.

As described above, even when one selects at least two siRNAs at random, the effectiveness of the two may be greater than one would predict based on the effectiveness of two individual siRNAs. This additive or synergistic effect is particularly noticeable as one increases to at least three siRNAs, and even more noticeable as one moves to at least four siRNAs. Surprisingly, the pooling of the non-functional and semi-functional siRNAs, particularly more than five siRNAs, can lead to a silencing mixture that is as effective if not more effective than any one particular functional siRNA.

Within the kits of the present invention, preferably each siRNA will be present in a concentration of between 0.001 and 200 μM, more preferably between 0.01 and 200 nM, and most preferably between 0.1 and 10 nM.

In addition to preferably comprising at least four or five siRNAs, the kits of the present invention will also preferably comprise a buffer to keep the siRNA duplex stable. Persons skilled in the art are aware of buffers suitable for keeping siRNA stable. For example, the buffer may be comprised of 100 mM KCl, 30 mM HEPES-pH 7.5, and 1 mM MgCl₂. Alternatively, kits might contain complementary strands that contain any one of a number of chemical modifications (e.g., a 2′-O-ACE) that protect the agents from degradation by nucleases. In this instance, the user may (or may not) remove the modifying protective group (e.g., deprotect) before annealing the two complementary strands together.

By way of example, the kits may be organized such that pools of siRNA duplexes are provided on an array or microarray of wells or drops for a particular gene set or for unrelated genes. The array may, for example, be in 96 wells, 384 wells or 1284 wells arrayed in a plastic plate or on a glass slide using techniques now known or that come to be known to persons skilled in the art. Within an array, preferably there will be controls such as functional anti-lamin A/C, cyclophilin and two siRNA duplexes that are not specific to the gene of interest.

In order to ensure stability of the siRNA pools prior to usage, they may be retained in lyophilized form at minus twenty degrees (−20° C.) until they are ready for use. Prior to usage, they should be resuspended; however, even once resuspended, for example, in the aforementioned buffer, they should be kept at minus twenty degrees, (−20° C.) until used. The aforementioned buffer, prior to use, may be stored at approximately 4° C. or room temperature. Effective temperatures at which to conduct transfections are well known to persons skilled in the art and include for example, room temperature.

The kits may be applied either in vivo or in vitro. Preferably, the siRNA of the pools or kits is applied to a cell through transfection, employing standard transfection protocols. These methods are well known to persons skilled in the art and include the use of lipid-based carriers, electroporation, cationic carriers, and microinjection. Further, one could apply the present invention by synthesizing equivalent DNA sequences (either as two separate, complementary strands, or as hairpin molecules) instead of siRNA sequences and introducing them into cells through vectors. Once in the cells, the cloned DNA could be transcribed, thereby forcing the cells to generate the siRNA. Examples of vectors suitable for use with the present application include but are not limited to the standard transient expression vectors, adenoviruses, retroviruses, lentivirus-based vectors, as well as other traditional expression vectors. Any vector that has an adequate siRNA expression and procession module may be used. Furthermore, certain chemical modifications to siRNAs, including but not limited to conjugations to other molecules, may be used to facilitate delivery. For certain applications it may be preferable to deliver molecules without transfection by simply formulating in a physiological acceptable solution.

This embodiment may be used in connection with any of the aforementioned embodiments. Accordingly, the sequences within any pool may be selected by rational design.

Multigene Silencing

In addition to developing kits that contain multiple siRNA directed against a single gene, another embodiment includes the use of multiple siRNA targeting multiple genes. Multiple genes may be targeted through the use of high- or hyper-functional siRNA. High- or hyper-functional siRNA that exhibit increased potency, require lower concentrations to induce desired phenotypic (and thus therapeutic) effects. This circumvents RISC saturation. It therefore reasons that if lower concentrations of a single siRNA are needed for knockout or knockdown expression of one gene, then the remaining (uncomplexed) RISC will be free and available to interact with siRNA directed against two, three, four, or more, genes. Thus in this embodiment, the authors describe the use of highly functional or hyper-functional siRNA to knock out three separate genes. More preferably, such reagents could be combined to knockout four distinct genes. Even more preferably, highly functional or hyperfunctional siRNA could be used to knock out five distinct genes. Most preferably, siRNA of this type could be used to knockout or knockdown the expression of six or more genes.

Hyperfunctional siRNA

The term hyperfunctional siRNA (hf-siRNA) describes a subset of the siRNA population that induces RNAi in cells at low- or sub-nanomolar concentrations for extended periods of time. These traits, heightened potency and extended longevity of the RNAi phenotype, are highly attractive from a therapeutic standpoint. Agents having higher potency require lesser amounts of the molecule to achieve the desired physiological response, thus reducing the probability of side effects due to “off-target” interference. In addition to the potential therapeutic benefits associated with hyperfunctional siRNA, hf-siRNA are also desirable from an economic perspective. Hyperfunctional siRNA may cost less on a per-treatment basis, thus reducing overall expenditures to both the manufacturer and the consumer.

Identification of hyperfunctional siRNA involves multiple steps that are designed to examine an individual siRNA agent's concentration- and/or longevity-profiles. In one non-limiting example, a population of siRNA directed against a single gene are first analyzed using the previously described algorithm (Formula VIII). Individual siRNA are then introduced into a test cell line and assessed for the ability to degrade the target mRNA. It is important to note that when performing this step it is not necessary to test all of the siRNA. Instead, it is sufficient to test only those siRNA having the highest SMARTSCORES™, or siRNA ranking (i.e., SMARTSCORES™, or siRNA ranking >−10). Subsequently, the gene silencing data is plotted against the SMARTSCORES™, or siRNA rankings (see FIG. 9). siRNA that (1) induce a high degree of gene silencing (i.e., they induce greater than 80% gene knockdown) and (2) have superior SMARTSCORES™ (i.e., a SMARTSCORE™, or siRNA ranking, of >−10, suggesting a desirable average internal stability profile) are selected for further investigations designed to better understand the molecule's potency and longevity. In one, non-limiting study dedicated to understanding a molecule's potency, an siRNA is introduced into one (or more) cell types in increasingly diminishing concentrations (e.g., 3.0→0.3 nM). Subsequently, the level of gene silencing induced by each concentration is examined and siRNA that exhibit hyperfunctional potency (i.e., those that induce 80% silencing or greater at, e.g., picomolar concentrations) are identified. In a second study, the longevity profiles of siRNA having high (>−10) SMARTSCORES™, or siRNA rankings and greater than 80% silencing are examined. In one non-limiting example of how this is achieved, siRNA are introduced into a test cell line and the levels of RNAi are measured over an extended period of time (e.g., 24-168 hrs). siRNAs that exhibit strong RNA interference patterns (i.e., >80% interference) for periods of time greater than, e.g., 120 hours, are thus identified. Studies similar to those described above can be performed on any and all of the >10⁶siRNA included in this document to further define the most functional molecule for any given gene. Molecules possessing one or both properties (extended longevity and heightened potency) are labeled “hyperfunctional siRNA,” and earmarked as candidates for future therapeutic studies.

While the example(s) given above describe one means by which hyperfunctional siRNA can be isolated, neither the assays themselves nor the selection parameters used are rigid and can vary with each family of siRNA. Families of siRNA include siRNAs directed against a single gene, or directed against a related family of genes.

The highest quality siRNA achievable for any given gene may vary considerably: Thus, for example, in the case of one gene (gene X), rigorous studies such as those described above may enable the identification of an siRNA that, at picomolar concentrations, induces 99⁺% silencing for a period of 10 days. Yet identical studies of a second gene (gene Y) may yield an siRNA that at high nanomolar concentrations (e.g., 100 nM) induces only 75% silencing for a period of 2 days. Both molecules represent the very optimum siRNA for their respective gene targets and therefore are designated “hyperfunctional.” Yet due to a variety of factors including but not limited to target concentration, siRNA stability, cell type, off-target interference, and others, equivalent levels of potency and longevity are not achievable. Thus, for these reasons, the parameters described in the before mentioned assays can vary. While the initial screen selected siRNA that had SMARTSCORES™ above −10 and a gene silencing capability of greater than 80%, selections that have stronger (or weaker) parameters can be implemented. Similarly, in the subsequent studies designed to identify molecules with high potency and longevity, the desired cutoff criteria (i.e., the lowest concentration that induces a desirable level of interference, or the longest period of time that interference can be observed) can vary. The experimentation subsequent to application of the rational criteria of this application is significantly reduced where one is trying to obtain a suitable hyperfunctional siRNA for, for example, therapeutic use. When, for example, the additional experimentation of the type described herein is applied by one skilled in the art with this disclosure in hand, a hyperfunctional siRNA is readily identified.

The siRNA may be introduced into a cell by any method that is now known or that comes to be known and that from reading this disclosure, persons skilled in the art would determine would be useful in connection with the present invention in enabling siRNA to cross the cellular membrane. These methods include, but are not limited to, any manner of transfection, such as, for example, transfection employing DEAE-Dextran, calcium phosphate, cationic lipids/liposomes, micelles, manipulation of pressure, microinjection, electroporation, immunoporation, use of vectors such as viruses, plasmids, cosmids, bacteriophages, cell fusions, and coupling of the polynucleotides to specific conjugates or ligands such as antibodies, antigens, or receptors, passive introduction, adding moieties to the siRNA that facilitate its uptake, and the like.

Having described the invention with a degree of particularity, examples will now be provided. These examples are not intended to and should not be construed to limit the scope of the claims in any way.

EXAMPLES General Techniques and Nomenclatures

siRNA nomenclature. All siRNA duplexes are referred to by sense strand. The first nucleotide of the 5′-end of the sense strand is position 1, which corresponds to position 19 of the antisense strand for a 19-mer. In most cases, to compare results from different experiments, silencing was determined by measuring specific transcript mRNA levels or enzymatic activity associated with specific transcript levels, 24 hours post-transfection, with siRNA concentrations held constant at 100 nM. For all experiments, unless otherwise specified, transfection efficiency was ensured to be over 95%, and no detectable cellular toxicity was observed. The following system of nomenclature was used to compare and report siRNA-silencing functionality: “F” followed by the degree of minimal knockdown. For example, F50 signifies at least 50% knockdown, F80 means at least 80%, and so forth. For this study, all sub-F50 siRNAs were considered non-functional.

Cell culture and transfection. 96-well plates are coated with 50 μl of 50 mg/ml poly-L-lysine (Sigma) for 1 hr, and then washed 3× with distilled water before being dried for 20 min. HEK293 cells or HEK293Lucs or any other cell type of interest are released from their solid support by trypsinization, diluted to 3.5×10⁵cells/ml, followed by the addition of 100 μL of cells/well. Plates are then incubated overnight at 37° C., 5% CO₂. Transfection procedures can vary widely depending on the cell type and transfection reagents. In one non-limiting example, a transfection mixture consisting of 2 mL Opti-MEM I (Gibco-BRL), 80 μl Lipofectamine 2000 (Invitrogen), 15 μL SUPERNasin at 20 U/μl (Ambion), and 1.5 μl of reporter gene plasmid at 1 μg/μl is prepared in 5-ml polystyrene round bottom tubes. One hundred μl of transfection reagent is then combined with 100 μl of siRNAs in polystyrene deep-well titer plates (Beckman) and incubated for 20 to 30 μl at room temperature. Five hundred and fifty microliters of Opti-MEM is then added to each well to bring the final siRNA concentration to 100 nM. Plates are then sealed with parafilm and mixed. Media is removed from HEK293 cells and replaced with 95 μl of transfection mixture. Cells are incubated overnight at 37° C., 5% CO₂.

Quantification of gene knockdown. A variety of quantification procedures can be used to measure the level of silencing induced by siRNA or siRNA pools. In one non-limiting example: to measure mRNA levels 24 hrs post-transfection, QuantiGene branched-DNA (bDNA) kits (Bayer) (Wang, et al, Regulation of insulin preRNA splicing by glucose. Proc. Natl. Acad. Sci. USA 1997, 94:4360.) are used according to manufacturer instructions. To measure luciferase activity, media is removed from HEK293 cells 24 hrs post-transfection, and 50 μl of Steady-GLO reagent (Promega) is added. After 5 minutes, plates are analyzed on a plate reader.

Example I Sequences Used to Develop the Algorithm

Anti-Firefly and anti-Cyclophilin siRNAs panels (FIG. 5a, b) sorted according to using Formula VIII predicted values. All siRNAs scoring more than 0 (formula VIII) and more then 20 (formula IX) are fully functional. All ninety sequences for each gene (and DBI) appear below in Table III.

TABLE III Cyclo 1 SEQ. ID 0032 GUUCCAAAAACAGUGGAUA Cyclo 2 SEQ. ID 0033 UCCAAAAACAGUCGAUAAU Cyclo 3 SEQ. ID 0034 CAAAAACAGUGGAUAAUUU Cyclo 4 SEQ. ID 0035 AAAACAGUGGAUAAUUUUG Cyclo 5 SEQ. ID 0036 AACAGUGGAUAAUUUUGUG Cyclo 6 SEQ. ID 0037 CAGUGGAUAAUUUUGUGGC Cyclo 7 SEQ. ID 0038 GUGGAUAAUUUUGUGGCCU Cyclo 8 SEQ. ID 0039 GGAUAAUUUUGUGGCCUUA Cyclo 9 SEQ. ID 0040 AUAAUUUUGUGGCCUUAGC Cyclo 10 SEQ. ID 0041 AAUUUUGUGGCCUUAGCUA Cyclo 11 SEQ. ID 0042 UUUUGUGGCCUUAGCUACA Cyclo 12 SEQ. ID 0043 UUGUGGCCUUAGCUACAGG Cyclo 13 SEQ. ID 0044 GUGGCCUUAGCUACAGGAG Cyclo 14 SEQ. ID 0045 GGCCUUAGCUACAGGAGAG Cyclo 15 SEQ. ID 0046 CCUUAGCUACAGGAGAGAA Cyclo 16 SEQ. ID 0047 UUAGCUACAGGAGAGAAAG Cyclo 17 SEQ. ID 0048 AGCUACAGGAGAGAAAGGA Cyclo 18 SEQ. ID 0049 CUACAGGAGAGAAAGGAUU Cyclo 19 SEQ. ID 0050 ACAGGAGAGAAAGGAUUUG Cyclo 20 SEQ. ID 0051 AGGAGAGAAAGGAUUUGGC Cyclo 21 SEQ. ID 0052 GAGAGAAAGGAUUUGGCUA Cyclo 22 SEQ. ID 0053 GAGAAAGGAUUUGGCUACA Cyclo 23 SEQ. ID 0054 GAAAGGAUUUGGCUACAAA Cyclo 24 SEQ. ID 0055 AAGGAUUUGGCUACAAAAA Cyclo 25 SEQ. ID 0056 GGAUUUGGCUACAAAAACA Cyclo 26 SEQ. ID 0057 AUUUGGCUACAAAAACAGC Cyclo 27 SEQ. ID 0058 UUGGCUACAAAAACAGCAA Cyclo 28 SEQ. ID 0059 GGCUACAAAAACAGCAAAU Cyclo 29 SEQ. ID 0060 CUACAAAAACAGCAAAUUC Cyclo 30 SEQ. ID 0061 ACAAAAACAGCAAAUUCCA Cyclo 31 SEQ. ID 0062 AAAAACAGCAAAUUCCAUC Cyclo 32 SEQ. ID 0063 AAACAGCAAAUUCCAUCGU Cyclo 33 SEQ. ID 0064 ACAGCAAAUUCCAUCGUGU Cyclo 34 SEQ. ID 0065 AGCAAAUUCCAUCGUGUAA Cyclo 35 SEQ. ID 0066 CAAAUUCCAUCGUGUAAUC Cyclo 36 SEQ. ID 0067 AAUUCCAUCGUGUAAUCAA Cyclo 37 SEQ. ID 0068 UUCCAUCGUGUAAUCAAGG Cyclo 38 SEQ. ID 0069 CCAUCGUGUAAUCAAGGAC Cyclo 39 SEQ. ID 0070 AUCGUGUAAUCAAGGACUU Cyclo 40 SEQ. ID 0071 CGUGUAAUCAAGGACUUCA Cyclo 41 SEQ. ID 0072 UGUAAUCAAGGACUUCAUG Cyclo 42 SEQ. ID 0073 UAAUCAAGGACUUCAUGAU Cyclo 43 SEQ. ID 0074 AUCAAGGACUUCAUGAUCC Cyclo 44 SEQ. ID 0075 CAAGGACUUCAUGAUCCAG Cyclo 45 SEQ. ID 0076 AGGACUUCAUGAUCCAGGG Cyclo 46 SEQ. ID 0077 GACUUCAUGAUCCAGGGCG Cyclo 47 SEQ. ID 0076 CUUCAUGAUCCAGGGCGGA Cyclo 48 SEQ. ID 0079 UCAUGAUCCAGGGCGGAGA Cyclo 49 SEQ. ID 0080 AUGAUCCAGGGCGGAGACU Cyclo 50 SEQ. ID 0081 GAUCCAGGGCGGAGACUUC Cyclo 51 SEQ. ID 0082 UCCAGGGCGGAGACUUCAC Cyclo 52 SEQ. ID 0083 CAGGGCGGAGACUUCACCA Cyclo 53 SEQ. ID 0084 GGGCGGAGACUUCACCAGG Cyclo 54 SEQ. ID 0085 GCGGAGACUUCACCAGGGG Cyclo 55 SEQ. ID 0086 GGAGACUUCACCAGGGGAG Cyclo 56 SEQ. ID 0087 AGACUUCACCAGGGGAGAU Cyclo 57 SEQ. ID 0088 ACUUCACCAGGGGAGAUGG Cyclo 58 SEQ. ID 0089 UUCACCAGGGGAGAUGGCA Cyclo 59 SEQ. ID 0090 CACCAGGGGAGAUGGCACA Cyclo 60 SEQ. ID 0091 CCAGGGGAGAUGGCACAGG Cyclo 61 SEQ. ID 0092 AGGGGAGAUGGCACAGGAG Cyclo 62 SEQ. ID 0093 GGGAGAUGGCACAGGAGGA Cyclo 63 SEQ. ID 0094 GAGAUGGCACAGGAGGAAA Cyclo 64 SEQ. ID 0095 GAUGGCACAGGAGGAAAGA Cyclo 65 SEQ. ID 0096 UGGCACACGAGGAAAGAGC Cyclo 66 SEQ. ID 0097 GCACAGGAGGAAAGAGCAU Cyclo 67 SEQ. ID 0098 ACAGGAGGAAAGAGCAUCU Cyclo 68 SEQ. ID 0099 AGGAGGAAAGAGCAUCUAC Cyclo 69 SEQ. ID 0100 GAGGAAAGAGCAUCUACGG Cyclo 70 SEQ. ID 0101 GGAAAGAGCAUCUACGGUG Cyclo 71 SEQ. ID 0102 AAAGAGCAUCUACGGUGAG Cyclo 72 SEQ. ID 0103 AGAGCAUCUACGGUGAGCG Cyclo 73 SEQ. ID 0104 AGCAUCUACGGUGAGCGCU Cyclo 74 SEQ. ID 0105 CAUCUACGGUGAGCGCUUC Cyclo 75 SEQ. ID 0106 UCUACGGUGAGCGCUUCCC Cyclo 76 SEQ. ID 0107 UACGGUGAGCGCUUCCCCG Cyclo 77 SEQ. ID 0108 CGGUGAGCGCUUCCCCGAU Cyclo 78 SEQ. ID 0109 GUGACCGCUUCCCCGAUGA Cyclo 79 SEQ. ID 0110 GAGCGCUUCCCCGAUGAGA Cyclo 80 SEQ. ID 0111 GCGCUUCCCCGAUGAGAAC Cyclo 81 SEQ. ID 0112 GCUUCCCCGAUGAGAACUU Cyclo 82 SEQ. ID 0113 UUCCCCGAUGAGAACUUCA Cyclo 83 SEQ. ID 0114 CCCCGAUGAGAACUUCAAA Cyclo 84 SEQ. ID 0115 CCGAUGAGAACUUCAAACU Cyclo 85 SEQ. ID 0116 GAUGAGAACUUCAAACUGA Cyclo 86 SEQ. ID 0117 UGAGAACUUCAAACUGAAG Cyclo 87 SEQ. ID 0118 AGAACUUCAAACUGAAGCA Cyclo 88 SEQ. ID 0119 AACUUCAAACUGAAGCACU Cyclo 89 SEQ. ID 0120 CUUCAAACUGAAGCACUAC Cyclo 90 SEQ. ID 0121 UCAAACUGAAGCACUACGG DB 1 SEQ. ID 0122 ACGGGCAAGGCCAAGUGGG DB 2 SEQ. ID 0123 CGGGCAAGGCCAAGUGGGA DB 3 SEQ. ID 0124 GGGCAAGGCCAAGUGGGAU DB 4 SEQ. ID 0125 GGCAAGGCCAAGUGGGAUG DB 5 SEQ. ID 0126 GCAAGGCCAAGUGGGAUGC DB 6 SEQ. ID 0127 CAAGGCCAAGUGGGAUGCC DB 7 SEQ. ID 0128 AAGGCCAAGUGGGAUGCCU DB 8 SEQ. ID 0129 AGGCCAAGUGGGAUGCCUG DB 9 SEQ. ID 0130 GGCCAAGUGGGAUGCCUGG DB 10 SEQ. ID 0131 GCCAAGUGGGAUGCCUGGA DB 11 SEQ. ID 0132 CCAAGUGGGAUGCCUGGAA DB 12 SEQ. ID 0133 CAAGUGGGAUGCCUGGAAU DB 13 SEQ. ID 0134 AAGUGGGAUGCCUGGAAUG DB 14 SEQ. ID 0135 AGUGGGAUGCCUGGAAUGA DB 15 SEQ. ID 0136 GUGGGAUGCCUGGAAUGAG DB 16 SEQ. ID 0137 UGGGAUGCCUGGAAUGAGC DB 17 SEQ. ID 0138 GGGAUGCCUGGAAUGAGCU DB 18 SEQ. ID 0139 GGAUGCCUGGAAUGAGCUG DB 19 SEQ. ID 0140 GAUGCCUGGAAUGAGCUGA DB 20 SEQ. ID 0141 AUGCCUGGAAUGAGCUGAA DB 21 SEQ. ID 0142 UGCCUGGAAUGAGCUGAAA DB 22 SEQ. ID 0143 GCCUGGAAUGAGCUGAAAG DB 23 SEQ. ID 0144 CCUGGAAUGAGCUGAAAGG DB 24 SEQ. ID 0145 CUGGAAUGAGCUGAAAGGG DB 25 SEQ. ID 0146 UGGAAUGAGCUGAAAGGGA DB 26 SEQ. ID 0147 GGAAUGAGCUGAAAGGGAC DB 27 SEQ. ID 0148 GAAUGAGCUGAAAGGGACU DB 28 SEQ. ID 0149 AAUGAGCUGAAAGGGACUU DB 29 SEQ. ID 0150 AUGAGCUGAAAGGGACUUC DB 30 SEQ. ID 0151 UGAGCUGAAAGGGACUUCC DB 31 SEQ. ID 0152 GAGCUGAAAGGGACUUCCA DB 32 SEQ. ID 0153 AGCUGAAAGGGACUUCCAA DB 33 SEQ. ID 0154 GCUGAAAGGGACUUCCAAG DB 34 SEQ. ID 0155 CUGAAAGGGACUUCCAAGG DB 35 SEQ. ID 0156 UGAAAGGGACUUCGAAGGA DB 36 SEQ. ID 0157 GAAAGGGACUUCCAAGGAA DB 37 SEQ. ID 0158 AAAGGGACUUCCAAGGAAG DB 38 SEQ. ID 0159 AAGGGACUUCCAAGGAAGA DB 39 SEQ. ID 0160 AGGGACUUCCAAGGAAGAU DB 40 SEQ. ID 0161 GGGACUUCCAAGGAAGAUG DB 41 SEQ. ID 0162 GGACUUCCAAGGAAGAUGC DB 42 SEQ. ID 0163 GACUUCCAAGGAAGAUGCC DB 43 SEQ. ID 0164 ACUUCCAAGGAAGAUGCCA DB 44 SEQ. ID 0165 CUUCCAAGGAAGAUGCCAU DB 45 SEQ. ID 0166 UUCCAAGGAAGAUGCCAUG DB 46 SEQ. ID 0167 UCCAAGGAAGAUGCCAUGA DB 47 SEQ. ID 0168 CCAAGGAAGAUGCCAUGAA DB 48 SEQ. ID 0169 CAAGGAAGAUGCCAUGAAA DB 49 SEQ. ID 0170 AAGGAAGAUGCCAUGAAAG DB 50 SEQ. ID 0171 AGGAAGAUGCCAUGAAAGC DB 51 SEQ. ID 0172 GGAAGAUGCCAUGAAAGCU DB 52 SEQ. ID 0173 GAAGAUGCCAUGAAAGCUU DB 53 SEQ. ID 0174 AAGAUGCCAUGAAAGCUUA DB 54 SEQ. ID 0175 AGAUGCCAUGAAAGCUUAC DB 55 SEQ. ID 0176 GAUGCCAUGAAAGCUUACA DB 56 SEQ. ID 0177 AUGCCAUGAAAGCUUACAU DB 57 SEQ. ID 0178 UGCCAUGAAAGCUUACAUC DB 58 SEQ. ID 0179 GCCAUGAAAGCUUACAUCA DB 59 SEQ. ID 0180 CCAUGAAAGCUUACAUCAA DB 60 SEQ. ID 0181 CAUGAAAGCUUACAUCAAC DB 61 SEQ. ID 0182 AUGAAAGCUUACAUCAACA DB 62 SEQ. ID 0183 UGAAAGCUUACAUCAACAA DB 63 SEQ. ID 0184 GAAAGCUUACAUCAACAAA DB 64 SEQ. ID 0185 AAAGCUUACAUCAACAAAG DB 65 SEQ. ID 0186 AAGCUUACAUCAACAAAGU DB 66 SEQ. ID 0187 AGCUUACAUCAACAAAGUA DB 67 SEQ. ID 0188 GCUUACAUCAACAAAGUAG DB 68 SEQ. ID 0189 CUUACAUCAACAAAGUAGA DB 69 SEQ. ID 0190 UUACAUCAACAAAGUAGAA DB 70 SEQ. ID 0191 UACAUCAACAAAGUAGAAG DB 71 SEQ. ID 0192 ACAUCAACAAAGUAGAAGA DB 72 SEQ. ID 0193 CAUCAACAAAGUAGAAGAG DB 73 SEQ. ID 0194 AUCAACAAAGUAGAAGAGC DB 74 SEQ. ID 0195 UCAACAAAGUAGAAGAGCU DB 75 SEQ. ID 0196 CAACAAAGUAGAAGAGCUA DB 76 SEQ. ID 0197 AACAAAGUAGAAGAGCUAA DB 77 SEQ. ID 0198 ACAAAGUAGAAGAGCUAAA DB 78 SEQ. ID 0199 CAAAGUAGAAGAGCUAAAG DB 79 SEQ. ID 0200 AAAGUAGAAGAGCUAAAGA DB 80 SEQ. ID 0201 AAGUAGAAGAGCUAAAGAA DB 81 SEQ. ID 0202 AGUAGAAGAGCUAAAGAAA DB 82 SEQ. ID 0203 GUAGAAGAGCUAAAGAAAA DB 83 SEQ. ID 0204 UAGAAGAGCUAAAGAAAAA DB 84 SEQ. ID 0205 AGAAGAGCUAAAGAAAAAA DB 85 SEQ. ID 0206 GAAGAGCUAAAGAAAAAAU DB 86 SEQ. ID 0207 AAGAGCUAAAGAAAAAAUA DB 87 SEQ. ID 0208 AGAGCUAAAGAAAAAAUAC DB 88 SEQ. ID 0209 GAGCUAAAGAAAAAAUACG DB 89 SEQ. ID 0210 AGCUAAAGAAAAAAUACGG DB 90 SEQ. ID 0211 GCUAAAGAAAAAAUACGGG Luc 1 SEQ. ID 0212 AUCCUCAUAAAGGCCAAGA Luc 2 SEQ. ID 0213 AGAUCCUCAUAAAGGCCAA Luc 3 SEQ. ID 0214 AGAGAUCCUCAUAAAGGCC Luc 4 SEQ. ID 0215 AGAGAGAUCCUCAUAAAGG Luc 5 SEQ. ID 0216 UCAGAGAGAUCCUCAUAAA Luc 6 SEQ. ID 0217 AAUCAGAGAGAUCCUCAUA Luc 7 SEQ. ID 0218 AAAAUCAGAGAGAUCCUCA Luc 8 SEQ. ID 0219 GAAAAAUCAGAGAGAUCCU Luc 9 SEQ. ID 0220 AAGAAAAAUCAGAGAGAUC Luc 10 SEQ. ID 0221 GCAAGAAAAAUCAGAGAGA Luc 11 SEQ. ID 0222 ACGCAAGAAAAAUCAGAGA Luc 12 SEQ. ID 0223 CGACGCAAGAAAAAUCAGA Luc 13 SEQ. ID 0224 CUCGACGCAAGAAAAAUCA Luc 14 SEQ. ID 0225 AACUCGACGCAAGAAAAAU Luc 15 SEQ. ID 0226 AAAACUCGACGCAAGAAAA Luc 16 SEQ. ID 0227 GGAAAACUCGACGCAAGAA Luc 17 SEQ. ID 0228 CCGGAAAACUCGACGCAAG Luc 18 SEQ. ID 0229 UACCGGAAAACUCGACGCA Luc 19 SEQ. ID 0230 CUUACCGGAAAACUCGACG Luc 20 SEQ. ID 0231 GUCUUACCGGAAAACUCGA Luc 21 SEQ. ID 0232 AGGUCUUACCGGAAAACUC Luc 22 SEQ. ID 0233 AAAGCUCUUACCGGAAAAC Luc 23 SEQ. ID 0234 CGAAAGGUCUUACCGGAAA Luc 24 SEQ. ID 0235 ACCGAAAGGUCUUACCGGA Luc 25 SEQ. ID 0236 GUACCGAAAGGUCUUACCG Luc 26 SEQ. ID 0237 AAGUACCGAAAGGUCUUAC Luc 27 SEQ. ID 0238 CGAAGUACCGAAAGGUCUU Luc 28 SEQ. ID 0239 GACGAAGUACCGAAAGGUC Luc 29 SEQ. ID 0240 UGGACGAAGUACCGAAAGG Luc 30 SEQ. ID 0241 UGUGGACGAAGUACCGAAA Luc 31 SEQ. ID 0242 UUUGUGGACGAAGUACCGA Luc 32 SEQ. ID 0243 UGUUUGUGGACGAAGUACC Luc 33 SEQ. ID 0244 UGUGUUUGUGGACGAAGUA Luc 34 SEQ. ID 0245 GUUGUGUUUGUGGACGAAG Luc 35 SEQ. ID 0246 GAGUUGUGUUUGUGGACGA Luc 36 SEQ. ID 0247 AGGAGUUGUGUUUGUGGAC Luc 37 SEQ. ID 0248 GGAGGAGUUGUGUUUGUGG Luc 38 SEQ. ID 0249 GCGGAGGAGUUGUGUUUGU Luc 39 SEQ. ID 0250 GCGCGGAGGAGUUGUGUUU Luc 40 SEQ. ID 0251 UUGCGCGGAGGAGUUGUGU Luc 41 SEQ. ID 0252 AGUUGCGCGGAGGAGUUGU Luc 42 SEQ. ID 0253 AAAGUUGCGCGGAGGAGUU Luc 43 SEQ. ID 0254 AAAAAGUUGCGCGGAGGAG Luc 44 SEQ. ID 0255 CGAAAAAGUUGCGCGGAGG Luc 45 SEQ. ID 0256 CGCGAAAAAGUUGCGCGGA Luc 46 SEQ. ID 0257 ACCGCGAAAAAGUUGCGCG Luc 47 SEQ. ID 0258 CAACCGCGAAAAAGUUGCG Luc 48 SEQ. ID 0259 AACAACCGCGAAAAAGUUG Luc 49 SEQ. ID 0260 GUAACAACCGCGAAAAAGU Luc 50 SEQ. ID 0261 AAGUAACAACCGCGAAAAA Luc 51 SEQ. ID 0262 UCAAGUAACAACCGCGAAA Luc 52 SEQ. ID 0263 AGUCAAGUAACAACCGCGA Luc 53 SEQ. ID 0264 CCAGUCAAGUAACAACCGC Luc 54 SEQ. ID 0265 CGCCAGUCAAGUAACAACC Luc 55 SEQ. ID 0266 GUCGCCAGUCAAGUAACAA Luc 56 SEQ. ID 0267 ACGUCGCCAGUCAAGUAAC Luc 57 SEQ. ID 0268 UUACGUCGCCAGUCAAGUA Luc 58 SEQ. ID 0269 GAUUACGUCGCCAGUCAAG Luc 59 SEQ. ID 0270 UGGAUUACGUCGCCAGUCA Luc 60 SEQ. ID 0271 CGUGGAUUACGUCGCCAGU Luc 61 SEQ. ID 0272 AUCGUGGAUUACGUCGCCA Luc 62 SEQ. ID 0273 AGAUCGUGGAUUACGUCGC Luc 63 SEQ. ID 0274 AGAGAUCGUGGAUUACGUC Luc 64 SEQ. ID 0275 AAAGAGAUCGUGGAUUACG Luc 65 SEQ. ID 0276 AAAAAGAGAUCGUGGAUUA Luc 66 SEQ. ID 0277 GGAAAAAGAGAUCGUGGAU Luc 67 SEQ. ID 0278 ACGGAAAAAGAGAUCGUGG Luc 68 SEQ. ID 0279 UGACGGAAAAAGAGAUCGU Luc 69 SEQ. ID 0280 GAUGACGGAAAAAGAGAUC Luc 70 SEQ. ID 0281 ACGAUGACGGAAAAAGAGA Luc 71 SEQ. ID 0282 AGACGAUGACGGAAAAAGA Luc 72 SEQ. ID 0283 AAAGACGAUGACGGAAAAA Luc 73 SEQ. ID 0284 GGAAAGACGAUGACGGAAA Luc 74 SEQ. ID 0285 ACGGAAAGACGAUGACGGA Luc 75 SEQ. ID 0286 GCACGGAAAGACGAUGACG Luc 76 SEQ. ID 0287 GAGCACGGAAAGACGAUGA Luc 77 SEQ. ID 0288 UGGAGCACGGAAAGACGAU Luc 78 SEQ. ID 0289 UUUGGAGCACGGAAAGACG Luc 79 SEQ. ID 0290 GUUUUGGAGCACGGAAAGA Luc 80 SEQ. ID 0291 UUGUUUUGGAGCACGGAAA Luc 81 SEQ. ID 0292 UGUUGUUUUGGAGCACGGA Luc 82 SEQ. ID 0293 GUUGUUGUUUUGGAGCACG Luc 83 SEQ. ID 0294 CCGUUGUUGUUUUGGAGCA Luc 84 SEQ. ID 0295 CGCCGUUGUUGUUUUGGAG Luc 85 SEQ. ID 0296 GCCGCCGUUGUUGUUUUGG Luc 86 SEQ. ID 0297 CCGCCGCCGUUGUUGUUUU Luc 87 SEQ. ID 0298 UCCCGCCGCCGUUGUUGUU Luc 88 SEQ. ID 0299 CUUCCCGCCGCCGUUGUUG Luc 89 SEQ. ID 0300 AACUUCCCGCCGCCGUUGU Luc 90 SEQ. ID 0301 UGAACUUCCCGCCGCCGUU

Example II Validation of the Algorithm Using DBI, Luciferase, PLK, EGFR, and SEAP

The algorithm (Formula VIII) identified siRNAs for five genes, human DBI, firefly luciferase (fLuc), renilla luciferase (rLuc), human PLK, and human secreted alkaline phosphatase (SEAP). Four individual siRNAs were selected on the basis of their SMARTSCORES™ derived by analysis of their sequence using Formula VIII (all of the siRNAs would be selected with Formula IX as well) and analyzed for their ability to silence their targets' expression. In addition to the scoring, a BLAST search was conducted for each siRNA. To minimize the potential for off-target silencing effects, only those target sequences with more than three mismatches against un-related sequences were selected. Semizarov, et al. (2003) Specificity of short interfering RNA determined through gene expression signatures, Proc. Natl. Acad. Sci. USA, 100:6347. These duplexes were analyzed individually and in pools of 4 and compared with several siRNAs that were randomly selected. The functionality was measured as a percentage of targeted gene knockdown as compared to controls. All siRNAs were transfected as described by the methods above at 100 nM concentration into HEK293 using Lipofectamine 2000. The level of the targeted gene expression was evaluated by B-DNA as described above and normalized to the non-specific control. FIG. 10 shows that the siRNAs selected by the algorithm disclosed herein were significantly more potent than randomly selected siRNAs. The algorithm increased the chances of identifying an F50 siRNA from 48% to 91%, and an F80 siRNA from 13% to 57%. In addition, pools of SMART siRNA silence the selected target better than randomly selected pools (see FIG. 10F).

Example III Validation of the Algorithm Using Genes Involved in Clathrin-Dependent Endocytosis

Components of clathrin-mediated endocytosis pathway are key to modulating intracellular signaling and play important roles in disease. Chromosomal rearrangements that result in fusion transcripts between the Mixed-Lineage Leukemia gene (MLL) and CALM (clathrin assembly lymphoid myeloid leukemia gene) are believed to play a role in leukemogenesis. Similarly, disruptions in Rab7 and Rab9, as well as HIP1 (Huntingtin-interacting protein), genes that are believed to be involved in endocytosis, are potentially responsible for ailments resulting in lipid storage, and neuronal diseases, respectively. For these reasons, siRNA directed against clathrin and other genes involved in the clathrin-mediated endocytotic pathway are potentially important research and therapeutic tools.

siRNAs directed against genes involved in the clathrin-mediated endocytosis pathways were selected using Formula VIII. The targeted genes were clathrin heavy chain (CHC, accession # NM_—004859), clathrin light chain A (CLCa, NM_—001833), clathrin light chain B (CLCb, NM_—001834), CALM (U45976), β2 subunit of AP-2 (β2, NM_—001282), Eps15 (NM_—001981), Eps15R (NM_—021235), dynamin II (DYNI₁, NM_—004945), Rab5a (BC001267), Rab5b (NM_—002868), Rab5c (AF141304), and EEA.1 (XM_—018197).

For each gene, four siRNAs duplexes with the highest scores were selected and a BLAST search was conducted for each of them using the Human EST database. In order to minimize the potential for off-target silencing effects, only those sequences with more than three mismatches against un-related sequences were used. All duplexes were synthesized at Dharmacon, Inc. as 21-mers with 3′-UU overhangs using a modified method of 2′-ACE chemistry, Scaringe (2000) Advanced 5′-silyl-2′-orthoester approach to RNA oligonucleotide synthesis, Methods Enzymol. 317:3, and the antisense strand was chemically phosphorylated to insure maximized activity.

HeLa cells were grown in Dulbecco's modified Eagle's medium (DMEM) containing 10% fetal bovine serum, antibiotics and glutamine. siRNA duplexes were resuspended in 1× siRNA Universal buffer (Dharmacon, Inc.) to 20 μM prior to transfection. HeLa cells in 12-well plates were transfected twice with 4 μl of 20 μM siRNA duplex in 3 μl Lipofectamine 2000 reagent (Invitrogen, Carlsbad, Calif., USA) at 24-hour intervals. For the transfections in which 2 or 3 siRNA duplexes were included, the amount of each duplex was decreased, so that the total amount was the same as in transfections with single siRNAs. Cells were plated into normal culture medium 12 hours prior to experiments, and protein levels were measured 2 or 4 days after the first transfection.

Equal amounts of lysates were resolved by electrophoresis, blotted, and stained with the antibody specific to targeted protein, as well as antibodies specific to unrelated proteins, PP1 phosphatase and Tsg101 (not shown). The cells were lysed in Triton X-100/glycerol solubilization buffer as described previously. Tebar, Bohlander, & Sorkin (1999) Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, Interactions with Clathrin, and the Impact of Overexpression on Clathrin-mediated Traffic, Mol. Biol. Cell, 10:2687. Cell lysates were electrophoresed, transferred to nitrocellulose membranes, and Western blotting was performed with several antibodies followed by detection using enhanced chemiluminescence system (Pierce, Inc). Several x-ray films were analyzed to determine the linear range of the chemiluminescence signals, and the quantifications were performed using densitometry and AlphaImager v5.5 software (Alpha Innotech Corporation). In experiments with Eps 15R-targeted siRNAs, cell lysates were subjected to immunoprecipitation with Ab860, and Eps 15R was detected in immunoprecipitates by Western blotting as described above.

The antibodies to assess the levels of each protein by Western blot were obtained from the following sources: monoclonal antibody to clathrin heavy chain (TD. 1) was obtained from American Type Culture Collection (Rockville, Md., USA); polyclonal antibody to dynamin II was obtained from Affinity Bioreagents, Inc. (Golden, Colo., USA); monoclonal antibodies to EEA. 1 and Rab5a were purchased from BD Transduction Laboratories (Los Angeles, Calif., USA); the monoclonal antibody to Tsg101 was purchased from Santa Cruz Biotechnology, Inc. (Santa Cruz, Calif., USA); the monoclonal antibody to GFP was from ZYMED Laboratories Inc. (South San Francisco, Calif., USA); the rabbit polyclonal antibodies Ab32 specific to α-adaptins and Ab20 to CALM were described previously (Sorkin et al. (1995) Stoichiometric Interaction of the Epidermal Growth Factor Receptor with the Clathrin-associated Protein Complex AP-2, J. Biol. Chem., 270:619), the polyclonal antibodies to clathrin light chains A and B were kindly provided by Dr. F. Brodsky (UCSF); monoclonal antibodies to PP1 (BD Transduction Laboratories) and x-Actinin (Chemicon) were kindly provided by Dr. M. Dell'Acqua (University of Colorado); Eps15 Ab577 and Eps15R Ab860 were kindly provided by Dr. P. P. Di Fiore (European Cancer Institute).

FIG. 11 demonstrates the in vivo functionality of 48 individual siRNAs, selected using Formula VIII (most of them will meet the criteria incorporated by Formula IX as well) targeting 12 genes. Various cell lines were transfected with siRNA duplexes (Dup1-4) or pools of siRNA duplexes (Pool), and the cells were lysed 3 days after transfection with the exception of CALM (2 days) and β2 (4 days).

Note a β1-adaptin band (part of AP-1 Golgi adaptor complex) that runs slightly slower than β2 adaptin. CALM has two splice variants, 66 and 72 kD. The full-length Eps15R (a doublet of ˜130 kD) and several truncated spliced forms of ˜100 kD and ˜70 kD were detected in Eps 15R immunoprecipitates (shown by arrows). The cells were lysed 3 days after transfection. Equal amounts of lysates were resolved by electrophoresis and blotted with the antibody specific to a targeted protein (GFP antibody for YFP fusion proteins) and the antibody specific to unrelated proteins PP1 phosphatase or α-actinin, and TSG101. The amount of protein in each specific band was normalized to the amount of non-specific proteins in each lane of the gel. Nearly all of them appear to be functional, which establishes that Formula VIII and IX can be used to predict siRNAs' functionality in general in a genome wide manner.

To generate the fusion of yellow fluorescent protein (YFP) with Rab5b or Rab5c (YFP-Rab5b or YFP-Rab5c), a DNA fragment encoding the full-length human Rab5b or Rab5c was obtained by PCR using Pfu polymerase (Stratagene) with a SacI restriction site introduced into the 5′ end and a KpnI site into the 3′ end and cloned into pEYFP-C1 vector (CLONTECH, Palo Alto, Calif., USA). GFP-CALM and YFP-Rab5a were described previously (Tebar, Bohlander, & Sorkin (1999) Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, Interactions with Clathrin, and the Impact of Overexpression on Clathrin-mediated Traffic, Mol. Biol. Cell 10:2687).

Example IV Validation of the Algorithm using Eg5, GADPH, ATE1, MEK2, MEK1, QB, Lamina/C, C-MYC, Human Cyclophilin, and Mouse Cyclophilin

A number of genes have been identified as playing potentially important roles in disease etiology. Expression profiles of normal and diseased kidneys has implicated Edg5 in immunoglobulin A neuropathy, a common renal glomerular disease. Myc1, MEK1/2 and other related kinases have been associated with one or more cancers, while lamins have been implicated in muscular dystrophy and other diseases. For these reasons, siRNA directed against the genes encoding these classes of molecules would be important research and therapeutic tools.

FIG. 12 illustrates four siRNAs targeting 10 different genes (Table V for sequence and accession number information) that were selected according to the Formula VIII and assayed as individuals and pools in HEK293 cells. The level of siRNA induced silencing was measured using the B-DNA assay. These studies demonstrated that thirty-six out of the forty individual SMART-selected siRNA tested are functional (90%) and all 10 pools are fully functional.

Example V Validation of the Algorithm Using Bcl2

Bcl-2 is a ˜25 kD, 205-239 amino acid, anti-apoptotic protein that contains considerable homology with other members of the BCL family including BCLX, MCL1, BAX, BAD, and BIK. The protein exists in at least two forms (Bcl2a, which has a hydrophobic tail for membrane anchorage, and Bcl2b, which lacks the hydrophobic tail) and is predominantly localized to the mitochondrial membrane. While Bcl2 expression is widely distributed, particular interest has focused on the expression of this molecule in B and T cells. Bcl2 expression is down-regulated in normal germinal center B cells yet in a high percentage of follicular lymphomas, Bcl2 expression has been observed to be elevated. Cytological studies have identified a common translocation ((14;18)(q32;q32)) amongst a high percentage (>70%) of these lymphomas. This genetic lesion places the Bcl2 gene in juxtaposition to immunoglobulin heavy chain gene (IgH) encoding sequences and is believed to enforce inappropriate levels of gene expression, and resistance to programmed cell death in the follicle center B cells. In other cases, hypomethylation of the Bcl2 promoter leads to enhanced expression and again, inhibition of apoptosis. In addition to cancer, dysregulated expression of Bcl-2 has been correlated with multiple sclerosis and various neurological diseases.

The correlation between Bcl-2 translocation and cancer makes this gene an attractive target for RNAi. Identification of siRNA directed against the bcl2 transcript (or Bcl2-IgH fusions) would further our understanding Bcl2 gene function and possibly provide a future therapeutic agent to battle diseases that result from altered expression or function of this gene.

In Silico Identification of Functional siRNA

To identify functional and hyperfunctional siRNA against the Bcl2 gene, the sequence for Bcl-2 was downloaded from the NCBI Unigene database and analyzed using the Formula VIII algorithm. As a result of these procedures, both the sequence and SMARTSCORES™, or siRNA rankings of the Bcl2 siRNA were obtained and ranked according to their functionality. Subsequently, these sequences were BLAST'ed (database) to insure that the selected sequences were specific and contained minimal overlap with unrealated genes. The SMARTSCORES™, or siRNA rankings for the top 10 Bcl-2 siRNA are identified in FIG. 13.

In Vivo Testing of Bcl-2 SiRNA

Bcl-2 siRNAs having the top ten SMARTSCORES™, or siRNA rankings were selected and tested in a functional assay to determine silencing efficiency. To accomplish this, each of the ten duplexes were synthesized using 2′-O-ACE chemistry and transfected at 100 nM concentrations into cells. Twenty-four hours later assays were performed on cell extracts to assess the degree of target silencing. Controls used in these experiments included mock transfected cells, and cells that were transfected with a non-specific siRNA duplex.

The results of these experiments are presented below (and in FIG. 14) and show that all ten of the selected siRNA induce 80% or better silencing of the Bcl2 message at 100 nM concentrations. These data verify that the algorithm successfully identified functional Bcl2 siRNA and provide a set of functional agents that can be used in experimental and therapeutic environments.

siRNA 1 GGGAGAUAGUGAUGAAGUA SEQ. ID NO. 302 siRNA 2 GAAGUACAUCCAUUAUAAG SEQ. ID NO. 303 siRNA 3 GUACGACAACCGGGAGAUA SEQ. ID NO. 304 siRNA 4 AGAUAGUGAUCAAGUACAU SEQ. ID NO. 305 siRNA 5 UGAAGACUCUGCUCAGUUU SEQ. ID NO. 306 siRNA 6 GCAUGCGGCCUCUGUUUGA SEQ. ID NO. 307 siRNA 7 UGCGGCCUCUGUUUGAUUU SEQ. ID NO. 303 siRNA 8 GAGAUAGUGAUGAAGUACA SEQ. ID NO. 309 siRNA 9 GGAGAUAGUGAUGAAGUAC SEQ. ID NO. 310 siRNA 10 GAAGACUCUGCUCAGUUUG SEQ. ID NO. 311

Bcl2 siRNA: Sense Strand, 5′→3′

Example VI Sequences Selected by the Algorithm

Sequences of the siRNAs selected using Formulas (Algorithms) VIII and IX with their corresponding ranking, which have been evaluated for the silencing activity in vivo in the present study (Formula VIII and IX, respectively) are shown in Table V. It should be noted that the “t” residues in Table V, and elsewhere, when referring to siRNA, should be replaced by “u” residues.

TABLE V SEQ. ID FORMULA FORMULA GENE Name No. FTLLSEQTENCE VIII IX CLTC NM_004859 0312 GAAAGAATCTGTAGAGAAA 76 94.2 CLTC NM_004859 0313 GCAATGAGCTGTTTGAAGA 65 39.9 CLTC NM_004859 0314 TGACAAAGGTGGATAAATT 57 38.2 CLTC NM_004859 0315 GGAAATGGATCTCTTTGAA 54 49.4 CLTA NM_001833 0316 GGAAAGTAATGGTCCAACA 22 55.5 CLTA NM_001833 0317 AGACAGTTATGCAGCTATT 4 22.9 CLTA NM_001833 0318 CCAATTCTCGGAAGCAAGA 1 17 CLTA NM_001833 0319 GAAAGTAATGGTCCAACAG −1 −13 CLTB NM_001834 0320 GCGCCAGAGTGAACAAGTA 17 57.5 CLTB NM_001834 0321 GAAGGTGCCCCAGCTATGT 15 −8.6 CLTB NM_001834 0322 GGAACCAGCGCCAGAGTCA 13 40.5 CLTB NM_001834 0323 GAGCGAGATTGCAGGCATA 20 61.7 CALM U45976 0324 GTTAGTATCTGATGACTTG 36 −34.6 CALM U45976 0325 GAAATGGAACCACTAAGAA 33 46.1 CALM U45976 0326 GGAAATGGAACCACTAAGA 30 61.2 CALM U45976 0327 CAACTACACTTTCCAATGC 28 6.8 EPS15 NM_001981 0328 CCACCAAGATTTCATGATA 48 25.2 EPS15 NM_001981 0329 GATCGGAACTCCAACAAGA 43 49.3 EPS15 NM_001981 0330 AAACGGAGCTACAGATTAT 39 11.5 EPS15 NM_001981 0331 CCACACAGCATTCTTGTAA 33 −23.6 EPS15R NM_021235 0332 GAAGTTACCTTGAGCAATC 48 33 EPS15R NM_021235 0333 GGACTTGGCCGATCCAGAA 27 33 EPS15R NM_021235 0334 GCACTTGGATCGAGATGAG 20 1.3 EPS15R NM_021235 0335 CAAAGACCAATTCGCGTTA 17 27.7 DNM2 NM_004945 0336 CCGAATCAATCGCATCTTC 6 −29.6 DNM2 NM_004945 0337 GACATGATCCTGCAGTTCA 5 −14 DNM2 NM_004945 0338 GAGCGAATCGTCACCACTT 5 24 DNM2 NM_004945 0339 CCTCCGAGCTGGCGTCTAC −4 −63.6 ARF6 AF93885 0340 TCACATGGTTAACCTCTAA 27 −21.1 ARF6 AF93885 0341 GATGAGGGACGCCATAATC 7 −38.4 ARF6 AF93885 0342 CCTCTAACTACAAATCTTA 4 16.9 ARF6 AF93885 0343 GGAAGGTGCTATCCAAAAT 4 11.5 RAB5A BC001267 0344 GCAAGCAAGTCCTAACATT 40 25.1 RAB5A BC001267 0345 GGAAGAGGAGTAGACCTTA 17 50.1 RAB5A BC001267 0346 AGGAATCAGTGTTGTAGTA 16 11.5 RAB5A BC001267 0347 GAAGAGGAGTAGACCTTAC 12 7 RAB5B NM_002868 0348 CAAAGTCAAGCCTGGTATT 14 18.1 RAB5B NM_002868 0349 AAAGTCAAGCCTGGTATTA 6 −17.8 RAB5B NM_002868 0350 GCTATGAACGTGAATGATC 3 −21.1 RAB5C NM_002868 0351 CAAGCCTGGTATTACGTTT −7 −37.5 RAB5C AF141304 0352 GGAACAAGATCTGTCAATT 38 51.9 RAB5C AF141304 0353 GCAATGAACGTGAACGAAA 29 43.7 RAB5C AF141304 0354 CAATGAACGTGAACGAAAT 18 43.3 RAB5C AF141304 0355 GGACAGGAGCGGTATCACA 6 18.2 EEA1 XM_018197 0356 AGACAGAGCTTGAGAATAA 67 64.1 EEA1 XM_018197 0357 GAGAAGATCTTTATGCAAA 60 48.7 EEA1 XM_018197 0358 GAAGAGAAATCAGCAGATA 58 45.7 EEA1 XM_018197 0359 GCAAGTAACTCAACTAACA 56 72.3 AP2B1 NM_001282 0360 GAGCTAATCTGCCACATTG 49 −12.4 AP2B1 NM_001282 0361 GCAGATGAGTTACTAGAAA 44 48.9 AP2B1 NM_001282 0362 CAACTTAATTGTCCAGAAA 41 28.2 AP2B1 NM_001282 0363 CAACACAGGATTCTGATAA 33 −5.8 PLK NM_005030 0364 AGATTGTGCCTAAGTCTCT −35 −3.4 PLK NM_005030 0365 ATGAAGATCTGGAGGTGAA 0 −4.3 PLK NM_005030 0366 TTTGACACTTCTTGCCTAA −5 −27.7 PLK NM_005030 0367 AGATCACCCTCCTTAAATA 15 72.3 GAPDH NM_002046 0368 CAACGGATTTGGTCGTATT 27 −2.8 GAPDH NM_002046 0369 GAAATCCCATCACCATCTT 24 3.9 GAPDH NM_002046 0370 GACCTCAACTACATGGTTT 22 −22.9 GAPDH NM_002046 0371 TGGTTTACATGTTCCAATA 9 9.8 c-Myc 0372 GAAGAAATCGATGTTGTTT 31 −11.7 c-Myc 0373 ACACAAACTTGAACAGCTA 22 51.3 c-Myc 0374 GGAAGAAATCGATGTTGTT 18 26 c-Myc 0375 GAAACGACGAGAACAGTTG 18 −8.9 MAP2K1 NM_002755 0376 GCACATGGATGGAGGTTCT 26 16 MAP2K1 NM_002755 0377 GCAGAGAGAGCAGATTTGA 16 0.4 MAP2K1 NM_002755 0378 GAGGTTCTCTGGATCAAGT 14 15.5 MAP2K1 NM_002755 0379 GAGCAGATTTGAAGCAACT 14 18.5 MAP2K2 NM_030662 0380 CAAAGACGATGACTTCGAA 37 26.4 MAP2K2 NM_030662 0381 GATCAGCATTTGCATGGAA 24 −0.7 MAP2K2 NM_030662 0382 TCCAGGAGTTTGTCAATAA 17 −4.5 MAP2K2 NM_030662 0383 GGAAGCTGATCCACCTTGA 16 59.2 KNSL1 (EG5) NM_004523 0384 GCAGAAATCTAAGGATATA 53 35.8 KNSL1 (EG5) NM_004523 0385 CAACAAGGATGAAGTCTAT 50 18.3 KNSL1 (EG5) NM_004523 0386 CAGCAGAAATCTAAGGATA 41 32.7 KNSL1 (EG5) NM_004523 0387 CTAGATGGCTTTCTCAGTA 39 3.9 CyclophilinA NM_021130 0388 AGACAAGGTCCCAAAGACA −16 58.1 CyclophilinA NM_021130 0389 GGAATGGCAAGACCAGCAA −6 36 CyclophilinA NM_021130 0390 AGAATTATTCCAGGGTTTA −3 16.1 CyclophilinA NM_021130 0391 GCAGACAAGGTCCCAAAGA 8 8.9 LAMIN A/C NM_170707 0392 AGAAGCAGCTTCAGGATGA 31 38.8 LAMIN A/C NM_170707 0393 GAGCTTGACTTCCAGAAGA 33 22.4 LAMIN A/C NM_170707 0394 CCACCGAAGTTCACCCTAA 21 27.5 LAMIN A/C NM_170707 0395 GAGAAGAGCTCCTCCATCA 55 30.1 CyclophilinB M60857 0396 GAAAGAGCATCTACGGTGA 41 83.9 CyclophilinB M60857 0397 GAAAGGATTTGGCTACAAA 53 59.1 CyclophilinB M60857 0398 ACAGCAAATTCCATCGTGT −20 28.8 CyclophilinB M60857 0399 GGAAAGACTGTTCCAAAAA 2 27 DBI1 NM_020548 0400 CAACACGCCTCATCCTCTA 27 −7.6 DBI2 NM_020548 0401 CATGAAAGCTTACATCAAC 25 −30.8 DBI3 NM_020548 0402 AAGATGCCATGAAAGCTTA 17 22 DBI4 NM_020548 0403 GCACATACCGCCTGAGTCT 15 3.9 rLUC1 0404 GATCAAATCTGAAGAAGGA 57 49.2 rLUC2 0405 GCCAAGAAGTTTCCTAATA 50 13.7 rLUC3 0406 CAGCATATCTTGAACCATT 41 −2.2 rLUC4 0407 GAACAAAGGAAACGGATGA 39 29.2 SeAP1 NM_031313 0408 CGGAAACGGTCCAGGCTAT 6 26.9 SeAP2 NM_031313 0409 GCTTCGAGCAGACATGATA 4 −11.2 SeAP3 NM_031313 0410 CCTACACGGTCCTCCTATA 4 4.9 SeAP4 NM_031313 0411 GCCAAGAACCTCATCATCT 1 −9.9 fLUC1 0412 GATATGGGCTGAATACAAA 54 40.4 fLUC2 0413 GCACTCTGATTGACAAATA 47 54.7 fLUC3 0414 TGAAGTCTCTGATTAAGTA 46 34.5 fLUC4 0415 TCAGAGAGATCCTCATAAA 40 11.4 mCyclo_1 NM_008907 0416 GCAAGAAGATCACCATTTC 52 46.4 mCyclo_2 NM_008907 0417 GAGAGAAATTTGAGGATGA 36 70.7 mCyclo_3 NM_008907 0418 GAAAGGATTTGGCTATAAG 35 −1.5 mCyclo_4 NM_008907 0419 GAAAGAAGGCATGAACATT 27 10.3 BCL2_1 NM_000633 0420 GGGAGATAGTGATGAAGTA 21 72 BCL2_2 NM_000633 0421 GAAGTACATCCATTATAAG 1 3.3 BCL2_3 NM_000633 0422 GTACGACAACCGGGAGATA 1 35.9 BCL2_4 NM_000633 0423 AGATAGTGATGAAGTACAT −12 22.1 BCL2_5 NM_000633 0424 TGAAGACTCTGCTCAGTTT 36 19.1 BCL2_6 NM_000633 0425 GCATGCGGCCTCTGTTTGA 5 −9.7 QB1 NM_003365.1 0426 GCACACAGCUUACUACAUC 52 −4.8 QB2 NM_003365.1 0427 GAAAUGCCCUGGUAUCUCA 49 22.1 QB3 NM_003365.1 0428 GAAGGAACGUGAUGUGAUC 34 22.9 QB4 NM_003365.1 0429 GCACUACUCCUGUGUGUGA 28 20.4 ATE1-1 NM_007041 0430 GAACCCAGCUGGAGAACUU 45 15.5 ATE1-2 NM_007041 0431 GAUAUACAGUGUGAUCUUA 40 12.2 ATE1-3 NM_007041 0432 GUACUACGAUCCUGAUUAU 37 32.9 ATE1-4 NM_007041 0433 GUGCCGACCUUUACAAUUU 35 18.2 EGFR-1 NM_005228 0434 GAAGGAAACTGAATTCAAA 68 79.4 EGFR-1 NM_005228 0435 GGAAATATGTACTACGAAA 49 49.5 EGFR-1 NM_005228 0436 CCACAAAGCAGTGAATTTA 41 7.6 EGFR-1 NM_005228 0437 GTAACAAGCTCACGCAGTT 40 25.9

Many of the genes to which the described siRNA are directed play critical roles in disease etiology. For this reason, the siRNAs listed in the sequence listing may potentially act as therapeutic agents. A number of prophetic examples follow and should be understood in view of the siRNA that are identified in the sequence listing. To isolate these siRNAs, the appropriate message sequence for each gene is analyzed using one of the before mentioned formulas (preferably formula VIII) to identify potential siRNA targets. Subsequently these targets are BLAST'ed to eliminate homology with potential off-targets.

Example VII Evidence for the Benefits of Pooling

Evidence for the benefits of pooling have been demonstrated using the reporter gene, luciferase. Ninety siRNA duplexes were synthesized using Dharmacon proprietary ACE® chemistry against one of the standard reporter genes: firefly luciferase. The duplexes were designed to start two base pairs apart and to cover approximately 180 base pairs of the luciferase gene (see sequences in Table III). Subsequently, the siRNA duplexes were co-transfected with a luciferase expression reporter plasmid into HEK293 cells using standard transfection protocols and luciferase activity was assayed at 24 and 48 hours.

Transfection of individual siRNAs showed standard distribution of inhibitory effect. Some duplexes were active, while others were not. FIG. 15 represents a typical screen of ninety siRNA duplexes (SEQ. ID NO. 0032-0120) positioned two base pairs apart. As the figure suggests, the functionality of the siRNA duplex is determined more by a particular sequence of the oligonucleotide than by the relative oligonucleotide position within a gene or excessively sensitive part of the mRNA, which is important for traditional anti-sense technology.

When two continuous oligonucleotides were pooled together, a significant increase in gene silencing activity was observed (see FIGS. 16A and B). A gradual increase in efficacy and the frequency of pools functionality was observed when the number of siRNAs increased to 3 and 4 (FIGS. 16A, 16B, 17A, and 17B). Further, the relative positioning of the oligonucleotides within a pool did not determine whether a particular pool was functional (see FIGS. 18A and 18B, in which 100% of pools of oligonucleotides distanced by 2, 10 and 20 base pairs were functional).

However, relative positioning may nonetheless have an impact. An increased functionality may exist when the siRNA are positioned continuously head to toe (5′ end of one directly adjacent to the 3′ end of the others).

Additionally, siRNA pools that were tested performed at least as well as the best oligonucleotide in the pool, under the experimental conditions whose results are depicted in FIG. 19. Moreover, when previously identified non-functional and marginally (semi) functional siRNA duplexes were pooled together in groups of five at a time, a significant functional cooperative action was observed (see FIG. 20). In fact, pools of semi-active oligonucleotides were 5 to 25 times more functional than the most potent oligonucleotide in the pool. Therefore, pooling several siRNA duplexes together does not interfere with the functionality of the most potent siRNAs within a pool, and pooling provides an unexpected significant increase in overall functionality

Example VIII Additional Evidence of the Benefits of Pooling

Experiments were performed on the following genes: β-galactosidase, Renilla luciferase, and Secreted alkaline phosphatase, which demonstrates the benefits of pooling. (see FIGS. 21A, 21B and 21C). Individual and pools of siRNA (described in Figure legends 21A-C) were transfected into cells and tested for silencing efficiency. Approximately 50% of individual siRNAs designed to silence the above-specified genes were functional, while 100% of the pools that contain the same siRNA duplexes were functional.

Example IX Highly Functional siRNA

Pools of five siRNAs in which each two siRNAs overlap to 10-90% resulted in 98% functional entities (>80% silencing). Pools of siRNAs distributed throughout the mRNA that were evenly spaced, covering an approximate 20-2000 base pair range, were also functional. When the pools of siRNA were positioned continuously head to tail relative to mRNA sequences and mimicked the natural products of Dicer cleaved long double stranded RNA, 98% of the pools evidenced highly functional activity (>95% silencing).

Example X Human Cyclophilin B

Table III above lists the siRNA sequences for the human cyclophilin B protein. A particularly functional siRNA may be selected by applying these sequences to any of Formula I to VII above.

Alternatively, one could pool 2, 3, 4, 5 or more of these sequences to create a kit for silencing a gene. Preferably, within the kit there would be at least one sequence that has a relatively high predicted functionality when any of Formulas I-VII is applied.

Example XI Sample Pools of siRNAs and their Application to Human Disease

The genetic basis behind human disease is well documented and siRNA may be used as both research or diagnostic tools and therapeutic agents, either individually or in pools. Genes involved in signal transduction, the immune response, apoptosis, DNA repair, cell cycle control, and a variety of other physiological functions have clinical relevance and therapeutic agents that can modulate expression of these genes may alleviate some or all of the associated symptoms. In some instances, these genes can be described as a member of a family or class of genes and siRNA (randomly, conventionally, or rationally designed) can be directed against one or multiple members of the family to induce a desired result.

To identify rationally designed siRNA to each gene, the sequence was analyzed using Formula VIII or Formula X to identify rationally designed siRNA. To confirm the activity of these sequences, the siRNA are introduced into a cell type of choice (e.g., HeLa cells, HEK293 cells) and the levels of the appropriate message are analyzed using one of several art proven techniques. siRNA having heightened levels of potency can be identified by testing each of the before mentioned duplexes at increasingly limiting concentrations. Similarly, siRNA having increased levels of longevity can be identified by introducing each duplex into cells and testing functionality at 24, 48, 72, 96, 120, 144, 168, and 192 hours after transfection. Agents that induce >95% silencing at sub-nanomolar concentrations and/or induce functional levels of silencing for >96 hours are considered hyperfunctional.

Example XII Validation of Multigene Knockout Using Rab5 and Eps

Two or more genes having similar, overlapping functions often leads to genetic redundancy. Mutations that knockout only one of, e.g., a pair of such genes (also referred to as homologs) results in little or no phenotype due to the fact that the remaining intact gene is capable of fulfilling the role of the disrupted counterpart. To fully understand the function of such genes in cellular physiology, it is often necessary to knockout or knockdown both homologs simultaneously. Unfortunately, concomitant knockdown of two or more genes is frequently difficult to achieve in higher organisms (e.g., mice) thus it is necessary to introduce new technologies dissect gene function. One such approach to knocking down multiple genes simultaneously is by using siRNA. For example, FIG. 11 showed that rationally designed siRNA directed against a number of genes involved in the clathrin-mediated endocytosis pathway resulted in significant levels of protein reduction (e.g., >80%). To determine the effects of gene knockdown on clathrin-related endocytosis, internalization assays were performed using epidermal growth factor and transferrin. Specifically, mouse receptor-grade EGF (Collaborative Research Inc.) and iron-saturated human transferrin (Sigma) were iodinated as described previously (Jiang, X., Huang, F., Marusyk, A. & Sorkin, A. (2003) Mol Biol Cell 14, 858-70). HeLa cells grown in 12-well dishes were incubated with ¹²⁵I-EGF (1 ng/ml) or ¹²⁵I-transferrin (1 μg/ml) in binding medium (DMEM, 0.1% bovine serum albumin) at 37° C., and the ratio of internalized and surface radioactivity was determined during 5-min time course to calculate specific internalization rate constant ke as described previously (Jiang, X et al.). The measurements of the uptakes of radiolabeled transferrin and EGF were performed using short time-course assays to avoid influence of the recycling on the uptake kinetics, and using low ligand concentration to avoid saturation of the clathrin-dependent pathway (for EGF Lund, K. A., Opresko, L. K., Strarbuck, C., Walsh, B. J. & Wiley, H. S. (1990) J. Biol. Chem. 265, 15713-13723).

The effects of knocking down Rab5a, 5b, 5c, Eps, or Eps 15R (individually) are shown in FIG. 22 and demonstrate that disruption of single genes has little or no effect on EGF or Tfn internalization. In contrast, simultaneous knock down of Rab5a, 5b, and 5c, or Eps and Eps 15R, leads to a distinct phenotype (note: total concentration of siRNA in these experiments remained constant with that in experiments in which a single siRNA was introduced, see FIG. 23). These experiments demonstrate the effectiveness of using rationally designed siRNA to knockdown multiple genes and validates the utility of these reagents to override genetic redundancy.

Example XIII Validation of Multigene Targeting Using G6PD, GAPDH, PLK, and UQC

Further demonstration of the ability to knock down expression of multiple genes using rationally designed siRNA was performed using pools of siRNA directed against four separate genes. To achieve this, siRNA were transfected into cells (total siRNA concentration of 100 nM) and assayed twenty-four hours later by B-DNA. Results shown in FIG. 24 show that pools of rationally designed molecules are capable of simultaneously silencing four different genes.

Example XIV Validation of multigene knockouts as demonstrated by gene expression Profiling, a Prophetic Example

To further demonstrate the ability to concomitantly knockdown the expression of multiple gene targets, single siRNA or siRNA pools directed against a collection of genes (e.g., 4, 8, 16, or 23 different targets) are simultaneously transfected into cells and cultured for twenty-four hours. Subsequently, mRNA is harvested from treated (and untreated) cells and labeled with one of two fluorescent probes dyes (e.g., a red fluorescent probe for the treated cells, a green fluorescent probe for the control cells.). Equivalent amounts of labeled RNA from each sample is then mixed together and hybridized to sequences that have been linked to a solid support (e.g., a slide, “DNA CHIP”). Following hybridization, the slides are washed and analyzed to assess changes in the levels of target genes induced by siRNA.

Example XV Identifying hyperfunctional siRNA

Identification of Hyperfunctional Bcl-2 siRNA

The ten rationally designed Bcl2 siRNA (identified in FIGS. 13, 14) were tested to identify hyperpotent reagents. To accomplish this, each of the ten Bcl-2 siRNA were individually transfected into cells at a 300 pM (0.3 nM) concentrations. Twenty-four hours later, transcript levels were assessed by B-DNA assays and compared with relevant controls. As shown in FIG. 25, while the majority of Bcl-2 siRNA failed to induce functional levels of silencing at this concentration, siRNA 1 and 8 induced >80% silencing, and siRNA 6 exhibited greater than 90% silencing at this subnanomolar concentration.

By way of prophetic examples, similar assays could be performed with any of the groups of rationally designed genes described in the Examples. Thus for instance, rationally designed siRNA sequences directed against a gene of interest could be introduced into cells at increasingly limiting concentrations to determine whether any of the duplexes are hyperfunctional.

Example XVI Gene Silencing: Prophetic Example

Below is an example of how one might transfect a cell.

Select a cell line. The selection of a cell line is usually determined by the desired application. The most important feature to RNAi is the level of expression of the gene of interest. It is highly recommended to use cell lines for which siRNA transfection conditions have been specified and validated.

Plate the cells. Approximately 24 hours prior to transfection, plate the cells at the appropriate density so that they will be approximately 70-90% confluent, or approximately 1×10⁵cells/ml at the time of transfection. Cell densities that are too low may lead to toxicity due to excess exposure and uptake of transfection reagent-siRNA complexes. Cell densities that are too high may lead to low transfection efficiencies and little or no silencing. Incubate the cells overnight. Standard incubation conditions for mammalian cells are 37° C. in 5% CO₂. Other cell types, such as insect cells, require different temperatures and CO₂concentrations that are readily ascertainable by persons skilled in the art. Use conditions appropriate for the cell type of interest.

siRNA re-suspension. Add 20 μl siRNA universal buffer to each siRNA to generate a final concentration of 50 μM.

siRNA-lipid complex formation. Use RNase-free solutions and tubes. Using the following table, Table XI:

TABLE XI 96-WELL 24-WELL MIXTURE 1 (TRANSIT-TKO-PLASMID DILUTION MIXTURE) Opti-MEM 9.3 μl 46.5 μl TransIT-TKO (1 μg/μl) 0.5 μl 2.5 μl MIXTURE 1 FINAL VOLUME 10.0 μl 50.0 μl MIXTURE 2 (SIRNA DILUTION MIXTURE) Opti-MEM 9.0 μl 45.0 μl siRNA (1 μM) 1.0 μl 5.0 μl MIXTURE 2 FINAL VOLUME 10.0 μl 50.0 μl MIXTURE 3 (SIRNA-TRANSFECTION REAGENT MIXTURE) Mixture 1 10 μl 50 μl Mixture 2 10 μl 50 μl MIXTURE 3 FINAL VOLUME 20 μl 100 μl Incubate 20 minutes at room temperature MIXTURE 4 (MEDIA-SIRNA/TRANSFECTION REAGENT MIXTURE) Mixture 3 20 μl 100 μl Complete media 80 μl 400 μl MIXTURE 4 FINAL VOLUME 100 μl 500 μl Incubate 48 hours at 37° C.

Transfection. Create a Mixture 1 by combining the specified amounts of OPTI-MEM serum free media and transfection reagent in a sterile polystyrene tube. Create a Mixture 2 by combining specified amounts of each siRNA with OPTI-MEM media in sterile 1 ml tubes. Create a Mixture 3 by combining specified amounts of Mixture 1 and Mixture 2. Mix gently (do not vortex) and incubate at room temperature for 20 minutes. Create a Mixture 4 by combining specified amounts of Mixture 3 to complete media. Add appropriate volume to each cell culture well. Incubate cells with transfection reagent mixture for 24-72 hours at 37° C. This incubation time is flexible. The ratio of silencing will remain consistent at any point in the time period. Assay for gene silencing using an appropriate detection method such as RT-PCR, Western blot analysis, immunohistochemistry, phenotypic analysis, mass spectrometry, fluorescence, radioactive decay, or any other method that is now known or that comes to be known to persons skilled in the art and that from reading this disclosure would useful with the present invention. The optimal window for observing a knockdown phenotype is related to the mRNA turnover of the gene of interest, although 24-72 hours is standard. Final Volume reflects amount needed in each well for the desired cell culture format. When adjusting volumes for a Stock Mix, an additional 10% should be used to accommodate variability in pipetting, etc. Duplicate or triplicate assays should be carried out when possible.

Example XVII siRNAs that Target Nuclear Receptors

siRNAs that target nuclear receptors sequences with the NCBI accession numbers denoted below and having sequences generated in silico by the algorithms herein, are provided. In various embodiments, the siRNAs are rationally designed. In various embodiments, the siRNAs are functional or hyperfunctional. These siRNA that have been generated by the algorithms of the present invention include:

SEQ ID NO Name siRNASense Accession 438 AR CCAAAGGGCUAGAAGGCGA NM_000044 439 AR GGAAGUAGGUGGAAGAUUC NM_000044 440 AR ACGCCAAGGAGUUGUGUAA NM_000044 441 AR GAAACUUGGUAAUCUGAAA NM_000044 442 AR CUUCAAGUAUUAAGAGACA NM_000044 443 AR CAAUGAACUGGGAGAGAGA NM_000044 444 AR UUUGGAAGGUGGAGGAUUU NM_000044 445 AR AGAUGAAGCUUCUGGGUGU NM_000044 446 AR AUGCAAAGGUUCUCUGCUA NM_000044 447 AR CCGAGGAGCUUUCCAGAAU NM_000044 448 AR CGGAAAUGAUGGCAGAGAU NM_000044 449 AR GCUGAAGAAACUUGGUAAU NM_000044 450 AR CCGAAGAAGGCCAGUUGUA NM_000044 451 AR CUGAAGGGAAACAGAAGUA NM_000044 452 AR UCAAGGAACUCGAUCGUAU NM_000044 453 AR CCACACAAACGUUUACUUA NM_000044 454 AR GGAAAUGAUGGCAGAGAUC NM_000044 455 AR UGGAGAACCCGCUGGACUA NM_000044 456 AR AGGCAAGAGCACUGAAGAU NM_000044 457 AR GCAUCUGAGUCCAGGGGAA NM_000044 458 AR CAGAAAUGAUUGCACUAUU NM_000044 459 AR GGCAAGAGCACUGAAGAUA NM_000044 460 AR UUAAAGACAUCCUGAGCGA NM_000044 461 AR GUUGAACUCUUCUGAGCAA NM_000044 462 AR CGAGAGAGCUGCAUCAGUU NM_000044 463 AR GCAACUUACACGUGGACGA NM_000044 464 AR AAGAGCCGCUGAAGGGAAA NM_000044 465 AR CAGAAGUACCUGUGCGCCA NM_000044 466 AR UCGGAAAUGUUAUGAAGCA NM_000044 467 AR CCGCUGAGCUUAAAGACAU NM_000044 468 ESR1 CAGUAAAGUCCAUGGAAUA NM_000125 469 ESR1 GAAAGUAUUUGGAGGAAAA NM_000125 470 ESR1 GUGUAGAGCUCUUGUUUUA NM_000125 471 ESR1 GGGAUAAGUUGCUGAUUUU NM_000125 472 ESR1 CCUAAAGCCUGGUGAUUAU NM_000125 473 ESR1 GGUAUUGGGUGUAGGAACA NM_000125 474 ESR1 ACAGAGAGGUCAUUGGUUA NM_000125 475 ESR1 GCAGUAAAGUCCAUGGAAU NM_000125 476 ESR1 GAGAAGUAUUCAAGGACAU NM_000125 477 ESR1 GGAAGCUAGUUAUGUGAAA NM_000125 478 ESR1 AAUGAUGAAAGGUGGGAUA NM_000125 479 ESR1 AGACCGAAGAGGAGGGAGA NM_000125 480 ESR1 GGUCAUGGGUUCCAGUUAA NM_000125 481 ESR1 GGGAAUGGCAAAUAUAUUA NM_000125 482 ESR1 UGAUGAAUCUGCAGGGAGA NM_000125 483 ESR1 GAAAAUGUGUAGAGGGCAU NM_000125 484 ESR1 AAAGCUAGGUCAAGGGUUU NM_000125 485 ESR1 GGGACUUACUGAUAAUUUA NM_000125 486 ESR1 CCUACUACGUGGAGAACGA NM_000125 487 ESR1 UGGACAGGAACCAGGGAAA NM_000125 488 ESR1 ACAUCAGGCACAUGAGUAA NM_000125 489 ESR1 GCAAAGAUUAUGCCUGAAA NM_000125 490 ESR1 UGAAAGUGGUACACCUUAA NM_000125 491 ESR1 GGAAGUAUGGCUAUGGAAU NM_000125 492 ESR1 GAACAACAUCAGCAGUAAA NM_000125 493 ESR1 GGACAAGAUCACAGACACU NM_000125 494 ESR1 CUAAGGAAGUGCAGUCUUU NM_000125 495 ESR1 CAAGAGAAGUAUUCAAGGA NM_000125 496 ESR1 CCAGGGUGGCAGAGAAAGA NM_000125 497 ESR1 GCUUAAUUCUGGAGUGUAC NM_000125 498 ESR2 GGAAAUGCGUAGAAGGAAU NM_001437 499 ESR2 CCAAAUGUGUUGUGGCCAA NM_001437 500 ESR2 GCGGAGGCUGCGAGAAAUA NM_001437 501 ESR2 AGGGAAGGCCGGUGUGUUU NM_001437 502 ESR2 GUAACAAGGGCAUGGAACA NM_001437 503 ESR2 CGGCAGACCACAAGCCCAA NM_001437 504 ESR2 CUCAAGACAUGGAUAuAAA NM_001437 505 ESR2 GAUUAUAUUUGUCCAGCUA NM_001437 506 ESR2 CCGACAAGGAGUUGGUACA NM_001437 507 ESR2 AAACAGAGAGACACUGAAA NM_001437 508 ESR2 GGACAGGGAUGAGGGGAAA NM_001437 509 ESR2 GGGCUGAUGUGGCGGUCAA NM_001437 510 ESR2 GGAUGGAGGUGUUAAUGAU NM_001437 511 ESR2 ACGUCAGGCAUGCGAGUAA NM_001437 512 ESR2 GCAAAGAGGGCUCCCAGAA NM_001437 513 ESR2 GUGUGAAGCAAGAUCGCUA NM_001437 514 ESR2 UUCAAGGUUUCGAGAGUUA NM_001437 515 ESR2 GCAUGGAACAUCUGCUCAA NM_001437 516 ESR2 AGUGUUACGAAGUGGGAAU NM_001437 517 ESR2 CAAGAAGAUUCCCGGCUUU NM_001437 518 ESR2 AAGAAGCAUUCAAGGACAU NM_001437 519 ESR2 GAAAUGCGUAGAAGGAAUU NM_001437 520 ESR2 AAGGAAGGUUAGUGGGAAC NM_001437 521 ESR2 CCAAGAAGAUUGCCGGCUU NM_001437 522 ESR2 GUUCAAAGAGGGAUGCUCA NM_001437 523 ESR2 ACACUGAAAAGGAAGGUUA NM_001437 524 ESR2 AGAAAUAACUGCCUCUUGA NM_001437 525 ESR2 GGGGAAAUGCGUAGAAGGA NM_001437 526 ESR2 AAUCAGUGUACAAUCGAUA NM_001437 527 ESR2 GAGAGAGAGAUGUGGGUAC NM_001437 528 ESRRA GGGUGAAGCUGGAGGGCAA NM_0044S1 529 ESRRA CCAAAGGGUUCCUCGGAGA NM_0044S1 530 ESRRA GCAAAGUGCUGGCCCAUUU NM_004451 531 ESRRA CUGCAGAGCAAUAACACUA NM_0044S1 532 ESRRA UGGAGAUGCUCGAGGCCAU NM_004451 533 ESRRA GGCUGGAGCGAGAGGAGUA NM_0044S1 534 ESRRA CUGAGAAGCUCUAUGCCAU NM_0044S1 535 ESRRA GCCACAAGGAAGAGGAGGA NM_004451 536 ESRRA ACUUAGUCCUGGAUGAAGA NM_0044S1 537 ESRRA GGGCCUUGCGGAAGCCAUA NM_0044S1 538 ESRRA GCCCUUGGCUGUACAGAGA NM_004451 539 ESRRA UCGAAGAUGCCGAGGCUGU NM_004451 540 ESRRA UGUCAGUACUGCAGAGCGU NM_0044S1 541 ESRRA GGAUGGAGGUGCUGGUGCU NM_004451 542 ESRRA CUGUGGAGCAGCUGCGAGA NM_0044S1 543 ESRRA GCGAGAGGAGUAUGUUCUA NM_004451 544 ESRRA GCAGAGCAAUAACACUAUA NM_004451 545 ESRRA AGGCCCUGCUGGAGUAUGA NM_004451 546 ESRRA GCAUUGAGCCUCUCUACAU NM_004451 547 ESRRA ACGAGGCCCUGCUGGAGUA NM_004451 548 ESRRA UCAUUUGCAUUGGGCAUUA NM_004451 549 ESRRA UGAAUGCACUGGUGUCUCA NM_004451 550 ESRRA GGCGUUGGCUGAGGACUUA NM_004451 551 ESRRA GGGUGGGCAUGCUCAAGGA NM_004451 552 ESRRA GUGGGAGGCAGAAACCUAU NM_004451 553 ESRRA GUGCAAAGCCUUCUUCAAG NM_004451 554 ESRRA GAGCGAGAGGAGUAUGUUC NM_004451 555 ESRRA GAAGCAAUAACUCCAAGCA NM_004451 556 ESRRA GCCAGGUGGUGGGCAUUGA NM_0044S1 557 ESRRA GAGCAUCCCAGGCUUCUCA NM_0044S1 558 ESRRB GCGUCAAACUGCAGGGCAA NM_004452 559 ESRRB CAGAGUGCCUGGAUGGAAA NM_004452 560 ESRRB UGGAGAUGCUGGAGGCCAA NM_0044S2 561 ESRRB ACAAGAAGCUCAAGGUGGA NM_004452 562 ESRRB UGGUGUACGCUGAGGACUA NM_0044S2 563 ESRRB CCAUGUACAUCGAGGAUCU NM_004452 564 ESRRB UGACCAAGAUUGUCUCAUA NM_004452 565 ESRRB CAGAGGUGAUCCAGUGAUU NM_0044S2 566 ESRRB UCCCUGGGCUGGUGAAUAA NM_004452 567 ESRRB CCAUCAAGUGGGAGUACAU NM_004452 568 ESRRB CGUGAAACUGCAGGGCAAA NM_004452 569 ESRRB AGCUCAAGGUGGAGAAGGA NM_0044S2 570 ESRRB GGACAUUGCCUCUGGCUAC NM_004452 571 ESRRB ACGAGGCACUGCAGGACUA NM_004452 572 ESRRB AGGUGGAGAAGGAGGAGUU NM_004452 573 ESRRB CAAGAGCAGCUUAGAGGAU NM_004452 574 ESRRB CAGCACUUCUAUAGCGUCA NM_0044S2 575 ESRRB GAGAAGGAGGAGUUUGUGA NM_004452 576 ESRRB GCAGGUACAAGAAGCUCAA NM_004452 577 ESRRB ACCGAGAGCUUGUGGUCAU NM_0044S2 578 ESRRB GGGCGGAAGUCCUGAUGGU NM_004452 579 ESRRB CAUCGAGGAUCUAGAGGCU NM_004452 580 ESRRB GCACUUCUAUAGCGUCAAA NM_0044S2 581 ESRRB CAAGAUUGUCUCAUACCUA NM_004452 582 ESRRB CAGCAUGUGCAUUUCCUAA NM_0044S2 583 ESRRB GAGGAUCUCCCAAGGAUGA NM_004452 584 ESRRB GAGCUUGGCUUGCAACUCA NM_004452 585 ESRRB CCACCAAGAGGCAGCAUGU NM_0044S2 586 ESRRB AGAGGCAGCAUGUGCAUUU NM_0044S2 587 ESRRB GCACUGCUCAGGCUGGAUA NM_004452 588 ESRRG GGGAAUAGUUUAAGCUUUA NM_001438 589 ESRRG GAACAAUAAUGAAGGACAA NM_001438 590 ESRRG GGAUGAUGGUAGAGCAAUA NM_001438 591 ESRRG AAAUACAACUCCUGGGAAA NM_001438 592 ESRRG GAAAUAAACCCAAGAGUGA NM_001438 593 ESRRG GGAGAUAUGUUCAAAGAAU NM_001438 594 ESRRG AAACAAAGAUCGACACAUU NM_001438 595 ESRRG GUUAAGAGGUGUAAUCUAA NM_001438 596 ESRRG CAUCAAACUAGAAGGCAAA NM_001438 597 ESRRG ACAUCAAACUAGAAGGCAA NM_206594 598 ESRRG UGAUAGGAGUUCAGAAAUU NM_001438 599 ESRRG GGUAAAUGCUCAUCUUAAA NM_001438 600 ESRRG GGAAAGUUAUAAAGGUGUA NM_001438 601 ESRRG GGUGUAAUCUAAUGAAGAA NM_001438 602 ESRRG AUGAAGCGCUGCAGGAUUA NM_001438 603 ESRRG GUAAUAAGGUUCUUCCUAA NM_001438 604 ESRRG GAAAGUUUGUGCAGGGUUU NM_001438 605 ESRRG CAGAGUGCUUGGAUGGAAA NM_001438 606 ESRRG GCACUGCACAUGAGAUAUA NM_001438 607 ESRRG AAGAAAUACAAGAGCAUGA NM_206594 608 ESRRG GAGGACAAUUCAAGGCAAU NM_206594 609 ESRRG GUAAAGAAAUACAAGAGCA NM_206594 610 ESRRG GCAAAGUCCCAAUGCACAA NM_001438 611 ESRRG GCCAAGAGCUGACAAUUUA NM_001438 612 ESRRG AGAAGUGACUGUAGUGUGA NM_001438 613 ESRRG CAAGAGUGAUGUCGAAGAA NM_001438 614 ESRRG GGACUAUAUUGUAGAUUGU NM_001438 615 ESRRG CUGCAGAAUGUCAAACAAA NM_001438 616 ESRRG GAAAUGUAGUUCAGCUUUU NM_001438 617 ESRRG UCAACAGACUGCACUGAUA NM_001438 618 HNF4A GGAGAUGACUUGAGCCUUA NM_000457 619 HNF4A CGACAUCACUGGAGCAUAU NM_000457 620 HNF4A UGGACAAAGACAAGAGGAA NM_178849 621 HNF4A CGAAGAAGAUUGCCAGCAU NM_178850 622 HNF4A GCAGGAAGUUAUCUAGCAA NM_000457 623 HNF4A AGGUGAGCUUGGAGGACUA NM_178849 624 HNF4A GAUCGAUGACAAUGAGUAU NM_000457 625 HNF4A GGCAGUGCGUGGUGGACAA NM_000457 626 HNF4A AGGAAGCCGUCCAGAAUGA NM_000457 627 HNF4A UCACCAAGCAGGAAGUUAU NM_000457 628 HNF4A GAAGGAAGCCGUCCAGAAU NM_000457 629 HNF4A GCAGGAAUGGGAAGGAUGA NM_000457 630 HNF4A AGAAGGAAGCCGUCCAGAA NM_178849 631 HNF4A UCAUCAAGCUCUUCGGCAU NM_000457 632 HNF4A GCUUGGAGGACUACAUCAA NM_000457 633 HNF4A AGGUGUUGACGAUGGGCAA NM_178849 634 HNF4A CGGAAGAACCACAUGUACU NM_000457 635 HNF4A GAGCAGAUCCAGUUCAUCA NM_178850 636 HNF4A GGCAGAUGAUCGAGCAGAU NM_000457 637 HNF4A GAUAACAAGACUUUGACUU NM_000457 638 HNF4A GCGUGGUGGACAAAGACAA NM_000457 639 HNF4A GCUCUGAUAACAAGACUUU NM_000457 640 HNF4A CCCUGGAAUUUGAGAAUGU NM_000457 641 HNF4A GGGCUGGCAUGAAGAAGGA NM_000457 642 HNF4A AUGUACUCCUGCAGAUUUA NM_000457 643 HNF4A GCAAGGGACAGAUGUGUGA NM_000457 644 HNF4A CCAAGCAGGAAGUUAUCUA NM_000457 645 HNF4A UGACUUGAGCCUUACUUAA NM_000457 646 HNF4A CCAAGAGAUCCAUGGUGUU NM_178850 647 HNF4A GGACAAAGACAAGAGGAAC NM_000457 648 HNF4G GGACAGAGGAACAGGAAAA NM_004133 649 HNF4G AGAAUAAGCACCAGAAGAA NM_004133 650 HNF4G GGCAAAUGAUUGAGCAAAU NM_004133 651 HNF4G GAUAUUGACACUACAGAAA NM_004133 652 HNF4G CAAAAUAGCUGCAAACCAA NM_004133 653 HNF4G GUUCAUUGGUGCUGUUAUA NM_004133 654 HNF4G GUGAAUUACCAUUGGAUGA NM_004133 655 HNF4G CCAGAGACAAGUAUGAAUA NM_004133 656 HNF4G GCAAAUGAUUGAGCAAAUA NM_004133 657 HNF4G GCACUGACAUAAACGUUAA NM_004133 658 HNF4G GCUUGGAGCUACAAAGAGA NM_004133 659 HNF4G GGGCUAAGCGAUCCAGUAA NM_004133 660 HNF4G GGAAUGGGCUAAAUAUAUU NM_004133 661 HNF4G GGGCAAGAACAGUAGAAAA NM_004133 662 HNF4G GGUUAGACCAUUUCAAGAA NM_004133 663 HNF4G GUGAAAAGUUGUUGAUCUU NM_004133 664 HNF4G GGGACAGAGCAACAGGAAA NM_004133 665 HNF4G AAAUGAACGUGACAGAAUA NM_004133 666 HNF4G CAGAAUAAGCACCAGAAGA NM_004133 667 HNF4G GUGGAAUGGGCUAAAUAUA NM_004133 668 HNF4G CCGCAACAGCUGUGAAGUU NM_004133 669 HNF4G GCUAAGCGAUCGAGUAAAA NM_004133 670 HNF4G ACAAUCUACUUCAGGAAAU NM_004133 671 HNF4G AGAUCGGUUUGGAGGACUA NM_004133 672 HNF4G CAGAACGGCACUACAUAAA NM_004133 673 HNF4G GAUCCAUGAUGUAUAAAGA NM_004133 674 HNF4G ACAAAGAGAUGCAUGAUGU NM_004133 675 HNF4G CAGCAUCUCUCCAAACAAA NM_004133 676 HNF4G CAAAGAGAUCCAUGAUGUA NM_004133 677 HNF4G GUGGCAAAUGAUUGAGCAA NM_004133 678 HSAJ2425 GGAAAUACGUGGAGACACU NM_017532 679 HSAJ2425 AAAGGGACAUUUAGAGAUA NM_017532 680 HSAJ2425 CCAGAUAACUACGGCGAUA NM_017532 681 HSAJ2425 CCUGGAAGGUUGAGAAGUU NM_017532 682 HSAJ2425 GCAUUAAUGAGAUCCGAAA NM_017532 683 HSAJ2425 CGAAAGGGACAUUUAGAGA NM_017532 684 HSAJ2425 CGGGUAAACUUGUCCCUUA NM_017532 685 HSAJ2425 GGUUCUGUUUCUAAAGUUA NM_017532 686 HSAJ2425 GGUUAAGGCAGGAUCUGAA NM_017532 687 HSAJ2425 AAAAGAACGUGGAGGUGUA NM_017532 688 HSAJ2425 AGUCAAAGUUUCACAGAUC NM_017532 689 HSAJ2425 CGAUGCAGCACUUUAAAUA NM_017532 690 HSAJ2425 UGCAACAUCUCUAAGAAUA NM_017532 691 HSAJ2425 GCUUGUACGACAUGAAAGA NM_017532 692 HSAJ2425 UGACCACACAUGACAGAAA NM_017532 693 HSAJ2425 GGUGUAUGAUGAUGACGUA NM_017532 694 HSAJ2425 GAGACACUACGGACAGAAA NM_017532 695 HSAJ2425 GGGUAAACUUGUCCCUUAU NM_017532 696 HSAJ2425 ACGUGGAGGUGUAUGAUGA NM_017532 697 HSAJ2425 GCGAUACCCUUGAUGAAAA NM_017532 698 HSAJ2425 GCUAGUUUGUAGUGACUCA NM_017532 699 HSAJ2425 GGAUCGAGCAGGAUUUCAA NM_017532 700 HSAJ2425 GAGAUAAGGACAAAUUUGA NM_017532 701 HSAJ2425 GCUAGAACAUGCUAUAACA NM_017532 702 HSAJ2425 GUGAAGAGGUGGAUAUGAC NM_017532 703 HSAJ2425 CUGAAUGAAUCUACACCUU NM_017532 704 HSAJ2425 GAGGAAAGAUGGCGUACCU NM_017532 705 HSAJ2425 CAGACUGGCUUGUGGUUCU NM_017532 706 HSAJ2425 ACACAUGACAGAAAUUCAA NM_017532 707 HSAJ2425 GUUGCUAGAACAUGCUAUA NM_017532 708 AAGC2452 GCAGAUGCCACCAGGAGAA NM_032644 709 NR0B1 AUAAAGAGCUGUGGGCAAA NM_000475 710 NR0B1 GAUGAUAUGAUGCUGGAAA NM_000475 711 NR0B1 CACCAUGAGGGAAGAAUAA NM_000475 712 NR0B1 GGCAAAAGAGUGUAAAAUA NM_000475 713 NR0B1 CUUAUGAGCGCGAAGCAAA NM_000475 714 NR0B1 GCAAAUACUCAGUGAACAC NM_000475 715 NR0B1 AUGAUAUGAUGCUGGAAAU NM_000475 716 NR0B1 ACAGAUUCAUCGAACUUAA NM_000475 717 NR0B1 CUCACUAGCUCAAAGCAAA NM_000475 718 NR0B1 GUUCCAUAGUUAAAGAAGA NM_000475 719 NR0B1 UGGAAAUGCUCUGUACAAA NM_000475 720 NR0B1 CUGGAAAUGCUCUGUACAA NM_000475 721 NR0B1 UCACCAUGAGGGAAGAAUA NM_000475 722 NR0B1 GCUCACUAGCUGAAAGCAA NM_000475 723 NR0B1 GOACAGUCAGGAUGGAUGA NM_000475 724 NR0B1 AGACAUUUGCCAACAGGUA NM_000475 725 NR0B1 CAGCAAAUACUCAGUGAAC NM_000475 726 NR0B1 CUGACGAGCGCAAAGCAAA NM_000475 727 NR0B1 AGACGCGGCUGGUGGAUCA NM_000475 728 NR0B1 CAGUCAGCAUGGAUGAUAU NM_000475 729 NR0B1 GGGCCACACAAGUGCAGUA NM_000475 730 NR0B1 GAAAUGCUCUGUACAAAGA NM_000475 731 NR0B1 UGACAGAUUCAUCGAACUU NM_000475 732 NR0B1 GAACUUAAUAGUACCCUUU NM_000475 733 NR0B1 GCAGCAUCCUCUACAGCAU NM_000475 734 NR0B1 GCAGCAUCCUCUACAGCUU NM_000475 735 NR0B1 GCUCAAAGCAAACGCACGU NM_000475 736 NR0B1 GCAGCAUCCUCUACAACAU NM_000475 737 NR0B1 GCUGACGAGCGCAAAGCAA NM_000475 738 NR0B1 AGUGAACACACCAGGAUGA NM_000475 739 NR0B2 GGGCAAGCCUGUAUAGACA NM_021969 740 NR0B2 GGAUGAGAAUGAAAGCUUA NM_021969 741 NR0B2 GGAAUAUGCCUGCCUGAAA NM_021969 742 NR0B2 GGGAUGAGAAUGAAAGCUU NM_021969 743 NR0B2 CAGUGGAGGCAGUGGCCAA NM_021969 744 NR0B2 GCAGCAUACUCAAGAAGAU NM_021969 745 NR0B2 GAGCUUAGCCCCAAGGAAU NM_021969 746 NR0B2 CAGCAGCAGUGGAGGCAGU NM_021969 747 NR0B2 AGGAACAGCUCUUCACUCA NM_021969 748 NR0B2 CCCAAGAUGCUGUGACCUU NM_021969 749 NR0B2 AGUGAGAGCAGAUCCCUAA NM_021969 750 NR0B2 CAUUGGAGUUCGUUGGUUU NM_021969 751 NR0B2 CAGCAUACUCAAGAAGAUU NM_021969 752 NR0B2 CUAUCGACUUUAUACAGAA NM_021969 753 NR0B2 CCUAUCGACUUUAUACAGA NM_021969 754 NR0B2 GAAUAUGCCUGCCUGAAAG NM_021969 755 NR0B2 CCAAGAUGCUGUGACCUUU NM_021969 756 NR0B2 GAUGAGAAUGAAAGCUUAG NM_021969 757 NR0B2 CCAGCUAUGUGCACCUCAU NM_021969 758 NR0B2 GAGAGCAGAUCCCUAACCA NM_021969 759 NR0B2 UGAAAGGGACCAUCCUCUU NM_021969 760 NR0B2 UGACAUCGCUGGCCUUCUU NM_021969 761 NR0B2 GACAGCACUUGGCUCCUUA NM_021969 762 NR0B2 GCGGUGCAGUGGCUUCAAU NM_021969 763 NR0B2 GUGCCCAGCAUACUCAAGA NM_021969 764 NR0B2 ACAUUGGACUUCCUUGGUU NM_021969 765 NR0B2 CGUAGCCGCUGCGUAUGUA NM_021969 766 NR0B2 AGACAGCACUUGGCUCCUU NM_021969 767 NR0B2 CGGGAGGCCUUGGAUGUUC NM_021969 768 NR0B2 GAGGCUGGCAGUGCUGAUU NM_021969 769 NR1D1 UGAGAUUGGCAGAGUGAAA NM_021724 770 NR1D1 GGGAAAGGCUCGGGCAAAA NM_021724 771 NR1D1 CCAUGAACCUGGCCAACAA NM_021724 772 NR1D1 GGAAAAGGCGGCUGAGAUU NM_021724 773 NR1D1 GUGAAGGACCAGACAGUGA NM_021724 774 NR1D1 UCACCAAGCUGCUGCUCAA NM_021724 775 NR1D1 GGGCUCUGGUGCUGAAGAA NM_021724 776 NR101 CCGAGAAGCUGCUGUCCUU NM_021724 777 NR1D1 UCACCAAGCUGAAUGGCAU NM_021724 778 NR1D1 GCUGAAUGGCAUGGUGUUA NM_021724 779 NR1D1 GCUGAGAUUGGCAGAGUGA NM_021724 780 NR1D1 AGCAAGAGCACCAGCAACA NM_021724 781 NR1D1 AGCAGAACAUCCAGUACAA NM_021724 782 NR1D1 GCGGGAGGUGGUAGAGUUU NM_021724 783 NR1D1 CGGCAGGGCAACUCAAAGA NM_021724 784 NR1D1 GCUGAGAUGCAGAGUGCCA NM_021724 785 NR1D1 AAAGCAAACCAGAAUCUUA NM_021724 786 NR1D1 AGGAGAUCUGGGAGGAUUU NM_021724 787 NR1D1 CUGAAUCCCUCUAUAGUGA NM_021724 788 NR1D1 GCAACAUCACCAAGCUGAA NM_021724 789 NR1D1 CAGGAGAUCUGGGAGGAUU NM_021724 790 NR1D1 GGCGAACGGUGCAGGAGAU NM_021724 791 NR1D1 ACAAAAGGUGUCUGAAGAA NM_021724 792 NR1D1 CAGCAAUGUCGCUUCAAGA NM_021724 793 NR1D1 CAGCAGAACAUCCAGUACA NM_021724 794 NR1D1 AAAGGUGUCUGAAGAAUGA NM_021724 795 NR1D1 GGAGGAUCCAGCAGAACAU NM_021724 796 NR1D1 CAGAACAUCCAGUACAAAA NM_021724 797 NR1D1 AAGGAAAGCAGAAUCUUAU NM_021724 798 NR1D1 CGUCAUAACGAGGCCCUAA NM_021724 799 NR1D2 GCAAGAACAUGGAGCAAUA NM_005126 800 NR1D2 UGAAAGAAGUGGUGGAAUU NM_005126 801 NR1D2 CCAAGAACAUGGAGCAAUA NM_005126 802 NR1D2 GUGCAACACUGGAGGAAGA NM_005126 803 NR1D2 CAGAAUGAGAACAAGAAUA NM_005126 804 NR1D2 AAACAUGAACUGAUGGUAA NM_005126 805 NR1D2 AAUCAAGAGCAGCAAGAAA NM_005126 806 NR1D2 UCAAAGAGUAUGUGAUAGA NM_005126 807 NR1D2 AGAAAUAUAGUGUGGAUGA NM_005126 808 NR1D2 CCUCCAACUUAGUGAUGAA NM_005126 809 NR1D2 CAAGAGAGGAGAACGGAUU NM_005126 810 NR1D2 CAAUGAAGACCAUGAUGAA NM_005126 811 NR1D2 CCAGAGAGGAGAACGGAUU NM_005126 812 NR1D2 CUUGAAGAAUGAUCGAAUA NM_005126 813 NR1D2 GUGAUGAAGAGAUGAGUUU NM_005126 814 NR1D2 AGGAAGAAGUGAUUGGCAU NM_005126 815 NR1D2 UCGGAGAAGUAUUCAACAA NM_005126 816 NR1D2 AGACAUUAGUAGAACAUCA NM_005126 817 NR1D2 CAGUUCAAAGGGAGGAAUA NM_005126 818 NR1D2 CGGAGAAGUAUUCAACAAA NM_005126 819 NR1D2 AGAAGUGCCUGAAGAAUGA NM_005126 820 NR1D2 ACAAGAAGUGCCUGAAGAA NM_005126 821 NR1D2 CCUUUAAAGUUCACCCUUA NM_005126 822 NR1D2 GAAGAAUGAUCGAAUAGAU NM_005126 823 NR1D2 GCAUGGUUCUACUGUGUAA NM_005126 824 NR1D2 GAAGAGAUGAGUUUGUUUA NM_005126 825 NR1D2 GCAAUGAAGACCAUGAUGA NM_005126 826 NR1D2 GCGAAGGCUGUAAGGGUUU NM_005126 827 NR1D2 GUUCAAAGGGAGGAAUAUA NM_005126 828 NR1D2 GGAAAGAAAUAUAGUGUGG NM_005126 829 NR1H2 GAACAGAUCCGGAAGAAGA NM_007121 830 NR1H2 CUAAAGAGGAGGACGAAGA NM_007121 831 NR1H2 GGGCGAGCCUGUAGACCUA NM_007121 832 NR1H2 AAGAGGAACCAGAGCGCAA NM_007121 833 NR1H2 GAAGAAGAAGAUUCGGAAA NM_007121 834 NR1H2 GCUACAACCACGAGACAGA NM_007121 835 NR1H2 GGAAGAAGAAGAUUCGGAA NM_007121 836 NR1H2 AACCAGAGCGCAAGCGAAA NM_007121 837 NR1H2 GAUCAUGCUGCUAGAGACA NM_007121 838 NR1H2 AGAAGAAGAUUCGGAAACA NM_007121 839 NR1H2 GGAAACAGCAGCAGCAGGA NM_007121 840 NR1H2 UCAUGAAGCUGGUGAGGCU NM_007121 841 NR1H2 CUGAAGAACAGAUCCGGAA NM_007121 842 NR1H2 GCUAACAGCGGCUCAAGAA NM_007121 843 NR1H2 CCUUGCGGCUCCAGGACAA NM_007121 844 NR1H2 GCGAAGGCUGCAAGGGCUU NM_007121 845 NR1H2 AAGAAGAAGAUUCGGAAAC NM_007121 846 NR1H2 CUCAAGAACUAAUGAUCCA NM_007121 847 NR1H2 AGGCGAGGGUGUCCAGCUA NM_007121 848 NR1H2 CUUUGAGGGUAUUUGAGUA NM_007121 849 NR1H2 GAAGAAGAGAUCCGGAAGA NM_007121 850 NR1H2 GAACCAGAGCGCAAGCGAA NM_007121 851 NR1H2 CUGUAAAGGAGGAGGGUCC NM_007121 852 NR1H2 AAGAACAGAUCCGGAAGAA NM_007121 853 NR1H2 CCAGAGCGCAAGCGAAAGA NM_007121 854 NR1H2 CGGAAGAAGAAGAUUCGGA NM_007121 855 NR1H2 CCCAGAUCCCGAAGAGGAA NM_007121 856 NR1H2 UCAAGAGGCCGCAGGACCA NM_007121 857 NR1H2 GCGCUACAACCACGAGACA NM_007121 858 NR1H2 CUAAGCAAGUGCCUGGUUU NM_007121 859 NR1H3 GACCAGGGCUCCAGAAAGA NM_005693 860 NR1H3 UGAAGAAACUGAAGCGGGA NM_005693 861 NR1H3 GAUAGUUGACUUUGCUAAA NM_005693 862 NR1H3 GGAACAGGCUCAUGCCACA NM_005693 863 NR1H3 UGACUUUGGUAAACAGCUA NM_005693 864 NR1H3 CCUCAAGGAUUUCAGUUAU NM_005693 865 NR1H3 UAAUGAAAGUGGUGAGCCU NM_005693 866 NR1H3 CUCAAGGAUUUCAGUUAUA NM_005693 867 NR1H3 GGUACAACCCUGGGAGUGA NM_005693 868 NR1H3 GAACAGAUCCGCCUGAAGA NM_005693 869 NR1H3 GCAGGAGAUAGUUGACUUU NM_005693 870 NR1H3 GUUAUAACCGGGAAGACUU NM_005693 871 NR1H3 UCAUCAAGGGAGCGCACUA NM_005693 872 NR1H3 GCGAGGGCUGCAAGGGAUU NM_005693 873 NR1H3 AGAUAGUUGACUUUGCUAA NM_005693 874 NR1H3 CUGCCCAGCAACAGUGUAA NM_005693 875 NR1H3 ACAGAGAUCCGUCCACAAA NM_005693 876 NR1H3 AAGAGGAGGAACAGGCUCA NM_005693 877 NR1H3 GGUGAUGCUUCUGGAGACA NM_005693 878 NR1H3 CAACUGGGCAUGAUCGAGA NM_005693 879 NR1H3 CGAUCGAGGUGAUGCUUCU NM_005693 880 NR1H3 UCUCCAGGGCCAUGAAUGA NM_005693 881 NR1H3 CUUCAGAACCCACAGAGAU NM_005693 882 NR1H3 UGACAUUCCUCCUGACUCU NM_005693 883 NR1H3 CGGAACAACUGGGCAUGAU NM_005693 884 NR1H3 GCAGCUGCAUCCUCAGAGA NM_005693 885 NR1H3 GAGUGUCGGCUUCGCAAAU NM_005693 886 NR1H3 CAGAUUGCCCUGCUGAAGA NM_005693 887 NR1H3 CCUCGGGCUUCCACUACAA NM_005693 888 NR1H3 UAGUUGACUUUGCUAAACA NM_005693 889 NR1H4 CCAUAAAGAAAGUGCAUUU NM_005123 890 NR1H4 AAACAUAGCUCAAAGUGAA NM_005123 891 NR1H4 CAUCGUAGAAGGAGUGAAA NM_005123 892 NR1H4 CAUAAAGGAUAGAGAGGCA NM_005123 893 NR1H4 GGAAGAAAGAAUUCGAAAU NM_005123 894 NR1H4 CAGCUGAGAUUUUCAAUAA NM_005123 895 NR1H4 CUGAUGUCAUGGAGAGUAA NM_005123 896 NR1H4 UGUCAAGAGUGUCGACUAA NM_005123 897 NR1H4 AAUCUAAGCGACUGAGAAA NM_005123 898 NR1H4 CCAAGGAGGUAGAAGACAU NM_005123 899 NR1H4 GCUGGGAUCUGGAGAGGAA NM_005123 900 NR1H4 CCAGAUAGACAAUACAUAA NM_005123 901 NR1H4 ACAAAGUCAUGCAGGGAGA NM_005123 902 NR1H4 GUAAAUAUUGGGCUAGAUA NM_005123 903 NR1H4 CUAAGGAAAUGCAAAGAGA NM_005123 904 NR1H4 GCAACUGUGUGAUGGAUAU NM_005123 905 NR1H4 CAGAGAUGCCUGUAACAAA NM_005123 906 NR1H4 CCUCAGGAAAUAACAAAUA NM_005123 907 NR1H4 GAAAUUGGAUUCUGAGCAU NM_005123 908 NR1H4 AGAAUUCAGUGCAGAAGAA NM_005123 909 NR1H4 CGAAGAAAGUGUCAAGAGU NM_005123 910 NR1H4 CAAGUGACCUCGACAACAA NM_005123 911 NR1H4 GGUAGAAGACAUCGUAGAA NM_005123 912 NR1H4 GGCAAUAAAGCAAACAUAA NM_005123 913 NR1H4 UAGAAGGAGUGAAAGAAGA NM_005123 914 NR1H4 AGAAGAAUUCAGUGCAGAA NM_005123 915 NR1H4 AAGAAAGUGUCAAGAGUGU NM_005123 916 NR1H4 AUAGAGAGGCAGUAGAGAA NM_005123 917 NR1H4 CAGGAGAAGCAUUACCAAA NM_005123 918 NR1H4 GGACAUUUCCUCAAGAUGA NM_005123 919 NR112 GAGAAAAGCAAGAGAAUAA NM_003889 920 NR112 CAACAAAGGAGGAAGUAUA NM_033013 921 NR112 UCUCAAAGCUAAAGGGUAU NM_022002 922 NR112 CAAAUUGGAUAGAGAAGAA NM_033013 923 NR112 UGAAGAGACCUGAAAGAAA NM_033013 924 NR112 AAACAAAGCAAGUAGAAGA NM_033013 925 NR112 CAGCAUUGACUCAGAUAUA NM_022002 926 NR112 GGGAAGUGCAAAUUGGAUA NM_033013 927 NR112 GGUUGAGGCUGAAGAGUGA NM_033013 928 NR112 GAGAACAUAAUGAGAACAA NM_003889 929 NR112 GGAAGUAUAAGGAGAUCUA NM_003889 930 NR112 UGAAAUGCACUCAGAAUUA NM_003889 931 NR112 GAAGAGACCUGAAAGAAAA NM_003889 932 NR112 GCAAAGAACUUACCACCAA NM_003889 933 NR112 CCUCAGAGCAGCUGCCAUA NM_022002 934 NR112 GGAGAAAUGAUAAGUGACA NM_003889 935 NR112 CCAUGUGGAUAAUCAGAAA NM_003889 936 NR112 CAUCUGACUUGGACUGAAA NM_033013 937 NR112 GAAGAAAAGUGAACGGACA NM_003889 938 NR112 GAGGUGAGACCCAAAGAAA NM_003889 939 NR112 GCAUGAAGAAGGAGAUGAU NM_033013 940 NR112 GGGCUGGAAUGCUGGGUAU NM_033013 941 NR112 UCAGAAAUGUGACUGGAAA NM_003889 942 NR112 GAAAUGUAGCGCUGGGUUU NM_033013 943 NR112 CCAAUAUCCUCAUGACAUU NM_003889 944 NR112 GGAUAGAGAAGAAACCAAU NM_003889 945 NR112 GGGAGGUGGUUUCAACAAA NM_003889 946 NR112 AUAAGGAGAUCUAGGUUCA NM_003889 947 NR112 CAAGAGGCCCAGAAGCAAA NM_003889 948 NR112 GAUUAAAGGAUGUACUUCA NM_003889 949 NR113 GGGAAGAUGAGCUGAGGAA NM_005122 950 NR113 CAAUGAAAGACUAAAGCAA NM_005122 951 NR113 CGGCAGAAGCCCUGGCAUU NM_005122 952 NR113 GAAGAUGAGCUGAGGAACU NM_005122 953 NR113 CAGGAGAACAGUCAGCAAA NM_005122 954 NR113 GGAAGUGAGUAAGGAGCAA NM_005122 955 NR113 GCUGAGGAACUGUGUGGUA NM_005122 956 NR113 CAGCAGGAGGGAUGAUAAU NM_005122 957 NR113 GGAGUUACCCAGAGAGAUG NM_005122 958 NR113 GCAGCAGGAGGGAUGAUAA NM_005122 959 NR113 CCAUGGAACACUACGAAAA NM_005122 960 NR113 CUGCAAGUCAUCAAGUUUA NM_005122 961 NR113 GGAAAAUGCUGGGACCAAA NM_005122 962 NR113 CUACACAAUUGAAGAUGGA NM_005122 963 NR113 AGGGAGCAGCUGUGGAAAU NM_005122 964 NR113 GGUACCAAAUCCAGCACAU NM_005122 965 NR113 UGAGUAAGGAGCAAGAAGA NM_005122 966 NR113 UUAAUGAGGCCUACGGGUA NM_005122 967 NR113 CAAGGGAGCAGCUGUGGAA NM_005122 968 NR113 GGAGCAUUAAUGAGGCCUA NM_005122 969 NR113 UCGCAGACAUCAACACUUU NM_005122 970 NR113 CAUCAACACUUUCAUGGUA NM_005122 971 NR113 CACCUGUGCAACUGAGUAA NM_005122 972 NR113 CAGGUGACAUGGUGOCUAA NM_005122 973 NR113 CGCAGUGGUUGCAAUGAAA NM_005122 974 NR113 UUGAACAGUUUGUGCAGUU NM_005122 975 NR113 GCCCAGUGGUUGCAAUGAA NM_005122 976 NR113 GAACAGUUUGUGCAGUUUA NM_005122 977 NR113 AGGAGAACAGUCAGCAAAA NM_005122 978 NR113 CGCAGAGAGAUGAGAUUGA NM_005122 979 NR2C1 CGUAGUAUGUGGAGACAAA NM_003297 980 NR2C1 GGAAAAGGCUUAUGUGGAA NM_003297 981 NR2C1 GGAUCAAAGGAUUGUAUUA NM_003297 982 NR2C1 GGAAGGAAGUGUACACCUA NM_003297 983 NR2C1 GGUGAAAGCUUACUGGAAU NM_003297 984 NR2C1 UCACAAUAGUCUUCAACAA NM_003297 985 NR2C1 AGAUAAUUCUCCAGACCAA NM_003297 986 NR2C1 CAGGAAUGUUCAUGAAUAU NM_003297 987 NR2C1 UGGAACUGAUAGAGAAAUU NM_003297 988 NR2C1 CUGCAGUGCUGUAAACUUA NM_003297 989 NR2C1 AGAAGAAAUUGCACAUCAA NM_003297 990 NR2C1 AGUGAAAGUACAAGGUCAA NM_003297 991 NR2C1 GCAGCAAACUGGGCAGAAA NM_003297 992 NR2C1 GGGCAUGGAAGGAAGUGUA NM_003297 993 NR2C1 UGACAGCACUUGAUCAUAA NM_003297 994 NR2C1 CAUGGAACUGAUAGAGAAA NM_003297 995 NR2C1 ACAGAGAUGUAUUGCGUUU NM_003297 996 NR2C1 AGACAAAGCAUCAGGACGU NM_003297 997 NR2C1 GCAUUGAUGGAUACGAAUA NM_003297 998 NR2C1 UGUAACAGAUAGUGAAAGU NM_003297 999 NR2C1 ACAACAGAUGGGAGAGAUU NM_003297 1000 NR2C1 GGUGAUGUUUCAAGGGCAU NM_003297 1001 NR2C1 GGUUACAUCAUUAGCGAAU NM_003297 1002 NR2C1 UCAGGAAUGUUCAUGAAUA NM_003297 1003 NR2C1 UUAAUAAGCACCACCGAAA NM_003297 1004 NR2C1 UUUCCAGGCUCUAGGGCAA NM_003297 1005 NR2C1 AAGAAAACCCAUUGAAGUA NM_003297 1006 NR2C1 AAUUAUUGAUGGAGCACAU NM_003297 1007 NR2C1 ACAGAUAGUGAAAGUACAA NM_003297 1008 NR2C1 CUGCAUUGAUGGAUACGAA NM_003297 1009 NR2C2 AGACAAAGAUGGAGCAAGA NM_003298 1010 NR2C2 CAACAUAACAGAAGAACUU NM_003298 1011 NR2C2 AGAUGGAGUUGCAGGACUA NM_003298 1012 NR2C2 CUGAUGAGCUCCAACAUAA NM_003298 1013 NR2C2 CGGGAGAAACCAAGCAAUU NM_003298 1014 NR2C2 GCGCAAAUCCUGAGGUAAC NM_003298 1015 NR2C2 CCAGCAGACAGGACAGAAA NM_003298 1016 NR2C2 GGAUAAAGCAAGUCAUGGA NM_003298 1017 NR2C2 UGGCGAAGCUGGAUAUAGA NM_003298 1018 NR2C2 AAGCUUGGCAGAUGGGAUA NM_003298 1019 NR2C2 AAAAGGAGUGUGAGGAAAA NM_003298 1020 NR2C2 CUUCAGAAAUCCAGCCAGA NM_003298 1021 NR2C2 AAAGAUGGAGCAAGACAAA NM_003298 1022 NR2C2 CAGGGAUGCUUGUGAACAU NM_003298 1023 NR2C2 CGGAAAGACCUGAGAAGUC NM_003298 1024 NR2C2 GCACUUAAUACCACAGACA NM_003298 1025 NR2C2 CAAAGAUGGAGCAAGACAA NM_003298 1026 NR2C2 AGAACAGCAUCCAGGAAGA NM_003298 1027 NR2C2 CAGAAGACACCUACCGAUU NM_003298 1028 NR2C2 CUGCAGGAGUUCUGUAACA NM_003298 1029 NR2C2 AAGAGAAGAUUGUCACAGA NM_003298 1030 NR2C2 GCACAAGCCAGAUUGAAAA NM_003298 1031 NR2C2 UGAGAUAACUCGGGCAUUU NM_003298 1032 NR2C2 UUGCAAAGGUUUCUUCAAA NM_003298 1033 NR2C2 CCAGUCUGCAAGUGAGAUA NM_003298 1034 NR2C2 UGGAGACAGCAGAGUAUAA NM_003298 1035 NR2C2 GAUGGAGACAGCAGAGUAU NM_003298 1036 NR2C2 AGAUGGAGCAAGACAAACA NM_003298 1037 NR2C2 AAGAUAAACUUUCUGGUGA NM_003298 1038 NR2C2 AGCACAAGCCAGAUUGAAA NM_003298 1039 NR2E1 CAAAGGAAUUGAUGGACAA NM_003269 1040 NR2E1 GAUCAAAAGCUGAAACAUA NM_003269 1041 NR2E1 GGAGAUGAAUGAAUGCUAA NM_003269 1042 NR2E1 GGACGUAAUUGCAGAAGGA NM_003269 1043 NR2E1 GAUCGAAGACUGAGUGACA NM_003269 1044 NR2E1 ACUGUUUGUUCUAGGAAUA NM_003269 1045 NR2E1 GUUAGAUGCUACUGAAUUU NM_003269 1046 NR2E1 UGGACAAGACGCACAGAAA NM_003269 1047 NR2E1 AUGUACAAAUCCAGUGAUA NM_003269 1048 NR2E1 CUAGUAAAGCCUUGAAUGA NM_003269 1049 NR2E1 UUAAAUAGCUGCUGUACUU NM_003269 1050 NR2E1 ACAUAGUGCUGAAACCAAA NM_003269 1051 NR2E1 CUAACAAACCCUUCAGGAA NM_003269 1052 NR2E1 GAAGAUGCUUGGAGAGAAC NM_003269 1053 NR2E1 GAGAUGAGUGAUUGAGUGA NM_003269 1054 NR2E1 GUACAAAUCCAGUGAUAUC NM_003269 1055 NR2E1 GAACAAGCCUCAACUAACA NM_003269 1056 NR2E1 AAACAUAAGUAGUGCUUUC NM_003269 1057 NR2E1 CAAAUAUAGCUCUGUGUAU NM_003269 1058 NR2E1 GAGAACAAGCCUCAACUAA NM_003269 1059 NR2E1 GGAAGAUGCUUGGAGAGAA NM_003269 1060 NR2E1 CAAAUAAAGUGGAGAUGAG NM_003269 1061 NR2E1 AAACGAAUAUGGACGUAAU NM_003269 1062 NR2E1 CGCAUCAACUAUAGAAGAA NM_003269 1063 NR2E1 GUAGAUUGCUGUCCCGUUA NM_003269 1064 NR2E1 GUUUGGAAGUCAACAUGAA NM_003269 1065 NR2E1 ACUGAGAAGUUUCCGGAAU NM_003269 1066 NR2E1 GCUAACAGCCUGAGACUCU NM_003269 1067 NR2E1 GGACAGCCAGCAUGAGCAA NM_003269 1068 NR2E1 UGUUCUAGGAAUAGCACAA NM_003269 1069 NR2E3 GUUCAUGGACUGAGGCAAA NM_014249 1070 NR2E3 GCAGAAAUGCCCACCGAAA NM_014249 1071 NR2E3 GGGCUGGACUUGAAAGGAA NM_016346 1072 NR2E3 AGGCAAGACUAAUUGACAA NM_014249 1073 NR2E3 GAAAGGAAGAAGAAGUCUA NM_016346 1074 NR2E3 GCACAGAGAGACAGAGGUU NM_014249 1075 NR2E3 GGAAAACAAUCUACUGAAA NM_014249 1076 NR2E3 AGAGGAUGCUGAUGAGAAU NM_016346 1077 NR2E3 UGAGAAAUGGGGAGAAUAA NM_016346 1078 NR2E3 GGUAAACUUCACAGACAUA NM_014249 1079 NR2E3 GGCCAUGGGUCCAGAGGAU NM_016346 1080 NR2E3 UGGGAAACAUAAAGCAGAA NM_016346 1081 NR2E3 GGGUGGAGGUGAAAUGUUU NM_014249 1082 NR2E3 GCACAAGGGUCUCAGUUCA NM_016346 1083 NR2E3 ACAGAGAGACAGAGGUUCA NM_014249 1084 NR2E3 GAGAAGCUCCUUUGUGAUA NM_014249 1085 NR2E3 GAGGUAAACUUCACAGACA NM_014249 1086 NR2E3 GGACUUGAAAGGAAGAAGA NM_016346 1087 NR2E3 GGGCAGUGGUUUCACAAAU NM_014249 1088 NR2E3 GGUGUUUGGGUGAAGGUAA NM_016346 1089 NR2E3 GGGUGAAGGUAAGGAAUGA NM_016346 1090 NR2E3 UGACCUCACUGAAGACAAA NM_016346 1091 NR2E3 GGAACAGGCAAGACUAAUU NM_014249 1092 NR2E3 AGAAUAAGCCAGAAAAGUA NM_016346 1093 NR2E3 AGACAGGGCUGGACUUGAA NM_016346 1094 NR2E3 GGUAAGGAAUGAGGGAAGA NM_016346 1095 NR2E3 GGGACCAAGAUGUACAUAA NM_016346 1096 NR2E3 CAGAGGAUGCUGAUGAGAA NM_016346 1097 NR2E3 CAGGAACAGGCAAGACUAA NM_014249 1098 NR2E3 GGACCAAGAUGUAGAUAAG NM_016346 1099 NR2F1 CCAACAACAUUAUGGGCAU NM_005654 1100 NR2F1 AGAAAUACAAUCCGAGCUA NM_005654 1101 NR2F1 UCGAGAGCCUGCAGGAGAA NM_005654 1102 NR2F1 CGGGAAGCACUACGGCCAA NM_005654 1103 NR2F1 GCAAACUGCUGGUGCGACU NM_005654 1104 NR2F1 AAUCCGAGCUACAAAGCAU NM_005654 1105 NR2F1 CGUCCGUUUGGUAGGUAAA NM_005654 1106 NR2F1 CCUCAAGAAGUGCCUCAAA NM_005654 1107 NR2F1 GCAUGCAGCCCAACAACAU NM_005654 1108 NR2F1 GUUAGCAGCUGGCGAGAUC NM_005654 1109 NR2F1 UGAGCACGUUGGCGAGGAA NM_005654 1110 NR2F1 CCACCCAGCAGAAAUACAA NM_005654 1111 NR2F1 CAAAGAUAUGGCAAUGGUA NM_005654 1112 NR2F1 GAAACUCUGAUCCGCGAUA NM_005654 1113 NR2F1 CCAAAGAUAUGGCAAUGGU NM_005654 1114 NR2F1 CGCAGGAACUUAACUUACA NM_005654 1115 NR2F1 UCAAAGCCAUCGUGCUGUU NM_005654 1116 NR2F1 UCUCAUCCGCGAUAUGUUA NM_005654 1117 NR2F1 GCCCAAAGAUAUGGGAAUG NM_005654 1118 NR2F1 GAUAUGGCAAUGGUAGUUA NM_005654 1119 NR2F1 UCAAGAAGUGCCUCAAAGU NM_005654 1120 NR2F1 AUGCAGCCCAACAACAUUA NM_005654 1121 NR2F1 GGAACUUAACUUACACAUG NM_005654 1122 NR2F1 OGAGUACAGGUGOCUCAAA NM_005654 1123 NR2F1 AAAAGAACCUUGUGUCUGU NM_005654 1124 NR2F1 GAGUACAGCUGCCUCAAAG NM_005654 1125 NR2F1 GGCAAGCACUACGGCGAAU NM_005654 1126 NR2F1 CAAUACUGCCGCCUCAAGA NM_005654 1127 NR2F1 GAGCGUCGGCAGGAACUUA NM_005654 1128 NR2F1 CUCAAGAAGUGCCUCAAAG NM_005654 1129 NR2F2 GUGGAAAGGUUGCAGGAAA NM_021005 1130 NR2F2 CCAACAACAUCAUGGGUAU NM_021005 1131 NR2F2 GGAGUGAAACAGAGAAAGA NM_021005 1132 NR2F2 CAGACAAGCCAUCGACAAA NM_021005 1133 NR2F2 AAAGCUGAGCCGAGAGAAA NM_021005 1134 NR2F2 CCUCAGUCAUAGAGCAAUU NM_021005 1135 NR2F2 UGGAAAGCUUGCAGGAAAA NM_021005 1136 NR2F2 GGAGACAAGUCGAGCGGCA NM_021005 1137 NR2F2 CGGAGGAACGUGAGCUACA NM_021005 1138 NR2F2 CGUAUAUGGCAAUUCAAUA NM_021005 1139 NR2F2 GAAGAGAGGGGAAGAAUUU NM_021005 1140 NR2F2 UGAUGUAGCCCAUGUGGAA NM_021005 1141 NR2F2 CAUCAUGGGUAUCGAGAAC NM_021005 1142 NR2F2 ACUCGUACCUGUCCGGAUA NM_021005 1143 NR2F2 GCUUAAUUUCCUUCUGUUA NM_021005 1144 NR2F2 CCAAGCAGCCGACGAGAUU NM_021005 1145 NR2F2 CCAGACAAGCCAUCGACAA NM_021005 1146 NR2F2 GCGACAAGCAGCAGCAGCA NM_021005 1147 NR2F2 GCCGAGAGAAACAAACAAA NM_021005 1148 NR2F2 AGAAUUUAAUGGACUGUGA NM_021005 1149 NR2F2 GGAUGUUACAAGUUUGCUA NM_021005 1150 NR2F2 AGUCAUAGAGCAAUUGUUU NM_021005 1151 NR2F2 GCCAAUGCAUGCAGCCCAA NM_021005 1152 NR2F2 AAGAAUACGUUAGGAGCCA NM_021005 1153 NR2F2 GCGAGCUGUUUGUGUUGAA NM_021005 1154 NR2F2 UCAUGGGUAUCGAGAACAU NM_021005 1155 NR2F2 GGAGCGAGCUGUUUGUGUU NM_021005 1156 NR2F2 GAGCCGAGAGAAACAAACA NM_021005 1157 NR2F2 AGACUGGUUUGUUUGCUUA NM_021005 1158 NR2F6 GGCCAAUAAUAAAGACAUU NM_005234 1159 NR2F6 ACUCAAGGCCAAUAAUAAA NM_005234 1160 NR2F6 GGGAAGACGCCCAUUGAGA XM_373407 1161 NR2F6 GACUCAAGGCCAAUAAUAA NM_005234 1162 NR2F6 GUUGAGAGCCUGCAGGAGA XM_373407 1163 NR2F6 UUGAGAGCCUGCAGGAGAA NM_005234 1164 NR2F6 ACACGUAACCUAUGUCAGA NM_005234 1165 NR2F6 GAAGACGCCCAUUGAGACA NM_005234 1166 NR2F6 AGUCGAGCGGCAAGCAUUA XM_373407 1167 NR2F6 CCUCUGGACACGUAACCUA NM_005234 1168 NR2F6 UGACUCAAGGCCAAUAAUA NM_005234 1169 NR2F6 GACACUACAUGAUGACUCA NM_005234 1170 NR2F6 AGACAUUUCCUACCUGCAA NM_005234 1171 NR2F6 UUGAGACACUGAUCAGAGA NM_005234 1172 NR2F6 GAUGACUGAAGGCCAAUAA NM_005234 1173 NR2F6 GACAUUUCCUACCUGCAAA NM_005234 1174 NR2F6 AGUGUGAAAUGUUUGUCUU NM_005234 1175 NR2F6 GUCAGACACUACAUGAUGA NM_005234 1176 NR2F6 UGCCAGUACUGCCGUCUCA NM_005234 1177 NR2F6 AUAAAGACAUUUCCUACCU NM_005234 1178 NR2F6 UCAAGGCCAAUAAUAAAGA NM_005234 1179 NR2F6 GUACUGCCGUCUCAAGAAG NM_005234 1180 NR2F6 UCUCCCAGCUGUUCUUCAU NM_005234 1181 NR2F6 UCAAGGCCAUCGCGCUCUU NM_005234 1182 NR2F6 UGGCUUCUCUCCUCAGACU NM_005234 1183 NR2F6 CAGCCGGUGUCCGAACUGA NM_005234 1184 NR2F6 CUGGACACGUAACCUAUGU NM_005234 1185 NR2F6 UGGCUUCCCUGGCAUGAUG NM_005234 1186 NR2F6 CAAGGCCAAUAAUAAAGAC NM_005234 1187 NR3C1 AAACAAAAGUGAUGGGAAA NM_000176 1188 NR3C1 GGUAAUUAAGCAAGAGAAA NM_000176 1189 NR3C1 AAGAAAUGCUGAUGGAUAA NM_000176 1190 NR3C1 CAAGAAAGCUGGUAAACUA NM_000176 1191 NR3C1 GAGAGUAGAUGGUGAAAUU NM_000176 1192 NR3C1 ACUUAAAGCUUUUGGAAGA NM_000176 1193 NR3C1 CAUCAAAGAGCUAGGAAAA NM_000176 1194 NR3C1 AAAGAAAUGCUGAUGGAUA NM_000176 1195 NR3C1 UCUCAUAGGUUGCCAAUAA NM_000176 1196 NR3C1 GGUAAAGAGACGAAUGAGA NM_000176 1197 NR3C1 CUGCUUCAGUGGAGAAUUA NM_000176 1198 NR3C1 CCUAUGUGCUGGAAGGAAU NM_000176 1199 NR3C1 GCACUUAGCUAUCAGAAGA NM_000176 1200 NR3C1 GAGUAGAUGGUGAAAUUUA NM_000176 1201 NR3C1 GCUCUGACCCAGUGAGAUU NM_000176 1202 NR3C1 GCAUAGAGGUACCAGCAAU NM_000176 1203 NR3C1 GUGAAAUGGGCAAAGGCAA NM_000176 1204 NR3C1 UAUUGAACCUGAAGUGUUA NM_000176 1205 NR3C1 UGAAAUGGGCAAAGGCAAU NM_000176 1206 NR3C1 UGGAUAAGACCAUGAGUAU NM_000176 1207 NR3C1 CAGCAUGAGACCAGAUGUA NM_000176 1208 NR3C1 UGGGAGAGACAGAAACAAA NM_000176 1209 NR3C1 AUUAAUGAGCAGAGAAUGA NM_000176 1210 NR3C1 ACAUCAAAGAGCUAGGAAA NM_000176 1211 NR3C1 GGAAACAGACUUAAAGCUU NM_000176 1212 NR3C1 GAAGGAAACUCCAGCCAGA NM_000176 1213 NR3C1 GGUUUUAUCAACUGACAAA NM_000176 1214 NR3C1 CAACGGUGGCAAUGUGAAA NM_000176 1215 NR3C1 GCCAAGAGCUAUUUGAUGA NM_000176 1216 NR3C1 AGUGAUUGCAGCAGUGAAA NM_000176 1217 NR3C2 AGUCAAAGAAGUUGGGAAA NM_000901 1218 NR3C2 GAGAAUAACUACAUGGAGA NM_000901 1219 NR3C2 GUAAAUAACCACACUGAAA NM_000901 1220 NR3C2 GAACUGAGGAAGAUGGUAA NM_000901 1221 NR3C2 UAGAAGAAGUGAUGGGUAU NM_000901 1222 NR3C2 GGUAAUAAAUACACAGCAU NM_000901 1223 NR3C2 CCGCAAUGCUGGUGGAGAU NM_000901 1224 NR3C2 CAAGGAAGCAGCAAAGAAA NM_000901 1225 NR3C2 GAAGAAAUGAGGACAAAUU NM_000901 1226 NR3C2 UUGAACAGCUGGUGAAAUU NM_000901 1227 NR3C2 CAGCAAAGCCAUAUUGUAA NM_000901 1228 NR3C2 GCACGAAAGUCAAAGAAGU NM_000901 1229 NR3C2 UGUAAGAGAUGCUGACUAU NM_000901 1230 NR3C2 GCAUGGAGACCGGGAGAUA NM_000901 1231 NR3C2 GGACAAAUUACAUCAAAGA NM_000901 1232 NR3C2 UGAGGAAGAUGGUAACUAA NM_000901 1233 NR3C2 AGCCAUAUCUAGUCAAUAA NM_000901 1234 NR3C2 AAACUGAGCUGGAAUCUAA NM_000901 1235 NR3C2 CAGCUAAGAUUUAUCAGAA NM_000901 1236 NR3C2 CUACAGGAUUGGUGCUCAA NM_000901 1237 NR3C2 GAAUAGAGGCCAAAUUAAU NM_000901 1238 NR3C2 AGAGAGGACCGAUGAGAAU NM_000901 1239 NR3C2 ACAUCAAAGAACUGAGGAA NM_000901 1240 NR3C2 AGGCAGAUCUUUAAAUACA NM_000901 1241 NR3C2 GCUCAUGUCUAGGAGGAAA NM_000901 1242 NR3C2 CCAGAAGAACUUUGCCUUA NM_000901 1243 NR3C2 CUGCAAAGUUUUCUUCAAA NM_000901 1244 NR3C2 UGGAAUGAAUUUAGGAGCA NM_000901 1245 NR3C2 CUAUUUAUGUGCUGGAAGA NM_000901 1246 NR3C2 CAGCAGAACCAACAAGGAA NM_000901 1247 NR4A1 GGAGAAGGCAGGUGGACAA NM_173158 1248 NR4A1 GGACAGAGCAGCUGCCCAA NM_173157 1249 NR4A1 GUAAAUACAGGAAGAAAGA NM_002135 1250 NR4A1 GGACAAGGGCCCAUGAAAA NM_173158 1251 NR4A1 GCAAGUGGGCGGAGAAGAU NM_002135 1252 NR4A1 CGGCUACACAGGAGAGUUU NM_173158 1253 NR4A1 GGCUUGAGCUGCAGAAUGA NM_173157 1254 NR4A1 GAAGAAGACAAAUGACAGA NM_002135 1255 NR4A1 GGAGAAGAUCCCUGGCUUU NM_173157 1256 NR4A1 GCACCUUCAUGGACGGCUA NM_002135 1257 NR4A1 GCUCAGGCCUGGUGCUACA NM_002135 1258 NR4A1 GCCCAGUGCUGCUGUAAAU NM_002135 1259 NR4A1 AUAGACAUGUAGUUGGAAA NM_002135 1260 NR4A1 CAGGAGAGUUUGACACCUU NM_002135 1261 NR4A1 GAAGGAAGUUGUCCGAACA NM_002135 1262 NR4A1 GCACAUGCGCACUCUCAUA NM_173157 1263 NR4A1 GGGAUGUACAGCAGUUCUA NM_173157 1264 NR4A1 UCGAGGACUUCCAGGUGUA NM_002135 1265 NR4A1 CCACCCAAAUGUUAGAAAA NM_173158 1266 NR4A1 GUUGCUGGCGUGAGAUGAA NM_173158 1267 NR4A1 AAAUAGACAUGUAGUUGGA NM_173157 1268 NR4A1 ACAGGAAGAAAGAGCUUGA NM_173157 1269 NR4A1 CUAGAUAGAUGCCCUGUAU NM_002135 1270 NR4A1 GGGCAUGGACAGAGCAGCU NM_002135 1271 NR4A1 GUACAUAAACUGUCACUCU NM_173157 1272 NR4A1 AGAAGACAAAUGACAGAUU NM_173157 1273 NR4A1 GGAGAUGCCCUGUAUCCAA NM_173157 1274 NR4A1 CAGCGAUGGUGCCUGCUUA NM_173158 1275 NR4A1 CCUUCCACAUGUACAUAAA NM_002135 1276 NR4A1 GGAAGAUGCUGGGGAUGUA NM_002135 1277 NR4A2 CAAAGGAACUGGAAUGAUA NM_006186 1278 NR4A2 GCAAAGGCUUGUAAAUUUA NM_173171 1279 NR4A2 CCAUUAAGGUAGAAGACAU NM_173172 1280 NR4A2 GGACAAGCGUCGCCGGAAU NM_173172 1281 NR4A2 GGAUUUAGAACAUGGACUA NM_173171 1282 NR4A2 AUACAUAACUCCCUGGAAA NM_173171 1283 NR4A2 GGCAAAGGCUUGUAAAUUU NM_006186 1284 NR4A2 CAGAAGAAAGAUUGCUAUA NM_006186 1285 NR4A2 CCUCCAACUUGCAGAAUAU NM_173173 1286 NR4A2 AGAUAAGUGUGUUUGCAAA NM_173171 1287 NR4A2 UUGAAUGAAUGAAGAGAGA NM_173172 1288 NR4A2 CGAUUAGCAUACAGAAUAU NM_173172 1289 NR4A2 UGGAAGAACUGCAAAACAA NM_006186 1290 NR4A2 CCAAGAGAGUGGAAGAACU NM_173171 1291 NR4A2 CCAGCUUGCUUGUACCAAA NM_173171 1292 NR4A2 GGAAAUAACUGAGCACUUU NM_173172 1293 NR4A2 GGACAAGCAUGUUGAUUCU NM_173172 1294 NR4A2 CAUUAAGGUAGAAGACAUU NM_173171 1295 NR4A2 GAAUAUGAACAUCGACAUU NM_173173 1296 NR4A2 GGGCACAAGUAUUACACAU NM_173171 1297 NR4A2 GGCUAUGGUCACAGAGAGA NM_173171 1298 NR4A2 UUGAAUGAAUGAAGACAGA NM_173173 1299 NR4A2 CCACAAGUAUUGCCCUUUA NM_173171 1300 NR4A2 CGGCAGAGUUGAAUGAAUG NM_006186 1301 NR4A2 GAGAUGACACCCAGCAUAU NM_173172 1302 NR4A2 GCAAUGCGUUCGUGGCUUU NM_006186 1303 NR4A2 GCACAGGCUACGACGUGAA NM_006186 1304 NR4A2 GGAGAUGACACCCAGCAUA NM_173172 1305 NR4A2 UCAAGGAACCCAAGAGAGU NM_006186 1306 NR4A2 UCGAUUAGCAUACAGAAUA NM_173172 1307 NR4A3 GGUGAAAGAUGGAGGAUUA NM_173198 1308 NR4A3 AAACAGAGCCCUAGAGAAA NM_006981 1309 NR4A3 GGGGAAGGGAGGAAAGAUA NM_173198 1310 NR4A3 CAAGAUAGCUUCAGACCAA NM_006981 1311 NR4A3 GAUAGAACGUGGAAUGUUA NM_006981 1312 NR4A3 CCAAAGAAGAUCAGACAUU NM_006981 1313 NR4A3 AUUGAAGGGUGAAGAGUUA NM_006981 1314 NR4A3 CUUGAUUAUUCCAGAGUAA NM_173199 1315 NR4A3 GUAACAGAAUCAAGACUAA NM_006981 1316 NR4A3 GCAAAUGACACGUUAAUAU NM_173200 1317 NR4A3 AGACAUACAGCUCGGAAUA NM_173199 1318 NR4A3 AAAGAGAGUCGAAGAGCUA NM_006981 1319 NR4A3 GUACCUUCGUGGAGGGCUA NM_006981 1320 NR4A3 GAAUCAAGACUAAGACCUA NM_006981 1321 NR4A3 GCACCAUGUUAGACAGUUU NM_173199 1322 NR4A3 CGGGACAGCUCUCUAGAAA NM_006981 1323 NR4A3 UCUCUGAGCUUAUGAGGAA NM_173198 1324 NR4A3 GAGAAAUGCUGUUACUUUU NM_006981 1325 NR4A3 CAAAUAAACCACUGGCUUU NM_173198 1326 NR4A3 UAAAAGACCACCAGAGUAA NM_173198 1327 NR4A3 AGUAAUAGGUCCAGAUAUG NM_173198 1328 NR4A3 GAUAAGAACAUGCAAAUCA NM_173198 1329 NR4A3 GGUAACAGAAUCAAGACUA NM_006981 1330 NR4A3 GAAGAUAACCAUGAGUAAA NM_006981 1331 NR4A3 GGACAGCUCUCUAGAAACU NM_173199 1332 NR4A3 GGUCAAACACUGCUGAAGA NM_173200 1333 NR4A3 CCAGAAAACUUGCAGAGUA NM_006981 1334 NR4A3 ACCCAGAGAUCUUGAUUAU NM_006981 1335 NR4A3 UCAAACAGGUGGUAACAGA NM_173198 1336 NR4A3 CGGAAUACACCACGGAGAU NM_173198 1337 NR5A1 UGAAAGACGGUCAGGAGAA NM_004959 1338 NR5A1 CCAAGGAGGUGGCUGUUAA NM_004959 1339 NR5A1 GAUUUGAAGUUCCUGAAUA NM_004959 1340 NR5A1 UCGAAAUGCUGCAAGCCAA NM_004959 1341 NR5A1 CUGAGUACCCGGAGCCUUA NM_004959 1342 NR5A1 GCAUGCAGGCCAAGGAGUA NM_004959 1343 NR5A1 UCGACAAGACGCAGCGCAA NM_004959 1344 NR5A1 AAACAGACAGGGAGAAGUU NM_004959 1345 NR5A1 GAGAAGUGCCCUUAAGGAU NM_004959 1346 NR5A1 CCACAGCCCUGGAAUAAAU NM_004959 1347 NR5A1 GCAGAAGAAGGCACAGAUU NM_004959 1348 NR5A1 CCCUGAAACAGCAGAAGAA NM_004959 1349 NR5A1 GGAGCGAGCUGCUGGUGUU NM_004959 1350 NR5A1 GGAUAACCGAGUUUGCUAA NM_004959 1351 NR5A1 GGGAGAAGUUGAGGAGGUA NM_004959 1352 NR5A1 GGACAGAUUUGAGGACUGA NM_004959 1353 NR5A1 GGAGACUGGUUAGGACAAA NM_004959 1354 NR5A1 GGGACAAAUUCCAGCAGCU NM_004959 1355 NR5A1 ACCAGAAGCAGCUGGGCAA NM_004959 1356 NR5A1 GUGAAAGACGCUCAGGAGA NM_004959 1357 NR5A1 UCUGCAAGGUUGUAGUCAA NM_004959 1358 NR5A1 GGGUGGGAAUGCAAGUGAA NM_004959 1359 NR5A1 AGGAGAACGUUUGGUACAA NM_004959 1360 NR5A1 CGAGUUUGCUAAAUUGAGA NM_004959 1361 NR5A1 GCAACAACCUGCUCAUCGA NM_004959 1362 NR5A1 UAACCGAGUUUGCUAAAUU NM_004959 1363 NR5A1 GGAACAAGUUUGGGCCGAU NM_004959 1364 NR5A1 CCACCUGACCUCUGCAAGA NM_004959 1365 NR5A1 CGAACGUGCCUGAGCUCAU NM_004959 1366 NR5A1 GAUCAGGGAUAGCGCUGUU NM_004959 1367 NR5A2 GCAAACAGCUAAUAGGAAA NM_003822 1368 NR5A2 GGAAAGGAAGAGAGUGAUA NM_003822 1369 NR5A2 UGGUAAAGCUGAACUGAAA NM_003822 1370 NR5A2 CUAAGUAGUUGGAAACAAA NM_003822 1371 NR5A2 GUAUGAAAGUCUUGCCUUA NM_003822 1372 NR5A2 AGAGAAAUUUGGACAGCUA NM_003822 1373 NR5A2 GGAAAUGACUACAAACUUU NM_003822 1374 NR5A2 GGUAGAAGGUGUCCAGGAA NM_003822 1375 NR5A2 GAGAAGUGGUACAUGGAAA NM_003822 1376 NR5A2 GAGAUAAAGUGUGUGGGUA NM_205860 1377 NR5A2 GGAAAAGACUUGCUUGUAA NM_003822 1378 NR5A2 GGGAUGUGCCGUAUAAUAA NM_003822 1379 NR5A2 CCAAAGAACUGCCUAUAAU NM_003822 1380 NR5A2 CCAAUGUACAAGAGAGACA NM_003822 1381 NR5A2 UCAGAGAACUUAAGGUUGA NM_003822 1382 NR5A2 AAGAAUACCUCUACUACAA NM_003822 1383 NR5A2 CACAGGAGUUAGUGGCAAA NM_003822 1384 NR5A2 GAAAGGAUGAGGUGAUGUA NM_003822 1385 NR5A2 GUAUGCAGGCUGAAGAAUA NM_003822 1386 NR5A2 CCAGAGGAUUCCAUAUAAA NM_003822 1387 NR5A2 ACUUAAGGUUGAUGACCAA NM_205860 1388 NR5A2 GGCUAUUCAUAUAUGGAUA NM_003822 1389 NR5A2 UCAAAUAGUCACAGUUCUA NM_003822 1390 NR5A2 UCAACAACCUCAUGAGUCA NM_003822 1391 NR5A2 GGCCAAUGUACAAGAGAGA NM_205860 1392 NR5A2 GUGAAAGCUGCAAGGGAUU NM_003822 1393 NR5A2 GUGCAGAUGUGGAUCAACA NM_003822 1394 NR5A2 GCAAAUGCUCCAUAGCUAA NM_003822 1395 NR5A2 CAGGAAGUCUUGUUAGUAU NM_003822 1396 NR5A2 GGAAGAAGAACAGGAAGAA NM_003822 1397 NR6A1 CCACAGAAUUGCCAGAUAA NM_033334 1398 NR6A1 GGGAUGAACCGGAAGACUA NM_033334 1399 NR6A1 ACACAGAUUUAGUGAUGAA NM_033334 1400 NR6A1 CAGGAAAGAUGGUGAAUGU NM_033335 1401 NR6A1 GGGAACAGUACAUGGGAAU NM_001489 1402 NR6A1 AAGAAGAAAUCGAAAGGAU NM_001489 1403 NR6A1 AUGAAGAACUAGAGAGAUU NM_033335 1404 NR6A1 GGCAGGAGUUUGAGGAAGA NM_001489 1405 NR6A1 CCUCACAGCUGGAACAAUU NM_033334 1406 NR6A1 GAAUAAGAGCAUUGGGCCA NM_033335 1407 NR6A1 GUUGUUAAUUUGUGGAGUA NM_001489 1408 NR6A1 CCUAAGGGCAGAAGAAUUU NM_001489 1409 NR6A1 CCAGAUAUCGGAAGAAGAA NM_033334 1410 NR6A1 GAAGAAAUCGAAAGGAUCA NM_001489 1411 NR6A1 GAACAUGUCUCUCCAACAA NM_033334 1412 NR6A1 UAAACAGGGUAAAGUGAAU NM_001489 1413 NR6A1 GGAUGAACCGGAAGACUAU NM_033334 1414 NR6A1 AGUUUGAGGAAGAGGCCAA NM_001489 1415 NR6A1 AGAAAUUGCUCUCUGAUGA NM_033334 1416 NR6A1 CCAACAACGUAGUAUUGAA NM_033334 1417 NR6A1 CUGUGGAACUGAAUGGAUU NM_001489 1418 NR6A1 UCUCGGAAGCAGAGGAACA NM_001489 1419 NR6A1 CCGGAAGGCUAUCAGAGAA NM_001489 1420 NR6A1 CAGAGCAAGUUGUUAAUUU NM_001489 1421 NR6A1 GAAGAACUACACAGAUUUA NM_033334 1422 NR6A1 GACAGGCUCUGGAGGGAAA NM_001489 1423 NR6A1 AGAUAUCGGAAGAAGAAAU NM_033335 1424 NR6A1 GCACAGUGACCAUAUUUAA NM_033334 1425 NR6A1 GGGAACAGGGCUUCGGAGA NM_001489 1426 NR6A1 AUGAAGGGAUGGAGGUGAU NM_033334 1427 PGR UGACUGAGCUGAAGGCAAA NM_000926 1428 PGR AGAUAAAGGAGGAGGAGGA NM_000926 1429 PGR CCUCAGAAGAUUUGUUUAA NM_000926 1430 PGR CGGUGGAGGUUGAGGAGGA NM_000926 1431 PGR GAGAUGAGGUCAAGCUACA NM_000926 1432 PGR GAACAGCGGAUGAAAGAAU NM_000926 1433 PGR ACAUAUUGAUGACCAGAUA NM_000926 1434 PGR CUACUUAUGUGCUGGAAGA NM_000926 1435 PGR GCUGUAAGGUCUUCUUUAA NM_000926 1436 PGR GCACAAUUACCCAAGAUAU NM_000926 1437 PGR GCUUCAAGUUAGCCAAGAA NM_000926 1438 PGR CUAAAUGAACAGCGGAUGA NM_000926 1439 PGR CUUAUGUGCUGGAAGAAAU NM_000926 1440 PGR GAAAUGAUGUCUGAAGUUA NM_000926 1441 PGR GCUGACAAGUCUUAAUCAA NM_000926 1442 PGR UGAUAAAAUCCGCAGAAAA NM_000926 1443 PGR CAUCUGUACUGCUUGAAUA NM_000926 1444 PGR GCCCUAAGCCAGAGAUUCA NM_000926 1445 PGR CACAAUUACCCAAGAUAUU NM_000926 1446 PGR CUCUAAAGAUAAAGGAGGA NM_000926 1447 PGR AAAUGAAAGCCAAGCCCUA NM_000926 1448 PGR AUUCAGUAUUCUUGGAUGA NM_000926 1449 PGR CAAUGGAAGGGCAGCACAA NM_000926 1450 PGR AGUCAUUACCUCAGAAGAU NM_000926 1451 PGR GCUUAAUGGUGUUUGGUCU NM_000926 1452 PGR UGAAGUUAUUGCUGCACAA NM_000926 1453 PGR CCUUGGAGGUCGAAAAUUU NM_000926 1454 PGR CUUAAUGGUGUUUGGUCUA NM_000926 1455 PGR GGUCAAGCUACAUUAGAGA NM_000926 1456 PGR UGGAAGGGCAGCAGAACUA NM_000926 1457 PPARA GGGAAACAUCCAAGAGAUU NM_005036 1458 PPARA CUGAAGAGUUCCUGCAAGA NM_005036 1459 PPARA GUUUAUAACUCGUGAAUUC NM_005036 1460 PPARA AGGAAACCGUUCUGUGAUA NM_005036 1461 PPARA CAGGAGAUCUACAGGGACA NM_005036 1462 PPARA CCAAUGGCAUCCAGAACAA NM_005036 1463 PPARA GGAUAGUUCUGGAAGCUUU NM_005036 1464 PPARA AGGAAAGGCCAGUAACAAU NM_005036 1465 PPARA GCACAAAUAUCCACGACUU NM_005036 1466 PPARA UCACGGAGCUCACGGAAUU NM_005036 1467 PPARA GAAGAGUUCCUGCAAGAAA NM_005036 1468 PPARA GAUCAAAGUGCCAGCAGAU NM_005036 1469 PPARA GACUCAAGCUGGUGUAUGA NM_005036 1470 PPARA GCUGGUAGCGUAUGGAAAU NM_005036 1471 PPARA GGAAAUGGGUUUAUAACUC NM_005036 1472 PPARA GCUUUGGCUUUACGGAAUA NM_005036 1473 PPARA GUGCAGAUGAUGAAGAAGA NM_005036 1474 PPARA GCAGGAGGGUAUUGUACAU NM_005036 1475 PPARA AGAGAAUCUACGAGGCCUA NM_005036 1476 PPARA AAGCAAAACUGAAAGCAGA NM_005036 1477 PPARA CCAAGAUCUGAGAAAGCAA NM_005036 1478 PPARA UGAGAAAGCAAAACUGAAA NM_005036 1479 PPARA CGGAAUUCGCCAAGGCCAU NM_005036 1480 PPARA AGGCCUACUUGAAGAACUU NM_005036 1481 PPARA GCUGGGAAGUUCAAGAUCA NM_005036 1482 PPARA UCAACAUGAACAAGGUCAA NM_005036 1483 PPARA UGGGAAACAUCCAAGAGAU NM_005036 1484 PPARA CAAUCCACCUUUUGUCAUA NM_005036 1485 PPARA GCGUAUGGAAAUGGGUUUA NM_005036 1486 PPARD GGGAAGAGGAGGAGAAAGA NM_006238 1487 PPARD GCAUGAAGCUGGAGUACGA NM_177435 1488 PPARD AGAAAGAGGAAGUGGCAGA NM_006238 1489 PPARD GCUGCAAGAUUCAGAAGAA NM_177435 1490 PPARD GAGCACAGAGGUAGGAGAA NM_006238 1491 PPARD AGGAAGUGGCAGAGGCAGA NM_006238 1492 PPARD GUGAUAUCAUUGAGCCUAA NM_177435 1493 PPARD GCAAGAUUCAGAAGAAGAA NM_006238 1494 PPARD CGCAAACCGUUCAGUGAUA NM_006238 1495 PPARD GGGACAAGGCAUCGGGCUU NM_006238 1496 PPARD GGAGAGUGUUGUACAGUGU NM_006238 1497 PPARD ACACAUAAGCACUGAAAUC NM_006238 1498 PPARD UGGAAAGCAGGGUCAGAUA NM_006238 1499 PPARD CUACAAUGCCUACCUGAAA NM_177435 1500 PPARD GGAGAAAGAGGAAGUGGCA NM_177435 1501 PPARD CAGAAGAAGAACCGCAACA NM_006238 1502 PPARD GAACAGGACCUCUGCUUUU NM_006238 1503 PPARD GCACAGAGGUAGGAGAACU NM_006238 1504 PPARD GGUCUGGAGUGGUCUGGAA NM_006238 1505 PPARD AGGAGGAGAAAGAGGAAGU NM_006238 1506 PPARD UGUGGAAGCAGUUGGUGAA NM_006238 1507 PPARD AGACAUUGUGGCAGGCAGA NM_177435 1508 PPARD UCUCCUGUCUUCAGAGCAA NM_006238 1509 PPARD AGAGGAGGAGAAAGAGGAA NM_006238 1510 PPARD CCGGGAAGAGGAGGAGAAA NM_006238 1511 PPARD CAAGAGAUGAAGACAGAUG NM_006238 1512 PPARD GUGUGGAAGCAGUUGGUGA NM_177435 1513 PPARD AGAAGGGGCUGGUGUGGAA NM_006238 1514 PPARD ACCAACAGAUGAAGACAGA NM_006238 1515 PPARD CUUGAGCCAUCCAAAGAAA NM_006238 1516 PPARG AGAUAAAGCUUCUGGAUUU NM_015869 1517 PPARG AGGAAAGAGAACAGACAAA NM_005037 1518 PPARG UGAAAGAAGCCGACACUAA NM_138712 1519 PPARG UGAAAGAAGCCAACACUAA NM_138711 1520 PPARG GCAAACAUAUCACAAGAAA NM_015869 1521 PPARG AGACUCAGCUCUACAAUAA NM_005037 1522 PPARG GUACCAAAGUGCAAUCAAA NM_005037 1523 PPARG CCUAAGAAAUUUACUGUGA NM_015869 1524 PPARG GGCCAAGGCUUCAUGACAA NM_005037 1525 PPARG GAGUCAGCCUUUAACGAAA NM_138711 1526 PPARG UCAAGAAGACGGAGACAGA NM_005037 1527 PPARG AAUGAUGGGAGAAGAUAAA NM_015869 1528 PPARG GAAAGAAGCCAACACUAAA NM_138711 1529 PPARG GUUCAAUGCACUGGAAUUA NM_005037 1530 PPARG GAAAGAAGCCGACACUAAA NM_138712 1531 PPARG ACAAUAAGCCUCAUGAAGA NM_005037 1532 PPARG CAGAAAUGACCAUGGUUGA NM_005037 1533 PPARG CCAAGUAACUCUCCUCAAA NM_015869 1534 PPARG GAGAUAAAGCUUCUGGAUU NM_005037 1535 PPARG GGGCGAUCUUGACAGGAAA NM_005037 1536 PPARG GUGAAGGAUGCAAGGGUUU NM_015869 1537 PPARG CAAGAGUACCAAAGUGCAA NM_015869 1538 PPARG GGAUUGAUCUUUUGCUAGA NM_015869 1539 PPARG GCAGGAGAUCACAGAGUAU NM_005037 1540 PPARG UGACAAGGGAGUUUCUAAA NM_005037 1541 PPARG ACACUAAACCACGAAUAUA NM_138712 1542 PPARG AGGAGAAGCUGUUGGCGGA NM_138712 1543 PPARG CAACACUAAACCACAAAUA NM_138711 1544 PPARG CAGGAAAGACAACAGACAA NM_005037 1545 PPARG UAAUGAUGGGAGAAGAUAA NM_005037 1546 RARA CCAAGGAGUCUGUGAGAAA NM_000964 1547 RARA GAGCAGUUCUGAAGAGAUA NM_000964 1548 RARA GAACAACAGCUCAGAACAA NM_000964 1549 RARA GCAAAUACACUACGAACAA NM_000964 1550 RARA ACUGAGAUCUACUGGAUAA NM_000964 1551 RARA CCAAGAUGCUAAUGAAGAU NM_000964 1552 RARA AGAAGAAGGAGGUGCCCAA NM_000964 1553 RARA GAAACAAGAAGAAGAAGGA NM_000964 1554 RARA AGGAAAUGUUGGAGAACUC NM_000964 1555 RARA AGGAAUUUGUGCUGUGUAU NM_000964 1556 RARA GCACAUUUAUACUGAAGGA NM_000964 1557 RARA UCAUUAAGACUGUGGAGUU NM_000964 1558 RARA GACCGAAACAAGAAGAAGA NM_000964 1559 RARA CCACCAAGUGCAUCAUUAA NM_000964 1560 RARA GGCAAAUACACUACGAAGA NM_000964 1561 RARA UCAUUGAGAAGGUGCGCAA NM_000964 1562 RARA GUGAGAAACGACCGAAACA NM_000964 1563 RARA GACAAGAACUGCAUCAUCA NM_000964 1564 RARA GAGAUCUACUGGAUAAAGA NM_000964 1565 RARA GCAAAGCGCACCAGGAAAC NM_000964 1566 RARA CCACUGAGAUCUACUGGAU NM_000964 1567 RARA CGCAAAGCGCACCAGGAAA NM_000964 1568 RARA CUGUGAGAAACGACCGAAA NM_000964 1569 RARA GAAACGACCGAAACAAGAA NM_000964 1570 RARA GUAGAUGGGCCGACACACA NM_000964 1571 RARA ACAUUGACCUCUGGGACAA NM_000964 1572 RARA UGGAGGCGCUAAAGGUCUA NM_000964 1573 RARA ACCGAAACAAGAAGAAGAA NM_000964 1574 RARA UCAACAAGGUGACGCGGAA NM_000964 1575 RARA AGAACUGCAUCAUCAACAA NM_000964 1576 RARB ACACCAAGGUUAUGAAAUA NM_016152 1577 RARB AGGGAAAUUUCAUGGGAUA NM_016152 1578 RARB AGAUAAGGCUGAAGAUAUU NM_016152 1579 RARB UCACAAGCCAUUAGGGAAA NM_016152 1580 RARB CGAGAUAAGAACUGUGUUA NM_000965 1581 RARB CCAAAGAAUCUGUCAGGAA NM_016152 1582 RARB GUAGAGUGGUUAACAGAUA NM_016152 1583 RARB GCAGAAGUAUUCAGAAGAA NM_016152 1584 RARB UGGCAAAGACCCAGUCAAA NM_000965 1585 RARB GCACAGAGAGCUAUGAAAU NM_000965 1586 RARB GGAGAAUUCUGAAGGACAU NM_000965 1587 RARB GGUAAAUACACCACGAAUU NM_000965 1588 RARB AAGAAAUGCUGGAGAAUUC NM_000965 1589 RARB AAGAAUGCACAGAGAGCUA NM_016152 1590 RARB GGACAAAUCAUCAGGGUAC NM_000965 1591 RARB AAGAUAAGGCUGAAGAUAU NM_000965 1592 RARB AGAAUUUGCAGCAGGUAUA NM_000965 1593 RARB GGAAAAUGGUAAAUGAUCA NM_000965 1594 RARB GGAAAUGGAUGACACAGAA NM_016152 1595 RARB CAAAGAAUCUGUCAGGAAU NM_000965 1596 RARB UCACAGAGAAGAUCCGAAA NM_000965 1597 RARB AAGCAAGAAUGCACAGAGA NM_000965 1598 RARB AGUCACCACUCGUGCAAUA NM_016152 1599 RARB GCAAAGACCCAGUCAAAAU NM_000965 1600 RARB UUAUUAAGAUCGUGGAGUU NM_016152 1601 RARB CCGAAAAGCUCACCAGGAA NM_000965 1602 RARB GGAAACAGGACUAUUGACA NM_016152 1603 RARB GGACUAUUGUACAGUAUGA NM_000965 1604 RARB CAAGAAGGACCAAGAAGUU NM_000965 1605 RARB GAACAAGACACCAUGACUU NM_016152 1606 RARG AGAAAGAGGUGAAGGAAGA NM_000966 1607 RARG CCAAGGAAGCUGUGCGAAA NM_000966 1608 RARG CUGAAAUGUUUGAGGAUGA NM_000966 1609 RARG GGAACAAGAAGAAGAAAGA NM_000966 1610 RARG AGAAGAAGAAAGAGGUGAA NM_000966 1611 RARG GGUGAAAGGGACAGAUAGA NM_000966 1612 RARG CGAAAUGACCGGAACAAGA NM_000966 1613 RARG CAGGAACACUAGAGACCAA NM_000966 1614 RARG GCUCAAAGCUGCCUGCCUA NM_000966 1615 RARG AGAAGAAAGAGGUGAAGGA NM_000966 1616 RARG GGUGAAGGAAGAAGGGUCA NM_000966 1617 RARG UAGAAGAGCUCAUCACCAA NM_000966 1618 RARG UCAUCAAGAUCGUGGAGUU NM_000966 1619 RARG GCCGAAGCAUCCAGAAGAA NM_000966 1620 RARG UCAACAAGGUGACCAGGAA NM_000966 1621 RARG AGAGCAAACAGGAACACUA NM_000966 1622 RARG AUAAAGUGAUGGAAACUCA NM_000966 1623 RARG GGAGAACCCUGAAAUGUUU NM_000966 1624 RARG GAAAUGACCGGAACAAGAA NM_000966 1625 RARG GCACCAGCUCAGAGGAGAU NM_000966 1626 RARG CAAGGAUGCUAAUGAAAAU NM_000966 1627 RARG CAUCAAGAUCGUGGAGUUU NM_000966 1628 RARG CCAAGGAUGCUAAUGAAAA NM_000966 1629 RARG AGCCAUGCUUCGUGUGGAA NM_000966 1630 RARG GAAGAAAGAGGUGAAGGAA NM_000966 1631 RARG CGGAACAAGAAGAAGAAAG NM_000966 1632 RARG UCCGAGAGAUGCUGGAGAA NM_000966 1633 RARG CAACAAGGUGACCAGGAAU NM_000966 1634 RARG CGAAGCAUCCAGAAGAACA NM_000966 1635 PARG CAAGGAAGCUGUGCGAAAU NM_000966 1636 RORA AGGAAGAAAUUGAGAACUA NM_134260 1637 RORA GGAAAGAGUUUAUGUUGUA NM_002943 1638 RORA GAGAAGAGCUGCAGCAGAU NM_002943 1639 RORA AUGCAGAAGUACAGAAACA NM_134260 1640 RORA CCGAGAAGAUGGAAUACUA NM_134262 1641 RORA CAAGAUCUGUGGAGACAAA NM_134260 1642 RORA GCUCAUGGCUGCAAGAAAA NM_002943 1643 RORA GGGUAAAUGUUAUCACCUA NM_002943 1644 RORA AAUGCAAAUUGAUGGGUAA NM_134260 1645 RORA GCACCUGACUGAAGAUGAA NM_134260 1646 RORA GGUAAAUGUUAUCACCUAA NM_134262 1647 RORA GUAACGAAGAAGACACAUA NM_134261 1648 RORA GGAGACAAAUCAUCAGGAA NM_134260 1649 RORA UCACCGAGAAGAUGGAAUA NM_134260 1650 RORA AAUCAAACCAGAACCAAUA NM_002943 1651 RORA GAAGAAUCACCGAGAAGAU NM_002943 1652 RORA GGAGAAGUCAGCAAAGCAA NM_002943 1653 RORA GAAUCAAACCAGAACCAAU NM_134260 1654 RORA UAGAAUGUCUGAAGUACAA NM_002943 1655 RORA CCGUGUACUUUGAUGGGAA NM_134260 1656 RORA GGAAGAUAAGGAAGUACAA NM_134260 1657 RORA AGACAAAUCAUCAGGAAUC NM_134260 1658 RORA CAGCAGAUAACGUGGCAGA NM_134262 1659 RORA CCAAACGCAUUGAUGGAUU NM_134260 1660 RORA CAGCAGAGGUAUCUCAGUA NM_134261 1661 RORA GCAGAUGGGUAACCAGGAA NM_134260 1662 RORA CAGCCAAGAACUUUAUUAA NM_002943 1663 RORA CAGCUUGUAUGCAGAAGUA NM_134260 1664 RORA CGUCAGAAGAACUGUUUGA NM_134260 1665 RORA GAGACAAAUCAUCAGGAAU NM_134260 1666 RORB GGGAGAAGCUGCAGGUAUU NM_006914 1667 RORB GCUUAUAGAACCAAGGAAA NM_006914 1668 RORB GCAGUUAGCACCAGGGAUA NM_006914 1669 RORB GGGAUAACCAUGACUGAAA NM_006914 1670 RORB CAAGCACAUUGGAGAGAAA NM_006914 1671 RORB UGACAUGACUGGAAUCAAA NM_006914 1672 RORB GGAAGUGGUUUUAGUGAGA NM_006914 1673 RORB GAACAGAAACCGUUGCCAA NM_006914 1674 RORB CUGAUGACCUAGUGAAUGA NM_006914 1675 RORB AGCACAAAUUGAAGUGAUA NM_006914 1676 RORB UGACCAAACAGUAGAUAUU NM_006914 1677 RORB UGAUUUACCUUCUGUGUUA NM_006914 1678 RORB UCAAACAGAUAAAGCAAGA NM_006914 1679 RORB UGCCCAAGUCUGAGGGUUA NM_006914 1680 RORB GUGUACAGCAGCAGCAUUA NM_006914 1681 RORB AGAAUUAGGCCACAGAUAA NM_006914 1682 RORB AGAUAAAGCAAGAACCUAU NM_006914 1683 RORB GCAAAUGUUCAAAGCGUUA NM_006914 1684 RORB GGAAGGAGACAGUAUUUUA NM_006914 1685 RORB GUUCAAAGCCUUAGGUUCU NM_006914 1686 RORB UGACUGAAAUCGACCGAAU NM_006914 1687 RORB GCAGAAGUGUCUUGGCGUA NM_006914 1688 RORB GGGAGCAGCUUCAUGACUA NM_006914 1689 RORB AUGAUGAGACCUUGGCAAA NM_006914 1690 RORB CAAGAGAUGCUGUGAAGUU NM_006914 1691 RORB GCAUCACCAUUAAGACAAA NM_006914 1692 RORB GUCCAGAAGCUUCAGGAAA NM_006914 1693 RORB GAGCCAGAGUGAAGAAGUA NM_006914 1694 RORB GGAGGAGGCAGCAGAACAA NM_006914 1695 RORB GGAGACAACUGUUUAUAGA NM_006914 1696 RORC UAGAACAGCUGCAGUACAA NM_005060 1697 RORC UGGAAAGGCUGCAGAUCUU NM_005060 1698 RORC GGGAGGAAGUGACUGGCUA NM_005060 1699 RORC UCCAAGAAGCAGAGGGACA NM_005060 1700 RORC AGGAAGUCCAUGUGGGAGA NM_005060 1701 RORC GGCCAGGGCUCCAAGAGAA NM_005060 1702 RORC CUGCAAGACUCAUCGCCAA NM_005060 1703 RORC CCUCAUGCCACCUUGAAUA NM_005060 1704 RORC AAGUAGAACAGCUGCAGUA NM_005060 1705 RORC UGACAGAGAUAGAGCACCU NM_005060 1706 RORC UCUCAAAGCAGGAGCAAUG NM_005060 1707 RORC GCCAGGGCUCCAAGAGAAA NM_005060 1708 RORC GCUCCAAGAGAAAAGGAAA NM_005060 1709 RORC AGAGAAAAGGAAAGUAGAA NM_005060 1710 RORC GGACAAGUCGUGUGGGAUC NM_005060 1711 RORC GAAGUGAUCCCUUGCAAAA NM_005060 1712 RORC CCUCAUAUUCCAACAACUU NM_005060 1713 RORC GAGCAGAUACCCUCACCUA NM_005060 1714 RORC GGAAAGUAGAACAGCUGCA NM_005060 1715 RORC UGCAAGUCCUACAGGGAGA NM_005060 1716 RORC AGAAAAGGAAAGUAGAACA NM_005060 1717 RORC GGGAGCUGCUGGCUGCAAA NM_005060 1718 RORC UGUCCAAGAAGCAGAGGGA NM_005060 1719 RORC AAGCAGGAGCAAUGGAAGU NM_005060 1720 RORC CAUCAAUGCCCAUCGGCCA NM_005060 1721 RORC AGGCAAAUACGGUGGCAUG NM_005060 1722 RORC GCAGAUACCCUCACGUACA NM_005060 1723 RORC GGGCCUACAAUGCUGACAA NM_005060 1724 RORC GUGGAAAGGCUGCAGAUCU NM_005060 1725 RORC GGGAGAAGUCGUGUGGGAU NM_005060 1726 RXRA AGACCUAGGUGGAGGOAAA NM_002957 1727 RXRA GCAUUAGAAUUGUGGAAAA NM_002957 1728 RXRA GAGUAGAGAGGUAGAAUUU NM_002957 1729 RXRA ACGCAUAGCUAAUACUUUA NM_002957 1730 RXRA GGACAGUAGCAUUAGAAUU NM_002957 1731 RXRA GGGACAUGCAGAUGGACAA NM_002957 1732 RXRA CCUCAUGUAUACUUGGAUA NM_002957 1733 RXRA GAGAAAAUGUCUAAAGCAU NM_002957 1734 RXRA GCAUAAAGAGAGUAAAGAU NM_002957 1735 RXRA UGACGGAGCUUGUGUCCAA NM_002957 1736 RXRA UGAGAGAUCCAUAAAGAGA NM_002957 1737 RXRA GAGAGUAAAGAUAAGAGAA NM_002957 1738 RXRA GGAACGAGAAUGAGGUGGA NM_002957 1739 RXRA UCGCAGACAUGGACACCAA NM_002957 1740 RXRA AACCAGACCUGUAGUAGUA NM_002957 1741 RXRA CAACAAGGACUGCCUGAUU NM_002957 1742 RXRA GAUGACAGAUCCAUAAAGA NM_002957 1743 RXRA GCUCAAAUGCCUGGAACAU NM_002957 1744 RXRA GGCGCAAGCUGGUGUGUCA NM_002957 1745 RXRA CCUUGGAGGCCUACUGCAA NM_002957 1746 RXRA AAAGAUAGCACUAACAUCA NM_002957 1747 RXRA CGAACGACCCUGUCACCAA NM_002957 1748 RXRA GCAAGGACCGGAACGAGAA NM_002957 1749 RXRA UAAAGCAUCUGGAAAGGUA NM_002957 1750 RXRA AGCCCAAGACCGAGACCUA NM_002957 1751 RXRA GGCAAGGACCGGAACGAGA NM_002957 1752 RXRA AGAGGAUCCUGGAGGCUGA NM_002957 1753 RXRA GAGAUUGAGGUGAAAGCUU NM_002957 1754 RXRA GCAGGAGGGUGCAGGUACU NM_002957 1755 RXRA UUGAUGGACAGUAGCAUUA NM_002957 1756 RXRB GGACAGAAGCUCAGGCAAA NM_021976 1757 RXRB CAAAUGACCGUGUGACUAA NM_021976 1758 RXRB GGGCAAUCAUUCUGUUUAA NM_021976 1759 RXRB GGAGAAAGUGUAUGCAUCA NM_021976 1760 RXRB GGAUAGAGACUUGCAGUUA NM_021976 1761 RXRB GGGCUUAAUUCGACGCAAU NM_021976 1762 RXRB GGUGGGAGGGGGAGAACAA NM_021976 1763 RXRB CGGUACAGGAGGAGCGUCA NM_021976 1764 RXRB GGCAGAACCAAGAACAUAA NM_021976 1765 RXRB GGGCAGAACCAAGAACAUA NM_021976 1766 RXRB GAGGUGGGUGUUUGGGAAA NM_021976 1767 RXRB CAGAUUAACUCAACAGUGU NM_021976 1768 RXRB CCACAUUGGCGCUGUCAUU NM_021976 1769 RXRB GCACAGUGGACAAGCGCCA NM_021976 1770 RXRB GGGAGAAAGUGUAUGCAUC NM_021976 1771 RXRB UGGAGAAGCGCCAGCGGAA NM_021976 1772 RXRB GGCAAACGGGUAUGUGCAA NM_021976 1773 RXRB GAGCAGAGCGACGGGCUUA NM_021976 1774 RXRB CAGCUAAUCCUCCGAUCAA NM_021976 1775 RXRB GUGACAUGAGGAUGGACAA NM_021976 1776 RXRB AGAAGUACCCUGAGCAGCA NM_021976 1777 RXRB GCAGAACCAAGAACAUAAA NM_021976 1778 RXRB GUACAGGGCAGAACCAAGA NM_021976 1779 RXRB UCUUCAAGCUCAUUGGUGA NM_021976 1780 RXRB CAGUUAGACUCAAAGAAGU NM_021976 1781 RXRB CCAAAUGACCCUGUGACUA NM_021976 1782 RXRB CCGCAAAGACCUUACAUAC NM_021976 1783 RXRB GCACUGGCAUGAAGAGGGA NM_021976 1784 RXRB CCUGGGACCUCUUCGGUAU NM_021976 1785 RXRB GGGACAGAAGCUCAGGCAA NM_021976 1786 RXRG GAGUAAACAUGUAUGGAAA NM_006917 1787 RXRG GAACAUGAACUGACGAGUA NM_006917 1788 RXRG CGAGAGAGGCGGUAAUAUU NM_006917 1789 RXRG AGACAGAAUCCUAUGGUGA NM_006917 1790 RXRG GGAUUGGAAACAUGAACUA NM_006917 1791 RXRG GCACAGGAGAGGAACAUGA NM_006917 1792 RXRG CGAGUAAACAUGUAUGGAA NM_006917 1793 RXRG AACAUGAACUGACGAGUAA NM_006917 1794 RXRG GAGGCAGAUUCCUGACUAA NM_006917 1795 RXRG UGAAAGACAUGCAGAUGGA NM_006917 1796 RXRG GAGCUGAGAGUGAGGCAGA NM_006917 1797 RXRG GAAGAGGGAAGCUGUGCAA NM_006917 1798 RXRG GGGAAGCUGUGCAAGAAGA NM_006917 1799 RXRG GUGCAAGAAGAAAGACAGA NM_006917 1800 RXRG ACAGAGGAGCCGAGAGCGA NM_006917 1801 RXRG CGGGCAGGGUGGAAUGAAU NM_006917 1802 RXRG UUGAAUGGGCCAAGCGUAU NM_006917 1803 RXRG AAGACAUGGAGAUGGACAA NM_006917 1804 RXRG CAAAUGACCCUGUUACCAA NM_006917 1805 RXRG GGAGAGGAUUGUAGAAGCU NM_006917 1806 RXRG GUGACAUGAAUAUGGAGAA NM_006917 1807 RXRG AGAGGGAAGCUGUGCAAGA NM_006917 1808 RXRG GGAAAUUAUUCUCACUUCA NM_006917 1809 RXRG GAGAGCGAGCUGAGAGUGA NM_006917 1810 RXRG GGGCAGGGUGGAAUGAAUU NM_006917 1811 RXRG UGAACUUGCUGUUGAACCA NM_006917 1812 RXRG GAUAAUAAAGACUGCCUCA NM_006917 1813 RXRG GAUUCUAGAAGCUGAACUU NM_006917 1814 RXRG GGAAGCUGUGCAAGAAGAA NM_006917 1815 RXRG AUAUGGAGAACUCGACAAA NM_006917 1816 THRA GGAUGGAAUUGAAGUGAAU NM_003250 1817 THRA GGACAAGGCAACUGGUUAU NM_003250 1818 THRA GGACAAGACUCCGAAGCUA NM_003250 1819 THRA GGGACAAGGCAACUGGUUA NM_003250 1820 THRA AGAGAGAAGUGCAGAGUUC NM_003250 1821 THRA GGCCCAAGCUGCUGAUGAA NM_003250 1822 THRA GCUACGACCCUGAGAGCGA NM_003250 1823 THRA CUAGUUACCUGGACAAAGA NM_003250 1824 THRA ACUCGAAGCGGGUGGCCAA NM_003250 1825 THRA CACCAGAUGGAAAGCGAAA NM_003250 1826 THRA UGGAAACAGAGGCGGAAAU NM_003250 1827 THRA CGGCCAAUGUUCCCUGAAA NM_003250 1828 THRA GGAAUUGAAGUGAAUGGAA NM_003250 1829 THRA GUAAGCUGAUUGAGCAGAA NM_003250 1830 THRA UGAAGUGAAUGGAACAGAA NM_003250 1831 THRA COAGGUCACCAGAUGGAAA NM_003250 1832 THRA CGCAGGGCAUGGUGUGAAA NM_003250 1833 THRA AGAAGAGUCAGGAGGCGUA NM_003250 1834 THRA GCACAAUCCAGAAGAACCU NM_003250 1835 THRA ACAAGAUCGAGAAGAGUCA NM_003250 1836 THRA UGGAUGGAAUUGAAGUGAA NM_003250 1837 THRA GGAGAACAGUGCCAGGUCA NM_003250 1838 THRA GGAAACAGAGGCGGAAAUU NM_003250 1839 THRA UGGACAAGAUCGAGAAGAG NM_003250 1840 THRA CGUUCGAGCACUACGUCAA NM_003250 1841 THRA ACAAAGACGAGCAGUGUGU NM_003250 1842 THRA CCAUCUUUGAACUGGGCAA NM_003250 1843 THRA GAACAGAAGCCAAGCAAGG NM_003250 1844 THRA GUUCAUGCUUCUACUGUGA NM_003250 1845 THRA GCUUCUACUGUGACACUUA NM_003250 1846 THRB UGGAAGUGUUCGAGGAUUA NM_000461 1847 THRB UAGAAGAACCAUUCAGAAA NM_000461 1848 THRB CUACAUAGGAAGAGCCAUU NM_000461 1849 THRB UGACAAAGCCACCGGGUAU NM_000461 1850 THRB CCUGUAAAUAUGAAGGAAA NM_000461 1851 THRB GGAAGCUGAUAGAGGAGAA NM_000461 1852 THRB UAGACAAAGUCACGCGAAA NM_000461 1853 THRB GAAGAAAUGUAAAGGGUAC NM_000461 1854 THRB GGAAUGUCGCUUUAAGAAA NM_000461 1855 THRB GGGAAGAGCUGCAGAAGUC NM_000461 1856 THRB AGGAAGAGCCAUUCAGAGA NM_000461 1857 THRB ACUGGAAGCUAGUAGGAAU NM_000461 1858 THRB CCUUACAGCUUGGGACAAA NM_000461 1859 THRB GCCAGAAGACAUUGGACAA NM_000461 1860 THRB UGUAGUGUGUGGUGACAAA NM_000461 1861 THRB CGGAGGAGAAGAAAUGUAA NM_000461 1862 THRB GAUCAACUUUGCAUGAAUA NM_000461 1863 THRB GGCCAAAACUCCUGAUGAA NM_000461 1864 THRB CGAAAUCAGUGCCAGGAAU NM_000461 1865 THRB GGACAAGCACCAAUAGUCA NM_000461 1866 THRB CAGAUUUGGUGCUGGAUGA NM_000461 1867 THRB GAGAAGAAAUGUAAAGGGU NM_000461 1868 THRB GCUAUGACCCAGAAAGUGA NM_000461 1869 THRB GCUAUGACCCGGAAAGUGA NM_000461 1570 THRB CCCAACAGUAUGACAGAAA NM_000461 1871 THRB AGAAGAAAUGUAAAGGGUA NM_000461 1872 THRB GGAUUAGACUGACUGGAUU NM_000461 1873 THRB GAAUGUCGCUUUAAGAAAU NM_000461 1874 THRB CAGAAUGAUUACUAACCUA NM_000461 1875 THRB CGCAGAAGGUGGAAAGGUU NM_000461 1876 VDBC CAUUAUAACUGAAGGAGAA NM_000376 1877 VORA AGAAUGAGAGUGUGAAAUA NM_000376 1878 VDR GGAUGGAGGAGAAGAAUUU NM_000376 1879 VDR CCACUGAUUUGGAGAUAUU NM_000376 1880 VDR GUGCAGAGGAAGCGGGAGA NM_000376 1881 VDR AUGAGGAAGUGCAGAGGAA NM_000376 1882 VDR GGAGAAGGAAGGAGACUCA NM_000376 1883 VDR CAACAGAGAAGGCAGGAAU NM_000376 1884 VDR GAAGUUACAUUGUGAAACU NM_000376 1885 VDR AGUGAAAGCUAGAGAUAUG NM_000376 1886 VDR CAACCAAUGUAGAAAGCUU NM_000376 1887 VDR GGAAUGUGUGGCAGAUUUA NM_000376 1888 VDR GCACUUACUUUGUUUGCAA NM_000376 1889 VDR GCGAAAGGAUGUAAACAGU NM_000376 1890 VDR GCUCGAAGUGUUUGGCAAU NM_000376 1891 VDR UUAGUGAAAGCUAGAGAUA NM_000376 1892 VDR GGAACAGACAGGAGAAAUG NM_000376 1893 VDR AGAUGAUCCUGAAGCGGAA NM_000376 1894 VDR CCAACACACUGCAGACGUA NM_000376 1895 VDR GAACCCACCUGCUGAGAGA NM_000376 1896 VDR UCUAGGAGCUGGGAGGAAA NM_000376 1897 VDR CAAGGGAGGUCAACAGAGA NM_000376 1898 VDR GGGAGAACUUACAUUGUGA NM_000376 1899 VDR UGUGGCAGAUUUAGUGAAA NM_000376 1900 VDR GCAGAUUUAGUGAAAGCUA NM_000376 1901 VDR AGGAAUGUGUGGCAGAUUU NM_000376 1902 VDR UGAAUGAUUUCCAAAGAGA NM_000376 1903 VDR CUAAGUGGCUGCUGACUGA NM_000376 1904 VDR GCUGAAUGAUUUCCAAAGA NM_000376 1905 VDR GAGAGGGUCUGGAGAAGCA NM_000376

Thus, consistent with Example XVII, the present invention provides an siRNA that targets a nuclear receptor, wherein the siRNA is selected from the group consisting of SEQ. ID NOs. 438-1905.

In another embodiment, an siRNA is provided, said siRNA comprising a sense region and an antisense region, wherein said sense region and said antisense region are at least 90% complementary, said sense region and said antisense region together form a duplex region comprising 18-30 base pairs, and said sense region comprises a sequence that is at least 90% similar to a sequence selected from the group consisting of: SEQ. ID NOs 438-1905.

In another embodiment, an siRNA is provided wherein the siRNA comprises a sense region and an antisense region, wherein said sense region and said antisense region are at least 90% complementary, said sense region and said antisense region together form a duplex region comprising 18-30 base pairs, and said sense region comprises a sequence that is identical to a contiguous stretch of at least 18 bases of a sequence selected from the group consisting of: SEQ. ID NOs 438-1905.

In another embodiment, an siRNA is provided wherein the siRNA comprises a sense region and an antisense region, wherein said sense region and said antisense region are at least 90% complementary, said sense region and said antisense region together form a duplex region comprising 19-30 base pairs, and said sense region comprises a sequence that is identical to a contiguous stretch of at least 18 bases of a sequence selected from the group consisting of: SEQ. ID NOs 438-1905.

In another embodiment, a pool of at least two siRNAs is provided, wherein said pool comprises a first siRNA and a second siRNA, said first siRNA comprises a duplex region of length 18-30 base pairs that has a first sense region that is at least 90% similar to 18 bases of a first sequence selected from the group consisting of: SEQ. ID NOs 438-1905 and said second siRNA comprises a duplex region of length 18-30 base pairs that has a second sense region that is at least 90% similar to 18 bases of a second sequence selected from the group consisting of: SEQ. ID NOs 438-1905 and wherein said first sense region and said second sense region are not identical.

In another embodiment, a pool of at least two siRNAs is provided, wherein said pool comprises a first siRNA and a second siRNA, said first siRNA comprises a duplex region of length 18-30 base pairs that has a first sense region that is identical to at least 18 bases of a sequence selected from the group consisting of: SEQ. ID NOs 438-1905 and wherein the second siRNA comprises a second sense region that comprises a sequence that is identical to at least 18 bases of a sequence selected from the group consisting of: SEQ. ID NOs 438-1905.

In another embodiment, a pool of at least two siRNAs is provided, wherein said pool comprises a first siRNA and a second siRNA, said first siRNA comprises a duplex region of length 19-30 base pairs and has a first sense region comprising a sequence that is at least 90% similar to a sequence selected from the group consisting of: SEQ. ID NOs 438-1905, and said duplex of said second siRNA is 19-30 base pairs and comprises a second sense region that comprises a sequence that is at least 90% similar to a sequence selected from the group consisting of: SEQ. ID NOs 438-1905.

In another embodiment, a pool of at least two siRNAs is provided, wherein said pool comprises a first siRNA and a second siRNA, said first siRNA comprises a duplex region of length 19-30 base pairs and has a first sense region comprising a sequence that is identical to at least 18 bases of a sequence selected the group consisting of: SEQ. ID NOs 438-1905 and said duplex of said second siRNA is 19-30 base pairs and comprises a second sense region comprising a sequence that is identical to a sequence selected from the group consisting of: SEQ. ID NOs 438-1905.

In each of the aforementioned embodiments, preferably the antisense region is at least 90% complementary to a contiguous stretch of bases of one of the NCBI sequences identified in Example XVII; each of the recited NCBI sequences is incorporated by reference as if set forth fully herein. In some embodiments, the antisense region is 100% complementary to a contiguous stretch of bases of one of the NCBI sequences identified in Example XVII.

Further, in some embodiments that are directed to siRNA duplexes in which the antisense region is 20-30 bases in length, preferably there is a stretch of 19 bases that is at least 90%, more preferably 100% complementary to the recited sequence id number and the entire antisense region is at least 90% and more preferably 100% complementary to a contiguous stretch of bases of one of the NCBI sequences identified in Example XVII.

While the invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications and this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departure from the present disclosure as come within known or customary practice within the art to which the invention pertains and as may be applied to the essential features hereinbefore set forth and as follows in the scope of the appended claims.

Claims

1. An siRNA comprising a sense region and an antisense region, wherein said sense region and said antisense region together form a duplex region, said antisense region and said sense region are each 18-30 nucleotides in length and said antisense region comprises a sequence that is at least 90% complementary to a sequence selected from the group consisting of SEQ. ID NOs. 438-1905.

2. An siRNA comprising a sense region and an antisense region, wherein said sense region and said antisense region together form a duplex region and said sense region and said antisense region are each 18-30 nucleotides in length, and said antisense region comprises a sequence that is 100% complementary to a contiguous stretch of at least 18 bases of a sequence selected from the group consisting of SEQ. ID NOs. 438-1905.

3. The siRNA of claim 2, wherein each of said antisense region and said sense region are 19-30 nucleotides in length, and said antisense region comprises a sequence that is 100% complementary to said sequence selected from the group consisting of: SEQ. ID NOs. 438-1905.

4. A pool of at least two siRNAs, wherein said pool comprises a first siRNA and a second siRNA, said first siRNA comprises a first antisense region and a first sense region that together form a first duplex region and each of said first antisense region and said first sense region are 18-30 nucleotides in length and said first antisense region is at least 90% complementary to 18 bases of a first sequence selected from the group consisting of: SEQ. ID NOs. 438-1905 and said second siRNA comprises a second antisense region and a second sense region that together form a second duplex region and each of said second antisense region and said second sense region are 18-30 nucleotides in length and said second antisense region is at least 90% complementary to 18 bases of a second sequence selected from the group consisting of: SEQ. ID NOs. 438-1905, wherein said first antisense region and said second antisense region are not identical.

5. The pool of claim 4, wherein said first antisense region comprises a sequence that is 100% complementary to at least 18 bases of said first sequence, and said second antisense region comprises a sequence that is 100% complementary to at least 18 bases of said second sequence.

6. The pool of claim 4, wherein said first siRNA is 19-30 nucleotides in length and said first antisense region comprises a sequence that is at least 90% complementary to said first sequence, and second siRNA is 19-30 nucleotides in length and said second antisense region comprises a sequence that is at least 90% complementary to said second sequence.

7. The pool of claim 4, wherein said first antisense region is 19-30 nucleotides in length and said first antisense region comprises a sequence that is 100% complementary to at least 18 bases of said first sequence, and said second antisense region is 19-30 nucleotides in length and said second antisense region comprises a sequence that is 100% complementary to said second sequence.

8. The siRNA of claim 1, wherein said antisense region and said sense region are each 19-25 nucleotides in length.

9. The siRNA of claim 4, wherein said first antisense region, said first sense region, said second sense region and said second antisense region are each 19-25 nucleotides in length.