RNA catalyst for cleaving specific RNA sequences
A synthetic RNA catalyst capable of cleaving an RNA substrate, the catalyst comprising a substrate binding portion and a “hairpin” portion. The invention also provides an engineered DNA molecule and a vector, each comprising a DNA sequence coding for an RNA catalyst according to the invention. The invention further comprises host cells transformed with the vectors of the invention which are capable of expressing the RNA catalyst. Finally, the invention provides a method of cleaving an RNA substrate which comprises contacting the substrate with a synthetic RNA catalyst according to the invention.
Latest Patents:
This application is a continuation-in-part of co-pending application Ser. No. 07/577,658 filed Sep. 4, 1990, which was a continuation-in-part of application Ser. No. 07/409,666 filed Sep. 20, 1989, which was a continuation-in-part of application Ser. No. 07/247,100 filed Sep. 20, 1988, now abandoned.
This invention was made in part with Government support under Grant No. DMB 8817576 awarded by the National Science Foundation and Grant No. RO1 AI 29870 awarded by the National Institutes of Health. The Government has certain rights in the invention.
FIELD OF THE INVENTIONThe present invention relates to an RNA catalyst which cleaves specific RNA sequences into a fragment having a 5′ hydroxyl and a fragment having a 2′,3′ cyclic phosphate. The products of the reaction described herein resemble those resulting from the natural hydrolysis of RNA.
BACKGROUND OF THE INVENTIONCertain naturally occurring satellite, virusoid and viroid RNAs possess the property of self-catalyzed cleavage. Self-cleavage has been demonstrated in vitro for avocado sunblotch viroid (ASBV) (Hutchins, C. J., Rathjen, P. D., Forster, A. C. and Symons, R. H. (1986) Nucleic Acids Res., 14: 3627-3640), satellite RNA from tobacco ringspot virus (sTRSV) (Prody, G. A., Bakos, J. T., Buzayan, J. M., Schneider, I. R. and Bruening, G. (1986) Science, 231: 1577-1580; Buzayan, J. M., Gerlach, W. L. and Bruening, G. B. (1986) Proc. Natl. Acad. Sci. U.S.A. 83: 8859-8862) and lucerne transient streak virus (vLTSV) (Forster, A. C. and Symons, R. H. (1987) Cell, 49: 211-220). These self-catalyzed RNA cleavage reactions share a requirement for divalent metal ions and neutral or higher pH and cleave target RNA sequences to give 5′ hydroxyl and 2′,3′-cyclic phosphate termini (Prody, G. A., Bakos, J. T., Buzayan, J. M., Schneider, I. R. and Bruening, G. (1986) Science, 213: 1577-1580; Forster, A. C. and Symons, R. H. (1987) Cell, 49: 211-220; Epstein, L. M. and Gall, J. G. (1987) Cell, 48: 535-543; Buzayan, J. M. Gerlach, W. L., Bruening, G. B., Keese, P. and Gould, A. R. (1986) Virology, 151: 186-199).
A “hammerhead” model has been proposed and accurately describes the catalytic center of (+)sTRSV RNA, the. (+) and (−) strands of ASBV and the (+) and (−) strands of vLTSV (Forster, A. C. and Symons, R. H. (1987) Cell, 49: 211-220). The single exception is (−)sTRSV RNA which does not fit the “hammerhead” model (Forster, A. C. and Symons, R. H. (1987) Cell, 49: 211-220; Buzayan, J. M., Gerlach, W. L. and Bruening, G. (1986) Nature, 323: 349-352; Buzayan, J. M., Hampel, A. and Bruening, G. B. (1986) Nucleic Acids Res., 14: 9729-9743), and the structure of whose catalytic center was unknown prior to the present invention. It is therefore understandable that the primary scientific focus has been on studying the “hammerhead” consensus structure and, as regards sTRSV, on studying the (+) strand.
Intermolecular cleavage of an RNA substrate by an RNA catalyst that fits the “hammerhead” model was first shown in 1987 (Uhlenbeck, O. C. (1987) Nature, 328: 596-600). The RNA catalyst was recovered and reacted with multiple RNA molecules, demonstrating that it was truly catalytic.
Catalytic RNAs designed based on the “hammerhead” motif have been used to cleave specific target sequences by making appropriate base changes in the catalytic RNA to maintain necessary base pairing with the target sequences (Haseloff and Gerlach, Nature, 334, 585 (1988); Walbot and Bruening, Nature, 334, 196 (1988); Uhlenbeck, O. C. (1987) Nature, 328: 596-600; Koizumi, M., Iwai, S. and Ohtsuka, E. (1988) FEBS Lett., 228: 228-230). This has allowed use of the catalytic RNA to cleave specific target sequences and indicates that catalytic RNAs designed according to the “hammerhead” model may possibly cleave specific substrate RNAs in vivo. (see Haseloff and Gerlach, Nature, 334, 585 (1988); Walbot and Bruening, Nature, 334, 196 (1988); Uhlenbeck, O. C. (1987) Nature, 328: 596-600).
However, catalytic RNAs such as those that were designed based on the “hammerhead” model have several limitations which restrict their use in vitro and may forestall their use in vivo. For example, the temperature optimum for the reaction is 50-55° C., which is well above physiological, and the kcat (turnover number) is only 0.5/mm even at 55° C. (Uhlenbeck, O. C. (1987) Nature, 328:596-600; Haseloff and Gerlach, Nature, 334, 585 (1988)). In addition, the Km is 0.6 uM (Uhlenbeck, O. C. (1987) Nature, 328:596-600), meaning that the reaction requires high concentrations of substrate which makes it difficult, if not impossible, for the catalytic RNA to cleave low levels of target RNA substrate such as would be encountered in vivo.
Cech et al. published application WO 88/04300 and U.S. Pat. No. 4,987,071 also report the preparation and use of certain synthetic ribozymes that have several activities, including endoribonuclease activity. The design of these ribozymes is based on the properties of the Tetrahymena ribosomal RNA self-splicing reaction. A temperature optimum of 50° C. is reported (page 39 of WO 88/04300; col. 20, lines 4-5, of U.S. Pat. No. 4,987,071) for the endoribonuclease activity, and the Km and kcat reported for this activity are 0.8 uM and 0.13/minute, respectively (Example VI, last paragraph).
In view of the above, there is a need for an RNA catalyst having a lower temperature optimum, preferably near physiological temperatures, a higher turnover number and a smaller Km and which can be engineered to cut specific target RNA substrates. Accordingly, based on the discovery of a totally different structure disclosed hereinafter, it is an object of the present invention to provide such an RNA catalyst. Other objects and features of the invention will be in part apparent and in part pointed out. The invention, accordingly, comprises the products and methods hereinafter described and their equivalents, the scope of the invention being indicated in the appended claims.
SUMMARY OF THE INVENTION The invention comprises a synthetic RNA catalyst capable of cleaving an RNA substrate which contains the sequence:
5′-F1-CS-F2-3′,
wherein,
-
- CS is a cleavage sequence; and
- F1 and F2 each is a sequence of bases flanking the cleavage sequence.
The catalyst comprises a substrate binding portion and a “hairpin” portion. The substrate binding portion of the catalyst has the sequence:
3′-F4-L1-F3-5′
wherein,
-
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
The “hairpin” portion is a portion of the catalyst that assumes a hairpin-like configuration when the substrate-catalyst complex is modeled in two dimensions for minimum energy folding. The “hairpin” portion of the catalyst preferably has the sequence:
wherein,
-
- P1 and P4 each is a sequence of bases, the sequences of P1 and P4 being selected so that P1 and P4 are substantially base paired;
- P1 is covalently attached to F4;
- S1 and S2 each is a sequence of bases, the sequences of S1 and S2 being selected so that S1 and S2 are substantially unpaired;
- P2 and P3 each is a sequence of bases, the sequences of P2 and P3 being selected so that P2 and P3 are substantially base paired; and
- L2 is a sequence of unpaired bases.
RNA catalysts according to the invention can cleave substrates of any length or type as long as they contain an appropriate cleavage sequence. In particular, the catalysts can be used to cleave a specific sequence in naturally-occurring RNA having a cleavage sequence, as well as RNAs which have been engineered to contain a cleavage sequence.
The invention further comprises an engineered DNA molecule and a vector, each of which comprises a DNA sequence that codes for an RNA catalyst according to the invention. The invention also comprises a host transformed with the vector, the host being capable of expressing the RNA catalyst. In particular, hosts can be transformed with vectors that, when transcribed, will produce RNA catalysts which can cleave any RNA, native or foreign, found in the host. For example, hosts can be transformed with vectors that, when transcribed, produce RNA catalysts which can regulate the expression of genes by cleaving messenger RNA or which act as anti-viral agents by cleaving viral RNA. Thus, the invention has application in vitro and in vivo in prokaryotes and eukaryotes of plant or animal origin in regulating gene expression and for controlling viral infections.
Finally, the invention includes a method of cleaving an RNA substrate comprising contacting the substrate with an RNA catalyst according to the invention. The reaction is unique because it occurs under physiological conditions, having a temperature optimum near 37° C., with very favorable reaction parameters. The method can be practiced in vitro or in vivo. For instance, the method may be practiced in vivo in host cells that have been transformed with a vector that codes for an RNA catalyst according to the invention.
BRIEF DESCRIPTION OF THE DRAWINGS
FIGS. 11A-C show that there is no effect on catalytic activity when base A at position 49 in the substrate is changed to a G, U or C.
FIGS. 14A-C show three substrates having sequences found in the sequence of the messenger. RNA coding for chloramphenicol acetyl transferase. FIGS. 14A-C also show the separation patterns on acrylamide gels of the reaction products obtained by reacting these substrates with catalytic RNAs designed to base pair with the substrates in the regions flanking the AGUC cleavage sequence.
FIGS. 27D and 27E: Results of Western immunoblot analysis of protein extracts of turnips leaves from control and infected plants.
FIGS. 42A-C: Summary of mutagenesis experiments with the (−)sTRSV RNA substrate-catalyst complex.
The 50-base catalytic RNA and the 14-base substrate RNA are the “minimum size” in the sense that reductions in their length result in a substantial or total loss of catalytic activity as is shown in the Examples below. Thus, this length of (−)sTRSV catalyst sequence is preferred to shorter lengths. Also, substrate RNA having at least the degree of base-pairing with the catalyst exhibited by the 14-base substrate is preferred.
The folding identifies regions of expected base pairing and expected non-base pairing, loops. This model does not preclude higher order interactions occurring between the loops.
FIGS. 14A-C. The sequences of three substrate RNAs having sequences found in the messenger RNA for chloramphenicol acetyl transferase (CAT) are shown. They have 14-, 16- and 18-base long target sites, and the length of the 3′ regions flanking the AGUC cleavage sequence is extended in substrates (B) and (C) as compared to substrate (A). Catalytic RNAs designed to base pair with the substrate RNAs in both the 3′ and 5′ regions flanking the cleavage sequence AGUC were synthesized. The open boxed bases are those which are different from those in the native (−)sTRSV substrate RNA sequence shown in
FIGS. 27D and 27E: Results of Western immunoblot analysis of protein extracts of turnips leaves from plants that were mock infected, infected with pCS101 (wild-type CMV) or infected with pCS101HC7.
FIGS. 42A-C: Summary of mutagenesis experiments with the (−)sTRSV “hairpin” substrate-catalyst complex. Each base or combination of bases enclosed in a circle represents a separate mutational experiment.
An RNA catalyst has been identified comprising an RNA sequence which can be engineered to cut, with great precision, target RNAs having a cleavage sequence. In particular, the invention comprises certain synthetic RNA catalysts capable of cleaving an RNA substrate which contains the target sequence 5′-F1-CS-F2-3′.
“Synthetic RNA catalyst,” as used herein, means a catalyst which is not a naturally-occurring RNA catalyst, although “synthetic catalysts” may be truncated or altered versions of naturally-occurring catalysts. “Synthetic catalysts” are catalysts designed according to the principles set forth herein based on the “hairpin” model to bind to and cleave a selected target sequence in a selected RNA substrate. “Synthetic catalysts” are also sometimes referred to herein as “engineered catalysts.” “Synthetic catalysts” include catalysts synthesized in vitro and catalysts synthesized in vivo. In particular, “synthetic catalysts” include catalysts produced by hosts transformed by a vector comprising a sequence coding for the catalyst.
RNA of any length and type may be used as the substrate as long as it contains the 5′-F1-CS-F2-3′ target sequence. In this formula, CS is the cleavage sequence (i.e., a sequence of bases containing the site at which the catalyst cleaves the substrate). CS is a short sequence of bases which does not base pair with the RNA catalyst, and CS preferably has the sequence 5′-NGUC-3′, wherein N is any base, and the substrate is cleaved by the catalyst between N and G to produce a fragment having an OH at the 5′ end and a fragment having a 2,′3′ cyclic phosphate at the 3′ end.
CS is flanked by two short base sequences F1 and F2 which do base pair with the RNA catalyst. F1 is preferably at least 3 bases in length, most preferably 4 bases in length. F2 is also preferably at least 3 bases in length, most preferably 6 to 12 bases in length.
Catalysts according to the invention comprise a substrate binding portion and a “hairpin” portion. The substrate binding portion of the catalyst has the sequence:
3′F4-L1-F3-5′
wherein,
-
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
- L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
As used herein, “substantially base paired” means that greater than 65% of the bases of the two RNA sequences in question are base paired, and preferably greater than 75% of the bases are base paired. “Substantially unpaired” means that greater than 65% of the bases of the two sequences in question are not base paired, and preferably greater than 75% of the bases are not paired.
F3 is preferably at least 3 bases in length, most preferably from 6 to 12 bases in length. F4 is preferably from 3 to 5 bases in length, most preferably 4 bases in length.
L1 is a short sequence of bases which preferably has the sequence 5′-AGAA-3′ when CS has the sequence 5′-NGUC-3′. Further, when L1 is 5′-AGAA-3′ and CS is 5′-NGUC-3′, then the first base pair between F1 and F4 adjacent to CS and L1 is preferably G:C or C:G (see
The “hairpin” portion is a portion of the catalyst which folds into a hairpin-like configuration when the substrate-catalyst complex is modeled in two dimensions for minimum energy folding. This is shown in
The “hairpin” portion of the catalyst preferably has the sequence:
wherein,
-
- P1 and P4 each is a sequence of bases, the sequences of P1 and P4 being selected so that P1 and P4 are substantially base paired;
- P1 is covalently attached to F4;
- S1 and S2 each is a sequence of bases, the sequences of S1 and S2 being selected so that S1 and S2 are substantially unpaired;
- P2 and P3 each is a sequence of bases, the sequences of P2 and P3 being selected so that P2 and P3 are substantially base paired; and
- L2 is a sequence of unpaired bases.
“Substantially base paired” and “substantially unpaired” have the same meanings as discussed above.
P1 and P4 each is preferably from 3 to 6 bases in length, and most preferably P1 has the sequence 5′-ACCAG-3′ and P4 has the sequence 5′-CUGGUA-3′. It has been found that the A at the 5′ end of 5′-ACCAG-3′ (underlined) is not base paired to the U at the 3′ end of 5′-CUGGUA-3′ (underlined), and the unpaired A may act as a “hinge” (see Example 32). It is not known yet, but the presence of such a “hinge” may be necessary for catalytic activity.
S1 and S2 each preferably is from 4 to 9 bases in length, and most preferably S1 has the sequence 5′-AGAAACA-3′ and S2 has the sequence 5′-GUAUAUUAC-3′.
P2 and P3 each is preferably from 3 to 9 bases in length, and more preferably P2 has the sequence 5′-CAC-3′ and P3 has the sequence 5′-GUG-3′.
Finally, L2 is preferably at least 3 bases in length and preferably has the sequence 5′-GUU-3′. Further, 5′-S1-P2-L2-3′ preferably has the sequence 5′-AGAAACACACGUU-3′.
The specific preferred sequences set forth above for P1, P2, S1, etc., are from the catalytic sequence of (−)sTRSV RNA.
A preferred catalyst according to the invention contains the sequence:
and active variants thereof, wherein F3, F4 and L1 are as defined above. As used herein “active variants” means catalysts which, although having substitutions, deletions and/or additions of bases as compared to the original sequence, are still capable of cleaving an RNA substrate.
The most preferred sequence for 5′-P2-L2-P3-3′ is 5′-CACGGACUUCGGUCCGUG-3′ [SEQ ID 46] (see Example 32). Accordingly, the most preferred catalyst has the sequence:
wherein F3, F4 and L1 are as defined above and the underlined portion is the preferred sequence for P2-L2-P3.
Another preferred catalyst according to the invention is an autocatalytic catalyst containing the sequence:
wherein, F1, F2, F3, F4, L1, L2, S1, S2, P1, P2, P3 and P4 are as defined above. L3 is a sequence of unpaired bases, and L3 preferably has the sequence 3′-CCUCC-5′. Thus, the molecule contains a substrate portion (5′-F1-CS-F2-3′) and a catalytic portion (5′-F3-L1-F4-P1-S1-P2-L2-P3-S2-P4-3′) covalently linked together by L3 so as to produce a synthetic autocatalytic RNA catalyst.
After being transcribed, this catalyst will spontaneously undergo an intramolecular autocatalytic cleavage at CS. The effect of this intramolecular cleavage is to autocatalytically terminate any RNA transcript in which the sequence is inserted. For instance, an engineered DNA molecule may be prepared which comprises a gene of interest and a DNA sequence coding for the catalyst linked so that, when transcribed, the catalyst will terminate transcription of the messenger RNA coding for the gene product at a desired location.
The invention further provides an engineered DNA molecule and a vector comprising a DNA sequence coding for an RNA catalyst of the invention. Also provided are host cells which have been transformed with the vectors and which are capable of expressing the RNA catalyst. Finally, the invention provides a method of cleaving an RNA substrate which contains the sequence 5′-F1-CS-F2-3′, the method comprising contacting the substrate with a synthetic RNA catalyst according to the invention.
The invention is further described below with particular reference to the catalytic properties of (−)sTRSV RNA and the structure of its catalytic complex, but the invention is not limited to synthetic catalysts based on (−)sTSV RNA. In addition to the particular catalytic sequences shown and described below, other RNA molecules having catalytic activity to cleave an RNA substrate can be readily found by applying the principles set forth in this specification.
For example, RNA sequences having the required structural features for cleaving an RNA substrate can be identified by applying the Wisconsin RNA Folding Program discussed above to (1) known sequences of molecules having catalytic or autocatalytic activity (especially molecules in which the actual location of the catalytic site is unknown), (2) randomly generated sequences having the proper pairing regions and lengths, and (3) randomly modified known catalytic sequences, while looking for known features of a catalytic molecule, such as the “hairpin” configuration of the catalytic complex when modeled in two dimensions. As a specific example, information regarding known autocatalytic cleavage sites can be used to find substrate binding sequences having the properties described above, such as having a substrate binding sequence adjacent a “hairpin” portion. Secondary features, such as the two substantially paired regions in the “hairpin” portion, with an intermediate substantially unpaired region and an appropriate base loop can then be looked for, either by manual examination or with automated computer programs. In order of decreasing preference, the following features are considered important in selecting a catalytic sequence: (1) regions that can base pair (form helices) with the regions of the substrate RNA molecule flanking the cleavage sequence; (2) an unpaired (loop) region opposite the cleavage sequence of the substrate RNA; (3) two substantially base paired regions in a “hairpin” structure near the cleavage sequence; (4) a substantially unpaired region between these two substantially paired regions in the “hairpin” structure; and (5) a loop connecting the two strands of a substantially base paired region to complete the “hairpin” structure. Standard techniques of in vitro RNA synthesis can then be used to prepare actual molecules having the sequence that gives the predicted two-dimensional computer-generated structure for verification of activity and routine testing of variation to determine optimum sequence.
The catalysts of the present invention were developed using a “hairpin” model or motif of RNA catalysis. According to this model, the catalytic complex, when modeled in two dimensions for minimum energy folding, assumes a “hairpin” configuration. This is shown in
The “hairpin” model has proved very useful in designing new catalysts, but it is still only a computer model of the likely secondary structure of catalytic complexes involving catalysts according to the present invention. Also, it is ultimately the tertiary structure of RNA catalysts that determines their activity. For these reasons, all catalysts having the properties described herein are considered to come within the scope of the present invention, even if they do not form a “hairpin” configuration when complexed with the substrate and even if they do not contain a “hairpin” portion. For instance, it may be possible to engineer a catalyst having the properties described herein which does not have a loop L2. Such a catalyst would be considered to be fully equivalent to the catalysts described and claimed herein.
As described in Example 1, a catalytic complex was identified within the (−) strand of tobacco ringspot virus satellite RNA (sTRSV) when the molecule was folded using computer models to determine the minimum energy folding in two-dimensional space. The (−) strand is a 359 base long RNA of defined sequence and is known to have autocatalytic properties (Gerlach, W. L., Buzayan, J. W., Schneider, I. R. and Bruening, G. B. (1986) Virology, 151: 172-185; Buzayan, J. M., Gerlach, W. and Bruening, G. (1986) Nature, 323: 349-352). The (−) strand cleaves at a defined site (ApG) into a cleavage product having an OH at the 5′ end and a 2′,3′ cyclic phosphate at the 3′ end. Up until the present time, however, little work had been done with the (−) strand to find the catalytic complex and to determine the minimum cleavage sequences because it does not fit the consensus “hammerhead” model.
In view of the above and the fact that the catalytic center would contain both the catalytic RNA sequence and the substrate (target) RNA sequence and by studying the results of Example 1, a 50 nucleotide sequence between bases 175 and 224 was picked and a 14 nucleotide sequence between bases 40 and 53 was picked. Using methodologies found in published procedures, a catalytic RNA having a satellite RNA base sequence identical to the base sequence in naturally-occurring (−)sTRSV between bases 175 and 224 was transcribed from chemically synthesized DNA templates using T7 RNA polymerase as described in Example 2. An RNA substrate having a satellite RNA base sequence identical to the base sequence in naturally occurring (−)sTRSV RNA between bases 40 and 53 was also prepared in the same manner. When the newly synthesized RNAs were mixed together under appropriate conditions as described in Example 3, the catalytic RNA cleaved the substrate RNA. As described in Example 4, the first RNA catalyst fitting the “hairpin” motif was discovered when the complex of the 50-base catalytic RNA and the 14-substrate RNA was modeled in two-dimensional space using computer modeling.
The reaction of catalysts fitting the “hairpin” motif with an appropriate substrate proved to be an excellent catalytic reaction under physiological conditions. The reaction of the catalyst and substrate containing the sequences of (−)sTRSV shown in
Catalytic cleavage of the substrate RNA occurs over a broad pH range, preferably 5.5 to 8.0, and in the presence of divalent ions such as Mg++, e.g. from MgCl2. As would be expected for a base catalyzed reaction, the rate of reaction increased with increasing pH. The reaction rate also increased with increasing concentration of divalent cations as shown in Example 8.
The reaction takes place at physiological temperatures, preferably 16° C. to 45° C., with a temperature optimum at 37° C. as described in Example 6. Temperatures above about 45° C. inactivate the reaction. However, the temperature optimum of the reaction is affected by the degree of base pairing between the substrate and catalyst (see Example 18). In particular, the length of the region of the catalyst that base pairs with the 3′ region of the substrate flanking the cleavage sequence can be varied so that an engineered catalyst reacting at a desired temperature can be obtained (see Example 18). Further, a “hairpin” catalyst which is more thermal stable than the native (−)sTRSV catalyst can be prepared by deleting the loop that closes the “hairpin” (Loop III in
The 50 base catalytic RNA configured in the “hairpin” model in
Base changes can be made in the two base paired regions (Helices 3 and 4 in
An active ribozyme is produced when Helix 4 is extended and the sequence of the loop that closes the “hairpin” (Loop III in
However, the simple replacement of the GUU sequence of Loop III with the sequence UUCG gives an inactive ribozyme (see Example 32). It is believed that this shows that the sequence of Loop III has an influence on the stability of Helix 4.
When Loop III is cut between U31 and U32, activity is lost (see
Base changes can also be made in the two regions of the catalytic RNA that base pair with the substrate, as long as substantial base pairing with the substrate in the regions flanking the cleavage sequence is maintained and base pairing with the cleavage sequence is avoided as shown in Examples 9, 16-21 and 32. Indeed, every base pair in Helices 1 and 2 (see
It is the ability to change the base pairs in Helices 1 and 2 that allows the RNA catalyst to be engineered to cut a specific target RNA substrate having a cleavage sequence such as NGUC. This is illustrated in Example 16 where the catalytic RNA was engineered by changing base 214 from a G to a C resulting in a catalytic RNA which failed to react with the substrate RNA developed from natural (−)sTRSV RNA. Activity was restored, however, when the substrate RNA was changed so that it could base pair with the subject engineered catalytic RNA. Also see Examples 9, 17-21 and 32.
A type of “hinge” region, consisting of a single base, seems to be present between Helices 2 and 3 of the (−)sTRSV catalytic RNA (see
Further mutation studies have showed that the substantially unpaired regions between Helices 3 and 4 (Loops II and IV in
The (−)sTRSV catalytic RNA sequence has an 5′-AGAA-3′ sequence opposite the AGUC cleavage sequence of the substrate. As shown in Examples 24 and 32, at least part of this AGAA sequence is invariant. In particular, when the A's at the 3′ and 5′ ends of the sequence (AGAA) were changed, the resulting catalysts were active (see
The target RNA substrate of the “hairpin” catalytic complex shown in
Although there is no base pairing between the NGUC cleavage sequence and the AAGA sequence in the catalyst opposite from the cleavage sequence, the C in NGUC apparently interacts with the A in AAGA opposite to it. In particular, it has been found that the single mutation of C-->A in the cleavage sequence NGUC (position 9 in the substrate sequence—see
The cleavage sequence has four flanking bases at its 5′ end and six at its 3′ end which base pair with the catalytic RNA. As described above, the bases in the flanking regions can be changed (with the possible exception of the G:C base pair in Helix 2 adjacent to NGUC) without affecting the ability of the catalytic RNA to cleave the substrate, as long as sufficient base pairing with the catalyst is maintained in the flanking regions. This would be expected to work on RNA substrate sequences of any length as long as these criteria are met. Indeed, lengthening the 3′ region of the substrate that base pairs with the catalyst has been found to provide a more efficient catalytic reaction. See Examples 18, 21 and 32. However, a smaller 10 base substrate having three flanking bases at its 5′ end and three flanking bases at its 3′ end did not work as well as the 14-base substrate, as described in Example 9.
Using the “hairpin” model as a guide, RNA catalysts can be engineered that base pair with an RNA substrate and mediate a cleavage in the cleavage sequence. In particular, catalytic RNA can be engineered that will cleave any RNA substrate having a cleavage sequence, such as NGUC, and flanking regions with which the catalyst base pairs, so that the catalytic RNA and RNA substrate form a catalytic complex in a “hairpin” motif. To do this, the bases flanking the cleavage sequence must be identified and the catalytic RNA engineered so that it does not pair in two-dimensional space with the cleavage sequence but does pair with adequate numbers of flanking bases upstream and downstream of the cleavage sequence. When designing a synthetic catalyst based on the (−)sTRSV catalyst, the other principles set forth herein regarding conserved or preferred sequences should also be taken into account in designing the synthetic catalyst.
As shown in Examples 17-20 and 29, catalytic RNAs according to the invention can cleave specific viral and messenger RNA sequences. In Example 15, tobacco mosaic virus (TMV) RNA containing the 5′ coding region of the replicase gene was targeted for specific cleavage by an appropriately engineered catalytic RNA. The target sequence contained changes in 8 of the 14 bases of the substrate RNA having a base sequence found within the catalytic complex of (−)sTRSV RNA and was cleaved by the engineered RNA catalyst under conditions near physiological. Catalytic RNAs were also designed and synthesized using the “hairpin” model as a guide which could cleave sequences from messenger RNA coding for chloramphenicol acetyl transferase (Example 18) and from HIV-1 viral RNA (Examples 19, 20 and 29). In particular, a conserved sequence in HIV-1 viral messenger RNAs has been identified, and a “hairpin” catalytic RNA designed which cleaves this sequence (see Example 29).
These examples demonstrate that the system can be used to specifically cleave an RNA sequence for which an appropriately engineered catalytic RNA base pairs at the designated flanking regions. Suitable target RNA substrates include viral, messenger, transfer, ribosomal, nuclear, organellar, other cellular RNA, or any other natural RNA having a cleavage sequence, as well as RNAs which have been engineered to contain an appropriate cleavage sequence.
Catalysts that fit the “hairpin” catalytic RNA model are useful in vivo in prokaryotes or eukaryotes of plant or animal origin for controlling viral infections or for regulating the expression of specific genes. In this case, a cleavage sequence such as NGUC in the virus or complementary to NGUC in the gene would need to be identified along with the flanking sequences immediately upstream and downstream of the cleavage sequence. Normally three to four bases on the 5′ side of the cleavage sequence and enough bases in the order of 6 to 12 on the 3′ side to provide adequate binding of the catalytic RNA and to provide reasonable certainty that the target RNA sequence is unique in the organism are required.
A catalytic RNA is then engineered which does not base pair with the cleavage sequence and which does base pair to the flanking regions on the 5′ and 3′ side of the cleavage sequence. A DNA template corresponding to this catalytic RNA is then synthesized using procedures that are well-known in the art. Such procedures include the phosphoramidite method (see, e.g., Beaucage and Caruthers, Tetrahedron Letters, 22, 1859 (1981); Matteucci and Caruthers, Tetrahedron Letters, 21, 719 (1980); and Matteucci and Caruthers, J. Amer. Chem. Soc., 103, 3185 (1981)) and the phosphotriester approach (see, e.g., Ito et al., Nucleic Acids Res., 10, 1755-69 (1982)).
The invention also includes an engineered DNA molecule and a vector comprising a DNA sequence coding for the desired synthetic RNA catalyst. The vector will have the DNA sequence coding for the desired catalytic RNA operatively linked to appropriate expression control sequences. Methods of effecting this operative linking, either before or after the DNA coding for the catalyst is inserted into the vector, are well known. Expression control sequences include promoters, activators, enhancers, operators, stop signals, cap signals, polyadenylation signals, and other signals involved with the control of transcription.
The vector must contain a promoter and a transcription termination signal, both operatively linked to the synthetic DNA sequence, i.e., the promoter is upstream of the synthetic DNA sequence and the termination signal is downstream from it. The promoter may be any DNA sequence that shows transcriptional activity in the host cell and may be derived from genes encoding homologous or heterologous proteins and either extracellular or intracellular proteins, such as amylase, glycoamylases, proteases, lipases, cellulases, and glycolytic enzymes. Also, a promoter recognized by T7 RNA polymerase may be used if the host is also engineered to contain the gene coding for T7 RNA polymerase.
The promoter may contain upstream or downstream activator and enhancer sequences. An operator sequence may also be included downstream of the promoter, if desired.
Expression control sequences suitable for use in the invention are well known. They include those of the E. coli lac system, the E. coli trp system, the TAC system and the TRC system; the major operator and promoter regions of bacteriophage lambda; the control region of filamentous single-stranded DNA phages; the expression control sequences of other bacteria; promoters derived from genes coding for Saccharomyces cerevisiae TPI, ADH, PGK and alpha-factor; promoters derived from genes coding for the Aspergillus oryzae TAKA amylase and A. niger glycoamylase, neutral alpha-amylase and acid stable alpha-amylase; promoters derived from genes coding for Rhizomucor miehei aspartic proteinase and lipase; mouse mammary tumor promoter; SV40 promoter; the actin promoter; and other sequences known to control the expression of genes of prokaryotic cells, eukaryotic cells, their viruses, or combinations thereof.
The vector must also contain one or more replication systems which allow it to replicate in the host cells. In particular, when the host is a yeast, the vector should contain the yeast 2u replication genes REP1-3 and origin of replication.
The vector should further include one or more restriction enzyme sites for inserting the DNA template sequences into the vector, and preferably contains a DNA sequence coding for a selectable or identifiable phenotypic trait which is manifested when the vector is present in the host cell (“a selection marker”).
Suitable vectors for use in the invention are well known. They include retroviral vectors, vaccinia vectors, pUC (such as pUC8 and pUC4K), pBR (such as pBR322 and pBR328), pTZ (such as pTZ18R), pUR (such as pUR288), phage lambda, YEp (such as YEp24) plasmids, and derivatives of these vectors.
The resulting vector having the engineered DNA sequence that codes for the RNA catalyst is used to transform an appropriate host. This transformation may be performed using methods well known in the art.
Any of a large number of available and wellknown host cells may be used in the practice of this invention. The selection of a particular host is dependent upon a number of factors recognized by the art. These include, for example, compatibility with the chosen expression vector, toxicity to it of the catalytic RNA encoded for by the engineered DNA sequence, rate of transformation, expression characteristics, bio-safety and costs. A balance of these factors must be struck with the understanding that not all hosts may be equally effective for the expression of a particular catalytic RNA.
Within these general guidelines, useful hosts include bacteria (such as E. coli sp.), yeast (such as Saccharomyces sp.) and other fungi, insects, plants, animals (including human), or other hosts known in the art.
As an example of the general genetic engineering techniques that are possible, mammalian vectors can be used to deliver DNA coding for the catalytic RNAs of the invention to animal cells. These vectors should have a suitable DNA replication signal, such as from SV40, a promoter which may or may not be inducible, such as the mouse mammary tumor promoter (which is induced by dexamethasone) or the noninducible SV40 promoter. A multiple cloning site is located after the promoter, and the DNA coding for the engineered catalytic RNA is inserted into this multiple cloning site using standard techniques. If necessary a suitable terminator is inserted. The resulting vector is then put into cells using standard techniques such as direct injection into the nucleus, electroporation, or other well-known transformation techniques. Once the vector is in the cell, the catalytic RNA is expressed directly when noninducible promoters are used, or after addition of the inducer when inducible promoters are used.
Similarly, plant vectors, such as the Ti plasmid or micro-Ti plasmids, can be used to deliver DNA coding for a desired catalytic RNA to plant cells. The Ti plasmids and micro-Ti plasmids may be used as such to transform plant protoplasts using known techniques or may be inserted into Agrobacterium tumefaciens which is then used to transform plant tissue. Once the plasmid is in the cell, the catalytic RNA will be expressed.
Once transformed, a host cell can express (transcribe) the synthetic RNA catalyst. When the DNA coding for the catalyst is transcribed, it produces catalytic RNA which will attack and cleave the target RNA sequence or sequences for which it has been designed, inactivating the RNA. If the RNA is necessary for the life cycle of a virus, the virus will be eliminated and if the RNA is the product of a specific gene, the expression of that gene will thus be regulated. The catalytic RNA can be designed to work in prokaryotes and within the nucleus (without poly(A) tail) or in the cytoplasm of a eukaryotic cell (with polyadenylation signals in place) for plants and animals.
Another potential method of using the catalytic RNAs of the invention is to prepare stable synthetic derivatives of RNA catalysts designed to bind and cleave a specific target RNA and to deliver the modified catalysts directly to the cell or organism of choice. For example, standard methods are available for making phoshporothioate derivatives of DNA which have been shown to be very stable in vivo and to be able to bind to a specific DNA or RNA target in vivo (antisense method). A modification of these procedures can be used to prepare a catalytically active derivative of RNA catalysts prepared according to the invention. This would entail determining which ribonucleotide regions can be altered and then altering them with deoxy, phophorothio, or other modifications which confer stability but do not destroy catalytic activity. This chemically modified catalytic RNA (which may or may not have any remaining RNA bonds) can then be injected or otherwise delivered to an organism to control viruses or gene expression. For instance, one of the catalytic RNAs whose preparation is described in Examples 19-20 having specificity for sequences found within the RNA of the HIV-1 virus that causes AIDS could be chemically modified as described, encapsulated in a liposome coated with monoclonal antibody directed to the CD4 receptors found on cells susceptible to HIV-1, and injected into a host animal.
The “hairpin” catalytic RNA model of the present invention may also be of possible interest to molecular biologists in exploring the genetic blueprints of plants and animals. This would be done by sending randomly constructed DNA reverse transcripts of catalytic RNA into the DNA of the organism and waiting to see which gene or genes were inactivated. Other techniques could be applied to determine where those genes resided on the organism's chromosomes, thereby greatly accelerating gene mapping.
Finally, a synthetic autocatalytic RNA catalyst has been developed. The synthesis of one such catalyst based on the (−)sTRSV RNA catalytic and substrate sequences is described in Example 23. When the DNA coding for this catalyst was transcribed in vitro, autocatalytic cleavage was observed. Further, in Example 26, DNA coding for this catalyst was linked to the gene coding for chloramphenicol acetyl transferase (CAT) in vector pHC-CAT. This vector was then used to transform Escherichia coli. Northern blot analysis of the RNA produced by the E. coli hosts indicated that autocatalytic cleavage occurred in vivo under standard E. coli growth conditions. Similar results were obtained when mammalian and plant cells were transformed with a vector coding for this autocatalytic RNA linked to either the gene coding for CAT (mammalian cells) or to DNA coding for cauliflower mosaic virus (plants) (see Examples 27 and 28). The autocatalytic RNA catalyst has also been used to properly terminate other engineered RNA catalysts after they were transcribed. In this manner, the engineered RNA catalysts were liberated from the RNA transcript and acted in trans in vivo (see Example 31).
EXAMPLESThe following examples further illustrate the invention.
Example 1 The (−) sense sequence of satellite RNA from the budblight strain of tobacco ringspot virus as shown in
The minimum catalytic complex, or active site of the molecule, is identified in
A 50 nucleotide sequence between bases 175 and 224 was picked and a 14 nucleotide sequence between bases 40 and 53 was picked from the catalytic complex identified in Example 1. A catalytic RNA (R51) with the 50 base sequence shown in
Before transcription, a 18mer or 16 mer DNA complement to the promoter for T7 RNA polymerase on the noncoding strand was hybridized by heating an equimolar amount of template DNA with promoter complement to 65° C. for 3 min. then placing in ice. A typical transcription reaction used 8 ng/ul DNA template, 0.5 mM each NTP, 2 mM spermidine, 40 mM Tris pH 7.5, 4% polyethylene glycol 6,000, 6 mM MgCl2, 4 mM NaCl, 10 mM dithiothreitol, 0.01% Triton X-100, 2.4 units/ul RNasin, 1.8 uCi/ul P32 CTP and 3 units/ul T7 RNA polymerase (US Biochemical) and was run at 37° C. for 90 min.
All in vitro transcribed RNAs were isolated on 7M urea, 15-20% acrylamide gels, bands cut out and isolated. All RNAs were sequenced using standard methods (Donis-Keller, H., Maxam, A. M. and Gilbert, W. (1980) Nucleic Acids Res., 4: 2527-2538); a method which also gave the 5′ terminal base. Terminal bases at the 3′ end were determined by ligation of the RNA to 5′ P32 pCp using T4 RNA ligase (BRL methods manual), nuclease T2 digestion, and separation of labelled bases by PEI thin layer chromatography in 0.3M LiC1 with appropriate standards. All RNA sequences corresponded to that expected from the DNA template.
Example 3 The catalytic RNA R51 was added to the substrate RNA S17 at a ratio of 1:30 and the time course of substrate RNA cleavage was studied. The reaction was carried out at 37° C. in 12 mM MgCl2, 40 mM Tris pH 7.5 and 2 mM spermidine over a time period of 150 min and is summarized as follows:
The reaction products were separated on polyacrylamide/urea gels by electrophoresis and bands cut out and counted in a liquid scintillation counter. The results are shown in
As shown in
After the RNA catalyst had been shown to be effective in cleaving the RNA substrate as described in Example 3, minimum energy folding of the 50 base sequence shown in
Various concentrations of substrate S17 were used at constant concentration of catalyst R51 and initial velocities of each reaction determined. The reaction was carried out at 37° C. in 12 mM MgCl2, 40 mM Tris pH 7.5 and 2 mM spermidine. Concentrations were as follows: R51=0.0004 uM and S17=0.125 uM (lane 1), 0.0624 uM (lane 2), 0.0417 uM (lane 3), 0.031 uM (lane 4), 0.021 uM (lane 5), 0.0156 uM (lane 6), 0.0078 uM (lane 7) and 0.0039 uM (lane 8). Each reaction was analyzed on polyacrylamide gels as described in Example 3 with the results shown in
The temperature dependence of the rate of cleavage of substrate RNA S17 by catalytic RNA R51 was tested over a temperature range and the reaction products analyzed on polyacrylamide gels as described in Example 3 with the results shown in
The reaction showed a temperature dependence similar to that which would be expected of a reaction involving base paired RNA molecules. The Arrhenius plot of the data shown in
The rate of cleavage of a constant concentration of substrate RNA S17 at varying concentrations of catalytic RNA R51 was tested and the reaction products analyzed on polyacrylamide gels as described in Example 3 with the results shown in
The effect of Mg++ concentration and pH on the rate of cleavage of RNA substrate S17 by RNA catalyst R51 was determined as shown in the following table:
In the Mg++ studies, the substrate S17 concentration was 0.14 uM and RNA catalyst R51 concentration was 0.0015 uM. The reactions were at 37° C. in 40 mM Tris pH 7.5. In the pH studies, the substrate S17 concentration was 0.062 uM and RNA catalyst R51 concentration was 0.0014 uM. The reactions were at 37° C. in 40 mM Tris for pH 7.0, 7.5, 8.0 and in 40 mM Pipes for pH 5.5, 6.0 and 6.5.
The dependence of the reaction rate on Mg++ and pH are virtually identical with those of catalysts fitting the “hammerhead” model. The reaction rate increases with increasing pH as one would expect for a base catalyzed reaction but the effect is masked by the catalytic activity of the RNA. Hence a 100 fold increase in [OH−] between pH 6.0 and 8.0 resulted in only a 3 fold increases in the reaction rate.
Example 9 A 10 base substrate (S10) was prepared by the methodology of Example 2. When the substrate was mixed with catalytic RNA R51, the reaction is summarized as follows:
The results of rate studies with substrate S10 comparable to those described with S17 in Example 5 showed a Km=0.06 uM and a kcat=0.8/min. These results are shown in
The 3′ terminal base of the catalyst shown in the “hairpin” model of the (−)sTRSV catalytic complex in
An RNA with the same sequence as catalytic RNA R51 was prepared, except that bases. 195-203 were omitted such that base 194 was in effect ligated to base 204. This RNA molecule was prepared as described in Example 2 from an appropriate DNA template containing the complementary sequence. When this RNA was mixed with substrate RNA S17 as described in Example 3, no reaction occurred. These results show that major elements of the “hairpin” structure are required for RNA catalysis to occur and that removal of 9 bases (see
An RNA with the same sequence as R51, except that the bases AAA at positions 203, 202 and 201 were changed to CGU, respectively, was prepared as described in Example 2 using an appropriate DNA primer. When this potential RNA catalyst was mixed with substrate RNA S17 as described in Example 3, no reaction occurred. This shows that the integrity of one or all of these bases (see
A substrate RNA with the base at position 49 in
Another substrate RNA identical to S17 but having “A” replaced by “U” in the AGUC loop was prepared as described in Example 2 (designated “S17(A-->U)”). This substrate RNA, at a concentration of 0.12 uM, was reacted with the catalytic RNA R51, at a concentration of 0.0065 uM, under standard conditions as described in Example 3 for 60 minutes. The results are shown in
Another substrate RNA identical to S17 but having “A” replaced by “C” in the AGUC loop was prepared as described in Example 2 (designated “S17(A-->C)”). This substrate RNA, at a concentration of 0.08 uM, was reacted with the catalytic RNA R51, at a concentration of 0.0065 uM, under standard conditions as described in Example 3 for 60 minutes. The results are shown in
Base changes in the stem regions at the site of binding of the substrate RNA to the catalytic RNA in the “hairpin” configuration can be made as long as the base pairing is maintained. The “C” base at position 50 of the substrate was changed to a “G” using the methods in Example 2. When this substrate RNA was reacted with the catalytic RNA R51, no cleavage of this substrate occurred. However, when a new catalytic RNA, containing a “C” at position 214, rather than the “G” found in R51, was synthesized according to the methods in Example 2 and added to this substrate, full cleavage was seen. The effect of the base change from “C” to “G” in the substrate was to eliminate the base pairing at this position predicted by the “hairpin” model since now a “G” would be across from a “G”. However, when a “G” to “C” base change was made in the catalytic RNA, the base pairing was restored, but in a reverse manner, and the integrity of the helices in the stem regions where the substrate RNA binds to the catalytic RNA was thus conserved restoring catalytic activity (see
An RNA sequence found within the sequence of tobacco mosaic virus was synthesized using the methods described in Example 2. This synthesized target RNA had the sequence 5′gAAACAGUCCCCAAC 3′. A catalytic RNA was synthesized with the sequence 5′-GUUGGGAGAAGUUUACCA-GAGAAACACACGUUGUGGUAUAUUACCUGGUA-31 selected so that base pairing between the substrate and the catalytic RNA is maintained in the “hairpin” configuration (see
Three RNA sequences found within the sequence of the messenger RNA for the enzyme chloramphenicol acetyl transferase (CAT) were synthesized using the methods described in Example 2. The synthesized substrate RNAs had the sequences (A) gUUUCAGUCAGUUGC, (B) gUUUCAGUCAGUUGCUC; and (C) gggUUUCAGUCAGUUGCUCAA (see
Note that the latter two sequences are extensions of the first sequence and that additional bases have been added to the 3′ end in the region that the “hairpin” model predicts will base pair with the catalytic RNA to form Helix 1 (see
Catalytic RNAs corresponding to substrate RNAs (A), (B) and (C) were synthesized according to the methods described in Example 2. Their sequences were designed so that they would base pair with the substrate RNAs in both the 3′ and 5′ regions flanking the AGUC cleavage sequence. In addition, the catalytic RNAs designed to react with substrate RNAs (A) and (B) each contained the vector sequence GA at their 5′ terminus, and the catalytic RNA designed to react with substrate RNA (C) contained the vector sequence GGG at its 5′ terminus. Otherwise, the catalytic RNAs had the same sequence as the (−) sTRSV catalytic RNA sequence shown in
The various substrate and catalytic RNAs were reacted and the reaction products analyzed as described in Example 3. All reaction conditions were as described in Example 3, except for the following. For substrates (A) and (B), reaction conditions were: substrate RNA concentration=0.05 uM; catalytic RNA concentration=0.005 uM; reaction run at 16° C.; and reaction time of 20 minutes. For substrate (C), the reaction conditions were the same as for (A) and (B), except that the reaction time was 40 minutes and temperatures were 20° C., 25° C., 30° C. and 37° C.
Cleavage of all of the substrate RNAs occurred when they were mixed with the corresponding catalytic RNAs as is shown in
An RNA substrate corresponding to part of the sequence of HIV-1, the virus which causes AIDS, was synthesized as described in Example 2. The sequence of this substrate RNA is shown in
A catalytic RNA was synthesized according to the methods of Example 2. Its sequence was designed so that it would base pair with the substrate RNA in both the 3′ and 5′ regions flanking the CGUC cleavage sequence. In addition, the catalytic RNA contained the vector sequence GGG at its 5′ terminus. Otherwise, the catalytic RNA had the same sequence as the (−)sTRSV catalytic RNA sequence shown in
The catalytic RNA and the substrate RNA were reacted and the reaction products were analyzed as described in Example 3. The reaction conditions were as set forth in Example 3, except that the following temperatures were used: 20° C., 25° C., 30° C. and 37° C. Also, the reaction was run for 60 minutes, except for 37° C. which was run for 15 minutes, and the substrate RNA concentration was 50 nM, and the catalytic RNA concentration was 5 nM.
The catalytic RNA cleaved the substrate at the expected position between the “C” and “G” in the CGUC cleavage sequence found in the loop between the two flanking stem regions. Thus, a specific sequence found in the HIV-1 viral RNA that codes for the gag protein can be cleaved with a catalytic RNA according to the invention. The reaction occurred with an RNA substrate having a 16-base target sequence which was longer than S17 in the region at the 3′end which base pairs with the catalyst (i.e., the portion that forms Helix 1 with the catalytic RNA according to the “hairpin” model). Also, the reaction occurred at physiological temperature of 37° C.
Example 20 A substrate RNA having a sequence found in the beginning of the coding region for the regulatory protein tat of HIV-1 virus was synthesized as described in Example 2. The substrate sequence is shown in
A catalytic RNA having a sequence so that it would base pair with the substrate RNA in the two regions flanking the UGUC loop (ie., the regions that forms Helices 1 and 2 with the catalytic RNA according to the “hairpin” model) was also synthesized as described in Example 2. Otherwise, the catalytic RNAs had the same sequence as the (−)sTRSV catalytic RNA sequence shown in
The substrate RNA and catalytic RNA were reacted and the reaction products were analyzed as described in Example 3. Reaction conditions were: 37° C.; reaction times of zero and 15 minutes; the concentration of substrate RNA was 100M; and the concentration of catalytic RNA was 20 nM.
Cleavage occurred as expected between the “U” and the “G” in the UGUC cleavage sequence located between the two stem regions of the substrate. The cleavage site is indicated by the arrow in
Using the methods described in Example 2, a substrate RNA having four non-native bases (UUUU) added to the 3′ end of the sequence of the native (−)sTRSV substrate shown in
The substrate and catalytic RNAs were reacted at standard conditions and the reaction products were analyzed as described in Examples 3 and 5. Catalytic RNA concentration was 0.00033 uM, and substrate RNA concentration was 0.1 uM (Lane 1), 0.05 uM (Lane 2), 0.033 uM (Lane 3), 0.025 uM (Lane 4), 0.016 uM (Lane 5), 0.012 uM (Lane 6), 0.006 uM (Lane 7), and 0.003 uM (Lane 8). The results are shown in
The data show that an extension of the length of the region of base pairing between the substrate and catalyst (i.e., those regions of the catalyst and substrate that form Helix 1 according to the “hairpin” model) can improve the catalytic properties of the reaction. Cleavage of the 18-base RNA substrate occurred at the expected site, but at an increased rate as compared to the cleavage of S17 by R51. The kcat or turnover number of the reaction was 7/minute. This means that each molecule of catalytic RNA cleaved 7 molecules of substrate RNA per minute during the reaction. The kcat for S17 cleavage by R51 was 2.1/minute. The Km of the reaction was the same as for S17 cleavage by R51 (30 nM). This shows that by optimizing the length of the region of the catalyst that base pairs to the substrate in the 3′ region flanking the cleavage sequence (i.e., by optimizing the length of Helix 1 predicted by the “hairpin” model), the catalytic properties of the native reaction can be improved.
Example 22 A series of catalytic RNAs were prepared using the methods described in Example 2 having certain base substitutions as compared to the native (−)sTRSV catalytic RNA sequence shown in
The various catalytic RNAs were reacted with substrate S17 and the reaction products were analyzed as described in Example 3. The results are shown in
As shown in
The results of these mutagenesis studies provide confirmation for the existence of Helices 3 and 4 (see
A synthetic “hairpin” autocatalytic cassette was prepared. The RNA sequence of the cassette is shown in
The cassette was prepared by making an appropriate synthetic DNA template that would yield an RNA with the sequence shown in
After being transcribed, the cassette spontaneously underwent an intramolecular autocatalytic cleavage at the expected site to give the appropriate 3′F and 5′F products (see
A catalytic RNA was prepared as described in Example 2. Its sequence was identical to that of R51, except that the base at position 217 in the AGAA loop was changed from a G to a C. The AGAA loop of the catalyst is opposite the cleavage sequence of the substrate when the substrate and catalyst are complexed (see
Another catalytic RNA was prepared as described in Example 2. Its sequence was also identical to R51, except that the base at position 216 was changed from A to U.
These catalysts were reacted with the RNA substrate S17 under standard conditions as described in Example 3 at 37° C. for 0 and 15 minutes. The control was the reaction of substrate S17 with catalyst R51. The concentration of the three catalytic RNAs was 0.007 uM, and the concentration of substrate RNA was 0.7 uM.
The reaction products were analyzed as described in Example 3, and the results are shown in
A third catalytic RNA was prepared as described in Example 2. Its sequence was identical to R51, except that the base at position 218 was changed from A to C. This catalyst was also reacted with substrate S17 as described above. Cleavage of the substrate with this catalyst was observed, but only at about 47% of the level achieved with R51 (data not shown).
Example 25Substrate RNAs identical to S17, but with one of the bases at positions 46, 47 or 48 (i.e., bases GUC of the cleavage sequence) changed to a different base, were prepared as described in Example 2. When these substrates were reacted with the RNA catalyst R51 under standard conditions for 60 minutes as described in Example 3, no cleavage of the substrates occurred. This shows conservation of the GUC sequence of the cleavage sequence of (−)sTRSV RNA. The results of these experiments combined with the results of Examples 13-15 show that the cleavage sequence of (−)sTRSV RNA is NGUC, where N is any base.
Example 26The vector prepared in Example 23 (hereinafter referred to as “pHC”) containing the “hairpin” autocatalytic cassette was tested for activity in vivo as follows. First, the CAT gene was removed from plasmid pMAMNEO-CAT (purchased from Clontech Inc.) with SmaI and XhoI. It was then ligated using T4 ligase into pHC which had been cut with SmaI and XhoI to produce vector pHC-CAT.
The original vector used in these constructions (pTZ18R; see Example 23) contains an inducible promotor (lacZ) and, as a result of the steps described in Example 23 and immediately above in this example, the CAT gene and the “hairpin” autocatalytic cassette were placed in this inducible region (see
Next, pHC-CAT was transfected into Escherichia coli strain JM109 (widely available from a number of commercial sources and from the American Type Culture Collection) with calcium chloride and heat shock using standard procedures as described in Maniatis et al., Molecular Cloning (1983). Transformed cells containing pHC-CAT were selected on the basis of ampicillin resistance by plating on double concentration YT medium containing 100 ug/ml ampicillin. After selection, the transformed cells were grown overnight in LB broth at 37° C. A fresh culture of these cells was then grown for 5 hours in LB broth at 37° C., after which the cells were induced with 1 mM isopropyl-beta-D-thiogalactoside (IPTG) for one hour to allow expression of the lacZ region, including the CAT gene-cassette RNA transcript.
At the end of this time, RNA was isolated by incubating the cells in 50 mM Tris, pH 8.0, 50 mM ethylenediaminetetraacetic acid (EDTA), 1 mg/ml lysozyme at room temperature for 10 minutes to lyse the cells. The lysate was made to 0.5% sodium dodecyl sulfate (SDS), and then centrifuged to remove cell debris. Phenol was added to the supernatant at a ratio of 1:1, and the supernatant was centrifuged to remove the precipitate. This procedure was repeated, and the resulting aqueous phase was treated with an equal volume of isopropanol at −20° C. for 20 minutes to precipitate the RNA. The precipitate was collected by centrifugation, dried, and redissolved in water.
This isolated RNA was electrophoresed on 1.2% agarose gels containing formaldehyde as described in Current Protocols In Molecular Biology (Greene 1989). After electrophoresis, Northern blots were carried out using published methods (GeneScreen Plus, DuPont, July 1985). Two DNA probes were used for blotting the gels. The CAT probe was prepared by primer extension of the CAT gene using the Klenow fragment of DNA polymerase I and dATP labelled with alpha P32 The “hairpin” autocatalytic cassette probe was prepared by kinasing the DNA complement to the entire “hairpin” autocatalytic cassette RNA sequence shown in
The results of the Northern blot test are shown in
When the “hairpin” autocatalytic cassette probe was used, only the full length CAT-cassette RNA transcript was detected (see
The mobilities of the CAT-cassette RNA and the Cleaved CAT-cassette 5′ fragment on the gel corresponded exactly to the predicted length of the transcripts. The standards run on the gel were E. coli 16S and 23S RNA.
An important control was to determine whether cleavage of the target RNA occurred during isolation of the RNA. Intact, uncleaved “hairpin” autocatalytic RNA prepared and separated on an acrylamide gel as described in Example 23 was isolated from the gel using conditions similar to those described above (i.e., no divalent cation, but in the presence of EDTA and SDS). Only intact, uncleaved RNA was obtained when the catalyst was re-electrophoresed on an acrylamide gel.
Example 27Mammalian expression vector pMHC-CAT containing the “hairpin” autocatalytic cassette linked to the CAT gene was constructed as follows. First, the “hairpin” autocatalytic cassette was removed from the vector pHC (prepared as described in Example 23) by digesting pHC with SmaI/SalI and ligating the resulting fragment to the mammalian vector pMSG (purchased from Pharmacia) at the SmaI/XhoI sites to give pMHC. This operation removed the XhoI site used to clone the “hairpin” autocatalytic cassette insert, but introduced another XhoI site near the 5′end of the insert.
Next, the CAT gene was removed from pMAM-NEO-CAT (Clontech, Palo Alto, Calif.) with SmaI/XhoI and cloned into pMHC at the SmaI/XhoI sites to give the final vector pMHC-CAT which is shown in
The vector pMHC-CAT was co-transfected into Chinese hamster ovary cells (CHO) by the calcium phosphate method along with the vector pMSG-dhfr (a gift from Dr. Ed Prochownick of the University of Michigan, Ann Arbor, Mich.) which contained a mutant dhfr gene. The isolation of this mutant dhfr gene is described in Simonson and Levinson, Proc. Nat'l Acad. Sci. 80, 2495-99 (1983), and pMSG is available from Pharmacia (see above).
Transfectants were selected using gpt medium (Molec. & Cell. Biol., 3, 1421-29 (1983)) which kills non-transfected cells. They were next amplified with methotrexate (which amplifies the mutant dhfr gene and associated DNA) to give CHO cells which contained the “hairpin” autocatalytic cassette linked to the CAT gene stably integrated into the genome. Dexamethasone (1 uM) was added to induce expression of the CAT gene and “hairpin” autocatalytic cassette, and RNA was isolated from the individual cells by the guanidinium isothiocyanate procedure (Current Protocols in Molecular Bioloqy).
Next, a nuclease protection assay was performed to identify the nature of the transcripts and cleavage products, if any, produced by transcription of the “hairpin” autocatalytic cassette-CAT gene fusion. The probe used in the S1 nuclease assay was prepared from the vector pHC19R. This vector was prepared by removing the “hairpin” autocatalytic cassette from pHC with BamHI/SalI and ligating this fragment into the BamHI/SalI sites of the vector PTZ19R (purchased from US Biochemical) as shown in
The probe was then hybridized in buffer (40 mM Pipes, pH6, 100 mM NaCl, 20% formamide, 1 mM ZnSO4) to RNA (5 ug) isolated from CHO cells that had been transfected with vector pMHC-CAT, and the hybridized RNA was digested with S1 nuclease (100 units, 1 hr., 37° C.). S1 nuclease will digest all single-stranded RNA, but not double-stranded RNA. A negative control was RNA from transfected CHO cells which had not been induced. A positive control was RNA isolated from the pHC plasmid which had been transcribed in vitro with T7 RNA polymerase as described in Example 2.
The S1 digests were electrophoresed on 10% acrylamide-7M urea gels. The results are shown in
Another control was uncleaved “hairpin” autocatalytic cassette RNA hybridized to the probe, S1 nuclease digested and electrophoresed under the same conditions. No cleavage products were seen (data not shown). This control shows that cleavage of the “hairpin” autocatalytic cassette RNA did not occur as a result of the analysis conditions and, therefore, that the cleavage products seen in Lane 5 must have been produced as a result of in vivo cleavage of the hybrid CAT-“hairpin” autocatalytic RNA.
Example 28The following example illustrates in vivo activity of the “hairpin” catalytic RNA in plants. The “hairpin” autocatalytic cassette (see Example 23) was ligated to cauliflower mosaic virus (CMV) in a viral vector. Plants were then transformed with the resulting vector and, during replication of the virus, the “hairpin” autocatalytic cassette RNA cleaved the viral RNA intramolecularly. The viral RNA serves as a template for viral replication and for attenuation of the virus. Since cleavage levels of the “hairpin” autocatalytic cassette were about 50-60% in vitro, attenuated viral infection in these plants would be expected if the “hairpin” autocatalytic cassette RNA linked to the CMV RNA did cleave in vivo in the plants, and this was what was observed.
The constructions tested had the “hairpin” autocatalytic cassette in the sense and antisense orientation. They are shown in
These constructions were made by removing the “hairpin” autocatalytic cassette from vector pHC (prepared as described in Example 23) with XhoI/SalI and ligating this fragment into the unique XhoI site of CMV plasmid pCS101 (a gift from Dr. Art Hunt, University of Kentucky). Plasmid pCS101 contains the entire CMV sequence. The XhoI site is located in gene II of CMV and was chosen because DNA can be cloned into this site without subsequently interfering with plant infection. Another plasmid identical to pCS101, except having a pBR322 bacterial replicon, is available from American Type Culture Collection, and can be used in its place as a source of CMV. The “hairpin” autocatalytic cassette could be inserted in either of two orientations (sense or antisense) since XhoI and SalI ends are compatible.
The resulting constructions were grown in XL-1 blue E. coli (Stratagene), and clones of the “hairpin” autocatalytic cassette insert were isolated and identified by cleaving with KpnI and by electrophoresing the insert on 1% agarose gels. The isolated clones were pCS101HC7 and pCS101HC9 which had the “hairpin” autocatalytic cassette in the sense and antisense orientation, respectively. These constructions are shown in
After being grown in E. coli, the constructions were cut with SalI, and were rubbed onto 10 turnip (Brassica campestis) plants (0.5 ug DNA/plant) which were three weeks old. Four groups of plants were treated as follows:
-
- A. Control-mock inoculated (no virus)
- B. Virus control—no “hairpin” autocatalytic cassette insert—pCS101
- C. Virus with the sense “hairpin” autocatalytic cassette—pCS101HC7
- D. Virus with the antisense “hairpin” autocatalytic cassette—pCS101HC9
The plants were checked weekly for viral symptoms. Within six weeks, plants treated with wild-type virus were observed to have viral symptoms, including vein clearing, leaf wrinkling and a yellow mosaic pattern on the leaf. In turnips treated with pCS101HC7, the onset of symptoms was delayed 7-10 days as compared to turnips infected with wild-type virus, and the severity of the symptoms never reached the levels attained with the wild-type virus. The following was observed when the plants were nine weeks old:
These results indicate that a “hairpin” catalytic RNA according to the invention can cleave viral RNA in which it is inserted so that the viral infection is attenuated (Treatment C). Surprisingly, viral attenuation was also obtained with the “hairpin” autocatalytic cassette linked to the viral DNA in the antisense direction (Treatment D). This is believed to be due to the deletion or disabling of the viral construct containing the antisense catalyst, so that viral replication is diminished compared to virus control (Treatment B).
Next, RNA was isolated from the leaves of the nine-week old plants, a probe was prepared and hybridized to the isolated RNA, S1 nuclease digestion was carried out, and the S1 digests separated on 10% acrylamide-7M urea gels and analyzed by autoradiography, all as described in Example 27. The probe used was P32-labelled autocatalytic cassette RNA. The results are shown in
RNA isolated from the leaves of the nine-week old plants was also subjected to Northern blot analysis. Total RNA was prepared by the guanidium thiocyanate method (Chirgwin et al., Biochemistry, 18, 5294-99 (1979)) followed by pelleting through a 5.7 M CsCl step gradient. Then, 2.5 ug of total RNA was denatured at 65° C. for 5 min. with formaldehyde and formamide. The denatured RNA was then electrophoresed on a 1% agarose/formaldehyde gel, and the RNA was transferred to a Duralon-UV nylon filter (Stratagene) in 10× standard sodium citrate (SSC) (0.15 M). After UV crosslinking, the filter was prehybridized in 6×SSC and 0.05× BLOTTO (Sambrook, Pritsch and Maniatis, Molecular Cloning § 1.102 (2nd ed. 1989)) for 3 hrs at 68° C. The “hairpin” autocatalytic cassette fragment was labeled with P32 using the oligolabeling procedure of Feinberg and Vogelstein, Anal. Biochem., 132, 6-9 (1983). Then it was heated to 100° C. for 5 min. and cooled on ice. This probe was added to the prehybridization mix, and incubation was continued at 68° C. overnight. The nylon filter was washed once in 2×SSC, 0.1% SDS for 20 min. at room temperature, followed by washing with 1×SSC, 0.1% SDS for 60 min. at 68° C. and 0.1×SSC, 0.1% SDS for 60 min. at 68° C. The filter was then exposed overnight to Kodak X-OMAT film using two lightning plus screens. The results are shown in
Some reproducible size heterogenity was observed in the S1 fragments and in the Northern blots. This may be due to the rapid degradation by plant nucleases of these RNA fragments which contain either a 2′-3′ cyclic phosphate or a 5′ hydroxyl after ribozyme cleavage. The 5′ hydroxyl could mimic naturally occurring RNA degradation signals. After infecting turnips with pCS101HC7, RNAs of the sizes expected before and after RNA cleavage were detected, but the amount of the 6 Kb fragment detected was somewhat less than the 2 Kb fragment. This could be due to enhanced stability of 2 Kb transcript which contained a cyclic phosphate over the 6 Kb transcript which contained the 5′ hydroxyl.
To determine if the DNA encoding the “hairpin” autocatalytic ribozyme was stable in vivo, total DNA was isolated from the leaves of mock-inoculated turnips, turnips inoculated with wild-type CMV (pCS101) and turnips inoculated with pCS101HC7. DNA was isolated from plants essentially as described in Murray and Thompson, Nucleic Acids Research, 8, 4321-25 (1980). Oligo-nucleotide primers homologous to domains 5′ and 3′ to the XhoI site of the parental CMV clone pCS101 were used as primers to amplify DNA sequences between these domains by polymerase chain reaction (PCR). Primer 1, which includes an EcoRI site, hybridizes 63 bases upstream from the XhoI site and has the following sequence:
-
- 5′-GGAATTCACC CGTCAGTTTT TAATACTGC-3′ [SEQ ID 1]
Primer 2 includes a BamHI site and hybridizes 54 bases downstream from the XhoI site and has the following sequence: - 5′-TGGATCCATT CTAGTATTTTG AGCTTCT-3′ [SEQ ID 2]
The primers were synthesized on an Applied Biosystem 391 PCR-MATE using phosphoramidite chemistry. PCR was performed as described by the vendor of TaQ polymerase (Perkin-Elmer). Briefly, PCR conditions were 94° C. for 1 min., 55° C. for 2 min. and 72° C. for 3 min. for 35 cycles. After PCR amplification, the amplified DNA was size fractionated on a 2% agarose gel and stained with ethidium bromide.
- 5′-GGAATTCACC CGTCAGTTTT TAATACTGC-3′ [SEQ ID 1]
The results are shown in
Finally, protein extracts were prepared from plants that were mock-inoculated, inoculated with wild-type CMV and inoculated with pCS101HC7, and the levels of CMV coat protein in the extracts determined by Western immunoblot. The extracts were prepared by homogenizing turnip leaf tissue in an equal volume of phosphate buffered saline using a mortar and pestle. Samples were boiled for 10 min. and then spun down in a microfuge for 10 min. at 4° C. Protein concentrations of the supernatants were determined by the method of Bradford, Anal. Biochem., 72, 248-54 (1976), and the extracts were diluted with an equal volume of 4× Laemmli sample buffer and denatured at 100° C. for 5 min. (Laemmli, Nature, 227, 680-85 (1970)). The samples were next electrophoresed on 10% SDS-polyacrylamide gels with 5% stacking gel (id.), followed by electrotransfer of the proteins to nitrocellulose at 45 volts, 1.25 hours, in transfer buffer as described in Towbin et al., Proc. Natl. Acad. Sci. USA, 76, 4350-54 (1979). Prestained molecular weight markers (BRL) were used to confirm protein transfer. Nonspecific antibody binding was avoided by using blocking solution (Johnson et al., Gene Anal. Techn., 1, 3-8 (1984)). All antibody incubations were done at 22° C. with gentle agitation. The primary antibody was rabbit antiserum to CMV coat protein (1:1000 dilution in blocking solution) provided by Dr. R. Shepherd, University of Kentucky. An anti-rabbit IgG alkaline phosphatase conjugate (Sigma, St. Louis) was used as the secondary antibody (1:1000 dilution in blocking solution). Visual detection of proteins was accomplished using BCIP/NBT (Sambrook et al., Molecular Cloning: A Laboratory Manual (2nd ed. Cold Spring Harbor Laboratory 1989)).
The results are presented in
The combined results of the pathological observations and the S1 nuclease, Northern blot, PCR amplification and Western immunoblot assays provide conclusive evidence that the “hairpin” catalytic RNA cleaved viral RNA in vivo in plants.
Example 29 A highly conserved sequence in HIV-1 viral RNA has been discovered which has favorable properties as a potential target site for a suitably engineered “hairpin” catalytic RNA. The 16-base sequence is shown in
When various HIV-1 isolates were compared, only two isolates showed variations in the sequence, and the variations were, in each case, only a single base change (see
A “hairpin” catalytic RNA was designed according to the principles set forth herein to cleave the conserved target sequence. The catalytic RNA sequence, designated herein as “RHIV”, is shown in
A substrate RNA containing the conserved 16-base target site sequence shown in
The substrate RNA SHIV and catalytic RNA RHIV were then reacted under standard conditions as described in Example 3 over a period of time from 0 to 280 minutes. The concentration of substrate SHIV was 0.025 uM, and the concentration of RHIV was 0.005 uM. Cleavage of substrate SHIV (which contains the conserved HIV-1 sequence) by the engineered “hairpin” catalytic RNA RHIV was obtained (see
The kinetics of the reaction between SHIV and RHIV were investigated using the methods described in Example S. The results are shown in
The rate of cleavage (turnover number, kcat) was determined to be 1.6/minute (see
Next, the ability of the catalytic RNA RHIV to cleave a long HIV-1 transcript was tested. The RNA used as a target sequence was a 183 nt-long transcript containing 107 nt of HIV-1 sequence. The target transcript was made by T7 RNA polymerase transcription (as described in Example 2) of the HaeII-linearized plasmid pROS. This plasmid was obtained from Dr. John Rossi of the City of Hope Medical Center, Duarte, Calif. Plasmid pROS has the 5′-HindIII fragment of the HIVHXB2 isolate cloned into plasmid pBluescript II (Stratagene, LaJolla, Calif.). The 5′-HindIII fragment of HIVHXB2 contains the sequence (+)77 to (+)635 from the 5′ cap site (sequence obtained from the HIV Sequence Data Base). The 183 nt transcript contains 76 nt of vector sequence at the 5′ end, followed by 107 nt of HIV-1 sequence (the sequence from (+)77 to (+)183 from the 5′ cap site).
This 183 nt transcript (5 nM) was incubated with catalytic RNA RHIV (25 nM) under standard conditions for 60 minutes as described in Example 3. When the products were analyzed on 6% acrylamide-7M urea gels, cleavage was seen to have occurred (see
As a prerequisite for in vivo experiments, it was necessary to develop a system for the proper termination of the “hairpin” catalytic RNA RHIV when it was transcribed in vivo. Such a system was developed by cloning DNA coding for RHIV into the BamHI/MluI sites of the vector pHC (prepared as described in Example 23) to give plasmid pHR (see
Plasmid pHR was transcribed with T7 RNA polymerase and the resulting 101 nt RNA (see
Next, a mammalian expression vector was prepared by removing DNA coding for RHIV and the autocatalytic cassette RNA from pHR with SmaI/SalI (see
Vector pMSGRHIV has been used to transfect human cells which were then infected with HIV-1. Preliminary results show that the amount of tat and gag mRNAs produced in transfected cells was lower than the amount produced in control cells.
Example 31In this example, evidence is presented showing the in vivo down regulation of two genes in mammalian cells by using “hairpin” RNA catalysts in trans. The two genes studied were hypoxanthine guanosine phosphoribosyl transferase (HGPRT) and chloramphenicol acetyl transferase (CAT).
A. In Vivo Inhibition of HGPRT
A 15 nt target sequence in the HGPRT gene of the hamster was identified. The sequence is:
A “hairpin” catalytic RNA was designed according to the principles set forth herein to cleave this target sequence. The catalytic RNA sequence, designated “RHGPT,” is shown in
RHGPT and the target sequence were synthesized as described in Example 2 and tested in vitro as described in Example 3. The results of the test showed that the target sequence was cleaved in vitro by the engineered “hairpin” ribozyme RHGPT at the * in the sequence given above (which is after nt87 in the sequence of the HGPRT gene).
Next, DNA coding for ribozyme RHGPT was cloned into the BamHI/MluI sites of the mammalian expression vector pHC (prepared as described in Example 23). The DNA coding for RHGPT and HC was excised from the resulting plasmid with SmaI/SalI and cloned into SmaI/SalI digested pMSG (Example 27) to give plasmid pMRHPT, a map of which is shown in
Another plasmid, pMR2HPT, was also prepared. This plasmid is identical to pMRHPT, except that a G35-->C mutation in RHGPT was made (numbering according to
Chinese hamster ovary cells (CHO) were cotransfected with one of these plasmids and plasmid pMSG-dhfr. The method of transfection and plasmid pMSG-dhfr are described in Example 27. Transfectants were selected and amplified with methotrexate, also as described in Example 27. A schematic drawing of the selection scheme is presented in
The poison 8-azaguanine, which is allowed into cells by the HGPRT enzyme, was then added to the cells (80 μg/ml) both in the presence and absence of dexamethasone (1 μM) which induces the HGPRT ribozyme. In the absence of dexamethasone, cells with or without DNA coding for RHGPT died as expected. In the presence of dexamethasone, control cells died, but colonies of survivors were seen for cells transfected with pMRHPT. All cells transformed with pMR2HPT (coding for the inactive ribozyme) died.
To quantitate the rate of survival, CHO cells were grown in gpt medium (see Example 27), except untransfected, uninduced CHO cells which were grown in MEM. The concentration of 8-azaquanine used was 80 μg/ml, and the concentration of dexamethasone used was 1 μM. One hundred cells were plated per dish. Colonies of cells were stained with crystal violet. The results are presented below:
*Results were normalized with untransfected, uninduced cells set at 100%. Actual survival rate for these cells was 80%.
The results are consistent with reduced levels of HGPRT due to the cleavage of HGPRT mRNA by the engineered “hairpin” ribozyme RHGPT after induction with dexamethasone. The in vivo activity of the ribozyme is not likely due to antisense effects, since the disabled ribozyme coded for by pMR2HPT was ineffective in increasing resistance to 8-azaquanine. Note that the mutation in pMR2HPT is not in the area of the ribozyme that base pairs to the substrate, so binding to the substrate should occur.
Next, an S1 nuclease assay was performed to observe the in vivo levels of mRNA coding for HGPRT. The S1 nuclease assay was performed as described in Example 27. The 148 nt probe which was used hybridizes to HGPRT mRNA and was prepared by transcribing plasmid pHPTPr as described in Example 27 using P32-labelled CTP. Plasmid pHPTPr contains the antisense HGPRT sequence from nt 20-160 (Konecki et al., Nucleic Acids Res., 10, 6763-75 (1982)) bridging the cleavage sequence at nt 87 cloned into pTZ18R (US Biochemical) between the EcoRI and HindIII sites.
The results are shown in
B. In Vivo Lowering of CAT mRNA Levels.
From a series of in vitro experiments using the techniques described in Examples 2 and 3, the optimum target sequence in the CAT gene was determined to be:
-
- 5-UUUCA*GUCAGUUGCUCAA-3′ [SEQ ID 5]
with cleavage at (*), which is nt 320 of the CAT gene. The “hairpin” catalytic RNA designed to cleave this target sequence, designated “RCAT,” is shown in FIG. 40. RCAT was designed according to the principles set forth herein. It base pairs with the target sequence in the two regions flanking the AGUC cleavage sequence, and the “hairpin” portion of RCAT has the same sequence as the “hairpin” portion of the (−)sTRSV catalytic RNA sequence shown inFIG. 1 . Additional vector bases coding for restriction sites are present at the 3′ and 5′ ends of RCAT.
- 5-UUUCA*GUCAGUUGCUCAA-3′ [SEQ ID 5]
DNA coding for RCAT was ligated to pHC (Example 23) which had been cut with BamHI/MluI. The resulting plasmid was cut with SmaI/SalI to remove the fragment coding for RCAT and HC. This fragment was ligated to pMSG (Example 27) which had been cut with SmaI/SalI to give pMSGRCAT. Next pCAT (Promega) was cut with PvuI/PstI, and the fragment containing the CAT gene under the control of the SV40 promoter and enhancer was isolated on a 1% low-melting agarose gel (Nusieve). Then the fragment was blunt ended with Klenow fragment and ligated into the EcoRI-cut, blunt-ended pMSGRCAT to give the final plasmid pMCATRCAT (see
Plasmid pMCATRCAT was cut with NdeI and then used to transfect CHO cells along with XhoI-cut pMSG-dhfr as described in Example 27. Transfectants were selected and amplified and an S1 nuclease assay performed, all as described in Example 27. The probe used for the S1 nuclease assay was RNA transcribed from the plasmid pCATP. This plasmid contains the sequence of the CAT gene from nt 260-372 (GenBank sequence) bridging the cleavage sequence at nt 320 cloned into the EcoRI/HindIII site of pTZ18R (US Biochemical) in the antisense direction. The probe was prepared as described in Example 27 using 32P-CTP and was a total of 119 nt long.
The results of the S1 nuclease assay are presented in
Additional mutagenesis experiments were performed changing bases in (−)sTRSV RNA and its substrate. All substrate RNAs and catalytic RNAs were prepared as described in Example 2. Mutagenesis was carried out simply by making the required base change in the synthetic DNA template. All catalytic RNAs had additional vector bases GGG at the 5′ end, and all substrate RNAs had additional vector bases GCG at the 51 end. These bases are required for efficient transcription (Milligan et al., Nucleic Acids Res., 15, 8783-98 (1987)), and the C near the 5′ end of all substrates ensured at least one P32-labelled C in the 5′ cleavage fragment. The reference sequences were the unmutated catalytic and substrate sequences (see
Substrate and catalytic RNAs were assayed for catalytic activity as described in Example 3. Generally, the final concentrations of substrate RNA was 0.1 uM, and the final concentration of catalytic RNA was 0.01 uM. Assays were done at 37° C. for times ranging from 15-30 minutes, and a zero-time control was always included. Reaction products were analyzed on 15% acrylamide/7M urea gels, autoradiography performed, and the bands cut from the gels and counted. The control (unmutated ribozyme/substrate) was assayed at the same time as all mutant catalytic RNAs and substrates.
Nucleotide changes made in the native, unmutated sequence of both the catalytic RNA and the substrate showed a range of catalytic effects. The nucleotide changes shown in
In summary, the results show that Helices 2 and 3 (see
The results will now be discussed in detail. First, Table I lists all substrate sequences successfully cleaved by a catalytic RNA designed according to the “hairpin” model so that the bases flanking the N*GUC cleavage sequence in the substrate were base paired to the catalyst. Cleavage occurred at the *. The lower case letters in Table I designate additional vector sequences.
As shown in Table I, every base pair in Helices 1 and 2 can be changed to any other base pair, and the substrate will be cleaved by the ribozyme, except the base pair in Helix 2 adjacent to the N*GUC cleavage sequence (designated by S in
As already demonstrated in Examples 18 and 21, it is possible to adjust the length of Helix 1 to optimize the rate of cleavage. In particular, when the native sequence was extended four base pairs by adding four A:U base pairs to the open end of Helix 1, an increased rate of activity was seen (Example 21). However, when three additional G:C base pairs were added in the present experiments, a large loss of activity occurred. This phenomenon was observed for numerous substrates and ribozymes. Accordingly, Helix 1 has an optimal length for each substrate used.
A type of “hinge” region, consisting of a single A base at position 15, is present between Helices 2 and 3 of the catalytic RNA (see
Proof of the base pair C17:G47 was obtained previously (see Example 22), but the presumptive base pair next to it, C16:G48, could not be shown to exist. Both the catalyst containing the C:C mismatch and the catalyst containing the reverse G:C base pair were inactive (
At the end of Helix 3 is the predicted base pair G19:C45. The catalyst containing the C:C mismatch was inactive (
Base pairing between A18:U46 is shown in
Helix 4 is a shorter helix than predicted by straightforward computer modeling and two-dimensional energy minimization (compare
The following results show that an active ribozyme is produced when Helix 4 is extended and the sequence of Loop III is changed. As shown in
It was concluded from this experiment that Loop III does not have a conserved or invariant base sequence and that Helix 4 can be extended towards loop III by at least four base pairs without loss of activity. The four additional base pairs in Helix 4 should provide helix stabilization of this region. The secondary folding energy of Helix 4 and Loop III in the native structure is +0.6 Kcal/mole, while that of the catalyst having the extended Helix 4 and the Loop III of the sequence UUCG was determined to be −11.1 Kcal/mole (methods described in Example 6). Thus, the presence of the Tinoco et al. hairpin sequence increases the folding energy by 11.7 Kcal/mole.
However, the simple replacement of Loop III with the sequence UUCG (see
When Loop III of the native ribozyme is cut between U31 and U32, activity is lost (see
With the extension of Helix 4 by the Tinoco et al. hairpin sequence, the potential base pair C29:G33 is between the two proven bases of Helix 4 and the four base pairs of the Tinoco sequence. Accordingly, it seemed likely that this base pair existed. However, both the single mutation G33-->C and the double mutation C29:G33-->G29:C33 were inactive (see
The existence of other base pairs in Helix 4 could not be shown. The next potential base pair upstream in Helix 4 was C25:G36. The single mutation C25->G was inactive (
Further mutation studies showed that Loops II and IV are larger than originally predicted by computer modeling and energy minimization (compare
As already shown in Example 12, when bases A22, A23 and A24 in Loop II were mutated to GUC, a totally inactive ribozyme was obtained, indicating that one or all of these bases are essential. The single mutation in Loop II of C25->G was inactive (see
In Loop IV, the single mutation U39->G was active (see
If this stem existed, the A43:U37 alternate base pair of U43:A37 should have been active. Since it was not, it was concluded that a stem in Loop IV does not exist.
In Loop V, the loop formed by the cleavage sequence in the substrate molecule, the bases guc are invariant and are not base paired to the catalyst (see Example 25 and
All of the bases of Loop I in the ribozyme, the loop opposite Loop V, have been mutated. When base A7 was changed to a G or C and when base A10 was changed to a G, the resulting catalysts were active (
Potential base pairs between Loop V in the substrate and the Loop I in the ribozyme were tested. In particular, experiments to determine whether G8 and A9 of Loop I were base paired with the corresponding bases in substrate Loop V were performed. Mutations tested were G8->C:c8->g and A9->U:u7->a. These mutations were in-active (see
While the c9-->a mutation in substrate Loop V was inactive (
A triple mutation comprising this same double mutation plus changing base A20->C in the ribozyme showed the same level of activity as the double mutation (
As various changes could be made in the above-described products and methods without departing from the scope of the invention, it is intended that all matter contained in the above description or shown in accompanying drawings shall be interpreted as illustrative and shall not be interpreted in a limiting sense.
Claims
1. A synthetic RNA catalyst capable of cleaving an RNA substrate which contains the sequence: 5′-F1-CS-F2-3′, wherein,
- CS is a cleavage sequence; and
- F1 and F2 each is a sequence of bases flanking the cleavage sequence;
- the catalyst comprising a substrate binding portion and a “hairpin” portion, the substrate binding portion of the catalyst having the sequence:
- 3′-F4-L1-F3-5′
- wherein,
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
- L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
2. An RNA catalyst according to claim 1, the “hair-pin” portion of the catalyst having the sequence: wherein,
- P1 and P4 each is a sequence of bases, the sequences of P1 and P4 being selected so that P1 and P4 are substantially base paired;
- P1 is covalently linked to F4;
- S1 and S2 each is a sequence of bases, the sequences of S1 and S2 being selected so that S1 and S2 are substantially unpaired;
- P2 and P3 each is a sequence of bases, the sequences of P2 and P3 being selected so that P2 and P3 are substantially base paired; and
- L2 is a sequence of unpaired bases.
3. An RNA catalyst according to claim 1 or 2 which is capable of cleaving an RNA substrate in which CS has the sequence 5′-NGUC-3′, wherein N is any base and the substrate is cleaved by the catalyst between N and G.
4. An RNA catalyst according to claim 3 wherein L1 has the sequence 3′-AAGA-5′.
5. An RNA catalyst according to claim 1 or 2 wherein F3 is at least 3 bases in length and F4 is from 3 to 5 bases in length, and the catalyst cleaves a substrate wherein F1 and F2 each is at least 3 bases in length.
6. An RNA catalyst according to claim 5 wherein F3 is from 6 to 12 bases in length and F4 is 4 bases in length, and the catalyst cleaves a substrate wherein F1 is 4 bases in length and F2 is from 6 to 12 bases in length.
7. An RNA catalyst according to claim 2 wherein P1 and P4 each is from 3 to 6 bases in length.
8. An RNA catalyst according to claim 7 wherein P1 has the sequence 5′-ACCAG-31 and P4 has the sequence 5′-CUGGUA-3′.
9. An RNA catalyst according to claim 2 wherein S1 and S2 each is from 4 to 9 bases in length.
10. An RNA catalyst according to claim 9 wherein S1 has the sequence 5′-AGAAACA-3′ and S2 has the sequence 5′-GUAUAUUAC-3′.
11. An RNA catalyst according to claim 2 wherein P2 and P3 each is from 3 to 9 bases in length.
12. An RNA catalyst according to claim 11 wherein P2 has the sequence 5′-CAC-3′ and P3 has the sequence 5′-GUG-3′.
13. An RNA catalyst according to claim 2 wherein L2 is at least 3 bases in length.
14. An RNA catalyst according to claim 13 wherein L2 has the sequence 5′-GUU-3′.
15. An RNA catalyst according to claim 2 wherein 5′-S1-P2-L2 has the sequence 5′AGAAACACACGUU-3′.
16. An RNA catalyst according to claim 2 wherein 5′-P2-L2-P3 has the sequence 5′-CACGGACUUCGGUCCGUG-3′ [SEQ ID 46].
17. An RNA catalyst according to claim 1 or 2 which is capable of cleaving an RNA substrate selected from the group consisting of messenger RNA, transfer RNA, ribosomal RNA, viral RNA, nuclear RNA, organellar RNA and other cellular RNA.
18. The catalyst of claim 17 which is capable of cleaving an RNA substrate selected from the group consisting of HIV-1 virus RNA and tobacco mosaic virus RNA.
19. An RNA catalyst according to claim 18 which is capable of cleaving HIV-1 RNAs containing the sequence UGCCCGUCUGUUGUGU.
20. An RNA catalyst according to claim 2 containing the sequence: wherein,
- F1, F2, F3, F4, L1, L2, S1, S2, P1, P2, P3 and P4 are as defined in claims 1 and 2; and
- L3 is a sequence of unpaired bases that covalently links the catalyst portion of the molecule with the substrate portion to produce a synthetic autocatalytic RNA catalyst.
21. An RNA catalyst according to claim 20 wherein CS has the sequence 5′-NGUC-3′, wherein N is any base, and the substrate is cleaved by the catalyst between N and G.
22. An RNA catalyst according to claim 21 wherein L1 has the sequence 3′-AAGA-5′.
23. An RNA catalyst according to claim 22 wherein 5′-P1-S1-P2-L2-P3-S2-P4-3′ has the sequence 5′-ACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA-3′.
24. An RNA catalyst according to claim 23 wherein L3 has the sequence 3′-CCUCC-5′.
25. A synthetic RNA catalyst which is capable of cleaving an RNA substrate containing the sequence: 5′-F1-CS-F2-3′, the catalyst containing the sequence: 5′-F3-L1-F4-ACCAGAGAAACACACGUUGUGGUAUAUUACCUGGU A-3′, and active variants thereof, wherein,
- CS is a cleavage sequence;
- F1 and F2 each is a sequence of bases flanking the cleavage sequence;
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
- L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
26. A synthetic RNA catalyst which is capable of cleaving an RNA substrate containing the sequence: 5′-F1-CS-F2-3′, the catalyst containing the sequence: 5′-F3-L1-F4-ACCAGAGAAACACACGGACUUCGGUCC [SEQ ID 47] GUG-GUAUAUUACCUGGUA-3′ wherein,
- CS is a cleavage sequence;
- F1 and F2 each is a sequence of bases flanking the cleavage sequence;
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
- L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
27. An RNA catalyst according to claim 25 or 26 wherein F3 is at least 3 bases in length and F4 is from 3 to 5 bases in length, and the catalyst cleaves a substrate wherein F1 and F2 each is at least 3 bases in length.
28. An RNA catalyst according to claim 27 wherein F3 is from 6 to 12 bases in length and F4 is 4 bases in length, and the catalyst cleaves a substrate wherein F1 is 4 bases in length and F2 is from 6 to 12 bases in length.
29. An RNA catalyst according to claim 25 or 26 which is capable of cleaving an RNA substrate in which CS has the sequence 5′-NGUC-3′, wherein N is any base and the substrate is cleaved by the catalyst between N and G.
30. AN RNA catalyst according to claim 29 wherein L1 has the sequence 3′-AAGA-5′.
31. An RNA catalyst according to claim 25 or 26 which is capable of cleaving an RNA substrate selected from the group consisting of messenger RNA, transfer RNA, ribosomal RNA, viral RNA, nuclear RNA, organellar RNA and other cellular RNA.
32. An RNA catalyst according to claim 31 which is capable of cleaving an RNA substrate selected from the group consisting of HIV-1 virus RNA and tobacco mosaic virus RNA.
33. An RNA catalyst according to claim 32 which is capable of cleaving HIV-1 RNAs containing the sequence UGCCCGUCUGUUGUGU.
34. An engineered DNA molecule coding for an RNA catalyst according to claim 1, 2, 20, 25 or 26.
35. A vector comprising a DNA sequence coding for an RNA catalyst according to claim 1, 2, 20, 25 or 26, the DNA sequence being operatively linked to expression control sequences.
36. The vector of claim 35 which is capable of self-replication in a host.
37. The vector of claim 35 wherein the RNA catalyst encoded by the vector is capable of cleaving an RNA substrate selected from the group consisting of messenger RNA, transfer RNA, ribosomal RNA, viral RNA, nuclear RNA, organellar RNA and other cellular RNA.
38. The vector of claim 37 wherein the RNA catalyst encoded by the vector is capable of cleaving an RNA substrate selected from the group consisting of HIV-1 virus RNA and tobacco mosaic virus RNA.
39. The vector of claim 38 wherein the RNA catalyst encoded by the vector is capable of cleaving HIV-1 RNAs containing the sequence UGCCCGUCUGUUGUGU.
40. A host cell transformed with a vector according to claim 35 and which is capable of expressing the RNA catalyst.
41. A method of cleaving an RNA substrate which contains the sequence: 5′F1-CS-F2-3′, wherein,
- CS is a cleavage sequence; and
- F1 and F2 each is a sequence of bases flanking the cleavage sequence;
- the method comprising contacting the substrate with a synthetic RNA catalyst comprising a substrate binding portion and a “hairpin” portion, the substrate binding portion of the catalyst having the sequence:
- 3′F4-L1-F3-5′
- wherein,
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
- L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
42. The method of claim 41 wherein the “hairpin” portion of the catalyst has the sequence: wherein,
- P1 and P4 each is a sequence of bases, the sequences of P1 and P4 being selected so that P1 and P4 are substantially base paired;
- P1 is covalently linked to F4;
- S1 and S2 each is a sequence of bases, the sequences of S1 and S2 being selected so that S1 and S2 are substantially unpaired;
- P2 and P3 each is a sequence of bases, the sequences of P2 and P3 being selected so that P2 and P3 are substantially base paired; and
- L2 is a sequence of unpaired bases.
43. The method of claim 42 wherein the catalyst has the sequence: wherein,
- F1, F2, F3, F4, L1, L2, S1, S2, P1, P2, P3 and P4 are as defined in claims 41 and 42; and
- L3 is a sequence of unpaired bases that covalently links the catalyst portion of the molecule with the substrate portion to produce a synthetic autocatalytic RNA catalyst.
44. A method of cleaving an RNA substrate containing the sequence: 5′-F1-CS-F2-3′, comprising contacting the substrate with a synthetic RNA catalyst containing the sequence: 5′-F3-L1-F4-ACCAGAGAAACACACGUUGUGGUAUAUUACCUGGU A-3′, and active variants thereof, wherein,
- CS is a cleavage sequence;
- F1 and F2 each is a sequence of bases flanking the cleavage sequence;
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
- L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
45. A method of cleaving an RNA substrate containing the sequence: 5′-F1-CS-F2-3′, comprising contacting the substrate with a synthetic RNA catalyst containing the sequence: 5′-F3-L1-F4-ACCAGAGAAACACACGGACUUCGGUCC [SEQ ID 47] GUGG-UAUAUUACCUGGUA-3′ wherein,
- CS is a cleavage sequence;
- F1 and F2 each is a sequence of bases flanking the cleavage sequence;
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2 when the catalyst is bound to the substrate;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is bound to the substrate;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; and
- L1 is a sequence of bases selected so that L1 does not base pair with CS when the catalyst is bound to the substrate.
46. The method of claim 41, 42, 43, 44 or 45 wherein the cleavage occurs under physiological conditions.
47. The method of claim 46 wherein the cleavage occurs in vivo in a host cell which has been transformed with a vector comprising a DNA sequence coding for the RNA catalyst, the DNA sequence being operatively linked to expression control sequences.
48. A synthetic RNA transcript comprising an autocatalytic portion which has the formula: wherein,
- CS is a cleavage sequence;
- F1 and F2 each is a sequence of bases flanking the cleavage sequence;
- F3 is a sequence of bases selected so that F3 is substantially base paired with F2;
- F4 is a sequence of bases selected so that F4 is substantially base paired with F1;
- the sequences of F3 and F4 being selected so that each contains an adequate number of bases to achieve sufficient binding with F1 and F2 so that cleavage can take place;
- L1 is a sequence of bases selected so that L1 does not base pair with CS;
- P1 and P4 each is a sequence of bases, the sequences of P1 and P4 being selected so that P1 and P4 are substantially base paired;
- S1 and S2 each is a sequence of bases, the sequences of S1 and S2 being selected so that S1 and S2 are substantially unpaired;
- P2 and P3 each is a sequence of bases, the sequences of P2 and P3 being selected so that P2 and P3 are substantially base paired;
- L2 is a sequence of unpaired bases; and
- L3 is a sequence of unpaired bases.
49. A method of terminating an RNA transcript comprising:
- transforming a host cell with a vector comprising DNA coding for an RNA transcript according to claim 48;
- culturing the host cell so that RNA is transcribed and the autocatalytic portion cleaves the RNA transcript to terminate the transcript.
Type: Application
Filed: Sep 15, 2003
Publication Date: Feb 24, 2005
Applicants: ,
Inventors: Arnold Hampel (DeKalb, IL), Richard Tritz (DeKalb, IL), Margaret Hicks (Antioch, TN)
Application Number: 10/664,835