GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM AND METHODS OF USE THEREOF

Nucleic acid sequences encoding ∈-cyclase, isopentenyl pyrophosphate isomerase and &bgr;-carotene hydroxylase as well as vectors containing the same and hosts transformed with the vectors. Methods for controlling the ratio of various carotenoids in a host and for the production of novel carotenoid pigments. The present invention also provides a method for screening for eukaryotic genes encoding carotenoid biosynthesis, and for modifying the disclosed enzymes.

Skip to: Description  ·  Claims  · Patent History  ·  Patent History
Description
CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is a continuation-in-part of U.S. application Ser. Nos. 09/088,724, filed Jun. 2, 1998, 09/088,725, filed Jun. 2, 1998, and 08/937,155, filed Sep. 25, 1997, U.S. application Ser. No. 08/937,155 being a divisional of U.S. application Ser. No. 08/624,125, filed Mar. 29, 1996, now U.S. Pat. No. 5,744,341.

BACKGROUND OF THE INVENTION

[0002] 1. Field of the Invention

[0003] The present invention describes nucleic acid sequences for eukaryotic genes encoding ∈ lycopene &egr;-cyclase (also known as ∈-cyclase and ∈ lycopene cyclase), isopentenyl pyrophosphate isomerase (IPP) and &bgr;-carotene hydroxylase as well as vectors containing the same and hosts transformed with said vectors. The present invention also provides methods for augmenting the accumulation of carotenoids, changing the composition of the carotenoids, and producing novel and rare carotenoids. The present invention provides methods for controlling the ratio or relative amounts of various carotenoids in a host. The invention also relates to modified lycopene ∈-cyclase, IPP isomerase and &bgr;-carotene hydroxylase. Additionally, the present invention provides a method for screening for genes and cDNAs encoding enzymes of carotenoid biosynthesis and metabolism.

[0004] 1. Background of the Invention

[0005] Carotenoid pigments with cyclic endgroups are essential components of the photosynthetic apparatus in oxygenic photosynthetic organisms (e.g., cyanobacteria, algae and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment &bgr;-carotene (or, in rare cases, the asymmetrical bicyclic &agr;-carotene) is intimately associated with the photosynthetic reaction centers and plays a vital role in protecting against potentially lethal photooxidative damage (Koyama, 1991). &bgr;-carotene and other carotenoids derived from it or from &agr;-carotene also serve as light-harvesting pigments (Siefermann-Harms, 1987), are involved in the thermal dissipation of excess light energy captured by the light-harvesting antenna (Demmig-Adams & Adams, 1992), provide substrate for the biosynthesis of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991; Parry & Horgan, 1991), and are precursors of vitamin A in human and animal diets (Krinsky, 1987). Plants also exploit carotenoids as coloring agents in flowers and fruits to attract pollinators and agents of seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of agronomic value in a number of important crops. Carotenoids are currently harvested from a variety of organisms, including plants, algae, yeasts, cyanobacteria and bacteria, for use as pigments in food and feed.

[0006] The probable pathway for formation of cyclic carotenoids in plants, algae and cyanobacteria is illustrated in FIG. 1. Two types of cyclic endgroups or rings are commonly found in higher plant carotenoids, these are referred to as the &bgr; (beta) and ∈ (epsilon) rings (FIG. 3). The precursor acyclic endgroup (no ring structure) is referred to as the &PSgr; (psi) endgroup. The &bgr; and &egr; endgroups differ only in the position of the double bond in the ring. Carotenoids with two &bgr; rings are ubiquitous, and those with one &bgr; and one ∈ ring are common, but carotenoids with two ∈ rings are uncommon. &bgr;-carotene (FIG. 1) has two &bgr;-endgroups and is a symmetrical compound that is the precursor of a number of other important plant carotenoids such as zeaxanthin and violaxanthin (FIG. 2).

[0007] Genes encoding enzymes of carotenoid biosynthesis have previously been isolated from a variety of sources including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 254-268; Misawa et al., 1990, J. Bacteriol., 172, 6704-12), fungi (Schmidhauser et al., 1990, Mol. Cell. Biol. 10, 5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 482-86; Cunningham et al., 1994) and higher plants (Bartley et al., Proc. Natl. Acad. Sci USA 88, 6532-36; Martinez-Ferez & Vioque, 1992, Plant Mol. Biol. 18, 981-83). Many of the isolated enzymes show a great diversity in structure, function and inhibitory properties between sources. For example, phytoene desaturases from the cyanobacterium Synechococcus and from higher plants and green algae carry out a two-step desaturation to yield &zgr;-carotene as a reaction product. In plants and cyanobacteria a second enzyme (&zgr;-carotene desaturase), similar in amino acid sequence to the phytoene desaturase, catalyzes two additional desaturations to yield lycopene. In contrast, a single desaturase enzyme from Erwinia herbicola and from other bacteria introduces all four double bonds required to form lycopene. The Erwinia and other bacterial desaturases bear little amino acid sequence similarity to the plant and cyanobacterial desaturase enzymes, and are thought to be of unrelated ancestry. Therefore, even with a gene in hand from one source, it may be difficult to identify a gene encoding an enzyme of similar function in another organism. In particular, the sequence similarity between certain of the prokaryotic and eukaryotic genes encoding enzymes of carotenoid biosynthesis is quite low.

[0008] Further, the mechanism of gene expression in prokaryotes and eukaryotes appears to differ sufficiently such that one cannot expect that an isolated eukaryotic gene will be properly expressed in a prokaryotic host.

[0009] The difficulties in isolating genes encoding enzymes with similar functions is exemplified by recent efforts to isolate the gene encoding the enzyme that catalyzes the formation of &bgr;-carotene from the acyclic precursor lycopene. Although a gene encoding an enzyme with this function had been isolated from a bacterium, it had not been isolated from any photosynthetic procaryote or from any eukaryotic organism. The isolation and characterization of the enzyme catalyzing formation of &bgr;-carotene in the cyanobacterium Synechococcus PCC7942 was described by the present inventors and others (Cunningham et al., 1993 and 1994). The amino acid sequence similarity of the cyanobacterial enzyme to the various bacterial lycopene &bgr;-cyclases is so low (ca. 18-25% overall; Cunningham et al., 1994) that there is much uncertainty as to whether they share a common ancestry or, instead, represent an example of convergent evolution.

[0010] The need remains for the isolation of eukaryotic and prokaryotic genes and cDNAs encoding polypeptides involved in the carotenoid biosynthetic pathway, including those encoding a lycopene ∈-cyclase, IPP isomerase and &bgr;-carotene hydroxylase. There remains a need for methods to enhance the production of carotenoids, to alter the composition of carotenoids, and to reduce or eliminate carotenoid production. There also remains a need in the art for methods for screening for genes and cDNAs encoding enzymes of carotenoid biosynthesis and metabolism.

SUMMARY OF THE INVENTION

[0011] Accordingly, a first object of this invention is to provide purified and/or isolated nucleic acids which encode enzymes involved in carotenoid biosynthesis; in particular, lycopene ∈-cyclase, IPP isomerase and &bgr;-carotene hydroxylase.

[0012] A second object of this invention is to provide purified and/or isolated nucleic acids which encode enzymes which produce novel or uncommon carotenoids.

[0013] A third object of the present invention is to provide vectors containing said genes.

[0014] A fourth object of the present invention is to provide hosts transformed with said vectors.

[0015] Another object of the present invention is to provide hosts which accumulate novel or uncommon carotenoids or which accumulate greater amounts of specific or total carotenoids.

[0016] Another object of the present invention is to provide hosts with inhibited and/or altered carotenoid production.

[0017] Another object of this invention is to secure the expression of eukaryotic carotenoid-related genes in a recombinant prokaryotic host.

[0018] Yet another object of the present invention is to provide a method for screening for eukaryotic and prokaryotic genes and cDNAs which encode enzymes involved in carotenoid biosynthesis and metabolism.

[0019] An additional object of the invention is to provide a method for manipulating carotenoid biosynthesis in photosynthetic organisms by inhibiting the synthesis of certain enzymatic products to cause accumulation of precursor compounds.

[0020] Another object of the invention is to provide modified lycopene ∈-cyclase, IPP isomerase and &bgr;carotene hydroxylase.

[0021] These and other objects of the present invention have been realized by the present inventors as described below.

[0022] A subject of the present invention is an isolated and/or purified nucleic acid sequence which encodes for a protein having lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS:2, 4, 14-21, 23 or 25-27.

[0023] The invention also includes vectors which comprise any of the nucleic acid sequences listed above, and host cells transformed with such vectors.

[0024] Another subject of the present invention is a method of producing or enhancing the production of a carotenoid in a host cell, comprising inserting into the host cell a vector comprising a heterologous nucleic acid sequence which encodes for a protein having lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence to produce the protein.

[0025] Yet another subject of the present invention is a method of modifying the production of carotenoids in a host cell, the method comprising inserting into the host cell a vector comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for a protein which modifies lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence in the host cell to modify the production of the carotenoids in the host cell, relative to the untransformed host cell.

[0026] The present invention also includes a method of expressing, in a host cell, a heterologous nucleic acid sequence which encodes for a protein having lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity, the method comprising inserting into the host cell a vector comprising the heterologous nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence.

[0027] Also included is a method of expressing, in a host cell, a heterologous nucleic acid sequence which encodes for a protein which modifies lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity in the host cell, relative to an untransformed host cell, the method comprising inserting into the host cell a vector comprising the heterologous nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence.

[0028] Another subject of the present invention is a method for screening for genes and cDNAs which encode enzymes involved in carotenoid biosynthesis and metabolism.

BRIEF DESCRIPTION OF THE DRAWINGS

[0029] A more complete appreciation of the invention and many of the attendant advantages thereof will be readily obtained as the same becomes better understood by reference to the following detailed description when considered in connection with the accompanying drawings, wherein:

[0030] FIG. 1 is a schematic representation of the putative pathway of &bgr;-carotene biosynthesis in cyanobacteria, algae and plants. The enzymes catalyzing various steps are indicated at the left. Target sites of the bleaching herbicides NFZ and MPTA are also indicated at the left. Abbreviations: DMAPP, dimethylallyl pyrophosphate; FPP, farnesyl pyrophosphate; GGPP, geranylgeranyl pyrophosphate; GPP, geranyl pyrophosphate; IPP, isopentenyl pyrophosphate; LCY, lycopene cyclase; MVA, mevalonic acid; MPTA, 2-(4-methylphenoxy)triethylamine hydrochloride; NFZ, norflurazon; PDS, phytoene desaturase; PSY, phytoene synthase; ZDS, &zgr;-carotene desaturase; PPPP, prephytoene pyrophosphate.

[0031] FIG. 2 depicts possible routes of synthesis of cyclic carotenoids and common plant and algal xanthophylls (oxycarotenolds) from neurosporene. Demonstrated activities of the &bgr;- and ∈-cyclase enzymes of A. thaliana are indicated by bold arrows labelled with &bgr;or ∈ respectively. A bar below the arrow leading to ∈-carotene indicates that the enzymatic activity was examined but no product was detected. The steps marked by an arrow with a dotted line have not been specifically examined. Conventional numbering of the carbon atoms is given for neurosporene and &agr;-carotene. Inverted triangles (▾) mark positions of the double bonds introduced as a consequence of the desaturation reactions.

[0032] FIG. 3 depicts the carotene endgroups which are found in plants.

[0033] FIG. 4 is a DNA sequence and the predicted amino acid sequence of a lycopene ∈-cyclase cDNA isolated from A. thaliana (SEQ ID NOS:1 and 2). These sequences were deposited under Genbank accession number U50738. This cDNA is incorporated into the plasmid pATeps.

[0034] FIG. 5 is a DNA sequence encoding the &bgr;-carotene hydroxylase isolated from A. thaliana (SEQ ID NO:3). This cDNA is incorporated into the plasmid pATOHB.

[0035] FIG. 6 is an alignment of the predicted amino acid sequences of A. thaliana &bgr;-carotene hydroxylase (SEQ ID NO:4) with those of the bacterial &bgr;-carotene hydroxylase enzymes from Alicalgenes sp. (SEQ ID NO:5) (Genbank D58422), Erwinia herbicola Eho10 (SEQ ID NO.: 6) (GenBank M872280), Erwinia uredovora (SEQ ID NO.: 7) (GenBank D90087) and Agrobacterium aurianticum (SEQ ID NO.: 8) (GenBank D58420). A consensus sequence is also shown. All five genes are identical where a capital letter appears in the consensus. A lowercase letter indicates that three of five, including A. thaliana , have the identical residue. TM;

[0036] transmembrane.

[0037] FIG. 7 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from A. thaliana (SEQ ID NO:9). This cDNA is incorporated into the plasmid pATDP5.

[0038] FIG. 8 is a DNA sequence of a second cDNA encoding another IPP isomerase isolated from A. thaliana (SEQ ID NO:10). This cDNA is incorporated into the plasmid pATDP7.

[0039] FIG. 9 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from Haematococcus pluvialis (SEQ ID NO:11). This cDNA is incorporated into the plasmid pBP04.

[0040] FIG. 10 is a DNA sequence of a second cDNA encoding another IPP isomerase isolated from Haematococcus pluvialis (SEQ ID NO:12). This cDNA is incorporated into the plasmid pHP05.

[0041] FIG. 11 is an alignment of the amino acid sequences predicted by IPP isomerase cDNAs isolated from A. thaliana (SEQ ID NO.: 16 and 18), H. pluvialis (SEQ ID NOS.: 14 and 15), Clarkia breweri (SEQ ID NO.: 17) (See, Blanc & Pichersky, Plant Physiol. (1995) 108:855; Genbank accession no. X82627) and Saccharomyces cerevisiae (SEQ ID NO.: 19) (Genbank accession no. J05090).

[0042] FIG. 12 is a DNA sequence of the cDNA encoding an IPP isomerase isolated from Tagetes erecta (marigold; SEQ ID NO:13). This cDNA is incorporated into the plasmid pPMDP1. xxx's denote a region not originally sequenced. FIG. 21A shows the complete marigold sequence.

[0043] FIG. 13 is an alignment of the consensus sequence of four plant &bgr;-cyclases (SEQ ID NO.: 20) with the A. thaliana lycopene ∈-cyclase (SEQ ID NO.: 21). A capital letter in the plant &bgr; consensus is used where all four &bgr;-cyclase genes predict the same amino acid residue in this position. A small letter indicates that an identical residue was found in three of the four. Dashes indicate that the amino acid residue was not conserved and dots in the sequence denote a gap. A consensus for the aligned sequences is given, in capital letters below the alignment, where the &bgr;- and ∈-cyclases have the same amino acid residue. Arrows indicate some of the conserved amino acids that will be used as junction sites for construction of chimeric cyclases with novel enzymatic activities. Several regions of interest including a sequence signature indicative of a dinucleotide-binding motif and two predicted transmembrane (TM) helical regions are indicated below the alignment and are underlined.

[0044] FIG. 14 shows the nucleotide (SEQ ID NO:22) and amino acid sequences (SEQ ID NO:23) of the Adonis palaestina (pheasant's eye) ∈-cyclase cDNA #5.

[0045] FIG. 15A shows the nucleotide (SEQ ID NO:24) and amino acid sequences (SEQ ID NO:25) of a potato ∈-cyclase cDNA. FIG. 15B shows the amino acid sequence (SEQ ID NO:26) of a chimeric lettuce/potato lycopene ∈-cyclase. Amino acids in lower case are from the lettuce cDNA and those in upper case are from the potato cDNA. The product of this chimeric cDNA has ∈-cyclase activity and converts lycopene to the monocyclic &dgr;-carotene.

[0046] FIG. 16 shows a comparison between the amino acid sequences of the Arabidopsis ∈-cyclase (SEQ ID NO:27) and the potato ∈-cyclase (SEQ ID NO:25).

[0047] FIG. 17A shows the nucleotide sequence of the Adonis palaestina Ipi1 (SEQ ID NO:28) and FIG. 17B shows the nucleotide sequence of the Adonis palaestina Ipi2 (SEQ ID NO:29).

[0048] FIG. 18A shows the nucleotide sequence of the Haematoccus pluvialis Ipi1 (SEQ ID NO:11) and FIG. 18B shows the nucleotide sequence of the Haematoccus pluvialis Ipi2 (SEQ ID NO:30).

[0049] FIG. 19A shows the nucleotide sequence of the Lactuca sativa (romaine lettuce) Ipi1 (SEQ ID NO:3 1) and FIG. 19B shows the nucleotide sequence of the Lactuca sativa Ipi2 (SEQ ID NO:32).

[0050] FIG. 20 shows the nucleotide sequence of the Chlamydomonas reinhardtii Ipi1 (SEQ ID NO:33).

[0051] FIG. 21A shows the nucleotide sequence of the Tagetes erecta (marigold) Ipi1 (SEQ ID NO:34) and FIG. 21B shows the nucleotide sequence of the Oryza sativa (rice) Ipi1 (SEQ ID NO:35).

[0052] FIG. 22 shows a amino acid sequence alignment of various plant and green algal isopentenyl isomerases (IPI) (SEQ ID NOS:16, 36-45).

[0053] FIG. 23 shows a comparison between Adonis palaestina ∈-cyclase cDNA #3 and cDNA #5 nucleotide sequences.

[0054] FIG. 24 shows a comparison between Adonis palaestina ∈-cyclase cDNA #3 and cDNA #5 predicted amino acid sequences.

[0055] FIG. 25 shows a sequence alignment of various plant &bgr;- and ∈-cyclases. Those sequences outlined in grey denote identical sequences among the ∈-cyclases. Those sequences outlined in black denote identical sequences among both the &bgr;- and ∈-cyclases.

[0056] FIG. 26 shows a sequence alignment of the plant ∈-cyclases from FIG. 25. Those sequences outlined in black denote identical sequences among the ∈-cyclases.

[0057] FIG. 27 is a dendrogram or “tree” illustrating the degree of amino acid sequence similarity for various lycopene &bgr;- and ∈-cyclases.

[0058] FIG. 28 shows a comparison between Arabidopsis ∈-cyclase and lettuce ∈-cyclase predicted amino acid sequences.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0059] The present invention includes an isolated and/or purified nucleic acid sequence which encodes for a protein having lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS:2, 4, 14-21, 23 or 25-27. Nucleic acids encoding lycopene ∈-cyclase, &bgr;-carotene hydroxylase and IPP isomerases have been isolated from several genetically distant sources.

[0060] The present inventors have isolated nucleic acids encoding the enzyme IPP isomerase, which catalyzes the reversible conversion of isopentenyl pyrophosphate (IPP) to dimethylallyl pyrophosphate (DMAPP). IPP isomerase cDNAs were isolated from the plants A. thaliana, Tagetes erecta (marigold), Adonis palaestina (pheasant's eye), Lactuca sativa (romaine lettuce) and from the green algae H. pluvialis and Chlamydomonas reinhardtii. Alignments of the amino acid sequences predicted by some of these cDNAs are shown in FIGS. 12 and 22. Plasmids containing some of these cDNAs were deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville Md. 20852 on Mar. 4, 1996 under ATCC accession numbers 98000 (pHP05—H. pluvialis); 98001 (pMDP1—marigold); 98002 (pATDP7—A. thaliana ) and 98004 (pHP04—H. pluvialis).

[0061] The present inventors have also isolated nucleic acids encoding the enzyme &bgr;-carotene hydroxylase, which is responsible for hydroxylating the &bgr;-endgroup in carotenoids. The nucleic acid of the present invention is shown in SEQ ID NO:3 and FIG. 5. The full length cDNA product hydroxylates both end groups of &bgr; -carotene as do products of cDNAs which encode proteins truncated by up to 50 amino acids from the N-terminus. Products of genes which encode proteins truncated between about 60-110 amino acids from the N-terminus preferentially hydroxylate only one ring. A plasmid containing this gene was deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville Md. 20852 on Mar. 4, 1996 under ATCC accession number 98003 (pATOHB—A. thaliana ).

[0062] The present inventors have also isolated nucleic acids encoding the enzyme lycopene ∈-cyclase, which is responsible for the formation of ∈-endgroups in carotenoids. The A. thaliane ∈-cyclase adds an ∈ ring to only one end of the symmetrical lycopene while the related &bgr;-cyclase adds a ring at both ends. The A. thaliana cDNA of the present invention is shown in FIG. 4 and SEQ ID NO:1. A plasmid containing this gene was deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville Md. 20852 on Mar. 4, 1996 under ATCC accession number 98005 (pATeps—A. thaliana).

[0063] In addition, lycopene ∈-cyclases have been identified in lettuce and in Adonis palaestina (cDNA #5) which encode enzymes that convert lycopene to the bicyclic ∈-carotene (&egr;,&egr;-carotene). An additional cDNA from Adonis palaestina (cDNA #3) encodes a lycopene ∈-cyclase which converts lycopene into &dgr;-carotene (∈,&PSgr;-carotene) and differs from the lycopene ∈-cyclase which forms bicyclic ∈-carotene (∈,∈-carotene) by only 5 amino acids. One or more of these amino acids may be modified by alteration of the nucleotide sequence in the #5 cDNA to obtain an enzyme which forms the bicyclic ∈,∈-carotene. The sequences of the Adonis palaestina and Arabidopsis thaliana ∈-cyclases have about 70% nucleotide identity and about 72% amino acid identity.

[0064] Initial experiments by the inventors with chimeric genes indicated that the part of the ∈-cyclase which is responsible for adding 2 ∈ rings to form ∈,∈-carotene is the carboxy terminal portion of the gene. The lettuce ∈-cyclase adds two ∈ rings to form ∈,∈-carotene. A DNA encoding a partial potato ∈-cyclase (missing its amino terminal portion), when combined with an amino terminal region from the lettuce ∈-cyclase gene, produces a monocyclic &dgr;-carotene (∈,&PSgr;-carotene). With the discovery of the differences between the Adonis palaestina clone #3 and clone #5, the specific amino acids responsible for the addition of an extra ∈ ring have been identified (FIG. 24). Specifically, amino acid 55 is Thr in clone #3 and Ser in clone #5, amino acid 210 is Asn in clone #3 and Asp in clone #5, amino acid 231 is Asp in clone #3 and Glu in clone #5, amino acid 352 is Ile in clone #3 and Val in clone #5, and amino acid 524 is Lys in clone #3 and Arg in clone #5. It can be appreciated that these changes are quite conservative, as only one change, at amino acid 210, changes the charge of the protein.

[0065] Thus, it is clear that the nucleic acids of the invention encoding the enzymes as presently disclosed may be altered to increase a particularly desirable property of the enzyme, to change a property of the enzyme, or to diminish an undesirable property of the enzyme. Such modifications can be by deletion, substitution, or insertion of one or more amino acids, and can be performed by routine enzymatic manipulation of the nucleic acid encoding the enzyme (such as by restriction enzyme digestion, removal of nucleotides by mung bean nuclease or Bal31, insertion of nucleotides by Klenow fragment, and by religation of the ends), by site-directed mutagenesis, or may be accidental, such as by low fidelity PCR or those obtained through mutations in hosts that are producers of the enzymes. These techniques as well as other suitable techniques are well known in the art.

[0066] Mutations can be made in the nucleic acids of the invention such that a particular codon is changed to a codon which codes for a different amino acid. Such a mutation is generally made by making the fewest nucleotide changes possible. A substitution mutation of this sort can be made to change an amino acid in the resulting protein in a non-conservative manner (i.e., by changing the codon from an amino acid belonging to a grouping of amino acids having a particular size or characteristic to an amino acid belonging to another grouping) or in a conservative manner (i.e., by changing the codon from an amino acid belonging to a grouping of amino acids having a particular size or characteristic to an amino acid belonging to the same grouping). Such a conservative change generally leads to less change in the structure and function of the resulting protein. A non-conservative change is more likely to alter the structure, activity or function of the resulting protein. The present invention should be considered to include sequences containing conservative changes which do not significantly alter the activity or binding characteristics of the resulting protein.

[0067] The following is one example of various groupings of amino acids: Amino acids with nonpolar R groups: Alanine, Valine, Leucine, Isoleucine, Proline, Phenylalanine, Tryptophan and Methionine. Amino acids with uncharged polar R groups: Glycine, Serine, Threonine, Cysteine, Tyrosine, Asparagine and Glutamine. Amino acids with charged polar R groups (negatively charged at Ph 6.0): Aspartic acid and Glutamic acid. Basic amino acids (positively charged at pH 6.0): Lysine, Arginine and Histidine.

[0068] Another grouping may be those amino acids with phenyl groups: Phenylalanine, Tryptophan and Tyrosine.

[0069] Another grouping may be according to molecular weight (i.e., size of R groups).

[0070] Particularly preferred substitutions are:

[0071] Lys for Arg and vice versa such that a positive charge may be maintained;

[0072] Glu for Asp and vice versa such that a negative charge may be maintained;

[0073] Ser for Thr such that a free -OH can be maintained; and

[0074] Gln for Asn such that a free NH2 can be maintained.

[0075] Amino acid substitutions may also be introduced to substitute an amino acid with a particularly preferable property. For example, a Cys may be introduced to provide a potential site for disulfide bridges with another Cys. A His may be introduced as a particularly “catalytic” site (i.e., His can act as an acid or base and is the most common amino acid in biochemical catalysis). Pro may be introduced because of its particularly planar structure, which induces &bgr;-turns in the protein's structure.

[0076] It is clear that certain modifications of SEQ ID NOS:2, 4, 14-21, 23 or 25-27 can take place without destroying the activity of the enzyme. It is noted especially that truncated versions of the nucleic acids of the invention are functional. For example, several amino acids (from 1 to about 120) can be deleted from the N-terminus of the lycopene ∈-cyclases of the invention, and a functional protein can still be produced. This fact is made especially clear from FIG. 25, which shows a sequence alignment of several plant ∈-cyclases. As can be seen from FIG. 25, there is an enormous amount of sequence disparity between amino acid sequences 2 to about 50-70 (depending on the particular sequence, since gaps are present). There is less, but also a substantial amount of, sequence dissimilarity between about 50-70 to about 90-120 (depending on the particular sequence). Thereafter, the sequences are fairly conserved, except for small pockets of dissimilarity between about 275-295 to about 285-305 (depending on the particular sequence), and between about 395-415 to about 410-430 (depending on the particular sequence).

[0077] The present inventors have found that the amount of the 5′ region present in the nucleic acids of the invention can alter the activity of the enzyme. Instead of diminishing activity, truncating the 5′ region of the nucleic acids of the invention may result in an enzyme with a different specificity. Thus, the present invention relates to nucleic acids and enzymes encoded thereby which are truncated to within 0-50, preferably 0-25, codons of the 5′ initiation codon of their prokaryotic counterparts as determined by alignment maps as discussed below.

[0078] For example, when the cDNA encoding A. thaliana &bgr;-carotene hydroxylase was truncated, the resulting enzyme catalyzed the formation of &bgr;cryptoxanthin as the major product and zeaxanthin as minor product; in contrast to its normal production of zeaxanthin.

[0079] The present invention is intended to include those nucleic acid and amino acid sequences in which substitutions, deletions, additions or other modifications have taken place, as compared to SEQ ID NOS:2, 4, 14-21, 23 or 25-27, without destroying the activity of the enzyme. Preferably, the substitutions, deletions, additions or other modifications take place at the 5′ end, or any other of those positions which already show dissimilarity between any of the presently disclosed amino acid sequences (see also FIG. 25) or other amino acid sequences which are known in the art and which encode the same enzyme (i.e., lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase).

[0080] In each case, nucleic acid and amino acid sequence similarity and identity is measured using sequence analysis software, for example, the Sequence Analysis, Gap, or BestFit software packages of the Genetics Computer Group (University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705), MEGAlign (DNAStar, Inc., 1228 S. Park St., Madison, Wis. 53715), or MacVector (Oxford Molecular Group, 2105 S. Bascom Avenue, Suite 200, Campbell, Calif. 95008). Such software uses algorithms to match similar sequences by assigning degrees of identity to various substitutions, deletions, and other modifications, and includes detailed instructions as to useful parameters, etc., such that those of routine skill in the art can easily compare sequence similarities and identities. An example of a useful algorithm in this regard is the algorithm of Needleman and Wunsch, which is used in the Gap program discussed above. This program finds the alignment of two complete sequences that maximizes the number of matches and minimizes the number of gaps. Another useful algorithm is the algorithm of Smith and Waterman, which is used in the BestFit program discussed above. This program creates an optimal alignment of the best segment of similarity between two sequences. Optimal alignments are found by inserting gaps to maximize the number of matches using the local homology algorithm of Smith and Waterman.

[0081] Conservative (i.e. similar) substitutions typically include substitutions within the following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, glutamic acid, asparagine and glutamine; serine and threonine; lysine and arginine; and phenylalanine and tyrosine. Substitutions may also be made on the basis of conserved hydrophobicity or hydrophilicity (see Kyte and Doolittle, J. Mol. Biol. 157: 105-132 (1982)), or on the basis of the ability to assume similar polypeptide secondary structure (see Chou and Fasman, Adv. Enzymol. 47: 45-148 (1978)).

[0082] If comparison is made between nucleotide sequences, preferably the length of comparison sequences is at least 50 nucleotides, more preferably at least 60 nucleotides, at least 75 nucleotides or at least 100 nucleotides. It is most preferred if comparison is made between the nucleic acid sequences encoding the enzyme coding regions necessary for enzyme activity. If comparison is made between amino acid sequences, preferably the length of comparison is at least 20 amino acids, more preferably at least 30 amino acids, at least 40 amino acids or at least 50 amino acids. It is most preferred if comparison is made between the amino acid sequences in the enzyme coding regions necessary for enzyme activity.

[0083] It should be appreciated that also within the scope of the present invention are nucleic acid sequences encoding lycopene ∈-cyclases, IPP isomerases and &bgr;-carotene hydroxylases which code for enzymes having the same amino acid sequence as SEQ ID NOS:2, 4, 14-21, 23 or 25-27, but which are degenerate to the nucleic acids specifically disclosed herein.

[0084] The amino acid residues described herein are preferred to be in the “L” isomeric form. However, residues in the “D” isomeric form can be substituted for any L-amino acid residue, as long as the desired functional property of immunoglobulin-binding is retained by the polypeptide.

[0085] In accordance with the present invention there may be employed conventional molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained fully in the literature. See, e.g., Sambrook et al, “Molecular Cloning: A Laboratory Manual” (1989); “Current Protocols in Molecular Biology” Volumes I-III [Ausubel, R. M., ed. (1994)]; “Cell Biology: A Laboratory Handbook” Volumes I-III [J. E. Celis, ed. (1994))]; “Current Protocols in Immunology” Volumes I-III [Coligan, J. E., ed. (1994)]; “Oligonucleotide Synthesis” (M. J. Gait ed. 1984); “Nucleic Acid Hybridization” [B. D. Hames & S. J. Higgins eds. (1985)]; “Transcription And Translation” [B. D. Hames & S. J. Higgins, eds. (1984)]; “Animal Cell Culture” [R. I. Freshney, ed. (1986)]; “Immobilized Cells And Enzymes” [IRL Press, (1986)]; B. Perbal, “A Practical Guide To Molecular Cloning” (1984).

[0086] The present invention also includes vectors. Suitable vectors according to the present invention comprise a nucleic acid of the invention encoding an enzyme involved in carotenoid biosynthesis or metabolism and a suitable promoter for the host, and can be constructed using techniques well known in the art (for example Sambrook et al., Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing and Wiley Interscience, New York, 1991). Suitable vectors for eukaryotic expression in plants are described in Frey et al., Plant J. (1995) 8(5):693 and Misawa et al, 1994a; incorporated herein by reference. Suitable vectors for prokaryotic expression include pACYC 184, pUC 119, and pBR322 (available from New England BioLabs, Bevery, Mass.) and pTrcHis (Irvitrogen) and pET28 (Novagen) and derivatives thereof. The vectors of the present invention can additionally contain regulatory elements such as promoters, repressors, selectable markers such as antibiotic resistance genes, etc.

[0087] The nucleic acids encoding the carotenoid enzymes as described above, when cloned into a suitable expression vector, can be used to overexpress these enzymes in a plant expression system or to inhibit the expression of these enzymes. For example, a vector containing the gene encoding lycopene ∈-cyclase can be used to increase the amount of &agr;-carotene and carotenoids derived from &agr;-carotene (such as lutein and &agr;-cryptoxanthin) in an organism and thereby alter the nutritional value, pharmacology and visual appearance value of the organism.

[0088] Therefore, the present invention includes a method of producing or enhancing the production of a carotenoid in a host cell, relative to an untransformed host cell, the method comprising inserting into the host cell a vector comprising a heterologous nucleic acid sequence which encodes for a protein having lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence to produce the protein.

[0089] The present invention also includes a method of modifying the production of carotenoids in a host cell, the method comprising inserting into the host cell a vector comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for a protein which modifies lycopene ∈-cyclase, IPP isomerase or &bgr;-carotene hydroxylase enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence in the host cell to modify the production of the carotenoids in the host cell, relative to the untransformed host cell.

[0090] The term “modifying the production” means that the amount of carotenoids produced in the host cell can be enhanced, reduced, or left the same, as compared to the untransformed host cell. In accordance with one embodiment of the present invention, the make-up of the carotenoids (i.e., the specific carotenoids produced) is changed vis a vis each other, and this change in make-up may result in either a net gain, net loss, or no net change in the total amount of carotenoids produced in the cell. In accordance with another embodiment of the present invention, the production or the biochemical activity of the carotenoids (or the enzymes which catalyze their formation) is enhanced by the insertion of an enzyme-encoding nucleic acid of the invention. In yet another embodiment of the invention, the production or the biochemical activity of the carotenoids (or the enzymes which catalyze their formation) may be reduced or inhibited by a number of different approaches available to those skilled in the art, including but not limited to such methodologies or approaches as anti-sense (e.g., Gray et al (1992) Plant Mol. Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. Gen. Genet. 245:465-470), co-suppression (e.g., Fray and Grierson (1993) Plant Mol. Biol. 22:589-602), targeted disruption of the gene (e.g., Schaefer et al. (1997) Plant J. 11:1195-1206), intracellular antibodies (e.g., Rondon and Marasco (1997) Ann. Rev. Microbiol. 51:257-283) or whatever other approaches rely on the knowledge or availability of the nucleic acid or amino acid sequences of the invention and/or portions thereof, to thereby reduce accumulation of carotenoids with ∈ rings and compounds derived from them (for ∈-cyclase inhibition), or carotenoids with hydroxylated &bgr; rings and compounds derived from them (for &bgr;-hydroxylase inhibition), or, in the case if IPP isomerase, accumulation of any isoprenoid compound.

[0091] Preferably, at least a portion of the nucleic acid sequences used in the methods, vectors and host cells of the invention codes for an enzyme having an amino acid sequence which is at least 85% identical, preferably at least 90%, at least 95% or completely identical to SEQ ID NOS:2, 4, 14-21, 23 or 25-27. Sequence identity is determined as noted above. Preferably, sequence additions, deletions or other modifications are made as indicated above, so as to not affect the function of the particular enzyme.

[0092] In a preferred embodiment, vectors are manufactured which contain a DNA encoding a eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic carotenoid enzyme. The inventors have discovered that inclusion of an IPP isomerase gene increases the supply of substrate for the carotenoid pathway; thereby enhancing the production of carotenoid endproducts, as compared to a host cell which is not transformed with such a vector. This is apparent from the much deeper pigmentation in carotenoid-accumulating colonies of E. coli which also contain one of the aforementioned IPP isomerase genes when compared to colonies that lack this additional IPP isomerase gene. Similarly, a vector comprising an IPP isomerase gene can be used to enhance production of any secondary metabolite of dimethylallyl pyrophosphate and/or isopentenyl pyrophosphate (such as isoprenoids, steroids, carotenoids, etc.). The term “isoprenoid” is intended to mean any member of the class of naturally occurring compounds whose carbon skeletons are composed, in part or entirely, of isopentyl C5 units. Preferably, the carbon skeleton is of an essential oil, a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel.

[0093] A vector containing the cDNA encoding a lycopene ∈-cyclase of the invention, preferably the lettuce lycopene ∈-cyclase or Adonis ∈-cyclase #5, can be used to increase the amount of bicyclic ∈-carotene in an organism and thereby alter the nutritional value, pharmacology and visual appearance value of the organism. In addition, the transformed organism can be used in the formulation of therapeutic agents, for example in the treatment of cancer (see Mayne et al (1996) FASEB J. 10:690-701; Tsushima et al (1995) Biol. Pharm. Bull. 18:227-233).

[0094] An antisense strand of a nucleic acid of the invention can be inserted into a vector. For example, the lycopene ∈-cyclase gene can be inserted into a vector and incorporated into the genomic DNA of a host, thereby inhibiting the synthesis of ∈,&bgr;carotenoids (lutein and &agr;-carotene) and enhancing the synthesis of &bgr;, &bgr;-carotenoids (zeaxanthin and &bgr;-carotene).

[0095] The present invention also relates to novel enzymes which are encoded by the amino acid sequences of the invention, or portions thereof.

[0096] The present invention also relates to novel enzymes which can transform known carotenoids into novel or uncommon products. Currently ∈-carotene (see FIG. 2) and &ggr;-carotene are commonly produced only in minor amounts. As described below, an enzyme can be produced which transforms lycopene to &ggr;-carotene and lycopene to ∈-carotene. With these products in hand, bulk synthesis of other carotenoids derived from them are possible. For example, ∈-carotene can be hydroxylated to form lactucaxanthin, an isomer of lutein (one ∈ and one &bgr; ring) and zeaxanthin (two &bgr; rings) where both endgroups are, instead, ∈ rings.

[0097] In addition to novel enzymes produced by truncating the 5′ region of known enzymes, as discussed above, novel enzymes which can participate in the formation of unusual carotenoids can be formed by replacing portions of one gene with an analogous sequence from a structurally related gene. For example, &bgr;cyclase and ∈-cyclase are structurally related (see FIG. 13). By replacing a portion of &bgr;-lycopene cyclase with the analogous portion of ∈-cyclase, an enzyme which produces &ggr;-carotene will be produced (one &bgr; endgroup). Further, by replacing a portion of the lycopene ∈-cyclase with the analogous portion of &bgr;-cyclase, an enzyme which produces ∈-carotene will be produced (with some exceptions, such as the lettuce ∈-cyclase, plant ∈-cyclases normally produce a compound with one ∈-endgroup, &dgr;-carotene). Similarly, &bgr;-hydroxylase could be modified to produce enzymes of novel function by creation of hybrids with ∈-hydroxylase.

[0098] Host systems according to the present invention can comprise any organism that already produces carotenoids or which has been genetically modified to produce carotenoids. The IPP isomerase genes are more broadly applicable for enhancing production of any product dependent on DMAPP and/or IPP as a precursor.

[0099] Organisms which already produce carotenoids include plants, algae, some yeasts, fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts with vectors according to the present invention can be done using standard techniques such as those described in Misawa et al., (1990) supra; Hundle et al., (1993) supra; Hundle et al., (1991) supra; Misawa et al., (1991) supra; Sandmann et al., supra; and Schnurr et al., supra.

[0100] Transgenic organisms can be constructed which include the nucleic acid sequences of the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 1994a; Misawa et al, 1994b; Cunningham et al, 1993). The incorporation of these sequences can allow the controlling of carotenoid biosynthesis, content, or composition in the host cell. These transgenic systems can be constructed to incorporate sequences which allow for the overexpression of the nucleic acids of the present invention. Transgenic systems can also be constructed containing antisense expression of the nucleic acid sequences of the present invention. Such antisense expression would result in the accumulation of the substrates of the substrates of the enzyme encoded by the sense strand.

[0101] A method for screening for eukaryotic genes which encode enzymes involved in carotenoid biosynthesis comprises transforming a prokaryotic host with a nucleic acid which may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; culturing said transformed host to obtain colonies; and screening for colonies exhibiting a different color than colonies of the untransformed host.

[0102] Suitable hosts include E. coli , cyanobacteria such as Synechococcus and Synechocystis, alga and plant cells. E. coli are preferred.

[0103] In a preferred embodiment, the above “color complementation” screening protocol can be enhanced by using mutants which are either (1) deficient in at least one carotenoid biosynthetic gene or (2) overexpress at least one carotenoid biosynthetic gene. In either case, such mutants will accumulate carotenoid precursors.

[0104] Prokaryotic and eukaryotic DNA or cDNA libraries can be screened in total for the presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred organisms to be screened include photosynthetic organisms.

[0105] E. coli can be transformed with these eukaryotic cDNA libraries using conventional methods such as those described in Sambrook et al, 1989 and according to protocols described by the vendors of the cloning vectors.

[0106] For example, the cDNA libraries in bacteriophage vectors such as lambdaZAP (Stratagene) or lambda ZIPLOX (Gibco BRL) can be excised en masse and used to transform E.coli.

[0107] Transformed E. coli can be cultured using conventional techniques. The culture broth preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics include penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 15-40° C., preferably at room temperature or slightly above (18-28° C.), for 12 hours to 7 days.

[0108] Cultures are plated and the plates are screened visually for colonies with a different color than the colonies of the host E. coli transformed with the empty plasmid cloning vector. For example, E. coli transformed with the plasmid, pAC-BETA (described below), produce yellow colonies that accumulate &bgr;-carotene. After transformation with a cDNA library, colonies which contain a different hue than those formed by E. coli/pAC-BETA would be expected to contain enzymes which modify the structure or accumulation of &bgr;-carotene. Similar E. coli strains can be engineered which accumulate earlier products in carotenoid biosynthesis, such as lycopene, &ggr;-carotene, etc.

[0109] Having generally described this invention, a further understanding can be obtained by reference to certain specific examples which are provided herein for purposes of illustration only and are not intended to be limiting unless otherwise specified.

EXAMPLE

[0110] I. Isolation of &bgr;-carotene hydroxylase

Plasmid Construction

[0111] An 8.6 kb Bg1II fragment containing the carotenoid biosynthetic genes of Erwinia herbicola was first cloned in the BamHI site of plasmid vector pACYC 184 (chloramphenicol resistant), and then a 1.1 kb BamHI fragment containing the E. herbicola &bgr;-carotene hydroxylase (CrtZ) was deleted. E.coli strains containing the resulting plasmid, pAC-BETA, accumulate &bgr;-carotene and form yellow colonies (Cunningham et al., 1994).

[0112] A full length cDNA encoding IPP isomerase of Haematococcus pluvialis (HP04) was first excised with BamHI and KpnI from pBluescript SK-, and then ligated into the corresponding sites of the pTrcHisA vector with high-level expression from the trc promoter (Inivitrogen, Inc.). A fragment containing the IPP isomerase and trc promoter was subsequently excised with EcoRV and KpnI, treated with the Klenow fragment of DNA polymerase to produce blunt ends, and ligated in the Klenow-treated HindIII site of pAC-BETA. E. coli cells transformed with this new plasmid pAC-BETA-04 form orange colonies on LB plates (vs. yellow for those containing pAC-BETA) and cultures accumulate substantially more &bgr;-carotene (ca. two fold) than those that contain pAC-BETA.

Screening of an Arabidopsis cDNA Library

[0113] Several &lgr; cDNA expression libraries of Arabidopsis were obtained from the Arabidopsis Biological Resource Center (Ohio State University, Columbus, Ohio.) (Kieber et al., 1993). The &lgr; cDNA libraries were excised in vivo using Stratagene's ExAssist SOLR system to produce a phagemid cDNA library wherein each phagemid contained also a gene conferring resistance to the antibiotic ampicillin.

[0114] E.coli strain DH10BZIP was chosen as the host cell for the screening and pigment production, although we have also used TOP 10F′ and XL1 Blue for this purpose. DH10B cells were transformed with plasmid pAC-BETA-04 and were plated on LB agar plates containing chloramphenicol at 50 &mgr;g/ml (from United States Biochemical Corporation). The phagemid Arabidopsis cDNA library was then introduced into DH10B cells already containing pAC-BETA-04. Transformed cells containing both pAC-BETA-04 and Arabidopsis cDNA library phagemids were selected on chloramphenicol plus ampicillin (150 &mgr;g/ml) agar plates. Maximum color development occurred after 3 to 7 days incubation at room temperature, and the rare bright yellow colonies were selected from a background of many thousands of orange colonies on each agar plate. Selected colonies were inoculated into 3 ml liquid LB medium containing ampicillin and chloramphenicol, and cultures were incubated at room temperature for 1-2 days, with shaking. Cells were then harvested by centrifugation and extracted with acetone in microfuage tubes. After centrifugation, the pigmented extract was spotted onto silica gel thin-layer chromatography (TLC) plates, and developed with a hexane:ether (1:1, by volume) mobile phases. &bgr;-carotene hydroxylas ∈-encoding cDNAs were identified based on the appearance of a yellow pigment that co-migrated with zeaxanthin on the TLC plates.

Subcloning and Sequencing

[0115] The plasmid containing the &bgr;-carotene hydroxylase cDNA was recovered and analyzed by standard procedures (Sambrook et al., 1989). The Arabidopsis &bgr;-carotene hydroxylase was sequenced completely on both strands on an automatic sequencer (Applied Biosystems, Model 373A, Version 2.0.1S). The cDNA insert of 0.95 kb also was excised and ligated into the a pTrcHis vector. A Bg/II restriction site within the cDNA was used to remove that portion of the cDNA that encodes the predicted polypeptide N terminal sequence region that is not also found in bacterial &bgr;-carotene hydroxylases (FIG. 6). A Bg1II-XhoI fragment was directionally cloned in BamHI-XhoI digested TrcHis vectors.

Pigment Analysis

[0116] A single colony was used to inoculate 50 ml of LB containing ampicillin and chloramphenicol in a 250-ml flask. Cultures were incubated at 28° C. for 36 hours with gentle shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed once with distilled H2O and resuspended with 0.5 ml of water. The extraction procedures and HPLC were essentially as described previously (Cunningham et al, 1994).

[0117] II. Isolation and biochemical analysis of an Arabidopsis lycopene ∈-cyclase

Plasmid Construction

[0118] Construction of plasmids pAC-LYC, pAC-NEUR, and pAC-ZETA is described in Cunningham et al., (1994). In brief, the appropriate carotenoid biosynthetic genes from Erwinia herbicola, Rhodobacter capsulatus, and Synechococcus sp. strain PCC7942 were cloned in the plasmid vector pACYC184 (New England BioLabs, Beverly, Mass.). Cultures of E. coli containing the plasmids pAC-ZETA, pAC-NEUR, and pAC-LYC, accumulate &zgr;-carotene, neurosporene, and lycopene, respectively. The plasmid pAC-ZETA was constructed as follows: an 8.6-kb Bg1II fragment containing the carotenoid biosynthetic genes of E. herbicola (GenBank M87280; Hundle et al., 1991) was obtained after partial digestion of plasmid pPL376 (Perry et al., 1986; Tuveson et al., 1986) and cloned in the BamHI site of pACYC184 to give the plasmid pAC-EHER. Deletion of adjacent 0.8- and 1.1-kb BamHI-BamHI fragments (deletion Z in Cunningham et al., 1994), and of a 1.1 kB SalI-SalI fragment (deletion X) served to remove most of the coding regions for the E. herbicola &bgr;-carotene hydroxylase (crtZ gene) and zeaxanthin glucosyltransferase (crtX gene), respectively. The resulting plasmid, pAC-BETA, retains functional genes for geranylgeranyl pyrophosphate synthase (crtE), phytoene synthase (crtB), phytoene desaturase (crtI), and lycopene cyclase (crtY). Cells of E. coli containing this plasmid form yellow colonies and accumulate &bgr;carotene. A plasmid containing both the lycopene ∈-and &bgr;cyclase cDNAs of A. thaliana was constructed by excising the ∈-cyclase in clone y2 as a PvuI-PvuII fragment and ligating this piece in the SnaBI site of a plasmid (pSPORT 1 from GIBCO-BRL) that already contained the &bgr;-cyclase (Cunningham et al., 1996).

Organisms and Growth Conditions

[0119] E. coli strains TOP10 and TOP10 F′ (obtained from Invitrogen Corporation, San Diego, Calif.) and XL1-Blue (Stratagene) were grown in Luria-Bertani (LB) medium (Sambrook et al., 1989) at 37° C. in darkness on a platform shaker at 225 cycles per min. Media components were from Difco (yeast extract and tryptone) or Sigma (NaCl). Ampicillin at 150 &mgr;g/mL and/or chloramphenicol at 50 &mgr;g/mL (both from United States Biochemical Corporation) were used, as appropriate, for selection and maintenance of plasmids.

Mass Excision and Color Complementation Screening of an A. thaliana cDNA Library

[0120] A size-fractionated 1-2 kB cDNA library of A. thaliana in lambda ZAPII (Kieber et al., 1993) was obtained from the Arabidopsis Biological Resource Center at The Ohio State University (stock number CD4-14). Other size fractionated libraries were also obtained (stock numbers CD4-13, CD4-15, and CD4-16). An aliquot of each library was treated to cause a mass excision of the cDNAs and thereby produce a phagemid library according to the instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1-Blue and the helper phage R408 were used). The titre of the excised phagemid was determined and the library was introduced into a lycopen-accumulating strain of E. coli TOP10 F′ (this strain contained the plasmid pAC-LYC) by incubation of the phagemid with the E. coli cells for 15 min at 37° C. Cells had been grown overnight at 30° C. in LB medium supplemented with 2% (w/v) maltose and 10 mM MgSO4 (final concentration), and harvested in 1.5 ml microfuge tubes at a setting of 3 on an Eppendorf microfuge (5415C) for 10 min. The pellets were resuspended in 10 mM MgSO4 to a volume equal to one-half that of the initial culture volume. Transformants were spread on large (150 mm diameter) LB agar petri plates containing antibiotics to provide for selection of cDNA clones (ampicillin) and maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming units were spread on each plate. Petri plates were incubated at 37 C. for 16 hr and then at room temperature for 2 to 7 days to allow maximum color development. Plates were screened visually with the aid of an illuminated 3× magnifier and a low power stage-dissecting microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be observed in the background of pink colonies. A colony color of yellow or pinkish-yellow was taken as presumptive evidence of a cyclization activity. These yellow colonies were collected with sterile toothpicks and used to inoculate 3 ml of LB medium in culture tubes with overnight growth at 37° C. and shaking at 225 cycles/min. Cultures were split into two aliquots in microfuge tubes and harvested by centrifugation at a setting of 5 in an Eppendorf 5415C microfuge. After discarding the liquid, one pellet was frozen for later purification of plasmid DNA. To the second pellet was added 1.5 ml EtOH, and the pellet was resuspended by vortex mixing, and extraction was allowed to proceed in the dark for 15-30 min with occasional remixing. Insoluble materials were pelleted by centrifugation at maximum speed for 10 min in a microfuge. Absorption spectra of the supernatant fluids were recorded from 350-550 nm with a Perkin Elmer lambda six spectrophotometer.

Analysis of Isolated Clones

[0121] Eight of the yellow colonies contained &bgr;-carotene indicating that a single gene product catalyzes both cyclizations required to form the two &bgr; endgroups of the symmetrical &bgr;-carotene from the symmetrical precursor lycopene. One of the yellow colonies contained a pigment with the spectrum characteristic of &dgr;-carotene, a monocyclic carotenoid with a single ∈ endgroup. Unlike the &bgr; cyclase, this ∈-cyclase appears unable to carry out a second cyclization at the other end of the molecule.

[0122] The observation that ∈-cyclase is unable to form two cyclic ∈-endgroups (e.g. the bicyclic ∈-carotene) illuminates the mechanism by which plants can coordinate and control the flow of substrate into carotenoids derived from &bgr;-carotene versus those derived from &agr;-carotene and also can prevent the formation of carotenoids with two ∈ endgroups.

[0123] The availability of the A. thaliana gene encoding the ∈-cyclase enables the directed manipulation of plant and algal species for modification of carotenoid content and composition. Through inactivation of the ∈-cyclase, whether at the gene level by deletion of the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by such as antisense technology), one may increase the formation of &bgr;-carotene and other pigments derived from it. Since vitamin A is derived only from carotenoids with &bgr; endgroups, an enhancement of the production of &bgr;-carotene versus &agr;-carotene may enhance nutritional value of crop plants. Reduction of carotenoids with ∈-endgroups may also be of value in modifying the color properties of crop plants and specific tissues of these plants. Alternatively, where production of &agr;-carotene, or pigments such as lutein that are derived from &agr;-carotene, is desirable, whether for the color properties, nutritional value or other reason, one may overexpress the ∈-cyclase or express it in specific tissues. Wherever agronomic value of a crop is related to pigmentation provided by carotenoid pigments the directed manipulation of expression of the ∈-cyclase gene and/or production of the enzyme may be of commercial value.

[0124] The predicted amino acid sequence of the A. thaliana ∈-cyclase enzyme was determined. A comparison of the amino acid sequences of the &bgr;- and ∈-cyclase enzymes of Arabidopsis thaliana (FIG. 13) as predicted by the DNA sequence of the respective cDNAs (FIG. 4 for the ∈-cyclase cDNA sequence), indicates that these two enzymes have many regions of sequence similarity, but they are only about 37% identical overall at the amino acid level. The degree of sequence identity at the DNA base level, only about 50%, is sufficiently low such that we and others have been unable to detect this gene by hybridization using the &bgr; cyclase as a probe in DNA gel blot experiments.

REFERENCES

[0125] Each reference cited in this application and/or listed below is hereby incorporated by reference.

[0126] Bird et al, 1991 Biotechnology 9, 635-639.

[0127] Bishop et al., (1995) FEBS Lett. 367, 158-162.

[0128] Bramley, P. M. (1985) Adv. Lipid Res. 21, 243-279.

[0129] Bramley, P. M. (1992) Plant J. 2, 343-349.

[0130] Britton, G. (1988). Biosynthesis of carotenoids. In Plant Pigments, T. W. Goodwin, ed. (London: Academic Press), pp. 133-182.

[0131] Britton, G. (1979) Z. Naturforsch. Section C. Biosci. 34, 979-985.

[0132] Britton, G. (1995) UV/Visible spectroscopy. In Carotenoids, Vol. IB: Spectroscopy, G. Britton, S. Liaaen-Jensen, H. P. Pfander, eds. (Basel: Birkhauser Verlag), pp. 13-62.

[0133] Bouvier et al., (1994) Plant J. 6, 45-54.

[0134] Cunningham et al., (1985) Photochem. Photobiol. 42: 295-307.

[0135] Cunningham et al., (1993) FEBS Lett. 328, 130-138.

[0136] Cunningham et al., (1994) Plant Cell 6,1107-1121.

[0137] Cunningham et al., (1996) Plant Cell 8, 1613-1626.

[0138] Davies, B. H. (1976). Carotenoids. In Chemistry and Biochemistry of Plant Pigments, Vol. 2, T. W. Goodwin, ed (New York: Academic Press), pp. 38-165.

[0139] Del Sal et al., (1988). Nucl. Acids Res. 16, 9878.

[0140] Demmig-Adams & Adams, (1992) Ann. Rev. Plant Physiol. Mol. Biol. 43, 599-626.

[0141] Enzell & Back, (1995) Mass spectrometry. In Carotenoids, Vol. IB: Spectroscopy, G. Britton, S. Liaaen-Jensen, H. P. Pfander, eds. (Basel: Birkhauser Verlag), pp. 261-320.

[0142] Frank & Cogdell (1993) Photochemistry and function of carotenoids in photosynthesis. In Carotenoids in Photosynthesis. A. Young and G. Britton, eds. (London: Chapman and Hall). pp. 253-326.

[0143] Goodwin, T. W. (1980). The Biochemistry of the Carotenoids. 2nd ed, Vol. 1 (London:

[0144] Chapman and Hall.

[0145] Horvath et al., (1972) Phytochem. 11, 183-187.

[0146] Hugueney et al., (1995) Plant J. 8, 417-424.

[0147] Hundle et al., (1991) Photochem. Photobiol. 54, 89-93.

[0148] Jensen & Jensen, (1971) Methods Enzymol. 23, 586-602.

[0149] Kargl & Quackenbush, (1960) Archives Biochem. Biophys. 88, 59-63.

[0150] Kargl et al., (1960) Proc. Am. Hort. Soc. 75, 574-578.

[0151] Kieber et al., (1993) Cell 72, 427-441.

[0152] Koyama, Y. (1991) J. Photochem. Photobiol., B, 9, 265-80.

[0153] Krinsky, N. I. (1987) Medical uses of carotenoids. In Carotenoids, N. I. Krinsky, M. M. Mathews-Roth, and R. F. Taylor, eds. (New York: Plenum), pp. 195-206.

[0154] Kyte & Doolittle, (1982) J. Mol. Biol. 157, 105-132.

[0155] LaRossa & Schloss, (1984) J. Biol. Chem. 259, 8753-8757.

[0156] Misawa et al., (1994a) Plant J. 6, 481-489.

[0157] Misawa et al., (1994b) J. Biochem, Tokyo, 116,980-985.

[0158] Norris et al., (1995) Plant Cell 7, 2139-2149.

[0159] Pecker et al., (1996) Submitted to Plant Mol. Biol.

[0160] Perry et al., (1986) J. Bacteriol. 168, 607-612.

[0161] Persson & Argos, (1994) J. Mol. Biol. 237, 182-192.

[0162] Plumley & Schmidt, (1987) Proc. Nat. Acad. Sci. USA 83, 146-150.

[0163] Plumley & Schmidt, (1995) Plant Cell 7, 689-704.

[0164] Rossmann et al., (1974) Nature 250, 194-199.

[0165] Rock & Zeevaart (1991) Proc. Nat. Acad. Sci. USA 88, 7496-7499.

[0166] Rost et al., (1995) Protein Science 4, 521-533.

[0167] Sambrook et al., (1989) Molecular Cloning: A Laboratory Manual, 2nd edition (Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press).

[0168] Sancar, A. (1994) Biochemistry 33, 2-9.

[0169] Sander & Schneider, (1991) Proteins 9, 56-68.

[0170] Sandmann, G. (1994) Eur. J. Biochem. 223, 7-24.

[0171] Scolnik & Bartley, (1995) Plant Physiol. 108, 1342.

[0172] Siefermann-Harms, D. (1987) Physiol. Plant. 69, 561-568.

[0173] Spurgeon & Porter, (1980). Biosynthesis of carotenoids. In Biochemistry of Isoprenoid Compounds, J. W. Porter, and S. L. Spurgeon, eds. (New York: Wiley), pp. 1-122.

[0174] Tomes, M. L. (1963) Bot. Gaz. 124, 180-185.

[0175] Tomes, M. L. (1967) Genetics 56, 227-232.

[0176] Tuveson et al., (1986) J. Bacteriol. 170, 4675-4680.

[0177] Van Beeumen et al., (1991) J. Biol. Chem. 266, 12921-12931.

[0178] Weedon & Moss, (1995) Structure and Nomenclature. In Carotenoids, Vol. IB: Spectroscopy, G. Britton, S. Liaaen-Jensen, H. P. Pfander, eds. (Basel: Birkhauser Verlag), pp. 27-70.

[0179] Wierenga et al., (1986) J. Mol. Biol. 187, 101-107.

[0180] Zechmeister, L. (1962) Cis-Trans Isomeric Carotenoids, Vitamins A and Arylpolyenes. Springer-Verlag, Vienna.

[0181] Having now fully described the invention, it will be apparent to one of ordinary skill in the art that many changes and modifications can be made thereto without departing from the spirit or scope of the invention as set forth herein.

Claims

1. A method of producing or enhancing production of a carotenoid in a host cell, relative to an untransformed host cell, the method comprising inserting into the host cell a vector comprising a heterologous nucleic acid sequence which encodes for a protein having ∈-cyclase enzyme activity, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence to produce the protein.

2. The method of claim 1, wherein the heterologous nucleic acid sequence has a sequence which encodes an amino acid sequence which is at least 85% identical to SEQ ID NOS:2, 23 or 25-27.

3. The method of claim 1, wherein the heterologous nucleic acid sequence has a sequence which encodes the amino acid sequence of SEQ ID NOS:2, 23 or 25-27.

4. The method of claim 1, wherein the host cell is selected from the group consisting of a bacterial cell, an algal cell, a yeast cell and a plant cell.

5. The method of claim 1, wherein the host cell is a photosynthetic cell.

6. A method of modifying the production of carotenoids in a host cell, the method comprising inserting into the host cell a vector comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for a protein which modifies ∈-cyclase enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid sequence is operably linked to a promoter; and expressing the heterologous nucleic acid sequence in the host cell to modify the production of the carotenoids in the host cell, relative to the untransformed host cell.

7. The method of claim 6, wherein the heterologous nucleic acid sequence has a sequence which encodes an amino acid sequence which is at least 85% identical to SEQ ID NOS:2, 23 or 25-27.

8. The method of claim 6, wherein the heterologous nucleic acid sequence has a sequence which encodes the amino acid sequence of SEQ ID NOS:2, 23 or 25-27.

9. The method of claim 6, wherein the host cell is selected from the group consisting of a bacterial cell, an algal cell, a yeast cell, and a plant cell.

10. The method of claim 6, wherein the host cell is a photosynthetic cell.

Patent History
Publication number: 20020102631
Type: Application
Filed: Jun 2, 1999
Publication Date: Aug 1, 2002
Patent Grant number: 6642021
Inventors: FRANCIS X. CUNNINGHAM (CHEVY CHASE, MD), ZAIREN SUN (HYATTSVILLE, MD)
Application Number: 09323998