Mammalian guanine nucleotide binding protein with an ADP-rybosylation factor domain

Info

Patent number: 5386021
Type: Grant
Filed: Apr 19, 1993
Date of Patent: Jan 31, 1995
Assignee: The United States of America, as represented by the Department of Health & Human Services (Washington, DC)
Inventors: Joel Moss (Bethesda, MD), Koichi Mishima (Rockville, MD), Maria S. Nightingale (Bethesda, MD), Mikako Tsuchiya (Izumo)
Primary Examiner: Christine M. Nucker
Assistant Examiner: Laurie Scheiner
Law Firm: Knobbe, Martens, Olson & Bear
Application Number: 8/49,473

Abstract

The invention provides a method for detecting the presence of ARD 1 protein in a sample. The method includes the steps of providing labeled or immobilized anti-ARD 1 antibody in a reaction zone, introducing sample into the reaction zone such that ARD 1 protein in the sample, if present, will react with said antibody to form an immunological complex, and detecting the formation of said immunological complex. Cells, nucleotide and amino acid sequences and vectors associated with ARD 1 are also described.

Description

Description

BACKGROUND

Monomeric guanine nucleotide-binding proteins of the ras superfamily of 18-30 kDa function in a variety of cellular processes including signaling, growth, immunity, and protein transport (Barbacid, H. Annu. Rev. Biochem. 56: 779-828 (1987); Bourne, H. R. Cell 53:669-671 (1988); Bourne, et al. Nature (London) 349: 117-127 (1991); Gabig, et al. J. Biol. Chem. 262: 1685-1690 (1987); Goud, et al. Nature (London) 345: 553-556 (1990); Hall, A. Science 249: 635-640 (1990); Knaus, et al. Science 254: 1512-1515 (1991)) . ADP-ribosylation factors (ARFs) constitute one family of the ras superfamily.

ARF was initially identified as a factor required for cholera toxdin-catalyzed ADP-ribosylation of G.sub.sa, the stimulatory guanine nucleotide-binding (G) protein of the adenylylcyclase system (Kahn, et al. J. Biol. Chem. 259: 6228-6234 (1984); Serventi, et al. In: Current Topics in Microbiology and Immunology 175, (Aktories, K. ed) pp. 43-67, Springer-Verlag, Berlin Heidelberg (1992). In the presence of GTP or a nonhydrolyzable GTP analogue, ARF serves as an allosteric activator of cholera toxin ADP-ribosyltransferase (Noda, et al. Biochim. Biophys. Acta 1034: 195-199 (1990); Tsai, et al. J. Biol. Chem. 263: 1768-1772 (1988); Tsai, et al. Proc. Natl. Acad. Sci. (USA) 84: 5139-5142 (1987)). It stimulates the toxin-catalyzed ADP-ribosylation of proteins unrelated to G.sub.sa and simple guanidino compounds such as arginine and agmatine as well as auto-ADP-ribosylation of the toxin A1 protein (Noda, et al. Biochim. Biophys. Acta 1034: 195-199 (1990); Tsai, et al. J. Biol. Chem. 263: 1768-1772 (1988); Tsai, et al. Proc. Natl. Acad. Sci. (USA) 84: 5139-5142 (1987)).

ARFs are evolutionarily well conserved and present in all eukaryotes from Giardia to mammals (Kahn, et al. J. Biol. Chem. 263: 8282-8287 (1988); Murtagh, et al. J. Biol. Chem. 267: 9654-9662 (1992); Tsai, et al. J. Biol. Chem. 266: 8213-8219 (1991); Tsuchiya, et al. Biochemistry 28: 9668-9673 (1989); Tsuchiya, et al. J. Biol. Chem. 266: 2772-2777 (1991)). Immunologically, they have been localized to the Golgi apparatus of several types of cells (Stearns et al. Proc. Natl. Acad. Sci. (USA) 87: 1238-1242 (1990)). ARFs are required for association of nonclathrin coat proteins with intracellular transport vesicles (Serafini, et al. Cell 67: 239-253 (1991)) and also appear to be critical during an early step in endocytosis as well as in nuclear vesicle fusion (Boman, et al. Nature (London) 358: 512-514 (1992); Lenhard, et al. J. Biol. Chem. 267: 13047-13052 (1992)). GTP binding and hydrolysis may be involved in binding of ARF to membranes, and the nonhydrolyzable GTP analogue GTP.sub..gamma. S, but not GTP or GDP, promotes the association of cytosolic ARF with Golgi (Regazzi, et al. Biochem. J. 275: 639-644 (1991)) 1991) or phospholipid membranes (Kahn, et al. J. Biol. Chem. 266: 15595-15597 (1991); Walker, et al. J. Biol. Chem. 267: 3230-3235 (1992)).

By molecular cloning from cDNA and genomic libraries, and PCR amplification of RNA transcripts, six mammalian ARFs, two yeast ARFs, and two Giardia ARFs have been identified (Bobak, et al. Proc. Natl. Acad. Sci. (USA) 86: 6101-6105 (1989); Monaco, et al. Proc. Natl. Acad. Sci. (USA) 87: 2206-2210 (1990); Murtagh, et al. J. Biol. Chem. 267: 9654-9662 (1992); Price, et al. Proc. Natl. Acad. Sci. (USA) 85: 5488-5491 (1988); Sewell, et al. Proc. Natl. Acad. Sci. (USA) 85: 4620-4624 (1988); Stearns, et al. Mol. Cell. Biol. 10: 6690-6699 (1990); Tsuchiya, et al. J. Biol. Chem. 266: 2772-2777 (1991)). Mammalian ARFs fall into three classes based on deduced amino acid sequences, gene structure, phylogenetic analysis, and size (Lee, et al. J. Biol. Chem. 267: 9028-9034 (1992); Tsuchiya, et al. J. Biol. Chem. 266: 2772-2777 (1991)). Class I ARFs are ARFs 1-3; class II includes ARFs 4 and 5; and class III has ARFs 6. Some lipids and/or detergents, e.g., SDS, cardiolipin, dimyristoylphosphatidylcholine (DMPC)/cholate, enhance ARF activities (Bobak, et al. Biochemistry 29: 855-861 (1990); Noda, et al. Biochim. Biophys. Acta 1034: 195-199 (1990); Tsai, et al. J. Biol. Chem. 263: 1768-1772 (1988)). Members of the ARF multigene family, when expressed as recombinant proteins in E. coli, display different phospholipid and detergent requirements (Price, et al. J. Biol. Chem. 267: 17766-17772 (1992)). Following synthesis in E. coli all of these ARFs had enhanced cholera toxin ADP-ribosyltransferase activity in the presence of GTP (Kahn, et al. J. Biol. Chem. 266: 2606-2614 (1991); Price, et al. J. Biol. Chem. 267: 17766-17772 (1992); Weiss, et al. J. Biol. Chem. 264: 21066-21072 (1989)).

In general, differences in the various ARF sequences are concentrated in the amino-terminal regions and the carboxyl portions of the proteins. Only three of 17 amino acids includin Met.sup.1 and Gly.sup.2, in the amino termini are identical among ARFs, and four amino acids in this region of ARFs 1-5 are missing in ARF 6 (Tsuchiya, et al. J. Biol. Chem. 266: 2772-2777 (1991)). It was recently reported (Kahn, et al. J. Biol. Chem. 267: 13039-13046 (1992)) that the amino-terminal regions of ARF proteins form an .alpha.-helix, and that this domain is required for membrane targeting, interaction with lipid, and ARF activity.

Schliefer et al., (J. Biol. Chem. 257: 20-23 (1991)) described a protein distinctly larger than ARF that possessed ARF-like activity. At the time of those studies however, it had not been demonstrated that ARF requires GTP for activity, so functional characterization of the protein did not include assessment of that property.

SUMMARY

The present invention relates to a novel 64 kDa protein styled ARD 1. This protein includes an 18 kDa region that exhibits significant homology to known ADP-ribosylation factors (ARFs), but lacks a 15 amino acid domain previously thought necessary for ARF stimulation. The remaining 46 kDa sequence is apparently unlike any known protein. Both rat and human sequences are specifically disclosed.

One aspect of the present invention, therefore, comprises a polynucleotide encoding ARD 1 (which substantially has the sequence of SEQ ID NO:1 or which otherwise substantially encodes the sequence of SEQ ID NO:2) in isolated or purified form. An isolated polynucleotide is one that has been largely separated from other nucleotides, so that the polynucleotide in any composition of which it is a part is at least 1%, preferably 5% or 10%, and more preferably at least about 20%, 30%, or 50% the particular polynucleotide of interest. Mammalian ARD 1 sequences are of particular interest, and vertebrate sequences such as rat and human sequences are particularly preferred.

The present invention also includes a recombinant DNA sequence in the form of a nucleic acid expression vector, comprising the DNA of SEQ ID NO:1 operably linked to a promoter. The promoter may be a heterologous promoter, for example, and is preferably adapted to direct expression of the polynucleotide in a host cell.

The present invention also includes a cell transfected with the expression vector discussed above. Such a cell is preferably capable of expressing ARD 1 protein. Secretion sequences of the type well known in the art may be included in the expression vector that are adapted to promote secretion of the protein in the cell in question. Although the transfected cell may be procaryotic, it is preferably a eukaryotic cell line. CHO cells, for example, are suitable for use in the present invention. Other mammalian cell lines, including human cell lines, may be used, as well other vertebrate cell lines. Alternatively, insect cell lines may be used for expressing the protein of the present invention.

One preferred embodiment of the present invention is an ARD 1 protein composition, comprising the polypeptide of SEQ ID NO:2 in a concentration of at least about 0.01 .mu.g/g. The composition of claim 3, wherein the polypeptide is in substantially purified form.

Also disclosed is an immunoassay kit, comprising ARD 1 protein as a polypeptide reagent, a reaction unit including a reaction zone in which the polypeptide reagent can interact with antibodies (if any) in the sample directed against ARD 1 to form an immunological complex, and means for detecting the reaction or lack of reaction of the polypeptide with the antibodies. Similarly, the present invention includes an immunoassay kit where anti-ARD 1 antibodies are used as a reagent to react with ARD 1 (if any) in a sample, and the reaction of the antibody and ARD 1 are detected.

Another aspect of the present invention relates to isolated or purified antibody against ARD 1 protein. Both polyclonal antibody and monoclonal antibody against ARD 1 protein are contemplated. Moreover, the monoclonal antibody of the present invention is not necessarily isolated or purified.

A further aspect of the present invention is a method for detecting the presence of ARD 1 protein in a sample, comprising the steps of providing labeled or immobilized anti-ARD 1 antibody in a reaction zone, introducing sample into the reaction zone such that ARD 1 protein in the sample, if present, will react with the antibody to form an immunological complex, and detecting the formation of the immunological complex.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 is a table showing a comparison of amino acid sequences of ARD 1 and mammalian ARFs. The deduced amino acid sequence of reported human and bovine ARFs and the ARF domain of human ARD 1 were aligned using PC/Gene CLUSTAL. hARF 1, human ARF (Bobak, et al. Proc. Natl. Acad. Sci. (USA) 86: 6101-6105 (1989)); bARF 2, bovine ARF 2 (Price, et al. Proc. Natl. Acad. Sci. (USA) 85: 5488-5491 (1988); hARF 3, human ARF 3 ((Bobak, et al. Proc. Natl. Acad. Sci. (USA) 86: 6101-6105 (1989)); hARF 4 human ARF 4 (Monaco, et al. Proc. Natl. Acad. Sci. (USA) 87: 2206-2210 (1990)); hARF 5 and hARF 6, human ARF 5 and human ARF 6, respectively (Tsuchiya, et al. J. Biol. Chem. 266: 2772-2777 (1991)); hARD 1, the ARF domain of human ARD 1. Gap penalty and window size were both 10. Asterisks and hyphens indicate amino acids identical to hARF 1 and gaps, respectively.

FIG. 2 is a copy of a Northern blot showing the hybridization of poly(A).sup.+ RNA from rat tissues with ARD 1 coding region cDNA. Poly(A).sup.+ RNA from rat skeletal muscle (SM), testis (Ts), spleen (Sp), kidney (Kd), liver (Lv), lung (Ln), heart (Hr) and brain (Br) was hybridized with human ARD 1 coding region cDNA. Positions (kb) of standards (left and ARD 1 mRNA (3.7, 4.1, right) are indicated.

FIG. 3 is a copy of a Northern blot illustrating the hybridization of poly (A).sup.+ RNA from different species with ARD 1 coding region cDNA. Poly(A).sup.+ RNA from human fibroblasts (FB), HL-60 (HL), and IMR-32 (IM) cells and brain from chicken (Ch), rabbit (Rb), mouse (Mo), and rat (Rt) was hybridized with human ARD 1 coding region cDNA FIG. 3A or human ARF 1 coding region cDNA FIG. 3B as described below. Positions of standards (kb) are indicated.

FIG. 4 is copy of a protein gel illustrating the binding of GTP to recombinant ARD 1. p3 (2 .mu.g), p8 (Mg) and sARF II (0.2 .mu.g) were subjected to 4-20% SDS-polyacrylamide gel electrophoresis, and transferred to nitrocellulose membrane. [.alpha.-.sup.32 P] GTP binding was carried out in the presence of 0.3% Tween 20 as described below.

FIG. 5 is a bar graph illustrating the effects of lipids/detergents on ARF activity. Cholera toxin-catalyzed ADP-ribosylation of agmatine was assayed in the presence of GST-p3 (4 .mu.g), GST-p8 (8 .mu.g), sARF II (2 .mu.g) or no ARF as described below. The lipids/detergents used were 0.003% SDS ( , 1 mg/ml cardiolipin (striped rectangles), 3 mM DMPC/0.4% cholate (dotted rectangles) or 0.4% cholate (hollow rectangles).

FIG. 6 is a line graph showing the effect of the increasing concentration of ARD 1 on cholera toxin-catalyzed ADP-ribosylation. Cholera toxin-catalyzed ADP-ribosylation of agmatine was assayed in the presence of indicated concentrations of recombinant ARD 1; GST-p8 ( , p8 (o), GST-p3 , p3 (.DELTA.).

FIG. 7 is a line graph illustrating the effect of GTP concentration on ARF activity in the presence of 0.3% Tween 20. Cholera toxin-catalyzed ADP-ribosylation of agmatine with 20 .mu.g of GST-p8 (o), 14 .mu.g of p8 ( ), 25 .mu.g of GST-p3 (hollow box), 2 .mu.g of sARF II (.DELTA.) or no ARF ( ) was assayed in the presence of indicated concentration of GTP. The activities without GTP (0.1 mM GDP.beta.S) was 0.51 nmol/h (GST-p8), 0.64 nmol/h (p8), 0.61 nmol/h (GST-p3), 0.70 nmol/h (sARF II) and 0.46 nmol/h (no ARF).

DETAILED DESCRIPTION

The present invention includes the discovery of a newly recognized member of the ADP-ribosylation factor (ARF) super family termed ARD 1. We have isolated the polynucleotide encoding ARD 1 from both human and rat libraries, as described below. In addition, ARD 1 has been transfected into a host cell and expressed. Upon expression, a 64-kDa guanine nucleotide-binding protein containing an 18-kDa, functional ARF domain at its carboxy terminus termed ARD 1 (for ARF domain) was found.

Recombinant human ARD 1, and a recombinant truncated species containing only the ARF domain following expression, activated cholera toxin ADP-ribosyltransferase in a GTP-dependent manner, consistent with the conclusion that 15 amino acids adjacent to the amino terminus of ARF proteins are not required for toxin activation.

The ARD 1 protein of the present invention is useful for stimulation of cholera toxin ADP-ribosyltransferase. Thus, as a reagent for synthetic biochemistry, it can stimulate toxin-catalyzed ribosylation of such molecules as agmatine in a GTP-dependent manner. It can also be used as a reagent in diagnosis of ADP-ribosylation deficiency disease. ARD 1 includes a GTP binding region, and may be used for binding GTP. It is also useful for generation of anti-ARD 1 antibodies, and as a reagent in assays for ARD 1 and for antibodies against ARD 1. Moreover, ARD 1 is a member of the ARF superfamily, and may be used in general in the same way as known ARF proteins.

ARD 1, a new member of the ARF family, was identified by molecular cloning from cDNA libraries and by Rapid Amplification of cDNA Ends (RACE) type PCR. Nucleotide and amino acid sequences of human and rat ARD 1 are, respectively, 92% and 96% identical. In all rat tissues tested, 3.7-kb and 4.1-kb ARD 1 mRNAs were detected. A cDNA probe specific for the ARD 1 coding region hybridized with poly(A).sup.+ RNA of similar sizes from rat, mouse and rabbit brains, and human cultured cells. The specific probe additionally hybridized to a somewhat different size mRNA from chicken brain. Sequence and hybridization data are thus consistent with the conclusion that these ARF-related proteins are highly conserved in eukaryotic cells.

The unique 64-kDa ARD 1 protein is much larger than other monomeric guanine nucleotide-binding proteins and consists of two distinct domains. One domain is a carboxy-terminal ARF domain and the remainder, doesn't have homology with other known ARF domains. In fact, the sequence of the non-ARF domain has no homology to any sequences in the GenBank data base and contains no motif known to be conserved in any guanine nucleotide-binding proteins.

These conserved motifs, denoted GX.sub.1 X.sub.2 X.sub.3 X.sub.4 GK, DX.sub.1 X.sub.2 G, and NKXD using the single letter amino acid code, are conserved in other GTP-binding proteins including heterotrimeric G proteins, and ras, rho, and rab proteins. Unlike rab, ras, rho, and G.sub.60, ARF and ARD 1 have an Aspartic Acid (D) at position X.sub.2 in GX.sub.1 X.sub.2 X.sub.3 X.sub.4 GK. Like G.sub.60 , but unlike the ras super family, X.sub.1 and X.sub.2 in the DX.sub.1 X.sub.2 G sequence of ARD 1 and ARFs are Valine (V) and Glycine (G), respectively. The ARF domain contains several motifs common to all ARFs, such as GLDGAGK, DVGG, and NKQD, which are considered to be responsible for GTP-binding (see FIG. 1). The CAT sequence is, however, represented by DAR. The conserved alanine, based on homology to ras, is thought to be involved in guanine nucleotide-binding. In fact, a recombinant protein, representing the ARF domain of ARD 1 had the ability to bind GTP, while the entire ARD 1 protein, under the same conditions did not bind GTP. The reason is unclear, although it is possible that when bound to a nitrocellulose membrane, the conformation of the larger protein limits accessibility of GTP to the binding site.

It has been suggested that the amino terminus of ARF is critical for its function, since mutant ARF 1 protein lacking the amino-terminal 17 amino acids was unable to stimulate cholera toxin-catalyzed ADP-ribosylation of G.sub.S.alpha. in the presence of DMPC/cholate, although it still possessed GTP.sub..gamma. S binding activity that no longer required DMPC/cholate (Kahn, et al. J. Biol. Chem. 267: 13039-13046 (1992)). Similarly, ARD 1 fusion protein or its ARF domain fusion protein did not have demonstrable ARF activity in the presence of SDS, DMPC/cholate or cardiolipin, all of which are effective in supporting activity of ARFs (Noda, et al. Biochim. Biophys. Acta 1034: 195-199 (1990); Serventi, et al. In: Current Topics in Microbiology and Immunology 175, (Aktories, K. ed) pp. 43-67, Springer-Verlag, Berlin Heidelberg (1992); Tsai, et al. Proc. Natl. Acad. Sci. (USA) 84: 5139-5142 (1987)).

In the presence of Tween 20, however, ARD 1 protein stimulated toxin-catalyzed ADP-ribosylation of agmatine in a GTP-dependent manner. Removal of the non-ARF domain from ARD 1 protein significantly diminished its ARF activity and increased the GTP concentration required for half maximal activity, suggesting some role for the non-ARF domain in its ARF-like activity. Clearly, however, these data are consistent with the notion that toxin activation does not require the amino-terminal 15 amino acids of ARF, although they may contribute to interaction with phospholipids.

As a first step, we isolated cDNA from a human HL-60 lambda library that had homology with known ARF sequences.

Materials

[.alpha.-.sup.32 P]dATP (6,000 Ci/mmol), [.alpha.-thio.sup.35 S]dATP (1,350 Ci/mmol), and [.alpha.-.sup.32 P]GTP (800 Ci/mmol) were purchased from DuPont-New England Nuclear; [adenine-.sup.14 C]NAD (273 Ci/mol) from Amersham (Arlington Heights, Ill.); random-primed DNA labeling kit and GDP.beta.S from Boehringer Mannheim (Indianapolis, IN); human fetal brain cDNA and rat brain cDNA Lambda ZAP library from Stratagene (La Jolla, Calif.); competent E. coli DH5.alpha. (maximal efficiency), terminal deoxynucleotidyl transferase and pSPORT 1 from Life Technology (Gaithersburg, Md.); plasmid purification kit from Qiagen (Chatsworth, Calif.); DNA sequencing kit from United States Biochemical Corp. (Cleveland, Ohio); AMV reverse transcriptase, NotI, SalI, RNase inhibitor (RNasin) and T4 DNA ligase from Promega (Madison, Wis.); Centricon 100 from Amicon (Danvers, Mass.); nylon membrane (Nytran) and nitrocellulose membrane from Schleicher and Schuell (Keene, NH); Thermus aquaticus DNA polymerase from Perkin-Elmer/Cetus (Norwalk, Conn.); glutathione, glutathione-agarose, cardiolipin, DMPC, cholate, and NAD from Sigma (St. Louis, Mo.); and AG1-X2 from Bio-Rad (Richmond, Calif.). Oligonucleotides were synthesized using an Applied Biosystems 380B DNA synthesizer.

Experiment 1 Analysis of ARD 1 cDNA

A cyclic, AMP-differentiated HL-60 Lambda ZAP library (5 .times.10.sup.5 plaques) was screened by plaque hybridization with ARF 2B cDNA (Price, et al. Proc. Natl. Acad. Sci. (USA) 85: 5488-5491 (1988)) and a mixture of oligonucleotides denoted XARFC as described by Tsuchiya et al., (J. Biol. Chem. 266: 2772-2777 (1991)). The only clone (#76) that was positive with the ARF 2B cDNA and negative with oligonucleotides specific for ARFs 1-6 (Tsuchiya et al., J. Biol. Chem. 266: 2772-2777 (1991)) was plaque-purified, the insert was excised in vivo by the method of Short et al.(Nucleic Acids Res. 16: 7583-7600 (1988)). The insert was then purified using Qiagen, and sequenced by the Sanger et al. method (Proc. Natl. Acad. Sci. (USA) 74: 5463-5467 (1977)). The 1660-bp insert (nucleotides 706-2365 in Table I) included an open reading frame (1207-1722) encoding an ARF domain of 172 amino acids.

3 TABLE I
       Nucleotide and Protein Alignment of Human and Rat ARD I
        -22 CTGTGGCGCTTCCCCTGCGAGG  HUMAN 1 MATLVVNKLG AGVDSGRQGS RGTAVVKVLE
      (SEQ ID NO:2) HUMAN 1 ATGGCTACCCTGGTTGTAAACAAGCTCGGA GCGGGAGTAGACAGTGGCCG
      GCAGGGCAGC CGGGGGACAGCTGTAGTGAAGGTGCTAGAG (SEQ ID NO:1) RAT  ------------
      ------------------ -------- --------------------- --------G-----G--------T
      --G--- HUMAN 31 CGVCEDVFSL QGDKVPRLLL CGHTVCHDCL HUMAN 91 TGTGGAGTTTGTGAAG
      ATGTCTTTTCTTTG CAAGGAGACAAAGTTCCCCGTCTTTTGCTT TGTGGCCATACCGTCTGTCATGACTGTC
      TC RAT  -----T-----------------C---C-- --------T--------T--C-----C---
      --------C--A------------------ HUMAN 61 TRLPLHGRAI RCPFDRQVTD LGDSGVWGLK
      HUMAN 181 ACTCGCCTACCTCTTCATGGAAGAGCAATC CGTTGCCCATTTGATCGACAAGTAACAGAC  T
     A
      CGTGATTCAGGTGTCTGGGGATTGAAA RAT  T-----T--G----G--------------A ---------
      --------------G------ --- --A-----------------T------ HUMAN 91 KNFALLELLE
       RLQNGPIGQY GAAEESIGIS HUMAN 271 AAAAATTTTGCTTTATTGGAGCTTTTGGAA CGACTGCAG
      AATGGGCCTATTGGTCAGTAT GGAGCTGCAGAAGAATCCATTGGGATATCT RAT  ------------C--
      ------- ---A--- --TT----A-----A-A------------- ---------------G--C-------
      ---- HUMAN 121 GESIIRCDED EAHLASVYCT VCATHLCSEC HUMAN 361 GGAGAGAGCATCATTC
     G
      TTGTGATGAAGAT GAAGCTCACCTTGCCTCTGTA TATTGCACT GTGTGTGCAACTCATTTGTGCTCTGAG
      TGT RAT  --------T--------C------------ ---------G----A-----G--C------
      --------G-----C--------A------ HUMAN 151 SQVTHSTXTL AKHRRVPLAD KPHEKTMCSQ
       HUMAN 451 TCTCAAGTTACTCATTCTACAAAGACATTA GCAAAGCACAGGCGAGTTCCTCTAGCTGAT
      AAACCTCATGAGAAAACTATGTGCTCTCAG RAT  --------------------------TC-G
      --C-----T------ --G------------ --------------G--C--G HUMAN 181 HQVHAIEFVC
       LEEGCQTSPL MCCVCKEYGK HUMAN 541 CACCAGGTGCATGCCATTGAGTTTGTTTGC TTGGAAGAAG
      GTTGTCAAACTAGCCCACTC ATGTGCTGTGTCTGCAAAGAATATGGAAAA RAT  --------A----- -
      --------------- ----------C------------T--T--- --------------------------
      ---- HUMAN 211 HQGHKHSVLE PEANQIRASI LDMARCIRTF HUMAN 631 CACCAGGGTCACAAG
      CAT TCAGTATTGGAA CCAGAAGCTAATCAGATCCGAGCATCAATT TTAGATATGGCTCACTGCATACGGAC
     C
       RAT  -----------------C------C----G -----------C-----------G------      C
      ------------- ---------A------ HUMAN 241 TEEISDYSRK LVGIVQHIBG GEQIVEDGIG
       HUMAN 721 ACAGAGGAAATCTCAGATTATTCCAGAAAA TTAGTTGGAATTGTGCAGCACATTGAAGGA
      GGAGAACAAATCGTGGAAGATGGAATTGGA RAT  --T-----G---------------------
      --- -----G-----T--------------- -----------A------------------ HUMAN 231 M
     A
       AENARSCIRA YFYDLHETLC HUMAN 811 ATGGCTCACACAGAACATGTACCAGGGACT GCAGAGAATG
      CCCGGTCATGTATTCGAGCT TATTTTTAT GATCTACATGAAACTCTGTCT RAT  -----------G---
      --C--C-----T--- -----------A--A------G-CA----- -------C----T-G-----------T
      --- HUMAN 301 RQEEMALSVV DA HVREKLIW LRQQQEDMTI HUMAN 901 CGTCAAGAAGAAATG
      GCTCTAAGTGTTGTT GATGCTCATGTTCGTGAAAAATTGATTTGG CTCAGGCAGCAACAAGAAGATATGACT
     A
       RAT  --------------------C--------- -----C--------A------C---- ----
      --T--------------G------------ HUMAN 331 LLSEVSAACL HCEKTLQQDD CRVVLAKQEI
       HUMAN 991 TTGTTGTCAGAGGTTTCTGCAGCCTGCCTC CACTGTGAAAAGACTTTGCAGCAGGATGAT
      TGTAGAGTTGTCTTGGCAAAACAGGAAATT RAT  C-C-----CC------- AA-------T---
      --T--------------------------- --C--------------------A-----C HUHAN 361  R
     L
      T QQQFTEVADH IQLQASIPVT HUMAN 1081 ACAAGGTTACTGGAAACATTGCAGAAACAG        A
     G
      CAGCAGCTTTACAGAAGTTGCAGATCAC ATTCAGTTGGATGCCAGCATCCCTGTCACT RAT
      -----A---T-A------C----------- ------------------------------ -----------
      ------T --T--A------ HUMAN 391 FTKDNRVHIG PKMEIRVVTL GLDGAGKTTI HUMAN
      1171 TTTACAAAGGATAATCGAGTTCACATTGGA CCAAAAATGGAAATTCGGGTCGTTACGTTA
      GGATTGGATGGTGCTGGAAAAACTACTATC RAT  -----------C--CA-----T-T------
      --C------- ----A--A--A--C--A--- -----A-----------------------T HUMAN 421 L
     F
       QPIPTIGFNV ETVEYKNLKF HUMAN 1261 TTGTTTAAGTTAAAACAGGATGAATTCATG         T
     C
      CAGCCCATGTTTTAACGTG GAAACTGTAGAATATAAAAATC TAAAATTC RAT  -----C----------
      -A--------T--- -----T--------------------T--- --------G-----C------------
      --- HUMAN 451 TIWDVGGKHK LRPLW KHYYL NTQAVVFVVD HUMAN 1351 ACTATTTGGGATGTA
     G
      GTGGAAAACACAAA TTAAGACCATTGTGGAAACATTATTACCTC AATACTCAAGCTGTTGTGTTTGTTGTAG
      AT RAT  --C-----------G----A---------- --------------------------- ---
      -----A-----------A--------T--C HUMAN 481 SSHRDRISEA HSELAKLLTE KELRDALLLI
       HUMAN 1441 AGCAGTCATAGAGACAGAATTAGTGAAGCA CACAGCGAACTTGCAAAGTTGTTAACGGAA
       AAAGAACTCCGAGATGCTCTGCTCCTGATT RAT  --------C------ ---------------
      --------------G-----------A--- --------T--------CT-A---T---- HUMAN 511   N
     K
      FA LSVEEITELL SLHKLCCGRS HUMAN 1531 TTTGCTAACAAACAGGATGTTGCTGGAGCA       A
     G
      C TGTCTAGAAGAAATCACTGAACTACTC AGTCTCCATAAATTATGCTGTGGCCGTAGC RAT
      --------------------C-----G--- --T--G--T-----------C-----T--- -----------
      --------- C--AA-G--- HUMAN 541 wYIQGCDARS GMGLYEGLDW LSRQLVAAGV HUMAN
      1621 TGGTATATTCAGGGCTGTGATGCTCGAAGT GGTATGGGACTGTATGAAGGGTTGGACTGG       C
     G
      CTCTCAGCAACTTGTAGCTGCTGGAGTA RAT  ------------------------------         G
      ----------C--C --------------- --G--C-----------G-----C-----G HUMAN 571  D
     V
      L HUMAN 1711 TTGGATGTTGCTTGATTTTAAAGGCAGCAG TTGTTTGAAGTTTTGTGGTTAAAAGTAACT
       TTGCACATAAAAAAAAAAAAAAAAAATGCA RAT  ---------------  1801 TCTCAAAAGATGGT
      AATTTAGGATGCATAT ATATATATATATATATAAAGGAATCTTGGA TTGGGAATTCAGTACTTTGCTTTAA
      AAAAA  1891 TTTTGTGGCAGAATTAAATTTCTAATTGAC GCAGATTAGATTGAATTAAATAGAAACTTA
       TTGAATATACATTCTTTTAAAAAGTATATT  1981 TGTTATTTAAGTTTTTCAGATAATATGTGA
      CCAATATACTGGGAAAGAGGTAGTCACAGA GAAAGGGTAAGTGAAGGTTTATTCTTTCAG  2071
      TGAAAAAAGAATAGCCAATTGAGTGCCTAA TGAGACCTCTGTGTGAAGCAAGTGAAGTAT AGCTGCTTCTT
      TTAACCTGCCTTTTCACTG  2161 AATGTTGGCAGCATTTAGTAGTAGAAATGA CAGTTGCTTAATGAAA
      TAGAATCCAAACTA CATATTTGGATAATAGGATTACTTTATGTT  2251 TATGTTCAGAGTTAACAGAAC
      ACCTTTAAT GCTAAGAACTATAAGGTACAGAAAATTAAT ACTTTATATAGTGTTTTATTAACTTTCTCC
      2341 TACAGCATTTTGTATAAAACACAATGAGGG AGTGAAATGTTACCCAATTAGGCTTGTCAG
      GTTAGTAATAAACTGAACAGTAATAAAACT  2431 GTGGAAGTAATTGGATCTGAATTTATGAAA
      GACCCATTTCCAGGACTGAACCTAGGTCAG AGCTCTAAATTGGTCCTTCTATTTTTCAAC  2521
      AAATTTAAAGTAATATTTCTTTCTAATATA ATATTGCATCCTTTGTGGGAATGACTATAG GTAAAATGTAG
      TAAGTAACGCAGAACCAGG  2611 GTTGGCTTTATTTAAAAGCTAGTGACCTAA ATAGAAAGCGAACTTC
      AAGAGAAGTTGTAA GTACAGTGGCAAATGCTTATTACTTACTTC  2701 AAACTGTTTCCCAAAATAAGT
      GCATTTATT TTGACAATAAAACTTAAGGCTGTTCATGAG AAGGCCTTGAAAAGTTACTCTAGAGGAAAA
      2791 ATGTCTAAAGAAAAAAAAAATTCAAAAAGT TTACATTAATTATTCAGTGTTGTGAGTAAA
      TAAAAATGTGTGCTCTTTACTGTTTTTCAT  2881 TTTTAAAGAATATTATTATGGAAGCACGAT
      TTATTTAAATAGGTACATTGAGACTTTTTT TTTTAATGTTCTGATACATTAGGATGAAGT  2971
      TAAATCTTAAATCTTATTAGTTGAATTGTT GTAAGGACAGTGATGTCTGGTAACAAGATG TGACTTTTTGG
      TAGCACTGTTGTGGTTCAT  3061 TCTTTTCAAATCTATTTTTGTTTAAAAACA ATACAAGTTTTAGAAA
      ACAAAGCATTAAAA AAAAAGCCTATCAGTATTATGGGCAATATG  3151 TAAATAAATAAATGTAATATT
      TCATCCTTT ATTTTTCAGGTAAAAGGTCATGCTGTTACA GGTGTAGTTTGTGTGCATAAATAATACTTC
      3241 CGAATTAAATTATTTAATATTTGACTGATT TCAATAACTGTGAAAATAAAAAGGTGTTGT
      ATTGCTTGTGAG

Table I illustrates the Nucleotide and deduced amino acid sequences of human and rat ARD 1 cDNA. The human ARD 1 deduced amino acid sequence is shown in single-letter code above the nucleotide sequence on the respective codon of the human ARD 1 coding region. Differences in rat ARD nucleotide sequences are shown below the human sequence. The rat ARD 1 sequence is available from positions corresponding to 61 to 1726 of human ARD 1. The human nucleotide sequence shown is SEQ ID NO:1 the rat nucleotide sequence shown is SEQ ID NO:3 and the translated human amino acid sequence shown is SEQ ID NO:2.

An oligonucleotide specific for this sequence (J1R) is shown in Table II and was used to screen a human fetal brain cDNA Lambda ZAP library (5.times.10.sup.5 plaques). Among eight positive clones, #7-3 contained nucleotides 7-1826 and clone #7-8 contained nucleotides 726-3225. In this sequence, about 1200 nucleotides preceded the ARF region without a stop codon in the same reading frame.

                                    TABLE II                                
     __________________________________________________________________________
     Oligonucleotides used in the analysis of ARD 1                            
           Sequence                                                            
     Name                                                                      
            ##STR1##                                                           
     __________________________________________________________________________
     JlR   AATGGGCTGCATGAATTCATCCTGTTTTAA             (SEQ ID NO:3)            
           complementary to bases 1270 to 1299 of human ARD 1.                 
     JK721RC                                                                   
           CCTCCTTCAATGTGCTGCACAATTCC                 (SEQ ID NO:4)            
           complementary to bases 757 to 782 of human ARD 1.                   
     JK723RII                                                                  
           TGAGTAACTTGAGAACACTC                       (SEQ ID NO:5)            
           complementary to bases 445 to 464 of human ARD 1.                   
     JK728R                                                                    
           TGCCCTGCCGGCCACTGTCTACTCCCGCTCCGAGCTTGTTTA (SEQ ID NO:6)            
           complementary to bases 17 to 58 of human ARD 1.                     
     JKNOT GACTAGTTCTAGATCGCGAGCGGCCGCCCTTCACCTAGGTCTGTTACTTGTCG               
                                                      (SEQ ID NO:7)            
           complementary to bases 226 to 248 of human ARD 1 with 30 bases      
           added at 5' end to introduce NotI site.                             
     JK8EX GGCCTGGTTCCGCGGATGGCTACCCTGGTTGTA          (SEQ ID NO:8)            
           bases 1 to 18 of human ARD 1 with 15 bases added at 5' end to       
           introduce ligation-independent cloning site.                        
     JK3EX GGCCTGGTTCCGCGGATGGAAATTCGGGTC             (SEQ ID NO:9)            
           bases 1207 to 1221 of human ARD 1 with 15 bases added at 5' end to  
           introduce ligation-independent cloning site.                        
     JKEXR CTGCGCCTCG CTCCTCAAGCAACATCCAA             (SEQ ID NO:10)           
           complementary to bases 1711 to 1725 of human ARD I with 14 bases    
           added at 5' end to introduce ligation-independent cloning site.     
     JK5EXR                                                                    
           CTGCGCCTCGCTCCTTTTGGTCCAATGTG              (SEQ ID NO:ll)           
           complementary to bases 1192 to 1206 of human ARD 1 with 14 bases    
           added at 5' end to introduce ligation-independent cloning           
     __________________________________________________________________________
           site.

To further characterize the 5' -terminus of this cDNA, 5' -RACE was carried with the poly(A).sup.+ RNA from IMR-32 human neuroblastoma cells.

Experiment 2 5' -RACE with IMR-32 Poly(A).sup.+ RNA

Poly(A).sup.+ RNA (5 .mu.g) from IMR-32 human neuroblastoma cells was reverse-transcribed with the primer JK721RC, as shown in Table II (100 ng) by AMV reverse transcriptase (20 units) in 50 mM Tris-HCl , pH 8.3, 7 mM MgCl.sub.2, 40 mM KCl, 1 mM dithiothreitol, dNTPs (1 mM each), bovine serum albumin (0.1 mg/ml), RNasin, 20 units (total volume 20 .mu.l ) at 42.degree. C. for 3 hours. TE (10 mM Tris-HCl, pH 7.5, 1 mM EDTA), 1 ml, was added and the mixture was concentrated (Centricon 100). After addition of 2 ml of 0.2.times.TE, it was concentrated again to 50 .mu.l.

The reverse transcribed products (46 .mu.l ) were tailed with terminal deoxynucleotidyl transferase (30 units) at 37.degree. C. for 5 minutes in 0.1M potassium cacodylate, pH 7.2, 2 mM CaCl.sub.2, 0.2 mM dithiothreitol, 0.2 mM dATP followed by heat inactivation of the enzyme (65.degree. C., 5 min). The preparation (20 .mu.l ) was subjected to PCR (40 cycles: 95.degree. C., 30 seconds; 50.degree. C., 30 seconds; 72.degree. C., 60 seconds) with 200 ng each of primers SALAD and SALADTT (Table III) and 400 ng of primer JK723RII (Table II) in 10 mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 mM MgCl.sub.2, 0.01% gelatin, dNTP, 200 .mu.M each, 0.1% Tween, Taq DNA polymerase, 2.5 units (total volume, 100 .mu.l).

TE (1 ml) was added to the PCR products and the mixture was concentrated (Centricon 100). TE (2 ml) was added followed by concentration to 30 .mu.l and transfer of a sample (1 .mu.l ) to a second PCR amplification (40 cycles: 95.degree. C., 30 seconds; 50.degree. C., 30 seconds; 72.degree. C., 30 seconds) with 400 ng each of primers SALAD (Table III) and JKNOT (Table II). Preparations were digested with NotI and SalI, ligated into plasmid pSPORT1, and used to transform competent DH5.alpha. cells, which were then grown on LB/agar plates containing ampicillin, 100 .mu.g/ml.

                                    TABLE III                               
     __________________________________________________________________________
     Oligonucleotides used in the analysis of ARD 1                            
     Name     Sequence, 5'-3'                                                  
     __________________________________________________________________________
     RA4CF    ATGGGCCTCACCAT                                                   
              (SEQ ID NO:12)                                                   
              Bases 1 to 14 of rat ARF 4.                                      
     HARF4-codAR                                                               
              TCTCACTGATGGCCATAGCA                                             
              (SEQ ID NO:13)                                                   
              Complementary to bases 396 to 415 of rat and human ARF 4         
              cDNA.                                                            
     HRA4CR   TCATTTGACAGCCA                                                   
              (SEQ ID NO:14)                                                   
              Complementary to bases 514 to 527 of rat and human ARF 4         
              cDNA.                                                            
     REKNOT   GACTAGTTCTACATCGCGAGCGGCCGCCCTGGATATCTAACCAAGGACAT               
              (SEQ ID NO.:34)                                                  
              Bases 552 to 571 of rat ARF 4 with 40 bases added at 5'          
               ##STR2##                                                        
     RDKlCF   TTGATAGAATTGGTCTAGGCTTGTTACAAC                                   
              (SEQ ID NO:15)                                                   
              Bases 574 to 603 of rat ARF 4 CDNA (just after stop codon).      
     RDKlR    GTTGTAACAAGCCTAGACCAATTCTATCAA                                   
              (SEQ ID NO:16)                                                   
              Complementary to RDK1CF                                          
     RDK3R    GGCTAAACAGCAACATTGTTCTTGGTAAACAATAATTGGCAACAAAAC                 
              (SEQ ID NO:17)                                                   
              Complementary to bases 677 to 724 of rat ARF 4 CDNA (after       
              first polyadenylation signal).                                   
     RDK4R    TCAGTGAGTTCCAAGGGGGTAACTTTAAAACATTATTGGTGTGGGCTC                 
              (SEQ ID NO:18)                                                   
              Complementary to bases 855 to 902 of rat ARF 4 CDNA (after       
              second polyadenylation signal).                                  
     RAKRIIa  TGGAATCGGAACTTCCAGATCCTCATCGTCCGAGTCCGATTCACTCTG                 
              (SEQ ID NO:19)                                                   
              Complementary to bases 127 to 174 of rat RII.alpha.CDNA          
              (regulatory subunit of cAMP-dependent protein kinase) (30).      
     SALADTT  CTCGTGGACGATGTTGCTGTCGACCCACGCGTCCG(T)20                         
              (SEQ ID NO:20)                                                   
               ##STR3##                                                        
     SALAD    CTCGTGGACGATGTTGCTGTCGACCCACGCGTCCG                              
              (SEQ ID NO:21)                                                   
     R2SCR    TTTGTACAAGATCGTCGTTTTGCCAGCTGCATCTAAGCC                          
              (SEQ ID NO:22)                                                   
              Used for screening.                                              
     RDK5NOT  GACTAGTTCTAGATCGCGAGCGGCCGCCACCACCGCTATGGGC                      
              (SEQ ID NO:23)                                                   
               ##STR4##                                                        
              site.                                                            
     RDKSAL1  CTCGTGGACGATGTGCTGGTCGACAGCTGCCCAAACCGTCTCAG                     
              (SEQ ID NO:24)                                                   
              Complementary to bases 638 to 655 with 26 bases added at S'      
               ##STR5##                                                        
     RDKSAL2  CTCGTGGACGATGTGCTGGTCGACGTTAACACTCAAAACAGATTT                    
              (SEQ ID NO:25)                                                   
              Complementary to bases 833 to 850 with 27 bases added at 5'      
               ##STR6##                                                        
     RDKSAL3  CTCGTGGACGATGTGCTGGTCGACTCGAAAAATCATTTTATTAGGAATAATTCCA          
              (SEQ ID NO:26)                                                   
              Complementary to bases 1362 to 1389 with 27 baseo added at       
               ##STR7##                                                        
     __________________________________________________________________________

Following transfection of the reverse transcribed, PCR amplified sequences into competent DH5.alpha. cells, we isolated clones corresponding to ARD 1.

Experiment 3 Isolation of ARD 1 from Human and Rat

Colonies were screened with probe JK728R (Table II) that had been labeled with [.alpha.-.sup.32 P]dATP and terminal deoxynucleotidyl transferase. Positive clones (33) were selected and grown in LB containing ampicillin, 100 .mu.g/ml. Plasmid DNA was purified and digested with Sal I and Not I. Longer inserts (.about.350 bp) from seven clones were sequenced. A consensus sequence of the 5' -terminal sequences of four are shown in Table I; three clones had shorter inserts.

Four clones with the longest inserts had an initiation codon ATG accompanied by A at -3 and G at position 4. This sequence is believed favorable for translation initiation (Kozak, et al. J. Cell. Biol. 108: 229-241 (1989)). The putative open reading frame of this gene, termed ARD 1, consisted of 1722 nucleotides encoding a protein of 574 amino acids with an ARF related domain at the carboxyl terminus. We anticipated that human and rat ARD 1 were likely similar, so we isolated clones corresponding to rat ARD 1.

A rat brain Lambda ZAP II library (6.times.10.sup.5 plaques) was screened with an oligonucleotide R2SCR (TABLE III) that yielded clone 2.sup.a containing an insert that corresponded to nucleotides 61-1973 of human ARD 1. A comparison of the rat and human sequences is provided in Table I.

Nucleotide and deduced amine acid sequences of ARD 1 coding regions from rat and human are 92% and 98% identical, respectively, without any gaps. The nucleotide sequence of the ARF domain of human ARD 1 is 60-66% identical to those of the mammalian ARFs; the deduced amino acid sequences are 55-60% identical or 69-72% similar including conservative placements (Table IV).

                TABLE IV                                                    
     ______________________________________                                    
     Comparison of nucleotide and deduced amino acid sequences                 
     of the ARF domain of human ARD 1 (172 amino acids) to those               
     of mammalian ARFs                                                         
     Mammalian    Percentage identity to hARD 1                                
     ARF          Nucleotides                                                  
                             Amino acids                                       
     ______________________________________                                    
     hARF 1       62          59 (71)*                                         
     hARF 2       66         59 (72)                                           
     hARF 3       65         60 (72)                                           
     hARF 4       66         59 (72)                                           
     hARF 5       62         56 (72)                                           
     hARF 6       60         55 (69)                                           
     ______________________________________                                    
      *identical plus conservative replacements                                
      Programs used were PC/Gene FASTSCAN for nucleotide sequence and PC/Gene  
      PALIGN (open gap cost, 1; unit gap cost, 1) for amino acid sequence

Some of the regions common to ARFs thus far identified that are believed to be involved in guanine nucleotide binding and TP hydrolysis are also conserved in ARD 1 (FIG. 1), i.e., GLDGAGK (411-417), DVGG (454-457), and NKQD (513-516); CAT, however, is missing and replaced with DAR (547-549 ) .

To analyze the expression of ARD 1, we probed Northern blots of RNA from various tissues.

Experiment 4 Northern Analysis of ARD 1

Poly(A).sup.+ RNA was isolated from total RNA (Chomyczynski, et al. Anal. Biochem. 162: 156-159 (1987)) using oligo(dT) chromatography (Chirgwin, et al. Biochemistry 18: 5294-5299 (1979)). Poly(A).sup.+ RNA (5 .mu.g) was then fractionated by electrophoresis in 1.2% agarose/formaldehyde gels and transferred to Nytran.

Prehybridization and hybridization were carried out at 42.degree. C. in hybridizaton buffer of 5.times. SSC (1.times.=0.15M NaCl, 0.015M sodium citrate, pH 7.0), 5.times. Denhardt's solution (1.times.=0.02% Ficoll, 0.02% polyvinylpyrrolidone, 0.02% bovine serum albumin), 10 mM Tris-HCl, pH 7.4, 0.1% SDS, 10% dextran sulfate, denatured salmon sperm DNA, 0.1 mg/ml, 40% formamide. Filters were washed at 55.degree. C. once with 2.times. SSC, 0.5% SDS and twice with 0.5.times. SSC, 0.5% SDS. Filters were exposed to Kodak XAR film at -80.degree. C. with an intensifying screen. Human ARF 1 coding region cDNA was prepared as described by Bobak et al. (Proc. Natl. Acad. Sci. (USA) 86: 6101-6105 (1989)). Human ARD 1 coding region cDNA was generated by PCR as described above.

An ARD 1 coding region cDNA clone hybridized with 4.2-kb and 3.7-kb mRNAs from all rat tissues examined (FIG. 2). This probe also hybridized with bands of similar size in poly(A).sup.+ RNA from mouse and rabbit brain and human fibroblasts. Poly(A).sup.+ RNA from IMR-32 cells, however, hybridized very weakly while RNA from undifferentiated HL-60 cells did not hybridize detectably. However, all samples of poly(A).sup.+ RNA hybridized essentially equally well with a human ARF 1 coding region cDNA (FIG. 3).

To assess the ability of ARD 1 to enhance cholera toxin ADP-ribosyltransferase, recombinant GST-ARD 1 fusion proteins were prepared.

Experiment 5 Expression of the Fusion Proteins

Three ARD 1 fusion proteins with glutathione S-transferase (GST) were synthesized employing a ligation-independent cloning method (Haun et al. Gene 112: 39-43 (1992)), using clone #7-3 and oligonucleotide primers as indicated in Table II. For GST-p8 (containing the entire sequence from Met.sup.1 to Ala .sup.574), oligonucleotides JK8EX and JKEXR (Table II) were used. For GST-p3 (the ARF domain from Met.sup.403 to Ala.sup.574), oligonucleotides JK3EX and JKEXR (Table II) were used. For GST-p5 (the non-ARF sequence from Met.sup.1 to Lys.sup.402) oligonucleotides JK8EX and JK5EXR (Table II) were used. The fusion proteins were purified with glutathione-agarose by well known methods as described in Smith et al. (Gene 67: 31-40 (1988)). To test the ability of each fusion protien to bind GTP we performed the following experiments.

Experiment 6 GTP-binding Assay with Recombinant ARD1

Fragment p3 (2.mu.g), p8 (4 .mu.g), and sARF II (0.2 .mu.g) were subjected to electrophoresis in 4-20% polyacrylamide gels with SDS and transferred to nitrocellulose. The membrane was incubated in 50 mM Tris-HCl, pH 7.5, 150 mM NAc1, 2 mM dithiothreitol, 2.5 mM EDTA, soybean trypsin inhibitor, 10 .mu.g/ml, 0.5 mM PMSF, 0.3% bovine serum albumin, 0.#% Tween 20 (binding buffer) at room temperature for 2 h, transferred to fresh binding buffer containing 8 mM MgCl.sub.2 and [.alpha.-.sup.32 P]GTP (800 Ci/mmol), 1 .mu.Ci/ml, for 2 h, washed three times with binding buffer for 5 min, briefly dried, and exposed to Kodak XAR film at -80.degree. C. overnight with intensifying screen.

The affinity of GST-p3 for GTP was apparently lower than that of GST-p8, p8 or sARF II. GTP concentrations required for half-maximal activation were less than 10 .mu.M with GST-p8, p8, and sARF II and .about.50 .mu.M with GST-p3.

Through the use of specific primers, as discussed above, GST-p8 contained the entire ARD 1 protein, GST-p3 contained the carboxy-terminal ARF domain, and GST-p5 contained the non-ARF domain. The recombinant ARF domain of ARD 1 bound GTP after SDS-PAGE and transfer to nitrocellulose membrane, whereas p8, which contained the entire sequence of ARD 1, exhibited no detectable binding (FIG. 4). It is possible that the longer p8 protein was unable to bind GTP due to a conformational change when attached to the nitrocellulose membrane. The shorter GST-p3 protein, however, was able to bind GTP even after attachment to the nitrocellulose membrane.

As shown in FIG. 5, ARD 1 fusion proteins did not stimulate ADP-ribosylation by cholera toxin in the presence of SDS, DMPC/cholate, or cardiolipin, which to differing degrees enhanced the activity of sARF II. In the presence of 0.3% Tween 20, however, recombinant ARD 1, p8, and GST-p8 increased the toxin ADP-ribosyltransferase in a dose-dependent manner (FIG. 6).

Experiment 7 NAD/Agmatine ADP-ribosylation Assay

Stimulation of cholera toxin-catalyzed ADP-ribosylation of agmatine was assayed to evaluate ARF activity of the recombinant proteins. Reaction mixtures contained 50 mM potassium phosphate (pH 7.5), 4 mM MgCl.sub.2, 30 mM dithiothreitol, ovalbumin (0.3 mg/ml), 0.2 mM [adenine-.sup.14 C]NAD (0.05 .mu.Ci), 20 mM agmatine, 0.3% Tween 20, cholera toxin (0.5 .mu.g), 1 mM GTP or 0.1 mM GDP.beta.S, and the indicated amount of sARF II or recombinant protein (total volume 200 .mu.l ). After incubation at 30.degree. C. for 1 hour, duplicate samples (80 .mu.l ) were transferred to columns of AG1-X2, equilibrated with water and eluted five times with five 1 ml washes of water. The eluate, containing [.sup.14 C]ADP-ribosylagmatine, was collected for radioassay.

In the presence of 0.3% Tween 20, recombinant ARD 1, p8, and GST-p8, increased the toxin ADP-ribosyltransferase in a dose-dependent manner (FIG. 6). The activity of GST-p3 was clearly less, whereas p3 had little if any activity. Maximal activation seemed to be similar; half-maximal activation was achieved with 0.5 .mu.M p8, 0.75 .mu.M GST-p8 and 2 .mu.M GST-3 (FIG. 6). sARF II also stimulated the toxin transferase activity in Tween 20 (FIG. 7), although to a lesser extent than it did in the presence of SDS, DMPC/cholate, or cardiolipin (FIG. 5). GST-p5, p5, or GST did not enhance cholera toxin activity (data not shown). Activity of the recombinant ARD 1 proteins was dependent on GTP (FIG. 7), as is the case with the ARFs.

Monoclonal antibodies against ARD 1 could be useful for the reasons cited above and can be produced by the following method.

Example 8 Production of Monoclonal Antibodies

Monoclonal antibodies to ARD 1 are generated using conventional techniques, such as those disclosed in Basic Methods in Molecular Biology 351 (Davis, et al., 1986). Briefly, ten mice are each innoculated intraperitoneally with 500 .mu.g of human ARD 1 antigen in a 1:1 emulsion with complete Freund's adjuvant. Thereafter, the mice are boosted with three additional injections at 15 day intervals of the same amount of antigen in a 1:1 emulsion with incomplete Freund's adjuvant. Three days following the third booster injection, the mice are sacrificed and the spleens are harvested. The spleen cells are separated and fused with myeloma cells according to the method of Kohler and Milstein by combining the spleen cells with mouse myeloma cells in a ratio of 6:1, centrifuging at 4000.times.g for 10 minutes, loosening the pellet with 1 ml 50% PEG, adding 9 ml DMEM, centrifuging again at 600.times.g for 10 minutes, and growing the cells in HAT in a 96 well tissue culture plate. Positive clones are expanded and the supernatant is screened against ARD 1.

Once antibodies against ARD 1 have been produced they can be used in assays such as an ELISA, as discussed below.

Example 9 ELISA for Antibodies Against ARD 1

A sample from either an organism or a hybridoma is screened for the presence of ARD 1 using a conventional ELISA. Briefly, 200 .mu.l of purified ARD 1 is added to a well of a microtiter plate and is permitted to bind to the plastic plate for 48 hours at 4.degree. C. The well is washed with 0.15M NaCl containing 0.05% Tween 20. Sample is mixed with PBS containing 0.05% Tween 20 and BSA at 0.1mg/ml, and aliquots are added to the treated plastic well. Plates are incubated for 1 hr at room temperature, washed, and 200.mu.l of 1:400 goat-antimouse IgG conjugated to alkaline phosphatase is added. After a 1 hour incubation, the well is again washed, after which 200 .mu.l of p-nitrophenyl phosphate is added. Color development takes about one hour; development of color indicates a positive assay.

From the above data it is tempting to speculate that ARD 1 is related to a new family of larger ARF proteins and representing larger GTP-binding proteins that contain a domain with ARF structure and function as well as another domain whose function is at present unknown.

It can be appreciated that the previous experiments are only illustrative of specific embodiements of the present invention. This invention should not be limited to those embodiments disclosed by the aforementioned experiments, but only by the following claims.

  __________________________________________________________________________
     SEQUENCE LISTING                                                          
     (1) GENERAL INFORMATION:                                                  
     (iii) NUMBER OF SEQUENCES: 34                                             
     (2) INFORMATION FOR SEQ ID NO:1:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 3312 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
      (ix) FEATURE:                                                            
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 1..1725                                                     
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                   
     ATGGCTACCCTGGTTGTAAACAAGCTCGGAGCGGGAGTAGACAGTGGC48                        
     MetAlaThrLeuValValAsnLysLeuGlyAlaGlyValAspSerGly                          
     151015                                                                    
     CGGCAGGGCAGCCGGGGGACAGCTGTAGTGAAGGTGCTAGAGTGTGGA96                        
     ArgGlnGlySerArgGlyThrAlaValValLysValLeuGluCysGl y                         
     202530                                                                    
     GTTTGTGAAGATGTCTTTTCTTTGCAAGGAGACAAAGTTCCCCGTCTT144                       
     ValCysGluAspValPheSerLeuGlnGlyAspLysValProArgLeu                          
      354045                                                                   
     TTGCTTTGTGGCCATACCGTCTGTCATGACTGTCTCACTCGCCTACCT192                       
     LeuLeuCysGlyHisThrValCysHisAspCysLeuThrArgLeuPro                          
     50 5560                                                                   
     CTTCATGGAAGAGCAATCCGTTGCCCATTTGATCGACAAGTAACAGAC240                       
     LeuHisGlyArgAlaIleArgCysProPheAspArgGlnValThrAsp                          
     65 707580                                                                 
     CTAGGTGATTCAGGTGTCTGGGGATTGAAAAAAAATTTTGCTTTATTG288                       
     LeuGlyAspSerGlyValTrpGlyLeuLysLysAsnPheAlaLeuLeu                          
      859095                                                                   
     GAGCTTTTGGAACGACTGCAGAATGGGCCTATTGGTCAGTATGGAGCT336                       
     GluLeuLeuGluArgLeuGlnAsnGlyProIleGlyGlnTyrGlyAla                          
      100105110                                                                
     GCAGAAGAATCCATTGGGATATCTGGAGAGAGCATCATTCGTTGTGAT384                       
     AlaGluGluSerIleGlyIleSerGlyGluSerIleIleArgCysAsp                          
     115 120125                                                                
     GAAGATGAAGCTCACCTTGCCTCTGTATATTGCACTGTGTGTGCAACT432                       
     GluAspGluAlaHisLeuAlaSerValTyrCysThrValCysAlaThr                          
     130 135140                                                                
     CATTTGTGCTCTGAGTGTTCTCAAGTTACTCATTCTACAAAGACATTA480                       
     HisLeuCysSerGluCysSerGlnValThrHisSerThrLysThrLeu                          
     145150 155160                                                             
     GCAAAGCACAGGCGAGTTCCTCTAGCTGATAAACCTCATGAGAAAACT528                       
     AlaLysHisArgArgValProLeuAlaAspLysProHisGluLysThr                          
     165 170175                                                                
     ATGTGCTCTCAGCACCAGGTGCATGCCATTGAGTTTGTTTGCTTGGAA576                       
     MetCysSerGlnHisGlnValHisAlaIleGluPheValCysLeuGlu                          
     180 185190                                                                
     GAAGGTTGTCAAACTAGCCCACTCATGTGCTGTGTCTGCAAAGAATAT624                       
     GluGlyCysGlnThrSerProLeuMetCysCysValCysLysGluTyr                          
     19520 0205                                                                
     GGAAAACACCAGGGTCACAAGCATTCAGTATTGGAACCAGAAGCTAAT672                       
     GlyLysHisGlnGlyHisLysHisSerValLeuGluProGluAlaAsn                          
     210215 220                                                                
     CAGATCCGAGCATCAATTTTAGATATGGCTCACTGCATACGGACCTTC720                       
     GlnIleArgAlaSerIleLeuAspMetAlaHisCysIleArgThrPhe                          
     225230235 240                                                             
     ACAGAGGAAATCTCAGATTATTCCAGAAAATTAGTTGGAATTGTGCAG768                       
     ThrGluGluIleSerAspTyrSerArgLysLeuValGlyIleValGln                          
     245250 255                                                                
     CACATTGAAGGAGGAGAACAAATCGTGGAAGATGGAATTGGAATGGCT816                       
     HisIleGluGlyGlyGluGlnIleValGluAspGlyIleGlyMetAla                          
     260265 270                                                                
     CACACAGAACATGTACCAGGGACTGCAGAGAATGCCCGGTCATGTATT864                       
     HisThrGluHisValProGlyThrAlaGluAsnAlaArgSerCysIle                          
     275280 285                                                                
     CGAGCTTATTTTTATGATCTACATGAAACTCTGTGTCGTCAAGAAGAA912                       
     ArgAlaTyrPheTyrAspLeuHisGluThrLeuCysArgGlnGluGlu                          
     290295300                                                                 
     AT GGCTCTAAGTGTTGTTGATGCTCATGTTCGTGAAAAATTGATTTGG960                      
     MetAlaLeuSerValValAspAlaHisValArgGluLysLeuIleTrp                          
     305310315320                                                              
     CTCAGGCAGCAACAAGAAGATATGACTATTTTGTTGTCAGAGGTTTCT1008                      
     LeuArgGlnGlnGlnGluAspMetThrIleLeuLeuSerGluValSer                          
     32533033 5                                                                
     GCAGCCTGCCTCCACTGTGAAAAGACTTTGCAGCAGGATGATTGTAGA1056                      
     AlaAlaCysLeuHisCysGluLysThrLeuGlnGlnAspAspCysArg                          
     340345350                                                                 
      GTTGTCTTGGCAAAACAGGAAATTACAAGGTTACTGGAAACATTGCAG1104                     
     ValValLeuAlaLysGlnGluIleThrArgLeuLeuGluThrLeuGln                          
     355360365                                                                 
     AAACA GCAGCAGCAGTTTACAGAAGTTGCAGATCACATTCAGTTGGAT1152                     
     LysGlnGlnGlnGlnPheThrGluValAlaAspHisIleGlnLeuAsp                          
     370375380                                                                 
     GCCAGCATCCCTG TCACTTTTACAAAGGATAATCGAGTTCACATTGGA1200                     
     AlaSerIleProValThrPheThrLysAspAsnArgValHisIleGly                          
     385390395400                                                              
     CCAAAAATG GAAATTCGGGTCGTTACGTTAGGATTGGATGGTGCTGGA1248                     
     ProLysMetGluIleArgValValThrLeuGlyLeuAspGlyAlaGly                          
     405410415                                                                 
     AAAACTACT ATCTTGTTTAAGTTAAAACAGGATGAATTCATGCAGCCC1296                     
     LysThrThrIleLeuPheLysLeuLysGlnAspGluPheMetGlnPro                          
     420425430                                                                 
     ATTCCAACAAT TGGTTTTAACGTGGAAACTGTAGAATATAAAAATCTA1344                     
     IleProThrIleGlyPheAsnValGluThrValGluTyrLysAsnLeu                          
     435440445                                                                 
     AAATTCACTATTTGGG ATGTAGGTGGAAAACACAAATTAAGACCATTG1392                     
     LysPheThrIleTrpAspValGlyGlyLysHisLysLeuArgProLeu                          
     450455460                                                                 
     TGGAAACATTATTACCTCAATACT CAAGCTGTTGTGTTTGTTGTAGAT1440                     
     TrpLysHisTyrTyrLeuAsnThrGlnAlaValValPheValValAsp                          
     465470475480                                                              
     AGCAGTCATAGAGACAGAATT AGTGAAGCACACAGCGAACTTGCAAAG1488                     
     SerSerHisArgAspArgIleSerGluAlaHisSerGluLeuAlaLys                          
     485490495                                                                 
     TTGTTAACGGAAAAAGAACT CCGAGATGCTCTGCTCCTGATTTTTGCT1536                     
     LeuLeuThrGluLysGluLeuArgAspAlaLeuLeuLeuIlePheAla                          
     500505510                                                                 
     AACAAACAGGATGTTGCTGGAG CACTGTCAGTAGAAGAAATCACTGAA1584                     
     AsnLysGlnAspValAlaGlyAlaLeuSerValGluGluIleThrGlu                          
     515520525                                                                 
     CTACTCAGTCTCCATAAATTATGCTGT GGCCGTAGCTGGTATATTCAG1632                     
     LeuLeuSerLeuHisLysLeuCysCysGlyArgSerTrpTyrIleGln                          
     530535540                                                                 
     GGCTGTGATGCTCGAAGTGGTATGGGACTGTATGAA GGGTTGGACTGG1680                     
     GlyCysAspAlaArgSerGlyMetGlyLeuTyrGluGlyLeuAspTrp                          
     545550555560                                                              
     CTCTCACGGCAACTTGTAGCTGCTGGAGTATT GGATGTTGCTTGATTTTAAA1732                 
     LeuSerArgGlnLeuValAlaAlaGlyValLeuAspValAla                                
     565570575                                                                 
     GGCAGCAGTTGTTTGAAGTTTTGTGGTTAAAAGTAACTTTGCACA TAAAAAAAAAAAAAA1792         
     AAAATGCATCTCAAAAGATGGTAATTTAGGATGCATATATATATATATATATATAAAGGA1852          
     ATCTTGGATTGGGAATTCAGTACTTTGCTTTAAAAAAATTTTGTGGCAGAATTAAATTTC1912          
     TAATTGACGCAGATTAGATTGA ATTAAATAGAAACTTATTGAATATACATTCTTTTAAAA1972         
     AGTATATTTGTTATTTAAGTTTTTCAGATAATATGTGACCAATATACTGGGAAAGAGGTA2032          
     GTCACAGAGAAAGGGTAAGTGAAGGTTTATTCTTTCAGTGAAAAAAGAATAGCCAATTGA2092          
      GTGCCTAATGAGACCTCTGTGTGAAGCAAGTGAAGTATAGCTGCTTCTTTTAACCTGCCT2152         
     TTTCACTGAATGTTGGCAGCATTTAGTAGTAGAAATGACAGTTGCTTAATGAAATAGAAT2212          
     CCAAACTACATATTTGGATAATAGGATTACTTTATGTTTATGTTC AGAGTTAACAGAACA2272         
     CCTTTAATGCTAAGAACTATAAGGTACAGAAAATTAATACTTTATATAGTGTTTTATTAA2332          
     CTTTCTCCTACAGCATTTTGTATAAAACACAATGAGGGAGTGAAATGTTACCCAATTAGG2392          
     CTTGTCAGGTTAGTAATAAACT GAACAGTAATAAAACTGTGGAAGTAATTGGATCTGAAT2452         
     TTATGAAAGACCCATTTCCAGGACTGAACCTAGGTCAGAGCTCTAAATTGGTCCTTCTAT2512          
     TTTTCAACAAATTTAAAGTAATATTTCTTTCTAATATAATATTGCATCCTTTGTGGGAAT2572          
      GACTATAGGTAAAATGTAGTAAGTAACGCAGAACCAGGGTTGGCTTTATTTAAAAGCTAG2632         
     TGACCTAAATAGAAAGCGAACTTCAAGAGAAGTTGTAAGTACAGTGGCAAATGCTTATTA2692          
     CTTACTTCAAACTGTTTCCCAAAATAAGTGCATTTATTTTGACAA TAAAACTTAAGGCTG2752         
     TTCATGAGAAGGCCTTGAAAAGTTACTCTAGAGGAAAAATGTCTAAAGAAAAAAAAAATT2812          
     CAAAAAGTTTACATTAATTATTCAGTGTTGTGAGTAAATAAAAATGTGTGCTCTTTACTG2872          
     TTTTTCATTTTTAAAGAATATT ATTATGGAAGCACGATTTATTTAAATAGGTACATTGAG2932         
     ACTTTTTTTTTTAATGTTCTGATACATTAGGATGAAGTTAAATCTTAAATCTTATTAGTT2992          
     GAATTGTTGTAAGGACAGTGATGTCTGGTAACAAGATGTGACTTTTTGGTAGCACTGTTG3052          
      TGGTTCATTCTTTTCAAATCTATTTTTGTTTAAAAACAATACAAGTTTTAGAAAACAAAG3112         
     CATTAAAAAAAAAGCCTATCAGTATTATGGGCAATATGTAAATAAATAAATGTAATATTT3172          
     CATCCTTTATTTTTCAGGTAAAAGGTCATGCTGTTACAGGTGTAG TTTGTGTGCATAAAT3232         
     AATACTTCCGAATTAAATTATTTAATATTTGACTGATTTCAATAACTGTGAAAATAAAAA3292          
     GGTGTTGTATTGCTTGTGAG3312                                                  
     (2) INFORMATION FOR SEQ ID NO:2:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
      (A) LENGTH: 574 amino acids                                              
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                   
     MetAlaThrLeuValValAsnLysLeuGlyAlaGlyValAspSerGly                          
     1510 15                                                                   
     ArgGlnGlySerArgGlyThrAlaValValLysValLeuGluCysGly                          
     202530                                                                    
     ValCysGluAspValPheSerLeuGlnGlyAspLysValProArgLeu                          
      354045                                                                   
     LeuLeuCysGlyHisThrValCysHisAspCysLeuThrArgLeuPro                          
     505560                                                                    
     LeuHisGlyArgAlaIleArgCys ProPheAspArgGlnValThrAsp                         
     65707580                                                                  
     LeuGlyAspSerGlyValTrpGlyLeuLysLysAsnPheAlaLeuLeu                          
     85 9095                                                                   
     GluLeuLeuGluArgLeuGlnAsnGlyProIleGlyGlnTyrGlyAla                          
     100105110                                                                 
     AlaGluGluSerIleGlyIleSerGlyGluSerIleIl eArgCysAsp                         
     115120125                                                                 
     GluAspGluAlaHisLeuAlaSerValTyrCysThrValCysAlaThr                          
     130135140                                                                 
     HisLeuCysSer GluCysSerGlnValThrHisSerThrLysThrLeu                         
     145150155160                                                              
     AlaLysHisArgArgValProLeuAlaAspLysProHisGluLysThr                          
     165 170175                                                                
     MetCysSerGlnHisGlnValHisAlaIleGluPheValCysLeuGlu                          
     180185190                                                                 
     GluGlyCysGlnThrSerProLeuMet CysCysValCysLysGluTyr                         
     195200205                                                                 
     GlyLysHisGlnGlyHisLysHisSerValLeuGluProGluAlaAsn                          
     210215220                                                                 
     G lnIleArgAlaSerIleLeuAspMetAlaHisCysIleArgThrPhe                         
     225230235240                                                              
     ThrGluGluIleSerAspTyrSerArgLysLeuValGlyIleValGln                          
      245250255                                                                
     HisIleGluGlyGlyGluGlnIleValGluAspGlyIleGlyMetAla                          
     260265270                                                                 
     HisThrGluHisVal ProGlyThrAlaGluAsnAlaArgSerCysIle                         
     275280285                                                                 
     ArgAlaTyrPheTyrAspLeuHisGluThrLeuCysArgGlnGluGlu                          
     290295 300                                                                
     MetAlaLeuSerValValAspAlaHisValArgGluLysLeuIleTrp                          
     305310315320                                                              
     LeuArgGlnGlnGlnGluAspMetThrIleLeuLeuSerGlu ValSer                         
     325330335                                                                 
     AlaAlaCysLeuHisCysGluLysThrLeuGlnGlnAspAspCysArg                          
     340345350                                                                 
     ValV alLeuAlaLysGlnGluIleThrArgLeuLeuGluThrLeuGln                         
     355360365                                                                 
     LysGlnGlnGlnGlnPheThrGluValAlaAspHisIleGlnLeuAsp                          
     370 375380                                                                
     AlaSerIleProValThrPheThrLysAspAsnArgValHisIleGly                          
     385390395400                                                              
     ProLysMetGluIleArgValValThrLeu GlyLeuAspGlyAlaGly                         
     405410415                                                                 
     LysThrThrIleLeuPheLysLeuLysGlnAspGluPheMetGlnPro                          
     420425 430                                                                
     IleProThrIleGlyPheAsnValGluThrValGluTyrLysAsnLeu                          
     435440445                                                                 
     LysPheThrIleTrpAspValGlyGlyLysHisLysLeuArgProLeu                          
     45 0455460                                                                
     TrpLysHisTyrTyrLeuAsnThrGlnAlaValValPheValValAsp                          
     465470475480                                                              
     SerSerHisArgAspArgI leSerGluAlaHisSerGluLeuAlaLys                         
     485490495                                                                 
     LeuLeuThrGluLysGluLeuArgAspAlaLeuLeuLeuIlePheAla                          
     5005 05510                                                                
     AsnLysGlnAspValAlaGlyAlaLeuSerValGluGluIleThrGlu                          
     515520525                                                                 
     LeuLeuSerLeuHisLysLeuCysCysGlyArgSerTrpTyr IleGln                         
     530535540                                                                 
     GlyCysAspAlaArgSerGlyMetGlyLeuTyrGluGlyLeuAspTrp                          
     545550555560                                                              
     LeuSerAr gGlnLeuValAlaAlaGlyValLeuAspValAla                               
     565570                                                                    
     (2) INFORMATION FOR SEQ ID NO:3:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 30 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
      (iii) HYPOTHETICAL: NO                                                   
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                   
     AATGGGCTGCATGAATTCATCCTGTTTTAA30                                          
     (2) INFORMATION FOR SEQ ID NO:4:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 26 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
      (D) TOPOLOGY: linear                                                     
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                   
     CCTCCTTCAATGTGCTGCACAATTCC26                                              
     (2) INFORMATION FOR SEQ ID NO:5:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 20 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                   
     TGAGTAACTTGAGAACACTC20                                                    
     (2) INFORMATION FOR SEQ ID NO:6:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
      (A) LENGTH: 42 base pairs                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                   
     TGCCCTGCCGGCCACTGTCTACTCCCGCTCCGAGCTTGTTTA42                              
     (2 ) INFORMATION FOR SEQ ID NO:7:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 53 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                   
     GACTAGTTCTAGATCGCGAGCGGCCGCCCTTCACCTAGGTCTGTTAC TTGTCG53                  
     (2) INFORMATION FOR SEQ ID NO:8:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 33 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                   
     GGCCTGGTTCCGCGGATGGCTA CCCTGGTTGTA33                                      
     (2) INFORMATION FOR SEQ ID NO:9:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 30 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                   
     GGCCTGGTTCCGCGGATGGAAATTCGGGTC30                                          
     (2) INFORMATION FOR SEQ ID NO:10:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 29 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
      (iv) ANTI-SENSE: NO                                                      
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                  
     CTGCGCCTCGCTCCTCAAGCAACATCCAA29                                           
     (2) INFORMATION FOR SEQ ID NO:11:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 29 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
      (ii) MOLECULE TYPE: cDNA                                                 
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                  
     CTGCGCCTCGCTCCTTTTGGTCCAATGTG29                                           
     (2) INFORMATION FOR SEQ ID NO:12:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 14 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
      (C) STRANDEDNESS: single                                                 
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                  
     ATGGGCCTCACCAT14                                                          
     (2) INFORMATION FOR SEQ ID NO:13:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 20 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
      (C) STRANDEDNESS: single                                                 
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                  
     TCTCACTGATGGCCATAGCA20                                                    
     (2) INFORMATION FOR SEQ ID NO:14:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 14 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                  
     TCATTTGACAGCCA14                                                          
     (2) INFORMATION FOR SEQ ID NO:15:                                         
      (i) SEQUENCE CHARACTERISTICS:                                            
     (A) LENGTH: 30 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                  
     TTGATAGAATTGGTCTAGGCTTGTTACAAC 30                                         
     (2) INFORMATION FOR SEQ ID NO:16:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 30 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                  
     GTTGTAACAAGCCTAGACCAATTCTATCAA 30                                         
     (2) INFORMATION FOR SEQ ID NO:17:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 48 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                  
     GGCTAAA CAGCAACATTGTTCTTGGTAAACAATAATTGGCAACAAAAC48                       
     (2) INFORMATION FOR SEQ ID NO:18:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 48 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                  
     TCAGTGAGTTCCAAGGGGGTAACTTTAAAACATTATTGGTGTGGGCTC48                        
     (2) INFORMATION FOR SEQ ID NO:19:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 48 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                  
     TGGAATCGGAACTTCCAGATCCTCATCGTCCGAGTCCGATTCACTCTG48                        
     (2) INFORMATION FOR SEQ ID NO:20:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 36 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
      (D) TOPOLOGY: linear                                                     
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                  
     CTCGTGGACGATGTTGCTGTCGACCCACGCGTCCGT36                                    
     (2) INFORMATION FOR SEQ ID NO:21:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 35 base pairs                                                 
      (B) TYPE: nucleic acid                                                   
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                  
     CTCGTGGACGATGTTGCTGTCGACCCACGCGTCCG35                                     
     (2) INFORMATION FOR SEQ ID NO:22:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
      (A) LENGTH: 39 base pairs                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                  
     TTTGTACAAGATCGTCGTTTTGCCAGCTGCATCTAAGCC39                                 
     (2) INFORMATION FOR SEQ ID NO:23:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 43 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                  
     GACTAGTTCTAGATCGCGAGCGGCCGCCACCACCGCTATGG GC43                            
     (2) INFORMATION FOR SEQ ID NO:24:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 44 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                  
     CTCGTGGACGATGTGCT GGTCGACAGCTGCCCAAACCGTCTCAG44                           
     (2) INFORMATION FOR SEQ ID NO:25:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 45 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi ) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                 
     CTCGTGGACGATGTGCTGGTCGACGTTAACACTCAAAACAGATTT45                           
     (2) INFORMATION FOR SEQ ID NO:26:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 55 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (ii i) HYPOTHETICAL: NO                                                   
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                  
     CTCGTGGACGATGTGCTGGTCGACTCGAAAAATCATTTTATTAGGAATAATTCCA55                 
     (2) INFORMATION FOR SEQ ID NO:27:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 181 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     ( D) TOPOLOGY: linear                                                     
     (ii) MOLECULE TYPE: peptide                                               
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (v) FRAGMENT TYPE: N-terminal                                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                  
     MetGlyAsnIlePheAlaAsnLeuPheLysGlyLeuPheGlyLysLys                          
     1510 15                                                                   
     GluMetArgIleLeuMetValGlyLeuAspAlaAlaGlyLysThrThr                          
     202530                                                                    
     IleLeuTyrLysLeuLysLeuGlyGluIleValThrTh rIleProThr                         
     354045                                                                    
     IleGlyPheAsnValGluThrValGluTyrLysAsnIleSerPheThr                          
     505560                                                                    
      ValTrpAspValGlyGlyGlnAspLysIleArgProLeuTrpArgHis                         
     65707580                                                                  
     TyrPheGlnAsnThrGlnGlyLeuIlePheValValAspSerAs nAsp                         
     859095                                                                    
     ArgGluArgValAsnGluAlaArgGluGluLeuMetArgMetLeuAla                          
     100105 110                                                                
     GluAspGluLeuArgAspAlaValLeuLeuValPheAlaAsnLysGln                          
     115120125                                                                 
     AspLeuProAsnAlaMetAsnAlaAlaGluIleThrAspLysLeu Gly                         
     130135140                                                                 
     LeuHisSerLeuArgHisArgAsnTrpTyrIleGlnAlaThrCysAla                          
     145150155160                                                              
      ThrSerGlyAspGlyLeuTyrGluGlyLeuAspTrpLeuSerAsnGln                         
     165170175                                                                 
     LeuArgAsnGlnLys                                                           
     180                                                                       
     (2) INFORMATION FOR SEQ ID NO:28:                                         
      (i) SEQUENCE CHARACTERISTICS:                                            
     (A) LENGTH: 181 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: peptide                                               
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (v) FRAGMENT TYPE: N-terminal                                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                  
     MetGlyAsnValPheGluLysLeuPheLysSe rLeuPheGlyLysLys                         
     151015                                                                    
     GluMetArgIleLeuMetValGlyLeuAspAlaAlaGlyLysThrThr                          
     2025 30                                                                   
     IleLeuTyrLysLeuLysLeuGlyGluIleValThrThrIleProThr                          
     354045                                                                    
     IleGlyPheAsnValGluThrValGluTyrLysAs nIleSerPheThr                         
     505560                                                                    
     ValTrpAspValGlyGlyGlnAspLysIleArgProLeuTrpArgHis                          
     657075 80                                                                 
     TyrPheGlnAsnThrGlnGlyLeuIlePheValValAspSerAsnAsp                          
     859095                                                                    
     ArgGluArgValAsnGluAlaArgGluGluLeuTh rArgMetLeuAla                         
     100105110                                                                 
     GluAspGluLeuArgAspAlaValLeuLeuValPheValAsnLysGln                          
     115120 125                                                                
     AspLeuProAsnAlaMetAsnAlaAlaGluIleThrAspLysLeuGly                          
     130135140                                                                 
     LeuHisSerLeuArgGlnArgAsnTrpTyrIleGlnAlaThrCys Ala                         
     145150155160                                                              
     ThrSerGlyAspGlyLeuTyrGluGlyLeuAspTrpLeuSerAsnGln                          
     165170 175                                                                
     LeuLysAsnGlnLys                                                           
     180                                                                       
     (2) INFORMATION FOR SEQ ID NO:29:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 181 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: peptide                                               
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
      (v) FRAGMENT TYPE: N-terminal                                            
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                  
     MetGlyAsnIlePheGlyLysLeuLeuLysSerLeuIleGlyLysLys                          
     151015                                                                    
     GluMetArgIleLeuMetValGl yLeuAspAlaAlaGlyLysThrThr                         
     202530                                                                    
     IleLeuTyrLysLeuLysLeuGlyGluIleValThrThrIleProThr                          
     35 4045                                                                   
     IleGlyPheAsnValGluThrValGluTyrLysAsnIleSerPheThr                          
     505560                                                                    
     ValTrpAspValGlyGlyGlnAspLysIleArgPr oLeuTrpArgHis                         
     65707580                                                                  
     TyrPheGlnAsnThrGlnGlyLeuIlePheValValAspSerAsnAsp                          
     85 9095                                                                   
     ArgGluArgValAsnGluAlaArgGluGluLeuMetArgMetLeuAla                          
     100105110                                                                 
     GluAspGluLeuArgAspAlaValLeuL euValPheAlaAsnLysGln                         
     115120125                                                                 
     AspLeuProAsnAlaMetAsnAlaAlaGluIleThrAspLysLeuGly                          
     130135 140                                                                
     LeuHisSerLeuArgHisArgAsnTrpTyrIleGlnAlaThrCysAla                          
     145150155160                                                              
     ThrSerGlyAspGlyLeuTyrGluGlyLeuAs pTrpLeuAlaAsnGln                         
     165170175                                                                 
     LeuLysAsnLysLys                                                           
     180                                                                       
     (2) INFORMATION FOR SEQ ID NO:30:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 180 amino acids                                               
     (B) TYPE: amino acid                                                      
      (C) STRANDEDNESS: single                                                 
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: peptide                                               
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (v) FRAGMENT TYPE: N-terminal                                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                  
     MetGlyLeuThrIleSerSerLeuPheSerArgLeuPheGlyLysLys                          
     15 1015                                                                   
     GlnMetArgIleLeuMetValGlyLeuAspAlaAlaGlyLysThrThr                          
     202530                                                                    
     IleLeuTyrLysLeuLy sLeuGlyGluIleValThrThrIleProThr                         
     354045                                                                    
     IleGlyPheAsnValGluThrValGluTyrLysAsnIleCysPheThr                          
     50 5560                                                                   
     ValTrpAspValGlyGlyGlnAspArgIleArgProLeuTrpLysHis                          
     65707580                                                                  
     TyrPheGlnAsnThrGlnGlyLe uIlePheValValAspSerAsnAsp                         
     859095                                                                    
     ArgGluArgIleGlnGluValAlaAspGluLeuGlnLysMetLeuLeu                          
     100 105110                                                                
     ValAspGluLeuArgAspAlaValLeuLeuLeuPheAlaAsnLysGln                          
     115120125                                                                 
     AspLeuProAsnAlaMetAlaIle SerGluMetThrAspLysLeuGly                         
     130135140                                                                 
     LeuGlnSerLeuArgAsnArgThrTrpTyrValGlnAlaThrCysAla                          
     145150 155160                                                             
     ThrGlnGlyThrGlyLeuTyrGluGlyLeuAspTrpLeuSerAsnGlu                          
     165170175                                                                 
     LeuSerLysArg                                                              
      180                                                                      
     (2) INFORMATION FOR SEQ ID NO:31:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 180 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: peptide                                               
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (v) FRAGMENT TYPE: N-terminal                                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                  
     MetGlyLeuThrVa lSerAlaLeuPheSerArgIlePheGlyLysLys                         
     151015                                                                    
     GlnMetArgIleLeuMetValGlyLeuAspAlaAlaGlyLysThrThr                          
      202530                                                                   
     IleLeuTyrLysLeuLysLeuGlyGluIleValThrThrIleProThr                          
     354045                                                                    
     IleGlyPheAsnValGl uThrValGluTyrLysAsnIleCysPheThr                         
     505560                                                                    
     ValTrpAspValGlyGlyGlnAspLysIleArgProLeuTrpArgHis                          
     6570 7580                                                                 
     TyrPheGlnAsnThrGlnGlyLeuIlePheValValAspSerAsnAsp                          
     859095                                                                    
     ArgGluArgValGlnGl uSerAlaAspGluLeuGlnLysMetLeuGln                         
     100105110                                                                 
     GluAspGluLeuArgAspAlaValLeuLeuValPheAlaAsnLysGln                          
     115 120125                                                                
     AspMetProAsnAlaMetProValSerGluLeuThrAspLysLeuGly                          
     130135140                                                                 
     LeuGlnHisLeuArgSerArgArgTrp TyrValGlnAlaThrCysAla                         
     145150155160                                                              
     ThrGlnGlyThrGlyLeuTyrAspGlyLeuAspTrpLeuSerHisGlu                          
     165 170175                                                                
     LeuSerLysArg                                                              
     180                                                                       
     (2) INFORMATION FOR SEQ ID NO:32:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 179 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: peptide                                               
     (iii ) HYPOTHETICAL: NO                                                   
     (iv) ANTI-SENSE: NO                                                       
     (v) FRAGMENT TYPE: N-terminal                                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                  
     MetGlyLysValLeuSerLysLeuPheLysGlyIlePheGlyAsnLys                          
     151015                                                                    
     GluMetAr gIleLeuMetLeuGlyLeuAspAlaAlaGlyLysThrThr                         
     202530                                                                    
     IleLeuTyrLysLeuLysLeuGlyGlnSerValThrThrIleProThr                          
      354045                                                                   
     ValGlyPheAsnValGluThrValThrTyrLysAsnValLysPheAsn                          
     505560                                                                    
     ValTrpAspValGlyGlyGl nAspLysIleArgProLeuTrpArgHis                         
     65707580                                                                  
     TyrTyrThrGlyThrGlnGlyLeuIlePheValValAspCysAlaAsp                          
      859095                                                                   
     ArgAspArgIleAspGluAlaArgGlnGluLeuHisArgIleIleAsn                          
     100105110                                                                 
     AspArgGluMetA rgAspAlaIleIleLeuIlePheAlaAsnLysGln                         
     115120125                                                                 
     AspLeuProAspAlaMetLysProHisGluIleGlnGluLysLeuGly                          
     130 135140                                                                
     LeuThrArgIleArgAspArgAsnTrpTyrValGlnProSerCysAla                          
     145150155160                                                              
     ThrSerGlyAspGlyLe uTyrGluGlyLeuThrTrpLeuThrSerAsn                         
     165170175                                                                 
     TyrLysSer                                                                 
     (2) INFORMATION FOR SEQ ID NO:33:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 187 amino acids                                               
     (B) TYPE: amino acid                                                      
      (C) STRANDEDNESS: single                                                 
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: peptide                                               
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (v) FRAGMENT TYPE: N-terminal                                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                                  
     MetGlyAsnIlePheAlaAsnLeuPheLysGlyLeuPheGlyLysLys                          
     15 1015                                                                   
     GluIleArgValValThrLeuGlyLeuAspGlyAlaGlyLysThrThr                          
     202530                                                                    
     IlePheTyrLysLeuGlnAspGly GluPheMetGlnProIleProThr                         
     354045                                                                    
     IleGlyPheAsnValGluThrValGluTyrLysAsnLeuLysPheThr                          
     5055 60                                                                   
     IleTrpAspValGlyGlyLysHisLysLeuArgProLeuTrpLysHis                          
     65707580                                                                  
     TyrTyrLeuAsnThrGlnAlaValValPhe ValValAspSerSerHis                         
     859095                                                                    
     ArgAspArgIleSerGluAlaHisSerGluLeuAlaLysLeuLeuThr                          
     10010 5110                                                                
     GluLysGluLeuArgAspAlaLeuLeuLeuIlePheAlaAsnLysGln                          
     115120125                                                                 
     AspValAlaGlyAlaLeuSerValGluGluIl eThrGluLeuLeuSer                         
     130135140                                                                 
     LeuHisLysLeuCysCysGlyArgSerTrpTyrIleGlnGlyCysAsp                          
     145150155 160                                                             
     AlaArgSerGlyMetGlyLeuTyrGluGlyLeuAspTrpLeuSerArg                          
     165170175                                                                 
     GlnLeuValAlaAlaGlyValLeuAspVal Ala                                        
     180185                                                                    
     (2) INFORMATION FOR SEQ ID NO:34:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 50 base pairs                                                 
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                                  
      GACTAGTTCTAGATCGCGAGCGGCCGCCCTGGATATCTAACCAAGGACAT50

Claims

1. An isolated polynucleotide encoding the ARD1 protein, said polynucleotide having the nucleotide sequence of SEQ ID NO:1.

2. A nucleic acid expression vector, said expression vector having the nucleotide sequence of SEQ ID NO:1 operably linked to a promoter.

3. A cell transfected with the expression vector of claim 2.

4. An isolated polynucleotide encoding the ARD 1 protein, said ARD1 protein having the amino acid sequence of SEQ ID NO:2.

5. The isolated polynucleotide of claim 4, wherein said polynucleotide is operably linked to a heterologous promoter.