Morphogenic protein screening method

Info

Patent number: 5741641
Type: Grant
Filed: May 26, 1995
Date of Patent: Apr 21, 1998
Assignee: Creative BioMolecules, Inc. (Hopkinton, MA)
Inventors: John E. Smart (Weston, MA), Hermann Oppermann (Medway, MA), Engin Ozkaynak (Milford, MA), Thangavel Kuberasampath (Medway, MA), David C. Rueger (Hopkinton, MA), Roy H. L. Pang (Etna, NH), Charles M. Cohen (Medway, MA)
Primary Examiner: David Guzo
Assistant Examiner: Robert Schwartzman
Law Firm: Testa, Hurwitz & Thibeault, LLP
Application Number: 8/451,953

Abstract

Disclosed is a method of screening candidate compounds for the ability to modulate the level of morphogenic protein in mammalian system. The method includes determining a parameter indicative of the level of production of a morphogenic in a cell culture known to produce the morphogen, incubating a candidate compound with the culture for a time sufficient to allow the compound to affect the production of the morphogenic protein, and then assaying the culture again to detect a change in the level of morphogenic protein production.

Description

Description

BACKGROUND OF THE INVENTION

Cell differentiation is the central characteristic of morphogenesis which initiates in the embryo, and continues to various degrees throughout the life of an organism in adult tissue repair and regeneration mechanisms. Members of the TGF-.beta. superfamily include subfamilies of highly-related genes that now are suspected to play important roles in cell differentiation and morphogenesis during development and/or during adult life. For example, the Drosophila decapentaplegic gene product (DPP) has been implicated in formation of the dorsal-ventral axis in fruit flies; activins induce mesoderm and anterior structure formation in mammals; Mullerian inhibiting substance (MIS) may be required for male sex development in mammals; growth/differentiation factor-1 (GDF-1) has been implicated in nerve development and maintenance; other morphogenic proteins (BMP-2, -3, -4 and OP-1) induce bone formation.

The development and study of a bone induction model system has identified the developmental cascade of bone differentiation as consisting of chemotaxis of mesenchymal cells, proliferation of these progenitor cells, differentiation of cartilage, ossification and hypertrophy of this cartilaginous tissue, vascular invasion, bone formation, remodeling, and finally, marrow differentiation (Reddi (1981) Collagen Rel. Res. 1:209-226). This bone model system, which is studied in adult mammals, recapitulates the cascade of bone differentiation events that occur in formation of bone in the developing fetus. In other studies, the epithelium of the urinary bladder has been shown to induce new bone formation. Huggins (1931, Arch. Surg. 22:377-408) showed that new bone formation could be induced by surgical transplantation of urinary bladder epithelium onto the parietal fascia. Urist (1965, Science 150:893-899) demonstrated that implantation of demineralized bone segments resulted in endochondral bone formation. The latter study and observation suggested the existence of an osteogenic protein and that bovine diaphyseal bone was a source of enriched preparations of osteogenic protein (Sampath et al., J. Biol. Chem. 265:13198-13205, 1990; Urist, ibid; Reddi et al., Proc. Nat. Aca. Sci. 69:1601-1605, 1972; Sampath et al., Proc. Natl. Acad. Sci. 80:6591-6595, 1983). Proteins capable of inducing endochondral bone formation in mammals when implanted in association with a matrix now have been identified in a number of different mammalian species, as have the genes encoding these proteins, (see, for example, U.S. Pat. No. 4,968,590; U.S. Ser. No. 315,342 filed Feb. 23, 1989, U.S. Pat. No. 5,011,691; and U.S. Ser. No. 599,543, filed Oct. 18, 1990, abandoned). Human OP-1 DNA has been cloned from various cDNA and genomic libraries using a consensus probe (Ozkaynak et al., EMBO J. 9:2085-2093, 1990). Purified human recombinant OP-1, expressed in mammalian cells, has been shown to induce new bone formation in vivo. Like other members of the TGF-.beta. superfamily, OP-1 is produced as a precursor, glycosylated, processed and secreted as a mature dimer. Mature OP-1 is cleaved at a maturation site following a sequence with the pattern of RXXR (Panganiban et al., Mol. Cell. Biol. 10:2669-2677, 1990).

The degree of morphogenesis in adult tissue varies among different tissues and depends on, among other factors, the degree of cell turnover in a given tissue. On this basis, tissues can be divided into three broad categories: 1) tissues with static cell populations such as nerve and skeletal muscle where there is little or no cell division and most of the cells formed during development persist throughout adult life and, therefore, possess little or no ability for normal regeneration after injury; 2) tissues containing conditionally renewing populations such as liver where there is generally little cell division but, in response to an appropriate stimulus or injury, cells can divide to produce daughters of the same differentiated cell type; and 3) tissues with permanently renewing populations including blood, bone, testes, and stratified squamous epithelia which are characterized by rapid and continuous cell turnover in the adult. Here, the terminally differentiated cells have a short life span and are replaced through proliferation of a distinct subpopulation of cells, known as stem or progenitor cells.

It is an object of this invention to provide a method of screening compounds which, when administered to a given tissue from a given organism, cause an alteration in the level of morphogenic protein ("morphogen") produced by the tissue. Such compounds, when administered systemically, will result in altered systemic or local levels of morphogenic activity. This morphogenic activity includes the ability to induce proliferation and sequential differentiation of progenitor cells, and the ability to support and maintain the differentiated phenotype or sequence of phenotypes through the progression of events that results in the formation of normal adult tissue (including organ regeneration). Thus, broadly, the invention provides a key to development of additional modalities of therapies involving modulation of morphogenic protein production in animals or adult mammals, e.g., humans, and consequent correction of conditions involving pathologic alteration of the balance of tissue cell turnover. Another object of the invention is to provide methodologies for identifying or selecting a combination of compound(s) which may increase a progenitor cell population in a mammal, stimulate progenitor cells to differentiate in vivo or in vitro, maintain the differentiated phenotype or sequence of phenotypes of a tissue, induce tissue-specific growth in vivo, or replace diseased or damaged tissues or organs in vivo. Another object of the invention is to determine the tissue(s) or organ(s) of origin of a given morphogen. Another object of the invention is to determine the specific cell type(s) within the tissue(s) or organ(s) of origin, or cell line(s) derived from the tissue(s), or organ(s) of origin, that is responsible for the synthesis and production of a given morphogen. These and other objects and features of the invention will be apparent from the description, drawing, and claims which follow.

SUMMARY OF THE INVENTION

The invention features a method of screening candidate compounds for the ability to modulate the effective local or systemic concentration or level of morphogenic protein in an organism. The method is practiced by incubating one or more candidate compound(s) with cells from a test tissue type of an organism known to produce a given morphogen for a time sufficient to allow the compound(s) to affect the production, i.e., expression and/or secretion, of morphogen by the cells; and then assaying cells and the medium conditioned by the cells for a change in a parameter indicative of the level of production of the morphogenic protein. The procedure may be used to identify compounds showing promise as drugs for human use capable of increasing or decreasing morphogen production in vivo, thereby to correct or alleviate a diseased condition.

In a related aspect, the invention features a method of screening tissue(s) of an organism to assess whether or at what level cells of the tissue(s) produce a particular morphogen, thereby to determine a tissue(s) of origin of the morphogen. This permits selection of the tissue cell type to be used in the screening. As used herein, "tissue" refers to a group of cells which are naturally found associated, including an organ.

As an example of tissue(s) or organ(s) which produce high levels of morphogen relative to the level produced by other types of tissues, it has been discovered that OP-1, first found in bone tissue is produced at relatively high levels in cells derived from renal, e.g., kidney or bladder, or adrenal tissue; that GDF-1 is produced at relatively high levels in cells derived from nerve, e.g., brain tissue; that DPP is produced at relatively high levels in cells derived from one of the following drosophila tissues: dorsal ectoderm, epithelial imaginal disc, visceral mesoderm, or gut endoderm; that Vgr-1 is produced at relatively high levels in cells derived from mouse lung tissue; and that Vgl is produced at relatively high levels in cells derived from xenopus fetal endoderm tissue. In addition, BMP3 and CBMP2B transcripts have been identified in abundance in lung tissue. As used herein, "derived" means the cells are the cultured tissue itself, or are a cell line whose parent cells are the tissue itself.

Preferred methods for determining the level of or a change in the level of a morphogen in a cultured cell include using an antibody specific for the morphogen, e.g., in an immunoassay such as an ELISA or radioimmunoassay; and determining the level of nucleic acid, most particularly mRNA, encoding the morphogen using a nucleic acid probe that hybridizes under stringent conditions with the morphogen RNA, such as in an RNA dot blot analysis. Where a change in the presence and/or concentration of morphogen is being determined, it will be necessary to measure and compare the levels of morphogen in the presence and absence of the candidate compound. The nucleic acid probe may be a nucleotide sequence encoding the morphogen or a fragment large enough to hybridize specifically only to RNA encoding a specific morphogen under stringent conditions. As used herein, "stringent conditions" are defined as conditions in which non-specific hybrids will be eluted but at which specific hybrids will be maintained, i.e., incubation at 0.1.times. (15 mM NaCl, 5 mM Na citrate) at 50.degree. C. for 15 minutes.

Examples of morphogens whose levels may be determined according to the invention include OP-1, OP-2, GDF-1, Vgr-1, DPP, 60A, CBMP2A, CBMP2B, BMP 2, 3, 4, 5, 6, or Vgl. Thus, if an immunoassay is used to indicate the presence and/or concentration of a morphogen, an antibody specific for one of these morphogens only, and which will not detect the presence of other morphogens, will be used. Similarly, if nucleic acid hybridization is used to indicate the level of RNA encoding the morphogen, a nucleotide probe specific for one of these morphogens only will be used under hybridization conditions such that the probe should not be capable of hybridizing with RNA encoding a different morphogen. A morphogen includes an active C-terminal core region, which includes at least six cysteine residues, and a region N-terminal to the C-terminal region that is relatively non-homologous to the equivalent N-terminal regions of other morphogens. In addition, the 3' noncoding region of the mRNA is unique to each morphogen. Thus, a nucleic acid probe encoding all or a portion of the sequences N-terminal to the C-terminal core region of a morphogen, or encoding all or a portion of the sequences C-terminal to or 3' to the core region of a morphogen may be used as a probe which detects mRNA encoding that morphogen only.

"Morphogenic proteins" or "morphogens", as used herein, include naturally-occurring osteogenic proteins capable of inducing the full developmental cascade of bone formation, as well as polypeptide chains not normally associated with bone or bone formation, but sharing substantial sequence homology with osteogenic proteins. Such proteins, as well as DNA sequences encoding them, have been isolated and characterized for a number of different species. See, for example, U.S. Pat. No. 4,968,590 and U.S. Pat. No. 5,011,691, U.S. application Ser. Nos. 1989; 422,699, filed Oct. 17, 1989, abandoned, and 600,024 and 599,543, both filed Oct. 18, 1990 abandoned; Sampath et al., (1990) J. Biol. Chem. 265:13198-13205; Ozkaynak et al. (1990) EMBO J. 9:2085-2093; and Lee, Proc. Nat. Aca. Sci. 88:42504254 (1991), all of which are hereby incorporated by reference. Many of these proteins subsequently were discovered to have utility beyond bone morphogenesis. See, e.g., U.S. Ser. No. 667,274 filed March 11, 1991, abandoned. The mature forms of morphogens share substantial amino acid sequence homology, especially in the C-terminal core regions of the proteins. In particular, most of the proteins share a seven-cysteine skeleton in this region, in addition to other apparently required amino acids. Table II, infra, shows the amino acid sequence homologies for nine morphogens over the carboxy terminal 102 amino acids.

Among the morphogens useful in this invention are proteins originally identified as osteogenic proteins, such as the OP-1, OP-2 and CBMP2 proteins, as well as amino acid sequence-related proteins such as DPP (from Drosophila), Vgl (from Xenopus), Vgr-1 (from mouse, see U.S. Pat. No. 5,011,691 to Oppermann et al.), GDF-1 (from mouse, see Lee (1991) PNAS 88:4250-4254), all of which are presented in Table II and Seq. ID Nos.5-14), and the recently identified 60A protein (from Drosophila, Seq. ID No. 24, see Wharton et al. (1991) PNAS 88:9214-9218.) The members of this family, which include members of the TGF-.beta. super-family of proteins, share substantial amino acid sequence homology in their C-terminal regions. The proteins are translated as a precursor, having an N-terminal signal peptide sequence, typically less than about 30 residues, followed by a "pro" domain that is cleaved to yield the mature sequence. The signal peptide is cleaved rapidly upon translation, at a cleavage site that can be predicted in a given sequence using the method of Von Heijne ((1986) Nucleic Acids Research 14:4683-4691.) Table I, below, describes the various morphogens identified to date, including their nomenclature as used herein, their Seq. ID references, and publication sources for the amino acid sequences for the full length proteins not included in the Seq. Listing. The disclosure of these publications is incorporated herein by reference.

TABLE I

"OP-1" refers generically to the group of morphogenically active proteins expressed from part or all of a DNA sequence encoding OP-1 protein, including allelic and species variants thereof, e.g., human OP-1 ("hOP-1", Seq. ID No. 5, mature protein amino acid sequence), or mouse OP-1 ("mOP-1", Seq. ID No. 6, mature protein amino acid sequence.) The conserved seven cysteine skeleton is defined by residues 38 to 139 of Seq. ID Nos. 5 and 6. The cDNA sequences and the amino acids encoding the full length proteins are provided in Seq. Id Nos. 16 and 17 (hOP1) and Seq. ID Nos. 18 and 19 (mOP1.) The mature proteins are defined by residues 293-431 (hOP1) and 292-430 (mOP1). The "pro" regions of the proteins, cleaved to yield the mature, morphogenically active proteins are defined essentially by residues 30-292 (hOP1) and residues 30-291 (mOP1).

"OP-2" refers generically to the group of active proteins expressed from part or all of a DNA sequence encoding OP-2 protein, including allelic and species variants thereof, e.g., human OP-2 ("hOP-2", Seq. ID No. 7, mature protein amino acid sequence) or mouse OP-2 ("mOP-2", Seq. ID No. 8, mature protein amino acid sequence). The conserved seven cysteine skeleton is defined by residues 38 to 139 of Seq. ID Nos. 7 and 8. The cDNA sequences and the amino acids encoding the full length proteins are provided in Seq. ID Nos. 20 and 21 (hOP2) and Seq. ID Nos. 22 and 23 (mOP2.) The mature proteins are defined essentially by residues 264-402 (hOP2) and 261-399 (mOP2). The "pro" regions of the proteins, cleaved to yield the mature, morphogenically active proteins likely are defined essentially by residues 18-263 (hOP2) and residues 18-260 (mOP2). (Another cleavage site also occurs 21 residues upstream for both OP-2 proteins.)

"CBMP2" refers generically to the morphogenically active proteins expressed from a part or all of a DNA sequence encoding the CBMP2 proteins, including allelic and species variants thereof, e.g., human CBMP2A ("CBMP2A(fx)", Seq ID No. 9) or human CBMP2B DNA ("CBMP2B(fx)", Seq. ID No. 10). The amino acid sequence for the full length proteins, referred to in the literature as BMP2A and BMP2B, or BMP2 and BMP4, appear in Wozney, et al. (1988) Science 242:1528-1534. The pro domain for BMP2 (BMP2A) likely includes residues 25-248 or 25-282; the mature protein, residues 249-396 or 283-396. The pro domain for BMP4 (BMP2B) likely includes residues 25-256 or 25-292; the mature protein, residues 257-408 or 293-408.

"DPP(fx)" refers to protein sequences encoded by the Drosophila DPP gene and defining the conserved seven cysteine skeleton (Seq. ID No. 11). The amino acid sequence for the full length protein appears in Padgett, et al (1987) Nature 325:81-84. The pro domain likely extends from the signal peptide cleavage site to residue 456; the mature protein likely is defined by residues 457-588.

"Vgl(fx)" refers to protein sequences encoded by the Xenopus Vgl gene and defining the conserved seven cysteine skeleton (Seq. ID No. 12). The amino acid sequence for the full length protein appears in Weeks (1987) Cell 51:861-867. The pro domain likely extends from the signal peptide cleavage site to residue 246; the mature protein likely is defined by residues 247-360.

"Vgr-1(fx)" refers to protein sequences encoded by the murine Vgr-1 gene and defining the conserved seven cysteine skeleton (Seq. ID No. 13). The amino acid sequence for the full length protein appears in Lyons, et al, (1989) PNAS 86:4554-4558. The pro domain likely extends from the signal peptide cleavage site to residue 299; the mature protein likely is defined by residues 300-438.

"GDF-1(fx)" refers to protein sequences encoded by the human GDF-1 gene and defining the conserved seven cysteine skeleton (Seq. ID No. 14). The cDNA and encoded amino sequence for the full length protein is provided in Seq. ID. No. 32. The pro domain likely extends from the signal peptide cleavage site to residue 214; the mature protein likely is defined by residues 215-372.

"60A" refers generically to the morphogenically active proteins expressed from part or all of a DNA sequence (from the Drosophila 60A gene) encoding the 60A proteins (see Seq. ID No. 24 wherein the CDNA and encoded amino acid sequence for the full length protein is provided). "60A(fx)" refers to the protein sequences defining the conserved seven cysteine skeleton (residues 354 to 455 of Seq. ID No. 24.) The pro domain likely extends from the signal peptide cleavage site to residue 324; the mature protein likely is defined by residues 325-455.

"BMP3(fx)" refers to protein sequences encoded by the human BMP3 gene and defining the conserved seven cysteine skeleton (Seq. ID No. 26). The amino acid sequence for the full length protein appears in Wozney et al. (1988) Science 242:1528-1534. The pro domain likely extends from the signal peptide cleavage site to residue 290; the mature protein likely is defined by residues 291-472.

"BMP5(fx)" refers to protein sequences encoded by the human BMP5 gene and defining the conserved seven cysteine skeleton (Seq. ID No. 27). The amino acid sequence for the full length protein appears in Celeste, et al. (1991) PNAS 87:9843-9847. The pro domain likely extends from the signal peptide cleavage site to residue 316; the mature protein likely is defined by residues 317-454.

"BMP6(fx)" refers to protein sequences encoded by the human BMP6 gene and defining the conserved seven cysteine skeleton (Seq. ID No. 28). The amino acid sequence for the full length protein appear sin Celeste, et al. (1990) PNAS 87:9843-5847. The pro domain likely includes extends from the signal peptide cleavage site to residue 374; the mature sequence likely includes residues 375-513.

The OP-2 proteins have an additional cysteine residue in this region (e.g., see residue 41 of Seq. ID Nos. 7 and 8), in addition to the conserved cysteine skeleton in common with the other proteins in this family. The GDF-1 protein has a four amino acid insert within the conserved skeleton (residues 44-47 of Seq. ID No. 14) but this insert likely does not interfere with the relationship of the cysteines in the folded structure. In addition, the CBMP2 proteins are missing one amino acid residue within the cysteine skeleton.

The morphogens are inactive when reduced, but are active as oxidized homodimers and when oxidized in combination with other morphogens of this invention. Thus, as defined herein, a morphogen is a dimeric protein comprising a pair of polypeptide chains, wherein each polypeptide chain comprises at least the C-terminal six cysteine skeleton defined by residues 43-139 of Seq. ID No. 5, including functionally equivalent arrangements of these cysteines (e.g., amino acid insertions or deletions which alter the linear arrangement of the cysteines in the sequence but not their relationship in the folded structure), such that, when the polypeptide chains are folded, the dimeric protein species comprising the pair of polypeptide chains has the appropriate three-dimensional structure, including the appropriate intra- and inter-chain disulfide bonds such that the protein is capable of acting as a morphogen as defined herein. Specifically, the morphogens generally are capable of the following biological functions in a morphogenically permissive environment: stimulating proliferation of progenitor cells; stimulating the differentiation of progenitor cells; stimulating the proliferation of differentiated cells; and supporting the growth and maintenance of differentiated cells, including the "redifferentiation" of transformed cells. In addition, it is also anticipated that these morphogens are capable of inducing redifferentiation of committed cells under appropriate environmental conditions.

Morphogens useful in this invention comprise one of two species of generic amino acid sequences: Generic Sequence 1 (Seq. ID No. 1) or Generic Sequence 2 (Seq. ID No. 2); where each Xaa indicates one of the 20 naturally-occurring L-isomer, .alpha.-amino acids or a derivative thereof. Generic Sequence 1 comprises the conserved six cysteine skeleton and Generic Sequence 2 comprises the conserved six cysteine skeleton plus the additional cysteine identified in OP-2 (see residue 36, Seq. ID No. 2). In another preferred aspect, these sequences further comprise the following additional sequence at their N-terminus: ##STR1##

Preferred amino acid sequences within the foregoing generic sequences include: Generic Sequence 3 (Seq. ID No. 3), Generic Sequence 4 (Seq. ID No. 4), Generic Sequence 5 (Seq. ID No. 30) and Generic Sequence 6 (Seq. ID No. 31), listed below. These Generic Sequences accommodate the homologies shared among the various preferred members of this morphogen family identified in Table II, as well as the amino acid sequence variation among them. Specifically, Generic Sequences 3 and 4 are composite amino acid sequences of the following proteins presented in Table II and identified in Seq. ID Nos. 5-14: human OP-1 (hOP-1, Seq. ID Nos. 5 and 16-17), mouse OP-1 (mOP-1, Seq. ID Nos. 6 and 18-19), human and mouse OP-2 (Seq. ID Nos. 7, 8, and 20-22), CBMP2A (Seq. ID No. 9), CBMP2B (Seq. ID No. 10), DPP (from Drosophila, Seq. ID No. 11), Vgl, (from Xenopus, Seq. ID No. 12), Vgr-1 (from mouse, Seq. ID No. 13), and GDF-1 (from mouse, Seq. ID No. 14.) The generic sequences include both the amino acid identity shared by the sequences in Table II, as well as alternative residues for the variable positions within the sequence. Note that these generic sequences allow for an additional cysteine at position 41 or 46 in Generic Sequences 3 or 4, respectively, providing an appropriate cysteine skeleton where inter- or intramolecular disulfide bonds can form, and contain certain critical amino acids which influence the tertiary structure of the proteins. ##STR2## wherein each Xaa is independently selected from a group of one or more specified amino acids defined as follows: "Res." means "residue" and Xaa at res.4=(Ser, Asp or Glu); Xaa at res.6 =(Arg, Gln, Ser or Lys); Xaa at res.7=(Asp or Glu); Xaa at res.8=(Leu or Val); Xaa at res.11=(Gln, Leu, Asp, His or Asn); Xaa at res.12=(Asp, Arg or Asn); Xaa at res.14=(Ile or Val); Xaa at res.15=(Ile or Val); Xaa at res.18=(Glu, Gln, Leu, Lys, Pro or Arg); Xaa at res.20=(Tyr or Phe); Xaa at res.21=(Ala, Ser, Asp, Met, His, Leu or Gln); Xaa at res.23=(Tyr, Asn or Phe); Xaa at res.26=(Glu, His, Tyr, Asp or Gln); Xaa at res.28=(Glu, Lys, Asp or Gln); Xaa at res.30=(Ala, Ser, Pro or Gln); Xaa at res.31=(Phe, Leu or Tyr); Xaa at res.33=(Leu or Val); Xaa at res.34=(Asn, Asp, Ala or Thr); Xaa at res.35=(Ser, Asp, Glu, Leu or Ala); Xaa at res.36=(Tyr, Cys, His, Ser or Ile); Xaa at res.37=(Met, Phe, Gly or Leu); Xaa at res.38=(Asn or Ser); Xaa at res.39=(Ala, Ser or Gly); Xaa at res.40=(Thr, Leu or Ser); Xaa at res.44=(Ile or Val); Xaa at res.45=(Val or Leu); Xaa at res.46=(Gln or Arg); Xaa at res.47=(Thr, Ala or Ser); Xaa at res.49=(Val or Met); Xaa at res.50=(His or Asn); Xaa at res.51=(Phe, Leu, Asn, Ser, Ala or Val); Xaa at res.52=(Ile, Met, Asn, Ala or Val); Xaa at res.53=(Asn, Lys, Ala or Glu); Xaa at res.54=(Pro or Ser); Xaa at res.55=(Glu, Asp, Asn, or Gly); Xaa at res.56=(Thr, Ala, Val, Lys, Asp, Tyr, Ser or Ala); Xaa at res.57=(Val, Ala or Ile); Xaa at res.58=(Pro or Asp); Xaa at res.59=(Lys or Leu); Xaa at res.60=(Pro or Ala); Xaa at res.63=(Ala or Val); Xaa at res.65=(Thr or Ala); Xaa at res.66=(Gln, Lys, Arg or Glu); Xaa at res.67=(Leu, Met or Val); Xaa at res.68=(Asn, Ser or Asp); Xaa at res.69=(Ala, Pro or Ser); Xaa at res.70=(Ile, Thr or Val); Xaa at res.71=(Ser or Ala); Xaa at res.72=(Val or Met); Xaa at res.74=(Tyr or Phe); Xaa at res.75=(Phe, Tyr or Leu); Xaa at res.76=(Asp or Asn); Xaa at res.77=(Asp, Glu, Asn or Ser); Xaa at res.78=(Ser, Gln, Asn or Tyr); Xaa at res.79=(Ser, Asn, Asp or Glu); Xaa at res.80=(Asn, Thr or Lys); Xaa at res.82=(Ile or Val); Xaa at res.84=(Lys or Arg); Xaa at res.85=(Lys, Asn, Gln or His); Xaa at res.86=(Tyr or His); Xaa at res.87=(Arg, Gln or Glu); Xaa at res.88=(Asn, Glu or Asp); Xaa at res.90=(Val, Thr or Ala); Xaa at res.92=(Arg, Lys, Val, Asp or Glu); Xaa at res.93=(Ala, Gly or Glu); and Xaa at res.97=(His or Arg); ##STR3## wherein each Xaa is independently selected from a group of one or more specified amino acids as defined by the following: "Res." means "residue" and Xaa at res.2=(Lys or Arg); Xaa at res.3=(Lys or Arg); Xaa at res.4=(His or Arg); Xaa at res.5=(Glu, Ser, His, Gly, Arg or Pro); Xaa at res.9=(Ser, Asp or Glu); Xaa at res.11=(Arg, Gln, Ser or Lys); Xaa at res.12=(Asp or Glu); Xaa at res.13=(Leu or Val); Xaa at res.16=(Gln, Leu, Asp, His or Asn); Xaa at res.17=(Asp, Arg, or Asn); Xaa at res.19=(Ile or Val); Xaa at res.20=(Ile or Val); Xaa at res.23=(Glu, Gln, Leu, Lys, Pro or Arg); Xaa at res.25=(Tyr or Phe); Xaa at res.26=(Ala, Ser, Asp, Met, His, Leu, or Gln); Xaa at res.28=(Tyr, Asn or Phe); Xaa at res.31=(Glu, His, Tyr, Asp or Gln); Xaa at res.33=Glu, Lys, Asp or Gln); Xaa at res.35=(Ala, Ser or Pro); Xaa at res.36=(Phe, Leu or Tyr); Xaa at res.38=(Leu or Val); Xaa at res.39=(Asn, Asp, Ala or Thr); Xaa at res.40=(Ser, Asp, Glu, Leu or Ala); Xaa at res.41=(Tyr, Cys, His, Ser or Ile); Xaa at res.42=(Met, Phe, Gly or Leu); Xaa at res.44=(Ala, Ser or Gly); Xaa at res.45=(Thr, Leu or Ser); Xaa at res.49=(Ile or Val); Xaa at res.50=(Val or Leu); Xaa at res.51=(Gln or Arg); Xaa at res.52=(Thr, Ala or Ser); Xaa at res.54=(Val or Met); Xaa at res.55=(His or Asn); Xaa at res.56=(Phe, Leu, Asn, Ser, Ala or Val); Xaa at res.57=(Ile, Met, Asn, Ala or Val); Xaa at res.58=(Asn, Lys, Ala or Glu); Xaa at res.59=(Pro or Ser); Xaa at res.60=(Glu, Asp, or Gly); Xaa at res.61=(Thr, Ala, Val, Lys, Asp, Tyr, Ser or Ala); Xaa at res.62=(Val, Ala or Ile); Xaa at res.63=(Pro or Asp); Xaa at res.64=(Lys or Leu); Xaa at res.65=(Pro or Ala); Xaa at res.68=(Ala or Val); Xaa at res.70=(Thr or Ala); Xaa at res.71=(Gln, Lys, Arg or Glu); Xaa at res.72=(Leu, Met or Val); Xaa at res.73=(Asn, Ser or Asp); Xaa at res.74=(Ala, Pro or Ser); Xaa at res.75=(Ile, Thr or Val); Xaa at res.76=(Ser or Ala); Xaa at res.77=(Val or Met); Xaa at res.79=(Tyr or Phe); Xaa at res.80=(Phe, Tyr or Leu); Xaa at res.81=(Asp or Asn); Xaa at res.82=(Asp, Glu, Asn or Ser); Xaa at res.83=(Ser, Gln, Asn or Tyr); Xaa at res.84=(Ser, Asn, Asp or Glu); Xaa at res.85=(Asn, Thr or Lys); Xaa at res.87=(Ile or Val); Xaa at res.89=(Lys or Arg); Xaa at res.90=(Lys, Asn, Gln or His); Xaa at res.91=(Tyr or His); Xaa at res.92=(Arg, Gln or Glu); Xaa at res.93=(Asn, Glu or Asp); Xaa at res.95=(Val, Thr or Ala); Xaa at res.97=(Arg, Lys, Val, Asp or Glu); Xaa at res.98=(Ala, Gly or Glu); and Xaa at res.102=(His or Arg).

Similarly, Generic Sequence 5 (Seq. ID No. 30) and Generic Sequence 6 (Seq. ID No. 31) accommodate the homologies shared among all the morphogen protein family members identified in Table II. Specifically, Generic Sequences 5 and 6 are composite amino acid sequences of human OP-1 (hOP-1, Seq. ID Nos. 5 and 16-17), mouse OP-1 (mOP-1, Seq. ID Nos. 6 and 18-19), human and mouse OP-2 (Seq. ID Nos. 7, 8, and 20-22), CBMP2A (Seq. ID No. 9), CBMP2B (Seq. ID No. 10), DPP (from Drosophila, Seq. ID No. 11), Vgl, (from Xenopus, Seq. ID No. 12), Vgr-1 (from mouse, Seq. ID No. 13), and GDF-1 (from mouse, Seq. ID No. 14), human BMP3 (Seq. ID No. 26), human BMP5 (Seq. ID No. 27), human BMP6 (Seq. ID No. 28) and 60(A) (from Drosophila, Seq. ID Nos. 24-25). The generic sequences include both the amino acid identity shared by these sequences in the C-terminal domain, defined by the six and seven cysteine skeletons (Generic Sequences 5 and 6, respectively), as well as alternative residues for the variable positions within the sequence. As for Generic Sequences 3 and 4, Generic Sequences 5 and 6 allow for an additional cysteine at position 41 (Generic Sequence 5) or position 46 (Generic Sequence 6), providing an appropriate cysteine skeleton where inter- or intramolecular disulfide bonds can form, and containing certain critical amino acids which influence the tertiary structure of the proteins. ##STR4## wherein each Xaa is independently selected from a group of one or more specified amino acids defined as follows: "Res." means "residue" and Xaa at res.2=(Tyr or Lys); Xaa at res.3=Val or Ile); Xaa at res.4=(Ser, Asp or Glu); Xaa at res.6=(Arg, Gln, Ser, Lys or Ala); Xaa at res.7=(Asp, Glu or Lys); Xaa at res.8=(Leu, Val or Ile); Xaa at res.11=(Gln, Leu, Asp, His, Asn or Ser); Xaa at res.12=(Asp, Arg, Asn or Glu); Xaa at res.14=(Ile or Val); Xaa at res.15=(Ile or Val); Xaa at res.16 (Ala or Ser); Xaa at res.18=(Glu, Gln, Leu, Lys, Pro or Arg); Xaa at res.19=(Gly or Ser); Xaa at res.20=(Tyr or Phe); Xaa at res.21=(Ala, Ser, Asp, Met, His, Gln, Leu or Gly); Xaa at res.23=(Tyr, Asn or Phe); Xaa at res.26=(Glu, His, Tyr, Asp, Gln or Ser); Xaa at res.28=(Glu, Lys, Asp, Gln or Ala); Xaa at res.30=(Ala, Ser, Pro, Gln or Asn); Xaa at res.31=(Phe, Leu or Tyr); Xaa at res.33=(Leu, Val or Met); Xaa at res.34=(Asn, Asp, Ala, Thr or Pro); Xaa at res.35=(Ser, Asp, Glu, Leu, Ala or Lys); Xaa at res.36=(Tyr, Cys, His, Ser or Ile); Xaa at res.37=(Met, Phe, Gly or Leu); Xaa at res.38=(Asn, Ser or Lys); Xaa at res.39=(Ala, Ser, Gly or Pro); Xaa at res.40=(Thr, Leu or Ser); Xaa at res.44=(Ile, Val or Thr); Xaa at res.45=(Val, Leu or Ile); Xaa at res.46=(Gln or Arg); Xaa at res.47=(Thr, Ala or Ser); Xaa at res.48=(Leu or Ile); Xaa at res.49=(Val or Met); Xaa at res.50=(His, Asn or Arg); Xaa at res.51=(Phe, Leu, Asn, Ser, Ala or Val); Xaa at res.52=(Ile, Met, Asn, Ala, Val or Leu); Xaa at res.53=(Asn, Lys, Ala, Glu, Gly or Phe); Xaa at res.54=(Pro, Ser or Val); Xaa at res.55=(Glu, Asp, Asn, Gly, Val or Lys); Xaa at res.56=(Thr, Ala, Val, Lys, Asp, Tyr, Ser, Ala, Pro or His); Xaa at res.57=(Val, Ala or Ile); Xaa at res.58=(Pro or Asp); Xaa at res.59=(Lys, Leu or Glu); Xaa at res.60=(Pro or Ala); Xaa at res.63=(Ala or Val); Xaa at res.65=(Thr, Ala or Glu); Xaa at res.66=(Gln, Lys, Arg or Glu); Xaa at res.67=(Leu, Met or Val); Xaa at res.68=(Asn, Ser, Asp or Gly); Xaa at res.69=(Ala, Pro or Ser); Xaa at res.70=(Ile, Thr, Val or Leu); Xaa at res.71=(Ser, Ala or Pro); Xaa at res.72=(Val, Met or Ile); Xaa at res.74=(Tyr or Phe); Xaa at res.75=(Phe, Tyr, Leu or His); Xaa at res.76=(Asp, Asn or Leu); Xaa at res.77=(Asp, Glu, Asn or Ser); Xaa at res.78=(Ser, Gln, Asn, Tyr or Asp); Xaa at res.79=(Ser, Asn, Asp, Glu or Lys); Xaa at res.80=(Asn, Thr or Lys); Xaa at res.82=(Ile, Val or Asn); Xaa at res.84=(Lys or Arg); Xaa at res.85=(Lys, Asn, Gln, His or Val); Xaa at res.86=(Tyr or His); Xaa at res.87=(Arg, Gln, Glu or Pro); Xaa at res.88=(Asn, Glu or Asp); Xaa at res.90=(Val, Thr, Ala or Ile); Xaa at res.92=(Arg, Lys, Val, Asp or Glu); Xaa at res.93=(Ala, Gly, Glu or Ser); Xaa at res.95=(Gly or Ala) and Xaa at res.97=(His or Arg). ##STR5## wherein each Xaa is independently selected from a group of one or more specified amino acids as defined by the following: "Res." means "residue" and Xaa at res.2=(Lys, Arg, Ala or Gln); Xaa at res.3=(Lys, Arg or Met); Xaa at res.4=(His, Arg or Gln); Xaa at res.5=(Glu, Ser, His, Gly, Arg, Pro, Thr, or Tyr); Xaa at res.7=(Tyr or Lys); Xaa at res.8=(Val or Ile); Xaa at res.9=(Ser, Asp or Glu); Xaa at res.11=(Arg, Gln, Ser, Lys or Ala); Xaa at res.12=(Asp, Glu, or Lys); Xaa at res.13=(Leu, Val or Ile); Xaa at res.16=(Gln, Leu, Asp, His, Asn or Ser); Xaa at res.17=(Asp, Arg, Asn or Glu); Xaa at res.19=(Ile or Val); Xaa at res.20=(Ile or Val); Xaa at res.21=(Ala or Ser); Xaa at res.23=(Glu, Gln, Leu, Lys, Pro or Arg); Xaa at res.24=(Gly or Ser); Xaa at res.25=(Tyr or Phe); Xaa at res.26=(Ala, Ser, Asp, Met, His, Gln, Leu, or Gly); Xaa at res.28=(Tyr, Asn or Phe); Xaa at res.31=(Glu, His, Tyr, Asp, Gln or Ser); Xaa at res.33=Glu, Lys, Asp, Gln or Ala); Xaa at res.35=(Ala, Ser, Pro, Gln or Asn); Xaa at res.36=(Phe, Leu or Tyr); Xaa at res.38=(Leu, Val or Met); Xaa at res.39=(Asn, Asp, Ala, Thr or Pro); Xaa at res.40=(Ser, Asp, Glu, Leu, Ala or Lys); Xaa at res.41=(Tyr, Cys, His, Ser or Ile); Xaa at res.42=(Met, Phe, Gly or Leu); Xaa at res.43=(Asn, Ser or Lys); Xaa at res.44=(Ala, Ser, Gly or Pro); Xaa at res.45=(Thr, Leu or Ser); Xaa at res.49=(Ile, Val or Thr); Xaa at res.50=(Val, Leu or Ile); Xaa at res.51=(Gln or Arg); Xaa at res.52=(Thr, Ala or Ser); Xaa at res.53=(Leu or Ile); Xaa at res.54=(Val or Met); Xaa at res.55=(His, Asn or Arg); Xaa at res.56=(Phe, Leu, Asn, Ser, Ala or Val); Xaa at res.57=(Ile, Met, Asn, Ala, Val or Leu); Xaa at res.58=(Asn, Lys, Ala, Glu, Gly or Phe); Xaa at res.59=(Pro, Ser or Val); Xaa at res.60=(Glu, Asp, Gly, Val or Lys); Xaa at res.61=(Thr, Ala, Val, Lys, Asp, Tyr, Ser, Ala, Pro or His); Xaa at res.62=(Val, Ala or Ile); Xaa at res.63=(Pro or Asp); Xaa at res.64=(Lys, Leu or Glu); Xaa at res.65=(Pro or Ala); Xaa at res.68=(Ala or Val); Xaa at res.70=(Thr, Ala or Glu); Xaa at res.71=(Gln, Lys, Arg or Glu); Xaa at res.72=(Leu, Met or Val); Xaa at res.73=(Asn, Ser, Asp or Gly); Xaa at res.74=(Ala, Pro or Ser); Xaa at res.75=(Ile, Thr, Val or Leu); Xaa at res.76=(Ser, Ala or Pro); Xaa at res.77=(Val, Met or Ile); Xaa at res.79=(Tyr or Phe); Xaa at res.80=(Phe, Tyr, Leu or His); Xaa at res.81=(Asp, Asn or Leu); Xaa at res.82=(Asp, Glu, Asn or Ser); Xaa at res.83=(Ser, Gln, Asn, Tyr or Asp); Xaa at res.84=(Ser, Asn, Asp, Glu or Lys); Xaa at res.85=(Asn, Thr or Lys); Xaa at res.87=(Ile, Val or Asn); Xaa at res.89=(Lys or Arg); Xaa at res.90=(Lys, Asn, Gln, His or Val); Xaa at res.91=(Tyr or His); Xaa at res.92=(Arg, Gln, Glu or Pro); Xaa at res.93=(Asn, Glu or Asp); Xaa at res.95=(Val, Thr, Ala or Ile); Xaa at res.97=(Arg, Lys, Val, Asp or Glu); Xaa at res.98=(Ala, Gly, Glu or Ser); Xaa at res.100=(Gly or Ala); and Xaa at res.102=(His or Arg).

Particularly useful sequences for use as morphogens in this invention include the C-terminal domains, e.g., the C-terminal 96-102 amino acid residues of Vgl, Vgr-1, DPP, OP-1, OP-2, CBMP-2A, CBMP-2B, GDF-1 (see Table II, below, and Seq. ID Nos. 5-14), as well as proteins comprising the C-terminal domains of 60A, BMP3, BMP5 and BMP6 (see Seq. ID Nos. 24-28), all of which include at least the conserved six or seven cysteine skeleton. In addition, biosynthetic constructs designed from the generic sequences, such as COP-1, 3-5, 7, 16, disclosed in U.S. Pat. No. 5,011,691, also are useful. Other sequences include the inhibins/activin proteins (see, for example, U.S. Pat. Nos. 4,968,590 and 5,011,691). Accordingly, other useful sequences are those sharing at least 70% amino acid sequence homology or "similarity", and preferably 80% homology or similarity with any of the sequences above. These are anticipated to include allelic and species variants and mutants, and biosynthetic muteins, as well as novel members of this morphogenic family of proteins. Particularly envisioned in the family of related proteins are those proteins exhibiting morphogenic activity and wherein the amino acid changes from the preferred sequences include conservative changes, e.g., those as defined by Dayoff et al., Atlas of Protein Sequence and Structure; vol. 5, Suppl. 3, pp. 345-362, (M. O. Dayoff, ed., Nat'l BioMed. Research Fdn., Washington, D.C. 1979). As used herein, potentially useful sequences are aligned with a known morphogen sequence using the method of Needleman et al. ((1970) J. Mol. Biol. 48:443-453) and identities calculated by the Align program (DNAstar, Inc.). "Homology" or "similarity" as used herein includes allowed conservative changes as defined by Dayoff et al.

Morphogen sequences which are detectable according to the methods of the invention include but are not limited to those having greater than 60% identity, preferably greater than 65% identity, with the amino acid sequence defining the conserved six cysteine skeleton of hOP1 (e.g., residues 43-139 of Seq. ID No. 5). These most preferred sequences include both allelic and species variants of the OP-1 and OP-2 proteins, including the Drosophila 60A protein. Accordingly, morphogens which are detectable according to the invention include active proteins comprising species of polypeptide chains having the generic amino acid sequence herein referred to as "OPX", which accommodates the homologies between the various identified species of OP1 and OP2 (Seq. ID No. 29).

The morphogens detectable in the methods of this invention include proteins comprising any of the polypeptide chains described above, whether isolated from naturally-occurring sources, or produced by recombinant DNA or other synthetic techniques, and includes allelic and species variants of these proteins, naturally-occurring or biosynthetic mutants thereof, chimeric variants containing a domain(s) or region(s) of one family member functionally arranged with another domain(s) or regions(s) of a second family member, as well as various truncated and fusion constructs. Deletion or insertion or addition mutants also are envisioned to be active, including those which may alter the conserved C-terminal cysteine skeleton, provided that the alteration does not functionally disrupt the relationship of these cysteines in the folded structure. Accordingly, such active forms are considered the equivalent of the specifically described constructs disclosed herein. The proteins may include forms having varying glycosylation patterns, varying N-termini, a family of related proteins having regions of amino acid sequence homology, and active truncated or mutated forms of native or biosynthetic proteins, produced by expression of recombinant DNA in host cells.

The morphogenic proteins can be expressed from intact or truncated cDNA or from synthetic DNAs in procaryotic or eucaryotic host cells, and purified, cleaved, refolded, and dimerized to form morphogenically active compositions. Currently preferred host cells include E. coli or mammalian cells, such as CHO, COS or BSC cells. A detailed description of the morphogens detectable according to the methods of this invention is disclosed in copending U.S. patent application Ser. Nos. 752,764, filed Aug. 30, 1991, abandoned, and 667,274, filed Mar. 11, 1991, abandoned, the disclosure of which are incorporated herein by reference.

The screening method of the invention provides a simple method of determining a change in the level of morphogenic protein as a result of exposure of cultured cells to one or more compound(s). The level of a morphogenic protein in a given cell culture, or a change in that level resulting from exposure to one or more compound(s) indicates that direct application of the compound modulates the level of the morphogen expressed by the cultured cells. If, for example, a compound upregulated the production of OP-1 by a kidney cell line, it would then be desirable to test systemic administration of this compound in an animal model to determine if it upregulated the production of OP-1 in vivo. If this compound did upregulate the endogenous circulating levels of OP-1, it would be consistent with administration of the compound systemically for the purpose of correcting bone metabolism diseases such as osteoporosis. The level of morphogen in the body may be a result of a wide range of physical conditions, e.g., tissue degeneration such as occurs in diseases including arthritis, emphysema, osteoporosis, kidney diseases, lung diseases, cardiomyopathy, and cirrhosis of the liver. The level of morphogens in the body may also occur as a result of the normal process of aging. A compound selected by the screening method of the invention as, for example, one which increases the level of morphogen in a tissue, may be consistent with the administration of the compound systemically or locally to a tissue for the purpose of preventing some form of tissue degeneration or for restoring the degenerated tissue to its normal healthy level.

Other advantages of the invention include determining the tissue or tissues of origin of a given morphogen in order to administer a compound aimed at modulating the systemic level of morphogen for treatment of a disease or condition in which the level of morphogen production has become altered.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the fragments of OP-1, used as probes in Northern hybridizations useful in the processes of the invention.

FIG. 2 shows results of Northern blot analysis of RNA using different OP-1-specific probes.

FIG. 3 shows results of Northern blot analysis of RNA from different cells types probed with an OP-1 probe.

DETAILED DESCRIPTION

The invention is based on the discovery of a family of structurally related morphogenic proteins (BMPs), also called osteogenic proteins (OPs), and more particularly that various of these proteins play an important role, not only in embryogenesis, but also in tissue and organ maintenance and repair in juvenile and adult mammals. Morphogenic proteins which have been identified include BMP 2, 3, 4, 5, 6, OP-1 and OP-2 (murine and human), Vgr-1, Vgl, DPP, GDF-1, CMBP-2A, CMBP-2B, 60A, and the inhibin/activin class of proteins. Other recombinant proteins include COP1, COP3, COP4, COP5, COP7, and COP16. While, as explained herein, the morphogen have significant homologies and similarities in structure, it is hypothesized that variants within the morphogenic protein genes may have specific roles in specific tissue involving, for example, stimulation of progenitor cell multiplication, tissue specific or tissue preferred phenotype maintenance, and/or stimulation or modulation of the rate of differentiation, growth or replication of tissue cells characterized by high turnover. The effect on the long-term physiology, maintenance and repair of particular tissues by particular species of the morphogens is currently unknown in any significant detail. However, methods useful in determining which particular tissues express which particular morphogen(s), and for finding changes which stimulate or depress morphogen expression in vivo, would enable discovery and development of strategies for therapeutic treatment of a large number of diseased states, and provide drugs designed to implement the strategy.

This invention provides such methods and, more specifically, two generic processes for obtaining data which ultimately will permit determination of structure/activity relationships of specific naturally occurring mammalian morphogens and drugs capable of modulating their production. For example, using the assay of the invention, it has been determined that OP-1, first found in bone and demonstrated to be osteoinductive, is synthesized primarily in kidney, bladder, and adrenal tissue. This surprising discovery, coupled with the observation that patients with kidney disease often express loss of bone mass, suggests that the bone loss in these patients may be due to pathologic depression of OP-1 synthesis in kidney, and suggests that administration of OP-1 systemically or stimulation of OP-1 expression and secretion by the kidney may arrest bone loss, or effect remineralization through increased bone formation (i.e., osteogenesis).

There are two fundamental aspects of the invention. One aspect involves an assay to determine tissues and cell types capable of synthesis and secretion of the morphogens; the other involves the use of the identified cell types configured in a screening system to find substances useful therapeutically to modulate, i.e., stimulate or depress, morphogen expression and/or secretion.

The assay to determine the tissue of origin of a given morphogen involves screening a plurality (i.e., two or more) different tissues by determining a parameter indicative of production of a morphogen in the tissue, and comparing the parameters. The tissue(s) of origin will, of course, be the tissue that produces that morphogen.

The other assay of the invention involves screening candidate compounds for their ability to modulate the effective systemic or local concentration of a morphogen by incubating the compound with a cell culture that produces the morphogen, and assaying the culture for a parameter indicative of a change in the production level of the morphogen. Useful candidate compounds then may be tested for in vivo efficacy in a suitable animal model. These compounds then may be used in vivo to modulate effective morphogen concentrating in the disease treatment.

1. Morphogen Tissue Distribution

Morphogens are broadly distributed in developing and adult tissue. For example, DPP and 60A are expressed in both embryonic and developing Drosophila tissue. Vgl has been identified in Xenopus embryonic tissue. Vgr-1 transcripts have been identified in a variety of murine tissues, including embryonic and developing brain, lung, liver, kidney and calvaria (dermal bone) tissue. In addition, both CBMP2B and CBMP3 have been identified in lung tissue. Recently, Vgr-1 transcripts also have been identified in adult murine lung, kidney, heart, and brain tissue, with particularly high levels in the lung (see infra). GDF-1 has been identified in human adult cerebellum and in fetal brain tissue. In addition, recent Northern blot analyses indicate that OP-1 is encoded by multiple transcripts in different tissues. This potential alternative splicing is consistent with the hypothesis that the longer transcripts may encoded additional proteins (e.g., bicistronic mRNA) and each form may be tissue or developmentally related.

OP-1 and the CBMP2 proteins, both first identified as bone morphogens, have been identified in mouse and human placenta, hippocampus, calvaria and osteosarcoma tissue as determined by identification of OP-1 and CMBP2-specific sequences in CDNA libraries constructed from these tissues (see U.S. Ser. No. 422,699, abandoned, incorporated herein by reference). Additionally, the OP-1 protein is present in a variety of embryonic and developing tissues including kidney, liver, heart and brain as determined by Western blot analysis and immunolocalization (see infra). OP-1-specific transcripts also have been identified in both embryonic and developing tissues, most abundantly in developing kidney, bladder, adrenal and (see infra). OP-1 also has been identified as a mesoderm inducing factor present during embryogenesis. Moreover, OP-1 has been shown to be associated with satellite cells in the muscle and associated with potential pluripotential stem cells in bone marrow following damage to adult murine endochondral bone, indicating its morphogenic role in tissue repair and regeneration. In addition, a novel protein GDF-1 comprising a 7 cysteine skeleton, has been identified in neural tissue (Lee, 1991, Proc. Nat. Aca. Sci. 88:4250-4254).

Knowledge of the tissue distribution of a given morphogen may be useful in choosing a cell type for screening according to the invention, or for targeting that cell type or tissue type for treatment. The proteins (or their mRNA transcripts) are readily identified in different tissues using standard methodologies and minor modifications thereof in tissues where expression may be low. For example, protein distribution may be determined using standard Western blot analysis or immunocytochemical techniques, and antibodies specific to the morphogen or morphogens of interest. Similarly, the distribution of morphogen transcripts may be determined using standard Northern hybridization protocols and a transcript-specific probe and hybridization conditions.

2. Useful Morphogens

As defined herein a protein is morphogenic if it is capable of inducing the developmental cascade of cellular and molecular events that culminate in the formation of new, organ-specific tissue and comprises at least the conserved C-terminal six cysteine skeleton or its functional equivalent (see supra). Specifically, the morphogens generally are capable of all of the following biological functions in a morphogenically permissive environment: stimulating proliferation of progenitor cells; stimulating the differentiation of progenitor cells; stimulating the proliferation of differentiated cells; and supporting the growth and maintenance of differentiated cells, including the "redifferentiation" of transformed cells. Details of how the morphogens detectable according to the methods of this invention first were identified, as well as a description on how to make, use and test them for morphogenic activity are disclosed in U.S. Ser. No. 667,274, filed Mar. 11, 1991, abandoned and U.S. Ser. No. 752,764, filed Aug. 30, 1991, abandoned, the disclosures of which are hereby incorporated by reference. As disclosed therein, the morphogens may be purified from naturally-sourced material or recombinantly produced from procaryotic or eucaryotic host cells, using the genetic sequences disclosed therein. Alternatively, novel morphogenic sequences may be identified following the procedures disclosed therein.

Particularly useful proteins include those which comprise the naturally derived sequences disclosed in Table II. Other useful sequences include biosynthetic constructs such as those disclosed in U.S. Pat. 5,011,691, the disclosure of which is incorporated herein by reference (e.g., COP-1, COP-3, COP-4, COP-5, COP-7, and COP-16).

Accordingly, the morphogens detectable according to the methods and compositions of this invention also may be described by morphogenically active proteins having amino acid sequences sharing 70% or, preferably, 80% homology (similarity) with any of the sequences described above, where "homology" is as defined herein above.

The morphogens detectable according to the method of this invention also can be described by any of the 6 generic sequences described herein (Generic Sequences 1, 2, 3, 4, 5 and 6). Generic sequences 1 and 2 also may include, at their N-terminus, the sequence ##STR6##

Table II, set forth below, compares the amino acid sequences of the active regions of native proteins that have been identified as morphogens, including human OP-1 (hOP-1, Seq. ID Nos. 5 and 16-17), mouse OP-1 (mOP-1, Seq. ID Nos. 6 and 18-19), human and mouse OP-2 (Seq. ID Nos. 7, 8, and 20-23), CBMP2A (Seq. ID No. 9), CBMP2B (Seq. ID No. 10), BMP3 (Seq. ID No. 26), DPP (from Drosophila, Seq. ID No. 11), Vgl, (from Xenopus, Seq. ID No. 12), Vgr-1 (from mouse, Seq. ID No. 13), GDF-1 (from mouse, Seq. ID Nos. 14, 32 and 33), 60A protein (from Drosophila, Seq. ID Nos. 24 and 25), BMP5 (Seq. ID No. 27) and BMP6 (Seq. ID No. 28). The sequences are aligned essentially following the method of Needleman et al. (1970) J. Mol. Biol., 48:443-453, calculated using the Align Program (DNAstar, Inc.) In the table, three dots indicates that the amino acid in that position is the same as the amino acid in hOP-1. Three dashes indicates that no amino acid is present in that position, and are included for purposes of illustrating homologies. For example, amino acid residue 60 of CBMP-2A and CBMP-2B is "missing". Of course, both these amino acid sequences in this region comprise Asn-Ser (residues 58, 59), with CBMP-2A then comprising Lys and Ile, whereas CBMP-2B comprises Ser and Ile.

                                    TABLE II                                
     __________________________________________________________________________
     SEQ                                                                       
     ID No.                                                                    
     __________________________________________________________________________
     hOP-1                                                                     
          5   Cys                                                              
                 Lys                                                           
                    Lys                                                        
                       His                                                     
                          Glu                                                  
                             Leu                                               
                                Tyr                                            
                                   Val                                         
                                      Ser                                      
                                         Phe                                   
                                            Arg Asp                            
                                                   Leu                         
                                                      Gly                      
                                                         Trp                   
     mOP-1                                                                     
          6   -- -- -- -- -- -- -- -- -- -- --  -- -- -- --                    
     hOP-2                                                                     
          7   -- Arg                                                           
                    Arg                                                        
                       -- -- -- -- -- -- -- Gln -- -- -- --                    
     mOP-2                                                                     
          8   -- Arg                                                           
                    Arg                                                        
                       -- -- -- -- -- Ser                                      
                                         -- --  -- -- -- --                    
     DPP  11  -- Arg                                                           
                    Arg                                                        
                       -- Ser                                                  
                             -- -- -- Asp                                      
                                         -- Ser -- Val                         
                                                      -- --                    
     Vgl  12  -- -- Lys                                                        
                       Arg                                                     
                          His                                                  
                             -- -- -- Glu                                      
                                         -- Lys -- Val                         
                                                      -- --                    
     Vgr-1                                                                     
          13  -- -- -- -- Gly                                                  
                             -- -- -- -- -- Gln -- Val                         
                                                      -- --                    
     CBMP-2A                                                                   
          9   -- -- Arg                                                        
                       -- Pro                                                  
                             -- -- -- Asp                                      
                                         -- Ser -- Val                         
                                                      -- --                    
     CBMP-2B                                                                   
          10  -- Arg                                                           
                    Arg                                                        
                       -- Ser                                                  
                             -- -- -- Asp                                      
                                         -- Ser -- Val                         
                                                      -- --                    
     BMP3 26  -- Ala                                                           
                    Arg                                                        
                       Arg                                                     
                          Tyr                                                  
                             -- Lys                                            
                                   -- Asp                                      
                                         -- Ala -- Ile                         
                                                      -- --                    
     GDF-1                                                                     
          14  -- Arg                                                           
                    Ala                                                        
                       Arg                                                     
                          Arg                                                  
                             -- -- -- -- -- --  Glu                            
                                                   Val                         
                                                      -- --                    
     60A  25  -- Gln                                                           
                    Met                                                        
                       Glu                                                     
                          Thr                                                  
                             -- -- -- Asp                                      
                                         -- Lys -- -- -- --                    
     BMP5 27  -- -- -- -- -- -- -- -- -- -- --  -- -- -- --                    
     BMP6 28  -- Arg                                                           
                    -- -- -- -- -- -- -- -- Gln -- -- -- --                    
              1           5              10              15                    
     __________________________________________________________________________
     hOP-1                                                                     
          5   Gln                                                              
                 Asp                                                           
                    Trp                                                        
                       Ile                                                     
                          Ile                                                  
                             Ala                                               
                                Pro                                            
                                   Glu                                         
                                      Gly                                      
                                         Try                                   
                                            Ala Ala                            
                                                   Tyr                         
                                                      Tyr                      
                                                         Cys                   
     mOP-1                                                                     
          6   -- -- -- -- -- -- -- -- -- -- --  -- -- -- --                    
     hOP-2                                                                     
          7   Leu                                                              
                 -- -- Val                                                     
                          -- -- -- Gln                                         
                                      -- -- Ser -- -- -- --                    
     mOP-2                                                                     
          8   Leu                                                              
                 -- -- Val                                                     
                          -- -- -- Gln                                         
                                      -- -- Ser -- -- -- --                    
     DPP  11  Asp                                                              
                 -- -- -- Val                                                  
                             -- -- Leu                                         
                                      -- -- Asp -- -- -- --                    
     Vgl  12  -- Asn                                                           
                    -- Val                                                     
                          -- -- -- Gln                                         
                                      -- -- Met -- Asn                         
                                                      -- --                    
     Vgr-1                                                                     
          13  -- -- -- -- -- -- -- Lys                                         
                                      -- -- --  -- Asn                         
                                                      -- --                    
     CBMP-2A                                                                   
          9   Asn                                                              
                 -- -- -- Val                                                  
                             -- -- Pro                                         
                                      -- -- His -- Phe                         
                                                      -- --                    
     CBMP-2B                                                                   
          10  Asn                                                              
                 -- -- -- Val                                                  
                             -- -- Pro                                         
                                      -- -- Gln -- Phe                         
                                                      -- --                    
     BMP3 26  Ser                                                              
                 Glu                                                           
                    -- -- -- Ser                                               
                                -- Lys                                         
                                      Ser                                      
                                         Phe                                   
                                            Asp -- -- -- --                    
     GDF-1                                                                     
          14  His                                                              
                 Arg                                                           
                    -- Val                                                     
                          -- -- -- Arg                                         
                                      -- Phe                                   
                                            Leu -- Asn                         
                                                      -- --                    
     60A  25  His                                                              
                 -- -- -- -- -- -- -- -- -- Gly -- Phe                         
                                                      -- --                    
     BMP5 27  -- -- -- -- -- -- -- -- -- -- --  -- Phe                         
                                                      -- --                    
     BMP6 28  -- -- -- -- -- -- -- Lys                                         
                                      -- -- --  -- Asn                         
                                                      -- --                    
                          20             25              30                    
     __________________________________________________________________________
     hOP-1                                                                     
          5   Glu                                                              
                 Gly                                                           
                    Glu                                                        
                       Cys                                                     
                          Ala                                                  
                             Phe                                               
                                Pro                                            
                                   Leu                                         
                                      Asn                                      
                                         Ser                                   
                                            Tyr Met                            
                                                   Asn                         
                                                      Ala                      
                                                         Thr                   
     mOP-1                                                                     
          6   -- -- -- -- -- -- -- -- -- -- --  -- -- -- --                    
     hOP-2                                                                     
          7   -- -- -- -- Ser                                                  
                             -- -- -- Asp                                      
                                         -- Cys -- -- -- --                    
     mOP-2                                                                     
          8   -- -- -- -- -- -- -- -- Asp                                      
                                         -- Cys -- -- -- --                    
     DPP  11  His                                                              
                 -- Lys                                                        
                       -- Pro                                                  
                             -- -- -- Ala                                      
                                         Asp                                   
                                            His Phe                            
                                                   -- Ser                      
                                                         --                    
     Vgl  12  Tyr                                                              
                 -- -- -- Pro                                                  
                             Tyr                                               
                                -- -- Thr                                      
                                         Glu                                   
                                            Ile Leu                            
                                                   -- Gly                      
                                                         Ser                   
     Vgr-1                                                                     
          13  Asp                                                              
                 -- -- -- Ser                                                  
                             -- -- -- -- Ala                                   
                                            His -- -- -- --                    
     CBMP-2A                                                                   
          9   His                                                              
                 -- Glu                                                        
                       -- Pro                                                  
                             -- -- -- Ala                                      
                                         Asp                                   
                                            His Leu                            
                                                   -- Ser                      
                                                         --                    
     CBMP-2B                                                                   
          10  His                                                              
                 -- Asp                                                        
                       -- Pro                                                  
                             -- -- -- Ala                                      
                                         Asp                                   
                                            His Leu                            
                                                   -- Ser                      
                                                         --                    
     BMP3 26  Ser                                                              
                 -- Ala                                                        
                       -- Gln                                                  
                             -- -- Met                                         
                                      Pro                                      
                                         Lys                                   
                                            Ser Leu                            
                                                   Lys                         
                                                      Pro                      
                                                         Ser                   
     GDF-1                                                                     
          14  Gln                                                              
                 -- Gln                                                        
                       -- -- Leu                                               
                                -- Val                                         
                                      Ala                                      
                                         Leu                                   
                                            Ser Gly                            
                                                   Ser**                       
                                                      -- Leu                   
     60A  25  Ser                                                              
                 -- -- -- Asn                                                  
                             -- -- -- -- Ala                                   
                                            His -- -- -- --                    
     BMP5 27  Asp                                                              
                 -- -- -- Ser                                                  
                             -- -- -- -- Ala                                   
                                            His Met                            
                                                   -- -- --                    
     BMP6 28  Asp                                                              
                 -- -- -- Ser                                                  
                             -- -- -- -- Ala                                   
                                            His Met                            
                                                   -- -- --                    
                          35             40              45                    
     __________________________________________________________________________
     hOP-1                                                                     
          5   Asn                                                              
                 His                                                           
                    Ala                                                        
                       Ile                                                     
                          Val                                                  
                             Gln                                               
                                Thr                                            
                                   Leu                                         
                                      Val                                      
                                         His                                   
                                            Phe Ile                            
                                                   Asn                         
                                                      Pro                      
                                                         Glu                   
     mOP-1                                                                     
          6   -- -- -- -- -- -- -- -- -- -- --  -- -- -- Asp                   
     hOP-2                                                                     
          7   -- -- -- -- Leu                                                  
                             -- Ser                                            
                                   -- -- His                                   
                                            Leu Met                            
                                                   Lys                         
                                                      -- Asn                   
     mOP-2                                                                     
          8   -- -- -- -- Leu                                                  
                             -- Ser                                            
                                   -- -- His                                   
                                            Leu Met                            
                                                   Lys                         
                                                      -- Asp                   
     DPP  11  -- -- -- Val                                                     
                          -- -- -- -- -- Asn                                   
                                            Asn Asn                            
                                                   -- -- Gly                   
     Vgl  12  -- -- -- -- Leu                                                  
                             -- -- -- -- -- Ser -- Glu                         
                                                      -- --                    
     Vgr-1                                                                     
          13  -- -- -- -- -- -- -- -- -- -- Val Met                            
                                                   -- -- --                    
     CBMP-2A                                                                   
          9   -- -- -- -- -- -- -- -- -- Asn                                   
                                            Ser Val                            
                                                   -- Ser                      
                                                         --                    
     CBMP-2B                                                                   
          10  -- -- -- -- -- -- -- -- -- Asn                                   
                                            Ser Val                            
                                                   -- Ser                      
                                                         --                    
     BMP3 26  -- -- -- Thr                                                     
                          Ile                                                  
                             -- Ser                                            
                                   Ile                                         
                                      -- Arg                                   
                                            Ala**                              
                                                Gly                            
                                                   Val                         
                                                      Val                      
                                                         Pro                   
     GDF-1                                                                     
          14  -- -- -- Val                                                     
                          Leu                                                  
                             Arg                                               
                                Ala                                            
                                   -- Met                                      
                                         -- Ala Ala                            
                                                   Ala                         
                                                      -- Gly                   
     60A  25  -- -- -- -- -- -- -- -- -- -- Leu Leu                            
                                                   Glu                         
                                                      -- Lys                   
     BMP5 27  -- -- -- -- -- -- -- -- -- -- Leu Met                            
                                                   Phe                         
                                                      -- Asp                   
     BMP6 28  -- -- -- -- -- -- -- -- -- -- Leu Met                            
                                                   -- -- --                    
                          50             55              60                    
     __________________________________________________________________________
     hOP-1                                                                     
          5   Thr                                                              
                 Val                                                           
                    Pro                                                        
                       Lys                                                     
                          Pro                                                  
                             Cys                                               
                                Cys                                            
                                   Ala                                         
                                      Pro                                      
                                         Thr                                   
                                            Gln Leu                            
                                                   Asn                         
                                                      Ala                      
                                                         Ile                   
     mOP-1                                                                     
          6   -- -- -- -- -- -- -- -- -- -- --  -- -- -- --                    
     hOP-2                                                                     
          7   Ala                                                              
                 -- -- -- Ala                                                  
                             -- -- -- -- -- Lys -- Ser                         
                                                      -- Thr                   
     mOP-2                                                                     
          8   Val                                                              
                 -- -- -- Ala                                                  
                             -- -- -- -- -- Lys -- Ser                         
                                                      -- Thr                   
     DPP  11  Lys                                                              
                 -- -- -- Ala                                                  
                             -- -- Val                                         
                                      -- -- --  -- Asp                         
                                                      Ser                      
                                                         Val                   
     Vgl  12  Asp                                                              
                 Ile                                                           
                    -- Leu                                                     
                          -- -- -- Val                                         
                                      -- -- Lys Met                            
                                                   Ser                         
                                                      Pro                      
                                                         --                    
     Vgr-1                                                                     
          13  Tyr                                                              
                 -- -- -- -- -- -- -- -- -- Lys Val                            
                                                   -- -- --                    
     CBMP-2A                                                                   
          9   Lys                                                              
                 Ile                                                           
                    -- -- Ala                                                  
                             -- -- Val                                         
                                      -- -- Glu -- Ser                         
                                                      -- --                    
     CBMP-2B                                                                   
          10  Ser                                                              
                 Ile                                                           
                    -- -- Ala                                                  
                             -- -- Val                                         
                                      -- -- Glu -- Ser                         
                                                      -- --                    
     BMP3 26  Gly                                                              
                 Ile                                                           
                    -- Glu                                                     
                          -- -- -- Val                                         
                                      -- Glu                                   
                                            Lys Met                            
                                                   Ser                         
                                                      Ser                      
                                                         Leu                   
     GDF-1                                                                     
          14  Ala                                                              
                 Ala                                                           
                    Asp                                                        
                       Leu                                                     
                          -- -- -- Val                                         
                                      -- Ala                                   
                                            Arg -- Ser                         
                                                      Pro                      
                                                         --                    
     60A  25  Lys                                                              
                 -- -- -- -- -- -- -- -- -- Arg -- Gly                         
                                                      -- Leu                   
     BMP5 27  His                                                              
                 -- -- -- -- -- -- -- -- -- Lys -- -- -- --                    
     BMP6 28  Tyr                                                              
                 -- -- -- -- -- -- -- -- -- Lys -- -- -- --                    
                          65             70              75                    
     __________________________________________________________________________
     hOP-1                                                                     
          5   Ser                                                              
                 Val                                                           
                    Leu                                                        
                       Tyr                                                     
                          Phe                                                  
                             Asp                                               
                                Asp                                            
                                   Ser                                         
                                      Ser                                      
                                         Asn                                   
                                            Val Ile                            
                                                   Leu                         
                                                      Lys                      
                                                         Lys                   
     mOP-1                                                                     
          6   -- -- -- -- -- -- -- -- -- -- --  -- -- -- --                    
     hOP-2                                                                     
          7   -- -- -- -- Tyr                                                  
                             -- Ser                                            
                                   -- Asn                                      
                                         -- --  -- -- Arg                      
                                                         --                    
     mOP-2                                                                     
          8   -- -- -- -- -- -- Ser                                            
                                   -- Asn                                      
                                         -- --  -- -- Arg                      
                                                         --                    
     DPP  11  Ala                                                              
                 Met                                                           
                    -- -- Leu                                                  
                             Asn                                               
                                -- Gln                                         
                                      -- Thr                                   
                                            --  Val                            
                                                   -- -- Asn                   
     Vgl  12  -- Met                                                           
                    -- Phe                                                     
                          Tyr                                                  
                             -- Asn                                            
                                   Asn                                         
                                      Asp                                      
                                         -- --  Val                            
                                                   -- Arg                      
                                                         His                   
     Vgr-1                                                                     
          13  -- -- -- -- -- -- -- Asn                                         
                                      -- -- --  -- -- -- --                    
     CBMP-2A                                                                   
          9   -- Met                                                           
                    -- -- Leu                                                  
                             -- Glu                                            
                                   Asn                                         
                                      Glu                                      
                                         Lys                                   
                                            --  Val                            
                                                   -- -- Asn                   
     CBMP-2B                                                                   
          10  -- Met                                                           
                    -- -- Leu                                                  
                             -- Glu                                            
                                   Tyr                                         
                                      Asp                                      
                                         Lys                                   
                                            --  Val                            
                                                   -- -- Asn                   
     BMP3 26  -- Ile                                                           
                    -- Phe                                                     
                          Tyr                                                  
                             -- Glu                                            
                                   Asn                                         
                                      Lys                                      
                                         -- --  Val                            
                                                   -- -- Val                   
     GDF-1                                                                     
          14  -- -- -- Phe                                                     
                          -- -- Asn                                            
                                   -- Asp                                      
                                         -- --  Val                            
                                                   -- Arg                      
                                                         Gln                   
     60A  25  Pro                                                              
                 -- -- -- His                                                  
                             Leu                                               
                                Asn                                            
                                   Asp                                         
                                      Glu                                      
                                         -- --  Asn                            
                                                   -- -- --                    
     BMP5 27  -- -- -- -- -- -- -- -- -- -- --  -- -- -- --                    
     BMP6 28  -- -- -- -- -- -- -- Asn                                         
                                      -- -- --  -- -- -- --                    
                          80             85              90                    
     __________________________________________________________________________
     hOP-1                                                                     
          5   Tyr                                                              
                 Arg                                                           
                    Asn                                                        
                       Met                                                     
                          Val                                                  
                             Val                                               
                                Arg                                            
                                   Ala                                         
                                      Cys                                      
                                         Gly                                   
                                            Cys His                            
     mOP-1                                                                     
          6   -- -- -- -- -- -- -- -- -- -- --  --                             
     hOP-2                                                                     
          7   His                                                              
                 -- -- -- -- -- Lys                                            
                                   -- -- -- --  --                             
     mOP-2                                                                     
          8   His                                                              
                 -- -- -- -- -- Lys                                            
                                   -- -- -- --  --                             
     DPP  11  -- Gln                                                           
                    Glu                                                        
                       -- Thr                                                  
                             -- Val                                            
                                   Gly                                         
                                      -- -- --  Arg                            
     Vgl  12  -- Glu                                                           
                    -- -- Ala                                                  
                             -- Asp                                            
                                   Glu                                         
                                      -- -- --  Arg                            
     Vgr-1                                                                     
          13  -- -- -- -- -- -- -- -- -- -- --  --                             
     CBMP-2A                                                                   
          9   -- Gln                                                           
                    Asp                                                        
                       -- -- -- Glu                                            
                                   Gly                                         
                                      -- -- --  Arg                            
     CBMP-2B                                                                   
          10  -- Gln                                                           
                    Glu                                                        
                       -- -- -- Glu                                            
                                   Gly                                         
                                      -- -- --  Arg                            
     BMP3 26  -- Pro                                                           
                    -- -- Thr                                                  
                             -- Glu                                            
                                   Ser                                         
                                      -- Ala                                   
                                            --  Arg                            
     GDF-1                                                                     
          14  -- Glu                                                           
                    Asp                                                        
                       -- -- -- Asp                                            
                                   Glu                                         
                                      -- -- --  Arg                            
     60A  25  -- -- -- -- Ile                                                  
                             -- Lys                                            
                                   Ser                                         
                                      -- -- --  --                             
     BMP5 27  -- -- -- -- -- -- -- Ser                                         
                                      -- -- --  --                             
     BMP6 28  -- -- Trp                                                        
                       -- -- -- -- -- -- -- --  --                             
                          95             100    102                            
     __________________________________________________________________________
      Between residues 56 and 57 of BMP3 (Seq. ID No. 26) is a Val residue;    
      between residues 43 and 44 of GDF1 (Seq. ID No. 14) lies the amino acid  
      sequence Gly--Gly--Pro--Pro.

As is apparent from the foregoing amino acid sequence comparisons, significant amino acid changes can be made within the generic sequences while retaining the morphogenic activity. For example, while the GDF-1 protein sequence depicted in Table II shares only about 50% amino acid identity with the hOP1 sequence described therein, the GDF-1 sequence shares greater than 70% amino acid sequence homology (or "similarity") with the hOP1 sequence, where "homology" or "similarity" includes allowed conservative amino acid changes within the sequence as defined by Dayoff, et al., Atlas of Protein Sequence and Structure vol.5, supp.3, pp.345-362, (M. O. Dayoff, ed., Nat'l BioMed. Res. Fd'n, Washington D.C. 1979.)

The currently most preferred protein sequences detectable as morphogens in this invention include those having greater than 60% identity, preferably greater than 65% identity, with the amino acid sequence defining the conserved six cysteine skeleton of hOP1 (e.g., residues 43-139 of Seq. ID No. 5). These most preferred sequences include both allelic and species variants of the OP-1 and OP-2 proteins, including the Drosophila 60A protein. Accordingly, in still another preferred aspect, the invention includes detection of morphogens comprising species of polypeptide chains having the generic amino acid sequence referred to herein as "OPX", which defines the seven cysteine skeleton and accommodates the identities between the various identified mouse and human OP1 and OP2 proteins. OPX is presented in Seq. ID No. 29. As described therein, each Xaa at a given position independently is selected from the residues occurring at the corresponding position in the C-terminal sequence of mouse or human OP1 or OP2 (see Seq. ID Nos. 5-8 and/or Seq. ID Nos. 16-23).

The following sets forth various procedures for evaluating the in vivo morphogenic utility of the morphogens and morphogenic compositions of this invention. The proteins and compositions ay be injected or surgically implanted in a mammal, following any of a number of procedures well known in the art. For example, surgical implant bioassays may be performed essentially following the procedure of Sampath et al. (1983) PNAS 80:6591:6595.

Histological Evaluation

Histological sectioning and staining is preferred to determine the extent of morphogenesis in vivo, particularly in tissue repair procedures. Excised implants are fixed in Bouins Solution, embedded in paraffin, and cut into 6-8 .mu.m sections. Staining with toluidine blue or hemotoxylin/eosin demonstrates clearly the ultimate development of the new tissue. Twelve day implants are usually sufficient to determine whether the implants contain newly induced tissue.

Successful implants exhibit a controlled progression through the stages of induced tissue development allowing one to identify and follow the tissue-specific events that occur. For example, in endochondral bone formation the stages include: (1) leukocytes on day one; (2) mesenchymal cell migration and proliferation on days two and three; (3) chondrocyte appearance on days five and six; (4) cartilage matrix formation on day seven; (5) cartilage calcification on day eight; (6) vascular invasion, appearance of osteoblasts, and formation of new bone on days nine and ten; (7) appearance of osteoblastic and bone remodeling and dissolution of the implanted matrix on days twelve to eighteen; and (8) hematopoietic bone marrow differentiation in the ossicle on day twenty-one.

Biological Markers

In addition to histological evaluation, biological markers may be used as a marker for tissue morphogenesis. Useful markers include tissue-specific enzymes whose activities may be assayed (e.g., spectrophotometrically) after homogenization of the implant. These assays may be useful for quantitation and for obtaining an estimate of tissue formation quickly after the implants are removed from the animal. For example, alkaline phosphatase activity may be used as a marker for osteogenesis.

Incorporation of systemically provided morphogens may be followed using tagged morphogens (e.g., radioactively labelled) and determining their localization in new tissue, and/or by monitoring their disappearance from the circulatory system using a standard pulse-chase labeling protocol. The morphogen also may be provided with a tissue-specific molecular tag, whose uptake may be monitored and correlated with the concentration of morphogen provided. As an example, ovary removal in female rats results in reduced bone alkaline phosphatase activity, rendering the rats predisposed to osteoporosis. If the female rats now are provided with a morphogen, e.g., OP-1, a reduction in the systemic concentration of calcium (CA.sup.2 +) is seen, which correlates with the presence of the provided morphogen and can be shown to correspond to increased alkaline phosphatase activity.

The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

3. Tissue-Specific Expression of OP-1

Once a morphogen is identified in a tissue, its level may be determined either at the protein or nucleic acid level. By comparing the levels of production of a given morphogen among different tissues, it is possible to determine the tissue(s) of origin of that morphogen. The level of production of the morphogen OP-1 in different tissues is one example of a morphogen having a tissue of origin, i.e., the kidney, which contains a cell type that can also be used as the cell type which is used to screen, according to the invention, different compounds for their potential effects on morphogen (OP-1) production.

The level of OP-1 varies among different tissue types. In order to screen compounds for their effect on the production of OP-1 by a given cell type, it may be desirable to determine which tissues produce levels of OP-1 which are sufficiently high to show a potential decrease and sufficiently low to show a potential increase in production. Different tissues may be screened at the RNA level as follows.

Any probe capable of hybridizing specifically to a transcript, and distinguishing the transcript of interest from other, related transcripts may be used. Because the morphogens to be detected in the methods of this invention share such high sequence homology in their C-terminal domain, the tissue distribution of a specific morphogen transcript may best be determined using a probe specific for the "pro" region of the immature protein and/or the N-terminal heterogeneous region of the mature protein. Another useful probe sequence is the 3' non-coding region immediately following the stop codon. These portions of the sequence vary substantially among the morphogens of this invention, and accordingly, are specific for each protein. For example, a particularly useful Vgr-1-specific probe sequence is the PvuII-SacI fragment, a 265 bp fragment encoding both a portion of the pro region and the N-terminus of the mature sequence. Similarly, particularly useful mOP-1-specific probe sequences are the BstXI-BglI fragment, a 0.68 kb sequence that covers approximately two-thirds of the mOP1 pro region; a StuI-StuI fragment, a 0.2 kb sequence immediately upstream of the 7-cysteine domain, and an EarI-PstI fragment, a 0.3 kb fragment containing the 3,untranslated sequence. Similar approaches may be used, for example, with hOP-1 (SEQ. ID NO.16) or human or mouse OP-2 (SEQ. ID NOS.20 and 22).

Using morphogen-specific oligonucleotides probes, morphogen transcripts can be identified in mammalian tissues, using standard methodologies well known to those having ordinary skill in the art. Briefly, total RNA from mouse embryos and organs from post-natal animals is prepared using the acid guanidine thiocyanate-phenol-chloroform method (Chomczynski et al., Anal. Biochem. 162:156-159, 1987). The RNA may be dissolved in TES buffer (10 mM Tris-HCl, 1 mM EDTA, 0.1% SDS, pH 7.5) and treated with Proteinase K (approx. 1.5 mg per g tissue sample) at 45.degree. C. for 1 hr. Poly(A).sup.+ RNA selection on oligo(dT)-cellulose (Type 7, Pharmacia LKB Biotechnology Inc., Piscataway, N.J.) may be done in a batch procedure by mixing 0.1 g oligo(dT)-cellulose with 11 ml RNA solution (from 1 g tissue) in TES buffer and 0.5M NaCl). Thereafter the oligo(dT) cellulose is washed in binding buffer (0.5M NaCl, 10 mM Tris-HCl, 1 mM EDTA, pH 7.5) and poly(A).sup.+ RNA is eluted with water. Poly(A).sup.+ RNA (5 or 15 .mu.g/lane) is fractionated on 1 or 1.2% agarose-formaldehyde gels (Selden, in Current Protocols in Molecular Biology, Ausubel et al. eds., pp. 1-4, 8, 9, Greene Publishing and Wiley-Interscience, New York, 1991). 1 .mu.l of 400 .mu.g/ml ethidium bromide is added to each sample prior to heat denaturation (Rosen et al., Focus 12:23-24, 1990). Following electrophoresis, the gels are photographed and the RNA is blotted overnight onto Nytran nitrocellulose membranes (Schleicher & Schuell Inc., Keene, N.H.) with 10.times.SSC. The membranes are baked at 80.degree. C. for 30-60 min. and irradiated with UV light (1 mW/cm.sup.2 for 25 sec.). The Northern hybridization conditions may be as previously described (Ozkaynak et al., EMBO J. 9:2085-2093, 1990). For re-use, the filters may be deprobed in 1 mM Tris-HCl, 1 mM EDTA, 0.1% SDS, pH 7.5, at 90.degree.-95.degree. C. and exposed to film to assure complete removal of previous hybridization signals.

One probe which may be used to screen for transcripts encoding a morphogen includes a portion of or the complete OP-1 cDNA, which may be used to detect the presence of OP-1 mRNA or mRNAs of related morphogens. The sequence of the murine cDNA gene is set forth in SEQ ID NO:14.

OP-1 mRNA expression was analyzed in 17 day mouse embryos and 3 day post-natal mice by sequentially hybridizing filters with various probes. Probes from regions other than the highly conserved 7-cysteine domain were selected because this region is highly variable among members of the TGF-.beta. superfamily. FIG. 1 shows the fragments of OP-1, used as probes in the Northern hybridizations. The solid box indicates the putative signal peptide and the hatched box corresponds to the TGF-.beta.-like domain that contains the seven cysteine residues. Asterisks indicate the potential N-glycosylation sites. The arrow marks the location of the cleavage site for OP-1 maturation. Three solid bars below the diagram indicate the OP-1 specific fragments used in making .sup.32 P-labeled probes (0.68 kb BstXI - BglI fragment, 0.20 kb StuI - StuI fragment and 0.34 kb EarI - PstI non-coding fragment).

Hybridization with a probe that covers approximately two thirds of the pro region (the 0.68 kb BstXI-BglI fragment), reveals a 4 kb message and 3 messages at 1.8 kb, 2.2 kb and 2.4 kb (FIG. 2B and D, and FIG. 3). In the Northern hybridization of FIG. 2, equal amounts (15 .mu.g) of poly(A).sup.+ RNA were loaded into each lane, electrophoresed on a 1% agarose-formaldehyde gel, blotted and hybridized. A 0.24-9.49 kb RNA ladder (Bethesda Research Labs, Inc.) was used as size standard. The same filter was used for sequential hybridizations with labeled probes specific for OP-1 (Panels B and D), Vgr-1 (Panel C), and EF-Tu (Panel A). Panel A: the EF-Tu specific probe (a control) was the 0.4 kb HindIII-SacI fragment (part of the coding region), the SacI site used belonged to the vector; Panel B: the OP-1 specific probe was the 0.68 kb BstXI-BglI fragment (two thirds of the pro region and upstream sequences of the mature domain, not including any sequences from the 7-cysteine domain); Panel C: the Vgr-1 specific probe was the 0.26 kb PvuII-SacI fragment (part of the pro region and the amino-terminal sequences of the mature domain, including the first cysteine) (Lyons et al., 1989, Proc. Nat. Aca. Sci. 86:4554, hereby incorporated by reference). Panel D: the OP-1 (3' flanking) specific probe was the 0.34 kb EarI-PstI fragment (3' untranslated sequences immediately following the sequences encoding OP-1).

In FIG. 3, the tissues to be used for RNA preparation were obtained from two week old mice (Panel A) or 5 week old mice (Panel B), with the exception of poly A+ RNA which was obtained from kidney adrenal gland of two week old mice (Panel B). Equal amounts of poly A+ RNA (15 .mu.g for Panel A and 5 .mu.g for Panel B) were loaded into each well. After electrophoresis (1.2% agarose-formaldehyde gels) and blotting, RNA was hybridized to the OP-1 specific 3' flanking probe described in the legend of FIG. 2 (Panel D). The 0.24-9.5 kb RNA ladder was used as size standard. The arrowheads indicate the OP-1 specific messages. The lower section of Panels A and B show the hybridization pattern obtained with the EF-Tu specific probe (a control).

Although the size of the Vgr-1 specific message is close to the 4 kb OP-1 species (FIG. 2 Panel C), the OP-1 4 kb mRNA is somewhat larger. To further rule out cross-hybridization with a non-OP-1 message, the 0.2 kb StuI--StuI fragment which represents the gene specific sequences immediately upstream of those encoding the 7-cysteine domain was used. This probe gave a hybridization pattern similar to the one shown in FIG. 2 Panel B (data not shown). A third probe, the 0.34 kb EarI-PstI fragment containing 3' untranslated sequences, also confirmed the pattern (FIG. 2 Panel D). Thus, the same four OP-1 specific messages were observed with three distinct probes.

The appearance of a new 4 kb OP-1 mRNA species was initially interpreted as cross hybridization of the OP-1 probe with Vgr-1 mRNA, which is approximately this size (FIG. 2 Panel C). However, the 4 kb message was detected with three different OP-1 specific probes, including one specific to the 3' untranslated region, and moreover it was separated from Vgr-1 message on the basis of size. Most likely, therefore, the 4 kb mRNA (and the three species of 1.8 kb, 2.2 kb and 2.4 kb) results from alternative splicing of OP-1 transcripts. The 4 kb OP-1 mRNA could also represent a bicistronic mRNA. The 4 kb message is a minor species in kidney, while it is more prominent in adrenal tissue.

The level of OP-1 expression was compared in different tissues using poly(A).sup.+ RNA prepared from brain, spleen, lung, kidney and adrenal gland, heart, and liver of 13 day post-natal mice. The RNA was analyzed on Northern blots by hybridization to various probes (FIG. 3. Equal amounts of mRNA, as judged by optical density, were fractionated on agarose formaldehyde gels. Ethidium bromide staining of the gels revealed some residual ribosomal RNA in addition to the mRNA and provided another assurance that the mRNA was not degraded and that there was not significant quantitative or qualitative variation in the preparation. As control for mRNA recovery, EF-Tu (translational elongation factor) mRNA was probed (assuming uniform expression of EF-Tu in most tissues). A great variation in the level of OP-1 expression was observed in spleen, lung, kidney and adrenal tissues whereas EF-Tu mRNA levels appeared relatively constant in these tissues (FIG. 3 Panel A). The highest level of OP-1 mRNA was found in the kidneys. Uniformly lower levels of EF-Tu mRNA were found in brain, heart and liver (FIG. 3 Panel A). Additional analysis of OP-1 mRNA showed the presence of significant amounts of OP-1 mRNA in the bladder (data not shown). In summary, next to kidney, bladder and adrenal tissue, brain tissue contained the highest levels of OP-1 RNA, whereas heart and liver did not give detectable signals.

OP-1 mRNA patterns display qualitative changes in the various tissues. Of the four messages found in brain, the 2.2 kb message is most abundant whereas in lung and spleen the 1.8 kb message predominates. Levels of the 1.8-2.4 kb in the kidney OP-1 mRNA are approximately two times higher in 3 day post-natal mice than in 17 day embryos, perhaps reflecting phases in bone and/or kidney development. mRNA was also prepared from carefully separated renal and adrenal tissues of 5 week old mice. Northern blot analysis (FIG. 3, Panel B) revealed that the high levels of 2.2 kb mRNA were derived from renal tissue whereas the 4 kb mRNA was more prominent in adrenal tissue.

The detection of OP-1 message primarily in the kidney but also in bladder links OP-1 expression specifically with the urinary tract. Interestingly, the related Vgr-1 is also expressed at significant levels in kidney although its main site of expression in lung.

Once the tissue-specific expression of a given morphogen is known, cell types known to exist in that tissue or cell lines derived from that tissue can be screened, in a similar manner, to identify the cell type within that tissue that is actually responsible for the tissue specific synthesis and secretion of the morphogen. Once a cell type which produces the morphogen in an amount sufficient to detect increases or decreases in the production level of the morphogen upon exposure to a compound is identified, it may be used in tissue culture assay to rapidly screen for the ability of compound to upregulate or down regulate the synthesis and secretion of the morphogen. The level of morphogen production by the chosen cell type is determined with and without incubating the cell in culture with the compound, in order to assess the effects of the compound on the cell's ability to synthesize or secrete the morphogen. This can be accomplished by detection of the level of production of the morphogen either at the protein or mRNA level.

4. Growth of Cells in Culture

Cell cultures derived from kidney, adrenals, urinary bladder, brain, or other organs, may be prepared as described widely in the literature. For example, kidneys may be explanted from neonatal, new born, young or adult rodents (mouse or rat) and used in organ culture as whole or sliced (1-4 mm) tissues. Primary tissue cultures and established cell lines, also derived from kidney, adrenals, urinary, bladder, brain, or other tissues may be established in multiwell plates (6 well, 24 well, or 96 well) according to conventional cell culture techniques, and are cultured in the absence or presence of serum for a period of time (1-7 days). Cells may be cultured, for example, in Dulbecco's Modified Eagle medium (Gibco, Long Island, N.Y.) containing serum (e.g., fetal calf serum at 1%-10%, Gibco) or in serum-deprived medium, as desired, or in defined medium (e.g., containing insulin, transferrin, glucose, albumin, or other growth factors).

Samples for testing the level of morphogen production include culture supernatants or cell lysates, collected periodically and evaluated for OP-1 production by immunoblot analysis of a portion of the cell culture itself, collected periodically and used to prepare polyA+ RNA for RNA analysis (Sambrook et al., eds., Molecular Cloning, 1989, Cold Spring Harbor Press, Cold Spring Harbor, N.Y.). To monitor de novo OP-1 synthesis, some cultures are labeled with .sup.35 S-methionine/.sup.35 S-cysteine mixture for 6-24 hours and then evaluated for morphogen production by conventional immunoprecipitation methods (Sambrook et al., eds., Molecular Cloning, 1989, Cold Spring Harbor Press, Cold Spring Harbor, N.Y.). Alternatively, the production of morphogen or determination of the level of morphogen production may be ascertained using a simple assay for a parameter of cell growth, e.g., cellular proliferation or death. For example, where a morphogen is produced by a cultured cell line, the addition of antibody specific for the morphogen may result in relief from morphogen inhibition of cell growth. Thus, measurement of cellular proliferation can be used as an indication of morphogen production by a tissue.

5. Determination of Level of Morphogenic Protein

In order to quantitate the production of a morphogenic protein by a cell type, an immunoassay may be performed to detect the morphogen using a polyclonal or monoclonal antibody specific for that morphogen. For example, OP-1 may be detected using a polyclonal antibody specific for OP-1 in an ELISA, as follows.

1 .mu.g/100 ul of affinity-purified polyclonal rabbit IgG specific for OP-1 is added to each well of a 96-well plate and incubated at 37.degree. C. for an hour. The wells are washed four times with 0.16M sodium borate buffer with 0.15M NaCl (BSB), pH 8.2, containing 0.1% Tween 20. To minimize non-specific binding, the wells are blocked by filling completely with 1% bovine serum albumin (BSA) in BSB for 1 hour at 37.degree. C. The wells are then washed four times with BSB containing 0.1% Tween 20. A 100 ul aliquot of an appropriate dilution of each of the test samples of cell culture supernatant is added to each well in triplicate and incubated at 37.degree. C. for 30 min. After incubation, 100 ul biotinylated rabbit anti-OP-1 serum (stock solution is about 1 mg/ml and diluted 1:400 in BSB containing 1% BSA before use) is added to each well and incubated at 37.degree. C. for 30 min. The wells are then washed four times with BSB containing 0.1% Tween 20. 100 ul strepavidin-alkaline (Southern Biotechnology Associates, Inc. Birmingham, Ala., diluted 1:2000 in BSB containing 0.1% Tween 20 before use) is added to each well and incubated at 37.degree. C. for 30 min. The plates are washed four times with 0.5M Tris buffered Saline (TBS), pH 7.2. 50 ul substrate (ELISA Amplification System Kit, Life Technologies, Inc., Bethesda, Md.) are added to each well incubated at room temperature for 15 min. Then, 50 ul amplifier (from the same amplification system kit) is added and incubated for another 15 min at room temperature. The reaction is stopped by the addition of 50 ul 0.3M sulphuric acid. The OD at 490 nm of the solution in each well is recorded. To quantitate OP-1 in culture media, a OP-1 standard curve is performed in parallel with the test samples.

6. Preparation of Polyclonal Antibody

Polyclonal antibody is prepared as follows. Each rabbit is given a primary immunization of 100 ug/500 ul E. coli-produced OP-1 monomer (amino acids 328-431 of SEQ. ID NO:11) in 0.1% SDS mixed with 500 ul Complete Freund's Adjuvant. The antigen is injected subcutaneously at multiple sites on the back and flanks of the animal. The rabbit is boosted after a month in the same manner using incomplete Freund's Adjuvant. Test bleeds are taken from the ear vein seven days later. Two additional boosts and test bleeds are performed at monthly intervals until antibody against OP-1 is detected in the serum using an ELISA assay. Then, the rabbit is boosted monthly with 100 ug of antigen and bled (15 ml per bleed) at days seven and ten after boosting.

7. Preparation of Monoclonal Antibody and Neutralizing Monoclonal Antibody

Monoclonal antibody specific for a given morphogen may be prepared as follows. A mouse is given two injections of E. coli produced OP-1 monomer (amino acids 328-431 in SEQ ID NO:11). The first injection contains 100 ug of OP-1 in complete Freund's adjuvant and is given subcutaneously. The second injection contains 50 ug of OP-1 in incomplete adjuvant and is given intraperitoneally. The mouse then receives a total of 230 ug of OP-1 (amino acids 307-431 of SEQ ID NO:11) in four intraperitoneal injections at various times over an eight month period. One week prior to fusion, The mouse is boosted intraperitoneally with 100 ug of OP-1 (15-139) and 30 ug of the N-terminal peptide (Ser293-Asn3O9-Cys) conjugated through the added cys residue to bovine serum albumin with SMCC crosslinking agent. This boost is repeated five days (IP), four days (IP), three days (IP) and one day (IV) prior to fusion. The mouse spleen cells are then fused to myeloma (e.g., 653) cells at a ratio of 1:1 using PEG 1500 (Boehringer Mannheim), and the cell fusion is plated and screened for OP-1-specific antibodies using OP-1 (307-431) as antigen. The cell fusion and monoclonal screening are according to procedures widely available in the art. The neutralizing monoclonal is identified by its ability to block the biological activity of OP-1 when added to a cellular assay which responds biologically to added OP-1.

8. Identification of OP-1 Producing Cell Line Which Displays OP-1 Surface Receptors

During the process of routinely testing the effects of increasing concentrations of OP-1 and TGF-.beta. on the proliferation of various cell lines, a cell line was identified which, surprising, appears not only to synthesize and secrete OP-1, but also to display cell surface receptors to which the secreted OP-1 binds and acts to inhibit proliferation of the cells. This cell line was identified after the following observations. Addition of increasing concentrations of OP-1 or TGF-.beta. failed to increase or decrease the relatively low basal rate of proliferation of the cells. However, addition of a monoclonal antibody, which neutralizes the activity of Op-1, resulted in a large increase in the proliferation of the cells. In addition, simultaneous addition of the same quantity of OP-1 neutralizing monoclonal to a fixed amount of OP-1 resulted in an increase in proliferation which was intermediate between the low basal level observed with OP-1 alone and the high level observed with the monoclonal alone. This cell line, which is an epithelial cell line that was derived from a bladder cell carcinoma, may be used in an assay of the invention. The parameter to be tested according to the invention is cellular proliferation. Thus, a compound(s) that increases or decreases the level of OP-1 production may be tested on this cell line as follows.

9. Assay for Identifying Drugs Which Affect OP-1 Synthesis

A simple medium flux screening assay can be configured in a standard 24 or 96 well microtiter dishe, in which each well contains a constant number of a cell line having the characteristics described above. Increasing concentrations of an OP-1 neutralizing monoclonal antibody is added from left to right across the dish. A constant amount of different test substances is added from top to bottom on the dish. An increase in the synthesis and secretion of OP-1 (over its constitutive (non-induced) level) will be indicated by an increase in the amount of OP-1 neutralizing antibody required to release the cells from the antimitogenic activity of OP-1. A decrease in the synthesis and secretion of OP-1 (below its constitutive (repressed) level) will be indicated by the observation that decreased concentrations of the OP-1 neutralizing monoclonal antibody will be required to release the cells from the antimitogenic activity of OP-1. One of the major advantages of this assay is that the end point, i.e., the dilution of antibody which has an effect on cell proliferation, is a measure of mitosis, or an increase in the number of cells per well. Because several convenient and rapid assays exist for quantitating cell numbers, this assay is faster and requires significantly fewer steps to perform.

The assay may be performed as follows. After addition of appropriate concentrations of the OP-1 neutralizing monoclonal antibody and test substances to the wells containing the cells, the dishes are placed in an incubator at 37.degree. C. for a period of 1-3 days. After completion of incubation/growth period, the dishes are removed and the cells in the individual wells are washed and stained with a vital stain, such as crystal violet. Washing and staining procedures are well-known in the art. The cells are then lysed and the stain dissolved in a constant amount of a solvent, such as ethanol. Quantitations of the dissolved stain, which is readily performed on an automated plate vendor, allows for direct quantitation of the number of cells in each well.

The above-described assay has the advantages of being rapid and easy to perform because it requires few steps. Another advantage is intrinsic to the assay; drugs which are screened according to this procedure that result in cell death (i.e., cytotoxic substances) are immediately, identifiable without the need of operator observation. In addition, although drugs that stop the growth of the cells (i.e., cytostatic substances) are scored as positive due to failure to see increases in cell numbers, they are automatically scored as suspect due to the failure of the highest concentrations of OP-1 neutralizing monoclonal antibody to release the cells from the antimitogenic activity of OP-1.

10. Candidate Drugs to Screen

The screening methods of the invention is used to test compounds for their effect on the production of morphogenic protein by a given cell type. Examples of compounds which may be screened include but are not limited to chemicals, biological response modifiers (e.g., lymphokines, cytokines, hormones, or vitamins), plant extracts, microbial broths and extracts medium conditioned by eukaryotic cells, body fluids, or tissue extracts.

The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

  __________________________________________________________________________
     SEQUENCE LISTING                                                          
     (1) GENERAL INFORMATION:                                                  
     (iii) NUMBER OF SEQUENCES: 33                                             
     (2) INFORMATION FOR SEQ ID NO:1:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 97 amino acids                                                
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..97                                                       
     (D) OTHER INFORMATION: /label=GENERIC-SEQ-1                               
     /note= "EACH XAA INDICATES ONE OF THE 20 NATURALLY                        
     OCCURRING L- ISOMER, ALPHA-AMINO ACIDS, OR A DERIVATIVE                   
     THEREOF"                                                                  
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                   
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     151015                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaCysXaaXaaXaaCysXaaXaaXaa                          
     202530                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     354045                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaCysCysXaaXaa                          
     505560                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     65707580                                                                  
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaCysXaaCys                          
     859095                                                                    
     Xaa                                                                       
     (2) INFORMATION FOR SEQ ID NO:2:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 97 amino acids                                                
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..97                                                       
     (D) OTHER INFORMATION: /label=GENERIC-SEQ-2                               
     /note= "EACH XAA INDICATES ONE OF THE 20 NATURALLY                        
     OCCURRING L- ISOMER, ALPHA-AMINO ACIDS, OR A DERIVATIVE                   
     THEREOF"                                                                  
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                   
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     151015                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaCysXaaXaaXaaCysXaaXaaXaa                          
     202530                                                                    
     XaaXaaXaaCysXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     354045                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaCysCysXaaXaa                          
     505560                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     65707580                                                                  
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaCysXaaCys                          
     859095                                                                    
     Xaa                                                                       
     (2) INFORMATION FOR SEQ ID NO:3:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 97 amino acids                                                
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..97                                                       
     (D) OTHER INFORMATION: /label=GENERIC-SEQ-3                               
     /note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED FROM A                 
     GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS AS DEFINED IN                  
     THE SPECIFICATION "                                                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                   
     LeuTyrValXaaPheXaaXaaXaaGlyTrpXaaXaaTrpXaaXaaAla                          
     151015                                                                    
     ProXaaGlyXaaXaaAlaXaaTyrCysXaaGlyXaaCysXaaXaaPro                          
     202530                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaAsnHisAlaXaaXaaXaaXaaLeu                          
     354045                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaCysCysXaaPro                          
     505560                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaLeuXaaXaaXaaXaaXaaXaaXaa                          
     65707580                                                                  
     ValXaaLeuXaaXaaXaaXaaXaaMetXaaValXaaXaaCysGlyCys                          
     859095                                                                    
     Xaa                                                                       
     (2) INFORMATION FOR SEQ ID NO:4:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /label=GENERIC-SEQ-4                               
     /note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED FROM A                 
     GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS AS DEFINED IN                  
     THE SPECIFICATION"                                                        
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                   
     CysXaaXaaXaaXaaLeuTyrValXaaPheXaaXaaXaaGlyTrpXaa                          
     151015                                                                    
     XaaTrpXaaXaaAlaProXaaGlyXaaXaaAlaXaaTyrCysXaaGly                          
     202530                                                                    
     XaaCysXaaXaaProXaaXaaXaaXaaXaaXaaXaaXaaAsnHisAla                          
     354045                                                                    
     XaaXaaXaaXaaLeuXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     505560                                                                    
     XaaCysCysXaaProXaaXaaXaaXaaXaaXaaXaaXaaLeuXaaXaa                          
     65707580                                                                  
     XaaXaaXaaXaaXaaValXaaLeuXaaXaaXaaXaaXaaMetXaaVal                          
     859095                                                                    
     XaaXaaCysGlyCysXaa                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:5:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 139 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..139                                                      
     (D) OTHER INFORMATION: /note= "HOP-1 (MATURE FORM)"                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                   
     SerThrGlySerLysGlnArgSerGlnAsnArgSerLysThrProLys                          
     151015                                                                    
     AsnGlnGluAlaLeuArgMetAlaAsnValAlaGluAsnSerSerSer                          
     202530                                                                    
     AspGlnArgGlnAlaCysLysLysHisGluLeuTyrValSerPheArg                          
     354045                                                                    
     AspLeuGlyTrpGlnAspTrpIleIleAlaProGluGlyTyrAlaAla                          
     505560                                                                    
     TyrTyrCysGluGlyGluCysAlaPheProLeuAsnSerTyrMetAsn                          
     65707580                                                                  
     AlaThrAsnHisAlaIleValGlnThrLeuValHisPheIleAsnPro                          
     859095                                                                    
     GluThrValProLysProCysCysAlaProThrGlnLeuAsnAlaIle                          
     100105110                                                                 
     SerValLeuTyrPheAspAspSerSerAsnValIleLeuLysLysTyr                          
     115120125                                                                 
     ArgAsnMetValValArgAlaCysGlyCysHis                                         
     130135                                                                    
     (2) INFORMATION FOR SEQ ID NO:6:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 139 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..139                                                      
     (D) OTHER INFORMATION: /note= "MOP-1 (MATURE FORM)"                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                   
     SerThrGlyGlyLysGlnArgSerGlnAsnArgSerLysThrProLys                          
     151015                                                                    
     AsnGlnGluAlaLeuArgMetAlaSerValAlaGluAsnSerSerSer                          
     202530                                                                    
     AspGlnArgGlnAlaCysLysLysHisGluLeuTyrValSerPheArg                          
     354045                                                                    
     AspLeuGlyTrpGlnAspTrpIleIleAlaProGluGlyTyrAlaAla                          
     505560                                                                    
     TyrTyrCysGluGlyGluCysAlaPheProLeuAsnSerTyrMetAsn                          
     65707580                                                                  
     AlaThrAsnHisAlaIleValGlnThrLeuValHisPheIleAsnPro                          
     859095                                                                    
     AspThrValProLysProCysCysAlaProThrGlnLeuAsnAlaIle                          
     100105110                                                                 
     SerValLeuTyrPheAspAspSerSerAsnValIleLeuLysLysTyr                          
     115120125                                                                 
     ArgAsnMetValValArgAlaCysGlyCysHis                                         
     130135                                                                    
     (2) INFORMATION FOR SEQ ID NO:7:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 139 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..139                                                      
     (D) OTHER INFORMATION: /note= "HOP-2 (MATURE FORM)"                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                   
     AlaValArgProLeuArgArgArgGlnProLysLysSerAsnGluLeu                          
     151015                                                                    
     ProGlnAlaAsnArgLeuProGlyIlePheAspAspValHisGlySer                          
     202530                                                                    
     HisGlyArgGlnValCysArgArgHisGluLeuTyrValSerPheGln                          
     354045                                                                    
     AspLeuGlyTrpLeuAspTrpValIleAlaProGlnGlyTyrSerAla                          
     505560                                                                    
     TyrTyrCysGluGlyGluCysSerPheProLeuAspSerCysMetAsn                          
     65707580                                                                  
     AlaThrAsnHisAlaIleLeuGlnSerLeuValHisLeuMetLysPro                          
     859095                                                                    
     AsnAlaValProLysAlaCysCysAlaProThrLysLeuSerAlaThr                          
     100105110                                                                 
     SerValLeuTyrTyrAspSerSerAsnAsnValIleLeuArgLysHis                          
     115120125                                                                 
     ArgAsnMetValValLysAlaCysGlyCysHis                                         
     130135                                                                    
     (2) INFORMATION FOR SEQ ID NO:8:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 139 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..139                                                      
     (D) OTHER INFORMATION: /note= "MOP-2 (MATURE FORM)"                       
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                   
     AlaAlaArgProLeuLysArgArgGlnProLysLysThrAsnGluLeu                          
     151015                                                                    
     ProHisProAsnLysLeuProGlyIlePheAspAspGlyHisGlySer                          
     202530                                                                    
     ArgGlyArgGluValCysArgArgHisGluLeuTyrValSerPheArg                          
     354045                                                                    
     AspLeuGlyTrpLeuAspTrpValIleAlaProGlnGlyTyrSerAla                          
     505560                                                                    
     TyrTyrCysGluGlyGluCysAlaPheProLeuAspSerCysMetAsn                          
     65707580                                                                  
     AlaThrAsnHisAlaIleLeuGlnSerLeuValHisLeuMetLysPro                          
     859095                                                                    
     AspValValProLysAlaCysCysAlaProThrLysLeuSerAlaThr                          
     100105110                                                                 
     SerValLeuTyrTyrAspSerSerAsnAsnValIleLeuArgLysHis                          
     115120125                                                                 
     ArgAsnMetValValLysAlaCysGlyCysHis                                         
     130135                                                                    
     (2) INFORMATION FOR SEQ ID NO:9:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 101 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..101                                                      
     (D) OTHER INFORMATION: /note= "CBMP-2A(FX)"                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                   
     CysLysArgHisProLeuTyrValAspPheSerAspValGlyTrpAsn                          
     151015                                                                    
     AspTrpIleValAlaProProGlyTyrHisAlaPheTyrCysHisGly                          
     202530                                                                    
     GluCysProPheProLeuAlaAspHisLeuAsnSerThrAsnHisAla                          
     354045                                                                    
     IleValGlnThrLeuValAsnSerValAsnSerLysIleProLysAla                          
     505560                                                                    
     CysCysValProThrGluLeuSerAlaIleSerMetLeuTyrLeuAsp                          
     65707580                                                                  
     GluAsnGluLysValValLeuLysAsnTyrGlnAspMetValValGlu                          
     859095                                                                    
     GlyCysGlyCysArg                                                           
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:10:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 101 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..101                                                      
     (D) OTHER INFORMATION: /note= "CBMP-2B(FX)"                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                  
     CysArgArgHisSerLeuTyrValAspPheSerAspValGlyTrpAsn                          
     151015                                                                    
     AspTrpIleValAlaProProGlyTyrGlnAlaPheTyrCysHisGly                          
     202530                                                                    
     AspCysProPheProLeuAlaAspHisLeuAsnSerThrAsnHisAla                          
     354045                                                                    
     IleValGlnThrLeuValAsnSerValAsnSerSerIleProLysAla                          
     505560                                                                    
     CysCysValProThrGluLeuSerAlaIleSerMetLeuTyrLeuAsp                          
     65707580                                                                  
     GluTyrAspLysValValLeuLysAsnTyrGlnGluMetValValGlu                          
     859095                                                                    
     GlyCysGlyCysArg                                                           
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:11:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /note= "DPP(FX)"                                   
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                  
     CysArgArgHisSerLeuTyrValAspPheSerAspValGlyTrpAsp                          
     151015                                                                    
     AspTrpIleValAlaProLeuGlyTyrAspAlaTyrTyrCysHisGly                          
     202530                                                                    
     LysCysProPheProLeuAlaAspHisPheAsnSerThrAsnHisAla                          
     354045                                                                    
     ValValGlnThrLeuValAsnAsnAsnAsnProGlyLysValProLys                          
     505560                                                                    
     AlaCysCysValProThrGlnLeuAspSerValAlaMetLeuTyrLeu                          
     65707580                                                                  
     AsnAspGlnSerThrValValLeuLysAsnTyrGlnGluMetThrVal                          
     859095                                                                    
     ValGlyCysGlyCysArg                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:12:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /note= "VGL(FX)"                                   
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                  
     CysLysLysArgHisLeuTyrValGluPheLysAspValGlyTrpGln                          
     151015                                                                    
     AsnTrpValIleAlaProGlnGlyTyrMetAlaAsnTyrCysTyrGly                          
     202530                                                                    
     GluCysProTyrProLeuThrGluIleLeuAsnGlySerAsnHisAla                          
     354045                                                                    
     IleLeuGlnThrLeuValHisSerIleGluProGluAspIleProLeu                          
     505560                                                                    
     ProCysCysValProThrLysMetSerProIleSerMetLeuPheTyr                          
     65707580                                                                  
     AspAsnAsnAspAsnValValLeuArgHisTyrGluAsnMetAlaVal                          
     859095                                                                    
     AspGluCysGlyCysArg                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:13:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /note= "VGR-1(FX)"                                 
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                  
     CysLysLysHisGluLeuTyrValSerPheGlnAspValGlyTrpGln                          
     151015                                                                    
     AspTrpIleIleAlaProLysGlyTyrAlaAlaAsnTyrCysAspGly                          
     202530                                                                    
     GluCysSerPheProLeuAsnAlaHisMetAsnAlaThrAsnHisAla                          
     354045                                                                    
     IleValGlnThrLeuValHisValMetAsnProGluTyrValProLys                          
     505560                                                                    
     ProCysCysAlaProThrLysValAsnAlaIleSerValLeuTyrPhe                          
     65707580                                                                  
     AspAspAsnSerAsnValIleLeuLysLysTyrArgAsnMetValVal                          
     859095                                                                    
     ArgAlaCysGlyCysHis                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:14:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 106 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..106                                                      
     (D) OTHER INFORMATION: /note= "GDF-1 (FX)"                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                  
     CysArgAlaArgArgLeuTyrValSerPheArgGluValGlyTrpHis                          
     151015                                                                    
     ArgTrpValIleAlaProArgGlyPheLeuAlaAsnTyrCysGlnGly                          
     202530                                                                    
     GlnCysAlaLeuProValAlaLeuSerGlySerGlyGlyProProAla                          
     354045                                                                    
     LeuAsnHisAlaValLeuArgAlaLeuMetHisAlaAlaAlaProGly                          
     505560                                                                    
     AlaAlaAspLeuProCysCysValProAlaArgLeuSerProIleSer                          
     65707580                                                                  
     ValLeuPhePheAspAsnSerAspAsnValValLeuArgGlnTyrGlu                          
     859095                                                                    
     AspMetValValAspGluCysGlyCysArg                                            
     100105                                                                    
     (2) INFORMATION FOR SEQ ID NO:15:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 5 amino acids                                                 
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: peptide                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                  
     CysXaaXaaXaaXaa                                                           
     15                                                                        
     (2) INFORMATION FOR SEQ ID NO:16:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 1822 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 49..1341                                                    
     (D) OTHER INFORMATION: /product="HOP-1"                                   
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                  
     GGTGCGGGCCCGGAGCCCGGAGCCCGGGTAGCGCGTAGAGCCGGCGCGATGCACGTG57               
     MetHisVal                                                                 
     CGCTCACTGCGAGCTGCGGCGCCGCACAGCTTCGTGGCGCTCTGGGCA105                       
     ArgSerLeuArgAlaAlaAlaProHisSerPheValAlaLeuTrpAla                          
     51015                                                                     
     CCCCTGTTCCTGCTGCGCTCCGCCCTGGCCGACTTCAGCCTGGACAAC153                       
     ProLeuPheLeuLeuArgSerAlaLeuAlaAspPheSerLeuAspAsn                          
     20253035                                                                  
     GAGGTGCACTCGAGCTTCATCCACCGGCGCCTCCGCAGCCAGGAGCGG201                       
     GluValHisSerSerPheIleHisArgArgLeuArgSerGlnGluArg                          
     404550                                                                    
     CGGGAGATGCAGCGCGAGATCCTCTCCATTTTGGGCTTGCCCCACCGC249                       
     ArgGluMetGlnArgGluIleLeuSerIleLeuGlyLeuProHisArg                          
     556065                                                                    
     CCGCGCCCGCACCTCCAGGGCAAGCACAACTCGGCACCCATGTTCATG297                       
     ProArgProHisLeuGlnGlyLysHisAsnSerAlaProMetPheMet                          
     707580                                                                    
     CTGGACCTGTACAACGCCATGGCGGTGGAGGAGGGCGGCGGGCCCGGC345                       
     LeuAspLeuTyrAsnAlaMetAlaValGluGluGlyGlyGlyProGly                          
     859095                                                                    
     GGCCAGGGCTTCTCCTACCCCTACAAGGCCGTCTTCAGTACCCAGGGC393                       
     GlyGlnGlyPheSerTyrProTyrLysAlaValPheSerThrGlnGly                          
     100105110115                                                              
     CCCCCTCTGGCCAGCCTGCAAGATAGCCATTTCCTCACCGACGCCGAC441                       
     ProProLeuAlaSerLeuGlnAspSerHisPheLeuThrAspAlaAsp                          
     120125130                                                                 
     ATGGTCATGAGCTTCGTCAACCTCGTGGAACATGACAAGGAATTCTTC489                       
     MetValMetSerPheValAsnLeuValGluHisAspLysGluPhePhe                          
     135140145                                                                 
     CACCCACGCTACCACCATCGAGAGTTCCGGTTTGATCTTTCCAAGATC537                       
     HisProArgTyrHisHisArgGluPheArgPheAspLeuSerLysIle                          
     150155160                                                                 
     CCAGAAGGGGAAGCTGTCACGGCAGCCGAATTCCGGATCTACAAGGAC585                       
     ProGluGlyGluAlaValThrAlaAlaGluPheArgIleTyrLysAsp                          
     165170175                                                                 
     TACATCCGGGAACGCTTCGACAATGAGACGTTCCGGATCAGCGTTTAT633                       
     TyrIleArgGluArgPheAspAsnGluThrPheArgIleSerValTyr                          
     180185190195                                                              
     CAGGTGCTCCAGGAGCACTTGGGCAGGGAATCGGATCTCTTCCTGCTC681                       
     GlnValLeuGlnGluHisLeuGlyArgGluSerAspLeuPheLeuLeu                          
     200205210                                                                 
     GACAGCCGTACCCTCTGGGCCTCGGAGGAGGGCTGGCTGGTGTTTGAC729                       
     AspSerArgThrLeuTrpAlaSerGluGluGlyTrpLeuValPheAsp                          
     215220225                                                                 
     ATCACAGCCACCAGCAACCACTGGGTGGTCAATCCGCGGCACAACCTG777                       
     IleThrAlaThrSerAsnHisTrpValValAsnProArgHisAsnLeu                          
     230235240                                                                 
     GGCCTGCAGCTCTCGGTGGAGACGCTGGATGGGCAGAGCATCAACCCC825                       
     GlyLeuGlnLeuSerValGluThrLeuAspGlyGlnSerIleAsnPro                          
     245250255                                                                 
     AAGTTGGCGGGCCTGATTGGGCGGCACGGGCCCCAGAACAAGCAGCCC873                       
     LysLeuAlaGlyLeuIleGlyArgHisGlyProGlnAsnLysGlnPro                          
     260265270275                                                              
     TTCATGGTGGCTTTCTTCAAGGCCACGGAGGTCCACTTCCGCAGCATC921                       
     PheMetValAlaPhePheLysAlaThrGluValHisPheArgSerIle                          
     280285290                                                                 
     CGGTCCACGGGGAGCAAACAGCGCAGCCAGAACCGCTCCAAGACGCCC969                       
     ArgSerThrGlySerLysGlnArgSerGlnAsnArgSerLysThrPro                          
     295300305                                                                 
     AAGAACCAGGAAGCCCTGCGGATGGCCAACGTGGCAGAGAACAGCAGC1017                      
     LysAsnGlnGluAlaLeuArgMetAlaAsnValAlaGluAsnSerSer                          
     310315320                                                                 
     AGCGACCAGAGGCAGGCCTGTAAGAAGCACGAGCTGTATGTCAGCTTC1065                      
     SerAspGlnArgGlnAlaCysLysLysHisGluLeuTyrValSerPhe                          
     325330335                                                                 
     CGAGACCTGGGCTGGCAGGACTGGATCATCGCGCCTGAAGGCTACGCC1113                      
     ArgAspLeuGlyTrpGlnAspTrpIleIleAlaProGluGlyTyrAla                          
     340345350355                                                              
     GCCTACTACTGTGAGGGGGAGTGTGCCTTCCCTCTGAACTCCTACATG1161                      
     AlaTyrTyrCysGluGlyGluCysAlaPheProLeuAsnSerTyrMet                          
     360365370                                                                 
     AACGCCACCAACCACGCCATCGTGCAGACGCTGGTCCACTTCATCAAC1209                      
     AsnAlaThrAsnHisAlaIleValGlnThrLeuValHisPheIleAsn                          
     375380385                                                                 
     CCGGAAACGGTGCCCAAGCCCTGCTGTGCGCCCACGCAGCTCAATGCC1257                      
     ProGluThrValProLysProCysCysAlaProThrGlnLeuAsnAla                          
     390395400                                                                 
     ATCTCCGTCCTCTACTTCGATGACAGCTCCAACGTCATCCTGAAGAAA1305                      
     IleSerValLeuTyrPheAspAspSerSerAsnValIleLeuLysLys                          
     405410415                                                                 
     TACAGAAACATGGTGGTCCGGGCCTGTGGCTGCCACTAGCTCCTCC1351                        
     TyrArgAsnMetValValArgAlaCysGlyCysHis                                      
     420425430                                                                 
     GAGAATTCAGACCCTTTGGGGCCAAGTTTTTCTGGATCCTCCATTGCTCGCCTTGGCCAG1411          
     GAACCAGCAGACCAACTGCCTTTTGTGAGACCTTCCCCTCCCTATCCCCAACTTTAAAGG1471          
     TGTGAGAGTATTAGGAAACATGAGCAGCATATGGCTTTTGATCAGTTTTTCAGTGGCAGC1531          
     ATCCAATGAACAAGATCCTACAAGCTGTGCAGGCAAAACCTAGCAGGAAAAAAAAACAAC1591          
     GCATAAAGAAAAATGGCCGGGCCAGGTCATTGGCTGGGAAGTCTCAGCCATGCACGGACT1651          
     CGTTTCCAGAGGTAATTATGAGCGCCTACCAGCCAGGCCACCCAGCCGTGGGAGGAAGGG1711          
     GGCGTGGCAAGGGGTGGGCACATTGGTGTCTGTGCGAAAGGAAAATTGACCCGGAAGTTC1771          
     CTGTAATAAATGTCACAATAAAACGAATGAATGAAAAAAAAAAAAAAAAAA1822                   
     (2) INFORMATION FOR SEQ ID NO:17:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 431 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                  
     MetHisValArgSerLeuArgAlaAlaAlaProHisSerPheValAla                          
     151015                                                                    
     LeuTrpAlaProLeuPheLeuLeuArgSerAlaLeuAlaAspPheSer                          
     202530                                                                    
     LeuAspAsnGluValHisSerSerPheIleHisArgArgLeuArgSer                          
     354045                                                                    
     GlnGluArgArgGluMetGlnArgGluIleLeuSerIleLeuGlyLeu                          
     505560                                                                    
     ProHisArgProArgProHisLeuGlnGlyLysHisAsnSerAlaPro                          
     65707580                                                                  
     MetPheMetLeuAspLeuTyrAsnAlaMetAlaValGluGluGlyGly                          
     859095                                                                    
     GlyProGlyGlyGlnGlyPheSerTyrProTyrLysAlaValPheSer                          
     100105110                                                                 
     ThrGlnGlyProProLeuAlaSerLeuGlnAspSerHisPheLeuThr                          
     115120125                                                                 
     AspAlaAspMetValMetSerPheValAsnLeuValGluHisAspLys                          
     130135140                                                                 
     GluPhePheHisProArgTyrHisHisArgGluPheArgPheAspLeu                          
     145150155160                                                              
     SerLysIleProGluGlyGluAlaValThrAlaAlaGluPheArgIle                          
     165170175                                                                 
     TyrLysAspTyrIleArgGluArgPheAspAsnGluThrPheArgIle                          
     180185190                                                                 
     SerValTyrGlnValLeuGlnGluHisLeuGlyArgGluSerAspLeu                          
     195200205                                                                 
     PheLeuLeuAspSerArgThrLeuTrpAlaSerGluGluGlyTrpLeu                          
     210215220                                                                 
     ValPheAspIleThrAlaThrSerAsnHisTrpValValAsnProArg                          
     225230235240                                                              
     HisAsnLeuGlyLeuGlnLeuSerValGluThrLeuAspGlyGlnSer                          
     245250255                                                                 
     IleAsnProLysLeuAlaGlyLeuIleGlyArgHisGlyProGlnAsn                          
     260265270                                                                 
     LysGlnProPheMetValAlaPhePheLysAlaThrGluValHisPhe                          
     275280285                                                                 
     ArgSerIleArgSerThrGlySerLysGlnArgSerGlnAsnArgSer                          
     290295300                                                                 
     LysThrProLysAsnGlnGluAlaLeuArgMetAlaAsnValAlaGlu                          
     305310315320                                                              
     AsnSerSerSerAspGlnArgGlnAlaCysLysLysHisGluLeuTyr                          
     325330335                                                                 
     ValSerPheArgAspLeuGlyTrpGlnAspTrpIleIleAlaProGlu                          
     340345350                                                                 
     GlyTyrAlaAlaTyrTyrCysGluGlyGluCysAlaPheProLeuAsn                          
     355360365                                                                 
     SerTyrMetAsnAlaThrAsnHisAlaIleValGlnThrLeuValHis                          
     370375380                                                                 
     PheIleAsnProGluThrValProLysProCysCysAlaProThrGln                          
     385390395400                                                              
     LeuAsnAlaIleSerValLeuTyrPheAspAspSerSerAsnValIle                          
     405410415                                                                 
     LeuLysLysTyrArgAsnMetValValArgAlaCysGlyCysHis                             
     420425430                                                                 
     (2) INFORMATION FOR SEQ ID NO:18:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 1873 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 104..1393                                                   
     (D) OTHER INFORMATION: /product="MOP1 (CDNA)"                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                  
     CTGCAGCAAGTGACCTCGGGTCGTGGACCGCTGCCCTGCCCCCTCCGCTGCCACCTGGGG60            
     CGGCGCGGGCCCGGTGCCCCGGATCGCGCGTAGAGCCGGCGCGATGCACGTGCGC115                
     MetHisValArg                                                              
     1                                                                         
     TCGCTGCGCGCTGCGGCGCCACACAGCTTCGTGGCGCTCTGGGCGCCT163                       
     SerLeuArgAlaAlaAlaProHisSerPheValAlaLeuTrpAlaPro                          
     5101520                                                                   
     CTGTTCTTGCTGCGCTCCGCCCTGGCCGATTTCAGCCTGGACAACGAG211                       
     LeuPheLeuLeuArgSerAlaLeuAlaAspPheSerLeuAspAsnGlu                          
     253035                                                                    
     GTGCACTCCAGCTTCATCCACCGGCGCCTCCGCAGCCAGGAGCGGCGG259                       
     ValHisSerSerPheIleHisArgArgLeuArgSerGlnGluArgArg                          
     404550                                                                    
     GAGATGCAGCGGGAGATCCTGTCCATCTTAGGGTTGCCCCATCGCCCG307                       
     GluMetGlnArgGluIleLeuSerIleLeuGlyLeuProHisArgPro                          
     556065                                                                    
     CGCCCGCACCTCCAGGGAAAGCATAATTCGGCGCCCATGTTCATGTTG355                       
     ArgProHisLeuGlnGlyLysHisAsnSerAlaProMetPheMetLeu                          
     707580                                                                    
     GACCTGTACAACGCCATGGCGGTGGAGGAGAGCGGGCCGGACGGACAG403                       
     AspLeuTyrAsnAlaMetAlaValGluGluSerGlyProAspGlyGln                          
     859095100                                                                 
     GGCTTCTCCTACCCCTACAAGGCCGTCTTCAGTACCCAGGGCCCCCCT451                       
     GlyPheSerTyrProTyrLysAlaValPheSerThrGlnGlyProPro                          
     105110115                                                                 
     TTAGCCAGCCTGCAGGACAGCCATTTCCTCACTGACGCCGACATGGTC499                       
     LeuAlaSerLeuGlnAspSerHisPheLeuThrAspAlaAspMetVal                          
     120125130                                                                 
     ATGAGCTTCGTCAACCTAGTGGAACATGACAAAGAATTCTTCCACCCT547                       
     MetSerPheValAsnLeuValGluHisAspLysGluPhePheHisPro                          
     135140145                                                                 
     CGATACCACCATCGGGAGTTCCGGTTTGATCTTTCCAAGATCCCCGAG595                       
     ArgTyrHisHisArgGluPheArgPheAspLeuSerLysIleProGlu                          
     150155160                                                                 
     GGCGAACGGGTGACCGCAGCCGAATTCAGGATCTATAAGGACTACATC643                       
     GlyGluArgValThrAlaAlaGluPheArgIleTyrLysAspTyrIle                          
     165170175180                                                              
     CGGGAGCGATTTGACAACGAGACCTTCCAGATCACAGTCTATCAGGTG691                       
     ArgGluArgPheAspAsnGluThrPheGlnIleThrValTyrGlnVal                          
     185190195                                                                 
     CTCCAGGAGCACTCAGGCAGGGAGTCGGACCTCTTCTTGCTGGACAGC739                       
     LeuGlnGluHisSerGlyArgGluSerAspLeuPheLeuLeuAspSer                          
     200205210                                                                 
     CGCACCATCTGGGCTTCTGAGGAGGGCTGGTTGGTGTTTGATATCACA787                       
     ArgThrIleTrpAlaSerGluGluGlyTrpLeuValPheAspIleThr                          
     215220225                                                                 
     GCCACCAGCAACCACTGGGTGGTCAACCCTCGGCACAACCTGGGCTTA835                       
     AlaThrSerAsnHisTrpValValAsnProArgHisAsnLeuGlyLeu                          
     230235240                                                                 
     CAGCTCTCTGTGGAGACCCTGGATGGGCAGAGCATCAACCCCAAGTTG883                       
     GlnLeuSerValGluThrLeuAspGlyGlnSerIleAsnProLysLeu                          
     245250255260                                                              
     GCAGGCCTGATTGGACGGCATGGACCCCAGAACAAGCAACCCTTCATG931                       
     AlaGlyLeuIleGlyArgHisGlyProGlnAsnLysGlnProPheMet                          
     265270275                                                                 
     GTGGCCTTCTTCAAGGCCACGGAAGTCCATCTCCGTAGTATCCGGTCC979                       
     ValAlaPhePheLysAlaThrGluValHisLeuArgSerIleArgSer                          
     280285290                                                                 
     ACGGGGGGCAAGCAGCGCAGCCAGAATCGCTCCAAGACGCCAAAGAAC1027                      
     ThrGlyGlyLysGlnArgSerGlnAsnArgSerLysThrProLysAsn                          
     295300305                                                                 
     CAAGAGGCCCTGAGGATGGCCAGTGTGGCAGAAAACAGCAGCAGTGAC1075                      
     GlnGluAlaLeuArgMetAlaSerValAlaGluAsnSerSerSerAsp                          
     310315320                                                                 
     CAGAGGCAGGCCTGCAAGAAACATGAGCTGTACGTCAGCTTCCGAGAC1123                      
     GlnArgGlnAlaCysLysLysHisGluLeuTyrValSerPheArgAsp                          
     325330335340                                                              
     CTTGGCTGGCAGGACTGGATCATTGCACCTGAAGGCTATGCTGCCTAC1171                      
     LeuGlyTrpGlnAspTrpIleIleAlaProGluGlyTyrAlaAlaTyr                          
     345350355                                                                 
     TACTGTGAGGGAGAGTGCGCCTTCCCTCTGAACTCCTACATGAACGCC1219                      
     TyrCysGluGlyGluCysAlaPheProLeuAsnSerTyrMetAsnAla                          
     360365370                                                                 
     ACCAACCACGCCATCGTCCAGACACTGGTTCACTTCATCAACCCAGAC1267                      
     ThrAsnHisAlaIleValGlnThrLeuValHisPheIleAsnProAsp                          
     375380385                                                                 
     ACAGTACCCAAGCCCTGCTGTGCGCCCACCCAGCTCAACGCCATCTCT1315                      
     ThrValProLysProCysCysAlaProThrGlnLeuAsnAlaIleSer                          
     390395400                                                                 
     GTCCTCTACTTCGACGACAGCTCTAATGTCATCCTGAAGAAGTACAGA1363                      
     ValLeuTyrPheAspAspSerSerAsnValIleLeuLysLysTyrArg                          
     405410415420                                                              
     AACATGGTGGTCCGGGCCTGTGGCTGCCACTAGCTCTTCCTGAGACCCTG1413                    
     AsnMetValValArgAlaCysGlyCysHis                                            
     425430                                                                    
     ACCTTTGCGGGGCCACACCTTTCCAAATCTTCGATGTCTCACCATCTAAGTCTCTCACTG1473          
     CCCACCTTGGCGAGGAGAACAGACCAACCTCTCCTGAGCCTTCCCTCACCTCCCAACCGG1533          
     AAGCATGTAAGGGTTCCAGAAACCTGAGCGTGCAGCAGCTGATGAGCGCCCTTTCCTTCT1593          
     GGCACGTGACGGACAAGATCCTACCAGCTACCACAGCAAACGCCTAAGAGCAGGAAAAAT1653          
     GTCTGCCAGGAAAGTGTCCAGTGTCCACATGGCCCCTGGCGCTCTGAGTCTTTGAGGAGT1713          
     AATCGCAAGCCTCGTTCAGCTGCAGCAGAAGGAAGGGCTTAGCCAGGGTGGGCGCTGGCG1773          
     TCTGTGTTGAAGGGAAACCAAGCAGAAGCCACTGTAATGATATGTCACAATAAAACCCAT1833          
     GAATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAATTC1873                              
     (2) INFORMATION FOR SEQ ID NO:19:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 430 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                  
     MetHisValArgSerLeuArgAlaAlaAlaProHisSerPheValAla                          
     151015                                                                    
     LeuTrpAlaProLeuPheLeuLeuArgSerAlaLeuAlaAspPheSer                          
     202530                                                                    
     LeuAspAsnGluValHisSerSerPheIleHisArgArgLeuArgSer                          
     354045                                                                    
     GlnGluArgArgGluMetGlnArgGluIleLeuSerIleLeuGlyLeu                          
     505560                                                                    
     ProHisArgProArgProHisLeuGlnGlyLysHisAsnSerAlaPro                          
     65707580                                                                  
     MetPheMetLeuAspLeuTyrAsnAlaMetAlaValGluGluSerGly                          
     859095                                                                    
     ProAspGlyGlnGlyPheSerTyrProTyrLysAlaValPheSerThr                          
     100105110                                                                 
     GlnGlyProProLeuAlaSerLeuGlnAspSerHisPheLeuThrAsp                          
     115120125                                                                 
     AlaAspMetValMetSerPheValAsnLeuValGluHisAspLysGlu                          
     130135140                                                                 
     PhePheHisProArgTyrHisHisArgGluPheArgPheAspLeuSer                          
     145150155160                                                              
     LysIleProGluGlyGluArgValThrAlaAlaGluPheArgIleTyr                          
     165170175                                                                 
     LysAspTyrIleArgGluArgPheAspAsnGluThrPheGlnIleThr                          
     180185190                                                                 
     ValTyrGlnValLeuGlnGluHisSerGlyArgGluSerAspLeuPhe                          
     195200205                                                                 
     LeuLeuAspSerArgThrIleTrpAlaSerGluGluGlyTrpLeuVal                          
     210215220                                                                 
     PheAspIleThrAlaThrSerAsnHisTrpValValAsnProArgHis                          
     225230235240                                                              
     AsnLeuGlyLeuGlnLeuSerValGluThrLeuAspGlyGlnSerIle                          
     245250255                                                                 
     AsnProLysLeuAlaGlyLeuIleGlyArgHisGlyProGlnAsnLys                          
     260265270                                                                 
     GlnProPheMetValAlaPhePheLysAlaThrGluValHisLeuArg                          
     275280285                                                                 
     SerIleArgSerThrGlyGlyLysGlnArgSerGlnAsnArgSerLys                          
     290295300                                                                 
     ThrProLysAsnGlnGluAlaLeuArgMetAlaSerValAlaGluAsn                          
     305310315320                                                              
     SerSerSerAspGlnArgGlnAlaCysLysLysHisGluLeuTyrVal                          
     325330335                                                                 
     SerPheArgAspLeuGlyTrpGlnAspTrpIleIleAlaProGluGly                          
     340345350                                                                 
     TyrAlaAlaTyrTyrCysGluGlyGluCysAlaPheProLeuAsnSer                          
     355360365                                                                 
     TyrMetAsnAlaThrAsnHisAlaIleValGlnThrLeuValHisPhe                          
     370375380                                                                 
     IleAsnProAspThrValProLysProCysCysAlaProThrGlnLeu                          
     385390395400                                                              
     AsnAlaIleSerValLeuTyrPheAspAspSerSerAsnValIleLeu                          
     405410415                                                                 
     LysLysTyrArgAsnMetValValArgAlaCysGlyCysHis                                
     420425430                                                                 
     (2) INFORMATION FOR SEQ ID NO:20:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 1723 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 490..1695                                                   
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                  
     GGCGCCGGCAGAGCAGGAGTGGCTGGAGGAGCTGTGGTTGGAGCAGGAGGTGGCACGGCA60            
     GGGCTGGAGGGCTCCCTATGAGTGGCGGAGACGGCCCAGGAGGCGCTGGAGCAACAGCTC120           
     CCACACCGCACCAAGCGGTGGCTGCAGGAGCTCGCCCATCGCCCCTGCGCTGCTCGGACC180           
     GCGGCCACAGCCGGACTGGCGGGTACGGCGGCGACAGAGGCATTGGCCGAGAGTCCCAGT240           
     CCGCAGAGTAGCCCCGGCCTCGAGGCGGTGGCGTCCCGGTCCTCTCCGTCCAGGAGCCAG300           
     GACAGGTGTCGCGCGGCGGGGCTCCAGGGACCGCGCCTGAGGCCGGCTGCCCGCCCGTCC360           
     CGCCCCGCCCCGCCGCCCGCCGCCCGCCGAGCCCAGCCTCCTTGCCGTCGGGGCGTCCCC420           
     AGGCCCTGGGTCGGCCGCGGAGCCGATGCGCGCCCGCTGAGCGCCCCAGCTGAGCGCCCC480           
     CGGCCTGCCATGACCGCGCTCCCCGGCCCGCTCTGGCTCCTGGGCCTG528                       
     MetThrAlaLeuProGlyProLeuTrpLeuLeuGlyLeu                                   
     1510                                                                      
     GCGCTATGCGCGCTGGGCGGGGGCGGCCCCGGCCTGCGACCCCCGCCC576                       
     AlaLeuCysAlaLeuGlyGlyGlyGlyProGlyLeuArgProProPro                          
     152025                                                                    
     GGCTGTCCCCAGCGACGTCTGGGCGCGCGCGAGCGCCGGGACGTGCAG624                       
     GlyCysProGlnArgArgLeuGlyAlaArgGluArgArgAspValGln                          
     30354045                                                                  
     CGCGAGATCCTGGCGGTGCTCGGGCTGCCTGGGCGGCCCCGGCCCCGC672                       
     ArgGluIleLeuAlaValLeuGlyLeuProGlyArgProArgProArg                          
     505560                                                                    
     GCGCCACCCGCCGCCTCCCGGCTGCCCGCGTCCGCGCCGCTCTTCATG720                       
     AlaProProAlaAlaSerArgLeuProAlaSerAlaProLeuPheMet                          
     657075                                                                    
     CTGGACCTGTACCACGCCATGGCCGGCGACGACGACGAGGACGGCGCG768                       
     LeuAspLeuTyrHisAlaMetAlaGlyAspAspAspGluAspGlyAla                          
     808590                                                                    
     CCCGCGGAGCGGCGCCTGGGCCGCGCCGACCTGGTCATGAGCTTCGTT816                       
     ProAlaGluArgArgLeuGlyArgAlaAspLeuValMetSerPheVal                          
     95100105                                                                  
     AACATGGTGGAGCGAGACCGTGCCCTGGGCCACCAGGAGCCCCATTGG864                       
     AsnMetValGluArgAspArgAlaLeuGlyHisGlnGluProHisTrp                          
     110115120125                                                              
     AAGGAGTTCCGCTTTGACCTGACCCAGATCCCGGCTGGGGAGGCGGTC912                       
     LysGluPheArgPheAspLeuThrGlnIleProAlaGlyGluAlaVal                          
     130135140                                                                 
     ACAGCTGCGGAGTTCCGGATTTACAAGGTGCCCAGCATCCACCTGCTC960                       
     ThrAlaAlaGluPheArgIleTyrLysValProSerIleHisLeuLeu                          
     145150155                                                                 
     AACAGGACCCTCCACGTCAGCATGTTCCAGGTGGTCCAGGAGCAGTCC1008                      
     AsnArgThrLeuHisValSerMetPheGlnValValGlnGluGlnSer                          
     160165170                                                                 
     AACAGGGAGTCTGACTTGTTCTTTTTGGATCTTCAGACGCTCCGAGCT1056                      
     AsnArgGluSerAspLeuPhePheLeuAspLeuGlnThrLeuArgAla                          
     175180185                                                                 
     GGAGACGAGGGCTGGCTGGTGCTGGATGTCACAGCAGCCAGTGACTGC1104                      
     GlyAspGluGlyTrpLeuValLeuAspValThrAlaAlaSerAspCys                          
     190195200205                                                              
     TGGTTGCTGAAGCGTCACAAGGACCTGGGACTCCGCCTCTATGTGGAG1152                      
     TrpLeuLeuLysArgHisLysAspLeuGlyLeuArgLeuTyrValGlu                          
     210215220                                                                 
     ACTGAGGACGGGCACAGCGTGGATCCTGGCCTGGCCGGCCTGCTGGGT1200                      
     ThrGluAspGlyHisSerValAspProGlyLeuAlaGlyLeuLeuGly                          
     225230235                                                                 
     CAACGGGCCCCACGCTCCCAACAGCCTTTCGTGGTCACTTTCTTCAGG1248                      
     GlnArgAlaProArgSerGlnGlnProPheValValThrPhePheArg                          
     240245250                                                                 
     GCCAGTCCGAGTCCCATCCGCACCCCTCGGGCAGTGAGGCCACTGAGG1296                      
     AlaSerProSerProIleArgThrProArgAlaValArgProLeuArg                          
     255260265                                                                 
     AGGAGGCAGCCGAAGAAAAGCAACGAGCTGCCGCAGGCCAACCGACTC1344                      
     ArgArgGlnProLysLysSerAsnGluLeuProGlnAlaAsnArgLeu                          
     270275280285                                                              
     CCAGGGATCTTTGATGACGTCCACGGCTCCCACGGCCGGCAGGTCTGC1392                      
     ProGlyIlePheAspAspValHisGlySerHisGlyArgGlnValCys                          
     290295300                                                                 
     CGTCGGCACGAGCTCTACGTCAGCTTCCAGGACCTCGGCTGGCTGGAC1440                      
     ArgArgHisGluLeuTyrValSerPheGlnAspLeuGlyTrpLeuAsp                          
     305310315                                                                 
     TGGGTCATCGCTCCCCAAGGCTACTCGGCCTATTACTGTGAGGGGGAG1488                      
     TrpValIleAlaProGlnGlyTyrSerAlaTyrTyrCysGluGlyGlu                          
     320325330                                                                 
     TGCTCCTTCCCACTGGACTCCTGCATGAATGCCACCAACCACGCCATC1536                      
     CysSerPheProLeuAspSerCysMetAsnAlaThrAsnHisAlaIle                          
     335340345                                                                 
     CTGCAGTCCCTGGTGCACCTGATGAAGCCAAACGCAGTCCCCAAGGCG1584                      
     LeuGlnSerLeuValHisLeuMetLysProAsnAlaValProLysAla                          
     350355360365                                                              
     TGCTGTGCACCCACCAAGCTGAGCGCCACCTCTGTGCTCTACTATGAC1632                      
     CysCysAlaProThrLysLeuSerAlaThrSerValLeuTyrTyrAsp                          
     370375380                                                                 
     AGCAGCAACAACGTCATCCTGCGCAAACACCGCAACATGGTGGTCAAG1680                      
     SerSerAsnAsnValIleLeuArgLysHisArgAsnMetValValLys                          
     385390395                                                                 
     GCCTGCGGCTGCCACTGAGTCAGCCCGCCCAGCCCTACTGCAG1723                           
     AlaCysGlyCysHis                                                           
     400                                                                       
     (2) INFORMATION FOR SEQ ID NO:21:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 402 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                  
     MetThrAlaLeuProGlyProLeuTrpLeuLeuGlyLeuAlaLeuCys                          
     151015                                                                    
     AlaLeuGlyGlyGlyGlyProGlyLeuArgProProProGlyCysPro                          
     202530                                                                    
     GlnArgArgLeuGlyAlaArgGluArgArgAspValGlnArgGluIle                          
     354045                                                                    
     LeuAlaValLeuGlyLeuProGlyArgProArgProArgAlaProPro                          
     505560                                                                    
     AlaAlaSerArgLeuProAlaSerAlaProLeuPheMetLeuAspLeu                          
     65707580                                                                  
     TyrHisAlaMetAlaGlyAspAspAspGluAspGlyAlaProAlaGlu                          
     859095                                                                    
     ArgArgLeuGlyArgAlaAspLeuValMetSerPheValAsnMetVal                          
     100105110                                                                 
     GluArgAspArgAlaLeuGlyHisGlnGluProHisTrpLysGluPhe                          
     115120125                                                                 
     ArgPheAspLeuThrGlnIleProAlaGlyGluAlaValThrAlaAla                          
     130135140                                                                 
     GluPheArgIleTyrLysValProSerIleHisLeuLeuAsnArgThr                          
     145150155160                                                              
     LeuHisValSerMetPheGlnValValGlnGluGlnSerAsnArgGlu                          
     165170175                                                                 
     SerAspLeuPhePheLeuAspLeuGlnThrLeuArgAlaGlyAspGlu                          
     180185190                                                                 
     GlyTrpLeuValLeuAspValThrAlaAlaSerAspCysTrpLeuLeu                          
     195200205                                                                 
     LysArgHisLysAspLeuGlyLeuArgLeuTyrValGluThrGluAsp                          
     210215220                                                                 
     GlyHisSerValAspProGlyLeuAlaGlyLeuLeuGlyGlnArgAla                          
     225230235240                                                              
     ProArgSerGlnGlnProPheValValThrPhePheArgAlaSerPro                          
     245250255                                                                 
     SerProIleArgThrProArgAlaValArgProLeuArgArgArgGln                          
     260265270                                                                 
     ProLysLysSerAsnGluLeuProGlnAlaAsnArgLeuProGlyIle                          
     275280285                                                                 
     PheAspAspValHisGlySerHisGlyArgGlnValCysArgArgHis                          
     290295300                                                                 
     GluLeuTyrValSerPheGlnAspLeuGlyTrpLeuAspTrpValIle                          
     305310315320                                                              
     AlaProGlnGlyTyrSerAlaTyrTyrCysGluGlyGluCysSerPhe                          
     325330335                                                                 
     ProLeuAspSerCysMetAsnAlaThrAsnHisAlaIleLeuGlnSer                          
     340345350                                                                 
     LeuValHisLeuMetLysProAsnAlaValProLysAlaCysCysAla                          
     355360365                                                                 
     ProThrLysLeuSerAlaThrSerValLeuTyrTyrAspSerSerAsn                          
     370375380                                                                 
     AsnValIleLeuArgLysHisArgAsnMetValValLysAlaCysGly                          
     385390395400                                                              
     CysHis                                                                    
     (2) INFORMATION FOR SEQ ID NO:22:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 1926 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 93..1289                                                    
     (D) OTHER INFORMATION: /product="MOP2 CDNA"                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                  
     GCCAGGCACAGGTGCGCCGTCTGGTCCTCCCCGTCTGGCGTCAGCCGAGCCCGACCAGCT60            
     ACCAGTGGATGCGCGCCGGCTGAAAGTCCGAGATGGCTATGCGTCCCGGGCCA113                  
     MetAlaMetArgProGlyPro                                                     
     15                                                                        
     CTCTGGCTATTGGGCCTTGCTCTGTGCGCGCTGGGAGGCGGCCACGGT161                       
     LeuTrpLeuLeuGlyLeuAlaLeuCysAlaLeuGlyGlyGlyHisGly                          
     101520                                                                    
     CCGCGTCCCCCGCACACCTGTCCCCAGCGTCGCCTGGGAGCGCGCGAG209                       
     ProArgProProHisThrCysProGlnArgArgLeuGlyAlaArgGlu                          
     253035                                                                    
     CGCCGCGACATGCAGCGTGAAATCCTGGCGGTGCTCGGGCTACCGGGA257                       
     ArgArgAspMetGlnArgGluIleLeuAlaValLeuGlyLeuProGly                          
     40455055                                                                  
     CGGCCCCGACCCCGTGCACAACCCGCGGCTGCCCGGCAGCCAGCGTCC305                       
     ArgProArgProArgAlaGlnProAlaAlaAlaArgGlnProAlaSer                          
     606570                                                                    
     GCGCCCCTCTTCATGTTGGACCTATACCACGCCATGACCGATGACGAC353                       
     AlaProLeuPheMetLeuAspLeuTyrHisAlaMetThrAspAspAsp                          
     758085                                                                    
     GACGGCGGGCCACCACAGGCTCACTTAGGCCGTGCCGACCTGGTCATG401                       
     AspGlyGlyProProGlnAlaHisLeuGlyArgAlaAspLeuValMet                          
     9095100                                                                   
     AGCTTCGTCAACATGGTGGAACGCGACCGTACCCTGGGCTACCAGGAG449                       
     SerPheValAsnMetValGluArgAspArgThrLeuGlyTyrGlnGlu                          
     105110115                                                                 
     CCACACTGGAAGGAATTCCACTTTGACCTAACCCAGATCCCTGCTGGG497                       
     ProHisTrpLysGluPheHisPheAspLeuThrGlnIleProAlaGly                          
     120125130135                                                              
     GAGGCTGTCACAGCTGCTGAGTTCCGGATCTACAAAGAACCCAGCACC545                       
     GluAlaValThrAlaAlaGluPheArgIleTyrLysGluProSerThr                          
     140145150                                                                 
     CACCCGCTCAACACAACCCTCCACATCAGCATGTTCGAAGTGGTCCAA593                       
     HisProLeuAsnThrThrLeuHisIleSerMetPheGluValValGln                          
     155160165                                                                 
     GAGCACTCCAACAGGGAGTCTGACTTGTTCTTTTTGGATCTTCAGACG641                       
     GluHisSerAsnArgGluSerAspLeuPhePheLeuAspLeuGlnThr                          
     170175180                                                                 
     CTCCGATCTGGGGACGAGGGCTGGCTGGTGCTGGACATCACAGCAGCC689                       
     LeuArgSerGlyAspGluGlyTrpLeuValLeuAspIleThrAlaAla                          
     185190195                                                                 
     AGTGACCGATGGCTGCTGAACCATCACAAGGACCTGGGACTCCGCCTC737                       
     SerAspArgTrpLeuLeuAsnHisHisLysAspLeuGlyLeuArgLeu                          
     200205210215                                                              
     TATGTGGAAACCGCGGATGGGCACAGCATGGATCCTGGCCTGGCTGGT785                       
     TyrValGluThrAlaAspGlyHisSerMetAspProGlyLeuAlaGly                          
     220225230                                                                 
     CTGCTTGGACGACAAGCACCACGCTCCAGACAGCCTTTCATGGTAACC833                       
     LeuLeuGlyArgGlnAlaProArgSerArgGlnProPheMetValThr                          
     235240245                                                                 
     TTCTTCAGGGCCAGCCAGAGTCCTGTGCGGGCCCCTCGGGCAGCGAGA881                       
     PhePheArgAlaSerGlnSerProValArgAlaProArgAlaAlaArg                          
     250255260                                                                 
     CCACTGAAGAGGAGGCAGCCAAAGAAAACGAACGAGCTTCCGCACCCC929                       
     ProLeuLysArgArgGlnProLysLysThrAsnGluLeuProHisPro                          
     265270275                                                                 
     AACAAACTCCCAGGGATCTTTGATGATGGCCACGGTTCCCGCGGCAGA977                       
     AsnLysLeuProGlyIlePheAspAspGlyHisGlySerArgGlyArg                          
     280285290295                                                              
     GAGGTTTGCCGCAGGCATGAGCTCTACGTCAGCTTCCGTGACCTTGGC1025                      
     GluValCysArgArgHisGluLeuTyrValSerPheArgAspLeuGly                          
     300305310                                                                 
     TGGCTGGACTGGGTCATCGCCCCCCAGGGCTACTCTGCCTATTACTGT1073                      
     TrpLeuAspTrpValIleAlaProGlnGlyTyrSerAlaTyrTyrCys                          
     315320325                                                                 
     GAGGGGGAGTGTGCTTTCCCACTGGACTCCTGTATGAACGCCACCAAC1121                      
     GluGlyGluCysAlaPheProLeuAspSerCysMetAsnAlaThrAsn                          
     330335340                                                                 
     CATGCCATCTTGCAGTCTCTGGTGCACCTGATGAAGCCAGATGTTGTC1169                      
     HisAlaIleLeuGlnSerLeuValHisLeuMetLysProAspValVal                          
     345350355                                                                 
     CCCAAGGCATGCTGTGCACCCACCAAACTGAGTGCCACCTCTGTGCTG1217                      
     ProLysAlaCysCysAlaProThrLysLeuSerAlaThrSerValLeu                          
     360365370375                                                              
     TACTATGACAGCAGCAACAATGTCATCCTGCGTAAACACCGTAACATG1265                      
     TyrTyrAspSerSerAsnAsnValIleLeuArgLysHisArgAsnMet                          
     380385390                                                                 
     GTGGTCAAGGCCTGTGGCTGCCACTGAGGCCCCGCCCAGCATCCTGCTTCTACT1319                
     ValValLysAlaCysGlyCysHis                                                  
     395                                                                       
     ACCTTACCATCTGGCCGGGCCCCTCTCCAGAGGCAGAAACCCTTCTATGTTATCATAGCT1379          
     CAGACAGGGGCAATGGGAGGCCCTTCACTTCCCCTGGCCACTTCCTGCTAAAATTCTGGT1439          
     CTTTCCCAGTTCCTCTGTCCTTCATGGGGTTTCGGGGCTATCACCCCGCCCTCTCCATCC1499          
     TCCTACCCCAAGCATAGACTGAATGCACACAGCATCCCAGAGCTATGCTAACTGAGAGGT1559          
     CTGGGGTCAGCACTGAAGGCCCACATGAGGAAGACTGATCCTTGGCCATCCTCAGCCCAC1619          
     AATGGCAAATTCTGGATGGTCTAAGAAGGCCGTGGAATTCTAAACTAGATGATCTGGGCT1679          
     CTCTGCACCATTCATTGTGGCAGTTGGGACATTTTTAGGTATAACAGACACATACACTTA1739          
     GATCAATGCATCGCTGTACTCCTTGAAATCAGAGCTAGCTTGTTAGAAAAAGAATCAGAG1799          
     CCAGGTATAGCGGTGCATGTCATTAATCCCAGCGCTAAAGAGACAGAGACAGGAGAATCT1859          
     CTGTGAGTTCAAGGCCACATAGAAAGAGCCTGTCTCGGGAGCAGGAAAAAAAAAAAAAAC1919          
     GGAATTC1926                                                               
     (2) INFORMATION FOR SEQ ID NO:23:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 399 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                  
     MetAlaMetArgProGlyProLeuTrpLeuLeuGlyLeuAlaLeuCys                          
     151015                                                                    
     AlaLeuGlyGlyGlyHisGlyProArgProProHisThrCysProGln                          
     202530                                                                    
     ArgArgLeuGlyAlaArgGluArgArgAspMetGlnArgGluIleLeu                          
     354045                                                                    
     AlaValLeuGlyLeuProGlyArgProArgProArgAlaGlnProAla                          
     505560                                                                    
     AlaAlaArgGlnProAlaSerAlaProLeuPheMetLeuAspLeuTyr                          
     65707580                                                                  
     HisAlaMetThrAspAspAspAspGlyGlyProProGlnAlaHisLeu                          
     859095                                                                    
     GlyArgAlaAspLeuValMetSerPheValAsnMetValGluArgAsp                          
     100105110                                                                 
     ArgThrLeuGlyTyrGlnGluProHisTrpLysGluPheHisPheAsp                          
     115120125                                                                 
     LeuThrGlnIleProAlaGlyGluAlaValThrAlaAlaGluPheArg                          
     130135140                                                                 
     IleTyrLysGluProSerThrHisProLeuAsnThrThrLeuHisIle                          
     145150155160                                                              
     SerMetPheGluValValGlnGluHisSerAsnArgGluSerAspLeu                          
     165170175                                                                 
     PhePheLeuAspLeuGlnThrLeuArgSerGlyAspGluGlyTrpLeu                          
     180185190                                                                 
     ValLeuAspIleThrAlaAlaSerAspArgTrpLeuLeuAsnHisHis                          
     195200205                                                                 
     LysAspLeuGlyLeuArgLeuTyrValGluThrAlaAspGlyHisSer                          
     210215220                                                                 
     MetAspProGlyLeuAlaGlyLeuLeuGlyArgGlnAlaProArgSer                          
     225230235240                                                              
     ArgGlnProPheMetValThrPhePheArgAlaSerGlnSerProVal                          
     245250255                                                                 
     ArgAlaProArgAlaAlaArgProLeuLysArgArgGlnProLysLys                          
     260265270                                                                 
     ThrAsnGluLeuProHisProAsnLysLeuProGlyIlePheAspAsp                          
     275280285                                                                 
     GlyHisGlySerArgGlyArgGluValCysArgArgHisGluLeuTyr                          
     290295300                                                                 
     ValSerPheArgAspLeuGlyTrpLeuAspTrpValIleAlaProGln                          
     305310315320                                                              
     GlyTyrSerAlaTyrTyrCysGluGlyGluCysAlaPheProLeuAsp                          
     325330335                                                                 
     SerCysMetAsnAlaThrAsnHisAlaIleLeuGlnSerLeuValHis                          
     340345350                                                                 
     LeuMetLysProAspValValProLysAlaCysCysAlaProThrLys                          
     355360365                                                                 
     LeuSerAlaThrSerValLeuTyrTyrAspSerSerAsnAsnValIle                          
     370375380                                                                 
     LeuArgLysHisArgAsnMetValValLysAlaCysGlyCysHis                             
     385390395                                                                 
     (2) INFORMATION FOR SEQ ID NO:24:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 1368 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 1..1365                                                     
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                  
     ATGTCGGGACTGCGAAACACCTCGGAGGCCGTTGCAGTGCTCGCCTCC48                        
     MetSerGlyLeuArgAsnThrSerGluAlaValAlaValLeuAlaSer                          
     151015                                                                    
     CTGGGACTCGGAATGGTTCTGCTCATGTTCGTGGCGACCACGCCGCCG96                        
     LeuGlyLeuGlyMetValLeuLeuMetPheValAlaThrThrProPro                          
     202530                                                                    
     GCCGTTGAGGCCACCCAGTCGGGGATTTACATAGACAACGGCAAGGAC144                       
     AlaValGluAlaThrGlnSerGlyIleTyrIleAspAsnGlyLysAsp                          
     354045                                                                    
     CAGACGATCATGCACAGAGTGCTGAGCGAGGACGACAAGCTGGACGTC192                       
     GlnThrIleMetHisArgValLeuSerGluAspAspLysLeuAspVal                          
     505560                                                                    
     TCGTACGAGATCCTCGAGTTCCTGGGCATCGCCGAACGGCCGACGCAC240                       
     SerTyrGluIleLeuGluPheLeuGlyIleAlaGluArgProThrHis                          
     65707580                                                                  
     CTGAGCAGCCACCAGTTGTCGCTGAGGAAGTCGGCTCCCAAGTTCCTG288                       
     LeuSerSerHisGlnLeuSerLeuArgLysSerAlaProLysPheLeu                          
     859095                                                                    
     CTGGACGTCTACCACCGCATCACGGCGGAGGAGGGTCTCAGCGATCAG336                       
     LeuAspValTyrHisArgIleThrAlaGluGluGlyLeuSerAspGln                          
     100105110                                                                 
     GATGAGGACGACGACTACGAACGCGGCCATCGGTCCAGGAGGAGCGCC384                       
     AspGluAspAspAspTyrGluArgGlyHisArgSerArgArgSerAla                          
     115120125                                                                 
     GACCTCGAGGAGGATGAGGGCGAGCAGCAGAAGAACTTCATCACCGAC432                       
     AspLeuGluGluAspGluGlyGluGlnGlnLysAsnPheIleThrAsp                          
     130135140                                                                 
     CTGGACAAGCGGGCCATCGACGAGAGCGACATCATCATGACCTTCCTG480                       
     LeuAspLysArgAlaIleAspGluSerAspIleIleMetThrPheLeu                          
     145150155160                                                              
     AACAAGCGCCACCACAATGTGGACGAACTGCGTCACGAGCACGGCCGT528                       
     AsnLysArgHisHisAsnValAspGluLeuArgHisGluHisGlyArg                          
     165170175                                                                 
     CGCCTGTGGTTCGACGTCTCCAACGTGCCCAACGACAACTACCTGGTG576                       
     ArgLeuTrpPheAspValSerAsnValProAsnAspAsnTyrLeuVal                          
     180185190                                                                 
     ATGGCCGAGCTGCGCATCTATCAGAACGCCAACGAGGGCAAGTGGCTG624                       
     MetAlaGluLeuArgIleTyrGlnAsnAlaAsnGluGlyLysTrpLeu                          
     195200205                                                                 
     ACCGCCAACAGGGAGTTCACCATCACGGTATACGCCATTGGCACCGGC672                       
     ThrAlaAsnArgGluPheThrIleThrValTyrAlaIleGlyThrGly                          
     210215220                                                                 
     ACGCTGGGCCAGCACACCATGGAGCCGCTGTCCTCGGTGAACACCACC720                       
     ThrLeuGlyGlnHisThrMetGluProLeuSerSerValAsnThrThr                          
     225230235240                                                              
     GGGGACTACGTGGGCTGGTTGGAGCTCAACGTGACCGAGGGCCTGCAC768                       
     GlyAspTyrValGlyTrpLeuGluLeuAsnValThrGluGlyLeuHis                          
     245250255                                                                 
     GAGTGGCTGGTCAAGTCGAAGGACAATCATGGCATCTACATTGGAGCA816                       
     GluTrpLeuValLysSerLysAspAsnHisGlyIleTyrIleGlyAla                          
     260265270                                                                 
     CACGCTGTCAACCGACCCGACCGCGAGGTGAAGCTGGACGACATTGGA864                       
     HisAlaValAsnArgProAspArgGluValLysLeuAspAspIleGly                          
     275280285                                                                 
     CTGATCCACCGCAAGGTGGACGACGAGTTCCAGCCCTTCATGATCGGC912                       
     LeuIleHisArgLysValAspAspGluPheGlnProPheMetIleGly                          
     290295300                                                                 
     TTCTTCCGCGGACCGGAGCTGATCAAGGCGACGGCCCACAGCAGCCAC960                       
     PhePheArgGlyProGluLeuIleLysAlaThrAlaHisSerSerHis                          
     305310315320                                                              
     CACAGGAGCAAGCGAAGCGCCAGCCATCCACGCAAGCGCAAGAAGTCG1008                      
     HisArgSerLysArgSerAlaSerHisProArgLysArgLysLysSer                          
     325330335                                                                 
     GTGTCGCCCAACAACGTGCCGCTGCTGGAACCGATGGAGAGCACGCGC1056                      
     ValSerProAsnAsnValProLeuLeuGluProMetGluSerThrArg                          
     340345350                                                                 
     AGCTGCCAGATGCAGACCCTGTACATAGACTTCAAGGATCTGGGCTGG1104                      
     SerCysGlnMetGlnThrLeuTyrIleAspPheLysAspLeuGlyTrp                          
     355360365                                                                 
     CATGACTGGATCATCGCACCAGAGGGCTATGGCGCCTTCTACTGCAGC1152                      
     HisAspTrpIleIleAlaProGluGlyTyrGlyAlaPheTyrCysSer                          
     370375380                                                                 
     GGCGAGTGCAATTTCCCGCTCAATGCGCACATGAACGCCACGAACCAT1200                      
     GlyGluCysAsnPheProLeuAsnAlaHisMetAsnAlaThrAsnHis                          
     385390395400                                                              
     GCGATCGTCCAGACCCTGGTCCACCTGCTGGAGCCCAAGAAGGTGCCC1248                      
     AlaIleValGlnThrLeuValHisLeuLeuGluProLysLysValPro                          
     405410415                                                                 
     AAGCCCTGCTGCGCTCCGACCAGGCTGGGAGCACTACCCGTTCTGTAC1296                      
     LysProCysCysAlaProThrArgLeuGlyAlaLeuProValLeuTyr                          
     420425430                                                                 
     CACCTGAACGACGAGAATGTGAACCTGAAAAAGTATAGAAACATGATT1344                      
     HisLeuAsnAspGluAsnValAsnLeuLysLysTyrArgAsnMetIle                          
     435440445                                                                 
     GTGAAATCCTGCGGGTGCCATTGA1368                                              
     ValLysSerCysGlyCysHis                                                     
     450455                                                                    
     (2) INFORMATION FOR SEQ ID NO:25:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 455 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                  
     MetSerGlyLeuArgAsnThrSerGluAlaValAlaValLeuAlaSer                          
     151015                                                                    
     LeuGlyLeuGlyMetValLeuLeuMetPheValAlaThrThrProPro                          
     202530                                                                    
     AlaValGluAlaThrGlnSerGlyIleTyrIleAspAsnGlyLysAsp                          
     354045                                                                    
     GlnThrIleMetHisArgValLeuSerGluAspAspLysLeuAspVal                          
     505560                                                                    
     SerTyrGluIleLeuGluPheLeuGlyIleAlaGluArgProThrHis                          
     65707580                                                                  
     LeuSerSerHisGlnLeuSerLeuArgLysSerAlaProLysPheLeu                          
     859095                                                                    
     LeuAspValTyrHisArgIleThrAlaGluGluGlyLeuSerAspGln                          
     100105110                                                                 
     AspGluAspAspAspTyrGluArgGlyHisArgSerArgArgSerAla                          
     115120125                                                                 
     AspLeuGluGluAspGluGlyGluGlnGlnLysAsnPheIleThrAsp                          
     130135140                                                                 
     LeuAspLysArgAlaIleAspGluSerAspIleIleMetThrPheLeu                          
     145150155160                                                              
     AsnLysArgHisHisAsnValAspGluLeuArgHisGluHisGlyArg                          
     165170175                                                                 
     ArgLeuTrpPheAspValSerAsnValProAsnAspAsnTyrLeuVal                          
     180185190                                                                 
     MetAlaGluLeuArgIleTyrGlnAsnAlaAsnGluGlyLysTrpLeu                          
     195200205                                                                 
     ThrAlaAsnArgGluPheThrIleThrValTyrAlaIleGlyThrGly                          
     210215220                                                                 
     ThrLeuGlyGlnHisThrMetGluProLeuSerSerValAsnThrThr                          
     225230235240                                                              
     GlyAspTyrValGlyTrpLeuGluLeuAsnValThrGluGlyLeuHis                          
     245250255                                                                 
     GluTrpLeuValLysSerLysAspAsnHisGlyIleTyrIleGlyAla                          
     260265270                                                                 
     HisAlaValAsnArgProAspArgGluValLysLeuAspAspIleGly                          
     275280285                                                                 
     LeuIleHisArgLysValAspAspGluPheGlnProPheMetIleGly                          
     290295300                                                                 
     PhePheArgGlyProGluLeuIleLysAlaThrAlaHisSerSerHis                          
     305310315320                                                              
     HisArgSerLysArgSerAlaSerHisProArgLysArgLysLysSer                          
     325330335                                                                 
     ValSerProAsnAsnValProLeuLeuGluProMetGluSerThrArg                          
     340345350                                                                 
     SerCysGlnMetGlnThrLeuTyrIleAspPheLysAspLeuGlyTrp                          
     355360365                                                                 
     HisAspTrpIleIleAlaProGluGlyTyrGlyAlaPheTyrCysSer                          
     370375380                                                                 
     GlyGluCysAsnPheProLeuAsnAlaHisMetAsnAlaThrAsnHis                          
     385390395400                                                              
     AlaIleValGlnThrLeuValHisLeuLeuGluProLysLysValPro                          
     405410415                                                                 
     LysProCysCysAlaProThrArgLeuGlyAlaLeuProValLeuTyr                          
     420425430                                                                 
     HisLeuAsnAspGluAsnValAsnLeuLysLysTyrArgAsnMetIle                          
     435440445                                                                 
     ValLysSerCysGlyCysHis                                                     
     450455                                                                    
     (2) INFORMATION FOR SEQ ID NO:26:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 104 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..104                                                      
     (D) OTHER INFORMATION: /label=BMP3                                        
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                  
     CysAlaArgArgTyrLeuLysValAspPheAlaAspIleGlyTrpSer                          
     151015                                                                    
     GluTrpIleIleSerProLysSerPheAspAlaTyrTyrCysSerGly                          
     202530                                                                    
     AlaCysGlnPheProMetProLysSerLeuLysProSerAsnHisAla                          
     354045                                                                    
     ThrIleGlnSerIleValAlaArgAlaValGlyValValProGlyIle                          
     505560                                                                    
     ProGluProCysCysValProGluLysMetSerSerLeuSerIleLeu                          
     65707580                                                                  
     PhePheAspGluAsnLysAsnValValLeuLysValTyrProAsnMet                          
     859095                                                                    
     ThrValGluSerCysAlaCysArg                                                  
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:27:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /label=BMP5                                        
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                  
     CysLysLysHisGluLeuTyrValSerPheArgAspLeuGlyTrpGln                          
     151015                                                                    
     AspTrpIleIleAlaProGluGlyTyrAlaAlaPheTyrCysAspGly                          
     202530                                                                    
     GluCysSerPheProLeuAsnAlaHisMetAsnAlaThrAsnHisAla                          
     354045                                                                    
     IleValGlnThrLeuValHisLeuMetPheProAspHisValProLys                          
     505560                                                                    
     ProCysCysAlaProThrLysLeuAsnAlaIleSerValLeuTyrPhe                          
     65707580                                                                  
     AspAspSerSerAsnValIleLeuLysLysTyrArgAsnMetValVal                          
     859095                                                                    
     ArgSerCysGlyCysHis                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:28:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /label=BMP6                                        
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                  
     CysArgLysHisGluLeuTyrValSerPheGlnAspLeuGlyTrpGln                          
     151015                                                                    
     AspTrpIleIleAlaProLysGlyTyrAlaAlaAsnTyrCysAspGly                          
     202530                                                                    
     GluCysSerPheProLeuAsnAlaHisMetAsnAlaThrAsnHisAla                          
     354045                                                                    
     IleValGlnThrLeuValHisLeuMetAsnProGluTyrValProLys                          
     505560                                                                    
     ProCysCysAlaProThrLysLeuAsnAlaIleSerValLeuTyrPhe                          
     65707580                                                                  
     AspAspAsnSerAsnValIleLeuLysLysTyrArgTrpMetValVal                          
     859095                                                                    
     ArgAlaCysGlyCysHis                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:29:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /label=OPX                                         
     /note= "WHEREIN XAA AT EACH POS'N IS INDEPENDENTLY                        
     SELECTED FROM THE RESIDUES OCCURING AT THE CORRESPONDING                  
     POS'N IN THE C-TERMINAL SEQUENCE OF MOUSE OR HUMAN OP1                    
     OR OP2 (SEQ. ID NOS. 5,6,7&8 OR 16,18, 20&22"                             
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                  
     CysXaaXaaHisGluLeuTyrValXaaPheXaaAspLeuGlyTrpXaa                          
     151015                                                                    
     AspTrpXaaIleAlaProXaaGlyTyrXaaAlaTyrTyrCysGluGly                          
     202530                                                                    
     GluCysXaaPheProLeuXaaSerXaaMetAsnAlaThrAsnHisAla                          
     354045                                                                    
     IleXaaGlnXaaLeuValHisXaaXaaXaaProXaaXaaValProLys                          
     505560                                                                    
     XaaCysCysAlaProThrXaaLeuXaaAlaXaaSerValLeuTyrXaa                          
     65707580                                                                  
     AspXaaSerXaaAsnValXaaLeuXaaLysXaaArgAsnMetValVal                          
     859095                                                                    
     XaaAlaCysGlyCysHis                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:30:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 97 amino acids                                                
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..97                                                       
     (D) OTHER INFORMATION: /label=GENERIC-SEQ-5                               
     /note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED FROM A                 
     GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS AS DEFINED IN                  
     THE SPECIFICATION"                                                        
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                  
     LeuXaaXaaXaaPheXaaXaaXaaGlyTrpXaaXaaTrpXaaXaaXaa                          
     151015                                                                    
     ProXaaXaaXaaXaaAlaXaaTyrCysXaaGlyXaaCysXaaXaaPro                          
     202530                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaAsnHisAlaXaaXaaXaaXaaXaa                          
     354045                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaCysCysXaaPro                          
     505560                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaLeuXaaXaaXaaXaaXaaXaaXaa                          
     65707580                                                                  
     ValXaaLeuXaaXaaXaaXaaXaaMetXaaValXaaXaaCysXaaCys                          
     859095                                                                    
     Xaa                                                                       
     (2) INFORMATION FOR SEQ ID NO:31:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 102 amino acids                                               
     (B) TYPE: amino acid                                                      
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (ix) FEATURE:                                                             
     (A) NAME/KEY: Protein                                                     
     (B) LOCATION: 1..102                                                      
     (D) OTHER INFORMATION: /label=GENERIC-SEQ-6                               
     /note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED FROM A                 
     GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS AS DEFINED IN                  
     THE SPECIFICATION"                                                        
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                  
     CysXaaXaaXaaXaaLeuXaaXaaXaaPheXaaXaaXaaGlyTrpXaa                          
     151015                                                                    
     XaaTrpXaaXaaXaaProXaaXaaXaaXaaAlaXaaTyrCysXaaGly                          
     202530                                                                    
     XaaCysXaaXaaProXaaXaaXaaXaaXaaXaaXaaXaaAsnHisAla                          
     354045                                                                    
     XaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaaXaa                          
     505560                                                                    
     XaaCysCysXaaProXaaXaaXaaXaaXaaXaaXaaXaaLeuXaaXaa                          
     65707580                                                                  
     XaaXaaXaaXaaXaaValXaaLeuXaaXaaXaaXaaXaaMetXaaVal                          
     859095                                                                    
     XaaXaaCysXaaCysXaa                                                        
     100                                                                       
     (2) INFORMATION FOR SEQ ID NO:32:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 1247 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 84..1199                                                    
     (D) OTHER INFORMATION: /product="GDF-1"                                   
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                  
     GGGGACACCGGCCCCGCCCTCAGCCCACTGGTCCCGGGCCGCCGCGGACCCTGCGCACTC60            
     TCTGGTCATCGCCTGGGAGGAAGATGCCACCGCCGCAGCAAGGTCCCTGC110                     
     MetProProProGlnGlnGlyProCys                                               
     15                                                                        
     GGCCACCACCTCCTCCTCCTCCTGGCCCTGCTGCTGCCCTCGCTGCCC158                       
     GlyHisHisLeuLeuLeuLeuLeuAlaLeuLeuLeuProSerLeuPro                          
     10152025                                                                  
     CTGACCCGCGCCCCCGTGCCCCCAGGCCCAGCCGCCGCCCTGCTCCAG206                       
     LeuThrArgAlaProValProProGlyProAlaAlaAlaLeuLeuGln                          
     303540                                                                    
     GCTCTAGGACTGCGCGATGAGCCCCAGGGTGCCCCCAGGCTCCGGCCG254                       
     AlaLeuGlyLeuArgAspGluProGlnGlyAlaProArgLeuArgPro                          
     455055                                                                    
     GTTCCCCCGGTCATGTGGCGCCTGTTTCGACGCCGGGACCCCCAGGAG302                       
     ValProProValMetTrpArgLeuPheArgArgArgAspProGlnGlu                          
     606570                                                                    
     ACCAGGTCTGGCTCGCGGCGGACGTCCCCAGGGGTCACCCTGCAACCG350                       
     ThrArgSerGlySerArgArgThrSerProGlyValThrLeuGlnPro                          
     758085                                                                    
     TGCCACGTGGAGGAGCTGGGGGTCGCCGGAAACATCGTGCGCCACATC398                       
     CysHisValGluGluLeuGlyValAlaGlyAsnIleValArgHisIle                          
     9095100105                                                                
     CCGGACCGCGGTGCGCCCACCCGGGCCTCGGAGCCTGTCTCGGCCGCG446                       
     ProAspArgGlyAlaProThrArgAlaSerGluProValSerAlaAla                          
     110115120                                                                 
     GGGCATTGCCCTGAGTGGACAGTCGTCTTCGACCTGTCGGCTGTGGAA494                       
     GlyHisCysProGluTrpThrValValPheAspLeuSerAlaValGlu                          
     125130135                                                                 
     CCCGCTGAGCGCCCGAGCCGGGCCCGCCTGGAGCTGCGTTTCGCGGCG542                       
     ProAlaGluArgProSerArgAlaArgLeuGluLeuArgPheAlaAla                          
     140145150                                                                 
     GCGGCGGCGGCAGCCCCGGAGGGCGGCTGGGAGCTGAGCGTGGCGCAA590                       
     AlaAlaAlaAlaAlaProGluGlyGlyTrpGluLeuSerValAlaGln                          
     155160165                                                                 
     GCGGGCCAGGGCGCGGGCGCGGACCCCGGGCCGGTGCTGCTCCGCCAG638                       
     AlaGlyGlnGlyAlaGlyAlaAspProGlyProValLeuLeuArgGln                          
     170175180185                                                              
     TTGGTGCCCGCCCTGGGGCCGCCAGTGCGCGCGGAGCTGCTGGGCGCC686                       
     LeuValProAlaLeuGlyProProValArgAlaGluLeuLeuGlyAla                          
     190195200                                                                 
     GCTTGGGCTCGCAACGCCTCATGGCCGCGCAGCCTCCGCCTGGCGCTG734                       
     AlaTrpAlaArgAsnAlaSerTrpProArgSerLeuArgLeuAlaLeu                          
     205210215                                                                 
     GCGCTACGCCCCCGGGCCCCTGCCGCCTGCGCGCGCCTGGCCGAGGCC782                       
     AlaLeuArgProArgAlaProAlaAlaCysAlaArgLeuAlaGluAla                          
     220225230                                                                 
     TCGCTGCTGCTGGTGACCCTCGACCCGCGCCTGTGCCACCCCCTGGCC830                       
     SerLeuLeuLeuValThrLeuAspProArgLeuCysHisProLeuAla                          
     235240245                                                                 
     CGGCCGCGGCGCGACGCCGAACCCGTGTTGGGCGGCGGCCCCGGGGGC878                       
     ArgProArgArgAspAlaGluProValLeuGlyGlyGlyProGlyGly                          
     250255260265                                                              
     GCTTGTCGCGCGCGGCGGCTGTACGTGAGCTTCCGCGAGGTGGGCTGG926                       
     AlaCysArgAlaArgArgLeuTyrValSerPheArgGluValGlyTrp                          
     270275280                                                                 
     CACCGCTGGGTCATCGCGCCGCGCGGCTTCCTGGCCAACTACTGCCAG974                       
     HisArgTrpValIleAlaProArgGlyPheLeuAlaAsnTyrCysGln                          
     285290295                                                                 
     GGTCAGTGCGCGCTGCCCGTCGCGCTGTCGGGGTCCGGGGGGCCGCCG1022                      
     GlyGlnCysAlaLeuProValAlaLeuSerGlySerGlyGlyProPro                          
     300305310                                                                 
     GCGCTCAACCACGCTGTGCTGCGCGCGCTCATGCACGCGGCCGCCCCG1070                      
     AlaLeuAsnHisAlaValLeuArgAlaLeuMetHisAlaAlaAlaPro                          
     315320325                                                                 
     GGAGCCGCCGACCTGCCCTGCTGCGTGCCCGCGCGCCTGTCGCCCATC1118                      
     GlyAlaAlaAspLeuProCysCysValProAlaArgLeuSerProIle                          
     330335340345                                                              
     TCCGTGCTCTTCTTTGACAACAGCGACAACGTGGTGCTGCGGCAGTAT1166                      
     SerValLeuPhePheAspAsnSerAspAsnValValLeuArgGlnTyr                          
     350355360                                                                 
     GAGGACATGGTGGTGGACGAGTGCGGCTGCCGCTAACCCGGGGCGGGCAGGGA1219                 
     GluAspMetValValAspGluCysGlyCysArg                                         
     365370                                                                    
     CCCGGGCCCAACAATAAATGCCGCGTGG1247                                          
     (2) INFORMATION FOR SEQ ID NO:33:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 372 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                                  
     MetProProProGlnGlnGlyProCysGlyHisHisLeuLeuLeuLeu                          
     151015                                                                    
     LeuAlaLeuLeuLeuProSerLeuProLeuThrArgAlaProValPro                          
     202530                                                                    
     ProGlyProAlaAlaAlaLeuLeuGlnAlaLeuGlyLeuArgAspGlu                          
     354045                                                                    
     ProGlnGlyAlaProArgLeuArgProValProProValMetTrpArg                          
     505560                                                                    
     LeuPheArgArgArgAspProGlnGluThrArgSerGlySerArgArg                          
     65707580                                                                  
     ThrSerProGlyValThrLeuGlnProCysHisValGluGluLeuGly                          
     859095                                                                    
     ValAlaGlyAsnIleValArgHisIleProAspArgGlyAlaProThr                          
     100105110                                                                 
     ArgAlaSerGluProValSerAlaAlaGlyHisCysProGluTrpThr                          
     115120125                                                                 
     ValValPheAspLeuSerAlaValGluProAlaGluArgProSerArg                          
     130135140                                                                 
     AlaArgLeuGluLeuArgPheAlaAlaAlaAlaAlaAlaAlaProGlu                          
     145150155160                                                              
     GlyGlyTrpGluLeuSerValAlaGlnAlaGlyGlnGlyAlaGlyAla                          
     165170175                                                                 
     AspProGlyProValLeuLeuArgGlnLeuValProAlaLeuGlyPro                          
     180185190                                                                 
     ProValArgAlaGluLeuLeuGlyAlaAlaTrpAlaArgAsnAlaSer                          
     195200205                                                                 
     TrpProArgSerLeuArgLeuAlaLeuAlaLeuArgProArgAlaPro                          
     210215220                                                                 
     AlaAlaCysAlaArgLeuAlaGluAlaSerLeuLeuLeuValThrLeu                          
     225230235240                                                              
     AspProArgLeuCysHisProLeuAlaArgProArgArgAspAlaGlu                          
     245250255                                                                 
     ProValLeuGlyGlyGlyProGlyGlyAlaCysArgAlaArgArgLeu                          
     260265270                                                                 
     TyrValSerPheArgGluValGlyTrpHisArgTrpValIleAlaPro                          
     275280285                                                                 
     ArgGlyPheLeuAlaAsnTyrCysGlnGlyGlnCysAlaLeuProVal                          
     290295300                                                                 
     AlaLeuSerGlySerGlyGlyProProAlaLeuAsnHisAlaValLeu                          
     305310315320                                                              
     ArgAlaLeuMetHisAlaAlaAlaProGlyAlaAlaAspLeuProCys                          
     325330335                                                                 
     CysValProAlaArgLeuSerProIleSerValLeuPhePheAspAsn                          
     340345350                                                                 
     SerAspAsnValValLeuArgGlnTyrGluAspMetValValAspGlu                          
     355360365                                                                 
     CysGlyCysArg                                                              
     370                                                                       
     __________________________________________________________________________

Claims

1. A method for identifying a tissue source of epithelial cells in which expression of a cellular gene can be modulated by a compound, said cellular gene encoding a protein comprising a naturally-occurring polypeptide, the amino acid sequence of said polypeptide comprising

(I) a sequence having at least 70% amino acid sequence homology with the C-terminal seven-cysteine domain of human OP-1, residues 38-139 of SEQ ID No:5, or

(II) a sequence defined by generic sequence 6, SEQ ID No:31

(a) incubating at least two preparations comprising epithelial cells derived from at least two different tissues of an organism with said compound for a time sufficient to allow said compound to modulate expression of said cellular gene, wherein one of said at least two preparations is derived from brain or renal tissue, and wherein said compound has been shown to modulate expression of said cellular gene in epithelial cells which express said cellular gene; and

(b) assaying said at least two preparations of epithelial cells for the presence of or the amount of said protein expressed by said cellular gene, wherein a change in the level of said protein relative to the level thereof in said at least two preparations of epithelial cells in the absence of said compound identifies a tissue source of epithelial cells in which expression of said cellular gene can be modulated by said compound.

2. The method of claim 1 wherein the presence of or the amount of said protein is assayed with an antibody reactive with said naturally-occurring polypeptide.

3. The method of claim 1 wherein the presence of or the amount of said protein is assayed by measuring cellular proliferation in cells which are sensitive to the concentration of said protein expressed by said cellular gene.

4. A method for identifying a tissue source of epithelial cells in which expression of a cellular gene can be modulated by a compound said cellular gene encoding a naturally-occurring polypeptide, said polypeptide being selected from the group consisting of polypeptides depicted in Table II and naturally-occurring variants thereof, provided that any said variant when dimerized, forms a homodimer protein having the property of inducing a developmental cascade of tissue-specific morphogenesis culminating in the formation of a mammalian body tissue, said method comprising the steps of:

(a) incubating at least two preparations comprising epithelial cells derived from at least two different tissues of an organism with said compound for a time sufficient to allow said compound to modulate expression of said cellular gene, wherein one of said at least two preparations is derived from brain or renal tissue, and wherein said compound has been shown to modulate expression of said cellular gene in epithelial cells which express said cellular gene; and

(b) assaying said at least two preparations of epithelial cells for the presence of or the amount of said naturally-occurring polypeptide expressed by said cellular gene, wherein a change in the level of said polypeptide relative to the level thereof in said at least two preparations of epithelial cells in the absence of said compound identifies a tissue source of epithelial cells in which expression of said cellular gene can be modulated by said compound.

5. The method of claim 4 wherein the presence of or the amount of said naturally-occurring polypeptide is determined using a nucleic acid probe that hybridizes, under stringent conditions, with RNA transcribed from said cellular gene.

6. The method of claim 5 wherein said nucleic acid probe hybridizes with an RNA transcript having a nucleic acid sequence encoding an amino acid sequence comprising at least residue 1 to residue 38 of human OP-1, SEQ ID No:5.

7. The method of claim 5 wherein said nucleic acid probe hybridizes with an RNA transcript having a nucleic acid sequence which corresponds to an untranslated non-coding sequence downstream from the 3' terminus of a nucleic acid sequence encoding human OP-1, SEQ ID No:16.

8. A method for identifying a tissue source of epithelial cells in which expression of a cellular gene can be modulated by a compound, said cellular gene encoding a protein comprising a naturally-occurring polypeptide, the amino acid sequence of said polypeptide comprising

(I) a sequence having at least 70% amino acid sequence homology with the C-terminal seven-cysteine domain of human OP-1, residues 38-139 of SEQ ID No:5; or,

(II) a sequence defined by generic sequence 6, SEQ ID No:31, or,

(III) an amino acid sequence selected from the group consisting of amino acid sequences depicted in Table II and naturally-occurring variants thereof,

(a) incubating a preparation comprising epithelial cells derived from a tissue of an organism with said compound for a time sufficient to allow said compound to modulate expression of said cellular gene, wherein said compound has been shown to modulate expression of said cellular gene in epithelial cells which express said cellular gene; and,

(b) assaying said preparation of epithelial cells for the presence of or the amount of said protein expressed by said cellular gene, wherein a change in the level of said protein relative to the level thereof in said preparations of epithelial cells in the absence of said compound identifies a tissue source of epithelial cells in which expression of said cellular gene can be modulated by said compound.