LL-1 tumor specific genes

Info

Patent number: 5811519
Type: Grant
Filed: Jan 27, 1997
Date of Patent: Sep 22, 1998
Assignee: Ludwig Institute for Cancer Research (New York, NY)
Inventors: Bernard Lethe (Brussels), Sophie Lucas (Brussels), Charles De Smet (Brussels), Daniele Godelaine (Brussels), Thierry Boon-Falleur (Brussels)
Primary Examiner: Sheela Huff
Law Firm: Wolf, Greenfield & Sacks, P.C.
Application Number: 8/791,495

Abstract

The invention describes the LL-1 tumor specific gene family, including fragments and biologically functional variants thereof. Also included are polypeptides and fragments thereof encoded by such genes, and antibodies relating thereto. Methods and products also are provided for diagnosing and treating conditions characterized by expression of a LL-1 gene product.

Description

Description

FIELD OF THE INVENTION

This invention relates to nucleic acid molecules and encoded polypeptides which are expressed preferentially in tumors. The nucleic acid molecules and encoded polypeptides are useful in, inter alia, diagnostic and therapeutic contexts.

BACKGROUND OF THE INVENTION

The phenotypic changes which distinguish a tumor cell from its normal counterpart are often the result of one or more changes to the genome of the cell. The genes which are expressed in tumor cells, but not in normal counterparts, can be termed "tumor specific" genes. These tumor specific genes are markers for the tumor phenotype. The expression of tumor specific genes can also be an essential event in the process of tumorigenesis.

Typically, the host recognizes as foreign the tumor specific genes which are not expressed in normal non-tumorigenic cells. Thus, the expression of tumor specific genes can provoke an immune response against the tumor cells by the host. Tumor specific genes can also be expressed in normal cells within certain tissues without provoking an immune response. In such tissues, expression of the gene and/or presentation of an ordinarily immunologically recognizable fragment of the protein product on the cell surface may not provoke an immune response because the immune system does not "see" the cells inside these immunologically privileged tissues. Examples of immunologically privileged tissues include brain and testis

The discovery of tumor specific expression of a gene provides a means of identifying a cell as a tumor cell. Diagnostic compounds can be based on the tumor specific gene, and used to determine the presence and location of tumor cells. Further, when the tumor specific gene is essential for an aspect of the tumor phenotype (e.g., unregulated growth or metastasis), the tumor specific gene can be used to provide therapeutics such as antisense nucleic acids which can reduce or substantially eliminate expression of that gene, thereby reducing or substantially eliminating the phenotypic aspect which depends on the expression of the particular tumor specific gene.

As previously noted, the polypeptide products of tumor specific genes can be the targets for host immune surveillance and provoke selection and expansion of one or more clones of cytotoxic T lymphocytes specific for the tumor specific gene product. Examples of this phenomenon include proteins and fragments thereof encoded by the MAGE family of genes, the tyrosinase gene, the Melan-A gene, the BAGE gene, the GAGE gene, the RAGE family of genes, the PRAME gene and the brain glycogen phosphorylase gene, as are detailed below. Thus, tumor specific expression of genes suggests that such genes can encode proteins which will be recognized by the immune system as foreign and thus provide a target for tumor rejection. Such genes encode "tumor rejection antigen precursors", or TRAPs, which may be used to generate therapeutics for enhancement of the immune system response to tumors expressing such genes and proteins.

The process by which the mammalian immune system recognizes and reacts to foreign or alien materials is a complex one. An important facet of the system is the T cell response. This response requires that T cells recognize and interact with complexes of cell surface molecules, referred to as human leukocyte antigens ("HLA"), or major histocompatibility complexes ("MHCs"), and peptides. The peptides are derived from larger molecules which are processed by the cells which also present the HLA/MHC molecule. See in this regard Male et al., Advanced Immunology (J. P. Lipincott Company, 1987), especially chapters 6-10. The interaction of T cells and complexes of HLA/peptide is restricted, requiring a T cell specific for a particular combination of an HLA molecule and a peptide. If a specific T cell is not present, there is no T cell response even if its partner complex is present. Similarly, there is no response if the specific complex is absent, but the T cell is present. The mechanism is involved in the immune system's response to foreign materials, in autoimmune pathologies, and in responses to cellular abnormalities. Much work has focused on the mechanisms by which proteins are processed into the HLA binding peptides. See, in this regard, Barinaga, Science 257: 880, 1992; Fremont et al., Science 257: 919, 1992; Matsumura et al., Science 257: 927, 1992; Latron et al., Science 257: 964, 1992.

The mechanism by which T cells recognize cellular abnormalities has also been implicated in cancer. For example, in PCT application PCT/US92/04354, filed May 22, 1992, published on Nov. 26, 1992, and incorporated by reference, a family of genes is disclosed, which are processed into peptides which, in turn, are expressed on cell surfaces, which can lead to lysis of the tumor cells by specific CTLs. The genes are said to code for "tumor rejection antigen precursors" or "TRAP" molecules, and the peptides derived therefrom are referred to as "tumor rejection antigens" or "TRAs". See Traversari et al., J Exp. Med. 176:1453-1457, 1992; van der Bruggen et al., Science 254: 1643,1991; De Plaen et al., Immunogenetics 40:360-369, 1994 for further information on this family of genes. Also, see U.S. patent application Ser. No. 807,043, filed Dec. 12, 1991, now U.S. Pat. No. 5,342,774.

In U.S. patent application Ser. No. 938,334, now U.S. Pat. No. 5,405,940, the disclosure of which is incorporated by reference, nonapeptides are taught which are presented by the HLA-A1 molecule. The reference teaches that given the known specificity of particular peptides for particular HLA molecules, one should expect a particular peptide to bind one HLA molecule, but not others. This is important, because different individuals possess different HLA phenotypes. As a result, while identification of a particular peptide as being a partner for a specific HLA molecule has diagnostic and therapeutic ramifications, these are only relevant for individuals with that particular HLA phenotype. There is a need for further work in the area, because cellular abnormalities are not restricted to one particular HLA phenotype, and targeted therapy requires some knowledge of the phenotype of the abnormal cells at issue.

In U.S. patent application Ser. No. 008,446, filed Jan. 22, 1993 and incorporated by reference, the fact that the MAGE-1 expression product is processed to a second TRA is disclosed. This second TRA is presented by HLA-Cw16 molecules, also known as HLA-C*1601. The disclosure shows that a given TRAP can yield a plurality of TRAs.

In U.S. patent application Ser. No. 994,928, filed Dec. 22, 1992, and incorporated by reference herein, tyrosinase is described as a tumor rejection antigen precursor. This reference discloses that a molecule which is produced by some normal cells (e.g., melanocytes), is processed in tumor cells to yield a tumor rejection antigen that is presented by HLA-A2 molecules.

In U.S. patent application Ser. No. 08/032,978, filed Mar. 18, 1993, and incorporated herein by reference in its entirety, a second TRA, not derived from tyrosinase is taught to be presented by HLA-A2 molecules. The TRA is derived from a TRAP, but is coded for by a known MAGE gene. This disclosure shows that a particular HLA molecule may present TRAs derived from different sources.

In U.S. patent application Ser. No. 079,110, filed Jun. 17, 1993 and entitled "Isolated Nucleic Acid Molecules Coding For BAGE Tumor Rejection Antigen Precursors" and Ser. No. 196,630, filed Feb. 15, 1994, and entitled "Isolated Peptides Which form Complexes with MHC Molecule HLA-C-Clone 10 and Uses Thereof" the entire disclosures of which are incorporated herein by reference, an unrelated tumor rejection antigen precursor, the so-called "BAGE" precursor, is described. TRAs are derived from the TRAP and also are described. They form complexes with MHC molecule HLA-C-Clone 10.

In U.S. patent application Ser. No. 096,039, filed Jul. 22, 1993 and entitled "Isolated Nucleic Acid Molecules Coding for GAGE Tumor Rejection Antigen Precursors" and Ser. No. 250,162, filed May 27, 1994 and entitled "Method for Diagnosing a Disorder by Determining Expression of GAGE Tumor Rejection Antigen Precursors", the entire disclosures of which are incorporated herein by reference, another unrelated tumor rejection antigen precursor, the so-called "GAGE" precursor, is described. The GAGE precursor is not related to the BAGE or the MAGE family.

In U.S. patent application Ser. No. 08/408,015, filed Mar. 21, 1995, and entitled "RAGE Tumor Rejection Antigen Precursors", incorporated herein by reference in its entirety, another TRAP is taught which is not derived from any of the foregoing genes. The TRAP is referred to as RAGE. In U.S. patent application Ser. No. 08/530,015, filed Sep. 20, 1995, and entitled "Isolated RAGE-1 Derived Peptides Which Complex with HLA-B7 Molecules and Uses Thereof", also incorporated by reference, the TRA derived form one member of the RAGE family of genes is taught to be presented by HLA-B7 molecules. This disclosure shows that additional TRAPs and TRAs can be derived from different sources.

In U.S. patent application Ser. No. 08/253,503, filed Jun. 3, 1994, and entitled "Isolated Nucleic Acid Molecule Which Codes for a Tumor Rejection Antigen Precursor Which is Processed to an Antigen Presented by HLA-B44", incorporated herein by reference in its entirety, another TRAP is taught which is not derived from any of the foregoing genes. The gene encoding the TRAP is referred to as MUM-1. A tumor rejection antigen, LB-33B, is described in the application.

In U.S. patent application Ser. No. 08/373,636, filed Jan. 17, 1995, and entitled "Isolated Nucleic Acid Molecule Which Codes for a Tumor Rejection Antigen Precursor Which is Processed to Antigens Presented by HLA Molecules and Uses Thereof", incorporated herein by reference in its entirety, other TRAPs are taught which are derived from LB33 and presented by HLA-B13, HLA-Cw6, HLA-A28 and HLA-A24.

In PCT publication WO96/10577, published Apr. 11, 1996, and entitled "Isolated Nucleic Acid Molecule Coding for a Tumor Rejection Antigen Precursor DAGE and Uses Thereof", incorporated herein by reference in its entirety, another TRAP is taught which is not derived from any of the foregoing genes. The TRAP was referred to as DAGE, but is now referred to as PRAME. A tumor rejection antigen is described in the application which is presented by HLA-A24.

In U.S. patent application Ser. No. 08/487,135, filed Jun. 7, 1995, and entitled "Isolated Nucleic Acid Molecule, Peptides Which Form Complexes with MHC Molecule HLA-A2 and Uses Thereof", incorporated herein by reference in its entirety, another TRAP is taught which is not derived from any of the foregoing genes. The TRAP is referred to as NAG. Various TRAs derived from NAG and presented by HLA-A2 are taught in this application.

In U.S. patent application Ser. No. 08/403,388, filed Mar. 14, 1995, and entitled "Isolated Nucleic Acid Molecules Which Are Members of the MAGE-Xp Family and Uses Thereof", incorporated herein by reference in its entirety, three TRAPs are taught which are not derived from any of the foregoing genes. These TRAPs are referred to as MAGE-Xp2, MAGE-Xp3 and MAGE-Xp4.

The work which is presented by the papers, patents and patent applications described above deal, for the most part, with the MAGE family of genes, the BAGE gene, the GAGE gene and the RAGE family of genes. It now has been discovered that additional genes similarly are expressed in a tumor specific pattern.

These genes which are believed to encode tumor rejection antigen precursors are referred to as LL-1 tumor specific genes. They do not show homology to the MAGE family of genes, to the BAGE gene, the GAGE gene, the RAGE family of genes, the LB33/MUM-1 gene, the PRAME gene, the NAG gene or the MAGE-Xp family of genes. Thus the invention relates to the genes expressed specifically in certain tumor cells, tumor rejection antigen precursors encoded by such genes, as well as related molecules and applications of these various entities.

The invention is elaborated upon further in the disclosure which follows.

SUMMARY OF THE INVENTION

The invention provides isolated nucleic acid molecules, unique fragments of those molecules, expression vectors containing the foregoing, and host cells transfected with those molecules. The invention also provides isolated polypeptides and agents which bind such polypeptides, including antibodies. Kits for detecting the presence of a LL-1 tumor specific polypeptide precursor additionally are provided. The foregoing can be used in the diagnosis or treatment of conditions characterized by the expression of a LL-1 tumor-specific polypeptide or precursor thereof.

According to one aspect of the invention, an isolated nucleic acid molecule is provided. The molecule hybridizes under stringent conditions to a molecule selected from the group consisting of the nucleic acid sequence of SEQ ID NO:4, the nucleic acid sequence of SEQ ID NO:6 and the nucleic acid sequence of SEQ ID NO:8. The isolated nucleic acid molecule is a LL-1 tumor specific polypeptide precursor and codes for a LL-1 tumor specific polypeptide. The invention further embraces nucleic acid molecules that differ from the foregoing isolated nucleic acid molecules in codon sequence to the degeneracy of the genetic code. The invention also embraces complements of the foregoing nucleic acids.

In preferred embodiments, the isolated nucleic acid molecule comprises a molecule selected from the group consisting of the nucleic acid sequence of SEQ ID NO:4, the nucleic acid sequence of SEQ ID NO:6 and the nucleic acid sequence of SEQ ID NO:8. More preferably, the isolated nucleic acid molecule comprises a molecule selected from the group consisting of the coding region of the nucleic acid sequence of SEQ ID. NO:4, the coding region of the nucleic acid sequence of SEQ ID NO:6 and the coding region of the nucleic acid sequence of SEQ ID NO:8.

According to another aspect of the invention, an isolated nucleic acid molecule is provided which comprises a molecule selected from the group consisting of a unique fragment of nucleotides 1-993 of SEQ ID NO:4 between 12 and 992 nucleotides in length, a unique fragment of nucleotides 1-746 of SEQ ID NO:6 between 12 and 745 nucleotides in length, a unique fragment of nucleotides 1-744 of SEQ ID NO:8 between 12 and 743 nucleotides in length, and complements thereof. In preferred embodiments, the unique fragment is at least 14, 15, 16, 17, 18, 20 or 22 contiguous nucleotides of nucleotides 1-993 of SEQ ID NO:4, nucleotides 1-746 SEQ ID NO:6 or nucleotides 1-744 SEQ ID NO:8, or complements thereof. In another embodiment, the isolated nucleic acid molecule consists of between 12 and 32 contiguous nucleotides of nucleotides 1-993 of SEQ ID NO:4, nucleotides 1-746 of SEQ ID NO:6 or nucleotides 1-744 of SEQ ID NO:8, or complements of such nucleic acid molecules.

According to another aspect of the invention, the invention involves expression vectors, and host cells transformed or transfected with such expression vectors, comprising the nucleic acid molecules described above. The expression vectors optionally include a nucleic acid molecule which codes for a HLA molecule. Of course, an HLA-encoding nucleic acid molecule can also be contained in a separate expression vector.

According to another aspect of the invention, an isolated polypeptide encoded by a nucleic acid molecule which hybridizes under stringent conditions to a molecule selected from the group consisting of the nucleic acid sequence of SEQ ID NO:4, the nucleic acid sequence of SEQ ID NO:6 and the nucleic acid sequence of SEQ ID NO:8, nucleic acid molecules which vary from the foregoing according to the degeneracy of the genetic code, and complements of any of the foregoing nucleic acid molecules.

According to yet another aspect of the invention, an isolated polypeptide is provided which comprises a molecule selected from the group consisting of a unique fragment of SEQ ID NO:5 between 9 and 209 amino acids in length, a unique fragment of SEQ ID NO:7 between 9 and 179 amino acids in length and a unique fragment of SEQ ID NO:9 between 9 and 179 amino acids in length. Preferably, the unique fragment of the isolated polypeptide binds to a polypeptide-binding agent. In other preferred embodiments, the unique fragment of the isolated polypeptide binds to an antibody or a cytotoxic T lymphocyte.

As used herein, a "polypeptide-binding agent" includes antibodies, cytotoxic T lymphocytes, polypeptides including HLA molecules, fragments of such polypeptides, nucleic acids, peptides of degenerate peptide libraries, combinatorial peptide libraries and phage display libraries (in solution or in immobilized form), phages bearing peptides as part of a phage display library, molecules from libraries synthesized of peptoids and/or non-peptide synthetic moieties.

The invention also provides isolated polypeptides which selectively bind a LL-1 protein or fragments thereof. Isolated binding polypeptides include antibodies and fragments of antibodies (e.g., Fab, F(ab).sub.2, Fd and antibody fragments which include a CDR III region which binds selectively to the LL-1 proteins of the invention). The isolated binding polypeptides include monoclonal antibodies.

The invention in another aspect involves a kit for detecting the presence of the expression of a LL-1 tumor specific polypeptide precursor. Such kits employ two or more of the above-described molecules isolated in separate containers and packaged in a single package. In one such kit, a pair of isolated nucleic acid molecules is provided, each of the pair consisting essentially of a molecule selected from the group consisting of a 12-32 nucleotide contiguous segment of SEQ ID NO:4 and complements thereof, a 12-32 nucleotide contiguous segment of SEQ ID NO:6 and complements thereof, and a 12-32 nucleotide contiguous segment of SEQ ID NO:8 and complements thereof, and wherein the contiguous segments are nonoverlapping. Preferably, the pair of isolated nucleic acid molecules is constructed and arranged to selectively amplify an isolated nucleic acid molecule which hybridizes under stringent conditions to a molecule selected from the group consisting of the nucleic acid sequence of SEQ ID NO:4, the nucleic acid sequence of SEQ ID NO:6, the nucleic acid sequence of SEQ ID NO:8, nucleic acid molecules which differ from the above in codon sequence due to the degeneracy of the genetic code and complements thereof. In certain embodiments, the pair of isolated nucleic acid molecules is PCR primers. Preferably one of the primers is a contiguous segment of SEQ ID NO:4 and another of the primers is a complement of another contiguous segment of SEQ ID NO:4. In other preferred embodiments, one of the primers is a contiguous segment of SEQ ID NO:6 and another of the primers is the complement of another contiguous segment of SEQ ID NO:6. In still other preferred embodiments, one of the primers is a contiguous segment of SEQ ID NO:8 and another of the primers is the complement of another contiguous segment of SEQ ID NO:8.

According to still another aspect of the invention, a method for diagnosing a disorder characterized by the expression of a LL-1 tumor specific polypeptide coded for by a LL-1 tumor specific polypeptide precursor nucleic acid molecule is provided. The method involves contacting a biological sample isolated from a subject with an agent that is specific for the nucleic acid molecule or an expression product thereof. In certain embodiments, the nucleic acid molecule hybridizes under stringent conditions to a molecule selected from the group consisting of the nucleic acid sequence of SEQ ID NO:4, the nucleic acid sequence of SEQ ID NO:6 and the nucleic acid sequence of SEQ ID NO:8, and which codes for a tumor specific polypeptide. In other embodiments, the agent is a binding agent which selectively binds to a LL-1 tumor specific polypeptide, such as an antibody, cytotoxic T lymphocyte, polypeptide, and the like. The method further involves determining the interaction between the agent and the nucleic acid molecule or expression product thereof as a determination of the disorder. In preferred embodiments, the agent is a DNA molecule comprising SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8, or a unique fragment thereof. In certain embodiments, the interaction between the agent and the nucleic acid molecule is determined by amplifying at least a portion of the nucleic acid molecule.

According to another aspect of the invention, a method for treating a subject with a disorder characterized by expression of an LL-1 tumor specific polypeptide is provided. A method involves administering to the subject an amount of an agent, which agent enriches selectively in the subject the presence of complexes of an HLA molecule and a tumor rejection antigen which is derived from a LL-1 tumor specific polypeptide coded for by one of the foregoing nucleic acid molecules.

These and other objects of the invention will be described in further detail in connection with the detailed description of the invention.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 depicts the nucleotide sequences of LL-1 clones 2, 3 and 4.

BRIEF DESCRIPTION OF THE SEQUENCES

SEQ ID NO:1 is the nucleotide sequence of LL-1 clone 1.

SEQ ID NO:2 is the nucleotide sequence of primer SL25.

SEQ ID NO:3 is the nucleotide sequence of primer BLE56.

SEQ ID NO:4 is the nucleotide sequence of LL-1.1 clone 2.

SEQ ID NO:5 is the amino acid sequence of the translation product of LL-1.1 clone 2.

SEQ ID NO:6 is the nucleotide sequence of LL-1.1 clone 4.

SEQ ID NO:7 is the amino acid sequence of the translation product of LL-1.1 clone 4.

SEQ ID NO:8 is the nucleotide sequence of LL-1.2 clone 3.

SEQ ID NO:9 is the amino acid sequence of the translation product of LL-1.2 clone 3.

SEQ ID NO:10 is the nucleotide sequence of primer BLE70.

SEQ ID NO:11 is the nucleotide sequence of primer BLE71.

SEQ ID NO:12 is the nucleotide sequence of primer BLE72.

SEQ ID NO:13 is the nucleotide sequence of primer BLE73.

SEQ ID NO:14 is the nucleotide sequence of primer BLE74.

DETAILED DESCRIPTION OF THE INVENTION

The examples which follow show the isolation of nucleic acid molecules which code for polypeptides and are expressed preferentially in tumor samples and tumor-derived cell lines. These isolated nucleic acid molecules, however, are not homologous with any of the previously disclosed coding sequences described in the references set forth supra. Hence, one aspect of the invention is an isolated nucleic acid molecule which includes all or a unique portion of the nucleotide sequence set forth in SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8. These sequences are not MAGE, BAGE, GAGE, RAGE, LB33/MUM-1, PRAME, NAG or MAGE-Xp sequences, as will be seen by comparing them to the sequence of any of the genes described in the references.

The invention thus involves LL-1 genes, polypeptides encoded by those genes, functional modifications and variants of the foregoing, useful fragments of the foregoing, as well as therapeutics and diagnostics related thereto.

Also a part of the invention are those nucleic acid sequences which also code for a LL-1 tumor specific polypeptide and which hybridize to a nucleic acid molecule consisting of the nucleotide sequence set forth in SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8, under stringent conditions. Such nucleic acids are termed tumor specific polypeptide precursors, and may be DNA, RNA, or composed of mixed deoxyribonucleotides and ribonucleotides. The tumor specific polypeptide precursors can also incorporate synthetic non-natural nucleotides. A tumor specific nucleic acid or polypeptide is a nucleic acid or polypeptide expressed preferentially in tumor cells. Various methods for determining the expression of a nucleic acid and/or a polypeptide in normal and tumor cells are known to those of skill in the art and are described further below. As used herein, tumor specific polypeptides include proteins, protein fragments, and peptides. In particular, tumor specific polypeptides include TRAPs and TRAs.

The term "stringent conditions" as used herein refers to parameters with which the art is familiar. More specifically, stringent conditions, as used herein, refers to hybridization at 65.degree. C. in hybridization buffer (3.5.times.SSC, 0.02% Ficoll, 0.02% polyvinyl pyrolidone, 0.02% Bovine Serum Albumin, 25 mM NaH.sub.2 PO.sub.4 (pH 7), 0.5% SDS, 2 mM EDTA). SSC is 0.15M sodium chloride/0.15M sodium citrate, pH 7; SDS is sodium dodecyl sulphate; and EDTA is ethylenediaminetetracetic acid. After hybridization, the membrane upon which the nucleic acid is transferred is washed at 2.times.SSC at room temperature and then at 0.1.times.SSC/0.1.times.SDS at 65.degree. C. SSC is 0.15M sodium chloride/0.15M sodium citrate, pH 7; SDS is sodium dodecyl sulphate; and EDTA is ethylenediamine tetraacetic acid.

There are other conditions, reagents, and so forth which can be used, which result in the same degree of stringency (see, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989, or Current Protocols in Molecular Biology, F. M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York). The skilled artisan will be familiar with such conditions, and thus they are not given here. It will be understood, however, that the skilled artisan will be able to manipulate the conditions in a manner to permit the clear identification of homologs and alleles of LL-1 nucleic acid molecules of the invention. The skilled artisan also is familiar with the methodology for screening cells, preferably cancer cells, and libraries for expression of such molecules which then are routinely isolated, followed by isolation of the pertinent nucleic acid and sequencing.

In general homologs and alleles typically will share at least 40% nucleotide identity and/or at least 50% amino acid identity to the coding region of SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8, in some instances will share at least 50% nucleotide identity and/or at least 65% amino acid identity and in still other instances will share at least 60% nucleotide identity and/or at least 75% amino acid identity. Watson-Crick complements of the foregoing nucleic acids also are embraced by the invention.

The nucleic acids disclosed herein are useful for determining the expression of LL-1 genes according to standard hybridization procedures. The nucleic acids also can be used to express LL-1 polypeptides in vitro or in vivo. The nucleic acids also can be used to prepare fragments of LL-1 polypeptides useful for e.g., preparation of antibodies. Many other uses will be apparent to the skilled artisan.

In screening for LL-1 family members, a Southern blot may be performed using the foregoing conditions, together with a radioactive probe. After washing the membrane to which the nucleic acid is finally transferred, the membrane can be placed against x-ray film to detect the radioactive signal.

The invention also includes degenerate nucleic acids which include alternative codons to those present in the native materials. For example, serine residues are encoded by the codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the purposes of encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the art that any of the serine-encoding nucleotide triplets may be employed to direct the protein synthesis apparatus, in vitro or in vivo, to incorporate a serine residue. Similarly, nucleotide sequence triplets which encode other amino acid residues include, but are not limited to: CCA, CCC, CCG and CCT (proline codons); CGA, CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG and ACT (threonine codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT (isoleucine codons). Other amino acid residues may be encoded similarly by multiple nucleotide sequences. Thus, the invention embraces degenerate nucleic acids that differ from the biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic code.

The invention also provides isolated unique fragments of SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8, or complements of SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8. A unique fragment is one that is a `signature` for the larger nucleic acid. It, for example, is long enough to assure that its precise sequence is not found in molecules outside of the LL-1 family as defined by claim 1. Unique fragments can be used as probes in Southern blot assays to identify family members or can be used in amplification assays such as those employing PCR. As known to those skilled in the art, large probes such as 200 nucleotides or more are preferred for certain uses such as Southern blots, while smaller fragments will be preferred for uses such as PCR. Unique fragments also can be used to produce fusion proteins for generating antibodies or for generating immunoassay components. Unique fragments further can be used as antisense molecules to inhibit the expression of the LL-1 proteins of the invention, particularly for therapeutic purposes as described in greater detail below.

As will be recognized by those skilled in the art, the size of the unique fragment will depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8, and their complements, will require longer segments to be unique while others will require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 nucleotides long). Virtually any segment of SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8, or their complements, that is 18 or more nucleotides in length will be unique. Those skilled in the art are well versed in methods for selecting such sequences, typically on the basis of the ability of the unique fragment to selectively distinguish the sequence of interest from non-family members. A comparison of the sequence of the fragment to those on known data bases typically is all that is necessary, although in vitro confirmatory hybridization and sequencing analysis may be performed.

For any pair of PCR primers constructed and arranged to selectively amplify, for example, a LL-1.1 nucleic acid,a LL-1.1 specific primer may be used. Such a primer is a contiguous stretch of LL-1.1 which hybridizes selectively to LL-1.1 and not other LL-1 nucleic acids. Such a specific primer would fully hybridize to a contiguous stretch of nucleotides only in LL-1.1, but would hybridize at most only in part to LL-1 genes that do not share the nucleotides to which the LL-1.1 specific primer binds. For efficient PCR priming and LL-1.1 identification, the LL-1.1 specific primer should be constructed and arranged so it does not hybridize efficiently at its 3' end to LL-1 genes other than LL-1.1. The kinetics of hybridization then will strongly favor hybridization at the 5' end. In this instance, 3' initiated PCR extension will occur only when both the 5' and 3' ends hybridize to the nucleic acid. Primers for selective amplification of LL-1.1 clone 2 and/or LL-1.1 clone 4 can be selected from portions of LL-1.1 which share lesser homology with LL-1.2 (see FIG. 1; compare SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8). In such cases, the LL-1.1 specific primers can be designed to prime DNA synthesis on either strand of the LL-1.1 gene, described herein as the antisense or the sense strands. Preferably the area of non-identity is at least one to four nucleotides in length and forms the 3' end of the LL-1.1 specific primer. Such a primer would be perfectly complementary and contiguous with its complement in LL-1.1. The 3' end of the primer would hybridize to its complement in the antisense strand and initiate extension. In LL-1 genes other than LL-1.1, the lack of nucleotide sequence identity would substantially eliminate hybridization of the 3' end of the LL-1 specific primer to the antisense strand 5' of the insert. The mismatch generated at the 3' end of the primer when hybridized to LL-1 genes, other than LL-1.1, would preclude efficient amplification of those genes. Exemplary primers include BLE72 (SEQ ID NO:12) which spans nucleotides 265-283 of SEQ ID NO:6. Other primers which contain nucleotide sequences not found in LL-1.2 can be prepared by one of skill in the art by comparison of the sequences of SEQ ID NO:4 or SEQ ID NO:6 with SEQ ID NO:8. Portions of SEQ ID NO:6 identical to SEQ ID NO:4 would serve equally well as LL-1.1 specific primers. Other exemplary primers can differ from the above by addition or deletion of 1, 2, 3, 4, 5, or more nucleotides from the 5' end of the primer.

Similarly, one of ordinary skill in the art can select primers from the nucleotide sequence of SEQ ID NO:8 for selective amplification of LL-1.2 mRNA sequences. For example, exemplary primers specific for LL-1.2 include BLE73 (SEQ ID NO:13), and BLE74 (SEQ ID NO:14) which is a sense primer located in SEQ ID NO:8 at nucleotides 262-281 (homologous to the position of BLE72 in SEQ ID NO:6, e.g., nucleotides 264-283). As demonstrated in the Examples below, primer pairs specific to LL-1.1 or LL-1.2 can be used to distinguish the expression of the genes in cells and tissues. Other exemplary primers can differ from the above by addition or deletion of 1, 2, 3, 4, 5, or more nucleotides from the 5' end of the primers above. One of ordinary skill in the art can determine with no more than routine experimentation the preferred primers for selective amplification of particular LL-1 clones.

In certain cases, the primers chosen to distinguish the LL-1 clones provide amplified products which are readily distinguishable by molecular size. For example, LL-1 primers can be chosen which initiate extension on opposite sides of the splice site by hybridizing to sequences which are identical or nearly so and which hybridize 5' of the 5' splice site and 3' of the 3' splice site. Because LL-1.1 clone 2 mRNA contains a portion of the gene which is spliced out in formation of LL-1.1 clone 4 mRNA (see FIG. 1; i.e., nucleotides 469-697 of SEQ ID NO:4), amplification products derived from LL-1.1 clone 2 using such primers will be longer than amplification products derived from LL-1.1 clone 4 (by about 229 base pairs). This difference may be distinguished readily using standard methods in the art including agarose and acrylamide gel electrophoresis.

Additional methods which can distinguish nucleotide sequences of substantial homology, such as ligase chain reaction ("LCR") and other methods, will be apparent to skilled artisans.

As used herein with respect to nucleic acids, the term "isolated" means: (i) amplified in vitro by, for example, polymerase chain reaction (PCR); (ii) recombinantly produced by cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, chemical synthesis. An isolated nucleic acid is one which is readily manipulable by recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained in a vector in which 5' and 3' restriction sites are known or for which polymerase chain reaction (PCR) primer sequences have been disclosed is considered isolated but a nucleic acid sequence existing in its native state in its natural host is not. An isolated nucleic acid may be substantially purified, but need not be. For example, a nucleic acid that is isolated within a cloning or expression vector is not pure in that it may comprise only a tiny percentage of the material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is used herein because it is readily manipulable by standard techniques known to those of ordinary skill in the art.

The invention also provides isolated polypeptides which include unique fragments of SEQ ID NO:5, SEQ ID NO:7 and/or SEQ ID NO:9. Such polypeptides are useful, for example, alone or as fusion proteins to generate antibodies, as a components of an immunoassay, or for determining the LL-1 protein binding specificity of HLA molecules and/or CTL clones.

A unique fragment of an LL-1 protein, in general, has the features and characteristics of unique fragments as discussed above in connection with nucleic acids. As will be recognized by those skilled in the art, the size of the unique fragment will depend upon factors such as whether the fragment constitutes a portion of a conserved protein domain. Thus, some regions of SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9, will require longer segments to be unique while others will require only short segments, typically between 5 and 12 amino acids (e.g. 5, 6, 7, 8, 9, 10, 11 and 12 amino acids long). Virtually any segment of SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9, that is 10 or more amino acids in length will be unique.

Unique fragments of a polypeptide preferably are those fragments which retain a distinct functional capability of the polypeptide. Functional capabilities which can be retained in a unique fragment of a polypeptide include interaction with antibodies, interaction with other polypeptides or fragments thereof, selective binding of nucleic acids, and enzymatic activity. A tumor rejection antigen is an example of a unique fragment of a tumor specific polypeptide which retains the functional capability of HLA binding and interaction with cytotoxic T lymphocytes. Tumor rejection antigens presented by HLA class I molecules typically are 9 amino acids in length, although peptides of 8, 9 and 10 and more amino acids also retain the capability to interact with HLA and cytotoxic T lymphocyte to an extent effective to provoke a cytotoxic T lymphocyte response (see, e.g., Van den Eynde & Brichard, Curr. Opin. Immnunol. 7:674-681, 1995; Coulie et al., Stem Cells 13:393-403, 1995).

Those skilled in the art are well versed in methods for selecting unique amino acid sequences, typically on the basis of the ability of the unique fragment to selectively distinguish the sequence of interest from non-family members. A comparison of the sequence of the fragment to those on known data bases typically is all that is necessary.

The skilled artisan will also realize that conservative amino acid substitutions may be made in LL-1 polypeptides to provide functionally active homologs of the foregoing polypeptides, i.e, the homologs retain the functional capabilities of the LL-1 polypeptides. As used herein, a "conservative amino acid substitution" refers to an amino acid substitution which does not alter the relative charge or size characteristics of the protein in which the amino acid substitution is made. Conservative substitutions of amino acids include substitutions made amongst amino acids within the following groups: (a) MILV; (b) FYW; (c) KRH; (d) AG; (e) ST; (f) QN; and (g) ED.

Functionally equivalent variants of LL-1 polypeptides, i.e., variants of LL-1 polypeptides which retain the function of the natural LL-1 polypeptides, can be prepared according to methods for altering polypeptide sequence known to one of ordinary skill in the art such as are found in references which compile such methods, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989, or Current Protocols in Molecular Biology, F. M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. Exemplary functionally equivalent variants of the LL-1 polypeptides include conservative amino acid substitutions of SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9. Conservative amino-acid substitutions in the amino acid sequence of LL-1 polypeptides to produce functionally equivalent variants of LL-1 polypeptides typically are made by alteration of the nucleic acid encoding LL-1.1 polypeptides (SEQ ID NO:4, SEQ ID NO:6), and alteration of the nucleic acid encoding LL-1.2 polypeptides (SEQ ID NO:8). Such substitutions can be made by a variety of methods known to one of ordinary skill in the art. For example, amino acid substitutions may be made by PCR-directed mutation, site-directed mutagenesis according to the method of Kunkel (Kunkel, Proc. Nat. Acad. Sci. U.S.A. 82: 488-492, 1985), or by chemical synthesis of a gene encoding a LL-1 polypeptide. Where amino acid substitutions are made to a small unique fragment of a LL-1 polypeptide, such as a 9 amino acid peptide, the substitutions can be made by directly synthesizing the peptide. The activity of functionally equivalent fragments of LL-1 polypeptides can be tested by cloning the gene encoding the altered LL-1 polypeptide into a bacterial or mammalian expression vector, introducing the vector into an appropriate host cell, expressing the altered LL-1 polypeptide, and testing for a functional capability of the LL-1 polypeptides as disclosed herein.

As mentioned above, the invention embraces antisense oligonucleotides that selectively bind to a nucleic acid molecule encoding an LL-1 protein, to decrease transcription and/or translation of LL-1 genes. This is desirable in virtually any medical condition wherein a reduction in LL-1 gene product expression is desirable, including to reduce any aspect of a tumor cell phenotype attributable to LL-1 gene expression. Antisense molecules, in this maimer, can be used to slow down or arrest such aspects of a tumor cell phenotype.

As used herein, the term "antisense oligonucleotide" or "antisense" describes an oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under physiological conditions to DNA comprising a particular gene or to an mRNA transcript of that gene and, thereby, inhibits the transcription of that gene and/or the translation of that mRNA. The antisense molecules are designed so as to interfere with transcription or translation of a target gene upon hybridization with the target gene. Those skilled in the art will recognize that the exact length of the antisense oligonucleotide and its degree of complementarity with its target will depend upon the specific target selected, including the sequence of the target and the particular bases which comprise that sequence. It is preferred that the antisense oligonucleotide be constructed and arranged so as to bind selectively with the target under physiological conditions, i.e., to hybridize substantially more to the target sequence than to any other sequence in the target cell under physiological conditions. Based upon SEQ ID NO:4, SEQ ID NO:6 and/or SEQ ID NO:8, or upon allelic or homologous genomic and/or DNA sequences, one of skill in the art can easily choose and synthesize any of a number of appropriate antisense molecules for use in accordance with the present invention. In order to be sufficiently selective and potent for inhibition, such antisense oligonucleotides should comprise at least 7 (Wagner et al., Nature Biotechnology 14:840-844, 1996) and, more preferably, at least 15 consecutive bases which are complementary to the target. Most preferably, the antisense oligonucleotides comprise a complementary sequence of 20-30 bases. Although oligonucleotides may be chosen which are antisense to any region of the gene or mRNA transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal or 5' upstream sites such as translation initiation, transcription initiation or promoter sites. In addition, 3'-untranslated regions may be targeted. Targeting to mRNA splicing sites has also been used in the art but may be less preferred if alternative mRNA splicing occurs. In addition, the antisense is targeted, preferably, to sites in which mRNA secondary structure is not expected (see, e.g., Sainio et al., Cell Mol. Neurobiol. 14(5):439-457, 1994) and at which proteins are not expected to bind. Finally, although, SEQ ID NOs:4, 6, and 8 disclose cDNA sequences, one of ordinary skill in the art may easily derive the genomic DNA corresponding to the cDNAs of SEQ ID NOs:4, 6, and 8. Thus, the present invention also provides for antisense oligonucleotides which are complementary to the genomic DNA corresponding to SEQ ID NOs:4, 6, and 8. Similarly, antisense to allelic or homologous DNAs and genomic DNAs are enabled without undue experimentation.

In one set of embodiments, the antisense oligonucleotides of the invention may be composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. That is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be covalently linked, as in natural systems, via a phosphodiester internucleoside linkage. These oligonucleotides may be prepared by art recognized methods which may be carried out manually or by an automated synthesizer. They also may be produced recombinantly by vectors.

In preferred embodiments, however, the antisense oligonucleotides of the invention also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified in a number of ways which do not prevent them from hybridizing to their target but which enhance their stability or targeting or which otherwise enhance their therapeutic effectiveness.

The term "modified oligonucleotide" as used herein describes an oligonucleotide in which (1) at least two of its nucleotides are covalently linked via a synthetic internucleoside linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one nucleotide and the 3' end of another nucleotide) and/or (2) a chemical group not normally associated with nucleic acids has been covalently attached to the oligonucleotide. Preferred synthetic internucleoside linkages are phosphorothioates, alkylphosphonates, phosphorodithioates, phosphate esters, alkylphosphonothioates, phosphoramidates, carbamates, carbonates, phosphate triesters, acetamidates, peptides, and carboxymethyl esters.

The term "modified oligonucleotide" also encompasses oligonucleotides with a covalently modified base and/or sugar. For example, modified oligonucleotides include oligonucleotides having backbone sugars which are covalently attached to low molecular weight organic groups other than a hydroxyl group at the 3' position and other than a phosphate group at the 5' position. Thus modified oligonucleotides may include a 2'-O-alkylated ribose group. In addition, modified oligonucleotides may include sugars such as arabinose instead of ribose. Modified oligonucleotides also can include base analogs such as C-5 propyne modified bases (Wagner et al., Nature Biotechnology 14:840-844, 1996). The present invention, thus, contemplates pharmaceutical preparations containing modified antisense molecules that are complementary to and hybridizable with, under physiological conditions, nucleic acids encoding LL-1 proteins, together with pharmaceutically acceptable carriers.

Antisense oligonucleotides may be administered as part of a pharmaceutical composition. Such a pharmaceutical composition may include the antisense oligonucleotides in combination with any standard physiologically and/or pharmaceutically acceptable carriers which are known in the art. The compositions should be sterile and contain a therapeutically effective amount of the antisense oligonucleotides in a unit of weight or volume suitable for administration to a patient. The term "pharmaceutically acceptable" means a non-toxic material that does not interfere with the effectiveness of the biological activity of the active ingredients. The term "physiologically acceptable" refers to a non-toxic material that is compatible with a biological system such as a cell, cell culture, tissue, or organism. The characteristics of the carrier will depend on the route of administration. Physiologically and pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials which are well known in the art.

It will also be seen from the examples that the invention embraces the use of the LL-1 sequences in expression vectors, as well as to transfect host cells and cell lines, be these prokaryotic (e.g., E. coli), or eukaryotic (e.g., CHO cells, COS cells, yeast expression systems and recombinant baculovirus expression in insect cells). Especially useful are mammalian cells such as mouse, hamster, pig, goat, primate, etc. They may be of a wide variety of tissue types, including mast cells, fibroblasts, oocytes and lymphocytes, and they may be primary cells or cell lines. Specific examples include dendritic cells, U293 cells, peripheral blood leucocytes, bone marrow stem cells and embryonic stem cells. The expression vectors require that the pertinent sequence, i.e., those nucleic acids described supra, be operably linked to a promoter. As it is believed that a human HLA class I molecule presents a tumor rejection antigen derived from these genes, the expression vector may also include a nucleic acid sequence coding for the HLA molecule that presents any particular tumor rejection antigen derived from these genes and polypeptides. Alternatively, the nucleic acid sequence coding for such a HLA molecule can be contained within a separate expression vector. In a situation where the vector contains both coding sequences, the single vector can be used to transfect a cell which does not normally express either one. Where the coding sequences for the tumor rejection antigen precursor and the HLA molecule which presents it are contained on separate expression vectors, the expression vectors can be cotransfected. The tumor rejection antigen precursor coding sequence may be used alone, when, e.g. the host cell already expresses a HLA molecule which presents a LL-1 TRA. Of course, there is no limit on the particular host cell which can be used. As the vectors which contain the two coding sequences may be used in any antigen-presenting cells if desired, and the gene for tumor rejection antigen precursor can be used in host cells which do not express a HLA molecule which presents a LL-1 TRA. Further, cell-free transcription systems may be used in lieu of cells.

The skilled artisan can determine which HLA molecule binds to tumor rejection antigens derived from LL-1.1 and/or LL-1.2 tumor rejection antigen precursors by, e.g., experiments utilizing antibodies to block specifically individual HLA class I molecules. For example, antibodies which bind selectively to HLA-A2 will prevent efficient presentation of TRAs specifically presented by HLA-A2. Thus, if TRAs derived from LL-1.1 are presented by HLA-A2, then the inclusion of anti-HLA-A2 antibodies in an in vitro assay will block the presentation of the LL-1.1 TRA. An assay for determining the nature of the HLA molecule is found in U.S. patent application Ser. No. 08/530,569. Briefly, in determining the HLA molecule type, inhibition experiments were carried out where the production of tumor necrosis factor (TNF) by cytotoxic T lymphocyte (CTL) clone 263/17 was tested in the presence of monoclonal antibodies directed against HLA molecules or against CD4/CD8 accessory molecules. Four monoclonal antibodies were found to inhibit the production of TNF by CTL 263/17: monoclonal antibody W6/32, which is directed against all HLA class I molecules (Parham et al., J Immunol. 123:342, 1979), antibody B1.23.2 which recognizes HLA-B and C molecules (Rebai et al., Tissue Antigens 22:107, 1983), antibody ME-1 which specifically recognizes HLA-B7 (Ellis et al., Hum. Immunol. 5:49, 1982) and antibody B9.4.1 against CD8. No inhibition was found with antibodies directed against HLA Class II DR molecules (L243: Lampson et al., J Inmunol. 125:293, 1980), against HLA-A3 (GAPA 3: Berger et al., Hybridoma 1:87, 1982) or against CD4 (13B.8.82). The conclusion was that CTL 263/17 was of the CD8 type, and recognized an antigen presented by HLA-B7. Similar experiments using widely available anti-HLA antibodies can be performed to determine the nature of a HLA molecule.

As used herein, a "vector" may be any of a number of nucleic acids into which a desired sequence may be inserted by restriction and ligation for transport between different genetic environments or for expression in a host cell. Vectors are typically composed of DNA although RNA vectors are also available. Vectors include, but are not limited to, plasmids and phagemids. A cloning vector is one which is able to replicate in a host cell, and which is further characterized by one or more endonuclease restriction sites at which the vector may be cut in a determinable fashion and into which a desired DNA sequence may be ligated such that the new recombinant vector retains its ability to replicate in the host cell. In the case of plasmids, replication of the desired sequence may occur many times as the plasmid increases in copy number within the host bacterium or just a single time per host before the host reproduces by mitosis. In the case of phage, replication may occur actively during a lytic phase or passively during a lysogenic phase. An expression vector is one into which a desired DNA sequence may be inserted by restriction and ligation such that it is operably joined to regulatory sequences and may be expressed as an RNA transcript. Vectors may further contain one or more marker sequences suitable for use in the identification of cells which have or have not been transformed or transfected with the vector. Markers include, for example, genes encoding proteins which increase or decrease either resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes whose activities are detectable by standard assays known in the art (e.g. .beta.-galactosidase or alkaline phosphatase), and genes which visibly affect the phenotype of transformed or transfected cells, hosts, colonies or plaques. Preferred vectors are those capable of autonomous replication and expression of the structural gene products present in the DNA segments to which they are operably joined.

As used herein, a coding sequence and regulatory sequences are said to be "operably" joined when they are covalently linked in such a way as to place the expression or transcription of the coding sequence under the influence or control of the regulatory sequences. If it is desired that the coding sequences be translated into a functional protein, two DNA sequences are said to be operably joined if induction of a promoter in the 5' regulatory sequences results in the transcription of the coding sequence and if the nature of the linkage between the two DNA sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a promoter region would be operably joined to a coding sequence if the promoter region were capable of effecting transcription of that DNA sequence such that the resulting transcript might be translated into the desired protein or polypeptide.

The precise nature of the regulatory sequences needed for gene expression may vary between species or cell types, but shall in general include, as necessary, 5' non-transcribing and 5' non-translating sequences involved with the initiation of transcription and translation respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. Especially, such 5' non-transcribing regulatory sequences will include a promoter region which includes a promoter sequence for transcriptional control of the operably joined gene. Regulatory sequences may also include enhancer sequences or upstream activator sequences as desired. The vectors of the invention may optionally include 5' leader or signal sequences, 5' or 3'. The choice and design of an appropriate vector is within the ability and discretion of one of ordinary skill in the art.

Expression vectors containing all the necessary elements for expression are commercially available and known to those skilled in the art. See Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. Cells are genetically engineered by the introduction into the cells of heterologous DNA (RNA) encoding the LL-1 tumor specific polypeptide or fragment or variant thereof. That heterologous DNA (RNA) is placed under operable control of transcriptional elements to permit the expression of the heterologous DNA in the host cell.

Preferred systems for mRNA expression in mammalian cells are those such as pRc/CMV (available from Invitrogen, San Diego, Calif.) that contain a selectable marker such as a gene that confers G418 resistance (which facilitates the selection of stably transfected cell lines) and the human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, suitable for expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which contains an Epstein Barr virus (EBV) origin of replication, facilitating the maintenance of plasmid as a multicopy extrachromosomal element. Another expression vector is the pEF-BOS plasmid containing the promoter of polypeptide Elongation Factor 1.alpha., which stimulates efficiently transcription in vitro. The plasmid is described by Mishizuma and Nagata (Nuc. Acids Res. 18:5322, 1990), and its use in transfection experiments is disclosed by, for example, Demoulin (Mol. Cell. Biol. 16:4710-4716, 1996). Still another preferred expression vector is an adenovirus, described by Stratford-Perricaudet, which is defective for E1 and E3 proteins (J. Clin. Invest. 90:626-630, 1992). The use of the adenovirus as an Adeno.P1A recombinant is disclosed by Warnier et al., in intradermal ionjection in mice for immunization against P1A (Int. J. Cancer, 67:303-310, 1996).

The invention also embraces so-called expression kits, which allow the artisan to prepare a desired expression vector or vectors. Such expression kits include at least separate portions of each of the previously discussed coding sequences. Other components may be added, as desired, as long as the previously mentioned sequences, which are required, are included.

The invention also involves agents which bind to LL-1 polypeptides and in certain embodiments preferably to unique fragments of the LL-1 polypeptides. Such binding partners can be used in screening assays to detect the presence or absence of the LL-1 polypeptide and in purification protocols to isolate LL-1 polypeptides. Likewise, such binding partners can be used to selectively target drugs, toxins or other molecules to tumor cells which present LL-1 tumor specific polypeptides. In this manner, tumor cells which express LL-1 polypeptides can be treated with cytotoxic compounds.

The invention, therefore, involves antibodies or fragments of antibodies having the ability to selectively bind to LL-1 polypeptides, and preferably to unique fragments thereof. Antibodies include polyclonal and monoclonal antibodies, prepared according to conventional methodology.

Significantly, as is well-known in the art, only a small portion of an antibody molecule, the paratope, is involved in the binding of the antibody to its epitope (see, in general, Clark, W. R. (1986) The Experimental Foundations of Modern Immunology Wiley & Sons, Inc., New York; Roitt, I. (1991) Essential Immunology, 7th Ed., Blackwell Scientific Publications, Oxford). The pFc' and Fc regions, for example, are effectors of the complement cascade but are not involved in antigen binding. An antibody from which the pFc' region has been enzymatically cleaved, or which has been produced without the pFc' region, designated an F(ab').sub.2 fragment, retains both of the antigen binding sites of an intact antibody. Similarly, an antibody from which the Fc region has been enzymatically cleaved, or which has been produced without the Fc region, designated an Fab fragment, retains one of the antigen binding sites of an intact antibody molecule. Proceeding further, Fab fragments consist of a covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. The Fd fragments are the major determinant of antibody specificity (a single Fd fragment may be associated with up to ten different light chains without altering antibody specificity) and Fd fragments retain epitope-binding ability in isolation.

Within the antigen-binding portion of an antibody, as is well-known in the art, there are complementarity determining regions (CDRs), which directly interact with the epitope of the antigen, and framework regions (FRs), which maintain the tertiary structure of the paratope (see, in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and the light chain of IgG immunoglobulins, there are four framework regions (FR1 through FR4) separated respectively by three complementarity determining regions (CDR1 through CDR3). The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain CDR3, are largely responsible for antibody specificity.

It is now well-established in the art that the non-CDR regions of a mammalian antibody may be replaced with similar regions of conspecific or heterospecific antibodies while retaining the epitopic specificity of the original antibody. This is most clearly manifested in the development and use of "humanized" antibodies in which non-human CDRs are covalently joined to human FR and/or Fc/pFc' regions to produce a functional antibody. Thus, for example, PCT International Publication Number WO 92/04381 teaches the production and use of humanized murine RSV antibodies in which at least a portion of the murine FR regions have been replaced by FR regions of human origin. Such antibodies, including fragments of intact antibodies with antigen-binding ability, are often referred to as "chimeric" antibodies.

Thus, as will be apparent to one of ordinary skill in the art, the present invention also provides for F(ab').sub.2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non-human sequences; chimeric F(ab').sub.2 fragment antibodies in which the FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non-human sequences; chimeric Fab fragment antibodies in which the FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non-human sequences; and chimeric Fd fragment antibodies in which the FR and/or CDR1 and/or CDR2 regions have been replaced by homologous human or non-human sequences. The present invention also includes so-called single chain antibodies. Thus, the invention involves polypeptides of numerous size and type that bind specifically to LL-1 polypeptides. These polypeptides may be derived also from sources other than antibody technology. For example, such polypeptide binding agents can be provided by degenerate peptide libraries which can be readily prepared in solution, in immobilized form or as phage display libraries. Combinatorial libraries also can be synthesized of peptides containing one or more amino acids. Libraries further can be synthesized of peptoids and non-peptide synthetic moieties.

Phage display can be particularly effective in identifying binding peptides useful according to the invention. Briefly, one prepares a phage library (using e.g. m13, fd, or lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional procedures. The inserts may represent a completely degenerate or biased array. One then can select phage-bearing inserts which bind to a LL-1 polypeptide. This process can be repeated through several cycles of reselection of phage that bind to the LL-1 polypeptide. Repeated rounds lead to enrichment of phage bearing particular sequences. DNA sequence analysis can be conducted to identify the sequences of the expressed polypeptides. The minimal linear portion of the sequence that binds to the LL-1 polypeptide can be determined. One can repeat the procedure using a biased library containing inserts containing part or all of the minimal linear portion plus one or more additional degenerate residues upstream or downstream thereof. Thus, the LL-1 polypeptides of the invention can be used to screen peptide libraries, including phage display libraries, to identify and select peptide binding partners of the LL-1 polypeptides of the invention. Such molecules can be used, as described, for screening assays, for diagnostic assays, for purification protocols or for targeting drugs, toxins and/or labeling agents (e.g. radioisotopes, fluorescent molecules, etc.) to cells which present LL-1 polypeptides on the cell surface. Such binding agent molecules can also be prepared to bind complexes of an LL-1 polypeptide and an HLA molecule by selecting the binding agent using such complexes. Drug molecules that would disable or destroy tumor cells which express such complexes or LL-1 polypeptides are known to those skilled in the art and are commercially available. For example, the immunotoxin art provides examples of toxins which are effective when delivered to a cell by an antibody or fragment thereof. Examples of toxins include ribosome-damaging toxins derived from plants or bacterial such as ricin, abrin, saporin, Pseudomonas endotoxin, diphtheria toxin, A chain toxins, blocked ricin, etc.

The invention as described herein has a number of uses, some of which are described herein. First, the invention permits the artisan to diagnose a disorder characterized by expression of the TRAP. These methods involve determining expression of the TRAP gene, and/or TRAs derived therefrom. In the former situation, such determinations can be carried out via any standard nucleic acid determination assay, including the polymerase chain reaction as exemplified in the examples below, or assaying with labeled hybridization probes.

The isolation of the TRAP gene also makes it possible to isolate the TRAP molecule itself, especially TRAP molecules containing the amino acid sequences coded for by SEQ ID NO:4, SEQ ID NO:6 or SEQ ID NO:8. A variety of methodologies well-known to the skilled practitioner can be utilized to obtain isolated TRAP molecules. The protein may be purified from cells which naturally produce the protein. Alternatively, an expression vector may be introduced into cells to cause production of the protein. In another method, mRNA transcripts may be microinjected or otherwise introduced into cells to cause production of the encoded protein. Translation of mRNA in cell-free extracts such as the reticulocyte lysate system also may be used to produce protein. Those skilled in the art also can readily follow known methods for isolating proteins in order to obtain isolated TRAPs. These include, but are not limited to, immunochromotography, HPLC, size-exclusion chromatography, ion-exchange chromatography and immune-affinity chromatography.

These isolated molecules when processed and presented as the TRA, or as complexes of TRA and HLA, such as HLA-A1, HLA-A2, or HLA-B7, may be combined with materials such as adjuvants to produce vaccines useful in treating disorders characterized by expression of the TRAP molecule. When "disorder" is used herein, it refers to any pathological condition where the tumor rejection antigen precursor is expressed. An example of such a disorder is cancer, melanoma in particular.

In addition, vaccines can be prepared from cells which present the TRA/HLA complexes on their surface, such as non-proliferative cancer cells, non-proliferative transfectants, etcetera. In all cases where cells are used as a vaccine, these can be cells transfected with coding sequences for one or both of the components necessary to provoke a CTL response, or be cells which already express both molecules without the need for transfection.

Therapeutic approaches based upon the disclosure are premised on a response by a subject's immune system, leading to lysis of TRA presenting cells, such as HLA-B7 cells. One such approach is the administration of autologous CTLs specific to the complex to a subject with abnormal cells of the phenotype at issue. It is within the skill of the artisan to develop such CTLs in vitro. Generally, a sample of cells taken from a subject, such as blood cells, are contacted with a cell presenting the complex and capable of provoking CTLs to proliferate. The target cell can be a transfectant, such as a COS cell of the type described supra. These transfectants present the desired complex of their surface and, when combined with a CTL of interest, stimulate its proliferation. COS cells, such as those used herein are widely available, as are other suitable host cells. Specific production of a CTL is well known to one of ordinary skill in the art. The clonally expanded autologous CTLs then are administered to the subject. Other CTLs specific to LL-1.1 and/or LL-1.2 may be isolated and administered by similar methods.

To detail a therapeutic methodology, referred to as adoptive transfer (Greenberg, J. Immunol. 136(5): 1917, 1986; Riddel et al., Science 257: 238, 1992; Lynch et al, Eur. J. Immunol. 21: 1403-1410, 1991; Kast et al., Cell 59: 603-614, 1989), cells presenting the desired complex are combined with CTLs leading to proliferation of the CTLs specific thereto. The proliferated CTLs are then administered to a subject with a cellular abnormality which is characterized by certain of the abnormal cells presenting the particular complex. The CTLs then lyse the abnormal cells, thereby achieving the desired therapeutic goal.

The foregoing therapy assumes that at least some of the subject's abnormal cells present the relevant HLA/TRA complex. This can be determined very easily, as the art is very familiar with methods for identifying cells which present a particular HLA molecule, as well as how to identify cells expressing DNA of the pertinent sequences, in this case a LL-1 sequence. Once cells presenting the relevant complex are identified via the foregoing screening methodology, they can be combined with a sample from a patient, where the sample contains CTLs. If the complex presenting cells are lysed by the mixed CTL sample, then it can be assumed that a LL-1 derived TRA is being presented, and the subject is an appropriate candidate for the therapeutic approaches set forth supra.

Adoptive transfer is not the only form of therapy that is available in accordance with the invention. CTLs can also be provoked in vivo, using a number of approaches. One approach is the use of non-proliferative cells expressing the complex. The cells used in this approach may be those that normally express the complex, such as irradiated tumor cells or cells transfected with one or both of the genes necessary for presentation of the complex. Chen et al., Proc. Natl. Acad. Sci. USA 88: 110-114 (1991) exemplifies this approach, showing the use of transfected cells expressing HPV E7 peptides in a therapeutic regime. Various cell types may be used. Similarly, vectors carrying one or both of the genes of interest may be used. Viral or bacterial vectors are especially preferred. For example, nucleic acids which encode a LL-1 TRA may be operably linked to promoter and enhancer sequences which direct expression of the LL-1 TRA in certain tissues or cell types. The nucleic acid may be incorporated into an expression vector. Expression vectors may be unmodified extrachromosomal nucleic acids, plasmids or viral genomes constructed or modified to enable insertion of exogenous nucleic acids, such as those encoding LL-1 TRAs. Nucleic acids encoding a LL-1 TRA also may be inserted into a retroviral genome, thereby facilitating integration of the nucleic acid into the genome of the target tissue or cell type. In these systems, the gene of interest is carried by a microorganism, e.g., a Vaccinia virus, retrovirus or the bacteria BCG, and the materials defacto "infect" host cells. The cells which result present the complex of interest, and are recognized by autologous CTLs, which then proliferate.

A similar effect can be achieved by combining a TRAP or a stimulatory fragment thereof with an adjuvant to facilitate incorporation into HLA presenting cells in vivo. The TRAP is processed to yield the peptide partner of the HLA molecule while the TRA is presented without the need for further processing. Generally, subjects can receive an intradermal injection of an effective amount of LL-1 TRAP, and/or TRAs derived therefrom. Initial doses can be followed by booster doses, following immunization protocols standard in the art.

As part of the immunization protocols, substances which potentiate the immune response may be administered with nucleic acid or peptide components of a cancer vaccine. Such immune response potentiating compound may be classified as either adjuvants or cytokines. Adjuvants may enhance the immunological response by providing a reservoir of antigen (extracellularly or within macrophages), activating macrophages and stimulating specific sets of lymphocytes. Adjuvants of many kinds are well known in the art; specific examples include MPL (SmithKline Beecham), a congener obtained after purification and acid hydrolysis of Salmonella minnesota Re 595 lipopolysaccharide, QS21 (SmithKline Beecham), a pure QA-21 saponin purified from Quillja saponaria extract, and various water-in-oil emulsions prepared from biodegradable oils such as squalene and/or tocopherol. Cytokines are also useful in vaccination protocols as a result of lymphocyte stimulatory properties. Many cytokines useful for such purposes will be known to one of ordinary skill in the art, including interleukin-12 (IL-12) which has been shown to enhance the protective effects of vaccines (Science 268: 1432-1434, 1995).

When administered, the therapeutic compositions of the present invention are administered in pharmaceutically acceptable preparations. Such preparations may routinely contain pharmaceutically acceptable concentrations of salt, buffering agents, preservatives, compatible carriers, supplementary immune potentiating agents such as adjuvants and cytokines and optionally other therapeutic agents.

The therapeutics of the invention can be administered by any conventional route, including injection or by gradual infusion over time. The administration may, for example, be oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. When antibodies are used therapeutically, a preferred route of administration is by pulmonary aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well known to those of skill in the art. Generally, such systems should utilize components which will not significantly impair the biological properties of the antibodies, such as the paratope binding capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington's Pharmaceutical Sciences, 18th edition, 1990, pp 1694-1712). Those of skill in the art can readily determine the various parameters and conditions for producing antibody aerosols without resort to undue experimentation. When using antisense preparations of the invention, slow intravenous administration is preferred.

Preparations for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Parenteral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like.

The invention also contemplates gene therapy. The procedure for performing ex vivo gene therapy is outlined in U.S. Pat. No. 5,399,346 and in exhibits submitted in the file history of that patent, all of which are publicly available documents. In general, it involves introduction in vitro of a functional copy of a gene into a cell(s) of a subject which contains a defective copy of the gene, and returning the genetically engineered cell(s) to the subject. The functional copy of the gene is under operable control of regulatory elements which permit expression of the gene in the genetically engineered cell(s). Numerous transfection and transduction techniques as well as appropriate expression vectors are well known to those of ordinary skill in the art, some of which are described in PCT application WO95/00654. In vivo gene therapy using vectors such as adenovirus also is contemplated according to the invention.

The preparations of the invention are administered in effective amounts. An effective amount is that amount of a pharmaceutical preparation that alone, or together with further doses, stimulates the desired response. In the case of treating cancer, the desired response is inhibiting the progression of the cancer. This may involve only slowing the progression of the disease temporarily, although more preferably, it involves halting the progression of the disease permanently. This can be monitored by routine methods or can be monitored according to diagnostic methods of the invention discussed herein.

Where it is desired to stimulate an immune response using a therapeutic composition of the invention, this may involve the stimulation of a humoral antibody response resulting in an increase in antibody titer in serum, a clonal expansion of cytotoxic lymphocytes, or some other desirable immunologic response. It is believed that doses of immunogens ranging from one nanogram/kilogram to 100 milligrams/kilogram, depending upon the mode of administration, would be effective. The preferred range is believed to be between 500 nanograms and 500 micrograms per kilogram. The absolute amount will depend upon a variety of factors, including the material selected for administration, whether the administration is in single or multiple doses, and individual patient parameters including age, physical condition, size, weight, and the stage of the disease. These factors are well known to those of ordinary skill in the art and can be addressed with no more than routine experimentation.

EXAMPLES Example 1

Isolation of a sequence specifically expressed by melanoma cell line LB373-MEL.

Specific cDNA fragments of melanoma cell line LB373-MEL were enriched by subtraction of cDNA fragments found in LB879 normal skin cells, according to the representational difference analysis method (RDA) described for DNA by Hubank and Schatz (NucL. Acids Res. 22: 5640-5648, 1994).

Briefly, cellular cDNAs obtained by reverse transcription of poly-A RNA of both LB373-MEL cells and LB879 normal skin cells primed with oligo-dT were digested by restriction enzyme DpnII. The DpnII fragments of cDNAs of each origin (LB373-MEL or LB879 normal skin cells) were ligated with the same set of adapters, divided in several groups and separately amplified by PCR. The PCR products originating from the same sample were pooled and digested again by DpnII. The DpnII fragments from the LB373-MEL cell line (the "tester" cDNA) were ligated with a new adapter set and hybridized with an excess of DpnII DNA fragments derived from the normal skin (the "driver" cDNA). The hybridization mixture was then submitted to PCR amplification using the new adaptor set. Only those DpnII fragments derived from the tester DNA but not present in the driver DNA were expected to be amplified exponentially because they carry primer-complementary sequences at both ends. These tester-specific amplification products were then cloned.

Thirty melanoma-cell specific cDNA clones obtained by this enrichment procedure were sequenced and compared with sequences compiled in databases. Some of the cDNA clones corresponded to known genes with ubiquitous expression, some of the cDNA clones corresponded to tumor specific genes (MAGE-3, MAGE-10, PRAME--formerly known as DAGE) and some of the cDNA clones were unknown. Among the six unknown clones, one melanoma-specific cDNA, named LL-1 (SEQ ID NO:1), appeared to have tumor specific expression as determined by RT-PCR. The LL-1 clone was sequenced and determined to be 217 base pairs long.

To determine the pattern of expression of LL-1, RT-PCR of samples from various tumor and normal tissues was performed. Total RNA of normal tissue and tumor samples was converted to cDNA. An amount of DNA corresponding to 50 ng of total RNA was then amplified by thirty-two cycles (denature at 94.degree. C. for 60 seconds, anneal at 58.degree. C. for 60 seconds, and extension at 72.degree. C. for 90 seconds) followed by a final extension step of 10 minutes at 73.degree. C., using primers SL25 (SEQ ID NO:2) and BLE56 (SEQ ID NO:3), described in Table 1, with 0.5U DYNAZYME.TM. in a buffer, provided by the supplier (Finnzyme, Finland), containing 10 mM TRIS (pH8.8), 50 mM KCl and 1.5 mM MgCl.sub.2. The total volume of the reaction mixture was 25 .mu.l. Ten microlitres were separated by electrophoresis on agarose gels. A fragment of the expected size was generated from cDNA of the parental LB373-MEL cell line and testis. No PCR product was obtained from normal skin cDNA starting materials, nor from a panel of cDNAs from eight other normal tissues. A faint signal was observed in one normal uterus sample.

Example 2

Isolation of complete LL-1 cDNA clones from melanoma cell line LB373-MEL.

To identify full-length LL-1 cDNA clones, we used the 137 bp PCR product amplified with primers SL25 (SEQ ID NO:2) and BLE56 (SEQ ID NO:3), as a probe to screen 75,000 clones of a cDNA library of LB373-MEL cells. The cDNA library was hybridized with the radiolabeled probe and washed according to standard protocols using 0.4X SSC at 63.degree. C. Th reduced stringency washing conditions were selected to maximize detection of related cDNA clones. DNA from 25 colonies hybridized to the LL-1 probe (0.03% of the total number of colonies), some of which DNAs were isolated and sequenced.

Two clones had sequence identity with the cDNA clone originally isolated from LB373-MEL cells (see Example 1, "clone 1" in FIG. 1). One of the clones contained a sequence that was identical to the 217 base pairs of clone 1. This cDNA, referred to as clone 2 (SEQ ID NO:4, FIG. 1) was determined to be about 993 nucleotides long excluding the poly A tail. Another hybridizing clone contained a sequence identical to the last 82 base pairs of clone 1. This second cDNA, referred to as clone 3 (SEQ ID NO:8, FIG. 1), was determined to be about 744 nucleotides in length excluding the poly A tail. The sequences of clone 2 and clone 3 were 94% identical. Most of the differences in nucleotide sequence between the two clones were located in the central region of the cDNAs. The gene encoding cDNA clone 2 (SEQ ID NO:4) is referred to hereinafter as LL-1.1; the gene encoding cDNA clone 3 (SEQ ID NO:8) is referred to hereinafter as LL-1.2. Analysis of the genomic fragment corresponding to gene LL-1.2 indicated that the region encompassing nucleotides 469-697 of clone 2 in FIG. 1 is an intron which was not spliced out during the formation of the LL-1.1 clone 2 mRNA.

In all clones, the longest open reading frame (ORF) is believed to begin at the same first ATG in a good transcription initiating context (gccATGc) according to Kozak (J. Biol. Chem. 266: 19867-19870, 1991). The size of the product of translation of completely spliced sequences of clone 4 (SEQ ID NO:6) and clone 3 (SEQ ID NO:8) derived from genes LL-1.1 and LL-1.2 respectively are believed to be in good agreement with the corresponding ORF (about 19-20 kD for 180 amino acids). The sequence of the putative protein encoded by LL-1.2 is referred to as SEQ ID NO:9. The translation product of partially spliced messenger RNAs from LL-1.1 has an apparent mass of about 25 kDa as determined by SDS-PAGE analysis of protein prepared by in vitro translation.

Example 3

Expression of LL-1 genes in normal tissue and tumor samples.

To determine the tissue specificity of expression of both LL-1 clones by PCR, we used two primers which correspond to sequences where LL-1.1 and LL-1.2 are identical. Primers BLE70 (SEQ ID NO:10) and BLE71 (SEQ ID NO:11), encompassing the main ORF, repeatedly provided two signals of nearly 600 base pairs and 850 base pairs which correspond to the fragment sizes of 614 and 842 base pairs expected for the spliced (LL-1.2) and unspliced (LL-1.1) LL-1 cDNAs.

A. Normal Tissues

RT-PCR with primers BLE70 and BLE71 was performed as described in Example 1, except that 20 cycles (denature at 95.degree. C. for 30 seconds, anneal at 60.degree. C. for 1 minute and extension at 70.degree. C. for 3 minutes) were performed followed by 10 cycles with an extension time of 10 minutes at 70.degree. C. and 15 minutes at 72.degree. C. for the final extension. In addition, the buffer used was 50 mM Tris-HCl, pH9.2 (25.degree. C.), 16 mM (NH.sub.4).sub.2 SO.sub.4, 2.25 mM MgCl.sub.2, 2% (v/v) DMSO and 0.1% (v/v) TWEEN.TM. 20 (buffer 3 from Expand Long template of Boehringer). Analysis of amplification products was performed by agarose gel electrophoresis of 10 .mu.l of the 25 .mu.l total volume. Samples of various tissue origins were tested again with primers BLE71 and either BLE72 (SEQ ID NO:12, specific for LL-1.1) or BLE73 (SEQ ID NO:13, specific for LL-1.2), for 30 cycles with an annealing step at 62.degree. C. for 1 minute and an extension step at 72.degree. C. for 2 minutes using 0.5U Dynazyme according to manufacturers instructions. The results are indicated in Table I, with increasing numbers of plus signs indicating higher levels of RNA expression in a particular tissue. Samples which did not yield PCR amplification products were retested by using the residual 15 .mu.l of negative PCR reactions in a reamplification reaction of 5 or 6 cycles. The results of any reamplification reactions are reported as -, (.+-.) or (+) in Table I.

No signal was observed on amplification of genomic DNA using standard PCR conditions with primers BLE70 and BLE71 after 30 and 33 cycles with an annealing step of 60.degree. C. Twenty-five normal tissues of 17 different histological types were analyzed with the primers indicated above. After 30 cycles, only the testis and one out of two uterus samples were positive (Table I). Investigating the expression of LL-1 subtypes, it was observed that both uterus samples expressed a low level of LL-1.1 mRNA, but remained clearly negative for LL-1.2 mRNA. Moreover, three endometrium and two myometrium RNA samples remained negative for both genes. On the other hand, both testis samples showed LL-1.1 and LL-1.2 expression at a level similar to expression in LB373-MEL cells. Control amplifications were performed using .beta.-actin specific primers. All of these cDNA samples strongly expressed .beta.-actin as judged by the signal obtained after a PCR amplification for 21 cycles. From a panel of 6 other samples of normal tissues already typed negative with primers SL25 and BLE56, all were also found negative using LL-1.1 specific BLE72-BLE71 primers. Three of these samples, however, were found positive for LL-1.2 expression, but below the threshold of 1% of the expression found in LB373-MEL cells. Those normal samples which exhibited a low level of LL-1.2 expression are the skin, the lung, and full-term placenta (Table I).

                                    TABLE I                                 
     __________________________________________________________________________
     Expression of genes LL-1.1 & LL-1.2 in normal tissue (RT-PCR)             
                  Detection of LL-1                                            
                                 Detection of LL-1.1                           
                                                Detect. LL-1.2                 
                  partially spliced                                            
                          spliced                                              
                                 partially spliced                             
                                         spliced                               
                                                spliced                        
                  BLE70/BLE71                                                  
                          BLE70/BLE71                                          
                                 BLE72/BLE71                                   
                                         BLE72/BLE71                           
                                                BLE73/BLE71                    
     Tissue sample                                                             
            code  842 nt  614 nt 627 nt  399 nt 274 nt                         
     __________________________________________________________________________
     brain  JNO10 -       -                                                    
     retina SH8-5 -       -                                                    
     PBL    LB33  -       -                                                    
     skin   LB243 -       -      -       -      (.+-.)                         
     breast LB520 -       -                                                    
     breast LB673 -       -                                                    
     heart  LB1266                                                             
                  -       -      -       -      -                              
     lung   LB264 -       -      -       -      (.+-.)                         
     liver  LB898 -       -      -       -      -                              
     kidney BA25  -       -                                                    
     kidney BA4                  -       -      -                              
     adrenals                                                                  
            LB535 -       -      -       -      -                              
     adrenals                                                                  
            LB538 -       (.+-.)                                               
     testis LB882 +       ++     +       +++    +++++                          
     testis HM31 1A5                                                           
                  .+-.    +      +       ++     ++++                           
     prostate                                                                  
            CLO9  -       -                                                    
     prostate                                                                  
            HM88  -       -      -       -      -                              
     ovary  LB1266                                                             
                  -       -      -       -      -                              
     placenta                                                                  
            LB695                -       -      (.+-.)                         
     placenta                                                                  
            LB692 -       (+)                                                  
     uterus LB181 +       +      ++      ++     -                              
     uterus LB1022                                                             
                  -       (+)    .+-.    .+-.   -                              
     endometrium                                                               
            LB1031                                                             
                  -       -      -       -      -                              
     endometrium                                                               
            LB1053                                                             
                  -       -      -       -      -                              
     endometrium                                                               
            LB1081                                                             
                  -       -      -       -      -                              
     myometrium                                                                
            LB1031                                                             
                  -       -      -       -      -                              
     myometrium                                                                
            LB1032                                                             
                  -       -      -       -      -                              
     controls                                                                  
     LB373-MEL                                                                 
            100%  +       ++                                                   
     LB373-MEL                                                                 
            33%                  +       ++     ++++                           
     LB373-MEL                                                                 
            3.7%                 .+-.    +      +++                            
     __________________________________________________________________________

B. Tumor Samples

Total RNA of tumor samples of the origins indicated in Table II was used in RT-PCR reactions with LL-1 specific primers (BLE70 and BLE71) as described for normal tissues. LL-1 positive samples were retested by amplification using primers BLE71 and either BLE72 (specific for LL-1.1 transcript) or BLE73 (specific for LL-1.2 transcripts) for 30 cycles as described above with an annealing step at 62.degree. C. for one minute and an extension step at 72.degree. C. for 2 minutes using 0.5U DYNAZYME.TM. according to the manufacturers' instructions. The amount of RNA and efficiency of cDNA synthesis previously were controlled for by parallel PCR reactions with a set of .beta.-actin specific primers.

Since two normal uterus samples showed a basal expression of gene LL-1.1, we stressed our attention on tumor samples derived from this organ. Surprisingly, among 8 tumors tested (4 tumors of the cervix and 4 tumors of the myometrium), none appear to be positive for LL-1 with primers BLE70-BLE71after 30 and 33 cycles, in spite of a confirmed good .beta.-actin expression (Table II).

As indicated in Table II, no expression of genes of the LL-1 family was detected in colon, kidney, thyroid and brain cancers, nor in leukemias, as assessed by RT-PCR with primers BLE70-BLE7 1. Expression of LL-1 family genes in breast cancer was not rare, but was faint. The expression in melanomas, NSCLC, sarcomas, head and neck, prostate and bladder tumors was stronger, relatively more frequent and correlated with the expression of other genes known to encode antigenic peptides as shown in Table III. The same cDNA samples were tested for expression of a panel of TRAPs including MAGE-1, -2, -3, -4, -6 and -12, BAGE, PRAME, GAGE-1&2, -3, -4, -5 and -6, and RAGE. Based on results demonstrating a link between the demethylation of the MAGE-1 promoter and MAGE-1 gene activation, it was determined that the expression of LL-1 paralleled that of other TRAPs. A correlation of the expression of LL-1 with activation of MAGE-1 or other TRAPS is given in Table III.

LL-1.1 and LL-1.2 each accounted for 75% of LL-1 positive tumor samples. Thus, as demonstrated in Table II both LL-1 family genes were expressed independently of each other. Sarcomas of various histological types preferentially expressed high levels of LL-1.2 RNA, independent of the expression of known tumor specific antigens encoding genes

Using the cDNAs of eight samples which express LL-1.1 and LL-1.2 genes simultaneously or LL-1.2 alone as templates for RT-PCR reactions, we amplified LL-1.2 sequence with primers BLE73 and BLE71 for use as starting material in sequencing reactions. Primer BLE56 (SEQ ID NO:3) was used to prime sequencing reactions. We observed a unique sequence that was identical to the corresponding region of clone 3 derived from LB373-MEL cells, demonstrating both the specificity of the PCR reactions and the absence of polymorphism in this region of the LL-1.2 gene (see FIG. 1).

                                    TABLE II                                
     __________________________________________________________________________
     Expression of genes LL-1.1 & LL-1.2 in tumors (RT-PCR)                    
             LL-1 Positive   LL-1.1 LL-1.2                                     
     Sample  Number                                                            
                  BLE70-BLE71                                                  
                         (%) BLE72-BLE71                                       
                                    BLE73-BLE71                                
     __________________________________________________________________________
     COLON   9    0          ND     ND                                         
     LEUKEMIA                                                                  
             17   0       0% ND     ND                                         
     B-LYMPHOMA                                                                
             6    1          1      1                                          
     MELANOMA                                                                  
             21   7      33% 6      5                                          
     HEAD & NECK                                                               
             15   4      27% 4      3                                          
     LUNG    15   5      33% 5      3                                          
     KIDNEY  10   0          ND     ND                                         
     SARCOMA 19   9      47% 4/8    6                                          
     BREAST  12   4          2      3                                          
     UTERUS  8    0          0      0                                          
     cervix       0/4                                                          
     corpus       0/4                                                          
     BLADDER 15   5      33% 4      5                                          
     BRAIN   4    0          ND     ND                                         
     THYROID 3    0          ND     ND                                         
     PROSTATE                                                                  
             12   4          3      3                                          
     TOTAL   166                                                               
     Positive     39         29/38  29/39                                      
     Positive (%)        23% 76%    74%                                        
     __________________________________________________________________________

                                    TABLE III                               
     __________________________________________________________________________
     Expression of gene LL-1 by RT-PCR BLE70-BLE71 (30 cycles)                 
     __________________________________________________________________________
                              among tumor samples                              
                              .gtoreq.1 other TRAP pos.                        
     Sample  Number                                                            
                 Positive                                                      
                     (%)                                                       
                        Mage-1 pos.                                            
                              (MAGE-1 neg.)                                    
                                        TRAPs neg.                             
     __________________________________________________________________________
     COLONS  9   0      --    0/5       0/4                                    
     LEUKEMIAS                                                                 
             17  0    0%                                                       
                        0/1   0/9       0/7                                    
     B-LYMPHOMAS                                                               
             6   1      --    1/3       0/3                                    
     MELANOMAS                                                                 
             21  7   33%                                                       
                        3/5   4/9       0/7                                    
     HEAD & NECK                                                               
             15  4   27%                                                       
                        3/4   1/6       0/5                                    
     LUNG    15  5   33%                                                       
                        3/5   2/5       0/5                                    
     nsclc (AC)  3/8    2/4   1/2       0/2                                    
     nsclc (epid.)                                                             
                 1/6    --    1/3       0/3                                    
     other       1/1    1/1   --        --                                     
     KIDNEY  10  0      0/2   0/2       0/6                                    
     SARCOMAS                                                                  
             19  9   47%                                                       
                        2/2   4/8       3/9                                    
     BREAST  12  4   33%                                                       
                        2.+-./5                                                
                              2.+-./3   0/4                                    
     UTERUS  8   0      --    0/3       0/5                                    
     cervix      0/4    --    0/2       0/2                                    
     corpus (benign)                                                           
                 0/4    --    0/1       0/3                                    
     BLADDER 15  5   33%                                                       
                        3/4   1/5       1/6                                    
     BRAIN   4   0   -- --    0/2       0/2                                    
     THYROID 3   0   -- --    0/2       0/1                                    
     PROSTATE                                                                  
             12  4   33%                                                       
                        1/3   1/1       2/8                                    
     __________________________________________________________________________
                              MAGE-1 neg.                                      
                        Mage-1 pos.                                            
                              .gtoreq.1 other TRAP pos.                        
                                        TRAPs neg.                             
     __________________________________________________________________________
     TOTAL   166        31    63        72                                     
     % of all samples   19%   38%       43%                                    
     Positive    39     17    16         6                                     
     Positive (%)    23%                                                       
                        53%   23%       11%                                    
     __________________________________________________________________________

Example 4

Northern blot on total RNA.

Various tumor cell lines and samples positive for LL-1 by RT-PCR were assayed by Northern blotting in order to determine the length of the messenger RNA. A normal lung sample was used as a negative control.

Total RNA (10 .mu.g) from normal testis, normal and tumoral lung from the same patient (LB 264), from two melanoma cell lines (LB373 and LB24) and one sarcoma cell line (LB188) were separated by electrophoresis in a denaturating 1.3% agarose gel, blotted overnight against Hybond C filters (Amersham), using the turbo-blotting system from Schleicher & Schuell (Keene, N.H.). RNA was fixed on the filter by UV autocrosslinking at 254 nm (Stratalinker, Stratagene, La Jolla, Calif.), and hybridized with 5.times.10.sup.6 CPM of a PCR probe of 842 base pairs in a 5 ml Dextran sulfate/SDS/NaCl solution. The probe was obtained by amplification of clone 2 with primers BLE70 and BLE71 in the presence of labeled dCTP. Specific activity of the probe was determined after purification by Chromaspin X (Clontech, Palo Alto, Calif.). After overnight hybridization at 60.degree. C, the filter was washed in successive baths of 2X SSC at increasing temperatures up to 60.degree. C. The washed filter was exposed to X-ray film to visualize the hybridization signal as an autoradiogram.

Two clear signals of approximately 750 and 1000 nucleotides in length were observed on the autoradiogram. These sizes are in good agreement with the length of cDNA clones 2, 3, and 4 which are about 993, 744, and 746 nucleotides respectively, without the poly-A tail. Accordingly, cDNA clones 2, 3, and 4 are believed to be nearly complete. These signals were barely visible for the testis, and absent for the normal lung sample. Ethidium bromide staining revealed no significant quantative difference between the six samples, and the 28S rRNA was undegraded in all samples.

Example 5

Identification of the portion of LL-1 encoding a tumor rejection antigen.

In a first method, available CTL clones directed against antigens presented by autologous tumor cells shown to express one or both of the LL-1 genes are screened for specificity against COS cells transfected with LL-1 genes and autologous HLA alleles as described by Brichard et al. (Eur. J. Immunol. 26:224-230, 1996). CTL recognition of LL-1 is determined by measuring release of TNF from the cytolytic T lymphocyte or by .sup.51 Cr release assay (Herin et al., Int. J Cancer 39:390-396, 1987). If a CTL clone specifically recognizes a transfected COS cell, the shorter fragments of the coding sequences are tested to identify the region of the gene that encodes the peptide. Fragments of LL-1.1 and LL-1.2 are prepared by exonuclease III digestion or other standard molecular biology methods. Synthetic peptides are prepared to confirm the exact sequence of the antigen.

Alternatively, CTL clones are generated by stimulating the peripheral blood lymphocytes (PBLs) of a patient with autologous normal cells transfected with DNA clones encoding LL-1.1 or LL-1.2 polypeptides (e.g. SEQ ID NOs: 4, 6 and 8) or with irradiated PBLs loaded with synthetic peptides corresponding to the putative proteins and matching the consensus for the appropriate HLA class I molecule to localize the antigenic peptide within the LL-1.1 or LL-1.2 clones (see, e.g., van der Bruggen et al., Eur. J. Immunol.24:3038-3043, 1994; MAGE3 peptides presented by HLA.A2).

Optionally, shorter fragments of LL-1.1 or LL-1.2 cDNAs are generated by PCR. Shorter fragments are used to provoke TNF release or .sup.51 Cr release as above.

Synthetic peptides corresponding to portions of the shortest fragment of LL-1.1 and/or LL-1.2 which provokes TNF release are prepared. Progressively shorter peptides are synthesized to determine the optimal LL-1 tumor rejection antigen peptides for a given HLA molecule.

Other aspects of the invention will be clear to the skilled artisan and need not be repeated here. All patents, published patent applications and literature cited herein are incorporated by reference in their entirety.

While the invention has been described with respect to certain embodiments, it should be appreciated that many modifications and changes may be made by those of ordinary skill in the art without departing from the spirit of the invention. It is intended that such modification, changes and equivalents fall within the scope of the following claims.

A sequence listing is presented followed by what is claimed.

  __________________________________________________________________________
     SEQUENCE LISTING                                                          
     (1) GENERAL INFORMATION:                                                  
     (iii) NUMBER OF SEQUENCES: 14                                             
     (2) INFORMATION FOR SEQ ID NO:1:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 217 base pairs                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: double                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (v) FRAGMENT TYPE: internal                                               
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                   
     GATCTCAGAACACCCAAACACAAGGTCTCAGAACAGAGACCTGGTACACCAGGCCCGCCG60            
     CCACCCGAGGGAGCCCAGGGAGATGGGTGCAGAGGTGTCGCCTTTAATGTGATGTTCTCT120           
     GCCCCTCACATTTAGCCGACTGACTGCTGCAGACCACCGCCAACTGCAGCTCTCCATCAG180           
     CTCCTGTCTCCAGCAGCTTTCCCTGTTGATGTGGATC217                                  
     (2) INFORMATION FOR SEQ ID NO:2:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 18 nucleotides                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                   
     AGATGGGTGCAGAGGTGT18                                                      
     (2) INFORMATION FOR SEQ ID NO:3:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 19 nucleotides                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: YES                                                      
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                   
     GATCCACATCAACAGGGAA19                                                     
     (2) INFORMATION FOR SEQ ID NO:4:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 1002 base pairs                                               
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: double                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 65..697                                                     
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                   
     TCTGCCTCCGCATCCTCGTGGGCCCTGACCTTCTCTCTGAGAGCCGGGCAGAGGCTCCGG60            
     AGCCATGCAGGCCGAAGGCCAGGGCACAGGGGGTTCGACGGGCGATGCT109                      
     MetGlnAlaGluGlyGlnGlyThrGlyGlySerThrGlyAspAla                             
     151015                                                                    
     GATGGCCCAGGAGGCCCTGGCATTCCTGATGGCCCAGGGGGCAATGCT157                       
     AspGlyProGlyGlyProGlyIleProAspGlyProGlyGlyAsnAla                          
     202530                                                                    
     GGCGGCCCAGGAGAGGCGGGTGCCACGGGCGGCAGAGGTCCCCGGGGC205                       
     GlyGlyProGlyGluAlaGlyAlaThrGlyGlyArgGlyProArgGly                          
     354045                                                                    
     GCAGGGGCAGCAAGGGCCTCGGGGCCGAGAGGAGGCGCCCCGCGGGGT253                       
     AlaGlyAlaAlaArgAlaSerGlyProArgGlyGlyAlaProArgGly                          
     505560                                                                    
     CCGCATGGCGGTGCCGCTTCTGCGCAGGATGGAAGGTGCCCCTGCGGG301                       
     ProHisGlyGlyAlaAlaSerAlaGlnAspGlyArgCysProCysGly                          
     657075                                                                    
     GCCAGGAGGCCGGACAGCCGCCTGCTTCAGTTGCACATCACGATGCCT349                       
     AlaArgArgProAspSerArgLeuLeuGlnLeuHisIleThrMetPro                          
     80859095                                                                  
     TTCTCGTCGCCCATGGAAGCGGAGCTGGTCCGCAGGATCCTGTCCCGG397                       
     PheSerSerProMetGluAlaGluLeuValArgArgIleLeuSerArg                          
     100105110                                                                 
     GATGCCGCACCTCTCCCCCGACCAGGGGCGGTTCTGAAGGACTTCACC445                       
     AspAlaAlaProLeuProArgProGlyAlaValLeuLysAspPheThr                          
     115120125                                                                 
     GTGTCCGGCAACCTACTGTTTATGTCAGTTCGGGACCAGGACAGGGAA493                       
     ValSerGlyAsnLeuLeuPheMetSerValArgAspGlnAspArgGlu                          
     130135140                                                                 
     GGCGCTGGGCGGATGAGGGTGGTGGGTTGGGGGCTGGGATCCGCCTCC541                       
     GlyAlaGlyArgMetArgValValGlyTrpGlyLeuGlySerAlaSer                          
     145150155                                                                 
     CCGGAGGGGCAGAAAGCTAGAGATCTCAGAACACCCAAACACAAGGTC589                       
     ProGluGlyGlnLysAlaArgAspLeuArgThrProLysHisLysVal                          
     160165170175                                                              
     TCAGAACAGAGACCTGGTACACCAGGCCCGCCGCCACCCGAGGGAGCC637                       
     SerGluGlnArgProGlyThrProGlyProProProProGluGlyAla                          
     180185190                                                                 
     CAGGGAGATGGGTGCAGAGGTGTCGCCTTTAATGTGATGTTCTCTGCC685                       
     GlnGlyAspGlyCysArgGlyValAlaPheAsnValMetPheSerAla                          
     195200205                                                                 
     CCTCACATTTAGCCGACTGACTGCTGCAGACCACCGCCAACTGCAGCTC734                      
     ProHisIle                                                                 
     210                                                                       
     TCCATCAGCTCCTGTCTCCAGCAGCTTTCCCTGTTGATGTGGATCACGCAGTGCTTTCTG794           
     CCCGTGTTTTTGGCTCAGGCTCCCTCAGGGCAGAGGCGCTAAGCCCAGCCTGGCGCCCCT854           
     TCCTAGGTCATGCCTCCTCCCCTAGGGAATGGTCCCAGCACGAGTGGCCAGTTCATTGTG914           
     GGGGCCTGATTGTTTGTCGCTGGAGGAGGACGGCTTACATGTTTGTTTCTGTAGAAAATA974           
     AAGCTGAGCTACGATTCCGAAAAAAAAA1002                                          
     (2) INFORMATION FOR SEQ ID NO:5:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 210 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                   
     MetGlnAlaGluGlyGlnGlyThrGlyGlySerThrGlyAspAlaAsp                          
     151015                                                                    
     GlyProGlyGlyProGlyIleProAspGlyProGlyGlyAsnAlaGly                          
     202530                                                                    
     GlyProGlyGluAlaGlyAlaThrGlyGlyArgGlyProArgGlyAla                          
     354045                                                                    
     GlyAlaAlaArgAlaSerGlyProArgGlyGlyAlaProArgGlyPro                          
     505560                                                                    
     HisGlyGlyAlaAlaSerAlaGlnAspGlyArgCysProCysGlyAla                          
     65707580                                                                  
     ArgArgProAspSerArgLeuLeuGlnLeuHisIleThrMetProPhe                          
     859095                                                                    
     SerSerProMetGluAlaGluLeuValArgArgIleLeuSerArgAsp                          
     100105110                                                                 
     AlaAlaProLeuProArgProGlyAlaValLeuLysAspPheThrVal                          
     115120125                                                                 
     SerGlyAsnLeuLeuPheMetSerValArgAspGlnAspArgGluGly                          
     130135140                                                                 
     AlaGlyArgMetArgValValGlyTrpGlyLeuGlySerAlaSerPro                          
     145150155160                                                              
     GluGlyGlnLysAlaArgAspLeuArgThrProLysHisLysValSer                          
     165170175                                                                 
     GluGlnArgProGlyThrProGlyProProProProGluGlyAlaGln                          
     180185190                                                                 
     GlyAspGlyCysArgGlyValAlaPheAsnValMetPheSerAlaPro                          
     195200205                                                                 
     HisIle                                                                    
     210                                                                       
     (2) INFORMATION FOR SEQ ID NO:6:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 755 base pairs                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: double                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 53..595                                                     
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                   
     TCCTCGTGGGCCCTGACCTTCTCTCTGAGAGCCGGGCAGAGGCTCCGGAGCCATG55                 
     Met                                                                       
     CAGGCCGAAGGCCAGGGCACAGGGGGTTCGACGGGCGATGCTGATGGC103                       
     GlnAlaGluGlyGlnGlyThrGlyGlySerThrGlyAspAlaAspGly                          
     51015                                                                     
     CCAGGAGGCCCTGGCATTCCTGATGGCCCAGGGGGCAATGCTGGCGGC151                       
     ProGlyGlyProGlyIleProAspGlyProGlyGlyAsnAlaGlyGly                          
     202530                                                                    
     CCAGGAGAGGCGGGTGCCACGGGCGGCAGAGGTCCCCGGGGCGCAGGG199                       
     ProGlyGluAlaGlyAlaThrGlyGlyArgGlyProArgGlyAlaGly                          
     354045                                                                    
     GCAGCAAGGGCCTCGGGGCCGAGAGGAGGCGCCCCGCGGGGTCCGCAT247                       
     AlaAlaArgAlaSerGlyProArgGlyGlyAlaProArgGlyProHis                          
     50556065                                                                  
     GGCGGTGCCGCTTCTGCGCAGGATGGAAGGTGCCCCTGCGGGGCCAGG295                       
     GlyGlyAlaAlaSerAlaGlnAspGlyArgCysProCysGlyAlaArg                          
     707580                                                                    
     AGGCCGGACAGCCGCCTGCTTCAGTTGCACATCACGATGCCTTTCTCG343                       
     ArgProAspSerArgLeuLeuGlnLeuHisIleThrMetProPheSer                          
     859095                                                                    
     TCGCCCATGGAAGCGGAGCTGGTCCGCAGGATCCTGTCCCGGGATGCC391                       
     SerProMetGluAlaGluLeuValArgArgIleLeuSerArgAspAla                          
     100105110                                                                 
     GCACCTCTCCCCCGACCAGGGGCGGTTCTGAAGGACTTCACCGTGTCC439                       
     AlaProLeuProArgProGlyAlaValLeuLysAspPheThrValSer                          
     115120125                                                                 
     GGCAACCTACTGTTTATCCGACTGACTGCTGCAGACCACCGCCAACTG487                       
     GlyAsnLeuLeuPheIleArgLeuThrAlaAlaAspHisArgGlnLeu                          
     130135140145                                                              
     CAGCTCTCCATCAGCTCCTGTCTCCAGCAGCTTTCCCTGTTGATGTGG535                       
     GlnLeuSerIleSerSerCysLeuGlnGlnLeuSerLeuLeuMetTrp                          
     150155160                                                                 
     ATCACGCAGTGCTTTCTGCCCGTGTTTTTGGCTCAGGCTCCCTCAGGG583                       
     IleThrGlnCysPheLeuProValPheLeuAlaGlnAlaProSerGly                          
     165170175                                                                 
     CAGAGGCGCTAAGCCCAGCCTGGCGCCCCTTCCTAGGTCATGCCTCCTC632                      
     GlnArgArg                                                                 
     180                                                                       
     CCCTAGGGAATGGTCCCAGCACGAGTGGCCAGTTCATTGTGGGGGCCTGATTGTTTGTCG692           
     CTGGAGGAGGACGGCTTACATGTTTGTTTCTGTAGAAAATAAAGCTGAGCTACGAAAAAA752           
     AAA755                                                                    
     (2) INFORMATION FOR SEQ ID NO:7:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 180 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                   
     MetGlnAlaGluGlyGlnGlyThrGlyGlySerThrGlyAspAlaAsp                          
     151015                                                                    
     GlyProGlyGlyProGlyIleProAspGlyProGlyGlyAsnAlaGly                          
     202530                                                                    
     GlyProGlyGluAlaGlyAlaThrGlyGlyArgGlyProArgGlyAla                          
     354045                                                                    
     GlyAlaAlaArgAlaSerGlyProArgGlyGlyAlaProArgGlyPro                          
     505560                                                                    
     HisGlyGlyAlaAlaSerAlaGlnAspGlyArgCysProCysGlyAla                          
     65707580                                                                  
     ArgArgProAspSerArgLeuLeuGlnLeuHisIleThrMetProPhe                          
     859095                                                                    
     SerSerProMetGluAlaGluLeuValArgArgIleLeuSerArgAsp                          
     100105110                                                                 
     AlaAlaProLeuProArgProGlyAlaValLeuLysAspPheThrVal                          
     115120125                                                                 
     SerGlyAsnLeuLeuPheIleArgLeuThrAlaAlaAspHisArgGln                          
     130135140                                                                 
     LeuGlnLeuSerIleSerSerCysLeuGlnGlnLeuSerLeuLeuMet                          
     145150155160                                                              
     TrpIleThrGlnCysPheLeuProValPheLeuAlaGlnAlaProSer                          
     165170175                                                                 
     GlyGlnArgArg                                                              
     180                                                                       
     (2) INFORMATION FOR SEQ ID NO:8:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 755 base pairs                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: double                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (ix) FEATURE:                                                             
     (A) NAME/KEY: CDS                                                         
     (B) LOCATION: 51..593                                                     
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                   
     CTCGTGGGCCCTGACCTTCTCTCTGAGAGCCGGGCAGAGGCTCCGGAGCCATGCAG56                
     MetGln                                                                    
     1                                                                         
     GCCGAAGGCCGGGGCACAGGGGGTTCGACGGGCGATGCTGATGGCCCA104                       
     AlaGluGlyArgGlyThrGlyGlySerThrGlyAspAlaAspGlyPro                          
     51015                                                                     
     GGAGGCCCTGGCATTCCTGATGGCCCAGGGGGCAATGCTGGCGGCCCA152                       
     GlyGlyProGlyIleProAspGlyProGlyGlyAsnAlaGlyGlyPro                          
     202530                                                                    
     GGAGAGGCGGGTGCCACGGGCGGCAGAGGTCCCCGGGGCGCAGGGGCA200                       
     GlyGluAlaGlyAlaThrGlyGlyArgGlyProArgGlyAlaGlyAla                          
     35404550                                                                  
     GCAAGGGCCTCGGGGCCGGGAGGAGGCGCCCCGCGGGGTCCGCATGGC248                       
     AlaArgAlaSerGlyProGlyGlyGlyAlaProArgGlyProHisGly                          
     556065                                                                    
     GGCGCGGCTTCAGGGCTGAATGGATGCTGCAGATGCGGGGCCAGGGGG296                       
     GlyAlaAlaSerGlyLeuAsnGlyCysCysArgCysGlyAlaArgGly                          
     707580                                                                    
     CCGGAGAGCCGCCTGCTTGAGTTCTACCTCGCCATGCCTTTCGCGACA344                       
     ProGluSerArgLeuLeuGluPheTyrLeuAlaMetProPheAlaThr                          
     859095                                                                    
     CCCATGGAAGCAGAGCTGGCCCGCAGGAGCCTGGCCCAGGATGCCCCA392                       
     ProMetGluAlaGluLeuAlaArgArgSerLeuAlaGlnAspAlaPro                          
     100105110                                                                 
     CCGCTTCCCGTGCCAGGGGTGCTTCTGAAGGAGTTCACTGTGTCCGGC440                       
     ProLeuProValProGlyValLeuLeuLysGluPheThrValSerGly                          
     115120125130                                                              
     AACATACTGACTATCCGACTGACTGCTGCAGACCACCGCCAACTGCAG488                       
     AsnIleLeuThrIleArgLeuThrAlaAlaAspHisArgGlnLeuGln                          
     135140145                                                                 
     CTCTCCATCAGCTCCTGTCTCCAGCAGCTTTCCCTGTTGATGTGGATC536                       
     LeuSerIleSerSerCysLeuGlnGlnLeuSerLeuLeuMetTrpIle                          
     150155160                                                                 
     ACGCAGTGCTTTCTGCCCGTGTTTTTGGCTCAGCCTCCCTCAGGGCAG584                       
     ThrGlnCysPheLeuProValPheLeuAlaGlnProProSerGlyGln                          
     165170175                                                                 
     AGGCGCTAAGCCCAGCCTGGCGCCCCTTCCTAGGTCATGCCTCCTCCCCTAGGGAA640               
     ArgArg                                                                    
     180                                                                       
     TGGTCCCAGCACGAGTGGCCAGTTCATTGTGGGGGCCTGATTGTTTGTCGCTGGAGGAGG700           
     ACGGCTTACATGTTTGTTTCTGTAGAAAATAAAACTGAGCTACGAAAAAAAAAAA755                
     (2) INFORMATION FOR SEQ ID NO:9:                                          
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 180 amino acids                                               
     (B) TYPE: amino acid                                                      
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: protein                                               
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                   
     MetGlnAlaGluGlyArgGlyThrGlyGlySerThrGlyAspAlaAsp                          
     151015                                                                    
     GlyProGlyGlyProGlyIleProAspGlyProGlyGlyAsnAlaGly                          
     202530                                                                    
     GlyProGlyGluAlaGlyAlaThrGlyGlyArgGlyProArgGlyAla                          
     354045                                                                    
     GlyAlaAlaArgAlaSerGlyProGlyGlyGlyAlaProArgGlyPro                          
     505560                                                                    
     HisGlyGlyAlaAlaSerGlyLeuAsnGlyCysCysArgCysGlyAla                          
     65707580                                                                  
     ArgGlyProGluSerArgLeuLeuGluPheTyrLeuAlaMetProPhe                          
     859095                                                                    
     AlaThrProMetGluAlaGluLeuAlaArgArgSerLeuAlaGlnAsp                          
     100105110                                                                 
     AlaProProLeuProValProGlyValLeuLeuLysGluPheThrVal                          
     115120125                                                                 
     SerGlyAsnIleLeuThrIleArgLeuThrAlaAlaAspHisArgGln                          
     130135140                                                                 
     LeuGlnLeuSerIleSerSerCysLeuGlnGlnLeuSerLeuLeuMet                          
     145150155160                                                              
     TrpIleThrGlnCysPheLeuProValPheLeuAlaGlnProProSer                          
     165170175                                                                 
     GlyGlnArgArg                                                              
     180                                                                       
     (2) INFORMATION FOR SEQ ID NO:10:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 18 nucleotides                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                  
     GCCATGCAGGCCGAAGGC18                                                      
     (2) INFORMATION FOR SEQ ID NO:11:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 19 nucleotides                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: YES                                                      
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                  
     CTGGCCACTCGTGCTGGGA19                                                     
     (2) INFORMATION FOR SEQ ID NO:12:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 18 nucleotides                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                  
     GCAGGATGGAAGGTGCCC18                                                      
     (2) INFORMATION FOR SEQ ID NO:13:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 17 nucleotides                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                  
     CCCCACCGCTTCCCGTG17                                                       
     (2) INFORMATION FOR SEQ ID NO:14:                                         
     (i) SEQUENCE CHARACTERISTICS:                                             
     (A) LENGTH: 20 nucleotides                                                
     (B) TYPE: nucleic acid                                                    
     (C) STRANDEDNESS: single                                                  
     (D) TOPOLOGY: linear                                                      
     (ii) MOLECULE TYPE: cDNA                                                  
     (iii) HYPOTHETICAL: NO                                                    
     (iv) ANTI-SENSE: NO                                                       
     (vi) ORIGINAL SOURCE:                                                     
     (A) ORGANISM: Homo sapiens                                                
     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                  
     GGCTGAATGGATGCTGCAGA20                                                    
     __________________________________________________________________________

Claims

1. An isolated polypeptide encoded by an isolated nucleic arid molecule selected from the group consisting of (a) SEQ ID NO:4 and allelic variants thereof, and (b) SEQ ID NO:6 and allelic variants thereof.

2. An isolated polypeptide selected from the group consisting of:

(a) a fragment of SEQ ID NO:5 between 5 and 209 consecutive amino acids in length, and

(b) a fragment of SEQ ID NO:7 between 5 and 179 consecutive amino acids in length.

3. The isolated polypeptide of claim 1, wherein the isolated nucleic acid molecule comprises the nucleic acid sequence of SEQ ID NO:4.

4. The isolated polypeptide of claim 1, wherein the isolated nucleic acid molecule comprises the nucleic acid sequence of SEQ ID NO:6.

5. The isolated polypeptide of claim 2, wherein the isolated polypeptide comprises a fragment of SEQ ID NO:5 between 9 and 209 consecutive amino acids in length.

6. The isolated polypeptide of claim 2, wherein the isolated polypeptide comprises a fragment of SEQ ID NO:7 between 9 and 179 consecutive amino acids in length.

7. The isolated polypeptide of claim 2, wherein the fragment binds to a polypeptide-binding agent.

8. The isolated polypeptide of claim 2, wherein the fragment binds to an antibody or a cytotoxic T lymphocyte.

9. The isolated polypeptide of claim 8, wherein the fragment binds to an antibody.

10. The isolated polypeptide of claim 8, wherein the fragment binds to a cytotoxic T lymphocyte.

11. The isolated polypeptide of claim 2, wherein the isolated polypeptide comprises a fragment of SEQ ID NO:5 between 10 and 209 consecutive amino acids in length.

12. The isolated polypeptide of claim 2, wherein the isolated polypeptide comprises a fragment of SEQ ID NO:7 between 10 and 179 consecutive amino acids in length.

13. An isolated polypeptide comprising an amino acid sequence selected from the group consisting of(a) SEQ ID NO:5 and allelic variants thereof, and (b) SEQ ID NO:7 and allelic variants thereof.