Streptococcus pneumoniae antigens and vaccines

The present invention relates to novel vaccines for the prevention or attenuation of infection by Streptococcus pneumoniae. The invention further relates to isolated nucleic acid molecules encoding antigenic polypeptides of Streptococcus pneumoniae. Antigenic polypeptides are also provided, as are vectors, host cells and recombinant methods for producing the same. The invention additionally relates to diagnostic methods for detecting Streptococcus nucleic acids, polypeptides and antibodies in a biological sample.

Skip to: Description  ·  Claims  ·  References Cited  · Patent History  ·  Patent History
Description
FIELD OF THE INVENTION

The present invention relates to novel Streptococcus pneumoniae antigens for the detection of Streptococcus and for the prevention or attenuation of disease caused by Streptococcus. The invention further relates to isolated nucleic acid molecules encoding antigenic polypeptides of S. pneumoniae. Antigenic polypeptides are also provided, as are vectors, host cells and recombinant methods for producing the same. The invention additionally relates to diagnostic methods for detecting Streptococcus gene expression.

BACKGROUND OF THE INVENTION

Streptococcus pneumoniae has been one of the most extensively studied microorganisms since its first isolation in 1881. It was the object of many investigations that led to important scientific discoveries. In 1928, Griffith observed that when heat-killed encapsulated pneumococci and live strains constitutively lacking any capsule were concomitantly injected into mice, the nonencapsulated could be converted into encapsulated pneumococci with the same capsular type as the heat-killed strain. Years later, the nature of this "transforming principle," or carrier of genetic information, was shown to be DNA. (Avery, O. T., et al., J. Exp. Med., 79:137-157 (1944)).

In spite of the vast number of publications on S. pneumoniae many questions about its virulence are still unanswered, and this pathogen remains a major causative agent of serious human disease, especially community-acquired pneumonia. (Johnston, R. B., et al., Rev. Infect. Dis. 13(Suppl. 6):S509-517 (1991)). In addition, in developing countries, the pneumococcus is responsible for the death of a large number of children under the age of 5 years from pneumococcal pneumonia. The incidence of pneumococcal disease is highest in infants under 2 years of age and in people over 60 years of age. Pneumococci are the second most frequent cause (after Haemophilus influenzae type b) of bacterial meningitis and otitis media in children. With the recent introduction of conjugate vaccines for H. influenzae type b, pneumococcal meningitis is likely to become increasingly prominent. S. pneumoniae is the most important etiologic agent of community-acquired pneumonia in adults and is the second most common cause of bacterial meningitis behind Neisseria meningitidis.

The antibiotic generally prescribed to treat S. pneumoniae is benzylpenicillin, although resistance to this and to other antibiotics is found occasionally. Pneumococcal resistance to penicillin results from mutations in its penicillin-binding proteins. In uncomplicated pneumococcal pneumonia caused by a sensitive strain, treatment with penicillin is usually successful unless started too late. Erythromycin or clindamycin can be used to treat pneumonia in patients hypersensitive to penicillin, but resistant strains to these drugs exist. Broad spectrum antibiotics (e.g., the tetracyclines) may also be effective, although tetracycline-resistant strains are not rare. In spite of the availability of antibiotics, the mortality of pneumococcal bacteremia in the last four decades has remained stable between 25 and 29%. (Gillespie, S. H., et al., J. Med. Microbiol. 28:237-248 (1989).

S. pneumoniae is carried in the upper respiratory tract by many healthy individuals. It has been suggested that attachment of pneumococci is mediated by a disaccharide receptor on fibronectin, present on human pharyngeal epithelial cells. (Anderson, B. J., et al., J. Immunol. 142:2464-2468 (1989). The mechanisms by which pneumococci translocate from the nasopharynx to the lung, thereby causing pneumonia, or migrate to the blood, giving rise to bacteremia or septicemia, are poorly understood. (Johnston, R. B., et al., Rev. Infect. Dis. 13(Suppl. 6):S509-517 (1991).

Various proteins have been suggested to be involved in the pathogenicity of S. pneumoniae, however, only a few of them have actually been confirmed as virulence factors. Pneumococci produce an IgAl protease that might interfere with host defense at mucosal surfaces. (Komfield, S. J., et al., Rev. Inf Dis. 3:521-534 (1981). S. pneumoniae also produces neuraminidase, an enzyme that may facilitate attachment to epithelial cells by cleaving sialic acid from the host glycolipids and gangliosides. Partially purified neuraminidase was observed to induce meningitis-like symptoms in mice; however, the reliability of this finding has been questioned because the neuraminidase preparations used were probably contaminated with cell wall products. Other pneumococcal proteins besides neuraminidase are involved in the adhesion of pneumococci to epithelial and endothelial cells. These pneumococcal proteins have as yet not been identified. Recently, Cundell et al., reported that peptide permeases can modulate pneumococcal adherence to epithelial and endothelial cells. It was, however, unclear whether these permeases function directly as adhesions or whether they enhance adherence by modulating the expression of pneumococcal adhesions. (DeVelasco, E. A., et al., Micro. Rev. 59:591-603 (1995). A better understanding of the virulence factors determining its pathogenicity will need to be developed to cope with the devastating effects of pneumococcal disease in humans.

Ironically, despite the prominent role of S. pneumoniae in the discovery of DNA, little is known about the molecular genetics of the organism. The S. pneumoniae genome consists of one circular, covalently closed, double-stranded DNA and a collection of so-called variable accessory elements, such as prophages, plasmids, transposons and the like. Most physical characteristics and almost all of the genes of S. pneumoniae are unknown. Among the few that have been identified, most have not been physically mapped or characterized in detail. Only a few genes of this organism have been sequenced. (See, for instance current versions of GENBANK and other nucleic acid databases, and references that relate to the genome of S. pneumoniae such as those set out elsewhere herein.) Identification of in vivo-expressed, and broadly protective, antigens of S. pneumoniae has remained elusive.

SUMMARY OF THE INVENTION

The present invention provides isolated nucleic acid molecules comprising polynucleotides encoding the S. pneumoniae polypeptides described in Table 1 and having the amino acid sequences shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226. Thus, one aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides having a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding any of the amino acid sequences of the polypeptides shown in Table 1; and (b) a nucleotide sequence complementary to any of the nucleotide sequences in (a).

Further embodiments of the invention include isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in (a) or (b) above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a) or (b) above. This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues. Additional nucleic acid embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides which encode the amino acid sequences of epitope-bearing portions of an S. pneumoniae polypeptide having an amino acid sequence in (a) above.

The present invention also relates to recombinant vectors, which include the isolated nucleic acid molecules of the present invention, and to host cells containing the recombinant vectors, as well as to methods of making such vectors and host cells and for using these vectors for the production of S. pneumoniae polypeptides or peptides by recombinant techniques.

The invention further provides isolated S. pneumoniae polypeptides having an amino acid sequence selected from the group consisting of an amino acid sequence of any of the polypeptides described in Table 1.

The polypeptides of the present invention also include polypeptides having an amino acid sequence with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 1, as well as polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to those above; as well as isolated nucleic acid molecules encoding such polypeptides.

The present invention further provides a vaccine, preferably a multi-component vaccine comprising one or more of the S. pneumoniae polynucleotides or polypeptides described in Table 1, or fragments thereof, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the S. pneumoniae polypeptide(s) are present in an amount effective to elicit an immune response to members of the Streptococcus genus in an animal. The S. pneumoniae polypeptides of the present invention may further be combined with one or more immunogens of one or more other streptococcal or non-streptococcal organisms to produce a multi-component vaccine intended to elicit an immunological response against members of the Streptococcus genus and, optionally, one or more non-streptococcal organisms.

The vaccines of the present invention can be administered in a DNA form, e.g., "naked" DNA, wherein the DNA encodes one or more streptococcal polypeptides and, optionally, one or more polypeptides of a non-streptococcal organism. The DNA encoding one or more polypeptides may be constructed such that these polypeptides are expressed fusion proteins.

The vaccines of the present invention may also be administered as a component of a genetically engineered organism. Thus, a genetically engineered organism which expresses one or more S. pneumoniae polypeptides may be administered to an animal. For example, such a genetically engineered organism may contain one or more S. pneumoniae polypeptides of the present invention intracellularly, on its cell surface, or in its periplasmic space. Further, such a genetically engineered organism may secrete one or more S. pneumoniae polypeptides.

The vaccines of the present invention may be co-administered to an animal with an immune system modulator (e.g., CD86 and GM-CSF).

The invention also provides a method of inducing an immunological response in an animal to one or more members of the Streptococcus genus, preferably one or more isolates of the S. pneumoniae genus, comprising administering to the animal a vaccine as described above.

The invention further provides a method of inducing a protective immune response in an animal, sufficient to prevent or attenuate an infection by members of the Streptococcus genus, preferrably at least S. pneumoniae, comprising administering to the animal a composition comprising one or more of the polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may be conjugated to another immunogen and/or administered in admixture with an adjuvant.

The invention further relates to antibodies elicited in an animal by the administration of one or more S. pneumoniae polypeptides of the present invention and to methods for producing such antibodies.

The invention also provides diagnostic methods for detecting the expression of genes of members of the Streptococcus genus in an animal. One such method involves assaying for the expression of a gene encoding S. pneumoniae peptides in a sample from an animal. This expression may be assayed either directly (e.g., by assaying polypeptide levels using antibodies elicited in response to amino acid sequences described in Table 1) or indirectly (e.g., by assaying for antibodies having specificity for amino acid sequences described in Table 1). An example of such a method involves the use of the polymerase chain reaction (PCR) to amplify and detect Streptococcus nucleic acid sequences.

The present invention also relates to nucleic acid probes having all or part of a nucleotide sequence described in Table 1 (shown as SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225) which are capable of hybridizing under stringent conditions to Streptococcus nucleic acids. The invention further relates to a method of detecting one or more Streptococcus nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids encoding Streptococcus polypeptides, comprising: (a) contacting the sample with one or more of the above-described nucleic acid probes, under conditions such that hybridization occurs, and (b) detecting hybridization of said one or more probes to the Streptococcus nucleic acid present in the biological sample.

The invention also includes immunoassays, including an immunoassay for detecting Streptococcus, preferrably at least isolates of the S. pneumoniae genus, comprising incubation of a sample (which is suspected of being infected with Streptococcus) with a probe antibody directed against an antigen/epitope of S. pneumoniae, to be detected under conditions allowing the formation of an antigen-antibody complex; and detecting the antigen-antibody complex which contains the probe antibody. An immunoassay for the detection of antibodies which are directed against a Streptococcus antigen comprising the incubation of a sample (containing antibodies from a mammal suspected of being infected with Streptococcus) with a probe polypeptide including an epitope of S. pneumoniae, under conditions that allow the formation of antigen-antibody complexes which contain the probe epitope containing antigen.

Some aspects of the invention pertaining to kits are those for: investigating samples for the presence of polynucleotides derived from Streptococcus which comprise a polynucleotide probe including a nucleotide sequence selected from Table 1 or a fragment thereof of approximately 15 or more nucleotides, in an appropriate container; analyzing the samples for the presence of antibodies directed against a Streptococcus antigen made up of a polypeptide which contains a S. pneumoniae epitope present in the polypeptide, in a suitable container; and analyzing samples for the presence of Streptococcus antigens made up of an anti-S. pneumoniae antibody, in a suitable container.

DETAILED DESCRIPTION

The present invention relates to recombinant antigenic S. pneumoniae polypeptides and fragments thereof. The invention also relates to methods for using these polypeptides to produce immunological responses and to confer immunological protection to disease caused by members of the genus Streptococcus, at least isolates of the S. pneumoniae genus. The invention further relates to nucleic acid sequences which encode antigenic S. pneumoniae polypeptides and to methods for detecting S. pneumoniae nucleic acids and polypeptides in biological samples. The invention also relates to S. pneumoniae-specific antibodies and methods for detecting such antibodies produced in a host animal.

DEFINITIONS

The following definitions are provided to clarify the subject matter which the inventors consider to be the present invention.

As used herein, the phrase "pathogenic agent" means an agent which causes a disease state or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, fungi, viruses and metazoan parasites which either produce a disease state or render an animal infected with such an organism susceptible to a disease state (e.g., a secondary infection). Further included are species and strains of the genus Streptococcus which produce disease states in animals.

As used herein, the term "organism" means any living biological system, including viruses, regardless of whether it is a pathogenic agent.

As used herein, the term "Streptococcus" means any species or strain of bacteria which is members of the genus Streptococcus. Such species and strains are known to those of skill in the art, and include those that are pathogenic and those that are not.

As used herein, the phrase "one or more S. pneumoniae polypeptides of the present invention" means polypeptides comprising the amino acid sequence of one or more of the S. pneumoniae polypeptides described in Table 1 and disclosed as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226. These polypeptides may be expressed as fusion proteins wherein the S. pneumoniae polypeptides of the present invention are linked to additional amino acid sequences which may be of streptococcal or non-streptococcal origin. This phrase further includes polypeptide comprising fragments of the S. pneumoniae polypeptides of the present invention.

Additional definitions are provided throughout the specification.

EXPLANATION OF TABLE 1

Table 1, below, provides information describing 113 open reading frames (ORFs) which encode potentially antigenic polypeptides of S. pneumoniae of the present invention. The table lists the ORF identifier which consists of the letters SP, which denote S. pneumoniae, followed immediately by a three digit numeric code, which arbitrarily number the potentially antigenic polypeptides of S. pneumoniae of the present invention and the nucleotide or amino acid sequence of each ORF and encoded polypeptide. The table further correlates the ORF identifier with a sequence identification number (SEQ ID NO:). The actual nucleotide or amino acid sequence of each ORF identifier is also shown in the Sequence Listing under the corresponding SEQ ID NO.

Thus, for example, the designation "SP126" refers to both the nucleotide and amino acid sequences of S. pneumoniae polypeptide number 126 of the present invention. Further, "SP126" correlates with the nucleotide sequence shown as SEQ ID NO:223 and with the amino acid sequence shown as SEQ ID NO:224 as is described in Table 1.

The open reading frame within each "ORF" begins with the second nucleotide shown. Thus, the first codon for each nucleotide sequence shown is bases 2-4, the second 5-7, the third 8-10, and so on.

EXPLANATION OF TABLE 2

Table 2 lists the antigenic epitopes present in each of the S. pneumoniae polypeptides described in Table 1 as predicted by the inventors. Each S. pneumoniae polypeptide shown in Table 1 has one or more antigenic epitopes described in Table 2. It will be appreciated that depending on the analytical criteria used to predict antigenic determinants, the exact address of the determinant may vary slightly. The exact location of the antigenic determinant may shift by about 1 to 5 residues, more likely 1 to 2 residues, depending on the criteria used. Thus, the first antigenic determinant described in Table 2, "Lys-1 to Ile-10" of SP001, represents a peptide comprising the lysine at position 1 in SEQ ID NO:2 through and including the isoleucine at position 10 in SEQ ID NO:2, but may include more or fewer residues than those 10. It will also be appreciated that, generally speaking, amino acids can be added to either terminus of a peptide or polypeptide containing an antigenic epitope without affecting its activity, whereas removing residues from a peptide or polypeptide containing only the antigenic determinant is much more likely to destroy activity. It will be appreciated that the residues and locations shown described in Table 2 correspond to the amino acid sequences for each ORF shown in Table 1 and in the Sequence Listing.

EXPLANATION OF TABLE 3

Table 3 shows PCR primers designed by the inventors for the amplification of polynucleotides encoding polypeptides of the present invention according to the method of Example 1. PCR primer design is routine in the art and those shown in Table 3 are provided merely for the convenience of the skilled artisan. It will be appreciated that others can be used with equal success.

For each primer, the table lists the corresponding ORF designation from Table 1 followed by either an "A" or a "B". The "A" primers are the 5' primers and the "B" primers 3'. A restriction enzyme site was built into each primer to allow ease of cloning. The restriction enzyme which will recognize and cleave a sequence within each primer is shown in Table 3, as well, under the heading "RE" for restriction enzyme. Finally the sequence identifier is shown in Table 3 for each primer for easy correlation with the Sequence Listing.

SELECTION OF NUCLEIC ACID SEQUENCES ENCODING ANTIGENIC S. PNEUMONIAE POLYPEPTIDES

The present invention provides a select number of ORFs from those presented in the fragments of the S. pneumoniae genome which may prove useful for the generation of a protective immune response. The sequenced S. pneumoniae genomic DNA was obtained from a sub-cultured isolate of S. pneumoniae Strain 7/87 14.8.91, which has been deposited at the American Type Culture Collection, as a convenience to those of skill in the art. The S. pneumoniae isolate was deposited on Oct. 10, 1996 at the ATCC, 10801 University Blvd., Manassas, Va. 20110-2209, U.S.A., and given accession number 55840. A genomic library constructed from DNA isolated from the S. pneumoniae isolate was also deposited at the ATCC on Oct. 11, 1996 and given ATCC Deposit No. 97755. A more complete listing of the sequence obtained from the S. pneumoniae genome may be found in co-pending U.S. Provisional application Ser. No. 60/029,960, filed Oct. 31, 1996, incorporated herein by reference in its entirety. Some ORFs contained in the subset of fragments of the S. pneumoniae genome disclosed herein were derived through the use of a number of screening criteria detailed below.

The selected ORFs do not consist of complete ORPFs. Although a polypeptide representing a complete ORF may be the closest approximation of a protein native to an organism, it is not always preferred to express a complete ORF in a heterologous system. It may be challenging to express and purify a highly hydrophobic protein by common laboratory methods. Thus, the polypeptide vaccine candidates described herein may have been modified slightly to simplify the production of recombinant protein. For example, nucleotide sequences which encode highly hydrophobic domains, such as those found at the amino terminal signal sequence, have been excluded from some constructs used for in vitro expression of the polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus have also been excluded from the recombinant expression constructs. Thus, in one embodiment, a polypeptide which represents a truncated or modified ORF may be used as an antigen.

While numerous methods are known in the art for selecting potentially immunogenic polypeptides, many of the ORFs disclosed herein were selected on the basis of screening all theoretical S. pneumoniae ORFs for several aspects of potential immunogenicity. One set of selection criteria are as follows:

1. Type I signal sequence: An amino terminal type I signal sequence generally directs a nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and Kendall, D. A. Mol. Microbiol. 13:765-773 (1994)). The length of the type I signal sequence is approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in the extreme amino terminus. In addition, the central region of the signal sequence adopts an alpha-helical conformation in a hydrophobic environment. Finally, the region surrounding the actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 and -3 positions.

2. Type IV signal sequence: The type IV signal sequence is an example of the several types of functional signal sequences which exist in addition to the type I signal sequence detailed above. Although functionally related, the type IV signal sequence possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, S., J. Bacteriol. 174:7345-7351 (1992)). These are typically six to eight amino acids with a net basic charge followed by an additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In addition, type IV signal sequences generally contain a phenylalanine residue at the +1 site relative to the cleavage site.

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors examined contained the sequence L-(A,S)-(G,A)-C at positions -3 to +1, relative to the point of cleavage (Hayashi, S. and Wu, H. C., J. Bioenerg. Biomembr. 22:451-471 (1990)).

4. LPXTG motif: It has been experimentally determined that most anchored proteins found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal sequence. More than fifty such proteins from organisms such as S. pyogenes, S. mutans, E. faecalis, S. pneumoniae, and others, have been identified based on their extracellular location and carboxy terminal amino acid sequence (Fischetti, V. A., ASM News 62:405-410 (1996)). The conserved region consists of six charged amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any amino acid.

An algorithm for selecting antigenic and immunogenic S. pneumoniae polypeptides including the foregoing criteria was developed. Use of the algorithm by the inventors to select immunologically useful S. pneumoniae polypeptides resulted in the selection of a number of the disclosed ORFs. Polypeptides comprising the polypeptides identified in this group may be produced by techniques standard in the art and as further described herein.

NUCLEIC ACID MOLECULES

The present invention provides isolated nucleic acid molecules comprising polynucleotides encoding the S. pneumoniae polypeptides having the amino acid sequences described in Table 1 and shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226, which were determined by sequencing the genome of S. pneumoniae and selected as putative immunogens.

Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA molecule herein were determined using an automated DNA sequencer (such as the Model 373 from Applied Biosystems, Inc.), and all amino acid sequences of polypeptides encoded by DNA molecules determined herein were predicted by translation of DNA sequences determined as above. Therefore, as is known in the art for any DNA sequence determined by this automated approach, any nucleotide sequence determined herein may contain some errors. Nucleotide sequences determined by automation are typically at least about 90% identical, more typically at least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the sequenced DNA molecule. The actual sequence can be more precisely determined by other approaches including manual DNA sequencing methods well known in the art. As is also known in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted amino acid sequence encoded by a determined nucleotide sequence will be completely different from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the point of such an insertion or deletion.

Unless otherwise indicated, each "nucleotide sequence" set forth herein is presented as a sequence of deoxyribonucleotides (abbreviated A, G , C and T). However, by "nucleotide sequence" of a nucleic acid molecule or polynucleotide is intended, for a DNA molecule or polynucleotide, a sequence of deoxyribonucleotides, and for an RNA molecule or polynucleotide, the corresponding sequence of ribonucleotides (A, G, C and U), where each thymidine deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced by the ribonucleotide uridine (U). For instance, reference to an RNA molecule having a sequence described in Table 1 set forth using deoxyribonucleotide abbreviations is intended to indicate an RNA molecule having a sequence in which each deoxyribonucleotide A, G or C described in Table 1 has been replaced by the corresponding ribonucleotide A, G or C, and each deoxyribonucleotide T has been replaced by a ribonucleotide U.

Nucleic acid molecules of the present invention may be in the form of RNA, such as MRNA, or in the form of DNA, including, for instance, cDNA and genomic DNA obtained by cloning or produced synthetically. The DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding strand, also known as the sense strand, or it may be the non-coding strand, also referred to as the anti-sense strand.

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA, which has been removed from its native environment. For example, recombinant DNA molecules contained in a vector are considered isolated for the purposes of the present invention. Further examples of isolated DNA molecules include recombinant DNA molecules maintained in heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present invention. Isolated nucleic acid molecules according to the present invention further include such molecules produced synthetically.

Isolated nucleic acid molecules of the present invention include DNA molecules comprising a nucleotide sequence described in Table 1 and shown as SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225; DNA molecules comprising the coding sequences for the polypeptides described in Table 1 and shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226; and DNA molecules which comprise sequences substantially different from those described above but which, due to the degeneracy of the genetic code, still encode the S. pneumoniae polypeptides described in Table 1. Of course, the genetic code is well known in the art. Thus, it would be routine for one skilled in the art to generate such degenerate variants.

The invention also provides nucleic acid molecules having sequences complementary to any one of those described in Table 1. Such isolated molecules, particularly DNA molecules, are useful as probes for detecting expression of Streptococcal genes, for instance, by Northern blot analysis or the polymerase chain reaction (PCR).

The present invention is further directed to fragments of the isolated nucleic acid molecules described herein. By a fragment of an isolated nucleic acid molecule having a nucleotide sequence described in Table 1, is intended fragments at least about 15 nt, and more preferably at least about 17 nt, still more preferably at least about 20 nt, and even more preferably, at least about 25 nt in length which are useful as diagnostic probes and primers as discussed herein. Of course, larger fragments 50-100 nt in length are also useful according to the present invention as are fragments corresponding to most, if not all, of a nucleotide sequence described in Table 1. By a fragment at least 20 nt in length, for example, is intended fragments which include 20 or more contiguous bases of a nucleotide sequence as described in Table 1. Since the nucleotide sequences identified in Table 1 are provided as SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225, generating such DNA fragments would be routine to the skilled artisan. For example, such fragments could be generated synthetically.

Preferred nucleic acid fragments of the present invention also include nucleic acid molecules comprising nucleotide sequences encoding epitope-bearing portions of the S. pneumoniae polypeptides identified in Table 1. Such nucleic acid fragments of the present invention include, for example, nucleotide sequences encoding polypeptide fragments comprising from about the amino terminal residue to about the carboxy terminal residue of each fragment shown in Table 2. The above referred to polypeptide fragments are antigenic regions of the S. pneumoniae polypeptides identified in Table 1.

In another aspect, the invention provides isolated nucleic acid molecules comprising polynucleotides which hybridize under stringent hybridization conditions to a portion of a polynucleotide in a nucleic acid molecule of the invention described above, for instance, a nucleic acid sequence identified in Table 1. By "stringent hybridization conditions" is intended overnight incubation at 42.degree. C. in a solution comprising: 50% formamide, 5.times.SSC (150 mM NaCl, 15 nM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5.times.Denhardt's solution, 10% dextran sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1.times.SSC at about 65.degree. C.

By polynucleotides which hybridize to a "portion" of a polynucleotide is intended polynucleotides (either DNA or RNA) which hybridize to at least about 15 nucleotides (nt), and more preferably at least about 17 nt, still more preferably at least about 20 nt, and even more preferably about 25-70 nt of the reference polynucleotide. These are useful as diagnostic probes and primers as discussed above and in more detail below.

Of course, polynucleotides hybridizing to a larger portion of the reference polynucleotide, for instance, a portion 50-100 nt in length, or even to the entire length of the reference polynucleotide, are also useful as probes according to the present invention, as are polynucleotides corresponding to most, if not all, of a nucleotide sequence as identified in Table 1. By a portion of a polynucleotide of "at least 20 nt in length," for example, is intended 20 or more contiguous nucleotides from the nucleotide sequence of the reference polynucleotide (e.g., a nucleotide sequences as described in Table 1). As noted above, such portions are useful diagnostically either as probes according to conventional DNA hybridization techniques or as primers for amplification of a target sequence by PCR, as described in the literature (for instance, in Molecular Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J., Fritsch, E. F. and Maniatis, T., eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), the entire disclosure of which is hereby incorporated herein by reference).

Since nucleic acid sequences encoding the S. pneumoniae polypeptides of the present invention are identified in Table 1 and provided as SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225, generating polynucleotides which hybridize to portions of these sequences would be routine to the skilled artisan. For example, the hybridizing polynucleotides of the present invention could be generated synthetically according to known techniques.

As indicated, nucleic acid molecules of the present invention which encode S. pneumoniae polypeptides of the present invention may include, but are not limited to those encoding the amino acid sequences of the polypeptides by themselves; and additional coding sequences which code for additional amino acids, such as those which provide additional functionalities. Thus, the sequences encoding these polypeptides may be fused to a marker sequence, such as a sequence encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), among others, many of which are commercially available. As described by Gentz and colleagues (Proc. Natl. Acad. Sci. USA 86:821-824 (1989)), for instance, hexa-histidine provides for convenient purification of the resulting fusion protein.

Thus, the present invention also includes genetic fusions wherein the S. pneumoniae nucleic acid sequences coding sequences identified in Table 1 are linked to additional nucleic acid sequences to produce fusion proteins. These fusion proteins may include epitopes of streptococcal or non-streptococcal origin designed to produce proteins having enhanced immunogenicity. Further, the fusion proteins of the present invention may contain antigenic determinants known to provide helper T-cell stimulation, peptides encoding sites for post-translational modifications which enhance immunogenicity (e.g., acylation), peptides which facilitate purification (e.g., histidine "tag"), or amino acid sequences which target the fusion protein to a desired location (e.g., a heterologous leader sequence).

In all cases of bacterial expression, an N-terminal methionine residues is added. In many cases, however, the N-terminal methionine residues is cleaved off post-translationally. Thus, the invention includes polypeptides shown in Table 1 with, and without an N-terminal methionine.

The present invention thus includes nucleic acid molecules and sequences which encode fusion proteins comprising one or more S. pneumoniae polypeptides of the present invention fused to an amino acid sequence which allows for post-translational modification to enhance immunogenicity. This post-translational modification may occur either in vitro or when the fusion protein is expressed in vivo in a host cell. An example of such a modification is the introduction of an amino acid sequence which results in the attachment of a lipid moiety.

Thus, as indicated above, the present invention includes genetic fusions wherein a S. pneumoniae nucleic acid sequence identified in Table 1 is linked to a nucleotide sequence encoding another amino acid sequence. These other amino acid sequences may be of streptococcal origin (e.g., another sequence selected from Table 1) or non-streptococcal origin.

The present invention further relates to variants of the nucleic acid molecules of the present invention, which encode portions, analogs or derivatives of the S. pneumoniae polypeptides described in Table 1. Variants may occur naturally, such as a natural allelic variant. By an "allelic variant" is intended one of several alternate forms of a gene occupying a given locus on a chromosome of an organism (Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985)). Non-naturally occurring variants may be produced using art-known mutagenesis techniques.

Such variants include those produced by nucleotide substitutions, deletions or additions. The substitutions, deletions or additions may involve one or more nucleotides. These variants may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions may produce conservative or non-conservative amino acid substitutions, deletions or additions. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of the S. pneumoniae polypeptides disclosed herein or portions thereof. Silent substitution are most likely to be made in non-epitopic regions. Guidance regarding those regions containing epitopes is provided herein, for example, in Table 2. Also especially preferred in this regard are conservative substitutions.

Further embodiments of the invention include isolated nucleic acid molecules comprising a polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical to: (a) a nucleotide sequence encoding any of the amino acid sequences of the polypeptides identified in Table 1; and (b) a nucleotide sequence complementary to any of the nucleotide sequences in (a) above.

By a polynucleotide having a nucleotide sequence at least, for example, 95% "identical" to a reference nucleotide sequence encoding a S. pneumoniae polypeptide described in Table 1, is intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence may include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the subject S. pneumoniae polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into the reference sequence. These mutations of the reference sequence may occur at the 5' or 3' terminal positions of the reference nucleotide sequence or anywhere between those terminal positions, interspersed either individually among nucleotides in the reference sequence or in one or more contiguous groups within the reference sequence.

Certain nucleotides within some of the nucleic acid sequences shown in Table 1 were ambiguous upon sequencing. Completely unknown sequences are shown as an "N". Other unresolved nucleotides are known to be either a purine, shown as "R", or a pyrimidine, shown as "Y". Accordingly, when determining identity between two nucleotide sequences, identity is met where any nucleotide, including an "R", "Y" or "N", is found in a test sequence and at the corresponding position in the reference sequence (from Table 1). Likewise, an A, G or "R" in a test sequence is identical to an "R" in the reference sequence; and a T, C or "Y" in a test sequence is identical to a "Y" in the reference sequence.

As a practical matter, whether any particular nucleic acid molecule is at least 90%, 95%, 96%, 97%, 98% or 99% identical to, for instance, a nucleotide sequence described in Table 1 can be determined conventionally using known computer programs such as the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711). Bestfit uses the local homology algorithm of Smith and Waterman (Advances in Applied Mathematics 2:482-489 (1981)), to find the best segment of homology between two sequences. When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set, of course, such that the percentage of identity is calculated over the full length of the reference nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in the reference sequence are allowed.

The present application is directed to nucleic acid molecules at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequences described in Table 1. One of skill in the art would still know how to use the nucleic acid molecule, for instance, as a hybridization probe or a polymerase chain reaction (PCR) primer. Uses of the nucleic acid molecules of the present invention include, inter alia, (1) isolating Streptococcal genes or allelic variants thereof from either a genomic or cDNA library and (2) Northern Blot or PCR analysis for detecting Streptococcal mRNA expression.

Of course, due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately recognize that a large number of nucleic acid molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence identified in Table 1 will encode the same polypeptide. In fact, since degenerate variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even without performing the above described comparison assay.

It will be further recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will also encode proteins having antigenic epitopes of the S. pneumoniae polypeptides of the present invention. This is because the skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to significantly effect the antigenicity of a polypeptide (e.g., replacement of an amino acid in a region which is not believed to form an antigenic epitope). For example, since antigenic epitopes have been identified which contain as few as six amino acids (see Harlow, et al., Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1988), page 76), in instances where a polypeptide has multiple antigenic epitopes the alteration of several amino acid residues would often not be expected to eliminate all of the antigenic epitopes of that polypeptide. This is especially so when the alterations are in regions believed to not constitute antigenic epitopes.

VECTORS AND HOST CELLS

The present invention also relates to vectors which include the isolated DNA molecules of the present invention, host cells which are genetically engineered with the recombinant vectors, and the production of S. pneumoniae polypeptides or fragments thereof by recombinant techniques.

Recombinant constructs may be introduced into host cells using well known techniques such as infection, transduction, transfection, transvection, electroporation and transformation. The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors may be replication competent or replication defective. In the latter case, viral propagation generally will occur only in complementing host cells.

The polynucleotides may be joined to a vector containing a selectable marker for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be packaged in vitro using an appropriate packaging cell line and then transduced into host cells.

Preferred are vectors comprising cis-acting control regions to the polynucleotide of interest. Appropriate trans-acting factors may be supplied by the host, supplied by a complementing vector or supplied by the vector itself upon introduction into the host.

In certain preferred embodiments in this regard, the vectors provide for specific expression, which may be inducible and/or cell type-specific. Particularly preferred among such vectors are those inducible by environmental factors that are easy to manipulate, such as temperature and nutrient additives.

Expression vectors useful in the present invention include chromosomal-, episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as cosmids and phagemids.

The DNA insert should be operatively linked to an appropriate promoter, such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters will be known to the skilled artisan. The expression constructs will further contain sites for transcription initiation, termination and, in the transcribed region, a ribosome binding site for translation. The coding portion of the mature transcripts expressed by the constructs will preferably include a translation initiating site at the beginning and a termination codon (UAA, UGA or UAG) appropriately positioned at the end of the polypeptide to be translated.

As indicated, the expression vectors will preferably include at least one selectable marker. Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes for culturing in E. coli and other bacteria. Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the above-described host cells are known in the art.

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXT1 and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan.

Among known bacterial promoters suitable for use in the present invention include the E. coli laci and lacZ promoters, the T3 and T7 promoters, the gpt promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the mouse metallothionein-I promoter.

Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection or other methods. Such methods are described in many standard laboratory manuals (for example, Davis, et al, Basic Methods In Molecular Biology (1986)).

Transcription of DNA encoding the polypeptides of the present invention by higher eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp that act to increase transcriptional activity of a promoter in a given host cell-type. Examples of enhancers include the SV40 enhancer, which is located on the late side of the replication origin at bp 100 to 270, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.

For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, into the periplasmic space or into the extracellular environment, appropriate secretion signals may be incorporated into the expressed polypeptide. The signals may be endogenous to the polypeptide or they may be heterologous signals.

The polypeptide may be expressed in a modified form, such as a fusion protein, and may include not only secretion signals, but also additional heterologous functional regions. For instance, a region of additional amino acids, particularly charged amino acids, may be added to the N-terminus of the polypeptide to improve stability and persistence in the host cell, during purification, or during subsequent handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate purification. Such regions may be removed prior to final preparation of the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or excretion, to improve stability and to facilitate purification, among others, are familiar and routine techniques in the art. A preferred fusion protein comprises a heterologous region from immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion proteins comprising various portions of constant region of immunoglobin molecules together with another human protein or part thereof. In many cases, the Fc part in a fusion protein is thoroughly advantageous for use in therapy and diagnosis and thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein has been expressed, detected and purified in the advantageous manner described. This is the case when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human proteins, such as, hIL5-receptor has been fused with Fc portions for the purpose of high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et al., J. Molec. Recogn. 8:52-58 (1995) and Johanson, K. et al., J. Biol. Chem. 270 (16):9459-9471 (1995).

The S. pneumoniae polypeptides can be recovered and purified from recombinant cell cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") is employed for purification. Polypeptides of the present invention include naturally purified products, products of chemical synthetic procedures, and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and mammalian cells.

POLYPEPTIDES AND FRAGMENTS

The invention further provides isolated polypeptides having the amino acid sequences described in Table 1, and shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226, and peptides or polypeptides comprising portions of the above polypeptides. The terms "peptide" and "oligopeptide" are considered synonymous (as is commonly recognized) and each term can be used interchangeably as the context requires to indicate a chain of at least two amino acids coupled by peptidyl linkages. The word "polypeptide" is used herein for chains containing more than ten amino acid residues. All oligopeptide and polypeptide formulas or sequences herein are written from left to right and in the direction from amino terminus to carboxy terminus.

Some amino acid sequences of the S. pneumoniae polypeptides described in Table 1 can be varied without significantly effecting the antigenicity of the polypeptides. If such differences in sequence are contemplated, it should be remembered that there will be critical areas on the polypeptide which determine antigenicity. In general, it is possible to replace residues which do not form part of an antigenic epitope without significantly effecting the antigenicity of a polypeptide. Guidance for such alterations is given in Table 2 wherein epitopes for each polypeptide is delineated.

The polypeptides of the present invention are preferably provided in an isolated form. By "isolated polypeptide" is intended a polypeptide removed from its native environment. Thus, a polypeptide produced and/or contained within a recombinant host cell is considered isolated for purposes of the present invention. Also intended as an "isolated polypeptide" is a polypeptide that has been purified, partially or substantially, from a recombinant host cell. For example, recombinantly produced versions of the S. pneumoniae polypeptides described in Table 1 can be substantially purified by the one-step method described by Smith and Johnson (Gene 67:31-40 (1988)).

The polypeptides of the present invention include: (a) an amino acid sequence of any of the polypeptides described in Table 1; and (b) an amino acid sequence of an epitope-bearing portion of any one of the polypeptides of (a); as well as polypeptides with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in (a) or (b) above, as well as polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to those above.

By "% similarity" for two polypeptides is intended a similarity score produced by comparing the amino acid sequences of the two polypeptides using the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711) and the default settings for determining similarity. Bestfit uses the local homology algorithm of Smith and Waterman (Advances in Applied Mathematics 2:482489 (1981)) to find the best segment of similarity between two sequences.

By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to a reference amino acid sequence of a S. pneumoniae polypeptide is intended that the amino acid sequence of the polypeptide is identical to the reference sequence except that the polypeptide sequence may include up to five amino acid alterations per each 100 amino acids of the reference amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a reference amino acid sequence, up to 5% of the amino acid residues in the reference sequence may be deleted or substituted with another amino acid, or a number of amino acids up to 5% of the total amino acid residues in the reference sequence may be inserted into the reference sequence. These alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the reference sequence or in one or more contiguous groups within the reference sequence.

The amino acid sequences shown in Table 1 may have on or more "X" residues. "X" represents unknown. Thus, for purposes of defining identity, if any amino acid is present at the same position in a reference amino acid sequence (shown in Table 1) where an X is shown, the two sequences are identical at that position.

As a practical matter, whether any particular polypeptide is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to, for instance, an amino acid sequence shown in Table 1, can be determined conventionally using known computer programs such the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711). When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set, of course, such that the percentage of identity is calculated over the fall length of the reference amino acid sequence and that gaps in homology of up to 5% of the total number of amino acid residues in the reference sequence are allowed.

As described below, the polypeptides of the present invention can also be used to raise polyclonal and monoclonal antibodies, which are useful in assays for detecting Streptococcal protein expression.

In another aspect, the invention provides peptides and polypeptides comprising epitope-bearing portions of the S. pneumoniae polypeptides of the invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides of the invention. An "immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the whole protein or polypeptide is the immunogen. These immunogenic epitopes are believed to be confined to a few loci on the molecule. On the other hand, a region of a protein molecule to which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The number of immunogenic epitopes of a protein generally is less than the number of antigenic epitopes (Geysen, et al, Proc. Natl. Acad. Sci. USA 81:3998-4002 (1983)). Predicted antigenic epitopes are shown in Table 2, below.

As to the selection of peptides or polypeptides bearing an antigenic epitope (i.e., that contain a region of a protein molecule to which an antibody can bind), it is well known in that art that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable of eliciting an antiserum that reacts with the partially mimicked protein (for instance, Sutcliffe, J., et al., Science 219:660-666 (1983)). Peptides capable of eliciting protein-reactive sera are frequently represented in the primary sequence of a protein, can be characterized by a set of simple chemical rules, and are confined neither to immunodominant regions of intact proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and those of six or fewer residues generally are ineffective at inducing antibodies that bind to the mimicked protein; longer, peptides, especially those containing proline residues, usually are effective (Sutcliffe, et al., supra, p. 661). For instance, 18 of 20 peptides designed according to these guidelines, containing 8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HA1 polypeptide chain, induced antibodies that reacted with the HA1 protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies glycoprotein induced antibodies that precipitated the respective proteins.

Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful to raise antibodies, including monoclonal antibodies, that bind specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the native protein (Sutcliffe, et al., supra, p. 663). The antibodies raised by antigenic epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies to different peptides may be used for tracking the fate of various regions of a protein precursor which undergoes post-translational processing. The peptides and anti-peptide antibodies may be used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in competition assays since it has been shown that even short peptides (e.g., about 9 amino acids) can bind and displace the larger peptides in immunoprecipitation assays (for instance, Wilson, et al., Cell 37:767-778 (1984) p. 777). The anti-peptide antibodies of the invention also are useful for purification of the mimicked protein, for instance, by adsorption chromatography using methods well known in the art.

Antigenic epitope-bearing peptides and polypeptides of the invention designed according to the above guidelines preferably contain a sequence of at least seven, more preferably at least nine and most preferably between about 15 to about 30 amino acids contained within the amino acid sequence of a polypeptide of the invention. However, peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide of the invention, containing about 30 to about 50 amino acids, or any length up to and including the entire amino acid sequence of a polypeptide of the invention, also are considered epitope-bearing peptides or polypeptides of the invention and also are useful for inducing antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic residues and highly hydrophobic sequences are preferably avoided); and sequences containing proline residues are particularly preferred.

Non-limiting examples of antigenic polypeptides or peptides that can be used to generate Streptococcal-specific antibodies include portions of the amino acid sequences identified in Table 1. More specifically, Table 2 discloses antigenic fragments of polypeptides of the present invention, which antigenic fragments comprise amino acid sequences from about the first amino acid residues indicated to about the last amino acid residue indicated for each fragment. The polypeptide fragments disclosed in Table 2 are believed to be antigenic regions of the S. pneumoniae polypeptides described in Table 1. Thus the invention further includes isolated peptides and polypeptides comprising an amino acid sequence of an epitope shown in Table 2 and polynucleotides encoding said polypeptides.

The epitope-bearing peptides and polypeptides of the invention may be produced by any conventional means for making peptides or polypeptides including recombinant means using nucleic acid molecules of the invention. For instance, an epitope-bearing amino acid sequence of the present invention may be fused to a larger polypeptide which acts as a carrier during recombinant production and purification, as well as during immunization to produce anti-peptide antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical synthesis. For instance, Houghten has described a simple method for synthesis of large numbers of peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid variants of a segment of the HA1 polypeptide which were prepared and characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R. A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S. Pat. No. 4,631,211 to Houghten and coworkers (1986). In this procedure the individual resins for the solid-phase synthesis of various peptides are contained in separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps involved in solid-phase methods. A completely manual procedure allows 500-1000 or more syntheses to be conducted simultaneously (Houghten, et al., supra, p. 5134).

Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies according to methods well known in the art (for instance, Sutcliffe, et al., supra; Wilson, et al., supra; Chow, M., et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J., et al., J. Gen. Virol. 66:2347-2354 (1985)). Generally, animals may be immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be coupled to carrier using a more general linking agent such as glutaraldehyde. Animals such as rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection of emulsions containing about 100 .mu.g peptide or carrier protein and Freund's adjuvant. Several booster injections may be needed, for instance, at intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in serum from an immunized animal may be increased by selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of the selected antibodies according to methods well known in the art.

Immunogenic epitope-bearing peptides of the invention, i.e., those parts of a protein that elicit an antibody response when the whole protein is the immunogen, are identified according to methods known in the art. For instance, Geysen, et al., supra, discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an enzyme-linked immunosorbent assay. Interaction of synthesized peptides with antibodies is then easily detected without removing them from the support. In this manner a peptide bearing an immunogenic epitope of a desired protein may be identified routinely by one of ordinary skill in the art. For instance, the immunologically important epitope in the coat protein of foot-and-mouth disease virus was located by Geysen et al. supra with a resolution of seven amino acids by synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. Then, a complete replacement set of peptides in which all 20 amino acids were substituted in turn at every position within the epitope were synthesized, and the particular amino acids conferring specificity for the reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing peptides of the invention can be made routinely by this method. U.S. Pat. No. 4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an immunogenic epitope of a desired protein.

Further still, U.S. Pat. No. 5,194,392, to Geysen (1990), describes a general method of detecting or determining the sequence of monomers (amino acids or other compounds) which is a topological equivalent of the epitope (i.e., a "mimotope") which is complementary to a particular paratope (antigen binding site) of an antibody of interest. More generally, U.S. Pat. No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a sequence of monomers which is a topographical equivalent of a ligand which is complementary to the ligand binding site of a particular receptor of interest. Similarly, U.S. Pat. No. 5,480,971 to Houghten, R. A. et al. (1996) discloses linear C.sub.1 -C.sub.7 -alkyl peralkylated oligopeptides and sets and libraries of such peptides, as well as methods for using such oligopeptide sets and libraries for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention also can be made routinely by these methods.

The entire disclosure of each document cited in this section on "Polypeptides and Fragments" is hereby incorporated herein by reference.

As one of skill in the art will appreciate, the polypeptides of the present invention and the epitope-bearing fragments thereof described above can be combined with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins facilitate purification and show an increased half-life in vivo. This has been shown, e.g., for chimeric proteins consisting of the first two domains of the human CD4-polypeptide and various domains of the constant regions of the heavy or light chains of mammalian immunoglobulins (EPA 0,394,827; Traunecker et al., Nature 331:84-86 (1988)). Fusion proteins that have a disulfide-linked dimeric structure due to the IgG part can also be more efficient in binding and neutralizing other molecules than a monomeric S. pneumoniae polypeptide or fragment thereof alone (Fountoulakis et al., J. Biochem. 270:3958-3964 (1995)).

DIAGNOSTIC ASSAYS

The present invention further relates to a method for assaying for Streptococcal infection in an animal via detecting the expression of genes encoding Streptococcal polypeptides (e.g., the polypeptides described Table 1). This method comprises analyzing tissue or body fluid from the animal for Streptococcus-specific antibodies or Streptococcal nucleic acids or proteins. Analysis of nucleic acid specific to Streptococcus can be done by PCR or hybridization techniques using nucleic acid sequences of the present invention as either hybridization probes or primers (cf Molecular Cloning: A Laboratory Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 1989; Eremeeva et al., J. Clin. Microbiol. 32:803-810 (1994) which describes differentiation among spotted fever group Rickettsiae species by analysis of restriction fragment length polymorphism of PCR-amplified DNA). Methods for detecting B. burgdorferi nucleic acids via PCR are described, for example, in Chen et al, J. Clin. Microbiol 32:589-595 (1994).

Where diagnosis of a disease state related to infection with Streptococcus has already been made, the present invention is useful for monitoring progression or regression of the disease state whereby patients exhibiting enhanced Streptococcus gene expression will experience a worse clinical outcome relative to patients expressing these gene(s) at a lower level.

By "assaying for Streptococcal infection in an animal via detection of genes encoding Streptococcal polypeptides" is intended qualitatively or quantitatively measuring or estimating the level of one or more Streptococcus polypeptides or the level of nucleic acid encoding Streptococcus polypeptides in a first biological sample either directly (e.g., by determining or estimating absolute protein level or nucleic level) or relatively (e.g., by comparing to the Streptococcus polypeptide level or mRNA level in a second biological sample). The Streptococcus polypeptide level or nucleic acid level in the second sample used for a relative comparison may be undetectable if obtained from an animal which is not infected with Streptococcus. When monitoring the progression or regression of a disease state, the Streptococcus polypeptide level or nucleic acid level may be compared to a second sample obtained from either an animal infected with Streptococcus or the same animal from which the first sample was obtained but taken from that animal at a different time than the first. As will be appreciated in the art, once a standard Streptococcus polypeptide level or nucleic acid level which corresponds to a particular stage of a Streptococcus infection is known, it can be used repeatedly as a standard for comparison.

By "biological sample" is intended any biological sample obtained from an animal, cell line, tissue culture, or other source which contains Streptococcus polypeptide, MRNA, or DNA. Biological samples include body fluids (such as plasma and synovial fluid) which contain Streptococcus polypeptides, and muscle, skin, and cartilage tissues. Methods for obtaining tissue biopsies and body fluids are well known in the art.

The present invention is useful for detecting diseases related to Streptococcus infections in animals. Preferred animals include monkeys, apes, cats, dogs, cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans.

Total RNA can be isolated from a biological sample using any suitable technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski and Sacchi, Anal. Biochem. 162:156-159 (1987). mRNA encoding Streptococcus polypeptides having sufficient homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between complementary sequences are then assayed using any appropriate method. These include Northern blot analysis, S1 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in combination with the polymerase chain reaction (RT-PCR), and reverse transcription in combination with the ligase chain reaction (RT-LCR).

Northern blot analysis can be performed as described in Harada et al., Cell 63:303-312 (1990). Briefly, total RNA is prepared from a biological sample as described above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been linked to the filter by a UV linker, the filter is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium phosphate buffer. A S. pnuemoniae polypeptide DNA sequence shown in Table 1 labeled according to any appropriate method (such as the .sup.32 P-multiprimed DNA labeling system (Amersham)) is used as probe. After hybridization overnight, the filter is washed and exposed to x-ray film. DNA for use as probe according to the present invention is described in the sections above and will preferably at least 15 bp in length.

S1 mapping can be performed as described in Fujita et al., Cell 49:357-367 (1987). To prepare probe DNA for use in S1 mapping, the sense strand of an above-described S. pnuemoniae DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. The antisense DNA can then be digested using an appropriate restriction endonuclease to generate further DNA probes of a desired length. Such antisense probes are useful for visualizing protected bands corresponding to the target mRNA (i.e., mRNA encoding Streptococcus polypeptides).

Preferably, levels of mRNA encoding Streptococcus polypeptides are assayed using the RT-PCR method described in Makino et al., Technique 2:295-301 (1990). By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from a biological sample in a reaction mixture containing a RT primer and appropriate buffer. After incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription of the RNA, the RT products are then subject to PCR using labeled primers. Alternatively, rather than labeling the primers, a labeled DNTP can be included in the PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler according to conventional techniques. After a suitable number of rounds to achieve amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel. After drying the gel, the radioactivity of the appropriate bands (corresponding to the mRNA encoding the Streptococcus polypeptides)) is quantified using an imaging analyzer. RT and PCR reaction ingredients and conditions, reagent and gel concentrations, and labeling methods are well known in the art. Variations on the RT-PCR method will be apparent to the skilled artisan.

Assaying Streptococcus polypeptide levels in a biological sample can occur using any art-known method. Preferred for assaying Streptococcus polypeptide levels in a biological sample are antibody-based techniques. For example, Streptococcus polypeptide expression in tissues can be studied with classical immunohistological methods. In these, the specific recognition is provided by the primary antibody (polyclonal or monoclonal) but the secondary detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an immunohistological staining of tissue section for pathological examination is obtained. Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of Streptococcus polypeptides for Western-blot or dot/slot assay (Jalkanen, M., et al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087-3096 (1987)). In this technique, which is based on the use of cationic solid phases, quantitation of a Streptococcus polypeptide can be accomplished using an isolated Streptococcus polypeptide as a standard. This technique can also be applied to body fluids.

Other antibody-based methods useful for detecting Streptococcus polypeptide gene expression include immunoassays, such as the enzyme linked immunosorbent assay (ELISA) and the radioimmunoassay (RIA). For example, a Streptococcus polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and quantify a Streptococcus polypeptide. The amount of a Streptococcus polypeptide present in the sample can be calculated by reference to the amount present in a standard preparation using a linear regression computer algorithm. Such an ELISA for detecting a tumor antigen is described in Iacobelli et al., Breast Cancer Research and Treatment 11:19-30 (1988). In another ELISA assay, two distinct specific monoclonal antibodies can be used to detect Streptococcus polypeptides in a body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as the enzyme-labeled probe.

The above techniques may be conducted essentially as a "one-step" or "two-step" assay. The "one-step" assay involves contacting the Streptococcus polypeptide with immobilized antibody and, without washing, contacting the mixture with the labeled antibody. The "two-step" assay involves washing before contacting the mixture with the labeled antibody. Other conventional methods may also be employed as suitable. It is usually desirable to immobilize one component of the assay system on a support, thereby allowing other components of the system to be brought into contact with the component and readily removed from the sample.

Streptococcus polypeptide-specific antibodies for use in the present invention can be raised against an intact S. pneumoize polypeptide of the present invention or fragment thereof. These polypeptides and fragments may be administered to an animal (e.g., rabbit or mouse) either with a carrier protein (e.g., albumin) or, if long enough (e.g., at least about 25 amino acids), without a carrier.

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to include intact molecules as well as antibody fragments (such as, for example, Fab and F(ab').sub.2 fragments) which are capable of specifically binding to a Streptococcus polypeptide. Fab and F(ab').sub.2 fragments lack the Fc fragment of intact antibody, clear more rapidly from the circulation, and may have less non-specific tissue binding of an intact antibody (Wahl et al., J. Nucl. Med. 24:316-325 (1983)). Thus, these fragments are preferred.

The antibodies of the present invention may be prepared by any of a variety of methods. For example, the S. pneumoniae polypeptides identified in Table 1, or fragments thereof, can be administered to an animal in order to induce the production of sera containing polyclonal antibodies. In a preferred method, a preparation of a S. pneumoniae polypeptide of the present invention is prepared and purified to render it substantially free of natural contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal antisera of high specific activity.

In the most preferred method, the antibodies of the present invention are monoclonal antibodies. Such monoclonal antibodies can be prepared using hybridoma technology (Kohler et al., Nature 256:495 (1975); Kohler et al., Eur. J. Immunol. 6:511 (1976); Kohler et al., Eur. J. Immunol. 6:292 (1976); Hammerling et al., In: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N.Y., (1981) pp. 563-681 ). In general, such procedures involve immunizing an animal (preferably a mouse) with a S. pneumoniae polypeptide antigen of the present invention. Suitable cells can be recognized by their capacity to bind anti-Streptococcus polypeptide antibody. Such cells may be cultured in any suitable tissue culture medium; however, it is preferable to culture cells in Earle's modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at about 56.degree. C.), and supplemented with about 10 g/l of nonessential amino acids, about 1,000 U/ml of penicillin, and about 100 .mu.g/ml of streptomycin. The splenocytes of such mice are extracted and fused with a suitable myeloma cell line. Any suitable myeloma cell line may be employed in accordance with the present invention; however, it is preferable to employ the parent myeloma cell line (SP.sub.2 O), available from the American Type Culture Collection, Rockville, Md. After fusion, the resulting hybridoma cells are selectively maintained in HAT medium, and then cloned by limiting dilution as described by Wands et al. (Gastroenterology 80:225-232 (1981)). The hybridoma cells obtained through such a selection are then assayed to identify clones which secrete antibodies capable of binding the Streptococcus polypeptide antigen administered to immunized animal.

Alternatively, additional antibodies capable of binding to Streptococcus polypeptide antigens may be produced in a two-step procedure through the use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that, therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance with this method, Streptococcus polypeptide-specific antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such an animal are then used to produce hybridoma cells, and the hybridoma cells are screened to identify clones which produce an antibody whose ability to bind to the Streptococcus polypeptide-specific antibody can be blocked by a Streptococcus polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the Streptococcus polypeptide-specific antibody and can be used to immunize an animal to induce formation of further Streptococcus polypeptide-specific antibodies.

It will be appreciated that Fab and F(ab').sub.2 and other fragments of the antibodies of the present invention may be used according to the methods disclosed herein. Such fragments are typically produced by proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab').sub.2 fragments). Alternatively, Streptococcus polypeptide-binding fragments can be produced through the application of recombinant DNA technology or through synthetic chemistry.

Of special interest to the present invention are antibodies to Streptococcus polypeptide antigens which are produced in humans, or are "humanized" (i.e., non-immunogenic in a human) by recombinant or other technology. Humanized antibodies may be produced, for example by replacing an immunogenic portion of an antibody with a corresponding, but non-immunogenic portion (i.e., chimeric antibodies) (Robinson, R. R. et al., International Patent Publication PCT/US86/02269; Akira, K. et al., European Patent Application 184,187; Taniguchi, M., European Patent Application 171,496; Morrison, S. L. et al., European Patent Application 173,494; Neuberger, M. S. et al., PCT Application WO 86/01533; Cabilly, S. et al., European Patent Application 125,023; Better, M. et al., Science 240:1041-1043 (1988); Liu, A. Y. et al., Proc. Natl. Acad. Sci. USA 84:3439-3443 (1987); Liu, A. Y. et al., J. Immunol. 139:3521-3526 (1987); Sun, L. K. et al., Proc. Natl. Acad. Sci. USA 84:214-218 (1987); Nishimura, Y. et al., Canc. Res. 47:999-1005 (1987); Wood, C. R. et al., Nature 314:446-449 (1985)); Shaw et al., J. Natl. Cancer Inst. 80:1553-1559 (1988). General reviews of "humanized" chimeric antibodies are provided by Morrison, S. L. (Science, 229:1202-1207 (1985)) and by Oi, V. T. et al., BioTechniques 4:214 (1986)). Suitable "humanized" antibodies can be alternatively produced by CDR or CEA substitution (Jones, P. T. et al., Nature 321:552-525 (1986); Verhoeyan et al., Science 239:1534 (1988); Beidler, C. B. et al., J. Immunol. 141:4053-4060 (1988)).

Suitable enzyme labels include, for example, those from the oxidase group, which catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase is particularly preferred as it has good stability and its substrate (glucose) is readily available. Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels include radioisotopes, such as iodine (.sup.125 I, .sup.121 I ), carbon (.sup.14 C), sulphur (.sup.35 S), tritium (.sup.3 H), indium (.sup.112 In), and technetium (.sup.99m Tc), and fluorescent labels, such as fluorescein and rhodamine, and biotin.

Further suitable labels for the Streptococcus polypeptide-specific antibodies of the present invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase.

Examples of suitable radioisotopic labels include .sup.3 H, .sup.111 In, .sup.125 I, .sup.131 I, .sup.32 P, .sup.35 S, .sup.14 C, .sup.51 Cr, .sup.57 To, .sup.58 Co, .sup.59 Fe, .sup.75 Se, .sup.152 Eu, .sup.90 Y, .sup.67 Cu, .sup.217 Ci, .sup.211 At, .sup.212 Pb, .sup.47 Sc, .sup.109 Pd, etc. .sup.111 In is a preferred isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the .sup.125 I or .sup.131 I-labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more favorable gamma emission energy for imaging (Perkins et al., Eur. J. Nucl. Med. 10:296-301 (1985); Carasquillo et al., J. NucL Med. 28:281-287 (1987)). For example, .sup.111 In coupled to monoclonal antibodies with 1-(P-isothiocyanatobenzyl)-DPTA has shown little uptake in non-tumorous tissues, particularly the liver, and therefore enhances specificity of tumor localization (Esteban et al., J. Nucl. Med. 28:861-870 (1987)).

Examples of suitable non-radioactive isotopic labels include .sup.157 Gd, .sup.55 Mn, .sup.162 Dy, .sup.52 Tr, and .sup.56 Fe.

Examples of suitable fluorescent labels include an .sup.152 Eu label, a fluorescein label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label.

Examples of suitable toxin labels include diphtheria toxin, ricin, and cholera toxin.

Examples of chemiluminescent labels include a luminal label, an isoluminal label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, a luciferin label, a luciferase label, and an aequorin label.

Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei such as Gd, Mn, and iron.

Typical techniques for binding the above-described labels to antibodies are provided by Kennedy et al., Clin. Chim. Acta 70:1-31 (1976), and Schurs et al., Clin. Chim. Acta 81:1-40 (1977). Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate method, the dimaleimide method, the m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are incorporated by reference herein.

In a related aspect, the invention includes a diagnostic kit for use in screening serum containing antibodies specific against S. pneumoniae infection. Such a kit may include an isolated S. pneumoniae antigen comprising an epitope which is specifically immunoreactive with at least one anti-S. pneumoniae antibody. Such a kit also includes means for detecting the binding of said antibody to the antigen. In specific embodiments, the kit may include a recombinantly produced or chemically synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen may be attached to a solid support.

In a more specific embodiment, the detecting means of the above-described kit includes a solid support to which said peptide or polypeptide antigen is attached. Such a kit may also include a non-attached reporter-labelled anti-human antibody. In this embodiment, binding of the antibody to the S. pneumoniae antigen can be detected by binding of the reporter labelled antibody to the anti-S. pneumoniae antibody.

In a related aspect, the invention includes a method of detecting S. pneumoniae infection in a subject. This detection method includes reacting a body fluid, preferrably serum, from the subject with an isolated S. pneumoniae antigen, and examining the antigen for the presence of bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to a solid support, and serum is reacted with the support. Subsequently, the support is reacted with a reporter-labeled anti-human antibody. The support is then examined for the presence of reporter-labeled antibody.

The solid surface reagent employed in the above assays and kits is prepared by known techniques for attaching protein material to solid support material, such as polymeric beads, dip sticks, 96-well plates or filter material. These attachment methods generally include non-specific adsorption of the protein to the support or covalent attachment of the protein, typically through a free amine group, to a chemically reactive group on the solid support, such as an activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in conjunction with biotinylated antigen(s).

THERAPEUTICS AND MODES OF ADMINISTRATION

The present invention also provides vaccines comprising one or more polypeptides of the present invention. Heterogeneity in the composition of a vaccine may be provided by combining S. pneumoniae polypeptides of the present invention. Multi-component vaccines of this type are desirable because they are likely to be more effective in eliciting protective immune responses against multiple species and strains of the Streptococcus genus than single polypeptide vaccines. Thus, as discussed in detail below, a multi-component vaccine of the present invention may contain one or more, preferably 2 to about 20, more preferably 2 to about 15, and most preferably 3 to about 8, of the S. pneumoniae polypeptides identified in Table 1, or fragments thereof.

Multi-component vaccines are known in the art to elicit antibody production to numerous immunogenic components. Decker, M. and Edwards, K., J. Infect. Dis. 174:S270-275 (1996). In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has recently been demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic agents. Aristegui, J. et al., Vaccine 15:7-9 (1997).

The present invention thus also includes multi-component vaccines. These vaccines comprise more than one polypeptide, immunogen or antigen. An example of such a multi-component vaccine would be a vaccine comprising more than one of the S. pneumoniae polypeptides described in Table 1. A second example is a vaccine comprising one or more, for example 2 to 10, of the S. pneumoniae polypeptides identified in Table 1 and one or more, for example 2 to 10, additional polypeptides of either streptococcal or non-streptococcal origin. Thus, a multi-component vaccine which confers protective immunity to both a Streptococcal infection and infection by another pathogenic agent is also within the scope of the invention.

As indicated above, the vaccines of the present invention are expected to elicit a protective immune response against infections caused by species and strains of Streptococcus other than strain of S. pneumoniae deposited with that ATCC.

Further within the scope of the invention are whole cell and whole viral vaccines. Such vaccines may be produced recombinantly and involve the expression of one or more of the S. pneumoniae polypeptides described in Table 1. For example, the S. pneumoniae polypeptides of the present invention may be either secreted or localized intracellular, on the cell surface, or in the periplasmic space. Further, when a recombinant virus is used, the S. pneumoniae polypeptides of the present invention may, for example, be localized in the viral envelope, on the surface of the capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing heterologous proteins are known in the art. See, e.g., Robinson, K. et al., Nature Biotech. 15:653-657 (1997); Sirard, J. et al., Infect. Immun. 65:2029-2033 (1997); Chabalgoity, J. et al., Infect. Immun. 65:2402-2412 (1997). These cells may be administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, for example, report the successful use in mice of a live attenuated Salmonella vaccine strain which expresses a portion of a platyhelminth fatty acid-binding protein as a fusion protein on its cells surface.

A multi-component vaccine can also be prepared using techniques known in the art by combining one or more S. pneumoniae polypeptides of the present invention, or fragments thereof, with additional non-streptococcal components (e.g., diphtheria toxin or tetanus toxin, and/or other compounds known to elicit an immune response). Such vaccines are useful for eliciting protective immune responses to both members of the Streptococcus genus and non-streptococcal pathogenic agents.

The vaccines of the present invention also include DNA vaccines. DNA vaccines are currently being developed for a number of infectious diseases. Boyer, J et al., Nat. Med. 3:526-532 (1997); reviewed in Spier, R., Vaccine 14:1285-1288 (1996). Such DNA vaccines contain a nucleotide sequence encoding one or more S. pneumoniae polypeptides of the present invention oriented in a manner that allows for expression of the subject polypeptide. The direct administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to elicit protective immunity in mice against borrelial challenge. Luke, C. et al., J. Infect. Dis. 175:91-97 (1997).

The present invention also relates to the administration of a vaccine which is co-administered with a molecule capable of modulating immune responses. Kim, J. et al., Nature Biotech. 15:641-646 (1997), for example, report the enhancement of immune responses produced by DNA immunizations when DNA sequences encoding molecules which stimulate the immune response are co-administered. In a similar fashion, the vaccines of the present invention may be co-administered with either nucleic acids encoding immune modulators or the immune modulators themselves. These immune modulators include granulocyte macrophage colony stimulating factor (GM-CSF) and CD86.

The vaccines of the present invention may be used to confer resistance to streptococcal infection by either passive or active immunization. When the vaccines of the present invention are used to confer resistance to streptococcal infection through active immunization, a vaccine of the present invention is administered to an animal to elicit a protective immune response which either prevents or attenuates a streptococcal infection. When the vaccines of the present invention are used to confer resistance to streptococcal infection through passive immunization, the vaccine is provided to a host animal (e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered and directly provided to a recipient suspected of having an infection caused by a member of the Streptococcus genus.

The ability to label antibodies, or fragments of antibodies, with toxin molecules provides an additional method for treating streptococcal infections when passive immunization is conducted. In this embodiment, antibodies, or fragments of antibodies, capable of recognizing the S. pneumoniae polypeptides disclosed herein, or fragments thereof, as well as other Streptococcus proteins, are labeled with toxin molecules prior to their administration to the patient. When such toxin derivatized antibodies bind to Streptococcus cells, toxin moieties will be localized to these cells and will cause their death.

The present invention thus concerns and provides a means for preventing or attenuating a streptococcal infection resulting from organisms which have antigens that are recognized and bound by antisera produced in response to the polypeptides of the present invention. As used herein, a vaccine is said to prevent or attenuate a disease if its administration to an animal results either in the total or partial attenuation (i.e., suppression) of a symptom or condition of the disease, or in the total or partial immunity of the animal to the disease.

The administration of the vaccine (or the antisera which it elicits) may be for either a "prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are provided in advance of any symptoms of streptococcal infection. The prophylactic administration of the compound(s) serves to prevent or attenuate any subsequent infection. When provided therapeutically, the compound(s) is provided upon or after the detection of symptoms which indicate that an animal may be infected with a member of the Streptococcus genus. The therapeutic administration of the compound(s) serves to attenuate any actual infection. Thus, the S. pneumoniae polypeptides, and fragments thereof, of the present invention may be provided either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after the initiation of an actual infection.

The polypeptides of the invention, whether encoding a portion of a native protein or a functional derivative thereof, may be administered in pure form or may be coupled to a macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable proteins which may act as macromolecular carrier for enhancing the immunogenicity of the polypeptides of the present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of the present invention to such macromolecular carriers are disclosed in Harlow et al., Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1988), the entire disclosure of which is incorporated by reference herein.

A composition is said to be "pharmacologically acceptable" if its administration can be tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an agent is said to be administered in a "therapeutically effective amount" if the amount administered is physiologically significant. An agent is physiologically significant if its presence results in a detectable change in the physiology of a recipient patient.

While in all instances the vaccine of the present invention is administered as a pharmacologically acceptable compound, one skilled in the art would recognize that the composition of a pharmacologically acceptable compound varies with the animal to which it is administered. For example, a vaccine intended for human use will generally not be co-administered with Freund's adjuvant. Further, the level of purity of the S. pneumoniae polypeptides of the present invention will normally be higher when administered to a human than when administered to a non-human animal.

As would be understood by one of ordinary skill in the art, when the vaccine of the present invention is provided to an animal, it may be in a composition which may contain salts, buffers, adjuvants, or other substances which are desirable for improving the efficacy of the composition. Adjuvants are substances that can be used to specifically augment a specific immune response. These substances generally perform two functions: (1) they protect the antigen(s) from being rapidly catabolized after administration and (2) they nonspecifically stimulate immune responses.

Normally, the adjuvant and the composition are mixed prior to presentation to the immune system, or presented separately, but into the same site of the animal being immunized. Adjuvants can be loosely divided into several groups based upon their composition. These groups include oil adjuvants (for example, Freund's complete and incomplete), mineral salts (for example, AlK(SO.sub.4).sub.2, AlNa(SO.sub.4).sub.2, AlNH.sub.4 (SO.sub.4), silica, kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and certain natural substances (for example, wax D from Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvum, or Bordetella pertussis, and members of the genus Brucella. Other substances useful as adjuvants are the saponins such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in the present invention include aluminum salts, such as AlK(SO.sub.4).sub.2, AlNa(SO.sub.4).sub.2, and AlNH.sub.4 (SO.sub.4). Examples of materials suitable for use in vaccine compositions are provided in Remington's Pharmaceutical Sciences (Osol, A, Ed, Mack Publishing Co, Easton, Pa., pp. 1324-1341 (1980), which reference is incorporated herein by reference).

The therapeutic compositions of the present invention can be administered parenterally by injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, or orally. The compositions may alternatively be administered intramuscularly, or intravenously. Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance antigen absorption. Liquid dosage forms for oral administration may generally comprise a liposome solution containing the liquid dosage form. Suitable forms for suspending liposomes include emulsions, suspensions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, such as purified water. Besides the inert diluents, such compositions can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, flavoring, or perfuming agents.

Therapeutic compositions of the present invention can also be administered in encapsulated form. For example, intranasal immunization of mice against Bordetella pertussis infection using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide-co-glycolide) has been shown to stimulate protective immune responses. Shahin, R. et al., Infect. Immun. 63:1195-1200 (1995). Similarly, orally administered encapsulated Salmonella typhimurium antigens have also been shown to elicit protective immunity in mice. Allaoui-Attarki, K. et al., Infect. Immun. 65:853-857 (1997). Encapsulated vaccines of the present invention can be administered by a variety of routes including those involving contacting the vaccine with mucous membranes (e.g., intranasally, intracolonicly, intraduodenally).

Many different techniques exist for the timing of the immunizations when a multiple administration regimen is utilized. It is possible to use the compositions of the invention more than once to increase the levels and diversities of expression of the immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple immunizations are given, they will be given one to two months apart.

According to the present invention, an "effective amount" of a therapeutic composition is one which is sufficient to achieve a desired biological effect. Generally, the dosage needed to provide an effective amount of the composition will vary depending upon such factors as the animal's or human's age, condition, sex, and extent of disease, if any, and other variables which can be adjusted by one of ordinary skill in the art.

The antigenic preparations of the invention can be administered by either single or multiple dosages of an effective amount. Effective amounts of the compositions of the invention can vary from 0.01-1,000 .mu.g/ml per dose, more preferably 0.1-500 .mu.g/ml per dose, and most preferably 10-300 .mu.g/ml per dose.

Having now generally described the invention, the same will be more readily understood through reference to the following example which is provided by way of illustration, and is not intended to be limiting of the present invention, unless specified.

EXAMPLES EXAMPLE 1 Expression and Purification of S. pneumoniae Polypeptides in E. coli

The bacterial expression vector pQE10 (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, Calif., 91311) is used in this example for cloning of the nucleotide sequences shown in Table 1 and for expressing the polypeptides identified in Table 1. The components of the pQE10 plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present invention expresses the polypeptide with the six His residues (i.e., a "6.times.His tag")) covalently linked to the amino terminus.

The DNA sequences encoding the desired portions of the polypeptides of Table 1 are amplified using PCR oligonucleotide primers from either a DNA library constructed from S. pnuemoniae, such as the one deposited by the inventors at the ATCC for convenience, ATCC Deposit No. 97755, or from DNA isolated from the same organism such as the S. pneumoniae strain deposited with the ATCC as Deposit No. 55840. A list of PCR primers which can be used for this purpose is provided in Table 3, below. The PCR primers anneal to the nucleotide sequences encoding both the amino terminal and carboxy terminal amino acid sequences of the desired portion of the polypeptides of Table 1. Additional nucleotides containing restriction sites to facilitate cloning in the pQE10 vector were added to the 5' and 3' primer sequences, respectively. Such restriction sites are listed in Table 3 for each primer. In each case, the primer comprises, from the 5' end, 4 random nucleotides to prevent "breathing" during the annealing process, a restriction site (shown in Table 3), and approximately 15 nucleotides of S. pneumoniae ORF sequence (the complete sequence of each cloning primer is shown as SEQ ID NO:227 through SEQ ID NO:452).

For cloning the polypeptides of Table 1, the 5' and 3' primers were selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the 5' primer begins may be varied to amplify a DNA segment encoding any desired portion of the complete amino acid sequences described in Table 1. Similarly, one of ordinary skill in the art would further appreciate that the point in the protein coding sequence where the 3' primer begins may also be varied to amplify a DNA segment encoding any desired portion of the complete amino acid sequences described in Table 1.

The amplified DNA fragment and the pQE10 vector are digested with the appropriate restriction enzyme(s) and the digested DNAs are then ligated together. The ligation mixture is transformed into competent E. coli cells using standard procedures such as those described in Sambrook et al., Molecular Cloning: a Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Transformants are identified by their ability to grow under selective pressure on LB plates. Plasmid DNA is isolated from resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA sequencing.

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture under selection. The O/N culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-b-D-thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce transcription from the lac repressor sensitive promoter, by inactivating the lacI repressor. Cells subsequently are incubated further for 3 to 4 hours. Cells are then harvested by centrifugation.

The cells are stirred for 3-4 hours at 4 C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the supernatant containing the protein of interest is loaded onto a nickel-nitrilo-tri-acetic acid ("NiNTA") affinity resin column (available from QIAGEN, Inc., supra). Proteins with a 6.times.His tag bind to the NI-NTA resin with high affinity and can be purified in a simple one-step procedure (for details see: The QIAexpressionist, 1995, QIAGEN, Inc., supra). Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH8, then washed with 10 volumes of 6 M guanidine-HCl pH6, and finally the polypeptide is eluted with 6 M guanidine-HCl, pH 5.0.

The purified protein is then renatured by dialyzing it against phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the protein can be successfully refolded while immobilized on the Ni-NTA column. The recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH7.4, containing protease inhibitors. The renaturation should be performed over a period of 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 250 mM immidazole. Imidazole is removed by a final dialyzing step against PBS or 50 mM sodium acetate pH6 buffer plus 200 mM NaCl. The purified protein is stored at 4.degree. C. or frozen at -80.degree. C.

The DNA sequences encoding the amino acid sequences of Table 1 may also be cloned and expressed as fusion proteins by a protocol similar to that described directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, Madison, Wis. 53711) is preferentially used in place of pQE10.

Each of the polynucleotides shown in Table 1, was successfully amplified and subcloned into pQE10 as described above using the PCR primers shown in Table 3. These pQE10 plasmids containing the DNAs of Table 1, except SP023, SP042, SP054, SP063, SP081, SP092, SP114, SP122, SP123, SP126, and SP127, were deposited with the ATCC as a pooled deposit as a convenience to those of skill in the art. This pooled deposit was desposited on Oct. 16, 1997 and given ATCC Deposit No. 209369. Those of ordinary skill in the art appreciate that isolating an individual plasmid from the pooled deposit is trivial provided the information and reagents described herein. Each of the deposited clones is capable of expressing its encoded S. pneumoniae polypeptide.

EXAMPLE 2 Immunization and Detection of Immune Responses

Methods

Growth of bacterial innoculum, immunization of Mice and Challenge with S pneumoniae.

Propagation and storage of, and challenge by S. pneumoniae are preformed essentially as described in Aaberge, I. S. et al., Virulence of Streptococcus pneumoniae in mice: a standardized method for preparation and frozen storage of the experimental bacterial inoculum, Microbial Pathogenesis, 18:141 (1995), incorporated herein by reference.

Briefly, Todd Hewitt (TH) broth (Difco laboratories, Detroit, Mich.) with 17% FCS, and horse blood agar plates are used for culturing the bacteria. Both broth and blood plates are incubated at 37.degree. C. in a 5% CO.sub.2 atmosphere. Blood plates are incubated for 18 hr. The culture broth is regularly 10-fold serially diluted in TH broth kept at room temperature and bacterial suspensions are kept at room temperature until challenge of mice.

For active immunizations C3H/HeJ mice (The Jackson Laboratory, Bar Harbor, Me.) are injected intraperitoneally (i.p.) at week 0 with 20 g of recombinant streptococcal protein, or phosphate-buffered saline (PBS), emulsified with complete Freund's adjuvant (CFA), given a similar booster immunization in incomplete Freund's adjuvant (IFA) at week 4, and challenged at week 6. For challenge S. pneumoniae are diluted in TH broth from exponentially-growing cultures and mice are injected subcutaneously (s.c.) at the base of the tail with 0.1 ml of these dilutions (serial dilutions are used to find medium infectious dose). Streptococci used for challenge are passaged fewer than six times in vitro. To assess infection, blood samples are obtained from the distal part of the lateral femoral vein into heparinized capillary tubes. A 25 ul blood sample is serially 10-fold diluted in TH broth, and 25 ul of diluted and undiluted blood is plated onto blood agar plates. The plates are incubated for 18 hr. and colonies are counted.

Other methods are known in the art, for example, see Langermann, S. et al., J. Exp. Med., 180:2277 (1994), incorporated herein by reference.

Immunoassays

Several immunoassay formats are used to quantify levels of streptococcal-specific antibodies (ELISA and immunoblot), and to evaluate the functional properties of these antibodies (growth inhibition assay). The ELISA and immunoblot assays are also used to detect and quantify antibodies elicited in response to streptococcal infection that react with specific streptococcal antigens. Where antibodies to certain streptococcal antigens are elicited by infection this is taken as evidence that the streptococcal proteins in question are expressed in vivo. Absence of infection-derived antibodies (seroconversion) following streptococcal challenge is evidence that infection is prevented or suppressed. The immunoblot assay is also used to ascertain whether antibodies raised against recombinant streptococcal antigens recognize a protein of similar size in extracts of whole streptococci. Where the natural protein is of similar, or identical, size in the immunoblot assay to the recombinant version of the same protein, this is taken as evidence that the recombinant protein is the product of a full-length clone of the respective gene.

Enzyme-Linked Immunosorbant Assay (ELISA).

The ELISA is used to quantify levels of antibodies reactive with streptococcus antigens elicited in response to immunization with these streptococcal antigens. Wells of 96 well microtiter plates (Immunlon 4, Dynatech, Chantilly, Va., or equivalent) are coated with antigen by incubating 50 1 of 1 g/mil protein antigen solution in a suitable buffer, typically 0.1 M sodium carbonate buffer at pH 9.6. After decanting unbound antigen, additional binding sites are blocked by incubating 100 1 of 3% nonfat milk in wash buffer (PBS, 0.2% Tween 20, pH 7.4). After washing, duplicate serial two-fold dilutions of sera in PBS, Tween 20, 1% fetal bovine serum, are incubated for 1 hr, removed, wells are washed three times, and incubated with horseradish peroxidase-conjugated goat anti-mouse IgG. After three washes, bound antibodies are detected with H.sub.2 O.sub.2 and 2,2'-azino-di-(3-ethylbenzthiazoline sulfonate) (Schwan, T. G., et al., Proc. Natl. Acad. Sci. USA 92:2909-2913 (1985)) (ABTS.RTM., Kirkegaard & Perry Labs., Gaithersburg, Md.) and A.sub.405 is quantified with a Molecular Devices, Corp. (Menlo Park, Calif.) VmaxTm plate reader. IgG levels twice the background level in serum from naive mice are assigned the minimum titer of 1:100.

Sodiumdodecylsulfate-Polyacrylamide Gel Electrophoresis (SDS-PAGE) and Immunoblotting

Using a single well format, total streptococcal protein extracts or recombinant streptococcal antigen are boiled in SDS/2-ME sample buffer before electrophoresis through 3% acrylamide stacking gels, and resolving gels of higher acrylamide concentration, typically 10-15% acrylamide monomer. Gels are electro-blotted to nitrocellulose membranes and lanes are probed with dilutions of antibody to be tested for reactivity with specific streptococcal antigens, followed by the appropriate secondary antibody-enzyme (horseradish peroxidase) conjugate. When it is desirable to confirm that the protein had transferred following electro-blotting, membranes are stained with Ponceau S. Immunoblot signals from bound antibodies are detected on x-ray film as chemiluminescence using ECL.TM. reagents (Amersham Corp., Arlington Heights, Ill.).

EXAMPLE 3 Detection of Streptococcus mRNA expression

Northern blot analysis is carried out using methods described by, among others, Sambrook et al., supra. to detect the expression of the S. pneumoniae nucleotide sequences of the present invention in animal tissues. A cDNA probe containing an entire nucleotide sequence shown in Table 1 is labeled with .sup.32 P using the rediprime.TM. DNA labeling system (Amersham Life Science), according to manufacturer's instructions. After labeling, the probe is purified using a CHROMA SPIN-100 .TM. column (Clontech Laboratories, Inc.), according to manufacturer's protocol number PT1200-1. The purified labeled probe is then used to detect the expression of Streptococcus MRNA in an animal tissue sample.

Animal tissues, such as blood or spinal fluid, are examined with the labeled probe using ExpressHyb.TM. hybridization solution (Clontech) according to manufacturer's protocol number PT1190-1. Following hybridization and washing, the blots are mounted and exposed to film at -70.degree. C. overnight, and films developed according to standard procedures.

It will be clear that the invention may be practiced otherwise than as particularly described in the foregoing description and examples.

Numerous modifications and variations of the present invention are possible in light of the above teachings and, therefore, are within the scope of the appended claims.

The entire disclosure of all publications (including patents, patent applications, journal articles, laboratory manuals, books, or other documents) cited herein are hereby incorporated by reference.

                                    TABLE 1                                 

     __________________________________________________________________________

     SP001  nucleotide  (SEQ  ID  NO:1)                                        

     TAAAATCTACGACAATAAAAATCAACTCATTGCTGACTTGGGTTCTGAACGCCGCGTCAATGCCCAAGC     

     TAATGATATTCCCACAGATTTGGTTAAGGCAATCGTTTCTATCGAAGACCATCGCTTCTTCGACCACAG     

     GGGGATTGATACCATCCGTATCCTGGGAGCTTTCTTGCGCAATCTGCAAAGCAATTCCCTCCAAGGTGG     

     ATCAACTCTCACCCAACAGTTGATTAAGTTGACTTACTTTTCAACTTCGACTTCCGACCAGACTATTTC     

     TCGTAAGGCTCAGGAAGCTTGGTTAGCGATTCAGTTAGAACAAAAAGCAACCAAGCAAGAAATCTTGAC     

     CTACTATATAAATAAGGTCTACATGTCTAATGGGAACTATGGAATGCAGACAGCAGCTCAAAACTACTA     

     TGGTAAAGACCTCAATAATTTAAGTTTACCTCAGTTAGCCTTGCTGGCTGGAATGCCTCAGGCACCAAA     

     CCAATATGACCCCTATTCACATCCAGAAGCAGCCCAAGACCGCCGAAACTTGGTCTTATCTGAAATGAA     

     AAATCAAGGCTACATCTCTGCTGAACAGTATGAGAAAGCAGTCAATACACCAATTACTGATGGACTACA     

     AAGTCTCAAATCAGCAAGTAATTACCCTGCTTACATGGATAATTACCTCAAGGAAGTCATCAATCAAGT     

     TGAAGAAGAAACAGGCTATAACCTACTCACAACTGGGATGGATGTCTACACAAATGTAGACCAAGAAGC     

     TCAAAAACATCTGTGGGATATTTACAATACAGACGAATACGTTGCCTATCCAGACGATGAATTGCAAGT     

     CGCTTCTACCATTGTTGATGTTTCTAACGGTAAAGTCATTGCCCAGCTAGGAGCACGCCATCAGTCAAG     

     TAATGTTTCCTTCGGAATTAACCAAGCAGTAGAAACAAACCGCGACTGGGGATCAACTATGAAACCGAT     

     CACAGACTATGCTCCTGCCTTGGAGTACGGTGTCTACGATTCAACTGCTACTATCGTTCACGATGAGCC     

     CTATAACTACCCTGGGACAAATACTCCTGTTTATAACTGGGATAGGGGCTACTTTGGCAACATCACCTT     

     GCAATACGCCCTGCAACAATCGCGAAACGTCCCAGCCGTGGAAACTCTAAACAAGGTCGGACTCAACCG     

     CGCCAAGACTTTCCTAAATGGTCTAGGAATCGACTACCCAAGTATTCACTACTCAAATGCCATTTCAAG     

     TAACACAACCGAATCAGACAAAAAATATGGAGCAAGTAGTGAAAAGATGGCTGCTGCTTACGCTGCCTT     

     TGCAAATGGTGGAACTTACTATAAACCAATGTATATCCATAAAGTCGTCTTTAGTGATGGGAGTGAAAA     

     AGAGTTCTCTAATGTCGGAACTCGTGCCATGAAGGAAACGACAGCCTATATGATGACCGACATGATGAA     

     AACAGTCTTGACTTATGGAACTGGACGAAATGCCTATCTTGCTTGGCTCCCTCAGGCTGGTAAAACAGG     

     AACCTCTAACTATACAGACGAGGAAATTGAAAACCACATCAAGACCTCTCAATTTGTAGCACCTGATGA     

     ACTATTTGCTGGCTATACGCGTAAATATTCAATGGCTGTATGGACAGGCTATTCTAACCGTCTGACACC     

     ACTTGTAGGCAATGGCCTTACGGTCGCTGCCAAAGTTTACCGCTCTATGATGACCTACCTGTCTGAAGG     

     AAGCAATCCAGAAGATTGGAATATACCAGAGGGGCTCTACAGAAATGGAGAATTCGTATTTAAAAATGG     

     TGCTCGTTCTACGTGGAACTCACCTGCTCCACAACAACCCCCATCAACTGAAAGTTCAAGCTCATCATC     

     AGATAGTTCAACTTCACAGTCTAGCTCAACCACTCCAAGCACAAATAATAGTACGACTACCAATCCTAA     

     CAATAATACGCAACAATCAAATACAACCCCTGATCAACAAAATCAGAATCCTCAACCAGCACAACCA       

     SP001  AMINO  ACID  (SEQ  ID  NO:2)                                       

     KIYDNKNQLIADLGSERRVNAQANDIPTDLVKAIVSIEDHRFFDHRGIDTIRILGAFLRNLQSNSLQGG     

     STLTQQLIKLTYFSTSTSDQTISRKAQEAWLAIQLEQKATKQEILTYYINKVYMSNGNYGMQTAAQNYY     

     GKDLNNLSLPQLALLAGMPQAPNQYDPYSHPEAAQDRRNLVLSEMKNQGYISAEQYEKAVNTPITDGLQ     

     SLKSASNYPAYMDNYLKEVINQVEEETGYNLLTTGMDVYTNVDQEAQKHLWDIYNTDEYVAYPDDELQV     

     ASTIVDVSNGKVIAQLGARHQSSNVSFGINQAVETNRDWGSTMKPITDYAPALEYGVYDSTATIVHDEP     

     YNYPGTNTPVYNWDRGYFGNITLQYALQQSRNVPAVETLNKVGLNRAKTFLNGLGIDYPSIHYSNAISS     

     NTTESDKKYGASSEKMAAAYAAFANGGTYYKPMYIHKVVFSDGSEKEFSNVGTRAMKETTAYMMTDMMK     

     TVLTYGTGRNAYLAWLPQAGKTGTSNYTDEEIENHIKTSQFVAPDELFAGYTRKYSMAVWTGYSNRLTP     

     LVGNGLTVAAKVYRSMMTYLSEGSNPEDWNIPEGLYRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSS     

     DSSTSQSSSTTPSTNNSTTTNPNNNTQQSNTTPDQQNQNPQPAQP                             

     SP004  nucleotide  (SEQ  ID  NO:3)                                        

     AAATTACAATACGGACTATGAATTGACCTCTGGAGAAAAATTACCTCTTCCTAAAGAGATTTCAGGTTA     

     CACTTATATTGGATATATCAAAGAGGGAAAAACGACTTCTGAGTCTGAAGTAAGTAATCAAAAGAGTTC     

     AGTTGCCACTCCTACAAAACAACAAAAGGTGGATTATAATGTTACACCGAATTTTGTAGACCATCCATC     

     AACAGTACAAGCTATTCAGGAACAAACACCTGTTTCTTCAACTAAGCCGACAGAAGTTCAAGTAGTTGA     

     AAAACCTTTCTCTACTGAATTAATCAATCCAAGAAAAGAAGAGAAACAATCTTCAGATTCTCAAGAACA     

     ATTAGCCGAACATAAGAATCTAGAAACGAAGAAAGAGGAGAAGATTTCTCCAAAAGAAAAGACTGGGGT     

     AAATACATTAAATCCACAGGATGAAGTTTTATCAGGTCAATTGAACAAACCTGAACTCTTATATCGTGA     

     GGAAACTATGGAGACAAAAATAGATTTTCAAGAAGAAATTCAAGAAAATCCTGATTTAGCTGAAGGAAC     

     TGTAAGAGTAAAACAAGAAGGTAAATTAGGTAAGAAAGTTGAAATCGTCAGAATATTCTCTGTAAACAA     

     GGAAGAAGTTTCGCGAGAAATTGTTTCAACTTCAACGACTGCGCCTAGTCCAAGAATAGTCGAAAAAGG     

     TACTAAAAAAACTCAAGTTATAAAGGAACAACCTGAGACTGGTGTAGAACATAAGGACGTACAGTCTGG     

     AGCTATTGTTGAACCCGCAATTCAGCCTGAGTTGCCCGAAGCTGTAGTAAGTGACAAAGGCGAACCAGA     

     AGTTCAACCTACATTACCCGAAGCAGTTGTGACCGACAAAGGTGAGACTGAGGTTCAACCAGAGTCGCC     

     AGATACTGTGGTAAGTGATAAAGGTGAACCAGAGCAGGTAGCACCGCTTCCAGAATATAAGGGTAATAT     

     TGAGCAAGTAAAACCTGAAACTCCGGTTGAGAAGACCAAAGAACAAGGTCCAGAAAAAACTGAAGAAGT     

     TCCAGTAAAACCAACAGAAGAAACACCAGTAAATCCAAATGAAGGTACTACAGAAGGAACCTCAATTCA     

     AGAAGCAGAAAATCCAGTTCAACCTGCAGAAGAATCAACAACGAATTCAGAGAAAGTATCACCAGATAC     

     ATCTAGCAAAAATACTGGGGAAGTGTCCAGTAATCCTAGTGATTCGACAACCTCAGTTGGAGAATCAAA     

     TAAACCAGAACATAATGACTCTAAAAATGAAAATTCAGAAAAAACTGTAGAAGAAGTTCCAGTAAATCC     

     AAATGAAGGCACAGTAGAAGGTACCTCAAATCAAGAAACAGAAAAACCAGTTCAACCTGCAGAAGAAAC     

     ACAAACAAACTCTGGGAAAATAGCTAACGAAAATACTGGAGAAGTATCCAATAAACCTAGTGATTCAAA     

     ACCACCAGTTGAAGAATCAAATCAACCAGAAAAAAACGGAACTGCAACAAAACCAGAAAATTCAGGTAA     

     TACAACATCAGAGAATGGACAAACAGAACCAGAACCATCAAACGGAAATTCAACTGAGGATGTTTCAAC     

     CGAATCAAACACATCCAATTCAAATGGAAACGAAGAAATTAAACAAGAAAATGAACTAGACCCTGATAA     

     AAAGGTAGAAGAACCAGAGAAAACACTTGAATTAAGAAATGTTTCCGACCTAGAGTTA                

     SP004  amino  acid  (SEQ  ID  NO:4)                                       

     NYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSESEVSNQKSSVATPTKQQKVDYNVTPNFVDHPS     

     TVQAIQEQTPVSSTKPTEVQVVEKPFSTELINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGV     

     NTLNPQDEVLSGQLNKPELLYREETMETKIDFQEEIQENPDLAEGTVRVKQEGKLGKKVEIVRIFSVNK     

     EEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQPETGVEHKDVQSGAIVEPAIQPELPEAVVSDKGEPE     

     VQPTLPEAVVTDKGETEVQPESPDTVVSDKGEPEQVAPLPEYKGNIEQVKPETPVEKTKEQGPEKTEEV     

     PVKPTEETPVNPNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSKNTGEVSSNPSDSTTSVGESN     

     KPEHNDSKNENSEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANENTGEVSNKPSDSK     

     PPVEESNQPEKNGTATKPENSGNTTSENGQTEPEPSNGNSTEDVSTESNTSNSNGNEEIKQENELDPDK     

     KVEEPEKTLELRNVSDLEL                                                       

     SP006  nucleotide  (SEQ  ID  NO:5)                                        

     TGAGAATCAAGCTACACCCAAAGAGACTAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGT     

     GCCACCATTTGACTACGAAGACAAGGGCAATCTGACAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGA     

     TGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCCTGGGAGAGCATCTTCCCAGGACTTGA     

     TTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCGTGCTGAAAAATACCTTTA     

     CTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGACTTCTCTTGA     

     CCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTGGAA     

     TCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCTAGA     

     CCTTGCTAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCG     

     TGGTTTAGACCTCTCAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAG     

     CGACCAAAAAGAGTTTAAAGAGCAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGA     

     AAAACTCAGCAATACCTATCTAGGTGGTTCTTACCTCCCAGATCAATCTCAGTTACAA                

     SP006  amino  acid  (SEQ  ID  NO:6)                                       

     ENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTAWESIFPGLD     

     SGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNWN     

     QKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKIIKDRGLDLSVVDLPSADSPSNYIIFSS     

     DQKEFKEQFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQ                                

     SP007  nucleotide  (SEQ  ID  NO:7)                                        

     TGGTAACCGCTCTTCTCGTAACGCAGCTTCATCTTCTGATGTGAAGACAAAAGCAGCAATCGTCACTGA     

     TACTGGTGGTGTTGATGACAAATCATTCAACCAATCAGCTTGGGAAGGTTTGCAGGCTTGGGGTAAAGA     

     ACACAATCTTTCAAAAGATAACGGTTTCACTTACTTCCAATCAACAAGTGAAGCTGACTACGCTAACAA     

     CTTGCAACAAGCGGCTGGAAGTTACAACCTAATCTTCGGTGTTGGTTTTGCCCTTAATAATGCAGTTAA     

     AGATGCAGCAAAAGAACACACTGACTTGAACTATGTCTTGATTGATGATGTGATTAAAGACCAAAAGAA     

     TGTTGCGAGCGTAACTTTCGCTGATAATGAGTCAGGTTACCTTGCAGGTGTGGCTGCAGCAAAAACAAC     

     TAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGAAGTTATCTCTCGTTTTGAAGCAGGATT     

     CAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAAGTTGACTACGCTGGTTCATTTGGTGA     

     TGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCAAGTAGC     

     TGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA     

     AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGA     

     ATCAAACTTTGTTCTTGTATCTACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTTCTAACAAGGC     

     AGAAAGAGGAGAATTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTGGC     

     AGTAACAAACCTTTCAGAAGAAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAG     

     CGTAAAAGTTCCTGAAAAA                                                       

     SP007  amino  acid  (SEQ  ID  NO:8)                                       

     GNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDNGFTYFQSTSEADYANN     

     LQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLAGVAAAKTT     

     KTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVA     

     GGTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKA     

     ERGEFPGGQVIVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEK                      

     SP008  nucleotide  (SEQ  ID  NO:9)                                        

     TGTGGAAATTTGACAGGTAACAGCAAAAAAGCTGCTGATTCAGGTGACAAACCTCTTATCAAAATGTAC     

     CAAATCGGTGACAAACCAGACAACTTGGATGAATTGTTAGCAAATGCCAACAAAATCATTGAAGAAAAA     

     GTTGGTGCCAAATTGGATATCCAATACCTTGGCTGGGGTGACTATGGTAAGAAAATGTCAGTTATCACA     

     TCATCTGGTGAAAACTATGATATTGCCTTTGCAGATAACTATATTGTAAATGCTCAAAAAGGTGCTTAC     

     GCTGACTTGACAGAATTGTACAAAAAAGAAGGTAAAGACCTTTACAAAGCACTTGACCCAGCTTACATC     

     AAGGGTAATACTGTAAATGGTAAGATTTACGCTGTTCCAGTTGCAGCCAACGTTGCATCATCTCAAAAC     

     TTTGCCTTCAACGGAACTCTCCTTGCTAAATATGGTATCGATATTTCAGGTGTTACTTCTTACGAAACT     

     CTTGAGCCAGTCTTGAAACAAATCAAAGAAAAAGCTCCAGACGTAGTACCATTTGCTATTGGTAAAGTT     

     TTCATCCCATCTCATAATTTTGACTACCCAGTAGCAAACGGTCTTCCATTCGTTATCGACCTTGAAGGC     

     GATACTACTAAAGTTGTAAACCGTTACGAAGTGCCTCGTTTCAAAGAACACTTGAAGACTCTTCACAAA     

     TTCTATGAAGCTGGCTACATTCCAAAAGACGTCGCAACAAGCGATACTTCCTTTGACCTTCAACAAGAT     

     ACTTGGTTCGTTCGTGAAGAAACAGTAGGACCAGCTGACTACGGTAACAGCTTGCTTTCACGTGTTGCC     

     AACAAAGATATCCAAATCAAACCAATTACTAACTTCATCAAGNAAAACCAAACAACACAAGTTGCTAAC     

     TTTGTCATCTCAAACAACTCTAAGAACAAAGAAAAATCAATGGAAATCTTGAACCTCTTGAATACGAAC     

     CCAGAACTCTTGAACGGTCTTGTTTACGGTCCAGAAGGCAAGAACTGGGAAAAAATTGAAGGTAAAGAA     

     AACCGTGTTCGCGTTCTTGATGGCTACAAAGGAAACACTCACATGGGTGGATGGAACACTGGTAACAAC     

     TGGATCCTTTACATCAACGAAAACGTTACAGACCAACAAATCGAAAATTCTAAGAAAGAATTGGCAGAA     

     GCTAAAGAATCTCCAGCGCTTGGATTTATCTTCAATACTGACAATGTGAAATCTGAAATCTCAGCTATT     

     GCTAACACAATGCAACAATTTGATACAGCTATCAACACTGGTACTGTAGACCCAGATAAAGCGATTCCA     

     GAATTGATGGAAAAATTGAAATCTGAAGGTGCCTACGAAAAAGTATTGAACGAAATGCAAAAACAATAC     

     GATGAATTCTTGAAAAACAAAAAA                                                  

     SP008  amino  acid  (SEQ  ID  NO:10)                                      

     CGNLTGNSKKAADSGDKPVIKMYQIGDKPDNLDELLANANKIIEEKVGAKLDIQYLGWGDYGKKMSVIT     

     SSGENYDIAFADNYIVNAQKGAYADLTELYKKEGKDLYKALDPAYIKGNTVNGKIYAVPVAANVASSQN     

     FAFNGTLLAKYGIDISGVTSYETLEPVLKQIKEKAPDVVPFAIGKVFIPSDNFDYPVANGLPFVIDLEG     

     DTTKVVNRYEVPRFKEHLKTLHKFYEAGYIPKDVATSDTSFDLQQDTWFVREETVGPADYGNSLLSRVA     

     NKDIQIKPITNFIKXNQTTQVANFVISNNSKNKEKSMEILNLLNTNPELLNGLVYGPEGKNWEKIEGKE     

     NRVRVLDGYKGNTHMGGWNTGNNWILYINENVTDQQIENSKKELAEAKESPALGFIFNTDNVKSEISAI     

     ANTMQQFDTAINTGTVDPDKAIPELMEKLKSEGAYEKVLNEMQKQYDEFLKNKK                    

     SP009  nucleotide  (SEQ  ID  NO:11)                                       

     TGGTCAAGGAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTG     

     GACACCAAATACCAACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGT     

     GGATGTTGATTTGAAATTGCCACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATT     

     TGCAGTCTATTTCCAAGACTACATGGCTAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGC     

     TATTGTTGAACACAATACATCAGGAATCATCTCTCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTT     

     GGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGAACTTGCTATGTTGAAAACCTTGGTAGAATC     

     TCAAGGTGGAGACTTTGAGAAGGTTGAAAAAGTACCAAATAACGACTCAAACTCAATCACACCGATTGC     

     CAATGGCGTCTTTGATACTGCTTGGATTTACTACGGTTGGGATGGTATCCTTGCTAAATCTCAAGGTGT     

     AGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTACTATTCACCAGTTATCATCGC     

     AAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAGGCTA     

     CCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGA     

     AAAACGTGACTTTGTCATCGAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAAAATG     

     GGGTCAATTTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGA     

     AGACTTGACAGACAAAGGCTTCACCAACGAATTTGTGAAA                                  

     SP009  amino  acid  (SEQ  ID  NO:12)                                      

     GQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLPPEESSSDLVINGKAPF     

     AVYFQDYMAKKLEKGAGITAVAAIVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAMLKTLVES     

     QGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDGILAKSQGVDANFMYLKDYVKEFDYYSPVIIA     

     NNDYLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKW     

     GQFDAARWNAFYKWDKENGILKEDLTDKGFTNEFVK                                      

     SP010  nucleotide  (SEQ  ID  NO:13)                                       

     TAGCTCAGGTGGAAACGCTGGTTCATCCTCTGGAAAAACAACTGCCAAAGCTCGCACTATCGATGAAAT     

     CAAAAAAAGCGGTGAACTGCGAATCGCCGTGTTTGGAGATAAAAAACCGTTTGGCTACGTTGACAATGA     

     TGGTTCTACCAAGGTACGCTACGATATTGAACTAGGGAACCAACTAGCTCAAGACCTTGGTGTCAAGGT     

     TAAATACATTTCAGTCGATGCTGCCAACCGTGCGGAATACTTGATTTCAAACAAGGTAGATATTACTCT     

     TGCTAACTTTACAGTAACTGACGAACGTAAGAAACAAGTTGATTTTGCCCTTCCATATATGAAAGTTTC     

     TCTGGGTGTCGTATCACCTAAGACTGGTCTCATTACAGACGTCAAACAACTTGAAGGTAAAACCTTAAT     

     TGTCACAAAAGGAACGACTGCTGAGACTTATTTTGAAAAGAATCATCCAGAAATCAAACTCCAAAAATA     

     CGACCAATACAGTGACTCTTACCAAGCTCTTCTTGACGGACGTGGAGATGCCTTTTCAACTGACAATAC     

     GGAAGTTCTAGCTTGGGCGCTTGAAAATAAAGGATTTGAAGTAGGAATTACTTCCCTCGGTGATCCCGA     

     TACCATTGCGGCAGCAGTTCAAAAAGGCAACCAAGAATTGCTAGACTTCATCAATAAAGATATTGAAAA     

     ATTAGGCAAGGAAAACTTCTTCCACAAGGCCTATGAAAAGACACTTCACCCAACCTACGGTGACGCTGC     

     TAAAGCAGATGACCTGGTTGTTGAAGGTGGAAAAGTTGAT                                  

     SP010  amino  acid  (SEQ  ID  NO:14)                                      

     SSGGNAGSSSGKTTAKARTIDEIKKSGELRIAVFGDKKPFGYVDNDGSTKVRYDIELGNQLAQDLGVKV     

     KYISVDAANRAEYLISNKVDITLANFTVTDERKKQVDFALPYMKVSLGVVSPKTGLITDVKQLEGKTLI     

     VTKGTTAETYFEKNHPEIKLQKYDQYSDSYQALLDGRGDAFSTDNTEVLAWALENKGFEVGITSLGDPD     

     TIAAAVQKGNQELLDFINKDIEKLGKENFFHKAYEKTLHPTYGDAAKADDLVVEGGKVD               

     SP011  nucleotide  (SEQ  ID  NO:15)                                       

     CTCCAACTATGGTAAATCTGCGGATGGCACAGTGACCATCGAGTATTTCAACCAGAAAAAAGAAATGAC     

     CAAAACCTTGGAAGAAATCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAA     

     TGTACCAAATGCTGGTGAAGTATTGAAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATAT     

     TTACCCACAGTCCATCGAACTGCAAGAATGGGCAAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGA     

     CTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATGCTGTAAACGAAAAAGTTTACAACGTTCC     

     TTTTACAGCTAATGCTTATGGAATTTACTACAACAAAGATAAATTCGAAGAACTGGGCTTGAAGGTTCC     

     TGAAACCTGGGATGAATTTGAACAGTTAGTCAAAGATATCGTTGCTAAAGGACAAACACCATTTGGAAT     

     TGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCTTTGCGACAGCAACAGGTGGAGG     

     AAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAA     

     AGATGATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGG     

     CTATACCGATGTTATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGAT     

     CACAGCGATTAATGAACAAAAACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGG     

     ACAAAGCTTAACCGTTGGTGCGGGAGACTTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGA     

     AGCCAATGCCTTTGTGGAATATATGACCCGTCCAGAAGTCATGCAAAAATACTACGATGTGGACGGATC     

     TCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCACCGCTTGCTGGTATGACCGAATATGC     

     CTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCAGACTTCCATACCTTGAC     

     CATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTTTAACCCGAT     

     GAAAGCGGATGTGGAT                                                          

     SP011  amino  acid  (SEQ  ID  NO:16)                                      

     SNYGKSADGTVTIEYFNQKKEMTKTLEEITRDFEKENPKIKVKVVNVPNAGEVLKTRVLAGDVPDVVNI     

     YPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVYVVPFTANAYGIYYNKDKFEELGLKVP     

     ETWDEFEQLVKDIVAKGQTPFGIAGADAWTLNGYNQLAFATATGGGKEANQYLRYSQPNAIKLSDPIMK     

     DDIKVMDILRINGSKQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMIPGKEKG     

     QSLTVGAGDLAWSISATTKHPKEANAFVEYMTRPEVMQKYYDVDGSPTAIEGVKQAGEDSPLAGMTEYA     

     FTDRHLVWLQQYWTSEADFHTLTMYYVLTGDKQGMVNDLNAFFNPMKADVD                       

     SP012  nucleotide  (SEQ  ID  NO:17)                                       

     TGGGAAAAATTCTAGCGAAACTAGTGGAGATAATTGGTCAAAGTACCAGTCTAACAAGTCTATTACTAT     

     TGGATTTGATAGTACTTTTGTTCCAATGGGATTTGCTCAGAAAGATGGTTCTTATGCAGGATTTGATAT     

     TGATTTAGCTACAGCTGTTTTTGAAAAATACGGAATCACGGTAAATTGGCAACCGATTGATTGGGATTT     

     GAAAGAAGCTGAATTGACAAAAGGAACGATTGATCTGATTTGGAATGGCTATTCCGCTACAGACGAACG     

     CCGTGAAAAGGTGGCTTTCAGTAACTCATATATGAAGAATGAGCAGGTATTGGTTACGAAGAAATCATC     

     TGGTATCACGACTGCAAAGGATATGACTGGAAAGACATTAGGAGCTCAAGCTGGTTCATCTGGTTATGC     

     GGACTTTGAAGCAAATCCAGAAATTTTGAAGAATATTGTCGCTAATAAGGAAGCGAATCAATACCAAAC     

     CTTTAATGAAGCCTTGATTGATTTGAAAAACGATCGAATTGATGGTCTATTGATTGACCGTGTCTATGC     

     AAACTATTATTTAGAAGCAGAAGGTGTTTTAAACGATTATAATGTCTTTACAGTTGGACTAGAAACAGA     

     AGCTTTTGCGGTTGGAGCCCGTAAGGAAGATACAAACTTGGTTAAGAAGATAAATGAAGCTTTTTCTAG     

     TCTTTACAAGGACGGCAAGTTCCAAGAAATCAGCCAAAAATGGTTTGGAGAAGATGTAGCAACCAAAGA     

     AGTAAAAGAAGGACAG                                                          

     SP012  nucleotide  (SEQ  ID  NO:18)                                       

     GKNSSETSGDNWSKYQSNKSITIGFDSTFVPMGFAQKDGSYAGFDIDLATAVFEKYGITVNWQPIDWDL     

     KEAELTKGTIDLIWNGYSATDERREKVAFSNSYMKNEQVLVTKKSSGITTAKDMTGKTLGAQAGSSGYA     

     DFEANPEILKNIVANKEANQYQTFNEALIDLKNDRIDGLLIDRVYANYYLEAEGVLNDYNVFTVGLETE     

     AFAVGARKEDTNLVKKINEAFSSLYKDGKFQEISQKWFGEDVATKEVKEGQ                       

     SP013  nucleotide  (SEQ  ID  NO:19)                                       

     TGCTAGCGGAAAAAAAGATACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGC     

     TGATATTACTAAAAATATTGCTGGTGACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCC     

     ACACGAATACGAACCACTTCCTGAAGACGTTAAGAAAACTTCTGAGGCTAATTTGATTTTCTATAACGG     

     TATCAACCTTGAAACAGGTGGCAATGCTTGGTTTACAAAATTGGTAGAAAATGCCAAGAAAACTGAAAA     

     CAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAGGTCAAAATGAAAAAGGAAA     

     AGAAGACCCACACGCTTGGCTTAACCTTGAAAACGGTATTATTTTTGCTAAAAATATCGCCAAACAATT     

     GAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAATATACTGATAAGTTAGA     

     CAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAG     

     CGAAGGAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATAC     

     TGAAGAAGAAGGAACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATC     

     ACTCTTTGTAGAATCAAGTGTGGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAAT     

     CTACGCTCAAATCTTTACTGACTCTATCGCAGAACAAGGTAAAGAAGGCGACAGCTACTACAGCATGAT     

     GAAATACAACCTTGACAAGATTGCTGAAGGATTGGCAAAA                                  

     SP013  amino  acid  (SEQ  ID  NO:20)                                      

     ASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVKKTSEANLIFYNG     

     INLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQL     

     SAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINT     

     EEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMM     

     KYNLDKIAEGLAK                                                             

     SP014  nucleotide  (SEQ  ID  NO:21)                                       

     TGGCTCAAAAAATACAGCTTCAAGTCCAGATTATAAGTTGGAAGGTGTAACATTCCCGCTTCAAGAAAA     

     GAAAACATTGAAGTTTATGACAGCCAGTTCACCGTTATCTCCTAAAGACCCAAATGAAAAGTTAATTTT     

     GCAACGTTTGGAGAAGGAAACTGGCGTTCATATTGACTGGACCAACTACCAATCCGACTTTGCAGAAAA     

     ACGTAACTTGGATATTTCTAGTGGTGATTTACCAGATGCTATCCACAACGACGGAGCTTCAGATGTGGA     

     CTTGATGAACTGGGCTAAAAAAGGTGTTATTATTCCAGTTGAAGATTTGATTGATAAATACATGCCAAA     

     TCTTAAGAAAATTTTGGATGAGAAACCAGAGTACAAGGCCTTGATGACAGCACCTGATGGGCACATTTA     

     CTCATTTCCATGGATTGAAGAGCTTGGAGATGGTAAAGAGTCTATTCACAGTGTCAACGATATGGCTTG     

     GATTAACAAAGATTGGCTTAAGAAACTTGGTCTTGAAATGCCAAAAACTACTGATGAITTGATTAAAGT     

     CCTAGAAGCTTTCAAAAACGGGGATCCAAATGGAAATGGAGAGGCTGATGAAATTCCATTTTCATTTAT     

     TAGTGGTAACGGAAACGAAGATTTTAAATTCCTATTTGCTGCATTTGGTATAGGGGATAACGATGATCA     

     TTTAGTAGTAGGAAATGATGGCAAAGTTGACTTCACAGCAGATAACGATAACTATAAAGAAGGTGTCAA     

     ATTTATCCGTCAATTGCAAGAAAAAGGCCTGATTGATAAAGAAGCTTTCGAACATGATTGGAATAGTTA     

     CATTGCTAAAGGTCATGATCAGAAATTTGGTGTTTACTTTACATGGGATAAGAATAATGTTACTGGAAG     

     TAACGAAAGTTATGATGTTTTACCAGTACTTGCTGGACCAAGTGGTCAAAAACACGTAGCTCGTACAAA     

     CGGTATGGGATTTGCACGTGACAAGATGGTTATTACCAGTGTAAACAAAAACCTAGAATTGACAGCTAA     

     ATGGATTGATGCACAATACGCTCCACTCCAATCTGTGCAAAATAACTGGGGAACTTACGGAGATGACAA     

     ACAACAAAACATCTTTGAATTGGATCAAGCGTCAAATAGTCTAAAACACTTACCACTAAACGGAACTGC     

     ACCAGCAGAACTTCGTCAAAAGACTGAAGTAGGAGGACCACTAGCTATCCTAGATTCATACTATGGTAA     

     AGTAACAACCATGCCTGATGATGCCAAATGGCGTTTGGATCTTATCAAAGAATATTATGTTCCTTACAT     

     GAGCAATGTCAATAACTATCCAAGAGTCTTTATGACACAGGAAGATTTGGACAAGATTGCCCATATCGA     

     AGCAGATATGAATGACTATATCTACCGTAAACGTGCTGAATGGATTGTAAATGGCAATATTGATACTGA     

     GTGGGATGATTACAAGAAAGAACTTGAAAAATACGGACTTTCTGATTACCTCGCTATTAAACAAAAATA     

     CTACGACCAATACCAAGCAAACAAAAAC                                              

     SP014  amino  acid  (SEQ  ID  NO:22)                                      

     GSKNTASSPDYKLEGVTFPLQEKKTLKFMTASSPLSPKDPNEKLILQRLEKETGVHIDWTNYQSDFAEK     

     RNLDISSGDLPDAIHNDGASDVDLMNWAKKGVIIPVEDLIDKYMPNLKKILDEKPEYKALMTAPDGHIY     

     SFPWIEELGDGKESIHSVNDMAWINKDWLKKLGLEMPKTTDDLIKVLEAFKNGDPNGNGEADEIPFSFI     

     SGNGNEDFKFLFAAFGIGDNDDHLVVGNDGKVDFTADNDNYKEGVKFIRQLQEKGLIDKEAFEHDWNSY     

     IAKGHDQKFGVYFTWDKNNVTGSNESYDVLPVLAGPSGQKHVARTNGMGFARDKMVITSVNKNLELTAK     

     WIDAQYAPLQSVQNNWGTYGDDKQQNIFELDQASNSLKHLPLNGTAPAELRQKTEVGGPLAILDSYYGK     

     VTTMPDDAKWRLDLIKEYYVPYMSNNNNYPRVFMTQEDLDKIAHIEADMMDYIYRKPAEWIVNGNIDTE     

     WDDYKKELEKYGLSDYLAIKQKYYDQYQANKN                                          

     SP015  nucleotide  (SEQ  ID  NO:23)                                       

     TAGTACAAACTCAAGCACTAGTCAGACAGAGACCAGTAGCTCTGCTCCAACAGAGGTAACCATTAAAAG     

     TTCACTGGACGAGGTCAAACTTTCCAAAGTTCCTGAAAAGATTGTGACCTTTGACCTCGGCGCTGCGGA     

     TACTATTCGCGCTTTAGGATTTGAAAAAAATATCGTCGGAATGCCTACAAAAACTGTTCCGACTTATCT     

     AAAAGACCTAGTGGGAACTGTCAAAAATGTTGGTTCTATGAAAGAACCTGATTTAGAAGCTATCGCCGC     

     CCTTGAGCCTGATTTGATTATCGCTTCGCCACGTACACAAAAATTCGTAGACAAATTCAAAGAAATCGC     

     CCCAACCGTTCTCTTCCAAGCAAGCAAGGACGACTACTGGACTTCTACCAAGGCTAATATCGAATCCTT     

     AGCAAGTGCCTTCGGCGAAACTGGTACACAGAAAGCCAAGGAAGAATTGACCAAGCTAGACAAGAGCAT     

     CCAAGAAGTCGCTACTAAAAATGAAAGCTCTGACAAAAAAGCCCTTGCGATCCTCCTTAATGAAGGAAA     

     AATGGCAGCCTTTGGTGCCAAATCTCGTTTCTCTTTCTTGTACCAAACCTTGAAATTCAAACCAACTGA     

     TACAAAATTTGAAGACTCACGCCACGGACAAGAAGTCAGCTTTGAAAGTGTCAAAGAAATCAACCCTGA     

     CATCCTCTTTGTCATCAACCGTACCCTTGCCATCGGTGGGGACAACTCTAGCAACGACGGTGTCCTAGA     

     AAATGCCCTTATCGCTGAAACACCTGCTGCTAAAAATGGTAAGATTATCCAACTAACACCAGACCTCTG     

     GTATCTAAGCGGAGGCGGACTTGAATCAACAAAACTCATGATTGAAGACATACAAAAAGCTTTGAAA       

     SP015  amino  acid  (SEQ  ID  NO:24)                                      

     STNSSTSQTETSSSAPTEVTIKSSLDEVKLSKVPEKIVTFDLGAADTIPALGFEKNIVGMPTKTVPTYL     

     KDLVGTVKNVGSMKEPDLEAIAALEPDLIIASPRTQKFVDKFKEIAPTVLFQASKDDYWTSTKANIESL     

     ASAFGETGTQKAKEELTKLDKSIQEVATKNESSDKKALAILLNEGKNAAFGAKSRFSFLYQTLKFKPTD     

     TKFEDSRHGQEVSFESVKEINPDILFVINRTLAIGGDNSSNDGVLENALIAETPAAKNGKIIQLTPDLW     

     YLSGGGLESTKLMIEDIQKALK                                                    

     SP016  nucleotide  (SEQ  ID  NO:25)                                       

     TGGCAATTCTGGCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTG     

     GTGGGCATTCCCAGTATTTACCCAAGAAAAAACTGGTGACGGTGTTGGAACTTATGAAAAATCAATCAT     

     CGAAGCGTTTGAAAAAGCAAACCCAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCC     

     TGAAAAAATCACAACAGCCATCGAAGCAGGAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTAT     

     CATCCAATACGGTAAAAACGGTAAATTGGCTGAGTTGAATGACCTCTTCACAGATGAATTTGTTAAAGA     

     TGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAGACAAGGCTTATATGTATCCGATTAGTTC     

     TGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGGAGTAGCAAACCTTGTAAAAGA     

     AGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACTTAAAGACAAGGGTTACACACCAGGTTC     

     ATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCTCTAACCTTTATAGCGGTTC     

     TGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAAGC     

     AACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCACAATTTGACGGTGGGGCAGATATCCAAAA     

     CTTTGCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTAAACT     

     TTTAGAAGCAAGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGA     

     GTACCTTGTAAACGGGTTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATT     

     CATCCAGTTTATCGCAGATGACAAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGT     

     CCGTACTTCATTTGGAAAACTTTATGAAGACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTA     

     CTCACCATACTACAACACTATTGATGGATTTGCTGAAATGAGAACACTTTGGTTCCCAATGTTGCAATC     

     TGTATCAAATGGTGACGAAAAACCAGCAGATGCTTTGAAAGCCTTCACTGAAAAAGCGAACGAAACAAT     

     CAAAAAAGCTATGAAACAA                                                       

     SP016  amino  acid  (SEQ  ID  NO:26)                                      

     GNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSIIEAFEKANPDIKVKLETIDFKSGP     

     EKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNNENIVQASKAGDKAYMYPISS     

     APFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGS     

     VTDEKVSKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKL     

     LEASKVEVVEVPFPSDEGKPALEYLVNGFAVFNNKDDKKVAASKKFIQFIADDKEWGPKDVVRTGAFPV     

     RTSFGKLYEDKRMETISGWTQYYSPYYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETI     

     KKAMKQ                                                                    

     SP017  nucleotide  (SEQ  ID  NO:27)                                       

     TTCACAAGAAAAAACAAAAAATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAAC     

     AGTCGGTAGTAAGTCTCAAGGAGCTGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAG     

     CATTCAAGGGAAATACGATGAAATCATCGTAGCCAACAAACACTATCCATTGTCTAAAGACTATAATCC     

     AGGGGAAAATCCAACAGCCAAGGCAGAGTTGGTCAAACTCATCAAAGCGATGCAAGAGGCAGGTTTCCC     

     TATTAGTGATCATTACAGTGGTTTTAGAAGTTATGAAACTCAGACCAAGCTCTATCAAGATTATGTCAA     

     CCAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCGTCCTGGCTATAGCGAACACCAGACAGGCTT     

     GGCCTTTGATGTGATTGGGACTGATGGTGATTTGGTGACAGAAGAAAAAGCAGCCCAATGGCTCTTGGA     

     TCATGCAGCTGATTATGGCTTTGTTGTCCGTTATCTCAAAGGCAAGGAAAAGGAAACAGGCTATATGGC     

     TGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCTGCAAGTGGTCTCAGTTTGGA     

     AGAATACTATGGCTTTGAAGGCGGAGACTACGTCGAT                                     

     SP017  amino  acid  (SEQ  ID  NO:28)                                      

     SQEKTKNEDGETKTEQTAKADGTVGSKSQGAAQKKAEVVNKGDYYSIQGKYDEIIVANKHYPLSKDYNP     

     GENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQTGL     

     AFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLE     

     EYYGFEGGDYVD                                                              

     SP019  nucleotide  (SEQ  ID  NO:29)                                       

     GAAAGGTCTGTGGTCAAATAATCTTACCTGCGGTTATGATGAAAAAATAATCTTGGAAAATATAAATAT     

     AAAAATACCTGAAGAAAAAATATCAGTTATTATTGGGTCAAATGGTTGTGGGAAATCAACACTCATTAA     

     AACCTTGTCTCGACTTATAAAGCCATTAGAGGGAGAAGTATTGCTTGATAATAAATCAATTAATTCTTA     

     TAAAGAAAAAGATTTAGCAAAACACATAGCTATATTACCTCAATCTCCAATAATCCCTGAATCAATAAC     

     AGTAGCTGATCTTGTAAGCCGTGGTCGTTTCCCCTACAGAAAGCCTTTTAAGAGTCTTGGAAAAGATGA     

     CCTTGAAATAATAAACAGATCAATGGTTAAGGCCAATGTTGAAGATCTAGCAAATAACCTAGTTGAAGA     

     ACTTTCTGGGGGTCAAAGGCAAAGAGTATGGATAGCTCTAGCCCTAGCCCAAGATACAAGTATCCTACT     

     TTTAGATGAGCCAACTACTTACTTGGATATCTCATATCAAATAGAACTATTAGACCTCTTGACTGATCT     

     AAACCAAAAATATAAGACAACCATTTGCATGATTTTGCACGATATAAATCTAACAGCAAGATACGCTGA     

     TTACCTATTTGCAATTAAAGAAGGTAAACTTGTTGCAGAGGGAAAGCCTGAAGATATACTAAATGATAA     

     ACTAGTTAAAGATATCTTTAATCTTGAAGCAAAAATTATACGTGACCCTATTTCCAATTCGCCTCTAAT     

     GATTCCTATTGGCAAGCACCATGTTAACTCT                                           

     SP019  amino  acid  (SEQ  ID  NO:30)                                      

     KGLWSNNLTCGYDEKIILENINIKIPEEKISVIIGSNGCGKSTLIKTLSRLIKPLEGEVLLDNKSINSY     

     KEKDLAKHIAILPQSPIIPESITVADLVSRGRFPYRKPFKSLGKDDLEIINRSMVKANVEDLANNLVEE     

     LSGGQRQRVWIALALAQDTSILLLDEPTTYLDISYQIELLDLLTDLNQKYKTTICMILHDINLTARYAD     

     YLFAIKEGKLVAEGKPEDILNDKLVKDIFNLEAKIIRDPISNSPLMIPTGKHHVS                   

     SP020  nucleotide  (SEQ  ID  NO:31)                                       

     AAACTCAGAAAAGAAAGCAGACAATGCAACAACTATCAAAATCGCAACTGTTAACCGTAGCGGTTCTGA     

     AGAAAAACGTTGGGACAAAATCCAAGAATTGGTTAAAAAAGACGGAATTACCTTGGAATTTACAGAGTT     

     CACAGACTACTCACAACCAAACAAAGCAACTGCTGATGGCGAAGTAGATTTGAACGCTTTCCAACACTA     

     TAACTTCTTGAACAACTGGAACAAAGAAAACGGAAAAGACCTTGTAGCGATTGCAGATACTTACATCTC     

     TCCAATCCGCCTTTACTCAGGTTTGAATGGAAGTGCCAACAAGTACACTAAAGTAGAAGACATCCCAGC     

     AAACGGAGAAATCGCTGTACCGAATGACGCTACAAACGAAAGCCGTGCGCTTTATTTGCTTCAATCAGC     

     TGGCTTGATTAAATTGGATGTTTCTGGAACTGCTCTTGCAACAGTTGCCAACATCAAAGAAAATCCAAA     

     GAACTTGAAAATCACTGAATTGGACGCTAGCCAAACAGCTCGTTCATTGTCATCAGTTGACGCTGCCGT     

     TGTAAACAATACCTTCGTTACAGAAGCAAAATTGGACTACAAGAAATCACTTTTCAAAGAACAAGCTGA     

     TGAAAACTCAAAACAATGGTACAACATCATTGTTGCAAAAAAAGATTGGGAAACATCACCTAAGGCTGA     

     TGCTATCAAGAAAGTAATCGCAGCTTACCACACAGATGACGTGAAAAAAGTTATCGAAGAATCATCAGA     

     TGGTTTGGATCAACCAGTTTGG                                                    

     SP020  amino  acid  (SEQ  ID  NO:32)                                      

     NSEKKADNATTIKIATVNRSGSEEKRWDKIQELVKKDGITLEFTEFTDYSQPNKATADGEVDLNAFQHY     

     NFLNNWNKENGKDLVAIADTYISPIRLYSGLNGSANKYTKVEDIPANGEIAVPNDATNESRALYLLQSA     

     GLIKLDVSGTALATVANIKENPKNLKITELDASQTARSLSSVDAAVVNNTFVTEAKLDYKKSLFKEQAD     

     ENSKQWYNIIVAKKDWETSPKADAIKKVIAAYHTDDVKKVIEESSDGLDQPVW                     

     SP021  nucleotide  (SEQ  ID  NO:33)                                       

     TTCGAAAGGGTCAGAAGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTA     

     TGAGCAAGTGAAAAGCAACCCTTCAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGA     

     AAAACAATATGGCTCAGAGCTTGATGATAAAGAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATA     

     TGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTATGACTCTTGAAACACGTAAAGCTCAAATTCG     

     TACAAGTAAATTAGTTGAGTTGGCAGTTAAGAAGGTAGCAGAAGCTGAATTGACAGATGAAGCCTATAA     

     GAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAATAATGAAGATAAGGC     

     CAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCAAAGATAATTCAAC     

     TGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCTGGAGC     

     AAGTCCAAAAAAGCCGCTTTTCGCTTTTAGATGTGGGATGGTGTTTCTGGATGTGGATTACAGCAACTG     

     GGGCACACCAAGCCTACAG                                                       

     SP021  amino  acid  (SEQ  ID  NO:34)                                      

     SKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLLNMTIQKVFEKQYGSELDDKEVDDTIAEEKKQY     

     GENYQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLNNEDKA     

     KEVLEKAKAEGADFAQLAKDNSTDEKTKENGGEITFDSASTEVPGASPKKPLFAFRCGMVFLDVDYSNW     

     GTPSLQ                                                                    

     SP022  nucleotide  (SEQ  ID  NO:35)                                       

     GGGGATGGCAGCTTTTAAAAATCCTAACAATCAATACAAAGCTATTACAATTGCTCAAACTCTAGGTGA     

     TGATGCTTCTTCAGAGGAATTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTC     

     AAACCTTTCAACAGTTAAAACTAAAGCTACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTAC     

     GTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTAAATGGTATTTCTATGAGTCTGGTGATGTGAAGAC     

     AGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGACTTAGGTGTCATGCAGACTGGATTTGT     

     AAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTACAGGCTGGGGAACAGATGG     

     TAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAAATGGCACTTG     

     GTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTATGC     

     CTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGA     

     ATGGGTAAAC                                                                

     SP022  amino  acid  (SEQ  ID  NO:36)                                      

     GMAAFKNPNNQYKAITIAQTLGDDASSEELAGRYGSAVQCTEVTASNLSTVKTKATVVEKPLKDFRAST     

     SDQSGWVESNGKWYFYESGDVKTGWVKTDGKWYYLNDLGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDG     

     SRWFYFDGSGANKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYYAYGSGALAVSTTTPDGYRVNGNGE     

     WVN                                                                       

     SP023  nucleotide  (SEQ  ID  NO:37)                                       

     AGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGCTGAGGCTACAAGGTTAAA     

     AAAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTAAAGAGCAAGG     

     TAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTGATAAAAAAGAAAATGA     

     TGCGAAGTCTTCAGATTCTAGCGTAGGTGAAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAAAAA     

     GGTAGCAGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCG     

     CCGTAACTACCCAACCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAA     

     AAAAGCGGAGCTTGAACTAGTAAAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGC     

     AAAAGCGGAAGTTGAGAGTAAAAAAGCTGAGGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAA     

     AGCAGAAGAAGAAGCTAAACGAAAAGCAGCAGAAGAAGATAAAGTTAAAGAAAAACCAGCTGAACAACC     

     ACAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCTCCAAAACCAGAGAATCCAGCTGAACA     

     ACCAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAAGAAGACTATGCTCGTAGATCAGAAGAAGAATA     

     TAATCGCTTGACTCAACAGCAACCGCCAAAAA&TGAAAAACCAGCACAACCATCTACTCCAAAAACAGG     

     CTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCA     

     AAACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGG     

     TTCATGGTACTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTA     

     CTACCTAAACGCTAATGGTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAA     

     CGCTAATGGTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGG     

     TGATATGGCGACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAA     

     AGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAATGGCTCAGGTGCCCTTGCAGTCAA     

     CACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC                         

     SP023  amino  acid  (SEQ  ID  NO:38)                                      

     DEQKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKEND     

     AKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVK     

     KAELELVKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQP     

     QPAPAPKAEKPAPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTG     

     WKQENGMWYFYNTDGSNATGWLQNNGSWYYLNSNGAMATGWLQNNGSWYYLNANGSNATGWLQNNGSWY     

     YLNANGSMATGWLQYNGSWYYLNANGSNATGWLQYNGSWYYLNANGDMATGWVKDGDTWYYLEASGAMK     

     ASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN                                   

     SP025  nucleotide  (SEQ  ID  NO:39)                                       

     CTGTGGTGAGGAAGAAACTAAAAAGACTCAAGCAGCACAACAGCCAAAACAACAAACGACTGTACAACA     

     AATTGCTGTTGGAAAAGATGCTCCAGACTTCACATTGCAATCCATGGATGGCAAAGAAGTTAAGTTATC     

     TGATTTTAAGGGTAAAAAGGTTTACTTGAAGTTTTGGGCTTCATGGTGTGGTCCATGCAAGAAAAGTAT     

     GCCAGAGTTGATGGAACTAGCGGCGAAACCAGATCGTGATTTCGAAATTCTTACTGTCATTGCACCAGG     

     AATTCAAGGTGAAAAAACTGTTGAGCAATTCCCACAATGGTTCCAGGAACAAGGATATAAGGATATCCC     

     AGTTCTTTATGATACCAAAGCAACCACTTCCAAGCTTATCAAATTCGAAGCATTCCTACAGAATATT       

     SP025  amino  acid  (SEQ  ID  NO:40)                                      

     CGEEETKKTQAAQQPKQQTTVQQIAVGKDAPDFTLQSMDGKEVKLSDFKGKKVYLKFWASWCGPCKKSM     

     PELMELAAKPDRDFEILTVIAPGIQGEKTVEQFPQWFQEQGYKDIPVLYDTKATTSKLIKFEAFLQNI      

     SP026  nucleotide  (SEQ  ID  NO:41)                                       

     GACTTTTAACAATAAAACTATTGAAGAGTTGCACAATCTCCTTGTCTCTAAGGAAATTTCTGCAACAGA     

     ATTGACCCAAGCAACACTTGAAAATATCAAGTCTCGTGAGGAAGCCCTCAATTCATTTGTCACCATCGC     

     TGAGGAGCAAGCTCTTGTTCAAGCTAAAGCCATTGATGAAGCtGGAATTGATGCTGACAATGTCCTTTC     

     AGGAATTCCACTTGCTGTTAAGGATAACATCTCTACAGACGGTATTCTCACAACTGCTGCCTCAAAAAT     

     GCTCTACAACTATGAGCCAATCTTTGATGCGACagCTgTTGCCAATGCAAAAACCAAGGGCATGATTGT     

     CGTTGGAAAGACCAACATGGACGAATTTGCTATGGGTGGTTCAGGtGAAACTTCACACTACGGAGCAAC     

     TAAAAACGCTTGGAACCACAGCAAGGTTCCTGGTGGGTCATCAAGTGGTTCTGCCGCAGCTGTAGCCTC     

     AGGACAAGTTCGCTTGTCACTTGGTTCTGATACTGGTGGTTCCATCCGCCAACCTGCTGCCTTCAACGG     

     AATCGTTGGTCTCAAACCAACCTACGGAACAGTTTCACGTTTCGGTCTCATTGCCTTTGGTAGCTCATT     

     AGACCAGATTGGACCTTTTGCTCCTACTGTTAAGGAAAATGCCCTCTTGCTCAACGCTATTGCCAGCGA     

     AGATGCTAAAGACTCTACTTCTGCTCCTGTCCGCATCGCCGACTTTACTTCAAAAATCGGCCAAGACAT     

     CAAGGGTATGAAAATCGCTTTGCCTAAGGAATACCTAGGCGAAGGAATTGATCCAGAGGTTAAGGAAAC     

     AATCTTAAACGCGGCCAAACACTTTGAAAAATTGGGTGCTATCGTCGAAGAAGTCAGCCTTCCTCACTC     

     TAAATACGGTGTTGCCGTTTATTACATCATCGCTTCATCAGAAGCTTCATCAAACTTGCAACGCTTCGA     

     CGGTATCCGTTACGGCTATCGCGCAGAAGATGCAACCAACCTTGATGAAATCTATGTAAACAGCCGAAG     

     CCAAGGTTTTGGTGAAGAGGTAAAACGTCGTATCATGCTGGGTACTTTCAGTCTTTCATCAGGTTACTA     

     TGATGCCTACTACAAAAAGGCTGGTCAAGTCCGTACCCTCATCATTCAAGATTTCGAAAAAGTCTTCGC     

     GGATTACGATTTGATTTTGGGTCCAACTGCTCCAAGTGTTGCCTATGACTTGGATTCTCTCAACCATGA     

     CCCAGTTGCCATGTACTTAGCCGACCTATTGACCATACCTGTAAACTTGGCAGGACTGCCTGGAATTTC     

     GATTCCTGCTGGATTCTCTCAAGGTCTACCTGTCGGACTCCAATTGATTGGTCCCAAGTACTCTGAGGA     

     AACCATTTACCAAGCTGCTGCTGCTTTTGAAGCAACAACAGACTACCACAAACAACAACCCGTGATTTT     

     TGGAGGTGACAAC                                                             

     SP026  amino  acid  (SEQ  ID  NO:42)                                      

     TFNNKTIEELHNLLVSKEISATELTQATLENIKSREEALNSFVTIAEEQALVQAKAIDEAGIDADNVLS     

     GIPLAVKDNISTDGILTTAASKMLYNYEPIFDATAVANAKTKGMIVVGKTNMDEFAMGGSGETSHYGAT     

     KNAWNHSKVPGGSSSGSAAAVASGQVRLSLGSDTGGSIRQPAAFNGIVGLKPTYGTVSRFGLIAFGSSL     

     DQIGPFAPTVKENALLLNAIASEDAKDSTSAPVRIADFTSKIGQDIKGMKIALPKEYLGEGIDPEVKET     

     ILNAAKHFEKLGAIVEEVSLPHSKYGVAVYYIIASSEASSNLQRFDGIRYGYRAEDATNLDEIYVNSRS     

     QGFGEEVKRRIMLGTFSLSSGYYDAYYKKAGQVRTLIIQDFEKVFADYDLILGPTAPSVAYDLDSLNHD     

     PVAMYLADLLTIPVNLAGLPGISIPAGFSQGLPVGLQLIGPKYSEETIYQAAAAFEATTDYHKQQPVIF     

     GGDN                                                                      

     SP030  nucleotide  (SEQ  ID  NO:43)                                       

     CTTTACAGGTAAACAACTACAAGTCGGCGACAAGGCGCTTGATTTTTCTCTTACTACAACAGATCTTTC     

     TAAAAAATCTCTGGCTGATTTTGATGGCAAGAAAAAAGTCTTGAGTGTCGTTCCTTCTATCGATACAGG     

     CATCTGCTCAACTCAAACACGTCGTTTTAATGAAGAATTGGCTGGACTGGACAACACGGTCGTATTGAC     

     TGTTTCAATGGACCTACCTTTTGCTCAAAAACGTTGGTGCGGTGCTGAAGGCCTTGACAATGCCATTAT     

     GCTTTCAGACTACTTTGACCATTCTTTCGGGCGCGATTATGCCCTCTTGATCAACGAATGGCACCTATT     

     AGCACGCGCAGTCTTTGTCCTCGATACTGACAATACGATTCGCTACGTTGAATACGTGGATAATATCAA     

     TTCTGAGCCAAACTTCGAA                                                       

     SP030  amino  acid  (SEQ  ID  NO:44)                                      

     FTGKQLQVGDKALDFSLTTTDLSKKSLADFDGKKKVLSVVPSIDTGICSTQTRRFNEELAGLDNTVVLT     

     VSMDLPFAQKRWCGAEGLDNAIMLSDYFDHSFGRDYALLINEWHLLARAVFVLDTDNTIRYVEYVDNIN     

     SEPNFE                                                                    

     SP031  nucleotide  (SEQ  ID  NO:45)                                       

     CCAGGCTGATACAAGTATCGCAGACATTCAAAAAAGAGGCGAACTGGTTGTCGGTGTCAAACAAGACGT     

     TCCCAATTTTGGTTACAAnGATCCCAAGACCGGTACTTATTCTGGTATCGAAaCCGACTTGGCCAAGAT     

     GGTAGCTGATGAACTCAAGGTCAAGATTCGCTATGTGCCGGTTACAGCACAAACCCGCGGCCCCCTTCT     

     AGACAATGAACAGGTCGATATGGATATCGCGACCTTTACCATCACGGACGAACGCAAAAAACTCTACAA     

     CTTTACCAGTCCCTACTACACAGACGCTTCTGGATTTTTGGTCAATAAATCTGCCAAAATCAAAAAGAT     

     TGAGGACCTAAACGGCAAAACCATCGGAGTCGCCCAAGGTTCTATCACCCAACGCCTGATTACTGAACT     

     GGGTAAAAAGAAAGGTCTGAAGTTTAAATTCGTCGAACTTGGTTCCTACCCAGAATTGATTACTTCCCT     

     GCACGCTCATCGTATCGATACCTTTTCCGTTGACCGCTCTATTCTATCTGGCTACACTAGTAAACGGAC     

     AGCACTACTAGATGATAGTTTCAAGCCATCTGACTACGGTATTGTTACCAAGAAATCAAATACAGAGCT     

     CAACGACTATCTTGATAACTTGGTTACTAAATGGAGCAAGGATGGTAGTTTGCAGAAACTTTATGACCG     

     TTACAAGCTCAAACCATCTAGCCATACTGCAGAT                                        

     SP031  amino  acid  (SEQ  ID  NO:46)                                      

     QADTSIADIQKRGELVVGVKQDVPNFGYXDPKTGTYSGIETDLAKMVADELKVKIRYVPVTAQTRGPLL     

     DNEQVDMDIATFTITDERKKLYNFTSPYYTDASGFLVNKSAKIKKIEDLNGKTIGVAQGSITQRLITEL     

     GKKKGLKFKFVELGSYPELITSLHAHRIDTFSVDRSILSGYTSKRTALLDDSFKPSDYGIVTKKSNTEL     

     NDYLDNLVTKWSKDGSLQKLYDRYKLKPSSHTAD                                        

     SP032  nucleotide  (SEQ  ID  NO:47)                                       

     GTCTGTATCATTTGAAAACAAAGAAACAAACCGTGGTGTCTTgACTTTCACTATCTCTCAAGACCAAAT     

     CAAACCAGAATTGGACCGTGTCTTCAAGtCAGTGAAGAAATCTCTTAATGTTCCAGGTTTCCGTAAAGG     

     TCACCTTCCACGCCCTATCTTCGACCAAAAATTTGGTGAAGAAGCTCTTTATCAAGATGCAATGAACGC     

     ACTTTTGCCAAACGCTTATGAAGCAGCTGTAAAAGAAGCTGGTCTTGAAGTGGTTGCCCAACCAAAAAT     

     TGACGTAACTTCAATGGAAAAAGGTCAAGACTGGGTTATCACTGCTGAAGTCGTTACAAAACCTGAAGT     

     AAAATTGGGTGACTACAAAAACCTTGAAGTATCAGTTGATGTAGAAAAAGAAGTAACTGACGCTGATGT     

     CGAAGAGCGTATCGAACGCGAACGCAACAACCTGGCTGAATTGGTTATCAAGGAAGCTGCTGCTGAAAA     

     CGGCGACACTGTTGTGATCGACTTCGTTGGTTCTATCGACGGTGTTGAATTTGACGGTGGAAAAGGTGA     

     AAACTTCTCACTTGGACTTGGTTCAGGTCAATTCATCCCTGGTTTCGAAGACCAATTGGTAGGTCACTC     

     AGCTGGCGAAACCGTTGATGTTATCGTAACATTCCCAGAAGACTACCAAGCAGAAGACCTTGCAGGTAA     

     AGAAGCTAAATTCGTGACAACTATCCACGAAGTAAAAGCTAAAGAAGTTCCGGCTCTTGACGATGAACT     

     TGCAAAAGACATTGATGAAGAAGTTGAAACACTTGCTGACTTGAAAGAAAAATACAGCAAAGAATTGGC     

     TGCTGCTAAAGAAGAAGCTTACAAAGATGCAGTTGAAGGTGCAGCAATTGATACAGCTGTAGAAAATGC     

     TGAAATCGTAGAACTTCCAGAAGAAATGATCCATGAAGAAGTTCACCGTTCAGTAAATGAATTCCTTGG     

     GAATTTGCAACGTCAAGGGATCAACCCTGACATGTACTTCCAAATCACTGGAACTACTCAAGAAGACCT     

     TCACAACCAATACCAAGCAGAAGCTGAGTCACGTACTAAGACTAACCTTGTTATCGAAGCAGTTGCCAA     

     AGCTGAAGGATTTGATGCTTCAGAAGAAGAAATCCAAAAAGAAGTTGAGCAATTGGCAGCAGACTACAA     

     CATGGAAGTTGCACAAGTTCAAAACTTGCTTTCAGCTGACATGTTGAAACATGATATCACTATCAAAAA     

     AGCTGTTGAATTGATCACAAGCACAGCAACAGTAAAA                                     

     SP032  amino  acid  (SEQ  ID  NO:48)                                      

     SVSFENKETNRGVLTFTISQDQIKPELDRVFKSVKKSLNVPGFRKGHLPRPIFDQKFGEEALYQDAMNA     

     LLPNAYEAAVKEAGLEVVAQPKIDVTSMEKGQDWVITAEVVTKPEVKLGDYKNLEVSVDVEKEVTDADV     

     EERIERERNNLAELVIKEAAAENGDTVVIDFVGSIDGVEFDGGKGENFSLGLGSGQFIPGFEDQLVGHS     

     AGETVDVIVTFPEDYQAEDLAGKEAKFVTTIHEVKAKEVPALDDELAKDIDEEVETLADLKEKYSKELA     

     AAKEEAYKDAVEGAAIDTAVENAEIVELPEEMIHEEVHRSVNEFLGNLQRQGINPDMYFQITGTTQEDL     

     HNQYQAEAESRTKTNLVIEAVAKAEGFDASEEEIQKEVEQLAADYNMEVAQVQNLLSADMLKHDITIKK     

     AVELITSTATVK                                                              

     SP033  nucleotide  (SEQ  ID  NO:49)                                       

     TGGTCAAAAGGAAAGTCAGACAGGAAAGGGGATGAAAATTGTGACCAGTTTTTATCCTATCTACGCTAT     

     GGTTAAGGAAGTATCTGGTGACTTGAATGATGTTCGGATGATTCAGTCAAGTAGTGGTATTCACTCCTT     

     TGAACCTTCGGCAAATGATATCGCAGCCATCTATGATGCAGATGTCTTTGTTTACCATTCTCATACACT     

     CGAATCTTGGGCAGGAAGTCTGGATCCAAATCTAAAAAAATCCAAAGTGAAGGTCTTAGAGGCTTCTGA     

     GGGAATGACCTTGGAACGTGTCCCTGGACTAGAGGATGTGGAAGCAGGGGATGGAGTTGATGAAAAAAC     

     GCTCTATGACCCTCACACATGGCTAGATCCTGAAAAAGCTGGAGAAGAAGCCCAAATTATCGCTGATAA     

     ACTTTCAGAGGTGGATAGTGAGCATAAAGAGACTTATCAAAAAAATGCGCAACCTTTATCAAAAAAGCT     

     CAGGAAT                                                                   

     SP033  amino  acid  (SEQ  ID  NO:50)                                      

     GQKESQTGKGMKIVTSFYPIYAMVKEVSGDLNDVRMIQSSSGIHSFEPSANDIAAIYDADVFVYHSHTL     

     ESWAGSLDPNLKKSKVKVLEASEGMTLERVPGLEDVEAGDGVDEKTLYDPHTWLDPEKAGEEAQIIADK     

     LSEVDSEHKETYQKNAQPLSKKLRN                                                 

     SP034  nucleotide  (SEQ  ID  NO:51)                                       

     GAAGGATAGATATATTTTAGCATTTGAGACATCCTGTGATGAGACCAGTGTCGCCGTCTTGAAAAACGA     

     CGATGAGCTCTTGTCCAATGTCATTGCTAGTCAAATTGAGAGTCACAAACGTTTTGGTGGCGTAGTGCC     

     CGAAGTAGCCAGTCGTCACCATGTCGAGGTCATTACAGCCTGTATCGAGGAGGCATTGGCAGAAGCAGG     

     GATTACCGAAGAGGACGTGACAGCTGTTGCGGTTACCTACGGACCAGGCTTGGTCGGAGCCTTGCTAGT     

     TGGTTTGTCAGCTGCCAAGGCCTTTGCTTGGGCTCACGGACTTCCACTGATTCCTGTTAATCACATGGC     

     TGGGCACCTCATGGCAGCTCAGAGTGTGGAGCCTTTGGAGTTTCCCTTGCTAGCCCTCTTGGTCAGCGG     

     CGGACACACAGAGTTGGTTTATGTTTCGGAGGCAGGAGATTATAAGATTGTTGGGGAAACCCGTGATGA     

     TGCGGTTGGTGAGGCTTATGATAAGGTCGGCCGTGTCATGGGCTTGACCTATCCTGCAGGTCGTGAGAT     

     TGACGAGCTGGCTCATCAGGGGCAGGATATTTATGATTTCCCCCGTGCCATGATTAAGGAAGATAATCT     

     GGAGTTCTCCTTCTCAGGTTTGAAATCTGCCTTTATCAATCTTCATCACAATGCCGAGCAAAAGGGAGA     

     AAGCCTGTCTACAGAAGATTTGTGTGCTTCCTTCCAAGCAGCAGTTATGGACATTCTCATGGCAAAAAC     

     CAAGAAGGCTTTGGAGAAATATCCTGTTAAAATCCTAGTTGTGGCAGGTGGTGTGGCAGCCAATAAAGG     

     TCTCAGAGAACGCCTAGCAGCCGAAATCACAGATGTCAAGGTTATCATCCCCCCTCTGCGACTCTGCGG     

     AGACAATGCAGGTATGATTGCCTATGCCAGCGTCAGCNAGTGGAACAAAGAAAACTTCGCAGGCTGGGA     

     CCTCAATGCCAAACCAAGTCTTGCCTTTGATACCATGGAA                                  

     SP034  amino  acid  (SEQ  ID  NO:52)                                      

     KDRYILAFETSCDETSVAVLKNDDELLSNVIASQIESHKRFGGVVPEVASRHHVEVITACIEEALAEAG     

     ITEEDVTAVAVTYGPGLVGALLVGLSAAKAFAWAHGLPLIPVNHMAGHLMAAQSVEPLEFPLLALLVSG     

     GHTELVYVSEAGDYKIVGETRDDAVGEAYDKVGRVMGLTYPAGREIDELAHQGQDIYDFPRAMIKEDNL     

     EFSFSGLKSAFINLHHNAEQKGESLSTEDLCASFQAAVMDILMAKTKKALEKYPVKILVVAGGVAANKG     

     LRERLAAEITDVKVIIPPLRLCGDNAGMIAYASVSXWNKENFAGWDLNAKPSLAFDTME               

     SP035  nucleotide  (SEQ  ID  NO:53)                                       

     GGTAGTTAAAGTTGGTATTAACGGTTTCGGACGTATCGGTCGTCTTGCTTTCCGTCGTATCCAAAACGT     

     AGAAGGTGTTGAAGTTACACGCATCAACGACCTTACAGATCCAGTTATGCTTGCACACTTGTTGAAATA     

     CGACACAACTCAAGGTCGTTTCGACGGTACTGTTGAAGTTAAAGAAGGTGGATTTGAAGTTAACGGTAA     

     ATTCATCAAAGTTTCTGCTGAACGTGATCCAGAACAAATCGACTGGGCTACTGACGGTGTAGAAATCGT     

     TCTTGAAGCTACTGGTTTCTTTGCTAAGAAAGAAGCAGCTGAAAAACACCTTAAAGGTGGAGCTAAAAA     

     AGTTGTTATCACTGCTCCTGGTGGAAACGACGTTAAAACAGTTGTATTCAACACTAACCACGACGTTCT     

     TGACGGTACTGAAACAGTTATCTCAGGTGCTTCATGTACTACAAACTGCTTGGCTCCAATGGCTAAAGC     

     TCTTCAAGACAACTTTGGTGTTGTTGAAGGATTGATGACTACTATCCACGCTTACACTGGTGACCAAAT     

     GATCCTTGACGGACCACACCGTGGTGGTGACCTTCGCCGTGCTCGCGCTGGTGCTGCAAACATCGTTCC     

     TAACTCAACTGGTGCTGCAAAAGCTATCGGTCTTGTAATCCCAGAATTGAATGGTAAACTTGACGGATC     

     TGCACAACGCGTTCCAACTCCAACTGGATCAGTTACTGAATTGGTAGCAGTTCTTGAAAAGAACGTTAC     

     TGTTGATGAAGTGAACGCAGCTATGAAAGCAGCTTCAAACGAATCATACGGTTACACAGAAGATCCAAT     

     CGTATCTTCAGATATCGTAGGTATGTCTTACGGTTCATTGTTTGACGCAACTCAAACTAAAGTTCTTGA     

     CGTTGACGGTAAACAATTGGTTAAAGTTGTATCATGGTACGACAACGAAATGTCATACACTGCACAACT     

     TGTTCGTACTCTTGGAATACTTCGCAAAAATTGC                                        

     SP035  amino  acid  (SEQ  ID  NO:54)                                      

     VVKVGINGFGRIGRLAFRRIQNVEGVEVTRINDLTDPVMLAHLLKYDTTQGRFDGTVEVKEGGFEVNGK     

     FIKVSAERDPEQIDWATDGVEIVLEATGFFAKKEAAEKHLKGGAKKVVITAPGGNDVKTVVFNTNHDVL     

     DGTETVISGASCTTNCLAPMAKALQDNFGVVEGLMTTIHAYTGDQMILDGPHRGGDLRRARAGAANIVP     

     NSTGAAKAIGLVIPELNGKLDGSAQRVPTPTGSVTELVAVLEKNVTVDEVNAAMKAASNESYGYTEDPI     

     VSSDIVGMSYGSLFDATQTKVLDVDGKQLVKVVSWYDNEMSYTAQLVRTLGILRKNC                 

     SP036  nucleotide  (SEQ  ID  NO:55)                                       

     TTCTTACGAGTTGGGACTGTATCAAGCTAGAACGGTTAAGGAAAATAATCGTGTTTCCTATATAGATGG     

     AAAACAAGCGACGCAAAAAACGGAGAATTTGACTCCTGATGAGGTTAGCAAGCGTGAAGGAATCAATGC     

     TGAGCAAATCGTCATCAAGATAACAGACCAAGGCTATGTCACTTCACATGGCGACCACTATCATTATTA     

     CAATGGTAAGGTTCCTTATGACGCTATCATCAGTGAAGAATTACTCATGAAAGATCCAAACTATAAGCT     

     AAAAGATGAGGATATTGTTAATGAGGTCAAGGGTGGATATGTTATCAAGGTAGATGGAAAATACTATGT     

     TTACCTTAAGGATGCTGCCCACGCGGATAACGTCCGTACAAAAGAGGAAATCAATCGACAAAAACAAGA     

     GCATAGTCAACATCGTGAAGGTGGAACTCCAAGAAACGATGGTGCTGTTGCCTTGGCACGTTCGCAAGG     

     ACGCTATACTACAGATGATGGTTATATCTTTAATGCTTCTGATATCATAGAGGATACTGGTGATGCTTA     

     TATCGTTCCTCATGGAGATCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTGGCTGC     

     TGCAGAAGCCTTCCTATCTGGTCGAGGAAATCTGTCAAATTCAAGAACCTATCGCCGACAAAATAGCGA     

     TAACACTTCAAGAACAAACTGGGTACCTTCTGTAAGCAATCCAGGAACTACAAATACTAACACAAGCAA     

     CAACAGCAACACTAACAGTCAAGCAAGTCAAAGTAATGACATTGATAGTCTCTTGAAACAGCTCTACAA     

     ACTGCCTTTGAGTCAACGACATGTAGAATCTGATGGCCTTGTCTTTGATCCAGCACAAATCACAAGTCG     

     AACAGCTAGAGGTGTTGCAGTGCCACACGGAGATCATTACCACTTCATCCCTTACTCTCAAATGTCTGA     

     ATTGGAAGAACGAATCGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAG     

     GCCAGAACAACCAAGTCCACAACCGACTCCGGAACCTAGTCCAGGCCCGCAACCTGCACCAAATCTTAA     

     AATAGACTCAAATTCTTCTTTGGTTAGTCAGCTGGTACGAAAAGTTGGGGAAGGATATGTATTCGAAGA     

     AAAGGGCATCTCTCGTTATGTCTTTGCGAAAGATTTACCATCTGAAACTGTTAAAAATCTTGAAAGCAA     

     GTTATCAAAACAAGAGAGTGTTTCACACACTTTAACTGCTAAAAAAGAAAATGTTGCTCCTCGTGACCA     

     AGAATTTTATGATAAAGCATATAATCTGTTAACTGAGGCTCATAAAGCCTTGTTTGNAAATAAGGGTCG     

     TAATTCTGATTTCCAAGCCTTAGACAAATTATTAGAACGCTTGAATGATGAATCGACTAATAAAGAAAA     

     ATTGGTAGATGATTTATTGGCATTCCTAGCACCAATTACCCATCCAGAGCGACTTGGCAAACCAAATTC     

     TCAAATTGAGTATACTGAAGACGAAGTTCGTATTGCTCAATTAGCTGATAAGTATACAACGTCAGATGG     

     TTACATTTTTGATGAACATGATATAATCAGTGATGAAGGAGATGCATATGTAACGCCTCATATGGGCCA     

     TAGTCACTGGATTGGAAAAGATAGCCTTTCTGATAAGGAAAAAGTTGCAGCTCAAGCCTATACTAAAGA     

     AAAAGGTATCCTACCTCCATCTCCAGACGCAGATGTTAAAGCAAATCCAACTGGAGATAGTGCAGCAGC     

     TATTTACAATCGTGTGAAAGGGGAAAAACGAATTCCACTCGTTCGACTTCCATATATGGTTGAGCATAC     

     AGTTGAGGTTAAAAACGGTAATTTGATTATTCCTCATAAGGATCATTACCATAATATTAAATTTGCTTG     

     GTTTGATGATCACACATACAAAGCTCCAAATGGCTATACCTTGGAAGATTTGTTTGCGACGATTAAGTA     

     CTACGTAGAACACCCTGACGAACGTCCACATTCTAATGATGGATGGGGCAATGCCAGTGAGCATGTGTT     

     AGGCAAGAAAGACCACAGTGAAGATCCAAATAAGAACTTCAAAGCGGATGAAGAGCCAGTAGAGGAAAC     

     ACCTGCTGAGCCAGAAGTCCCTCAAGTAGAGACTGAAAAAGTAGAAGCCCAACTCAAAGAAGCAGAAGT     

     TTTGCTTGCGAAAGTAACGGATTCTAGTCTGAAAGCCAATGCAACAGAAACTCTAGCTGGTTTACGAAA     

     TAATTTGACTCTTCAAATTATGGATAACAATAGTATCATGGCAGAAGCAGAAAAATTACTTGCGTTGTT     

     AAAAGGAAGTAATCCTTCATCTGTAAGTAAGGAAAAAATAAAC                               

     SP036  amino  acid  (SEQ  ID  NO:56)                                      

     SYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYY     

     NGKVPYDAIISEELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADMVRTKEEINRQKQE     

     HSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAA     

     AEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYK     

     LPLSQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDSR     

     PEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEEKGISRYVFAKDLPSETVKNLESK     

     LSKQESVSHTLTAKKENVAPRDQEFYDKAYNLLTEAHKALFXNKGRNSDFQALDKLLERLNDESTNKEK     

     LVDDLLAFLAPITHPERLGKPNSQIEYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDAYVTPHMGH     

     SHWIGKDSLSDKEKVAAQAYTKEKGILPPSPDADVKANPTGDSAAAIYNRVKGEKRIPLVRLPYHVEHT     

     VEVKNGNLIIPHKDHYHNIKFAWFDDHTYKAPNGYTLEDLFATIKYYVEHPDERPHSNDGWGNASEHVL     

     GKKDHSEDPNKNFKADEEPVEETPAEPEVPQVETEKVEAQLKEAEVLLAKVTDSSLKANATETLAGLRN     

     NLTLQIMDNNSIMAEAEKLLALLKGSNPSSVSKEKIN                                     

     SP038  nucleotide  (SEQ  ID  NO:57)                                       

     TACTGAGATGCATCATAATCTAGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAA     

     GGAGCGAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCTTCTTTGGCTCTAG     

     TGAATGGCTTCGTTTTGACGGTGCTCATTCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCC     

     TTATCTTTTAGGACAGGGGGGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACCACA     

     GCTGGAGAATAAACAAGTTGTGTATGTTATCTCACCTCAGTGGTTCAGTAAAAATGGCTATGATCCAGC     

     AGCCTTCCAGCAGTATTTTAATGGAGACCAGTTGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGC     

     TAGTCAATATGCAGCGACTCGCTTACTGCAACAGTTCCCAAACGTAGCTATGAAGGACCTGGTTCAGAA     

     GTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGATTGAATTATTGGCTCGTTTTAATGA     

     ACGCCAAGCTTCCTTTTTTGGTCAGTTTTCGGTTAGAGGCTATGTTAACTACGATAAGCATGTAGCTAA     

     GTATTTAAAAATCTTGCCAGACCAGTTTTCTTATCAGGCAATAGAAGATGTTGTCAAAGCAGATGCTGA     

     AAAAAATACTTCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGATCAAGAAGGATTT     

     GAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAGTATAATGNNTTGCAGTT     

     GGTTTTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCTGTTAATAAAAAATG     

     GATGNACTATGCTGGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATTCGCTACCAGTTAGA     

     AAGTCAAGGTTTTACCAATATAGCAGATTTTTCTAAGGACGGCGGGGAGCCTTTCTTTATGAAGGACAC     

     CATTCACCTTGGTTGGTTGGGTTGGTTGGCTTTTGACAAGGCAGTTGATCCTTTCCTATCCAATCCCAC     

     ACCAGCTCCGACTTACCATCTGAATGAGCGCTTTTTCAGCAAAGATTGGGCGACTTATGATGGAGATGT     

     CAAAGAA                                                                   

     SP038  amino  acid  (SEQ  ID  NO:58)                                      

     TEMHHNLGAEKRSAVATTIDSFKERSQKVRALSDPNVRFVPFFGSSEWLRFDGAHSAVLAEKYNRSYRP     

     YLLGQGGAASLNQYFGMQQMLPQLENKQVVYVISPQWFSKNGYDPAAFQQYFNGDQLTSFLKHQSGDQA     

     SQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMIELLARFNERQASFFGQFSVRGYVNYDKHVAK     

     YLKILPDQFSYQAIEDVVKADAEKNTSNNEMGMENYFYNEQIKKDLKKLKDSQKSFTYLKSPEYNXLQL     

     VLTQFSKSKVNPIFIIPPVNKKWMXYAGLREDMYQQTVQKIRYQLESQGFTNIADFSKDGGEPFFMKDT     

     IHLGWLGWLAFDKAVDPFLSNPTPAPTYHLNERFFSKDWATYDGDVKE                          

     SP039  nucleotide  (SEQ  ID  NO:59)                                       

     GGTTTTGAGAAAGTATTTGCAGGGGGCCCTGATTGAGTCGATTGAGCAAGTGGAAAATGACCGTATTGT     

     GGAAATTACAGTTTCCAATAAAAACGAGATTGGAGACCATATCCAGGCTACCTTGATTATCGAAATTAT     

     GGGGAAACACAGTAATATTCTACTGGTCGATAAAAGCAGTCATAAAATCCTCGAAGTTATCAAACACGT     

     CGGCTTTTCACAAAATAGCTACCGCACCTTACTTCCAGGATCGACCTATATCGCTCCGCCAAGTACAAA     

     ATCTCTCAATCCTTTTACTATCAAGGATGAAAAGCTCTTTGAAATCCTGCAAACCCAAGAACTAACAGC     

     AAAAAATCTTCAAAGCCTCTTTCAAGGTCTGGGACGCGATACGGCAAATGAATTGGAAAGGATACTGGT     

     TAGTGAAAAACTTTCCGCTTTCCGAAATTTTTTCAATCAAGAAACCAAGCCATGCTTGACTGAGACTTC     

     CTTCAGTCCAGTTCCTTTTGCAAATCAGGTGGGAGAGCCTTTTGCAAATCTTTCTGATTTGTTGGACAC     

     CTACTATAAGGATAAGGCTGAGCGCGACCGCGTCAAACAGCAGGCCAGTGAACTGATTCGTCGTGTTGA     

     AAATGAACTTCAGAAAAACCGACACAAACTCAAAAAACAGGAAAAAGAGTTACTGGCGACAGACAACGC     

     TGAAGAATTTCGTCAAAAAGGAGAATTGCTGACAACCTTCCTCCACCAAGTGCCTAACGACCAAGACCA     

     GGTTATCCTAGACAACTACTATACCAACCAACCTATCATGATTGCGCTTGATAAGGCTCTGACTCCCAA     

     CCAGAATGCCCAACGCTATTTTAAACGGTATCAGAAACTCAAAGAAGCTGTCAAATACTTGACTGATTT     

     GATTGAAGAAACCAAAGCCACTATTCTCTATCTGGAAAGTGTAGAAACCGTCCTCAACCAAGCTGGACT     

     GGAAGAAATCGCTGAAATCCGTGAAGAATTGATTCAAACAGGTTTTATCCGCAGAAGACAACGGGAGAA     

     AATCCAGAAACGCAAAAAACTAGAACAATATCTAGCAAGCGATGGCAAAACCATCATCTATGTCGGACG     

     AAACAATCTTCAAAATGAGGAATTGACCTTTAAAATGGCCCGCAAGGAGGAACTTTGGTTCCATGCTAA     

     GGACATTCCTGGAAGCCATGTTGTCATCTCAGGAAATCTTGACCCATCTGATGCAGTCAAGACAGACGC     

     AGCAGAGTTAGCTGCCTACTTCTCTCAAGGGCGCCTGTCGAATCTGGTGCAGGTAGATATGATTGAAGT     

     CAAAAACTCAATAAACCAACTGGTGGAAAACCCGGCTTTGTCACTTACACAGGACAAAAGACCCTCCG      

     CGTCACACCAGACTCCAAAAAAATTGCATCCATGAAAAAATCC                               

     SP039  amino  acid  (SEQ  ID  NO:60)                                      

     VLRKYLQGALIESIEQVENDRIVEITVSNKNEIGDHIQATLIIEIMGKHSNILLVDKSSHKILEVIKHV     

     GFSQNSYRTLLPGSTYIAPPSTKSLNPFTIKDEKLFEILQTQELTAKNLQSLFQGLGRDTANELERILV     

     SEKLSAFRNFFNQETKPCLTETSFSPVPFANQVGEPFANLSDLLDTYYKDKAERDRVKQQASELIRRVE     

     NELQKNRHKLKKQEKELLATDNAEEFRQKGELLTTFLHQVPNDQDQVILDNYYTNQPIMIALDKALTPN     

     QNAQRYFKRYQKLKEAVKYLTDLIEETKATILYLESVETVLNQAGLEEIAEIREELIQTGFIRRRQREK     

     IQKRKKLEQYLASDGKTIIYVGRNNLQNEELTFKNARKEELWFHAKDIPGSHVVISGNLDPSDAVKTDA     

     AELAAYFSQGRLSNLVQVDMIEVKKLNKPTGGKPGFVTYTGQKTLRVTPDSKKIASMKKS              

     SP040  nucleotide  (SEQ  ID  NO:61)                                       

     GACAACATTTACTATCCATACAGTAGAGTCAGCACCAGCAGAAGTGAAAGAAATTCTTGAAACAGTAGA     

     AAAAGACAACAATGGCTATATTCCCAACCTAATCGGTCTCTTGGCCAATGCCCCGACTGTTTTAGAAGC     

     CTACCAAATTGTCTCATCTATCCACCGTCGCAACAGCCTGACACCCGTTGAGCGTGAAGTGGTGCAAAT     

     CACGGCAGCCGTGACCAATGGTTGTGCCTTCTGTGTCGCAGGTCACACAGCCTTTTCCATCAAACAAAT     

     CCAGATGAATGATGACTTGATTCAAGCTCTTCGCAATCGTACTCCAATTGAAACAGATCCTAAATTGGA     

     TACCCTAGCTAAGTTTACCTTGGCAGTTATCAATACCAAGGGTCGTGTAGGAGATGAAGCCTTGTCTGA     

     GTTTTTAGAAGCTGGCTACACTCAACAAAATGCCTTGGATGTGGTTTTTGGTGTCAGCCTAGCAATCCT     

     CTGTAACTATGCCAACAACTTAGCTAATACACCAATTAATCCAGAATTGCAACCTTATGCC             

     SP040  amino  acid  (SEQ  ID  NO:62)                                      

     TTFTIHTVESAPAEVKEILETVEKDNNGYIPNLIGLLANAPTVLEAYQIVSSIHRRNSLTPVEREVVQI     

     TAAVTNGCAFCVAGHTAFSIKQIQMNDDLIQALRNRTPIETDPKLDTLAKFTLAVINTKGRVGDEALSE     

     FLEAGYTQQNALDVVFGVSLAILCNYANNLANTPINPELQPYA                               

     SP041  nucleotide  (SEQ  ID  NO:63)                                       

     GGCTAAGGAAAGAGTGGATGTACTAGCTTATAAACAGGGGTTGTTTGAAACGAGAGAGCAGGCCAAGCG     

     AGGTGTGATGGCTGGCCTAGTCGTAGCAGTCCTTAATGGAGAACGGTTTGACAAGCCAGGAGAGAAAAT     

     TCCAGATGACACCGAATTAAAACTCAAGGGGGAGAAACTCAAGTATGTCAGCCGTGGTGGTTTGAAACT     

     GGAAAAGGCCTTGCAGGTCTTTGATTTGTCGGTGGATGGCGCGACTACGATTGATATCGGGGCCTCTAC     

     TGGAGGTTTTACCGATGTCATGCTACAGAATAGTGCCAAGTTGGTCTTTGCAGTCGATGTTGGTACCAA     

     TCAGTTGGCTTGGAAATTACGCCAAGACCCACGAGTTGTCAGCATGGAGCAGTTCAATTTCCGCTATGC     

     TGAAAAGACTGATTTCGAGCAGGAGCCGAGCTTTGCCAGTATTGATGTGAGTTTCATTTCCCTTAGTCT     

     GATTTTGCCAGCCTTGCACCGTGTCTTGGCTGATCAAGGTCAGGTGGTAGCACTTGTCAAACCTCAGTT     

     TGAGGCAGGACGTGAGCAGATTGGGAAAAATGGAATTATTCGAGATGCTAAGGTTCATCAGAATGTCCT     

     TGAATCTGTAACAGCTATGGCAGTAGAGGTAGGTTTTTCAGTCCTTGGCTTGGACTTTTCTCCCATCCA     

     AGGTGGACATGGAAATATTGAATTTTTAGCGTATTTGAAAAAAGAAAAGTCAGCAAGCAATCAGATTCT     

     TGCTGAGATTAAAGAAGCAGTAGAGAGGGCGCATAGTCAATTTAAAAATGAA                      

     SP041  amino  acid  (SEQ  ID  NO:64)                                      

     AKERVDVLAYKQGLFETREQAKRGVMAGLVVAVLNGERFDKPGEKIPDDTELKLKGEKLKYVSRGGLKL     

     EKALQVFDLSVDGATTIDIGASTGGFTDVMLQNSAKLVFAVDVGTNQLAWKLRQDPRVVSMEQFNFRYA     

     EKTDFEQEPSFASIDVSFISLSLILPALHRVLADQGQVVALVKPQFEAGREQIGKNGIIRDAKVHQNVL     

     ESVTAMAVEVGFSVLGLDFSPIQGGHGNIEFLAYLKKEKSASNQILAEIKEAVERAHSQFKNE           

     SP042  nucleotide  (SEQ  ID  NO:65)                                       

     TTGTTCCTATGAACTTGGTCGTCACCAAGCTGGTCAGGTTAAGAAAGAGTCTAATCGAGTTTCTTATAT     

     AGATGGTGATCAGGCTGGTCAAAAGGCAGAAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGAT     

     CAACGCCGAACAAATNGTNATCAAGATTACGGATCAAGGTTATGTGACCTCTCATGGAGACCATTATCA     

     TTACTATAATGGCAAGGTTCCTTATGATGCCATCATCAGTGAAGAGCTCCTCATGAAAGATCCGAATTA     

     TCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTCATTAAGGTAAACGGTAAATA     

     CTATGTNTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAACGTCAGAA     

     GCAGGAACGCAGTCATAATCATAACTCAAGAGCAGATAATGCTGTTGCTGCAGCCAGAGCCCAAGGACG     

     TTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATAT     

     CGTTCCTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGC     

     AGAAGCCTATTGGAATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGC     

     TCAACCAAGATTGTCAGAGAACCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAA     

     CATTTCAAGCCTTTTACGTGAATTGTATGCTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCT     

     TATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCCAGAGGTGTAGCTGTCCCTCATGGTAACCATTA     

     CCACTTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAATTGCTCGTATTATTCCCCTTCGTTA     

     TCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCACAATCGACTCCGGAACCTAG     

     TCCAAGTCCGCAACCTGCACCAAATCCTCAACCAGCTCCAAGCAATCCAATTGATGAGAAATTGGTCAA     

     AGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC     

     CAAGGATCTTTCAGCAGAAACAGCAGCAGGCATTGATAGCAAACTGGCCAAGCAGGAAAGTTTATCTCA     

     TAAGCTAGGAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTT     

     ACTAGCAAGAATTCACCAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAA     

     CCTGTTGGAACGACTCAAGGATGTCNCAAGTGATAAAGTCAAGTTAGTGGANGATATTCTTGCCTTCTT     

     AGCTCCGATTCGTCATCCAGAACGTTTAGGAAAACCAAATGCGCAAATTACCTACACTGATGATGAGAT     

     TCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGAAGACGGTTATATCTTTGATCCTCGTGATATAAC     

     CAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCACTGGATTAAAAAAGATAGTTT     

     GTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCCCTCCTTCGACAGA     

     CCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGAAGCTATCTACAACCGCGTGAAAGCAGCTAA     

     GAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAAT     

     CATACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACC     

     TAAGGGGTATACTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCC     

     GCATTCAGATAATGGTTTTGGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGATAC     

     CAATCAAACGGAAAAACCAAGCGAGGAGAAACCTCAGACAGAAAAACCTGAGGAAGAAACCCCTCGAGA     

     AGAGAAACCGCAAAGCGAGAAACCAGAGTCTCCAAAACCAACAGAGGAACCAGAAGAATCACCAGAGGA     

     ATCAGAAGAACCTCAGGTCGAGACTGAAAAGGTTGAAGAAAAACTGAGAGAGGCTGAAGATTTACTTGG     

     AAAAATCCAGGAT                                                             

     SP042  amino  acid  (SEQ  ID  NO:66)                                      

     CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQXVIKITDQGYVTSHGDHYH     

     YYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLKDAAHADNIRTKEEIKRQK     

     QERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAA     

     EAYWNGKQGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGL     

     IFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPS     

     PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSKLAKQESLSH     

     KLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVXSDKVKLVXDILAFL     

     APIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSL     

     SEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI     

     IPHYDHYHNIKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADT     

     NQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEKLREAEDLLG     

     KIQD                                                                      

     SP043  nucleotide  (SEQ  ID  NO:67)                                       

     TTATAAGGGTGAATTAGAAAAAGGATACCAATTTGATGGTTGGGAAATTTCTGGTTTCGAAGGTAAAAA     

     AGACGCTGGCTATGTTATTAATCTATCAAAAGATACCTTTATAAAACCTGTATTCAAGAAAATAGAGGA     

     GAAAAAGGAGGAAGAAAATAAACCTACTTTTGATGTATCGAAAAAGAAAGATAACCCACAAGTAAACCA     

     TAGTCAATTAAATGAAAGTCACAGAAAAGAGGATTTACAAAGAGAAGAGCATTCACAAAAATCTGATTC     

     AACTAAGGATGTTACAGCTACAGTTCTTGATAAAAACAATATCAGTAGTAAATCAACTACTAACAATCC     

     TAATAAG                                                                   

     SP043  amino  acid  (SEQ  ID  NO:68)                                      

     YKGELEKGYQFDGWEISGFEGKKDAGYVINLSKDTFIKPVFKKIEEKKEEENKPTFDVSKKKDNPQVNH     

     SQLNESHRKEDLQREEHSQKSDSTKDVTATVLDKNNISSKSTTNNPNK                          

     SP044  nucleotide  (SEQ  ID  NO:69)                                       

     GAATGTTCAGGCTCAAGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTCAAGAAGGTGGCAGTGA     

     TGCGATTATTCTTGAAAGCAATGGACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGA     

     TGGAAGTGATTCTCGCTATCCATGGAGAGAAGGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCG     

     TGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACTTGATTTTATTTTGGTGACCCATACCCACAG     

     TGATCATATTGGAAATGTTGATGAATTACTGTCTACCTATCCAGTTGACCGAGTCTATCTTAAGAAATA     

     TAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTATGATAAGGTTTTACA     

     GACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACAAGGGGATGCTCATTTTCAGTTTGG     

     GGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATTTG     

     GGATGACAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTTGGGGGCGA     

     TTTAGATAATGTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTT     

     TAATCATCACCATGATACCAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGAT     

     TGTTCAAACTTCGGATAGTCTACCTTGGAAAAATGGTGTTGATAGTGAGTATGTTAATTGGCTCAAAGA     

     ACGAGGAATTGAGAGAATCAACGCAGCCAGCAAAGACTATGATGCAACAGTTTTTGATATTCGAAAAGA     

     CGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAGTTTTCAAGCTGGTTGGCATAAGAGTGC     

     ATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGAGTATGCTGTCGGTTGGAATGAAATCGA     

     AGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATCAATGGAAAAAATGGAACAATCA     

     TTGGTTCTATTTGACAGACTCTGGTGCTTCTGCTAAAAATTGGAAGAAAATCGCTGGAATCTGGTATTA     

     TTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTATTTGGATGT     

     TGATGGTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATGGTATTACTTTGCTCCATCAGGGGA     

     SP044  amino  acid  (SEQ  ID  NO:70)                                      

     NVQAQESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWREGIETSYKHVLTDR     

     VFRRLKELGVQKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLWDNLYGYDKVLQ     

     TAAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGELKKIWDDNSNSLISVVKVNGKKIYLGGD     

     LDNVHGAEDKYGPLIGKVDLMKFNHHHDTNKSNTKDFIKNLSPSLIVQTSDSLPWKNGVDSEYVNWLKE     

     RGIERINAASKDYDATVFDIRKDGFVNISTSYKPIPSFQAGWHKSAYGNWWYQAPDSTGEYAVGWNEIE     

     GEWYYFNQTGILLQNQWKKWNNHWFYLTDSGASAKNWKKIAGIWYYFNKENQMEIGWIQDKEQWYYLDV     

     DGSMKTGWLQYMGQWYYFAPSGE                                                   

     SP045  nucleotide  (SEQ  ID  NO:71)                                       

     CTTGGGTGTAACCCATATCCAGCTCCTTCCAGTCTTGTCTTACTACTTTGTCAATGAATTGAAAAACCA     

     TGAACGCTTGTCTGACTACGCTTCAAGCAACAGCAACTACAACTGGGGATATGACCCTCAAAACTACTT     

     CTCCTTGACTGGTATGTACTCAAGCGATCCTAAGAATCCAGAAAAACGAATCGCAGAATTTAAAAACCT     

     CATCAACGAAATCCACAAACGTGGTATGGGAGCTATCCTAGATGTCGTTTATAACCACACAGCCAAAGT     

     CGATCTCTTTGAAGATTTGGAACCAAACTACTACCACTTTATGGATGCCGATGGCACACCTCGAACTAG     

     CTTTGGTGGTGGACGCTTGGGGACAACCCACCATATGACCAAACGGCTCCTAATTGACTCTATCAAATA     

     CCTAGTTGATACCTACAAAGTGGATGGCTTCCGTTTCGATATGATGGGAGACCATGACGCCGCTTCTAT     

     CGAAGAAGCTTACAAGGCTGCACGCGCCCTCAATCCAAACCTCATCATGCTTGGTGAAGGTTGGAGAAC     

     CTATGCCGGTGATGAAAACATGCCTACTAAAGCTGCTGACCAAGATTGGATGAAACATACCGATACTGT     

     CGCTGTCTTTTCAGATGACATCCGTAACAACCTCAAATCTGGTTATCCAAACGAAGGTCAACCTGCCTT     

     TATCACAGGTGGCAAGCGTGATGTCAACACCATCTTTAAAAATCTCATTGCTCAACCAACTAACTTTGA     

     AGCTGACAGCCCTGGAGATGTCATCCAATACATCGCAGCCCATGATAACTTGACCCTCTTTGACATCAT     

     TGCCCAGTCTATCAAAAAAGACCCAAGCAAGGCTGAGAACTATGCTGAAATCCACCGTCGTTTACGACT     

     TGGAAATCTCATGGTCTTGACAGCTCAAGGAACTCCATTTATCCACTCCGGTCAGGAATATGGACGTAC     

     TAAACAATTCCGTGACCCAGCCTACAAGACTCCAGTAGCAGAGGATAAGGTTCCAAACAAATCTCACTT     

     GTTGCGTGATAAGGACGGCAACCCATTTGACTATCCTTACTTCATCCATGACTCTTACGATTCTAGTGA     

     TGCAGTCAACAAGTTTGACTGGACTAAGGCTACAGATGGTAAAGCTTATCCTGAAAATGTCAAGAGCCG     

     TGACTATATGAAAGGTTTGATTGCCCTTCGTCAATCTACAGATGCCTTCCGACTTAAGAGTCTTCAAGA     

     TATCAAAGACCGTGTCCACCTCATCACTGTCCCAGGCCAAAATGGTGTGGAAAAAGAGGATGTAGTGAT     

     TGGCTACCAAATCACTGCTCCAAACGGCGATATCTACGCAGTCTTTGTCAATGCGGATGAAAAAGCTCG     

     CGAATTTAATTTGGGAACTGCCTTTGCACATCTAAGAAATGCGGAAGTTTTGGCAGATGAAAACCAAGC     

     AGGACCAGTCGGAATTGCCAACCCGAAAGGACTTGAATGGACTGAAAAAGGCTTGAAATTGAATGCCCT     

     TACAGCTACTGTTCTTCGAGTCTCTCAAAATGGAACTAGCCATGAGTCAACTGCAGAAGAGAAACCAGA     

     CTCAACCCCTTCCAAGCCTGAACATCAAAATGAAGCTTCTCACCCTGCACATCAAGACCCAGCTCCAGA     

     AGCTAGACCTGATTCTACTAAACCAGATGCCAAAGTAGCTGATGCGGAAAATAAACCTAGCCAAGCTAC     

     AGCTGATTCACAAGCTGAACAACCAGCACAAGAAGCACAAGCATCATCTGTAAAAGAAGCGGTTCGAAA     

     CGAATCGGTAGAAAACTCTAGCAAGGAAAATATACCTGCAACCCCAGATAAACAAGCTGAA             

     SP045  nucleotide  (SEQ  ID  NO:72)                                       

     LGVTHIQLLPVLSYYFVNELKNHERLSDYASSNSNYNWGYDPQNYFSLTGMYSSDPKNPEKRIAEFKNL     

     INEIHKRGMGAILDVVYNHTAKVDLFEDLEPNYYHFMDADGTPRTSFGGGRLGTTHHMTKRLLIDSIKY     

     LVDTYKVDGFRFDMMGDHDAASIEEAYKAARALNPNLIMLGEGWRTYAGDENMPTKAADQDWMKHTDTV     

     AVFSDDIRNNLKSGYPNEGQPAFITGGKRDVNTIFKNLIAQPTNFEADSPGDVIQYIAAHDNLTLFDII     

     AQSIKKDPSKAENYAEIHRRLRLGNLMVLTAQGTPFIHSGQEYGRTKQFRDPAYKTPVAEDKVPNKSHL     

     LRDKDGNPFDYPYFIHDSYDSSDAVNKFDWTKATDGKAYPENVKSRDYMKGLIALRQSTDAFRLKSLQD     

     IKDRVHLITVPGQNGVEKEDVVIGYQITAPNGDIYAVFVNADEKAREFNLGTAFAHLRNAEVLADENQA     

     GPVGIANPKGLEWTEKGLKLNALTATVLRVSQNGTSHESTAEEKPDSTPSKPEHQNEASHPAHQDPAPE     

     ARPDSTKPDAKVADAENKPSQATADSQAEQPAQEAQASSVKEAVRNESVENSSKENIPATPDKQAE        

     SP046  nucleotide  (SEQ  ID  NO:73)                                       

     TAGTGATGGTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGT     

     TTTNGATACTCATTATCAATCTTGGTTCTATATAAAAGCAGATGCTAACTATGCTGAAAATGAATGGCT     

     AAAGCAAGGTGACGACTATTTTTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGA     

     CAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTA     

     TGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCTTGGTTTTA     

     TATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAATTAAAGGGAAGGACTATTATTTCAA     

     ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGT     

     ACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCAAAGAAAATGGAAACTATGC     

     TGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAA     

     TGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAAAGAATG     

     GGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATG     

     GATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTA     

     CGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGA     

     TGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACTA     

     TCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGG     

     TAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGGCTATTACTATTTCTGGTTT     

     GTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCTAGTAAGGACTTTATCCCTTATTATGA     

     GAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCCAGTAGCTTCTCATCTTTC     

     TGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGATGGTTTTAAGCTTGAGAA     

     TCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGAATTGGATAAGGTATTTAG     

     TTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGCCGAAGAACATTA     

     CCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGC     

     CAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATT     

     TGATGATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAG     

     AACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGACCCTTATTGGGGCGAAAA     

     AATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGGTGGCAAAGAT                         

     SP046  amino  acid  (SEQ  ID  NO:74)                                      

     SDGTWQGKQYLKEDGSQAANEWVXDTHYQSWFYIKADANYAENEWLKQGDDYFYLKSGGYMAKSEWVED     

     KGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEWLQIKGKDYYFK     

     SGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGYMAAN     

     EWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVY     

     DSHSQAWYYFKSGGYMAKNETVDGYQLGSDGKWLGGKTTNENAAYYQVVPVTANNYDSDGEKLSYISQG     

     SVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLS     

     DMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNINNSLLENKGATFKEAEEHY     

     HINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWIKENYIDRGR     

     TFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKD                                   

     SP048  nucleotide  (SEQ  ID  NO:75)                                       

     TGGGATTCAATATGTCAGAGATGATACTAGAGATAAAGAAGAGGGAATAGAGTATGATGACGCTGACAA     

     TGGGGATATTATTGTAAAAGTAGCGACTAAACCTAAGGTAGTAACCAAGAAAATTTCAAGTACGCGAAT     

     TCGTTATGAAAAAGATGAAACAAAAGACCGTAGTGAAAATCCTGTTACAATTGATGGAGAGGATGGCTA     

     TGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTTACCGAACAGGTTACTGTTGA     

     TAGAAAAGAAGCCACGGATACAGTTATCAAAGTTCCAGCTAAAAGCAAGGTTGAAGAAGTTCTTGTTCC     

     ATTTGCTACTAAATATGAAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACTCTAGGAAA     

     GAATGGGAAAACAGTTACAACGATAACTTATAATGTAGATGGAAAGAGTGGACAAGTAACTGAGAGTAC     

     TTTAAGTCAAAAAAAAGACTCtCAAACAAGAGTTGTTAAAAAAAGaACCArkCCCCAAGTTCTTGTCCA     

     AGAAATTCCAATCGAAACAGAATATCTCGATGGCCCaACTCTTGATAAAAGTCAAGAAGTAGAAGAAGT     

     AGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGGTAGATGAACGTGATGGAACAATTGAAGA     

     AACTACTTCTCGTCAAATTACTAAAGAGATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGA     

     AAAAGTTGTTGTTCCTGAGCAATCATCTATTCCTTCGTATCCTGTATCTGTTACATCTAACCAAGGAAC     

     AGATGTAGCAGTAGAACCAGCTAAAGCAGTTGCTCCAACAACAGACTGGAAACAAGAAAATGGTATGTG     

     GTATTTTTATAATACTGATGGTTCCATGGCAACAGGTTGGGTACAAGTTAATAGTTCATGGTACTACCT     

     CAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGTTGGTGGTAAATGGTATTATGTAAATAC     

     ATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGTGCG     

     T                                                                         

     SP048  amino  acid  (SEQ  ID  NO:76)                                      

     GIQYVRDDTRDKEEGIEYDDADNGDIIVKVATKPKVVTKKISSTRIRYEKDETKDRSENPVTIDGEDGY     

     VTTTRTYDVNPETGYVTEQVTVDRKEATDTVIKVPAKSKVEEVLVPFATKYEADNDLSAGQEQEITLGK     

     NGKTVTTITYNVDGKSGQVTESTLSQKKDSQTRVVKKRTXPQVLVQEIPIETEYLDGPTLDKSQEVEEV     

     GEIGKLLLLQSILVDERDGTIEETTSRQITKEMVKRRIRRGTREPEKVVVPEQSSIPSYPVSVTSNQGT     

     DVAVEPAKAVAPTTDWKQENGMWYFYNTDGSMATGWVQVNSSWYYLNSNGSMKVNQWFQVGGKWYYVNT     

     SGELAVNTSIDGYRVNDNGEWVR                                                   

     SP049  nucleotide  (SEQ  ID  NO:77)                                       

     GGATAATAGAGAAGCATTAAAAACCTTTATGACGGGTGAAAATTTTTATCTCCAACATTATCTAGGAGC     

     ACATAGGGAAGAACTAAATGGAGAGCATGGCTATACCTTCCGTGTTTGGGCACCTAATGCTCAGGCTGT     

     TCACTTGGTTGGTGATTTTACCAACTGGATTGAAAATCAGATTCCAATGGTAAGAAATGATTTTGGGGT     

     CTGGGAAGTCTTTACCAATATGGCTCAAGAAGGGCATATTTACAAATATCATGTCACACGTCAAAATGG     

     TCATCAACTGATGAAGATTGACCCTTTTGCTGTCAGGTATGAGGCTCGTCCAGGAACAGGGGCAATCGT     

     AACAGAGCTTCCTGAGAAGAAATGGAAGGATGGACTTTGGCTGCCACGAAGAAAACGTTGGGGCTTTGA     

     AGAGCGTCCTGTCAATATTTATGAAGTTCACGCTGGATCATGGAAAAGAAATTCTGATGGCAGTCCTTA     

     TAGTTTTGCCCAGCTCAAGGATGAACTCATTCCTTATCTCGTTGAAATGAACTATACTCATATTGAGTT     

     TATGCCCTTGATGTCCCATCCTTTGGGCTTGAGTTGGGGGTATCAGCTTATGGGTTACTTCGCTTTAGA     

     GCATGCTTATGGCCGACCAGAGGAGTTTCAAGATTTTGTC                                  

     SP049  amino  acid  (SEQ  ID  NO:78)                                      

     DNREALKTFMTGENFYLQHYLGAHREELNGEHGYTFRVWAPNAQAVHLVGDFTNWIENQIPMVRNDFGV     

     WEVFTNMAQEGHIYKYHVTRQNGHQLMKIDPFAVRYEARPGTGAIVTELPEKKWKDGLWLARRKRWGFE     

     ERPVNIYEVHAGSWKRNSDGSPYSFAQLKDELIPYLVEMNYTHIEFMPLMSHPLGLSWGYQLMGYFALE     

     HAYGRPEEFQDFV                                                             

     SP050  nucleotide  (SEQ  ID  NO:79)                                       

     AGATTTTGTCGAGGAGTGTCATACCCATAATATTGGGGTTATTGTGGACTGGGTACCAGNTCACTTTAC     

     CATCAACGATGATGCCTTAGCCTATTATGATGGGACACCGACTTTTGAATACCAAGACCATAATAAGGC     

     TCATAACCATGGTTGGGGTGCCCTTAATTTTGACCTTGGAAAAAATGAAGTCCAGTCCTTCTTAATTTC     

     TTGCATTAAGCATTGGATTGATGTCTATCATTTGGATGGTATTCGTGTGGATGCTGTTAGCAACATGCT     

     CTATTTGGACTATGATGATGCTCCATGGACACCTAATAAAGATGGCGGAAATCTCAACTATGAAGGTTA     

     TTATTTCCTTCAGCGCTTGAATGAGGTTATTAAGTTAGAATATCCAGATGTGATGATGATTGCAGAAGA     

     AAGTTCGTCTGCGATCAAGATTACGGGAATGAAAGAGATTGGTGGTCTAGGATTTGACTACAAATGGAA     

     CATGGGCTGGATGAATGATATCCTCCGTTTCTACGAAGAAGATCCGATCTATCGTAAATATGACTTTAA     

     CCTGGTGACTTTCAGCTTTATGTATGTTTNCAAGGAGAATTATCTCTTGCCATTCTCGCACGATGAAGT     

     GGTTCATGGCAAGAAGAGTATGATGCATAAGATGTGGGGAGATCGTTACAATCAATTCGCAGGCTTGCG     

     CAATCTCTATACGTACCAAATTTGTCACCCTGGTAAGAAATTGCTCTTCATGGGTAGCGAATACGGTCA     

     ATTCCTAGAATGGAAATCTGAAGAACAGTTGGAATGGTCTAACCTAGAAGACCCAATGAATGCTAAGAT     

     GAAGTATTTCGCTTCTCAGCTAAACCAGTTTTACAAAGATCATCGCTGTCTGTGGGAAATTGATACCAG     

     CTATGATGGTATTGAAATCATTGATGCGGATAATCGAGACCAGAGTGTTCTTTCCTTTATTCGTAAGGG     

     TAAAAAGGGA                                                                

     SP050  amino  acid  (SEQ  ID  NO:80)                                      

     DFVEECHTHNIGVIVDWVPXHFTINDDALAYYDGTPTFEYQDHNKAHNHGWGALNFDLGKNEVQSFLIS     

     CIKHWIDVYHLDGIRVDAVSNMLYLDYDDAPWTPNKDGGNLMYEGYYFLQRLNEVIKLEYPDVMMIAEE     

     SSSAIKITGMKEIGGLGFDYKWNMGWMNDILRFYEEDPIYRKYDFNLVTFSFMYVXKENYLLPFSHDEV     

     VHGKKSMMHKMWGDRYNQFAGLRNLYTYQICHPGKKLLFMGSEYGQFLEWKSEEQLEWSNLEDPMNAKM     

     KYFASQLNQFYKDHRCLWEIDTSYDGIEIIDADNRDQSVLSFIRKGKKG                         

     SP051  nucleotide  (SEQ  ID  NO:81)                                       

     ATCTGTAGTTTATGCGGATGAAACACTTATTACTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGAT     

     AGTAGAAGAAAAGGCTGATAAAGCTTTGGAAACTAAAAATATAGTTGAAAGGACAGAACAAAGTGAACC     

     TAGTTCAACTGAGGCTATTGCATCTGAGNAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGT     

     GTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAATCACAAGCTTCAAATCAAGAAAAACCGCTCAA     

     GGAAGATGCTAAAGCTGTAACAAATGAAGAAGTGAATCAAATGATTGAAGACAGGAAAGTGGATTTTAA     

     TCAAAATTGGTACTTTAAACTCAATGCAAATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTAC     

     GTGGAAAAAATTAGATTTACCGTATGACTGGAGTATCTTTAACGATTTCGATCATGAATCTCCTGCACA     

     AAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGA     

     CCTCAAGAAAAATGTTCGCCTTACTTTTGATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCA     

     GTTAGTGGGGCATTATCCAAATGGTTATAACCAGTTCTCATATGATATCACCAAATACCTTCAAAAAGA     

     TGGTCGTGAGAATGTGATTGCTGTCCATGCAGTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAG     

     TGGTATCTATCGTGATGTGACTTTACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACTAT     

     TTTAACACCAAAACTTGAAGAACAACAACATGGCAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAA     

     TACGGACGACAAAGACCATGAACTTGTAGCCGAATATCAAATCGTTGAACGAGGTGGTCATGCTGTAAC     

     AGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAGCACATGAATCAACAAGCCTAGATGCGATTTTAGA     

     AGTTGAAAGACCAAAACTCTGGACTGTTTTAAATGACAAACCTGCCTTGTACGAATTGATTACGCGTGT     

     TTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGATTTGTTTGGTTACCGTTACTATCACTGGACTCC     

     AAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAATTCCATGGAGTATCCTTGCACCACGACCATGG     

     GGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAGT     

     TAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAACCTTGCAAATCGCAGCAGAACTAGGTTT     

     ACTCGTTCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAGAAACCTTATGACTATGGACGTTTCTT     

     TGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCAT     

     GGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCTAA     

     TGGTGATGCCCACTCTTTAGCAACTGTTAAACGTTTGGTTAAGGTTATCAAGGATGTTGATAAGACTCG     

     CTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGTAATGGTAGCGGAGGGCATGAGAAAATTGCTGA     

     TGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAAGCCCTTAGAGCTAAGCATCCAAA     

     ATGGTTGATTTATGGATCAGAAACATCTTCAGCTACCCGTACACGTGGAAGTTACTATCGCCCTGAACG     

     TGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTATGGAAATGATCGTGTGGG     

     TTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCTGGCTATGCTGGACAGTTTAT     

     CTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAAATCAAACTCCTGTTAAGAG     

     CTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAAGC          

     SP051  amino  acid  (SEQ  ID  NO:82)                                      

     SVVYADETLITHTAEKPKEEKMIVEEKADKALETKNIVERTEQSEPSSTEAIASEXKEDEAVTPKEEKV     

     SAKPEEKAPRIESQASNQEKPLKEDAKAVTNEEVNQMIEDRKVDFNQNWYFKLNANSKEAIKPDADVST     

     WKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDLKKNVRLTFDGVYMDSQVYVNGQ     

     LVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTI     

     LTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAEYQIVERGGHAVTGLVRTASRTLKAHESTSLDAILE     

     VERPKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHG     

     ALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFF     

     EKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTR     

     YVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPER     

     ELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNAGYAGQFIWTGTDYIGEPTPWHNQNQTPVKS     

     SYFGIVDTAGIPKHDFYLYQS                                                     

     SP052  nucleotide  (SEQ  ID  NO:83)                                       

     TTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAAGCCAATGGGT     

     TTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGGGAAAACAAAGAATTAGCATC     

     CAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATGCTTCTAGTGTAGAATTGTT     

     CTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAGCGATGGGCGGACTTACCA     

     AGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAACCAGGTACCTTGGAAGC     

     AATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTGCTGGTAAGCCAGCGGC     

     AGTTCGTCTTATTAAGGAAGACCATGCGATTGCAGCAGATGGAAAAGACTTGACTTACATCTACTATGA     

     AATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTCCAATTGCATGGCCA     

     AGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGGCGCAAGCAGATGG     

     TTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGAACAAGCAGGGAA     

     ATTCACCCTGACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTTACTGGTAAGAAAGA     

     AGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAGGCACCTGA     

     AATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGGTCTTC     

     AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCG     

     TGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTGACTTGAA     

     TTCTGTAGACAAATCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGACAAGTG     

     GGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTATTT     

     AGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAAC     

     TGTAACGGTTGGTGGTGAGGCAGTAACAGGTCTTACTAGTCAAAAACCAATGCAATACCGCACTCTTGC     

     TTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAAAAATGCAGCTGTTACAGTTCTTCAAGCAAG     

     CGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATGC     

     AATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT     

     CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACC     

     AGCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTT     

     GCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTAT     

     CCAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCA     

     GGAACCAAGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCAC     

     TTGGCAAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGG     

     AATTGACCTTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGAC     

     AACTCCAATCGCAGAAGCACCACAATTACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTC     

     ATCAGCTAAGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAA     

     TGGTCGCTTAGAAGGTACGCAATTAACA                                              

     SP052  amino  acid  (SEQ  ID  NO:84)                                      

     YFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWWWENKELASKVADSEGKIPVRAYSNASSVELF     

     LNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDESGKEIARDKITTAGKPAA     

     VRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRERYKAQADG     

     SWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGKKEGQEKTVLGTEVPKVQTIIGEAPE     

     MPTTVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVIALKSELPVVKRIAPNTDLN     

     SVDKSVSYVLIDGSVEEYEVDKWEIAEEDKAKLAIPGSRIQATGYLEGQPIHATLVVEEGNPAAPAVPT     

     VTVGGEAVTGLTSQKPMQYRTLAYGAKLPEVTASAKNAAVTVLQASAANGMRASIFIQPKDGGPLQTYA     

     IQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLEL     

     HKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVNVVTDLHQEPSLPATVTVEYDKGFPKTHKVT     

     WQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAPQLPESVRTYDSNGHVS     

     SAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLT                                          

     SP053  nucleotide  (SEQ  ID  NO:85)                                       

     AGCTAAGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGG     

     TCGCTTAGAAGGTACGCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGC     

     AAACATTTCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGACTCAAATCCAAGCGA     

     CCCAGTTTCAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTG     

     GAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTTGAGCAAACGCTC     

     CGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTGATTGA     

     GTATTATGTTGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGT     

     CTTTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGA     

     AATGAACCACTTTAGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAA     

     CAAGCGTGGAACGTCTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACA     

     AACAAGAATCCAAGTTGACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGA     

     GTCTGTAGATGGAAAAGTTCCGGCAGTCACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCC     

     AAGCGTTCGTGAAGGTGAGCCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATA     

     CCGTCTGCACTTCACTAAGGATAAGAGCTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCG     

     CTTGCTACAAGTAGGTCAAGCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGG     

     CTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCA     

     ATTTACTGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGATCACTGTACGAGTGACAGACAA     

     ACTTGGTGAGACTCTTTCAGATAACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAAC     

     CAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCG     

     TCGTTGGACAAACTGGTCACCAACACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGA     

     AAATGGTAAGATTGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGA     

     TGCACCATCTAAACTCGTTTTAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAA     

     CTACCAAGCCTACGACGCAGACCATCCATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGC     

     GGATAAAGACATTGCAGCTGGTGATGAAATCAACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAG     

     ATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAG     

     TGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGA     

     AAATCGTCAAGACTATCAAATTACCTATAAAGGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCA     

     AGTAGCTTCAACTGTGGTAGATAGTGGAGAAGATAGCTTTCCAGTACTTGTTCGCCTCGTTTCAGAAAG     

     TGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAAAAACCAGTTTCTGAGAAGACAGTTGC     

     TGCTGTACAAGAAGATCTTCCAAAAATCGAATTTGTTGAAAAAGATTTGGCATACAAGACAGTTGAGAA     

     AAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAAAAGTTGGAAAAGAACGTAT     

     CTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGTGGTAGAAGTTCCGACAGA     

     CCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAAGC     

     AGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAG                      

     SP053  amino  acid  (SEQ  ID  NO:86)                                      

     AKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAFASDSNPSD     

     PVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGVPKSYVIE     

     YYVGKTVPTAPKNPSFVGNEDHVFNDSAWWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADN     

     KRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATVVP     

     SVREGEPVRVIAKAENGDILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDG     

     YETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEITVRVTDKLGETLSDNPNYDENSNQAFASAT     

     NDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERTVTQGKVQFFADSGTD     

     APSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDEINVTFKAIKAKAMR     

     WRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYKGQRPKVSVEENNQ     

     VASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEFVEKDLAYKPVEK     

     KDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKPQVSEKA     

     DTKPIDSSEASQTNKAQ                                                         

     SP054  nucleotide  (SEQ  ID  NO:87)                                       

     CTATCACTATGTAAATAAAGAGATTATTTCACAAGAAGCTAAAGATTTAATTCAGACAGGAAAGCCTGA     

     CAGGAATGAAGTTGTATATGGTTTGGTGTATCAAAAAGATCAGTTGCCTCAAACAGGGACAGAA          

     SP054  amino  acid  (SEQ  ID  NO:88)                                      

     YHYVNKEIISQEAKDLIQTGKPDRNEVVYGLVYQKDQLPQTGTE                              

     SP055  nucleotide  (SEQ  ID  NO:89)                                       

     TGAGACTCCTCAATCAATAACAAATCAGGAGCAAGCTAGGACAGAAAACCAAGTAGTAGAGACAGAGGA     

     AGCTCCAAAAGAAGAAGCACCTAAAACAGAAGAAAGTCCAAAGGAAGAACCAAAATCGGAGGTAAAACC     

     TACTGACGACACCCTTCCTAAAGTAGAAGAGGGGAAAGAAGATTCAGCAGAACCAGCTCCAGTTGAAGA     

     AGTAGGTGGAGAAGTTGAGTCAAAACCAGAGGAAAAAGTAGCAGTTAAGCCAGAAAGTCAACCATCAGA     

     CAAACCAGCTGAGGAATCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAAAAGGC     

     ACCAGTCGAGCCAGAAAAGCAACCAGAAGCTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACA     

     AGAAGAGTCAACTCCAGATACCAAGGCTGAAGAAACTGTAGAACCAAAAGAGGAGACTGTTAATCAATC     

     TATTGAACAACCAAAAGTTGAAACGCCTGCTGTAGAAAAACAAACAGAACCAACAGAGGAACCAAAAGT     

     TGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAACAGGCACCAACGGCACCAGTTGAGCCAGA     

     AAAGCAACCAGAAGTTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACCAGAAGATAAAATAAA     

     GGGTATTGGTACTAAAGAACCAGTTGATAAAAGTGAGTTAAATAATCAAATTGATAAAGCTAGTTCAGT     

     TTCTCCTACTGATTAT                                                          

     SP055  amino  acid  (SEQ  ID  NO:90)                                      

     ETPQSITNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPAPVEE     

     VGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEETPKQ     

     EESTPDTKAEETVEPKEETVNQSIEQPKVETPAVEKQTEPTEEPKVEQAGEPVAPREDEQAPTAPVEPE     

     KQPEVPEEEKAVEETPKPEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDY                       

     SP056  nucleotide  (SEQ  ID  NO:91)                                       

     GGATGCTCAAGAAACTGCGGGAGTTCACTATAAATATGTGGCAGATTCAGAGCTATCATCAGAAGAAAA     

     GAAGCAGCTTGTCTATGATATTCCGACATACGTGGAGAATGATGATGAAACTTATTATCTTGTTTATAA     

     GTTAAATTCTCAAAATCAACTGGCGGAATTGCCAAATACTGGAAGCAAGAATGAGAGGCAA             

     SP056  amino  acid  (SEQ  ID  NO:92)                                      

     DAQETAGVHYKYVADSELSSEEKKQLVYDIPTYVENDDETYYLVYKLNSQNQLAELPNTGSKNERQ        

     SP057  nucleotide  (SEQ  ID  NO:93)                                       

     CGACAAAGGTGAGACTGAGGTTCAACCAGAGTCGCCAGATACTGTGGTAAGTGATAAAGGTGAACCAGA     

     GCAGGTAGCACCGCTTCCAGAATATAAGGGTAATATTGAGCAAGTAAAACCTGAAACTCCGGTTGAGAA     

     GACCAAAGAACAAGGTCCAGAAAAAACTGAAGAAGTTCCAGTAAAACCAACAGAAGAAACACCAGTAAA     

     TCCAAATGAAGGTACTACAGAAGGAACCTCAATTCAAGAAGCAGAAAATCCAGTTCAACCTGCAGAAGA     

     ATCAACAACGAATTCAGAGAAAGTATCACCAGATACATCTAGCAAAAATACTGGGGAAGTGTCCAGTAA     

     TCCTAGTGATTCGACAACCTCAGTTGGAGAATCAAATAAACCAGAACATAATGACTCTAAAAATGAAAA     

     TTCAGAAAAAACTGTAGAAGAAGTTCCAGTAAATCCAAATGAAGGCACAGTAGAAGGTACCTCAAATCA     

     AGAAACAGAAAAACCAGTTCAACCTGCAGAAGAAACACAAACAAACTCTGGGAAAATAGCTAACGAAAA     

     TACTGGAGAAGTATCCAATAAACCTAGTGATTCAAAACCACCAGTTGAAGAATCAAATCAACCAGAAAA     

     AAACGGAACTGCAACAAAACCAGAAAATTCAGGTAATACAACATCAGAGAATGGACAAACAGAACCAGA     

     ACCATCAAACGGAAATTCAACTGAGGATGTTTCAACCGAATCAAACACATCCAATTCAAATGGAAACGA     

     AGAAATTAAACAAGAAAATGAACTAGACCCTGATAAAAAGGTAGAAGAACCAGAGAAAACACTTGAATT     

     AAGAAAT                                                                   

     SP057  amino  acid  (SEQIDNO:94)                                          

     DKGETEVQPESPDTVVSDKGEPEQVAPLPEYKGNIEQVKPETPVEKTKEQGPEKTEEVPVKPTEETPVN     

     PNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSKNTGEVSSNPSDSTTSVGESNKPEHNDSKNEN     

     SEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANENTGEVSNKPSDSKPPVEESNQPEK     

     NGTATKPENSGNTTSENGQTEPEPSNGNSTEDVSTESNTSNSNGNEEIKQENELDPDKKVEEPEKTLEL     

     RN                                                                        

     SP058  nucleotide  (SEQ  ID  NO:95)                                       

     AAATCAATTGGTAGCACAAGATCCAAAAGCACAAGATAGCACTAAACTGACTGCTGAAAAATCAACTGT     

     TAAAGCACCTGCTCAAAGAGTAGATGTAAAAGATATAACTCATTTAACAGATGAAGAAAAAGTTAAGGT     

     TGCTATTTTACAAGCAAATGGTTCAGCATTAGACGGAGCGACAATCAATGTAGCTGGAGATGGTACAGC     

     AACAATCACATTCCCAGATGGTTCAGTAGTGACGATTCTAGGAAAAGATACAGTTCAACAATCTGCGAA     

     AGGTGAATCTGTAACTCAAGAAGCTACACCAGAGTATAAGCTAGAAAATACACCAGGTGGAGATAAGGG     

     AGGCAATACTGGAAGCTCAGATGCTAATGCGAATGAAGGCGGTGGTAGCCAGGCGGGTGGATCAGCTCA     

     CACAGGTTCACAAAACTCAGCTCAATCACAAGCTTCTAAGCAATTAGCTACTGAAAAAGAATCAGCTAA     

     AAATGCCATTGAAAAAGCAGCCAAGGACAAGCAGGATGAAATCAAAGGCGCACCGCTTTCTGATAAAGA     

     AAAAGCAGAACTTTTAGCAAGAGTGGAAGCAGAAAAACAAGCAGCTCTCAAAGAGATTGAAAATGCGAA     

     AACTATGGAAGATGTGAAGGAAGCAGAAACGATTGGAGTGCAAGCCATTGCCATGGTTACAGTTCCTAA     

     GAGACCAGTGGCTCCTAAT                                                       

     SP058  amino  acid  (SEQIDNO:96)                                          

     NQLVAQDPKAQDSTKLTAEKSTVKAPAQRVDVKDITHLTDEEKVKVAILQANGSALDGATINVAGDGTA     

     TITFPDGSVVTILGKDTVQQSAKGESVTQEATPEYKLENTPGGDKGGNTGSSDANANEGGGSQAGGSAH     

     TGSQNSAQSQASKQLATEKESAKNAIEKAAKDKQDEIKGAPLSDKEKAELLARVEAEKQAALKEIENAK     

     TMEDVKEAETIGVQAIAMVTVPKRPVAPN                                             

     SP059  nucleotide  (SEQ  ID  NO:97)                                       

     CAAACAGTCAGCTTCAGGAACGATTGAGGTGATTTCACGAGAAAATGGCTCTGGGACACGGGGTGCCTT     

     CACAGAAATCACAGGGATTCTCAAAAAAGACGGTGATAAAAAAATTGACAACACTGCCAAAACAGCTGT     

     GATTCAAAATAGTACAGAAGGTGTTCTCTCAGCAGTTCAAGGGAATGCTAATGCTATCGGCTACATCTC     

     CTTGGGATCTTTAACGAAATCTGTCAAGGCTTTAGAGATTGATGGTGTCAAGGCTAGTCGAGACACAGT     

     TTTAGATGGTGAATACCCTCTTCAACGTCCCTTCAACATTGTTTGGTCTTCTAATCTTTCCAAGCTAGG     

     TCAAGATTTTATCAGCTTTATCCACTCCAAACAAGGTCAACAAGTGGTCACAGATAATAAATTTATTGA     

     AGCTAAAACCGAAACCACGGAATATACAAGCCAACACTTATCAGGCAAGTTGTCTGTTGTAGGTTCCAC     

     TTCAGTATCTTCTTTAATGGAAAAATTAGCAGAAGCTTATAAAAAAGAAAATCCAGAAGTTACGATTGA     

     TATTACCTCTAATGGGTCTTCAGCAGGTATTACCGCTGTTAAGGAGAAAACCGCTGATATTGGTATGGT     

     TTCTAGGGAATTAACTCCTGAAGAAGGTAAGAGTCTCACCCATGATGCTATTGCTTTAGACGGTATTGC     

     TGTTGTGGTCAATAATGACAATAAGGCAAGCCAAGTCAGTATGGCTGAACTTGCAGACGTTTTTAGTGG     

     CAAATTAACCACCTGGGACAAGATTAAA                                              

     BP059  amino  acid  (SEQ  ID  NO:98)                                      

     KQSASGTIEVISRENGSGTRGAFTEITGILKKDGDKKIDNTAKTAVIQNSTEGVLSAVQGNANAIGYIS     

     LGSLTKSVKALEIDGVKASRDTVLDGEYPLQRPFNIVWSSNLSKLGQDFISFIHSKQGQQVVTDNKFIE     

     AKTETTEYTSQHLSGKLSVVGSTSVSSLMEKLAEAYKKENPEVTIDITSNGSSAGITAVKEKTADIGMV     

     SRELTPEEGKSLTHDAIALDGIAVVVNNDNKASQVSMAELADVFSGKLTTWDKIK                   

     SP060  nucleotide  (SEQ  ID  NO:99)                                       

     ATTCGATGATGCGGATGAAAAGATGACCCGTGATGAAATTGCCTATATGCTGACAAATAGTGAAGAAAC     

     ATTGGATGCTGATGAGATTGAGATGCTACAAGGTGTCTTTTCGCTCGATGAACTGATGGCACGAGAGGT     

     TATGGTTCCTCGAACGGATGCCTTTATGGTGGATATTCAGGATGATAGTCAAGCCATTATCCAAAGTAT     

     TTTAAAACAAAATTATTCTCGTATCCCGGTTTATGATGGGGATAAGGACAATGTAATTGGAATCATTCA     

     CACCAAGAGTCTCCTTAAGGCAGGCTTTGTGGACGGTTTTGACAATATTGTTTGGAAGAGAATTTTACA     

     AGATCCACTTTTTGTACCTGAAACTATTTTTGTGGATGACTTGCTAAAAGAACTGCGAAATACCCAAAG     

     ACAAATG                                                                   

     SP060  amino  acid  (SEQ  ID  NO:100)                                     

     FDDADEKMTRDEIAYMLTNSEETLDADEIEMLQGVFSLDELMAREVMVPRTDAFMVDIQDDSQAIIQSI     

     LKQNYSRIPVYDGDKDNVIGIIHTKSLLKAGFVDGFDNIVWKRILQDPLFVPETIFVDDLLKELRNTQR     

     QM                                                                        

     SP062  nucleotide  (SEQ  ID  NO:101)                                      

     GGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCAAGTTC     

     AGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAGAAAA     

     GGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCG     

     TCGTAACTACCCAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAA     

     AAAAGCGGAGCTTGAACTAGTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAA                   

     SP062  amino  acid  (SEQ  ID  NO:102)                                     

     ESRSKVDEAVSKFEKDSSSSSSSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQKEEDR     

     RNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQ                                 

     SP063  nucleotide  (SEQ  ID  NO:103)                                      

     ATGGACAACAGGAAACTGGGACGAGGTTATATCTGGTAAGATTGACAAGTACAAAGATCCAGATATTCC     

     AACAGTTGAATCACAAGAAGTTACGTCAGACTCTAGTGATAAAGAAATAACGGTAAGGTATGACCGTTT     

     ATCAACACCAGAAAAACCAATCCCACAACCAAATCCAGAGCATCCAAGTGTTCCGACACCAAACCCAGA     

     ACTACCAAATCAAGAGACTCCAACACCAGATAAACCAACTCCAGAACCAGGTACTCCAAAAACTGAAAC     

     TCCAGTGAATCCAGACCCAGAAGTTCCGACTTATGAGACAGGTAAGAGAGAGGAATTGCCAAACACAGG     

     TACAGAAGCTAAT                                                             

     SP063  amino  acid  (SEQ  ID  NO:104)                                     

     WTTGNWDEVISGKIDKYKDPDIPTVESQEVTSDSSDKEITVRYDRLSTPEKPIPQPNPEHPSVPTPNPE     

     LPNQETPTPDKPTPEPGTPKTETPVNPDPEVPTYETGKREELPNTGTEAN                        

     SP064  nucleotide  (SEQ  ID  NO:105)                                      

     CGATGGGCTCAATCCAACCCCAGGTCAAGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTT     

     ATCAGAAAAACCAGGAGACACCGTTCTCACTCAAGCGAAACCTGAGGGCGTTACTGGAAATACGAATTC     

     ACTTCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGAAACAAGCCCTTCTAGTCTGGATACACTTTT     

     TGAAAAAGATGAAGAAGCTCAAAAAAATCCAGAGCTAACAGATGTCTTAAAAGAAACTGTAGATACAGC     

     TGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGTAAAAGGTGGAGTGAAAGA     

     AAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCTTGAAAAAGCTGAAGGGAAAGGTCCTTTCAC     

     TGCCGGTGTAAACCAAGTAATTCCTTATGAACTATTCGCTGGTGATGGTATGTTAACTCGTCTATTACT     

     AAAAGCTTCGGATAATGCTCCTTGGTCTGACAATGGTACTGCTAAAAATCCTGCTTTACCTCCTCTTGA     

     AGGATTAACAAAAGGGAAATACTTCTATGAAGTAGACTTAAATGGCAATACTGTTGGTAAACAAGGTCA     

     AGCTTTAATTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTTACGGAAA     

     TAAAGACGGTAAAGCTGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATT     

     AGTTGCTAAAGAAACAGTTCAAAAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGC     

     CTACCTAGAAAAAGCCAAGGGTGAAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACT     

     CTTCGCAGGTGATGGCATGTTGACTCGTCTCTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAA     

     CGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAAACGTGAAGACCAAAGGTCAATACTTCTA     

     TCAANTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGCTCATTGACCAGTTCCGAGCAAA     

     NGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTAAACCAGACTTGGACAA     

     CATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAGAAACAGTTCAAAAAGC     

     CGTTGCAGACAACGTTAANGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGG     

     TCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGTATGTTGACTCG     

     TCTCTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGNGACGCTAAAAACCCAGCNCTATC     

     TCCACTAGGTGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAANTAGCCTTGGACGGAAATGTAGC     

     TGGCAAAGAAAAACAAGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGT     

     CAATGTCTATGGTAACAAAGACGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTAT     

     TAAGATAAATGTTAAAGAAACATCAGACACAGCAAATGGTTCATTATCACCTTCTAACTCTGGTTCTGG     

     CGTGACTCCGATGAATCACAATCATGCTACAGGTACTACAGATAGCATGCCTGCTGACACCATGACAAG     

     TTCTACCAACACGATGGCAGGTGAAAACATGGCTGCTTCTGCTAACAAGATGTCTGATACGATGATGTC     

     AGAGGATAAAGCTATG                                                          

     SP064  amino  acid  (SEQIDNO:106)                                         

     DGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKPEGVTGNTNSLPTPTERTEVSEETSPSSLDTLF     

     EKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKENTKDSIDVPAAYLEKAEGKGPFT     

     AGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYEVDLNGNTVGKQGQ     

     ALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVQKAVADNVKDSIDVPAA     

     YLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFY     

     QXALDGNVAGKEKQALIDQFRAXGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKA     

     VADNVXDSIDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALS     

     PLGENVKTKGQYFYQXALDGNVAGKEKQALIDQFRANGTQTYSATVVVYGNKDGKPDLDNIVATKKVTI     

     KINVKETSDTANGSLSPSNSGSGVTPMNHNHATGTTDSMPADTMTSSTNTMAGENAAASANKMSDTMMS     

     EDKAM                                                                     

     SP065  nucleotide  (SEQ  ID  NO:107)                                      

     TTCCAATCAAAAACAGGCAGATGGTAAACTCAATATCGTGACAACCTTTTACCCTGTCTATGArTTTAC     

     CAAGCAAGTCGCAGGAGATACGGCTAATGTAGAACTCCTAATCGGTGCTGGGACAGAACCTCATGAATA     

     CGAACCATCTGCCAAGGCAGTTGCCAAAATCCAAGATGCAGATACCTTCGTTTATGAAAATGAAAACAT     

     GGAAACATGGGTACCTAAATTGCTAGATACCTTGGATAAGAAAAAAGTGAAAACCATCAAGGCGACAGG     

     CGATATGTTGCTCTTGCCAGGTGGCGAGGAAGAAGAGGGAGACCATGACCATGGAGAAGAAGGTCATCA     

     CCATGAGTTTGACCCCCATGTTTGGTTATCACCAGTTCGTGCCATtAAACTAGTAGAGCACCATCCGCG     

     ACACTTGTCAGCAGATTATCCTGATAAAAAAGAGACCTTTGAGAAGAATGCAGCTGCCTATATCGAAAA     

     ATTGCAAGCCTTGGATAAGGCTTACGCAGAAGGTTTGTCTCAAGCAAAACAAAAGAGCTTTGTGACTCA     

     ACACGCAgCCTTTAACTaTCTTGCCTTGGACTATGGGACTC                                 

     SP065  amino  acid  (SEQ  ID  NO:106)                                     

     SNQKQADGKLNIVTTFYPVYEFTKQVAGDTANVELLIGAGTEPHEYEPSAKAVAKIQDADTFVYENENM     

     ETWVPKLLDTLDKKKVKTIKATGDMLLLPGGEEEEGDHDHGEEGHHHEFDPHVWLSPVRAIKLVEHHPR     

     HLSADYPDKKETFEKNAAAYIEKLQALDKAYAEGLSQAKQKSFVTQHAAFNYLALDYGT               

     SP067  nucleotide  (SEQ  ID  NO:109)                                      

     TATCACAGGATCGAACGGTAAGACAACCACAACGACTATGATTGGGGAAGTTTTGACTGCTGCTGGCCA     

     ACATGGTCTTTTATCAGGGAATATCGGCTATCCAGCTAGTCAGGTTGCTCAAATAGCATCAGATAAGGA     

     CACGCTTGTTATGGAACTTTCTTCTTTCCAACTCATGGGTGTTCAAGAATTCCATCCAGAGATTGCGGT     

     TATTACCAACCTCATGCCAACTCATATCGACTACCATGGGTCATTTTCGGAATATGTAGCAGCCAAGTG     

     GAATATCCAGAACAAGATGACAGCAGCTGATTTCCTTGTCTTGAACTTTAATCAAGACTTGGCAAAAGA     

     CTTGACTTCCAAGACAGAAGCCACTGTTGTACCATTTTCAACACTTGAAAAGGTTGATGGAGCTTATCT     

     GGAAGATGGTCAACTCTACTTCCGTGGTGAAGTAGTCATGGCAGCGAATGAAATCGGTGTTCCAGGTAG     

     CCACAATGTGGAAAATGCCCTTGCGACTATTGCTGTAGCCAAGCTTCGTGATGTGGACAATCAAACCAT     

     CAAGGAAACTCTTTCAGCCTTCGGTGGTGTCAAACACCGTCTCCAGTTTGTGGATGACATCAAGGGTGT     

     TAAATTCTATAACGACAGTAAATCAACTAATATCTTGGCTACTCAAAAAGCCTTGTCAGGATTTGACAA     

     CAGCAAGGTCGTCTTGATTGCAGGTGGTTTGGACCGTGGCAATGAGTTTGACGAATTGGTGCCAGACAT     

     TACTGGACTCAAGAAGATGGTCATCCTGGGTCAATCTGCAGAACGTGTCAAACGGGCAGCAGACAAGGC     

     TGGTGTCGCTTATGTGGAGGCGACAGATATTGCAGATGCGACCCGCAAGGCCTATGAGCTTGCGACTCA     

     AGGAGATGTGGTTCTTCTTAGTCCTGCCAATGCTAGCTGGGATATGTATGCTAACTTTGAAGTACGTGG     

     CGACCTCTTTATCGACACAGTAGCGGAGTTAAAAGAA                                     

     SP067  amino  acid  (SEQ  ID  NO:110)                                     

     GITGSNGKTTTTTMIGEVLTAAGQHGLLSGNIGYPASQVAQIASDKDTLVMELSSFQLMGVQEFHPEIA     

     VITNLMPTHIDYHGSFSEYVAAKWNIQNKMTAADFLVLNFNQDLAKDLTSKTEATVVPFSTLEKVDGAY     

     LEDGQLYFRGEVVMAANEIGVPGSHNVENALATIAVAKLRDVDNQTIKETLSAFGGVKHRLQFVDDIKG     

     VKFYNDSKSTNILATQKALSGFDNSKVVLIAGGLDRGNEFDELVPDITGLKKMVILGQSAERVKRAADK     

     AGVAYVEATDIADATRKAYELATQGDVVLLSPANASWDMYANFEVRGDLFIDTVAELKE               

     SP068  nucleotide  (SEQ  ID  NO:111)                                      

     AAGTTCATCGAAGATGGTTGGGAAGTCCACTATATCGGGGACAAGTGTGGTATCGAACACCAAGAAATC     

     CTTAAGTCAGGTTTGGATGTCACCTTCCATTCTATTGCGACTGGAAAATTGCGTCGCTATTTCTCTTGG     

     CAAAATATGCTGGACGTCTTCAAAGTTGGTTGGGGAATTGTCCAATCGCTCTTTATCATGTTGCGACTG     

     CGTCCACAGACCCTTTTTTCAAAGGGGGGCTTTGTCTCAGTACCGCCTGTTATCGCTGCGCGTGTGTCA     

     GGAGTGCCTGTCTTTATTCACGAATCTGACCTGTCTATGGGCTTGGCCAATAAAATCGCCTATAAATTT     

     GCGACTAAGATGTATTCAACCTTTGAACAAGCTTCGAGTTTGGCTAAGGTTGAGCATGTGGGAGCGG       

     SP068  amino  acid  (SEQ  ID  NO:112)                                     

     SSSKMVGKSTISGTSVVSNTKKSLSQVWMSPSILLRLENCVAISLGKICWTSSKLVGELSNRSLSCCDC     

     VHRPFFQRGALSQYRLLSLRVCQECLSLFTNLTCLWAWPIKSPINLRLRCIQPLNKLRVWLRLSMWER      

     SP069  nucleotide  (SEQ  ID  NO:113)                                      

     ATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGG     

     TGTTGGAACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCC     

     GCAAAAAATTACAGTCAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTGTACCAGAGATTGA     

     CCCTAGTCAAATTGATAGTCGGGTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTAC     

     GAGTGACCAAGAGACATTGGATAGAATTGATAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAAC     

     AGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCGACCGCAATGGTGTTGTCTTACCGGCAGTTAT     

     CACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCACCAAGTTCAAGCACATCAAATTC     

     AAGTACAAGCAGTTCATCGGAGACATCTTCGTCAACGAAAGCAACTAGTTCAAAAACGAAT             

     SP069  amino  acid  (SEQ  ID  NO:114)                                     

     SLASEMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDKVKIVPEID     

     PSQIDSRVQIENVNVSDKEVSITSDQETLDRIDKIIAVLPTSERITGNYSGSVPLQAIDRNGVVLPAVI     

     TPFDTIMKVTTKPVAPSSSTSNSSTSSSSETSSSTKATSSKTN                               

     SP070  nucleotide  (SEQ  ID  NO:115)                                      

     GCACCAGATGGGGCACAAGGTTCAGGGATCAGATGTTGAAAAGTACTACTTTACCCAACGCGGTCTTGA     

     GCAGGCAGGAATTACCATTCTTCCTTTTGATGAAAAAAATCTAGACGGTGATATGGAAATTATCGCTGG     

     AAATGCCTTTCGTCCAGATAACAACGTCGAAATTGCCTATGCGGACCAAAATGGTATCAGCTACAAACG     

     TTACCATGAGTTTCTAGGTAGCTTTATGCGTGACTTTGTTAGCATGGGAGTAGCAGGAGCACATGGAAA     

     AACTTCAACGACAGGTATGTTGTCTCATGTCTTGTCTCACATTACAGATACCAGCTTCTTGATTGGAGA     

     TGGGACAGGTCGTGGTTCGGCCAATGCCAAATATTTTGTCTTTGAATCTGACGAATATGAGCGTCACTT     

     CATGCCTTACCACCCAGAATACTCTATTATCACCAACATTGACTTTGACCATCCAGATTATTTCACAAG     

     TCTCGAGGATGTTTTTAATGCCTTTAACGACTATGCCAAACAAATCACCAAGGGTCTTTTTGTCTATGG     

     TGAAGATGCTGAATTGCGTAAGATTACGTCTGATGCACCAATTTATTATTATGGTTTTGAAGCTGAAGG     

     CAATGACTTTGTAGCTAGTGATCTTCTTCGTTCAATAACTGGTTCAACCTTCACCGTTCATTTCCGTGG     

     ACAAAACTTGGGGCAATTCCACATTCCAACCTTTGGTCGTCACAATATCATGAATGCGACAGCCGTTAT     

     TGGTCTTCTTTACACAGCAGGATTTGATTTGAACTTGGTGCGTGAGCACTTGAAAACATTTGCCGGTGT     

     TAAACGTCGTTTCACTGAGAAAATTGTCAATGATACAGTGATTATCGATGACTTTGCCCACCATCCAAC     

     AGAAATTATTGCGACCTTGGATGCGGCTCGTCAGAAATACCCAAGCAAGGAAATTGTAGCAGTCTTTCA     

     ACCGCATACCTTTACAAGAACCATTGCCTTGTTGGACGACTTTGCCCATGCTTTAAACCAAGCAGATGC     

     TGTTTATCTAGCGCAAATTTATGGCTCGGCTCGTGAAGTAGATCATGGTGACGTTAAGGTAGAAGACCT     

     AGCCAACAAAATCAACAAAAAACACCAAGTGATTACTGTTGAAAATGTTTCTCCACTCCTAGACCATGA     

     CAATGCTGTTTACGTCTTTATGGGAGCAGGAGACATCCAAACCTATGAATACTCATTTGAGCGTCTCTT     

     GTCTAACTTGACAAGCAATGTTCAA                                                 

     SP070  amino  acid  (SEQ  ID  NO:116)                                     

     HQMGHKVQGSDVEKYYFTQRGLEQAGITILPFDEKNLDGDMEIIAGNAFRPDNNVEIAYADQNGISYKR     

     YHEFLGSFMRDFVSMGVAGAHGKTSTTGMLSHVLSHITDTSFLIGDGTGRGSANAKYFVFESDEYERHF     

     MPYHPEYSIITNIDFDHPDYFTSLEDVFNAFNDYAKQITKGLFVYGEDAELRKITSDAPIYYYGFEAEG     

     NDFVASDLLRSITGSTFTVHFRGQNLGQFHIPTFGRHNIMNATAVIGLLYTAGFDLNLVREHLKTFAGV     

     KRRFTEKIVNDTVIIDDFAHHPTEIIATLDAARQKYPSKEIVAVFQPHTFTRTIALLDDFAHALNQADA     

     VYLAQIYGSAREVDHGDVKVEDLANKINKKHQVITVENVSPLLDHDNAVYVFMGAGDIQTYEYSFERLL     

     SNLTSNVQ                                                                  

     SP071  nucleotide  (SEQ  ID  NO:117)                                      

     TTTTAACCCAACTGTTGGTACTTTCCTTTTTACTGCAGGATTGAGCTTGTTAGTTTTATTGGTTTCTAA     

     AAGGGAAAATGGAAAGAAACGACTTGTTCATTTTCTGCTGTTGACTAGCATGGGAGTTCAATTGTTGCC     

     GGCCAGTGCTTTTGGGTTGACCAGCCAGATTTTATCTGCCTATAATAGTCAGCTTTCTATCGGAGTCGG     

     GGAACATTTACCAGAGCCTCTGAAAATCGAAGGTTATCAATATATTGGTTATATCAAAACTAAGAAACA     

     GGATAATACAGAGCTTTCAAGGACAGTTGATGGGAAATACTCTGCTCAAAGAGATAGTCAACCAAACTC     

     TACAAAAACATCAGATGTAGTTCATTCAGCTGATTTAGAATGGAACCAAGGACAGGGGAAGGTTAGTTT     

     ACAAGGTGAAGCATCAGGGGATGATGGACTTTCAGAAAAATCTTCTATAGCAGCAGACAATCTATCTTC     

     TAATGATTCATTCGCAAGTCAAGTTGAGCAGAATCCGGATCACAAAGGAGAATCTGTAGTTCGACCAAC     

     AGTQCCAGAACAAGGAAATCCTGTGTCTGCTACAACGGTGCAGAGTGCGGAAGAGGAAGTATTGGCGAC     

     GACAAATGATCGACCAGAGTATAAACTTCCATTGGAAACCAAAGGCACGCAAGAACCCGGTCATGAGGG     

     TGAAGCCGCAGTCCGTGAAGACTTACCAGTCTACACTAAGCCACTAGAAACCAAAGGTACACAAGGACC     

     CGGACATGAAGGTGAAGCTGCAGTTCGCGAGGAAGAACCAGCTTACACAGAACCGTTAGCAACGAAAGG     

     CACGCAAGAGCCAGGTCATGAGGGCAAAGCTACAGTCCGCGAAGAGACTCTAGAGTACACGGAACCGGT     

     AGCGACAAAAGGCACACAAGAACCCGAACATGAGGGCGAaCGGSCAGTAGAAGAAGAACTTCCGGCTTT     

     AGAGGTCACTACACGAAATAGAACGGAAATCCAGAATATTCCTTATACAACAGAAGAAATTCAGGATCC     

     AACACTTCTGAAAAATCGTCGTAAGATTGAACGACAAGGGCAAGCAGGGACACGTACAATTCAATATGA     

     AGACTACATCGTAAATGGTAATGTCGTAGAAACTAAAGAAGTGTCACGAACTGAAGTAGCTCCGGTCAA     

     CGAAGTCGTTAAAGTAGGAACACTTGTGAAAGTTAAACCTACAGTAGAAATTACAAACTTAACAAAAGT     

     TGAGAACAAAAAATCTATAACTGTAAGTTATAACTTAATAGACACTACCTCAGCATATGTTTCTGCAAA     

     AACGCAAGTTTTCCATGGAGACAAGCTAGTTAAAGAGGTGGATATAGAAAATCCTGCCAAAGAGCAAGT     

     AATATCAGGTTTAGATTACTACACACCGTATACAGTTAAAACACACCTAACTTATAATTTGGGTGAAAA     

     TAATGAGGAAAATACTGAAACATCAACTCAAGATTTCCAATTAGAGTATAAGAAAATAGAGATTAAAGA     

     TATTGATTCAGTAGAATTATACGGTAAAGAAAATGATCGTTATCGTAGATATTTAAGTCTAAGTGAAGC     

     GCCGACTGATACGGCTAAATACTTTGTAAAAGTGAAATCAGATCGCTTCAAAGAAATGTACCTACCTGT     

     AAAATCTATTACAGAAAATACGGATGGAACGTATAAAGTGACGGTAGCCGTTGATCAACTTGTCGAAGA     

     AGGTACAGACGGTTACAAAGATGATTACACATTTACTGTAGCTAAATCTAAAGCAGAGCAACCAGGAGT     

     TTACACATCCTTTAAACAGCTGGTAACAGCCATGCAAAGCAATCTGTCTGGTGTCTATACATTGGCTTC     

     AGATATGACCGCAGATGAGGTGAGCTTAGGCGATAAGCAGACAAGTTATCTCACAGGTGCATTTACAGG     

     GAGCTTGATCGGTTCTGATCGAACAAAATCGTATGCCATTTATGATTTGAAGAAACCATTATTTGATAC     

     ATTAAATGGTGCTACAGTTAGAGATTTGGATATTAAAACTGTTTCTGCTGATAGTAAAGAAAATGTCGC     

     AGCGCTGGCGAAGGCAGCGAATAGCGCGAATATTAATAATGTTGCAGTAGAAGGAAAAATCTCAGGTGC     

     GAAATCTGTTGCGGGATTAGTAGCGAGCGCAACAAATACAGTGATAGAAAACAGCTCGTTTACAGGGAA     

     ACTTATCGCAAATCACCAGGACAGTAATAAAAATGATACTGGAGGAATAGTAGGTAATATAACAGGAAA     

     TAGTTCGAGAGTTAATAAAGTTAGGGTAGATGCCTTAATCTCTACTAATGCACGCAATAATAACCAAAC     

     AGCTGGAGGGATAGTAGGTAGATTAGAAAATGGTGCATTGATATCTAATTCGGTTGCTACTGGAGAAAT     

     ACGAAATGGTCAAGGATATTCTAGAGTCGGAGGAATAGTAGGATCTACGTGGCAAAACGGTCGAGTAAA     

     TAATGTTGTGAGTAACGTAGATGTTGGAGATGGTTATGTTATCACCGGTGATCAATACGCAGCAGCAGA     

     TGTGAAAAATGCAAGTACATCAGTTGATAATAGAAAAGCAGACAGATTCGCTACAAAATTATCAAAAGA     

     CCAAATAGACGCGAAAGTTGCTGATTATGGAATCACAGTAACTCTTGATGATACTGGGCAAGATTTAAA     

     ACGTAATCTAAGAGAAGTTGATTATACAAGACTAAATAAAGCAGAAGCTGAAAGAAAAGTAGCTTATAG     

     CAACATAGAAAAACTGATGCCATTCTACAATAAAGACCTAGTAGTTCACTATGGTAACAAAGTAGCGAC     

     AACAGATAAACTTTACACTACAGAATTGTTAGATGTTGTGCCGATGAAAGATGATGAAGTAGTAACGGA     

     TATTAATAATAAGAAAAATTCAATAAATAAAGTTATGTTACATTTCAAAGATAATACAGTAGAATACCT     

     AGATGTAACATTCAAAGAAAACTTCATAAACAGTCAAGTAATCGAATACAATGTTACAGGAAAAGAATA     

     TATATTCACACCAGAAGCATTTGTTTCAGACTATACAGCGATAACGAATAACGTACTAAGCGACTTGCA     

     AAATGTAACACTTAAC                                                          

     SP071  amino  acid  (SEQ  ID  NO:118)                                     

     FNPTVGTFLFTAGLSLLVLLVSKRENGKKRLVHFLLLTSMGVQLLPASAFGLTSQILSAYNSQLSIGVG     

     EHLPEPLKIEGYQYIGYIKTKKQDNTELSRTVDGKYSAQRDSQPNSTKTSDVVHSADLEWNQGQGKVSL     

     QGEASGDDGLSEKSSIAADNLSSNDSFASQVEQNPDHKGESVVRPTVPEQGNPVSATTVQSAEEEVLAT     

     TNDRPEYKLPLETKGTQEPGHEGEAAVREDLPVYTKPLETKGTQGPGHEGEAAVREEEPAYTEPLATKG     

     TQEPGHEGKATVREETLEYTEPVATKGTQEPEHEGERXVEEELPALEVTTRNRTEIQNIPYTTEEIQDP     

     TLLKNRRKIERQGQAGTRTIQYEDYIVNGNVVETKEVSRTEVAPVNEVVKVGTLVKVKPTVEITNLTKV     

     ENKKSITVSYNLIDTTSAYVSAKTQVFHGDKLVKEVDIENPAKEQVISGLDYYTPYTVKTHLTYNLGEN     

     NEENTETSTQDFQLEYKKIEIKDIDSVELYGKENDRYRRYLSLSEAPTDTAKYFVKVKSDRFKEMYLPV     

     KSITENTDGTYKVTVAVDQLVEEGTDGYKDDYTFTVAKSKAEQPGVYTSFKQLVTAMQSNLSGVYTLAS     

     DMTADEVSLGDKQTSYLTGAFTGSLIGSDGTKSYAIYDLKKPLFDTLNGATVRDLDIKTVSADSKENVA     

     ALAKAANSANINNVAVEGKISGAKSVAGLVASATNTVIENSSFTGKLIANHQDSNKNDTGGIVGNITGN     

     SSRVNKVRVDALISTNARNNNQTAGGIVGRLENGALISNSVATGEIRNGQGYSRVGGIVGSTWQNGRVN     

     NVVSNVDVGDGYVITGDQYAAADVKNASTSVDNRKADRFATKLSKDQIDAKVADYGITVTLDDTGQDLK     

     RNLREVDYTRLNKAEAERKVAYSNIEKLMPFYNKDLVVHYGNKVATTDKLYTTELLDVVPMKDDEVVTD     

     INNKKNSINKVMLHFKDNTVEYLDVTFKENFINSQVIEYNVTGKEYIFTPEAFVSDYTAITNNVLSDLQ     

     NVTLN                                                                     

     SP072  nucleotide  (SEQ  ID  NO:119)                                      

     TTTTAACCCAACTGTTGGTACTTTCCTTTTTACTGCAGGATTGAGCTTGTTAGTTTTATTGGTTTCTAA     

     AAGGGAAAATGGAAAGAAACGACTTGTTCATTTTCTGCTGTTGACTAGCATGGGAGTTCAATTGTTGCC     

     GGCCAGTGCTTTTGGGTTGACCAGCCAGATTTTATCTGCCTATAATAGTCAGCTTTCTATCGGAGTCGG     

     GGAACATTTACCAGAGCCTCTGAAAATCGAAGGTTATCAATATATTGGTTATATCAAAACTAAGAAACA     

     GGATAATACAGAGCTTTCAAGGACAGTTGATGGGAAATACTCTGCTCAAAGAGATAGTCAACCAAACTC     

     TACAAAAACATCAGATGTAGTTCATTCAGCTGATTTAGAATGGAACCAAGGACAGGGGAAGGTTAGTTT     

     ACAAGGTGAAGCATCAGGGGATGATGGACTTTCAGAAAAATCTTCTATAGCAGCAGACAATCTATCTTC     

     TAATGATTCATTCGCAAGTCAAGTTGAGCAGAATCCGGATCACAAAGGAGAATCTGTAGTTCGACCAAC     

     AGTGCCAGAACAAGGAAATCCTGTGTCTGCTACAACGGTGCAGAGTGCGGAAGAGGAAGTATTGGCGAC     

     GACAAATGATCGACCAGAGTATAAACTTCCATTGGAAACCAAAGGCACGCAAGAACCCGGTCATGAGGG     

     TGAAGCCGCAGTCCGTGAAGACTTACCAGTCTACACTAAGCCACTAGAAACCAAAGGTACACAAGGACC     

     CGGACATGAAGGTGAAGCTGCAGTTCGCGAGGAAGAACCAGCTTACACAGAACCGTTAGCAACGAAAGG     

     CACGCAAGAGCCAGGTCATGAGGGCAAAGCTACAGTCCGCGAAGAGACTCTAGAGTACACGGAACCGGT     

     AGCGACAAAAGGCACACAAGAACCCGAACATGAGGGCGAaCGGsCAGTAGAAGAAGAACTTCCGGCTTT     

     AGAGGTCACTACACGAAATAGAACGGAAATCCAGAATATTCCTTATACAACAGAAGAAATTCAGGATCC     

     AACACTTCTGAAAAATCGTCGTAAGATTGAACGACAAGGGCAAGCAGGGACACGTACAATTCAATATGA     

     AGACTACATCGTAAATGGTAATGTCGTAGAAACTAAAGAAGTGTCACGAACTGAAGTAGCTCCGGTCAA     

     CGAAGTCGTTAAAGTAGGAACACTTGTGAAAGTTAAACCTACAGTAGAAATTACAAACTTAACAAAAGT     

     TGAGAACAAAAAATCTATAACTGTAAGTTATAACTTAATAGACACTACCTCAGCATATGTTTCTGCAAA     

     AACGCAAGTTTTCCATGGAGACAAGCTAGTTAAAGAGGTGGATATAGAAAATCCTGCCAAAGAGCAAGT     

     AATATCAGGTTTAGATTACTACACACCGTATACAGTTAAAACACACCTAACTTATAATTTGGGTGAAAA     

     TAATGAGGAAAATACTGAAACATCAACTCAAGATTTCCAATTAGAGTATAAGAAAATAGAGATTAAAGA     

     TATTGATTCAGTAGAATTATACGGTAAAGAAAATGATCGTTATCGTAGA                         

     SP072  amino  acid  (SEQ  ID  NO:120)                                     

     FNPTVGTFLFTAGLSLLVLLVSKRENGKKRLVHFLLLTSMGVQLLPASAFGLTSQILSAYNSQLSIGVG     

     EHLPEPLKIEGYQYIGYIKTKKQDNTELSRTVDGKYSAQRDSQPNSTKTSDVVHSADLEWNQGQGKVSL     

     QGEASGDDGLSEKSSIAADNLSSNDSFASQVEQNPDHKGESVVRPTVPEQGNPVSATTVQSAEEEVLAT     

     TNDRPEYKLPLETKGTQEPGHEGEAAVREDLPVYTKPLETKGTQGPGHEGEAAVREEEPAYTEPLATKG     

     TQEPGHEGKATVREETLEYTEPVATKGTQEPEHEGERXVEEELPALEVTTRNRTEIQNIPYTTEEIQDP     

     TLLKNRRKIERQGQAGTRTIQYEDYIVNGNVVETKEVSRTEVAPVNEVVKVGTLVKVKPTVEITNLTKV     

     ENKKSITVSYNLIDTTSAYVSAKTQVFHGDKLVKEVDIENPAKEQVTSGLDYYTPYTVKTHLTYNLGEN     

     NEENTETSTQDFQLEYKKIEIKDIDSVELYGKENDRYRR                                   

     SP073  nucleotide  (SEQ  ID  NO:121)                                      

     TCGTAGATATTTAAGTCTAAGTGAAGCGCCGACTGATACGGCTAAATACTTTGTAAAAGTGAAATCAGA     

     TCGCTTCAAAGAAATGTACCTACCTGTAAAATCTATTACAGAAAATACGGATGGAACGTATAAAGTGAC     

     GGTAGCCGTTGATCAACTTGTCGAAGAAGGTACAGACGGTTACAAAGATGATTACACATTTACTGTAGC     

     TAAATCTAAAGCAGAGCAACCAGGAGTTTACACATCCTTTAAACAGCTGGTAACAGCCATGCAAAGCAA     

     TCTGTCTGGTGTCTATACATTGGCTTCAGATATGACCGCAGATGAGGTGAGCTTAGGCGATAAGCAGAC     

     AAGTTATCTCACAGGTGCATTTACAGGGAGCTTGATCGGTTCTGATGGAACAAAATCGTATGCCATTTA     

     TGATTTGAAGAAACCATTATTTGATACATTAAATGGTGCTACAGTTAGAGATTTGGATATTAAAACTGT     

     TTCTGCTGATAGTAAAGAAAATGTCGCAGCGCTGGCGAAGGCAGCGAATAGCGCGAATATTAATAATGT     

     TGCAGTAGAAGGAAAAATCTCAGGTGCGAAATCTGTTGCGGGATTAGTAGCGAGCGCAACAAATACAGT     

     GATAGAAAACAGCTCGTTTACAGGGAAACTTATCGCAAATCACCAGGACAGTAATAAAAATGATACTGG     

     AGGAATAGTAGGTAATATAACAGGAAATAGTTCGAGAGTTAATAAAGTTAGGGTAGATGCCTTAATCTC     

     TACTAATGCACGCAATAATAACCAAACAGCTGGAGGGATAGTAGGTAGATTAGAAAATGGTGCATTGAT     

     ATCTAATTCGGTTGCTACTGGAGAAATACGAAATGGTCAAGGATATTCTAGAGTCGGAGGAATAGTAGG     

     ATCTACGTGGCAAAACGGTCGAGTAAATAATGTTGTGAGTAACGTAGATGTTGGAGATGGTTATGTTAT     

     CACCGGTGATCAATACGCAGCAGCAGATGTGAAAAATGCAAGTACATCAGTTGATAATAGAAAAGCAGA     

     CAGATTCGCTACAAAATTATCAAAAGACCAAATAGACGCGAAAGTTGCTGATTATGGAATCACAGTAAC     

     TCTTGATGATACTGGGCAAGATTTAAAACGTAATCTAAGAGAAGTTGATTATACAAGACTAAATAAAGC     

     AGAAGCTGAAAGAAAAGTAGCTTATAGCAACATAGAAAAACTGATGCCATTCTACAATAAAGACCTAGT     

     AGTTCACTATGGTAACAAAGTAGCGACAACAGATAAACTTTACACTACAGAATTGTTAGATGTTGTGCC     

     GATGAAAGATGATGAAGTAGTAACGGATATTAATAATAAGAAAAATTCAATAAATAAAGTTATGTTACA     

     TTTCAAAGATAATACAGTAGAATACCTAGATGTAACATTCAAAGAAAACTTCATAAACAGTCAAGTAAT     

     CGAATACAATGTTACAGGAAAAGAATATATATTCACACCAGAAGCATTTGTTTCAGACTATACAGCGAT     

     AACGAATAACGTACTAAGCGACTTGCAAAATGTAACACTTAAC                               

     SP073  amino  acid  (SEQ  ID  NO:122)                                     

     RRYLSLSEAPTDTAKYFVKVKSDRFKEMYLPVKSITENTDGTYKVTVAVDQLVEEGTDGYKDDYTFTVA     

     KSKAEQPGVYTSFKQLVTAMQSNLSGVYTLASDMTADEVSLGDKQTSYLTGAFTGSLIGSDGTKSYAIY     

     DLKKPLFDTLNGATVRDLDIKTVSADSKENVAALAKAANSANINNVAVEGKISGAKSVAGLVASATNTV     

     IENSSFTGKLIANHQDSNKNDTGGIVGNITGNSSRVNKVRVDALISTNARNNNQTAGGIVGRLENGALI     

     SNSVATGEIRNGQGYSRVGGIVGSTWQNGRVNVVVSNVDVGDGYVITGDQYAAADVKNASTSVDNRKAD     

     RFATKLSKDQIDAKVADYGITVTLDDTGQDLKRNLREVDYTRLNKAEAERKVAYSNIEKLMPFYNKDLV     

     VHYGNKVATTDKLYTTELLDVVPMKDDEVVTDINNKKNSINKVMLHFKDNTVEYLDVTFKENFINSQVI     

     EYNVTGKEYIFTPEAFVSDYTAITNNVLSDLQNVTLN                                     

     SP074  nucleotide  (SEQ  ID  NO:123)                                      

     CTTTGGTTTTGAAGGAAGTAAGCGTGGACAATTTGCTGTAGAAGGAATCAATCAACTTCGTGAGCATGT     

     AGACACTCTATTGATTATCTCAAACAACAATTTGCTTGAAATTGTTGATAAGAAAACACCGCTTTTGGA     

     GGCTCTTAGCGAAGCGGATAACGTTCTTCGTCAAGGTGTTCAAGGGATTACCGATTTGATTACCAATCC     

     AGGATTGATTAACCTTGACTTTGCCGATGTGAAAACGGTAATGGCAAACAAAGGGAATGCTCTTATGGG     

     TATTGGTATCGGTAGTGGAGAAGAACGTGTGGTAGAAGCGGCACGTAAGGCAATCTATTCACCACTTCT     

     TGAAACAACTATTGACGGTGCTGAGGATGTTATCGTCAACGTTACTGGTGGTCTTGACTTAACCTTGAT     

     TGAGGCAGAAGAGGCTTCACAAATTGTGAACCAGGCAGCAGGTCAAGGAGTGAACATCTGGCTCGGTAC     

     TTCAATTGATGAAAGTATGCGTGATGAAATTCGTGTAACAGTTGTTGCAACGGGTGTTCGTCAAGACCG     

     CGTAGAAAAGGTTGTGGCTCCACAAGCTAGATCTGCTACTAACTACCGTGAGACAGTGAAACCAGCTCA     

     TTCACATGGCTTTGATCGTCATTTTGATATGGCAGAAACAGTTGAATTGCCAAAACAAAATCCACGTCG     

     TTTGGAACCAACTCAGGCATCTGCTTTTGGTGATTGGGATCTTCGCCGTGAATCGATTGTTCGTACAAC     

     AGATTCAGTCGTTTCTCCAGTCGAGCGCTTTGAAGCCCCAATTTCACAAGATGAAGATGAATTGGATAC     

     ACCTCCATTTTTCAAAAATCGT                                                    

     SP074  amino  acid  (SEQ  ID  NO:124)                                     

     FGFEGSKRGQFAVEGINQLREHVDTLLIISNNNLLEIVDKKTPLLEALSEADNVLRQGVQGITDLITNP     

     GLINLDFADVKTVNANKGNALMGIGIGSGEERVVEAARKAIYSPLLETTIDGAEDVIVNVTGGLDLTLI     

     EAEEASQIVNQAAGQGVNIWLGTSIDESMRDEIRVTVVATGVRQDRVEKVVAPQARSATNYRETVKPAH     

     SHGFDRHFDMAETVELPKQNPRRLEPTQASAFGDWDLRRESIVRTTDSVVSPVERFEAPISQDEDELDT     

     PPFFKNR                                                                   

     SP075  nucleotide  (SEQ  ID  NO:125)                                      

     CTACTACCTCTCGAGAGAAAGTGACCTAGAGGTGACCGTTTTTGACCATGAGCAAGGTCAAGCCACCAA     

     GGCCGCAGCAGGAATTATCAGTCCTTGGTTTTCCAAACGCCGTAATAAAGCCTGGTACAAGATGGCGCG     

     CTTGGGGGCTGATTTTTATGTGGATTTATTAGCTGATTTAGAGAAATCAGGACAAGAAATCGACTTTTA     

     CCAGCGTTCGGGAGTCTTTCTCTTGAAAAAGGATGAATCCAATTTGGAAGAACTTTATCAACTGGCCCT     

     CCAGCGCAGAGAAGAATCTCCCTTGATAGGGCAATTAGCCATTCTGAACCAAGCCTCAGCTAATGAATT     

     ATTCCCTGGTTTGCAGGGATTTGACCGCCTGCTCTATGCTTCTGGTGGAGCGAGAGTAGATGGCCAACT     

     TTTAGTGACTCGTTTGCTGGAAGTCAGTCATGTCAAGCTGGTCAAAGAAAAAGTGACTCTGACACCGTT     

     AGCATCAGGCTACCAGATTGGTGAAGAGGAGTTTGAGCAGGTTATTTTGGCGACGGGAGCTTGGTTGGG     

     GGACATGTTAGAGCCTTTAGGTTATGAAGTGGATGTCCGTCCTCAAAAAGGACAACTACGAGATTATCA     

     GCTTGCCCAAGACATGGAAGATTACCCTGTTGTCATGCCAGAAGGGGAGTGGGATTTGATTCCCTTTGC     

     AGGTGGGAAATTATCCTTAGGCGCTACCCACGAAAATGACATGGGATTTGATTTGACGGTAGATGAAAC     

     CTTGCTCCAACAAATGGAGGAGGCCACCTTGACTCACTATCTGATTTTGGCTGAAGCTACTTCAAAATC     

     TGAGCGTGTTGGAATCCGTGCCTACACCAGTGATTTCTCTCCTTTCTTTGGGCAGGTGCCTGACTTAAC     

     TGGTGTCTATGCAGCCAGTGGACTAGGTTCATCAGGCCTCACAACTGGTCCTATCATTGGTTACCATCT     

     AGCCCAACTGATCCAAGACAAGGAGTTGACCTTGGACCCTCTAAATTACCCAATTGAAAACTATGTCAA     

     ACGAGTAAAAAGCGAA                                                          

     SP075  amino  acid  (SEQ  ID  NO:126)                                     

     YYLSRESDLEVTVFDHEQGQATKAAAGIISPWFSKRRNKAWYKMARLGADFYVDLLADLEKSGQEIDFY     

     QRSGVFLLKKDESNLEELYQLALQRREESPLIGQLAILNQASANELFPGLQGFDRLLYASGGARVDGQL     

     LVTRLLEVSHVKLVKEKVTLTPLASGYQIGEEEFEQVILATGAWLGDMLEPLGYEVDVRPQKGQLRDYQ     

     LAQDMEDYPVVMPEGEWDLIPFAGGKLSLGATHENDMGFDLTVDETLLQQMEEATLTHYLILAEATSKS     

     ERVGIRAYTSDFSPFFGQVPDLTGVYAASGLGSSGLTTGPIIGYHLAQLIQDKELTLDPLNYPIENYVK     

     RVKSE                                                                     

     SP076  nucleotide  (SEQ  ID  NO:127)                                      

     TAAGGTCAAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGA     

     GTATGAAATGGGGCAGTCTCTAGCACAGACCATTCTTTTCCATAATAGTGTTGATGTCTTTCAGTTGGA     

     TAAAAATGTGTCTATCGTGGAGATGAAAATTCCTCAGTCTTGGGCAGGTCAAAGTCTGAGTAAATTAGA     

     CCTCCGTGGCAAATACAATCTGAATATTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGA     

     ATTTGGACCAGATGACCTCTTGAAAGCAGATACCTATATTTTGGCAGTCATCAACAACCAGTATTTGGA     

     TACCCTA                                                                   

     SP076  amino  acid  (SEQ  ID  NO:128)                                     

     KVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKNVSIVEMKIPQSWAGQSLSKLD     

     LRGKYNLNILGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTL                          

     SP077  nucleotide  (SEQ  ID  NO:129)                                      

     TGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTTAGCTAGCAAGTATCCTAATATCGTTAG     

     AGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGGCTTGGTAGAGGCTTCTGGGCG     

     CTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTCGTGCCTACTTGAAAATTCTTGAAACTTG     

     CAGGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTG                                  

     SP077  amino  acid  (SEQ  ID  NO:130)                                     

     DGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEASGRYFKVVDSDDWVDPRAYLKILETC     

     RNLRAKVKRWMSL                                                             

     SP078  nucleotide  (SEQ  ID  NO:131)                                      

     TAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAACGCTTTGTCAAACAAGAAGA     

     AAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAAAAAGCCTTACTCGA     

     TTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAAAATCTTCCACCTAT     

     TCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGGA     

     AGATGACTCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGAATACAAACTTCC     

     AAGCTTACAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAA     

     TATCAAAATCTTAGAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGG     

     GCCATCAGTGACCAAGTATGAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCTATC     

     AGATGACCTCGCTCTAGCCTTGGCTGCCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCT     

     AATCGGAATTGAAGTGCCCAACTCCGATATTGCCACTGTATCTTTCCGAGAACTATGGGAACAATCGCA     

     AACGAAAGCAGAAAATTTCTTGGAAATTCCTTTAGGGAAGGCTGTTAATGGAACCGCAAGAGCTTTTGA     

     CCTTTCTAAAATGCCCCACTTGCTAGTTGCAGGTTCAACGGGTTCAGGGAAGTCAGTAGCAGTTAACGG     

     CATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTATGATGGTCGATCCCAAGAT     

     GGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATTCCAGTCGTGACCAATCCACGCAAAGC     

     CAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCTTTGCCAAGGTGGGAGT     

     TCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCC     

     GCTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGA     

     AGATGCTATCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCG     

     TCCATCTGTTGATGTCATCTCTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTC     

     ATCAGGAACAGACTCCCGTACGATTTTGGATGAAAATGGAGCAGAAAAACTTCTTGGTCGAGGAGACAT     

     GCTCTTTAAACCGATTGATGAAAATCATCCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGT     

     TGAGCGCATTGTGAACTTCATCAAGACTCAGGCAGATGCAGACTACGATGAGAGTTTTGATCCAGGTGA     

     GGTTTCTGAAAATGAAGGAGAATTTTCGGATGGAGATGCTGGTGGTGATCCGCTTTTTGAAGAAGCTAA     

     GTCTTTGGTTATCGAAACACAGAAAGCCAGTGCGTCTATGATTCAGCGTCGTTTATCAGTTGGATTTAA     

     CCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCTGAAGGTACCAAACC     

     TCGAAAAGTGTTACAACAA                                                       

     SP07B  amino  acid  (SEQ  ID  NO:132)                                     

     RGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKALLDLPPVDMETGEILTEEAVQNLPPI     

     PEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVREN     

     IKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGKSL     

     IGIEVPNSDIATVSFRELWEQSQTKAENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGKSVAVNG     

     IIASILMKARPDQVKFMMPVDPKMVELSVYNDIPHLLIPWTNPRKASKALQKVVDEMENRYELFAKVGV     

     PNIAGFNAKVEEFNSQSEYKQIPLPFIVVIVDELADLMMVASKEVEDAIIRLGQKARAAGIHMILATQR     

     PSVDVISGLIKAAVPSRVAFAVSSGTDSRTILDENGAEKLLGRGDMLFKPIDENHPVRLQGSFISDDDV     

     ERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKASASMIQRRLSVGFN     

     RATRLMEELEIAGVIGPAEGTKPRKVLQQ                                             

     SP079  nucleotide  (SEQ  ID  NO:133)                                      

     TCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTTTGGCCAATAT     

     GTATAAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATTTTGGGAAGACAAG     

     CTTCCTTTATGAAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTGACTGA     

     AAGTTTGCTTCAACCATCACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGG     

     CATTGCTAAGCAGGATCATCTAGCCTATCTCAAACCCATGTCTTATCAAAACACCTATGCTGTAGCTGT     

     TCCGAAAAAGATTGCTCAAGAATATGGCTTGAAGACCATTTCAGACTTGAAAAAAGTGGAAGGGCAGTT     

     GAAGGCAGGTTTTACACTCGAGTTTAACGACCGTGAAGATGGAAATAAGGGCTTGCAATCAATGTATGG     

     TCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATCAGGCTATTCAGTCAGGGGATATTCA     

     AATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACAGGTCTTGGAAGATGACAA     

     GCAACTCTTCCCACCTTATCAAGGGGCTCCACTCATGAAAGAAGCTCTTCTCAAGAAACACCCAGAGTT     

     GGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTACCAAGT     

     CGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGTTGAAGAA     

     A                                                                         

     SP079  amino  acid  (SEQ  ID  NO:134)                                     

     QKEKENLVIAGKIGPEPEILANMYKLLIEENTSMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTE     

     SLLQPSPKVSHEPEQVYQVARDGIAKQDHLAYLKPMSYQNTYAVAVPKKIAQEYGLKTISDLKKVEGQL     

     KAGFTLEFNDREDGNKGLQSMYGLNLNVATIEPALRYQAIQSGDIQITDAYSTDAELERYDLQVLEDDK     

     QLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLNYQVGVEGKSAKQVAKEFLQEQGLLKK     

     SP080  nucleotide  (SEQ  ID  NO:135)                                      

     ACGTTCTATTGAGGACCACTTTGATTCAAACTTCGAATTGGAATATAACCTCAAAGAAAAAGGGAAAAC     

     AGATCTTTTGAAGCTAGTTGATAAAACAACTGACATGCGTCTGCATTTTATCCGCCAAACTCATCCACG     

     CGGTCTCGGAGATGCTGTTTTGCAAGCCAAGGCTTTCGTCGGAAATGAACCTTTTGTCGTTATGCTTGG     

     TGATGACTTGATGGATATCACAGACGAAAAGGCTGTTCCACTTACCAAACAACTCATGGATGACTACGA     

     GCGTACCCACGCGTCTACTATCGCTGTCATGCCAGTCCCTCATGACGAAGTATCTGCTTACGGGGTTAT     

     TGCTCCGCAAGGCGAAGGAAAAGATGGTCTTTACAGTGTTGAAACCTTTGTTGAAAAACCAGCTCCAGA     

     GGACGCTCCTAGCGACCTTGCTATTATCGGACGCTACCTCCTCACGCCTGAAATTTTTGAGATTCTCGA     

     AAAGCAAGCTCCAGGTGCAGGAAATGAAATTCAGCTGACAGATGCAATCGACACCCTCAATAAAACACA     

     ACGTGTATTTGCTCGTGAGTTCAAAGGGGCTCGTTACGATGTCGGAGACAAGTTTGGCTTCATGAAAAC     

     ATCCATCGACTACGCCCTCAAACACCCACAAGTCAAAGATGATTTGAAGAATTACCTCATCCAACTTGG     

     AAAAGAATTGACTGAGAAGGAA                                                    

     SP080  amino  acid  (SEQ  ID  NO:136)                                     

     RSIEDHFDSNFELEYNLKEKGKTDLLKLVDKTTDMRLHFIRQTHPRGLGDAVLQAKAFVGNEPFVVMLG     

     DDLMDITDEKAVPLTKQLMDDYERTHASTIAVMPVPHDEVSAYGVIAPQGEGKDGLYSVETFVEKPAPE     

     DAPSDLAIIGRYLLTPEIFEILEKQAPGAGNEIQLTDAIDTLNKTQRVFAREFKGARYDVGDKFGFMKT     

     SIDYALKHPQVKDDLKNYLIQLGKELTEKE                                            

     SP081  nucleotide  (SEQ  ID  NO:137)                                      

     CGCTCAAAATACCAGAGGTGTTCAGCTAATCGAGCACGTTTCTCCTCAAATGTTGAAAGCCCAATTGGA     

     GAGTGTCTTTTCTGATATTCCACCTCAGGCTGTAAAAACTGGAATGTTGGCTACTACTGAAATCATGGA     

     AATCATCCAACCCTATCTTAAAAAACTGGATTGTCCCTATGTCCTTGATCCTGTTATGGTTGCTACAAG     

     TGGAGATGCCTTGATTGACTCAAATGCTAGAGACTATCTCAAAACAAACTTACTACCTCTAGCAACTAT     

     TATTACGCCAAATCTTCCTGAAGCAGAAGAGATTGTTGGTTTTTCAATCCATGACCCCGAAGACATGCA     

     GCGTGCTGGTCGCCTGATTTTAAAAGAATTTGGTCCTCAGTCTGTGGTTATCAAAGGCGGACATCTCAA     

     AGGTGGTGCTAAAGATTTCCTCTTTACCAAGAATGAACAATTTGTCTGGGAAAGCCCACGAATTCAAAC     

     CTGTCACACCCATGGTACT                                                       

     SP081  amino  acid  (SEQ  ID  NO:138)                                     

     AQNTRGVQLIEHVSPQMLKAQLESVFSDIPPQAVKTGMLATTEIMEIIQPYLKKLDCPYVLDPVMVATS     

     GDALIDSNARDYLKTNLLPLATIITPNLPEAEEIVGFSIHDPEDMQRAGRLILKEFGPQSVVIKGGHLK     

     GGAKDFLFTKNEQFVWESPRIQTCHTHGT                                             

     SP082  nucleotide  (SEQ  ID  NO:139)                                      

     AATTGTACAATTAGAAAAAGATAGCAAATCAGACAAAGAACAAGTTGATAAACTATTTGAATCATTTGA     

     TGCATCTTCAGATGAATCTATTTCTAAATTAAAAGAACTATCTGAAACTTCACTTAAAACCGATGCAGG     

     TAAAGACTATCTTAATAACAAAGTCAAAGAATCATCTAAAGCAATTGTAGATTTTCATTTGCAAAAAGG     

     TTTGGCTTATGATGTTAAAGATTCAGATGACAAATTTAAAGATAAAGCAACTCTTGAAACAAATGTAAA     

     AGAAATTACAAAACAAATTGATTTTATCAAAAAAGTTGATGAAACTTTTAAACAAGAGAATTTGGAAGA     

     AACTCTTAAATCTCTAAATGATCTTGTTGATAAATATCAAAAACAAATCGAACTTTTGAAGAAAGAAGA     

     AGAAAAAGCTGCTGAAAAAGCTGCTGAAAAAGCAAAGGAATCTTCTAGTCAAAGTAATTCTTCTGGTAG     

     TGCTTCTAATGAGTCTTATAATGGATCTTCCAATTCAAATGTAGATTATAGTTCATCTGAACAAACTAA     

     TGGATATTCAAATAATTATGGCGGTCAAGATTATTCTGGTTCAGGAGATAGTTCAACAAATGGTGGATC     

     ATCAGAACAATATTCATCTAGCAATTCAAACAGCGGAGCAAATAATGTCTACAGATATAAAGGCACTGG     

     TGCTGACGGCTATCAAAGATACTACTACAAAGATCATAATAATGGAGATGTGTATGATGACGATGGAAA     

     TTACCTTGGGAACTTTGGTGGCGGCATTGCAGAACCTAGTCAACGC                            

     SP082  amino  acid  (SEQ  ID  NO:140)                                     

     IVQLEKDSKSDKEQVDKLFESFDASSDESISKLKELSETSLKTDAGKDYLNNKVKESSKAIVDFHLQKG     

     LAYDVKDSDDKFKDKATLETNVKEITKQIDFIKKVDETFKQENLEETLKSLNDLVDKYQKQIELLKKEE     

     EKAAEKAAEKAKESSSQSNSSGSASNESYNGSSNSNVDYSSSEQTNGYSNNYGGQDYSGSGDSSTNGGS     

     SEQYSSSNSNSGAVVVYRYKGTGADGYQRYYYKDHNNGDVYDDDGNYLGNFGGGIAEPSQR             

     SP083  nucleotide  (SEQ  ID  NO:141)                                      

     TCTGACCAAGCAAAAAGAAGCAGTCAATGACAAAGGAAAAGCAGCTGTTGTTAAGGTGGTGGAAAGCCA     

     GGCAGAACTTTATAGCTTAGAAAAGAATGAAGATGCTAGCCTAAGAAAGTTACAAGCAGATGGACGCAT     

     CACGGAAGAACAGGCTAAAGCTTATAAAGAATACAATGATAAAAATGGAGGAGCAAATCGTAAAGTCAA     

     TGAT                                                                      

     SP083  amino  acid  (SEQ  ID  NO:142)                                     

     LTKQKEAVNDKGKAAVVKVVESQAELYSLEKNEDASLRKLQADGRITEEQAKAYKEYNDKNGGANRKVN     

     D                                                                         

     SP084  nucleotide  (SEQ  ID  NO:143)                                      

     GTCCGGCTCTGTCCAGTCCACTTTTTCAGCGGTAGAGGAACAGATTTTCTTTATGGAGTTTGAAGAACT     

     CTATCGGGAAACCCAAAAACGCAGTGTAGCCAGTCAGCAAAAGACTAGTCTGAACTTAGATGGGCAGAC     

     GCTTAGCAATGGCAGTCAAAAGTTGCCAGTCCCTAAAGGAATTCAGGCCCCATCAGGCCAAAGTATTAC     

     ATTTGACCGAGCTGGGGGCAATTCGTCCCTGGCTAAGGTTGAATTTCAGACCAGTAAAGGAGCGATTCG     

     CTATCAATTATATCTAGGAAATGGAAAAATTAAACGCATTAAGGAAACAAAAAAT                   

     SP084  amino  acid  (SEQ  ID  NO:144)                                     

     SGSVQSTFSAVEEQIFFMEFEELYRETQKRSVASQQKTSLNLDGQTLSNGSQKLPVPKGIQAPSGQSIT     

     FDRAGGNSSLAKVEFQTSKGAIRYQLYLGNGKIKRIKETKN                                 

     SP085  nucleotide  (SEQ  ID  NO:145)                                      

     GGGACAAATTCAAAAAAATAGGCAAGAGGAAGCAAAAATCTTGCAAAAGGAAGAAGTCTTGAGGGTAGC     

     TAAGATGGCCCTGCAGACGGGGCAAAATCAGGTAAGCATCAACGGAGTTGAGATTCAGGTATTTTCTAG     

     TGAAAAAGGATTGGAGGTCTACCATGGTTCAGAACAGTTGTTGGCAATCAAAGAGCCA                

     SP085  amino  acid  (SEQ  ID  NO:146)                                     

     GQIQKNRQEEAKILQKEEVLRVAKMALQTGQNQVSINGVEIQVFSSEKGLEVYHGSEQLLAIKEP         

     SP086  nucleotide  (SEQ  ID  NO:147)                                      

     TCGCTACCAGCAACAAAGCGAGCAAAAGGAGTGGCTCTTGTTTGTGGACCAACTTGAGGTAGAATTAGA     

     CCGTTCGCAGTTCGAAAAAGTAGAAGGCAATCGCCTATACATGAAGCAAGATGGCAAGGACATCGCCAT     

     CGGTAAGTCAAAGTCAGATGATTTCCGTAAAACGAATGCTCGTGGTCGAGGTTATCAGCCTATGGTTTA     

     TGGACTCAAATCTGTACGGATTACAGAGGACAATCAACTGGTTCGCTTTCATTTCCAGTTCCAAAAAGG     

     CTTAGAAAGGGAGTTCATCTATCGTGTGGAAAAAGAAAAAAGT                               

     SP086  amino  acid  (SEQ  ID  NO:148)                                     

     RYQQQSEQKEWLLFVDQLEVELDRSQFEKVEGNRLYMKQDGKDIAIGKSKSDDFRKTNARGRGYQPMVY     

     GLKSVRITEDNQLVRFHFQFQKGLEREFIYRVEKEKS                                     

     SP087  nucleotide  (SEQ  ID  NO:149)                                      

     GAACCGACAAGTCGCCCACTATCAAGACTATGCTTTGAATAAAGAAAAATTGGTTGCTTTTGCTATGGC     

     TAAACGAACCAAAGATAAGGTTGAGCAAGAAAGTGGGGAACAGTTTTTTAATCTAGGTCAGGTAAGCTA     

     TCAAAACAAGAAAACTGGCTTAGTGACGAGGGTTCGTACGGATAAGAGCCAATATGAGTTTCTGTTTCC     

     TTCAGTCAAAATCAAAGAAGAGAAAAGAGATAAAAAGGAAGAGGTAGCGACCGATTCAAGCGAAAAAGT     

     GGAGAAGAAAAAATCAGAAGAGAAGCCTGAAAAGAAAGAGAATTCA                            

     SP087  amino  acid  (SEQ  ID  NO:150)                                     

     NRQVAHYQDYALNKEKLVAFAMAKRTKDKVEQESGEQFFNLGQVSYQNKKTGLVTRVRTDKSQYEFLFP     

     SVKIKEEKRDKKEEVATDSSEKVEKKKSEEKPEKKENS                                    

     SP088  nucleotide  (SEQ  ID  NO:151)                                      

     GGTTGTCGGCTGGCAATATATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTAT     

     CAGATTAGAAGGTTTTCCAAAGTCAGAGTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGT     

     TGGTTGGAAAACATTAGAGATTAAAACTAAAGACAGTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGA     

     TTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTACTTTAATCAAAATCATTCTTTAGAGAC     

     AGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCTAAGACGGAAATTAATGGAGAAAACTACCT     

     TGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGATTCGACTTGGTACTACCTAGATCCAACAACTGG     

     TATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGTTCCTCAGGAGCAATGGC     

     CACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGG     

     TTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCA     

     AGATGGTTCAACTTGGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAA     

     TGGCAACTGGTACTATGCTTATAGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGT     

     CAACTATAATGGCGAATGGGTTCGG                                                 

     SP088  amino  acid  (SEQ  ID  NO:152)                                     

     VVGWQYIPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKRED     

     SEDKEEKRYYTNYYFNQNHSLETGWLYDQSNWYYLAKTEINGENYLGGERRAGWINDDSTWYYLDPTTG     

     IMQTGWQYLGNKWYYLRSSGAMATGWYQEGTTWYYLDHPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQ     

     DGSTWYYLNAGNGDMKTGWFQVNGNWYYAYSSGALAVNTTVDGYSVYYNGEWVR                    

     SP089  nucleotide  (SEQ  ID  NO:153)                                      

     GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAG     

     AAATGCTTGGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGA     

     TTCTCAATACGATGCTTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAAT     

     TAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTA     

     TGTGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTA     

     CATCAAAGAAAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAA     

     ATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGA     

     TGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGG     

     TGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAA     

     AATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTA     

     CATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGC     

     TGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGC     

     GAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAA     

     TGAAAATGCTGCTTACTATCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCT     

     TTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGGC     

     TATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCTAGTAAGGA     

     CTTTATCCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCC     

     AGTAGCTTCTCATCTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGA     

     TGGTTTTAAGCTTGAGAATCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGA     

     ATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAA     

     GGAAGCCGAAGAACATTACCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTG     

     GGGAAGAAGTAAAATTGCCAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTA     

     CCTTTCTGCTAAGACATTTGATGATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAA     

     TTATATCGATAGGGGAAGAACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGA     

     CCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAG                      

     SP089  amino  acid  (SEQ  ID  NO:154)                                     

     AKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEWLQI     

     KGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLK     

     SGGYMAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGK     

     IAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMA     

     KNETVDGYQLGSDGKWLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLA     

     ITISGLSGYMKTEDLQALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFD     

     GFKLENPFLFKDLTEATNYSAEELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNW     

     GRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASD     

     PYWGEKIASVMMKINEK                                                         

     SP090  nucleotide  (SEQ  ID  NO:155)                                      

     ATTTGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAGGGGGA     

     TCTAAAAGAAACCTACTGGAGAGTGATAGATGGGAAGTACTATTATTTTGATCCTTTATCCGGAGAGAT     

     GGTTGTCGGCTGGCAATATATACCTGCTCCACACAAGGGGGTTACGATTGGTCCTTCTCCAAGAATAGA     

     GATTGCTCTTAGACCAGATTGGTTTTATTTTGGTCAAGATGGTGTATTACAAGAATTTGTTGGCAAGCA     

     AGTTTTAGAAGCAAAAACTGCTACGAATACCAACAAACATCATGGGGAAGAATATGATAGCCAAGCAGA     

     GAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCATACTTTAAAAACTGGTTGGATTTATGAAGA     

     GGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAACAGATTGACGGTTGGAGA     

     GCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAGCAGCTCCATG     

     GTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGGTAATAGATGGTACTA     

     CCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGC     

     TGAAAATGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGG     

     AGCTATGGCAACTGGTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATAT     

     GAAAACAGGCTGGTTCCAAGTCAATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAA     

     TACCACAGTAGGTGGTTACTACTTAAACTATAATGGTGAATGGGTTAAG                         

     SP090  amino  acid  (SEQ  ID  NO:156)                                     

     VFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDPLSGEMVVGWQYIPAPHKGVTIGPSPRI     

     EIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHTLKTGWIYE     

     EGHWYYLQKDGGFDSRINRLTVGELARGWVKDYPLTYDEEKLKAAPWYYLNPATGIMQTGWQYLGNRWY     

     YLHSSGAMATGWYKEGSTWYYLDAENGDMRTGWQNLGNKWYYLRSSGAMATGWYQESSTWYYLNASNGD     

     MKTGWFQVNGNWYYAYDSGALAVNTTVGGYYLNYNGEWVK                                  

     SP091  nucleotide  (SEQ  ID  NO:157)                                      

     TGTCGCTGCAAATGAAACTGAAGTAGCAAAAACTTCGCAGGATACAACGACAGCTTCAAGTAGTTCAGA     

     GCAAAATCAGTCTTCTAATAAAACGCAAACGAGCGCAGAAGTACAGACTAATGCTGCTGCCCACTGGGA     

     TGGGGATTATTATGTAAAGGATGATGGTTCTAAAGCTCAAAGTGAATGGATTTTTGACAACTACTATAA     

     GGCTTGGTTTTATATTAATTCAGATGGTCGTTACTCGCAGAATGAATGGCATGGAAATTACTACCTGAA     

     ATCAGGTGGATATATGGCCCAAAACGAGTGGATCTATGACAGTAATTACAAGAGTTGGTTTTATCTCAA     

     GTCAGATGGGGCTTATGCTCATCAAGAATGGCAATTGATTGGAAATAAGTGGTACTACTTCAAGAAGTG     

     GGGTTACATGGCTAAAAGCCAATGGCAAGGAAGTTATTTCTTGAATGGTCAAGGAGCTATGATGCAAAA     

     TGAATGGCTSCTATGATCCAGCCTATTCTGCTTATTTTTATCTAAAATCCGATGGAACTTATGCTAACC     

     AAGAGTGGCAAAAAGTGGGCGGCAAATGGTACTATTTCAAGAAGTGGGGCTATATGGCTCGGAATGAGT     

     GGCAAGGCAACTACTATTTGACTGGAAGTGGTGCCATGGCGACTGACGAAGTGATTATGGATGGTACTC     

     GCTATATCTTTGCGGCCTCTGGTGAGCTCAAAGAAAAAAAAGATTTGAATGTCGGCTGGGTTCACAGAG     

     ATGGTAAGCGCTATTTCTTTAATAATAGAGAAGAACAAGTGGGAACCGAACATGCTAAGAAAGTCATTG     

     ATATTAGTGAGCACAATGGTCGTATCAATGATTGGAAAAAGGTTATTGATGAGAACGAAGTGGATGGTG     

     TCATTGTTCGTCTAGGTTATAGCGGTAAAGAAGACAAGGAATTGGCGCATAACATTAAGGAGTTAAACC     

     GTCTGGGAATTCCTTATGGTGTCTATCTCTATACCTATGCTGAAAATGAGACCGATGCTGAGAGTGACG     

     CTAAACAGACCATTGAACTTATAAAGAAATACAATATGAACCTGTCTTACCCTATCTATTATGATGTTG     

     AGAATTGGGAATATGTAAATAAGAGCAAGAGAGCTCCAAGTGATACAGGCACTTGGGTTAAAATCATCA     

     ACAAGTACATGGACACGATGAAGCAGGCGGGTTATCAAAATGTGTATGTCTATAGCTATCGTAGTTTAT     

     TACAGACGCGTTTAAAACACCCAGATATTTTAAAACATGTAAACTGGGTAGCGGCCTATACGAATGCTT     

     TAGAATGGGAAAACCCTCATTATTCAGGAAAAAAAGGTTGGCAATATACCTCTTCTGAATACATGAAAG     

     GAATCCAAGGGCGCGTAGATGTCAGCGTTTGGTAT                                       

     SP091  amino  acid  (SEQ  ID  NO:158)                                     

     VAANETEVAKTSQDTTTASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYK     

     AWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYYFKKW     

     GYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVGGKWYYFKKWGYMARNEW     

     QGNYYLTGSGAMATDEVIMDGTRYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVID     

     ISEHNGRINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYLYTYAENETDAESDA     

     KQTIELIKKYNMNLSYPIYYDVENWEYVNKSKRAPSDTGTWVKIINKYMDTMKQAGYQNVYVYSYRSLL     

     QTRLKHPDILKHVNWVAAYTNALEWENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWY                 

     SP092  nucleotide  (SEQ  ID  NO:159)                                      

     TACGTCTCAGCCTACTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCTTCATTAGAGAA     

     GAAATATGAGGAAGCAAAAGCAAAAGCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGC     

     AGAAGACGCTCAGAAAAAGTATGAAGATGATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGA     

     AGCATCTCAAAAATTGAATGATGTGGCGCTTGTTGTTCAAAATGCATATAAAGAGTACCGAGAAGTTCA     

     AAATCAACGTAGTAAATATAAATCTGACGCTGAATATCAGAAAAAATTAACAGAGGTCGACTCTAAAAT     

     AGAGAAGGCTAGGAAAGAGCAACAGGACTTGCAAAATAAATTTAATGAAGTAAGAGCAGTTGTAGTTCC     

     TGAACCAAATGCGTTGGCTGAGACTAAGAAAAAAGCAGAAGAAGCTAAAGCAGAAGAAAAAGTAGCTAA     

     GAGAAAATATGATTATGCAACTCTAAAGGTAGCACTAGCGAAGAAAGAAGTAGAGGCTAAGGAACTTGA     

     AATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACTGCTCAACATCAAGTAGA     

     TAATTTGAAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAA     

     AAAAGGAGAAGCTGAGCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACT     

     TCTTGACAGCCTTGATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTT     

     GGATAAAAAAGCTGATGAACTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAAT     

     ATTACTTGGAGGGGCTGATNCTGAAGATGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGC     

     TGAATTGGAAAAAACTCAAAAAGAATTAGATGCAGCTCTTAATGAGTTAGGCCCTGATGGAGATGAAGA     

     AGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGCAACCAGCTCC     

     AGCTCCAAAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGA     

     GCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCTAAGCCGGAGAAACCAGCTGAAGAGCCTACTCA     

     ACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATGTGGTATTTCTACAATAC     

     TGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGC     

     TATGGCAACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGC     

     AAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAACAGCAATGGCGCTATGGCGACAGGCTG     

     GCTCCAATACAATGGCTCATGGTACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATA     

     CAACGGTTCATGGTATTACCTCAACGCTAATGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTC     

     ATGGTACTACCTAAACGCTAACGGTGCTATGGCTACAGGTTGGGCTAAAGTCAACGGTTCATGGTACTA     

     CCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGC     

     ATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAATGGCTTAGG     

     TGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTT             

     SP092  amino  acid  (SEQ  ID  NO:160)                                     

     TSQPTFVRAEESPQVVEKSSLEKKYEEAKAKADTAKKDYETAKKKAEDAQKKYEDDQKRTEEKARKEAE     

     ASQKLNDVALVVQNAYKEYREVQNQRSKYKSDAEYQKKLTEVDSKIEKARKEQQDLQNKFNEVRAVVVP     

     EPNALAETKKKAEEAKAEEKVAKRKYDYATLKVALAKKEVEAKELEIEKLQYEISTLEQEVATAQHQVD     

     NLKKLLAGADPDDGTEVIEAKLKKGEAELNAKQAELAKKQTELEKLLDSLDPEGKTQDELDKEAEEAEL     

     DKKADELQNKVADLEKEISNLEILLGGADXEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEE     

     ETPAPAPQPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQ     

     PEKPATPKTGWKQENGMWYFYNTDGSMAIGWLQNNGSWYYLNANGANATGWVKDGDTWYYLEASGAMKA     

     SQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYLNANGDNATGWLQYNGSWYYLNANGDMATGWAKVNGS     

     WYYLNANGANATGWAKVNGSWYYLNANGSNATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGLG     

     ALAVNTTVDGYKVNANGEWV                                                      

     P093                                         nucleotide  (SEQ  ID         

                                                NO:161)                        

     TGGACAGGTGAAAGGTCATGCTACATTTGTGAAATCCATGACAACTGAAATGTACCAAGAACAACAGAA     

     CCATTCTCTCGCCTACAATCAACGCTTGGNTTCGCAAAATCGCATTGTAGATCCTTTTTTGGCGGAGGG     

     ATATGAGGTCAATTACCAAGTGTCTGACGACCCTGATGCAGTCTATGGTTACTTGTCTATTCCAAGTTT     

     GGAAATCATGGAGCCGGTTTATTTGGGAGCAGATTATCATCATTTAGGGATGGGCTTGGCTCATGTGGA     

     TGGTACACCGCTGCCTCTGGATGGTACAGGGATTCGCTCAGTGATTGCTGGGCACCGTGCAGAGCCAAG     

     CCATGTCTTTTTCCGCCATTTGGATCAGCTAAAAGTTGGAGATGCTCTTTATTATGATAATGGCCAGGA     

     AATTGTAGAATATCAGATGATGGACACAGAGATTATTTTACCGTCGGAATGGGAAAAATTAGAATCGGT     

     TAGCTCTAAAAATATCATGACCTTGATAACCTGCGATCCGATTCCTACCTTTAATAAACGCTTATTAGT     

     GAATTTTGAACGAGTCGCTGTTTATCAAAAATCAGATCCACAAACAGCTGCAGTTGCGAGGGTTGCTTT     

     TACGAAAGAAGGACAATCTGTATCGCGTGTTGCAACCTCTCAATGGTTG                         

     SP093  amino  acid  (SEQ  ID  NO:162)                                     

     GQVKGHATFVKSMTTEMYQEQQNHSLAYNQRLXSQNRIVDPFLAEGYEVNYQVSDDPDAVYGYLSTPSL     

     EIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSHVFFRHLDQLKVGDALYYDNGQE     

     IVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAVYQKSDPQTAAVARVAF     

     TKEGQSVSRVATSQWL                                                          

     SP094  nucleotide  (SEQ  ID  NO:163)                                      

     GATTGCTCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGC     

     CAAGGAAGTTGGTGCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGA     

     TCAGTTGATGGTAGCTATTCGTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAG     

     CCAGATTAATCCACATTTCCTCTATAACACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAG     

     TCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCAACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGA     

     CTTGATTTGTCTCTCTGACGAAATCAATCATGTCCGCCAGTATCTCTTTATCCAGAAACAACGCTATGG     

     AGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCTTTGATAATTTAGTCTTACCCAAGCTGGTCCT     

     ACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCAGGGCCATATTAAACT     

     TTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTCCAAGATGC     

     TGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTCAA     

     ACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGAAAT     

     ATATATAAATAGAATAGAAACTAGC                                                 

     SP094  amino  acid  (SEQ  ID  NO:164)                                     

     IAPLKDLRETMLEIASGAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSS     

     QINPHFLYNTLDTIIWMAEFHDSQRVVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYG     

     DKLEYEINENVAFDNLVLPKLVLQPLVENALYHGIKEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDA     

     GDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKVEIYINRIETS                    

     SP095  nucleotide  (SEQ  ID  NO:165)                                      

     TAGGTCATATGGGACTTTTTTTCTACAACAAAATAGGCTCCATAATATCTATAAGGGATTTACCCACTA     

     CAAATATTATAGAGCCGAAAATTCACATCTAATATATGCAGACTACTTTGAAATGAAATTAAAAAAATT     

     ATTAAAGGATGACACAAAAGTTTTTGAAAAATCTACATTCAAATTTGTAGAAGGATATAAAATATACCT     

     GACAGAATCTAAAGAATCTGGAATTAAACAAATGGACAATGTCATAAAATATTTTGAGTTTATTGAATC     

     TAAAAGTATTGCTTTATATTTTCAAAAACGATTAAATGAGCTGATAGAT                         

     SP095  amino  acid  (SEQ  ID  NO:166)                                     

     RSYGTFFLQQNRLHNIYKGFTHYKYYRAENSHLIYADYFEMKLKKLLKDDTKVFEKSTFKFVEGYKIYL     

     TESKESGIKQMDNVIKYFEFIESKSIALYFQKRLNELID                                   

     SP096  nucleotide  (SEQ  ID  NO:167)                                      

     CAACGTTGAGAATTATTTGCGAATGTGTTTGGATAGCATTCAGAATCAGACGTATCAAAATTTTGAGTG     

     TTTATTAATCAATGATGGCTCTCCAGATCATTCATCCAAAATATGTGAAGAATTTGTAGAGAAAGATTC     

     TCGTTTCAAATATTTTGAGAAAGCAAACGGCGGTCTTTCATCAGCTCGTAACCTAGGTATTGAATGTTC     

     GGGGGGGGGCGTACATTACTTTTGTAGACTC                                           

     SP096  amino  acid  (SEQ  ID  NO:168)                                     

     NVENYLRMCLDSIQNQTYQNFECLLINDGSPDHSSKICEEFVEKDSRFKYFEKANGGLSSARNLGIECS     

     GGGVHYFCRL                                                                

     SP097  nucleotide  (SEQ  ID  NO:169)                                      

     CTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCA     

     GACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGCAGAC     

     GGATGTTCTGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGAC     

     CATCTTGAAGTCAGATAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTAC     

     AGATGACAGTGTGCAGATGAAAACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCA     

     TCAGGGAGCTATGCCTGTTTTGACTCCAGCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCA     

     AGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTGTGCTTCGTTTGGATATTTCTTATGAAACTCTGGA     

     AGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCTTTGCCTTCATTATCAATGAAAACCATGAATT     

     TGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCAAAATGGAGGCTATGAAACCCTACATCGA     

     TACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATTGCAGGAACTGATTG     

     GACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAG                      

     SP097  amino  acid  (SEQ  ID  NO:170)                                     

     YYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENPSQDKVEGIRDLFLT     

     ILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQWVISVTQ     

     ELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYID     

     TGQGYTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQ                                  

     SP098  nucleotide  (SEQ  ID  NO:171)                                      

     GACAAAAACATTAAAACGTCCTGAGGTTTTATCACCTGCAGGGACTTTAGAGAAGCTAAAGGTAGCTGT     

     TCAGTATGGAGCAGATGCTGTCTTTATCGGTGGTCAGGCCTATGGTCTTCGTAGCCGTGCGGGAAACTT     

     TACTTTCGAACAGATGGAAGAAGGCGTGCAGTTTGCGGCCAAGTATGGTGCCAAGGTCTATGTAGCGGC     

     TAATATGGTTATGCACGAAGGAAATGAAGCTGGTGCTGGTGAGTGGTTCCGTAAACTGCGTGATATCGG     

     GATTGCAGCAGTTATCGTATCTGACCCAGCCTTGATTATGATTGCAGTGACTGAAGCACCAGGCCTTGA     

     AATCCACCTTTCTACCCAAGCCAGTGCCACTAACTATGAAACCCTTGAGTTCTGGAAAGAGCTAGGCTT     

     GACTCGTGTCGTTTTAGCGCGTGAGGTTTCAATGGAAGAATTAGCTGAGATCCGCAAACGTACAGATGT     

     TGAAATTGAAGCCTTTGTCCATGGAGCTATGTGTATTTCATACTCTGGACGTTGTACTCTTTCAAACCA     

     CATGAGTATGCGTGATGCCAACCGTGGTGGATGTTCTCAGTCATGCCGTTGGAAATACGACCTTTACGA     

     TATGCCATTTGGGAAAGAACGTAAGAGTTTGCAGGGTGAGATTCCAGAAGAATTTTCAATGTCAGCCGT     

     TGACATGTCTATGATTGACCANATTCCAGATATGATTGAAAATGGTGTGGACAGTCTAAAAATCGAAGG     

     ACGTATGNAGTCTATTCACTANGTATCAACAGTAACCAACTGCTACAAGGCGGCTGTGGATGCCTATCT     

     TGAAAGTCCTGAAAAGTTTGAAGCTATCAAACAAGACTTGGTGGACGAGATGTGGAAGGTTGCCCAACG     

     TGAACTGGCTACAGGATTTTACTATGGTACACCATCTGAAAATGAGCAGTTGTTTGGTGCTCGTCGTAA     

     AATCCCTGAGTACAAGTTTGTCGCTGAAGTGGTTTCTTATGATGATGCGGCACAAACAGCAACTATTCG     

     TCAACGAAACGTCATTAACGAAGGGGACCAAGTTGAGTTTTATGGTCCAGGTTTCCGTCATTTTGAAAC     

     CTATATTGAAGATTTGCATGATGCTAAAGGCAATAAAATCGACCGCGCTCCAAATCCAATGGAACTATT     

     GACTATTAAAGTCCCACAACCTGTTCAATCAGGAGACATGGTTCGAGCTCTTAAAGAGGGGCTTATCAA     

     TCTTTATAAGGAAGATGGAACCAGCGTCACAGTTCGTGCT                                  

     SP098  amino  acid  (SEQ  ID  NO:172)                                     

     TKTLKRPEVLSPAGTLEKLKVAVQYGADAVFIGGQAYGLRSRAGNFTFEQMEEGVQFAAKYGAKVYVAA     

     NMVMHEGNEAGAGEWFRKLRDIGIAAVIVSDPALIMIAVTEAPGLEIHLSTQASATNYETLEFWKELGL     

     TRVVLAREVSMEELAEIRKRTDVEIEAFVHGAMCISYSGRCTLSNHMSMRDANRGGCSQSCRWKYDLYD     

     MPFGKERKSLQGEIPEEFSMSAVDMSMIDXIPDMIENGVDSLKIEGRMXSIHXVSTVTNCYKAAVDAYL     

     ESPEKFEAIKQDLVDEMWKVAQRELATGFYYGTPSENEQLFGARRKIPEYKFVAEVVSYDDAAQTATIR     

     QRNVINEGDQVEFYGPGFRHFETYIEDLHDAKGNKIDRAPNPMELLTIKVPQPVQSGDMVRALKEGLIN     

     LYKEDGTSVTVRA                                                             

     SP099  nucleotide  (SEQ  ID  NO:173)                                      

     TTCTCAGGAGACCTTTAAAAATATCACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGG     

     AACGCCTCGTGGTGCTGGGAATATCAAGGGTGAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGA     

     GTCTTATGTCAAACGTATCAACGCTATCGGAGATTTGACTGGATATGACCTGATTGAAACGCCAGAAAC     

     CAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGTAGCTTGATGATTACAGGTGTCAATGA     

     CTCCTCTAAAGAAGACAAGTTTGTCTCTGGTTCTTATAAACTAGTCGAAGGAGAGCACTTAACCAACGA     

     CGACAAGGATAAAATCCTCTTGCACAAGGACTTGGCAGCCAAACACGGCTGGAAAGTAGGGGACAAGGT     

     TAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGAAGTGACAAT     

     CAAGGGACTCTTTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACTTTACGAAAACACAGC     

     TATTACAGACATTCACACTGCTGCAAAACTTTATGGATACACAGAAGACACAGCCATTTATGGGGACGC     

     AACCTTCTTTGTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTAT     

     CAACTGGAAGAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCTGG     

     TATGTACAAGATGGCCAAC                                                       

     SP099  amino  acid  (SEQ  ID  NO:174)                                     

     SQETFKNITNSFSMQINRRVNQGTPRGAGNIKGEDIKKITENKAIESYVKRINAIGDLTGYDLIETPET     

     KKNLTADRAKRFGSSLMITGVNDSSKEDKFVSGSYKLVEGEHLTNDDKDKILLHKDLAAKHGWKVGDKV     

     KLDSNIYDADNEKGAKETVEVTIKGLFDGHNKSAVTYSQELYENTAITDIHTAAKLYGYTEDTAIYGDA     

     TFRVTADKNLDDVMKELNGISGIMWKSYTLVKSSSNYPALEQSISGMYKMAN                      

     SP100  nucleotide  (SEQ  ID  NO:175)                                      

     AGTAAATGCGCAATCAAATTCATTAATATTAATAGATGAACCTGAAATCTCACTTCATCCGAGTGCAAT     

     CTATAAATTTAAAGAGTTTTTACTTCAAGAGTGTTTAAATAAAAAACATCAAATTATTATCACTACACA     

     TTCTACACAACTTATAAAAGATTTTCCTAGAGAAGCCGTGAAACTTTTAGTGAAAAACGGAGAAAAGGT     

     AGATGTTATTGAAAATATTGATTATCAGGATGCATTTTTTGAATTAGGTGATGTGTATCATTCTAGGAA     

     GATGATTTATGTTGAAGATAGACTAGCTAAATATATTCTAGAGTTTGTTATCACTCATTCAGGTAGTGA     

     GAATCTTAAACAGAATTTAGTAGTGAGATATATTCCTGGTGGAGCAAATCAAATAATTTGTAATAATAT     

     TTTAAACTCATCGTATTTAGATTCCGATAACCATTATTTTTGGCTTGATGGAGATCAAAACACTAATGT     

     TAGTGAATCAAATAATTTAATGAACTATCTTGAAAATGGTGTTGTTATATCAGATAAAATTCCTGAATC     

     AGATAATAAAAATCTTGATGATATTATAAAATTGATAANGGGATGTCCAATTAAATTTAATGTTTCAGG     

     TAATAAAGGGCAAAAAAATAATATTGAATTAATTGCGAAACAAAGAAGCTTTATAGATTATTGGGCTAA     

     ATAC                                                                      

     SP100  amino  acid  (SEQ  ID  NO:176)                                     

     VNAQSNSLILIDEPEISLHPSAIYKFKEFLLQECLNKKHQIIITTHSTQLIKDFPREAVKLLVKNGEKV     

     DVIENIDYQDAFFELGDVYHSRKMIYVEDRLAKYILEFVITHSGSENLKQNLVVRYIPGGANQIICNNI     

     LNSSYLDSDNHYFWLDGDQNTNVSESNNLMNYLENGVVISDKIPESDNKNLDDIIKLIXGCPIKFNVSG     

     NKGQKNNIELIAKQRSFIDYWAKY                                                  

     SP101  nucleotide  (SEQ  ID  NO:177)                                      

     TTACCGCGTTCATCAAGATGTCAAACAAGTCATGACCTATCAACCCATGGTGCGAGAAATATTGAGTGA     

     ACAAGACACCCCAGCAAACGAAGAGCTTGTGCTTGCTATGATTTATACTGAAACAAAAGGAAAAGAAGG     

     CGATGTTATGCAGTCTAGTGAGTCTGCAAGTGGTTCCACCAACACCATCAATGATAATGCCTCTAGCAT     

     TCGGCAAGGCATTCAAACTCTGACAGGCAATCTCTATCTGGCGCAGAAGAAGGGGGTAGATATCTGGAC     

     AGCTGTTCAAGCCTATAATTTTGGACCTGCCTATATCGATTTTATCGCCCAAAATGGCAAGGAAAATAC     

     CCTGGCTCTAGCCAAACAGTACTCTCGTGAGACTGTTGCCCCCTTGCTTGGTAATAGGACTGGAAAGAC     

     TTATAGTTATATTCACCCCATTTCCATTTTTCACGGTGCTGAACTCTATGTAAATGGAGGAAACTATTA     

     TTATTCTAGACAGGTACGACTTAACCTTTACATCATCAAATGTTTCACTCTCTTTTCAACATCTGGC       

     SP101  amino  acid  (SEQ  ID  NO:178)                                     

     YRVHQDVKQVMTYQPMVREILSEQDTPANEELVLAMIYTETKGKEGDVMQSSESASGSTNTINDNASSI     

     RQGIQTLTGNLYLAQKKGVDIWTAVQAYNFGPAYIDFIAQNGKENTLALAKQYSRETVAPLLGNRTGKT     

     YSYIHPISIFHGAELYVNGGNYYYSRQVRLNLYIIKCFTLFSTSG                             

     SP102  nucleotide  (SEQ  ID  NO:179)                                      

     GTGGATGGGCTTTAACTATCTTCGTATTCGCCGTGCGGCTAAAATTGTGGACAATGAGGAGTTTGAAGC     

     CTTGATTCGTACGGGTCAATTGATTGATTTGCGCGACCCAGCAGAATTCCACAGAAAACATATCCTTGG     

     TGCACGCAATATTCCTTCAAGTCAGTTGAAAACTAGTCTTGCAGCCCTTCGTAAAGATAAACCTGTCCT     

     TCTCTACGAAAACCAACGTGCGCAACGAGTTACAAATGCAGCTCTTTACTTGAAAAAACAAGGTTTTTC     

     TGAGATTTATATCCTTTCTTATGGCTTGGATTCTTGGAAAGGGAAAGTGAAGACTAGC                

     SP102  amino  acid  (SEQ  ID  NO:180)                                     

     WMGFNYLRIRRAAKIVDNEEFEALIRTGQLIDLRDPAEFHRKHILGARNIPSSQLKTSLAALRKDKPVL     

     LYENQRAQRVTNAALYLKKQGFSEIYILSYGLDSWKGKVKTS                                

     SP103  nucleotide  (SEQ  ID  NO:181)                                      

     ACTAAACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTC     

     AAGTCAGAAAAGTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAAT     

     TGTAATCAAAATTACAGATCAGGGCTATGTAACGTCACACGGTGACCACTATCATTACTATAATGGGAA     

     AGTTCCTTATGATGCCCTCTTTAGTGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGC     

     TGATATTGTCAATGAAGTCAAGGGTGGTTATATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAA     

     AGATGCAGCTCATGCTGATAATGTTCGAACTAAAGATGAAATCAATCGTCAAAAACAAGAACATGTCAA     

     AGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCTCAGGGACGATATACGACAAATGATGG     

     TTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATATCGTTCCTCATGGAGGTCA     

     CTATCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATTAGCAGCAGCTAAAGCACATCTGGCTGG     

     AAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTGACAATAACACGCAATCTGTAGC     

     AAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTTTGAAGGAACTCTATGA     

     TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATTATCAGTCG     

     TACACCAAATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCTTTCTGC     

     CTTAGAAGAAAAGATTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAA     

     ACCTAATGAAGTAGTGTCTAGTCTAGGCAGTCTTTCAAGCAATCCTTCTTCTTTAACGACAAGTAAGGA     

     GCTCTCTTCAGCATCTGATGGTTATATTTTTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTA     

     TATTGTAAGACATGGTGATCATTTCCATTACATTCCAAAATCAAATCAAATTGGGCAACCGACTCTTCC     

     AAACAATAGTCTAGCAACACCTTCTCCATCTCTTCCAATCAATCCAGGAACTTCACATGAGAAACATGA     

     AGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAATCAGGTTTTGTCATGAGTCA     

     CGGAGACCACAATCATTATTTCTTCAAGAAG                                           

     SP103  amino  acid  (SEQ  ID  NO:162)                                     

     LNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGK     

     VPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVK     

     DNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAG     

     KNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR     

     TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKE     

     LSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHE     

     EDGYGFDANRIIAEDESGFVMSHGDHNHYFFKK                                         

     SP105  nucleotide  (SEQ  ID  NO:183)                                      

     TGACTACCTTGAAATCCCACTTTACAGCTATCTTGGTGGATTCAACACTAAAGTTCTTCCAACTCCAAT     

     GATGAACATCATCAACGGTGGTTCTCACTCTGACGCTCCAATCGCTTTCCAAGAGTTCATGATCTTGCC     

     AGTTGGTGCGCCAACATTTAAAGAAGCCCTTCGTTACGGTGCTGAAATCTTCCACGCTCTTAAGAAAAT     

     CCTTAAATCACGTGGTTTGGAAACTGCCGTAGGTGACGAAGGTGGATTCGCTCCTCGTTTCGAAGGAAC     

     TGAAGATGGTGTTGAAACTATCCTTGCTGCGATTGAAGCTGCTGGATATGTACCAGGTAAAGACGTATT     

     TATCGGATTTGACTGTGCTTCATCAGAATTCTACGATAAAGAACGTAAAGTTTACGACTACACTAAATT     

     TGAAGGTGAAGGTGCTGCTGTTCGTACATCTGCAGAACAAATCGACTACCTTGAAGAATTGGTTAACAA     

     ATACCCAATCATCACTATTGAAGATGGTATGGATGAAAACGACTGGGATGGTTGGAAAGCTCTTACTGA     

     ACGTCTTGGTAAGAAAGTACAACTTGTTGGTGACGACTTCTTCGTAACAAACACTGACTACCTTGCACG     

     TGGTATCCAAGAAGGTGCTGCTAACTCAATCCTTATCAAAGTTAACCAAATCGGTACTCTTACTGAAAC     

     TTTTGAAGCTATCGAAATGGCTAAAGAAGCTGGTTACACTGCTGTTGTATCACACCGTTCAGGTGAAAC     

     TGAAGATTCAACAATCGCTGATATTGCAGTTGCAACTAACGCAGGACAAATCAAGACTGGTTCACTTTC     

     ACGTACAGACCGCATCGCTAAATACAACCAATTGCTTCGTATCGAAGACCAACTTGGTGAAGTAGCTGA     

     ATATCGTGGATTGAAATCATTCTACAACCTTAAAAAA                                     

     SP105  amino  acid  (SEQ  ID  NO:184)                                     

     DYLEIPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPIAFQEFMILPVGAPTFKEALRYGAETFHALKKI     

     LKSRGLETAVGDEGGFAPRFEGTEDGVETILAAIEAAGYVPGKDVFIGFDCASSEFYDKERKVYDYTKF     

     EGEGAAVRTSAEQIDYLEELVNKYPIITIEDGMDENDWDGWKALTERLGKKVQLVGDDFFVTNTDYLAR     

     GIQEGAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLS     

     RTDRIAKYNQLLRIEDQLGEVAEYRGLKSFYNLKK                                       

     SP106  nucleotide  (SEQ  ID  NO:185)                                      

     TCGTATCTTTTTTTGGAGCAATGTTCGCGTAGAAGGACATTCCATGGATCCGACCCTAGCGGATGGCGA     

     AATTCTCTTCGTTGTAAAACACCTTCCTATTGACCGTTTTGATATCGTGGTGGCCCATGAGGAAGATGG     

     CAATAAGGACATCGTCAAGCGCGTGATTGGAATGCCTGGCGACACCATTCGTTACGAAAATGATAAACT     

     CTACATCAATGACAAAGAAACGGACGAGCCTTATCTAGCAGACTATATCAAACGCTTCAAGGATGACAA     

     ACTCCAAAGCACTTACTCAGGCAAGGGCTTTGAAGGAAATAAAGGAACTTTCTTTAGAAGTATCGCTCA     

     AAAAGCTCAAGCCTTCACAGTTGATGTCAACTACAACACCAACTTTAGCTTTACTGTTCCAGAAGGAGA     

     ATACCTTCTCCTCGGAGATGACCGCTTGGTTTCGAGCGACAGCCGCCACGTAGGTACCTTCAAAGCAAA     

     AGATATCACAGGGGAAGCTAAATTCCGCTTATGGCCAATCACCCGTATCGGAACATTT                

     SP106  amino  acid  (SEQ  ID  NO:186)                                     

     RIFFWSNVRVEGHSMDPTLADGEILFVVKHLPIDRFDIVVAHEEDGNKDIVKRVIGMPGDTIRYENDKL     

     YINDKETDEPYLADYIKRFKDDKLQSTYSGKGFEGNKGTFFRSIAQKAQAFTVDVNYNTNFSFTVPEGE     

     YLLLGDDRLVSSDSRHVGTFKAKDITGEAKFRLWPITRIGTF                                

     SP107  nucleotide  (SEQ  ID  NO:187)                                      

     GGACTCTCTCAAAGATGTGAAAGCAAATGCTAGCGACAGCAAGCCTGCACAGGACAAGAAGGATGCAAA     

     ACAAGGAACGGAAGATAGTAAGGATTCAGATAAGATGACTGAAACAAACTCAGTTCCGGCAGGAGTGAT     

     TGTGGTCAGTCTACTTGCCCTCCTAGGCGTGATTGCCTTCTGGCTGATTCGCCGTAAGAAAGAGTCAGA     

     AATCCAGCAATTAAGCACGGAATTGATCAAGGTTCTAGGACAGCTAGATGCAGAAAAAGCGGATAAAAA     

     AGTCCTTGCCAAAGCCCAAAACCTTCTCCAAGAAACCCTTGATTTCGTGAAAGAAGAAAATGGCTCAGC     

     AGAGACAGAAACTAAACTAGTAGAGGAGCTTAAAGCAATCCTTGACAAACTCAAG                   

     SP107  amino  acid  (SEQ  ID  NO:188)                                     

     DSLKDVKANASDSKPAQDKKDAKQGTEDSKDSDKMTETNSVPAGVIVVSLLALLGVIAFWLIRRKKESE     

     IQQLSTELIKVLGQLDAEKADKKVLAKAQNLLQETLDFVKEENGSAETETKLVEELKAILDKLK          

     SP108  nucleotide  (SEQ  ID  NO:189)                                      

     CAAGAAATCCTATCATCTCTTCCAGAAGCAAACAGAGACGAGGGGAATTCAGACTCAGTTGATTGAAGA     

     ATCGCTTAGTCAGCAGACTATAATCCAGTCCTTCAATGCTCAAACAGAATTTATCCAAAGATTGCGTGA     

     GGCTCATGACAACTACTCAGGCTATTCTCAGTCAGCCATCTTTTATTCTTCAACGGTCAATCCTTCGAC     

     TCGCTTTGTAAATGCACTCATTTATGCCCTTTTAGCTGGAGTAGGAGCTTATCGTATCATGATGGGTTC     

     AGCCTTGACCGTCGGTCGTTTAGTGACTTTTTTGAACTATGTTCAGCAATACACCAAGCCCTTTAACGA     

     TATTTCTTCAGTGCTAGCTGAGTTGCAAAGTGCTCTGGCTTGCGTAGAGCGTATCTATGGAGTCTTAGA     

     TAGCCCTGAAGTGGCTGAAACAGGTAAGGAAGTCTTGACGACCAGTGACCAAGTTAAGGGAGCTATTTC     

     CTTTAAACATGTCTCTTTTGGCTACCATCCTGAAAAAATTTTGATTAAGGACTTGTCTATCGATATTCC     

     AGCTGGTAGTAAGGTAGCCATCGTTGGTCCGACAGGTGCTGGAAAATCAACTCTTATCAATCTCCTTAT     

     GCGTTTTTATCCCATTAGCTCGGGAGATATCTTGCTGGATGGGCAATCCATTTATGATTATACACGAGT     

     ATCATTGAGACAGCAGTTTGGTATGGTGCTTCAAGAAACCTGGCTCACACAAGGGACCATTCATGATAA     

     TATTGCCTTTGGCAATCCTGAAGCCAGTCGAGAGCAAGTAATTGCTGCTGCCAAAGCAGCTAATGCAGA     

     CTTTTTCATCCAACAGTTGCCACAGGGATACGATACCAAGTTGGAAAATGCTGGAGAATCTCTCTCTGT     

     CGGCCAAGCTCAGCTCTTGACCATAGCCCGAGTCTTTCTGGCTATTCCAAAGATTCTTATCTTAGACGA     

     GGCAACTTCTTCCATTGATACACGGACAGAAGTGCTGGTACAGGATGCCTTTGCAAAACTCATGAAGGG     

     CCGCACAAGTTTCATCATTGCTCACCGTTTGTCAACCATTCAGGATGCGGATTTAATTCTTGTCTTAGT     

     AGATGGTGATATTGTTGAATATGGTAACCATCAAGAACTCATGGATAGAAAGGGTAAGTATTACCAAAT     

     GCAAAAAGCTGCGGCTTTTAGTTCTGA                                               

     A                                                                         

     SP108  amino  acid  (SEQ  ID  NO:190)                                     

     KKSYHLFQKQTETRGIQTQLIEESLSQQTIIQSFNAQTEFIQRLREAHDNYSGYSQSAIFYSSTVNPST     

     RFVNALIYALLAGVGAYRIMMGSALTVGRLVTFLNYVQQYTKPFNDISSVLAELQSALACVERIYGVLD     

     SPEVAETGKEVLTTSDQVKGAISFKHVSFGYHPEKILIKDLSIDIPAGSKVAIVGPTGAGKSTLINLLM     

     RFYPISSGDILLDGQSIYDYTRVSLRQQFGMVLQETWLTQGTIHDNIAFGNPEASREQVIAAAKAANAD     

     FFIQQLPQGYDTKLENAGESLSVGQAQLLTIARVFLAIPKILILDEATSSIDTRTEVLVQDAFAKLMKG     

     RTSFIIAHRLSTIQDADLILVLVDGDIVEYGNHQELMDRKGKYYQMQKAAAFSSE                   

     SP109  nucleotide  (SEQ  ID  NO:191)                                      

     ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGT     

     GAAAAAAACAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGG     

     AGTGGAGGAATATTTCTCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGG     

     AGAAACAACCAAGTACGAATTGATTAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGC     

     CAAGCAAATTTCTCTCAAGGAAATCCTGCTTCACTATTTCCGCATTATCAATCCAACCAGCAAAAATAA     

     ACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTTTATTACACAGATGACAAGGATTTGGAAGT     

     GATTAACCAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAACCTCTAGCAGTTGAAAAGGAAAACTT     

     GAAGAATTTTGTGGTGGCTGAGGATTACCATCAAGACTATCTCAAGAAAAATCCAAATGGCTACTGCCA     

     TATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGATGAGGA     

     ATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTC     

     AAACCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTC     

     ATCAAAAGACAAATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGT     

     CACCTACAAGGAAGATAAGTCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCA     

     CCTTGGGCATGTCTTTACGGATGGTCCACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTC     

     TATCCGCTTTATTCCCAAAGACCAAATGGAAGAAAAAGGCTACGCTTATTTACTAGATTATGTTGAT       

     SP109  amino  acid  (SEQ  ID  NO:192)                                     

     RNAGQTDASQIEKAAVSQGGKAVKKTEISKDADLHEIYLAGGCFWGVEEYFSRVPGVTDAVSGYANGRG     

     ETTKYELINQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSKNKQGNDVGTQYRTGVYYTDDKDLEV     

     INQVFDEVAKKYDQPLAVEKENLKNFVVAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSDEE     

     LKKTLSPEEYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDVV     

     TYKEDKSYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGYAYLLDYVD      

     BP110  nucleotide  (SEQ  ID  NO:193)                                      

     TGTATAGTTTTTAGCGCTTGTTCTTCTAATTCTGNTAAAAATGAAGAAAATACTTCTAAAGAGCATGCG     

     CCTGATAAAATAGTTTTAGATCATGCTTTCGGTCAAACTATATTAGATAAAAAACCTGAAAGAGTTGCA     

     ACTATTGCTTGGGGAAATCATGATGTAGCATTAGCTTTAGGAATAGTTCCTGTTGGATTTTCAAAAGCA     

     AATTACGGTGTAAGTGCTGATAAAGGAGTTTTACCATGGACAGAAGAAAAAATCAAAGAACTAAATGGT     

     AAAGCTAACCTATTTGACGATTTGGATGGACTTAACTTTGAAGCAATATCAAATTCTAAACCAGATGTT     

     ATCTTAGCAGGTTATTCTGGTATAACTAAAGAAGATTATGACACTCTATCA                       

     SP110  amino  acid  (SEQ  ID  NO:194)                                     

     CIVFSACSSNSXKNEENTSKEHAPDKIVLDHAFGQTILDKKPERVATIAWGNHDVALALGIVPVGFSKA     

     NYGVSADKGVLPWTEEKIKELNGKANLFDDLDGLNFEAISNSKPDVILAGYSGITKEDYDTLS           

     SP111  nucleotide  (SEQ  ID  NO:195)                                      

     GTGTGTCGAGCATATTCTGAAGCAAACCTATCAAAATATAGAAATTATTTTAGTTGATGACGGTTCTAC     

     GGATAATTCTGGGGAAATTTGTGATGCTTTTATGATGCAAGATAATCGTGTGCGAGTATTGCATCAAGA     

     AAATAAGGGGGGGGCAGCACAAGCTAAAAATATGGGGATTAGTGTAGCTAAGGGAGAGTACATCACGAT     

     TGTTGATTCAGATGATATCGTAAAAGAAAATATGATTGAAACTCTTTATCAGCAAGTCCAAGAAAAGGA     

     TGCAGATGTTGTTATAGGGAATTACTATAATTATGACGAAAGTGACGGGAATTTTTATTTTTATGTAAC     

     AGGGCAAGATTTTTGCGTCGAAGAATTAGCTATACAAGAAATTATGAACCGTCAAGCAGGAGATTGGAA     

     ATTCAATAGCTCGGCCTTTATATTGCCGACATTTAAGTTGATTAAAAAAGAATTATTCAATGAAGTTCA     

     CTTTTCAAATGGTCGCCGCTTTGATGATGAAGCAACTATGCATCGCTTTTATCTTTTAGCCTCTAAAAT     

     CGTCTTTATAAACGATAATCTCTATCTGTATAGAAGACGTTCAGGAAGCATCATGAGAACGGAATTTGA     

     TCTTTCCTGGGCAAGAGATATTGTTGAAGTGTTTTCTAAGAAAATATCGGATTGTGTCTTGGCTGGTTT     

     GGATGTCTCCGTTCTGCGTATTCGATTTGTCAATCTTTTAAAAGATTATAAGCAAACTTTAGAATACCA     

     TCAATTAACAGATACTGAGGAATATAAAGATATTTGTTTCAGATTAAAGTTGTTTTTTGATGCAGAACA     

     AAGAAATGGTAAAAGT                                                          

     SP111  amino  acid  (SEQ  ID  NO:196)                                     

     CVEHILKQTYQNIEIILVDDGSTDNSGEICDAFMMQDNRVRVLHQENKGGAAQAKNMGISVAKGEYITI     

     VDSDDIVKENMIETLYQQVQEKDADVVIGNYYNYDESDGNFYFYVTGQDFCVEELAIQEIMNRQAGDWK     

     FNSSAFILPTFKLIKKELFNEVHFSNGRRFDDEATMHRFYLLASKIVFINDNLYLYRRRSGSIMRTEFD     

     LSWARDIVEVFSKKISDCVLAGLDVSVLRIRFVNLLKDYKQTLEYHQLTDTEEYKDICFRLKLFFDAEQ     

     RNGKS                                                                     

     SP0112  nucleotide  (SEQ  ID  NO:197)                                     

     GTGTTTGGATAGCATTCAGAATCAGACGTATCAAAATTTTGAGTGTTTATTAATCAATGATGGCTCTCC     

     AGATCATTCATCCAAAATATGTGAAGAATTTGTAGAGAAAGATTCTCGTTTCAAATATTTTGAGAAAGC     

     AAACGGCGGTCTTTCATCAGCTCGTAACCTAGGTATTGAATGTTCGGGGGGGGCGTACATTACTTTTGT     

     AGACTCTGATGATTGGTTGGAACATGATGCTTTAGACCGATTATATGGTGCTTTGAAAAAGGAAAACGC     

     AGATATTAGTATCGGGCGTTATAATTCTTATGATGAAACACGCTATGTGTATATGACTTATGTTACGGA     

     TCCAGATGATTCTCTAGAAGTGATAGAAGGTAAAGCAATTATGGATAGGGAAGGTGTCGAAGAAGTCAG     

     AAATGGGAACTGGACTGTAGCTGTCTTGAAGTTATTCAAGAGAGAGTTACTACAAGATTTACCATTTCC     

     TATAGGAAAAATTGCAGAGGATACTTACTGGACATGGAAGGTACTTCTAAGAGCTTCGAGGATAGTCTA     

     TTTGAATCGTTGTGTTTACTGGTACCGTGTTGGTTTATCTGATACTTTATCGAATACATGGAGTGAAAA     

     GCGTATGTATGATGAAATTGGGGCTAGGGAAGAAAAGATAGCTATTTTAGCAAGTTCAGACTATGACTT     

     GACCAATCATATTTTGATTTATAAAAATAGATTACAAAGAGTGATAGCAAAATTAGAAGAACAAAATAT     

     GCAGTTCACAGAGATTTACAGAAGAATGATGGAAAAATTGTCTTTACTTCCG                      

     SP0112  amino  acid  (SEQ  ID  NO:198)                                    

     CLDSIQNQTYQNFECLLINDGSPDHSSKICEEFVEKDSRFKYFEKANGGLSSARNLGIECSGGAYITFV     

     DSDDWLEHDALDRLYGALKKENADISIGRYNSYDETRYVYMTYVTDPDDSLEVIEGKAIMDREGVEEVR     

     NGNWTVAVLKLFKRELLQDLPFPIGKIAEDTYWTWKVLLRASRIVYLNRCVYWYRVGLSDTLSNTWSEK     

     RMYDEIGAREEKIAILASSDYDLTNHILIYKNRLQRVIAKLEEQNMQFTEIYRRMMEKLSLLP           

     SP113  nucleotide  (SEQ  ID  NO:199)                                      

     GTGCCTAGATAGTATTATTACTCAAACATATAAAAATATTGAGATTGTTGTCGTTAATGATGGTTCTAC     

     GGATGCTTCAGGTGAAATTTGTAAAGAATTTTCAGAAATGGATCACCGAATTCTCTATATAGAACAAGA     

     AAATGCTGGTCTTTCTGCCGCACGAAACACCGGTCTGAATAATATGTCCGGAAATTATGTGACCTTTGT     

     GGACTCGGATGATTGGATTGAGCAAGATTATGTAGAAACTCTATATAAAAAAATAGTAGAGTATCAGGC     

     TGATATTGCAGTTGGTAATTATTATTCTTTCAACGAAAGTGAAGGAATGTTCTACTTTCATATATTGGG     

     AGACTCCTATTATGAGAAAGTATATGATAATGTTTCTATCTTTGAGAACTTGTATGAAACTCAAGAAAT     

     GAAGAGTTTTGCTTTGATATCTGCTTGGGGTAAACTCTATAAGGCAAGATTGTTTGAGCAGTTGCGCTT     

     TGACATAGGTAAATTAGGAGAAGATGGTTACCTCAATCAAAAGGTATATTTATTATCAGAAAAGGTAAT     

     TTATTTAAATAAAAGTCTTTATGCTTATCGGATTAGAAAAGGTAGTTTATCAAGAGTTTGGACAGAAAA     

     GTGGATGCACGCTTTAGTTGATGCTATGTCTGAACGTATTACGCTACTAGCTAATATGGGTTATCCTCT     

     AGAGAAACACTTGGCAGTTTATCGTCAGATGTTGGAAGTCAGTCTCGCCAACGGTCAAGCTAGTGGTTT     

     ATCTGACACAGCAACGTATAAAGAGTTTGAAATGAAACAAAGGCTTTTAAATCAGCTATCGAGACAAGA     

     GGAAAGTGAAAAGAAAGCCATTGTCCTCGCAGCAAACTATGGCTATGTAGACCAAGTTTTAACGACAAT     

     CAAGTCTATTTGTTATCATAATCGTTCGATTCGTTTTTATCTGATTCATAGCGATTTTCCAAATGAATG     

     GATTAAGCAATTAAATAAGCGCTTAGAGAAGTTTGACTCAGAAATTATTAATTGTCGGGTAACTTCTGA     

     GCAAATTTCATGTTATAAATCGGATATTAGTTACACAGTCTTTTTACGCTATTTCATAGCTGATTTCGT     

     GCAAGAAGACAAGGCCCTCTACTTGGACTGTGATCTAGTTGTAACGAAAAATCTGGATGACTTGTTTGC     

     TACAGACTTACAAGATTATCCTTTGGCTGCTGTTAGAGATTTTGGGGGCAGAGCTTATTTTGGTCAAGA     

     AATCTTTAATGCCGGTGTTCTCTTGGTAAACAATGCTTTTTGGAAAAAAGAGAATATGACCCAAAAATT     

     AATTGATGTAACCAATGAATGGCATGATAAGGTGGATCAGGCAGATCAGAGCATCTTGAATATGCTTTT     

     TGAACATAAATGGTTGGAATTGGACTTTGATTATAATCATATTGTCATTCATAAACAGTTTGCTGATTA     

     TCAATTGCCTGAGGGTCAGGATTATCCTGCTATTATTCACTATCTTTCTCATCGGAAACCGTGGAAAGA     

     TTTGGCGGCCCAAACCTATCGTGAAGTTTGGTGGTACTATCATGGGCTTGAATGGACAGAATTGGGACA     

     AAACCATCATTTACATCCATTACAAAGATCTCACATCTATCCAATAAAGGAACCTTTCACTTGTCTAAT     

     CTATACTGCCTCAGACCATATTGAACAAATTGAGACATTGGTTCAATCCTTGCCTGATATTCAGTTTAA     

     GATAGCAGCTAGAGTAATAGTTAGTGATCGATTGGCTCAGATGACAATTTATCCAAACGTGACTATATT     

     TAACGGAATTCACTATTTGGTAGATGTCGATAATGAATTGGTAGAAACCAGTCAAGTACTTTTAGATAT     

     TAATCATGGCGAAAAGACAGAAGAAATTCTCGATCAATTTGCTAATCTTGGCAAGCCTATCTTATCCTT     

     TGAAAATACTAAAACCTATGAAGTAGGTCAGGAGGCATATGCTGTTGACCAAGTTCAAGCAATGATTGA     

     AAAATTGAGAGAAATAAGCAAA                                                    

     SP113  amino  acid  (SEQ  ID  NO:200)                                     

     CLDSIITQTYKNIEIVVVNDGSTDASGEICKEFSEMDHRILYIEQENAGLSAARNTGLNNMSGNYVTFV     

     DSDDWIEQDYVETLYKKIVEYQADIAVGNYYSFNESEGMFYFHILGDSYYEKVYDNVSIFENLYETQEM     

     KSFALISAWGKLYKARLFEQLRFDIGKLGEDGYLNQKVYLLSEKVIYLNKSLYAYRIRKGSLSRVWTEK     

     WMHALVDAMSERITLLANMGYPLEKHLAVYRQMLEVSLANGQASGLSDTATYKEFEMKQRLLNQLSRQE     

     ESEKKAIVLAANYGYVDQVLTTIKSICYHNRSIRFYLIHSDFPNEWIKQLNKRLEKFDSEIINCRVTSE     

     QISCYKSDISYTVFLRYFIADFVQEDKALYLDCDLVVTKNLDDLFATDLQDYPLAAVRDFGGRAYFGQE     

     IFNAGVLLVNNAFWKKENMTQKLIDVTNEWHDKVDQADQSILNMLFEHKWLELDFDYNHIVIHKQFADY     

     QLPEGQDYPAIIHYLSHRKPWKDLAAQTYREVWWYYHGLEWTELGQNHHLHPLQRSHIYPIKEPFTCLI     

     YTASDHIEQIETLVQSLPDIQFKIAARVIVSDRLAQMTIYPNVTIFNGIHYLVDVDNELVETSQVLLDI     

     NHGEKTEEILDQFANLGKPILSFENTKTYEVGQEAYAVDQVQAMIEKLREISK                     

     SP114  nucleotide  (SEQ  ID  NO:201)                                      

     CATTCAGAAGCAGACCTATCAAAATCTGGAAATTATTCTTGTTGATGATGGTGCAACAGATGAAAGTGG     

     TCGCTTGTGTGATTCAATCGCTGAACAAGATGACAGGGTGTCAGTGCTTCATAAAAAGAACGAAGGATT     

     GTCGCAAGCACGAAATGATGGGATGAAGCAGGCTCACGGGGATTATCTGATTTTTATTGACTCAGATGA     

     TTATATCCATCCAGAAATGATTCAGAGCTTATATGAGCAATTAGTTCAAGAAGATGCGGATGTTTCGAG     

     CTGTGGTGTCATGAATGTCTATGCTAATGATGAAAGCCCACAGTCAGCCAATCAGGATGACTATTTTGT     

     CTGTGATTCTCAAACATTTCTAAAGGAATACCTCATAGGTGAAAAAATACCTGGGACGATTTGCAATAA     

     GCTAATCAAGAGACAGATTGCAACTGCCCTATCCTTTCCTAAGGGGTTGATTTACGAAGATGCCTATTA     

     CCATTTTGATTTAATCAAGTTGGCCAAGAAGTATGTGGTTAATACTAAACCCTATTATTACTATTTCCA     

     TAGAGGGGATAGTATTACGACCAAACCCTATGCAGAGAAGGATTTAGCCTATATTGATATCTACCAAAA     

     GTTTTATAATGAAGTTGTGAAAAACTATCCTGACTTGAAAGAGGTCGCTTTTTTCAGATTGGCCTATGC     

     CCACTTCTTTATTCTGGATAAGATGTTGCTAGATGATCAGTATAAACAGTTTGAAGCCTATTCTCAGAT     

     TCATCGTTTTTTAAAAGGCCATGCCTTTGCTATTTCTAGGAATCCAATTTTCCGTAAGGGGAGAAGAAT     

     TAGTGCTTTGGCCCTATTCATAAATATTTCCTTATATCGATTCTTATTACTGAAAAATATTGAAAAATC     

     TAAAAAATTACAT                                                             

     SP114  amino  acid  (SEQ  ID  NO:202)                                     

     IQKQTYQNLEIILVDDGATDESGRLCDSIAEQDDRVSVLHKKNEGLSQARNDGMKQAHGDYLIFIDSDD     

     YIHPEMIQSLYEQLVQEDADVSSCGVMMVYANDESPQSANQDDYFVCDSQTFLKEYLIGEKIPGTICNK     

     LIKRQIATALSFPKGLIYEDAYYHFDLIKLAKKYVVNTKPYYYYFHRGDSITTKPYAEKDLAYIDIYQK     

     FYNEVVKNYPDLKEVAFFRLAYAHFFILDKMLLDDQYKQFEAYSQIHRFLKGHAFAISRNPIFRKGRRI     

     SALALFINISLYRFLLLKNIEKSKKLH                                               

     SP115  nucleotide  (SEQ  ID  NO:203)                                      

     TAAGGCTGATAATCGTGTTCAAATGAGAACGACGATTAATAATGAATCGCCATTGTTGCTTTCTCCGTT     

     GTATGGCAATGATAATGGTAACGGATTATGGTGGGGGAACACATTGAAGGGAGCATGGGAAGCTATTCC     

     TGAAGATGTAAAGCCATATGCAGCGATTGAACTTCATCCTGCAAAAGTCTGTAAACCAACAAGTTGTAT     

     TCCACGAGATACGAAAGAATTGAGAGAATGGTATGTCAAGATGTTGGAGGAAGCTCAAAGTCTAAACAT     

     TCCAGTTTTCTTGGTTATTATGTCGGCTGGAGAGCGTAATACAGTTCCTCCAGAGTGGTTAGATGAACA     

     ATTCCAAAAGTATAGTGTGTTAAAAGGTGTTTTAAATATTGAGAATTATTGGATTTACAATAACCAGTT     

     AGCTCCGCATAGTGCTAAATATTTGGAAGTTTGTGCCAAATATGGAGCGCATTTTATCTGGCATGATCA     

     TGAAAAATGGTTCTGGGAAACTATTATGAATGATCCGACATTCTTTGAAGCGAGTCAAAAATATCATAA     

     AAATTTGGTGTTGGCAACTAAAAATACGCCAATAAGAGATGATGCGGGTACAGATTCTATCGTTAGTGG     

     ATTTTGGTTGAGTGGCTTATGTGATAACTGGGGCTCATCAACAGATACATGGAAATGGTGGGAAAAACA     

     TTATACAAACACATTTGAAACTGGAAGAGCTAGGGATATGAGATCCTATGCATCGGAACCAGAATCAAT     

     GATTGCTATGGAAATGATGAATGTATATACTGGGGGAGGCACAGTTTATAATTTCGAATGTGCCGCGTA     

     TACATTTATGACAAATGATGTACCAACTCCAGCATTTACTAAAGGTATTATTCCTTTCTTTAGACATGC     

     TATACAAAATCCAGCTCCAAGTAAGGAAGAAGTTGTAAATAGAACAAAAGCTGTATTTTGGAATGGAGA     

     AGGTAGGATTAGTTCATTAAACGGATTTTATCAAGGACTTTATTCGAATGATGAAACAATGCCTTTATA     

     TAATAATGGGAGATATCATATTCTTCCTGTAATACATGAGAAAATTGATAAGGAAAAGATTTCATCTAT     

     ATTCCCTAATGCAAAAATTTTGACTAAAAATAGTGAGGAATTGTCTAGTAAAGTCAACTATTTAAACTC     

     GCTTTATCCAAAACTTTATGAAGGAGATGGGTATGCTCAGCGTGTAGGTAATTCCTGGTATATTTATAA     

     TAGTAATGCTAATATCAATAAAAATCAGCAAGTAATGTTGCCTATGTATACTAATAATACAAAGTCGTT     

     ATCGTTAGATTTGACGCCACATACTTACGCTGTTGTTAAAGAAAATCCAAATAATTTACATATTTTATT     

     GAATAATTACAGGACAGATAAGACAGCTATGTGGGCATTATCAGGAAATTTTGATGCATCAAAAAGTTG     

     GAAGAAAGAAGAATTAGAGTTAGCGAACTGGATAAGCAAAAATTATTCCATCAATCCTGTAGATAATGA     

     CTTTAGGACAACAACACTTACATTAAAAGGGCATACTGGTCATAAACCTCAGATAAATATAAGTGGCGA     

     TAAAAATCATTATACTTATACAGAAAATTGGGATGAGAATACCCATGTTTATACCATTACGGTTAATCA     

     TAATGGAATGGTAGAGATGTCTATAAATACTGAGGGGACAGGTCCAGTCTCTTTCCCAACACCAGATAA     

     ATTTAATGATGGTAATTTGAATATAGCATATGCAAAACCAACAACACAAAGTTCTGTAGATTACAATGG     

     AGACCCTAATAGAGCTGTGGATGGTAACAGAAATGGTAATTTTAACTCTGGTTCGGTAACACACACTAG     

     GGCAGATAATCCCTCTTGGTGGGAAGTCGATTTGAAAAAAATGGATAAAGTTGGGCTTGTTAAAATTTA     

     TAATCGCACAGATGCTGAGACTCAACGTCTATCTAATTTT                                  

     SP115  amino  acid  (SEQ  ID  NO:204)                                     

     KADNRVQMRTTINNESPLLLSPLYGNDNGNGLWWGNTLKGAWEAIPEDVKPYAAIELHPAKVCKPTSCI     

     PRDTKELREWYVKMLEEAQSLNIPVFLVIMSAGERNTVPPEWLDEQFQKYSVLKGVLNIENYWIYNNQL     

     APHSAKYLEVCAKYGAHFIWHDHEKWFWETIMNDPTFFEASQKYHKNLVLATKNTPIRDDAGTDSIVSG     

     FWLSGLCDNWGSSTDTWKWWEKHYTNTFETGRARDMRSYASEPESMIAMEMMNVYTGGGTVYNFECAAY     

     TFMTNDVPTPAFTKGIIPFFRHAIQNPAPSKEEVVNRTKAVFWNGEGRISSLNGFYQGLYSNDETMPLY     

     NNGRYHILPVIHEKIDKEKISSIFPNAKILTKNSEELSSKVVYLNSLYPKLYEGDGYAQRVGNSWYIYN     

     SNANINKNQQVMLPMYTNNTKSLSLDLTPHTYAVVKENPNNLHILLNNYRTDKTAMWALSGNFDASKSW     

     KKEELELANWISKNYSINPVDNDFRTTTLTLKGHTGHKPQINISGDKNHYTYTENWDENTHVYTITVNH     

     NGMVEMSINTEGTGPVSFPTPDKFNDGNLNIAYAKPTTQSSVDYNGDPNRAVDGNRNGNFNSGSVTHTR     

     ADNPSWWEVDLKKMDKVGLVKIYNRTDAETQRLSNF                                      

     SP117  nucleotide  (SEQ  ID  NO:205)                                      

     CTGTGGCAATCAGTCAGCTGCTTCCAAACAGTCAGCTTCAGGAACGATTGAGGTGATTTCACGAGAAAA     

     TGGCTCTGGGACACGGGGTGCCTTCACAGAAATCACAGGGATTCTCAAAAAAGACGGTGATAAAAAAAT     

     TGACAACACTGCCAAAACAGCTGTGATTCAAAATAGTACAGAAGGTGTTCTCTCAGCAGTTCAAGGGAA     

     TGCTAATGCTATCGGCTACATCTCCTTGGGATCTTTAACGAAATCTGTCAAGGCTTTAGAGATTGATGG     

     TGTCAAGGCTAGTCGAGACACAGTTTTAGATGGTGAATACCCTCTTCAACGTCCCTTCAACATTGTTTG     

     GTCTTCTAATCTTTCCAAGCTAGGTCAAGATTTTATCAGCTTTATCCACTCCAAACAAGGTCAACAAGT     

     GGTCACAGATAATAAATTTATTGAAGCTAAAACCGAAACCACGGAATATACAAGCCAACACTTATCAGG     

     CAAGTTGTCTGTTGTAGGTTCCACTTCAGTATCTTCTTTAATGGAAAAATTAGCAGAAGCTTATAAAAA     

     AGAAAATCCAGAAGTTACGATTGATATTACCTCTAATGGGTCTTCAGCAGGTATTACCGCTGTTAAGGA     

     GAAAACCGCTGATATTGGTATGGTTTCTAGGGAATTAACTCCTGAAGAAGGTAAGAGTCTCACCCATGA     

     TGCTATTGCTTTAGACGGTATTGCTGTTGTGGTCAATAATGACAATAAGGCAAGCCAAGTCAGTATGGC     

     TGAACTTGCAGACGTTTTTAGTGGCAAATTAACCACCTGGGACAAGATTAAA                      

     SP117  amino  acid  (SEQ  ID  NO:206)                                     

     CGNQSAASKQSASGTIEVISRENGSGTRGAFTEITGILKKDGDKKIDNTAKTAVIQNSTEGVLSAVQGN     

     ANAIGYISLGSLTKSVKALEIDGVKASRDTVLDGEYPLQRPFNIVWSSNLSKLGQDFISFIHSKQGQQV     

     VTDNKFIEAKTETTEYTSQHLSGKLSVVGSTSVSSLMEKLAEAYKKENPEVTIDITSNGSSAGITAVKE     

     KTADIGMVSRELTPEEGKSLTHDAIALDGIAVVVNNDNKASQVSMAELADVFSGKLTTWDKIK           

     SP118  nucleotide  (SEQ  ID  NO:207)                                      

     TTGTCAACAACAACATGCTACTTCTGAGGGGACGAATCAAAGGCAAAGCAGTTCAGCGAAAGTTCCATG     

     GAAAGCTTCATACACCAACCTAAACAACCAGGTAAGTACAGAAGAGGTCAAATCTCTCTTATCAGCTCA     

     CTTGGATCCAAATAGTGTTGATGCATTTTTTAATCTCGTTAATGACTATAATACCATTGTCGGCTCAAC     

     TGGCTTATCAGGAGATTTCACTTCCTTTACTCACACCGAATACGATGTTGAGAAAATCAGTCATCTCTG     

     GAATCAAAAGAAGGGCGATTTTGTTGGGACCAACTGCCGTATCAATAGTTATTGTCTTTTGAAAAATTC     

     AGTCACCATTCCAAAGCTTGAAAAGAATGACCAGTTGCTTTTCCTAGATAATGATGCGATTGATAAAGG     

     AAAGGTCTTTGATTCACAAGATAAGGAAGAGTTTGATATTCTATTTTCGAGAGTTCCAACTGAGTCAAC     

     TACAGATGTCAAGGTTCACGCTGAAAAGATGGAAGCATTCTTCTCACAATTTCAATTCAATGAAAAAGC     

     TCGAATGCTGTCTGTAGTCTTGCACGACAATTTGGATGGCGAGTATCTGTTTGTAGGCCACGTTGGGGT     

     CTTAGTACCTGCTGATGACGGTTTCTTATTTGTAGAGAAATTGACTTTCGAAGAGCCCTACCAAGCGAT     

     TAAATTTGCTAGTAAGGAAGATTGCTACAAGTATTTGGGCACCAAGTATGCGGATTATACAGGCGAGGG     

     ACTGGCTAAGCCTTTTATCATGGATAATGATAAGTGGGTTAAACTT                            

     SP118  amino  acid  (SEQ  ID  NO:206)                                     

     CQQQHATSEGTNQRQSSSAKVPWKASYTNLNNQVSTEEVKSLLSAHLDPNSVDAFFNLVNDYNTIVGST     

     GLSGDFTSFTHTEYDVEKISHLWNQKKGDFVGTNCRINSYCLLKNSVTIPKLEKNDQLLFLDNDAIDKG     

     KVFDSQDKEEFDILFSRVPTESTTDVKVHAEKMEAFFSQFQFNEKARMLSVVLHDNLDGEYLFVGHVGV     

     LVPADDGFLFVEKLTFEEPYQAIKFASKEDCYKYLGTKYADYTGEGLAKPFIMDNDKWVKL             

     SP119  nucleotide  (SEQ  ID  NO:209)                                      

     TTGTTCAGGCAAGTCCGTGACTAGTGAACACCAAACGAAAGATGAAATGAAGACGGAGCAGACAGCTAG     

     TAAAACAAGCGCAGCTAAAGGGAAAGAGGTGGCTGATTTTGAATTGATGGGAGTAGATGGCAAGACCTA     

     CCGTTTATCTGATTACAAGGGCAAGAAAGTCTATCTCAAATTCTGGGCTTCTTGGTGTTCCATCTGTCT     

     GGCTAGTCTTCCAGATACGGATGAGATTGCTAAAGAAGCTGGTGATGACTATGTGGTCTTGACAGTAGT     

     GTCACCAGGACATAAGGGAGAGCAATCTGAAGCGGACTTTAAGAATTGGTATAAGGGATTGGATTATAA     

     AAATCTCCCAGTCCTAGTTGACCCATCAGGCAAACTTTTGGAAACTTATGGTGTCCGTTCTTACCCAAC     

     CCAAGCCTTTATAGACAAAGAAGGCAAGCTGGTCAAAACACATCCAGGATTCATGGAAAAAGATGCAAT     

     TTTGCAAACTTTGAAGGAATTAGCC                                                 

     SP119  amino  acid  (SEQ  ID  NO:210)                                     

     CSGKSVTSEHQTKDEMKTEQTASKTSAAKGKEVADFELMGVDGKTYRLSDYKGKKVYLKFWASWCSICL     

     ASLPDTDEIAKEAGDDYVVLTVVSPGHKGEQSEADFKNWYKGLDYKNLPVLVDPSGKLLETYGVRSYPT     

     QAFIDKEGKLVKTHPGFMEKDAILQTLKELA                                           

     SP120  nucleotide  (SEQ  ID  NO:211)                                      

     CTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGTGAAAAAAACAGAAATTAGTAAAGA     

     CGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTCTCACGTGT     

     TCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAATTGAT     

     TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAAT     

     CCTGCTTCACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCA     

     GTACCGTACTGGTGTTTATTACACAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGT     

     GGCTAAGAAATACGATCAACCTCTAGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGA     

     TTACCATCAAGACTATCTCAAGAAAAATCCAAATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTA     

     TCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGATGAGGAATTGAAAAAGACCCTGTCACCTGA     

     GGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCAAACCGTTACTGGGATAAATTTGA     

     ATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACAAATTTGAGTCTGG     

     TTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAGTCCTA     

     CAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGG     

     TCCACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCTTTATTCCCAAAGACCA     

     AATGGAAGAAAAAGGTACGCTTATTTAC                                              

     SP120  amino  acid  (SEQ  ID  NO:212)                                     

     SQIEKAAVSQGGKAVKKTEISKDADLHEIYLAGGCFWGVEEYFSRVPGVTDAVSGYANGRGETTKYELI     

     NQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSKNKQGNDVGTQYRTGVYYTDDKDLEVINQVFDEV     

     AKKYDQPLAVEKENLKNFVVAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSDEELKKTLSPE     

     EYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDVVTYKEDKSY     

     NMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGTLIY                   

     SP121  nucleotide  (SEQ  ID  NO:213)                                      

     TTGTCAGTCAGGTTCTAATGGTTCTCAGTCTGCTGTGGATGCTATCAAACAAAAAGGGAAATTAGTTGT     

     GGCAACCAGTCCTGACTATGCACCCTTTGAATTTCAATCATTGGTTGATGGAAAGAACCAGGTAGTCGG     

     TGCAGACATCGACATGGCTCAGGCTATCGCTGATGAACTTGGGGTTAAGTTGGAAATCTCAAGCATGAG     

     TTTTGACAATGTTTTGACCAGTCTTCAAACTGGTAAGGCTGACCTAGCAGTTGCAGGAATTAGTGCTAC     

     TGACGAGAGAAAAGAAGTCTTTGATTTTTCAATCCCATACTATGAAAACAAGATTAGTTTCTTGGTTCG     

     TAAGGCTGATGTGGAAAAATACAAGGATTTAACTAGCCTAGAAAGTGCTAATATTGCAGCCCAAAAAGG     

     GACTGTTCCAGAATCAATGGTCAAGGAACAATTGCCAAAAGTTCAATTAACTTCCCTAACTAATATGGG     

     TGAAGCAGTCAATGAATTGCAGGCTGGAAAAATAGATGCTGTTCATATGGATGAGCCTGTTGCACTTAG     

     TTATGCTGCTAAAAACGCTGGCTTAGCTGTCGCAACTGTCAGCTTGAAGATGAAGGACGGCGACGCCAA     

     TGCC                                                                      

     SP121  amino  acid  (SEQ  ID  NO:214)                                     

     CQSGSNGSQSAVDAIKQKGKLVVATSPDYAPFEFQSLVDGKNQVVGADIDMAQAIADELGVKLEISSMS     

     FDNVLTSLQTGKADLAVAGISATDERKEVFDFSIPYYENKISFLVRKADVEKYKDLTSLESANIAAQKG     

     TVPESMVKEQLPKVQLTSLTNMGEAVNELQAGKIDAVHMDEPVALSYAAKNAGLAVATVSLKMKDGDAN     

     A                                                                         

     BP122  nucleotide  (SEQ  ID  NO:215)                                      

     GGAAACTTCACAGGATTTTAAAGAGAAGAAAACAGCAGTCATTAAGGAAAAAGAAGTTGTTAGTAAAAA     

     TCCTGTGATAGACAATAACACTAGCAATGAAGAAGCAAAAATCAAAGAAGAAAATTCCAATAAATCCCA     

     AGGAGATTATACGGACTCATTTGTGAATAAAAACACAGAAAATCCCAAAAAAGAAGATAAAGTTGTCTA     

     TATTGCTGAATTTAAAGATAAAGAATCTGGAGAAAAAGCAATCAAGGAACTATCCAGTCTTAAGAATAC     

     AAAAGTTTTATATACTTATGATAGAATTTTTAACGGTAGTGCCATAGAAACAACTCCAGATAACTTGGA     

     CAAAATTAAACAAATAGAAGGTATTTCATCGGTTGAAAGGGCACAAAAAGTCCAACCCATGATGAATCA     

     TGCCAGAAAGGAAATTGGAGTTGAGGAAGCTATTGATTACCTAAAGTCTATCAATGCTCCGTTTGGGAA     

     AAATTTTGATGGTAGAGGTATGGTCATTTCAAATATCGATACTGGAACAGATTATAGACATAAGGCTAT     

     GAGAATCGATGATGATGCCAAAGCCTCAATGAGATTTAAAAAAGAAGACTTAAAAGGCACTGATAAAAA     

     TTATTGGTTGAGTGATAAAATCCCTCATGCGTTCAATTATTATAATGGTGGCAAAATCACTGTAGAAAA     

     ATATGATGATGGAAGGGATTATTTTGACCCACATGGGATGCATATTGCAGGGATTCTTGCTGGAAATGA     

     TACTGAACAAGACATCAAAAACTTTAACGGCATAGATGGAATTGCACCTAATGCACAAATTTTCTCTTA     

     CAAAATGTATTCTGACGCAGGATCTGGGTTTGCGGGTGATGAAACAATGTTTCATGCTATTGAAGATTC     

     TATCAAACACAACGTTGATGTTGTTTCGGTATCATCTGGTTTTACAGGAACAGGTCTTGTAGGTGAGAA     

     ATATTGGCAAGCTATTCGGGCATTAAGAAAAGCAGGCATTCCAATGGTTGTCGCTACGGGTAACTATGC     

     GACTTCTGCTTCAAGTTCTTCATGGGATTTAGTAGCAAATAATCATCTGAAAATGACCGACACTGGAAA     

     TGTAACACGAACTGCAGCACATGAAGATGCGATAGCGGTCGCTTCTGCTAAAAATCAAACAGTTGAGTT     

     TGATAAAGTTAACATAGGTGGAGAAAGTTTTAAATACAGAAATATAGGGGCCTTTTTCGATAAGAGTAA     

     AATCACAACAAATGAAGATGGAACAAAAGCTCCTAGTAAATTAAAATTTGTATATATAGGCAAGGGGCA     

     AGACCAAGATTTGATAGGTTTGGATCTTAGGGGCAAAATTGCAGTAATGGATAGAATTTATACAAAGGA     

     TTTAAAAAATGCTTTTAAAAAAGCTATGGATAAGGGTGCACGCGCCATTATGGTTGTAAATACTGTAAA     

     TTACTACAATAGAGATAATTGGACAGAGCTTCCAGCTATGGGATATGAAGCGGATGAAGGTACTAAAAG     

     TCAAGTGTTTTCAATTTCAGGAGATGATGGTGTAAAGCTATGGAACATGATTAATCCTGATAAAAAAAC     

     TGAAGTCAAAAGAAATAATAAAGAAGATTTTAAAGATAAATTGGAGCAATACTATCCAATTGATATGGA     

     AAGTTTTAATTCCAACAAACCGAATGTAGGTGACGAAAAAGAGATTGACTTTAAGTTTGCACCTGACAC     

     AGACAAAGAACTCTATAAAGAAGATATCATCGTTCCAGCAGGATCTACATCTTGGGGGCCAAGAATAGA     

     TTTACTTTTAAAACCCGATGTTTCAGCACCTGGTAAAAATATTAAATCCACGCTTAATGTTATTAATGG     

     CAAATCAACTTATGGCTATATGTCAGGAACTAGTATGGCGACTCCAATCGTGGCAGCTTCTACTGTTTT     

     GATTAGACCGAAATTAAAGGAAATGCTTGAAAGACCTGTATTGAAAAATCTTAAGGGAGATGACAAAAT     

     AGATCTTACAAGTCTTACAAAAATTGCCCTACAAAATACTGCGCGACCTATGATGGATGCAACTTCTTG     

     GAAAGAAAAAAGTCAATACTTTGCATCACCTAGACAACAGGGAGCAGGCCTAATTAATGTGGCCAATGC     

     TTTGAGAAATGAAGTTGTAGCAACTTTCAAAAACACTGATTCTAAAGGTTTGGTAAACTCATATGGTTC     

     CATTTCTCTTAAAGAAATAAAAGGTGATAAAAAATACTTTACAATCAAGCTTCACAATACATCAAACAG     

     ACCTTTGACTTTTAAAGTTTCAGCATCAGCGATAACTACAGATTCTCTAACTGACAGATTAAAACTTGA     

     TGAAACATATAAAGATGAAAAATCTCCAGATGGTAACCAAATTGTTCCAGAAATTCACCCAGAAAAAGT     

     CAAAGGAGCAAATATCACATTTGAGCATGATACTTTCACTATAGGCGCAAATTCTAGCTTTGATTTGAA     

     TGCGGTTATAAATGTTGGAGAGGCCAAAAACAAAAATAAATTTGTAGAATCATTTATTCATTTTGAGTC     

     AGTGGAAGCGATGGAAGCTCTAAACTCCAGCGGGAAGAAAATAAACTTCCAACCTTCTTTGTCGATGCC     

     TCTAATGGGATTTGCTGGGAATTGGAACCACGAACCAATCCTTGATAAATGGGCTTGGGAAGAAGGGTC     

     AAGATCAAAAACACTGGGAGGTTATGATGATGATGGTAAACCGAAAATTCCAGGAACCTTAAATAAGGG     

     AATTGGTGGAGAACATGGTATAGATAAATTTAATCCAGCAGGAGTTATACAAAATAGAAAAGATAAAAA     

     TACAACATCCCTGGATCAAAATCCAGAATTATTTGCTTTCAATAACGAAGGGATCAACGCTCCATCATC     

     AAGTGGTTCTAAGATTGCTAACATTTATCCTTTAGATTCAAATGGAAATCCTCAAGATGCTCAACTTGA     

     AAGAGGATTAACACCTTCTCCACTTGTATTAAGAAGTGCAGAAGAAGGATTGATT                   

     SP122  amino  acid  (SEQ  ID  NO:216)                                     

     ETSQDFKEKKTAVIKEKEVVSKNPVIDNNTSNEEAKIKEENSNKSQGDYTDSFVNKNTENPKKEDKVVY     

     IAEFKDKESGEKAIKELSSLKNTKVLYTYDRIFNGSAIETTPDNLDKIKQIEGISSVERAQKVQPMMNH     

     ARKEIGVEEAIDYLKSINAPFGKNFDGRGMVISNIDTGTDYRHKAMRIDDDAKASMRFKKEDLKGTDKN     

     YWLSDKIPHAFNYYNGGKITVEKYDDGRDYFDPHGMHIAGILAGNDTEQDIKNFNGIDGIAPNAQIFSY     

     KMYSDAGSGFAGDETMFHAIEDSIKHMVDVVSVSSGFTGTGLVGEKYWQAIRALRKAGIPMVVATGNYA     

     TSASSSSWDLVANNHLKMTDTGNVTRTAAHEDAIAVASAKNQTVEFDKVNIGGESFKYRNIGAFFDKSK     

     ITTNEDGTKAPSKLKFVYIGKGQDQDLIGLDLRGKIAVMDRIYTKDLKNAFKKAMDKGARAIMVVNTVN     

     YYNRDNWTELPAMGYEADEGTKSQVFSISGDDGVKLWNMINPDKKTEVKRNNKEDFKDKLEQYYPIDME     

     SFNSNKPNVGDEKEIDFKFAPDTDKELYKEDIIVPAGSTSWGPRIDLLLKPDVSAPGKNIKSTLNVING     

     KSTYGYMSGTSMATPIVAASTVLIRPKLKEMLERPVLKNLKGDDKIDLTSLTKIALQNTARPMMDATSW     

     KEKSQYFASPRQQGAGLINVANALRNEVVATFKNTDSKGLVNSYGSISLKEIKGDKKYFTIKLHNTSNR     

     PLTFKVSASAITTDSLTDRLKLDETYKDEKSPDGKQIVPEIHPEKVKGANITFEHDTFTIGANSSFDLN     

     AVINVGEAKNKNKFVESFIHFESVEAMEALNSSGKKINFQPSLSMPLMGFAGNWNHEPILDKWAWEEGS     

     RSKTLGGYDDDGKPKIPGTLNKGIGGEHGIDKFNPAGVIQNRKDKNTTSLDQNPELFAFNNEGINAPSS     

     SGSKIANIYPLDSNGNPQDAQLERGLTPSPLVLRSAEEGLI                                 

     SP123  nucleotide  (SEQ  ID  NO:217)                                      

     TGTGGTCGAAGTTGAGACTCCTCAATCAATAACAAATCAGGAGCAAGCTAGGACAGAAAACCAAGTAGT     

     AGAGACAGAGGAAGCTCCAAAAGAAGAAGCACCTAAAACAGAAGAAAGTCCAAAGGAAGAACCAAAATC     

     GGAGGTAAAACCTACTGACGACACCCTTCCTAAAGTAGAAGAGGGGAAAGAAGATTCAGCAGAACCAGC     

     TCCAGTTGAAGAAGTAGGTGGAGAAGTTGAGTCAAAACCAGAGGAAAAAGTAGCAGTTAAGCCAGAAAG     

     TCAACCATCAGACAAACCAGCTGAGGAATCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGA     

     AGACGAAAAGGCACCAGTCGAGCCAGAAAAGCAACCAGAAGCTCCTGAAGAAGAGAAGGCTGTAGAGGA     

     AACACCGAAACAAGAAGAGTCAACTCCAGATACCAAGGCTGAAGAAACTGTAGAACCAAAAGAGGAGAC     

     TGTTAATCAATCTATTGAACAACCAAAAGTTGAAACGCCTGCTGTAGAAAAACAAACAGAACCAACAGA     

     GGAACCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAACAGGCACCAACGGCACC     

     AGTTGAGCCAGAAAAGCAACCAGAAGTTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACCAGA     

     AGATAAAATAAAGGGTATTGGTACTAAAGAACCAGTTGATAAAAGTGAGTTAAATAATCAAATTGATAA     

     AGCTAGTTCAGTTTCTCCTACTGATTATTCTACAGCAAGTTACAATGCTCTTGGACCTGTTTTAGAAAC     

     TGCAAAAGGTGTCTATGCTTCAGAGCCTGTAAAACAGCCTGAGGTAAATAGCGAGACAAATAAACTTAA     

     AACGGCTATTGACGCTCTAAACGTTGATAAAACTGAATTAAACAATACGATTGCAGATGCAAAAACAAA     

     GGTAAAAGAACATTACAGTGATAGAAGTTGGCAAAACCTCCAAACTGAAGTTACAAAGGCTGAAAAAGT     

     TGCAGCTAATACAGATGCTAAACAAAGTGAAGTTAACGAAGCTGTTGAAAAATTAACTGCAACTATTGA     

     AAAATTGGTTGAATTATCTGAAAAGCCAATATTAACATTGACTAGTACCGATAAGAAAATATTGGAACG     

     TGAAGCTGTTGCTAAGTATACTCTAGAAAATCAAAACAAAACAAAAATCAAATCAATCACAGCTGAATT     

     GAAAAAAGGAGAAGAAGTTATTAATACTGTAGTCCTTACAGATGACAAGGTAACAACAGAAACTATAAG     

     CGCTGCATTTAAGAACCTAGAGTACTACAAAGAATACACCCTATCTACAACTATGATTTACGACAGAGG     

     TAACGGTGAAGAAACTGAAACTCTAGAAAATCAAAATATTCAATTAGATCTTAAAAAAGTTGAGCTTAA     

     AAATATTAAACGTACAGATTTAATCAAATACGAAAATGGAAAAGAAACTAATGAATCACTGATAACAAC     

     TATTCCTGATGATAAGAGCAATTATTATTTAAAAATAACTTCAAATAATCAGAAAACTACATTACTAGC     

     TGTTAAAAATATAGAAGAAACTACGGTTAACGGAACACCTGTATATAAAGTTACAGCAATCGCAGACAA     

     TTTAGTCTCTAGAACTGCTGATAATAAATTTGAAGAAGAA                                  

     SP123  amino  acid  (SEQ  ID  NO:218)                                     

     VVEVETPQSITNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPA     

     PVEEVGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEE     

     TPKQEESTPDTKAEETVEPKEETVNQSIEQPKVETPAVEKQTEPTEEPKVEQAGEPVAPREDEQAPTAP     

     VEPEKQPEVPEEEKAVEETPKPEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDYSTASYNALGPVLET     

     AKGVYASEPVKQPEVNSETNKLKTAIDALNVDKTELNNTIADAKTKVKEHYSDRSWQNLQTEVTKAEKV     

     AANTDAKQSEVNEAVEKLTATIEKLVELSEKPILTLTSTDKKILEREAVAKYTLENQNKTKIKSITAEL     

     KKGEEVINTVVLTDDKVTTETISAAFKNLEYYKEYTLSTTMIYDRGNGEETETLENQNIQLDLKKVELK     

     NIKRTDLIKYENGKETNESLITTIPDDKSNYYLKITSNNQKTTLLAVKNIEETTVNGTPVYKVTAIADN     

     LVSRTADNKFEEE                                                             

     SP124  amino  acid  (SEQ  ID  NO:219)                                     

     AACACCTGTATATAAAGTTACAGCAATCGCAGACAATTTAGTCTCTAGAACTGCTGATAATAAATTTGA     

     AGAAGAATACGTTCACTATATTGAAAAACCTAAAGTCCACGAAGATAATGTATATTATAATTTCAAAGA     

     ATTAGTGGAAGCTATTCAAAACGATCCTTCAAAAGAATATCGTCTGGGACAATCAATGAGCGCTAGAAA     

     TGTTGTTCCTAATGGAAAATCATATATCACTAAAGAATTCACAGGAAAACTTTTAAGTTCTGAAGGAAA     

     ACAATTTGCTATTACTGAATTGGAACATCCATTATTTAATGTGATAACAAACGCAACGATAAATAATGT     

     GAATTTTGAAAATGTAGAGATAGAACGTTCTGGTCAAGATAATATTGCATCATTAGCCAATACTATGAA     

     AGGTTCTTCAGTTATTACAAATGTCAAAATTACAGGCACACTTTCAGGTCGTAATAATGTTGCTGGATT     

     TGTAAATAATATGAATGATGGAACTCGTATTGAAAATGTTGCTTTCTTTGGCAAACTACACTCTACAAG     

     TGGAAATGGCTCTCATACAGGGGGAATTGCAGGTACAAACTATAGAGGAATTGTTAGAAAAGCATATGT     

     TGATGCTACTATTACAGGAAACAAAACACGCGCCAGCTTGTTAGTTCCTAAAGTAGATTATGGATTAAC     

     TCTAGACCATCTTATTGGTACAAAAGCTCTCCTAACTGAGTCGGTTGTAAAAGGTAAAATAGATGTTTC     

     AAATCCAGTAGAAGTTGGAGCAATAGCAAGTAAGACTTGGCCTGTAGGTACGGTAAGTAATTCTGTCAG     

     CTATGCTAAGATTATCCGTGGAGAGGAGTTATTCGGCTCTAACGACGTTGATGATTCTGATTATGCTAG     

     TGCTCATATAAAAGATTTATATGCGGTAGAGGGATATTCGTCAGGTAATAGATCATTTAGGAAATCTAA     

     AACATTTACTAAATTAACTAAAGAACAAGCTGATGCTAAAGTTACTACTTTCAATATTACTGCTGATAA     

     ATTAGAAAGTGATCTATCTCCTCTTGCAAAACTTAATGAAGAAAAAGCCTATTCTAGTATTCAAGATTA     

     TAACGCTGAATATAACCAAGCCTATAAAAATCTTGAAAAATTAATACCATTCTACAATAAAGATTATAT     

     TGTATATCAAGGTAATAAATTAAATAAAGAACACCATCTAAATACTAAAGAAGTTCTTTCTGTTACCGC     

     GATGAACAACAATGAGTTTATCACAAACCTAGATGAAGCTAATAAAATTATTGTTCACTATGCGGACGG     

     TACAAAAGATTACTTTAACTTGTCTTCTAGCAGTGAAGGTTTAAGTAATGTAAAAGAATATACTATAAC     

     TGACTTAGGAATTAAATATACACCTAATATCGTTCAAAAAGATAACACTACTCTTGTTAATGATATAAA     

     ATCTATTTTAGAATCAGTAGAGCTTCAGTCTCAAACGATGTATCAGCATCTAAATCGATTAGGTGACTA     

     TAGAGTTAATGCAATCAAAGATTTATATTTAGAAGAAAGCTTCACAGATGTTAAAGAAAACTTAACAAA     

     CCTAATCACAAAATTAGTTCAAAACGAAGAACATCAACTAAATGATTCTCCAGCTGCTCGTCAAATGAT     

     TCGTGATAAAGTCGAGAAAAACAAAGCAGCTTTATTACTAGGTTTAACTTACCTAAATCGTTACTATGG     

     AGTTAAATTTGGTGATGTTAATATTAAAGAATTAATGCTATTCAAACCAGATTTCTATGGTGAAAAAGT     

     TAGCGTATTAGACAGATTAATTGAAATCGGTTCTAAAGAGAACAACATTAAAGGTTCACGTACATTCGA     

     CGCATTCGGTCAAGTA                                                          

     SP124  amino  acid  (SEQ  ID  NO:220)                                     

     TPVYKVTAIADNLVSRTADNKFEEEYVHYIEKPKVHEDNVYYNFKELVEAIQNDPSKEYRLGQSMSARN     

     VVPNGKSYITKEFTGKLLSSEGKQFAITELEHPLFNVITNATINNVNFENVEIERSGQDNIASLANTMK     

     GSSVITNVKITGTLSGRNNVAGFVNNMNDGTRIENVAFFGKLHSTSGNGSHTGGIAGTNYRGIVRKAYV     

     DATITGNKTRASLLVPKVDYGLTLDHLIGTKALLTESVVKGKIDVSNPVEVGAIASKTWPVGTVSNSVS     

     YAKIIRGEELFGSNDVDDSDYASAHIKDLYAVEGYSSGNRSFRKSKTFTKLTKEQADAKVTTFNITADK     

     LESDLSPLAKLNEEKAYSSIQDYNAEYNQAYKNLEKLIPFYNKDYIVYQGNKLNKEHHLNTKEVLSVTA     

     MNNNEFITNLDEANKIIVHYADGTKDYFNLSSSSEGLSNVKEYTITDLGIKYTPNIVQKDNTTLVNDIK     

     SILESVELQSQTMYQHLNRLGDYRVNAIKDLYLEESFTDVKENLTNLITKLVQNEEHQLNDSPAARQMI     

     RDKVEKNKAALLLGLTYLNRYYGVKFGDVNIKELMLFKPDFYGEKVSVLDRLIEIGSKENNIKGSRTFD     

     AFGQV                                                                     

     SP125  nucleotide  (SEQ  ID  NO:221)                                      

     ATTAGACAGATTAATTGAAATCGGTTCTAAAGAGAACAACATTAAAGGTTCACGTACATTCGACGCATT     

     CGGTCAAGTATTGGCTAAATATACTAAATCAGGTAATTTAGATGCATTTTTAAATTATAATAGACAATT     

     GTTCACAAATATAGACAATATGAACGATTGGTTTATTGATGCTACAGAAGACCATGTCTACATCGCAGA     

     ACGCGCTTCTGAGGTCGAAGAAATTAAAAATTCTAAACATCGTGCATTCGATAATTTAAAACGAAGTCA     

     CCTTAGAAATACTATACTCCCACTACTGAATATTGATAAAGCACATCTTTATTTAATTTCAAATTATAA     

     TGCAATTGCCTTTGGTAGTGCAGAGCGATTAGGTAAAAAATCATTAGAAGATATTAAAGATATCGTTAA     

     CAAAGCTGCAGATGGTTATAGAAACTATTATGATTTCTGGTATCGTCTAGCGTCTGATAACGTTAAACA     

     ACGACTACTAAGAGATGCTGTTATTCCTATTTGGGAAGGTTATAACGCTCCTGGTGGATGGGTTGAAAA     

     ATATGGCCGCTATAATACCGACAAAGTATATACTCCTCTTAGAGAATTCTTTGGTCCTATGGATAAGTA     

     TTATAATTATAATGGAACAGGAGCTTATGCTGCTATATATCCTAACTCTGATGATATTAGAACTGATGT     

     AAAATATGTTCATTTAGAAATGGTTGGTGAATACGGTATTTCAGTTTACACACATGAAACAACACACGT     

     CAACGACCGTGCGATTTACTTAGGTGGCTTTGGACACCGTGAAGGTACTGATGCTGAAGCATATGCTCA     

     GGGTATGCTACAAACTCCTGTTACTGGTAGTGGATTTGATGAGTTTGGTTCTTTAGGTATTAATATGGT     

     ATTTAAACGCAAAAATGATGGGAATCAGTGGTATATTACAGATCCAAAAACTCTAAAAACACGAGAAGA     

     TATTAATAGATATATGAAGGGTTATAATGACACTTTAACTCTTCTTGATGAAATTGAGGCTGAATCTGT     

     GATTTCTCAACAAAATAAAGATTTAAATAGTGCATGGTTCAAAAAAATAGATAGAGAATACCGTGATAA     

     CAATAAATTAAATCAATGGGATAAAATTCGAAATCTAAGTCAAGAAGAGAAAAATGAATTAAATATTCA     

     ATCTGTTAATGATTTAGTTGATCAACAATTAATGACTAATCGCAATCCAGGTAATGGTATCTATAAACC     

     CGAAGCAATTAGCTATAACGATCAATCACCTTATGTAGGTGTTAGAATGATGACCGGTATCTACGGAGG     

     TAATACTAGTAAAGGTGCTCCTGGAGCTGTTTCATTCAAACATAATGCTTTTAGATTATGGGGTTACTA     

     CGGATACGAAAATGGGTTCTTAGGTTATGCTTCAAATAAATATAAACAACAATCTAAAACAGATGGTGA     

     GTCTGTTCTAAGTGATGAATATATTATCAAGAAAATATCTAACAATACATTTAATACTATTGAAGAATT     

     TAAAAAAGCTTACTTCAAAGAAGTTAAAGATAAAGCAACGAAAGGATTAACAACATTCGAAGTAAATGG     

     TTCTTCCGTTTCATCATACGATGATTTACTGACATTGTTTAAAGAAGCTGTTAAAAAAGATGCCGAAAC     

     TCTTAAACAAGAAGCAAACGGTAATAAAACAGTATCTATGAATAATACAGTTAAATTAAAAGAAGCTGT     

     TTATAAGAAACTTCTTCAACAAACAAATAGCTTTAAAACTTCAATCTTTAAA                      

     SP125  amino  acid  (SEQ  ID  NO:222)                                     

     LDRLIEIGSKENNIKGSRTFDAFGQVLAKYTKSGNLDAFLNYNRQLFTNIDNMNDWFIDATEDHVYIAE     

     RASEVEEIKNSKHRAFDNLKRSHLRNTILPLLNIDKAHLYLISNYNAIAFGSAERLGKKSLEDIKDIVN     

     KAADGYRNYYDFWYRLASDNVKQRLLRDAVIPIWEGYNAPGGWVEKYGRYNTDKVYTPLREFFGPMDKY     

     YNYNGTGAYAAIYPNSDDIRTDVKYVHLEMVGEYGISVYTHETTHVNDRAIYLGGFGHREGTDAEAYAQ     

     GMLQTPVTGSGFDEFGSLGINMVFKRKNDGNQWYITDPKTLKTREDINRYMKGYNDTLTLLDEIEAESV     

     ISQQNKDLNSAWFKKIDREYRDNNKLNQWDKIRNLSQEEKNELNIQSVNDLVDQQLMTNRNPGNGIYKP     

     EAISYNDQSPYVGVRMMTGIYGGNTSKGAPGAVSFKHNAFRLWGYYGYENGFLGYASNKYKQQSKTDGE     

     SVLSDEYIIKKISNNTFNTIEEFKKAYFKEVKDKATKGLTTFEVNGSSVSSYDDLLTLFKEAVKKDAET     

     LKQEANGNKTVSMNNTVKLKEAVYKKLLQQTNSFKTSIFK                                  

     SP126  nucleotide  (SEQ  ID  NO:223)                                      

     TAAGACAGATGAACGGAGCAAGGTGTTTGACTTTTCCATTCCCTACTATACTGCAAAAAATAAACTCAT     

     TGTCAAAAAATCTGACTTGACTACTTATCAGTCTGTAAACGACTTGGCGCAGAAAAAGGTTGGAGCGCA     

     GAAAGGTTCGATTCAAGAGACGATGGCGAAAGATTTGCTACAAAATTCTTCCCTCGTATCTCTGCCTAA     

     AAATGGGAATTTAATCACAGATTTAAAATCAGGACAAGTGGATGCCGTTATCTTTGAAGAACCTGTTTC     

     CAAGGGATTTGTGGAAAATAATCCTGATTTAGCAATCGCAGACCTCAATTTTGAAAAAGAGCAAGATGA     

     TTCCTACGCGGTAGCCATgAAAAAAGATAGCAAGAAATTGAAGAGGCAGTTCGATAAAACCATTCAAAA     

     GTTGAAGGAGTCTGGGGAATTAGACAAACTCATTGAGGAAGCCTTA                            

     SP126  amino  acid  (SEQ  ID  NO:224)                                     

     KTDERSKVFDFSIPYYTAKNKLIVKKSDLTTYQSVNDLAQKKVGAQKGSIQETMAKDLLQNSSLVSLPK     

     NGNLITDLKSGQVDAVIFEEPVSKGFVENNPDLAIADLNFEKEQDDSYAVAMKKDSKKLKRQFDKTIQK     

     LKESGELDKLIEEAL                                                           

     SP127  nucleotide  (SEQ  ID  NO:225)                                      

     CTGTGAGAATCAAGCTACACCCAAAGAGACTAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGA     

     CGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGACAGGCTTTGATATCGAAGTTTTAAAGGCAGT     

     AGATGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCCTGGGAGAGCATCTTCCCAGGACT     

     TGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCGTGCTGAAAAATACCT     

     TTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGACTTCTCT     

     TGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTG     

     GAATCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCT     

     AGACCTTGCTAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGA     

     CCGTGGTTTAGACCTCTCAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTC     

     AAGCGACCAAAAAGAGTTTAAAGAGCAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCT     

     TGAAAAACTCAGCAATACCTATCTAGGTGGTTCTTACCTCCCAGATCAATCTCAGTTACAA             

     SP127  amino  acid  (SEQ  ID  NO:226)                                     

     CENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTAWESIFPGL     

     DSGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNW     

     NQKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKIIKDRGLDLSVVDLPSADSPSNYIIFS     

     SDQKEFKEQFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQ                               

                TABLE 2                                                     

     ______________________________________                                    

     S. pneumoniae Antigenic Epitopes                                          

     ______________________________________                                    

     SP001                                                                     

     Lys-1 to Ile-10; Leu-13 to Lys-32; Arg-41 to Ile-51; Ser-85 to Glu-97;    

     Ala-159 to His-168; Val-309 to Thr-318; Val-341 to Asn-352; Asn-415 to    

     Met-430; Phe-454 to Asn-464; Ser-573 to Gly-591; Asn-597 to Thr-641;      

     and Asn-644 to Ala-664.                                                   

     SP004                                                                     

     Thr-9 to Thr-24; Ile-29 to Ala-48; Thr-49 to Val-56; Val-286 to Val-      

     312;                                                                      

     Pro-316 to Glu-344; Val-345 to Ile-367; Gln-368 to Val-399; Ser-400 to    

     Glu-431; Asn-436 to Ala-457; Ile-467 to Ala-498; and Thr-499 to Glu-      

     540.                                                                      

     SP006                                                                     

     Glu-1 to Lys-13; Pro-24 to Gly-36; Val-104 to Thr-112; Ala-118 to Asn-    

     130; Trp-137 to Ala-146; Ser-151 to Ile-159; Ile-181 to Leu-188; and      

     Pro-194 to Tyr-202.                                                       

     SP007                                                                     

     Gly-1 to Asn-7; Tyr-24 to Gln-34; His-47 to Phe-55; Ser-60 to Ala-67;     

     Ala-122 to Leu-129; Leu-221 to Lys-230; Val-236 to Phe-256; and           

     Asp-271 to Gly-283; and Leu-291 to Asp-297.                               

     SP008                                                                     

     Leu-4 to Lys-17; Gln-24 to Leu-32; Asp-60 to Ser-66; Ser-70 to Asp-76;    

     Ala-276 to Lys-283; Asn-304 to Lys-311; and Thr-429 to Pro-437.           

     SP009                                                                     

     Thr-4 to Glu-11; Leu-50 to Asp-60; Ile-102 to Trp-123; and Ser-138 to     

     Ile-157.                                                                  

     SP010                                                                     

     Phe-34 to Gly-41; Asp-44 to Lys-50; Leu-172 to Val-186; Leu-191 to Val-   

     198; Ser-202 to Ile-209; and Val-213 to Leu-221.                          

     SP011                                                                     

     Asn-2 to Thr-10; Asp-87 to Ala-102; Tyr-125 to Glu-132; Thr-181 to Tyr-   

     189; Arg-217 to Thr-232; Asn-257 to Lys-264; Pro-271 to Ser-278; Tyr-     

     317 to Ala-325; Glu-327 to Pro-337; and Thr-374 to Val-381.               

     SP012                                                                     

     Gly-1 to Lys-19; Phe-34 to Tyr-41; Leu-109 to Lys-126; and Leu-231 to     

     Glu-247.                                                                  

     SP013                                                                     

     Ala-1 to Lys-12; Ile-42 to Pro-53; Leu-138 to Lys-146; Ile-205 to Lys-    

     217; Ser-235 to Ile-251; and Ser-261 to Tyr-272.                          

     SP014                                                                     

     Gly-1 to Val-16; Leu-35 to Leu-44; Asp-73 to Asp-81; Ile-83 to Asp-92;    

     Glu-145 to Ile-153; Phe-188 to Asn-196; Ser-208 to Phe-215; Ile-224 to    

     Leu-231; and Asn-235 to Ala-243.                                          

     SP015                                                                     

     Ser-1 to Pro-16; Asn-78 to Glu-88; Ala-100 to Val-108; Ala-122 to Thr-    

     129; Thr-131 to Ser-137; Leu-201 to Ser-220; and Gly-242 to Val-251.      

     SP016                                                                     

     Gly-1 to Glu-20; Thr-30 to Val-38; Gln-94 to Asn-105; Lys-173 to Pro-     

     182; Gly-189 to Arg-197; Ser-207 to Val-224; Pro-288 to Leu-298; Ala-     

     327 to Ala-342; and Ser-391 to Ala-402.                                   

     SP017                                                                     

     Ser-1 to Thr-12; Ala-36 to Tyr-45; Gln-48 to Ile-54; Lys-59 to Lys-76;    

     Tyr-113 to Leu-138; and Phe-212 to Asp-219.                               

     SP019                                                                     

     Val-97 to Glu-117; Asp-163 to Leu-169; Thr-182 to Thr-191; and Lys-241    

     to Ser-250.                                                               

     SP020                                                                     

     Asn-18 to Lys-25; Thr-47 to Glu-60; Trp-75 to Val-84; Gly-102 to Val-     

     110; Pro-122 to Ala-131; and Glu-250 to Pro-258.                          

     SP021                                                                     

     Serl to Asp-8; Val-44 to Asp-54; Ala-117 to Val-125; Thr-165 to Thr-      

     173; and Glu-180 to Pro-189.                                              

     SP022                                                                     

     Phe-5 to Lys-13; Thr-20 to Ser-36; Glu-59 to Lys-81; Tyr-85 to Gly-93;    

     Trp-94 to Trp-101; and Thr-195 to Trp-208.                                

     SP023                                                                     

     Gln-45 to Glu-59; Asp-69 to Pro-85; Lys-111 to Asn-121; Pro-218 to Ala-   

     228; and Glu-250 to Asn-281.                                              

     SP025                                                                     

     Gln-14 to Thr-20; Gly-27 to Phe-33; Gly-63 to Glu-71; and Ile-93 to       

     Phe-102.                                                                  

     SP028                                                                     

     Asp-171 to Pro-179; Tyr-340 to Glu-350; Pro-455 to Tyr-463; and           

     Asp-474 to Pro-480.                                                       

     SP030                                                                     

     Leu-22 to Leu-37; Trp-81 to Ala-90; Phe-101 to Ala-106; Thr-124 to Tyr-   

     130; and Asn-138 to Glu-144.                                              

     SP031                                                                     

     Asp-8 to Val-16; Gly-27 to Thr-35; Gly-178 to Asp-195; Thr-200 to         

     Asp209; Trp-218 to Leu-224; and Lys-226 to Asp-241.                       

     SP032                                                                     

     Ser-9 to Asp-28; Phe-31 to Val-40; Gly-42 to Arg-50; Ile-52 to Leu-60;    

     Asp-174 to Phe-186; Leu-324 to Met-333; and Thr-340 to Asn-347.           

     SP033                                                                     

     Gln-2 to Ile-13; Phe-46 to Ile-53; and Asp-104 to Thr-121.                

     SP034                                                                     

     Glu-36 to Gly-43; Ala-188 to Asp-196; Trp-313 to Gly-320; and Leu-323     

     to Leu-329.                                                               

     SP035                                                                     

     Arg-19 to Asp-36; Asp-47 to Val-57; Asn-134 to Thr-143; Asp-187 to        

     Arg-196; and Glu-222 to Ser-230.                                          

     SP036                                                                     

     Arg-10 to Arg-17; Lys-29 to Ser-39; Ser-140 to Ala-153; Arg-158 to Tyr-   

     169; Asp-175 to Ala-183; Gly-216 to Asn-236; Ala-261 to Leu-270; Arg-     

     282 to Phe-291; and Thr-297 to Ala-305; Pro-342 to Gln-362; Phe-455 to    

     Asp-463; His-497 to Thr-511; Ala-521 to Gly-529; Ile-537 to Val-546;      

     Ile-556 to Ala-568; Pro-581 to Ser-595; Glu-670 to Ala-685; Ser-696 to    

     Ala-705 and Leu-782 to Ser-791.                                           

     SP038                                                                     

     Glu-61 to Pro-69; Phe-107 to Ala-115; Leu-130 to Tyr-141; Ala-229 to      

     Glu-237; Ser-282 to Asn-287; Ala-330 to Glu-338; and Tyr-387 to Glu-      

     393.                                                                      

     SP039                                                                     

     Ser-28 to Asp-35; Pro-88 to Pro-96; Leu-125 to Arg-135; Phe-149 to Leu-   

     157; Gln-246 to Val-254; Ala-357 to Thr-362; Gly-402 to Lys-411; and      

     Leu-440 to Pro-448.                                                       

     SP040                                                                     

     Thr-21 to Ile-30; His-54 to Gln-68; Arg-103 to Leu-117; and Thr-127 to    

     Leu-136.                                                                  

     SP041                                                                     

     Gly-36 to Asp-49; Leu-121 to Val-128; and Ala-186 to Ile-196.             

     SP042                                                                     

     Gly-11 to Arg-19; Ile-23 to Lys-31; His-145 to Asn-151; Gln-159 to Asp-   

     166; Ile-175 to Asp-181; Gly-213 to Tyr-225; Ile-283 to Val-291; Pro-     

     329 to Glu-364; Arg-372 to Ser-386; Thr-421 to Phe-430; Leu-445 to Val-   

     453; Ile-486 to Ala-497; Asp-524 to Ala-535; His-662 to Gly-674; and      

     His-679 to Gln-702.                                                       

     SP043                                                                     

     Lys-2 to Asp-12; Val-58 to Asn-68; Ser-87 to Asp-95; and Asp-102 to       

     Lys-117.                                                                  

     SP044                                                                     

     Gln-3 to Lys-11; Asp-37 to Tyr-52; Glu-171 to Leu-191; His-234 to Asn-    

     247; and Asn-283 to Ala-291.                                              

     SP045                                                                     

     Tyr-52 to Ile-63; Asp-212 to Gln-227; Ser-315 to Thr-332; Leu-345 to      

     Phe-354; Asp-362 to Val-370; Thr-518 to Asn-539; Ala-545 to Lys-559;      

     and Val-601 to Pro-610.                                                   

     SP046                                                                     

     Gln-9 to Ala-18; Glu-179 to Lys-186; Lys-264 to Glu-271; Gly-304 to       

     Glu-17; Ser-503 to Asn-511; Asn-546 to Thr-553; and Asn-584 to            

     Asp-591.                                                                  

     SP048                                                                     

     Tyr-4 to Asp-25; Lys-33 to Val-70; Asp-151 to Thr-170; Asp-222 to         

     Val-257; Thr-290 to Phe-301; and Gly-357 to Val-367.                      

     SP049                                                                     

     Ala-23 to Arg-37; Tyr-85 to Gln-95; Glu-106 to Ile-118; Arg-131 to        

     ILE-144; Gly-150 to Ser-162; and Ala-209 to Asp-218.                      

     SP050                                                                     

     Asp-95 to Glu-113; Gly-220 to Gly-228; Asn-284 to Glu-295; Thr-298        

     to Val-315.                                                               

     SP051                                                                     

     Lys-16 to Glu-50; Lys-57 to Asn-104; Ser-158 to Trp-173; Asp-265          

     to Pro-279; Val-368 to Tyr-386; Glu-420 to Ile-454; Pro-476 to            

     Ile-516; Phe-561 to Gly-581; Thr-606 to Gly-664; and Glu-676 to           

     Val-696.                                                                  

     SP052                                                                     

     Asn-41 to Tyr-60; Phe-80 to Glu-103; Ala-117 to Val-139; Ile-142 to       

     Leu-155; Val-190 to Lys-212; Glu-276 to Phe-283; Arg-290 to Ser-299;      

     Leu-328 to Val-351; Gly-358 to Thr-388; Glu-472 to Ala-483; Val-533       

     to Asn-561; Asp-595 to Val-606; Glu-609 to Val-620; Glu-672 to Ser-       

     691.                                                                      

     SP053                                                                     

     Ala-62 to Val-101; Thr-147 to Leu-174; Lys-204 to Val-216; Gln-228        

     to Val-262; Ser-277 to Gly-297; Thr-341 to Glyn-368; Thr-385 to Ala-      

     409; Thr-414 to Ser-453; Asn-461 to Leu-490; Glu-576 to Thr-625;          

     Gly-630 to Arg-639; and Asp-720 to Leu-740.                               

     SP054                                                                     

     Glu-7 to Val-28; and Tyr-33 to Glu-44.                                    

     SP055                                                                     

     Pro-3 to Val-18; Thr-21 to Lys-53; Val-84 to Lys-99; Ile-162 to Val-      

     172; and Val-204 to Ser-241.                                              

     SP056                                                                     

     Val-34 to Tyr-41; Leu-47 to Glu-55; and Pro-57 to Gln-66.                 

     SP057                                                                     

     Asp-1 to Val-25; Pro-29 to Ile-80; Asn-96 to Val-145; and Pro-150 to      

     Glu-172.                                                                  

     SP059                                                                     

     Ala-64 to Thr-70; Leu-82 to His-138; and Val-228 to Asn-236.              

     SP059                                                                     

     Val-10 to Thr-24; Ser-76 to Pro-102; Ser-109 to Ile-119; Ser-124 to       

     Val-130; Thr-186 to Ile-194; and Asn-234 to Ser-243.                      

     SP060                                                                     

     Leu-70 to Arg-76; and Val-79 to Ile-88.                                   

     SP062                                                                     

     Glu-14 to Lys-28; Ser-32 to Lys-46; and Glu-66 to Thr-74.                 

     SP063                                                                     

     Ile-10 to Val-25; Val-30 to Thr-40; Asp-44 to Pro-54; Asn-57 to Val-      

     63; Pro-71 to Val-100; and Thr-105 to Thr-116.                            

     SP064                                                                     

     Pro-12 to Leu-32; Val-40 to Leu-68; Asp-95 to Ala-125; Ser-164 to         

     Glu-184; Ser-314 to Glu-346; Asn-382 to Val-393; Leu-463 to Gln-498;      

     Asn-534 to Lys-548; and Lys-557 to Gly-605.                               

     SP065                                                                     

     Asn-2 to Ile-12; Ala-39 to Thr-61; and His-135 to Ala-155.                

     SP067                                                                     

     Gly-1 to Thr-13; Asp-203 to Asn-218; and Gly-240 to Asp-253.              

     SP068                                                                     

     Ser-2 to Ser-12; Val-17 to Gln-26; and Lys-54 to Cys-67.                  

     SP069                                                                     

     Ser-32 to Thr-41; Pro-66 to Glu-80; Thr-110 to Val-122; and Val-147       

     to Thr-180.                                                               

     SP070                                                                     

     Lys-6 to Tyr-16; Gln-19 to Ile-27; Arg-50 to Ala-58; Leu-112 to Val-      

     128; Ile-151 to Asn-167; Leu-305 to Phe-321.                              

     SP071                                                                     

     Gln-92 to Asn-158; Gln-171 to Gln-188; Val-204 to Val-240; Thr-247 to     

     Ala-273; Glu-279 to Thr-338; Pro-345 to Glu-368; Asn-483 to Lys-539;      

     Val-552 to Ala-568; Glu-575 to Ser-591; Ser-621 to Gly-640; Gln-742       

     to Gly-758.                                                               

     SP072                                                                     

     Val-68 to Tyr-81; Tyr-86 to Val-121; Leu-127 to Gly-140; Gly-144 to       

     Ala-155; Gln-168 to Val-185; Asp-210 to Try-241; Glu-246 to Thr-269;      

     Lys-275 to Tyr-295; Gly-303 to Pro-320; Arg-327 to Ile-335; Thr-338       

     to Thr-364; Tyr-478 to Phe-495; and Tyr-499 to Arg-521.                   

     SP073                                                                     

     Glu-37 to Val-45; Glu-55 to Val-68; Thr-104 to Thr-119; Ile-127 to        

     Tyr-135; Asn-220 to Ile-232; Thr-237 to Ala-250; Ser-253 to Ala-263;      

     Glu-284 to Ile-297; and Met-438 to Asn-455.                               

     SP074                                                                     

     Gly-2 to Ala-12; Gly-96 to Ile-110; and Thr-220 to Phe-239.               

     SP075                                                                     

     Phe-33 to Tyr-42; Gln-93 to Gly-102; and Val-196 to Asp-211.              

     SP076                                                                     

     Ser-64 to Leu-76; and Phe-81 to Ala-101.                                  

     BP077                                                                     

     Asp-1 to Glu-12; Tyr-26 to Val-36; and Val-51 to Try-62.                  

     SP078                                                                     

     Ala-193 to Ile-208; Tyr-266 to Asn-275; Glu-356 to Leu-369; Ala-411       

     to Gly-422; Ser-437 to Pro-464; Thr-492 to Glu-534; and Glu-571 to        

     Gln-508.                                                                  

     SP079                                                                     

     Gly-11 to Leu-20; Lys-39 to Leu-48; Leu-72 to Val-85; Asn-147 to Ser-     

     158; Ile-178 to Asp-187; Tyr-189 to Gln-201; and Leu-203 to Ala-216       

     SP080                                                                     

     Ser-2 to Glu-12; Gln-42 to Ala-51; Ala-116 to Ser-127; Phe-131 to         

     Asp-143; and Ile-159 to Ile-171.                                          

     SP081                                                                     

     Gln-2 to Leu-9; Gln-49 to Cys-57; Ile-108 to Val-131; Gly-134 to Leu-     

     145; and Trp-154 to Cys-162.                                              

     SP082                                                                     

     Ile-101 to Ser-187; Gly-191 to Asn-221; Arg-225 to Arg-236; Tyr-239       

     to Leu-255; and Gly-259 to Arg-268.                                       

     SP083                                                                     

     Ser-28 to Asp-70.                                                         

     SP084                                                                     

     Leu-42 to Gln-66; Thr-69 to Lys-81; Glu-83 to Arg-92; and Gly-98 to       

     Asn-110.                                                                  

     SP085                                                                     

     Gln-2 to Val-22; and Ser-45 to Glu-51.                                    

     SP086                                                                     

     Leu-18 to Gln-65; and Lys-72 to Val-83.                                   

     SP087                                                                     

     Ser-45 to Leu-53; and Thr-55 to Gln-63                                    

     SP088                                                                     

     Pro-8 to Ile-16; Leu-25 to Trp-33; Tyr-35 to Gln-43; Leu-51 to Val-59;    

     Val-59 to Arg-67; Thr-55 to Tyr-63; Asn-85 to Gly-93; Thr-107 to          

     Leu-115;                                                                  

     Leu-115 to Trp-123; Ala-121 to Thr-129; Tyr-153 to Ala-161; His-176 to    

     Gly-184; Tyr-194 to Ala-202; Ala-217 to Gly-225; and Asn-85 to Gly-93.    

     SP089                                                                     

     Trp-43 to Ala-51; Gln-68 to Phe-76; Val-93 to Gln-101; Phe-106 to         

     Phe-114; Lys-117 to Lys-125; Trp-148 to Phe-156; Glu-168 to Gln-176;      

     Ile-193 to Tyr-201; Lys-203 to Lys-211; Glu-212 to Gln-220; Ile-237 to    

     Tyr-245; Lys-247 to Lys-255; Glu-256 to Gln-264; Met-275 to Gly-283;      

     Lys-286 to Gly-294; Trp-292 to Glu-300; Asp-289 to Thr-297; Tyr-315 to    

     Ser-323; Asp-334 to Lys-342; Pro-371 to Arg-379; Arg-485 to Asn-493;      

     Lys-527 to Arg-535; Phe-537 to Met-545; and Tyr-549 to Glu-557.           

     SP090                                                                     

     Phe-2 to Gln-10; Gln-13 to Lys-21; Tyr-19 to Glu-27; Tyr-39 to Met-47;    

     Pro-65 to Leu-73; Tyr-121 to His-129; Lys-147 to Ile-155; Gly-161 to      

     Lys-169; Gly-218 to Trp-226; Asp-230 to Thr-238; Tyr-249 to Ala-257;      

     and Ala-272 to Gly-280.                                                   

     SP091                                                                     

     Ser-19 to Ser-27; Asn-25 to Thr-33; Val-51 to Gln-59; Asn-75 to Asn-83;   

     Ile-103 to Trp-111; Tyr-113 to Ala-121; Leu-175 to Asn-183; Glu-185 to    

     Trp-193; Ala-203 to Tyr-211; Val-250 to Phe-258; Asn-260 to Thr-268;      

     Ser-278 to Asp-286; Tyr-305 to Leu-313; Asn-316 to Gly-324; Asn-374 to    

     Asp-382; Asn-441 to Gly-449; and Ser-454 to Gln-462.                      

     SP092                                                                     

     Arg-95 to Glu-103; Ala-216 to Val-224; Leu-338 to Glu-346; Pro-350 to     

     Ala-358; Pro-359 to Ala-367; Pro-368 to Ala-376; Pro-377 to Ala-385;      

     Pro-386 to Ala-394; Pro-395 to Ala-403; Pro-350 to Ala-358; Gln-414 to    

     Lys-422; Pro-421 to Asn-429; Trp-465 to Tyr-473; Phe-487 to Tyr-495;      

     Asn-517 to Gly-525; Trp-586 to Tyr-594; Phe-608 to Tyr-616; and           

     Asp-630 to Gly-638.                                                       

     SP093                                                                     

     Gln-30 to Ile-38; Gln-52 to Val-60; Ala-108 to His-116; Tyr-133 to        

     Glu-141; Tyr-192 to Ala-200; and Phe-207 to Ser-215.                      

     SP094                                                                     

     Ala-87 to Val-95; Leu-110 to Cys-118; Gln-133 to Leu-141; Ser-185 to      

     Leu-193; Ile-195 to Gly-203; Asp-206 to Gln-214; Ser-211 to Gly-219;      

     Ile-241 to Thr-249.                                                       

     SP095                                                                     

     Arg-1 to Gln-9; Phe-7 to Asn-15; Thr-21 to Asn-30; Leu-46 to Phe-54;      

     and Ser-72 to Met-80.                                                     

     SP096                                                                     

     Gly-29 to Ile-37; Glu-52 to Ser-60; and Leu-64 to Gly-72.                 

     SP097                                                                     

     Ala-11 to Thr-19; Glu-53 to Glu-61; Ser-91 to Lys-99; Thr-123 to          

     Gln-131; and Gly-209 to Lys-217.                                          

     SP098                                                                     

     Thr-3 to Ser-11; Gly-38 to Phe-46; Tyr-175 to Asn-183; Met-187 to         

     Cys-195; Gln-197 to Leu-205; Tyr-307 to Gln-315; Gly-318 to Tyr-326;      

     Asn-348 to Val-356; Lys-377 to Pro-385; and Leu-415 to Val-423.           

     SP099                                                                     

     Arg-19 to Gly-27; Asp-76 to Ser-84; Val-90 to Lys-98; Phe-165 to          

     Val-173; Leu-237 to Pro-245.                                              

     SP100                                                                     

     His-111 to Gln-119; Ser-141 to His-149; Asp-154 to Ser-162; Gln-158       

     to Gln-166; Asp-154 to Gln-166; Lys-180 to Gln-188; and Ser-206 to        

     Gln-214.                                                                  

     SP101                                                                     

     Glu-23 to Glu-31; Glu-40 to Val-48; Gln-50 to Ser-58; Thr-61 to           

     Ile-69; Leu-82 to Ile-90; Ala-108 to Leu-116; Gln-121 to Pro-129;         

     and Leu-130 to Thr-138.                                                   

     SP102                                                                     

     Asp-32 to His-40; Arg-48 to Lys-56; and Asp-102 to Thr-110.               

     SP103                                                                     

     Arg-5 to Gln-13; Gln-22 to Leu-30; Arg-151 to Gln-159; Arg-167 to         

     Gln-175; Pro-189 to Glu-197; Gly-207 to Leu-215; Ser-219 to Gln-227;      

     Ser-233 to Ser-241; Pro-255 to Asp-264; Lys-272 to 6Gly-280; Ser-318      

     to Val-326; Thr-341 to Asp-351; Asn-356 to Thr-364; Val-370 to            

     Tyr-378;                                                                  

     Ile-379 to Gln-387; and Met-435 to Tyr-443.                               

     SP105                                                                     

     Asn-28 to Pro-36; Thr-77 to Phe-85; Arg-88 to Val-96; Gly-107 to          

     Phe-115; Asp-169 to Asp-177; His-248 to Ser-256; and Ser-274 to           

     Ala-282.                                                                  

     SP106                                                                     

     Val-10 to Thr-18; Ile-62 to Tyr-70; Ile-71 to Pro-79; Lys-86 to           

     Gln-94; Lys-100 to Thr-108; Phe-132 to Leu-140; and Asp-145 to            

     Arg-153.                                                                  

     SP107                                                                     

     Asp-33 to Val-41; and Arg-63 to Gln-71.                                   

     SP108                                                                     

     Lys-9 to Gln-17; Leu-44 to Ser-52; Ser-63 to Phe-71; Tyr-109 to           

     Ser-117; Ile-183 to Ile-191; Pro-194 to Leu-202; Gly-257 to Gln-265;      

     Ala-323 to Thr-331; and Leu-381 to Tyr-389.                               

     SP109                                                                     

     Asn-2 to Gln-10; Ala-65 to Lys-73; Leu-76 to Glu-84; Thr-111 to           

     Asp-119; Gln-116 to Tyr-124; Tyr-130 to Val-138; Asp-173 to Gly-181;      

     Asp-196 to Ser-204; Asn-231 to Ser-239; Phe-252 to Ser-260; Phe-270 to    

     Tyr-278; Val-291 to His-299; Asp-306 to Leu-314; and Pro-327 to           

     Gly-335.                                                                  

     SP110                                                                     

     Ser-8 to Glu-16; Ile-37 to Val-45; Ala-107 to Val-115; and Gly-122        

     to Thr-130.                                                               

     SP111                                                                     

     Asp-19 to Glu-28; Leu-43 to Ala-51; Asn-102 to Phe-110; Gln-133 to        

     Ser-141; Phe-162 to Asp-170; Tyr-194 to Met-202; and Asp-273 to           

     Ser-281.                                                                  

     SP112                                                                     

     Asp-3 to Gln-11; Gly-21 to Ile-29; Ala-46 to Arg-54; Arg-98 to            

     Arg-106; Thr-114 to Val-122; Gln-133 to Asn-141; and Leu-223 to           

     Thr-231                                                                   

     SP113                                                                     

     Asn-19 to Gly-27; Arg-54 to Ser-62; Val-69 to Gln-77; Ser-117 to          

     Asn-125; Gly-164 to Leu-172; Tyr-193 to Ser-201; Cys-303 to Phe-311;      

     His-315 to Ile-323; Arg-341 to Cys-349; Ile-347 to Ser-355; Arg-403       

     to Phe-411; Gln-484 to Pro-492; Ser-499 to Leu-507; Ile-541 to            

     Thr-549                                                                   

     Asn-622 to Ile-630; and Glu-645 to Gly-653.                               

     SP114                                                                     

     Gly-17 to Leu-25; His-40 to Gln-48; Arg-49 to Arg-57; Ile-65 to           

     Pro-73;                                                                   

     Asn-101 to Asp-111; Gly-128 to Cys-136; Phe-183 to Thr-191; and           

     Pro-268 to Ile-276.                                                       

     SP0115                                                                    

     Met-8 to Ser-16; Tyr-24 to Leu-32; Cys-68 to Leu-76; Ser-100 to           

     Pro-108; Thr-193 to Thr-201; Gly-238 to Pro-250; Thr-280 to Phe-288;      

     Pro-303 to Asn-312; Trp-319 to Leu-328; Leu-335 to Leu-344; Lys-395       

     to Ala-403; Asn-416 to Gln-424; Tyr-430 to Ser-438; Val-448 to            

     Leu-456; Leu-460 to Thr-468; Pro-502 to Thr-510; Lys-515 to               

     Ile-524; Gln-523 to His-532; Tyr-535 to Thr-543; Ser-559 to               

     Pro-567; Thr-572 to Asn-580;                                              

     Val-594 to Arg-602; Arg-603 to Asn-611; Thr-620 to Trp-628; and           

     Tyr-644 to Arg-653.                                                       

     BP117                                                                     

     Ala-6 to Gly-14; Ile-19 to Thr-27; Thr-99 to Leu-107; Ser-117 to          

     Asp-125; His-131 to Val-139; Ile-193 to Gly-201; and Val-241 to           

     Gln-249.                                                                  

     SP118                                                                     

     Ser-8 to Trp-23; His-46 to Ala-54; Asn-93 to Gly-101; Val-100 to          

     Ser-108; Arg-155 to Asp-163; and His-192 to Leu-200.                      

     SP119                                                                     

     Tyr-46 to Lys-54; Ser-93 to Ser-101; Trp-108 to Asn-116; Val-121 to       

     Glu-129; and Tyr-131 to Gln-139.                                          

     SP120                                                                     

     Ala-57 to Lys-65; Leu-68 to Glu-76; Thr-103 to Tyr-116; Tyr-122 to        

     Val-130; His-163 to Gly-173; Asp-188 to Ser-196; Ser-222 to Ser-231;      

     Phe-244 to Ser-252; Pro-262 to Tyr-270; Val-283 to His-291; and           

     Asp-298 to Leu-306.                                                       

     SP121                                                                     

     Ser-3 to Ala-11; Asp-13 to Leu-21; Ser-36 to Val-44; and Gln-136 to       

     Met-144.                                                                  

     SP122                                                                     

     Asn-28 to Lys-36; Glu-39 to Thr-50; Val-54 to Lys-62; Asn-106 to          

     Leu-114; Phe-159 to Gly-167; Asn-172 to Arg-180; Glu-199 to Asn-207;      

     Lys-230 to His-241; Asn-252 to Gly-263; Met-278 to Ala-287; Thr-346       

     to Asp-354; Lys-362 to Thr-370; Asp-392 to Asn-405; Asp-411 to            

     Ala-424; Gly-434 to Gly-443; Tyr-484 to Glu-492; Ile-511 to               

     Leu-519; Asn-524 to Asp-538; Glu-552 to Ile-567; Val-605 to Lys-613;      

     Phe-697 to Ala-705; Phe-722 to Leu-730; Leu-753 to Leu-761; Asp-787       

     to Gln-795; Leu-858 to Asn-866; Ala-892 to Thr-901; Gly-903 to            

     Ile-913; Ile-921 to Asn-931; Asn-938 to Pro-951; Gly-960 to               

     Lys-970; Leu-977 to Asp-985; and Leu-988 to Pro-996.                      

     SP123                                                                     

     Val-4 to Asn-12; Glu-47 to Leu-55; Lys-89 to Glu-100; Ser-165 to          

     Thr-173; Lys-234 to Val-242; Ser-258 to Ser-266; Glu-284 to Asn-292;      

     Tyr-327 to Leu-335; Tyr-457 to Thr-465; Tyr-493 to Glu-501; Thr-506       

     to Tyr-514; Lys-517 to Thr-525; Asn-532 to Gly-540; and Arg-556 to        

     Glu-564.                                                                  

     SP124                                                                     

     rg-16 to Glu-24; Gln-52 to Arg-60; Asn-69 to Tyr-77; Glu-121 to           

     Asn-129; Ala-134 to Val-142; Thr-151 to Ala-159; Asn-164 to Glu-172;      

     His-181 to His-189; Thr-210 to Ala-218; Ser-244 to Val-252; Phe-287       

     to Tyr-297; Ser-312 to Thr-323; His-433 to Tyr-441; Ser-445 to            

     Asn-453;                                                                  

     Asn-469 to Thr-477; Asn-501 to Asn-509; Gln-536 to Ala-547; and           

     Gln-608 to Asp-621.                                                       

     SP125                                                                     

     Ser-9 to Asp-21; Ala-28 to Leu-36; Asn-49 to Phe-57; Val-137 to           

     Arg-145; Asn-155 to Leu-163; Glu-183 to Asp-191; Gly-202 to Tyr-210;      

     Pro-221 to Asp-229; Phe-263 to Ala-271; Phe-300 to Gln-308; Asp-313       

     to Glu-321; Asn-324 to Asp-332; Ile-346 to Asn-354; Asp-362 to            

     Lys-370; Met-402 to Gly-410; Gly-437 to Gly-445; Ser-471 to Glu-483;      

     Gly-529 to Asp-537; Gln-555 to Val-563; and Leu-579 to Lys-587.           

     SP126                                                                     

     Leu-22 to Thr-30; Val-65 to Leu-73; and Thr-75 to Asp-83.                 

     SP127                                                                     

     Glu-2 to Ala-12; Asp-28 to Thr-36; Val-105 to Thr-113; Lys-121 to         

     Thr-129; Trp-138 to Pro-146; Ser-152 to Ile-160; Lys-180 to Asp-188;      

     Leu-194 to Asn-202; and Gly-228 to Thr-236.                               

     ______________________________________                                    

                                    TABLE 3                                 

     __________________________________________________________________________

     S. pneumoniae ORF Cloning Primers                                         

     Primer                                                                    

     Name                                                                      

         SEQ ID                                                                

             Sequence                       RE                                 

     __________________________________________________________________________

     SP001A                                                                    

         NO:227                                                                

             GACTGGATCCTAAAATCTACGACAATAAAAATC                                 

                                          Bam HI                               

     SP001B                                                                    

         NO:228                                                                

             CTGAGTCGACTGGTTGTGCTGGTTGAG   Sal I                               

     SP004A                                                                    

         NO:229                                                                

             GTCAGGATCCAAATTACAATACGGACTATG                                    

                                           Bam HI                              

     SP004B                                                                    

         NO:230                                                                

             CAGTGTCGACTAACTCTAGGTCGGAAAC  Sal I                               

     SP006A                                                                    

         NO:231                                                                

             GACTGGATCCTGAGAATCAAGCTACACCCAAAGAG                               

                                          Bam HI                               

     SP006B                                                                    

         NO:232                                                                

             AGTCAAGCTTTTGTAACTGAGATTGATCTGG                                   

                                          Hind III                             

     SP007A                                                                    

         NO:233                                                                

             GACTGGATCCTGGTAACCGCTCTTCTCGTAACGCAGC                             

                                          Bam HI                               

     SP007B                                                                    

         NO:234                                                                

             AGTCAAGCTTTTTCAGGAACTTTTACGCTTCC                                  

                                          Hind III                             

     SP008A                                                                    

         NO:235                                                                

             AGTCAGATCTTGTGGAAATTTGACAGGTAACAGCAAAAAAGCTGC                     

                                          Bgl II                               

     SP008B                                                                    

         NO:236                                                                

             ACTGAAGCTTTTTTGTTTTTCAAGAATTCATCG                                 

                                          Hind III                             

     SP009A                                                                    

         NO:237                                                                

             GACTGGATCCTGGTCAAGGAACTGCTTCTAAAGAC                               

                                          Bam HI                               

     SP009B                                                                    

         NO:238                                                                

             AGTCAAGCTTTCACAAATTCGTTGGTGAAGCC                                  

                                          Hind III                             

     SP010A                                                                    

         NO:239                                                                

             GACTGGATCCTAGCTCAGGTGGAAACGCTGGTTCATCC                            

                                          Bam HI                               

     SP010B                                                                    

         NO:240                                                                

             AGTCAAGCTTATCAACTTTTCCACCTTCAACAACC                               

                                          Hind III                             

     SP011A                                                                    

         NO:241                                                                

             GTCAAGATCTCTCCAACTATGGTAAATCTGCGGATGG                             

                                          Bgl II                               

     SP011B                                                                    

         NO:242                                                                

             AGTCCTGCAGATCCACATCCGCTTTCATCGGGTTAAAGAAGG                        

                                          Pst I                                

     SP012A                                                                    

         NO:243                                                                

             GACTGGATCCTGGGAAAAATTCTAGCGAAACTAGTGG                             

                                          Bam HI                               

     SP012B                                                                    

         NO:244                                                                

             GTCACTGCAGCTGTCCTTCTTTTACTTCTTTGGTTGC                             

                                          Pst I                                

     SP013A                                                                    

         NO:245                                                                

             GACTGGATCCTGCTAGCGGAAAAAAAGATACAACTTCTGG                          

                                          Bam HI                               

     SP013B                                                                    

         NO:246                                                                

             CTGAAAGCTTTTTTGCCAATCCTTCAGCAATCTTGTC                             

                                          Hind III                             

     SP014A                                                                    

         NO:247                                                                

             GACTAGATCTTGGCTCAAAAAATACAGCTTCAAGTCC                             

                                          Bgl II                               

     SP014B                                                                    

         NO:248                                                                

             AGTCCTGCAGGTTTTTGTTTGCTTGGTATTGGTCG                               

                                          Pst I                                

     SP015A                                                                    

         NO:249                                                                

             GACTGGATCCTAGTACAAACTCAAGCACTAGTCAGACAGAG                         

                                          Bam HI                               

     SP015B                                                                    

         NO:250                                                                

             CAGTCTGCAGTTTCAAAGCTTTTTGTATGTCTTC                                

                                          Pst I                                

     SP016A                                                                    

         NO:251                                                                

             GACTGGATCCTGGCAATTCTGGCGGAAGTAAAGATGC                             

                                          Bam HI                               

     SP016B                                                                    

         NO:252                                                                

             AGTCAAGCTTGTTTCATAGCTTTTTTGATTGTTTCG                              

                                          Hind III                             

     SP017A                                                                    

         NO:253                                                                

             GACTGGATCCTTCACAAGAAAAAACAAAAAATGAAGATGG                          

                                          Bam HI                               

     SP017B                                                                    

         NO:254                                                                

             AGTCAAGCTTATCGACGTAGTCTCCGCCTTC                                   

                                          Hind III                             

     SP019A                                                                    

         NO:255                                                                

             GACTGGATCCGAAAGGTCTGTGGTCAAATAATCTTACC                            

                                          Bam HI                               

     SP019B                                                                    

         NO:256                                                                

             AGTCAAGCTTAGAGTTAACATGGTGCTTGCCAATAGG                             

                                          Hind III                             

     SP020A                                                                    

         NO:257                                                                

             GACTGGATCCAAACTCAGAAAAGAAAGCAGACAATGC                             

                                          Bam HI                               

     SP020B                                                                    

         NO:258                                                                

             AGTCAAGCTTCCAAACTGGTTGATCCAAACCATCTG                              

                                          Hind III                             

     SP021A                                                                    

         NO:259                                                                

             GACTGGATCCTTCGAAAGGGTCAGAAGGTGCAGACC                              

                                          Bam HI                               

     SP021B                                                                    

         NO:260                                                                

             AGTCAAGCTTCTGTAGGCTTGGTGTGCCCCAGTTGC                              

                                          Hind III                             

     SP022A                                                                    

         NO:261                                                                

             CTGAGGATCCGGGGATGGCAGCTTTTAAAAATC                                 

                                          Bam HI                               

     SP022B                                                                    

         NO:262                                                                

             CAGTAAGCTTGTTTACCCATTCACCATTACC                                   

                                          Hind III                             

     SP023A                                                                    

         NO:263                                                                

             CAGTGGATCCAGACGAGCAAAAAATTAAG                                     

                                           Bam HI                              

     SP023B                                                                    

         NO:264                                                                

             TCAGAAGCTTGTTTACCCATTCACCATT  Hind III                            

     SP025A                                                                    

         NO:265                                                                

             GACTGGATCCCTGTGGTGAGGAAGAAACTAAAAAG                               

                                          Bam HI                               

     SP025B                                                                    

         NO:266                                                                

             CTGAGTCGACAATATTCTGTAGGAATGCTTCGAATTTG                            

                                          Sal I                                

     SP028A                                                                    

         NO:267                                                                

             CTGAGGATCCGACTTTTAACAATAAAACTATTGAAGAG                            

                                          Bam HI                               

     SP028B                                                                    

         NO:268                                                                

             GTCACTGCAGGTTGTCACCTCCAAAAATCACGG                                 

                                          Pst I                                

     SP030A                                                                    

         NO:269                                                                

             GACTGGATCCCTTTACAGGTAAACAACTACAAGTCGG                             

                                          Bam HI                               

     SP030B                                                                    

         NO:270                                                                

             CAGTAAGCTTTTCGAAGTTTGGCTCAGAATTG                                  

                                          Hind III                             

     SP031A                                                                    

         NO:271                                                                

             GACTGGATCCCCAGGCTGATACAAGTATCGCA                                  

                                          Bam HI                               

     SP031B                                                                    

         NO:272                                                                

             CAGTAAGCTTATCTGCAGTATGGCTAGATGG                                   

                                          Hind III                             

     SP032A                                                                    

         NO:273                                                                

             GACTGGATCCGTCTGTATCATTTGAAAACAAAGAAAC                             

                                          Bam HI                               

     SP032B                                                                    

         NO:274                                                                

             CAGTCTGCAGTTTTACTGTTGCTGTGCTTGTG                                  

                                          Pst I                                

     SP033A                                                                    

         NO:275                                                                

             ACTGAGATCTTGGTCAAAAGGAAAGTCAGACAGGAAAGG                           

                                          Bgl II                               

     SP033B                                                                    

         NO:276                                                                

             CAGTAAGCTTATTCCTGAGCTTTTTTGATAAAGGTTGCGCA                         

                                          Hind III                             

     SP034A                                                                    

         NO:277                                                                

             ACTGGGATCCGAAGGATAGATATATTTTAGCATTTGAGAC                          

                                          Bam HI                               

     SP034B                                                                    

         NO:278                                                                

             AGTCAAGCTTCCATGGTATCAAAGGCAAGACTTGG                               

                                          Hind III                             

     SP035A                                                                    

         NO:279                                                                

             GTCAGGATCCGGTAGTTAAAGTTGGTATTAACGG                                

                                          Bam HI                               

     SP035B                                                                    

         NO:280                                                                

             AGTCAAGCTTGCAATTTTTGCGAAGTATTCCAAGAG                              

                                          Hind III                             

     SP036A                                                                    

         NO:281                                                                

             AGTCGGATCCTTCTTACGAGTTGGGACTGTATCAAGC                             

                                          Bam HI                               

     SP036B                                                                    

         NO:282                                                                

             AGTCAAGCTTGTTTATTTTTTCCTTACTTACAGATGAAGG                          

                                          Hind III                             

     SP038A                                                                    

         NO:283                                                                

             AGTCGGATCCTACTGAGATGCATCATAATCTAGGAGC                             

                                          Bam HI                               

     SP038B                                                                    

         NO:284                                                                

             TCAGCTCGAGTTCTTTGACATCTCCATCATAAGTCGC                             

                                          Xho I                                

     SP039A                                                                    

         NO:285                                                                

             GACTGGATCCGGTTTTGAGAAAGTATTTGCAGGGG                               

                                          Bam HI                               

     SP039B                                                                    

         NO:286                                                                

             CAGTAAGCTTGGATTTTTTCATGGATGCAATTTTTTTGG                           

                                          Hind III                             

     SP040A                                                                    

         NO:287                                                                

             GACTGGATCCGACAACATTTACTATCCATACAGTAGAGTCAGC                       

                                          Bam HI                               

     SP040B                                                                    

         NO:288                                                                

             GACTAAGCTTGGCATAAGGTTGCAATTCTGGATTAATTGG                          

                                          Hind III                             

     SP041A                                                                    

         NO:289                                                                

             GACTGGATCCGGCTAAGGAAAGAGTGGATG                                    

                                          Bam HI                               

     SP041B                                                                    

         NO:290                                                                

             GACTAAGCTTTTCATTTTTAAATTGACTATGCGCCCG                             

                                          Hind III                             

     SP042A                                                                    

         NO:291                                                                

             GACTGGATCCTTGTTCCTATGAACTTGGTCGTCACC                              

                                          Bam HI                               

     SP042B                                                                    

         NO:292                                                                

             CATGAAGCTTATCCTGGATTTTTCCAAGTAAATCT                               

                                          Hind III                             

     SP043A                                                                    

         NO:293                                                                

             GACTGGATCCTTATAAGGGTGAATTAGAAAAAGG                                

                                          Bam HI                               

     SP043B                                                                    

         NO:294                                                                

             GACTAAGCTTCTTATTAGGATTGTTAGTAGTTG                                 

                                          Hind III                             

     SP044A                                                                    

         NO:295                                                                

             GACTGGATCCGAATGTTCAGGCTCAAGAAAGTTCAGG                             

                                          Bam HI                               

     SP044B                                                                    

         NO:296                                                                

             GACTAAGCTTTTCCCCTGATGGAGCAAAGTAATACC                              

                                          Hind III                             

     SP045A                                                                    

         NO:297                                                                

             GACTGGATCCCTTGGGTGTAACCCATATCCAGCTCCTTCC                          

                                          Bam HI                               

     SP045B                                                                    

         NO:298                                                                

             GACTGTCGACTTCAGCTTGTTTATCTGGGGTTGC                                

                                          Sal I                                

     SP046A                                                                    

         NO:299                                                                

             GACTGGATCCTAGTGATGGTACTTGGCAAGGAAAACAG                            

                                          Bam HI                               

     SP046B                                                                    

         NO:300                                                                

             ACTGCTGCAGATCTTTGCCACCTAGCTTCTCATTG                               

                                          Pst I                                

     SP048A                                                                    

         NO:301                                                                

             GTCAGGATCCTGGGATTCAATATGTCAGAGATGATACTAG                          

                                          Bam HI                               

     SP048B                                                                    

         NO:302                                                                

             CTAGAAGCTTACGCACCCATTCACCATTATCATTG                               

                                          Hind III                             

     SP049A                                                                    

         NO:303                                                                

             GTCAGGATCCGGATAATAGAGAAGCATTAAAAACC                               

                                          Bam HI                               

     SP049B                                                                    

         NO:304                                                                

             AGTCAAGCTTGACAAAATCTTGAAACTCCTCTGGTC                              

                                          Hind III                             

     SP050A                                                                    

         NO:305                                                                

             GTCAGGATCCAGATTTTGTCGAGGAGTGTCATACC                               

                                          Bam HI                               

     SP050B                                                                    

         NO:306                                                                

             AGTCAAGCTTTCCCTTTTTACCCTTACGAATCCAGG                              

                                          Hind III                             

     SP051A                                                                    

         NO:307                                                                

             GACTGGATCCATCTGTAGTTTATGCGGATGAAACACTTATTAC                       

                                          Bam HI                               

     SP051B                                                                    

         NO:308                                                                

             GACTGTCGACGCTTTGGTAGAGATAGAAGTCATG                                

                                          Sal I                                

     SP052A                                                                    

         NO:309                                                                

             GACTGGATCCTTACTTTGGTATCGTAGATACAGCCGGC                            

                                          Bam HI                               

     SP052B                                                                    

         NO:310                                                                

             AGTCAAGCTTTGTTAATTGCGTACCTTCTAAGCGACC                             

                                          Hind III                             

     SP053A                                                                    

         NO:311                                                                

             GACTGGATCCAGCTAAGGTTGCATGGGATGCGATTCG                             

                                          Bam HI                               

     SP053B                                                                    

         NO:312                                                                

             GACTGTCGACCTGGGCTTTATTAGTTTGACTAGC                                

                                          Sal I                                

     SP054A                                                                    

         NO:313                                                                

             CAGTGGATCCCTATCACTATGTAAATAAAGAGA                                 

                                          Bam HI                               

     SP054B                                                                    

         NO:314                                                                

             ACTGAAGCTTTTCTGTCCCTGTTTGAGGCA                                    

                                          Hind III                             

     SP055A                                                                    

         NO:315                                                                

             CAGTGGATCCTGAGACTCCTCAATCAATAACAAA                                

                                          Bam HI                               

     SP055B                                                                    

         NO:316                                                                

             ACGTAAGCTTATAATCAGTAGGAGAAACTGAACT                                

                                          Hind III                             

     SP056A                                                                    

         NO:317                                                                

             CAGTGGATCCGGATGCTCAAGAAACTGCGG                                    

                                          Bam HI                               

     SP056B                                                                    

         NO:318                                                                

             GACTAAGCTTTTGCCTCTCATTCTTGCTTCC                                   

                                          Hind III                             

     SP057A                                                                    

         NO:319                                                                

             CAGTGGATCCCGACAAAGGTGAGACTGAG                                     

                                           Bam HI                              

     SP057B                                                                    

         NO:320                                                                

             ACGTAAGCTTATTTCTTAATTCAAGTGTTTTCTCTG                              

                                          Hind III                             

     SP058A                                                                    

         NO:321                                                                

             GACTGGATCCAAATCAATTGGTAGCACAAGATCC                                

                                          Bam HI                               

     SP058B                                                                    

         NO:322                                                                

             CAGTGTCGACATTAGGAGCCACTGGTCTC                                     

                                           Sal I                               

     SP059A                                                                    

         NO:323                                                                

             CAGTGGATCCCAAACAGTCAGCTTCAGGAAC                                   

                                          Bam HI                               

     SP059B                                                                    

         NO:324                                                                

             GACTCTGCAGTTTAATCTTGTCCCAGGTGG                                    

                                          Pst I                                

     SP060A                                                                    

         NO:325                                                                

             GACTGGATCCATTCGATGATGCGGATGAAAAG                                  

                                          Bam HI                               

     SP060B                                                                    

         NO:326                                                                

             GACTAAGCTTCATTTGTCTTTGGGTATTTCGCA                                 

                                          Hind III                             

     SP062A                                                                    

         NO:327                                                                

             CAGTGGATCCGGAGAGTCGATCAAAAGTAG                                    

                                          Bam HI                               

     SP062B                                                                    

         NO:328                                                                

             GTCACTGCAGTTGCTCGTCTCGAGGTTC  Pst I                               

     SP063A                                                                    

         NO:329                                                                

             CAGTGGATCCATGGACAACAGGAAACTGGGAC                                  

                                          Bam HI                               

     SP063B                                                                    

         NO:330                                                                

             CAGTAAGCTTATTAGCTTCTGTACCTGTGTTTG                                 

                                          Hind III                             

     SP064A                                                                    

         NO:331                                                                

             GACTGGATCCCGATGGGCTCAATCCAACCCCAGGTCAAGTC                         

                                          Bam HI                               

     SP064B                                                                    

         NO:332                                                                

             GACTCTGCAGCATAGCTTTATCCTCTGACATCATCGTATC                          

                                          Pst I                                

     SP065A                                                                    

         NO:333                                                                

             GACTGGATCCTTCCAATCAAAAACAGGCAGATGG                                

                                          Bam HI                               

     SP065B                                                                    

         NO:334                                                                

             GACTAAGCTTGAGTCCCATAGTCCAAGGCA                                    

                                          Hind III                             

     SP067A                                                                    

         NO:335                                                                

             AGTCGGATCCTATCACAGGATCGAACGGTAAGACAACC                            

                                          Bam HI                               

     SP067B                                                                    

         NO:336                                                                

             ACTGGTCGACTTCTTTTAACTCCGCTACTGTGTC                                

                                          Sal I                                

     SP068A                                                                    

         NO:337                                                                

             CAGTGGATCCAAGTTCATCGAAGATGGTTGGGAAGTCC                            

                                          Bam HI                               

     SP068B                                                                    

         NO:338                                                                

             GATCGTCGACCCGCTCCCACATGCTCAACCTT                                  

                                          Sal I                                

     SP069A                                                                    

         NO:339                                                                

             TGACGGATCCATCGCTAGCTAGTGAAATGCAAGAAAG                             

                                          Bam HI                               

     SP069B                                                                    

         NO:340                                                                

             TGACAAGCTTATTCGTTTTTGAACTAGTTGCTTTCGT                             

                                          Hind III                             

     SP070A                                                                    

         NO:341                                                                

             GACTGGATCCGCACCAGATGGGGCACAAGGTTCAGGG                             

                                          Bam HI                               

     SP070B                                                                    

         NO:342                                                                

             TGACAAGCTTAACTTGTAACGAACAGTTCAATCTG                               

                                          Hind III                             

     SP071A                                                                    

         NO:343                                                                

             GACTAGATCTTTTTAACCCAACTGTTGGTACTTTCC                              

                                          Bgl II                               

     SP071B                                                                    

         NO:344                                                                

             TGACAAGCTTGTTAGGTGTTACATTTTGACCGTC                                

                                          Hind III                             

     SP072A                                                                    

         NO:345                                                                

             ACTGAGATCTTTTTAACCCAACTGTTGGTACTTTC                               

                                          Bgl II                               

     SP072B                                                                    

         NO:346                                                                

             GACTAAGCTTTCTACGATAACGATCATTTTCTTTACC                             

                                          Hind III                             

     SP073A                                                                    

         NO:347                                                                

             GACTGTCGACTCGTAGATATTTAAGTCTAAGTGAAGCG                            

                                          Sal I                                

     SP073B                                                                    

         NO:348                                                                

             AGTCAAGCTTGTTAGGTGTTACATTTTGCAAGTC                                

                                          Hind III                             

     SP074A                                                                    

         NO:349                                                                

             GACTGGATCCCTTTGGTTTTGAAGGAAGTAAG                                  

                                          Bam HI                               

     SP074B                                                                    

         NO:350                                                                

             TGACCTGCAGACGATTTTTGAAAAATGGAGGTGTATC                             

                                          Pst I                                

     SP075A                                                                    

         NO:351                                                                

             CAGTGGATCCCTACTACCTCTCGAGAGAAAG                                   

                                          Bam HI                               

     SP075B                                                                    

         NO:352                                                                

             ACTGAAGCTTTTCGCTTTTTACTCGTTTGACA                                  

                                          Hind III                             

     SP076A                                                                    

         NO:353                                                                

             CAGTGGATCCTAAGGTCAAAAGTCAGACCGCTAAGAAAGTGC                        

                                          Bam HI                               

     SP076B                                                                    

         NO:354                                                                

             CAGTAAGCTTTAGGGTATCCAAATACTGGTTGTTGATG                            

                                          Hind III                             

     SP077A                                                                    

         NO:355                                                                

             TGACAGATCTTGACGGGTCTCAGGATCAGACTCAGG                              

                                          Bgl II                               

     SP077B                                                                    

         NO:356                                                                

             TGACAAGCTTCAAAGACATCCACCTCTTGACCTTTG                              

                                          Hind III                             

     SP078A                                                                    

         NO:357                                                                

             GACTGGATCCTAGAGGCTTTGCCAAATGGTGGGAAGGG                            

                                          Bam HI                               

     SP078B                                                                    

         NO:358                                                                

             GTCAGTCGACTTGTTGTAACACTTTTCGAGGTTTGGTACC                          

                                          Sal I                                

     SP079A                                                                    

         NO:359                                                                

             CAGTGGATCCTCAAAAAGAGAAGGAAAACTTGG                                 

                                          Bam HI                               

     SP079B                                                                    

         NO:360                                                                

             CAGTCTGCAGTTTCTTCAACAAACCTTGTTCTTG                                

                                          Pst I                                

     SP080A                                                                    

         NO:361                                                                

             CAGTGGATCCACGTTCTATTGAGGACCACTT                                   

                                          Bam HI                               

     SP080B                                                                    

         NO:362                                                                

             CAGTAAGCTTTTCCTTCTCAGTCAATTCTTTTCC                                

                                          Hind III                             

     SP081A                                                                    

         NO:363                                                                

             GACTGGATCCCGCTCAAAATACCAGAGGTGTTCAG                               

                                          Bam HI                               

     SP081B                                                                    

         NO:364                                                                

             GACTAAGCTTAGTACCATGGGTGTGACAGGTTTGAA                              

                                          Hind III                             

     SP082A                                                                    

         NO:365                                                                

             CTGAGGATCCAATTGTACAATTAGAAAAAGATAGC                               

                                          Bam HI                               

     SP082B                                                                    

         NO:366                                                                

             TGACAAGCTTGCGTTGACTAGGTTCTGCAATGCC                                

                                          Hind III                             

     SP083A                                                                    

         NO:367                                                                

             GACTGGATCCTCTGACCAAGCAAAAAGAAGCAGTCAATGA                          

                                          Bam HI                               

     SP083B                                                                    

         NO:368                                                                

             TCAGCAGCTGATCATTGACTTTACGATTTGCTCC                                

                                          Bgl II                               

     SP084A                                                                    

         NO:369                                                                

             GACTGGATCCGTCCGGCTCTGTCCAGTCCACTTTTTCAGCG                         

                                          Bam HI                               

     SP084B                                                                    

         NO:370                                                                

             TCAGAAGCTTATTTTTTGTTTCCTTAATGCGTT                                 

                                          Hind III                             

     SP085A                                                                    

         NO:371                                                                

             GACTGGATCCGGGACAAATTCAAAAAAATAGGCAAGAGG                           

                                          Bam HI                               

     SP085B                                                                    

         NO:372                                                                

             GTCAAAGCTTTGGCTCTTTGATTGCCAACAACTG                                

                                          Hind III                             

     SP086A                                                                    

         NO:373                                                                

             GACTGGATCCTCGCTACCAGCAACAAAGCGAGCAAAAGG                           

                                          Bam HI                               

     SP086B                                                                    

         NO:374                                                                

             GACTAAGCTTACTTTTTTCTTTTTCCACACGA                                  

                                          Hind III                             

     SP087A                                                                    

         NO:375                                                                

             CAGTGGATCCGAACCGACAAGTCGCCCACTATCAAGACT                           

                                          Bam HI                               

     SP087B                                                                    

         NO:376                                                                

             CTGAAAGCTTTGAATTCTCTTTCTTTTCAGGCT                                 

                                          Hind III                             

     SP088A                                                                    

         NO:377                                                                

             TCGAGGATCCGGTTGTCGGCTGGCAATATATCCCGT                              

                                          Bam HI                               

     SP088B                                                                    

         NO:378                                                                

             CAGTAAGCTTCCGAACCCATTCGCCATTATAGTTGAC                             

                                          Hind III                             

     SP089A                                                                    

         NO:379                                                                

             AGTCGGATCCGGCCAAATCAGAATGGGTAGAAGAC                               

                                          Bam HI                               

     SP089B                                                                    

         NO:380                                                                

             TGACCTGCAGCTTCTCATTGATTTTCATCATCAC                                

                                          Pst I                                

     SP090A                                                                    

         NO:381                                                                

             GACTGGATCCATTTGCAGATGATTCTGAAGGATGG                               

                                          Bam HI                               

     SP090B                                                                    

         NO:382                                                                

             TCAGCTGCAGCTTAACCCATTCACCATTCTAGTTTAAG                            

                                          Pst I                                

     SP091A                                                                    

         NO:383                                                                

             GACTGGATCCTGTCGCTGCAAATGAAACTGAAGTAGC                             

                                          Bam HI                               

     SP091B                                                                    

         NO:384                                                                

             GACTAAGCTTATACCAAACGCTGACATCTACGCG                                

                                          Hind III                             

     SP092A                                                                    

         NO:385                                                                

             AGTCAGATCTTACGTCTCAGCCTACTTTTGTAAGAGC                             

                                          Bgl II                               

     SP092B                                                                    

         NO:386                                                                

             GACTAAGCTTAACCCATTCACCATTGGCATTGAC                                

                                          Hind III                             

     SP093A                                                                    

         NO:387                                                                

             CAGTGGATCCTGGACAGGTGAAAGGTCATGCTACATTTGTG                         

                                          Bam HI                               

     SP093B                                                                    

         NO:388                                                                

             GACTAAGCTTCAACCATTGAGACCTTGCAACAC                                 

                                          Hind III                             

     SP094A                                                                    

         NO:389                                                                

             GTCAGGATCCGATTGCTCCTTTGAAGGATTTGAGAGAAACC                         

                                          Bam HI                               

     SP094B                                                                    

         NO:390                                                                

             GACTAAGCTTCGATCAAAGATAAGATAAATATATATAAAGT                         

                                          Hind III                             

     SP095A                                                                    

         NO:391                                                                

             GACTGGATCCTAGGTCATATGGGACTTTTTTTCTACAACAAAATAGG                   

                                          Bam HI                               

     SP095B                                                                    

         NO:392                                                                

             TGACAAGCTTATCTATCAGCTCATTTAATCGTTTTTG                             

                                          Hind III                             

     SP096A                                                                    

         NO:393                                                                

             CTGAGGATCCCAACGTTGAGAATTATTTGCGAATG                               

                                          Bam HI                               

     SP096B                                                                    

         NO:394                                                                

             TGACAAGCTTGAGTCTACAAAAGTAATGTAC                                   

                                          Hind III                             

     SP097A                                                                    

         NO:395                                                                

             GTCAGGATCCCTACTATCAATCAAGTTCTTCAGCC                               

                                          Bam HI                               

     SP097B                                                                    

         NO:396                                                                

             TGACAAGCTTGACTGAGGCTTGGACCAGATTGAAAAG                             

                                          Hind III                             

     SP098A                                                                    

         NO:397                                                                

             GACTGGATCCGACAAAAACATTAAAACGTCCTGAGG                              

                                          Bam HI                               

     SP098B                                                                    

         NO:398                                                                

             GACTAAGCTTAGCACGAACTGTGACGCTGGTTCC                                

                                          Hind III                             

     SP099A                                                                    

         NO:399                                                                

             GACTGGATCCTTCTCAGGAGACCTTTAAAAATATC                               

                                          Bam HI                               

     SP099B                                                                    

         NO:400                                                                

             GACTAAGCTTGTTGGCCATCTTGTACATACC                                   

                                          Hind III                             

     SP100A                                                                    

         NO:401                                                                

             GACTGGATCCAGTAAATGCGCAATCAAATTC                                   

                                          Bam HI                               

     SP100B                                                                    

         NO:402                                                                

             AGTCCTGCAGGTATTTAGCCCAATAATCTATAAAGCT                             

                                          Pst I                                

     SP101A                                                                    

         NO:403                                                                

             CAGTGGATCCTTACCGCGTTCATCAAGATGTC                                  

                                          Bam HI                               

     SP101B                                                                    

         NO:404                                                                

             GACTAAGCTTGCCAGATGTTGAAAAGAGAGTG                                  

                                          Hind III                             

     SP102A                                                                    

         NO:405                                                                

             GACTGGATCCGTGGATGGGCTTTAACTATCTTCGTATTCG                          

                                          Bam HI                               

     SP102B                                                                    

         NO:406                                                                

             AGTCAAGCTTGCTAGTCTTCACTTTCCCTTTCC                                 

                                          Hind III                             

     SP103A                                                                    

         NO:407                                                                

             GACTGTCGACACTAAACCAGCATCGTTCGCAGGA                                

                                          Sal I                                

     SP103B                                                                    

         NO:408                                                                

             CTGACTGCAGCTTCTTGAAGAAATAATGATTGTGG                               

                                          Pst I                                

     SP105A                                                                    

         NO:409                                                                

             CAGTGGATCCTGACTACCTTGAAATCCCACTT                                  

                                          Bam HI                               

     SP105B                                                                    

         NO:410                                                                

             CAGTAAGCTTTTTTTTAAGGTTGTAGAATGATTTCAATC                           

                                          Hind III                             

     SP106A                                                                    

         NO:411                                                                

             CAGTGTCGACTCGTATCTTTTTTTGGAGCAATGTT                               

                                          Sal I                                

     SP106B                                                                    

         NO:412                                                                

             GACTAAGCTTAAATGTTCCGATACGGGTGATTG                                 

                                          Hind III                             

     SP107A                                                                    

         NO:413                                                                

             CAGTGGATCCGGACTCTCTCAAAGATGTGAAAG                                 

                                          Bam HI                               

     SP107B                                                                    

         NO:414                                                                

             GACTAAGCTTCTTGAGTTTGTCAAGGATTGCTTT                                

                                          Hind III                             

     SP108A                                                                    

         NO:415                                                                

             CAGTGGATCCCAAGAAATCCTATCATCTCTTCCAGAAG                            

                                          Bam HI                               

     SP108B                                                                    

         NO:416                                                                

             GACTAAGCTTTTCAGAACTAAAAGCCGCAGCTT                                 

                                          Hind III                             

     SP109A                                                                    

         NO:417                                                                

             GACTGGATCCACGAAATGCAGGGCAGACAG                                    

                                           Bam HI                              

     SP109B                                                                    

         NO:418                                                                

             CAGTAAGCTTATCAACATAATCTAGTAAATAAGCGT                              

                                          Hind III                             

     SP110A                                                                    

         NO:419                                                                

             CAGTGGATCCTGTATAGTTTTTAGCGCTTGTTCTTC                              

                                          Bam HI                               

     SP110B                                                                    

         NO:420                                                                

             GTCAAAGCTTTGATAGAGTGTCATAATCTTCTTTAG                              

                                          Hind III                             

     SP111A                                                                    

         NO:421                                                                

             GACTGGATCCGTGTGTCGAGCATATTCTGAAG                                  

                                          Bam HI                               

     SP111B                                                                    

         NO:422                                                                

             CAGTAAGCTTACTTTTACCATTTCTTTGTTCTGCATC                             

                                          Hind III                             

     SP112A                                                                    

         NO:423                                                                

             GACTGTCGACGTGTTTGGATAGCATTCAGAATCAGACG                            

                                          Sal I                                

     SP112B                                                                    

         NO:424                                                                

             CAGTAAGCTTCGGAAGTAAAGACAATTTTTCC                                  

                                          Hind III                             

     SP113A                                                                    

         NO:425                                                                

             CAGTGGATCCGTGCCTAGATAGTATTATTACTCAAAC                             

                                          Bam HI                               

     SP113B                                                                    

         NO:426                                                                

             GACTAAGCTTTTTGCTTATTTCTCTCAATTTTTC                                

                                          Hind III                             

     SP114A                                                                    

         NO:427                                                                

             CAGTGGATCCCATTCAGAAGCAGACCTATCAAAATC                              

                                          Bam HI                               

     SP114B                                                                    

         NO:428                                                                

             ACTGAAGCTTATGTAATTTTTTAGATTTTTCAATATTTTTCAG                       

                                          Hind III                             

     SP115A                                                                    

         NO:429                                                                

             AGTCGGATCCTAAGGCTGATAATCGTGTTCAAATG                               

                                          Bam HI                               

     SP115B                                                                    

         NO:430                                                                

             GACTAAGCTTAAAATTAGATAGACGTTGAGT                                   

                                          Hind III                             

     SP117A                                                                    

         NO:431                                                                

             AGTCGGATCCCTGTGGCAATCAGTCAGCTGCTTCC                               

                                          Bam HI                               

     SP117B                                                                    

         NO:432                                                                

             GACTGTCGACTTTAATCTTGTCCCAGGTGGTTAATTTGCC                          

                                          Sal I                                

     SP118A                                                                    

         NO:433                                                                

             ACTGGTCGACTTGTCAACAACAACATGCTACTTCTGAG                            

                                          Sal I                                

     SP118B                                                                    

         NO:434                                                                

             GACTCTGCAGAAGTTTAACCCACTTATCATTATCC                               

                                          Pst I                                

     SP119A                                                                    

         NO:435                                                                

             ACTGGGATCCTTGTTCAGGCAAGTCCGTGACTAGTGAAC                           

                                          Bam HI                               

     SP119B                                                                    

         NO:436                                                                

             GACTAAGCTTGGCTAATTCCTTCAAAGTTTGCA                                 

                                          Hind III                             

     SP120A                                                                    

         NO:437                                                                

             AGTCGGATCCCTCGCAAATTGAAAAGGCGGCAGTTAGCC                           

                                          Bam HI                               

     SP120B                                                                    

         NO:438                                                                

             GACTAAGCTTGTAAATAAGCGTACCTTTTTCTTCC                               

                                          Hind III                             

     SP121A                                                                    

         NO:439                                                                

             TCAGGGATCCTTGTCAGTCAGGTTCTAATGGTTCTCAG                            

                                          Bam HI                               

     SP121B                                                                    

         NO:440                                                                

             AGTCAAGCTTGGCATTGGCGTCGCCGTCCTTC                                  

                                          Hind III                             

     SP122A                                                                    

         NO:441                                                                

             GACTGGATCCGGAAACTTCACAGGATTTTAAAGAGAAG                            

                                          Bam HI                               

     SP122B                                                                    

         NO:442                                                                

             GACTGTCGACAATCAATCCTTCTTCTGCACTTCT                                

                                          Sal I                                

     SP123A                                                                    

         NO:443                                                                

             CAGTGGATCCTGTGGTCGAAGTTGAGACTCCTCAATC                             

                                          Bam HI                               

     SP123B                                                                    

         NO:444                                                                

             GACTAAGCTTTTCTTCAAATTTATTATCAGC                                   

                                          Hind III                             

     SP124A                                                                    

         NO:445                                                                

             AGTCGGATCCAACACCTGTATATAAAGTTACAGCAATCG                           

                                          Bam HI                               

     SP124B                                                                    

         NO:446                                                                

             GACTGTCGACTACTTGACCGAATGCGTCGAATGTACG                             

                                          Sal I                                

     SP125A                                                                    

         NO:447                                                                

             CTGAGGATCCATTAGACAGATTAATTGAAATCGG                                

                                          Bam HI                               

     SP125B                                                                    

         NO:448                                                                

             GACTGTCGACTTTAAAGATTGAAGTTTTAAAGCT                                

                                          Sal I                                

     SP126A                                                                    

         NO:449                                                                

             TGACGGATCCTAAGACAGATGAACGGAGCAAGGTG                               

                                          Bam HI                               

     SP126B                                                                    

         NO:450                                                                

             CTGAAAGCTTTAAGGCTTCCTCAATGAGTTTGTCT                               

                                          Hind III                             

     SP127A                                                                    

         NO:451                                                                

             GACTGGATCCCTGTGAGAATCAAGCTACACCCA                                 

                                          Bam HI                               

     SP127B                                                                    

         NO:452                                                                

             CTGAAAGCTTTTGTAACTGAGATTGATCTGGGAG                                

                                          Hind III                             

     __________________________________________________________________________

  __________________________________________________________________________

     #             SEQUENCE LISTING                                            

     - (1) GENERAL INFORMATION:                                                

     -    (iii) NUMBER OF SEQUENCES: 452                                       

     - (2) INFORMATION FOR SEQ ID NO: 1:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1999 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #1:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAAAATCTAC GACAATAAAA ATCAACTCAT TGCTGACTTG GGTTCTGAAC GC - #CGCGTCAA   

       60                                                                      

     - TGCCCAAGCT AATGATATTC CCACAGATTT GGTTAAGGCA ATCGTTTCTA TC - #GAAGACCA   

      120                                                                      

     - TCGCTTCTTC GACCACAGGG GGATTGATAC CATCCGTATC CTGGGAGCTT TC - #TTGCGCAA   

      180                                                                      

     - TCTGCAAAGC AATTCCCTCC AAGGTGGATC AACTCTCACC CAACAGTTGA TT - #AAGTTGAC   

      240                                                                      

     - TTACTTTTCA ACTTCGACTT CCGACCAGAC TATTTCTCGT AAGGCTCAGG AA - #GCTTGGTT   

      300                                                                      

     - AGCGATTCAG TTAGAACAAA AAGCAACCAA GCAAGAAATC TTGACCTACT AT - #ATAAATAA   

      360                                                                      

     - GGTCTACATG TCTAATGGGA ACTATGGAAT GCAGACAGCA GCTCAAAACT AC - #TATGGTAA   

      420                                                                      

     - AGACCTCAAT AATTTAAGTT TACCTCAGTT AGCCTTGCTG GCTGGAATGC CT - #CAGGCACC   

      480                                                                      

     - AAACCAATAT GACCCCTATT CACATCCAGA AGCAGCCCAA GACCGCCGAA AC - #TTGGTCTT   

      540                                                                      

     - ATCTGAAATG AAAAATCAAG GCTACATCTC TGCTGAACAG TATGAGAAAG CA - #GTCAATAC   

      600                                                                      

     - ACCAATTACT GATGGACTAC AAAGTCTCAA ATCAGCAAGT AATTACCCTG CT - #TACATGGA   

      660                                                                      

     - TAATTACCTC AAGGAAGTCA TCAATCAAGT TGAAGAAGAA ACAGGCTATA AC - #CTACTCAC   

      720                                                                      

     - AACTGGGATG GATGTCTACA CAAATGTAGA CCAAGAAGCT CAAAAACATC TG - #TGGGATAT   

      780                                                                      

     - TTACAATACA GACGAATACG TTGCCTATCC AGACGATGAA TTGCAAGTCG CT - #TCTACCAT   

      840                                                                      

     - TGTTGATGTT TCTAACGGTA AAGTCATTGC CCAGCTAGGA GCACGCCATC AG - #TCAAGTAA   

      900                                                                      

     - TGTTTCCTTC GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CA - #ACTATGAA   

      960                                                                      

     - ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA CT - #GCTACTAT   

     1020                                                                      

     - CGTTCACGAT GAGCCCTATA ACTACCCTGG GACAAATACT CCTGTTTATA AC - #TGGGATAG   

     1080                                                                      

     - GGGCTACTTT GGCAACATCA CCTTGCAATA CGCCCTGCAA CAATCGCGAA AC - #GTCCCAGC   

     1140                                                                      

     - CGTGGAAACT CTAAACAAGG TCGGACTCAA CCGCGCCAAG ACTTTCCTAA AT - #GGTCTAGG   

     1200                                                                      

     - AATCGACTAC CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CC - #GAATCAGA   

     1260                                                                      

     - CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT TT - #GCAAATGG   

     1320                                                                      

     - TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC TTTAGTGATG GG - #AGTGAAAA   

     1380                                                                      

     - AGAGTTCTCT AATGTCGGAA CTCGTGCCAT GAAGGAAACG ACAGCCTATA TG - #ATGACCGA   

     1440                                                                      

     - CATGATGAAA ACAGTCTTGA CTTATGGAAC TGGACGAAAT GCCTATCTTG CT - #TGGCTCCC   

     1500                                                                      

     - TCAGGCTGGT AAAACAGGAA CCTCTAACTA TACAGACGAG GAAATTGAAA AC - #CACATCAA   

     1560                                                                      

     - GACCTCTCAA TTTGTAGCAC CTGATGAACT ATTTGCTGGC TATACGCGTA AA - #TATTCAAT   

     1620                                                                      

     - GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT GTAGGCAATG GC - #CTTACGGT   

     1680                                                                      

     - CGCTGCCAAA GTTTACCGCT CTATGATGAC CTACCTGTCT GAAGGAAGCA AT - #CCAGAAGA   

     1740                                                                      

     - TTGGAATATA CCAGAGGGGC TCTACAGAAA TGGAGAATTC GTATTTAAAA AT - #GGTGCTCG   

     1800                                                                      

     - TTCTACGTGG AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CA - #AGCTCATC   

     1860                                                                      

     - ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA AT - #AGTACGAC   

     1920                                                                      

     - TACCAATCCT AACAATAATA CGCAACAATC AAATACAACC CCTGATCAAC AA - #AATCAGAA   

     1980                                                                      

     #                 199 - #9                                                

     - (2) INFORMATION FOR SEQ ID NO:2:                                        

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 666 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                             

     - Lys Ile Tyr Asp Asn Lys Asn Gln Leu Ile Al - #a Asp Leu Gly Ser Glu     

     #                15                                                       

     - Arg Arg Val Asn Ala Gln Ala Asn Asp Ile Pr - #o Thr Asp Leu Val Lys     

     #            30                                                           

     - Ala Ile Val Ser Ile Glu Asp His Arg Phe Ph - #e Asp His Arg Gly Ile     

     #        45                                                               

     - Asp Thr Ile Arg Ile Leu Gly Ala Phe Leu Ar - #g Asn Leu Gln Ser Asn     

     #    60                                                                   

     - Ser Leu Gln Gly Gly Ser Thr Leu Thr Gln Gl - #n Leu Ile Lys Leu Thr     

     #80                                                                       

     - Tyr Phe Ser Thr Ser Thr Ser Asp Gln Thr Il - #e Ser Arg Lys Ala Gln     

     #                95                                                       

     - Glu Ala Trp Leu Ala Ile Gln Leu Glu Gln Ly - #s Ala Thr Lys Gln Glu     

     #           110                                                           

     - Ile Leu Thr Tyr Tyr Ile Asn Lys Val Tyr Me - #t Ser Asn Gly Asn Tyr     

     #       125                                                               

     - Gly Met Gln Thr Ala Ala Gln Asn Tyr Tyr Gl - #y Lys Asp Leu Asn Asn     

     #   140                                                                   

     - Leu Ser Leu Pro Gln Leu Ala Leu Leu Ala Gl - #y Met Pro Gln Ala Pro     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asn Gln Tyr Asp Pro Tyr Ser His Pro Glu Al - #a Ala Gln Asp Arg Arg     

     #               175                                                       

     - Asn Leu Val Leu Ser Glu Met Lys Asn Gln Gl - #y Tyr Ile Ser Ala Glu     

     #           190                                                           

     - Gln Tyr Glu Lys Ala Val Asn Thr Pro Ile Th - #r Asp Gly Leu Gln Ser     

     #       205                                                               

     - Leu Lys Ser Ala Ser Asn Tyr Pro Ala Tyr Me - #t Asp Asn Tyr Leu Lys     

     #   220                                                                   

     - Glu Val Ile Asn Gln Val Glu Glu Glu Thr Gl - #y Tyr Asn Leu Leu Thr     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Thr Gly Met Asp Val Tyr Thr Asn Val Asp Gl - #n Glu Ala Gln Lys His     

     #               255                                                       

     - Leu Trp Asp Ile Tyr Asn Thr Asp Glu Tyr Va - #l Ala Tyr Pro Asp Asp     

     #           270                                                           

     - Glu Leu Gln Val Ala Ser Thr Ile Val Asp Va - #l Ser Asn Gly Lys Val     

     #       285                                                               

     - Ile Ala Gln Leu Gly Ala Arg His Gln Ser Se - #r Asn Val Ser Phe Gly     

     #   300                                                                   

     - Ile Asn Gln Ala Val Glu Thr Asn Arg Asp Tr - #p Gly Ser Thr Met Lys     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Pro Ile Thr Asp Tyr Ala Pro Ala Leu Glu Ty - #r Gly Val Tyr Asp Ser     

     #               335                                                       

     - Thr Ala Thr Ile Val His Asp Glu Pro Tyr As - #n Tyr Pro Gly Thr Asn     

     #           350                                                           

     - Thr Pro Val Tyr Asn Trp Asp Arg Gly Tyr Ph - #e Gly Asn Ile Thr Leu     

     #       365                                                               

     - Gln Tyr Ala Leu Gln Gln Ser Arg Asn Val Pr - #o Ala Val Glu Thr Leu     

     #   380                                                                   

     - Asn Lys Val Gly Leu Asn Arg Ala Lys Thr Ph - #e Leu Asn Gly Leu Gly     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ile Asp Tyr Pro Ser Ile His Tyr Ser Asn Al - #a Ile Ser Ser Asn Thr     

     #               415                                                       

     - Thr Glu Ser Asp Lys Lys Tyr Gly Ala Ser Se - #r Glu Lys Met Ala Ala     

     #           430                                                           

     - Ala Tyr Ala Ala Phe Ala Asn Gly Gly Thr Ty - #r Tyr Lys Pro Met Tyr     

     #       445                                                               

     - Ile His Lys Val Val Phe Ser Asp Gly Ser Gl - #u Lys Glu Phe Ser Asn     

     #   460                                                                   

     - Val Gly Thr Arg Ala Met Lys Glu Thr Thr Al - #a Tyr Met Met Thr Asp     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Met Met Lys Thr Val Leu Thr Tyr Gly Thr Gl - #y Arg Asn Ala Tyr Leu     

     #               495                                                       

     - Ala Trp Leu Pro Gln Ala Gly Lys Thr Gly Th - #r Ser Asn Tyr Thr Asp     

     #           510                                                           

     - Glu Glu Ile Glu Asn His Ile Lys Thr Ser Gl - #n Phe Val Ala Pro Asp     

     #       525                                                               

     - Glu Leu Phe Ala Gly Tyr Thr Arg Lys Tyr Se - #r Met Ala Val Trp Thr     

     #   540                                                                   

     - Gly Tyr Ser Asn Arg Leu Thr Pro Leu Val Gl - #y Asn Gly Leu Thr Val     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Ala Ala Lys Val Tyr Arg Ser Met Met Thr Ty - #r Leu Ser Glu Gly Ser     

     #               575                                                       

     - Asn Pro Glu Asp Trp Asn Ile Pro Glu Gly Le - #u Tyr Arg Asn Gly Glu     

     #           590                                                           

     - Phe Val Phe Lys Asn Gly Ala Arg Ser Thr Tr - #p Asn Ser Pro Ala Pro     

     #       605                                                               

     - Gln Gln Pro Pro Ser Thr Glu Ser Ser Ser Se - #r Ser Ser Asp Ser Ser     

     #   620                                                                   

     - Thr Ser Gln Ser Ser Ser Thr Thr Pro Ser Th - #r Asn Asn Ser Thr Thr     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Thr Asn Pro Asn Asn Asn Thr Gln Gln Ser As - #n Thr Thr Pro Asp Gln     

     #               655                                                       

     - Gln Asn Gln Asn Pro Gln Pro Ala Gln Pro                                 

     #           665                                                           

     - (2) INFORMATION FOR SEQ ID NO: 3:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1714 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #3:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AAATTACAAT ACGGACTATG AATTGACCTC TGGAGAAAAA TTACCTCTTC CT - #AAAGAGAT   

       60                                                                      

     - TTCAGGTTAC ACTTATATTG GATATATCAA AGAGGGAAAA ACGACTTCTG AG - #TCTGAAGT   

      120                                                                      

     - AAGTAATCAA AAGAGTTCAG TTGCCACTCC TACAAAACAA CAAAAGGTGG AT - #TATAATGT   

      180                                                                      

     - TACACCGAAT TTTGTAGACC ATCCATCAAC AGTACAAGCT ATTCAGGAAC AA - #ACACCTGT   

      240                                                                      

     - TTCTTCAACT AAGCCGACAG AAGTTCAAGT AGTTGAAAAA CCTTTCTCTA CT - #GAATTAAT   

      300                                                                      

     - CAATCCAAGA AAAGAAGAGA AACAATCTTC AGATTCTCAA GAACAATTAG CC - #GAACATAA   

      360                                                                      

     - GAATCTAGAA ACGAAGAAAG AGGAGAAGAT TTCTCCAAAA GAAAAGACTG GG - #GTAAATAC   

      420                                                                      

     - ATTAAATCCA CAGGATGAAG TTTTATCAGG TCAATTGAAC AAACCTGAAC TC - #TTATATCG   

      480                                                                      

     - TGAGGAAACT ATGGAGACAA AAATAGATTT TCAAGAAGAA ATTCAAGAAA AT - #CCTGATTT   

      540                                                                      

     - AGCTGAAGGA ACTGTAAGAG TAAAACAAGA AGGTAAATTA GGTAAGAAAG TT - #GAAATCGT   

      600                                                                      

     - CAGAATATTC TCTGTAAACA AGGAAGAAGT TTCGCGAGAA ATTGTTTCAA CT - #TCAACGAC   

      660                                                                      

     - TGCGCCTAGT CCAAGAATAG TCGAAAAAGG TACTAAAAAA ACTCAAGTTA TA - #AAGGAACA   

      720                                                                      

     - ACCTGAGACT GGTGTAGAAC ATAAGGACGT ACAGTCTGGA GCTATTGTTG AA - #CCCGCAAT   

      780                                                                      

     - TCAGCCTGAG TTGCCCGAAG CTGTAGTAAG TGACAAAGGC GAACCAGAAG TT - #CAACCTAC   

      840                                                                      

     - ATTACCCGAA GCAGTTGTGA CCGACAAAGG TGAGACTGAG GTTCAACCAG AG - #TCGCCAGA   

      900                                                                      

     - TACTGTGGTA AGTGATAAAG GTGAACCAGA GCAGGTAGCA CCGCTTCCAG AA - #TATAAGGG   

      960                                                                      

     - TAATATTGAG CAAGTAAAAC CTGAAACTCC GGTTGAGAAG ACCAAAGAAC AA - #GGTCCAGA   

     1020                                                                      

     - AAAAACTGAA GAAGTTCCAG TAAAACCAAC AGAAGAAACA CCAGTAAATC CA - #AATGAAGG   

     1080                                                                      

     - TACTACAGAA GGAACCTCAA TTCAAGAAGC AGAAAATCCA GTTCAACCTG CA - #GAAGAATC   

     1140                                                                      

     - AACAACGAAT TCAGAGAAAG TATCACCAGA TACATCTAGC AAAAATACTG GG - #GAAGTGTC   

     1200                                                                      

     - CAGTAATCCT AGTGATTCGA CAACCTCAGT TGGAGAATCA AATAAACCAG AA - #CATAATGA   

     1260                                                                      

     - CTCTAAAAAT GAAAATTCAG AAAAAACTGT AGAAGAAGTT CCAGTAAATC CA - #AATGAAGG   

     1320                                                                      

     - CACAGTAGAA GGTACCTCAA ATCAAGAAAC AGAAAAACCA GTTCAACCTG CA - #GAAGAAAC   

     1380                                                                      

     - ACAAACAAAC TCTGGGAAAA TAGCTAACGA AAATACTGGA GAAGTATCCA AT - #AAACCTAG   

     1440                                                                      

     - TGATTCAAAA CCACCAGTTG AAGAATCAAA TCAACCAGAA AAAAACGGAA CT - #GCAACAAA   

     1500                                                                      

     - ACCAGAAAAT TCAGGTAATA CAACATCAGA GAATGGACAA ACAGAACCAG AA - #CCATCAAA   

     1560                                                                      

     - CGGAAATTCA ACTGAGGATG TTTCAACCGA ATCAAACACA TCCAATTCAA AT - #GGAAACGA   

     1620                                                                      

     - AGAAATTAAA CAAGAAAATG AACTAGACCC TGATAAAAAG GTAGAAGAAC CA - #GAGAAAAC   

     1680                                                                      

     #      1714        GTTT CCGACCTAGA GTTA                                   

     - (2) INFORMATION FOR SEQ ID NO:4:                                        

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 571 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                             

     - Asn Tyr Asn Thr Asp Tyr Glu Leu Thr Ser Gl - #y Glu Lys Leu Pro Leu     

     #                15                                                       

     - Pro Lys Glu Ile Ser Gly Tyr Thr Tyr Ile Gl - #y Tyr Ile Lys Glu Gly     

     #            30                                                           

     - Lys Thr Thr Ser Glu Ser Glu Val Ser Asn Gl - #n Lys Ser Ser Val Ala     

     #        45                                                               

     - Thr Pro Thr Lys Gln Gln Lys Val Asp Tyr As - #n Val Thr Pro Asn Phe     

     #    60                                                                   

     - Val Asp His Pro Ser Thr Val Gln Ala Ile Gl - #n Glu Gln Thr Pro Val     

     #80                                                                       

     - Ser Ser Thr Lys Pro Thr Glu Val Gln Val Va - #l Glu Lys Pro Phe Ser     

     #                95                                                       

     - Thr Glu Leu Ile Asn Pro Arg Lys Glu Glu Ly - #s Gln Ser Ser Asp Ser     

     #           110                                                           

     - Gln Glu Gln Leu Ala Glu His Lys Asn Leu Gl - #u Thr Lys Lys Glu Glu     

     #       125                                                               

     - Lys Ile Ser Pro Lys Glu Lys Thr Gly Val As - #n Thr Leu Asn Pro Gln     

     #   140                                                                   

     - Asp Glu Val Leu Ser Gly Gln Leu Asn Lys Pr - #o Glu Leu Leu Tyr Arg     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Glu Glu Thr Met Glu Thr Lys Ile Asp Phe Gl - #n Glu Glu Ile Gln Glu     

     #               175                                                       

     - Asn Pro Asp Leu Ala Glu Gly Thr Val Arg Va - #l Lys Gln Glu Gly Lys     

     #           190                                                           

     - Leu Gly Lys Lys Val Glu Ile Val Arg Ile Ph - #e Ser Val Asn Lys Glu     

     #       205                                                               

     - Glu Val Ser Arg Glu Ile Val Ser Thr Ser Th - #r Thr Ala Pro Ser Pro     

     #   220                                                                   

     - Arg Ile Val Glu Lys Gly Thr Lys Lys Thr Gl - #n Val Ile Lys Glu Gln     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Pro Glu Thr Gly Val Glu His Lys Asp Val Gl - #n Ser Gly Ala Ile Val     

     #               255                                                       

     - Glu Pro Ala Ile Gln Pro Glu Leu Pro Glu Al - #a Val Val Ser Asp Lys     

     #           270                                                           

     - Gly Glu Pro Glu Val Gln Pro Thr Leu Pro Gl - #u Ala Val Val Thr Asp     

     #       285                                                               

     - Lys Gly Glu Thr Glu Val Gln Pro Glu Ser Pr - #o Asp Thr Val Val Ser     

     #   300                                                                   

     - Asp Lys Gly Glu Pro Glu Gln Val Ala Pro Le - #u Pro Glu Tyr Lys Gly     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Asn Ile Glu Gln Val Lys Pro Glu Thr Pro Va - #l Glu Lys Thr Lys Glu     

     #               335                                                       

     - Gln Gly Pro Glu Lys Thr Glu Glu Val Pro Va - #l Lys Pro Thr Glu Glu     

     #           350                                                           

     - Thr Pro Val Asn Pro Asn Glu Gly Thr Thr Gl - #u Gly Thr Ser Ile Gln     

     #       365                                                               

     - Glu Ala Glu Asn Pro Val Gln Pro Ala Glu Gl - #u Ser Thr Thr Asn Ser     

     #   380                                                                   

     - Glu Lys Val Ser Pro Asp Thr Ser Ser Lys As - #n Thr Gly Glu Val Ser     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gl - #y Glu Ser Asn Lys Pro     

     #               415                                                       

     - Glu His Asn Asp Ser Lys Asn Glu Asn Ser Gl - #u Lys Thr Val Glu Glu     

     #           430                                                           

     - Val Pro Val Asn Pro Asn Glu Gly Thr Val Gl - #u Gly Thr Ser Asn Gln     

     #       445                                                               

     - Glu Thr Glu Lys Pro Val Gln Pro Ala Glu Gl - #u Thr Gln Thr Asn Ser     

     #   460                                                                   

     - Gly Lys Ile Ala Asn Glu Asn Thr Gly Glu Va - #l Ser Asn Lys Pro Ser     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gl - #n Pro Glu Lys Asn Gly     

     #               495                                                       

     - Thr Ala Thr Lys Pro Glu Asn Ser Gly Asn Th - #r Thr Ser Glu Asn Gly     

     #           510                                                           

     - Gln Thr Glu Pro Glu Pro Ser Asn Gly Asn Se - #r Thr Glu Asp Val Ser     

     #       525                                                               

     - Thr Glu Ser Asn Thr Ser Asn Ser Asn Gly As - #n Glu Glu Ile Lys Gln     

     #   540                                                                   

     - Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Gl - #u Glu Pro Glu Lys Thr     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Leu Glu Leu Arg Asn Val Ser Asp Leu Glu Le - #u                         

     #               570                                                       

     - (2) INFORMATION FOR SEQ ID NO: 5:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 748 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     -    (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:                             

     - TGAGAATCAA GCTACACCCA AAGAGACTAG CGCTCAAAAG ACAATCGTCC TT - #GCTACAGC   

       60                                                                      

     - TGGCGACGTG CCACCATTTG ACTACGAAGA CAAGGGCAAT CTGACAGGCT TT - #GATATCGA   

      120                                                                      

     - AGTTTTAAAG GCAGTAGATG AAAAACTCAG CGACTACGAG ATTCAATTCC AA - #AGAACCGC   

      180                                                                      

     - CTGGGAGAGC ATCTTCCCAG GACTTGATTC TGGTCACTAT CAGGCTGCGG CC - #AATAACTT   

      240                                                                      

     - GAGTTACACA AAAGAGCGTG CTGAAAAATA CCTTTACTCG CTTCCAATTT CC - #AACAATCC   

      300                                                                      

     - CCTCGTCCTT GTCAGCAACA AGAAAAATCC TTTGACTTCT CTTGACCAGA TC - #GCTGGTAA   

      360                                                                      

     - AACAACACAA GAGGATACCG GAACTTCTAA CGCTCAATTC ATCAATAACT GG - #AATCAGAA   

      420                                                                      

     - ACACACTGAT AATCCCGCTA CAATTAATTT TTCTGGTGAG GATATTGGTA AA - #CGAATCCT   

      480                                                                      

     - AGACCTTGCT AACGGAGAGT TTGATTTCCT AGTTTTTGAC AAGGTATCCG TT - #CAAAAGAT   

      540                                                                      

     - TATCAAGGAC CGTGGTTTAG ACCTCTCAGT CGTTGATTTA CCTTCTGCAG AT - #AGCCCCAG   

      600                                                                      

     - CAATTATATC ATTTTCTCAA GCGACCAAAA AGAGTTTAAA GAGCAATTTG AT - #AAAGCGCT   

      660                                                                      

     - CAAAGAACTC TATCAAGACG GAACCCTTGA AAAACTCAGC AATACCTATC TA - #GGTGGTTC   

      720                                                                      

     #            748   TCTC AGTTACAA                                          

     - (2) INFORMATION FOR SEQ ID NO:6:                                        

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 249 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                             

     - Glu Asn Gln Ala Thr Pro Lys Glu Thr Ser Al - #a Gln Lys Thr Ile Val     

     #                15                                                       

     - Leu Ala Thr Ala Gly Asp Val Pro Pro Phe As - #p Tyr Glu Asp Lys Gly     

     #            30                                                           

     - Asn Leu Thr Gly Phe Asp Ile Glu Val Leu Ly - #s Ala Val Asp Glu Lys     

     #        45                                                               

     - Leu Ser Asp Tyr Glu Ile Gln Phe Gln Arg Th - #r Ala Trp Glu Ser Ile     

     #    60                                                                   

     - Phe Pro Gly Leu Asp Ser Gly His Tyr Gln Al - #a Ala Ala Asn Asn Leu     

     #80                                                                       

     - Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Le - #u Tyr Ser Leu Pro Ile     

     #                95                                                       

     - Ser Asn Asn Pro Leu Val Leu Val Ser Asn Ly - #s Lys Asn Pro Leu Thr     

     #           110                                                           

     - Ser Leu Asp Gln Ile Ala Gly Lys Thr Thr Gl - #n Glu Asp Thr Gly Thr     

     #       125                                                               

     - Ser Asn Ala Gln Phe Ile Asn Asn Trp Asn Gl - #n Lys His Thr Asp Asn     

     #   140                                                                   

     - Pro Ala Thr Ile Asn Phe Ser Gly Glu Asp Il - #e Gly Lys Arg Ile Leu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Va - #l Phe Asp Lys Val Ser     

     #               175                                                       

     - Val Gln Lys Ile Ile Lys Asp Arg Gly Leu As - #p Leu Ser Val Val Asp     

     #           190                                                           

     - Leu Pro Ser Ala Asp Ser Pro Ser Asn Tyr Il - #e Ile Phe Ser Ser Asp     

     #       205                                                               

     - Gln Lys Glu Phe Lys Glu Gln Phe Asp Lys Al - #a Leu Lys Glu Leu Tyr     

     #   220                                                                   

     - Gln Asp Gly Thr Leu Glu Lys Leu Ser Asn Th - #r Tyr Leu Gly Gly Ser     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Leu Pro Asp Gln Ser Gln Leu Gln                                     

                     245                                                       

     - (2) INFORMATION FOR SEQ ID NO: 7:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 985 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     -    (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:                             

     - TGGTAACCGC TCTTCTCGTA ACGCAGCTTC ATCTTCTGAT GTGAAGACAA AA - #GCAGCAAT   

       60                                                                      

     - CGTCACTGAT ACTGGTGGTG TTGATGACAA ATCATTCAAC CAATCAGCTT GG - #GAAGGTTT   

      120                                                                      

     - GCAGGCTTGG GGTAAAGAAC ACAATCTTTC AAAAGATAAC GGTTTCACTT AC - #TTCCAATC   

      180                                                                      

     - AACAAGTGAA GCTGACTACG CTAACAACTT GCAACAAGCG GCTGGAAGTT AC - #AACCTAAT   

      240                                                                      

     - CTTCGGTGTT GGTTTTGCCC TTAATAATGC AGTTAAAGAT GCAGCAAAAG AA - #CACACTGA   

      300                                                                      

     - CTTGAACTAT GTCTTGATTG ATGATGTGAT TAAAGACCAA AAGAATGTTG CG - #AGCGTAAC   

      360                                                                      

     - TTTCGCTGAT AATGAGTCAG GTTACCTTGC AGGTGTGGCT GCAGCAAAAA CA - #ACTAAGAC   

      420                                                                      

     - AAAACAAGTT GGTTTTGTAG GTGGTATCGA ATCTGAAGTT ATCTCTCGTT TT - #GAAGCAGG   

      480                                                                      

     - ATTCAAGGCT GGTGTTGCGT CAGTAGACCC ATCTATCAAA GTCCAAGTTG AC - #TACGCTGG   

      540                                                                      

     - TTCATTTGGT GATGCGGCTA AAGGTAAAAC AATTGCAGCC GCACAATACG CA - #GCCGGTGC   

      600                                                                      

     - AGATATTGTT TACCAAGTAG CTGGTGGTAC AGGTGCAGGT GTCTTTGCAG AG - #GCAAAATC   

      660                                                                      

     - TCTCAACGAA AGCCGTCCTG AAAATGAAAA AGTTTGGGTT ATCGGTGTTG AT - #CGTGACCA   

      720                                                                      

     - AGAAGCAGAA GGTAAATACA CTTCTAAAGA TGGCAAAGAA TCAAACTTTG TT - #CTTGTATC   

      780                                                                      

     - TACTTTGAAA CAAGTTGGTA CAACTGTAAA AGATATTTCT AACAAGGCAG AA - #AGAGGAGA   

      840                                                                      

     - ATTCCCTGGC GGTCAAGTGA TCGTTTACTC ATTGAAGGAT AAAGGGGTTG AC - #TTGGCAGT   

      900                                                                      

     - AACAAACCTT TCAGAAGAAG GTAAAAAAGC TGTCGAAGAT GCAAAAGCTA AA - #ATCCTTGA   

      960                                                                      

     #              985 CCTG AAAAA                                             

     - (2) INFORMATION FOR SEQ ID NO:8:                                        

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 328 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                             

     - Gly Asn Arg Ser Ser Arg Asn Ala Ala Ser Se - #r Ser Asp Val Lys Thr     

     #                15                                                       

     - Lys Ala Ala Ile Val Thr Asp Thr Gly Gly Va - #l Asp Asp Lys Ser Phe     

     #            30                                                           

     - Asn Gln Ser Ala Trp Glu Gly Leu Gln Ala Tr - #p Gly Lys Glu His Asn     

     #        45                                                               

     - Leu Ser Lys Asp Asn Gly Phe Thr Tyr Phe Gl - #n Ser Thr Ser Glu Ala     

     #    60                                                                   

     - Asp Tyr Ala Asn Asn Leu Gln Gln Ala Ala Gl - #y Ser Tyr Asn Leu Ile     

     #80                                                                       

     - Phe Gly Val Gly Phe Ala Leu Asn Asn Ala Va - #l Lys Asp Ala Ala Lys     

     #                95                                                       

     - Glu His Thr Asp Leu Asn Tyr Val Leu Ile As - #p Asp Val Ile Lys Asp     

     #           110                                                           

     - Gln Lys Asn Val Ala Ser Val Thr Phe Ala As - #p Asn Glu Ser Gly Tyr     

     #       125                                                               

     - Leu Ala Gly Val Ala Ala Ala Lys Thr Thr Ly - #s Thr Lys Gln Val Gly     

     #   140                                                                   

     - Phe Val Gly Gly Ile Glu Ser Glu Val Ile Se - #r Arg Phe Glu Ala Gly     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Phe Lys Ala Gly Val Ala Ser Val Asp Pro Se - #r Ile Lys Val Gln Val     

     #               175                                                       

     - Asp Tyr Ala Gly Ser Phe Gly Asp Ala Ala Ly - #s Gly Lys Thr Ile Ala     

     #           190                                                           

     - Ala Ala Gln Tyr Ala Ala Gly Ala Asp Ile Va - #l Tyr Gln Val Ala Gly     

     #       205                                                               

     - Gly Thr Gly Ala Gly Val Phe Ala Glu Ala Ly - #s Ser Leu Asn Glu Ser     

     #   220                                                                   

     - Arg Pro Glu Asn Glu Lys Val Trp Val Ile Gl - #y Val Asp Arg Asp Gln     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Glu Ala Glu Gly Lys Tyr Thr Ser Lys Asp Gl - #y Lys Glu Ser Asn Phe     

     #               255                                                       

     - Val Leu Val Ser Thr Leu Lys Gln Val Gly Th - #r Thr Val Lys Asp Ile     

     #           270                                                           

     - Ser Asn Lys Ala Glu Arg Gly Glu Phe Pro Gl - #y Gly Gln Val Ile Val     

     #       285                                                               

     - Tyr Ser Leu Lys Asp Lys Gly Val Asp Leu Al - #a Val Thr Asn Leu Ser     

     #   300                                                                   

     - Glu Glu Gly Lys Lys Ala Val Glu Asp Ala Ly - #s Ala Lys Ile Leu Asp     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Gly Ser Val Lys Val Pro Glu Lys                                         

                     325                                                       

     - (2) INFORMATION FOR SEQ ID NO: 9:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1404 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #9:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGCTGATT CAGGTGACAA AC - #CTGTTATC   

       60                                                                      

     - AAAATGTACC AAATCGGTGA CAAACCAGAC AACTTGGATG AATTGTTAGC AA - #ATGCCAAC   

      120                                                                      

     - AAAATCATTG AAGAAAAAGT TGGTGCCAAA TTGGATATCC AATACCTTGG CT - #GGGGTGAC   

      180                                                                      

     - TATGGTAAGA AAATGTCAGT TATCACATCA TCTGGTGAAA ACTATGATAT TG - #CCTTTGCA   

      240                                                                      

     - GATAACTATA TTGTAAATGC TCAAAAAGGT GCTTACGCTG ACTTGACAGA AT - #TGTACAAA   

      300                                                                      

     - AAAGAAGGTA AAGACCTTTA CAAAGCACTT GACCCAGCTT ACATCAAGGG TA - #ATACTGTA   

      360                                                                      

     - AATGGTAAGA TTTACGCTGT TCCAGTTGCA GCCAACGTTG CATCATCTCA AA - #ACTTTGCC   

      420                                                                      

     - TTCAACGGAA CTCTCCTTGC TAAATATGGT ATCGATATTT CAGGTGTTAC TT - #CTTACGAA   

      480                                                                      

     - ACTCTTGAGC CAGTCTTGAA ACAAATCAAA GAAAAAGCTC CAGACGTAGT AC - #CATTTGCT   

      540                                                                      

     - ATTGGTAAAG TTTTCATCCC ATCTGATAAT TTTGACTACC CAGTAGCAAA CG - #GTCTTCCA   

      600                                                                      

     - TTCGTTATCG ACCTTGAAGG CGATACTACT AAAGTTGTAA ACCGTTACGA AG - #TGCCTCGT   

      660                                                                      

     - TTCAAAGAAC ACTTGAAGAC TCTTCACAAA TTCTATGAAG CTGGCTACAT TC - #CAAAAGAC   

      720                                                                      

     - GTCGCAACAA GCGATACTTC CTTTGACCTT CAACAAGATA CTTGGTTCGT TC - #GTGAAGAA   

      780                                                                      

     - ACAGTAGGAC CAGCTGACTA CGGTAACAGC TTGCTTTCAC GTGTTGCCAA CA - #AAGATATC   

      840                                                                      

     - CAAATCAAAC CAATTACTAA CTTCATCAAG NAAAACCAAA CAACACAAGT TG - #CTAACTTT   

      900                                                                      

     - GTCATCTCAA ACAACTCTAA GAACAAAGAA AAATCAATGG AAATCTTGAA CC - #TCTTGAAT   

      960                                                                      

     - ACGAACCCAG AACTCTTGAA CGGTCTTGTT TACGGTCCAG AAGGCAAGAA CT - #GGGAAAAA   

     1020                                                                      

     - ATTGAAGGTA AAGAAAACCG TGTTCGCGTT CTTGATGGCT ACAAAGGAAA CA - #CTCACATG   

     1080                                                                      

     - GGTGGATGGA ACACTGGTAA CAACTGGATC CTTTACATCA ACGAAAACGT TA - #CAGACCAA   

     1140                                                                      

     - CAAATCGAAA ATTCTAAGAA AGAATTGGCA GAAGCTAAAG AATCTCCAGC GC - #TTGGATTT   

     1200                                                                      

     - ATCTTCAATA CTGACAATGT GAAATCTGAA ATCTCAGCTA TTGCTAACAC AA - #TGCAACAA   

     1260                                                                      

     - TTTGATACAG CTATCAACAC TGGTACTGTA GACCCAGATA AAGCGATTCC AG - #AATTGATG   

     1320                                                                      

     - GAAAAATTGA AATCTGAAGG TGCCTACGAA AAAGTATTGA ACGAAATGCA AA - #AACAATAC   

     1380                                                                      

     #              1404ACAA AAAA                                              

     - (2) INFORMATION FOR SEQ ID NO:10:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 468 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                            

     - Cys Gly Asn Leu Thr Gly Asn Ser Lys Lys Al - #a Ala Asp Ser Gly Asp     

     #                15                                                       

     - Lys Pro Val Ile Lys Met Tyr Gln Ile Gly As - #p Lys Pro Asp Asn Leu     

     #            30                                                           

     - Asp Glu Leu Leu Ala Asn Ala Asn Lys Ile Il - #e Glu Glu Lys Val Gly     

     #        45                                                               

     - Ala Lys Leu Asp Ile Gln Tyr Leu Gly Trp Gl - #y Asp Tyr Gly Lys Lys     

     #    60                                                                   

     - Met Ser Val Ile Thr Ser Ser Gly Glu Asn Ty - #r Asp Ile Ala Phe Ala     

     #80                                                                       

     - Asp Asn Tyr Ile Val Asn Ala Gln Lys Gly Al - #a Tyr Ala Asp Leu Thr     

     #                95                                                       

     - Glu Leu Tyr Lys Lys Glu Gly Lys Asp Leu Ty - #r Lys Ala Leu Asp Pro     

     #           110                                                           

     - Ala Tyr Ile Lys Gly Asn Thr Val Asn Gly Ly - #s Ile Tyr Ala Val Pro     

     #       125                                                               

     - Val Ala Ala Asn Val Ala Ser Ser Gln Asn Ph - #e Ala Phe Asn Gly Thr     

     #   140                                                                   

     - Leu Leu Ala Lys Tyr Gly Ile Asp Ile Ser Gl - #y Val Thr Ser Tyr Glu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Leu Glu Pro Val Leu Lys Gln Ile Lys Gl - #u Lys Ala Pro Asp Val     

     #               175                                                       

     - Val Pro Phe Ala Ile Gly Lys Val Phe Ile Pr - #o Ser Asp Asn Phe Asp     

     #           190                                                           

     - Tyr Pro Val Ala Asn Gly Leu Pro Phe Val Il - #e Asp Leu Glu Gly Asp     

     #       205                                                               

     - Thr Thr Lys Val Val Asn Arg Tyr Glu Val Pr - #o Arg Phe Lys Glu His     

     #   220                                                                   

     - Leu Lys Thr Leu His Lys Phe Tyr Glu Ala Gl - #y Tyr Ile Pro Lys Asp     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Val Ala Thr Ser Asp Thr Ser Phe Asp Leu Gl - #n Gln Asp Thr Trp Phe     

     #               255                                                       

     - Val Arg Glu Glu Thr Val Gly Pro Ala Asp Ty - #r Gly Asn Ser Leu Leu     

     #           270                                                           

     - Ser Arg Val Ala Asn Lys Asp Ile Gln Ile Ly - #s Pro Ile Thr Asn Phe     

     #       285                                                               

     - Ile Lys Xaa Asn Gln Thr Thr Gln Val Ala As - #n Phe Val Ile Ser Asn     

     #   300                                                                   

     - Asn Ser Lys Asn Lys Glu Lys Ser Met Glu Il - #e Leu Asn Leu Leu Asn     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Thr Asn Pro Glu Leu Leu Asn Gly Leu Val Ty - #r Gly Pro Glu Gly Lys     

     #               335                                                       

     - Asn Trp Glu Lys Ile Glu Gly Lys Glu Asn Ar - #g Val Arg Val Leu Asp     

     #           350                                                           

     - Gly Tyr Lys Gly Asn Thr His Met Gly Gly Tr - #p Asn Thr Gly Asn Asn     

     #       365                                                               

     - Trp Ile Leu Tyr Ile Asn Glu Asn Val Thr As - #p Gln Gln Ile Glu Asn     

     #   380                                                                   

     - Ser Lys Lys Glu Leu Ala Glu Ala Lys Glu Se - #r Pro Ala Leu Gly Phe     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ile Phe Asn Thr Asp Asn Val Lys Ser Glu Il - #e Ser Ala Ile Ala Asn     

     #               415                                                       

     - Thr Met Gln Gln Phe Asp Thr Ala Ile Asn Th - #r Gly Thr Val Asp Pro     

     #           430                                                           

     - Asp Lys Ala Ile Pro Glu Leu Met Glu Lys Le - #u Lys Ser Glu Gly Ala     

     #       445                                                               

     - Tyr Glu Lys Val Leu Asn Glu Met Gln Lys Gl - #n Tyr Asp Glu Phe Leu     

     #   460                                                                   

     - Lys Asn Lys Lys                                                         

     465                                                                       

     - (2) INFORMATION FOR SEQ ID NO: 11:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 937 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #11:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGGTCAAGGA ACTGCTTCTA AAGACAACAA AGAGGCAGAA CTTAAGAAGG TT - #GACTTTAT   

       60                                                                      

     - CCTAGACTGG ACACCAAATA CCAACCACAC AGGGCTTTAT GTTGCCAAGG AA - #AAAGGTTA   

      120                                                                      

     - TTTCAAAGAA GCTGGAGTGG ATGTTGATTT GAAATTGCCA CCAGAAGAAA GT - #TCTTCTGA   

      180                                                                      

     - CTTGGTTATC AACGGAAAGG CACCATTTGC AGTGTATTTC CAAGACTACA TG - #GCTAAGAA   

      240                                                                      

     - ATTGGAAAAA GGAGCAGGAA TCACTGCCGT TGCAGCTATT GTTGAACACA AT - #ACATCAGG   

      300                                                                      

     - AATCATCTCT CGTAAATCTG ATAATGTAAG CAGTCCAAAA GACTTGGTTG GT - #AAGAAATA   

      360                                                                      

     - TGGGACATGG AATGACCCAA CTGAACTTGC TATGTTGAAA ACCTTGGTAG AA - #TCTCAAGG   

      420                                                                      

     - TGGAGACTTT GAGAAGGTTG AAAAAGTACC AAATAACGAC TCAAACTCAA TC - #ACACCGAT   

      480                                                                      

     - TGCCAATGGC GTCTTTGATA CTGCTTGGAT TTACTACGGT TGGGATGGTA TC - #CTTGCTAA   

      540                                                                      

     - ATCTCAAGGT GTAGATGCTA ACTTCATGTA CTTGAAAGAC TATGTCAAGG AG - #TTTGACTA   

      600                                                                      

     - CTATTCACCA GTTATCATCG CAAACAACGA CTATCTGAAA GATAACAAAG AA - #GAAGCTCG   

      660                                                                      

     - CAAAGTCATC CAAGCCATCA AAAAAGGCTA CCAATATGCC ATGGAACATC CA - #GAAGAAGC   

      720                                                                      

     - TGCAGATATT CTCATCAAGA ATGCACCTGA ACTCAAGGAA AAACGTGACT TT - #GTCATCGA   

      780                                                                      

     - ATCTCAAAAA TACTTGTCAA AAGAATACGC AAGCGACAAG GAAAAATGGG GT - #CAATTTGA   

      840                                                                      

     - CGCAGCTCGC TGGAATGCTT TCTACAAATG GGATAAAGAA AATGGTATCC TT - #AAAGAAGA   

      900                                                                      

     #     937          TTCA CCAACGAATT TGTGAAA                                

     - (2) INFORMATION FOR SEQ ID NO:12:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 312 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                            

     - Gly Gln Gly Thr Ala Ser Lys Asp Asn Lys Gl - #u Ala Glu Leu Lys Lys     

     #                15                                                       

     - Val Asp Phe Ile Leu Asp Trp Thr Pro Asn Th - #r Asn His Thr Gly Leu     

     #            30                                                           

     - Tyr Val Ala Lys Glu Lys Gly Tyr Phe Lys Gl - #u Ala Gly Val Asp Val     

     #        45                                                               

     - Asp Leu Lys Leu Pro Pro Glu Glu Ser Ser Se - #r Asp Leu Val Ile Asn     

     #    60                                                                   

     - Gly Lys Ala Pro Phe Ala Val Tyr Phe Gln As - #p Tyr Met Ala Lys Lys     

     #80                                                                       

     - Leu Glu Lys Gly Ala Gly Ile Thr Ala Val Al - #a Ala Ile Val Glu His     

     #                95                                                       

     - Asn Thr Ser Gly Ile Ile Ser Arg Lys Ser As - #p Asn Val Ser Ser Pro     

     #           110                                                           

     - Lys Asp Leu Val Gly Lys Lys Tyr Gly Thr Tr - #p Asn Asp Pro Thr Glu     

     #       125                                                               

     - Leu Ala Met Leu Lys Thr Leu Val Glu Ser Gl - #n Gly Gly Asp Phe Glu     

     #   140                                                                   

     - Lys Val Glu Lys Val Pro Asn Asn Asp Ser As - #n Ser Ile Thr Pro Ile     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ala Asn Gly Val Phe Asp Thr Ala Trp Ile Ty - #r Tyr Gly Trp Asp Gly     

     #               175                                                       

     - Ile Leu Ala Lys Ser Gln Gly Val Asp Ala As - #n Phe Met Tyr Leu Lys     

     #           190                                                           

     - Asp Tyr Val Lys Glu Phe Asp Tyr Tyr Ser Pr - #o Val Ile Ile Ala Asn     

     #       205                                                               

     - Asn Asp Tyr Leu Lys Asp Asn Lys Glu Glu Al - #a Arg Lys Val Ile Gln     

     #   220                                                                   

     - Ala Ile Lys Lys Gly Tyr Gln Tyr Ala Met Gl - #u His Pro Glu Glu Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ala Asp Ile Leu Ile Lys Asn Ala Pro Glu Le - #u Lys Glu Lys Arg Asp     

     #               255                                                       

     - Phe Val Ile Glu Ser Gln Lys Tyr Leu Ser Ly - #s Glu Tyr Ala Ser Asp     

     #           270                                                           

     - Lys Glu Lys Trp Gly Gln Phe Asp Ala Ala Ar - #g Trp Asn Ala Phe Tyr     

     #       285                                                               

     - Lys Trp Asp Lys Glu Asn Gly Ile Leu Lys Gl - #u Asp Leu Thr Asp Lys     

     #   300                                                                   

     - Gly Phe Thr Asn Glu Phe Val Lys                                         

     305                 3 - #10                                               

     - (2) INFORMATION FOR SEQ ID NO: 13:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 799 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #13:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAGCTCAGGT GGAAACGCTG GTTCATCCTC TGGAAAAACA ACTGCCAAAG CT - #CGCACTAT   

       60                                                                      

     - CGATGAAATC AAAAAAAGCG GTGAACTGCG AATCGCCGTG TTTGGAGATA AA - #AAACCGTT   

      120                                                                      

     - TGGCTACGTT GACAATGATG GTTCTACCAA GGTACGCTAC GATATTGAAC TA - #GGGAACCA   

      180                                                                      

     - ACTAGCTCAA GACCTTGGTG TCAAGGTTAA ATACATTTCA GTCGATGCTG CC - #AACCGTGC   

      240                                                                      

     - GGAATACTTG ATTTCAAACA AGGTAGATAT TACTCTTGCT AACTTTACAG TA - #ACTGACGA   

      300                                                                      

     - ACGTAAGAAA CAAGTTGATT TTGCCCTTCC ATATATGAAA GTTTCTCTGG GT - #GTCGTATC   

      360                                                                      

     - ACCTAAGACT GGTCTCATTA CAGACGTCAA ACAACTTGAA GGTAAAACCT TA - #ATTGTCAC   

      420                                                                      

     - AAAAGGAACG ACTGCTGAGA CTTATTTTGA AAAGAATCAT CCAGAAATCA AA - #CTCCAAAA   

      480                                                                      

     - ATACGACCAA TACAGTGACT CTTACCAAGC TCTTCTTGAC GGACGTGGAG AT - #GCCTTTTC   

      540                                                                      

     - AACTGACAAT ACGGAAGTTC TAGCTTGGGC GCTTGAAAAT AAAGGATTTG AA - #GTAGGAAT   

      600                                                                      

     - TACTTCCCTC GGTGATCCCG ATACCATTGC GGCAGCAGTT CAAAAAGGCA AC - #CAAGAATT   

      660                                                                      

     - GCTAGACTTC ATCAATAAAG ATATTGAAAA ATTAGGCAAG GAAAACTTCT TC - #CACAAGGC   

      720                                                                      

     - CTATGAAAAG ACACTTCACC CAACCTACGG TGACGCTGCT AAAGCAGATG AC - #CTGGTTGT   

      780                                                                      

     #799               GAT                                                    

     - (2) INFORMATION FOR SEQ ID NO:14:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 266 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                            

     - Ser Ser Gly Gly Asn Ala Gly Ser Ser Ser Gl - #y Lys Thr Thr Ala Lys     

     #                15                                                       

     - Ala Arg Thr Ile Asp Glu Ile Lys Lys Ser Gl - #y Glu Leu Arg Ile Ala     

     #            30                                                           

     - Val Phe Gly Asp Lys Lys Pro Phe Gly Tyr Va - #l Asp Asn Asp Gly Ser     

     #        45                                                               

     - Thr Lys Val Arg Tyr Asp Ile Glu Leu Gly As - #n Gln Leu Ala Gln Asp     

     #    60                                                                   

     - Leu Gly Val Lys Val Lys Tyr Ile Ser Val As - #p Ala Ala Asn Arg Ala     

     #80                                                                       

     - Glu Tyr Leu Ile Ser Asn Lys Val Asp Ile Th - #r Leu Ala Asn Phe Thr     

     #                95                                                       

     - Val Thr Asp Glu Arg Lys Lys Gln Val Asp Ph - #e Ala Leu Pro Tyr Met     

     #           110                                                           

     - Lys Val Ser Leu Gly Val Val Ser Pro Lys Th - #r Gly Leu Ile Thr Asp     

     #       125                                                               

     - Val Lys Gln Leu Glu Gly Lys Thr Leu Ile Va - #l Thr Lys Gly Thr Thr     

     #   140                                                                   

     - Ala Glu Thr Tyr Phe Glu Lys Asn His Pro Gl - #u Ile Lys Leu Gln Lys     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Tyr Asp Gln Tyr Ser Asp Ser Tyr Gln Ala Le - #u Leu Asp Gly Arg Gly     

     #               175                                                       

     - Asp Ala Phe Ser Thr Asp Asn Thr Glu Val Le - #u Ala Trp Ala Leu Glu     

     #           190                                                           

     - Asn Lys Gly Phe Glu Val Gly Ile Thr Ser Le - #u Gly Asp Pro Asp Thr     

     #       205                                                               

     - Ile Ala Ala Ala Val Gln Lys Gly Asn Gln Gl - #u Leu Leu Asp Phe Ile     

     #   220                                                                   

     - Asn Lys Asp Ile Glu Lys Leu Gly Lys Glu As - #n Phe Phe His Lys Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Glu Lys Thr Leu His Pro Thr Tyr Gly As - #p Ala Ala Lys Ala Asp     

     #               255                                                       

     - Asp Leu Val Val Glu Gly Gly Lys Val Asp                                 

     #           265                                                           

     - (2) INFORMATION FOR SEQ ID NO: 15:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1189 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #15:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTCCAACTAT GGTAAATCTG CGGATGGCAC AGTGACCATC GAGTATTTCA AC - #CAGAAAAA   

       60                                                                      

     - AGAAATGACC AAAACCTTGG AAGAAATCAC TCGTGATTTT GAGAAGGAAA AC - #CCTAAGAT   

      120                                                                      

     - CAAGGTCAAA GTCGTCAATG TACCAAATGC TGGTGAAGTA TTGAAGACAC GC - #GTTCTCGC   

      180                                                                      

     - AGGAGATGTG CCTGATGTGG TCAATATTTA CCCACAGTCC ATCGAACTGC AA - #GAATGGGC   

      240                                                                      

     - AAAAGCAGGT GTTTTTGAAG ATTTGAGCAA CAAAGACTAC CTGAAACGCG TG - #AAAAATGG   

      300                                                                      

     - CTACGCTGAA AAATATGCTG TAAACGAAAA AGTTTACAAC GTTCCTTTTA CA - #GCTAATGC   

      360                                                                      

     - TTATGGAATT TACTACAACA AAGATAAATT CGAAGAACTG GGCTTGAAGG TT - #CCTGAAAC   

      420                                                                      

     - CTGGGATGAA TTTGAACAGT TAGTCAAAGA TATCGTTGCT AAAGGACAAA CA - #CCATTTGG   

      480                                                                      

     - AATTGCAGGT GCAGATGCTT GGACACTCAA TGGTTACAAT CAATTAGCCT TT - #GCGACAGC   

      540                                                                      

     - AACAGGTGGA GGAAAAGAAG CAAATCAATA CCTTCGTTAT TCTCAACCAA AT - #GCCATTAA   

      600                                                                      

     - ATTGTCGGAT CCGATTATGA AAGATGATAT CAAGGTCATG GACATCCTTC GC - #ATCAATGG   

      660                                                                      

     - ATCTAAGCAA AAGAACTGGG AAGGTGCTGG CTATACCGAT GTTATCGGAG CC - #TTCGCACG   

      720                                                                      

     - TGGGGATGTC CTCATGACAC CAAATGGGTC TTGGGCGATC ACAGCGATTA AT - #GAACAAAA   

      780                                                                      

     - ACCGAACTTT AAGATTGGGA CCTTCATGAT TCCAGGAAAA GAAAAAGGAC AA - #AGCTTAAC   

      840                                                                      

     - CGTTGGTGCG GGAGACTTGG CATGGTCTAT CTCAGCCACC ACCAAACATC CA - #AAAGAAGC   

      900                                                                      

     - CAATGCCTTT GTGGAATATA TGACCCGTCC AGAAGTCATG CAAAAATACT AC - #GATGTGGA   

      960                                                                      

     - CGGATCTCCA ACAGCGATCG AAGGGGTCAA ACAAGCAGGA GAAGATTCAC CG - #CTTGCTGG   

     1020                                                                      

     - TATGACCGAA TATGCCTTTA CGGATCGTCA CTTGGTCTGG TTGCAACAAT AC - #TGGACCAG   

     1080                                                                      

     - TGAAGCAGAC TTCCATACCT TGACCATGAA CTATGTCTTG ACCGGTGATA AA - #CAAGGCAT   

     1140                                                                      

     #             1189TGCCT TCTTTAACCC GATGAAAGCG GATGTGGAT                   

     - (2) INFORMATION FOR SEQ ID NO:16:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 396 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                            

     - Ser Asn Tyr Gly Lys Ser Ala Asp Gly Thr Va - #l Thr Ile Glu Tyr Phe     

     #                15                                                       

     - Asn Gln Lys Lys Glu Met Thr Lys Thr Leu Gl - #u Glu Ile Thr Arg Asp     

     #            30                                                           

     - Phe Glu Lys Glu Asn Pro Lys Ile Lys Val Ly - #s Val Val Asn Val Pro     

     #        45                                                               

     - Asn Ala Gly Glu Val Leu Lys Thr Arg Val Le - #u Ala Gly Asp Val Pro     

     #    60                                                                   

     - Asp Val Val Asn Ile Tyr Pro Gln Ser Ile Gl - #u Leu Gln Glu Trp Ala     

     #80                                                                       

     - Lys Ala Gly Val Phe Glu Asp Leu Ser Asn Ly - #s Asp Tyr Leu Lys Arg     

     #                95                                                       

     - Val Lys Asn Gly Tyr Ala Glu Lys Tyr Ala Va - #l Asn Glu Lys Val Tyr     

     #           110                                                           

     - Asn Val Pro Phe Thr Ala Asn Ala Tyr Gly Il - #e Tyr Tyr Asn Lys Asp     

     #       125                                                               

     - Lys Phe Glu Glu Leu Gly Leu Lys Val Pro Gl - #u Thr Trp Asp Glu Phe     

     #   140                                                                   

     - Glu Gln Leu Val Lys Asp Ile Val Ala Lys Gl - #y Gln Thr Pro Phe Gly     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ile Ala Gly Ala Asp Ala Trp Thr Leu Asn Gl - #y Tyr Asn Gln Leu Ala     

     #               175                                                       

     - Phe Ala Thr Ala Thr Gly Gly Gly Lys Glu Al - #a Asn Gln Tyr Leu Arg     

     #           190                                                           

     - Tyr Ser Gln Pro Asn Ala Ile Lys Leu Ser As - #p Pro Ile Met Lys Asp     

     #       205                                                               

     - Asp Ile Lys Val Met Asp Ile Leu Arg Ile As - #n Gly Ser Lys Gln Lys     

     #   220                                                                   

     - Asn Trp Glu Gly Ala Gly Tyr Thr Asp Val Il - #e Gly Ala Phe Ala Arg     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Gly Asp Val Leu Met Thr Pro Asn Gly Ser Tr - #p Ala Ile Thr Ala Ile     

     #               255                                                       

     - Asn Glu Gln Lys Pro Asn Phe Lys Ile Gly Th - #r Phe Met Ile Pro Gly     

     #           270                                                           

     - Lys Glu Lys Gly Gln Ser Leu Thr Val Gly Al - #a Gly Asp Leu Ala Trp     

     #       285                                                               

     - Ser Ile Ser Ala Thr Thr Lys His Pro Lys Gl - #u Ala Asn Ala Phe Val     

     #   300                                                                   

     - Glu Tyr Met Thr Arg Pro Glu Val Met Gln Ly - #s Tyr Tyr Asp Val Asp     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Gly Ser Pro Thr Ala Ile Glu Gly Val Lys Gl - #n Ala Gly Glu Asp Ser     

     #               335                                                       

     - Pro Leu Ala Gly Met Thr Glu Tyr Ala Phe Th - #r Asp Arg His Leu Val     

     #           350                                                           

     - Trp Leu Gln Gln Tyr Trp Thr Ser Glu Ala As - #p Phe His Thr Leu Thr     

     #       365                                                               

     - Met Asn Tyr Val Leu Thr Gly Asp Lys Gln Gl - #y Met Val Asn Asp Leu     

     #   380                                                                   

     - Asn Ala Phe Phe Asn Pro Met Lys Ala Asp Va - #l Asp                     

     385                 3 - #90                 3 - #95                       

     - (2) INFORMATION FOR SEQ ID NO: 17:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 775 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #17:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGGGAAAAAT TCTAGCGAAA CTAGTGGAGA TAATTGGTCA AAGTACCAGT CT - #AACAAGTC   

       60                                                                      

     - TATTACTATT GGATTTGATA GTACTTTTGT TCCAATGGGA TTTGCTCAGA AA - #GATGGTTC   

      120                                                                      

     - TTATGCAGGA TTTGATATTG ATTTAGCTAC AGCTGTTTTT GAAAAATACG GA - #ATCACGGT   

      180                                                                      

     - AAATTGGCAA CCGATTGATT GGGATTTGAA AGAAGCTGAA TTGACAAAAG GA - #ACGATTGA   

      240                                                                      

     - TCTGATTTGG AATGGCTATT CCGCTACAGA CGAACGCCGT GAAAAGGTGG CT - #TTCAGTAA   

      300                                                                      

     - CTCATATATG AAGAATGAGC AGGTATTGGT TACGAAGAAA TCATCTGGTA TC - #ACGACTGC   

      360                                                                      

     - AAAGGATATG ACTGGAAAGA CATTAGGAGC TCAAGCTGGT TCATCTGGTT AT - #GCGGACTT   

      420                                                                      

     - TGAAGCAAAT CCAGAAATTT TGAAGAATAT TGTCGCTAAT AAGGAAGCGA AT - #CAATACCA   

      480                                                                      

     - AACCTTTAAT GAAGCCTTGA TTGATTTGAA AAACGATCGA ATTGATGGTC TA - #TTGATTGA   

      540                                                                      

     - CCGTGTCTAT GCAAACTATT ATTTAGAAGC AGAAGGTGTT TTAAACGATT AT - #AATGTCTT   

      600                                                                      

     - TACAGTTGGA CTAGAAACAG AAGCTTTTGC GGTTGGAGCC CGTAAGGAAG AT - #ACAAACTT   

      660                                                                      

     - GGTTAAGAAG ATAAATGAAG CTTTTTCTAG TCTTTACAAG GACGGCAAGT TC - #CAAGAAAT   

      720                                                                      

     - CAGCCAAAAA TGGTTTGGAG AAGATGTAGC AACCAAAGAA GTAAAAGAAG GA - #CAG        

      775                                                                      

     - (2) INFORMATION FOR SEQ ID NO:18:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 258 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                            

     - Gly Lys Asn Ser Ser Glu Thr Ser Gly Asp As - #n Trp Ser Lys Tyr Gln     

     #                15                                                       

     - Ser Asn Lys Ser Ile Thr Ile Gly Phe Asp Se - #r Thr Phe Val Pro Met     

     #            30                                                           

     - Gly Phe Ala Gln Lys Asp Gly Ser Tyr Ala Gl - #y Phe Asp Ile Asp Leu     

     #        45                                                               

     - Ala Thr Ala Val Phe Glu Lys Tyr Gly Ile Th - #r Val Asn Trp Gln Pro     

     #    60                                                                   

     - Ile Asp Trp Asp Leu Lys Glu Ala Glu Leu Th - #r Lys Gly Thr Ile Asp     

     #80                                                                       

     - Leu Ile Trp Asn Gly Tyr Ser Ala Thr Asp Gl - #u Arg Arg Glu Lys Val     

     #                95                                                       

     - Ala Phe Ser Asn Ser Tyr Met Lys Asn Glu Gl - #n Val Leu Val Thr Lys     

     #           110                                                           

     - Lys Ser Ser Gly Ile Thr Thr Ala Lys Asp Me - #t Thr Gly Lys Thr Leu     

     #       125                                                               

     - Gly Ala Gln Ala Gly Ser Ser Gly Tyr Ala As - #p Phe Glu Ala Asn Pro     

     #   140                                                                   

     - Glu Ile Leu Lys Asn Ile Val Ala Asn Lys Gl - #u Ala Asn Gln Tyr Gln     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Phe Asn Glu Ala Leu Ile Asp Leu Lys As - #n Asp Arg Ile Asp Gly     

     #               175                                                       

     - Leu Leu Ile Asp Arg Val Tyr Ala Asn Tyr Ty - #r Leu Glu Ala Glu Gly     

     #           190                                                           

     - Val Leu Asn Asp Tyr Asn Val Phe Thr Val Gl - #y Leu Glu Thr Glu Ala     

     #       205                                                               

     - Phe Ala Val Gly Ala Arg Lys Glu Asp Thr As - #n Leu Val Lys Lys Ile     

     #   220                                                                   

     - Asn Glu Ala Phe Ser Ser Leu Tyr Lys Asp Gl - #y Lys Phe Gln Glu Ile     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ser Gln Lys Trp Phe Gly Glu Asp Val Ala Th - #r Lys Glu Val Lys Glu     

     #               255                                                       

     - Gly Gln                                                                 

     - (2) INFORMATION FOR SEQ ID NO: 19:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 868 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #19:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGCTAGCGGA AAAAAAGATA CAACTTCTGG TCAAAAACTA AAAGTTGTTG CT - #ACAAACTC   

       60                                                                      

     - AATCATCGCT GATATTACTA AAAATATTGC TGGTGACAAA ATTGACCTTC AT - #AGTATCGT   

      120                                                                      

     - TCCGATTGGG CAAGACCCAC ACGAATACGA ACCACTTCCT GAAGACGTTA AG - #AAAACTTC   

      180                                                                      

     - TGAGGCTAAT TTGATTTTCT ATAACGGTAT CAACCTTGAA ACAGGTGGCA AT - #GCTTGGTT   

      240                                                                      

     - TACAAAATTG GTAGAAAATG CCAAGAAAAC TGAAAACAAA GACTACTTCG CA - #GTCAGCGA   

      300                                                                      

     - CGGCGTTGAT GTTATCTACC TTGAAGGTCA AAATGAAAAA GGAAAAGAAG AC - #CCACACGC   

      360                                                                      

     - TTGGCTTAAC CTTGAAAACG GTATTATTTT TGCTAAAAAT ATCGCCAAAC AA - #TTGAGCGC   

      420                                                                      

     - CAAAGACCCT AACAATAAAG AATTCTATGA AAAAAATCTC AAAGAATATA CT - #GATAAGTT   

      480                                                                      

     - AGACAAACTT GATAAAGAAA GTAAGGATAA ATTTAATAAG ATCCCTGCTG AA - #AAGAAACT   

      540                                                                      

     - CATTGTAACC AGCGAAGGAG CATTCAAATA CTTCTCTAAA GCCTATGGTG TC - #CCAAGTGC   

      600                                                                      

     - TTACATCTGG GAAATCAATA CTGAAGAAGA AGGAACTCCT GAACAAATCA AG - #ACCTTGGT   

      660                                                                      

     - TGAAAAACTT CGCCAAACAA AAGTTCCATC ACTCTTTGTA GAATCAAGTG TG - #GATGACCG   

      720                                                                      

     - TCCAATGAAA ACTGTTTCTC AAGACACAAA CATCCCAATC TACGCTCAAA TC - #TTTACTGA   

      780                                                                      

     - CTCTATCGCA GAACAAGGTA AAGAAGGCGA CAGCTACTAC AGCATGATGA AA - #TACAACCT   

      840                                                                      

     #            868   GGAT TGGCAAAA                                          

     - (2) INFORMATION FOR SEQ ID NO:20:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 289 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                            

     - Ala Ser Gly Lys Lys Asp Thr Thr Ser Gly Gl - #n Lys Leu Lys Val Val     

     #                15                                                       

     - Ala Thr Asn Ser Ile Ile Ala Asp Ile Thr Ly - #s Asn Ile Ala Gly Asp     

     #            30                                                           

     - Lys Ile Asp Leu His Ser Ile Val Pro Ile Gl - #y Gln Asp Pro His Glu     

     #        45                                                               

     - Tyr Glu Pro Leu Pro Glu Asp Val Lys Lys Th - #r Ser Glu Ala Asn Leu     

     #    60                                                                   

     - Ile Phe Tyr Asn Gly Ile Asn Leu Glu Thr Gl - #y Gly Asn Ala Trp Phe     

     #80                                                                       

     - Thr Lys Leu Val Glu Asn Ala Lys Lys Thr Gl - #u Asn Lys Asp Tyr Phe     

     #                95                                                       

     - Ala Val Ser Asp Gly Val Asp Val Ile Tyr Le - #u Glu Gly Gln Asn Glu     

     #           110                                                           

     - Lys Gly Lys Glu Asp Pro His Ala Trp Leu As - #n Leu Glu Asn Gly Ile     

     #       125                                                               

     - Ile Phe Ala Lys Asn Ile Ala Lys Gln Leu Se - #r Ala Lys Asp Pro Asn     

     #   140                                                                   

     - Asn Lys Glu Phe Tyr Glu Lys Asn Leu Lys Gl - #u Tyr Thr Asp Lys Leu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asp Lys Leu Asp Lys Glu Ser Lys Asp Lys Ph - #e Asn Lys Ile Pro Ala     

     #               175                                                       

     - Glu Lys Lys Leu Ile Val Thr Ser Glu Gly Al - #a Phe Lys Tyr Phe Ser     

     #           190                                                           

     - Lys Ala Tyr Gly Val Pro Ser Ala Tyr Ile Tr - #p Glu Ile Asn Thr Glu     

     #       205                                                               

     - Glu Glu Gly Thr Pro Glu Gln Ile Lys Thr Le - #u Val Glu Lys Leu Arg     

     #   220                                                                   

     - Gln Thr Lys Val Pro Ser Leu Phe Val Glu Se - #r Ser Val Asp Asp Arg     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Pro Met Lys Thr Val Ser Gln Asp Thr Asn Il - #e Pro Ile Tyr Ala Gln     

     #               255                                                       

     - Ile Phe Thr Asp Ser Ile Ala Glu Gln Gly Ly - #s Glu Gly Asp Ser Tyr     

     #           270                                                           

     - Tyr Ser Met Met Lys Tyr Asn Leu Asp Lys Il - #e Ala Glu Gly Leu Ala     

     #       285                                                               

     - Lys                                                                     

     - (2) INFORMATION FOR SEQ ID NO: 21:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1546 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #21:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGGCTCAAAA AATACAGCTT CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CA - #TTCCCGCT   

       60                                                                      

     - TCAAGAAAAG AAAACATTGA AGTTTATGAC AGCCAGTTCA CCGTTATCTC CT - #AAAGACCC   

      120                                                                      

     - AAATGAAAAG TTAATTTTGC AACGTTTGGA GAAGGAAACT GGCGTTCATA TT - #GACTGGAC   

      180                                                                      

     - CAACTACCAA TCCGACTTTG CAGAAAAACG TAACTTGGAT ATTTCTAGTG GT - #GATTTACC   

      240                                                                      

     - AGATGCTATC CACAACGACG GAGCTTCAGA TGTGGACTTG ATGAACTGGG CT - #AAAAAAGG   

      300                                                                      

     - TGTTATTATT CCAGTTGAAG ATTTGATTGA TAAATACATG CCAAATCTTA AG - #AAAATTTT   

      360                                                                      

     - GGATGAGAAA CCAGAGTACA AGGCCTTGAT GACAGCACCT GATGGGCACA TT - #TACTCATT   

      420                                                                      

     - TCCATGGATT GAAGAGCTTG GAGATGGTAA AGAGTCTATT CACAGTGTCA AC - #GATATGGC   

      480                                                                      

     - TTGGATTAAC AAAGATTGGC TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CT - #ACTGATGA   

      540                                                                      

     - TTTGATTAAA GTCCTAGAAG CTTTCAAAAA CGGGGATCCA AATGGAAATG GA - #GAGGCTGA   

      600                                                                      

     - TGAAATTCCA TTTTCATTTA TTAGTGGTAA CGGAAACGAA GATTTTAAAT TC - #CTATTTGC   

      660                                                                      

     - TGCATTTGGT ATAGGGGATA ACGATGATCA TTTAGTAGTA GGAAATGATG GC - #AAAGTTGA   

      720                                                                      

     - CTTCACAGCA GATAACGATA ACTATAAAGA AGGTGTCAAA TTTATCCGTC AA - #TTGCAAGA   

      780                                                                      

     - AAAAGGCCTG ATTGATAAAG AAGCTTTCGA ACATGATTGG AATAGTTACA TT - #GCTAAAGG   

      840                                                                      

     - TCATGATCAG AAATTTGGTG TTTACTTTAC ATGGGATAAG AATAATGTTA CT - #GGAAGTAA   

      900                                                                      

     - CGAAAGTTAT GATGTTTTAC CAGTACTTGC TGGACCAAGT GGTCAAAAAC AC - #GTAGCTCG   

      960                                                                      

     - TACAAACGGT ATGGGATTTG CACGTGACAA GATGGTTATT ACCAGTGTAA AC - #AAAAACCT   

     1020                                                                      

     - AGAATTGACA GCTAAATGGA TTGATGCACA ATACGCTCCA CTCCAATCTG TG - #CAAAATAA   

     1080                                                                      

     - CTGGGGAACT TACGGAGATG ACAAACAACA AAACATCTTT GAATTGGATC AA - #GCGTCAAA   

     1140                                                                      

     - TAGTCTAAAA CACTTACCAC TAAACGGAAC TGCACCAGCA GAACTTCGTC AA - #AAGACTGA   

     1200                                                                      

     - AGTAGGAGGA CCACTAGCTA TCCTAGATTC ATACTATGGT AAAGTAACAA CC - #ATGCCTGA   

     1260                                                                      

     - TGATGCCAAA TGGCGTTTGG ATCTTATCAA AGAATATTAT GTTCCTTACA TG - #AGCAATGT   

     1320                                                                      

     - CAATAACTAT CCAAGAGTCT TTATGACACA GGAAGATTTG GACAAGATTG CC - #CATATCGA   

     1380                                                                      

     - AGCAGATATG AATGACTATA TCTACCGTAA ACGTGCTGAA TGGATTGTAA AT - #GGCAATAT   

     1440                                                                      

     - TGATACTGAG TGGGATGATT ACAAGAAAGA ACTTGAAAAA TACGGACTTT CT - #GATTACCT   

     1500                                                                      

     #               1546ACT ACGACCAATA CCAAGCAAAC AAAAAC                      

     - (2) INFORMATION FOR SEQ ID NO:22:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 515 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                            

     - Gly Ser Lys Asn Thr Ala Ser Ser Pro Asp Ty - #r Lys Leu Glu Gly Val     

     #                15                                                       

     - Thr Phe Pro Leu Gln Glu Lys Lys Thr Leu Ly - #s Phe Met Thr Ala Ser     

     #            30                                                           

     - Ser Pro Leu Ser Pro Lys Asp Pro Asn Glu Ly - #s Leu Ile Leu Gln Arg     

     #        45                                                               

     - Leu Glu Lys Glu Thr Gly Val His Ile Asp Tr - #p Thr Asn Tyr Gln Ser     

     #    60                                                                   

     - Asp Phe Ala Glu Lys Arg Asn Leu Asp Ile Se - #r Ser Gly Asp Leu Pro     

     #80                                                                       

     - Asp Ala Ile His Asn Asp Gly Ala Ser Asp Va - #l Asp Leu Met Asn Trp     

     #                95                                                       

     - Ala Lys Lys Gly Val Ile Ile Pro Val Glu As - #p Leu Ile Asp Lys Tyr     

     #           110                                                           

     - Met Pro Asn Leu Lys Lys Ile Leu Asp Glu Ly - #s Pro Glu Tyr Lys Ala     

     #       125                                                               

     - Leu Met Thr Ala Pro Asp Gly His Ile Tyr Se - #r Phe Pro Trp Ile Glu     

     #   140                                                                   

     - Glu Leu Gly Asp Gly Lys Glu Ser Ile His Se - #r Val Asn Asp Met Ala     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Trp Ile Asn Lys Asp Trp Leu Lys Lys Leu Gl - #y Leu Glu Met Pro Lys     

     #               175                                                       

     - Thr Thr Asp Asp Leu Ile Lys Val Leu Glu Al - #a Phe Lys Asn Gly Asp     

     #           190                                                           

     - Pro Asn Gly Asn Gly Glu Ala Asp Glu Ile Pr - #o Phe Ser Phe Ile Ser     

     #       205                                                               

     - Gly Asn Gly Asn Glu Asp Phe Lys Phe Leu Ph - #e Ala Ala Phe Gly Ile     

     #   220                                                                   

     - Gly Asp Asn Asp Asp His Leu Val Val Gly As - #n Asp Gly Lys Val Asp     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Phe Thr Ala Asp Asn Asp Asn Tyr Lys Glu Gl - #y Val Lys Phe Ile Arg     

     #               255                                                       

     - Gln Leu Gln Glu Lys Gly Leu Ile Asp Lys Gl - #u Ala Phe Glu His Asp     

     #           270                                                           

     - Trp Asn Ser Tyr Ile Ala Lys Gly His Asp Gl - #n Lys Phe Gly Val Tyr     

     #       285                                                               

     - Phe Thr Trp Asp Lys Asn Asn Val Thr Gly Se - #r Asn Glu Ser Tyr Asp     

     #   300                                                                   

     - Val Leu Pro Val Leu Ala Gly Pro Ser Gly Gl - #n Lys His Val Ala Arg     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Thr Asn Gly Met Gly Phe Ala Arg Asp Lys Me - #t Val Ile Thr Ser Val     

     #               335                                                       

     - Asn Lys Asn Leu Glu Leu Thr Ala Lys Trp Il - #e Asp Ala Gln Tyr Ala     

     #           350                                                           

     - Pro Leu Gln Ser Val Gln Asn Asn Trp Gly Th - #r Tyr Gly Asp Asp Lys     

     #       365                                                               

     - Gln Gln Asn Ile Phe Glu Leu Asp Gln Ala Se - #r Asn Ser Leu Lys His     

     #   380                                                                   

     - Leu Pro Leu Asn Gly Thr Ala Pro Ala Glu Le - #u Arg Gln Lys Thr Glu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Val Gly Gly Pro Leu Ala Ile Leu Asp Ser Ty - #r Tyr Gly Lys Val Thr     

     #               415                                                       

     - Thr Met Pro Asp Asp Ala Lys Trp Arg Leu As - #p Leu Ile Lys Glu Tyr     

     #           430                                                           

     - Tyr Val Pro Tyr Met Ser Asn Val Asn Asn Ty - #r Pro Arg Val Phe Met     

     #       445                                                               

     - Thr Gln Glu Asp Leu Asp Lys Ile Ala His Il - #e Glu Ala Asp Met Asn     

     #   460                                                                   

     - Asp Tyr Ile Tyr Arg Lys Arg Ala Glu Trp Il - #e Val Asn Gly Asn Ile     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Asp Thr Glu Trp Asp Asp Tyr Lys Lys Glu Le - #u Glu Lys Tyr Gly Leu     

     #               495                                                       

     - Ser Asp Tyr Leu Ala Ile Lys Gln Lys Tyr Ty - #r Asp Gln Tyr Gln Ala     

     #           510                                                           

     - Asn Lys Asn                                                             

             515                                                               

     - (2) INFORMATION FOR SEQ ID NO: 23:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 895 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #23:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAGTACAAAC TCAAGCACTA GTCAGACAGA GACCAGTAGC TCTGCTCCAA CA - #GAGGTAAC   

       60                                                                      

     - CATTAAAAGT TCACTGGACG AGGTCAAACT TTCCAAAGTT CCTGAAAAGA TT - #GTGACCTT   

      120                                                                      

     - TGACCTCGGC GCTGCGGATA CTATTCGCGC TTTAGGATTT GAAAAAAATA TC - #GTCGGAAT   

      180                                                                      

     - GCCTACAAAA ACTGTTCCGA CTTATCTAAA AGACCTAGTG GGAACTGTCA AA - #AATGTTGG   

      240                                                                      

     - TTCTATGAAA GAACCTGATT TAGAAGCTAT CGCCGCCCTT GAGCCTGATT TG - #ATTATCGC   

      300                                                                      

     - TTCGCCACGT ACACAAAAAT TCGTAGACAA ATTCAAAGAA ATCGCCCCAA CC - #GTTCTCTT   

      360                                                                      

     - CCAAGCAAGC AAGGACGACT ACTGGACTTC TACCAAGGCT AATATCGAAT CC - #TTAGCAAG   

      420                                                                      

     - TGCCTTCGGC GAAACTGGTA CACAGAAAGC CAAGGAAGAA TTGACCAAGC TA - #GACAAGAG   

      480                                                                      

     - CATCCAAGAA GTCGCTACTA AAAATGAAAG CTCTGACAAA AAAGCCCTTG CG - #ATCCTCCT   

      540                                                                      

     - TAATGAAGGA AAAATGGCAG CCTTTGGTGC CAAATCTCGT TTCTCTTTCT TG - #TACCAAAC   

      600                                                                      

     - CTTGAAATTC AAACCAACTG ATACAAAATT TGAAGACTCA CGCCACGGAC AA - #GAAGTCAG   

      660                                                                      

     - CTTTGAAAGT GTCAAAGAAA TCAACCCTGA CATCCTCTTT GTCATCAACC GT - #ACCCTTGC   

      720                                                                      

     - CATCGGTGGG GACAACTCTA GCAACGACGG TGTCCTAGAA AATGCCCTTA TC - #GCTGAAAC   

      780                                                                      

     - ACCTGCTGCT AAAAATGGTA AGATTATCCA ACTAACACCA GACCTCTGGT AT - #CTAAGCGG   

      840                                                                      

     - AGGCGGACTT GAATCAACAA AACTCATGAT TGAAGACATA CAAAAAGCTT TG - #AAA        

      895                                                                      

     - (2) INFORMATION FOR SEQ ID NO:24:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 298 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                            

     - Ser Thr Asn Ser Ser Thr Ser Gln Thr Glu Th - #r Ser Ser Ser Ala Pro     

     #                15                                                       

     - Thr Glu Val Thr Ile Lys Ser Ser Leu Asp Gl - #u Val Lys Leu Ser Lys     

     #            30                                                           

     - Val Pro Glu Lys Ile Val Thr Phe Asp Leu Gl - #y Ala Ala Asp Thr Ile     

     #        45                                                               

     - Arg Ala Leu Gly Phe Glu Lys Asn Ile Val Gl - #y Met Pro Thr Lys Thr     

     #    60                                                                   

     - Val Pro Thr Tyr Leu Lys Asp Leu Val Gly Th - #r Val Lys Asn Val Gly     

     #80                                                                       

     - Ser Met Lys Glu Pro Asp Leu Glu Ala Ile Al - #a Ala Leu Glu Pro Asp     

     #                95                                                       

     - Leu Ile Ile Ala Ser Pro Arg Thr Gln Lys Ph - #e Val Asp Lys Phe Lys     

     #           110                                                           

     - Glu Ile Ala Pro Thr Val Leu Phe Gln Ala Se - #r Lys Asp Asp Tyr Trp     

     #       125                                                               

     - Thr Ser Thr Lys Ala Asn Ile Glu Ser Leu Al - #a Ser Ala Phe Gly Glu     

     #   140                                                                   

     - Thr Gly Thr Gln Lys Ala Lys Glu Glu Leu Th - #r Lys Leu Asp Lys Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ile Gln Glu Val Ala Thr Lys Asn Glu Ser Se - #r Asp Lys Lys Ala Leu     

     #               175                                                       

     - Ala Ile Leu Leu Asn Glu Gly Lys Met Ala Al - #a Phe Gly Ala Lys Ser     

     #           190                                                           

     - Arg Phe Ser Phe Leu Tyr Gln Thr Leu Lys Ph - #e Lys Pro Thr Asp Thr     

     #       205                                                               

     - Lys Phe Glu Asp Ser Arg His Gly Gln Glu Va - #l Ser Phe Glu Ser Val     

     #   220                                                                   

     - Lys Glu Ile Asn Pro Asp Ile Leu Phe Val Il - #e Asn Arg Thr Leu Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ile Gly Gly Asp Asn Ser Ser Asn Asp Gly Va - #l Leu Glu Asn Ala Leu     

     #               255                                                       

     - Ile Ala Glu Thr Pro Ala Ala Lys Asn Gly Ly - #s Ile Ile Gln Leu Thr     

     #           270                                                           

     - Pro Asp Leu Trp Tyr Leu Ser Gly Gly Gly Le - #u Glu Ser Thr Lys Leu     

     #       285                                                               

     - Met Ile Glu Asp Ile Gln Lys Ala Leu Lys                                 

     #   295                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 25:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1261 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #25:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGGCAATTCT GGCGGAAGTA AAGATGCTGC CAAATCAGGT GGTGACGGTG CC - #AAAACAGA   

       60                                                                      

     - AATCACTTGG TGGGCATTCC CAGTATTTAC CCAAGAAAAA ACTGGTGACG GT - #GTTGGAAC   

      120                                                                      

     - TTATGAAAAA TCAATCATCG AAGCGTTTGA AAAAGCAAAC CCAGATATAA AA - #GTGAAATT   

      180                                                                      

     - GGAAACCATC GACTTCAAGT CAGGTCCTGA AAAAATCACA ACAGCCATCG AA - #GCAGGAAC   

      240                                                                      

     - AGCTCCAGAC GTACTCTTTG ATGCACCAGG ACGTATCATC CAATACGGTA AA - #AACGGTAA   

      300                                                                      

     - ATTGGCTGAG TTGAATGACC TCTTCACAGA TGAATTTGTT AAAGATGTCA AC - #AATGAAAA   

      360                                                                      

     - CATCGTACAA GCAAGTAAAG CTGGAGACAA GGCTTATATG TATCCGATTA GT - #TCTGCCCC   

      420                                                                      

     - ATTCTACATG GCAATGAACA AGAAAATGTT AGAAGATGCT GGAGTAGCAA AC - #CTTGTAAA   

      480                                                                      

     - AGAAGGTTGG ACAACTGATG ATTTTGAAAA AGTATTGAAA GCACTTAAAG AC - #AAGGGTTA   

      540                                                                      

     - CACACCAGGT TCATTGTTCA GTTCTGGTCA AGGGGGAGAC CAAGGAACAC GT - #GCCTTTAT   

      600                                                                      

     - CTCTAACCTT TATAGCGGTT CTGTAACAGA TGAAAAAGTT AGCAAATATA CA - #ACTGATGA   

      660                                                                      

     - TCCTAAATTC GTCAAAGGTC TTGAAAAAGC AACTAGCTGG ATTAAAGACA AT - #TTGATCAA   

      720                                                                      

     - TAATGGTTCA CAATTTGACG GTGGGGCAGA TATCCAAAAC TTTGCCAACG GT - #CAAACATC   

      780                                                                      

     - TTACACAATC CTTTGGGCAC CAGCTCAAAA TGGTATCCAA GCTAAACTTT TA - #GAAGCAAG   

      840                                                                      

     - TAAGGTAGAA GTGGTAGAAG TACCATTCCC ATCAGACGAA GGTAAGCCAG CT - #CTTGAGTA   

      900                                                                      

     - CCTTGTAAAC GGGTTTGCAG TATTCAACAA TAAAGACGAC AAGAAAGTCG CT - #GCATCTAA   

      960                                                                      

     - GAAATTCATC CAGTTTATCG CAGATGACAA GGAGTGGGGA CCTAAAGACG TA - #GTTCGTAC   

     1020                                                                      

     - AGGTGCTTTC CCAGTCCGTA CTTCATTTGG AAAACTTTAT GAAGACAAAC GC - #ATGGAAAC   

     1080                                                                      

     - AATCAGCGGC TGGACTCAAT ACTACTCACC ATACTACAAC ACTATTGATG GA - #TTTGCTGA   

     1140                                                                      

     - AATGAGAACA CTTTGGTTCC CAATGTTGCA ATCTGTATCA AATGGTGACG AA - #AAACCAGC   

     1200                                                                      

     - AGATGCTTTG AAAGCCTTCA CTGAAAAAGC GAACGAAACA ATCAAAAAAG CT - #ATGAAACA   

     1260                                                                      

     #             1261                                                        

     - (2) INFORMATION FOR SEQ ID NO:26:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 420 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                            

     - Gly Asn Ser Gly Gly Ser Lys Asp Ala Ala Ly - #s Ser Gly Gly Asp Gly     

     #                15                                                       

     - Ala Lys Thr Glu Ile Thr Trp Trp Ala Phe Pr - #o Val Phe Thr Gln Glu     

     #            30                                                           

     - Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu Ly - #s Ser Ile Ile Glu Ala     

     #        45                                                               

     - Phe Glu Lys Ala Asn Pro Asp Ile Lys Val Ly - #s Leu Glu Thr Ile Asp     

     #    60                                                                   

     - Phe Lys Ser Gly Pro Glu Lys Ile Thr Thr Al - #a Ile Glu Ala Gly Thr     

     #80                                                                       

     - Ala Pro Asp Val Leu Phe Asp Ala Pro Gly Ar - #g Ile Ile Gln Tyr Gly     

     #                95                                                       

     - Lys Asn Gly Lys Leu Ala Glu Leu Asn Asp Le - #u Phe Thr Asp Glu Phe     

     #           110                                                           

     - Val Lys Asp Val Asn Asn Glu Asn Ile Val Gl - #n Ala Ser Lys Ala Gly     

     #       125                                                               

     - Asp Lys Ala Tyr Met Tyr Pro Ile Ser Ser Al - #a Pro Phe Tyr Met Ala     

     #   140                                                                   

     - Met Asn Lys Lys Met Leu Glu Asp Ala Gly Va - #l Ala Asn Leu Val Lys     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Glu Gly Trp Thr Thr Asp Asp Phe Glu Lys Va - #l Leu Lys Ala Leu Lys     

     #               175                                                       

     - Asp Lys Gly Tyr Thr Pro Gly Ser Leu Phe Se - #r Ser Gly Gln Gly Gly     

     #           190                                                           

     - Asp Gln Gly Thr Arg Ala Phe Ile Ser Asn Le - #u Tyr Ser Gly Ser Val     

     #       205                                                               

     - Thr Asp Glu Lys Val Ser Lys Tyr Thr Thr As - #p Asp Pro Lys Phe Val     

     #   220                                                                   

     - Lys Gly Leu Glu Lys Ala Thr Ser Trp Ile Ly - #s Asp Asn Leu Ile Asn     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Asn Gly Ser Gln Phe Asp Gly Gly Ala Asp Il - #e Gln Asn Phe Ala Asn     

     #               255                                                       

     - Gly Gln Thr Ser Tyr Thr Ile Leu Trp Ala Pr - #o Ala Gln Asn Gly Ile     

     #           270                                                           

     - Gln Ala Lys Leu Leu Glu Ala Ser Lys Val Gl - #u Val Val Glu Val Pro     

     #       285                                                               

     - Phe Pro Ser Asp Glu Gly Lys Pro Ala Leu Gl - #u Tyr Leu Val Asn Gly     

     #   300                                                                   

     - Phe Ala Val Phe Asn Asn Lys Asp Asp Lys Ly - #s Val Ala Ala Ser Lys     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Lys Phe Ile Gln Phe Ile Ala Asp Asp Lys Gl - #u Trp Gly Pro Lys Asp     

     #               335                                                       

     - Val Val Arg Thr Gly Ala Phe Pro Val Arg Th - #r Ser Phe Gly Lys Leu     

     #           350                                                           

     - Tyr Glu Asp Lys Arg Met Glu Thr Ile Ser Gl - #y Trp Thr Gln Tyr Tyr     

     #       365                                                               

     - Ser Pro Tyr Tyr Asn Thr Ile Asp Gly Phe Al - #a Glu Met Arg Thr Leu     

     #   380                                                                   

     - Trp Phe Pro Met Leu Gln Ser Val Ser Asn Gl - #y Asp Glu Lys Pro Ala     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Asp Ala Leu Lys Ala Phe Thr Glu Lys Ala As - #n Glu Thr Ile Lys Lys     

     #               415                                                       

     - Ala Met Lys Gln                                                         

                 420                                                           

     - (2) INFORMATION FOR SEQ ID NO: 27:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 658 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #27:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTCACAAGAA AAAACAAAAA ATGAAGATGG AGAAACTAAG ACAGAACAGA CA - #GCCAAAGC   

       60                                                                      

     - TGATGGAACA GTCGGTAGTA AGTCTCAAGG AGCTGCCCAG AAGAAAGCAG AA - #GTGGTCAA   

      120                                                                      

     - TAAAGGTGAT TACTACAGCA TTCAAGGGAA ATACGATGAA ATCATCGTAG CC - #AACAAACA   

      180                                                                      

     - CTATCCATTG TCTAAAGACT ATAATCCAGG GGAAAATCCA ACAGCCAAGG CA - #GAGTTGGT   

      240                                                                      

     - CAAACTCATC AAAGCGATGC AAGAGGCAGG TTTCCCTATT AGTGATCATT AC - #AGTGGTTT   

      300                                                                      

     - TAGAAGTTAT GAAACTCAGA CCAAGCTCTA TCAAGATTAT GTCAACCAAG AT - #GGAAAGGC   

      360                                                                      

     - AGCAGCTGAC CGTTACTCTG CCCGTCCTGG CTATAGCGAA CACCAGACAG GC - #TTGGCCTT   

      420                                                                      

     - TGATGTGATT GGGACTGATG GTGATTTGGT GACAGAAGAA AAAGCAGCCC AA - #TGGCTCTT   

      480                                                                      

     - GGATCATGCA GCTGATTATG GCTTTGTTGT CCGTTATCTC AAAGGCAAGG AA - #AAGGAAAC   

      540                                                                      

     - AGGCTATATG GCTGAAGAAT GGCACCTGCG TTATGTAGGA AAAGAAGCTA AA - #GAAATTGC   

      600                                                                      

     - TGCAAGTGGT CTCAGTTTGG AAGAATACTA TGGCTTTGAA GGCGGAGACT AC - #GTCGAT     

      658                                                                      

     - (2) INFORMATION FOR SEQ ID NO:28:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 219 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                            

     - Ser Gln Glu Lys Thr Lys Asn Glu Asp Gly Gl - #u Thr Lys Thr Glu Gln     

     #                15                                                       

     - Thr Ala Lys Ala Asp Gly Thr Val Gly Ser Ly - #s Ser Gln Gly Ala Ala     

     #            30                                                           

     - Gln Lys Lys Ala Glu Val Val Asn Lys Gly As - #p Tyr Tyr Ser Ile Gln     

     #        45                                                               

     - Gly Lys Tyr Asp Glu Ile Ile Val Ala Asn Ly - #s His Tyr Pro Leu Ser     

     #    60                                                                   

     - Lys Asp Tyr Asn Pro Gly Glu Asn Pro Thr Al - #a Lys Ala Glu Leu Val     

     #80                                                                       

     - Lys Leu Ile Lys Ala Met Gln Glu Ala Gly Ph - #e Pro Ile Ser Asp His     

     #                95                                                       

     - Tyr Ser Gly Phe Arg Ser Tyr Glu Thr Gln Th - #r Lys Leu Tyr Gln Asp     

     #           110                                                           

     - Tyr Val Asn Gln Asp Gly Lys Ala Ala Ala As - #p Arg Tyr Ser Ala Arg     

     #       125                                                               

     - Pro Gly Tyr Ser Glu His Gln Thr Gly Leu Al - #a Phe Asp Val Ile Gly     

     #   140                                                                   

     - Thr Asp Gly Asp Leu Val Thr Glu Glu Lys Al - #a Ala Gln Trp Leu Leu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asp His Ala Ala Asp Tyr Gly Phe Val Val Ar - #g Tyr Leu Lys Gly Lys     

     #               175                                                       

     - Glu Lys Glu Thr Gly Tyr Met Ala Glu Glu Tr - #p His Leu Arg Tyr Val     

     #           190                                                           

     - Gly Lys Glu Ala Lys Glu Ile Ala Ala Ser Gl - #y Leu Ser Leu Glu Glu     

     #       205                                                               

     - Tyr Tyr Gly Phe Glu Gly Gly Asp Tyr Val As - #p                         

     #   215                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 29:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 790 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #29:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GAAAGGTCTG TGGTCAAATA ATCTTACCTG CGGTTATGAT GAAAAAATAA TC - #TTGGAAAA   

       60                                                                      

     - TATAAATATA AAAATACCTG AAGAAAAAAT ATCAGTTATT ATTGGGTCAA AT - #GGTTGTGG   

      120                                                                      

     - GAAATCAACA CTCATTAAAA CCTTGTCTCG ACTTATAAAG CCATTAGAGG GA - #GAAGTATT   

      180                                                                      

     - GCTTGATAAT AAATCAATTA ATTCTTATAA AGAAAAAGAT TTAGCAAAAC AC - #ATAGCTAT   

      240                                                                      

     - ATTACCTCAA TCTCCAATAA TCCCTGAATC AATAACAGTA GCTGATCTTG TA - #AGCCGTGG   

      300                                                                      

     - TCGTTTCCCC TACAGAAAGC CTTTTAAGAG TCTTGGAAAA GATGACCTTG AA - #ATAATAAA   

      360                                                                      

     - CAGATCAATG GTTAAGGCCA ATGTTGAAGA TCTAGCAAAT AACCTAGTTG AA - #GAACTTTC   

      420                                                                      

     - TGGGGGTCAA AGGCAAAGAG TATGGATAGC TCTAGCCCTA GCCCAAGATA CA - #AGTATCCT   

      480                                                                      

     - ACTTTTAGAT GAGCCAACTA CTTACTTGGA TATCTCATAT CAAATAGAAC TA - #TTAGACCT   

      540                                                                      

     - CTTGACTGAT CTAAACCAAA AATATAAGAC AACCATTTGC ATGATTTTGC AC - #GATATAAA   

      600                                                                      

     - TCTAACAGCA AGATACGCTG ATTACCTATT TGCAATTAAA GAAGGTAAAC TT - #GTTGCAGA   

      660                                                                      

     - GGGAAAGCCT GAAGATATAC TAAATGATAA ACTAGTTAAA GATATCTTTA AT - #CTTGAAGC   

      720                                                                      

     - AAAAATTATA CGTGACCCTA TTTCCAATTC GCCTCTAATG ATTCCTATTG GC - #AAGCACCA   

      780                                                                      

     #       790                                                               

     - (2) INFORMATION FOR SEQ ID NO:30:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 262 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                            

     - Lys Gly Leu Trp Ser Asn Asn Leu Thr Cys Gl - #y Tyr Asp Glu Lys Ile     

     #                15                                                       

     - Ile Leu Glu Asn Ile Asn Ile Lys Ile Pro Gl - #u Glu Lys Ile Ser Val     

     #            30                                                           

     - Ile Ile Gly Ser Asn Gly Cys Gly Lys Ser Th - #r Leu Ile Lys Thr Leu     

     #        45                                                               

     - Ser Arg Leu Ile Lys Pro Leu Glu Gly Glu Va - #l Leu Leu Asp Asn Lys     

     #    60                                                                   

     - Ser Ile Asn Ser Tyr Lys Glu Lys Asp Leu Al - #a Lys His Ile Ala Ile     

     #80                                                                       

     - Leu Pro Gln Ser Pro Ile Ile Pro Glu Ser Il - #e Thr Val Ala Asp Leu     

     #                95                                                       

     - Val Ser Arg Gly Arg Phe Pro Tyr Arg Lys Pr - #o Phe Lys Ser Leu Gly     

     #           110                                                           

     - Lys Asp Asp Leu Glu Ile Ile Asn Arg Ser Me - #t Val Lys Ala Asn Val     

     #       125                                                               

     - Glu Asp Leu Ala Asn Asn Leu Val Glu Glu Le - #u Ser Gly Gly Gln Arg     

     #   140                                                                   

     - Gln Arg Val Trp Ile Ala Leu Ala Leu Ala Gl - #n Asp Thr Ser Ile Leu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Leu Asp Glu Pro Thr Thr Tyr Leu Asp Il - #e Ser Tyr Gln Ile Glu     

     #               175                                                       

     - Leu Leu Asp Leu Leu Thr Asp Leu Asn Gln Ly - #s Tyr Lys Thr Thr Ile     

     #           190                                                           

     - Cys Met Ile Leu His Asp Ile Asn Leu Thr Al - #a Arg Tyr Ala Asp Tyr     

     #       205                                                               

     - Leu Phe Ala Ile Lys Glu Gly Lys Leu Val Al - #a Glu Gly Lys Pro Glu     

     #   220                                                                   

     - Asp Ile Leu Asn Asp Lys Leu Val Lys Asp Il - #e Phe Asn Leu Glu Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Lys Ile Ile Arg Asp Pro Ile Ser Asn Ser Pr - #o Leu Met Ile Pro Ile     

     #               255                                                       

     - Gly Lys His His Val Ser                                                 

                 260                                                           

     - (2) INFORMATION FOR SEQ ID NO: 31:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 781 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #31:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AAACTCAGAA AAGAAAGCAG ACAATGCAAC AACTATCAAA ATCGCAACTG TT - #AACCGTAG   

       60                                                                      

     - CGGTTCTGAA GAAAAACGTT GGGACAAAAT CCAAGAATTG GTTAAAAAAG AC - #GGAATTAC   

      120                                                                      

     - CTTGGAATTT ACAGAGTTCA CAGACTACTC ACAACCAAAC AAAGCAACTG CT - #GATGGCGA   

      180                                                                      

     - AGTAGATTTG AACGCTTTCC AACACTATAA CTTCTTGAAC AACTGGAACA AA - #GAAAACGG   

      240                                                                      

     - AAAAGACCTT GTAGCGATTG CAGATACTTA CATCTCTCCA ATCCGCCTTT AC - #TCAGGTTT   

      300                                                                      

     - GAATGGAAGT GCCAACAAGT ACACTAAAGT AGAAGACATC CCAGCAAACG GA - #GAAATCGC   

      360                                                                      

     - TGTACCGAAT GACGCTACAA ACGAAAGCCG TGCGCTTTAT TTGCTTCAAT CA - #GCTGGCTT   

      420                                                                      

     - GATTAAATTG GATGTTTCTG GAACTGCTCT TGCAACAGTT GCCAACATCA AA - #GAAAATCC   

      480                                                                      

     - AAAGAACTTG AAAATCACTG AATTGGACGC TAGCCAAACA GCTCGTTCAT TG - #TCATCAGT   

      540                                                                      

     - TGACGCTGCC GTTGTAAACA ATACCTTCGT TACAGAAGCA AAATTGGACT AC - #AAGAAATC   

      600                                                                      

     - ACTTTTCAAA GAACAAGCTG ATGAAAACTC AAAACAATGG TACAACATCA TT - #GTTGCAAA   

      660                                                                      

     - AAAAGATTGG GAAACATCAC CTAAGGCTGA TGCTATCAAG AAAGTAATCG CA - #GCTTACCA   

      720                                                                      

     - CACAGATGAC GTGAAAAAAG TTATCGAAGA ATCATCAGAT GGTTTGGATC AA - #CCAGTTTG   

      780                                                                      

     #              781                                                        

     - (2) INFORMATION FOR SEQ ID NO:32:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 260 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                            

     - Asn Ser Glu Lys Lys Ala Asp Asn Ala Thr Th - #r Ile Lys Ile Ala Thr     

     #                15                                                       

     - Val Asn Arg Ser Gly Ser Glu Glu Lys Arg Tr - #p Asp Lys Ile Gln Glu     

     #            30                                                           

     - Leu Val Lys Lys Asp Gly Ile Thr Leu Glu Ph - #e Thr Glu Phe Thr Asp     

     #        45                                                               

     - Tyr Ser Gln Pro Asn Lys Ala Thr Ala Asp Gl - #y Glu Val Asp Leu Asn     

     #    60                                                                   

     - Ala Phe Gln His Tyr Asn Phe Leu Asn Asn Tr - #p Asn Lys Glu Asn Gly     

     #80                                                                       

     - Lys Asp Leu Val Ala Ile Ala Asp Thr Tyr Il - #e Ser Pro Ile Arg Leu     

     #                95                                                       

     - Tyr Ser Gly Leu Asn Gly Ser Ala Asn Lys Ty - #r Thr Lys Val Glu Asp     

     #           110                                                           

     - Ile Pro Ala Asn Gly Glu Ile Ala Val Pro As - #n Asp Ala Thr Asn Glu     

     #       125                                                               

     - Ser Arg Ala Leu Tyr Leu Leu Gln Ser Ala Gl - #y Leu Ile Lys Leu Asp     

     #   140                                                                   

     - Val Ser Gly Thr Ala Leu Ala Thr Val Ala As - #n Ile Lys Glu Asn Pro     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Asn Leu Lys Ile Thr Glu Leu Asp Ala Se - #r Gln Thr Ala Arg Ser     

     #               175                                                       

     - Leu Ser Ser Val Asp Ala Ala Val Val Asn As - #n Thr Phe Val Thr Glu     

     #           190                                                           

     - Ala Lys Leu Asp Tyr Lys Lys Ser Leu Phe Ly - #s Glu Gln Ala Asp Glu     

     #       205                                                               

     - Asn Ser Lys Gln Trp Tyr Asn Ile Ile Val Al - #a Lys Lys Asp Trp Glu     

     #   220                                                                   

     - Thr Ser Pro Lys Ala Asp Ala Ile Lys Lys Va - #l Ile Ala Ala Tyr His     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Thr Asp Asp Val Lys Lys Val Ile Glu Glu Se - #r Ser Asp Gly Leu Asp     

     #               255                                                       

     - Gln Pro Val Trp                                                         

                 260                                                           

     - (2) INFORMATION FOR SEQ ID NO: 33:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 640 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #33:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTCGAAAGGG TCAGAAGGTG CAGACCTTAT CAGCATGAAA GGGGATGTCA TT - #ACAGAACA   

       60                                                                      

     - TCAATTTTAT GAGCAAGTGA AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TA - #AATATGAC   

      120                                                                      

     - CATCCAAAAA GTTTTTGAAA AACAATATGG CTCAGAGCTT GATGATAAAG AG - #GTTGATGA   

      180                                                                      

     - TACTATTGCC GAAGAAAAAA AACAATATGG CGAAAACTAC CAACGTGTCT TG - #TCACAAGC   

      240                                                                      

     - AGGTATGACT CTTGAAACAC GTAAAGCTCA AATTCGTACA AGTAAATTAG TT - #GAGTTGGC   

      300                                                                      

     - AGTTAAGAAG GTAGCAGAAG CTGAATTGAC AGATGAAGCC TATAAGAAAG CC - #TTTGATGA   

      360                                                                      

     - GTACACTCCA GATGTAACGG CTCAAATCAT CCGTCTTAAT AATGAAGATA AG - #GCCAAAGA   

      420                                                                      

     - AGTTCTCGAA AAAGCCAAGG CAGAAGGTGC TGATTTTGCT CAATTAGCCA AA - #GATAATTC   

      480                                                                      

     - AACTGATGAA AAAACAAAAG AAAATGGTGG AGAAATTACC TTTGATTCTG CT - #TCAACAGA   

      540                                                                      

     - AGTACCTGGA GCAAGTCCAA AAAAGCCGCT TTTCGCTTTT AGATGTGGGA TG - #GTGTTTCT   

      600                                                                      

     #   640            AACT GGGGCACACC AAGCCTACAG                             

     - (2) INFORMATION FOR SEQ ID NO:34:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 213 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                            

     - Ser Lys Gly Ser Glu Gly Ala Asp Leu Ile Se - #r Met Lys Gly Asp Val     

     #                15                                                       

     - Ile Thr Glu His Gln Phe Tyr Glu Gln Val Ly - #s Ser Asn Pro Ser Ala     

     #            30                                                           

     - Gln Gln Val Leu Leu Asn Met Thr Ile Gln Ly - #s Val Phe Glu Lys Gln     

     #        45                                                               

     - Tyr Gly Ser Glu Leu Asp Asp Lys Glu Val As - #p Asp Thr Ile Ala Glu     

     #    60                                                                   

     - Glu Lys Lys Gln Tyr Gly Glu Asn Tyr Gln Ar - #g Val Leu Ser Gln Ala     

     #80                                                                       

     - Gly Met Thr Leu Glu Thr Arg Lys Ala Gln Il - #e Arg Thr Ser Lys Leu     

     #                95                                                       

     - Val Glu Leu Ala Val Lys Lys Val Ala Glu Al - #a Glu Leu Thr Asp Glu     

     #           110                                                           

     - Ala Tyr Lys Lys Ala Phe Asp Glu Tyr Thr Pr - #o Asp Val Thr Ala Gln     

     #       125                                                               

     - Ile Ile Arg Leu Asn Asn Glu Asp Lys Ala Ly - #s Glu Val Leu Glu Lys     

     #   140                                                                   

     - Ala Lys Ala Glu Gly Ala Asp Phe Ala Gln Le - #u Ala Lys Asp Asn Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Asp Glu Lys Thr Lys Glu Asn Gly Gly Gl - #u Ile Thr Phe Asp Ser     

     #               175                                                       

     - Ala Ser Thr Glu Val Pro Gly Ala Ser Pro Ly - #s Lys Pro Leu Phe Ala     

     #           190                                                           

     - Phe Arg Cys Gly Met Val Phe Leu Asp Val As - #p Tyr Ser Asn Trp Gly     

     #       205                                                               

     - Thr Pro Ser Leu Gln                                                     

         210                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 35:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 631 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #35:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA TT - #GCTCAAAC   

       60                                                                      

     - TCTAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG CT - #GTTCAGTG   

      120                                                                      

     - TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG TT - #GTAGAAAA   

      180                                                                      

     - ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG AA - #TCTAATGG   

      240                                                                      

     - TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA CA - #GATGGTAA   

      300                                                                      

     - ATGGTACTAT TTGAATGACT TAGGTGTCAT GCAGACTGGA TTTGTAAAAT TT - #TCTGGTAG   

      360                                                                      

     - CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG AT - #GGTAGCAG   

      420                                                                      

     - ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG AA - #AATGGCAC   

      480                                                                      

     - TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG TC - #GGACCACA   

      540                                                                      

     - CTGGTACTAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC CA - #GATGGTTA   

      600                                                                      

     #         631      GGTG AATGGGTAAA C                                      

     - (2) INFORMATION FOR SEQ ID NO:36:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 210 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:                            

     - Gly Met Ala Ala Phe Lys Asn Pro Asn Asn Gl - #n Tyr Lys Ala Ile Thr     

     #                15                                                       

     - Ile Ala Gln Thr Leu Gly Asp Asp Ala Ser Se - #r Glu Glu Leu Ala Gly     

     #            30                                                           

     - Arg Tyr Gly Ser Ala Val Gln Cys Thr Glu Va - #l Thr Ala Ser Asn Leu     

     #        45                                                               

     - Ser Thr Val Lys Thr Lys Ala Thr Val Val Gl - #u Lys Pro Leu Lys Asp     

     #    60                                                                   

     - Phe Arg Ala Ser Thr Ser Asp Gln Ser Gly Tr - #p Val Glu Ser Asn Gly     

     #80                                                                       

     - Lys Trp Tyr Phe Tyr Glu Ser Gly Asp Val Ly - #s Thr Gly Trp Val Lys     

     #                95                                                       

     - Thr Asp Gly Lys Trp Tyr Tyr Leu Asn Asp Le - #u Gly Val Met Gln Thr     

     #           110                                                           

     - Gly Phe Val Lys Phe Ser Gly Ser Trp Tyr Ty - #r Leu Ser Asn Ser Gly     

     #       125                                                               

     - Ala Met Phe Thr Gly Trp Gly Thr Asp Gly Se - #r Arg Trp Phe Tyr Phe     

     #   140                                                                   

     - Asp Gly Ser Gly Ala Met Lys Thr Gly Trp Ty - #r Lys Glu Asn Gly Thr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Trp Tyr Tyr Leu Asp Glu Ala Gly Ile Met Ly - #s Thr Gly Trp Phe Lys     

     #               175                                                       

     - Val Gly Pro His Trp Tyr Tyr Ala Tyr Gly Se - #r Gly Ala Leu Ala Val     

     #           190                                                           

     - Ser Thr Thr Thr Pro Asp Gly Tyr Arg Val As - #n Gly Asn Gly Glu Trp     

     #       205                                                               

     - Val Asn                                                                 

         210                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 37:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1360 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #37:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AGACGAGCAA AAAATTAAGC AAGCAGAAGC GGAAGTTGAG AGTAAACAAG CT - #GAGGCTAC   

       60                                                                      

     - AAGGTTAAAA AAAATCAAGA CAGATCGTGA AGAAGCAGAA GAAGAAGCTA AA - #CGAAGAGC   

      120                                                                      

     - AGATGCTAAA GAGCAAGGTA AACCAAAGGG GCGGGCAAAA CGAGGAGTTC CT - #GGAGAGCT   

      180                                                                      

     - AGCAACACCT GATAAAAAAG AAAATGATGC GAAGTCTTCA GATTCTAGCG TA - #GGTGAAGA   

      240                                                                      

     - AACTCTTCCA AGCCCATCCC TGAAACCAGA AAAAAAGGTA GCAGAAGCTG AG - #AAGAAGGT   

      300                                                                      

     - TGAAGAAGCT AAGAAAAAAG CCGAGGATCA AAAAGAAGAA GATCGCCGTA AC - #TACCCAAC   

      360                                                                      

     - CAATACTTAC AAAACGCTTG AACTTGAAAT TGCTGAGTCC GATGTGGAAG TT - #AAAAAAGC   

      420                                                                      

     - GGAGCTTGAA CTAGTAAAAG AGGAAGCTAA GGAACCTCGA AACGAGGAAA AA - #GTTAAGCA   

      480                                                                      

     - AGCAAAAGCG GAAGTTGAGA GTAAAAAAGC TGAGGCTACA AGGTTAGAAA AA - #ATCAAGAC   

      540                                                                      

     - AGATCGTAAA AAAGCAGAAG AAGAAGCTAA ACGAAAAGCA GCAGAAGAAG AT - #AAAGTTAA   

      600                                                                      

     - AGAAAAACCA GCTGAACAAC CACAACCAGC GCCGGCTCCA AAAGCAGAAA AA - #CCAGCTCC   

      660                                                                      

     - AGCTCCAAAA CCAGAGAATC CAGCTGAACA ACCAAAAGCA GAAAAACCAG CT - #GATCAACA   

      720                                                                      

     - AGCTGAAGAA GACTATGCTC GTAGATCAGA AGAAGAATAT AATCGCTTGA CT - #CAACAGCA   

      780                                                                      

     - ACCGCCAAAA ACTGAAAAAC CAGCACAACC ATCTACTCCA AAAACAGGCT GG - #AAACAAGA   

      840                                                                      

     - AAACGGTATG TGGTACTTCT ACAATACTGA TGGTTCAATG GCGACAGGAT GG - #CTCCAAAA   

      900                                                                      

     - CAATGGCTCA TGGTACTACC TCAACAGCAA TGGCGCTATG GCGACAGGAT GG - #CTCCAAAA   

      960                                                                      

     - CAATGGTTCA TGGTACTATC TAAACGCTAA TGGTTCAATG GCAACAGGAT GG - #CTCCAAAA   

     1020                                                                      

     - CAATGGTTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GG - #CTCCAATA   

     1080                                                                      

     - CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GG - #CTCCAATA   

     1140                                                                      

     - CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTGATATG GCGACAGGTT GG - #GTGAAAGA   

     1200                                                                      

     - TGGAGATACC TGGTACTATC TTGAAGCATC AGGTGCTATG AAAGCAAGCC AA - #TGGTTCAA   

     1260                                                                      

     - AGTATCAGAT AAATGGTACT ATGTCAATGG CTCAGGTGCC CTTGCAGTCA AC - #ACAACTGT   

     1320                                                                      

     #  1360            AATG CCAATGGTGA ATGGGTAAAC                             

     - (2) INFORMATION FOR SEQ ID NO:38:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 453 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:                            

     - Asp Glu Gln Lys Ile Lys Gln Ala Glu Ala Gl - #u Val Glu Ser Lys Gln     

     #                15                                                       

     - Ala Glu Ala Thr Arg Leu Lys Lys Ile Lys Th - #r Asp Arg Glu Glu Ala     

     #            30                                                           

     - Glu Glu Glu Ala Lys Arg Arg Ala Asp Ala Ly - #s Glu Gln Gly Lys Pro     

     #        45                                                               

     - Lys Gly Arg Ala Lys Arg Gly Val Pro Gly Gl - #u Leu Ala Thr Pro Asp     

     #    60                                                                   

     - Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp Se - #r Ser Val Gly Glu Glu     

     #80                                                                       

     - Thr Leu Pro Ser Pro Ser Leu Lys Pro Glu Ly - #s Lys Val Ala Glu Ala     

     #                95                                                       

     - Glu Lys Lys Val Glu Glu Ala Lys Lys Lys Al - #a Glu Asp Gln Lys Glu     

     #           110                                                           

     - Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Ty - #r Lys Thr Leu Glu Leu     

     #       125                                                               

     - Glu Ile Ala Glu Ser Asp Val Glu Val Lys Ly - #s Ala Glu Leu Glu Leu     

     #   140                                                                   

     - Val Lys Glu Glu Ala Lys Glu Pro Arg Asn Gl - #u Glu Lys Val Lys Gln     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ala Lys Ala Glu Val Glu Ser Lys Lys Ala Gl - #u Ala Thr Arg Leu Glu     

     #               175                                                       

     - Lys Ile Lys Thr Asp Arg Lys Lys Ala Glu Gl - #u Glu Ala Lys Arg Lys     

     #           190                                                           

     - Ala Ala Glu Glu Asp Lys Val Lys Glu Lys Pr - #o Ala Glu Gln Pro Gln     

     #       205                                                               

     - Pro Ala Pro Ala Pro Lys Ala Glu Lys Pro Al - #a Pro Ala Pro Lys Pro     

     #   220                                                                   

     - Glu Asn Pro Ala Glu Gln Pro Lys Ala Glu Ly - #s Pro Ala Asp Gln Gln     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ala Glu Glu Asp Tyr Ala Arg Arg Ser Glu Gl - #u Glu Tyr Asn Arg Leu     

     #               255                                                       

     - Thr Gln Gln Gln Pro Pro Lys Thr Glu Lys Pr - #o Ala Gln Pro Ser Thr     

     #           270                                                           

     - Pro Lys Thr Gly Trp Lys Gln Glu Asn Gly Me - #t Trp Tyr Phe Tyr Asn     

     #       285                                                               

     - Thr Asp Gly Ser Met Ala Thr Gly Trp Leu Gl - #n Asn Asn Gly Ser Trp     

     #   300                                                                   

     - Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala Th - #r Gly Trp Leu Gln Asn     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gl - #y Ser Met Ala Thr Gly     

     #               335                                                       

     - Trp Leu Gln Asn Asn Gly Ser Trp Tyr Tyr Le - #u Asn Ala Asn Gly Ser     

     #           350                                                           

     - Met Ala Thr Gly Trp Leu Gln Tyr Asn Gly Se - #r Trp Tyr Tyr Leu Asn     

     #       365                                                               

     - Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gl - #n Tyr Asn Gly Ser Trp     

     #   380                                                                   

     Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Th - #r Gly Trp Val Lys Asp       

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gl - #y Ala Met Lys Ala Ser     

     #               415                                                       

     - Gln Trp Phe Lys Val Ser Asp Lys Trp Tyr Ty - #r Val Asn Gly Ser Gly     

     #           430                                                           

     - Ala Leu Ala Val Asn Thr Thr Val Asp Gly Ty - #r Gly Val Asn Ala Asn     

     #       445                                                               

     - Gly Glu Trp Val Asn                                                     

         450                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 39:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 412 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #39:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTGTGGTGAG GAAGAAACTA AAAAGACTCA AGCAGCACAA CAGCCAAAAC AA - #CAAACGAC   

       60                                                                      

     - TGTACAACAA ATTGCTGTTG GAAAAGATGC TCCAGACTTC ACATTGCAAT CC - #ATGGATGG   

      120                                                                      

     - CAAAGAAGTT AAGTTATCTG ATTTTAAGGG TAAAAAGGTT TACTTGAAGT TT - #TGGGCTTC   

      180                                                                      

     - ATGGTGTGGT CCATGCAAGA AAAGTATGCC AGAGTTGATG GAACTAGCGG CG - #AAACCAGA   

      240                                                                      

     - TCGTGATTTC GAAATTCTTA CTGTCATTGC ACCAGGAATT CAAGGTGAAA AA - #ACTGTTGA   

      300                                                                      

     - GCAATTCCCA CAATGGTTCC AGGAACAAGG ATATAAGGAT ATCCCAGTTC TT - #TATGATAC   

      360                                                                      

     - CAAAGCAACC ACTTCCAAGC TTATCAAATT CGAAGCATTC CTACAGAATA TT - #           

      412                                                                      

     - (2) INFORMATION FOR SEQ ID NO:40:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 137 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:                            

     - Cys Gly Glu Glu Glu Thr Lys Lys Thr Gln Al - #a Ala Gln Gln Pro Lys     

     #                15                                                       

     - Gln Gln Thr Thr Val Gln Gln Ile Ala Val Gl - #y Lys Asp Ala Pro Asp     

     #            30                                                           

     - Phe Thr Leu Gln Ser Met Asp Gly Lys Glu Va - #l Lys Leu Ser Asp Phe     

     #        45                                                               

     - Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Al - #a Ser Trp Cys Gly Pro     

     #    60                                                                   

     - Cys Lys Lys Ser Met Pro Glu Leu Met Glu Le - #u Ala Ala Lys Pro Asp     

     #80                                                                       

     - Arg Asp Phe Glu Ile Leu Thr Val Ile Ala Pr - #o Gly Ile Gln Gly Glu     

     #                95                                                       

     - Lys Thr Val Glu Gln Phe Pro Gln Trp Phe Gl - #n Glu Gln Gly Tyr Lys     

     #           110                                                           

     - Asp Ile Pro Val Leu Tyr Asp Thr Lys Ala Th - #r Thr Ser Lys Leu Ile     

     #       125                                                               

     - Lys Phe Glu Ala Phe Leu Gln Asn Ile                                     

     #   135                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 41:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1462 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #41:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GACTTTTAAC AATAAAACTA TTGAAGAGTT GCACAATCTC CTTGTCTCTA AG - #GAAATTTC   

       60                                                                      

     - TGCAACAGAA TTGACCCAAG CAACACTTGA AAATATCAAG TCTCGTGAGG AA - #GCCCTCAA   

      120                                                                      

     - TTCATTTGTC ACCATCGCTG AGGAGCAAGC TCTTGTTCAA GCTAAAGCCA TT - #GATGAAGC   

      180                                                                      

     - tGGAATTGAT GCTGACAATG TCCTTTCAGG AATTCCACTT GCTGTTAAGG AT - #AACATCTC   

      240                                                                      

     - TACAGACGGT ATTCTCACAA CTGCTGCCTC AAAAATGCTC TACAACTATG AG - #CCAATCTT   

      300                                                                      

     - TGATGCGACa gCTgTTGCCA ATGCAAAAAC CAAGGGCATG ATTGTCGTTG GA - #AAGACCAA   

      360                                                                      

     - CATGGACGAA TTTGCTATGG GTGGTTCAGG tGAAACTTCA CACTACGGAG CA - #ACTAAAAA   

      420                                                                      

     - CGCTTGGAAC CACAGCAAGG TTCCTGGTGG GTCATCAAGT GGTTCTGCCG CA - #GCTGTAGC   

      480                                                                      

     - CTCAGGACAA GTTCGCTTGT CACTTGGTTC TGATACTGGT GGTTCCATCC GC - #CAACCTGC   

      540                                                                      

     - TGCCTTCAAC GGAATCGTTG GTCTCAAACC AACCTACGGA ACAGTTTCAC GT - #TTCGGTCT   

      600                                                                      

     - CATTGCCTTT GGTAGCTCAT TAGACCAGAT TGGACCTTTT GCTCCTACTG TT - #AAGGAAAA   

      660                                                                      

     - TGCCCTCTTG CTCAACGCTA TTGCCAGCGA AGATGCTAAA GACTCTACTT CT - #GCTCCTGT   

      720                                                                      

     - CCGCATCGCC GACTTTACTT CAAAAATCGG CCAAGACATC AAGGGTATGA AA - #ATCGCTTT   

      780                                                                      

     - GCCTAAGGAA TACCTAGGCG AAGGAATTGA TCCAGAGGTT AAGGAAACAA TC - #TTAAACGC   

      840                                                                      

     - GGCCAAACAC TTTGAAAAAT TGGGTGCTAT CGTCGAAGAA GTCAGCCTTC CT - #CACTCTAA   

      900                                                                      

     - ATACGGTGTT GCCGTTTATT ACATCATCGC TTCATCAGAA GCTTCATCAA AC - #TTGCAACG   

      960                                                                      

     - CTTCGACGGT ATCCGTTACG GCTATCGCGC AGAAGATGCA ACCAACCTTG AT - #GAAATCTA   

     1020                                                                      

     - TGTAAACAGC CGAAGCCAAG GTTTTGGTGA AGAGGTAAAA CGTCGTATCA TG - #CTGGGTAC   

     1080                                                                      

     - TTTCAGTCTT TCATCAGGTT ACTATGATGC CTACTACAAA AAGGCTGGTC AA - #GTCCGTAC   

     1140                                                                      

     - CCTCATCATT CAAGATTTCG AAAAAGTCTT CGCGGATTAC GATTTGATTT TG - #GGTCCAAC   

     1200                                                                      

     - TGCTCCAAGT GTTGCCTATG ACTTGGATTC TCTCAACCAT GACCCAGTTG CC - #ATGTACTT   

     1260                                                                      

     - AGCCGACCTA TTGACCATAC CTGTAAACTT GGCAGGACTG CCTGGAATTT CG - #ATTCCTGC   

     1320                                                                      

     - TGGATTCTCT CAAGGTCTAC CTGTCGGACT CCAATTGATT GGTCCCAAGT AC - #TCTGAGGA   

     1380                                                                      

     - AACCATTTAC CAAGCTGCTG CTGCTTTTGA AGCAACAACA GACTACCACA AA - #CAACAACC   

     1440                                                                      

     #               1462ACA AC                                                

     - (2) INFORMATION FOR SEQ ID NO:42:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 487 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:                            

     - Thr Phe Asn Asn Lys Thr Ile Glu Glu Leu Hi - #s Asn Leu Leu Val Ser     

     #                15                                                       

     - Lys Glu Ile Ser Ala Thr Glu Leu Thr Gln Al - #a Thr Leu Glu Asn Ile     

     #            30                                                           

     - Lys Ser Arg Glu Glu Ala Leu Asn Ser Phe Va - #l Thr Ile Ala Glu Glu     

     #        45                                                               

     - Gln Ala Leu Val Gln Ala Lys Ala Ile Asp Gl - #u Ala Gly Ile Asp Ala     

     #    60                                                                   

     - Asp Asn Val Leu Ser Gly Ile Pro Leu Ala Va - #l Lys Asp Asn Ile Ser     

     #80                                                                       

     - Thr Asp Gly Ile Leu Thr Thr Ala Ala Ser Ly - #s Met Leu Tyr Asn Tyr     

     #                95                                                       

     - Glu Pro Ile Phe Asp Ala Thr Ala Val Ala As - #n Ala Lys Thr Lys Gly     

     #           110                                                           

     - Met Ile Val Val Gly Lys Thr Asn Met Asp Gl - #u Phe Ala Met Gly Gly     

     #       125                                                               

     - Ser Gly Glu Thr Ser His Tyr Gly Ala Thr Ly - #s Asn Ala Trp Asn His     

     #   140                                                                   

     - Ser Lys Val Pro Gly Gly Ser Ser Ser Gly Se - #r Ala Ala Ala Val Ala     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Gly Gln Val Arg Leu Ser Leu Gly Ser As - #p Thr Gly Gly Ser Ile     

     #               175                                                       

     - Arg Gln Pro Ala Ala Phe Asn Gly Ile Val Gl - #y Leu Lys Pro Thr Tyr     

     #           190                                                           

     - Gly Thr Val Ser Arg Phe Gly Leu Ile Ala Ph - #e Gly Ser Ser Leu Asp     

     #       205                                                               

     - Gln Ile Gly Pro Phe Ala Pro Thr Val Lys Gl - #u Asn Ala Leu Leu Leu     

     #   220                                                                   

     - Asn Ala Ile Ala Ser Glu Asp Ala Lys Asp Se - #r Thr Ser Ala Pro Val     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Arg Ile Ala Asp Phe Thr Ser Lys Ile Gly Gl - #n Asp Ile Lys Gly Met     

     #               255                                                       

     - Lys Ile Ala Leu Pro Lys Glu Tyr Leu Gly Gl - #u Gly Ile Asp Pro Glu     

     #           270                                                           

     - Val Lys Glu Thr Ile Leu Asn Ala Ala Lys Hi - #s Phe Glu Lys Leu Gly     

     #       285                                                               

     - Ala Ile Val Glu Glu Val Ser Leu Pro His Se - #r Lys Tyr Gly Val Ala     

     #   300                                                                   

     - Val Tyr Tyr Ile Ile Ala Ser Ser Glu Ala Se - #r Ser Asn Leu Gln Arg     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Phe Asp Gly Ile Arg Tyr Gly Tyr Arg Ala Gl - #u Asp Ala Thr Asn Leu     

     #               335                                                       

     - Asp Glu Ile Tyr Val Asn Ser Arg Ser Gln Gl - #y Phe Gly Glu Glu Val     

     #           350                                                           

     - Lys Arg Arg Ile Met Leu Gly Thr Phe Ser Le - #u Ser Ser Gly Tyr Tyr     

     #       365                                                               

     - Asp Ala Tyr Tyr Lys Lys Ala Gly Gln Val Ar - #g Thr Leu Ile Ile Gln     

     #   380                                                                   

     - Asp Phe Glu Lys Val Phe Ala Asp Tyr Asp Le - #u Ile Leu Gly Pro Thr     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ala Pro Ser Val Ala Tyr Asp Leu Asp Ser Le - #u Asn His Asp Pro Val     

     #               415                                                       

     - Ala Met Tyr Leu Ala Asp Leu Leu Thr Ile Pr - #o Val Asn Leu Ala Gly     

     #           430                                                           

     - Leu Pro Gly Ile Ser Ile Pro Ala Gly Phe Se - #r Gln Gly Leu Pro Val     

     #       445                                                               

     - Gly Leu Gln Leu Ile Gly Pro Lys Tyr Ser Gl - #u Glu Thr Ile Tyr Gln     

     #   460                                                                   

     - Ala Ala Ala Ala Phe Glu Ala Thr Thr Asp Ty - #r His Lys Gln Gln Pro     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Val Ile Phe Gly Gly Asp Asn                                             

                     485                                                       

     - (2) INFORMATION FOR SEQ ID NO: 43:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 433 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #43:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTTTACAGGT AAACAACTAC AAGTCGGCGA CAAGGCGCTT GATTTTTCTC TT - #ACTACAAC   

       60                                                                      

     - AGATCTTTCT AAAAAATCTC TGGCTGATTT TGATGGCAAG AAAAAAGTCT TG - #AGTGTCGT   

      120                                                                      

     - TCCTTCTATC GATACAGGCA TCTGCTCAAC TCAAACACGT CGTTTTAATG AA - #GAATTGGC   

      180                                                                      

     - TGGACTGGAC AACACGGTCG TATTGACTGT TTCAATGGAC CTACCTTTTG CT - #CAAAAACG   

      240                                                                      

     - TTGGTGCGGT GCTGAAGGCC TTGACAATGC CATTATGCTT TCAGACTACT TT - #GACCATTC   

      300                                                                      

     - TTTCGGGCGC GATTATGCCC TCTTGATCAA CGAATGGCAC CTATTAGCAC GC - #GCAGTCTT   

      360                                                                      

     - TGTCCTCGAT ACTGACAATA CGATTCGCTA CGTTGAATAC GTGGATAATA TC - #AATTCTGA   

      420                                                                      

     #     433                                                                 

     - (2) INFORMATION FOR SEQ ID NO:44:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 144 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:                            

     - Phe Thr Gly Lys Gln Leu Gln Val Gly Asp Ly - #s Ala Leu Asp Phe Ser     

     #                15                                                       

     - Leu Thr Thr Thr Asp Leu Ser Lys Lys Ser Le - #u Ala Asp Phe Asp Gly     

     #            30                                                           

     - Lys Lys Lys Val Leu Ser Val Val Pro Ser Il - #e Asp Thr Gly Ile Cys     

     #        45                                                               

     - Ser Thr Gln Thr Arg Arg Phe Asn Glu Glu Le - #u Ala Gly Leu Asp Asn     

     #    60                                                                   

     - Thr Val Val Leu Thr Val Ser Met Asp Leu Pr - #o Phe Ala Gln Lys Arg     

     #80                                                                       

     - Trp Cys Gly Ala Glu Gly Leu Asp Asn Ala Il - #e Met Leu Ser Asp Tyr     

     #                95                                                       

     - Phe Asp His Ser Phe Gly Arg Asp Tyr Ala Le - #u Leu Ile Asn Glu Trp     

     #           110                                                           

     - His Leu Leu Ala Arg Ala Val Phe Val Leu As - #p Thr Asp Asn Thr Ile     

     #       125                                                               

     - Arg Tyr Val Glu Tyr Val Asp Asn Ile Asn Se - #r Glu Pro Asn Phe Glu     

     #   140                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 45:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 724 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #45:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CCAGGCTGAT ACAAGTATCG CAGACATTCA AAAAAGAGGC GAACTGGTTG TC - #GGTGTCAA   

       60                                                                      

     - ACAAGACGTT CCCAATTTTG GTTACAAnGA TCCCAAGACC GGTACTTATT CT - #GGTATCGA   

      120                                                                      

     - AaCCGACTTG GCCAAGATGG TAGCTGATGA ACTCAAGGTC AAGATTCGCT AT - #GTGCCGGT   

      180                                                                      

     - TACAGCACAA ACCCGCGGCC CCCTTCTAGA CAATGAACAG GTCGATATGG AT - #ATCGCGAC   

      240                                                                      

     - CTTTACCATC ACGGACGAAC GCAAAAAACT CTACAACTTT ACCAGTCCCT AC - #TACACAGA   

      300                                                                      

     - CGCTTCTGGA TTTTTGGTCA ATAAATCTGC CAAAATCAAA AAGATTGAGG AC - #CTAAACGG   

      360                                                                      

     - CAAAACCATC GGAGTCGCCC AAGGTTCTAT CACCCAACGC CTGATTACTG AA - #CTGGGTAA   

      420                                                                      

     - AAAGAAAGGT CTGAAGTTTA AATTCGTCGA ACTTGGTTCC TACCCAGAAT TG - #ATTACTTC   

      480                                                                      

     - CCTGCACGCT CATCGTATCG ATACCTTTTC CGTTGACCGC TCTATTCTAT CT - #GGCTACAC   

      540                                                                      

     - TAGTAAACGG ACAGCACTAC TAGATGATAG TTTCAAGCCA TCTGACTACG GT - #ATTGTTAC   

      600                                                                      

     - CAAGAAATCA AATACAGAGC TCAACGACTA TCTTGATAAC TTGGTTACTA AA - #TGGAGCAA   

      660                                                                      

     - GGATGGTAGT TTGCAGAAAC TTTATGACCG TTACAAGCTC AAACCATCTA GC - #CATACTGC   

      720                                                                      

     #            724                                                          

     - (2) INFORMATION FOR SEQ ID NO:46:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 241 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     #ID NO:46:(xi) SEQUENCE DESCRIPTION: SEQ                                  

     -      Gln Ala Asp Thr Ser Ile Ala Asp - # Ile Gln Lys Arg Gly Glu Leu    

     Val                                                                       

     #   15                                                                    

     -      Val Gly Val Lys Gln Asp Val Pro - # Asn Phe Gly Tyr Xaa Asp Pro    

     Lys                                                                       

     #                 30                                                      

     -      Thr Gly Thr Tyr Ser Gly Ile Glu - # Thr Asp Leu Ala Lys Met Val    

     Ala                                                                       

     #             45                                                          

     -      Asp Glu Leu Lys Val Lys Ile Arg - # Tyr Val Pro Val Thr Ala Gln    

     Thr                                                                       

     #         60                                                              

     -      Arg Gly Pro Leu Leu Asp Asn Glu - # Gln Val Asp Met Asp Ile Ala    

     Thr                                                                       

     #     80                                                                  

     -      Phe Thr Ile Thr Asp Glu Arg Lys - # Lys Leu Tyr Asn Phe Thr Ser    

     Pro                                                                       

     #   95                                                                    

     -      Tyr Tyr Thr Asp Ala Ser Gly Phe - # Leu Val Asn Lys Ser Ala Lys    

     Ile                                                                       

     #                110                                                      

     -      Lys Lys Ile Glu Asp Leu Asn Gly - # Lys Thr Ile Gly Val Ala Gln    

     Gly                                                                       

     #            125                                                          

     -      Ser Ile Thr Gln Arg Leu Ile Thr - # Glu Leu Gly Lys Lys Lys Gly    

     Leu                                                                       

     #        140                                                              

     -      Lys Phe Lys Phe Val Glu Leu Gly - # Ser Tyr Pro Glu Leu Ile Thr    

     Ser                                                                       

     #    160                                                                  

     -      Leu His Ala His Arg Ile Asp Thr - # Phe Ser Val Asp Arg Ser Ile    

     Leu                                                                       

     #   175                                                                   

     -      Ser Gly Tyr Thr Ser Lys Arg Thr - # Ala Leu Leu Asp Asp Ser Phe    

     Lys                                                                       

     #                190                                                      

     -      Pro Ser Asp Tyr Gly Ile Val Thr - # Lys Lys Ser Asn Thr Glu Leu    

     Asn                                                                       

     #            205                                                          

     -      Asp Tyr Leu Asp Asn Leu Val Thr - # Lys Trp Ser Lys Asp Gly Ser    

     Leu                                                                       

     #        220                                                              

     -      Gln Lys Leu Tyr Asp Arg Tyr Lys - # Leu Lys Pro Ser Ser His Thr    

     Ala                                                                       

     #    240                                                                  

     -      Asp                                                                

     - (2) INFORMATION FOR SEQ ID NO: 47:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1279 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #SEQ ID NO: 47: SEQUENCE DESCRIPTION:                                     

     - GTCTGTATCA TTTGAAAACA AAGAAACAAA CCGTGGTGTC TTgACTTTCA CT - #ATCTCTCA   

       60                                                                      

     - AGACCAAATC AAACCAGAAT TGGACCGTGT CTTCAAGtCA GTGAAGAAAT CT - #CTTAATGT   

      120                                                                      

     - TCCAGGTTTC CGTAAAGGTC ACCTTCCACG CCCTATCTTC GACCAAAAAT TT - #GGTGAAGA   

      180                                                                      

     - AGCTCTTTAT CAAGATGCAA TGAACGCACT TTTGCCAAAC GCTTATGAAG CA - #GCTGTAAA   

      240                                                                      

     - AGAAGCTGGT CTTGAAGTGG TTGCCCAACC AAAAATTGAC GTAACTTCAA TG - #GAAAAAGG   

      300                                                                      

     - TCAAGACTGG GTTATCACTG CTGAAGTCGT TACAAAACCT GAAGTAAAAT TG - #GGTGACTA   

      360                                                                      

     - CAAAAACCTT GAAGTATCAG TTGATGTAGA AAAAGAAGTA ACTGACGCTG AT - #GTCGAAGA   

      420                                                                      

     - GCGTATCGAA CGCGAACGCA ACAACCTGGC TGAATTGGTT ATCAAGGAAG CT - #GCTGCTGA   

      480                                                                      

     - AAACGGCGAC ACTGTTGTGA TCGACTTCGT TGGTTCTATC GACGGTGTTG AA - #TTTGACGG   

      540                                                                      

     - TGGAAAAGGT GAAAACTTCT CACTTGGACT TGGTTCAGGT CAATTCATCC CT - #GGTTTCGA   

      600                                                                      

     - AGACCAATTG GTAGGTCACT CAGCTGGCGA AACCGTTGAT GTTATCGTAA CA - #TTCCCAGA   

      660                                                                      

     - AGACTACCAA GCAGAAGACC TTGCAGGTAA AGAAGCTAAA TTCGTGACAA CT - #ATCCACGA   

      720                                                                      

     - AGTAAAAGCT AAAGAAGTTC CGGCTCTTGA CGATGAACTT GCAAAAGACA TT - #GATGAAGA   

      780                                                                      

     - AGTTGAAACA CTTGCTGACT TGAAAGAAAA ATACAGCAAA GAATTGGCTG CT - #GCTAAAGA   

      840                                                                      

     - AGAAGCTTAC AAAGATGCAG TTGAAGGTGC AGCAATTGAT ACAGCTGTAG AA - #AATGCTGA   

      900                                                                      

     - AATCGTAGAA CTTCCAGAAG AAATGATCCA TGAAGAAGTT CACCGTTCAG TA - #AATGAATT   

      960                                                                      

     - CCTTGGGAAT TTGCAACGTC AAGGGATCAA CCCTGACATG TACTTCCAAA TC - #ACTGGAAC   

     1020                                                                      

     - TACTCAAGAA GACCTTCACA ACCAATACCA AGCAGAAGCT GAGTCACGTA CT - #AAGACTAA   

     1080                                                                      

     - CCTTGTTATC GAAGCAGTTG CCAAAGCTGA AGGATTTGAT GCTTCAGAAG AA - #GAAATCCA   

     1140                                                                      

     - AAAAGAAGTT GAGCAATTGG CAGCAGACTA CAACATGGAA GTTGCACAAG TT - #CAAAACTT   

     1200                                                                      

     - GCTTTCAGCT GACATGTTGA AACATGATAT CACTATCAAA AAAGCTGTTG AA - #TTGATCAC   

     1260                                                                      

     #                 127 - #9                                                

     - (2) INFORMATION FOR SEQ ID NO:48:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 426 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     #ID NO:48:(xi) SEQUENCE DESCRIPTION: SEQ                                  

     -      Ser Val Ser Phe Glu Asn Lys Glu - # Thr Asn Arg Gly Val Leu Thr    

     Phe                                                                       

     #   15                                                                    

     -      Thr Ile Ser Gln Asp Gln Ile Lys - # Pro Glu Leu Asp Arg Val Phe    

     Lys                                                                       

     #                 30                                                      

     -      Ser Val Lys Lys Ser Leu Asn Val - # Pro Gly Phe Arg Lys Gly His    

     Leu                                                                       

     #             45                                                          

     -      Pro Arg Pro Ile Phe Asp Gln Lys - # Phe Gly Glu Glu Ala Leu Tyr    

     Gln                                                                       

     #         60                                                              

     -      Asp Ala Met Asn Ala Leu Leu Pro - # Asn Ala Tyr Glu Ala Ala Val    

     Lys                                                                       

     #     80                                                                  

     -      Glu Ala Gly Leu Glu Val Val Ala - # Gln Pro Lys Ile Asp Val Thr    

     Ser                                                                       

     #   95                                                                    

     -      Met Glu Lys Gly Gln Asp Trp Val - # Ile Thr Ala Glu Val Val Thr    

     Lys                                                                       

     #                110                                                      

     -      Pro Glu Val Lys Leu Gly Asp Tyr - # Lys Asn Leu Glu Val Ser Val    

     Asp                                                                       

     #            125                                                          

     -      Val Glu Lys Glu Val Thr Asp Ala - # Asp Val Glu Glu Arg Ile Glu    

     Arg                                                                       

     #        140                                                              

     -      Glu Arg Asn Asn Leu Ala Glu Leu - # Val Ile Lys Glu Ala Ala Ala    

     Glu                                                                       

     #    160                                                                  

     -      Asn Gly Asp Thr Val Val Ile Asp - # Phe Val Gly Ser Ile Asp Gly    

     Val                                                                       

     #   175                                                                   

     -      Glu Phe Asp Gly Gly Lys Gly Glu - # Asn Phe Ser Leu Gly Leu Gly    

     Ser                                                                       

     #                190                                                      

     -      Gly Gln Phe Ile Pro Gly Phe Glu - # Asp Gln Leu Val Gly His Ser    

     Ala                                                                       

     #            205                                                          

     -      Gly Glu Thr Val Asp Val Ile Val - # Thr Phe Pro Glu Asp Tyr Gln    

     Ala                                                                       

     #        220                                                              

     -      Glu Asp Leu Ala Gly Lys Glu Ala - # Lys Phe Val Thr Thr Ile His    

     Glu                                                                       

     #    240                                                                  

     -      Val Lys Ala Lys Glu Val Pro Ala - # Leu Asp Asp Glu Leu Ala Lys    

     Asp                                                                       

     #   255                                                                   

     -      Ile Asp Glu Glu Val Glu Thr Leu - # Ala Asp Leu Lys Glu Lys Tyr    

     Ser                                                                       

     #                270                                                      

     -      Lys Glu Leu Ala Ala Ala Lys Glu - # Glu Ala Tyr Lys Asp Ala Val    

     Glu                                                                       

     #            285                                                          

     -      Gly Ala Ala Ile Asp Thr Ala Val - # Glu Asn Ala Glu Ile Val Glu    

     Leu                                                                       

     #        300                                                              

     -      Pro Glu Glu Met Ile His Glu Glu - # Val His Arg Ser Val Asn Glu    

     Phe                                                                       

     #    320                                                                  

     -      Leu Gly Asn Leu Gln Arg Gln Gly - # Ile Asn Pro Asp Met Tyr Phe    

     Gln                                                                       

     #   335                                                                   

     -      Ile Thr Gly Thr Thr Gln Glu Asp - # Leu His Asn Gln Tyr Gln Ala    

     Glu                                                                       

     #                350                                                      

     -      Ala Glu Ser Arg Thr Lys Thr Asn - # Leu Val Ile Glu Ala Val Ala    

     Lys                                                                       

     #            365                                                          

     -      Ala Glu Gly Phe Asp Ala Ser Glu - # Glu Glu Ile Gln Lys Glu Val    

     Glu                                                                       

     #        380                                                              

     -      Gln Leu Ala Ala Asp Tyr Asn Met - # Glu Val Ala Gln Val Gln Asn    

     Leu                                                                       

     #    400                                                                  

     -      Leu Ser Ala Asp Met Leu Lys His - # Asp Ile Thr Ile Lys Lys Ala    

     Val                                                                       

     #   415                                                                   

     -      Glu Leu Ile Thr Ser Thr Ala Thr - # Val Lys                        

     #                425                                                      

     - (2) INFORMATION FOR SEQ ID NO: 49:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 490 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #SEQ ID NO: 49: SEQUENCE DESCRIPTION:                                     

     - TGGTCAAAAG GAAAGTCAGA CAGGAAAGGG GATGAAAATT GTGACCAGTT TT - #TATCCTAT   

       60                                                                      

     - CTACGCTATG GTTAAGGAAG TATCTGGTGA CTTGAATGAT GTTCGGATGA TT - #CAGTCAAG   

      120                                                                      

     - TAGTGGTATT CACTCCTTTG AACCTTCGGC AAATGATATC GCAGCCATCT AT - #GATGCAGA   

      180                                                                      

     - TGTCTTTGTT TACCATTCTC ATACACTCGA ATCTTGGGCA GGAAGTCTGG AT - #CCAAATCT   

      240                                                                      

     - AAAAAAATCC AAAGTGAAGG TCTTAGAGGC TTCTGAGGGA ATGACCTTGG AA - #CGTGTCCC   

      300                                                                      

     - TGGACTAGAG GATGTGGAAG CAGGGGATGG AGTTGATGAA AAAACGCTCT AT - #GACCCTCA   

      360                                                                      

     - CACATGGCTA GATCCTGAAA AAGCTGGAGA AGAAGCCCAA ATTATCGCTG AT - #AAACTTTC   

      420                                                                      

     - AGAGGTGGAT AGTGAGCATA AAGAGACTTA TCAAAAAAAT GCGCAACCTT TA - #TCAAAAAA   

      480                                                                      

     #       490                                                               

     - (2) INFORMATION FOR SEQ ID NO:50:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 163 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     #ID NO:50:(xi) SEQUENCE DESCRIPTION: SEQ                                  

     -      Gly Gln Lys Glu Ser Gln Thr Gly - # Lys Gly Met Lys Ile Val Thr    

     Ser                                                                       

     #   15                                                                    

     -      Phe Tyr Pro Ile Tyr Ala Met Val - # Lys Glu Val Ser Gly Asp Leu    

     Asn                                                                       

     #                 30                                                      

     -      Asp Val Arg Met Ile Gln Ser Ser - # Ser Gly Ile His Ser Phe Glu    

     Pro                                                                       

     #             45                                                          

     -      Ser Ala Asn Asp Ile Ala Ala Ile - # Tyr Asp Ala Asp Val Phe Val    

     Tyr                                                                       

     #         60                                                              

     -      His Ser His Thr Leu Glu Ser Trp - # Ala Gly Ser Leu Asp Pro Asn    

     Leu                                                                       

     #     80                                                                  

     -      Lys Lys Ser Lys Val Lys Val Leu - # Glu Ala Ser Glu Gly Met Thr    

     Leu                                                                       

     #   95                                                                    

     -      Glu Arg Val Pro Gly Leu Glu Asp - # Val Glu Ala Gly Asp Gly Val    

     Asp                                                                       

     #                110                                                      

     -      Glu Lys Thr Leu Tyr Asp Pro His - # Thr Trp Leu Asp Pro Glu Lys    

     Ala                                                                       

     #            125                                                          

     -      Gly Glu Glu Ala Gln Ile Ile Ala - # Asp Lys Leu Ser Glu Val Asp    

     Ser                                                                       

     #        140                                                              

     -      Glu His Lys Glu Thr Tyr Gln Lys - # Asn Ala Gln Pro Leu Ser Lys    

     Lys                                                                       

     #    160                                                                  

     -      Leu Arg Asn                                                        

     - (2) INFORMATION FOR SEQ ID NO: 51:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1006 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #SEQ ID NO: 51: SEQUENCE DESCRIPTION:                                     

     - GAAGGATAGA TATATTTTAG CATTTGAGAC ATCCTGTGAT GAGACCAGTG TC - #GCCGTCTT   

       60                                                                      

     - GAAAAACGAC GATGAGCTCT TGTCCAATGT CATTGCTAGT CAAATTGAGA GT - #CACAAACG   

      120                                                                      

     - TTTTGGTGGC GTAGTGCCCG AAGTAGCCAG TCGTCACCAT GTCGAGGTCA TT - #ACAGCCTG   

      180                                                                      

     - TATCGAGGAG GCATTGGCAG AAGCAGGGAT TACCGAAGAG GACGTGACAG CT - #GTTGCGGT   

      240                                                                      

     - TACCTACGGA CCAGGCTTGG TCGGAGCCTT GCTAGTTGGT TTGTCAGCTG CC - #AAGGCCTT   

      300                                                                      

     - TGCTTGGGCT CACGGACTTC CACTGATTCC TGTTAATCAC ATGGCTGGGC AC - #CTCATGGC   

      360                                                                      

     - AGCTCAGAGT GTGGAGCCTT TGGAGTTTCC CTTGCTAGCC CTCTTGGTCA GC - #GGCGGACA   

      420                                                                      

     - CACAGAGTTG GTTTATGTTT CGGAGGCAGG AGATTATAAG ATTGTTGGGG AA - #ACCCGTGA   

      480                                                                      

     - TGATGCGGTT GGTGAGGCTT ATGATAAGGT CGGCCGTGTC ATGGGCTTGA CC - #TATCCTGC   

      540                                                                      

     - AGGTCGTGAG ATTGACGAGC TGGCTCATCA GGGGCAGGAT ATTTATGATT TC - #CCCCGTGC   

      600                                                                      

     - CATGATTAAG GAAGATAATC TGGAGTTCTC CTTCTCAGGT TTGAAATCTG CC - #TTTATCAA   

      660                                                                      

     - TCTTCATCAC AATGCCGAGC AAAAGGGAGA AAGCCTGTCT ACAGAAGATT TG - #TGTGCTTC   

      720                                                                      

     - CTTCCAAGCA GCAGTTATGG ACATTCTCAT GGCAAAAACC AAGAAGGCTT TG - #GAGAAATA   

      780                                                                      

     - TCCTGTTAAA ATCCTAGTTG TGGCAGGTGG TGTGGCAGCC AATAAAGGTC TC - #AGAGAACG   

      840                                                                      

     - CCTAGCAGCC GAAATCACAG ATGTCAAGGT TATCATCCCC CCTCTGCGAC TC - #TGCGGAGA   

      900                                                                      

     - CAATGCAGGT ATGATTGCCT ATGCCAGCGT CAGCNAGTGG AACAAAGAAA AC - #TTCGCAGG   

      960                                                                      

     #               1006AAC CAAGTCTTGC CTTTGATACC ATGGAA                      

     - (2) INFORMATION FOR SEQ ID NO:52:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 335 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     #ID NO:52:(xi) SEQUENCE DESCRIPTION: SEQ                                  

     -      Lys Asp Arg Tyr Ile Leu Ala Phe - # Glu Thr Ser Cys Asp Glu Thr    

     Ser                                                                       

     #   15                                                                    

     -      Val Ala Val Leu Lys Asn Asp Asp - # Glu Leu Leu Ser Asn Val Ile    

     Ala                                                                       

     #                 30                                                      

     -      Ser Gln Ile Glu Ser His Lys Arg - # Phe Gly Gly Val Val Pro Glu    

     Val                                                                       

     #             45                                                          

     -      Ala Ser Arg His His Val Glu Val - # Ile Thr Ala Cys Ile Glu Glu    

     Ala                                                                       

     #         60                                                              

     -      Leu Ala Glu Ala Gly Ile Thr Glu - # Glu Asp Val Thr Ala Val Ala    

     Val                                                                       

     #     80                                                                  

     -      Thr Tyr Gly Pro Gly Leu Val Gly - # Ala Leu Leu Val Gly Leu Ser    

     Ala                                                                       

     #   95                                                                    

     -      Ala Lys Ala Phe Ala Trp Ala His - # Gly Leu Pro Leu Ile Pro Val    

     Asn                                                                       

     #                110                                                      

     -      His Met Ala Gly His Leu Met Ala - # Ala Gln Ser Val Glu Pro Leu    

     Glu                                                                       

     #            125                                                          

     -      Phe Pro Leu Leu Ala Leu Leu Val - # Ser Gly Gly His Thr Glu Leu    

     Val                                                                       

     #        140                                                              

     -      Tyr Val Ser Glu Ala Gly Asp Tyr - # Lys Ile Val Gly Glu Thr Arg    

     Asp                                                                       

     #    160                                                                  

     -      Asp Ala Val Gly Glu Ala Tyr Asp - # Lys Val Gly Arg Val Met Gly    

     Leu                                                                       

     #   175                                                                   

     -      Thr Tyr Pro Ala Gly Arg Glu Ile - # Asp Glu Leu Ala His Gln Gly    

     Gln                                                                       

     #                190                                                      

     -      Asp Ile Tyr Asp Phe Pro Arg Ala - # Met Ile Lys Glu Asp Asn Leu    

     Glu                                                                       

     #            205                                                          

     -      Phe Ser Phe Ser Gly Leu Lys Ser - # Ala Phe Ile Asn Leu His His    

     Asn                                                                       

     #        220                                                              

     -      Ala Glu Gln Lys Gly Glu Ser Leu - # Ser Thr Glu Asp Leu Cys Ala    

     Ser                                                                       

     #    240                                                                  

     -      Phe Gln Ala Ala Val Met Asp Ile - # Leu Met Ala Lys Thr Lys Lys    

     Ala                                                                       

     #   255                                                                   

     -      Leu Glu Lys Tyr Pro Val Lys Ile - # Leu Val Val Ala Gly Gly Val    

     Ala                                                                       

     #                270                                                      

     -      Ala Asn Lys Gly Leu Arg Glu Arg - # Leu Ala Ala Glu Ile Thr Asp    

     Val                                                                       

     #            285                                                          

     -      Lys Val Ile Ile Pro Pro Leu Arg - # Leu Cys Gly Asp Asn Ala Gly    

     Met                                                                       

     #        300                                                              

     -      Ile Ala Tyr Ala Ser Val Ser Xaa - # Trp Asn Lys Glu Asn Phe Ala    

     Gly                                                                       

     #    320                                                                  

     -      Trp Asp Leu Asn Ala Lys Pro Ser - # Leu Ala Phe Asp Thr Met Glu    

     #   335                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 53:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1000 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #SEQ ID NO: 53: SEQUENCE DESCRIPTION:                                     

     - GGTAGTTAAA GTTGGTATTA ACGGTTTCGG ACGTATCGGT CGTCTTGCTT TC - #CGTCGTAT   

       60                                                                      

     - CCAAAACGTA GAAGGTGTTG AAGTTACACG CATCAACGAC CTTACAGATC CA - #GTTATGCT   

      120                                                                      

     - TGCACACTTG TTGAAATACG ACACAACTCA AGGTCGTTTC GACGGTACTG TT - #GAAGTTAA   

      180                                                                      

     - AGAAGGTGGA TTTGAAGTTA ACGGTAAATT CATCAAAGTT TCTGCTGAAC GT - #GATCCAGA   

      240                                                                      

     - ACAAATCGAC TGGGCTACTG ACGGTGTAGA AATCGTTCTT GAAGCTACTG GT - #TTCTTTGC   

      300                                                                      

     - TAAGAAAGAA GCAGCTGAAA AACACCTTAA AGGTGGAGCT AAAAAAGTTG TT - #ATCACTGC   

      360                                                                      

     - TCCTGGTGGA AACGACGTTA AAACAGTTGT ATTCAACACT AACCACGACG TT - #CTTGACGG   

      420                                                                      

     - TACTGAAACA GTTATCTCAG GTGCTTCATG TACTACAAAC TGCTTGGCTC CA - #ATGGCTAA   

      480                                                                      

     - AGCTCTTCAA GACAACTTTG GTGTTGTTGA AGGATTGATG ACTACTATCC AC - #GCTTACAC   

      540                                                                      

     - TGGTGACCAA ATGATCCTTG ACGGACCACA CCGTGGTGGT GACCTTCGCC GT - #GCTCGCGC   

      600                                                                      

     - TGGTGCTGCA AACATCGTTC CTAACTCAAC TGGTGCTGCA AAAGCTATCG GT - #CTTGTAAT   

      660                                                                      

     - CCCAGAATTG AATGGTAAAC TTGACGGATC TGCACAACGC GTTCCAACTC CA - #ACTGGATC   

      720                                                                      

     - AGTTACTGAA TTGGTAGCAG TTCTTGAAAA GAACGTTACT GTTGATGAAG TG - #AACGCAGC   

      780                                                                      

     - TATGAAAGCA GCTTCAAACG AATCATACGG TTACACAGAA GATCCAATCG TA - #TCTTCAGA   

      840                                                                      

     - TATCGTAGGT ATGTCTTACG GTTCATTGTT TGACGCAACT CAAACTAAAG TT - #CTTGACGT   

      900                                                                      

     - TGACGGTAAA CAATTGGTTA AAGTTGTATC ATGGTACGAC AACGAAATGT CA - #TACACTGC   

      960                                                                      

     #  1000            CTTG GAATACTTCG CAAAAATTGC                             

     - (2) INFORMATION FOR SEQ ID NO:54:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 333 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     #ID NO:54:(xi) SEQUENCE DESCRIPTION: SEQ                                  

     -      Val Val Lys Val Gly Ile Asn Gly - # Phe Gly Arg Ile Gly Arg Leu    

     Ala                                                                       

     #   15                                                                    

     -      Phe Arg Arg Ile Gln Asn Val Glu - # Gly Val Glu Val Thr Arg Ile    

     Asn                                                                       

     #                 30                                                      

     -      Asp Leu Thr Asp Pro Val Met Leu - # Ala His Leu Leu Lys Tyr Asp    

     Thr                                                                       

     #             45                                                          

     -      Thr Gln Gly Arg Phe Asp Gly Thr - # Val Glu Val Lys Glu Gly Gly    

     Phe                                                                       

     #         60                                                              

     -      Glu Val Asn Gly Lys Phe Ile Lys - # Val Ser Ala Glu Arg Asp Pro    

     Glu                                                                       

     #     80                                                                  

     -      Gln Ile Asp Trp Ala Thr Asp Gly - # Val Glu Ile Val Leu Glu Ala    

     Thr                                                                       

     #   95                                                                    

     -      Gly Phe Phe Ala Lys Lys Glu Ala - # Ala Glu Lys His Leu Lys Gly    

     Gly                                                                       

     #                110                                                      

     -      Ala Lys Lys Val Val Ile Thr Ala - # Pro Gly Gly Asn Asp Val Lys    

     Thr                                                                       

     #            125                                                          

     -      Val Val Phe Asn Thr Asn His Asp - # Val Leu Asp Gly Thr Glu Thr    

     Val                                                                       

     #        140                                                              

     -      Ile Ser Gly Ala Ser Cys Thr Thr - # Asn Cys Leu Ala Pro Met Ala    

     Lys                                                                       

     #    160                                                                  

     -      Ala Leu Gln Asp Asn Phe Gly Val - # Val Glu Gly Leu Met Thr Thr    

     Ile                                                                       

     #   175                                                                   

     -      His Ala Tyr Thr Gly Asp Gln Met - # Ile Leu Asp Gly Pro His Arg    

     Gly                                                                       

     #                190                                                      

     -      Gly Asp Leu Arg Arg Ala Arg Ala - # Gly Ala Ala Asn Ile Val Pro    

     Asn                                                                       

     #            205                                                          

     -      Ser Thr Gly Ala Ala Lys Ala Ile - # Gly Leu Val Ile Pro Glu Leu    

     Asn                                                                       

     #        220                                                              

     -      Gly Lys Leu Asp Gly Ser Ala Gln - # Arg Val Pro Thr Pro Thr Gly    

     Ser                                                                       

     #    240                                                                  

     -      Val Thr Glu Leu Val Ala Val Leu - # Glu Lys Asn Val Thr Val Asp    

     Glu                                                                       

     #   255                                                                   

     -      Val Asn Ala Ala Met Lys Ala Ala - # Ser Asn Glu Ser Tyr Gly Tyr    

     Thr                                                                       

     #                270                                                      

     -      Glu Asp Pro Ile Val Ser Ser Asp - # Ile Val Gly Met Ser Tyr Gly    

     Ser                                                                       

     #            285                                                          

     -      Leu Phe Asp Ala Thr Gln Thr Lys - # Val Leu Asp Val Asp Gly Lys    

     Gln                                                                       

     #        300                                                              

     -      Leu Val Lys Val Val Ser Trp Tyr - # Asp Asn Glu Met Ser Tyr Thr    

     Ala                                                                       

     #    320                                                                  

     -      Gln Leu Val Arg Thr Leu Gly Ile - # Leu Arg Lys Asn Cys            

     #   330                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 55:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 2389 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #SEQ ID NO: 55: SEQUENCE DESCRIPTION:                                     

     - TTCTTACGAG TTGGGACTGT ATCAAGCTAG AACGGTTAAG GAAAATAATC GT - #GTTTCCTA   

       60                                                                      

     - TATAGATGGA AAACAAGCGA CGCAAAAAAC GGAGAATTTG ACTCCTGATG AG - #GTTAGCAA   

      120                                                                      

     - GCGTGAAGGA ATCAATGCTG AGCAAATCGT CATCAAGATA ACAGACCAAG GC - #TATGTCAC   

      180                                                                      

     - TTCACATGGC GACCACTATC ATTATTACAA TGGTAAGGTT CCTTATGACG CT - #ATCATCAG   

      240                                                                      

     - TGAAGAATTA CTCATGAAAG ATCCAAACTA TAAGCTAAAA GATGAGGATA TT - #GTTAATGA   

      300                                                                      

     - GGTCAAGGGT GGATATGTTA TCAAGGTAGA TGGAAAATAC TATGTTTACC TT - #AAGGATGC   

      360                                                                      

     - TGCCCACGCG GATAACGTCC GTACAAAAGA GGAAATCAAT CGACAAAAAC AA - #GAGCATAG   

      420                                                                      

     - TCAACATCGT GAAGGTGGAA CTCCAAGAAA CGATGGTGCT GTTGCCTTGG CA - #CGTTCGCA   

      480                                                                      

     - AGGACGCTAT ACTACAGATG ATGGTTATAT CTTTAATGCT TCTGATATCA TA - #GAGGATAC   

      540                                                                      

     - TGGTGATGCT TATATCGTTC CTCATGGAGA TCATTACCAT TACATTCCTA AG - #AATGAGTT   

      600                                                                      

     - ATCAGCTAGC GAGTTGGCTG CTGCAGAAGC CTTCCTATCT GGTCGAGGAA AT - #CTGTCAAA   

      660                                                                      

     - TTCAAGAACC TATCGCCGAC AAAATAGCGA TAACACTTCA AGAACAAACT GG - #GTACCTTC   

      720                                                                      

     - TGTAAGCAAT CCAGGAACTA CAAATACTAA CACAAGCAAC AACAGCAACA CT - #AACAGTCA   

      780                                                                      

     - AGCAAGTCAA AGTAATGACA TTGATAGTCT CTTGAAACAG CTCTACAAAC TG - #CCTTTGAG   

      840                                                                      

     - TCAACGACAT GTAGAATCTG ATGGCCTTGT CTTTGATCCA GCACAAATCA CA - #AGTCGAAC   

      900                                                                      

     - AGCTAGAGGT GTTGCAGTGC CACACGGAGA TCATTACCAC TTCATCCCTT AC - #TCTCAAAT   

      960                                                                      

     - GTCTGAATTG GAAGAACGAA TCGCTCGTAT TATTCCCCTT CGTTATCGTT CA - #AACCATTG   

     1020                                                                      

     - GGTACCAGAT TCAAGGCCAG AACAACCAAG TCCACAACCG ACTCCGGAAC CT - #AGTCCAGG   

     1080                                                                      

     - CCCGCAACCT GCACCAAATC TTAAAATAGA CTCAAATTCT TCTTTGGTTA GT - #CAGCTGGT   

     1140                                                                      

     - ACGAAAAGTT GGGGAAGGAT ATGTATTCGA AGAAAAGGGC ATCTCTCGTT AT - #GTCTTTGC   

     1200                                                                      

     - GAAAGATTTA CCATCTGAAA CTGTTAAAAA TCTTGAAAGC AAGTTATCAA AA - #CAAGAGAG   

     1260                                                                      

     - TGTTTCACAC ACTTTAACTG CTAAAAAAGA AAATGTTGCT CCTCGTGACC AA - #GAATTTTA   

     1320                                                                      

     - TGATAAAGCA TATAATCTGT TAACTGAGGC TCATAAAGCC TTGTTTGNAA AT - #AAGGGTCG   

     1380                                                                      

     - TAATTCTGAT TTCCAAGCCT TAGACAAATT ATTAGAACGC TTGAATGATG AA - #TCGACTAA   

     1440                                                                      

     - TAAAGAAAAA TTGGTAGATG ATTTATTGGC ATTCCTAGCA CCAATTACCC AT - #CCAGAGCG   

     1500                                                                      

     - ACTTGGCAAA CCAAATTCTC AAATTGAGTA TACTGAAGAC GAAGTTCGTA TT - #GCTCAATT   

     1560                                                                      

     - AGCTGATAAG TATACAACGT CAGATGGTTA CATTTTTGAT GAACATGATA TA - #ATCAGTGA   

     1620                                                                      

     - TGAAGGAGAT GCATATGTAA CGCCTCATAT GGGCCATAGT CACTGGATTG GA - #AAAGATAG   

     1680                                                                      

     - CCTTTCTGAT AAGGAAAAAG TTGCAGCTCA AGCCTATACT AAAGAAAAAG GT - #ATCCTACC   

     1740                                                                      

     - TCCATCTCCA GACGCAGATG TTAAAGCAAA TCCAACTGGA GATAGTGCAG CA - #GCTATTTA   

     1800                                                                      

     - CAATCGTGTG AAAGGGGAAA AACGAATTCC ACTCGTTCGA CTTCCATATA TG - #GTTGAGCA   

     1860                                                                      

     - TACAGTTGAG GTTAAAAACG GTAATTTGAT TATTCCTCAT AAGGATCATT AC - #CATAATAT   

     1920                                                                      

     - TAAATTTGCT TGGTTTGATG ATCACACATA CAAAGCTCCA AATGGCTATA CC - #TTGGAAGA   

     1980                                                                      

     - TTTGTTTGCG ACGATTAAGT ACTACGTAGA ACACCCTGAC GAACGTCCAC AT - #TCTAATGA   

     2040                                                                      

     - TGGATGGGGC AATGCCAGTG AGCATGTGTT AGGCAAGAAA GACCACAGTG AA - #GATCCAAA   

     2100                                                                      

     - TAAGAACTTC AAAGCGGATG AAGAGCCAGT AGAGGAAACA CCTGCTGAGC CA - #GAAGTCCC   

     2160                                                                      

     - TCAAGTAGAG ACTGAAAAAG TAGAAGCCCA ACTCAAAGAA GCAGAAGTTT TG - #CTTGCGAA   

     2220                                                                      

     - AGTAACGGAT TCTAGTCTGA AAGCCAATGC AACAGAAACT CTAGCTGGTT TA - #CGAAATAA   

     2280                                                                      

     - TTTGACTCTT CAAATTATGG ATAACAATAG TATCATGGCA GAAGCAGAAA AA - #TTACTTGC   

     2340                                                                      

     #             2389TAATC CTTCATCTGT AAGTAAGGAA AAAATAAAC                   

     - (2) INFORMATION FOR SEQ ID NO:56:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 796 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     #ID NO:56:(xi) SEQUENCE DESCRIPTION: SEQ                                  

     -      Ser Tyr Glu Leu Gly Leu Tyr Gln - # Ala Arg Thr Val Lys Glu Asn    

     Asn                                                                       

     #   15                                                                    

     -      Arg Val Ser Tyr Ile Asp Gly Lys - # Gln Ala Thr Gln Lys Thr Glu    

     Asn                                                                       

     #                 30                                                      

     -      Leu Thr Pro Asp Glu Val Ser Lys - # Arg Glu Gly Ile Asn Ala Glu    

     Gln                                                                       

     #             45                                                          

     -      Ile Val Ile Lys Ile Thr Asp Gln - # Gly Tyr Val Thr Ser His Gly    

     Asp                                                                       

     #         60                                                              

     -      His Tyr His Tyr Tyr Asn Gly Lys - # Val Pro Tyr Asp Ala Ile Ile    

     Ser                                                                       

     #     80                                                                  

     -      Glu Glu Leu Leu Met Lys Asp Pro - # Asn Tyr Lys Leu Lys Asp Glu    

     Asp                                                                       

     #   95                                                                    

     -      Ile Val Asn Glu Val Lys Gly Gly - # Tyr Val Ile Lys Val Asp Gly    

     Lys                                                                       

     #                110                                                      

     -      Tyr Tyr Val Tyr Leu Lys Asp Ala - # Ala His Ala Asp Asn Val Arg    

     Thr                                                                       

     #            125                                                          

     -      Lys Glu Glu Ile Asn Arg Gln Lys - # Gln Glu His Ser Gln His Arg    

     Glu                                                                       

     #        140                                                              

     -      Gly Gly Thr Pro Arg Asn Asp Gly - # Ala Val Ala Leu Ala Arg Ser    

     Gln                                                                       

     #    160                                                                  

     -      Gly Arg Tyr Thr Thr Asp Asp Gly - # Tyr Ile Phe Asn Ala Ser Asp    

     Ile                                                                       

     #   175                                                                   

     -      Ile Glu Asp Thr Gly Asp Ala Tyr - # Ile Val Pro His Gly Asp His    

     Tyr                                                                       

     #                190                                                      

     -      His Tyr Ile Pro Lys Asn Glu Leu - # Ser Ala Ser Glu Leu Ala Ala    

     Ala                                                                       

     #            205                                                          

     -      Glu Ala Phe Leu Ser Gly Arg Gly - # Asn Leu Ser Asn Ser Arg Thr    

     Tyr                                                                       

     #        220                                                              

     -      Arg Arg Gln Asn Ser Asp Asn Thr - # Ser Arg Thr Asn Trp Val Pro    

     Ser                                                                       

     #    240                                                                  

     -      Val Ser Asn Pro Gly Thr Thr Asn - # Thr Asn Thr Ser Asn Asn Ser    

     Asn                                                                       

     #   255                                                                   

     -      Thr Asn Ser Gln Ala Ser Gln Ser - # Asn Asp Ile Asp Ser Leu Leu    

     Lys                                                                       

     #                270                                                      

     -      Gln Leu Tyr Lys Leu Pro Leu Ser - # Gln Arg His Val Glu Ser Asp    

     Gly                                                                       

     #            285                                                          

     -      Leu Val Phe Asp Pro Ala Gln Ile - # Thr Ser Arg Thr Ala Arg Gly    

     Val                                                                       

     #        300                                                              

     -      Ala Val Pro His Gly Asp His Tyr - # His Phe Ile Pro Tyr Ser Gln    

     Met                                                                       

     #    320                                                                  

     -      Ser Glu Leu Glu Glu Arg Ile Ala - # Arg Ile Ile Pro Leu Arg Tyr    

     Arg                                                                       

     #   335                                                                   

     -      Ser Asn His Trp Val Pro Asp Ser - # Arg Pro Glu Gln Pro Ser Pro    

     Gln                                                                       

     #                350                                                      

     -      Pro Thr Pro Glu Pro Ser Pro Gly - # Pro Gln Pro Ala Pro Asn Leu    

     Lys                                                                       

     #            365                                                          

     -      Ile Asp Ser Asn Ser Ser Leu Val - # Ser Gln Leu Val Arg Lys Val    

     Gly                                                                       

     #        380                                                              

     -      Glu Gly Tyr Val Phe Glu Glu Lys - # Gly Ile Ser Arg Tyr Val Phe    

     Ala                                                                       

     #    400                                                                  

     -      Lys Asp Leu Pro Ser Glu Thr Val - # Lys Asn Leu Glu Ser Lys Leu    

     Ser                                                                       

     #   415                                                                   

     -      Lys Gln Glu Ser Val Ser His Thr - # Leu Thr Ala Lys Lys Glu Asn    

     Val                                                                       

     #                430                                                      

     -      Ala Pro Arg Asp Gln Glu Phe Tyr - # Asp Lys Ala Tyr Asn Leu Leu    

     Thr                                                                       

     #            445                                                          

     -      Glu Ala His Lys Ala Leu Phe Xaa - # Asn Lys Gly Arg Asn Ser Asp    

     Phe                                                                       

     #        460                                                              

     -      Gln Ala Leu Asp Lys Leu Leu Glu - # Arg Leu Asn Asp Glu Ser Thr    

     Asn                                                                       

     #    480                                                                  

     -      Lys Glu Lys Leu Val Asp Asp Leu - # Leu Ala Phe Leu Ala Pro Ile    

     Thr                                                                       

     #   495                                                                   

     -      His Pro Glu Arg Leu Gly Lys Pro - # Asn Ser Gln Ile Glu Tyr Thr    

     Glu                                                                       

     #                510                                                      

     -      Asp Glu Val Arg Ile Ala Gln Leu - # Ala Asp Lys Tyr Thr Thr Ser    

     Asp                                                                       

     #            525                                                          

     -      Gly Tyr Ile Phe Asp Glu His Asp - # Ile Ile Ser Asp Glu Gly Asp    

     Ala                                                                       

     #        540                                                              

     -      Tyr Val Thr Pro His Met Gly His - # Ser His Trp Ile Gly Lys Asp    

     Ser                                                                       

     #    560                                                                  

     -      Leu Ser Asp Lys Glu Lys Val Ala - # Ala Gln Ala Tyr Thr Lys Glu    

     Lys                                                                       

     #   575                                                                   

     -      Gly Ile Leu Pro Pro Ser Pro Asp - # Ala Asp Val Lys Ala Asn Pro    

     Thr                                                                       

     #                590                                                      

     -      Gly Asp Ser Ala Ala Ala Ile Tyr - # Asn Arg Val Lys Gly Glu Lys    

     Arg                                                                       

     #            605                                                          

     -      Ile Pro Leu Val Arg Leu Pro Tyr - # Met Val Glu His Thr Val Glu    

     Val                                                                       

     #        620                                                              

     -      Lys Asn Gly Asn Leu Ile Ile Pro - # His Lys Asp His Tyr His Asn    

     Ile                                                                       

     #    640                                                                  

     -      Lys Phe Ala Trp Phe Asp Asp His - # Thr Tyr Lys Ala Pro Asn Gly    

     Tyr                                                                       

     #   655                                                                   

     -      Thr Leu Glu Asp Leu Phe Ala Thr - # Ile Lys Tyr Tyr Val Glu His    

     Pro                                                                       

     #                670                                                      

     -      Asp Glu Arg Pro His Ser Asn Asp - # Gly Trp Gly Asn Ala Ser Glu    

     His                                                                       

     #            685                                                          

     -      Val Leu Gly Lys Lys Asp His Ser - # Glu Asp Pro Asn Lys Asn Phe    

     Lys                                                                       

     #        700                                                              

     -      Ala Asp Glu Glu Pro Val Glu Glu - # Thr Pro Ala Glu Pro Glu Val    

     Pro                                                                       

     #    720                                                                  

     -      Gln Val Glu Thr Glu Lys Val Glu - # Ala Gln Leu Lys Glu Ala Glu    

     Val                                                                       

     #   735                                                                   

     -      Leu Leu Ala Lys Val Thr Asp Ser - # Ser Leu Lys Ala Asn Ala Thr    

     Glu                                                                       

     #                750                                                      

     -      Thr Leu Ala Gly Leu Arg Asn Asn - # Leu Thr Leu Gln Ile Met Asp    

     Asn                                                                       

     #            765                                                          

     -      Asn Ser Ile Met Ala Glu Ala Glu - # Lys Leu Leu Ala Leu Leu Lys    

     Gly                                                                       

     #        780                                                              

     -      Ser Asn Pro Ser Ser Val Ser Lys - # Glu Lys Ile Asn                

     #    795                                                                  

     - (2) INFORMATION FOR SEQ ID NO: 57:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1180 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #SEQ ID NO: 57: SEQUENCE DESCRIPTION:                                     

     - TACTGAGATG CATCATAATC TAGGAGCTGA AAAGCGTTCA GCAGTGGCTA CT - #ACTATCGA   

       60                                                                      

     - TAGTTTTAAG GAGCGAAGTC AAAAAGTCAG AGCACTATCT GATCCAAATG TG - #CGTTTTGT   

      120                                                                      

     - TCCCTTCTTT GGCTCTAGTG AATGGCTTCG TTTTGACGGT GCTCATTCTG CG - #GTATTAGC   

      180                                                                      

     - TGAGAAATAC AATCGTTCCT ACCGTCCTTA TCTTTTAGGA CAGGGGGGAG CT - #GCATCGCT   

      240                                                                      

     - TAACCAATAT TTTGGAATGC AACAGATGTT ACCACAGCTG GAGAATAAAC AA - #GTTGTGTA   

      300                                                                      

     - TGTTATCTCA CCTCAGTGGT TCAGTAAAAA TGGCTATGAT CCAGCAGCCT TC - #CAGCAGTA   

      360                                                                      

     - TTTTAATGGA GACCAGTTGA CTAGTTTTCT GAAACATCAA TCTGGGGATC AG - #GCTAGTCA   

      420                                                                      

     - ATATGCAGCG ACTCGCTTAC TGCAACAGTT CCCAAACGTA GCTATGAAGG AC - #CTGGTTCA   

      480                                                                      

     - GAAGTTGGCA AGTAAAGAAG AATTGTCGAC AGCAGACAAT GAAATGATTG AA - #TTATTGGC   

      540                                                                      

     - TCGTTTTAAT GAACGCCAAG CTTCCTTTTT TGGTCAGTTT TCGGTTAGAG GC - #TATGTTAA   

      600                                                                      

     - CTACGATAAG CATGTAGCTA AGTATTTAAA AATCTTGCCA GACCAGTTTT CT - #TATCAGGC   

      660                                                                      

     - AATAGAAGAT GTTGTCAAAG CAGATGCTGA AAAAAATACT TCCAATAATG AG - #ATGGGAAT   

      720                                                                      

     - GGAAAATTAT TTCTATAATG AGCAGATCAA GAAGGATTTG AAGAAATTAA AG - #GATTCTCA   

      780                                                                      

     - GAAAAGCTTT ACCTATCTCA AGTCGCCAGA GTATAATGNN TTGCAGTTGG TT - #TTAACACA   

      840                                                                      

     - GTTTTCTAAA TCTAAGGTAA ACCCGATTTT TATCATTCCA CCTGTTAATA AA - #AAATGGAT   

      900                                                                      

     - GNACTATGCT GGTCTACGAG AGGATATGTA CCAACAAACG GTGCAGAAGA TT - #CGCTACCA   

      960                                                                      

     - GTTAGAAAGT CAAGGTTTTA CCAATATAGC AGATTTTTCT AAGGACGGCG GG - #GAGCCTTT   

     1020                                                                      

     - CTTTATGAAG GACACCATTC ACCTTGGTTG GTTGGGTTGG TTGGCTTTTG AC - #AAGGCAGT   

     1080                                                                      

     - TGATCCTTTC CTATCCAATC CCACACCAGC TCCGACTTAC CATCTGAATG AG - #CGCTTTTT   

     1140                                                                      

     #  1180            ACTT ATGATGGAGA TGTCAAAGAA                             

     - (2) INFORMATION FOR SEQ ID NO:58:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 393 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58:                            

     - Thr Glu Met His His Asn Leu Gly Ala Glu Ly - #s Arg Ser Ala Val Ala     

     #                15                                                       

     - Thr Thr Ile Asp Ser Phe Lys Glu Arg Ser Gl - #n Lys Val Arg Ala Leu     

     #            30                                                           

     - Ser Asp Pro Asn Val Arg Phe Val Pro Phe Ph - #e Gly Ser Ser Glu Trp     

     #        45                                                               

     - Leu Arg Phe Asp Gly Ala His Ser Ala Val Le - #u Ala Glu Lys Tyr Asn     

     #    60                                                                   

     - Arg Ser Tyr Arg Pro Tyr Leu Leu Gly Gln Gl - #y Gly Ala Ala Ser Leu     

     #80                                                                       

     - Asn Gln Tyr Phe Gly Met Gln Gln Met Leu Pr - #o Gln Leu Glu Asn Lys     

     #                95                                                       

     - Gln Val Val Tyr Val Ile Ser Pro Gln Trp Ph - #e Ser Lys Asn Gly Tyr     

     #           110                                                           

     - Asp Pro Ala Ala Phe Gln Gln Tyr Phe Asn Gl - #y Asp Gln Leu Thr Ser     

     #       125                                                               

     - Phe Leu Lys His Gln Ser Gly Asp Gln Ala Se - #r Gln Tyr Ala Ala Thr     

     #   140                                                                   

     - Arg Leu Leu Gln Gln Phe Pro Asn Val Ala Me - #t Lys Asp Leu Val Gln     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Leu Ala Ser Lys Glu Glu Leu Ser Thr Al - #a Asp Asn Glu Met Ile     

     #               175                                                       

     - Glu Leu Leu Ala Arg Phe Asn Glu Arg Gln Al - #a Ser Phe Phe Gly Gln     

     #           190                                                           

     - Phe Ser Val Arg Gly Tyr Val Asn Tyr Asp Ly - #s His Val Ala Lys Tyr     

     #       205                                                               

     - Leu Lys Ile Leu Pro Asp Gln Phe Ser Tyr Gl - #n Ala Ile Glu Asp Val     

     #   220                                                                   

     - Val Lys Ala Asp Ala Glu Lys Asn Thr Ser As - #n Asn Glu Met Gly Met     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Glu Asn Tyr Phe Tyr Asn Glu Gln Ile Lys Ly - #s Asp Leu Lys Lys Leu     

     #               255                                                       

     - Lys Asp Ser Gln Lys Ser Phe Thr Tyr Leu Ly - #s Ser Pro Glu Tyr Asn     

     #           270                                                           

     - Xaa Leu Gln Leu Val Leu Thr Gln Phe Ser Ly - #s Ser Lys Val Asn Pro     

     #       285                                                               

     - Ile Phe Ile Ile Pro Pro Val Asn Lys Lys Tr - #p Met Xaa Tyr Ala Gly     

     #   300                                                                   

     - Leu Arg Glu Asp Met Tyr Gln Gln Thr Val Gl - #n Lys Ile Arg Tyr Gln     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Leu Glu Ser Gln Gly Phe Thr Asn Ile Ala As - #p Phe Ser Lys Asp Gly     

     #               335                                                       

     - Gly Glu Pro Phe Phe Met Lys Asp Thr Ile Hi - #s Leu Gly Trp Leu Gly     

     #           350                                                           

     - Trp Leu Ala Phe Asp Lys Ala Val Asp Pro Ph - #e Leu Ser Asn Pro Thr     

     #       365                                                               

     - Pro Ala Pro Thr Tyr His Leu Asn Glu Arg Ph - #e Phe Ser Lys Asp Trp     

     #   380                                                                   

     - Ala Thr Tyr Asp Gly Asp Val Lys Glu                                     

     385                 3 - #90                                               

     - (2) INFORMATION FOR SEQ ID NO: 59:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1423 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #59:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGTTTTGAGA AAGTATTTGC AGGGGGCCCT GATTGAGTCG ATTGAGCAAG TG - #GAAAATGA   

       60                                                                      

     - CCGTATTGTG GAAATTACAG TTTCCAATAA AAACGAGATT GGAGACCATA TC - #CAGGCTAC   

      120                                                                      

     - CTTGATTATC GAAATTATGG GGAAACACAG TAATATTCTA CTGGTCGATA AA - #AGCAGTCA   

      180                                                                      

     - TAAAATCCTC GAAGTTATCA AACACGTCGG CTTTTCACAA AATAGCTACC GC - #ACCTTACT   

      240                                                                      

     - TCCAGGATCG ACCTATATCG CTCCGCCAAG TACAAAATCT CTCAATCCTT TT - #ACTATCAA   

      300                                                                      

     - GGATGAAAAG CTCTTTGAAA TCCTGCAAAC CCAAGAACTA ACAGCAAAAA AT - #CTTCAAAG   

      360                                                                      

     - CCTCTTTCAA GGTCTGGGAC GCGATACGGC AAATGAATTG GAAAGGATAC TG - #GTTAGTGA   

      420                                                                      

     - AAAACTTTCC GCTTTCCGAA ATTTTTTCAA TCAAGAAACC AAGCCATGCT TG - #ACTGAGAC   

      480                                                                      

     - TTCCTTCAGT CCAGTTCCTT TTGCAAATCA GGTGGGAGAG CCTTTTGCAA AT - #CTTTCTGA   

      540                                                                      

     - TTTGTTGGAC ACCTACTATA AGGATAAGGC TGAGCGCGAC CGCGTCAAAC AG - #CAGGCCAG   

      600                                                                      

     - TGAACTGATT CGTCGTGTTG AAAATGAACT TCAGAAAAAC CGACACAAAC TC - #AAAAAACA   

      660                                                                      

     - GGAAAAAGAG TTACTGGCGA CAGACAACGC TGAAGAATTT CGTCAAAAAG GA - #GAATTGCT   

      720                                                                      

     - GACAACCTTC CTCCACCAAG TGCCTAACGA CCAAGACCAG GTTATCCTAG AC - #AACTACTA   

      780                                                                      

     - TACCAACCAA CCTATCATGA TTGCGCTTGA TAAGGCTCTG ACTCCCAACC AG - #AATGCCCA   

      840                                                                      

     - ACGCTATTTT AAACGGTATC AGAAACTCAA AGAAGCTGTC AAATACTTGA CT - #GATTTGAT   

      900                                                                      

     - TGAAGAAACC AAAGCCACTA TTCTCTATCT GGAAAGTGTA GAAACCGTCC TC - #AACCAAGC   

      960                                                                      

     - TGGACTGGAA GAAATCGCTG AAATCCGTGA AGAATTGATT CAAACAGGTT TT - #ATCCGCAG   

     1020                                                                      

     - AAGACAACGG GAGAAAATCC AGAAACGCAA AAAACTAGAA CAATATCTAG CA - #AGCGATGG   

     1080                                                                      

     - CAAAACCATC ATCTATGTCG GACGAAACAA TCTTCAAAAT GAGGAATTGA CC - #TTTAAAAT   

     1140                                                                      

     - GGCCCGCAAG GAGGAACTTT GGTTCCATGC TAAGGACATT CCTGGAAGCC AT - #GTTGTCAT   

     1200                                                                      

     - CTCAGGAAAT CTTGACCCAT CTGATGCAGT CAAGACAGAC GCAGCAGAGT TA - #GCTGCCTA   

     1260                                                                      

     - CTTCTCTCAA GGGCGCCTGT CGAATCTGGT GCAGGTAGAT ATGATTGAAG TC - #AAAAAACT   

     1320                                                                      

     - CAATAAACCA ACTGGTGGAA AACCCGGCTT TGTCACTTAC ACAGGACAAA AG - #ACCCTCCG   

     1380                                                                      

     #                 142 - #3ATTGCATC CATGAAAAAA TCC                         

     - (2) INFORMATION FOR SEQ ID NO:60:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 474 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60:                            

     - Val Leu Arg Lys Tyr Leu Gln Gly Ala Leu Il - #e Glu Ser Ile Glu Gln     

     #                15                                                       

     - Val Glu Asn Asp Arg Ile Val Glu Ile Thr Va - #l Ser Asn Lys Asn Glu     

     #            30                                                           

     - Ile Gly Asp His Ile Gln Ala Thr Leu Ile Il - #e Glu Ile Met Gly Lys     

     #        45                                                               

     - His Ser Asn Ile Leu Leu Val Asp Lys Ser Se - #r His Lys Ile Leu Glu     

     #    60                                                                   

     - Val Ile Lys His Val Gly Phe Ser Gln Asn Se - #r Tyr Arg Thr Leu Leu     

     #80                                                                       

     - Pro Gly Ser Thr Tyr Ile Ala Pro Pro Ser Th - #r Lys Ser Leu Asn Pro     

     #                95                                                       

     - Phe Thr Ile Lys Asp Glu Lys Leu Phe Glu Il - #e Leu Gln Thr Gln Glu     

     #           110                                                           

     - Leu Thr Ala Lys Asn Leu Gln Ser Leu Phe Gl - #n Gly Leu Gly Arg Asp     

     #       125                                                               

     - Thr Ala Asn Glu Leu Glu Arg Ile Leu Val Se - #r Glu Lys Leu Ser Ala     

     #   140                                                                   

     - Phe Arg Asn Phe Phe Asn Gln Glu Thr Lys Pr - #o Cys Leu Thr Glu Thr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Phe Ser Pro Val Pro Phe Ala Asn Gln Va - #l Gly Glu Pro Phe Ala     

     #               175                                                       

     - Asn Leu Ser Asp Leu Leu Asp Thr Tyr Tyr Ly - #s Asp Lys Ala Glu Arg     

     #           190                                                           

     - Asp Arg Val Lys Gln Gln Ala Ser Glu Leu Il - #e Arg Arg Val Glu Asn     

     #       205                                                               

     - Glu Leu Gln Lys Asn Arg His Lys Leu Lys Ly - #s Gln Glu Lys Glu Leu     

     #   220                                                                   

     - Leu Ala Thr Asp Asn Ala Glu Glu Phe Arg Gl - #n Lys Gly Glu Leu Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Thr Thr Phe Leu His Gln Val Pro Asn Asp Gl - #n Asp Gln Val Ile Leu     

     #               255                                                       

     - Asp Asn Tyr Tyr Thr Asn Gln Pro Ile Met Il - #e Ala Leu Asp Lys Ala     

     #           270                                                           

     - Leu Thr Pro Asn Gln Asn Ala Gln Arg Tyr Ph - #e Lys Arg Tyr Gln Lys     

     #       285                                                               

     - Leu Lys Glu Ala Val Lys Tyr Leu Thr Asp Le - #u Ile Glu Glu Thr Lys     

     #   300                                                                   

     - Ala Thr Ile Leu Tyr Leu Glu Ser Val Glu Th - #r Val Leu Asn Gln Ala     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Gly Leu Glu Glu Ile Ala Glu Ile Arg Glu Gl - #u Leu Ile Gln Thr Gly     

     #               335                                                       

     - Phe Ile Arg Arg Arg Gln Arg Glu Lys Ile Gl - #n Lys Arg Lys Lys Leu     

     #           350                                                           

     - Glu Gln Tyr Leu Ala Ser Asp Gly Lys Thr Il - #e Ile Tyr Val Gly Arg     

     #       365                                                               

     - Asn Asn Leu Gln Asn Glu Glu Leu Thr Phe Ly - #s Met Ala Arg Lys Glu     

     #   380                                                                   

     - Glu Leu Trp Phe His Ala Lys Asp Ile Pro Gl - #y Ser His Val Val Ile     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ser Gly Asn Leu Asp Pro Ser Asp Ala Val Ly - #s Thr Asp Ala Ala Glu     

     #               415                                                       

     - Leu Ala Ala Tyr Phe Ser Gln Gly Arg Leu Se - #r Asn Leu Val Gln Val     

     #           430                                                           

     - Asp Met Ile Glu Val Lys Lys Leu Asn Lys Pr - #o Thr Gly Gly Lys Pro     

     #       445                                                               

     - Gly Phe Val Thr Tyr Thr Gly Gln Lys Thr Le - #u Arg Val Thr Pro Asp     

     #   460                                                                   

     - Ser Lys Lys Ile Ala Ser Met Lys Lys Ser                                 

     465                 4 - #70                                               

     - (2) INFORMATION FOR SEQ ID NO: 61:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 544 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #61:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GACAACATTT ACTATCCATA CAGTAGAGTC AGCACCAGCA GAAGTGAAAG AA - #ATTCTTGA   

       60                                                                      

     - AACAGTAGAA AAAGACAACA ATGGCTATAT TCCCAACCTA ATCGGTCTCT TG - #GCCAATGC   

      120                                                                      

     - CCCGACTGTT TTAGAAGCCT ACCAAATTGT CTCATCTATC CACCGTCGCA AC - #AGCCTGAC   

      180                                                                      

     - ACCCGTTGAG CGTGAAGTGG TGCAAATCAC GGCAGCCGTG ACCAATGGTT GT - #GCCTTCTG   

      240                                                                      

     - TGTCGCAGGT CACACAGCCT TTTCCATCAA ACAAATCCAG ATGAATGATG AC - #TTGATTCA   

      300                                                                      

     - AGCTCTTCGC AATCGTACTC CAATTGAAAC AGATCCTAAA TTGGATACCC TA - #GCTAAGTT   

      360                                                                      

     - TACCTTGGCA GTTATCAATA CCAAGGGTCG TGTAGGAGAT GAAGCCTTGT CT - #GAGTTTTT   

      420                                                                      

     - AGAAGCTGGC TACACTCAAC AAAATGCCTT GGATGTGGTT TTTGGTGTCA GC - #CTAGCAAT   

      480                                                                      

     - CCTCTGTAAC TATGCCAACA ACTTAGCTAA TACACCAATT AATCCAGAAT TG - #CAACCTTA   

      540                                                                      

     #            544                                                          

     - (2) INFORMATION FOR SEQ ID NO:62:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 181 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:                            

     - Thr Thr Phe Thr Ile His Thr Val Glu Ser Al - #a Pro Ala Glu Val Lys     

     #                15                                                       

     - Glu Ile Leu Glu Thr Val Glu Lys Asp Asn As - #n Gly Tyr Ile Pro Asn     

     #            30                                                           

     - Leu Ile Gly Leu Leu Ala Asn Ala Pro Thr Va - #l Leu Glu Ala Tyr Gln     

     #        45                                                               

     - Ile Val Ser Ser Ile His Arg Arg Asn Ser Le - #u Thr Pro Val Glu Arg     

     #    60                                                                   

     - Glu Val Val Gln Ile Thr Ala Ala Val Thr As - #n Gly Cys Ala Phe Cys     

     #80                                                                       

     - Val Ala Gly His Thr Ala Phe Ser Ile Lys Gl - #n Ile Gln Met Asn Asp     

     #                95                                                       

     - Asp Leu Ile Gln Ala Leu Arg Asn Arg Thr Pr - #o Ile Glu Thr Asp Pro     

     #           110                                                           

     - Lys Leu Asp Thr Leu Ala Lys Phe Thr Leu Al - #a Val Ile Asn Thr Lys     

     #       125                                                               

     - Gly Arg Val Gly Asp Glu Ala Leu Ser Glu Ph - #e Leu Glu Ala Gly Tyr     

     #   140                                                                   

     - Thr Gln Gln Asn Ala Leu Asp Val Val Phe Gl - #y Val Ser Leu Ala Ile     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Cys Asn Tyr Ala Asn Asn Leu Ala Asn Th - #r Pro Ile Asn Pro Glu     

     #               175                                                       

     - Leu Gln Pro Tyr Ala                                                     

                 180                                                           

     - (2) INFORMATION FOR SEQ ID NO: 63:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 811 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #63:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGCTAAGGAA AGAGTGGATG TACTAGCTTA TAAACAGGGG TTGTTTGAAA CG - #AGAGAGCA   

       60                                                                      

     - GGCCAAGCGA GGTGTGATGG CTGGCCTAGT CGTAGCAGTC CTTAATGGAG AA - #CGGTTTGA   

      120                                                                      

     - CAAGCCAGGA GAGAAAATTC CAGATGACAC CGAATTAAAA CTCAAGGGGG AG - #AAACTCAA   

      180                                                                      

     - GTATGTCAGC CGTGGTGGTT TGAAACTGGA AAAGGCCTTG CAGGTCTTTG AT - #TTGTCGGT   

      240                                                                      

     - GGATGGCGCG ACTACGATTG ATATCGGGGC CTCTACTGGA GGTTTTACCG AT - #GTCATGCT   

      300                                                                      

     - ACAGAATAGT GCCAAGTTGG TCTTTGCAGT CGATGTTGGT ACCAATCAGT TG - #GCTTGGAA   

      360                                                                      

     - ATTACGCCAA GACCCACGAG TTGTCAGCAT GGAGCAGTTC AATTTCCGCT AT - #GCTGAAAA   

      420                                                                      

     - GACTGATTTC GAGCAGGAGC CGAGCTTTGC CAGTATTGAT GTGAGTTTCA TT - #TCCCTTAG   

      480                                                                      

     - TCTGATTTTG CCAGCCTTGC ACCGTGTCTT GGCTGATCAA GGTCAGGTGG TA - #GCACTTGT   

      540                                                                      

     - CAAACCTCAG TTTGAGGCAG GACGTGAGCA GATTGGGAAA AATGGAATTA TT - #CGAGATGC   

      600                                                                      

     - TAAGGTTCAT CAGAATGTCC TTGAATCTGT AACAGCTATG GCAGTAGAGG TA - #GGTTTTTC   

      660                                                                      

     - AGTCCTTGGC TTGGACTTTT CTCCCATCCA AGGTGGACAT GGAAATATTG AA - #TTTTTAGC   

      720                                                                      

     - GTATTTGAAA AAAGAAAAGT CAGCAAGCAA TCAGATTCTT GCTGAGATTA AA - #GAAGCAGT   

      780                                                                      

     #         811      CAAT TTAAAAATGA A                                      

     - (2) INFORMATION FOR SEQ ID NO:64:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 270 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64:                            

     - Ala Lys Glu Arg Val Asp Val Leu Ala Tyr Ly - #s Gln Gly Leu Phe Glu     

     #                15                                                       

     - Thr Arg Glu Gln Ala Lys Arg Gly Val Met Al - #a Gly Leu Val Val Ala     

     #            30                                                           

     - Val Leu Asn Gly Glu Arg Phe Asp Lys Pro Gl - #y Glu Lys Ile Pro Asp     

     #        45                                                               

     - Asp Thr Glu Leu Lys Leu Lys Gly Glu Lys Le - #u Lys Tyr Val Ser Arg     

     #    60                                                                   

     - Gly Gly Leu Lys Leu Glu Lys Ala Leu Gln Va - #l Phe Asp Leu Ser Val     

     #80                                                                       

     - Asp Gly Ala Thr Thr Ile Asp Ile Gly Ala Se - #r Thr Gly Gly Phe Thr     

     #                95                                                       

     - Asp Val Met Leu Gln Asn Ser Ala Lys Leu Va - #l Phe Ala Val Asp Val     

     #           110                                                           

     - Gly Thr Asn Gln Leu Ala Trp Lys Leu Arg Gl - #n Asp Pro Arg Val Val     

     #       125                                                               

     - Ser Met Glu Gln Phe Asn Phe Arg Tyr Ala Gl - #u Lys Thr Asp Phe Glu     

     #   140                                                                   

     - Gln Glu Pro Ser Phe Ala Ser Ile Asp Val Se - #r Phe Ile Ser Leu Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Ile Leu Pro Ala Leu His Arg Val Leu Al - #a Asp Gln Gly Gln Val     

     #               175                                                       

     - Val Ala Leu Val Lys Pro Gln Phe Glu Ala Gl - #y Arg Glu Gln Ile Gly     

     #           190                                                           

     - Lys Asn Gly Ile Ile Arg Asp Ala Lys Val Hi - #s Gln Asn Val Leu Glu     

     #       205                                                               

     - Ser Val Thr Ala Met Ala Val Glu Val Gly Ph - #e Ser Val Leu Gly Leu     

     #   220                                                                   

     - Asp Phe Ser Pro Ile Gln Gly Gly His Gly As - #n Ile Glu Phe Leu Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Leu Lys Lys Glu Lys Ser Ala Ser Asn Gl - #n Ile Leu Ala Glu Ile     

     #               255                                                       

     - Lys Glu Ala Val Glu Arg Ala His Ser Gln Ph - #e Lys Asn Glu             

     #           270                                                           

     - (2) INFORMATION FOR SEQ ID NO: 65:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 2290 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     -      (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 65:                      

     - TTGTTCCTAT GAACTTGGTC GTCACCAAGC TGGTCAGGTT AAGAAAGAGT CT - #AATCGAGT   

       60                                                                      

     - TTCTTATATA GATGGTGATC AGGCTGGTCA AAAGGCAGAA AACTTGACAC CA - #GATGAAGT   

      120                                                                      

     - CAGTAAGAGG GAGGGGATCA ACGCCGAACA AATNGTNATC AAGATTACGG AT - #CAAGGTTA   

      180                                                                      

     - TGTGACCTCT CATGGAGACC ATTATCATTA CTATAATGGC AAGGTTCCTT AT - #GATGCCAT   

      240                                                                      

     - CATCAGTGAA GAGCTCCTCA TGAAAGATCC GAATTATCAG TTGAAGGATT CA - #GACATTGT   

      300                                                                      

     - CAATGAAATC AAGGGTGGTT ATGTCATTAA GGTAAACGGT AAATACTATG TN - #TACCTTAA   

      360                                                                      

     - GGATGCAGCT CATGCGGATA ATATTCGGAC AAAAGAAGAG ATTAAACGTC AG - #AAGCAGGA   

      420                                                                      

     - ACGCAGTCAT AATCATAACT CAAGAGCAGA TAATGCTGTT GCTGCAGCCA GA - #GCCCAAGG   

      480                                                                      

     - ACGTTATACA ACGGATGATG GGTATATCTT CAATGCATCT GATATCATTG AG - #GACACGGG   

      540                                                                      

     - TGATGCTTAT ATCGTTCCTC ACGGCGACCA TTACCATTAC ATTCCTAAGA AT - #GAGTTATC   

      600                                                                      

     - AGCTAGCGAG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG AAGCAGGGAT CT - #CGTCCTTC   

      660                                                                      

     - TTCAAGTTCT AGTTATAATG CAAATCCAGC TCAACCAAGA TTGTCAGAGA AC - #CACAATCT   

      720                                                                      

     - GACTGTCACT CCAACTTATC ATCAAAATCA AGGGGAAAAC ATTTCAAGCC TT - #TTACGTGA   

      780                                                                      

     - ATTGTATGCT AAACCCTTAT CAGAACGCCA TGTGGAATCT GATGGCCTTA TT - #TTCGACCC   

      840                                                                      

     - AGCGCAAATC ACAAGTCGAA CCGCCAGAGG TGTAGCTGTC CCTCATGGTA AC - #CATTACCA   

      900                                                                      

     - CTTTATCCCT TATGAACAAA TGTCTGAATT GGAAAAACGA ATTGCTCGTA TT - #ATTCCCCT   

      960                                                                      

     - TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA GAACAACCAA GT - #CCACAATC   

     1020                                                                      

     - GACTCCGGAA CCTAGTCCAA GTCCGCAACC TGCACCAAAT CCTCAACCAG CT - #CCAAGCAA   

     1080                                                                      

     - TCCAATTGAT GAGAAATTGG TCAAAGAAGC TGTTCGAAAA GTAGGCGATG GT - #TATGTCTT   

     1140                                                                      

     - TGAGGAGAAT GGAGTTTCTC GTTATATCCC AGCCAAGGAT CTTTCAGCAG AA - #ACAGCAGC   

     1200                                                                      

     - AGGCATTGAT AGCAAACTGG CCAAGCAGGA AAGTTTATCT CATAAGCTAG GA - #GCTAAGAA   

     1260                                                                      

     - AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG GCTTATGACT TA - #CTAGCAAG   

     1320                                                                      

     - AATTCACCAA GATTTACTTG ATAATAAAGG TCGACAAGTT GATTTTGAGG CT - #TTGGATAA   

     1380                                                                      

     - CCTGTTGGAA CGACTCAAGG ATGTCNCAAG TGATAAAGTC AAGTTAGTGG AN - #GATATTCT   

     1440                                                                      

     - TGCCTTCTTA GCTCCGATTC GTCATCCAGA ACGTTTAGGA AAACCAAATG CG - #CAAATTAC   

     1500                                                                      

     - CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC AAGTACACAA CA - #GAAGACGG   

     1560                                                                      

     - TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG GATGCCTATG TA - #ACTCCACA   

     1620                                                                      

     - TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT GAAGCTGAGA GA - #GCGGCAGC   

     1680                                                                      

     - CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG ACAGACCATC AG - #GATTCAGG   

     1740                                                                      

     - AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCGC GTGAAAGCAG CT - #AAGAAGGT   

     1800                                                                      

     - GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA GAAGTCAAAA AC - #GGTAGTTT   

     1860                                                                      

     - AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT GAGTGGTTTG AC - #GAAGGCCT   

     1920                                                                      

     - TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG GCGACTGTCA AG - #TACTATGT   

     1980                                                                      

     - CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT GGTAACGCTA GC - #GACCATGT   

     2040                                                                      

     - TCAAAGAAAC AAAAATGGTC AAGCTGATAC CAATCAAACG GAAAAACCAA GC - #GAGGAGAA   

     2100                                                                      

     - ACCTCAGACA GAAAAACCTG AGGAAGAAAC CCCTCGAGAA GAGAAACCGC AA - #AGCGAGAA   

     2160                                                                      

     - ACCAGAGTCT CCAAAACCAA CAGAGGAACC AGAAGAATCA CCAGAGGAAT CA - #GAAGAACC   

     2220                                                                      

     - TCAGGTCGAG ACTGAAAAGG TTGAAGAAAA ACTGAGAGAG GCTGAAGATT TA - #CTTGGAAA   

     2280                                                                      

     #      2290                                                               

     - (2) INFORMATION FOR SEQ ID NO:66:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 763 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66:                            

     - Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gl - #y Gln Val Lys Lys Glu     

     #                15                                                       

     - Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gl - #n Ala Gly Gln Lys Ala     

     #            30                                                           

     - Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Ar - #g Glu Gly Ile Asn Ala     

     #        45                                                               

     - Glu Gln Xaa Val Ile Lys Ile Thr Asp Gln Gl - #y Tyr Val Thr Ser His     

     #    60                                                                   

     - Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Va - #l Pro Tyr Asp Ala Ile     

     #80                                                                       

     - Ile Ser Glu Glu Leu Leu Met Lys Asp Pro As - #n Tyr Gln Leu Lys Asp     

     #                95                                                       

     - Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Ty - #r Val Ile Lys Val Asn     

     #           110                                                           

     - Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Al - #a His Ala Asp Asn Ile     

     #       125                                                               

     - Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gl - #n Glu Arg Ser His Asn     

     #   140                                                                   

     - His Asn Ser Arg Ala Asp Asn Ala Val Ala Al - #a Ala Arg Ala Gln Gly     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe As - #n Ala Ser Asp Ile Ile     

     #               175                                                       

     - Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro Hi - #s Gly Asp His Tyr His     

     #           190                                                           

     - Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Gl - #u Leu Ala Ala Ala Glu     

     #       205                                                               

     - Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pr - #o Ser Ser Ser Ser Ser     

     #   220                                                                   

     - Tyr Asn Ala Asn Pro Ala Gln Pro Arg Leu Se - #r Glu Asn His Asn Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gl - #y Glu Asn Ile Ser Ser     

     #               255                                                       

     - Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Se - #r Glu Arg His Val Glu     

     #           270                                                           

     - Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Il - #e Thr Ser Arg Thr Ala     

     #       285                                                               

     - Arg Gly Val Ala Val Pro His Gly Asn His Ty - #r His Phe Ile Pro Tyr     

     #   300                                                                   

     - Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Al - #a Arg Ile Ile Pro Leu     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Arg Tyr Arg Ser Asn His Trp Val Pro Asp Se - #r Arg Pro Glu Gln Pro     

     #               335                                                       

     - Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Se - #r Pro Gln Pro Ala Pro     

     #           350                                                           

     - Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile As - #p Glu Lys Leu Val Lys     

     #       365                                                               

     - Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Va - #l Phe Glu Glu Asn Gly     

     #   380                                                                   

     - Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Se - #r Ala Glu Thr Ala Ala     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Se - #r Leu Ser His Lys Leu     

     #               415                                                       

     - Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser As - #p Arg Glu Phe Tyr Asn     

     #           430                                                           

     - Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gl - #n Asp Leu Leu Asp Asn     

     #       445                                                               

     - Lys Gly Arg Gln Val Asp Phe Glu Ala Leu As - #p Asn Leu Leu Glu Arg     

     #   460                                                                   

     - Leu Lys Asp Val Xaa Ser Asp Lys Val Lys Le - #u Val Xaa Asp Ile Leu     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Ala Phe Leu Ala Pro Ile Arg His Pro Glu Ar - #g Leu Gly Lys Pro Asn     

     #               495                                                       

     - Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gl - #n Val Ala Lys Leu Ala     

     #           510                                                           

     - Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Ph - #e Asp Pro Arg Asp Ile     

     #       525                                                               

     - Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pr - #o His Met Thr His Ser     

     #   540                                                                   

     - His Trp Ile Lys Lys Asp Ser Leu Ser Glu Al - #a Glu Arg Ala Ala Ala     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pr - #o Pro Ser Thr Asp His     

     #               575                                                       

     - Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Al - #a Glu Ala Ile Tyr Asn     

     #           590                                                           

     - Arg Val Lys Ala Ala Lys Lys Val Pro Leu As - #p Arg Met Pro Tyr Asn     

     #       605                                                               

     - Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Se - #r Leu Ile Ile Pro His     

     #   620                                                                   

     - Tyr Asp His Tyr His Asn Ile Lys Phe Glu Tr - #p Phe Asp Glu Gly Leu     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu As - #p Leu Leu Ala Thr Val     

     #               655                                                       

     - Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pr - #o His Ser Asp Asn Gly     

     #           670                                                           

     - Phe Gly Asn Ala Ser Asp His Val Gln Arg As - #n Lys Asn Gly Gln Ala     

     #       685                                                               

     - Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Gl - #u Lys Pro Gln Thr Glu     

     #   700                                                                   

     - Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Ly - #s Pro Gln Ser Glu Lys     

     705                 7 - #10                 7 - #15                 7 -   

     #20                                                                       

     - Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Gl - #u Glu Ser Pro Glu Glu     

     #               735                                                       

     - Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Va - #l Glu Glu Lys Leu Arg     

     #           750                                                           

     - Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln As - #p                         

     #       760                                                               

     - (2) INFORMATION FOR SEQ ID NO: 67:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 352 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #67:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTATAAGGGT GAATTAGAAA AAGGATACCA ATTTGATGGT TGGGAAATTT CT - #GGTTTCGA   

       60                                                                      

     - AGGTAAAAAA GACGCTGGCT ATGTTATTAA TCTATCAAAA GATACCTTTA TA - #AAACCTGT   

      120                                                                      

     - ATTCAAGAAA ATAGAGGAGA AAAAGGAGGA AGAAAATAAA CCTACTTTTG AT - #GTATCGAA   

      180                                                                      

     - AAAGAAAGAT AACCCACAAG TAAACCATAG TCAATTAAAT GAAAGTCACA GA - #AAAGAGGA   

      240                                                                      

     - TTTACAAAGA GAAGAGCATT CACAAAAATC TGATTCAACT AAGGATGTTA CA - #GCTACAGT   

      300                                                                      

     - TCTTGATAAA AACAATATCA GTAGTAAATC AACTACTAAC AATCCTAATA AG - #           

      352                                                                      

     - (2) INFORMATION FOR SEQ ID NO:68:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 117 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68:                            

     - Tyr Lys Gly Glu Leu Glu Lys Gly Tyr Gln Ph - #e Asp Gly Trp Glu Ile     

     #                15                                                       

     - Ser Gly Phe Glu Gly Lys Lys Asp Ala Gly Ty - #r Val Ile Asn Leu Ser     

     #            30                                                           

     - Lys Asp Thr Phe Ile Lys Pro Val Phe Lys Ly - #s Ile Glu Glu Lys Lys     

     #        45                                                               

     - Glu Glu Glu Asn Lys Pro Thr Phe Asp Val Se - #r Lys Lys Lys Asp Asn     

     #    60                                                                   

     - Pro Gln Val Asn His Ser Gln Leu Asn Glu Se - #r His Arg Lys Glu Asp     

     #80                                                                       

     - Leu Gln Arg Glu Glu His Ser Gln Lys Ser As - #p Ser Thr Lys Asp Val     

     #                95                                                       

     - Thr Ala Thr Val Leu Asp Lys Asn Asn Ile Se - #r Ser Lys Ser Thr Thr     

     #           110                                                           

     - Asn Asn Pro Asn Lys                                                     

             115                                                               

     - (2) INFORMATION FOR SEQ ID NO: 69:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1312 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #SEQ ID NO: 69: SEQUENCE DESCRIPTION:                                     

     - GAATGTTCAG GCTCAAGAAA GTTCAGGAAA TAAAATCCAC TTTATCAATG TT - #CAAGAAGG   

       60                                                                      

     - TGGCAGTGAT GCGATTATTC TTGAAAGCAA TGGACATTTT GCCATGGTGG AT - #ACAGGAGA   

      120                                                                      

     - AGATTATGAT TTCCCAGATG GAAGTGATTC TCGCTATCCA TGGAGAGAAG GA - #ATTGAAAC   

      180                                                                      

     - GTCTTATAAG CATGTTCTAA CAGACCGTGT CTTTCGTCGT TTGAAGGAAT TG - #GGTGTCCA   

      240                                                                      

     - AAAACTTGAT TTTATTTTGG TGACCCATAC CCACAGTGAT CATATTGGAA AT - #GTTGATGA   

      300                                                                      

     - ATTACTGTCT ACCTATCCAG TTGACCGAGT CTATCTTAAG AAATATAGTG AT - #AGTCGTAT   

      360                                                                      

     - TACTAATTCT GAACGTCTAT GGGATAATCT GTATGGCTAT GATAAGGTTT TA - #CAGACTGC   

      420                                                                      

     - TGCAGAAAAA GGTGTTTCAG TTATTCAAAA TATCACACAA GGGGATGCTC AT - #TTTCAGTT   

      480                                                                      

     - TGGGGACATG GATATTCAGC TCTATAATTA TGAAAATGAA ACTGATTCAT CG - #GGTGAATT   

      540                                                                      

     - AAAGAAAATT TGGGATGACA ATTCCAATTC CTTGATTAGC GTGGTGAAAG TC - #AATGGCAA   

      600                                                                      

     - GAAAATTTAC CTTGGGGGCG ATTTAGATAA TGTTCATGGA GCAGAAGACA AG - #TATGGTCC   

      660                                                                      

     - TCTCATTGGA AAAGTTGATT TGATGAAGTT TAATCATCAC CATGATACCA AC - #AAATCAAA   

      720                                                                      

     - TACCAAGGAT TTCATTAAAA ATTTGAGTCC GAGTTTGATT GTTCAAACTT CG - #GATAGTCT   

      780                                                                      

     - ACCTTGGAAA AATGGTGTTG ATAGTGAGTA TGTTAATTGG CTCAAAGAAC GA - #GGAATTGA   

      840                                                                      

     - GAGAATCAAC GCAGCCAGCA AAGACTATGA TGCAACAGTT TTTGATATTC GA - #AAAGACGG   

      900                                                                      

     - TTTTGTCAAT ATTTCAACAT CCTACAAGCC GATTCCAAGT TTTCAAGCTG GT - #TGGCATAA   

      960                                                                      

     - GAGTGCATAT GGGAACTGGT GGTATCAAGC GCCTGATTCT ACAGGAGAGT AT - #GCTGTCGG   

     1020                                                                      

     - TTGGAATGAA ATCGAAGGTG AATGGTATTA CTTTAACCAA ACGGGTATCT TG - #TTACAGAA   

     1080                                                                      

     - TCAATGGAAA AAATGGAACA ATCATTGGTT CTATTTGACA GACTCTGGTG CT - #TCTGCTAA   

     1140                                                                      

     - AAATTGGAAG AAAATCGCTG GAATCTGGTA TTATTTTAAC AAAGAAAACC AG - #ATGGAAAT   

     1200                                                                      

     - TGGTTGGATT CAAGATAAAG AGCAGTGGTA TTATTTGGAT GTTGATGGTT CT - #ATGAAGAC   

     1260                                                                      

     - AGGATGGCTT CAATATATGG GGCAATGGTA TTACTTTGCT CCATCAGGGG AA - #           

     1312                                                                      

     - (2) INFORMATION FOR SEQ ID NO:70:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 437 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:70:                            

     - Asn Val Gln Ala Gln Glu Ser Ser Gly Asn Ly - #s Ile His Phe Ile Asn     

     #                15                                                       

     - Val Gln Glu Gly Gly Ser Asp Ala Ile Ile Le - #u Glu Ser Asn Gly His     

     #            30                                                           

     - Phe Ala Met Val Asp Thr Gly Glu Asp Tyr As - #p Phe Pro Asp Gly Ser     

     #        45                                                               

     - Asp Ser Arg Tyr Pro Trp Arg Glu Gly Ile Gl - #u Thr Ser Tyr Lys His     

     #    60                                                                   

     - Val Leu Thr Asp Arg Val Phe Arg Arg Leu Ly - #s Glu Leu Gly Val Gln     

     #80                                                                       

     - Lys Leu Asp Phe Ile Leu Val Thr His Thr Hi - #s Ser Asp His Ile Gly     

     #                95                                                       

     - Asn Val Asp Glu Leu Leu Ser Thr Tyr Pro Va - #l Asp Arg Val Tyr Leu     

     #           110                                                           

     - Lys Lys Tyr Ser Asp Ser Arg Ile Thr Asn Se - #r Glu Arg Leu Trp Asp     

     #       125                                                               

     - Asn Leu Tyr Gly Tyr Asp Lys Val Leu Gln Th - #r Ala Ala Glu Lys Gly     

     #   140                                                                   

     - Val Ser Val Ile Gln Asn Ile Thr Gln Gly As - #p Ala His Phe Gln Phe     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Asp Met Asp Ile Gln Leu Tyr Asn Tyr Gl - #u Asn Glu Thr Asp Ser     

     #               175                                                       

     - Ser Gly Glu Leu Lys Lys Ile Trp Asp Asp As - #n Ser Asn Ser Leu Ile     

     #           190                                                           

     - Ser Val Val Lys Val Asn Gly Lys Lys Ile Ty - #r Leu Gly Gly Asp Leu     

     #       205                                                               

     - Asp Asn Val His Gly Ala Glu Asp Lys Tyr Gl - #y Pro Leu Ile Gly Lys     

     #   220                                                                   

     - Val Asp Leu Met Lys Phe Asn His His His As - #p Thr Asn Lys Ser Asn     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Thr Lys Asp Phe Ile Lys Asn Leu Ser Pro Se - #r Leu Ile Val Gln Thr     

     #               255                                                       

     - Ser Asp Ser Leu Pro Trp Lys Asn Gly Val As - #p Ser Glu Tyr Val Asn     

     #           270                                                           

     - Trp Leu Lys Glu Arg Gly Ile Glu Arg Ile As - #n Ala Ala Ser Lys Asp     

     #       285                                                               

     - Tyr Asp Ala Thr Val Phe Asp Ile Arg Lys As - #p Gly Phe Val Asn Ile     

     #   300                                                                   

     - Ser Thr Ser Tyr Lys Pro Ile Pro Ser Phe Gl - #n Ala Gly Trp His Lys     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Ser Ala Tyr Gly Asn Trp Trp Tyr Gln Ala Pr - #o Asp Ser Thr Gly Glu     

     #               335                                                       

     - Tyr Ala Val Gly Trp Asn Glu Ile Glu Gly Gl - #u Trp Tyr Tyr Phe Asn     

     #           350                                                           

     - Gln Thr Gly Ile Leu Leu Gln Asn Gln Trp Ly - #s Lys Trp Asn Asn His     

     #       365                                                               

     - Trp Phe Tyr Leu Thr Asp Ser Gly Ala Ser Al - #a Lys Asn Trp Lys Lys     

     #   380                                                                   

     - Ile Ala Gly Ile Trp Tyr Tyr Phe Asn Lys Gl - #u Asn Gln Met Glu Ile     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Gly Trp Ile Gln Asp Lys Glu Gln Trp Tyr Ty - #r Leu Asp Val Asp Gly     

     #               415                                                       

     - Ser Met Lys Thr Gly Trp Leu Gln Tyr Met Gl - #y Gln Trp Tyr Tyr Phe     

     #           430                                                           

     - Ala Pro Ser Gly Glu                                                     

             435                                                               

     - (2) INFORMATION FOR SEQ ID NO: 71:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1855 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #71:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTTGGGTGTA ACCCATATCC AGCTCCTTCC AGTCTTGTCT TACTACTTTG TC - #AATGAATT   

       60                                                                      

     - GAAAAACCAT GAACGCTTGT CTGACTACGC TTCAAGCAAC AGCAACTACA AC - #TGGGGATA   

      120                                                                      

     - TGACCCTCAA AACTACTTCT CCTTGACTGG TATGTACTCA AGCGATCCTA AG - #AATCCAGA   

      180                                                                      

     - AAAACGAATC GCAGAATTTA AAAACCTCAT CAACGAAATC CACAAACGTG GT - #ATGGGAGC   

      240                                                                      

     - TATCCTAGAT GTCGTTTATA ACCACACAGC CAAAGTCGAT CTCTTTGAAG AT - #TTGGAACC   

      300                                                                      

     - AAACTACTAC CACTTTATGG ATGCCGATGG CACACCTCGA ACTAGCTTTG GT - #GGTGGACG   

      360                                                                      

     - CTTGGGGACA ACCCACCATA TGACCAAACG GCTCCTAATT GACTCTATCA AA - #TACCTAGT   

      420                                                                      

     - TGATACCTAC AAAGTGGATG GCTTCCGTTT CGATATGATG GGAGACCATG AC - #GCCGCTTC   

      480                                                                      

     - TATCGAAGAA GCTTACAAGG CTGCACGCGC CCTCAATCCA AACCTCATCA TG - #CTTGGTGA   

      540                                                                      

     - AGGTTGGAGA ACCTATGCCG GTGATGAAAA CATGCCTACT AAAGCTGCTG AC - #CAAGATTG   

      600                                                                      

     - GATGAAACAT ACCGATACTG TCGCTGTCTT TTCAGATGAC ATCCGTAACA AC - #CTCAAATC   

      660                                                                      

     - TGGTTATCCA AACGAAGGTC AACCTGCCTT TATCACAGGT GGCAAGCGTG AT - #GTCAACAC   

      720                                                                      

     - CATCTTTAAA AATCTCATTG CTCAACCAAC TAACTTTGAA GCTGACAGCC CT - #GGAGATGT   

      780                                                                      

     - CATCCAATAC ATCGCAGCCC ATGATAACTT GACCCTCTTT GACATCATTG CC - #CAGTCTAT   

      840                                                                      

     - CAAAAAAGAC CCAAGCAAGG CTGAGAACTA TGCTGAAATC CACCGTCGTT TA - #CGACTTGG   

      900                                                                      

     - AAATCTCATG GTCTTGACAG CTCAAGGAAC TCCATTTATC CACTCCGGTC AG - #GAATATGG   

      960                                                                      

     - ACGTACTAAA CAATTCCGTG ACCCAGCCTA CAAGACTCCA GTAGCAGAGG AT - #AAGGTTCC   

     1020                                                                      

     - AAACAAATCT CACTTGTTGC GTGATAAGGA CGGCAACCCA TTTGACTATC CT - #TACTTCAT   

     1080                                                                      

     - CCATGACTCT TACGATTCTA GTGATGCAGT CAACAAGTTT GACTGGACTA AG - #GCTACAGA   

     1140                                                                      

     - TGGTAAAGCT TATCCTGAAA ATGTCAAGAG CCGTGACTAT ATGAAAGGTT TG - #ATTGCCCT   

     1200                                                                      

     - TCGTCAATCT ACAGATGCCT TCCGACTTAA GAGTCTTCAA GATATCAAAG AC - #CGTGTCCA   

     1260                                                                      

     - CCTCATCACT GTCCCAGGCC AAAATGGTGT GGAAAAAGAG GATGTAGTGA TT - #GGCTACCA   

     1320                                                                      

     - AATCACTGCT CCAAACGGCG ATATCTACGC AGTCTTTGTC AATGCGGATG AA - #AAAGCTCG   

     1380                                                                      

     - CGAATTTAAT TTGGGAACTG CCTTTGCACA TCTAAGAAAT GCGGAAGTTT TG - #GCAGATGA   

     1440                                                                      

     - AAACCAAGCA GGACCAGTCG GAATTGCCAA CCCGAAAGGA CTTGAATGGA CT - #GAAAAAGG   

     1500                                                                      

     - CTTGAAATTG AATGCCCTTA CAGCTACTGT TCTTCGAGTC TCTCAAAATG GA - #ACTAGCCA   

     1560                                                                      

     - TGAGTCAACT GCAGAAGAGA AACCAGACTC AACCCCTTCC AAGCCTGAAC AT - #CAAAATGA   

     1620                                                                      

     - AGCTTCTCAC CCTGCACATC AAGACCCAGC TCCAGAAGCT AGACCTGATT CT - #ACTAAACC   

     1680                                                                      

     - AGATGCCAAA GTAGCTGATG CGGAAAATAA ACCTAGCCAA GCTACAGCTG AT - #TCACAAGC   

     1740                                                                      

     - TGAACAACCA GCACAAGAAG CACAAGCATC ATCTGTAAAA GAAGCGGTTC GA - #AACGAATC   

     1800                                                                      

     - GGTAGAAAAC TCTAGCAAGG AAAATATACC TGCAACCCCA GATAAACAAG CT - #GAA        

     1855                                                                      

     - (2) INFORMATION FOR SEQ ID NO:72:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 618 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72:                            

     - Leu Gly Val Thr His Ile Gln Leu Leu Pro Va - #l Leu Ser Tyr Tyr Phe     

     #                15                                                       

     - Val Asn Glu Leu Lys Asn His Glu Arg Leu Se - #r Asp Tyr Ala Ser Ser     

     #            30                                                           

     - Asn Ser Asn Tyr Asn Trp Gly Tyr Asp Pro Gl - #n Asn Tyr Phe Ser Leu     

     #        45                                                               

     - Thr Gly Met Tyr Ser Ser Asp Pro Lys Asn Pr - #o Glu Lys Arg Ile Ala     

     #    60                                                                   

     - Glu Phe Lys Asn Leu Ile Asn Glu Ile His Ly - #s Arg Gly Met Gly Ala     

     #80                                                                       

     - Ile Leu Asp Val Val Tyr Asn His Thr Ala Ly - #s Val Asp Leu Phe Glu     

     #                95                                                       

     - Asp Leu Glu Pro Asn Tyr Tyr His Phe Met As - #p Ala Asp Gly Thr Pro     

     #           110                                                           

     - Arg Thr Ser Phe Gly Gly Gly Arg Leu Gly Th - #r Thr His His Met Thr     

     #       125                                                               

     - Lys Arg Leu Leu Ile Asp Ser Ile Lys Tyr Le - #u Val Asp Thr Tyr Lys     

     #   140                                                                   

     - Val Asp Gly Phe Arg Phe Asp Met Met Gly As - #p His Asp Ala Ala Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ile Glu Glu Ala Tyr Lys Ala Ala Arg Ala Le - #u Asn Pro Asn Leu Ile     

     #               175                                                       

     - Met Leu Gly Glu Gly Trp Arg Thr Tyr Ala Gl - #y Asp Glu Asn Met Pro     

     #           190                                                           

     - Thr Lys Ala Ala Asp Gln Asp Trp Met Lys Hi - #s Thr Asp Thr Val Ala     

     #       205                                                               

     - Val Phe Ser Asp Asp Ile Arg Asn Asn Leu Ly - #s Ser Gly Tyr Pro Asn     

     #   220                                                                   

     - Glu Gly Gln Pro Ala Phe Ile Thr Gly Gly Ly - #s Arg Asp Val Asn Thr     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ile Phe Lys Asn Leu Ile Ala Gln Pro Thr As - #n Phe Glu Ala Asp Ser     

     #               255                                                       

     - Pro Gly Asp Val Ile Gln Tyr Ile Ala Ala Hi - #s Asp Asn Leu Thr Leu     

     #           270                                                           

     - Phe Asp Ile Ile Ala Gln Ser Ile Lys Lys As - #p Pro Ser Lys Ala Glu     

     #       285                                                               

     - Asn Tyr Ala Glu Ile His Arg Arg Leu Arg Le - #u Gly Asn Leu Met Val     

     #   300                                                                   

     - Leu Thr Ala Gln Gly Thr Pro Phe Ile His Se - #r Gly Gln Glu Tyr Gly     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Arg Thr Lys Gln Phe Arg Asp Pro Ala Tyr Ly - #s Thr Pro Val Ala Glu     

     #               335                                                       

     - Asp Lys Val Pro Asn Lys Ser His Leu Leu Ar - #g Asp Lys Asp Gly Asn     

     #           350                                                           

     - Pro Phe Asp Tyr Pro Tyr Phe Ile His Asp Se - #r Tyr Asp Ser Ser Asp     

     #       365                                                               

     - Ala Val Asn Lys Phe Asp Trp Thr Lys Ala Th - #r Asp Gly Lys Ala Tyr     

     #   380                                                                   

     - Pro Glu Asn Val Lys Ser Arg Asp Tyr Met Ly - #s Gly Leu Ile Ala Leu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Arg Gln Ser Thr Asp Ala Phe Arg Leu Lys Se - #r Leu Gln Asp Ile Lys     

     #               415                                                       

     - Asp Arg Val His Leu Ile Thr Val Pro Gly Gl - #n Asn Gly Val Glu Lys     

     #           430                                                           

     - Glu Asp Val Val Ile Gly Tyr Gln Ile Thr Al - #a Pro Asn Gly Asp Ile     

     #       445                                                               

     - Tyr Ala Val Phe Val Asn Ala Asp Glu Lys Al - #a Arg Glu Phe Asn Leu     

     #   460                                                                   

     - Gly Thr Ala Phe Ala His Leu Arg Asn Ala Gl - #u Val Leu Ala Asp Glu     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Asn Gln Ala Gly Pro Val Gly Ile Ala Asn Pr - #o Lys Gly Leu Glu Trp     

     #               495                                                       

     - Thr Glu Lys Gly Leu Lys Leu Asn Ala Leu Th - #r Ala Thr Val Leu Arg     

     #           510                                                           

     - Val Ser Gln Asn Gly Thr Ser His Glu Ser Th - #r Ala Glu Glu Lys Pro     

     #       525                                                               

     - Asp Ser Thr Pro Ser Lys Pro Glu His Gln As - #n Glu Ala Ser His Pro     

     #   540                                                                   

     - Ala His Gln Asp Pro Ala Pro Glu Ala Arg Pr - #o Asp Ser Thr Lys Pro     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Asp Ala Lys Val Ala Asp Ala Glu Asn Lys Pr - #o Ser Gln Ala Thr Ala     

     #               575                                                       

     - Asp Ser Gln Ala Glu Gln Pro Ala Gln Glu Al - #a Gln Ala Ser Ser Val     

     #           590                                                           

     - Lys Glu Ala Val Arg Asn Glu Ser Val Glu As - #n Ser Ser Lys Glu Asn     

     #       605                                                               

     - Ile Pro Ala Thr Pro Asp Lys Gln Ala Glu                                 

     #   615                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 73:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1774 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #73:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAGTGATGGT ACTTGGCAAG GAAAACAGTA TCTGAAAGAA GATGGCAGTC AA - #GCAGCAAA   

       60                                                                      

     - TGAGTGGGTT TTNGATACTC ATTATCAATC TTGGTTCTAT ATAAAAGCAG AT - #GCTAACTA   

      120                                                                      

     - TGCTGAAAAT GAATGGCTAA AGCAAGGTGA CGACTATTTT TACCTCAAAT CT - #GGTGGCTA   

      180                                                                      

     - TATGGCCAAA TCAGAATGGG TAGAAGACAA GGGAGCCTTT TATTATCTTG AC - #CAAGATGG   

      240                                                                      

     - AAAGATGAAA AGAAATGCTT GGGTAGGAAC TTCCTATGTT GGTGCAACAG GT - #GCCAAAGT   

      300                                                                      

     - AATAGAAGAC TGGGTCTATG ATTCTCAATA CGATGCTTGG TTTTATATCA AA - #GCAGATGG   

      360                                                                      

     - ACAGCACGCA GAGAAAGAAT GGCTCCAAAT TAAAGGGAAG GACTATTATT TC - #AAATCCGG   

      420                                                                      

     - TGGTTATCTA CTGACAAGTC AGTGGATTAA TCAAGCTTAT GTGAATGCTA GT - #GGTGCCAA   

      480                                                                      

     - AGTACAGCAA GGTTGGCTTT TTGACAAACA ATACCAATCT TGGTTTTACA TC - #AAAGAAAA   

      540                                                                      

     - TGGAAACTAT GCTGATAAAG AATGGATTTT CGAGAATGGT CACTATTATT AT - #CTAAAATC   

      600                                                                      

     - CGGTGGCTAC ATGGCAGCCA ATGAATGGAT TTGGGATAAG GAATCTTGGT TT - #TATCTCAA   

      660                                                                      

     - ATTTGATGGG AAAATGGCTG AAAAAGAATG GGTCTACGAT TCTCATAGTC AA - #GCTTGGTA   

      720                                                                      

     - CTACTTCAAA TCCGGTGGTT ACATGACAGC CAATGAATGG ATTTGGGATA AG - #GAATCTTG   

      780                                                                      

     - GTTTTACCTC AAATCTGATG GGAAAATAGC TGAAAAAGAA TGGGTCTACG AT - #TCTCATAG   

      840                                                                      

     - TCAAGCTTGG TACTACTTCA AATCTGGTGG CTACATGGCG AAAAATGAGA CA - #GTAGATGG   

      900                                                                      

     - TTATCAGCTT GGAAGCGATG GTAAATGGCT TGGAGGAAAA ACTACAAATG AA - #AATGCTGC   

      960                                                                      

     - TTACTATCAA GTAGTGCCTG TTACAGCCAA TGTTTATGAT TCAGATGGTG AA - #AAGCTTTC   

     1020                                                                      

     - CTATATATCG CAAGGTAGTG TCGTATGGCT AGATAAGGAT AGAAAAAGTG AT - #GACAAGCG   

     1080                                                                      

     - CTTGGCTATT ACTATTTCTG GTTTGTCAGG CTATATGAAA ACAGAAGATT TA - #CAAGCGCT   

     1140                                                                      

     - AGATGCTAGT AAGGACTTTA TCCCTTATTA TGAGAGTGAT GGCCACCGTT TT - #TATCACTA   

     1200                                                                      

     - TGTGGCTCAG AATGCTAGTA TCCCAGTAGC TTCTCATCTT TCTGATATGG AA - #GTAGGCAA   

     1260                                                                      

     - GAAATATTAT TCGGCAGATG GCCTGCATTT TGATGGTTTT AAGCTTGAGA AT - #CCCTTCCT   

     1320                                                                      

     - TTTCAAAGAT TTAACAGAGG CTACAAACTA CAGTGCTGAA GAATTGGATA AG - #GTATTTAG   

     1380                                                                      

     - TTTGCTAAAC ATTAACAATA GCCTTTTGGA GAACAAGGGC GCTACTTTTA AG - #GAAGCCGA   

     1440                                                                      

     - AGAACATTAC CATATCAATG CTCTTTATCT CCTTGCCCAT AGTGCCCTAG AA - #AGTAACTG   

     1500                                                                      

     - GGGAAGAAGT AAAATTGCCA AAGATAAGAA TAATTTCTTT GGCATTACAG CC - #TATGATAC   

     1560                                                                      

     - GACCCCTTAC CTTTCTGCTA AGACATTTGA TGATGTGGAT AAGGGAATTT TA - #GGTGCAAC   

     1620                                                                      

     - CAAGTGGATT AAGGAAAATT ATATCGATAG GGGAAGAACT TTCCTTGGAA AC - #AAGGCTTC   

     1680                                                                      

     - TGGTATGAAT GTGGAATATG CTTCAGACCC TTATTGGGGC GAAAAAATTG CT - #AGTGTGAT   

     1740                                                                      

     #      1774        AAGC TAGGTGGCAA AGAT                                   

     - (2) INFORMATION FOR SEQ ID NO:74:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 591 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74:                            

     - Ser Asp Gly Thr Trp Gln Gly Lys Gln Tyr Le - #u Lys Glu Asp Gly Ser     

     #                15                                                       

     - Gln Ala Ala Asn Glu Trp Val Xaa Asp Thr Hi - #s Tyr Gln Ser Trp Phe     

     #            30                                                           

     - Tyr Ile Lys Ala Asp Ala Asn Tyr Ala Glu As - #n Glu Trp Leu Lys Gln     

     #        45                                                               

     - Gly Asp Asp Tyr Phe Tyr Leu Lys Ser Gly Gl - #y Tyr Met Ala Lys Ser     

     #    60                                                                   

     - Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Ty - #r Leu Asp Gln Asp Gly     

     #80                                                                       

     - Lys Met Lys Arg Asn Ala Trp Val Gly Thr Se - #r Tyr Val Gly Ala Thr     

     #                95                                                       

     - Gly Ala Lys Val Ile Glu Asp Trp Val Tyr As - #p Ser Gln Tyr Asp Ala     

     #           110                                                           

     - Trp Phe Tyr Ile Lys Ala Asp Gly Gln His Al - #a Glu Lys Glu Trp Leu     

     #       125                                                               

     - Gln Ile Lys Gly Lys Asp Tyr Tyr Phe Lys Se - #r Gly Gly Tyr Leu Leu     

     #   140                                                                   

     - Thr Ser Gln Trp Ile Asn Gln Ala Tyr Val As - #n Ala Ser Gly Ala Lys     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Val Gln Gln Gly Trp Leu Phe Asp Lys Gln Ty - #r Gln Ser Trp Phe Tyr     

     #               175                                                       

     - Ile Lys Glu Asn Gly Asn Tyr Ala Asp Lys Gl - #u Trp Ile Phe Glu Asn     

     #           190                                                           

     - Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Ty - #r Met Ala Ala Asn Glu     

     #       205                                                               

     - Trp Ile Trp Asp Lys Glu Ser Trp Phe Tyr Le - #u Lys Phe Asp Gly Lys     

     #   220                                                                   

     - Met Ala Glu Lys Glu Trp Val Tyr Asp Ser Hi - #s Ser Gln Ala Trp Tyr     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Phe Lys Ser Gly Gly Tyr Met Thr Ala As - #n Glu Trp Ile Trp Asp     

     #               255                                                       

     - Lys Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gl - #y Lys Ile Ala Glu Lys     

     #           270                                                           

     - Glu Trp Val Tyr Asp Ser His Ser Gln Ala Tr - #p Tyr Tyr Phe Lys Ser     

     #       285                                                               

     - Gly Gly Tyr Met Ala Lys Asn Glu Thr Val As - #p Gly Tyr Gln Leu Gly     

     #   300                                                                   

     - Ser Asp Gly Lys Trp Leu Gly Gly Lys Thr Th - #r Asn Glu Asn Ala Ala     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Tyr Tyr Gln Val Val Pro Val Thr Ala Asn Va - #l Tyr Asp Ser Asp Gly     

     #               335                                                       

     - Glu Lys Leu Ser Tyr Ile Ser Gln Gly Ser Va - #l Val Trp Leu Asp Lys     

     #           350                                                           

     - Asp Arg Lys Ser Asp Asp Lys Arg Leu Ala Il - #e Thr Ile Ser Gly Leu     

     #       365                                                               

     - Ser Gly Tyr Met Lys Thr Glu Asp Leu Gln Al - #a Leu Asp Ala Ser Lys     

     #   380                                                                   

     - Asp Phe Ile Pro Tyr Tyr Glu Ser Asp Gly Hi - #s Arg Phe Tyr His Tyr     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Val Ala Gln Asn Ala Ser Ile Pro Val Ala Se - #r His Leu Ser Asp Met     

     #               415                                                       

     - Glu Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gl - #y Leu His Phe Asp Gly     

     #           430                                                           

     - Phe Lys Leu Glu Asn Pro Phe Leu Phe Lys As - #p Leu Thr Glu Ala Thr     

     #       445                                                               

     - Asn Tyr Ser Ala Glu Glu Leu Asp Lys Val Ph - #e Ser Leu Leu Asn Ile     

     #   460                                                                   

     - Asn Asn Ser Leu Leu Glu Asn Lys Gly Ala Th - #r Phe Lys Glu Ala Glu     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Glu His Tyr His Ile Asn Ala Leu Tyr Leu Le - #u Ala His Ser Ala Leu     

     #               495                                                       

     - Glu Ser Asn Trp Gly Arg Ser Lys Ile Ala Ly - #s Asp Lys Asn Asn Phe     

     #           510                                                           

     - Phe Gly Ile Thr Ala Tyr Asp Thr Thr Pro Ty - #r Leu Ser Ala Lys Thr     

     #       525                                                               

     - Phe Asp Asp Val Asp Lys Gly Ile Leu Gly Al - #a Thr Lys Trp Ile Lys     

     #   540                                                                   

     - Glu Asn Tyr Ile Asp Arg Gly Arg Thr Phe Le - #u Gly Asn Lys Ala Ser     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Gly Met Asn Val Glu Tyr Ala Ser Asp Pro Ty - #r Trp Gly Glu Lys Ile     

     #               575                                                       

     - Ala Ser Val Met Met Lys Ile Asn Glu Lys Le - #u Gly Gly Lys Asp         

     #           590                                                           

     - (2) INFORMATION FOR SEQ ID NO: 75:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1105 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #75:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGGGATTCAA TATGTCAGAG ATGATACTAG AGATAAAGAA GAGGGAATAG AG - #TATGATGA   

       60                                                                      

     - CGCTGACAAT GGGGATATTA TTGTAAAAGT AGCGACTAAA CCTAAGGTAG TA - #ACCAAGAA   

      120                                                                      

     - AATTTCAAGT ACGCGAATTC GTTATGAAAA AGATGAAACA AAAGACCGTA GT - #GAAAATCC   

      180                                                                      

     - TGTTACAATT GATGGAGAGG ATGGCTATGT AACTACGACA AGGACCTACG AT - #GTTAATCC   

      240                                                                      

     - AGAGACTGGT TATGTTACCG AACAGGTTAC TGTTGATAGA AAAGAAGCCA CG - #GATACAGT   

      300                                                                      

     - TATCAAAGTT CCAGCTAAAA GCAAGGTTGA AGAAGTTCTT GTTCCATTTG CT - #ACTAAATA   

      360                                                                      

     - TGAAGCAGAC AATGACCTTT CTGCAGGACA GGAGCAAGAG ATTACTCTAG GA - #AAGAATGG   

      420                                                                      

     - GAAAACAGTT ACAACGATAA CTTATAATGT AGATGGAAAG AGTGGACAAG TA - #ACTGAGAG   

      480                                                                      

     - TACTTTAAGT CAAAAAAAAG ACTCtCAAAC AAGAGTTGTT AAAAAAAGaA CC - #ArkCCCCA   

      540                                                                      

     - AGTTCTTGTC CAAGAAATTC CAATCGAAAC AGAATATCTC GATGGCCCaA CT - #CTTGATAA   

      600                                                                      

     - AaGTCAAGAA GTAGAAGAAG TAGGAGAAAT TGGTAAATTA CTCTTACTAC AA - #TCTATACT   

      660                                                                      

     - GGTAGATGAA CGTGATGGAA CAATTGAAGA AACTACTTCT CGTCAAATTA CT - #AAAGAGAT   

      720                                                                      

     - GGTAAAAAGA CGTATAAGGA GAGGGACGAG AGAACCTGAA AAAGTTGTTG TT - #CCTGAGCA   

      780                                                                      

     - ATCATCTATT CCTTCGTATC CTGTATCTGT TACATCTAAC CAAGGAACAG AT - #GTAGCAGT   

      840                                                                      

     - AGAACCAGCT AAAGCAGTTG CTCCAACAAC AGACTGGAAA CAAGAAAATG GT - #ATGTGGTA   

      900                                                                      

     - TTTTTATAAT ACTGATGGTT CCATGGCAAC AGGTTGGGTA CAAGTTAATA GT - #TCATGGTA   

      960                                                                      

     - CTACCTCAAC AGCAACGGTT CTATGAAAGT CAATCAATGG TTCCAAGTTG GT - #GGTAAATG   

     1020                                                                      

     - GTATTATGTA AATACATCGG GTGAGTTAGC GGTCAATACA AGTATAGATG GC - #TATAGAGT   

     1080                                                                      

     #             1105 TGGG TGCGT                                             

     - (2) INFORMATION FOR SEQ ID NO:76:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 368 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76:                            

     - Gly Ile Gln Tyr Val Arg Asp Asp Thr Arg As - #p Lys Glu Glu Gly Ile     

     #                15                                                       

     - Glu Tyr Asp Asp Ala Asp Asn Gly Asp Ile Il - #e Val Lys Val Ala Thr     

     #            30                                                           

     - Lys Pro Lys Val Val Thr Lys Lys Ile Ser Se - #r Thr Arg Ile Arg Tyr     

     #        45                                                               

     - Glu Lys Asp Glu Thr Lys Asp Arg Ser Glu As - #n Pro Val Thr Ile Asp     

     #    60                                                                   

     - Gly Glu Asp Gly Tyr Val Thr Thr Thr Arg Th - #r Tyr Asp Val Asn Pro     

     #80                                                                       

     - Glu Thr Gly Tyr Val Thr Glu Gln Val Thr Va - #l Asp Arg Lys Glu Ala     

     #                95                                                       

     - Thr Asp Thr Val Ile Lys Val Pro Ala Lys Se - #r Lys Val Glu Glu Val     

     #           110                                                           

     - Leu Val Pro Phe Ala Thr Lys Tyr Glu Ala As - #p Asn Asp Leu Ser Ala     

     #       125                                                               

     - Gly Gln Glu Gln Glu Ile Thr Leu Gly Lys As - #n Gly Lys Thr Val Thr     

     #   140                                                                   

     - Thr Ile Thr Tyr Asn Val Asp Gly Lys Ser Gl - #y Gln Val Thr Glu Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Leu Ser Gln Lys Lys Asp Ser Gln Thr Ar - #g Val Val Lys Lys Arg     

     #               175                                                       

     - Thr Xaa Pro Gln Val Leu Val Gln Glu Ile Pr - #o Ile Glu Thr Glu Tyr     

     #           190                                                           

     - Leu Asp Gly Pro Thr Leu Asp Lys Ser Gln Gl - #u Val Glu Glu Val Gly     

     #       205                                                               

     - Glu Ile Gly Lys Leu Leu Leu Leu Gln Ser Il - #e Leu Val Asp Glu Arg     

     #   220                                                                   

     - Asp Gly Thr Ile Glu Glu Thr Thr Ser Arg Gl - #n Ile Thr Lys Glu Met     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Val Lys Arg Arg Ile Arg Arg Gly Thr Arg Gl - #u Pro Glu Lys Val Val     

     #               255                                                       

     - Val Pro Glu Gln Ser Ser Ile Pro Ser Tyr Pr - #o Val Ser Val Thr Ser     

     #           270                                                           

     - Asn Gln Gly Thr Asp Val Ala Val Glu Pro Al - #a Lys Ala Val Ala Pro     

     #       285                                                               

     - Thr Thr Asp Trp Lys Gln Glu Asn Gly Met Tr - #p Tyr Phe Tyr Asn Thr     

     #   300                                                                   

     - Asp Gly Ser Met Ala Thr Gly Trp Val Gln Va - #l Asn Ser Ser Trp Tyr     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Tyr Leu Asn Ser Asn Gly Ser Met Lys Val As - #n Gln Trp Phe Gln Val     

     #               335                                                       

     - Gly Gly Lys Trp Tyr Tyr Val Asn Thr Ser Gl - #y Glu Leu Ala Val Asn     

     #           350                                                           

     - Thr Ser Ile Asp Gly Tyr Arg Val Asn Asp As - #n Gly Glu Trp Val Arg     

     #       365                                                               

     - (2) INFORMATION FOR SEQ ID NO: 77:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 661 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #77:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGATAATAGA GAAGCATTAA AAACCTTTAT GACGGGTGAA AATTTTTATC TC - #CAACATTA   

       60                                                                      

     - TCTAGGAGCA CATAGGGAAG AACTAAATGG AGAGCATGGC TATACCTTCC GT - #GTTTGGGC   

      120                                                                      

     - ACCTAATGCT CAGGCTGTTC ACTTGGTTGG TGATTTTACC AACTGGATTG AA - #AATCAGAT   

      180                                                                      

     - TCCAATGGTA AGAAATGATT TTGGGGTCTG GGAAGTCTTT ACCAATATGG CT - #CAAGAAGG   

      240                                                                      

     - GCATATTTAC AAATATCATG TCACACGTCA AAATGGTCAT CAACTGATGA AG - #ATTGACCC   

      300                                                                      

     - TTTTGCTGTC AGGTATGAGG CTCGTCCAGG AACAGGGGCA ATCGTAACAG AG - #CTTCCTGA   

      360                                                                      

     - GAAGAAATGG AAGGATGGAC TTTGGCTGGC ACGAAGAAAA CGTTGGGGCT TT - #GAAGAGCG   

      420                                                                      

     - TCCTGTCAAT ATTTATGAAG TTCACGCTGG ATCATGGAAA AGAAATTCTG AT - #GGCAGTCC   

      480                                                                      

     - TTATAGTTTT GCCCAGCTCA AGGATGAACT CATTCCTTAT CTCGTTGAAA TG - #AACTATAC   

      540                                                                      

     - TCATATTGAG TTTATGCCCT TGATGTCCCA TCCTTTGGGC TTGAGTTGGG GG - #TATCAGCT   

      600                                                                      

     - TATGGGTTAC TTCGCTTTAG AGCATGCTTA TGGCCGACCA GAGGAGTTTC AA - #GATTTTGT   

      660                                                                      

     #              661                                                        

     - (2) INFORMATION FOR SEQ ID NO:78:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 220 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78:                            

     - Asp Asn Arg Glu Ala Leu Lys Thr Phe Met Th - #r Gly Glu Asn Phe Tyr     

     #                15                                                       

     - Leu Gln His Tyr Leu Gly Ala His Arg Glu Gl - #u Leu Asn Gly Glu His     

     #            30                                                           

     - Gly Tyr Thr Phe Arg Val Trp Ala Pro Asn Al - #a Gln Ala Val His Leu     

     #        45                                                               

     - Val Gly Asp Phe Thr Asn Trp Ile Glu Asn Gl - #n Ile Pro Met Val Arg     

     #    60                                                                   

     - Asn Asp Phe Gly Val Trp Glu Val Phe Thr As - #n Met Ala Gln Glu Gly     

     #80                                                                       

     - His Ile Tyr Lys Tyr His Val Thr Arg Gln As - #n Gly His Gln Leu Met     

     #                95                                                       

     - Lys Ile Asp Pro Phe Ala Val Arg Tyr Glu Al - #a Arg Pro Gly Thr Gly     

     #           110                                                           

     - Ala Ile Val Thr Glu Leu Pro Glu Lys Lys Tr - #p Lys Asp Gly Leu Trp     

     #       125                                                               

     - Leu Ala Arg Arg Lys Arg Trp Gly Phe Glu Gl - #u Arg Pro Val Asn Ile     

     #   140                                                                   

     - Tyr Glu Val His Ala Gly Ser Trp Lys Arg As - #n Ser Asp Gly Ser Pro     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Tyr Ser Phe Ala Gln Leu Lys Asp Glu Leu Il - #e Pro Tyr Leu Val Glu     

     #               175                                                       

     - Met Asn Tyr Thr His Ile Glu Phe Met Pro Le - #u Met Ser His Pro Leu     

     #           190                                                           

     - Gly Leu Ser Trp Gly Tyr Gln Leu Met Gly Ty - #r Phe Ala Leu Glu His     

     #       205                                                               

     - Ala Tyr Gly Arg Pro Glu Glu Phe Gln Asp Ph - #e Val                     

     #   220                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 79:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 976 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #79:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AGATTTTGTC GAGGAGTGTC ATACCCATAA TATTGGGGTT ATTGTGGACT GG - #GTACCAGN   

       60                                                                      

     - TCACTTTACC ATCAACGATG ATGCCTTAGC CTATTATGAT GGGACACCGA CT - #TTTGAATA   

      120                                                                      

     - CCAAGACCAT AATAAGGCTC ATAACCATGG TTGGGGTGCC CTTAATTTTG AC - #CTTGGAAA   

      180                                                                      

     - AAATGAAGTC CAGTCCTTCT TAATTTCTTG CATTAAGCAT TGGATTGATG TC - #TATCATTT   

      240                                                                      

     - GGATGGTATT CGTGTGGATG CTGTTAGCAA CATGCTCTAT TTGGACTATG AT - #GATGCTCC   

      300                                                                      

     - ATGGACACCT AATAAAGATG GCGGAAATCT CAACTATGAA GGTTATTATT TC - #CTTCAGCG   

      360                                                                      

     - CTTGAATGAG GTTATTAAGT TAGAATATCC AGATGTGATG ATGATTGCAG AA - #GAAAGTTC   

      420                                                                      

     - GTCTGCGATC AAGATTACGG GAATGAAAGA GATTGGTGGT CTAGGATTTG AC - #TACAAATG   

      480                                                                      

     - GAACATGGGC TGGATGAATG ATATCCTCCG TTTCTACGAA GAAGATCCGA TC - #TATCGTAA   

      540                                                                      

     - ATATGACTTT AACCTGGTGA CTTTCAGCTT TATGTATGTT TNCAAGGAGA AT - #TATCTCTT   

      600                                                                      

     - GCCATTCTCG CACGATGAAG TGGTTCATGG CAAGAAGAGT ATGATGCATA AG - #ATGTGGGG   

      660                                                                      

     - AGATCGTTAC AATCAATTCG CAGGCTTGCG CAATCTCTAT ACGTACCAAA TT - #TGTCACCC   

      720                                                                      

     - TGGTAAGAAA TTGCTCTTCA TGGGTAGCGA ATACGGTCAA TTCCTAGAAT GG - #AAATCTGA   

      780                                                                      

     - AGAACAGTTG GAATGGTCTA ACCTAGAAGA CCCAATGAAT GCTAAGATGA AG - #TATTTCGC   

      840                                                                      

     - TTCTCAGCTA AACCAGTTTT ACAAAGATCA TCGCTGTCTG TGGGAAATTG AT - #ACCAGCTA   

      900                                                                      

     - TGATGGTATT GAAATCATTG ATGCGGATAA TCGAGACCAG AGTGTTCTTT CC - #TTTATTCG   

      960                                                                      

     #   976                                                                   

     - (2) INFORMATION FOR SEQ ID NO:80:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 325 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:80:                            

     - Asp Phe Val Glu Glu Cys His Thr His Asn Il - #e Gly Val Ile Val Asp     

     #                15                                                       

     - Trp Val Pro Xaa His Phe Thr Ile Asn Asp As - #p Ala Leu Ala Tyr Tyr     

     #            30                                                           

     - Asp Gly Thr Pro Thr Phe Glu Tyr Gln Asp Hi - #s Asn Lys Ala His Asn     

     #        45                                                               

     - His Gly Trp Gly Ala Leu Asn Phe Asp Leu Gl - #y Lys Asn Glu Val Gln     

     #    60                                                                   

     - Ser Phe Leu Ile Ser Cys Ile Lys His Trp Il - #e Asp Val Tyr His Leu     

     #80                                                                       

     - Asp Gly Ile Arg Val Asp Ala Val Ser Asn Me - #t Leu Tyr Leu Asp Tyr     

     #                95                                                       

     - Asp Asp Ala Pro Trp Thr Pro Asn Lys Asp Gl - #y Gly Asn Leu Asn Tyr     

     #           110                                                           

     - Glu Gly Tyr Tyr Phe Leu Gln Arg Leu Asn Gl - #u Val Ile Lys Leu Glu     

     #       125                                                               

     - Tyr Pro Asp Val Met Met Ile Ala Glu Glu Se - #r Ser Ser Ala Ile Lys     

     #   140                                                                   

     - Ile Thr Gly Met Lys Glu Ile Gly Gly Leu Gl - #y Phe Asp Tyr Lys Trp     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asn Met Gly Trp Met Asn Asp Ile Leu Arg Ph - #e Tyr Glu Glu Asp Pro     

     #               175                                                       

     - Ile Tyr Arg Lys Tyr Asp Phe Asn Leu Val Th - #r Phe Ser Phe Met Tyr     

     #           190                                                           

     - Val Xaa Lys Glu Asn Tyr Leu Leu Pro Phe Se - #r His Asp Glu Val Val     

     #       205                                                               

     - His Gly Lys Lys Ser Met Met His Lys Met Tr - #p Gly Asp Arg Tyr Asn     

     #   220                                                                   

     - Gln Phe Ala Gly Leu Arg Asn Leu Tyr Thr Ty - #r Gln Ile Cys His Pro     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Gly Lys Lys Leu Leu Phe Met Gly Ser Glu Ty - #r Gly Gln Phe Leu Glu     

     #               255                                                       

     - Trp Lys Ser Glu Glu Gln Leu Glu Trp Ser As - #n Leu Glu Asp Pro Met     

     #           270                                                           

     - Asn Ala Lys Met Lys Tyr Phe Ala Ser Gln Le - #u Asn Gln Phe Tyr Lys     

     #       285                                                               

     - Asp His Arg Cys Leu Trp Glu Ile Asp Thr Se - #r Tyr Asp Gly Ile Glu     

     #   300                                                                   

     - Ile Ile Asp Ala Asp Asn Arg Asp Gln Ser Va - #l Leu Ser Phe Ile Arg     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Lys Gly Lys Lys Gly                                                     

                     325                                                       

     - (2) INFORMATION FOR SEQ ID NO: 81:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 2134 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #81:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ATCTGTAGTT TATGCGGATG AAACACTTAT TACTCATACT GCTGAGAAAC CT - #AAAGAGGA   

       60                                                                      

     - AAAAATGATA GTAGAAGAAA AGGCTGATAA AGCTTTGGAA ACTAAAAATA TA - #GTTGAAAG   

      120                                                                      

     - GACAGAACAA AGTGAACCTA GTTCAACTGA GGCTATTGCA TCTGAGNAGA AA - #GAAGATGA   

      180                                                                      

     - AGCCGTAACT CCAAAAGAGG AAAAAGTGTC TGCTAAACCG GAAGAAAAAG CT - #CCAAGGAT   

      240                                                                      

     - AGAATCACAA GCTTCAAATC AAGAAAAACC GCTCAAGGAA GATGCTAAAG CT - #GTAACAAA   

      300                                                                      

     - TGAAGAAGTG AATCAAATGA TTGAAGACAG GAAAGTGGAT TTTAATCAAA AT - #TGGTACTT   

      360                                                                      

     - TAAACTCAAT GCAAATTCTA AGGAAGCCAT TAAACCTGAT GCAGACGTAT CT - #ACGTGGAA   

      420                                                                      

     - AAAATTAGAT TTACCGTATG ACTGGAGTAT CTTTAACGAT TTCGATCATG AA - #TCTCCTGC   

      480                                                                      

     - ACAAAATGAA GGTGGACAGC TCAACGGTGG GGAAGCTTGG TATCGCAAGA CT - #TTCAAACT   

      540                                                                      

     - AGATGAAAAA GACCTCAAGA AAAATGTTCG CCTTACTTTT GATGGCGTCT AC - #ATGGATTC   

      600                                                                      

     - TCAAGTTTAT GTCAATGGTC AGTTAGTGGG GCATTATCCA AATGGTTATA AC - #CAGTTCTC   

      660                                                                      

     - ATATGATATC ACCAAATACC TTCAAAAAGA TGGTCGTGAG AATGTGATTG CT - #GTCCATGC   

      720                                                                      

     - AGTCAACAAA CAGCCAAGTA GCCGTTGGTA TTCAGGAAGT GGTATCTATC GT - #GATGTGAC   

      780                                                                      

     - TTTACAAGTG ACAGATAAGG TGCATGTTGA GAAAAATGGG ACAACTATTT TA - #ACACCAAA   

      840                                                                      

     - ACTTGAAGAA CAACAACATG GCAAGGTTGA AACTCATGTG ACCAGCAAAA TC - #GTCAATAC   

      900                                                                      

     - GGACGACAAA GACCATGAAC TTGTAGCCGA ATATCAAATC GTTGAACGAG GT - #GGTCATGC   

      960                                                                      

     - TGTAACAGGC TTAGTTCGTA CAGCGAGTCG TACCTTAAAA GCACATGAAT CA - #ACAAGCCT   

     1020                                                                      

     - AGATGCGATT TTAGAAGTTG AAAGACCAAA ACTCTGGACT GTTTTAAATG AC - #AAACCTGC   

     1080                                                                      

     - CTTGTACGAA TTGATTACGC GTGTTTACCG TGACGGTCAA TTGGTTGATG CT - #AAGAAGGA   

     1140                                                                      

     - TTTGTTTGGT TACCGTTACT ATCACTGGAC TCCAAATGAA GGTTTCTCTT TG - #AATGGTGA   

     1200                                                                      

     - ACGTATTAAA TTCCATGGAG TATCCTTGCA CCACGACCAT GGGGCGCTTG GA - #GCAGAAGA   

     1260                                                                      

     - AAACTATAAA GCAGAATATC GCCGTCTCAA ACAAATGAAG GAGATGGGAG TT - #AACTCCAT   

     1320                                                                      

     - CCGTACAACC CACAACCCTG CTAGTGAGCA AACCTTGCAA ATCGCAGCAG AA - #CTAGGTTT   

     1380                                                                      

     - ACTCGTTCAG GAAGAGGCCT TTGATACGTG GTATGGTGGC AAGAAACCTT AT - #GACTATGG   

     1440                                                                      

     - ACGTTTCTTT GAAAAAGATG CCACTCACCC AGAAGCTCGA AAAGGTGAAA AA - #TGGTCTGA   

     1500                                                                      

     - TTTTGACCTA CGTACCATGG TCGAAAGAGG CAAAAACAAC CCTGCTATCT TC - #ATGTGGTC   

     1560                                                                      

     - AATTGGTAAT GAAATAGGTG AAGCTAATGG TGATGCCCAC TCTTTAGCAA CT - #GTTAAACG   

     1620                                                                      

     - TTTGGTTAAG GTTATCAAGG ATGTTGATAA GACTCGCTAT GTTACCATGG GA - #GCAGATAA   

     1680                                                                      

     - ATTCCGTTTC GGTAATGGTA GCGGAGGGCA TGAGAAAATT GCTGATGAAC TC - #GATGCTGT   

     1740                                                                      

     - TGGATTTAAC TATTCTGAAG ATAATTACAA AGCCCTTAGA GCTAAGCATC CA - #AAATGGTT   

     1800                                                                      

     - GATTTATGGA TCAGAAACAT CTTCAGCTAC CCGTACACGT GGAAGTTACT AT - #CGCCCTGA   

     1860                                                                      

     - ACGTGAATTG AAACATAGCA ATGGACCTGA GCGTAATTAT GAACAGTCAG AT - #TATGGAAA   

     1920                                                                      

     - TGATCGTGTG GGTTGGGGGA AAACAGCAAC CGCTTCATGG ACTTTTGACC GT - #GACAACGC   

     1980                                                                      

     - TGGCTATGCT GGACAGTTTA TCTGGACAGG TACGGACTAT ATTGGTGAAC CT - #ACACCATG   

     2040                                                                      

     - GCACAACCAA AATCAAACTC CTGTTAAGAG CTCTTACTTT GGTATCGTAG AT - #ACAGCCGG   

     2100                                                                      

     #      2134        TTCT ATCTCTACCA AAGC                                   

     - (2) INFORMATION FOR SEQ ID NO:82:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 711 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:82:                            

     - Ser Val Val Tyr Ala Asp Glu Thr Leu Ile Th - #r His Thr Ala Glu Lys     

     #                15                                                       

     - Pro Lys Glu Glu Lys Met Ile Val Glu Glu Ly - #s Ala Asp Lys Ala Leu     

     #            30                                                           

     - Glu Thr Lys Asn Ile Val Glu Arg Thr Glu Gl - #n Ser Glu Pro Ser Ser     

     #        45                                                               

     - Thr Glu Ala Ile Ala Ser Glu Xaa Lys Glu As - #p Glu Ala Val Thr Pro     

     #    60                                                                   

     - Lys Glu Glu Lys Val Ser Ala Lys Pro Glu Gl - #u Lys Ala Pro Arg Ile     

     #80                                                                       

     - Glu Ser Gln Ala Ser Asn Gln Glu Lys Pro Le - #u Lys Glu Asp Ala Lys     

     #                95                                                       

     - Ala Val Thr Asn Glu Glu Val Asn Gln Met Il - #e Glu Asp Arg Lys Val     

     #           110                                                           

     - Asp Phe Asn Gln Asn Trp Tyr Phe Lys Leu As - #n Ala Asn Ser Lys Glu     

     #       125                                                               

     - Ala Ile Lys Pro Asp Ala Asp Val Ser Thr Tr - #p Lys Lys Leu Asp Leu     

     #   140                                                                   

     - Pro Tyr Asp Trp Ser Ile Phe Asn Asp Phe As - #p His Glu Ser Pro Ala     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gln Asn Glu Gly Gly Gln Leu Asn Gly Gly Gl - #u Ala Trp Tyr Arg Lys     

     #               175                                                       

     - Thr Phe Lys Leu Asp Glu Lys Asp Leu Lys Ly - #s Asn Val Arg Leu Thr     

     #           190                                                           

     - Phe Asp Gly Val Tyr Met Asp Ser Gln Val Ty - #r Val Asn Gly Gln Leu     

     #       205                                                               

     - Val Gly His Tyr Pro Asn Gly Tyr Asn Gln Ph - #e Ser Tyr Asp Ile Thr     

     #   220                                                                   

     - Lys Tyr Leu Gln Lys Asp Gly Arg Glu Asn Va - #l Ile Ala Val His Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Val Asn Lys Gln Pro Ser Ser Arg Trp Tyr Se - #r Gly Ser Gly Ile Tyr     

     #               255                                                       

     - Arg Asp Val Thr Leu Gln Val Thr Asp Lys Va - #l His Val Glu Lys Asn     

     #           270                                                           

     - Gly Thr Thr Ile Leu Thr Pro Lys Leu Glu Gl - #u Gln Gln His Gly Lys     

     #       285                                                               

     - Val Glu Thr His Val Thr Ser Lys Ile Val As - #n Thr Asp Asp Lys Asp     

     #   300                                                                   

     - His Glu Leu Val Ala Glu Tyr Gln Ile Val Gl - #u Arg Gly Gly His Ala     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Val Thr Gly Leu Val Arg Thr Ala Ser Arg Th - #r Leu Lys Ala His Glu     

     #               335                                                       

     - Ser Thr Ser Leu Asp Ala Ile Leu Glu Val Gl - #u Arg Pro Lys Leu Trp     

     #           350                                                           

     - Thr Val Leu Asn Asp Lys Pro Ala Leu Tyr Gl - #u Leu Ile Thr Arg Val     

     #       365                                                               

     - Tyr Arg Asp Gly Gln Leu Val Asp Ala Lys Ly - #s Asp Leu Phe Gly Tyr     

     #   380                                                                   

     - Arg Tyr Tyr His Trp Thr Pro Asn Glu Gly Ph - #e Ser Leu Asn Gly Glu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Arg Ile Lys Phe His Gly Val Ser Leu His Hi - #s Asp His Gly Ala Leu     

     #               415                                                       

     - Gly Ala Glu Glu Asn Tyr Lys Ala Glu Tyr Ar - #g Arg Leu Lys Gln Met     

     #           430                                                           

     - Lys Glu Met Gly Val Asn Ser Ile Arg Thr Th - #r His Asn Pro Ala Ser     

     #       445                                                               

     - Glu Gln Thr Leu Gln Ile Ala Ala Glu Leu Gl - #y Leu Leu Val Gln Glu     

     #   460                                                                   

     - Glu Ala Phe Asp Thr Trp Tyr Gly Gly Lys Ly - #s Pro Tyr Asp Tyr Gly     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Arg Phe Phe Glu Lys Asp Ala Thr His Pro Gl - #u Ala Arg Lys Gly Glu     

     #               495                                                       

     - Lys Trp Ser Asp Phe Asp Leu Arg Thr Met Va - #l Glu Arg Gly Lys Asn     

     #           510                                                           

     - Asn Pro Ala Ile Phe Met Trp Ser Ile Gly As - #n Glu Ile Gly Glu Ala     

     #       525                                                               

     - Asn Gly Asp Ala His Ser Leu Ala Thr Val Ly - #s Arg Leu Val Lys Val     

     #   540                                                                   

     - Ile Lys Asp Val Asp Lys Thr Arg Tyr Val Th - #r Met Gly Ala Asp Lys     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Phe Arg Phe Gly Asn Gly Ser Gly Gly His Gl - #u Lys Ile Ala Asp Glu     

     #               575                                                       

     - Leu Asp Ala Val Gly Phe Asn Tyr Ser Glu As - #p Asn Tyr Lys Ala Leu     

     #           590                                                           

     - Arg Ala Lys His Pro Lys Trp Leu Ile Tyr Gl - #y Ser Glu Thr Ser Ser     

     #       605                                                               

     - Ala Thr Arg Thr Arg Gly Ser Tyr Tyr Arg Pr - #o Glu Arg Glu Leu Lys     

     #   620                                                                   

     - His Ser Asn Gly Pro Glu Arg Asn Tyr Glu Gl - #n Ser Asp Tyr Gly Asn     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Asp Arg Val Gly Trp Gly Lys Thr Ala Thr Al - #a Ser Trp Thr Phe Asp     

     #               655                                                       

     - Arg Asp Asn Ala Gly Tyr Ala Gly Gln Phe Il - #e Trp Thr Gly Thr Asp     

     #           670                                                           

     - Tyr Ile Gly Glu Pro Thr Pro Trp His Asn Gl - #n Asn Gln Thr Pro Val     

     #       685                                                               

     - Lys Ser Ser Tyr Phe Gly Ile Val Asp Thr Al - #a Gly Ile Pro Lys His     

     #   700                                                                   

     - Asp Phe Tyr Leu Tyr Gln Ser                                             

     705                 7 - #10                                               

     - (2) INFORMATION FOR SEQ ID NO: 83:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 2167 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #83:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTACTTTGGT ATCGTAGATA CAGCCGGCAT TCCAAAACAT GACTTCTATC TC - #TACCAAAG   

       60                                                                      

     - CCAATGGGTT TCTGTTAAGA AGAAACCGAT GGTACACCTT CTTCCTCACT GG - #AACTGGGA   

      120                                                                      

     - AAACAAAGAA TTAGCATCCA AAGTAGCTGA CTCAGAAGGT AAGATTCCAG TT - #CGTGCTTA   

      180                                                                      

     - TTCGAATGCT TCTAGTGTAG AATTGTTCTT GAATGGAAAA TCTCTTGGTC TT - #AAGACTTT   

      240                                                                      

     - CAATAAAAAA CAAACCAGCG ATGGGCGGAC TTACCAAGAA GGTGCAAATG CT - #AATGAACT   

      300                                                                      

     - TTATCTTGAA TGGAAAGTTG CCTATCAACC AGGTACCTTG GAAGCAATTG CT - #CGTGATGA   

      360                                                                      

     - ATCTGGCAAG GAAATTGCTC GAGATAAGAT TACGACTGCT GGTAAGCCAG CG - #GCAGTTCG   

      420                                                                      

     - TCTTATTAAG GAAGACCATG CGATTGCAGC AGATGGAAAA GACTTGACTT AC - #ATCTACTA   

      480                                                                      

     - TGAAATTGTT GACAGCCAGG GGAATGTGGT TCCAACTGCT AATAATCTGG TT - #CGCTTCCA   

      540                                                                      

     - ATTGCATGGC CAAGGTCAAC TGGTCGGTGT AGATAACGGA GAACAAGCCA GC - #CGTGAACG   

      600                                                                      

     - CTATAAGGCG CAAGCAGATG GTTCTTGGAT TCGTAAAGCA TTTAATGGTA AA - #GGTGTTGC   

      660                                                                      

     - CATTGTCAAA TCAACTGAAC AAGCAGGGAA ATTCACCCTG ACTGCCCACT CT - #GATCTCTT   

      720                                                                      

     - GAAATCGAAC CAAGTCACTG TCTTTACTGG TAAGAAAGAA GGACAAGAGA AG - #ACTGTTTT   

      780                                                                      

     - GGGGACAGAA GTGCCAAAAG TACAGACCAT TATTGGAGAG GCACCTGAAA TG - #CCTACCAC   

      840                                                                      

     - TGTTCCGTTT GTATACAGTG ATGGTAGCCG TGCAGAACGT CCTGTAACCT GG - #TCTTCAGT   

      900                                                                      

     - AGATGTGAGC AAGCCTGGTA TTGTAACGGT GAAAGGTATG GCTGACGGAC GA - #GAAGTAGA   

      960                                                                      

     - AGCTCGTGTA GAAGTGATTG CTCTTAAATC AGAGCTACCA GTTGTGAAAC GT - #ATTGCTCC   

     1020                                                                      

     - AAATACTGAC TTGAATTCTG TAGACAAATC TGTTTCCTAT GTTTTGATTG AT - #GGAAGTGT   

     1080                                                                      

     - TGAAGAGTAT GAAGTGGACA AGTGGGAGAT TGCCGAAGAA GATAAAGCTA AG - #TTAGCAAT   

     1140                                                                      

     - TCCAGGTTCT CGTATTCAAG CGACCGGTTA TTTAGAAGGT CAACCAATTC AT - #GCAACCCT   

     1200                                                                      

     - TGTGGTAGAA GAAGGCAATC CTGCGGCACC TGCAGTACCA ACTGTAACGG TT - #GGTGGTGA   

     1260                                                                      

     - GGCAGTAACA GGTCTTACTA GTCAAAAACC AATGCAATAC CGCACTCTTG CT - #TATGGAGC   

     1320                                                                      

     - TAAGTTGCCA GAAGTCACAG CAAGTGCTAA AAATGCAGCT GTTACAGTTC TT - #CAAGCAAG   

     1380                                                                      

     - CGCAGCAAAC GGCATGCGTG CGAGCATCTT TATTCAGCCT AAAGATGGTG GC - #CCTCTTCA   

     1440                                                                      

     - AACCTATGCA ATTCAATTCC TTGAAGAAGC GCCAAAAATT GCTCACTTGA GC - #TTGCAAGT   

     1500                                                                      

     - GGAAAAAGCT GACAGTCTCA AAGAAGACCA AACTGTCAAA TTGTCGGTTC GA - #GCTCACTA   

     1560                                                                      

     - TCAAGATGGA ACGCAAGCTG TATTACCAGC TGATAAAGTA ACCTTCTCTA CA - #AGTGGTGA   

     1620                                                                      

     - AGGGGAAGTC GCAATTCGTA AAGGAATGCT TGAGTTGCAT AAGCCAGGAG CA - #GTCACTCT   

     1680                                                                      

     - GAACGCTGAA TATGAGGGAG CTAAAGACCA AGTTGAACTC ACTATCCAAG CC - #AATACTGA   

     1740                                                                      

     - GAAGAAGATT GCGCAATCCA TCCGTCCTGT AAATGTAGTG ACAGATTTGC AT - #CAGGAACC   

     1800                                                                      

     - AAGTCTTCCA GCAACAGTAA CAGTTGAGTA TGACAAAGGT TTCCCTAAAA CT - #CATAAAGT   

     1860                                                                      

     - CACTTGGCAA GCTATTCCGA AAGAAAAACT AGACTCCTAT CAAACATTTG AA - #GTACTAGG   

     1920                                                                      

     - TAAAGTTGAA GGAATTGACC TTGAAGCGCG TGCAAAAGTC TCTGTAGAAG GT - #ATCGTTTC   

     1980                                                                      

     - AGTTGAAGAA GTCAGTGTGA CAACTCCAAT CGCAGAAGCA CCACAATTAC CA - #GAAAGTGT   

     2040                                                                      

     - TCGGACATAT GATTCAAATG GTCACGTTTC ATCAGCTAAG GTTGCATGGG AT - #GCGATTCG   

     2100                                                                      

     - TCCAGAGCAA TACGCTAAGG AAGGTGTCTT TACAGTTAAT GGTCGCTTAG AA - #GGTACGCA   

     2160                                                                      

     #        2167                                                             

     - (2) INFORMATION FOR SEQ ID NO:84:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 722 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84:                            

     - Tyr Phe Gly Ile Val Asp Thr Ala Gly Ile Pr - #o Lys His Asp Phe Tyr     

     #                15                                                       

     - Leu Tyr Gln Ser Gln Trp Val Ser Val Lys Ly - #s Lys Pro Met Val His     

     #            30                                                           

     - Leu Leu Pro His Trp Asn Trp Glu Asn Lys Gl - #u Leu Ala Ser Lys Val     

     #        45                                                               

     - Ala Asp Ser Glu Gly Lys Ile Pro Val Arg Al - #a Tyr Ser Asn Ala Ser     

     #    60                                                                   

     - Ser Val Glu Leu Phe Leu Asn Gly Lys Ser Le - #u Gly Leu Lys Thr Phe     

     #80                                                                       

     - Asn Lys Lys Gln Thr Ser Asp Gly Arg Thr Ty - #r Gln Glu Gly Ala Asn     

     #                95                                                       

     - Ala Asn Glu Leu Tyr Leu Glu Trp Lys Val Al - #a Tyr Gln Pro Gly Thr     

     #           110                                                           

     - Leu Glu Ala Ile Ala Arg Asp Glu Ser Gly Ly - #s Glu Ile Ala Arg Asp     

     #       125                                                               

     - Lys Ile Thr Thr Ala Gly Lys Pro Ala Ala Va - #l Arg Leu Ile Lys Glu     

     #   140                                                                   

     - Asp His Ala Ile Ala Ala Asp Gly Lys Asp Le - #u Thr Tyr Ile Tyr Tyr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Glu Ile Val Asp Ser Gln Gly Asn Val Val Pr - #o Thr Ala Asn Asn Leu     

     #               175                                                       

     - Val Arg Phe Gln Leu His Gly Gln Gly Gln Le - #u Val Gly Val Asp Asn     

     #           190                                                           

     - Gly Glu Gln Ala Ser Arg Glu Arg Tyr Lys Al - #a Gln Ala Asp Gly Ser     

     #       205                                                               

     - Trp Ile Arg Lys Ala Phe Asn Gly Lys Gly Va - #l Ala Ile Val Lys Ser     

     #   220                                                                   

     - Thr Glu Gln Ala Gly Lys Phe Thr Leu Thr Al - #a His Ser Asp Leu Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Lys Ser Asn Gln Val Thr Val Phe Thr Gly Ly - #s Lys Glu Gly Gln Glu     

     #               255                                                       

     - Lys Thr Val Leu Gly Thr Glu Val Pro Lys Va - #l Gln Thr Ile Ile Gly     

     #           270                                                           

     - Glu Ala Pro Glu Met Pro Thr Thr Val Pro Ph - #e Val Tyr Ser Asp Gly     

     #       285                                                               

     - Ser Arg Ala Glu Arg Pro Val Thr Trp Ser Se - #r Val Asp Val Ser Lys     

     #   300                                                                   

     - Pro Gly Ile Val Thr Val Lys Gly Met Ala As - #p Gly Arg Glu Val Glu     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Ala Arg Val Glu Val Ile Ala Leu Lys Ser Gl - #u Leu Pro Val Val Lys     

     #               335                                                       

     - Arg Ile Ala Pro Asn Thr Asp Leu Asn Ser Va - #l Asp Lys Ser Val Ser     

     #           350                                                           

     - Tyr Val Leu Ile Asp Gly Ser Val Glu Glu Ty - #r Glu Val Asp Lys Trp     

     #       365                                                               

     - Glu Ile Ala Glu Glu Asp Lys Ala Lys Leu Al - #a Ile Pro Gly Ser Arg     

     #   380                                                                   

     - Ile Gln Ala Thr Gly Tyr Leu Glu Gly Gln Pr - #o Ile His Ala Thr Leu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Val Val Glu Glu Gly Asn Pro Ala Ala Pro Al - #a Val Pro Thr Val Thr     

     #               415                                                       

     - Val Gly Gly Glu Ala Val Thr Gly Leu Thr Se - #r Gln Lys Pro Met Gln     

     #           430                                                           

     - Tyr Arg Thr Leu Ala Tyr Gly Ala Lys Leu Pr - #o Glu Val Thr Ala Ser     

     #       445                                                               

     - Ala Lys Asn Ala Ala Val Thr Val Leu Gln Al - #a Ser Ala Ala Asn Gly     

     #   460                                                                   

     - Met Arg Ala Ser Ile Phe Ile Gln Pro Lys As - #p Gly Gly Pro Leu Gln     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Thr Tyr Ala Ile Gln Phe Leu Glu Glu Ala Pr - #o Lys Ile Ala His Leu     

     #               495                                                       

     - Ser Leu Gln Val Glu Lys Ala Asp Ser Leu Ly - #s Glu Asp Gln Thr Val     

     #           510                                                           

     - Lys Leu Ser Val Arg Ala His Tyr Gln Asp Gl - #y Thr Gln Ala Val Leu     

     #       525                                                               

     - Pro Ala Asp Lys Val Thr Phe Ser Thr Ser Gl - #y Glu Gly Glu Val Ala     

     #   540                                                                   

     - Ile Arg Lys Gly Met Leu Glu Leu His Lys Pr - #o Gly Ala Val Thr Leu     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Asn Ala Glu Tyr Glu Gly Ala Lys Asp Gln Va - #l Glu Leu Thr Ile Gln     

     #               575                                                       

     - Ala Asn Thr Glu Lys Lys Ile Ala Gln Ser Il - #e Arg Pro Val Asn Val     

     #           590                                                           

     - Val Thr Asp Leu His Gln Glu Pro Ser Leu Pr - #o Ala Thr Val Thr Val     

     #       605                                                               

     - Glu Tyr Asp Lys Gly Phe Pro Lys Thr His Ly - #s Val Thr Trp Gln Ala     

     #   620                                                                   

     - Ile Pro Lys Glu Lys Leu Asp Ser Tyr Gln Th - #r Phe Glu Val Leu Gly     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Lys Val Glu Gly Ile Asp Leu Glu Ala Arg Al - #a Lys Val Ser Val Glu     

     #               655                                                       

     - Gly Ile Val Ser Val Glu Glu Val Ser Val Th - #r Thr Pro Ile Ala Glu     

     #           670                                                           

     - Ala Pro Gln Leu Pro Glu Ser Val Arg Thr Ty - #r Asp Ser Asn Gly His     

     #       685                                                               

     - Val Ser Ser Ala Lys Val Ala Trp Asp Ala Il - #e Arg Pro Glu Gln Tyr     

     #   700                                                                   

     - Ala Lys Glu Gly Val Phe Thr Val Asn Gly Ar - #g Leu Glu Gly Thr Gln     

     705                 7 - #10                 7 - #15                 7 -   

     #20                                                                       

     - Leu Thr                                                                 

     - (2) INFORMATION FOR SEQ ID NO: 85:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 2329 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #85:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AGCTAAGGTT GCATGGGATG CGATTCGTCC AGAGCAATAC GCTAAGGAAG GT - #GTCTTTAC   

       60                                                                      

     - AGTTAATGGT CGCTTAGAAG GTACGCAATT AACAACTAAA CTTCATGTTC GC - #GTATCTGC   

      120                                                                      

     - TCAAACTGAG CAAGGTGCAA ACATTTCTGA CCAATGGACC GGTTCAGAAT TG - #CCACTTGC   

      180                                                                      

     - CTTTGCTTCA GACTCAAATC CAAGCGACCC AGTTTCAAAT GTTAATGACA AG - #CTCATTTC   

      240                                                                      

     - CTACAATAAC CAACCAGCCA ATCGTTGGAC AAACTGGAAT CGTACTAATC CA - #GAAGCTTC   

      300                                                                      

     - AGTCGGTGTT CTGTTTGGAG ATTCAGGTAT CTTGAGCAAA CGCTCCGTTG AT - #AATCTAAG   

      360                                                                      

     - TGTCGGATTC CATGAAGACC ATGGAGTTGG TGTACCGAAG TCTTATGTGA TT - #GAGTATTA   

      420                                                                      

     - TGTTGGTAAG ACTGTCCCAA CAGCTCCTAA AAACCCTAGT TTTGTTGGTA AT - #GAGGACCA   

      480                                                                      

     - TGTCTTTAAT GATTCTGCCA ACTGGAAACC AGTTACTAAT CTAAAAGCCC CT - #GCTCAACT   

      540                                                                      

     - CAAGGCTGGA GAAATGAACC ACTTTAGCTT TGATAAAGTT GAAACCTATG CT - #GTTCGTAT   

      600                                                                      

     - TCGCATGGTT AAAGCAGATA ACAAGCGTGG AACGTCTATC ACAGAGGTAC AA - #ATCTTTGC   

      660                                                                      

     - GAAACAAGTT GCGGCAGCCA AGCAAGGACA AACAAGAATC CAAGTTGACG GC - #AAAGACTT   

      720                                                                      

     - AGCAAACTTC AACCCTGATT TGACAGACTA CTACCTTGAG TCTGTAGATG GA - #AAAGTTCC   

      780                                                                      

     - GGCAGTCACA GCAAGTGTTA GCAACAATGG TCTCGCTACC GTCGTTCCAA GC - #GTTCGTGA   

      840                                                                      

     - AGGTGAGCCA GTTCGTGTCA TCGCGAAAGC TGAAAATGGC GACATCTTAG GA - #GAATACCG   

      900                                                                      

     - TCTGCACTTC ACTAAGGATA AGAGCTTACT TTCTCATAAA CCAGTTGCTG CG - #GTTAAACA   

      960                                                                      

     - AGCTCGCTTG CTACAAGTAG GTCAAGCACT TGAATTGCCG ACTAAGGTTC CA - #GTTTACTT   

     1020                                                                      

     - CACAGGTAAA GACGGCTACG AAACAAAAGA CCTGACAGTT GAATGGGAAG AA - #GTTCCAGC   

     1080                                                                      

     - GGAAAATCTG ACAAAAGCAG GTCAATTTAC TGTTCGAGGC CGTGTCCTTG GT - #AGTAACCT   

     1140                                                                      

     - TGTTGCTGAG ATCACTGTAC GAGTGACAGA CAAACTTGGT GAGACTCTTT CA - #GATAACCC   

     1200                                                                      

     - TAACTATGAT GAAAACAGTA ACCAGGCCTT TGCTTCAGCA ACCAATGATA TT - #GACAAAAA   

     1260                                                                      

     - CTCTCATGAC CGCGTTGACT ATCTCAATGA CGGAGATCAT TCAGAAAATC GT - #CGTTGGAC   

     1320                                                                      

     - AAACTGGTCA CCAACACCAT CTTCTAATCC AGAAGTATCA GCGGGTGTGA TT - #TTCCGTGA   

     1380                                                                      

     - AAATGGTAAG ATTGTAGAAC GGACTGTTAC ACAAGGAAAA GTTCAGTTCT TT - #GCAGATAG   

     1440                                                                      

     - TGGTACGGAT GCACCATCTA AACTCGTTTT AGAACGCTAT GTCGGTCCAG AG - #TTTGAAGT   

     1500                                                                      

     - GCCAACCTAC TATTCAAACT ACCAAGCCTA CGACGCAGAC CATCCATTCA AC - #AATCCAGA   

     1560                                                                      

     - AAATTGGGAA GCTGTTCCTT ATCGTGCGGA TAAAGACATT GCAGCTGGTG AT - #GAAATCAA   

     1620                                                                      

     - CGTAACATTT AAAGCTATCA AAGCCAAAGC TATGAGATGG CGTATGGAGC GT - #AAAGCAGA   

     1680                                                                      

     - TAAGAGCGGT GTTGCGATGA TTGAGATGAC CTTCCTTGCA CCAAGTGAAT TG - #CCTCAAGA   

     1740                                                                      

     - AAGCACTCAA TCAAAGATTC TTGTAGATGG AAAAGAACTT GCTGATTTCG CT - #GAAAATCG   

     1800                                                                      

     - TCAAGACTAT CAAATTACCT ATAAAGGTCA ACGGCCAAAA GTCTCAGTTG AA - #GAAAACAA   

     1860                                                                      

     - TCAAGTAGCT TCAACTGTGG TAGATAGTGG AGAAGATAGC TTTCCAGTAC TT - #GTTCGCCT   

     1920                                                                      

     - CGTTTCAGAA AGTGGAAAAC AAGTCAAGGA ATACCGTATC CACTTGACTA AG - #GAAAAACC   

     1980                                                                      

     - AGTTTCTGAG AAGACAGTTG CTGCTGTACA AGAAGATCTT CCAAAAATCG AA - #TTTGTTGA   

     2040                                                                      

     - AAAAGATTTG GCATACAAGA CAGTTGAGAA AAAAGATTCA ACACTGTATC TA - #GGTGAAAC   

     2100                                                                      

     - TCGTGTAGAA CAAGAAGGAA AAGTTGGAAA AGAACGTATC TTTACAGCGA TT - #AATCCTGA   

     2160                                                                      

     - TGGAAGTAAG GAAGAAAAAC TCCGTGAAGT GGTAGAAGTT CCGACAGACC GC - #ATCGTCTT   

     2220                                                                      

     - GGTTGGAACC AAACCAGTAG CTCAAGAAGC TAAAAAACCA CAAGTGTCAG AA - #AAAGCAGA   

     2280                                                                      

     #             2329TTCAA GTGAAGCTAG TCAAACTAAT AAAGCCCAG                   

     - (2) INFORMATION FOR SEQ ID NO:86:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 776 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86:                            

     - Ala Lys Val Ala Trp Asp Ala Ile Arg Pro Gl - #u Gln Tyr Ala Lys Glu     

     #                15                                                       

     - Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gl - #y Thr Gln Leu Thr Thr     

     #            30                                                           

     - Lys Leu His Val Arg Val Ser Ala Gln Thr Gl - #u Gln Gly Ala Asn Ile     

     #        45                                                               

     - Ser Asp Gln Trp Thr Gly Ser Glu Leu Pro Le - #u Ala Phe Ala Ser Asp     

     #    60                                                                   

     - Ser Asn Pro Ser Asp Pro Val Ser Asn Val As - #n Asp Lys Leu Ile Ser     

     #80                                                                       

     - Tyr Asn Asn Gln Pro Ala Asn Arg Trp Thr As - #n Trp Asn Arg Thr Asn     

     #                95                                                       

     - Pro Glu Ala Ser Val Gly Val Leu Phe Gly As - #p Ser Gly Ile Leu Ser     

     #           110                                                           

     - Lys Arg Ser Val Asp Asn Leu Ser Val Gly Ph - #e His Glu Asp His Gly     

     #       125                                                               

     - Val Gly Val Pro Lys Ser Tyr Val Ile Glu Ty - #r Tyr Val Gly Lys Thr     

     #   140                                                                   

     - Val Pro Thr Ala Pro Lys Asn Pro Ser Phe Va - #l Gly Asn Glu Asp His     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Val Phe Asn Asp Ser Ala Asn Trp Lys Pro Va - #l Thr Asn Leu Lys Ala     

     #               175                                                       

     - Pro Ala Gln Leu Lys Ala Gly Glu Met Asn Hi - #s Phe Ser Phe Asp Lys     

     #           190                                                           

     - Val Glu Thr Tyr Ala Val Arg Ile Arg Met Va - #l Lys Ala Asp Asn Lys     

     #       205                                                               

     - Arg Gly Thr Ser Ile Thr Glu Val Gln Ile Ph - #e Ala Lys Gln Val Ala     

     #   220                                                                   

     - Ala Ala Lys Gln Gly Gln Thr Arg Ile Gln Va - #l Asp Gly Lys Asp Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ala Asn Phe Asn Pro Asp Leu Thr Asp Tyr Ty - #r Leu Glu Ser Val Asp     

     #               255                                                       

     - Gly Lys Val Pro Ala Val Thr Ala Ser Val Se - #r Asn Asn Gly Leu Ala     

     #           270                                                           

     - Thr Val Val Pro Ser Val Arg Glu Gly Glu Pr - #o Val Arg Val Ile Ala     

     #       285                                                               

     - Lys Ala Glu Asn Gly Asp Ile Leu Gly Glu Ty - #r Arg Leu His Phe Thr     

     #   300                                                                   

     - Lys Asp Lys Ser Leu Leu Ser His Lys Pro Va - #l Ala Ala Val Lys Gln     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Ala Arg Leu Leu Gln Val Gly Gln Ala Leu Gl - #u Leu Pro Thr Lys Val     

     #               335                                                       

     - Pro Val Tyr Phe Thr Gly Lys Asp Gly Tyr Gl - #u Thr Lys Asp Leu Thr     

     #           350                                                           

     - Val Glu Trp Glu Glu Val Pro Ala Glu Asn Le - #u Thr Lys Ala Gly Gln     

     #       365                                                               

     - Phe Thr Val Arg Gly Arg Val Leu Gly Ser As - #n Leu Val Ala Glu Ile     

     #   380                                                                   

     - Thr Val Arg Val Thr Asp Lys Leu Gly Glu Th - #r Leu Ser Asp Asn Pro     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Asn Tyr Asp Glu Asn Ser Asn Gln Ala Phe Al - #a Ser Ala Thr Asn Asp     

     #               415                                                       

     - Ile Asp Lys Asn Ser His Asp Arg Val Asp Ty - #r Leu Asn Asp Gly Asp     

     #           430                                                           

     - His Ser Glu Asn Arg Arg Trp Thr Asn Trp Se - #r Pro Thr Pro Ser Ser     

     #       445                                                               

     - Asn Pro Glu Val Ser Ala Gly Val Ile Phe Ar - #g Glu Asn Gly Lys Ile     

     #   460                                                                   

     - Val Glu Arg Thr Val Thr Gln Gly Lys Val Gl - #n Phe Phe Ala Asp Ser     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Gly Thr Asp Ala Pro Ser Lys Leu Val Leu Gl - #u Arg Tyr Val Gly Pro     

     #               495                                                       

     - Glu Phe Glu Val Pro Thr Tyr Tyr Ser Asn Ty - #r Gln Ala Tyr Asp Ala     

     #           510                                                           

     - Asp His Pro Phe Asn Asn Pro Glu Asn Trp Gl - #u Ala Val Pro Tyr Arg     

     #       525                                                               

     - Ala Asp Lys Asp Ile Ala Ala Gly Asp Glu Il - #e Asn Val Thr Phe Lys     

     #   540                                                                   

     - Ala Ile Lys Ala Lys Ala Met Arg Trp Arg Me - #t Glu Arg Lys Ala Asp     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Lys Ser Gly Val Ala Met Ile Glu Met Thr Ph - #e Leu Ala Pro Ser Glu     

     #               575                                                       

     - Leu Pro Gln Glu Ser Thr Gln Ser Lys Ile Le - #u Val Asp Gly Lys Glu     

     #           590                                                           

     - Leu Ala Asp Phe Ala Glu Asn Arg Gln Asp Ty - #r Gln Ile Thr Tyr Lys     

     #       605                                                               

     - Gly Gln Arg Pro Lys Val Ser Val Glu Glu As - #n Asn Gln Val Ala Ser     

     #   620                                                                   

     - Thr Val Val Asp Ser Gly Glu Asp Ser Phe Pr - #o Val Leu Val Arg Leu     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Val Ser Glu Ser Gly Lys Gln Val Lys Glu Ty - #r Arg Ile His Leu Thr     

     #               655                                                       

     - Lys Glu Lys Pro Val Ser Glu Lys Thr Val Al - #a Ala Val Gln Glu Asp     

     #           670                                                           

     - Leu Pro Lys Ile Glu Phe Val Glu Lys Asp Le - #u Ala Tyr Lys Thr Val     

     #       685                                                               

     - Glu Lys Lys Asp Ser Thr Leu Tyr Leu Gly Gl - #u Thr Arg Val Glu Gln     

     #   700                                                                   

     - Glu Gly Lys Val Gly Lys Glu Arg Ile Phe Th - #r Ala Ile Asn Pro Asp     

     705                 7 - #10                 7 - #15                 7 -   

     #20                                                                       

     - Gly Ser Lys Glu Glu Lys Leu Arg Glu Val Va - #l Glu Val Pro Thr Asp     

     #               735                                                       

     - Arg Ile Val Leu Val Gly Thr Lys Pro Val Al - #a Gln Glu Ala Lys Lys     

     #           750                                                           

     - Pro Gln Val Ser Glu Lys Ala Asp Thr Lys Pr - #o Ile Asp Ser Ser Glu     

     #       765                                                               

     - Ala Ser Gln Thr Asn Lys Ala Gln                                         

     #   775                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 87:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 133 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #87:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTATCACTAT GTAAATAAAG AGATTATTTC ACAAGAAGCT AAAGATTTAA TT - #CAGACAGG   

       60                                                                      

     - AAAGCCTGAC AGGAATGAAG TTGTATATGG TTTGGTGTAT CAAAAAGATC AG - #TTGCCTCA   

      120                                                                      

     #     133                                                                 

     - (2) INFORMATION FOR SEQ ID NO:88:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 44 amino                                            

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:88:                            

     - Tyr His Tyr Val Asn Lys Glu Ile Ile Ser Gl - #n Glu Ala Lys Asp Leu     

     #                15                                                       

     - Ile Gln Thr Gly Lys Pro Asp Arg Asn Glu Va - #l Val Tyr Gly Leu Val     

     #            30                                                           

     - Tyr Gln Lys Asp Gln Leu Pro Gln Thr Gly Th - #r Glu                     

     #        40                                                               

     - (2) INFORMATION FOR SEQ ID NO: 89:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 775 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #89:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGAGACTCCT CAATCAATAA CAAATCAGGA GCAAGCTAGG ACAGAAAACC AA - #GTAGTAGA   

       60                                                                      

     - GACAGAGGAA GCTCCAAAAG AAGAAGCACC TAAAACAGAA GAAAGTCCAA AG - #GAAGAACC   

      120                                                                      

     - AAAATCGGAG GTAAAACCTA CTGACGACAC CCTTCCTAAA GTAGAAGAGG GG - #AAAGAAGA   

      180                                                                      

     - TTCAGCAGAA CCAGCTCCAG TTGAAGAAGT AGGTGGAGAA GTTGAGTCAA AA - #CCAGAGGA   

      240                                                                      

     - AAAAGTAGCA GTTAAGCCAG AAAGTCAACC ATCAGACAAA CCAGCTGAGG AA - #TCAAAAGT   

      300                                                                      

     - TGAACAAGCA GGTGAACCAG TCGCGCCAAG AGAAGACGAA AAGGCACCAG TC - #GAGCCAGA   

      360                                                                      

     - AAAGCAACCA GAAGCTCCTG AAGAAGAGAA GGCTGTAGAG GAAACACCGA AA - #CAAGAAGA   

      420                                                                      

     - GTCAACTCCA GATACCAAGG CTGAAGAAAC TGTAGAACCA AAAGAGGAGA CT - #GTTAATCA   

      480                                                                      

     - ATCTATTGAA CAACCAAAAG TTGAAACGCC TGCTGTAGAA AAACAAACAG AA - #CCAACAGA   

      540                                                                      

     - GGAACCAAAA GTTGAACAAG CAGGTGAACC AGTCGCGCCA AGAGAAGACG AA - #CAGGCACC   

      600                                                                      

     - AACGGCACCA GTTGAGCCAG AAAAGCAACC AGAAGTTCCT GAAGAAGAGA AG - #GCTGTAGA   

      660                                                                      

     - GGAAACACCG AAACCAGAAG ATAAAATAAA GGGTATTGGT ACTAAAGAAC CA - #GTTGATAA   

      720                                                                      

     - AAGTGAGTTA AATAATCAAA TTGATAAAGC TAGTTCAGTT TCTCCTACTG AT - #TAT        

      775                                                                      

     - (2) INFORMATION FOR SEQ ID NO:90:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 258 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:90:                            

     - Glu Thr Pro Gln Ser Ile Thr Asn Gln Glu Gl - #n Ala Arg Thr Glu Asn     

     #                15                                                       

     - Gln Val Val Glu Thr Glu Glu Ala Pro Lys Gl - #u Glu Ala Pro Lys Thr     

     #            30                                                           

     - Glu Glu Ser Pro Lys Glu Glu Pro Lys Ser Gl - #u Val Lys Pro Thr Asp     

     #        45                                                               

     - Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Gl - #u Asp Ser Ala Glu Pro     

     #    60                                                                   

     - Ala Pro Val Glu Glu Val Gly Gly Glu Val Gl - #u Ser Lys Pro Glu Glu     

     #80                                                                       

     - Lys Val Ala Val Lys Pro Glu Ser Gln Pro Se - #r Asp Lys Pro Ala Glu     

     #                95                                                       

     - Glu Ser Lys Val Glu Gln Ala Gly Glu Pro Va - #l Ala Pro Arg Glu Asp     

     #           110                                                           

     - Glu Lys Ala Pro Val Glu Pro Glu Lys Gln Pr - #o Glu Ala Pro Glu Glu     

     #       125                                                               

     - Glu Lys Ala Val Glu Glu Thr Pro Lys Gln Gl - #u Glu Ser Thr Pro Asp     

     #   140                                                                   

     - Thr Lys Ala Glu Glu Thr Val Glu Pro Lys Gl - #u Glu Thr Val Asn Gln     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Ile Glu Gln Pro Lys Val Glu Thr Pro Al - #a Val Glu Lys Gln Thr     

     #               175                                                       

     - Glu Pro Thr Glu Glu Pro Lys Val Glu Gln Al - #a Gly Glu Pro Val Ala     

     #           190                                                           

     - Pro Arg Glu Asp Glu Gln Ala Pro Thr Ala Pr - #o Val Glu Pro Glu Lys     

     #       205                                                               

     - Gln Pro Glu Val Pro Glu Glu Glu Lys Ala Va - #l Glu Glu Thr Pro Lys     

     #   220                                                                   

     - Pro Glu Asp Lys Ile Lys Gly Ile Gly Thr Ly - #s Glu Pro Val Asp Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ser Glu Leu Asn Asn Gln Ile Asp Lys Ala Se - #r Ser Val Ser Pro Thr     

     #               255                                                       

     - Asp Tyr                                                                 

     - (2) INFORMATION FOR SEQ ID NO: 91:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 199 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #91:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGATGCTCAA GAAACTGCGG GAGTTCACTA TAAATATGTG GCAGATTCAG AG - #CTATCATC   

       60                                                                      

     - AGAAGAAAAG AAGCAGCTTG TCTATGATAT TCCGACATAC GTGGAGAATG AT - #GATGAAAC   

      120                                                                      

     - TTATTATCTT GTTTATAAGT TAAATTCTCA AAATCAACTG GCGGAATTGC CA - #AATACTGG   

      180                                                                      

     #199               CAA                                                    

     - (2) INFORMATION FOR SEQ ID NO:92:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 66 amino                                            

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:92:                            

     - Asp Ala Gln Glu Thr Ala Gly Val His Tyr Ly - #s Tyr Val Ala Asp Ser     

     #                15                                                       

     - Glu Leu Ser Ser Glu Glu Lys Lys Gln Leu Va - #l Tyr Asp Ile Pro Thr     

     #            30                                                           

     - Tyr Val Glu Asn Asp Asp Glu Thr Tyr Tyr Le - #u Val Tyr Lys Leu Asn     

     #        45                                                               

     - Ser Gln Asn Gln Leu Ala Glu Leu Pro Asn Th - #r Gly Ser Lys Asn Glu     

     #    60                                                                   

     - Arg Gln                                                                 

     65                                                                        

     - (2) INFORMATION FOR SEQ ID NO: 93:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 835 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #93:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CGACAAAGGT GAGACTGAGG TTCAACCAGA GTCGCCAGAT ACTGTGGTAA GT - #GATAAAGG   

       60                                                                      

     - TGAACCAGAG CAGGTAGCAC CGCTTCCAGA ATATAAGGGT AATATTGAGC AA - #GTAAAACC   

      120                                                                      

     - TGAAACTCCG GTTGAGAAGA CCAAAGAACA AGGTCCAGAA AAAACTGAAG AA - #GTTCCAGT   

      180                                                                      

     - AAAACCAACA GAAGAAACAC CAGTAAATCC AAATGAAGGT ACTACAGAAG GA - #ACCTCAAT   

      240                                                                      

     - TCAAGAAGCA GAAAATCCAG TTCAACCTGC AGAAGAATCA ACAACGAATT CA - #GAGAAAGT   

      300                                                                      

     - ATCACCAGAT ACATCTAGCA AAAATACTGG GGAAGTGTCC AGTAATCCTA GT - #GATTCGAC   

      360                                                                      

     - AACCTCAGTT GGAGAATCAA ATAAACCAGA ACATAATGAC TCTAAAAATG AA - #AATTCAGA   

      420                                                                      

     - AAAAACTGTA GAAGAAGTTC CAGTAAATCC AAATGAAGGC ACAGTAGAAG GT - #ACCTCAAA   

      480                                                                      

     - TCAAGAAACA GAAAAACCAG TTCAACCTGC AGAAGAAACA CAAACAAACT CT - #GGGAAAAT   

      540                                                                      

     - AGCTAACGAA AATACTGGAG AAGTATCCAA TAAACCTAGT GATTCAAAAC CA - #CCAGTTGA   

      600                                                                      

     - AGAATCAAAT CAACCAGAAA AAAACGGAAC TGCAACAAAA CCAGAAAATT CA - #GGTAATAC   

      660                                                                      

     - AACATCAGAG AATGGACAAA CAGAACCAGA ACCATCAAAC GGAAATTCAA CT - #GAGGATGT   

      720                                                                      

     - TTCAACCGAA TCAAACACAT CCAATTCAAA TGGAAACGAA GAAATTAAAC AA - #GAAAATGA   

      780                                                                      

     - ACTAGACCCT GATAAAAAGG TAGAAGAACC AGAGAAAACA CTTGAATTAA GA - #AAT        

      835                                                                      

     - (2) INFORMATION FOR SEQ ID NO:94:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 278 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:94:                            

     - Asp Lys Gly Glu Thr Glu Val Gln Pro Glu Se - #r Pro Asp Thr Val Val     

     #                15                                                       

     - Ser Asp Lys Gly Glu Pro Glu Gln Val Ala Pr - #o Leu Pro Glu Tyr Lys     

     #            30                                                           

     - Gly Asn Ile Glu Gln Val Lys Pro Glu Thr Pr - #o Val Glu Lys Thr Lys     

     #        45                                                               

     - Glu Gln Gly Pro Glu Lys Thr Glu Glu Val Pr - #o Val Lys Pro Thr Glu     

     #    60                                                                   

     - Glu Thr Pro Val Asn Pro Asn Glu Gly Thr Th - #r Glu Gly Thr Ser Ile     

     #80                                                                       

     - Gln Glu Ala Glu Asn Pro Val Gln Pro Ala Gl - #u Glu Ser Thr Thr Asn     

     #                95                                                       

     - Ser Glu Lys Val Ser Pro Asp Thr Ser Ser Ly - #s Asn Thr Gly Glu Val     

     #           110                                                           

     - Ser Ser Asn Pro Ser Asp Ser Thr Thr Ser Va - #l Gly Glu Ser Asn Lys     

     #       125                                                               

     - Pro Glu His Asn Asp Ser Lys Asn Glu Asn Se - #r Glu Lys Thr Val Glu     

     #   140                                                                   

     - Glu Val Pro Val Asn Pro Asn Glu Gly Thr Va - #l Glu Gly Thr Ser Asn     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gln Glu Thr Glu Lys Pro Val Gln Pro Ala Gl - #u Glu Thr Gln Thr Asn     

     #               175                                                       

     - Ser Gly Lys Ile Ala Asn Glu Asn Thr Gly Gl - #u Val Ser Asn Lys Pro     

     #           190                                                           

     - Ser Asp Ser Lys Pro Pro Val Glu Glu Ser As - #n Gln Pro Glu Lys Asn     

     #       205                                                               

     - Gly Thr Ala Thr Lys Pro Glu Asn Ser Gly As - #n Thr Thr Ser Glu Asn     

     #   220                                                                   

     - Gly Gln Thr Glu Pro Glu Pro Ser Asn Gly As - #n Ser Thr Glu Asp Val     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ser Thr Glu Ser Asn Thr Ser Asn Ser Asn Gl - #y Asn Glu Glu Ile Lys     

     #               255                                                       

     - Gln Glu Asn Glu Leu Asp Pro Asp Lys Lys Va - #l Glu Glu Pro Glu Lys     

     #           270                                                           

     - Thr Leu Glu Leu Arg Asn                                                 

             275                                                               

     - (2) INFORMATION FOR SEQ ID NO: 95:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 709 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #95:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC ACTAAACTGA CT - #GCTGAAAA   

       60                                                                      

     - ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA GATATAACTC AT - #TTAACAGA   

      120                                                                      

     - TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT TCAGCATTAG AC - #GGAGCGAC   

      180                                                                      

     - AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC CCAGATGGTT CA - #GTAGTGAC   

      240                                                                      

     - GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT GAATCTGTAA CT - #CAAGAAGC   

      300                                                                      

     - TACACCAGAG TATAAGCTAG AAAATACACC AGGTGGAGAT AAGGGAGGCA AT - #ACTGGAAG   

      360                                                                      

     - CTCAGATGCT AATGCGAATG AAGGCGGTGG TAGCCAGGCG GGTGGATCAG CT - #CACACAGG   

      420                                                                      

     - TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA GCTACTGAAA AA - #GAATCAGC   

      480                                                                      

     - TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT GAAATCAAAG GC - #GCACCGCT   

      540                                                                      

     - TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA GCAGAAAAAC AA - #GCAGCTCT   

      600                                                                      

     - CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG GAAGCAGAAA CG - #ATTGGAGT   

      660                                                                      

     #              709GGTTA CAGTTCCTAA GAGACCAGTG GCTCCTAAT                   

     - (2) INFORMATION FOR SEQ ID NO:96:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 236 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:96:                            

     - Asn Gln Leu Val Ala Gln Asp Pro Lys Ala Gl - #n Asp Ser Thr Lys Leu     

     #                15                                                       

     - Thr Ala Glu Lys Ser Thr Val Lys Ala Pro Al - #a Gln Arg Val Asp Val     

     #            30                                                           

     - Lys Asp Ile Thr His Leu Thr Asp Glu Glu Ly - #s Val Lys Val Ala Ile     

     #        45                                                               

     - Leu Gln Ala Asn Gly Ser Ala Leu Asp Gly Al - #a Thr Ile Asn Val Ala     

     #    60                                                                   

     - Gly Asp Gly Thr Ala Thr Ile Thr Phe Pro As - #p Gly Ser Val Val Thr     

     #80                                                                       

     - Ile Leu Gly Lys Asp Thr Val Gln Gln Ser Al - #a Lys Gly Glu Ser Val     

     #                95                                                       

     - Thr Gln Glu Ala Thr Pro Glu Tyr Lys Leu Gl - #u Asn Thr Pro Gly Gly     

     #           110                                                           

     - Asp Lys Gly Gly Asn Thr Gly Ser Ser Asp Al - #a Asn Ala Asn Glu Gly     

     #       125                                                               

     - Gly Gly Ser Gln Ala Gly Gly Ser Ala His Th - #r Gly Ser Gln Asn Ser     

     #   140                                                                   

     - Ala Gln Ser Gln Ala Ser Lys Gln Leu Ala Th - #r Glu Lys Glu Ser Ala     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Asn Ala Ile Glu Lys Ala Ala Lys Asp Ly - #s Gln Asp Glu Ile Lys     

     #               175                                                       

     - Gly Ala Pro Leu Ser Asp Lys Glu Lys Ala Gl - #u Leu Leu Ala Arg Val     

     #           190                                                           

     - Glu Ala Glu Lys Gln Ala Ala Leu Lys Glu Il - #e Glu Asn Ala Lys Thr     

     #       205                                                               

     - Met Glu Asp Val Lys Glu Ala Glu Thr Ile Gl - #y Val Gln Ala Ile Ala     

     #   220                                                                   

     - Met Val Thr Val Pro Lys Arg Pro Val Ala Pr - #o Asn                     

     225                 2 - #30                 2 - #35                       

     - (2) INFORMATION FOR SEQ ID NO: 97:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 787 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #97:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CAAACAGTCA GCTTCAGGAA CGATTGAGGT GATTTCACGA GAAAATGGCT CT - #GGGACACG   

       60                                                                      

     - GGGTGCCTTC ACAGAAATCA CAGGGATTCT CAAAAAAGAC GGTGATAAAA AA - #ATTGACAA   

      120                                                                      

     - CACTGCCAAA ACAGCTGTGA TTCAAAATAG TACAGAAGGT GTTCTCTCAG CA - #GTTCAAGG   

      180                                                                      

     - GAATGCTAAT GCTATCGGCT ACATCTCCTT GGGATCTTTA ACGAAATCTG TC - #AAGGCTTT   

      240                                                                      

     - AGAGATTGAT GGTGTCAAGG CTAGTCGAGA CACAGTTTTA GATGGTGAAT AC - #CCTCTTCA   

      300                                                                      

     - ACGTCCCTTC AACATTGTTT GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG AT - #TTTATCAG   

      360                                                                      

     - CTTTATCCAC TCCAAACAAG GTCAACAAGT GGTCACAGAT AATAAATTTA TT - #GAAGCTAA   

      420                                                                      

     - AACCGAAACC ACGGAATATA CAAGCCAACA CTTATCAGGC AAGTTGTCTG TT - #GTAGGTTC   

      480                                                                      

     - CACTTCAGTA TCTTCTTTAA TGGAAAAATT AGCAGAAGCT TATAAAAAAG AA - #AATCCAGA   

      540                                                                      

     - AGTTACGATT GATATTACCT CTAATGGGTC TTCAGCAGGT ATTACCGCTG TT - #AAGGAGAA   

      600                                                                      

     - AACCGCTGAT ATTGGTATGG TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AG - #AGTCTCAC   

      660                                                                      

     - CCATGATGCT ATTGCTTTAG ACGGTATTGC TGTTGTGGTC AATAATGACA AT - #AAGGCAAG   

      720                                                                      

     - CCAAGTCAGT ATGGCTGAAC TTGCAGACGT TTTTAGTGGC AAATTAACCA CC - #TGGGACAA   

      780                                                                      

     #         787                                                             

     - (2) INFORMATION FOR SEQ ID NO:98:                                       

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 262 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:98:                            

     - Lys Gln Ser Ala Ser Gly Thr Ile Glu Val Il - #e Ser Arg Glu Asn Gly     

     #                15                                                       

     - Ser Gly Thr Arg Gly Ala Phe Thr Glu Ile Th - #r Gly Ile Leu Lys Lys     

     #            30                                                           

     - Asp Gly Asp Lys Lys Ile Asp Asn Thr Ala Ly - #s Thr Ala Val Ile Gln     

     #        45                                                               

     - Asn Ser Thr Glu Gly Val Leu Ser Ala Val Gl - #n Gly Asn Ala Asn Ala     

     #    60                                                                   

     - Ile Gly Tyr Ile Ser Leu Gly Ser Leu Thr Ly - #s Ser Val Lys Ala Leu     

     #80                                                                       

     - Glu Ile Asp Gly Val Lys Ala Ser Arg Asp Th - #r Val Leu Asp Gly Glu     

     #                95                                                       

     - Tyr Pro Leu Gln Arg Pro Phe Asn Ile Val Tr - #p Ser Ser Asn Leu Ser     

     #           110                                                           

     - Lys Leu Gly Gln Asp Phe Ile Ser Phe Ile Hi - #s Ser Lys Gln Gly Gln     

     #       125                                                               

     - Gln Val Val Thr Asp Asn Lys Phe Ile Glu Al - #a Lys Thr Glu Thr Thr     

     #   140                                                                   

     - Glu Tyr Thr Ser Gln His Leu Ser Gly Lys Le - #u Ser Val Val Gly Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Ser Val Ser Ser Leu Met Glu Lys Leu Al - #a Glu Ala Tyr Lys Lys     

     #               175                                                       

     - Glu Asn Pro Glu Val Thr Ile Asp Ile Thr Se - #r Asn Gly Ser Ser Ala     

     #           190                                                           

     - Gly Ile Thr Ala Val Lys Glu Lys Thr Ala As - #p Ile Gly Met Val Ser     

     #       205                                                               

     - Arg Glu Leu Thr Pro Glu Glu Gly Lys Ser Le - #u Thr His Asp Ala Ile     

     #   220                                                                   

     - Ala Leu Asp Gly Ile Ala Val Val Val Asn As - #n Asp Asn Lys Ala Ser     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Gln Val Ser Met Ala Glu Leu Ala Asp Val Ph - #e Ser Gly Lys Leu Thr     

     #               255                                                       

     - Thr Trp Asp Lys Ile Lys                                                 

                 260                                                           

     - (2) INFORMATION FOR SEQ ID NO: 99:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 421 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #99:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ATTCGATGAT GCGGATGAAA AGATGACCCG TGATGAAATT GCCTATATGC TG - #ACAAATAG   

       60                                                                      

     - TGAAGAAACA TTGGATGCTG ATGAGATTGA GATGCTACAA GGTGTCTTTT CG - #CTCGATGA   

      120                                                                      

     - ACTGATGGCA CGAGAGGTTA TGGTTCCTCG AACGGATGCC TTTATGGTGG AT - #ATTCAGGA   

      180                                                                      

     - TGATAGTCAA GCCATTATCC AAAGTATTTT AAAACAAAAT TATTCTCGTA TC - #CCGGTTTA   

      240                                                                      

     - TGATGGGGAT AAGGACAATG TAATTGGAAT CATTCACACC AAGAGTCTCC TT - #AAGGCAGG   

      300                                                                      

     - CTTTGTGGAC GGTTTTGACA ATATTGTTTG GAAGAGAATT TTACAAGATC CA - #CTTTTTGT   

      360                                                                      

     - ACCTGAAACT ATTTTTGTGG ATGACTTGCT AAAAGAACTG CGAAATACCC AA - #AGACAAAT   

      420                                                                      

     #              421                                                        

     - (2) INFORMATION FOR SEQ ID NO:100:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 140 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:100:                           

     - Phe Asp Asp Ala Asp Glu Lys Met Thr Arg As - #p Glu Ile Ala Tyr Met     

     #                15                                                       

     - Leu Thr Asn Ser Glu Glu Thr Leu Asp Ala As - #p Glu Ile Glu Met Leu     

     #            30                                                           

     - Gln Gly Val Phe Ser Leu Asp Glu Leu Met Al - #a Arg Glu Val Met Val     

     #        45                                                               

     - Pro Arg Thr Asp Ala Phe Met Val Asp Ile Gl - #n Asp Asp Ser Gln Ala     

     #    60                                                                   

     - Ile Ile Gln Ser Ile Leu Lys Gln Asn Tyr Se - #r Arg Ile Pro Val Tyr     

     #80                                                                       

     - Asp Gly Asp Lys Asp Asn Val Ile Gly Ile Il - #e His Thr Lys Ser Leu     

     #                95                                                       

     - Leu Lys Ala Gly Phe Val Asp Gly Phe Asp As - #n Ile Val Trp Lys Arg     

     #           110                                                           

     - Ile Leu Gln Asp Pro Leu Phe Val Pro Glu Th - #r Ile Phe Val Asp Asp     

     #       125                                                               

     - Leu Leu Lys Glu Leu Arg Asn Thr Gln Arg Gl - #n Met                     

     #   140                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 101:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 331 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #101: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGAGAGTCGA TCAAAAGTAG ATGAAGCTGT GTCTAAGTTT GAAAAGGACT CA - #TCTTCTTC   

       60                                                                      

     - GTCAAGTTCA GACTCTTCCA CTAAACCGGA AGCTTCAGAT ACAGCGAAGC CA - #AACAAGCC   

      120                                                                      

     - GACAGAACCA GGAGAAAAGG TAGCAGAAGC TAAGAAGAAG GTTGAAGAAG CT - #GAGAAAAA   

      180                                                                      

     - AGCCAAGGAT CAAAAAGAAG AAGATCGTCG TAACTACCCA ACCATTACTT AC - #AAAACGCT   

      240                                                                      

     - TGAACTTGAA ATTGCTGAGT CCGATGTGGA AGTTAAAAAA GCGGAGCTTG AA - #CTAGTAAA   

      300                                                                      

     #         331      CCTC GAGACGAGCA A                                      

     - (2) INFORMATION FOR SEQ ID NO:102:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 110 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:102:                           

     - Glu Ser Arg Ser Lys Val Asp Glu Ala Val Se - #r Lys Phe Glu Lys Asp     

     #                15                                                       

     - Ser Ser Ser Ser Ser Ser Ser Asp Ser Ser Th - #r Lys Pro Glu Ala Ser     

     #            30                                                           

     - Asp Thr Ala Lys Pro Asn Lys Pro Thr Glu Pr - #o Gly Glu Lys Val Ala     

     #        45                                                               

     - Glu Ala Lys Lys Lys Val Glu Glu Ala Glu Ly - #s Lys Ala Lys Asp Gln     

     #    60                                                                   

     - Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Il - #e Thr Tyr Lys Thr Leu     

     #80                                                                       

     - Glu Leu Glu Ile Ala Glu Ser Asp Val Glu Va - #l Lys Lys Ala Glu Leu     

     #                95                                                       

     - Glu Leu Val Lys Val Lys Ala Asn Glu Pro Ar - #g Asp Glu Gln             

     #           110                                                           

     - (2) INFORMATION FOR SEQ ID NO: 103:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 358 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #103: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ATGGACAACA GGAAACTGGG ACGAGGTTAT ATCTGGTAAG ATTGACAAGT AC - #AAAGATCC   

       60                                                                      

     - AGATATTCCA ACAGTTGAAT CACAAGAAGT TACGTCAGAC TCTAGTGATA AA - #GAAATAAC   

      120                                                                      

     - GGTAAGGTAT GACCGTTTAT CAACACCAGA AAAACCAATC CCACAACCAA AT - #CCAGAGCA   

      180                                                                      

     - TCCAAGTGTT CCGACACCAA ACCCAGAACT ACCAAATCAA GAGACTCCAA CA - #CCAGATAA   

      240                                                                      

     - ACCAACTCCA GAACCAGGTA CTCCAAAAAC TGAAACTCCA GTGAATCCAG AC - #CCAGAAGT   

      300                                                                      

     - TCCGACTTAT GAGACAGGTA AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AA - #GCTAAT     

      358                                                                      

     - (2) INFORMATION FOR SEQ ID NO:104:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 119 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:104:                           

     - Trp Thr Thr Gly Asn Trp Asp Glu Val Ile Se - #r Gly Lys Ile Asp Lys     

     #                15                                                       

     - Tyr Lys Asp Pro Asp Ile Pro Thr Val Glu Se - #r Gln Glu Val Thr Ser     

     #            30                                                           

     - Asp Ser Ser Asp Lys Glu Ile Thr Val Arg Ty - #r Asp Arg Leu Ser Thr     

     #        45                                                               

     - Pro Glu Lys Pro Ile Pro Gln Pro Asn Pro Gl - #u His Pro Ser Val Pro     

     #    60                                                                   

     - Thr Pro Asn Pro Glu Leu Pro Asn Gln Glu Th - #r Pro Thr Pro Asp Lys     

     #80                                                                       

     - Pro Thr Pro Glu Pro Gly Thr Pro Lys Thr Gl - #u Thr Pro Val Asn Pro     

     #                95                                                       

     - Asp Pro Glu Val Pro Thr Tyr Glu Thr Gly Ly - #s Arg Glu Glu Leu Pro     

     #           110                                                           

     - Asn Thr Gly Thr Glu Ala Asn                                             

             115                                                               

     - (2) INFORMATION FOR SEQ ID NO: 105:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1879 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #105: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CGATGGGCTC AATCCAACCC CAGGTCAAGT CTTACCTGAA GAGACATCGG GA - #ACGAAAGA   

       60                                                                      

     - GGGTGACTTA TCAGAAAAAC CAGGAGACAC CGTTCTCACT CAAGCGAAAC CT - #GAGGGCGT   

      120                                                                      

     - TACTGGAAAT ACGAATTCAC TTCCGACACC TACAGAAAGA ACTGAAGTGA GC - #GAGGAAAC   

      180                                                                      

     - AAGCCCTTCT AGTCTGGATA CACTTTTTGA AAAAGATGAA GAAGCTCAAA AA - #AATCCAGA   

      240                                                                      

     - GCTAACAGAT GTCTTAAAAG AAACTGTAGA TACAGCTGAT GTGGATGGGA CA - #CAAGCAAG   

      300                                                                      

     - TCCAGCAGAA ACTACTCCTG AACAAGTAAA AGGTGGAGTG AAAGAAAATA CA - #AAAGACAG   

      360                                                                      

     - CATCGATGTT CCTGCTGCTT ATCTTGAAAA AGCTGAAGGG AAAGGTCCTT TC - #ACTGCCGG   

      420                                                                      

     - TGTAAACCAA GTAATTCCTT ATGAACTATT CGCTGGTGAT GGTATGTTAA CT - #CGTCTATT   

      480                                                                      

     - ACTAAAAGCT TCGGATAATG CTCCTTGGTC TGACAATGGT ACTGCTAAAA AT - #CCTGCTTT   

      540                                                                      

     - ACCTCCTCTT GAAGGATTAA CAAAAGGGAA ATACTTCTAT GAAGTAGACT TA - #AATGGCAA   

      600                                                                      

     - TACTGTTGGT AAACAAGGTC AAGCTTTAAT TGATCAACTT CGCGCTAATG GT - #ACTCAAAC   

      660                                                                      

     - TTATAAAGCT ACTGTTAAAG TTTACGGAAA TAAAGACGGT AAAGCTGACT TG - #ACTAATCT   

      720                                                                      

     - AGTTGCTACT AAAAATGTAG ACATCAACAT CAATGGATTA GTTGCTAAAG AA - #ACAGTTCA   

      780                                                                      

     - AAAAGCCGTT GCAGACAACG TTAAAGACAG TATCGATGTT CCAGCAGCCT AC - #CTAGAAAA   

      840                                                                      

     - AGCCAAGGGT GAAGGTCCAT TCACAGCAGG TGTCAACCAT GTGATTCCAT AC - #GAACTCTT   

      900                                                                      

     - CGCAGGTGAT GGCATGTTGA CTCGTCTCTT GCTCAAGGCA TCTGACAAGG CA - #CCATGGTC   

      960                                                                      

     - AGATAACGGC GACGCTAAAA ACCCAGCCCT ATCTCCACTA GGCGAAAACG TG - #AAGACCAA   

     1020                                                                      

     - AGGTCAATAC TTCTATCAAN TAGCCTTGGA CGGAAATGTA GCTGGCAAAG AA - #AAACAAGC   

     1080                                                                      

     - GCTCATTGAC CAGTTCCGAG CAAANGGTAC TCAAACTTAC AGCGCTACAG TC - #AATGTCTA   

     1140                                                                      

     - TGGTAACAAA GACGGTAAAC CAGACTTGGA CAACATCGTA GCAACTAAAA AA - #GTCACTAT   

     1200                                                                      

     - TAACATAAAC GGTTTAATTT CTAAAGAAAC AGTTCAAAAA GCCGTTGCAG AC - #AACGTTAA   

     1260                                                                      

     - NGACAGTATC GATGTTCCAG CAGCCTACCT AGAAAAAGCC AAGGGTGAAG GT - #CCATTCAC   

     1320                                                                      

     - AGCAGGTGTC AACCATGTGA TTCCATACGA ACTCTTCGCA GGTGATGGTA TG - #TTGACTCG   

     1380                                                                      

     - TCTCTTGCTC AAGGCATCTG ACAAGGCACC ATGGTCAGAT AACGGNGACG CT - #AAAAACCC   

     1440                                                                      

     - AGCNCTATCT CCACTAGGTG AAAACGTGAA GACCAAAGGT CAATACTTCT AT - #CAANTAGC   

     1500                                                                      

     - CTTGGACGGA AATGTAGCTG GCAAAGAAAA ACAAGCGCTC ATTGACCAGT TC - #CGAGCAAA   

     1560                                                                      

     - CGGTACTCAA ACTTACAGCG CTACAGTCAA TGTCTATGGT AACAAAGACG GT - #AAACCAGA   

     1620                                                                      

     - CTTGGACAAC ATCGTAGCAA CTAAAAAAGT CACTATTAAG ATAAATGTTA AA - #GAAACATC   

     1680                                                                      

     - AGACACAGCA AATGGTTCAT TATCACCTTC TAACTCTGGT TCTGGCGTGA CT - #CCGATGAA   

     1740                                                                      

     - TCACAATCAT GCTACAGGTA CTACAGATAG CATGCCTGCT GACACCATGA CA - #AGTTCTAC   

     1800                                                                      

     - CAACACGATG GCAGGTGAAA ACATGGCTGC TTCTGCTAAC AAGATGTCTG AT - #ACGATGAT   

     1860                                                                      

     #                 187 - #9                                                

     - (2) INFORMATION FOR SEQ ID NO:106:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 626 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:106:                           

     - Asp Gly Leu Asn Pro Thr Pro Gly Gln Val Le - #u Pro Glu Glu Thr Ser     

     #                15                                                       

     - Gly Thr Lys Glu Gly Asp Leu Ser Glu Lys Pr - #o Gly Asp Thr Val Leu     

     #            30                                                           

     - Thr Gln Ala Lys Pro Glu Gly Val Thr Gly As - #n Thr Asn Ser Leu Pro     

     #        45                                                               

     - Thr Pro Thr Glu Arg Thr Glu Val Ser Glu Gl - #u Thr Ser Pro Ser Ser     

     #    60                                                                   

     - Leu Asp Thr Leu Phe Glu Lys Asp Glu Glu Al - #a Gln Lys Asn Pro Glu     

     #80                                                                       

     - Leu Thr Asp Val Leu Lys Glu Thr Val Asp Th - #r Ala Asp Val Asp Gly     

     #                95                                                       

     - Thr Gln Ala Ser Pro Ala Glu Thr Thr Pro Gl - #u Gln Val Lys Gly Gly     

     #           110                                                           

     - Val Lys Glu Asn Thr Lys Asp Ser Ile Asp Va - #l Pro Ala Ala Tyr Leu     

     #       125                                                               

     - Glu Lys Ala Glu Gly Lys Gly Pro Phe Thr Al - #a Gly Val Asn Gln Val     

     #   140                                                                   

     - Ile Pro Tyr Glu Leu Phe Ala Gly Asp Gly Me - #t Leu Thr Arg Leu Leu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Lys Ala Ser Asp Asn Ala Pro Trp Ser As - #p Asn Gly Thr Ala Lys     

     #               175                                                       

     - Asn Pro Ala Leu Pro Pro Leu Glu Gly Leu Th - #r Lys Gly Lys Tyr Phe     

     #           190                                                           

     - Tyr Glu Val Asp Leu Asn Gly Asn Thr Val Gl - #y Lys Gln Gly Gln Ala     

     #       205                                                               

     - Leu Ile Asp Gln Leu Arg Ala Asn Gly Thr Gl - #n Thr Tyr Lys Ala Thr     

     #   220                                                                   

     - Val Lys Val Tyr Gly Asn Lys Asp Gly Lys Al - #a Asp Leu Thr Asn Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Val Ala Thr Lys Asn Val Asp Ile Asn Ile As - #n Gly Leu Val Ala Lys     

     #               255                                                       

     - Glu Thr Val Gln Lys Ala Val Ala Asp Asn Va - #l Lys Asp Ser Ile Asp     

     #           270                                                           

     - Val Pro Ala Ala Tyr Leu Glu Lys Ala Lys Gl - #y Glu Gly Pro Phe Thr     

     #       285                                                               

     - Ala Gly Val Asn His Val Ile Pro Tyr Glu Le - #u Phe Ala Gly Asp Gly     

     #   300                                                                   

     - Met Leu Thr Arg Leu Leu Leu Lys Ala Ser As - #p Lys Ala Pro Trp Ser     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Asp Asn Gly Asp Ala Lys Asn Pro Ala Leu Se - #r Pro Leu Gly Glu Asn     

     #               335                                                       

     - Val Lys Thr Lys Gly Gln Tyr Phe Tyr Gln Xa - #a Ala Leu Asp Gly Asn     

     #           350                                                           

     - Val Ala Gly Lys Glu Lys Gln Ala Leu Ile As - #p Gln Phe Arg Ala Xaa     

     #       365                                                               

     - Gly Thr Gln Thr Tyr Ser Ala Thr Val Asn Va - #l Tyr Gly Asn Lys Asp     

     #   380                                                                   

     - Gly Lys Pro Asp Leu Asp Asn Ile Val Ala Th - #r Lys Lys Val Thr Ile     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Asn Ile Asn Gly Leu Ile Ser Lys Glu Thr Va - #l Gln Lys Ala Val Ala     

     #               415                                                       

     - Asp Asn Val Xaa Asp Ser Ile Asp Val Pro Al - #a Ala Tyr Leu Glu Lys     

     #           430                                                           

     - Ala Lys Gly Glu Gly Pro Phe Thr Ala Gly Va - #l Asn His Val Ile Pro     

     #       445                                                               

     - Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Th - #r Arg Leu Leu Leu Lys     

     #   460                                                                   

     - Ala Ser Asp Lys Ala Pro Trp Ser Asp Asn Gl - #y Asp Ala Lys Asn Pro     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Ala Leu Ser Pro Leu Gly Glu Asn Val Lys Th - #r Lys Gly Gln Tyr Phe     

     #               495                                                       

     - Tyr Gln Xaa Ala Leu Asp Gly Asn Val Ala Gl - #y Lys Glu Lys Gln Ala     

     #           510                                                           

     - Leu Ile Asp Gln Phe Arg Ala Asn Gly Thr Gl - #n Thr Tyr Ser Ala Thr     

     #       525                                                               

     - Val Asn Val Tyr Gly Asn Lys Asp Gly Lys Pr - #o Asp Leu Asp Asn Ile     

     #   540                                                                   

     - Val Ala Thr Lys Lys Val Thr Ile Lys Ile As - #n Val Lys Glu Thr Ser     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Asp Thr Ala Asn Gly Ser Leu Ser Pro Ser As - #n Ser Gly Ser Gly Val     

     #               575                                                       

     - Thr Pro Met Asn His Asn His Ala Thr Gly Th - #r Thr Asp Ser Met Pro     

     #           590                                                           

     - Ala Asp Thr Met Thr Ser Ser Thr Asn Thr Me - #t Ala Gly Glu Asn Met     

     #       605                                                               

     - Ala Ala Ser Ala Asn Lys Met Ser Asp Thr Me - #t Met Ser Glu Asp Lys     

     #   620                                                                   

     - Ala Met                                                                 

     625                                                                       

     - (2) INFORMATION FOR SEQ ID NO: 107:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 593 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #107: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTCCAATCAA AAACAGGCAG ATGGTAAACT CAATATCGTG ACAACCTTTT AC - #CCTGTCTA   

       60                                                                      

     - TGArTTTACC AAGCAAGTCG CAGGAGATAC GGCTAATGTA GAACTCCTAA TC - #GGTGCTGG   

      120                                                                      

     - GACAGAACCT CATGAATACG AACCATCTGC CAAGGCAGTT GCCAAAATCC AA - #GATGCAGA   

      180                                                                      

     - TACCTTCGTT TATGAAAATG AAAACATGGA AACATGGGTA CCTAAATTGC TA - #GATACCTT   

      240                                                                      

     - GGATAAGAAA AAAGTGAAAA CCATCAAGGC GACAGGCGAT ATGTTGCTCT TG - #CCAGGTGG   

      300                                                                      

     - CGAGGAAGAA GAGGGAGACC ATGACCATGG AGAAGAAGGT CATCACCATG AG - #TTTGACCC   

      360                                                                      

     - CCATGTTTGG TTATCACCAG TTCGTGCCAT tAAACTAGTA GAGCACCATC CG - #CGACACTT   

      420                                                                      

     - GTCAGCAGAT TATCCTGATA AAAAAGAGAC CTTTGAGAAG AATGCAGCTG CC - #TATATCGA   

      480                                                                      

     - AAAATTGCAA GCCTTGGATA AGGCTTACGC AGAAGGTTTG TCTCAAGCAA AA - #CAAAAGAG   

      540                                                                      

     - CTTTGTGACT CAACACGCAg CCTTTAACTa TCTTGCCTTG GACTATGGGA CT - #C          

      593                                                                      

     - (2) INFORMATION FOR SEQ ID NO:108:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 197 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:108:                           

     - Ser Asn Gln Lys Gln Ala Asp Gly Lys Leu As - #n Ile Val Thr Thr Phe     

     #                15                                                       

     - Tyr Pro Val Tyr Glu Phe Thr Lys Gln Val Al - #a Gly Asp Thr Ala Asn     

     #            30                                                           

     - Val Glu Leu Leu Ile Gly Ala Gly Thr Glu Pr - #o His Glu Tyr Glu Pro     

     #        45                                                               

     - Ser Ala Lys Ala Val Ala Lys Ile Gln Asp Al - #a Asp Thr Phe Val Tyr     

     #    60                                                                   

     - Glu Asn Glu Asn Met Glu Thr Trp Val Pro Ly - #s Leu Leu Asp Thr Leu     

     #80                                                                       

     - Asp Lys Lys Lys Val Lys Thr Ile Lys Ala Th - #r Gly Asp Met Leu Leu     

     #                95                                                       

     - Leu Pro Gly Gly Glu Glu Glu Glu Gly Asp Hi - #s Asp His Gly Glu Glu     

     #           110                                                           

     - Gly His His His Glu Phe Asp Pro His Val Tr - #p Leu Ser Pro Val Arg     

     #       125                                                               

     - Ala Ile Lys Leu Val Glu His His Pro Arg Hi - #s Leu Ser Ala Asp Tyr     

     #   140                                                                   

     - Pro Asp Lys Lys Glu Thr Phe Glu Lys Asn Al - #a Ala Ala Tyr Ile Glu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Leu Gln Ala Leu Asp Lys Ala Tyr Ala Gl - #u Gly Leu Ser Gln Ala     

     #               175                                                       

     - Lys Gln Lys Ser Phe Val Thr Gln His Ala Al - #a Phe Asn Tyr Leu Ala     

     #           190                                                           

     - Leu Asp Tyr Gly Thr                                                     

             195                                                               

     - (2) INFORMATION FOR SEQ ID NO: 109:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1003 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #109: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TATCACAGGA TCGAACGGTA AGACAACCAC AACGACTATG ATTGGGGAAG TT - #TTGACTGC   

       60                                                                      

     - TGCTGGCCAA CATGGTCTTT TATCAGGGAA TATCGGCTAT CCAGCTAGTC AG - #GTTGCTCA   

      120                                                                      

     - AATAGCATCA GATAAGGACA CGCTTGTTAT GGAACTTTCT TCTTTCCAAC TC - #ATGGGTGT   

      180                                                                      

     - TCAAGAATTC CATCCAGAGA TTGCGGTTAT TACCAACCTC ATGCCAACTC AT - #ATCGACTA   

      240                                                                      

     - CCATGGGTCA TTTTCGGAAT ATGTAGCAGC CAAGTGGAAT ATCCAGAACA AG - #ATGACAGC   

      300                                                                      

     - AGCTGATTTC CTTGTCTTGA ACTTTAATCA AGACTTGGCA AAAGACTTGA CT - #TCCAAGAC   

      360                                                                      

     - AGAAGCCACT GTTGTACCAT TTTCAACACT TGAAAAGGTT GATGGAGCTT AT - #CTGGAAGA   

      420                                                                      

     - TGGTCAACTC TACTTCCGTG GTGAAGTAGT CATGGCAGCG AATGAAATCG GT - #GTTCCAGG   

      480                                                                      

     - TAGCCACAAT GTGGAAAATG CCCTTGCGAC TATTGCTGTA GCCAAGCTTC GT - #GATGTGGA   

      540                                                                      

     - CAATCAAACC ATCAAGGAAA CTCTTTCAGC CTTCGGTGGT GTCAAACACC GT - #CTCCAGTT   

      600                                                                      

     - TGTGGATGAC ATCAAGGGTG TTAAATTCTA TAACGACAGT AAATCAACTA AT - #ATCTTGGC   

      660                                                                      

     - TACTCAAAAA GCCTTGTCAG GATTTGACAA CAGCAAGGTC GTCTTGATTG CA - #GGTGGTTT   

      720                                                                      

     - GGACCGTGGC AATGAGTTTG ACGAATTGGT GCCAGACATT ACTGGACTCA AG - #AAGATGGT   

      780                                                                      

     - CATCCTGGGT CAATCTGCAG AACGTGTCAA ACGGGCAGCA GACAAGGCTG GT - #GTCGCTTA   

      840                                                                      

     - TGTGGAGGCG ACAGATATTG CAGATGCGAC CCGCAAGGCC TATGAGCTTG CG - #ACTCAAGG   

      900                                                                      

     - AGATGTGGTT CTTCTTAGTC CTGCCAATGC TAGCTGGGAT ATGTATGCTA AC - #TTTGAAGT   

      960                                                                      

     #                 100 - #3ACAGTAGC GGAGTTAAAA GAA                         

     - (2) INFORMATION FOR SEQ ID NO:110:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 335 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:110:                           

     - Gly Ile Thr Gly Ser Asn Gly Lys Thr Thr Th - #r Thr Thr Met Ile Gly     

     #                15                                                       

     - Glu Val Leu Thr Ala Ala Gly Gln His Gly Le - #u Leu Ser Gly Asn Ile     

     #            30                                                           

     - Gly Tyr Pro Ala Ser Gln Val Ala Gln Ile Al - #a Ser Asp Lys Asp Thr     

     #        45                                                               

     - Leu Val Met Glu Leu Ser Ser Phe Gln Leu Me - #t Gly Val Gln Glu Phe     

     #    60                                                                   

     - His Pro Glu Ile Ala Val Ile Thr Asn Leu Me - #t Pro Thr His Ile Asp     

     #80                                                                       

     - Tyr His Gly Ser Phe Ser Glu Tyr Val Ala Al - #a Lys Trp Asn Ile Gln     

     #                95                                                       

     - Asn Lys Met Thr Ala Ala Asp Phe Leu Val Le - #u Asn Phe Asn Gln Asp     

     #           110                                                           

     - Leu Ala Lys Asp Leu Thr Ser Lys Thr Glu Al - #a Thr Val Val Pro Phe     

     #       125                                                               

     - Ser Thr Leu Glu Lys Val Asp Gly Ala Tyr Le - #u Glu Asp Gly Gln Leu     

     #   140                                                                   

     - Tyr Phe Arg Gly Glu Val Val Met Ala Ala As - #n Glu Ile Gly Val Pro     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Ser His Asn Val Glu Asn Ala Leu Ala Th - #r Ile Ala Val Ala Lys     

     #               175                                                       

     - Leu Arg Asp Val Asp Asn Gln Thr Ile Lys Gl - #u Thr Leu Ser Ala Phe     

     #           190                                                           

     - Gly Gly Val Lys His Arg Leu Gln Phe Val As - #p Asp Ile Lys Gly Val     

     #       205                                                               

     - Lys Phe Tyr Asn Asp Ser Lys Ser Thr Asn Il - #e Leu Ala Thr Gln Lys     

     #   220                                                                   

     - Ala Leu Ser Gly Phe Asp Asn Ser Lys Val Va - #l Leu Ile Ala Gly Gly     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Leu Asp Arg Gly Asn Glu Phe Asp Glu Leu Va - #l Pro Asp Ile Thr Gly     

     #               255                                                       

     - Leu Lys Lys Met Val Ile Leu Gly Gln Ser Al - #a Glu Arg Val Lys Arg     

     #           270                                                           

     - Ala Ala Asp Lys Ala Gly Val Ala Tyr Val Gl - #u Ala Thr Asp Ile Ala     

     #       285                                                               

     - Asp Ala Thr Arg Lys Ala Tyr Glu Leu Ala Th - #r Gln Gly Asp Val Val     

     #   300                                                                   

     - Leu Leu Ser Pro Ala Asn Ala Ser Trp Asp Me - #t Tyr Ala Asn Phe Glu     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Val Arg Gly Asp Leu Phe Ile Asp Thr Val Al - #a Glu Leu Lys Glu         

     #               335                                                       

     - (2) INFORMATION FOR SEQ ID NO: 111:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 412 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #111: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AAGTTCATCG AAGATGGTTG GGAAGTCCAC TATATCGGGG ACAAGTGTGG TA - #TCGAACAC   

       60                                                                      

     - CAAGAAATCC TTAAGTCAGG TTTGGATGTC ACCTTCCATT CTATTGCGAC TG - #GAAAATTG   

      120                                                                      

     - CGTCGCTATT TCTCTTGGCA AAATATGCTG GACGTCTTCA AAGTTGGTTG GG - #GAATTGTC   

      180                                                                      

     - CAATCGCTCT TTATCATGTT GCGACTGCGT CCACAGACCC TTTTTTCAAA GG - #GGGGCTTT   

      240                                                                      

     - GTCTCAGTAC CGCCTGTTAT CGCTGCGCGT GTGTCAGGAG TGCCTGTCTT TA - #TTCACGAA   

      300                                                                      

     - TCTGACCTGT CTATGGGCTT GGCCAATAAA ATCGCCTATA AATTTGCGAC TA - #AGATGTAT   

      360                                                                      

     - TCAACCTTTG AACAAGCTTC GAGTTTGGCT AAGGTTGAGC ATGTGGGAGC GG - #           

      412                                                                      

     - (2) INFORMATION FOR SEQ ID NO:112:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 137 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:112:                           

     - Ser Ser Ser Lys Met Val Gly Lys Ser Thr Il - #e Ser Gly Thr Ser Val     

     #                15                                                       

     - Val Ser Asn Thr Lys Lys Ser Leu Ser Gln Va - #l Trp Met Ser Pro Ser     

     #            30                                                           

     - Ile Leu Leu Arg Leu Glu Asn Cys Val Ala Il - #e Ser Leu Gly Lys Ile     

     #        45                                                               

     - Cys Trp Thr Ser Ser Lys Leu Val Gly Glu Le - #u Ser Asn Arg Ser Leu     

     #    60                                                                   

     - Ser Cys Cys Asp Cys Val His Arg Pro Phe Ph - #e Gln Arg Gly Ala Leu     

     #80                                                                       

     - Ser Gln Tyr Arg Leu Leu Ser Leu Arg Val Cy - #s Gln Glu Cys Leu Ser     

     #                95                                                       

     - Leu Phe Thr Asn Leu Thr Cys Leu Trp Ala Tr - #p Pro Ile Lys Ser Pro     

     #           110                                                           

     - Ile Asn Leu Arg Leu Arg Cys Ile Gln Pro Le - #u Asn Lys Leu Arg Val     

     #       125                                                               

     - Trp Leu Arg Leu Ser Met Trp Glu Arg                                     

     #   135                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 113:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 544 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #113: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ATCGCTAGCT AGTGAAATGC AAGAAAGTAC ACGTAAATTC AAGGTTACTG CT - #GACCTAAC   

       60                                                                      

     - AGATGCCGGT GTTGGAACGA TTGAAGTTCC TTTGAGCATT GAAGATTTAC CC - #AATGGGCT   

      120                                                                      

     - GACCGCTGTG GCGACTCCGC AAAAAATTAC AGTCAAGATT GGTAAGAAGG CT - #CAGAAGGA   

      180                                                                      

     - TAAGGTAAAG ATTGTACCAG AGATTGACCC TAGTCAAATT GATAGTCGGG TA - #CAAATTGA   

      240                                                                      

     - AAATGTCATG GTGTCAGATA AAGAAGTGTC TATTACGAGT GACCAAGAGA CA - #TTGGATAG   

      300                                                                      

     - AATTGATAAG ATTATCGCTG TTTTGCCAAC TAGCGAACGT ATAACAGGTA AT - #TACAGTGG   

      360                                                                      

     - TTCAGTACCT TTGCAGGCAA TCGACCGCAA TGGTGTTGTC TTACCGGCAG TT - #ATCACTCC   

      420                                                                      

     - GTTTGATACA ATAATGAAGG TGACTACAAA ACCAGTAGCA CCAAGTTCAA GC - #ACATCAAA   

      480                                                                      

     - TTCAAGTACA AGCAGTTCAT CGGAGACATC TTCGTCAACG AAAGCAACTA GT - #TCAAAAAC   

      540                                                                      

     #            544                                                          

     - (2) INFORMATION FOR SEQ ID NO:114:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 181 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:114:                           

     - Ser Leu Ala Ser Glu Met Gln Glu Ser Thr Ar - #g Lys Phe Lys Val Thr     

     #                15                                                       

     - Ala Asp Leu Thr Asp Ala Gly Val Gly Thr Il - #e Glu Val Pro Leu Ser     

     #            30                                                           

     - Ile Glu Asp Leu Pro Asn Gly Leu Thr Ala Va - #l Ala Thr Pro Gln Lys     

     #        45                                                               

     - Ile Thr Val Lys Ile Gly Lys Lys Ala Gln Ly - #s Asp Lys Val Lys Ile     

     #    60                                                                   

     - Val Pro Glu Ile Asp Pro Ser Gln Ile Asp Se - #r Arg Val Gln Ile Glu     

     #80                                                                       

     - Asn Val Met Val Ser Asp Lys Glu Val Ser Il - #e Thr Ser Asp Gln Glu     

     #                95                                                       

     - Thr Leu Asp Arg Ile Asp Lys Ile Ile Ala Va - #l Leu Pro Thr Ser Glu     

     #           110                                                           

     - Arg Ile Thr Gly Asn Tyr Ser Gly Ser Val Pr - #o Leu Gln Ala Ile Asp     

     #       125                                                               

     - Arg Asn Gly Val Val Leu Pro Ala Val Ile Th - #r Pro Phe Asp Thr Ile     

     #   140                                                                   

     - Met Lys Val Thr Thr Lys Pro Val Ala Pro Se - #r Ser Ser Thr Ser Asn     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Ser Thr Ser Ser Ser Ser Glu Thr Ser Se - #r Ser Thr Lys Ala Thr     

     #               175                                                       

     - Ser Ser Lys Thr Asn                                                     

                 180                                                           

     - (2) INFORMATION FOR SEQ ID NO: 115:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1267 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #115: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GCACCAGATG GGGCACAAGG TTCAGGGATC AGATGTTGAA AAGTACTACT TT - #ACCCAACG   

       60                                                                      

     - CGGTCTTGAG CAGGCAGGAA TTACCATTCT TCCTTTTGAT GAAAAAAATC TA - #GACGGTGA   

      120                                                                      

     - TATGGAAATT ATCGCTGGAA ATGCCTTTCG TCCAGATAAC AACGTCGAAA TT - #GCCTATGC   

      180                                                                      

     - GGACCAAAAT GGTATCAGCT ACAAACGTTA CCATGAGTTT CTAGGTAGCT TT - #ATGCGTGA   

      240                                                                      

     - CTTTGTTAGC ATGGGAGTAG CAGGAGCACA TGGAAAAACT TCAACGACAG GT - #ATGTTGTC   

      300                                                                      

     - TCATGTCTTG TCTCACATTA CAGATACCAG CTTCTTGATT GGAGATGGGA CA - #GGTCGTGG   

      360                                                                      

     - TTCGGCCAAT GCCAAATATT TTGTCTTTGA ATCTGACGAA TATGAGCGTC AC - #TTCATGCC   

      420                                                                      

     - TTACCACCCA GAATACTCTA TTATCACCAA CATTGACTTT GACCATCCAG AT - #TATTTCAC   

      480                                                                      

     - AAGTCTCGAG GATGTTTTTA ATGCCTTTAA CGACTATGCC AAACAAATCA CC - #AAGGGTCT   

      540                                                                      

     - TTTTGTCTAT GGTGAAGATG CTGAATTGCG TAAGATTACG TCTGATGCAC CA - #ATTTATTA   

      600                                                                      

     - TTATGGTTTT GAAGCTGAAG GCAATGACTT TGTAGCTAGT GATCTTCTTC GT - #TCAATAAC   

      660                                                                      

     - TGGTTCAACC TTCACCGTTC ATTTCCGTGG ACAAAACTTG GGGCAATTCC AC - #ATTCCAAC   

      720                                                                      

     - CTTTGGTCGT CACAATATCA TGAATGCGAC AGCCGTTATT GGTCTTCTTT AC - #ACAGCAGG   

      780                                                                      

     - ATTTGATTTG AACTTGGTGC GTGAGCACTT GAAAACATTT GCCGGTGTTA AA - #CGTCGTTT   

      840                                                                      

     - CACTGAGAAA ATTGTCAATG ATACAGTGAT TATCGATGAC TTTGCCCACC AT - #CCAACAGA   

      900                                                                      

     - AATTATTGCG ACCTTGGATG CGGCTCGTCA GAAATACCCA AGCAAGGAAA TT - #GTAGCAGT   

      960                                                                      

     - CTTTCAACCG CATACCTTTA CAAGAACCAT TGCCTTGTTG GACGACTTTG CC - #CATGCTTT   

     1020                                                                      

     - AAACCAAGCA GATGCTGTTT ATCTAGCGCA AATTTATGGC TCGGCTCGTG AA - #GTAGATCA   

     1080                                                                      

     - TGGTGACGTT AAGGTAGAAG ACCTAGCCAA CAAAATCAAC AAAAAACACC AA - #GTGATTAC   

     1140                                                                      

     - TGTTGAAAAT GTTTCTCCAC TCCTAGACCA TGACAATGCT GTTTACGTCT TT - #ATGGGAGC   

     1200                                                                      

     - AGGAGACATC CAAACCTATG AATACTCATT TGAGCGTCTC TTGTCTAACT TG - #ACAAGCAA   

     1260                                                                      

     #        1267                                                             

     - (2) INFORMATION FOR SEQ ID NO:116:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 422 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:116:                           

     - His Gln Met Gly His Lys Val Gln Gly Ser As - #p Val Glu Lys Tyr Tyr     

     #                15                                                       

     - Phe Thr Gln Arg Gly Leu Glu Gln Ala Gly Il - #e Thr Ile Leu Pro Phe     

     #            30                                                           

     - Asp Glu Lys Asn Leu Asp Gly Asp Met Glu Il - #e Ile Ala Gly Asn Ala     

     #        45                                                               

     - Phe Arg Pro Asp Asn Asn Val Glu Ile Ala Ty - #r Ala Asp Gln Asn Gly     

     #    60                                                                   

     - Ile Ser Tyr Lys Arg Tyr His Glu Phe Leu Gl - #y Ser Phe Met Arg Asp     

     #80                                                                       

     - Phe Val Ser Met Gly Val Ala Gly Ala His Gl - #y Lys Thr Ser Thr Thr     

     #                95                                                       

     - Gly Met Leu Ser His Val Leu Ser His Ile Th - #r Asp Thr Ser Phe Leu     

     #           110                                                           

     - Ile Gly Asp Gly Thr Gly Arg Gly Ser Ala As - #n Ala Lys Tyr Phe Val     

     #       125                                                               

     - Phe Glu Ser Asp Glu Tyr Glu Arg His Phe Me - #t Pro Tyr His Pro Glu     

     #   140                                                                   

     - Tyr Ser Ile Ile Thr Asn Ile Asp Phe Asp Hi - #s Pro Asp Tyr Phe Thr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Leu Glu Asp Val Phe Asn Ala Phe Asn As - #p Tyr Ala Lys Gln Ile     

     #               175                                                       

     - Thr Lys Gly Leu Phe Val Tyr Gly Glu Asp Al - #a Glu Leu Arg Lys Ile     

     #           190                                                           

     - Thr Ser Asp Ala Pro Ile Tyr Tyr Tyr Gly Ph - #e Glu Ala Glu Gly Asn     

     #       205                                                               

     - Asp Phe Val Ala Ser Asp Leu Leu Arg Ser Il - #e Thr Gly Ser Thr Phe     

     #   220                                                                   

     - Thr Val His Phe Arg Gly Gln Asn Leu Gly Gl - #n Phe His Ile Pro Thr     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Phe Gly Arg His Asn Ile Met Asn Ala Thr Al - #a Val Ile Gly Leu Leu     

     #               255                                                       

     - Tyr Thr Ala Gly Phe Asp Leu Asn Leu Val Ar - #g Glu His Leu Lys Thr     

     #           270                                                           

     - Phe Ala Gly Val Lys Arg Arg Phe Thr Glu Ly - #s Ile Val Asn Asp Thr     

     #       285                                                               

     - Val Ile Ile Asp Asp Phe Ala His His Pro Th - #r Glu Ile Ile Ala Thr     

     #   300                                                                   

     - Leu Asp Ala Ala Arg Gln Lys Tyr Pro Ser Ly - #s Glu Ile Val Ala Val     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Phe Gln Pro His Thr Phe Thr Arg Thr Ile Al - #a Leu Leu Asp Asp Phe     

     #               335                                                       

     - Ala His Ala Leu Asn Gln Ala Asp Ala Val Ty - #r Leu Ala Gln Ile Tyr     

     #           350                                                           

     - Gly Ser Ala Arg Glu Val Asp His Gly Asp Va - #l Lys Val Glu Asp Leu     

     #       365                                                               

     - Ala Asn Lys Ile Asn Lys Lys His Gln Val Il - #e Thr Val Glu Asn Val     

     #   380                                                                   

     - Ser Pro Leu Leu Asp His Asp Asn Ala Val Ty - #r Val Phe Met Gly Ala     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Gly Asp Ile Gln Thr Tyr Glu Tyr Ser Phe Gl - #u Arg Leu Leu Ser Asn     

     #               415                                                       

     - Leu Thr Ser Asn Val Gln                                                 

                 420                                                           

     - (2) INFORMATION FOR SEQ ID NO: 117:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 3121 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #117: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTTTAACCCA ACTGTTGGTA CTTTCCTTTT TACTGCAGGA TTGAGCTTGT TA - #GTTTTATT   

       60                                                                      

     - GGTTTCTAAA AGGGAAAATG GAAAGAAACG ACTTGTTCAT TTTCTGCTGT TG - #ACTAGCAT   

      120                                                                      

     - GGGAGTTCAA TTGTTGCCGG CCAGTGCTTT TGGGTTGACC AGCCAGATTT TA - #TCTGCCTA   

      180                                                                      

     - TAATAGTCAG CTTTCTATCG GAGTCGGGGA ACATTTACCA GAGCCTCTGA AA - #ATCGAAGG   

      240                                                                      

     - TTATCAATAT ATTGGTTATA TCAAAACTAA GAAACAGGAT AATACAGAGC TT - #TCAAGGAC   

      300                                                                      

     - AGTTGATGGG AAATACTCTG CTCAAAGAGA TAGTCAACCA AACTCTACAA AA - #ACATCAGA   

      360                                                                      

     - TGTAGTTCAT TCAGCTGATT TAGAATGGAA CCAAGGACAG GGGAAGGTTA GT - #TTACAAGG   

      420                                                                      

     - TGAAGCATCA GGGGATGATG GACTTTCAGA AAAATCTTCT ATAGCAGCAG AC - #AATCTATC   

      480                                                                      

     - TTCTAATGAT TCATTCGCAA GTCAAGTTGA GCAGAATCCG GATCACAAAG GA - #GAATCTGT   

      540                                                                      

     - AGTTCGACCA ACAGTGCCAG AACAAGGAAA TCCTGTGTCT GCTACAACGG TG - #CAGAGTGC   

      600                                                                      

     - GGAAGAGGAA GTATTGGCGA CGACAAATGA TCGACCAGAG TATAAACTTC CA - #TTGGAAAC   

      660                                                                      

     - CAAAGGCACG CAAGAACCCG GTCATGAGGG TGAAGCCGCA GTCCGTGAAG AC - #TTACCAGT   

      720                                                                      

     - CTACACTAAG CCACTAGAAA CCAAAGGTAC ACAAGGACCC GGACATGAAG GT - #GAAGCTGC   

      780                                                                      

     - AGTTCGCGAG GAAGAACCAG CTTACACAGA ACCGTTAGCA ACGAAAGGCA CG - #CAAGAGCC   

      840                                                                      

     - AGGTCATGAG GGCAAAGCTA CAGTCCGCGA AGAGACTCTA GAGTACACGG AA - #CCGGTAGC   

      900                                                                      

     - GACAAAAGGC ACACAAGAAC CCGAACATGA GGGCGAaCGG sCAGTAGAAG AA - #GAACTTCC   

      960                                                                      

     - GGCTTTAGAG GTCACTACAC GAAATAGAAC GGAAATCCAG AATATTCCTT AT - #ACAACAGA   

     1020                                                                      

     - AGAAATTCAG GATCCAACAC TTCTGAAAAA TCGTCGTAAG ATTGAACGAC AA - #GGGCAAGC   

     1080                                                                      

     - AGGGACACGT ACAATTCAAT ATGAAGACTA CATCGTAAAT GGTAATGTCG TA - #GAAACTAA   

     1140                                                                      

     - AGAAGTGTCA CGAACTGAAG TAGCTCCGGT CAACGAAGTC GTTAAAGTAG GA - #ACACTTGT   

     1200                                                                      

     - GAAAGTTAAA CCTACAGTAG AAATTACAAA CTTAACAAAA GTTGAGAACA AA - #AAATCTAT   

     1260                                                                      

     - AACTGTAAGT TATAACTTAA TAGACACTAC CTCAGCATAT GTTTCTGCAA AA - #ACGCAAGT   

     1320                                                                      

     - TTTCCATGGA GACAAGCTAG TTAAAGAGGT GGATATAGAA AATCCTGCCA AA - #GAGCAAGT   

     1380                                                                      

     - AATATCAGGT TTAGATTACT ACACACCGTA TACAGTTAAA ACACACCTAA CT - #TATAATTT   

     1440                                                                      

     - GGGTGAAAAT AATGAGGAAA ATACTGAAAC ATCAACTCAA GATTTCCAAT TA - #GAGTATAA   

     1500                                                                      

     - GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA AT - #GATCGTTA   

     1560                                                                      

     - TCGTAGATAT TTAAGTCTAA GTGAAGCGCC GACTGATACG GCTAAATACT TT - #GTAAAAGT   

     1620                                                                      

     - GAAATCAGAT CGCTTCAAAG AAATGTACCT ACCTGTAAAA TCTATTACAG AA - #AATACGGA   

     1680                                                                      

     - TGGAACGTAT AAAGTGACGG TAGCCGTTGA TCAACTTGTC GAAGAAGGTA CA - #GACGGTTA   

     1740                                                                      

     - CAAAGATGAT TACACATTTA CTGTAGCTAA ATCTAAAGCA GAGCAACCAG GA - #GTTTACAC   

     1800                                                                      

     - ATCCTTTAAA CAGCTGGTAA CAGCCATGCA AAGCAATCTG TCTGGTGTCT AT - #ACATTGGC   

     1860                                                                      

     - TTCAGATATG ACCGCAGATG AGGTGAGCTT AGGCGATAAG CAGACAAGTT AT - #CTCACAGG   

     1920                                                                      

     - TGCATTTACA GGGAGCTTGA TCGGTTCTGA TGGAACAAAA TCGTATGCCA TT - #TATGATTT   

     1980                                                                      

     - GAAGAAACCA TTATTTGATA CATTAAATGG TGCTACAGTT AGAGATTTGG AT - #ATTAAAAC   

     2040                                                                      

     - TGTTTCTGCT GATAGTAAAG AAAATGTCGC AGCGCTGGCG AAGGCAGCGA AT - #AGCGCGAA   

     2100                                                                      

     - TATTAATAAT GTTGCAGTAG AAGGAAAAAT CTCAGGTGCG AAATCTGTTG CG - #GGATTAGT   

     2160                                                                      

     - AGCGAGCGCA ACAAATACAG TGATAGAAAA CAGCTCGTTT ACAGGGAAAC TT - #ATCGCAAA   

     2220                                                                      

     - TCACCAGGAC AGTAATAAAA ATGATACTGG AGGAATAGTA GGTAATATAA CA - #GGAAATAG   

     2280                                                                      

     - TTCGAGAGTT AATAAAGTTA GGGTAGATGC CTTAATCTCT ACTAATGCAC GC - #AATAATAA   

     2340                                                                      

     - CCAAACAGCT GGAGGGATAG TAGGTAGATT AGAAAATGGT GCATTGATAT CT - #AATTCGGT   

     2400                                                                      

     - TGCTACTGGA GAAATACGAA ATGGTCAAGG ATATTCTAGA GTCGGAGGAA TA - #GTAGGATC   

     2460                                                                      

     - TACGTGGCAA AACGGTCGAG TAAATAATGT TGTGAGTAAC GTAGATGTTG GA - #GATGGTTA   

     2520                                                                      

     - TGTTATCACC GGTGATCAAT ACGCAGCAGC AGATGTGAAA AATGCAAGTA CA - #TCAGTTGA   

     2580                                                                      

     - TAATAGAAAA GCAGACAGAT TCGCTACAAA ATTATCAAAA GACCAAATAG AC - #GCGAAAGT   

     2640                                                                      

     - TGCTGATTAT GGAATCACAG TAACTCTTGA TGATACTGGG CAAGATTTAA AA - #CGTAATCT   

     2700                                                                      

     - AAGAGAAGTT GATTATACAA GACTAAATAA AGCAGAAGCT GAAAGAAAAG TA - #GCTTATAG   

     2760                                                                      

     - CAACATAGAA AAACTGATGC CATTCTACAA TAAAGACCTA GTAGTTCACT AT - #GGTAACAA   

     2820                                                                      

     - AGTAGCGACA ACAGATAAAC TTTACACTAC AGAATTGTTA GATGTTGTGC CG - #ATGAAAGA   

     2880                                                                      

     - TGATGAAGTA GTAACGGATA TTAATAATAA GAAAAATTCA ATAAATAAAG TT - #ATGTTACA   

     2940                                                                      

     - TTTCAAAGAT AATACAGTAG AATACCTAGA TGTAACATTC AAAGAAAACT TC - #ATAAACAG   

     3000                                                                      

     - TCAAGTAATC GAATACAATG TTACAGGAAA AGAATATATA TTCACACCAG AA - #GCATTTGT   

     3060                                                                      

     - TTCAGACTAT ACAGCGATAA CGAATAACGT ACTAAGCGAC TTGCAAAATG TA - #ACACTTAA   

     3120                                                                      

     #             3121                                                        

     - (2) INFORMATION FOR SEQ ID NO:118:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 1040 amino                                          

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:118:                           

     - Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Th - #r Ala Gly Leu Ser Leu     

     #                15                                                       

     - Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gl - #y Lys Lys Arg Leu Val     

     #            30                                                           

     - His Phe Leu Leu Leu Thr Ser Met Gly Val Gl - #n Leu Leu Pro Ala Ser     

     #        45                                                               

     - Ala Phe Gly Leu Thr Ser Gln Ile Leu Ser Al - #a Tyr Asn Ser Gln Leu     

     #    60                                                                   

     - Ser Ile Gly Val Gly Glu His Leu Pro Glu Pr - #o Leu Lys Ile Glu Gly     

     #80                                                                       

     - Tyr Gln Tyr Ile Gly Tyr Ile Lys Thr Lys Ly - #s Gln Asp Asn Thr Glu     

     #                95                                                       

     - Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Al - #a Gln Arg Asp Ser Gln     

     #           110                                                           

     - Pro Asn Ser Thr Lys Thr Ser Asp Val Val Hi - #s Ser Ala Asp Leu Glu     

     #       125                                                               

     - Trp Asn Gln Gly Gln Gly Lys Val Ser Leu Gl - #n Gly Glu Ala Ser Gly     

     #   140                                                                   

     - Asp Asp Gly Leu Ser Glu Lys Ser Ser Ile Al - #a Ala Asp Asn Leu Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Asn Asp Ser Phe Ala Ser Gln Val Glu Gl - #n Asn Pro Asp His Lys     

     #               175                                                       

     - Gly Glu Ser Val Val Arg Pro Thr Val Pro Gl - #u Gln Gly Asn Pro Val     

     #           190                                                           

     - Ser Ala Thr Thr Val Gln Ser Ala Glu Glu Gl - #u Val Leu Ala Thr Thr     

     #       205                                                               

     - Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Gl - #u Thr Lys Gly Thr Gln     

     #   220                                                                   

     - Glu Pro Gly His Glu Gly Glu Ala Ala Val Ar - #g Glu Asp Leu Pro Val     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gl - #n Gly Pro Gly His Glu     

     #               255                                                       

     - Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Al - #a Tyr Thr Glu Pro Leu     

     #           270                                                           

     - Ala Thr Lys Gly Thr Gln Glu Pro Gly His Gl - #u Gly Lys Ala Thr Val     

     #       285                                                               

     - Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Va - #l Ala Thr Lys Gly Thr     

     #   300                                                                   

     - Gln Glu Pro Glu His Glu Gly Glu Arg Xaa Va - #l Glu Glu Glu Leu Pro     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Gl - #u Ile Gln Asn Ile Pro     

     #               335                                                       

     - Tyr Thr Thr Glu Glu Ile Gln Asp Pro Thr Le - #u Leu Lys Asn Arg Arg     

     #           350                                                           

     - Lys Ile Glu Arg Gln Gly Gln Ala Gly Thr Ar - #g Thr Ile Gln Tyr Glu     

     #       365                                                               

     - Asp Tyr Ile Val Asn Gly Asn Val Val Glu Th - #r Lys Glu Val Ser Arg     

     #   380                                                                   

     - Thr Glu Val Ala Pro Val Asn Glu Val Val Ly - #s Val Gly Thr Leu Val     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Lys Val Lys Pro Thr Val Glu Ile Thr Asn Le - #u Thr Lys Val Glu Asn     

     #               415                                                       

     - Lys Lys Ser Ile Thr Val Ser Tyr Asn Leu Il - #e Asp Thr Thr Ser Ala     

     #           430                                                           

     - Tyr Val Ser Ala Lys Thr Gln Val Phe His Gl - #y Asp Lys Leu Val Lys     

     #       445                                                               

     - Glu Val Asp Ile Glu Asn Pro Ala Lys Glu Gl - #n Val Ile Ser Gly Leu     

     #   460                                                                   

     - Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr Hi - #s Leu Thr Tyr Asn Leu     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Se - #r Thr Gln Asp Phe Gln     

     #               495                                                       

     - Leu Glu Tyr Lys Lys Ile Glu Ile Lys Asp Il - #e Asp Ser Val Glu Leu     

     #           510                                                           

     - Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg Ty - #r Leu Ser Leu Ser Glu     

     #       525                                                               

     - Ala Pro Thr Asp Thr Ala Lys Tyr Phe Val Ly - #s Val Lys Ser Asp Arg     

     #   540                                                                   

     - Phe Lys Glu Met Tyr Leu Pro Val Lys Ser Il - #e Thr Glu Asn Thr Asp     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Gly Thr Tyr Lys Val Thr Val Ala Val Asp Gl - #n Leu Val Glu Glu Gly     

     #               575                                                       

     - Thr Asp Gly Tyr Lys Asp Asp Tyr Thr Phe Th - #r Val Ala Lys Ser Lys     

     #           590                                                           

     - Ala Glu Gln Pro Gly Val Tyr Thr Ser Phe Ly - #s Gln Leu Val Thr Ala     

     #       605                                                               

     - Met Gln Ser Asn Leu Ser Gly Val Tyr Thr Le - #u Ala Ser Asp Met Thr     

     #   620                                                                   

     - Ala Asp Glu Val Ser Leu Gly Asp Lys Gln Th - #r Ser Tyr Leu Thr Gly     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Ala Phe Thr Gly Ser Leu Ile Gly Ser Asp Gl - #y Thr Lys Ser Tyr Ala     

     #               655                                                       

     - Ile Tyr Asp Leu Lys Lys Pro Leu Phe Asp Th - #r Leu Asn Gly Ala Thr     

     #           670                                                           

     - Val Arg Asp Leu Asp Ile Lys Thr Val Ser Al - #a Asp Ser Lys Glu Asn     

     #       685                                                               

     - Val Ala Ala Leu Ala Lys Ala Ala Asn Ser Al - #a Asn Ile Asn Asn Val     

     #   700                                                                   

     - Ala Val Glu Gly Lys Ile Ser Gly Ala Lys Se - #r Val Ala Gly Leu Val     

     705                 7 - #10                 7 - #15                 7 -   

     #20                                                                       

     - Ala Ser Ala Thr Asn Thr Val Ile Glu Asn Se - #r Ser Phe Thr Gly Lys     

     #               735                                                       

     - Leu Ile Ala Asn His Gln Asp Ser Asn Lys As - #n Asp Thr Gly Gly Ile     

     #           750                                                           

     - Val Gly Asn Ile Thr Gly Asn Ser Ser Arg Va - #l Asn Lys Val Arg Val     

     #       765                                                               

     - Asp Ala Leu Ile Ser Thr Asn Ala Arg Asn As - #n Asn Gln Thr Ala Gly     

     #   780                                                                   

     - Gly Ile Val Gly Arg Leu Glu Asn Gly Ala Le - #u Ile Ser Asn Ser Val     

     785                 7 - #90                 7 - #95                 8 -   

     #00                                                                       

     - Ala Thr Gly Glu Ile Arg Asn Gly Gln Gly Ty - #r Ser Arg Val Gly Gly     

     #               815                                                       

     - Ile Val Gly Ser Thr Trp Gln Asn Gly Arg Va - #l Asn Asn Val Val Ser     

     #           830                                                           

     - Asn Val Asp Val Gly Asp Gly Tyr Val Ile Th - #r Gly Asp Gln Tyr Ala     

     #       845                                                               

     - Ala Ala Asp Val Lys Asn Ala Ser Thr Ser Va - #l Asp Asn Arg Lys Ala     

     #   860                                                                   

     - Asp Arg Phe Ala Thr Lys Leu Ser Lys Asp Gl - #n Ile Asp Ala Lys Val     

     865                 8 - #70                 8 - #75                 8 -   

     #80                                                                       

     - Ala Asp Tyr Gly Ile Thr Val Thr Leu Asp As - #p Thr Gly Gln Asp Leu     

     #               895                                                       

     - Lys Arg Asn Leu Arg Glu Val Asp Tyr Thr Ar - #g Leu Asn Lys Ala Glu     

     #           910                                                           

     - Ala Glu Arg Lys Val Ala Tyr Ser Asn Ile Gl - #u Lys Leu Met Pro Phe     

     #       925                                                               

     - Tyr Asn Lys Asp Leu Val Val His Tyr Gly As - #n Lys Val Ala Thr Thr     

     #   940                                                                   

     - Asp Lys Leu Tyr Thr Thr Glu Leu Leu Asp Va - #l Val Pro Met Lys Asp     

     945                 9 - #50                 9 - #55                 9 -   

     #60                                                                       

     - Asp Glu Val Val Thr Asp Ile Asn Asn Lys Ly - #s Asn Ser Ile Asn Lys     

     #               975                                                       

     - Val Met Leu His Phe Lys Asp Asn Thr Val Gl - #u Tyr Leu Asp Val Thr     

     #           990                                                           

     - Phe Lys Glu Asn Phe Ile Asn Ser Gln Val Il - #e Glu Tyr Asn Val Thr     

     #      10050                                                              

     - Gly Lys Glu Tyr Ile Phe Thr Pro Glu Ala Ph - #e Val Ser Asp Tyr Thr     

     #  10205                                                                  

     - Ala Ile Thr Asn Asn Val Leu Ser Asp Leu Gl - #n Asn Val Thr Leu Asn     

     #               10401030 - #                1035                          

     - (2) INFORMATION FOR SEQ ID NO: 119:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1567 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #119: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTTTAACCCA ACTGTTGGTA CTTTCCTTTT TACTGCAGGA TTGAGCTTGT TA - #GTTTTATT   

       60                                                                      

     - GGTTTCTAAA AGGGAAAATG GAAAGAAACG ACTTGTTCAT TTTCTGCTGT TG - #ACTAGCAT   

      120                                                                      

     - GGGAGTTCAA TTGTTGCCGG CCAGTGCTTT TGGGTTGACC AGCCAGATTT TA - #TCTGCCTA   

      180                                                                      

     - TAATAGTCAG CTTTCTATCG GAGTCGGGGA ACATTTACCA GAGCCTCTGA AA - #ATCGAAGG   

      240                                                                      

     - TTATCAATAT ATTGGTTATA TCAAAACTAA GAAACAGGAT AATACAGAGC TT - #TCAAGGAC   

      300                                                                      

     - AGTTGATGGG AAATACTCTG CTCAAAGAGA TAGTCAACCA AACTCTACAA AA - #ACATCAGA   

      360                                                                      

     - TGTAGTTCAT TCAGCTGATT TAGAATGGAA CCAAGGACAG GGGAAGGTTA GT - #TTACAAGG   

      420                                                                      

     - TGAAGCATCA GGGGATGATG GACTTTCAGA AAAATCTTCT ATAGCAGCAG AC - #AATCTATC   

      480                                                                      

     - TTCTAATGAT TCATTCGCAA GTCAAGTTGA GCAGAATCCG GATCACAAAG GA - #GAATCTGT   

      540                                                                      

     - AGTTCGACCA ACAGTGCCAG AACAAGGAAA TCCTGTGTCT GCTACAACGG TG - #CAGAGTGC   

      600                                                                      

     - GGAAGAGGAA GTATTGGCGA CGACAAATGA TCGACCAGAG TATAAACTTC CA - #TTGGAAAC   

      660                                                                      

     - CAAAGGCACG CAAGAACCCG GTCATGAGGG TGAAGCCGCA GTCCGTGAAG AC - #TTACCAGT   

      720                                                                      

     - CTACACTAAG CCACTAGAAA CCAAAGGTAC ACAAGGACCC GGACATGAAG GT - #GAAGCTGC   

      780                                                                      

     - AGTTCGCGAG GAAGAACCAG CTTACACAGA ACCGTTAGCA ACGAAAGGCA CG - #CAAGAGCC   

      840                                                                      

     - AGGTCATGAG GGCAAAGCTA CAGTCCGCGA AGAGACTCTA GAGTACACGG AA - #CCGGTAGC   

      900                                                                      

     - GACAAAAGGC ACACAAGAAC CCGAACATGA GGGCGAaCGG sCAGTAGAAG AA - #GAACTTCC   

      960                                                                      

     - GGCTTTAGAG GTCACTACAC GAAATAGAAC GGAAATCCAG AATATTCCTT AT - #ACAACAGA   

     1020                                                                      

     - AGAAATTCAG GATCCAACAC TTCTGAAAAA TCGTCGTAAG ATTGAACGAC AA - #GGGCAAGC   

     1080                                                                      

     - AGGGACACGT ACAATTCAAT ATGAAGACTA CATCGTAAAT GGTAATGTCG TA - #GAAACTAA   

     1140                                                                      

     - AGAAGTGTCA CGAACTGAAG TAGCTCCGGT CAACGAAGTC GTTAAAGTAG GA - #ACACTTGT   

     1200                                                                      

     - GAAAGTTAAA CCTACAGTAG AAATTACAAA CTTAACAAAA GTTGAGAACA AA - #AAATCTAT   

     1260                                                                      

     - AACTGTAAGT TATAACTTAA TAGACACTAC CTCAGCATAT GTTTCTGCAA AA - #ACGCAAGT   

     1320                                                                      

     - TTTCCATGGA GACAAGCTAG TTAAAGAGGT GGATATAGAA AATCCTGCCA AA - #GAGCAAGT   

     1380                                                                      

     - AATATCAGGT TTAGATTACT ACACACCGTA TACAGTTAAA ACACACCTAA CT - #TATAATTT   

     1440                                                                      

     - GGGTGAAAAT AATGAGGAAA ATACTGAAAC ATCAACTCAA GATTTCCAAT TA - #GAGTATAA   

     1500                                                                      

     - GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA AT - #GATCGTTA   

     1560                                                                      

     #        1567                                                             

     - (2) INFORMATION FOR SEQ ID NO:120:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 522 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:120:                           

     - Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Th - #r Ala Gly Leu Ser Leu     

     #                15                                                       

     - Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gl - #y Lys Lys Arg Leu Val     

     #            30                                                           

     - His Phe Leu Leu Leu Thr Ser Met Gly Val Gl - #n Leu Leu Pro Ala Ser     

     #        45                                                               

     - Ala Phe Gly Leu Thr Ser Gln Ile Leu Ser Al - #a Tyr Asn Ser Gln Leu     

     #    60                                                                   

     - Ser Ile Gly Val Gly Glu His Leu Pro Glu Pr - #o Leu Lys Ile Glu Gly     

     #80                                                                       

     - Tyr Gln Tyr Ile Gly Tyr Ile Lys Thr Lys Ly - #s Gln Asp Asn Thr Glu     

     #                95                                                       

     - Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Al - #a Gln Arg Asp Ser Gln     

     #           110                                                           

     - Pro Asn Ser Thr Lys Thr Ser Asp Val Val Hi - #s Ser Ala Asp Leu Glu     

     #       125                                                               

     - Trp Asn Gln Gly Gln Gly Lys Val Ser Leu Gl - #n Gly Glu Ala Ser Gly     

     #   140                                                                   

     - Asp Asp Gly Leu Ser Glu Lys Ser Ser Ile Al - #a Ala Asp Asn Leu Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Asn Asp Ser Phe Ala Ser Gln Val Glu Gl - #n Asn Pro Asp His Lys     

     #               175                                                       

     - Gly Glu Ser Val Val Arg Pro Thr Val Pro Gl - #u Gln Gly Asn Pro Val     

     #           190                                                           

     - Ser Ala Thr Thr Val Gln Ser Ala Glu Glu Gl - #u Val Leu Ala Thr Thr     

     #       205                                                               

     - Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Gl - #u Thr Lys Gly Thr Gln     

     #   220                                                                   

     - Glu Pro Gly His Glu Gly Glu Ala Ala Val Ar - #g Glu Asp Leu Pro Val     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gl - #n Gly Pro Gly His Glu     

     #               255                                                       

     - Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Al - #a Tyr Thr Glu Pro Leu     

     #           270                                                           

     - Ala Thr Lys Gly Thr Gln Glu Pro Gly His Gl - #u Gly Lys Ala Thr Val     

     #       285                                                               

     - Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Va - #l Ala Thr Lys Gly Thr     

     #   300                                                                   

     - Gln Glu Pro Glu His Glu Gly Glu Arg Xaa Va - #l Glu Glu Glu Leu Pro     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Gl - #u Ile Gln Asn Ile Pro     

     #               335                                                       

     - Tyr Thr Thr Glu Glu Ile Gln Asp Pro Thr Le - #u Leu Lys Asn Arg Arg     

     #           350                                                           

     - Lys Ile Glu Arg Gln Gly Gln Ala Gly Thr Ar - #g Thr Ile Gln Tyr Glu     

     #       365                                                               

     - Asp Tyr Ile Val Asn Gly Asn Val Val Glu Th - #r Lys Glu Val Ser Arg     

     #   380                                                                   

     - Thr Glu Val Ala Pro Val Asn Glu Val Val Ly - #s Val Gly Thr Leu Val     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Lys Val Lys Pro Thr Val Glu Ile Thr Asn Le - #u Thr Lys Val Glu Asn     

     #               415                                                       

     - Lys Lys Ser Ile Thr Val Ser Tyr Asn Leu Il - #e Asp Thr Thr Ser Ala     

     #           430                                                           

     - Tyr Val Ser Ala Lys Thr Gln Val Phe His Gl - #y Asp Lys Leu Val Lys     

     #       445                                                               

     - Glu Val Asp Ile Glu Asn Pro Ala Lys Glu Gl - #n Val Ile Ser Gly Leu     

     #   460                                                                   

     - Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr Hi - #s Leu Thr Tyr Asn Leu     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Se - #r Thr Gln Asp Phe Gln     

     #               495                                                       

     - Leu Glu Tyr Lys Lys Ile Glu Ile Lys Asp Il - #e Asp Ser Val Glu Leu     

     #           510                                                           

     - Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg                                 

     #       520                                                               

     - (2) INFORMATION FOR SEQ ID NO: 121:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1561 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #121: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TCGTAGATAT TTAAGTCTAA GTGAAGCGCC GACTGATACG GCTAAATACT TT - #GTAAAAGT   

       60                                                                      

     - GAAATCAGAT CGCTTCAAAG AAATGTACCT ACCTGTAAAA TCTATTACAG AA - #AATACGGA   

      120                                                                      

     - TGGAACGTAT AAAGTGACGG TAGCCGTTGA TCAACTTGTC GAAGAAGGTA CA - #GACGGTTA   

      180                                                                      

     - CAAAGATGAT TACACATTTA CTGTAGCTAA ATCTAAAGCA GAGCAACCAG GA - #GTTTACAC   

      240                                                                      

     - ATCCTTTAAA CAGCTGGTAA CAGCCATGCA AAGCAATCTG TCTGGTGTCT AT - #ACATTGGC   

      300                                                                      

     - TTCAGATATG ACCGCAGATG AGGTGAGCTT AGGCGATAAG CAGACAAGTT AT - #CTCACAGG   

      360                                                                      

     - TGCATTTACA GGGAGCTTGA TCGGTTCTGA TGGAACAAAA TCGTATGCCA TT - #TATGATTT   

      420                                                                      

     - GAAGAAACCA TTATTTGATA CATTAAATGG TGCTACAGTT AGAGATTTGG AT - #ATTAAAAC   

      480                                                                      

     - TGTTTCTGCT GATAGTAAAG AAAATGTCGC AGCGCTGGCG AAGGCAGCGA AT - #AGCGCGAA   

      540                                                                      

     - TATTAATAAT GTTGCAGTAG AAGGAAAAAT CTCAGGTGCG AAATCTGTTG CG - #GGATTAGT   

      600                                                                      

     - AGCGAGCGCA ACAAATACAG TGATAGAAAA CAGCTCGTTT ACAGGGAAAC TT - #ATCGCAAA   

      660                                                                      

     - TCACCAGGAC AGTAATAAAA ATGATACTGG AGGAATAGTA GGTAATATAA CA - #GGAAATAG   

      720                                                                      

     - TTCGAGAGTT AATAAAGTTA GGGTAGATGC CTTAATCTCT ACTAATGCAC GC - #AATAATAA   

      780                                                                      

     - CCAAACAGCT GGAGGGATAG TAGGTAGATT AGAAAATGGT GCATTGATAT CT - #AATTCGGT   

      840                                                                      

     - TGCTACTGGA GAAATACGAA ATGGTCAAGG ATATTCTAGA GTCGGAGGAA TA - #GTAGGATC   

      900                                                                      

     - TACGTGGCAA AACGGTCGAG TAAATAATGT TGTGAGTAAC GTAGATGTTG GA - #GATGGTTA   

      960                                                                      

     - TGTTATCACC GGTGATCAAT ACGCAGCAGC AGATGTGAAA AATGCAAGTA CA - #TCAGTTGA   

     1020                                                                      

     - TAATAGAAAA GCAGACAGAT TCGCTACAAA ATTATCAAAA GACCAAATAG AC - #GCGAAAGT   

     1080                                                                      

     - TGCTGATTAT GGAATCACAG TAACTCTTGA TGATACTGGG CAAGATTTAA AA - #CGTAATCT   

     1140                                                                      

     - AAGAGAAGTT GATTATACAA GACTAAATAA AGCAGAAGCT GAAAGAAAAG TA - #GCTTATAG   

     1200                                                                      

     - CAACATAGAA AAACTGATGC CATTCTACAA TAAAGACCTA GTAGTTCACT AT - #GGTAACAA   

     1260                                                                      

     - AGTAGCGACA ACAGATAAAC TTTACACTAC AGAATTGTTA GATGTTGTGC CG - #ATGAAAGA   

     1320                                                                      

     - TGATGAAGTA GTAACGGATA TTAATAATAA GAAAAATTCA ATAAATAAAG TT - #ATGTTACA   

     1380                                                                      

     - TTTCAAAGAT AATACAGTAG AATACCTAGA TGTAACATTC AAAGAAAACT TC - #ATAAACAG   

     1440                                                                      

     - TCAAGTAATC GAATACAATG TTACAGGAAA AGAATATATA TTCACACCAG AA - #GCATTTGT   

     1500                                                                      

     - TTCAGACTAT ACAGCGATAA CGAATAACGT ACTAAGCGAC TTGCAAAATG TA - #ACACTTAA   

     1560                                                                      

     #             1561                                                        

     - (2) INFORMATION FOR SEQ ID NO:122:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 520 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:122:                           

     - Arg Arg Tyr Leu Ser Leu Ser Glu Ala Pro Th - #r Asp Thr Ala Lys Tyr     

     #                15                                                       

     - Phe Val Lys Val Lys Ser Asp Arg Phe Lys Gl - #u Met Tyr Leu Pro Val     

     #            30                                                           

     - Lys Ser Ile Thr Glu Asn Thr Asp Gly Thr Ty - #r Lys Val Thr Val Ala     

     #        45                                                               

     - Val Asp Gln Leu Val Glu Glu Gly Thr Asp Gl - #y Tyr Lys Asp Asp Tyr     

     #    60                                                                   

     - Thr Phe Thr Val Ala Lys Ser Lys Ala Glu Gl - #n Pro Gly Val Tyr Thr     

     #80                                                                       

     - Ser Phe Lys Gln Leu Val Thr Ala Met Gln Se - #r Asn Leu Ser Gly Val     

     #                95                                                       

     - Tyr Thr Leu Ala Ser Asp Met Thr Ala Asp Gl - #u Val Ser Leu Gly Asp     

     #           110                                                           

     - Lys Gln Thr Ser Tyr Leu Thr Gly Ala Phe Th - #r Gly Ser Leu Ile Gly     

     #       125                                                               

     - Ser Asp Gly Thr Lys Ser Tyr Ala Ile Tyr As - #p Leu Lys Lys Pro Leu     

     #   140                                                                   

     - Phe Asp Thr Leu Asn Gly Ala Thr Val Arg As - #p Leu Asp Ile Lys Thr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Val Ser Ala Asp Ser Lys Glu Asn Val Ala Al - #a Leu Ala Lys Ala Ala     

     #               175                                                       

     - Asn Ser Ala Asn Ile Asn Asn Val Ala Val Gl - #u Gly Lys Ile Ser Gly     

     #           190                                                           

     - Ala Lys Ser Val Ala Gly Leu Val Ala Ser Al - #a Thr Asn Thr Val Ile     

     #       205                                                               

     - Glu Asn Ser Ser Phe Thr Gly Lys Leu Ile Al - #a Asn His Gln Asp Ser     

     #   220                                                                   

     - Asn Lys Asn Asp Thr Gly Gly Ile Val Gly As - #n Ile Thr Gly Asn Ser     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ser Arg Val Asn Lys Val Arg Val Asp Ala Le - #u Ile Ser Thr Asn Ala     

     #               255                                                       

     - Arg Asn Asn Asn Gln Thr Ala Gly Gly Ile Va - #l Gly Arg Leu Glu Asn     

     #           270                                                           

     - Gly Ala Leu Ile Ser Asn Ser Val Ala Thr Gl - #y Glu Ile Arg Asn Gly     

     #       285                                                               

     - Gln Gly Tyr Ser Arg Val Gly Gly Ile Val Gl - #y Ser Thr Trp Gln Asn     

     #   300                                                                   

     - Gly Arg Val Asn Asn Val Val Ser Asn Val As - #p Val Gly Asp Gly Tyr     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Val Ile Thr Gly Asp Gln Tyr Ala Ala Ala As - #p Val Lys Asn Ala Ser     

     #               335                                                       

     - Thr Ser Val Asp Asn Arg Lys Ala Asp Arg Ph - #e Ala Thr Lys Leu Ser     

     #           350                                                           

     - Lys Asp Gln Ile Asp Ala Lys Val Ala Asp Ty - #r Gly Ile Thr Val Thr     

     #       365                                                               

     - Leu Asp Asp Thr Gly Gln Asp Leu Lys Arg As - #n Leu Arg Glu Val Asp     

     #   380                                                                   

     - Tyr Thr Arg Leu Asn Lys Ala Glu Ala Glu Ar - #g Lys Val Ala Tyr Ser     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Asn Ile Glu Lys Leu Met Pro Phe Tyr Asn Ly - #s Asp Leu Val Val His     

     #               415                                                       

     - Tyr Gly Asn Lys Val Ala Thr Thr Asp Lys Le - #u Tyr Thr Thr Glu Leu     

     #           430                                                           

     - Leu Asp Val Val Pro Met Lys Asp Asp Glu Va - #l Val Thr Asp Ile Asn     

     #       445                                                               

     - Asn Lys Lys Asn Ser Ile Asn Lys Val Met Le - #u His Phe Lys Asp Asn     

     #   460                                                                   

     - Thr Val Glu Tyr Leu Asp Val Thr Phe Lys Gl - #u Asn Phe Ile Asn Ser     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Gln Val Ile Glu Tyr Asn Val Thr Gly Lys Gl - #u Tyr Ile Phe Thr Pro     

     #               495                                                       

     - Glu Ala Phe Val Ser Asp Tyr Thr Ala Ile Th - #r Asn Asn Val Leu Ser     

     #           510                                                           

     - Asp Leu Gln Asn Val Thr Leu Asn                                         

     #       520                                                               

     - (2) INFORMATION FOR SEQ ID NO: 123:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 850 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #123: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTTTGGTTTT GAAGGAAGTA AGCGTGGACA ATTTGCTGTA GAAGGAATCA AT - #CAACTTCG   

       60                                                                      

     - TGAGCATGTA GACACTCTAT TGATTATCTC AAACAACAAT TTGCTTGAAA TT - #GTTGATAA   

      120                                                                      

     - GAAAACACCG CTTTTGGAGG CTCTTAGCGA AGCGGATAAC GTTCTTCGTC AA - #GGTGTTCA   

      180                                                                      

     - AGGGATTACC GATTTGATTA CCAATCCAGG ATTGATTAAC CTTGACTTTG CC - #GATGTGAA   

      240                                                                      

     - AACGGTAATG GCAAACAAAG GGAATGCTCT TATGGGTATT GGTATCGGTA GT - #GGAGAAGA   

      300                                                                      

     - ACGTGTGGTA GAAGCGGCAC GTAAGGCAAT CTATTCACCA CTTCTTGAAA CA - #ACTATTGA   

      360                                                                      

     - CGGTGCTGAG GATGTTATCG TCAACGTTAC TGGTGGTCTT GACTTAACCT TG - #ATTGAGGC   

      420                                                                      

     - AGAAGAGGCT TCACAAATTG TGAACCAGGC AGCAGGTCAA GGAGTGAACA TC - #TGGCTCGG   

      480                                                                      

     - TACTTCAATT GATGAAAGTA TGCGTGATGA AATTCGTGTA ACAGTTGTTG CA - #ACGGGTGT   

      540                                                                      

     - TCGTCAAGAC CGCGTAGAAA AGGTTGTGGC TCCACAAGCT AGATCTGCTA CT - #AACTACCG   

      600                                                                      

     - TGAGACAGTG AAACCAGCTC ATTCACATGG CTTTGATCGT CATTTTGATA TG - #GCAGAAAC   

      660                                                                      

     - AGTTGAATTG CCAAAACAAA ATCCACGTCG TTTGGAACCA ACTCAGGCAT CT - #GCTTTTGG   

      720                                                                      

     - TGATTGGGAT CTTCGCCGTG AATCGATTGT TCGTACAACA GATTCAGTCG TT - #TCTCCAGT   

      780                                                                      

     - CGAGCGCTTT GAAGCCCCAA TTTCACAAGA TGAAGATGAA TTGGATACAC CT - #CCATTTTT   

      840                                                                      

     #       850                                                               

     - (2) INFORMATION FOR SEQ ID NO:124:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 283 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:124:                           

     - Phe Gly Phe Glu Gly Ser Lys Arg Gly Gln Ph - #e Ala Val Glu Gly Ile     

     #                15                                                       

     - Asn Gln Leu Arg Glu His Val Asp Thr Leu Le - #u Ile Ile Ser Asn Asn     

     #            30                                                           

     - Asn Leu Leu Glu Ile Val Asp Lys Lys Thr Pr - #o Leu Leu Glu Ala Leu     

     #        45                                                               

     - Ser Glu Ala Asp Asn Val Leu Arg Gln Gly Va - #l Gln Gly Ile Thr Asp     

     #    60                                                                   

     - Leu Ile Thr Asn Pro Gly Leu Ile Asn Leu As - #p Phe Ala Asp Val Lys     

     #80                                                                       

     - Thr Val Met Ala Asn Lys Gly Asn Ala Leu Me - #t Gly Ile Gly Ile Gly     

     #                95                                                       

     - Ser Gly Glu Glu Arg Val Val Glu Ala Ala Ar - #g Lys Ala Ile Tyr Ser     

     #           110                                                           

     - Pro Leu Leu Glu Thr Thr Ile Asp Gly Ala Gl - #u Asp Val Ile Val Asn     

     #       125                                                               

     - Val Thr Gly Gly Leu Asp Leu Thr Leu Ile Gl - #u Ala Glu Glu Ala Ser     

     #   140                                                                   

     - Gln Ile Val Asn Gln Ala Ala Gly Gln Gly Va - #l Asn Ile Trp Leu Gly     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Ser Ile Asp Glu Ser Met Arg Asp Glu Il - #e Arg Val Thr Val Val     

     #               175                                                       

     - Ala Thr Gly Val Arg Gln Asp Arg Val Glu Ly - #s Val Val Ala Pro Gln     

     #           190                                                           

     - Ala Arg Ser Ala Thr Asn Tyr Arg Glu Thr Va - #l Lys Pro Ala His Ser     

     #       205                                                               

     - His Gly Phe Asp Arg His Phe Asp Met Ala Gl - #u Thr Val Glu Leu Pro     

     #   220                                                                   

     - Lys Gln Asn Pro Arg Arg Leu Glu Pro Thr Gl - #n Ala Ser Ala Phe Gly     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Asp Trp Asp Leu Arg Arg Glu Ser Ile Val Ar - #g Thr Thr Asp Ser Val     

     #               255                                                       

     - Val Ser Pro Val Glu Arg Phe Glu Ala Pro Il - #e Ser Gln Asp Glu Asp     

     #           270                                                           

     - Glu Leu Asp Thr Pro Pro Phe Phe Lys Asn Ar - #g                         

     #       280                                                               

     - (2) INFORMATION FOR SEQ ID NO: 125:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1051 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #125: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTACTACCTC TCGAGAGAAA GTGACCTAGA GGTGACCGTT TTTGACCATG AG - #CAAGGTCA   

       60                                                                      

     - AGCCACCAAG GCCGCAGCAG GAATTATCAG TCCTTGGTTT TCCAAACGCC GT - #AATAAAGC   

      120                                                                      

     - CTGGTACAAG ATGGCGCGCT TGGGGGCTGA TTTTTATGTG GATTTATTAG CT - #GATTTAGA   

      180                                                                      

     - GAAATCAGGA CAAGAAATCG ACTTTTACCA GCGTTCGGGA GTCTTTCTCT TG - #AAAAAGGA   

      240                                                                      

     - TGAATCCAAT TTGGAAGAAC TTTATCAACT GGCCCTCCAG CGCAGAGAAG AA - #TCTCCCTT   

      300                                                                      

     - GATAGGGCAA TTAGCCATTC TGAACCAAGC CTCAGCTAAT GAATTATTCC CT - #GGTTTGCA   

      360                                                                      

     - GGGATTTGAC CGCCTGCTCT ATGCTTCTGG TGGAGCGAGA GTAGATGGCC AA - #CTTTTAGT   

      420                                                                      

     - GACTCGTTTG CTGGAAGTCA GTCATGTCAA GCTGGTCAAA GAAAAAGTGA CT - #CTGACACC   

      480                                                                      

     - GTTAGCATCA GGCTACCAGA TTGGTGAAGA GGAGTTTGAG CAGGTTATTT TG - #GCGACGGG   

      540                                                                      

     - AGCTTGGTTG GGGGACATGT TAGAGCCTTT AGGTTATGAA GTGGATGTCC GT - #CCTCAAAA   

      600                                                                      

     - AGGACAACTA CGAGATTATC AGCTTGCCCA AGACATGGAA GATTACCCTG TT - #GTCATGCC   

      660                                                                      

     - AGAAGGGGAG TGGGATTTGA TTCCCTTTGC AGGTGGGAAA TTATCCTTAG GC - #GCTACCCA   

      720                                                                      

     - CGAAAATGAC ATGGGATTTG ATTTGACGGT AGATGAAACC TTGCTCCAAC AA - #ATGGAGGA   

      780                                                                      

     - GGCCACCTTG ACTCACTATC TGATTTTGGC TGAAGCTACT TCAAAATCTG AG - #CGTGTTGG   

      840                                                                      

     - AATCCGTGCC TACACCAGTG ATTTCTCTCC TTTCTTTGGG CAGGTGCCTG AC - #TTAACTGG   

      900                                                                      

     - TGTCTATGCA GCCAGTGGAC TAGGTTCATC AGGCCTCACA ACTGGTCCTA TC - #ATTGGTTA   

      960                                                                      

     - CCATCTAGCC CAACTGATCC AAGACAAGGA GTTGACCTTG GACCCTCTAA AT - #TACCCAAT   

     1020                                                                      

     #        1051      CGAG TAAAAAGCGA A                                      

     - (2) INFORMATION FOR SEQ ID NO:126:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 350 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:126:                           

     - Tyr Tyr Leu Ser Arg Glu Ser Asp Leu Glu Va - #l Thr Val Phe Asp His     

     #                15                                                       

     - Glu Gln Gly Gln Ala Thr Lys Ala Ala Ala Gl - #y Ile Ile Ser Pro Trp     

     #            30                                                           

     - Phe Ser Lys Arg Arg Asn Lys Ala Trp Tyr Ly - #s Met Ala Arg Leu Gly     

     #        45                                                               

     - Ala Asp Phe Tyr Val Asp Leu Leu Ala Asp Le - #u Glu Lys Ser Gly Gln     

     #    60                                                                   

     - Glu Ile Asp Phe Tyr Gln Arg Ser Gly Val Ph - #e Leu Leu Lys Lys Asp     

     #80                                                                       

     - Glu Ser Asn Leu Glu Glu Leu Tyr Gln Leu Al - #a Leu Gln Arg Arg Glu     

     #                95                                                       

     - Glu Ser Pro Leu Ile Gly Gln Leu Ala Ile Le - #u Asn Gln Ala Ser Ala     

     #           110                                                           

     - Asn Glu Leu Phe Pro Gly Leu Gln Gly Phe As - #p Arg Leu Leu Tyr Ala     

     #       125                                                               

     - Ser Gly Gly Ala Arg Val Asp Gly Gln Leu Le - #u Val Thr Arg Leu Leu     

     #   140                                                                   

     - Glu Val Ser His Val Lys Leu Val Lys Glu Ly - #s Val Thr Leu Thr Pro     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Ala Ser Gly Tyr Gln Ile Gly Glu Glu Gl - #u Phe Glu Gln Val Ile     

     #               175                                                       

     - Leu Ala Thr Gly Ala Trp Leu Gly Asp Met Le - #u Glu Pro Leu Gly Tyr     

     #           190                                                           

     - Glu Val Asp Val Arg Pro Gln Lys Gly Gln Le - #u Arg Asp Tyr Gln Leu     

     #       205                                                               

     - Ala Gln Asp Met Glu Asp Tyr Pro Val Val Me - #t Pro Glu Gly Glu Trp     

     #   220                                                                   

     - Asp Leu Ile Pro Phe Ala Gly Gly Lys Leu Se - #r Leu Gly Ala Thr His     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Glu Asn Asp Met Gly Phe Asp Leu Thr Val As - #p Glu Thr Leu Leu Gln     

     #               255                                                       

     - Gln Met Glu Glu Ala Thr Leu Thr His Tyr Le - #u Ile Leu Ala Glu Ala     

     #           270                                                           

     - Thr Ser Lys Ser Glu Arg Val Gly Ile Arg Al - #a Tyr Thr Ser Asp Phe     

     #       285                                                               

     - Ser Pro Phe Phe Gly Gln Val Pro Asp Leu Th - #r Gly Val Tyr Ala Ala     

     #   300                                                                   

     - Ser Gly Leu Gly Ser Ser Gly Leu Thr Thr Gl - #y Pro Ile Ile Gly Tyr     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - His Leu Ala Gln Leu Ile Gln Asp Lys Glu Le - #u Thr Leu Asp Pro Leu     

     #               335                                                       

     - Asn Tyr Pro Ile Glu Asn Tyr Val Lys Arg Va - #l Lys Ser Glu             

     #           350                                                           

     - (2) INFORMATION FOR SEQ ID NO: 127:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 352 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     -      (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 127:                     

     - TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GCTAGAAAAG ATTGGAGCTG AC - #TCGGTTAT   

       60                                                                      

     - CTCGCCAGAG TATGAAATGG GGCAGTCTCT AGCACAGACC ATTCTTTTCC AT - #AATAGTGT   

      120                                                                      

     - TGATGTCTTT CAGTTGGATA AAAATGTGTC TATCGTGGAG ATGAAAATTC CT - #CAGTCTTG   

      180                                                                      

     - GGCAGGTCAA AGTCTGAGTA AATTAGACCT CCGTGGCAAA TACAATCTGA AT - #ATTTTGGG   

      240                                                                      

     - TTTCCGAGAG CAGGAAAATT CCCCATTGGA TGTTGAATTT GGACCAGATG AC - #CTCTTGAA   

      300                                                                      

     - AGCAGATACC TATATTTTGG CAGTCATCAA CAACCAGTAT TTGGATACCC TA - #           

      352                                                                      

     - (2) INFORMATION FOR SEQ ID NO:128:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 117 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:128:                           

     - Lys Val Lys Ser Gln Thr Ala Lys Lys Val Le - #u Glu Lys Ile Gly Ala     

     #                15                                                       

     - Asp Ser Val Ile Ser Pro Glu Tyr Glu Met Gl - #y Gln Ser Leu Ala Gln     

     #            30                                                           

     - Thr Ile Leu Phe His Asn Ser Val Asp Val Ph - #e Gln Leu Asp Lys Asn     

     #        45                                                               

     - Val Ser Ile Val Glu Met Lys Ile Pro Gln Se - #r Trp Ala Gly Gln Ser     

     #    60                                                                   

     - Leu Ser Lys Leu Asp Leu Arg Gly Lys Tyr As - #n Leu Asn Ile Leu Gly     

     #80                                                                       

     - Phe Arg Glu Gln Glu Asn Ser Pro Leu Asp Va - #l Glu Phe Gly Pro Asp     

     #                95                                                       

     - Asp Leu Leu Lys Ala Asp Thr Tyr Ile Leu Al - #a Val Ile Asn Asn Gln     

     #           110                                                           

     - Tyr Leu Asp Thr Leu                                                     

             115                                                               

     - (2) INFORMATION FOR SEQ ID NO: 129:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 247 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     -      (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 129:                     

     - TGACGGGTCT CAGGATCAGA CTCAGGAAAT CGCTGAGTGT TTAGCTAGCA AG - #TATCCTAA   

       60                                                                      

     - TATCGTTAGA GCCATCTATC AGGAAAATAA ATGCCATGGC GGTGCGGTCA AT - #CGTGGCTT   

      120                                                                      

     - GGTAGAGGCT TCTGGGCGCT ATTTTAAAGT AGTTGACAGT GATGACTGGG TG - #GATCCTCG   

      180                                                                      

     - TGCCTACTTG AAAATTCTTG AAACTTGCAG GAACTTGAGA GCAAAGGTCA AG - #AGGTGGAT   

      240                                                                      

     #         247                                                             

     - (2) INFORMATION FOR SEQ ID NO:130:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 82 amino                                            

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:130:                           

     - Asp Gly Ser Gln Asp Gln Thr Gln Glu Ile Al - #a Glu Cys Leu Ala Ser     

     #                15                                                       

     - Lys Tyr Pro Asn Ile Val Arg Ala Ile Tyr Gl - #n Glu Asn Lys Cys His     

     #            30                                                           

     - Gly Gly Ala Val Asn Arg Gly Leu Val Glu Al - #a Ser Gly Arg Tyr Phe     

     #        45                                                               

     - Lys Val Val Asp Ser Asp Asp Trp Val Asp Pr - #o Arg Ala Tyr Leu Lys     

     #    60                                                                   

     - Ile Leu Glu Thr Cys Arg Asn Leu Arg Ala Ly - #s Val Lys Arg Trp Met     

     #80                                                                       

     - Ser Leu                                                                 

     - (2) INFORMATION FOR SEQ ID NO: 131:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1744 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #131: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAGAGGCTTT GCCAAATGGT GGGAAGGGCA CGAGCGTCGA AAAGAGGAAC GC - #TTTGTCAA   

       60                                                                      

     - ACAAGAAGAA AAAGCTCGCC AAAAGGCTGA GAAAGAGGCT AGATTAGAAC AA - #GAAGAGAC   

      120                                                                      

     - TGAAAAAGCC TTACTCGATT TGCCTCCTGT TGATATGGAA ACGGGTGAAA TT - #CTGACAGA   

      180                                                                      

     - GGAAGCTGTT CAAAATCTTC CACCTATTCC AGAAGAAAAG TGGGTGGAAC CA - #GAAATCAT   

      240                                                                      

     - CCTGCCTCAA GCTGAACTTA AATTCCCTGA ACAGGAAGAT GACTCAGATG AC - #GAAGATGT   

      300                                                                      

     - TCAGGTCGAT TTTTCAGCCA AAGAAGCCCT TGAATACAAA CTTCCAAGCT TA - #CAACTCTT   

      360                                                                      

     - TGCACCAGAT AAACCAAAAG ATCAGTCTAA AGAGAAGAAA ATTGTCAGAG AA - #AATATCAA   

      420                                                                      

     - AATCTTAGAA GCAACCTTTG CTAGCTTTGG TATTAAGGTA ACAGTTGAAC GG - #GCCGAAAT   

      480                                                                      

     - TGGGCCATCA GTGACCAAGT ATGAAGTCAA GCCGGCTGTT GGTGTAAGGG TC - #AACCGCAT   

      540                                                                      

     - TTCCAATCTA TCAGATGACC TCGCTCTAGC CTTGGCTGCC AAAGATGTCC GG - #ATTGAAGC   

      600                                                                      

     - ACCAATCCCT GGGAAATCCC TAATCGGAAT TGAAGTGCCC AACTCCGATA TT - #GCCACTGT   

      660                                                                      

     - ATCTTTCCGA GAACTATGGG AACAATCGCA AACGAAAGCA GAAAATTTCT TG - #GAAATTCC   

      720                                                                      

     - TTTAGGGAAG GCTGTTAATG GAACCGCAAG AGCTTTTGAC CTTTCTAAAA TG - #CCCCACTT   

      780                                                                      

     - GCTAGTTGCA GGTTCAACGG GTTCAGGGAA GTCAGTAGCA GTTAACGGCA TT - #ATTGCTAG   

      840                                                                      

     - CATTCTCATG AAGGCGAGAC CAGATCAAGT TAAATTTATG ATGGTCGATC CC - #AAGATGGT   

      900                                                                      

     - TGAGTTATCT GTTTACAATG ATATTCCCCA CCTCTTGATT CCAGTCGTGA CC - #AATCCACG   

      960                                                                      

     - CAAAGCCAGC AAGGCTCTGC AAAAGGTTGT GGATGAAATG GAAAACCGTT AT - #GAACTCTT   

     1020                                                                      

     - TGCCAAGGTG GGAGTTCGGA ATATTGCAGG TTTTAATGCC AAGGTAGAAG AG - #TTCAATTC   

     1080                                                                      

     - CCAGTCTGAG TACAAGCAAA TTCCGCTACC ATTCATTGTC GTGATTGTGG AT - #GAGTTGGC   

     1140                                                                      

     - TGACCTCATG ATGGTGGCCA GCAAGGAAGT GGAAGATGCT ATCATCCGTC TT - #GGGCAGAA   

     1200                                                                      

     - GGCGCGTGCT GCAGGTATCC ACATGATTCT TGCAACTCAG CGTCCATCTG TT - #GATGTCAT   

     1260                                                                      

     - CTCTGGTTTG ATTAAGGCCA ATGTTCCATC TCGTGTAGCA TTTGCGGTTT CA - #TCAGGAAC   

     1320                                                                      

     - AGACTCCCGT ACGATTTTGG ATGAAAATGG AGCAGAAAAA CTTCTTGGTC GA - #GGAGACAT   

     1380                                                                      

     - GCTCTTTAAA CCGATTGATG AAAATCATCC AGTTCGTCTC CAAGGCTCCT TT - #ATCTCGGA   

     1440                                                                      

     - TGACGATGTT GAGCGCATTG TGAACTTCAT CAAGACTCAG GCAGATGCAG AC - #TACGATGA   

     1500                                                                      

     - GAGTTTTGAT CCAGGTGAGG TTTCTGAAAA TGAAGGAGAA TTTTCGGATG GA - #GATGCTGG   

     1560                                                                      

     - TGGTGATCCG CTTTTTGAAG AAGCTAAGTC TTTGGTTATC GAAACACAGA AA - #GCCAGTGC   

     1620                                                                      

     - GTCTATGATT CAGCGTCGTT TATCAGTTGG ATTTAACCGT GCGACCCGTC TC - #ATGGAAGA   

     1680                                                                      

     - ACTGGAGATA GCAGGTGTCA TCGGTCCAGC TGAAGGTACC AAACCTCGAA AA - #GTGTTACA   

     1740                                                                      

     #           1744                                                          

     - (2) INFORMATION FOR SEQ ID NO:132:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 581 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:132:                           

     - Arg Gly Phe Ala Lys Trp Trp Glu Gly His Gl - #u Arg Arg Lys Glu Glu     

     #                15                                                       

     - Arg Phe Val Lys Gln Glu Glu Lys Ala Arg Gl - #n Lys Ala Glu Lys Glu     

     #            30                                                           

     - Ala Arg Leu Glu Gln Glu Glu Thr Glu Lys Al - #a Leu Leu Asp Leu Pro     

     #        45                                                               

     - Pro Val Asp Met Glu Thr Gly Glu Ile Leu Th - #r Glu Glu Ala Val Gln     

     #    60                                                                   

     - Asn Leu Pro Pro Ile Pro Glu Glu Lys Trp Va - #l Glu Pro Glu Ile Ile     

     #80                                                                       

     - Leu Pro Gln Ala Glu Leu Lys Phe Pro Glu Gl - #n Glu Asp Asp Ser Asp     

     #                95                                                       

     - Asp Glu Asp Val Gln Val Asp Phe Ser Ala Ly - #s Glu Ala Leu Glu Tyr     

     #           110                                                           

     - Lys Leu Pro Ser Leu Gln Leu Phe Ala Pro As - #p Lys Pro Lys Asp Gln     

     #       125                                                               

     - Ser Lys Glu Lys Lys Ile Val Arg Glu Asn Il - #e Lys Ile Leu Glu Ala     

     #   140                                                                   

     - Thr Phe Ala Ser Phe Gly Ile Lys Val Thr Va - #l Glu Arg Ala Glu Ile     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Pro Ser Val Thr Lys Tyr Glu Val Lys Pr - #o Ala Val Gly Val Arg     

     #               175                                                       

     - Val Asn Arg Ile Ser Asn Leu Ser Asp Asp Le - #u Ala Leu Ala Leu Ala     

     #           190                                                           

     - Ala Lys Asp Val Arg Ile Glu Ala Pro Ile Pr - #o Gly Lys Ser Leu Ile     

     #       205                                                               

     - Gly Ile Glu Val Pro Asn Ser Asp Ile Ala Th - #r Val Ser Phe Arg Glu     

     #   220                                                                   

     - Leu Trp Glu Gln Ser Gln Thr Lys Ala Glu As - #n Phe Leu Glu Ile Pro     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Leu Gly Lys Ala Val Asn Gly Thr Ala Arg Al - #a Phe Asp Leu Ser Lys     

     #               255                                                       

     - Met Pro His Leu Leu Val Ala Gly Ser Thr Gl - #y Ser Gly Lys Ser Val     

     #           270                                                           

     - Ala Val Asn Gly Ile Ile Ala Ser Ile Leu Me - #t Lys Ala Arg Pro Asp     

     #       285                                                               

     - Gln Val Lys Phe Met Met Val Asp Pro Lys Me - #t Val Glu Leu Ser Val     

     #   300                                                                   

     - Tyr Asn Asp Ile Pro His Leu Leu Ile Pro Va - #l Val Thr Asn Pro Arg     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Lys Ala Ser Lys Ala Leu Gln Lys Val Val As - #p Glu Met Glu Asn Arg     

     #               335                                                       

     - Tyr Glu Leu Phe Ala Lys Val Gly Val Arg As - #n Ile Ala Gly Phe Asn     

     #           350                                                           

     - Ala Lys Val Glu Glu Phe Asn Ser Gln Ser Gl - #u Tyr Lys Gln Ile Pro     

     #       365                                                               

     - Leu Pro Phe Ile Val Val Ile Val Asp Glu Le - #u Ala Asp Leu Met Met     

     #   380                                                                   

     - Val Ala Ser Lys Glu Val Glu Asp Ala Ile Il - #e Arg Leu Gly Gln Lys     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ala Arg Ala Ala Gly Ile His Met Ile Leu Al - #a Thr Gln Arg Pro Ser     

     #               415                                                       

     - Val Asp Val Ile Ser Gly Leu Ile Lys Ala As - #n Val Pro Ser Arg Val     

     #           430                                                           

     - Ala Phe Ala Val Ser Ser Gly Thr Asp Ser Ar - #g Thr Ile Leu Asp Glu     

     #       445                                                               

     - Asn Gly Ala Glu Lys Leu Leu Gly Arg Gly As - #p Met Leu Phe Lys Pro     

     #   460                                                                   

     - Ile Asp Glu Asn His Pro Val Arg Leu Gln Gl - #y Ser Phe Ile Ser Asp     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Asp Asp Val Glu Arg Ile Val Asn Phe Ile Ly - #s Thr Gln Ala Asp Ala     

     #               495                                                       

     - Asp Tyr Asp Glu Ser Phe Asp Pro Gly Glu Va - #l Ser Glu Asn Glu Gly     

     #           510                                                           

     - Glu Phe Ser Asp Gly Asp Ala Gly Gly Asp Pr - #o Leu Phe Glu Glu Ala     

     #       525                                                               

     - Lys Ser Leu Val Ile Glu Thr Gln Lys Ala Se - #r Ala Ser Met Ile Gln     

     #   540                                                                   

     - Arg Arg Leu Ser Val Gly Phe Asn Arg Ala Th - #r Arg Leu Met Glu Glu     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Leu Glu Ile Ala Gly Val Ile Gly Pro Ala Gl - #u Gly Thr Lys Pro Arg     

     #               575                                                       

     - Lys Val Leu Gln Gln                                                     

                 580                                                           

     - (2) INFORMATION FOR SEQ ID NO: 133:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 829 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #133: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TCAAAAAGAG AAGGAAAACT TGGTTATTGC TGGGAAAATA GGTCCAGAAC CA - #GAAATTTT   

       60                                                                      

     - GGCCAATATG TATAAGTTGC TGATTGAAGA AAATACCAGC ATGACTGCGA CT - #GTTAAACC   

      120                                                                      

     - GAATTTTGGG AAGACAAGCT TCCTTTATGA AGCTCTGAAA AAAGGCGATA TT - #GACATCTA   

      180                                                                      

     - TCCTGAATTT ACTGGTACGG TGACTGAAAG TTTGCTTCAA CCATCACCCA AG - #GTGAGTCA   

      240                                                                      

     - TGAACCAGAA CAGGTTTATC AGGTGGCGCG TGATGGCATT GCTAAGCAGG AT - #CATCTAGC   

      300                                                                      

     - CTATCTCAAA CCCATGTCTT ATCAAAACAC CTATGCTGTA GCTGTTCCGA AA - #AAGATTGC   

      360                                                                      

     - TCAAGAATAT GGCTTGAAGA CCATTTCAGA CTTGAAAAAA GTGGAAGGGC AG - #TTGAAGGC   

      420                                                                      

     - AGGTTTTACA CTCGAGTTTA ACGACCGTGA AGATGGAAAT AAGGGCTTGC AA - #TCAATGTA   

      480                                                                      

     - TGGTCTCAAT CTCAATGTAG CGACCATTGA GCCAGCCCTT CGCTATCAGG CT - #ATTCAGTC   

      540                                                                      

     - AGGGGATATT CAAATCACGG ATGCCTATTC GACTGATGCG GAATTGGAGC GT - #TATGATTT   

      600                                                                      

     - ACAGGTCTTG GAAGATGACA AGCAACTCTT CCCACCTTAT CAAGGGGCTC CA - #CTCATGAA   

      660                                                                      

     - AGAAGCTCTT CTCAAGAAAC ACCCAGAGTT GGAAAGAGTT CTTAATACAT TG - #GCTGGTAA   

      720                                                                      

     - GATTACAGAA AGCCAGATGA GCCAGCTCAA CTACCAAGTC GGTGTTGAAG GC - #AAGTCAGC   

      780                                                                      

     #              829GGAGT TTCTCCAAGA ACAAGGTTTG TTGAAGAAA                   

     - (2) INFORMATION FOR SEQ ID NO:134:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 276 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:134:                           

     - Gln Lys Glu Lys Glu Asn Leu Val Ile Ala Gl - #y Lys Ile Gly Pro Glu     

     #                15                                                       

     - Pro Glu Ile Leu Ala Asn Met Tyr Lys Leu Le - #u Ile Glu Glu Asn Thr     

     #            30                                                           

     - Ser Met Thr Ala Thr Val Lys Pro Asn Phe Gl - #y Lys Thr Ser Phe Leu     

     #        45                                                               

     - Tyr Glu Ala Leu Lys Lys Gly Asp Ile Asp Il - #e Tyr Pro Glu Phe Thr     

     #    60                                                                   

     - Gly Thr Val Thr Glu Ser Leu Leu Gln Pro Se - #r Pro Lys Val Ser His     

     #80                                                                       

     - Glu Pro Glu Gln Val Tyr Gln Val Ala Arg As - #p Gly Ile Ala Lys Gln     

     #                95                                                       

     - Asp His Leu Ala Tyr Leu Lys Pro Met Ser Ty - #r Gln Asn Thr Tyr Ala     

     #           110                                                           

     - Val Ala Val Pro Lys Lys Ile Ala Gln Glu Ty - #r Gly Leu Lys Thr Ile     

     #       125                                                               

     - Ser Asp Leu Lys Lys Val Glu Gly Gln Leu Ly - #s Ala Gly Phe Thr Leu     

     #   140                                                                   

     - Glu Phe Asn Asp Arg Glu Asp Gly Asn Lys Gl - #y Leu Gln Ser Met Tyr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Leu Asn Leu Asn Val Ala Thr Ile Glu Pr - #o Ala Leu Arg Tyr Gln     

     #               175                                                       

     - Ala Ile Gln Ser Gly Asp Ile Gln Ile Thr As - #p Ala Tyr Ser Thr Asp     

     #           190                                                           

     - Ala Glu Leu Glu Arg Tyr Asp Leu Gln Val Le - #u Glu Asp Asp Lys Gln     

     #       205                                                               

     - Leu Phe Pro Pro Tyr Gln Gly Ala Pro Leu Me - #t Lys Glu Ala Leu Leu     

     #   220                                                                   

     - Lys Lys His Pro Glu Leu Glu Arg Val Leu As - #n Thr Leu Ala Gly Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ile Thr Glu Ser Gln Met Ser Gln Leu Asn Ty - #r Gln Val Gly Val Glu     

     #               255                                                       

     - Gly Lys Ser Ala Lys Gln Val Ala Lys Glu Ph - #e Leu Gln Glu Gln Gly     

     #           270                                                           

     - Leu Leu Lys Lys                                                         

             275                                                               

     - (2) INFORMATION FOR SEQ ID NO: 135:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 712 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #135: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ACGTTCTATT GAGGACCACT TTGATTCAAA CTTCGAATTG GAATATAACC TC - #AAAGAAAA   

       60                                                                      

     - AGGGAAAACA GATCTTTTGA AGCTAGTTGA TAAAACAACT GACATGCGTC TG - #CATTTTAT   

      120                                                                      

     - CCGCCAAACT CATCCACGCG GTCTCGGAGA TGCTGTTTTG CAAGCCAAGG CT - #TTCGTCGG   

      180                                                                      

     - AAATGAACCT TTTGTCGTTA TGCTTGGTGA TGACTTGATG GATATCACAG AC - #GAAAAGGC   

      240                                                                      

     - TGTTCCACTT ACCAAACAAC TCATGGATGA CTACGAGCGT ACCCACGCGT CT - #ACTATCGC   

      300                                                                      

     - TGTCATGCCA GTCCCTCATG ACGAAGTATC TGCTTACGGG GTTATTGCTC CG - #CAAGGCGA   

      360                                                                      

     - AGGAAAAGAT GGTCTTTACA GTGTTGAAAC CTTTGTTGAA AAACCAGCTC CA - #GAGGACGC   

      420                                                                      

     - TCCTAGCGAC CTTGCTATTA TCGGACGCTA CCTCCTCACG CCTGAAATTT TT - #GAGATTCT   

      480                                                                      

     - CGAAAAGCAA GCTCCAGGTG CAGGAAATGA AATTCAGCTG ACAGATGCAA TC - #GACACCCT   

      540                                                                      

     - CAATAAAACA CAACGTGTAT TTGCTCGTGA GTTCAAAGGG GCTCGTTACG AT - #GTCGGAGA   

      600                                                                      

     - CAAGTTTGGC TTCATGAAAA CATCCATCGA CTACGCCCTC AAACACCCAC AA - #GTCAAAGA   

      660                                                                      

     - TGATTTGAAG AATTACCTCA TCCAACTTGG AAAAGAATTG ACTGAGAAGG AA - #           

      712                                                                      

     - (2) INFORMATION FOR SEQ ID NO:136:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 237 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:136:                           

     - Arg Ser Ile Glu Asp His Phe Asp Ser Asn Ph - #e Glu Leu Glu Tyr Asn     

     #                15                                                       

     - Leu Lys Glu Lys Gly Lys Thr Asp Leu Leu Ly - #s Leu Val Asp Lys Thr     

     #            30                                                           

     - Thr Asp Met Arg Leu His Phe Ile Arg Gln Th - #r His Pro Arg Gly Leu     

     #        45                                                               

     - Gly Asp Ala Val Leu Gln Ala Lys Ala Phe Va - #l Gly Asn Glu Pro Phe     

     #    60                                                                   

     - Val Val Met Leu Gly Asp Asp Leu Met Asp Il - #e Thr Asp Glu Lys Ala     

     #80                                                                       

     - Val Pro Leu Thr Lys Gln Leu Met Asp Asp Ty - #r Glu Arg Thr His Ala     

     #                95                                                       

     - Ser Thr Ile Ala Val Met Pro Val Pro His As - #p Glu Val Ser Ala Tyr     

     #           110                                                           

     - Gly Val Ile Ala Pro Gln Gly Glu Gly Lys As - #p Gly Leu Tyr Ser Val     

     #       125                                                               

     - Glu Thr Phe Val Glu Lys Pro Ala Pro Glu As - #p Ala Pro Ser Asp Leu     

     #   140                                                                   

     - Ala Ile Ile Gly Arg Tyr Leu Leu Thr Pro Gl - #u Ile Phe Glu Ile Leu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Glu Lys Gln Ala Pro Gly Ala Gly Asn Glu Il - #e Gln Leu Thr Asp Ala     

     #               175                                                       

     - Ile Asp Thr Leu Asn Lys Thr Gln Arg Val Ph - #e Ala Arg Glu Phe Lys     

     #           190                                                           

     - Gly Ala Arg Tyr Asp Val Gly Asp Lys Phe Gl - #y Phe Met Lys Thr Ser     

     #       205                                                               

     - Ile Asp Tyr Ala Leu Lys His Pro Gln Val Ly - #s Asp Asp Leu Lys Asn     

     #   220                                                                   

     - Tyr Leu Ile Gln Leu Gly Lys Glu Leu Thr Gl - #u Lys Glu                 

     225                 2 - #30                 2 - #35                       

     - (2) INFORMATION FOR SEQ ID NO: 137:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 502 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #137: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CGCTCAAAAT ACCAGAGGTG TTCAGCTAAT CGAGCACGTT TCTCCTCAAA TG - #TTGAAAGC   

       60                                                                      

     - CCAATTGGAG AGTGTCTTTT CTGATATTCC ACCTCAGGCT GTAAAAACTG GA - #ATGTTGGC   

      120                                                                      

     - TACTACTGAA ATCATGGAAA TCATCCAACC CTATCTTAAA AAACTGGATT GT - #CCCTATGT   

      180                                                                      

     - CCTTGATCCT GTTATGGTTG CTACAAGTGG AGATGCCTTG ATTGACTCAA AT - #GCTAGAGA   

      240                                                                      

     - CTATCTCAAA ACAAACTTAC TACCTCTAGC AACTATTATT ACGCCAAATC TT - #CCTGAAGC   

      300                                                                      

     - AGAAGAGATT GTTGGTTTTT CAATCCATGA CCCCGAAGAC ATGCAGCGTG CT - #GGTCGCCT   

      360                                                                      

     - GATTTTAAAA GAATTTGGTC CTCAGTCTGT GGTTATCAAA GGCGGACATC TC - #AAAGGTGG   

      420                                                                      

     - TGCTAAAGAT TTCCTCTTTA CCAAGAATGA ACAATTTGTC TGGGAAAGCC CA - #CGAATTCA   

      480                                                                      

     #                502GTA CT                                                

     - (2) INFORMATION FOR SEQ ID NO:138:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 167 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:138:                           

     - Ala Gln Asn Thr Arg Gly Val Gln Leu Ile Gl - #u His Val Ser Pro Gln     

     #                15                                                       

     - Met Leu Lys Ala Gln Leu Glu Ser Val Phe Se - #r Asp Ile Pro Pro Gln     

     #            30                                                           

     - Ala Val Lys Thr Gly Met Leu Ala Thr Thr Gl - #u Ile Met Glu Ile Ile     

     #        45                                                               

     - Gln Pro Tyr Leu Lys Lys Leu Asp Cys Pro Ty - #r Val Leu Asp Pro Val     

     #    60                                                                   

     - Met Val Ala Thr Ser Gly Asp Ala Leu Ile As - #p Ser Asn Ala Arg Asp     

     #80                                                                       

     - Tyr Leu Lys Thr Asn Leu Leu Pro Leu Ala Th - #r Ile Ile Thr Pro Asn     

     #                95                                                       

     - Leu Pro Glu Ala Glu Glu Ile Val Gly Phe Se - #r Ile His Asp Pro Glu     

     #           110                                                           

     - Asp Met Gln Arg Ala Gly Arg Leu Ile Leu Ly - #s Glu Phe Gly Pro Gln     

     #       125                                                               

     - Ser Val Val Ile Lys Gly Gly His Leu Lys Gl - #y Gly Ala Lys Asp Phe     

     #   140                                                                   

     - Leu Phe Thr Lys Asn Glu Gln Phe Val Trp Gl - #u Ser Pro Arg Ile Gln     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Cys His Thr His Gly Thr                                             

                     165                                                       

     - (2) INFORMATION FOR SEQ ID NO: 139:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 805 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #139: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AATTGTACAA TTAGAAAAAG ATAGCAAATC AGACAAAGAA CAAGTTGATA AA - #CTATTTGA   

       60                                                                      

     - ATCATTTGAT GCATCTTCAG ATGAATCTAT TTCTAAATTA AAAGAACTAT CT - #GAAACTTC   

      120                                                                      

     - ACTTAAAACC GATGCAGGTA AAGACTATCT TAATAACAAA GTCAAAGAAT CA - #TCTAAAGC   

      180                                                                      

     - AATTGTAGAT TTTCATTTGC AAAAAGGTTT GGCTTATGAT GTTAAAGATT CA - #GATGACAA   

      240                                                                      

     - ATTTAAAGAT AAAGCAACTC TTGAAACAAA TGTAAAAGAA ATTACAAAAC AA - #ATTGATTT   

      300                                                                      

     - TATCAAAAAA GTTGATGAAA CTTTTAAACA AGAGAATTTG GAAGAAACTC TT - #AAATCTCT   

      360                                                                      

     - AAATGATCTT GTTGATAAAT ATCAAAAACA AATCGAACTT TTGAAGAAAG AA - #GAAGAAAA   

      420                                                                      

     - AGCTGCTGAA AAAGCTGCTG AAAAAGCAAA GGAATCTTCT AGTCAAAGTA AT - #TCTTCTGG   

      480                                                                      

     - TAGTGCTTCT AATGAGTCTT ATAATGGATC TTCCAATTCA AATGTAGATT AT - #AGTTCATC   

      540                                                                      

     - TGAACAAACT AATGGATATT CAAATAATTA TGGCGGTCAA GATTATTCTG GT - #TCAGGAGA   

      600                                                                      

     - TAGTTCAACA AATGGTGGAT CATCAGAACA ATATTCATCT AGCAATTCAA AC - #AGCGGAGC   

      660                                                                      

     - AAATAATGTC TACAGATATA AAGGCACTGG TGCTGACGGC TATCAAAGAT AC - #TACTACAA   

      720                                                                      

     - AGATCATAAT AATGGAGATG TGTATGATGA CGATGGAAAT TACCTTGGGA AC - #TTTGGTGG   

      780                                                                      

     #              805 AGTC AACGC                                             

     - (2) INFORMATION FOR SEQ ID NO:140:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 268 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:140:                           

     - Ile Val Gln Leu Glu Lys Asp Ser Lys Ser As - #p Lys Glu Gln Val Asp     

     #                15                                                       

     - Lys Leu Phe Glu Ser Phe Asp Ala Ser Ser As - #p Glu Ser Ile Ser Lys     

     #            30                                                           

     - Leu Lys Glu Leu Ser Glu Thr Ser Leu Lys Th - #r Asp Ala Gly Lys Asp     

     #        45                                                               

     - Tyr Leu Asn Asn Lys Val Lys Glu Ser Ser Ly - #s Ala Ile Val Asp Phe     

     #    60                                                                   

     - His Leu Gln Lys Gly Leu Ala Tyr Asp Val Ly - #s Asp Ser Asp Asp Lys     

     #80                                                                       

     - Phe Lys Asp Lys Ala Thr Leu Glu Thr Asn Va - #l Lys Glu Ile Thr Lys     

     #                95                                                       

     - Gln Ile Asp Phe Ile Lys Lys Val Asp Glu Th - #r Phe Lys Gln Glu Asn     

     #           110                                                           

     - Leu Glu Glu Thr Leu Lys Ser Leu Asn Asp Le - #u Val Asp Lys Tyr Gln     

     #       125                                                               

     - Lys Gln Ile Glu Leu Leu Lys Lys Glu Glu Gl - #u Lys Ala Ala Glu Lys     

     #   140                                                                   

     - Ala Ala Glu Lys Ala Lys Glu Ser Ser Ser Gl - #n Ser Asn Ser Ser Gly     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Ala Ser Asn Glu Ser Tyr Asn Gly Ser Se - #r Asn Ser Asn Val Asp     

     #               175                                                       

     - Tyr Ser Ser Ser Glu Gln Thr Asn Gly Tyr Se - #r Asn Asn Tyr Gly Gly     

     #           190                                                           

     - Gln Asp Tyr Ser Gly Ser Gly Asp Ser Ser Th - #r Asn Gly Gly Ser Ser     

     #       205                                                               

     - Glu Gln Tyr Ser Ser Ser Asn Ser Asn Ser Gl - #y Ala Asn Asn Val Tyr     

     #   220                                                                   

     - Arg Tyr Lys Gly Thr Gly Ala Asp Gly Tyr Gl - #n Arg Tyr Tyr Tyr Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Asp His Asn Asn Gly Asp Val Tyr Asp Asp As - #p Gly Asn Tyr Leu Gly     

     #               255                                                       

     - Asn Phe Gly Gly Gly Ile Ala Glu Pro Ser Gl - #n Arg                     

     #           265                                                           

     - (2) INFORMATION FOR SEQ ID NO: 141:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 211 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #141: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TCTGACCAAG CAAAAAGAAG CAGTCAATGA CAAAGGAAAA GCAGCTGTTG TT - #AAGGTGGT   

       60                                                                      

     - GGAAAGCCAG GCAGAACTTT ATAGCTTAGA AAAGAATGAA GATGCTAGCC TA - #AGAAAGTT   

      120                                                                      

     - ACAAGCAGAT GGACGCATCA CGGAAGAACA GGCTAAAGCT TATAAAGAAT AC - #AATGATAA   

      180                                                                      

     #         211      CGTA AAGTCAATGA T                                      

     - (2) INFORMATION FOR SEQ ID NO:142:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 70 amino                                            

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:142:                           

     - Leu Thr Lys Gln Lys Glu Ala Val Asn Asp Ly - #s Gly Lys Ala Ala Val     

     #                15                                                       

     - Val Lys Val Val Glu Ser Gln Ala Glu Leu Ty - #r Ser Leu Glu Lys Asn     

     #            30                                                           

     - Glu Asp Ala Ser Leu Arg Lys Leu Gln Ala As - #p Gly Arg Ile Thr Glu     

     #        45                                                               

     - Glu Gln Ala Lys Ala Tyr Lys Glu Tyr Asn As - #p Lys Asn Gly Gly Ala     

     #    60                                                                   

     - Asn Arg Lys Val Asn Asp                                                 

     #70                                                                       

     - (2) INFORMATION FOR SEQ ID NO: 143:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 331 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #143: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC GGTAGAGGAA CAGATTTTCT TT - #ATGGAGTT   

       60                                                                      

     - TGAAGAACTC TATCGGGAAA CCCAAAAACG CAGTGTAGCC AGTCAGCAAA AG - #ACTAGTCT   

      120                                                                      

     - GAACTTAGAT GGGCAGACGC TTAGCAATGG CAGTCAAAAG TTGCCAGTCC CT - #AAAGGAAT   

      180                                                                      

     - TCAGGCCCCA TCAGGCCAAA GTATTACATT TGACCGAGCT GGGGGCAATT CG - #TCCCTGGC   

      240                                                                      

     - TAAGGTTGAA TTTCAGACCA GTAAAGGAGC GATTCGCTAT CAATTATATC TA - #GGAAATGG   

      300                                                                      

     #         331      AAGG AAACAAAAAA T                                      

     - (2) INFORMATION FOR SEQ ID NO:144:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 110 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:144:                           

     - Ser Gly Ser Val Gln Ser Thr Phe Ser Ala Va - #l Glu Glu Gln Ile Phe     

     #                15                                                       

     - Phe Met Glu Phe Glu Glu Leu Tyr Arg Glu Th - #r Gln Lys Arg Ser Val     

     #            30                                                           

     - Ala Ser Gln Gln Lys Thr Ser Leu Asn Leu As - #p Gly Gln Thr Leu Ser     

     #        45                                                               

     - Asn Gly Ser Gln Lys Leu Pro Val Pro Lys Gl - #y Ile Gln Ala Pro Ser     

     #    60                                                                   

     - Gly Gln Ser Ile Thr Phe Asp Arg Ala Gly Gl - #y Asn Ser Ser Leu Ala     

     #80                                                                       

     - Lys Val Glu Phe Gln Thr Ser Lys Gly Ala Il - #e Arg Tyr Gln Leu Tyr     

     #                95                                                       

     - Leu Gly Asn Gly Lys Ile Lys Arg Ile Lys Gl - #u Thr Lys Asn             

     #           110                                                           

     - (2) INFORMATION FOR SEQ ID NO: 145:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 196 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #145: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGGACAAATT CAAAAAAATA GGCAAGAGGA AGCAAAAATC TTGCAAAAGG AA - #GAAGTCTT   

       60                                                                      

     - GAGGGTAGCT AAGATGGCCC TGCAGACGGG GCAAAATCAG GTAAGCATCA AC - #GGAGTTGA   

      120                                                                      

     - GATTCAGGTA TTTTCTAGTG AAAAAGGATT GGAGGTCTAC CATGGTTCAG AA - #CAGTTGTT   

      180                                                                      

     #   196                                                                   

     - (2) INFORMATION FOR SEQ ID NO:146:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 65 amino                                            

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:146:                           

     - Gly Gln Ile Gln Lys Asn Arg Gln Glu Glu Al - #a Lys Ile Leu Gln Lys     

     #                15                                                       

     - Glu Glu Val Leu Arg Val Ala Lys Met Ala Le - #u Gln Thr Gly Gln Asn     

     #            30                                                           

     - Gln Val Ser Ile Asn Gly Val Glu Ile Gln Va - #l Phe Ser Ser Glu Lys     

     #        45                                                               

     - Gly Leu Glu Val Tyr His Gly Ser Glu Gln Le - #u Leu Ala Ile Lys Glu     

     #    60                                                                   

     - Pro                                                                     

     65                                                                        

     - (2) INFORMATION FOR SEQ ID NO: 147:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 319 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #147: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TCGCTACCAG CAACAAAGCG AGCAAAAGGA GTGGCTCTTG TTTGTGGACC AA - #CTTGAGGT   

       60                                                                      

     - AGAATTAGAC CGTTCGCAGT TCGAAAAAGT AGAAGGCAAT CGCCTATACA TG - #AAGCAAGA   

      120                                                                      

     - TGGCAAGGAC ATCGCCATCG GTAAGTCAAA GTCAGATGAT TTCCGTAAAA CG - #AATGCTCG   

      180                                                                      

     - TGGTCGAGGT TATCAGCCTA TGGTTTATGG ACTCAAATCT GTACGGATTA CA - #GAGGACAA   

      240                                                                      

     - TCAACTGGTT CGCTTTCATT TCCAGTTCCA AAAAGGCTTA GAAAGGGAGT TC - #ATCTATCG   

      300                                                                      

     #319               AGT                                                    

     - (2) INFORMATION FOR SEQ ID NO:148:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 106 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:148:                           

     - Arg Tyr Gln Gln Gln Ser Glu Gln Lys Glu Tr - #p Leu Leu Phe Val Asp     

     #                15                                                       

     - Gln Leu Glu Val Glu Leu Asp Arg Ser Gln Ph - #e Glu Lys Val Glu Gly     

     #            30                                                           

     - Asn Arg Leu Tyr Met Lys Gln Asp Gly Lys As - #p Ile Ala Ile Gly Lys     

     #        45                                                               

     - Ser Lys Ser Asp Asp Phe Arg Lys Thr Asn Al - #a Arg Gly Arg Gly Tyr     

     #    60                                                                   

     - Gln Pro Met Val Tyr Gly Leu Lys Ser Val Ar - #g Ile Thr Glu Asp Asn     

     #80                                                                       

     - Gln Leu Val Arg Phe His Phe Gln Phe Gln Ly - #s Gly Leu Glu Arg Glu     

     #                95                                                       

     - Phe Ile Tyr Arg Val Glu Lys Glu Lys Ser                                 

     #           105                                                           

     - (2) INFORMATION FOR SEQ ID NO: 149:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 322 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #149: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GAACCGACAA GTCGCCCACT ATCAAGACTA TGCTTTGAAT AAAGAAAAAT TG - #GTTGCTTT   

       60                                                                      

     - TGCTATGGCT AAACGAACCA AAGATAAGGT TGAGCAAGAA AGTGGGGAAC AG - #TTTTTTAA   

      120                                                                      

     - TCTAGGTCAG GTAAGCTATC AAAACAAGAA AACTGGCTTA GTGACGAGGG TT - #CGTACGGA   

      180                                                                      

     - TAAGAGCCAA TATGAGTTTC TGTTTCCTTC AGTCAAAATC AAAGAAGAGA AA - #AGAGATAA   

      240                                                                      

     - AAAGGAAGAG GTAGCGACCG ATTCAAGCGA AAAAGTGGAG AAGAAAAAAT CA - #GAAGAGAA   

      300                                                                      

     #                322ATT CA                                                

     - (2) INFORMATION FOR SEQ ID NO:150:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 107 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:150:                           

     - Asn Arg Gln Val Ala His Tyr Gln Asp Tyr Al - #a Leu Asn Lys Glu Lys     

     #                15                                                       

     - Leu Val Ala Phe Ala Met Ala Lys Arg Thr Ly - #s Asp Lys Val Glu Gln     

     #            30                                                           

     - Glu Ser Gly Glu Gln Phe Phe Asn Leu Gly Gl - #n Val Ser Tyr Gln Asn     

     #        45                                                               

     - Lys Lys Thr Gly Leu Val Thr Arg Val Arg Th - #r Asp Lys Ser Gln Tyr     

     #    60                                                                   

     - Glu Phe Leu Phe Pro Ser Val Lys Ile Lys Gl - #u Glu Lys Arg Asp Lys     

     #80                                                                       

     - Lys Glu Glu Val Ala Thr Asp Ser Ser Glu Ly - #s Val Glu Lys Lys Lys     

     #                95                                                       

     - Ser Glu Glu Lys Pro Glu Lys Lys Glu Asn Se - #r                         

     #           105                                                           

     - (2) INFORMATION FOR SEQ ID NO: 151:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 784 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #151: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGTTGTCGGC TGGCAATATA TCCCGTTTCC ATCTAAAGGT AGTACAATTG GT - #CCTTACCC   

       60                                                                      

     - AAATGGTATC AGATTAGAAG GTTTTCCAAA GTCAGAGTGG TACTACTTCG AT - #AAAAATGG   

      120                                                                      

     - AGTGCTACAA GAGTTTGTTG GTTGGAAAAC ATTAGAGATT AAAACTAAAG AC - #AGTGTTGG   

      180                                                                      

     - AAGAAAGTAC GGGGAAAAAC GTGAAGATTC AGAAGATAAA GAAGAGAAGC GT - #TATTATAC   

      240                                                                      

     - GAACTATTAC TTTAATCAAA ATCATTCTTT AGAGACAGGT TGGCTTTATG AT - #CAGTCTAA   

      300                                                                      

     - CTGGTATTAT CTAGCTAAGA CGGAAATTAA TGGAGAAAAC TACCTTGGTG GT - #GAAAGACG   

      360                                                                      

     - TGCGGGGTGG ATAAACGATG ATTCGACTTG GTACTACCTA GATCCAACAA CT - #GGTATTAT   

      420                                                                      

     - GCAAACAGGT TGGCAATATC TAGGTAATAA GTGGTACTAC CTCCGTTCCT CA - #GGAGCAAT   

      480                                                                      

     - GGCCACTGGC TGGTATCAGG AAGGTACCAC TTGGTATTAT TTAGACCACC CA - #AATGGCGA   

      540                                                                      

     - TATGAAAACA GGTTGGCAAA ACCTTGGGAA CAAATGGTAC TATCTCCGTT CA - #TCAGGAGC   

      600                                                                      

     - TATGGCAACT GGTTGGTATC AAGATGGTTC AACTTGGTAC TACCTAAATG CA - #GGTAATGG   

      660                                                                      

     - AGACATGAAG ACAGGTTGGT TCCAGGTCAA TGGCAACTGG TACTATGCTT AT - #AGCTCAGG   

      720                                                                      

     - TGCTTTGGCA GTGAATACGA CCGTAGATGG CTATTCTGTC AACTATAATG GC - #GAATGGGT   

      780                                                                      

     #            784                                                          

     - (2) INFORMATION FOR SEQ ID NO:152:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 261 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:152:                           

     - Val Val Gly Trp Gln Tyr Ile Pro Phe Pro Se - #r Lys Gly Ser Thr Ile     

     #                15                                                       

     - Gly Pro Tyr Pro Asn Gly Ile Arg Leu Glu Gl - #y Phe Pro Lys Ser Glu     

     #            30                                                           

     - Trp Tyr Tyr Phe Asp Lys Asn Gly Val Leu Gl - #n Glu Phe Val Gly Trp     

     #        45                                                               

     - Lys Thr Leu Glu Ile Lys Thr Lys Asp Ser Va - #l Gly Arg Lys Tyr Gly     

     #    60                                                                   

     - Glu Lys Arg Glu Asp Ser Glu Asp Lys Glu Gl - #u Lys Arg Tyr Tyr Thr     

     #80                                                                       

     - Asn Tyr Tyr Phe Asn Gln Asn His Ser Leu Gl - #u Thr Gly Trp Leu Tyr     

     #                95                                                       

     - Asp Gln Ser Asn Trp Tyr Tyr Leu Ala Lys Th - #r Glu Ile Asn Gly Glu     

     #           110                                                           

     - Asn Tyr Leu Gly Gly Glu Arg Arg Ala Gly Tr - #p Ile Asn Asp Asp Ser     

     #       125                                                               

     - Thr Trp Tyr Tyr Leu Asp Pro Thr Thr Gly Il - #e Met Gln Thr Gly Trp     

     #   140                                                                   

     - Gln Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Ar - #g Ser Ser Gly Ala Met     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ala Thr Gly Trp Tyr Gln Glu Gly Thr Thr Tr - #p Tyr Tyr Leu Asp His     

     #               175                                                       

     - Pro Asn Gly Asp Met Lys Thr Gly Trp Gln As - #n Leu Gly Asn Lys Trp     

     #           190                                                           

     - Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Th - #r Gly Trp Tyr Gln Asp     

     #       205                                                               

     - Gly Ser Thr Trp Tyr Tyr Leu Asn Ala Gly As - #n Gly Asp Met Lys Thr     

     #   220                                                                   

     - Gly Trp Phe Gln Val Asn Gly Asn Trp Tyr Ty - #r Ala Tyr Ser Ser Gly     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ala Leu Ala Val Asn Thr Thr Val Asp Gly Ty - #r Ser Val Asn Tyr Asn     

     #               255                                                       

     - Gly Glu Trp Val Arg                                                     

                 260                                                           

     - (2) INFORMATION FOR SEQ ID NO: 153:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1708 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #153: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGCCAAATCA GAATGGGTAG AAGACAAGGG AGCCTTTTAT TATCTTGACC AA - #GATGGAAA   

       60                                                                      

     - GATGAAAAGA AATGCTTGGG TAGGAACTTC CTATGTTGGT GCAACAGGTG CC - #AAAGTAAT   

      120                                                                      

     - AGAAGACTGG GTCTATGATT CTCAATACGA TGCTTGGTTT TATATCAAAG CA - #GATGGACA   

      180                                                                      

     - GCACGCAGAG AAAGAATGGC TCCAAATTAA AGGGAAGGAC TATTATTTCA AA - #TCCGGTGG   

      240                                                                      

     - TTATCTACTG ACAAGTCAGT GGATTAATCA AGCTTATGTG AATGCTAGTG GT - #GCCAAAGT   

      300                                                                      

     - ACAGCAAGGT TGGCTTTTTG ACAAACAATA CCAATCTTGG TTTTACATCA AA - #GAAAATGG   

      360                                                                      

     - AAACTATGCT GATAAAGAAT GGATTTTCGA GAATGGTCAC TATTATTATC TA - #AAATCCGG   

      420                                                                      

     - TGGCTACATG GCAGCCAATG AATGGATTTG GGATAAGGAA TCTTGGTTTT AT - #CTCAAATT   

      480                                                                      

     - TGATGGGAAA ATGGCTGAAA AAGAATGGGT CTACGATTCT CATAGTCAAG CT - #TGGTACTA   

      540                                                                      

     - CTTCAAATCC GGTGGTTACA TGACAGCCAA TGAATGGATT TGGGATAAGG AA - #TCTTGGTT   

      600                                                                      

     - TTATCTCAAA TCTGATGGGA AAATAGCTGA AAAAGAATGG GTCTACGATT CT - #CATAGTCA   

      660                                                                      

     - AGCTTGGTAC TACTTCAAAT CCGGTGGTTA CATGACAGCC AATGAATGGA TT - #TGGGATAA   

      720                                                                      

     - GGAATCTTGG TTTTACCTCA AATCTGATGG GAAAATAGCT GAAAAAGAAT GG - #GTCTACGA   

      780                                                                      

     - TTCTCATAGT CAAGCTTGGT ACTACTTCAA ATCTGGTGGC TACATGGCGA AA - #AATGAGAC   

      840                                                                      

     - AGTAGATGGT TATCAGCTTG GAAGCGATGG TAAATGGCTT GGAGGAAAAA CT - #ACAAATGA   

      900                                                                      

     - AAATGCTGCT TACTATCAAG TAGTGCCTGT TACAGCCAAT GTTTATGATT CA - #GATGGTGA   

      960                                                                      

     - AAAGCTTTCC TATATATCGC AAGGTAGTGT CGTATGGCTA GATAAGGATA GA - #AAAAGTGA   

     1020                                                                      

     - TGACAAGCGC TTGGCTATTA CTATTTCTGG TTTGTCAGGC TATATGAAAA CA - #GAAGATTT   

     1080                                                                      

     - ACAAGCGCTA GATGCTAGTA AGGACTTTAT CCCTTATTAT GAGAGTGATG GC - #CACCGTTT   

     1140                                                                      

     - TTATCACTAT GTGGCTCAGA ATGCTAGTAT CCCAGTAGCT TCTCATCTTT CT - #GATATGGA   

     1200                                                                      

     - AGTAGGCAAG AAATATTATT CGGCAGATGG CCTGCATTTT GATGGTTTTA AG - #CTTGAGAA   

     1260                                                                      

     - TCCCTTCCTT TTCAAAGATT TAACAGAGGC TACAAACTAC AGTGCTGAAG AA - #TTGGATAA   

     1320                                                                      

     - GGTATTTAGT TTGCTAAACA TTAACAATAG CCTTTTGGAG AACAAGGGCG CT - #ACTTTTAA   

     1380                                                                      

     - GGAAGCCGAA GAACATTACC ATATCAATGC TCTTTATCTC CTTGCCCATA GT - #GCCCTAGA   

     1440                                                                      

     - AAGTAACTGG GGAAGAAGTA AAATTGCCAA AGATAAGAAT AATTTCTTTG GC - #ATTACAGC   

     1500                                                                      

     - CTATGATACG ACCCCTTACC TTTCTGCTAA GACATTTGAT GATGTGGATA AG - #GGAATTTT   

     1560                                                                      

     - AGGTGCAACC AAGTGGATTA AGGAAAATTA TATCGATAGG GGAAGAACTT TC - #CTTGGAAA   

     1620                                                                      

     - CAAGGCTTCT GGTATGAATG TGGAATATGC TTCAGACCCT TATTGGGGCG AA - #AAAATTGC   

     1680                                                                      

     #           1708   ATCA ATGAGAAG                                          

     - (2) INFORMATION FOR SEQ ID NO:154:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 569 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:154:                           

     - Ala Lys Ser Glu Trp Val Glu Asp Lys Gly Al - #a Phe Tyr Tyr Leu Asp     

     #                15                                                       

     - Gln Asp Gly Lys Met Lys Arg Asn Ala Trp Va - #l Gly Thr Ser Tyr Val     

     #            30                                                           

     - Gly Ala Thr Gly Ala Lys Val Ile Glu Asp Tr - #p Val Tyr Asp Ser Gln     

     #        45                                                               

     - Tyr Asp Ala Trp Phe Tyr Ile Lys Ala Asp Gl - #y Gln His Ala Glu Lys     

     #    60                                                                   

     - Glu Trp Leu Gln Ile Lys Gly Lys Asp Tyr Ty - #r Phe Lys Ser Gly Gly     

     #80                                                                       

     - Tyr Leu Leu Thr Ser Gln Trp Ile Asn Gln Al - #a Tyr Val Asn Ala Ser     

     #                95                                                       

     - Gly Ala Lys Val Gln Gln Gly Trp Leu Phe As - #p Lys Gln Tyr Gln Ser     

     #           110                                                           

     - Trp Phe Tyr Ile Lys Glu Asn Gly Asn Tyr Al - #a Asp Lys Glu Trp Ile     

     #       125                                                               

     - Phe Glu Asn Gly His Tyr Tyr Tyr Leu Lys Se - #r Gly Gly Tyr Met Ala     

     #   140                                                                   

     - Ala Asn Glu Trp Ile Trp Asp Lys Glu Ser Tr - #p Phe Tyr Leu Lys Phe     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asp Gly Lys Met Ala Glu Lys Glu Trp Val Ty - #r Asp Ser His Ser Gln     

     #               175                                                       

     - Ala Trp Tyr Tyr Phe Lys Ser Gly Gly Tyr Me - #t Thr Ala Asn Glu Trp     

     #           190                                                           

     - Ile Trp Asp Lys Glu Ser Trp Phe Tyr Leu Ly - #s Ser Asp Gly Lys Ile     

     #       205                                                               

     - Ala Glu Lys Glu Trp Val Tyr Asp Ser His Se - #r Gln Ala Trp Tyr Tyr     

     #   220                                                                   

     - Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Gl - #u Trp Ile Trp Asp Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Ly - #s Ile Ala Glu Lys Glu     

     #               255                                                       

     - Trp Val Tyr Asp Ser His Ser Gln Ala Trp Ty - #r Tyr Phe Lys Ser Gly     

     #           270                                                           

     - Gly Tyr Met Ala Lys Asn Glu Thr Val Asp Gl - #y Tyr Gln Leu Gly Ser     

     #       285                                                               

     - Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr As - #n Glu Asn Ala Ala Tyr     

     #   300                                                                   

     - Tyr Gln Val Val Pro Val Thr Ala Asn Val Ty - #r Asp Ser Asp Gly Glu     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Lys Leu Ser Tyr Ile Ser Gln Gly Ser Val Va - #l Trp Leu Asp Lys Asp     

     #               335                                                       

     - Arg Lys Ser Asp Asp Lys Arg Leu Ala Ile Th - #r Ile Ser Gly Leu Ser     

     #           350                                                           

     - Gly Tyr Met Lys Thr Glu Asp Leu Gln Ala Le - #u Asp Ala Ser Lys Asp     

     #       365                                                               

     - Phe Ile Pro Tyr Tyr Glu Ser Asp Gly His Ar - #g Phe Tyr His Tyr Val     

     #   380                                                                   

     - Ala Gln Asn Ala Ser Ile Pro Val Ala Ser Hi - #s Leu Ser Asp Met Glu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Le - #u His Phe Asp Gly Phe     

     #               415                                                       

     - Lys Leu Glu Asn Pro Phe Leu Phe Lys Asp Le - #u Thr Glu Ala Thr Asn     

     #           430                                                           

     - Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Se - #r Leu Leu Asn Ile Asn     

     #       445                                                               

     - Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Ph - #e Lys Glu Ala Glu Glu     

     #   460                                                                   

     - His Tyr His Ile Asn Ala Leu Tyr Leu Leu Al - #a His Ser Ala Leu Glu     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Ser Asn Trp Gly Arg Ser Lys Ile Ala Lys As - #p Lys Asn Asn Phe Phe     

     #               495                                                       

     - Gly Ile Thr Ala Tyr Asp Thr Thr Pro Tyr Le - #u Ser Ala Lys Thr Phe     

     #           510                                                           

     - Asp Asp Val Asp Lys Gly Ile Leu Gly Ala Th - #r Lys Trp Ile Lys Glu     

     #       525                                                               

     - Asn Tyr Ile Asp Arg Gly Arg Thr Phe Leu Gl - #y Asn Lys Ala Ser Gly     

     #   540                                                                   

     - Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Tr - #p Gly Glu Lys Ile Ala     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Ser Val Met Met Lys Ile Asn Glu Lys                                     

                     565                                                       

     - (2) INFORMATION FOR SEQ ID NO: 155:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 946 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #155: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ATTTGCAGAT GATTCTGAAG GATGGCAGTT TGTCCAAGAA AATGGTAGAA CC - #TACTACAA   

       60                                                                      

     - AAAGGGGGAT CTAAAAGAAA CCTACTGGAG AGTGATAGAT GGGAAGTACT AT - #TATTTTGA   

      120                                                                      

     - TCCTTTATCC GGAGAGATGG TTGTCGGCTG GCAATATATA CCTGCTCCAC AC - #AAGGGGGT   

      180                                                                      

     - TACGATTGGT CCTTCTCCAA GAATAGAGAT TGCTCTTAGA CCAGATTGGT TT - #TATTTTGG   

      240                                                                      

     - TCAAGATGGT GTATTACAAG AATTTGTTGG CAAGCAAGTT TTAGAAGCAA AA - #ACTGCTAC   

      300                                                                      

     - GAATACCAAC AAACATCATG GGGAAGAATA TGATAGCCAA GCAGAGAAAC GA - #GTCTATTA   

      360                                                                      

     - TTTTGAAGAT CAGCGTAGTT ATCATACTTT AAAAACTGGT TGGATTTATG AA - #GAGGGTCA   

      420                                                                      

     - TTGGTATTAT TTACAGAAGG ATGGTGGCTT TGATTCGCGC ATCAACAGAT TG - #ACGGTTGG   

      480                                                                      

     - AGAGCTAGCA CGTGGTTGGG TTAAGGATTA CCCTCTTACG TATGATGAAG AG - #AAGCTAAA   

      540                                                                      

     - AGCAGCTCCA TGGTACTATC TAAATCCAGC AACTGGCATT ATGCAAACAG GT - #TGGCAATA   

      600                                                                      

     - TCTAGGTAAT AGATGGTACT ACCTCCATTC GTCAGGAGCT ATGGCAACTG GC - #TGGTATAA   

      660                                                                      

     - GGAAGGCTCA ACTTGGTACT ATCTAGATGC TGAAAATGGT GATATGAGAA CT - #GGCTGGCA   

      720                                                                      

     - AAACCTTGGG AACAAATGGT ACTATCTCCG TTCATCAGGA GCTATGGCAA CT - #GGTTGGTA   

      780                                                                      

     - TCAGGAAAGT TCGACTTGGT ACTATCTAAA TGCAAGTAAT GGAGATATGA AA - #ACAGGCTG   

      840                                                                      

     - GTTCCAAGTC AATGGTAACT GGTACTATGC CTATGATTCA GGTGCTTTAG CT - #GTTAATAC   

      900                                                                      

     #                946ACT TAAACTATAA TGGTGAATGG GTTAAG                      

     - (2) INFORMATION FOR SEQ ID NO:156:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 316 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:156:                           

     - Val Phe Ala Asp Asp Ser Glu Gly Trp Gln Ph - #e Val Gln Glu Asn Gly     

     #                15                                                       

     - Arg Thr Tyr Tyr Lys Lys Gly Asp Leu Lys Gl - #u Thr Tyr Trp Arg Val     

     #            30                                                           

     - Ile Asp Gly Lys Tyr Tyr Tyr Phe Asp Pro Le - #u Ser Gly Glu Met Val     

     #        45                                                               

     - Val Gly Trp Gln Tyr Ile Pro Ala Pro His Ly - #s Gly Val Thr Ile Gly     

     #    60                                                                   

     - Pro Ser Pro Arg Ile Glu Ile Ala Leu Arg Pr - #o Asp Trp Phe Tyr Phe     

     #80                                                                       

     - Gly Gln Asp Gly Val Leu Gln Glu Phe Val Gl - #y Lys Gln Val Leu Glu     

     #                95                                                       

     - Ala Lys Thr Ala Thr Asn Thr Asn Lys His Hi - #s Gly Glu Glu Tyr Asp     

     #           110                                                           

     - Ser Gln Ala Glu Lys Arg Val Tyr Tyr Phe Gl - #u Asp Gln Arg Ser Tyr     

     #       125                                                               

     - His Thr Leu Lys Thr Gly Trp Ile Tyr Glu Gl - #u Gly His Trp Tyr Tyr     

     #   140                                                                   

     - Leu Gln Lys Asp Gly Gly Phe Asp Ser Arg Il - #e Asn Arg Leu Thr Val     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Glu Leu Ala Arg Gly Trp Val Lys Asp Ty - #r Pro Leu Thr Tyr Asp     

     #               175                                                       

     - Glu Glu Lys Leu Lys Ala Ala Pro Trp Tyr Ty - #r Leu Asn Pro Ala Thr     

     #           190                                                           

     - Gly Ile Met Gln Thr Gly Trp Gln Tyr Leu Gl - #y Asn Arg Trp Tyr Tyr     

     #       205                                                               

     - Leu His Ser Ser Gly Ala Met Ala Thr Gly Tr - #p Tyr Lys Glu Gly Ser     

     #   220                                                                   

     - Thr Trp Tyr Tyr Leu Asp Ala Glu Asn Gly As - #p Met Arg Thr Gly Trp     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Gln Asn Leu Gly Asn Lys Trp Tyr Tyr Leu Ar - #g Ser Ser Gly Ala Met     

     #               255                                                       

     - Ala Thr Gly Trp Tyr Gln Glu Ser Ser Thr Tr - #p Tyr Tyr Leu Asn Ala     

     #           270                                                           

     - Ser Asn Gly Asp Met Lys Thr Gly Trp Phe Gl - #n Val Asn Gly Asn Trp     

     #       285                                                               

     - Tyr Tyr Ala Tyr Asp Ser Gly Ala Leu Ala Va - #l Asn Thr Thr Val Gly     

     #   300                                                                   

     - Gly Tyr Tyr Leu Asn Tyr Asn Gly Glu Trp Va - #l Lys                     

     305                 3 - #10                 3 - #15                       

     - (2) INFORMATION FOR SEQ ID NO: 157:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1415 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #157: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGTCGCTGCA AATGAAACTG AAGTAGCAAA AACTTCGCAG GATACAACGA CA - #GCTTCAAG   

       60                                                                      

     - TAGTTCAGAG CAAAATCAGT CTTCTAATAA AACGCAAACG AGCGCAGAAG TA - #CAGACTAA   

      120                                                                      

     - TGCTGCTGCC CACTGGGATG GGGATTATTA TGTAAAGGAT GATGGTTCTA AA - #GCTCAAAG   

      180                                                                      

     - TGAATGGATT TTTGACAACT ACTATAAGGC TTGGTTTTAT ATTAATTCAG AT - #GGTCGTTA   

      240                                                                      

     - CTCGCAGAAT GAATGGCATG GAAATTACTA CCTGAAATCA GGTGGATATA TG - #GCCCAAAA   

      300                                                                      

     - CGAGTGGATC TATGACAGTA ATTACAAGAG TTGGTTTTAT CTCAAGTCAG AT - #GGGGCTTA   

      360                                                                      

     - TGCTCATCAA GAATGGCAAT TGATTGGAAA TAAGTGGTAC TACTTCAAGA AG - #TGGGGTTA   

      420                                                                      

     - CATGGCTAAA AGCCAATGGC AAGGAAGTTA TTTCTTGAAT GGTCAAGGAG CT - #ATGATGCA   

      480                                                                      

     - AAATGAATGG CTSCTATGAT CCAGCCTATT CTGCTTATTT TTATCTAAAA TC - #CGATGGAA   

      540                                                                      

     - CTTATGCTAA CCAAGAGTGG CAAAAAGTGG GCGGCAAATG GTACTATTTC AA - #GAAGTGGG   

      600                                                                      

     - GCTATATGGC TCGGAATGAG TGGCAAGGCA ACTACTATTT GACTGGAAGT GG - #TGCCATGG   

      660                                                                      

     - CGACTGACGA AGTGATTATG GATGGTACTC GCTATATCTT TGCGGCCTCT GG - #TGAGCTCA   

      720                                                                      

     - AAGAAAAAAA AGATTTGAAT GTCGGCTGGG TTCACAGAGA TGGTAAGCGC TA - #TTTCTTTA   

      780                                                                      

     - ATAATAGAGA AGAACAAGTG GGAACCGAAC ATGCTAAGAA AGTCATTGAT AT - #TAGTGAGC   

      840                                                                      

     - ACAATGGTCG TATCAATGAT TGGAAAAAGG TTATTGATGA GAACGAAGTG GA - #TGGTGTCA   

      900                                                                      

     - TTGTTCGTCT AGGTTATAGC GGTAAAGAAG ACAAGGAATT GGCGCATAAC AT - #TAAGGAGT   

      960                                                                      

     - TAAACCGTCT GGGAATTCCT TATGGTGTCT ATCTCTATAC CTATGCTGAA AA - #TGAGACCG   

     1020                                                                      

     - ATGCTGAGAG TGACGCTAAA CAGACCATTG AACTTATAAA GAAATACAAT AT - #GAACCTGT   

     1080                                                                      

     - CTTACCCTAT CTATTATGAT GTTGAGAATT GGGAATATGT AAATAAGAGC AA - #GAGAGCTC   

     1140                                                                      

     - CAAGTGATAC AGGCACTTGG GTTAAAATCA TCAACAAGTA CATGGACACG AT - #GAAGCAGG   

     1200                                                                      

     - CGGGTTATCA AAATGTGTAT GTCTATAGCT ATCGTAGTTT ATTACAGACG CG - #TTTAAAAC   

     1260                                                                      

     - ACCCAGATAT TTTAAAACAT GTAAACTGGG TAGCGGCCTA TACGAATGCT TT - #AGAATGGG   

     1320                                                                      

     - AAAACCCTCA TTATTCAGGA AAAAAAGGTT GGCAATATAC CTCTTCTGAA TA - #CATGAAAG   

     1380                                                                      

     #     1415         AGAT GTCAGCGTTT GGTAT                                  

     - (2) INFORMATION FOR SEQ ID NO:158:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 471 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:158:                           

     - Val Ala Ala Asn Glu Thr Glu Val Ala Lys Th - #r Ser Gln Asp Thr Thr     

     #                15                                                       

     - Thr Ala Ser Ser Ser Ser Glu Gln Asn Gln Se - #r Ser Asn Lys Thr Gln     

     #            30                                                           

     - Thr Ser Ala Glu Val Gln Thr Asn Ala Ala Al - #a His Trp Asp Gly Asp     

     #        45                                                               

     - Tyr Tyr Val Lys Asp Asp Gly Ser Lys Ala Gl - #n Ser Glu Trp Ile Phe     

     #    60                                                                   

     - Asp Asn Tyr Tyr Lys Ala Trp Phe Tyr Ile As - #n Ser Asp Gly Arg Tyr     

     #80                                                                       

     - Ser Gln Asn Glu Trp His Gly Asn Tyr Tyr Le - #u Lys Ser Gly Gly Tyr     

     #                95                                                       

     - Met Ala Gln Asn Glu Trp Ile Tyr Asp Ser As - #n Tyr Lys Ser Trp Phe     

     #           110                                                           

     - Tyr Leu Lys Ser Asp Gly Ala Tyr Ala His Gl - #n Glu Trp Gln Leu Ile     

     #       125                                                               

     - Gly Asn Lys Trp Tyr Tyr Phe Lys Lys Trp Gl - #y Tyr Met Ala Lys Ser     

     #   140                                                                   

     - Gln Trp Gln Gly Ser Tyr Phe Leu Asn Gly Gl - #n Gly Ala Met Met Gln     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asn Glu Trp Leu Tyr Asp Pro Ala Tyr Ser Al - #a Tyr Phe Tyr Leu Lys     

     #               175                                                       

     - Ser Asp Gly Thr Tyr Ala Asn Gln Glu Trp Gl - #n Lys Val Gly Gly Lys     

     #           190                                                           

     - Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met Al - #a Arg Asn Glu Trp Gln     

     #       205                                                               

     - Gly Asn Tyr Tyr Leu Thr Gly Ser Gly Ala Me - #t Ala Thr Asp Glu Val     

     #   220                                                                   

     - Ile Met Asp Gly Thr Arg Tyr Ile Phe Ala Al - #a Ser Gly Glu Leu Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Glu Lys Lys Asp Leu Asn Val Gly Trp Val Hi - #s Arg Asp Gly Lys Arg     

     #               255                                                       

     - Tyr Phe Phe Asn Asn Arg Glu Glu Gln Val Gl - #y Thr Glu His Ala Lys     

     #           270                                                           

     - Lys Val Ile Asp Ile Ser Glu His Asn Gly Ar - #g Ile Asn Asp Trp Lys     

     #       285                                                               

     - Lys Val Ile Asp Glu Asn Glu Val Asp Gly Va - #l Ile Val Arg Leu Gly     

     #   300                                                                   

     - Tyr Ser Gly Lys Glu Asp Lys Glu Leu Ala Hi - #s Asn Ile Lys Glu Leu     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Asn Arg Leu Gly Ile Pro Tyr Gly Val Tyr Le - #u Tyr Thr Tyr Ala Glu     

     #               335                                                       

     - Asn Glu Thr Asp Ala Glu Ser Asp Ala Lys Gl - #n Thr Ile Glu Leu Ile     

     #           350                                                           

     - Lys Lys Tyr Asn Met Asn Leu Ser Tyr Pro Il - #e Tyr Tyr Asp Val Glu     

     #       365                                                               

     - Asn Trp Glu Tyr Val Asn Lys Ser Lys Arg Al - #a Pro Ser Asp Thr Gly     

     #   380                                                                   

     - Thr Trp Val Lys Ile Ile Asn Lys Tyr Met As - #p Thr Met Lys Gln Ala     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Gly Tyr Gln Asn Val Tyr Val Tyr Ser Tyr Ar - #g Ser Leu Leu Gln Thr     

     #               415                                                       

     - Arg Leu Lys His Pro Asp Ile Leu Lys His Va - #l Asn Trp Val Ala Ala     

     #           430                                                           

     - Tyr Thr Asn Ala Leu Glu Trp Glu Asn Pro Hi - #s Tyr Ser Gly Lys Lys     

     #       445                                                               

     - Gly Trp Gln Tyr Thr Ser Ser Glu Tyr Met Ly - #s Gly Ile Gln Gly Arg     

     #   460                                                                   

     - Val Asp Val Ser Val Trp Tyr                                             

     465                 4 - #70                                               

     - (2) INFORMATION FOR SEQ ID NO: 159:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1924 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #159: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TACGTCTCAG CCTACTTTTG TAAGAGCAGA AGAATCTCCA CAAGTTGTCG AA - #AAATCTTC   

       60                                                                      

     - ATTAGAGAAG AAATATGAGG AAGCAAAAGC AAAAGCTGAT ACTGCCAAGA AA - #GATTACGA   

      120                                                                      

     - AACGGCTAAA AAGAAAGCAG AAGACGCTCA GAAAAAGTAT GAAGATGATC AG - #AAGAGAAC   

      180                                                                      

     - TGAGGAGAAA GCTCGAAAAG AAGCAGAAGC ATCTCAAAAA TTGAATGATG TG - #GCGCTTGT   

      240                                                                      

     - TGTTCAAAAT GCATATAAAG AGTACCGAGA AGTTCAAAAT CAACGTAGTA AA - #TATAAATC   

      300                                                                      

     - TGACGCTGAA TATCAGAAAA AATTAACAGA GGTCGACTCT AAAATAGAGA AG - #GCTAGGAA   

      360                                                                      

     - AGAGCAACAG GACTTGCAAA ATAAATTTAA TGAAGTAAGA GCAGTTGTAG TT - #CCTGAACC   

      420                                                                      

     - AAATGCGTTG GCTGAGACTA AGAAAAAAGC AGAAGAAGCT AAAGCAGAAG AA - #AAAGTAGC   

      480                                                                      

     - TAAGAGAAAA TATGATTATG CAACTCTAAA GGTAGCACTA GCGAAGAAAG AA - #GTAGAGGC   

      540                                                                      

     - TAAGGAACTT GAAATTGAAA AACTTCAATA TGAAATTTCT ACTTTGGAAC AA - #GAAGTTGC   

      600                                                                      

     - TACTGCTCAA CATCAAGTAG ATAATTTGAA AAAACTTCTT GCTGGTGCGG AT - #CCTGATGA   

      660                                                                      

     - TGGCACAGAA GTTATAGAAG CTAAATTAAA AAAAGGAGAA GCTGAGCTAA AC - #GCTAAACA   

      720                                                                      

     - AGCTGAGTTA GCAAAAAAAC AAACAGAACT TGAAAAACTT CTTGACAGCC TT - #GATCCTGA   

      780                                                                      

     - AGGTAAGACT CAGGATGAAT TAGATAAAGA AGCAGAAGAA GCTGAGTTGG AT - #AAAAAAGC   

      840                                                                      

     - TGATGAACTT CAAAATAAAG TTGCTGATTT AGAAAAAGAA ATTAGTAACC TT - #GAAATATT   

      900                                                                      

     - ACTTGGAGGG GCTGATNCTG AAGATGATAC TGCTGCTCTT CAAAATAAAT TA - #GCTACTAA   

      960                                                                      

     - AAAAGCTGAA TTGGAAAAAA CTCAAAAAGA ATTAGATGCA GCTCTTAATG AG - #TTAGGCCC   

     1020                                                                      

     - TGATGGAGAT GAAGAAGAAA CTCCAGCGCC GGCTCCTCAA CCAGAGCAAC CA - #GCTCCTGC   

     1080                                                                      

     - ACCAAAACCA GAGCAACCAG CTCCAGCTCC AAAACCAGAG CAACCAGCTC CT - #GCACCAAA   

     1140                                                                      

     - ACCAGAGCAA CCAGCTCCAG CTCCAAAACC AGAGCAACCA GCTCCAGCTC CA - #AAACCAGA   

     1200                                                                      

     - GCAACCAGCT AAGCCGGAGA AACCAGCTGA AGAGCCTACT CAACCAGAAA AA - #CCAGCCAC   

     1260                                                                      

     - TCCAAAAACA GGCTGGAAAC AAGAAAACGG TATGTGGTAT TTCTACAATA CT - #GATGGTTC   

     1320                                                                      

     - AATGGCAATA GGTTGGCTCC AAAACAACGG TTCATGGTAC TACCTAAACG CT - #AACGGCGC   

     1380                                                                      

     - TATGGCAACA GGTTGGGTGA AAGATGGAGA TACCTGGTAC TATCTTGAAG CA - #TCAGGTGC   

     1440                                                                      

     - TATGAAAGCA AGCCAATGGT TCAAAGTATC AGATAAATGG TACTATGTCA AC - #AGCAATGG   

     1500                                                                      

     - CGCTATGGCG ACAGGCTGGC TCCAATACAA TGGCTCATGG TACTACCTCA AC - #GCTAATGG   

     1560                                                                      

     - TGATATGGCG ACAGGATGGC TCCAATACAA CGGTTCATGG TATTACCTCA AC - #GCTAATGG   

     1620                                                                      

     - TGATATGGCG ACAGGATGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA AC - #GCTAACGG   

     1680                                                                      

     - TGCTATGGCT ACAGGTTGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA AC - #GCTAACGG   

     1740                                                                      

     - TTCAATGGCA ACAGGTTGGG TGAAAGATGG AGATACCTGG TACTATCTTG AA - #GCATCAGG   

     1800                                                                      

     - TGCTATGAAA GCAAGCCAAT GGTTCAAAGT ATCAGATAAA TGGTACTATG TC - #AATGGCTT   

     1860                                                                      

     - AGGTGCCCTT GCAGTCAACA CAACTGTAGA TGGCTATAAA GTCAATGCCA AT - #GGTGAATG   

     1920                                                                      

     #           1924                                                          

     - (2) INFORMATION FOR SEQ ID NO:160:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 641 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:160:                           

     - Thr Ser Gln Pro Thr Phe Val Arg Ala Glu Gl - #u Ser Pro Gln Val Val     

     #                15                                                       

     - Glu Lys Ser Ser Leu Glu Lys Lys Tyr Glu Gl - #u Ala Lys Ala Lys Ala     

     #            30                                                           

     - Asp Thr Ala Lys Lys Asp Tyr Glu Thr Ala Ly - #s Lys Lys Ala Glu Asp     

     #        45                                                               

     - Ala Gln Lys Lys Tyr Glu Asp Asp Gln Lys Ar - #g Thr Glu Glu Lys Ala     

     #    60                                                                   

     - Arg Lys Glu Ala Glu Ala Ser Gln Lys Leu As - #n Asp Val Ala Leu Val     

     #80                                                                       

     - Val Gln Asn Ala Tyr Lys Glu Tyr Arg Glu Va - #l Gln Asn Gln Arg Ser     

     #                95                                                       

     - Lys Tyr Lys Ser Asp Ala Glu Tyr Gln Lys Ly - #s Leu Thr Glu Val Asp     

     #           110                                                           

     - Ser Lys Ile Glu Lys Ala Arg Lys Glu Gln Gl - #n Asp Leu Gln Asn Lys     

     #       125                                                               

     - Phe Asn Glu Val Arg Ala Val Val Val Pro Gl - #u Pro Asn Ala Leu Ala     

     #   140                                                                   

     - Glu Thr Lys Lys Lys Ala Glu Glu Ala Lys Al - #a Glu Glu Lys Val Ala     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Arg Lys Tyr Asp Tyr Ala Thr Leu Lys Va - #l Ala Leu Ala Lys Lys     

     #               175                                                       

     - Glu Val Glu Ala Lys Glu Leu Glu Ile Glu Ly - #s Leu Gln Tyr Glu Ile     

     #           190                                                           

     - Ser Thr Leu Glu Gln Glu Val Ala Thr Ala Gl - #n His Gln Val Asp Asn     

     #       205                                                               

     - Leu Lys Lys Leu Leu Ala Gly Ala Asp Pro As - #p Asp Gly Thr Glu Val     

     #   220                                                                   

     - Ile Glu Ala Lys Leu Lys Lys Gly Glu Ala Gl - #u Leu Asn Ala Lys Gln     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ala Glu Leu Ala Lys Lys Gln Thr Glu Leu Gl - #u Lys Leu Leu Asp Ser     

     #               255                                                       

     - Leu Asp Pro Glu Gly Lys Thr Gln Asp Glu Le - #u Asp Lys Glu Ala Glu     

     #           270                                                           

     - Glu Ala Glu Leu Asp Lys Lys Ala Asp Glu Le - #u Gln Asn Lys Val Ala     

     #       285                                                               

     - Asp Leu Glu Lys Glu Ile Ser Asn Leu Glu Il - #e Leu Leu Gly Gly Ala     

     #   300                                                                   

     - Asp Xaa Glu Asp Asp Thr Ala Ala Leu Gln As - #n Lys Leu Ala Thr Lys     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Lys Ala Glu Leu Glu Lys Thr Gln Lys Glu Le - #u Asp Ala Ala Leu Asn     

     #               335                                                       

     - Glu Leu Gly Pro Asp Gly Asp Glu Glu Glu Th - #r Pro Ala Pro Ala Pro     

     #           350                                                           

     - Gln Pro Glu Gln Pro Ala Pro Ala Pro Lys Pr - #o Glu Gln Pro Ala Pro     

     #       365                                                               

     - Ala Pro Lys Pro Glu Gln Pro Ala Pro Ala Pr - #o Lys Pro Glu Gln Pro     

     #   380                                                                   

     - Ala Pro Ala Pro Lys Pro Glu Gln Pro Ala Pr - #o Ala Pro Lys Pro Glu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Gln Pro Ala Lys Pro Glu Lys Pro Ala Glu Gl - #u Pro Thr Gln Pro Glu     

     #               415                                                       

     - Lys Pro Ala Thr Pro Lys Thr Gly Trp Lys Gl - #n Glu Asn Gly Met Trp     

     #           430                                                           

     - Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala Il - #e Gly Trp Leu Gln Asn     

     #       445                                                               

     - Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gl - #y Ala Met Ala Thr Gly     

     #   460                                                                   

     - Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr Le - #u Glu Ala Ser Gly Ala     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Met Lys Ala Ser Gln Trp Phe Lys Val Ser As - #p Lys Trp Tyr Tyr Val     

     #               495                                                       

     - Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Le - #u Gln Tyr Asn Gly Ser     

     #           510                                                           

     - Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Al - #a Thr Gly Trp Leu Gln     

     #       525                                                               

     - Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala As - #n Gly Asp Met Ala Thr     

     #   540                                                                   

     - Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Ty - #r Leu Asn Ala Asn Gly     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Ala Met Ala Thr Gly Trp Ala Lys Val Asn Gl - #y Ser Trp Tyr Tyr Leu     

     #               575                                                       

     - Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Va - #l Lys Asp Gly Asp Thr     

     #           590                                                           

     - Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Ly - #s Ala Ser Gln Trp Phe     

     #       605                                                               

     - Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gl - #y Leu Gly Ala Leu Ala     

     #   620                                                                   

     - Val Asn Thr Thr Val Asp Gly Tyr Lys Val As - #n Ala Asn Gly Glu Trp     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Val                                                                     

     - (2) INFORMATION FOR SEQ ID NO: 161:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 670 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #161: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGGACAGGTG AAAGGTCATG CTACATTTGT GAAATCCATG ACAACTGAAA TG - #TACCAAGA   

       60                                                                      

     - ACAACAGAAC CATTCTCTCG CCTACAATCA ACGCTTGGNT TCGCAAAATC GC - #ATTGTAGA   

      120                                                                      

     - TCCTTTTTTG GCGGAGGGAT ATGAGGTCAA TTACCAAGTG TCTGACGACC CT - #GATGCAGT   

      180                                                                      

     - CTATGGTTAC TTGTCTATTC CAAGTTTGGA AATCATGGAG CCGGTTTATT TG - #GGAGCAGA   

      240                                                                      

     - TTATCATCAT TTAGGGATGG GCTTGGCTCA TGTGGATGGT ACACCGCTGC CT - #CTGGATGG   

      300                                                                      

     - TACAGGGATT CGCTCAGTGA TTGCTGGGCA CCGTGCAGAG CCAAGCCATG TC - #TTTTTCCG   

      360                                                                      

     - CCATTTGGAT CAGCTAAAAG TTGGAGATGC TCTTTATTAT GATAATGGCC AG - #GAAATTGT   

      420                                                                      

     - AGAATATCAG ATGATGGACA CAGAGATTAT TTTACCGTCG GAATGGGAAA AA - #TTAGAATC   

      480                                                                      

     - GGTTAGCTCT AAAAATATCA TGACCTTGAT AACCTGCGAT CCGATTCCTA CC - #TTTAATAA   

      540                                                                      

     - ACGCTTATTA GTGAATTTTG AACGAGTCGC TGTTTATCAA AAATCAGATC CA - #CAAACAGC   

      600                                                                      

     - TGCAGTTGCG AGGGTTGCTT TTACGAAAGA AGGACAATCT GTATCGCGTG TT - #GCAACCTC   

      660                                                                      

     #       670                                                               

     - (2) INFORMATION FOR SEQ ID NO:162:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 223 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:162:                           

     - Gly Gln Val Lys Gly His Ala Thr Phe Val Ly - #s Ser Met Thr Thr Glu     

     #                15                                                       

     - Met Tyr Gln Glu Gln Gln Asn His Ser Leu Al - #a Tyr Asn Gln Arg Leu     

     #            30                                                           

     - Xaa Ser Gln Asn Arg Ile Val Asp Pro Phe Le - #u Ala Glu Gly Tyr Glu     

     #        45                                                               

     - Val Asn Tyr Gln Val Ser Asp Asp Pro Asp Al - #a Val Tyr Gly Tyr Leu     

     #    60                                                                   

     - Ser Ile Pro Ser Leu Glu Ile Met Glu Pro Va - #l Tyr Leu Gly Ala Asp     

     #80                                                                       

     - Tyr His His Leu Gly Met Gly Leu Ala His Va - #l Asp Gly Thr Pro Leu     

     #                95                                                       

     - Pro Leu Asp Gly Thr Gly Ile Arg Ser Val Il - #e Ala Gly His Arg Ala     

     #           110                                                           

     - Glu Pro Ser His Val Phe Phe Arg His Leu As - #p Gln Leu Lys Val Gly     

     #       125                                                               

     - Asp Ala Leu Tyr Tyr Asp Asn Gly Gln Glu Il - #e Val Glu Tyr Gln Met     

     #   140                                                                   

     - Met Asp Thr Glu Ile Ile Leu Pro Ser Glu Tr - #p Glu Lys Leu Glu Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Val Ser Ser Lys Asn Ile Met Thr Leu Ile Th - #r Cys Asp Pro Ile Pro     

     #               175                                                       

     - Thr Phe Asn Lys Arg Leu Leu Val Asn Phe Gl - #u Arg Val Ala Val Tyr     

     #           190                                                           

     - Gln Lys Ser Asp Pro Gln Thr Ala Ala Val Al - #a Arg Val Ala Phe Thr     

     #       205                                                               

     - Lys Glu Gly Gln Ser Val Ser Arg Val Ala Th - #r Ser Gln Trp Leu         

     #   220                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 163:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 784 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #163: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GATTGCTCCT TTGAAGGATT TGAGAGAAAC CATGTTGGAA ATTGCTTCTG GT - #GCTCAAAA   

       60                                                                      

     - TCTTCGTGCC AAGGAAGTTG GTGCCTATGA ACTGAGAGAA GTAACTCGCC AA - #TTTAATGC   

      120                                                                      

     - TATGTTGGAT CAGATTGATC AGTTGATGGT AGCTATTCGT AGCCAGGAAG AA - #ACGACCCG   

      180                                                                      

     - TCAGTACCAA CTTCAAGCCC TTTCGAGCCA GATTAATCCA CATTTCCTCT AT - #AACACTTT   

      240                                                                      

     - GGACACCATC ATCTGGATGG CTGAATTTCA TGATAGTCAG CGAGTGGTGC AG - #GTGACCAA   

      300                                                                      

     - GTCCTTGGCA ACCTATTTCC GCTTGGCGCT CAATCAAGGC AAGGACTTGA TT - #TGTCTCTC   

      360                                                                      

     - TGACGAAATC AATCATGTCC GCCAGTATCT CTTTATCCAG AAACAACGCT AT - #GGAGATAA   

      420                                                                      

     - GCTGGAATAC GAAATTAATG AAAATGTTGC CTTTGATAAT TTAGTCTTAC CC - #AAGCTGGT   

      480                                                                      

     - CCTACAACCC CTTGTAGAAA ATGCTCTTTA CCATGGCATT AAGGAAAAGG AA - #GGTCAGGG   

      540                                                                      

     - CCATATTAAA CTTTCTGTCC AGAAACAGGA TTCGGGATTG GTCATCCGTA TT - #GAGGATGA   

      600                                                                      

     - TGGCGTTGGC TTCCAAGATG CTGGTGATAG TAGTCAAAGT CAACTCAAAC GT - #GGGGGAGT   

      660                                                                      

     - TGGTCTTCAA AATGTCGATC AACGGCTCAA ACTTCATTTT GGAGCCAATT AC - #CATATGAA   

      720                                                                      

     - GATTGATTCT AGACCCCAAA AAGGGACGAA AGTTGAAATA TATATAAATA GA - #ATAGAAAC   

      780                                                                      

     #            784                                                          

     - (2) INFORMATION FOR SEQ ID NO:164:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 261 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:164:                           

     - Ile Ala Pro Leu Lys Asp Leu Arg Glu Thr Me - #t Leu Glu Ile Ala Ser     

     #                15                                                       

     - Gly Ala Gln Asn Leu Arg Ala Lys Glu Val Gl - #y Ala Tyr Glu Leu Arg     

     #            30                                                           

     - Glu Val Thr Arg Gln Phe Asn Ala Met Leu As - #p Gln Ile Asp Gln Leu     

     #        45                                                               

     - Met Val Ala Ile Arg Ser Gln Glu Glu Thr Th - #r Arg Gln Tyr Gln Leu     

     #    60                                                                   

     - Gln Ala Leu Ser Ser Gln Ile Asn Pro His Ph - #e Leu Tyr Asn Thr Leu     

     #80                                                                       

     - Asp Thr Ile Ile Trp Met Ala Glu Phe His As - #p Ser Gln Arg Val Val     

     #                95                                                       

     - Gln Val Thr Lys Ser Leu Ala Thr Tyr Phe Ar - #g Leu Ala Leu Asn Gln     

     #           110                                                           

     - Gly Lys Asp Leu Ile Cys Leu Ser Asp Glu Il - #e Asn His Val Arg Gln     

     #       125                                                               

     - Tyr Leu Phe Ile Gln Lys Gln Arg Tyr Gly As - #p Lys Leu Glu Tyr Glu     

     #   140                                                                   

     - Ile Asn Glu Asn Val Ala Phe Asp Asn Leu Va - #l Leu Pro Lys Leu Val     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Gln Pro Leu Val Glu Asn Ala Leu Tyr Hi - #s Gly Ile Lys Glu Lys     

     #               175                                                       

     - Glu Gly Gln Gly His Ile Lys Leu Ser Val Gl - #n Lys Gln Asp Ser Gly     

     #           190                                                           

     - Leu Val Ile Arg Ile Glu Asp Asp Gly Val Gl - #y Phe Gln Asp Ala Gly     

     #       205                                                               

     - Asp Ser Ser Gln Ser Gln Leu Lys Arg Gly Gl - #y Val Gly Leu Gln Asn     

     #   220                                                                   

     - Val Asp Gln Arg Leu Lys Leu His Phe Gly Al - #a Asn Tyr His Met Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ile Asp Ser Arg Pro Gln Lys Gly Thr Lys Va - #l Glu Ile Tyr Ile Asn     

     #               255                                                       

     - Arg Ile Glu Thr Ser                                                     

                 260                                                           

     - (2) INFORMATION FOR SEQ ID NO: 165:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 325 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #165: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGGCTC CATAATATCT AT - #AAGGGATT   

       60                                                                      

     - TACCCACTAC AAATATTATA GAGCCGAAAA TTCACATCTA ATATATGCAG AC - #TACTTTGA   

      120                                                                      

     - AATGAAATTA AAAAAATTAT TAAAGGATGA CACAAAAGTT TTTGAAAAAT CT - #ACATTCAA   

      180                                                                      

     - ATTTGTAGAA GGATATAAAA TATACCTGAC AGAATCTAAA GAATCTGGAA TT - #AAACAAAT   

      240                                                                      

     - GGACAATGTC ATAAAATATT TTGAGTTTAT TGAATCTAAA AGTATTGCTT TA - #TATTTTCA   

      300                                                                      

     #              325 CTGA TAGAT                                             

     - (2) INFORMATION FOR SEQ ID NO:166:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 108 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:166:                           

     - Arg Ser Tyr Gly Thr Phe Phe Leu Gln Gln As - #n Arg Leu His Asn Ile     

     #                15                                                       

     - Tyr Lys Gly Phe Thr His Tyr Lys Tyr Tyr Ar - #g Ala Glu Asn Ser His     

     #            30                                                           

     - Leu Ile Tyr Ala Asp Tyr Phe Glu Met Lys Le - #u Lys Lys Leu Leu Lys     

     #        45                                                               

     - Asp Asp Thr Lys Val Phe Glu Lys Ser Thr Ph - #e Lys Phe Val Glu Gly     

     #    60                                                                   

     - Tyr Lys Ile Tyr Leu Thr Glu Ser Lys Glu Se - #r Gly Ile Lys Gln Met     

     #80                                                                       

     - Asp Asn Val Ile Lys Tyr Phe Glu Phe Ile Gl - #u Ser Lys Ser Ile Ala     

     #                95                                                       

     - Leu Tyr Phe Gln Lys Arg Leu Asn Glu Leu Il - #e Asp                     

     #           105                                                           

     - (2) INFORMATION FOR SEQ ID NO: 167:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 238 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #167: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CAACGTTGAG AATTATTTGC GAATGTGTTT GGATAGCATT CAGAATCAGA CG - #TATCAAAA   

       60                                                                      

     - TTTTGAGTGT TTATTAATCA ATGATGGCTC TCCAGATCAT TCATCCAAAA TA - #TGTGAAGA   

      120                                                                      

     - ATTTGTAGAG AAAGATTCTC GTTTCAAATA TTTTGAGAAA GCAAACGGCG GT - #CTTTCATC   

      180                                                                      

     - AGCTCGTAAC CTAGGTATTG AATGTTCGGG GGGGGGCGTA CATTACTTTT GT - #AGACTC     

      238                                                                      

     - (2) INFORMATION FOR SEQ ID NO:168:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 79 amino                                            

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:168:                           

     - Asn Val Glu Asn Tyr Leu Arg Met Cys Leu As - #p Ser Ile Gln Asn Gln     

     #                15                                                       

     - Thr Tyr Gln Asn Phe Glu Cys Leu Leu Ile As - #n Asp Gly Ser Pro Asp     

     #            30                                                           

     - His Ser Ser Lys Ile Cys Glu Glu Phe Val Gl - #u Lys Asp Ser Arg Phe     

     #        45                                                               

     - Lys Tyr Phe Glu Lys Ala Asn Gly Gly Leu Se - #r Ser Ala Arg Asn Leu     

     #    60                                                                   

     - Gly Ile Glu Cys Ser Gly Gly Gly Val His Ty - #r Phe Cys Arg Leu         

     #75                                                                       

     - (2) INFORMATION FOR SEQ ID NO: 169:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 742 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #169: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTACTATCAA TCAAGTTCTT CAGCCATTGA GGCCACCATT GAGGGCAACA GC - #CAAACGAC   

       60                                                                      

     - CATCAGCCAG ACTAGCCACT TTATTCAGTC TTATATCAAA AAACTAGAAA CC - #ACCTCGAC   

      120                                                                      

     - TGGTTTGACC CAGCAGACGG ATGTTCTGGC CTATGCTGAG AATCCCAGTC AA - #GACAAGGT   

      180                                                                      

     - CGAGGGAATC CGAGATTTGT TTTTGACCAT CTTGAAGTCA GATAAGGACT TG - #AAAACTGT   

      240                                                                      

     - TGTGCTGGTG ACCAAATCTG GTCAGGTCAT TTCTACAGAT GACAGTGTGC AG - #ATGAAAAC   

      300                                                                      

     - TTCCTCTGAT ATGATGGCTG AGGATTGGTA CCAAAAGGCC ATTCATCAGG GA - #GCTATGCC   

      360                                                                      

     - TGTTTTGACT CCAGCTCGTA AATCAGATAG TCAGTGGGTC ATTTCTGTCA CT - #CAAGAACT   

      420                                                                      

     - TGTTGATGCA AAGGGAGCCA ATCTTGGTGT GCTTCGTTTG GATATTTCTT AT - #GAAACTCT   

      480                                                                      

     - GGAAGCCTAT CTCAATCAAC TCCAGTTGGG GCAGCAGGGC TTTGCCTTCA TT - #ATCAATGA   

      540                                                                      

     - AAACCATGAA TTTGTCTACC ATCCTCAACA CACAGTTTAT AGTTCGTCTA GC - #AAAATGGA   

      600                                                                      

     - GGCTATGAAA CCCTACATCG ATACAGGTCA GGGTTATACT CCTGGTCACA AA - #TCCTACGT   

      660                                                                      

     - CAGTCAAGAG AAGATTGCAG GAACTGATTG GACGGTGCTT GGCGTGTCAT CA - #TTGGAAAA   

      720                                                                      

     #                742GTC AG                                                

     - (2) INFORMATION FOR SEQ ID NO:170:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 247 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:170:                           

     - Tyr Tyr Gln Ser Ser Ser Ser Ala Ile Glu Al - #a Thr Ile Glu Gly Asn     

     #                15                                                       

     - Ser Gln Thr Thr Ile Ser Gln Thr Ser His Ph - #e Ile Gln Ser Tyr Ile     

     #            30                                                           

     - Lys Lys Leu Glu Thr Thr Ser Thr Gly Leu Th - #r Gln Gln Thr Asp Val     

     #        45                                                               

     - Leu Ala Tyr Ala Glu Asn Pro Ser Gln Asp Ly - #s Val Glu Gly Ile Arg     

     #    60                                                                   

     - Asp Leu Phe Leu Thr Ile Leu Lys Ser Asp Ly - #s Asp Leu Lys Thr Val     

     #80                                                                       

     - Val Leu Val Thr Lys Ser Gly Gln Val Ile Se - #r Thr Asp Asp Ser Val     

     #                95                                                       

     - Gln Met Lys Thr Ser Ser Asp Met Met Ala Gl - #u Asp Trp Tyr Gln Lys     

     #           110                                                           

     - Ala Ile His Gln Gly Ala Met Pro Val Leu Th - #r Pro Ala Arg Lys Ser     

     #       125                                                               

     - Asp Ser Gln Trp Val Ile Ser Val Thr Gln Gl - #u Leu Val Asp Ala Lys     

     #   140                                                                   

     - Gly Ala Asn Leu Gly Val Leu Arg Leu Asp Il - #e Ser Tyr Glu Thr Leu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Glu Ala Tyr Leu Asn Gln Leu Gln Leu Gly Gl - #n Gln Gly Phe Ala Phe     

     #               175                                                       

     - Ile Ile Asn Glu Asn His Glu Phe Val Tyr Hi - #s Pro Gln His Thr Val     

     #           190                                                           

     - Tyr Ser Ser Ser Ser Lys Met Glu Ala Met Ly - #s Pro Tyr Ile Asp Thr     

     #       205                                                               

     - Gly Gln Gly Tyr Thr Pro Gly His Lys Ser Ty - #r Val Ser Gln Glu Lys     

     #   220                                                                   

     - Ile Ala Gly Thr Asp Trp Thr Val Leu Gly Va - #l Ser Ser Leu Glu Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Leu Asp Gln Val Arg Ser Gln                                             

                     245                                                       

     - (2) INFORMATION FOR SEQ ID NO: 171:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1282 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #171: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GACAAAAACA TTAAAACGTC CTGAGGTTTT ATCACCTGCA GGGACTTTAG AG - #AAGCTAAA   

       60                                                                      

     - GGTAGCTGTT CAGTATGGAG CAGATGCTGT CTTTATCGGT GGTCAGGCCT AT - #GGTCTTCG   

      120                                                                      

     - TAGCCGTGCG GGAAACTTTA CTTTCGAACA GATGGAAGAA GGCGTGCAGT TT - #GCGGCCAA   

      180                                                                      

     - GTATGGTGCC AAGGTCTATG TAGCGGCTAA TATGGTTATG CACGAAGGAA AT - #GAAGCTGG   

      240                                                                      

     - TGCTGGTGAG TGGTTCCGTA AACTGCGTGA TATCGGGATT GCAGCAGTTA TC - #GTATCTGA   

      300                                                                      

     - CCCAGCCTTG ATTATGATTG CAGTGACTGA AGCACCAGGC CTTGAAATCC AC - #CTTTCTAC   

      360                                                                      

     - CCAAGCCAGT GCCACTAACT ATGAAACCCT TGAGTTCTGG AAAGAGCTAG GC - #TTGACTCG   

      420                                                                      

     - TGTCGTTTTA GCGCGTGAGG TTTCAATGGA AGAATTAGCT GAGATCCGCA AA - #CGTACAGA   

      480                                                                      

     - TGTTGAAATT GAAGCCTTTG TCCATGGAGC TATGTGTATT TCATACTCTG GA - #CGTTGTAC   

      540                                                                      

     - TCTTTCAAAC CACATGAGTA TGCGTGATGC CAACCGTGGT GGATGTTCTC AG - #TCATGCCG   

      600                                                                      

     - TTGGAAATAC GACCTTTACG ATATGCCATT TGGGAAAGAA CGTAAGAGTT TG - #CAGGGTGA   

      660                                                                      

     - GATTCCAGAA GAATTTTCAA TGTCAGCCGT TGACATGTCT ATGATTGACC AN - #ATTCCAGA   

      720                                                                      

     - TATGATTGAA AATGGTGTGG ACAGTCTAAA AATCGAAGGA CGTATGNAGT CT - #ATTCACTA   

      780                                                                      

     - NGTATCAACA GTAACCAACT GCTACAAGGC GGCTGTGGAT GCCTATCTTG AA - #AGTCCTGA   

      840                                                                      

     - AAAGTTTGAA GCTATCAAAC AAGACTTGGT GGACGAGATG TGGAAGGTTG CC - #CAACGTGA   

      900                                                                      

     - ACTGGCTACA GGATTTTACT ATGGTACACC ATCTGAAAAT GAGCAGTTGT TT - #GGTGCTCG   

      960                                                                      

     - TCGTAAAATC CCTGAGTACA AGTTTGTCGC TGAAGTGGTT TCTTATGATG AT - #GCGGCACA   

     1020                                                                      

     - AACAGCAACT ATTCGTCAAC GAAACGTCAT TAACGAAGGG GACCAAGTTG AG - #TTTTATGG   

     1080                                                                      

     - TCCAGGTTTC CGTCATTTTG AAACCTATAT TGAAGATTTG CATGATGCTA AA - #GGCAATAA   

     1140                                                                      

     - AATCGACCGC GCTCCAAATC CAATGGAACT ATTGACTATT AAAGTCCCAC AA - #CCTGTTCA   

     1200                                                                      

     - ATCAGGAGAC ATGGTTCGAG CTCTTAAAGA GGGGCTTATC AATCTTTATA AG - #GAAGATGG   

     1260                                                                      

     #               1282GTG CT                                                

     - (2) INFORMATION FOR SEQ ID NO:172:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 427 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:172:                           

     - Thr Lys Thr Leu Lys Arg Pro Glu Val Leu Se - #r Pro Ala Gly Thr Leu     

     #                15                                                       

     - Glu Lys Leu Lys Val Ala Val Gln Tyr Gly Al - #a Asp Ala Val Phe Ile     

     #            30                                                           

     - Gly Gly Gln Ala Tyr Gly Leu Arg Ser Arg Al - #a Gly Asn Phe Thr Phe     

     #        45                                                               

     - Glu Gln Met Glu Glu Gly Val Gln Phe Ala Al - #a Lys Tyr Gly Ala Lys     

     #    60                                                                   

     - Val Tyr Val Ala Ala Asn Met Val Met His Gl - #u Gly Asn Glu Ala Gly     

     #80                                                                       

     - Ala Gly Glu Trp Phe Arg Lys Leu Arg Asp Il - #e Gly Ile Ala Ala Val     

     #                95                                                       

     - Ile Val Ser Asp Pro Ala Leu Ile Met Ile Al - #a Val Thr Glu Ala Pro     

     #           110                                                           

     - Gly Leu Glu Ile His Leu Ser Thr Gln Ala Se - #r Ala Thr Asn Tyr Glu     

     #       125                                                               

     - Thr Leu Glu Phe Trp Lys Glu Leu Gly Leu Th - #r Arg Val Val Leu Ala     

     #   140                                                                   

     - Arg Glu Val Ser Met Glu Glu Leu Ala Glu Il - #e Arg Lys Arg Thr Asp     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Val Glu Ile Glu Ala Phe Val His Gly Ala Me - #t Cys Ile Ser Tyr Ser     

     #               175                                                       

     - Gly Arg Cys Thr Leu Ser Asn His Met Ser Me - #t Arg Asp Ala Asn Arg     

     #           190                                                           

     - Gly Gly Cys Ser Gln Ser Cys Arg Trp Lys Ty - #r Asp Leu Tyr Asp Met     

     #       205                                                               

     - Pro Phe Gly Lys Glu Arg Lys Ser Leu Gln Gl - #y Glu Ile Pro Glu Glu     

     #   220                                                                   

     - Phe Ser Met Ser Ala Val Asp Met Ser Met Il - #e Asp Xaa Ile Pro Asp     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Met Ile Glu Asn Gly Val Asp Ser Leu Lys Il - #e Glu Gly Arg Met Xaa     

     #               255                                                       

     - Ser Ile His Xaa Val Ser Thr Val Thr Asn Cy - #s Tyr Lys Ala Ala Val     

     #           270                                                           

     - Asp Ala Tyr Leu Glu Ser Pro Glu Lys Phe Gl - #u Ala Ile Lys Gln Asp     

     #       285                                                               

     - Leu Val Asp Glu Met Trp Lys Val Ala Gln Ar - #g Glu Leu Ala Thr Gly     

     #   300                                                                   

     - Phe Tyr Tyr Gly Thr Pro Ser Glu Asn Glu Gl - #n Leu Phe Gly Ala Arg     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Arg Lys Ile Pro Glu Tyr Lys Phe Val Ala Gl - #u Val Val Ser Tyr Asp     

     #               335                                                       

     - Asp Ala Ala Gln Thr Ala Thr Ile Arg Gln Ar - #g Asn Val Ile Asn Glu     

     #           350                                                           

     - Gly Asp Gln Val Glu Phe Tyr Gly Pro Gly Ph - #e Arg His Phe Glu Thr     

     #       365                                                               

     - Tyr Ile Glu Asp Leu His Asp Ala Lys Gly As - #n Lys Ile Asp Arg Ala     

     #   380                                                                   

     - Pro Asn Pro Met Glu Leu Leu Thr Ile Lys Va - #l Pro Gln Pro Val Gln     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ser Gly Asp Met Val Arg Ala Leu Lys Glu Gl - #y Leu Ile Asn Leu Tyr     

     #               415                                                       

     - Lys Glu Asp Gly Thr Ser Val Thr Val Arg Al - #a                         

     #           425                                                           

     - (2) INFORMATION FOR SEQ ID NO: 173:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 778 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #173: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTCTCAGGAG ACCTTTAAAA ATATCACCAA TAGCTTCTCC ATGCAAATCA AT - #CGTCGCGT   

       60                                                                      

     - CAACCAAGGA ACGCCTCGTG GTGCTGGGAA TATCAAGGGT GAAGACATCA AA - #AAAATCAC   

      120                                                                      

     - CGAAAACAAG GCCATTGAGT CTTATGTCAA ACGTATCAAC GCTATCGGAG AT - #TTGACTGG   

      180                                                                      

     - ATATGACCTG ATTGAAACGC CAGAAACCAA GAAGAATCTC ACTGCTGATC GT - #GCCAAGCG   

      240                                                                      

     - TTTTGGAAGT AGCTTGATGA TTACAGGTGT CAATGACTCC TCTAAAGAAG AC - #AAGTTTGT   

      300                                                                      

     - CTCTGGTTCT TATAAACTAG TCGAAGGAGA GCACTTAACC AACGACGACA AG - #GATAAAAT   

      360                                                                      

     - CCTCTTGCAC AAGGACTTGG CAGCCAAACA CGGCTGGAAA GTAGGGGACA AG - #GTTAAACT   

      420                                                                      

     - GGACTCTAAT ATCTACGATG CAGATAATGA AAAAGGAGCC AAGGAAACAG TT - #GAAGTGAC   

      480                                                                      

     - AATCAAGGGA CTCTTTGATG GTCATAATAA GTCAGCAGTA ACCTACTCAC AA - #GAACTTTA   

      540                                                                      

     - CGAAAACACA GCTATTACAG ACATTCACAC TGCTGCAAAA CTTTATGGAT AC - #ACAGAAGA   

      600                                                                      

     - CACAGCCATT TATGGGGACG CAACCTTCTT TGTAACAGCA GACAAGAACT TG - #GATGATGT   

      660                                                                      

     - TATGAAAGAG TTGAATGGCA TCAGTGGTAT CAACTGGAAG AGCTACACAC TC - #GTCAAGAG   

      720                                                                      

     - CTCCTCTAAC TACCCAGCTC TTGAGCAATC TATCTCTGGT ATGTACAAGA TG - #GCCAAC     

      778                                                                      

     - (2) INFORMATION FOR SEQ ID NO:174:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 259 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:174:                           

     - Ser Gln Glu Thr Phe Lys Asn Ile Thr Asn Se - #r Phe Ser Met Gln Ile     

     #                15                                                       

     - Asn Arg Arg Val Asn Gln Gly Thr Pro Arg Gl - #y Ala Gly Asn Ile Lys     

     #            30                                                           

     - Gly Glu Asp Ile Lys Lys Ile Thr Glu Asn Ly - #s Ala Ile Glu Ser Tyr     

     #        45                                                               

     - Val Lys Arg Ile Asn Ala Ile Gly Asp Leu Th - #r Gly Tyr Asp Leu Ile     

     #    60                                                                   

     - Glu Thr Pro Glu Thr Lys Lys Asn Leu Thr Al - #a Asp Arg Ala Lys Arg     

     #80                                                                       

     - Phe Gly Ser Ser Leu Met Ile Thr Gly Val As - #n Asp Ser Ser Lys Glu     

     #                95                                                       

     - Asp Lys Phe Val Ser Gly Ser Tyr Lys Leu Va - #l Glu Gly Glu His Leu     

     #           110                                                           

     - Thr Asn Asp Asp Lys Asp Lys Ile Leu Leu Hi - #s Lys Asp Leu Ala Ala     

     #       125                                                               

     - Lys His Gly Trp Lys Val Gly Asp Lys Val Ly - #s Leu Asp Ser Asn Ile     

     #   140                                                                   

     - Tyr Asp Ala Asp Asn Glu Lys Gly Ala Lys Gl - #u Thr Val Glu Val Thr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ile Lys Gly Leu Phe Asp Gly His Asn Lys Se - #r Ala Val Thr Tyr Ser     

     #               175                                                       

     - Gln Glu Leu Tyr Glu Asn Thr Ala Ile Thr As - #p Ile His Thr Ala Ala     

     #           190                                                           

     - Lys Leu Tyr Gly Tyr Thr Glu Asp Thr Ala Il - #e Tyr Gly Asp Ala Thr     

     #       205                                                               

     - Phe Phe Val Thr Ala Asp Lys Asn Leu Asp As - #p Val Met Lys Glu Leu     

     #   220                                                                   

     - Asn Gly Ile Ser Gly Ile Asn Trp Lys Ser Ty - #r Thr Leu Val Lys Ser     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ser Ser Asn Tyr Pro Ala Leu Glu Gln Ser Il - #e Ser Gly Met Tyr Lys     

     #               255                                                       

     - Met Ala Asn                                                             

     - (2) INFORMATION FOR SEQ ID NO: 175:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 694 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #175: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AGTAAATGCG CAATCAAATT CATTAATATT AATAGATGAA CCTGAAATCT CA - #CTTCATCC   

       60                                                                      

     - GAGTGCAATC TATAAATTTA AAGAGTTTTT ACTTCAAGAG TGTTTAAATA AA - #AAACATCA   

      120                                                                      

     - AATTATTATC ACTACACATT CTACACAACT TATAAAAGAT TTTCCTAGAG AA - #GCCGTGAA   

      180                                                                      

     - ACTTTTAGTG AAAAACGGAG AAAAGGTAGA TGTTATTGAA AATATTGATT AT - #CAGGATGC   

      240                                                                      

     - ATTTTTTGAA TTAGGTGATG TGTATCATTC TAGGAAGATG ATTTATGTTG AA - #GATAGACT   

      300                                                                      

     - AGCTAAATAT ATTCTAGAGT TTGTTATCAC TCATTCAGGT AGTGAGAATC TT - #AAACAGAA   

      360                                                                      

     - TTTAGTAGTG AGATATATTC CTGGTGGAGC AAATCAAATA ATTTGTAATA AT - #ATTTTAAA   

      420                                                                      

     - CTCATCGTAT TTAGATTCCG ATAACCATTA TTTTTGGCTT GATGGAGATC AA - #AACACTAA   

      480                                                                      

     - TGTTAGTGAA TCAAATAATT TAATGAACTA TCTTGAAAAT GGTGTTGTTA TA - #TCAGATAA   

      540                                                                      

     - AATTCCTGAA TCAGATAATA AAAATCTTGA TGATATTATA AAATTGATAA NG - #GGATGTCC   

      600                                                                      

     - AATTAAATTT AATGTTTCAG GTAATAAAGG GCAAAAAAAT AATATTGAAT TA - #ATTGCGAA   

      660                                                                      

     #       694        GATT ATTGGGCTAA ATAC                                   

     - (2) INFORMATION FOR SEQ ID NO:176:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 231 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:176:                           

     - Val Asn Ala Gln Ser Asn Ser Leu Ile Leu Il - #e Asp Glu Pro Glu Ile     

     #                15                                                       

     - Ser Leu His Pro Ser Ala Ile Tyr Lys Phe Ly - #s Glu Phe Leu Leu Gln     

     #            30                                                           

     - Glu Cys Leu Asn Lys Lys His Gln Ile Ile Il - #e Thr Thr His Ser Thr     

     #        45                                                               

     - Gln Leu Ile Lys Asp Phe Pro Arg Glu Ala Va - #l Lys Leu Leu Val Lys     

     #    60                                                                   

     - Asn Gly Glu Lys Val Asp Val Ile Glu Asn Il - #e Asp Tyr Gln Asp Ala     

     #80                                                                       

     - Phe Phe Glu Leu Gly Asp Val Tyr His Ser Ar - #g Lys Met Ile Tyr Val     

     #                95                                                       

     - Glu Asp Arg Leu Ala Lys Tyr Ile Leu Glu Ph - #e Val Ile Thr His Ser     

     #           110                                                           

     - Gly Ser Glu Asn Leu Lys Gln Asn Leu Val Va - #l Arg Tyr Ile Pro Gly     

     #       125                                                               

     - Gly Ala Asn Gln Ile Ile Cys Asn Asn Ile Le - #u Asn Ser Ser Tyr Leu     

     #   140                                                                   

     - Asp Ser Asp Asn His Tyr Phe Trp Leu Asp Gl - #y Asp Gln Asn Thr Asn     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Val Ser Glu Ser Asn Asn Leu Met Asn Tyr Le - #u Glu Asn Gly Val Val     

     #               175                                                       

     - Ile Ser Asp Lys Ile Pro Glu Ser Asp Asn Ly - #s Asn Leu Asp Asp Ile     

     #           190                                                           

     - Ile Lys Leu Ile Xaa Gly Cys Pro Ile Lys Ph - #e Asn Val Ser Gly Asn     

     #       205                                                               

     - Lys Gly Gln Lys Asn Asn Ile Glu Leu Ile Al - #a Lys Gln Arg Ser Phe     

     #   220                                                                   

     - Ile Asp Tyr Trp Ala Lys Tyr                                             

     225                 2 - #30                                               

     - (2) INFORMATION FOR SEQ ID NO: 177:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 550 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #177: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTACCGCGTT CATCAAGATG TCAAACAAGT CATGACCTAT CAACCCATGG TG - #CGAGAAAT   

       60                                                                      

     - ATTGAGTGAA CAAGACACCC CAGCAAACGA AGAGCTTGTG CTTGCTATGA TT - #TATACTGA   

      120                                                                      

     - AACAAAAGGA AAAGAAGGCG ATGTTATGCA GTCTAGTGAG TCTGCAAGTG GT - #TCCACCAA   

      180                                                                      

     - CACCATCAAT GATAATGCCT CTAGCATTCG GCAAGGCATT CAAACTCTGA CA - #GGCAATCT   

      240                                                                      

     - CTATCTGGCG CAGAAGAAGG GGGTAGATAT CTGGACAGCT GTTCAAGCCT AT - #AATTTTGG   

      300                                                                      

     - ACCTGCCTAT ATCGATTTTA TCGCCCAAAA TGGCAAGGAA AATACCCTGG CT - #CTAGCCAA   

      360                                                                      

     - ACAGTACTCT CGTGAGACTG TTGCCCCCTT GCTTGGTAAT AGGACTGGAA AG - #ACTTATAG   

      420                                                                      

     - TTATATTCAC CCCATTTCCA TTTTTCACGG TGCTGAACTC TATGTAAATG GA - #GGAAACTA   

      480                                                                      

     - TTATTATTCT AGACAGGTAC GACTTAACCT TTACATCATC AAATGTTTCA CT - #CTCTTTTC   

      540                                                                      

     #       550                                                               

     - (2) INFORMATION FOR SEQ ID NO:178:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 183 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:178:                           

     - Tyr Arg Val His Gln Asp Val Lys Gln Val Me - #t Thr Tyr Gln Pro Met     

     #                15                                                       

     - Val Arg Glu Ile Leu Ser Glu Gln Asp Thr Pr - #o Ala Asn Glu Glu Leu     

     #            30                                                           

     - Val Leu Ala Met Ile Tyr Thr Glu Thr Lys Gl - #y Lys Glu Gly Asp Val     

     #        45                                                               

     - Met Gln Ser Ser Glu Ser Ala Ser Gly Ser Th - #r Asn Thr Ile Asn Asp     

     #    60                                                                   

     - Asn Ala Ser Ser Ile Arg Gln Gly Ile Gln Th - #r Leu Thr Gly Asn Leu     

     #80                                                                       

     - Tyr Leu Ala Gln Lys Lys Gly Val Asp Ile Tr - #p Thr Ala Val Gln Ala     

     #                95                                                       

     - Tyr Asn Phe Gly Pro Ala Tyr Ile Asp Phe Il - #e Ala Gln Asn Gly Lys     

     #           110                                                           

     - Glu Asn Thr Leu Ala Leu Ala Lys Gln Tyr Se - #r Arg Glu Thr Val Ala     

     #       125                                                               

     - Pro Leu Leu Gly Asn Arg Thr Gly Lys Thr Ty - #r Ser Tyr Ile His Pro     

     #   140                                                                   

     - Ile Ser Ile Phe His Gly Ala Glu Leu Tyr Va - #l Asn Gly Gly Asn Tyr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Tyr Tyr Ser Arg Gln Val Arg Leu Asn Leu Ty - #r Ile Ile Lys Cys Phe     

     #               175                                                       

     - Thr Leu Phe Ser Thr Ser Gly                                             

                 180                                                           

     - (2) INFORMATION FOR SEQ ID NO: 179:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 334 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #179: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GTGGATGGGC TTTAACTATC TTCGTATTCG CCGTGCGGCT AAAATTGTGG AC - #AATGAGGA   

       60                                                                      

     - GTTTGAAGCC TTGATTCGTA CGGGTCAATT GATTGATTTG CGCGACCCAG CA - #GAATTCCA   

      120                                                                      

     - CAGAAAACAT ATCCTTGGTG CACGCAATAT TCCTTCAAGT CAGTTGAAAA CT - #AGTCTTGC   

      180                                                                      

     - AGCCCTTCGT AAAGATAAAC CTGTCCTTCT CTACGAAAAC CAACGTGCGC AA - #CGAGTTAC   

      240                                                                      

     - AAATGCAGCT CTTTACTTGA AAAAACAAGG TTTTTCTGAG ATTTATATCC TT - #TCTTATGG   

      300                                                                      

     #       334        GGGA AAGTGAAGAC TAGC                                   

     - (2) INFORMATION FOR SEQ ID NO:180:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 111 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:180:                           

     - Trp Met Gly Phe Asn Tyr Leu Arg Ile Arg Ar - #g Ala Ala Lys Ile Val     

     #                15                                                       

     - Asp Asn Glu Glu Phe Glu Ala Leu Ile Arg Th - #r Gly Gln Leu Ile Asp     

     #            30                                                           

     - Leu Arg Asp Pro Ala Glu Phe His Arg Lys Hi - #s Ile Leu Gly Ala Arg     

     #        45                                                               

     - Asn Ile Pro Ser Ser Gln Leu Lys Thr Ser Le - #u Ala Ala Leu Arg Lys     

     #    60                                                                   

     - Asp Lys Pro Val Leu Leu Tyr Glu Asn Gln Ar - #g Ala Gln Arg Val Thr     

     #80                                                                       

     - Asn Ala Ala Leu Tyr Leu Lys Lys Gln Gly Ph - #e Ser Glu Ile Tyr Ile     

     #                95                                                       

     - Leu Ser Tyr Gly Leu Asp Ser Trp Lys Gly Ly - #s Val Lys Thr Ser         

     #           110                                                           

     - (2) INFORMATION FOR SEQ ID NO: 181:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1342 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #181: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ACTAAACCAG CATCGTTCGC AGGAAAATAA GGACAATAAT CGTGTCTCTT AT - #GTGGATGG   

       60                                                                      

     - CAGCCAGTCA AGTCAGAAAA GTGAAAACTT GACACCAGAC CAGGTTAGCC AG - #AAAGAAGG   

      120                                                                      

     - AATTCAGGCT GAGCAAATTG TAATCAAAAT TACAGATCAG GGCTATGTAA CG - #TCACACGG   

      180                                                                      

     - TGACCACTAT CATTACTATA ATGGGAAAGT TCCTTATGAT GCCCTCTTTA GT - #GAAGAACT   

      240                                                                      

     - CTTGATGAAG GATCCAAACT ATCAACTTAA AGACGCTGAT ATTGTCAATG AA - #GTCAAGGG   

      300                                                                      

     - TGGTTATATC ATCAAGGTCG ATGGAAAATA TTATGTCTAC CTGAAAGATG CA - #GCTCATGC   

      360                                                                      

     - TGATAATGTT CGAACTAAAG ATGAAATCAA TCGTCAAAAA CAAGAACATG TC - #AAAGATAA   

      420                                                                      

     - TGAGAAGGTT AACTCTAATG TTGCTGTAGC AAGGTCTCAG GGACGATATA CG - #ACAAATGA   

      480                                                                      

     - TGGTTATGTC TTTAATCCAG CTGATATTAT CGAAGATACG GGTAATGCTT AT - #ATCGTTCC   

      540                                                                      

     - TCATGGAGGT CACTATCACT ACATTCCCAA AAGCGATTTA TCTGCTAGTG AA - #TTAGCAGC   

      600                                                                      

     - AGCTAAAGCA CATCTGGCTG GAAAAAATAT GCAACCGAGT CAGTTAAGCT AT - #TCTTCAAC   

      660                                                                      

     - AGCTAGTGAC AATAACACGC AATCTGTAGC AAAAGGATCA ACTAGCAAGC CA - #GCAAATAA   

      720                                                                      

     - ATCTGAAAAT CTCCAGAGTC TTTTGAAGGA ACTCTATGAT TCACCTAGCG CC - #CAACGTTA   

      780                                                                      

     - CAGTGAATCA GATGGCCTGG TCTTTGACCC TGCTAAGATT ATCAGTCGTA CA - #CCAAATGG   

      840                                                                      

     - AGTTGCGATT CCGCATGGCG ACCATTACCA CTTTATTCCT TACAGCAAGC TT - #TCTGCCTT   

      900                                                                      

     - AGAAGAAAAG ATTGCCAGAA TGGTGCCTAT CAGTGGAACT GGTTCTACAG TT - #TCTACAAA   

      960                                                                      

     - TGCAAAACCT AATGAAGTAG TGTCTAGTCT AGGCAGTCTT TCAAGCAATC CT - #TCTTCTTT   

     1020                                                                      

     - AACGACAAGT AAGGAGCTCT CTTCAGCATC TGATGGTTAT ATTTTTAATC CA - #AAAGATAT   

     1080                                                                      

     - CGTTGAAGAA ACGGCTACAG CTTATATTGT AAGACATGGT GATCATTTCC AT - #TACATTCC   

     1140                                                                      

     - AAAATCAAAT CAAATTGGGC AACCGACTCT TCCAAACAAT AGTCTAGCAA CA - #CCTTCTCC   

     1200                                                                      

     - ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT GAAGAAGATG GA - #TACGGATT   

     1260                                                                      

     - TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT GTCATGAGTC AC - #GGAGACCA   

     1320                                                                      

     #               1342AGA AG                                                

     - (2) INFORMATION FOR SEQ ID NO:182:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 447 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:182:                           

     - Leu Asn Gln His Arg Ser Gln Glu Asn Lys As - #p Asn Asn Arg Val Ser     

     #                15                                                       

     - Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Se - #r Glu Asn Leu Thr Pro     

     #            30                                                           

     - Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Al - #a Glu Gln Ile Val Ile     

     #        45                                                               

     - Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser Hi - #s Gly Asp His Tyr His     

     #    60                                                                   

     - Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Le - #u Phe Ser Glu Glu Leu     

     #80                                                                       

     - Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys As - #p Ala Asp Ile Val Asn     

     #                95                                                       

     - Glu Val Lys Gly Gly Tyr Ile Ile Lys Val As - #p Gly Lys Tyr Tyr Val     

     #           110                                                           

     - Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Va - #l Arg Thr Lys Asp Glu     

     #       125                                                               

     - Ile Asn Arg Gln Lys Gln Glu His Val Lys As - #p Asn Glu Lys Val Asn     

     #   140                                                                   

     - Ser Asn Val Ala Val Ala Arg Ser Gln Gly Ar - #g Tyr Thr Thr Asn Asp     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Gl - #u Asp Thr Gly Asn Ala     

     #               175                                                       

     - Tyr Ile Val Pro His Gly Gly His Tyr His Ty - #r Ile Pro Lys Ser Asp     

     #           190                                                           

     - Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Al - #a His Leu Ala Gly Lys     

     #       205                                                               

     - Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Se - #r Thr Ala Ser Asp Asn     

     #   220                                                                   

     - Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Se - #r Lys Pro Ala Asn Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Le - #u Tyr Asp Ser Pro Ser     

     #               255                                                       

     - Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Va - #l Phe Asp Pro Ala Lys     

     #           270                                                           

     - Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Il - #e Pro His Gly Asp His     

     #       285                                                               

     - Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Al - #a Leu Glu Glu Lys Ile     

     #   300                                                                   

     - Ala Arg Met Val Pro Ile Ser Gly Thr Gly Se - #r Thr Val Ser Thr Asn     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gl - #y Ser Leu Ser Ser Asn     

     #               335                                                       

     - Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Se - #r Ser Ala Ser Asp Gly     

     #           350                                                           

     - Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Gl - #u Thr Ala Thr Ala Tyr     

     #       365                                                               

     - Ile Val Arg His Gly Asp His Phe His Tyr Il - #e Pro Lys Ser Asn Gln     

     #   380                                                                   

     - Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Le - #u Ala Thr Pro Ser Pro     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ser Leu Pro Ile Asn Pro Gly Thr Ser His Gl - #u Lys His Glu Glu Asp     

     #               415                                                       

     - Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Al - #a Glu Asp Glu Ser Gly     

     #           430                                                           

     - Phe Val Met Ser His Gly Asp His Asn His Ty - #r Phe Phe Lys Lys         

     #       445                                                               

     - (2) INFORMATION FOR SEQ ID NO: 183:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 934 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #183: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGACTACCTT GAAATCCCAC TTTACAGCTA TCTTGGTGGA TTCAACACTA AA - #GTTCTTCC   

       60                                                                      

     - AACTCCAATG ATGAACATCA TCAACGGTGG TTCTCACTCT GACGCTCCAA TC - #GCTTTCCA   

      120                                                                      

     - AGAGTTCATG ATCTTGCCAG TTGGTGCGCC AACATTTAAA GAAGCCCTTC GT - #TACGGTGC   

      180                                                                      

     - TGAAATCTTC CACGCTCTTA AGAAAATCCT TAAATCACGT GGTTTGGAAA CT - #GCCGTAGG   

      240                                                                      

     - TGACGAAGGT GGATTCGCTC CTCGTTTCGA AGGAACTGAA GATGGTGTTG AA - #ACTATCCT   

      300                                                                      

     - TGCTGCGATT GAAGCTGCTG GATATGTACC AGGTAAAGAC GTATTTATCG GA - #TTTGACTG   

      360                                                                      

     - TGCTTCATCA GAATTCTACG ATAAAGAACG TAAAGTTTAC GACTACACTA AA - #TTTGAAGG   

      420                                                                      

     - TGAAGGTGCT GCTGTTCGTA CATCTGCAGA ACAAATCGAC TACCTTGAAG AA - #TTGGTTAA   

      480                                                                      

     - CAAATACCCA ATCATCACTA TTGAAGATGG TATGGATGAA AACGACTGGG AT - #GGTTGGAA   

      540                                                                      

     - AGCTCTTACT GAACGTCTTG GTAAGAAAGT ACAACTTGTT GGTGACGACT TC - #TTCGTAAC   

      600                                                                      

     - AAACACTGAC TACCTTGCAC GTGGTATCCA AGAAGGTGCT GCTAACTCAA TC - #CTTATCAA   

      660                                                                      

     - AGTTAACCAA ATCGGTACTC TTACTGAAAC TTTTGAAGCT ATCGAAATGG CT - #AAAGAAGC   

      720                                                                      

     - TGGTTACACT GCTGTTGTAT CACACCGTTC AGGTGAAACT GAAGATTCAA CA - #ATCGCTGA   

      780                                                                      

     - TATTGCAGTT GCAACTAACG CAGGACAAAT CAAGACTGGT TCACTTTCAC GT - #ACAGACCG   

      840                                                                      

     - CATCGCTAAA TACAACCAAT TGCTTCGTAT CGAAGACCAA CTTGGTGAAG TA - #GCTGAATA   

      900                                                                      

     #       934        TTCT ACAACCTTAA AAAA                                   

     - (2) INFORMATION FOR SEQ ID NO:184:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 311 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:184:                           

     - Asp Tyr Leu Glu Ile Pro Leu Tyr Ser Tyr Le - #u Gly Gly Phe Asn Thr     

     #                15                                                       

     - Lys Val Leu Pro Thr Pro Met Met Asn Ile Il - #e Asn Gly Gly Ser His     

     #            30                                                           

     - Ser Asp Ala Pro Ile Ala Phe Gln Glu Phe Me - #t Ile Leu Pro Val Gly     

     #        45                                                               

     - Ala Pro Thr Phe Lys Glu Ala Leu Arg Tyr Gl - #y Ala Glu Ile Phe His     

     #    60                                                                   

     - Ala Leu Lys Lys Ile Leu Lys Ser Arg Gly Le - #u Glu Thr Ala Val Gly     

     #80                                                                       

     - Asp Glu Gly Gly Phe Ala Pro Arg Phe Glu Gl - #y Thr Glu Asp Gly Val     

     #                95                                                       

     - Glu Thr Ile Leu Ala Ala Ile Glu Ala Ala Gl - #y Tyr Val Pro Gly Lys     

     #           110                                                           

     - Asp Val Phe Ile Gly Phe Asp Cys Ala Ser Se - #r Glu Phe Tyr Asp Lys     

     #       125                                                               

     - Glu Arg Lys Val Tyr Asp Tyr Thr Lys Phe Gl - #u Gly Glu Gly Ala Ala     

     #   140                                                                   

     - Val Arg Thr Ser Ala Glu Gln Ile Asp Tyr Le - #u Glu Glu Leu Val Asn     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Tyr Pro Ile Ile Thr Ile Glu Asp Gly Me - #t Asp Glu Asn Asp Trp     

     #               175                                                       

     - Asp Gly Trp Lys Ala Leu Thr Glu Arg Leu Gl - #y Lys Lys Val Gln Leu     

     #           190                                                           

     - Val Gly Asp Asp Phe Phe Val Thr Asn Thr As - #p Tyr Leu Ala Arg Gly     

     #       205                                                               

     - Ile Gln Glu Gly Ala Ala Asn Ser Ile Leu Il - #e Lys Val Asn Gln Ile     

     #   220                                                                   

     - Gly Thr Leu Thr Glu Thr Phe Glu Ala Ile Gl - #u Met Ala Lys Glu Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Gly Tyr Thr Ala Val Val Ser His Arg Ser Gl - #y Glu Thr Glu Asp Ser     

     #               255                                                       

     - Thr Ile Ala Asp Ile Ala Val Ala Thr Asn Al - #a Gly Gln Ile Lys Thr     

     #           270                                                           

     - Gly Ser Leu Ser Arg Thr Asp Arg Ile Ala Ly - #s Tyr Asn Gln Leu Leu     

     #       285                                                               

     - Arg Ile Glu Asp Gln Leu Gly Glu Val Ala Gl - #u Tyr Arg Gly Leu Lys     

     #   300                                                                   

     - Ser Phe Tyr Asn Leu Lys Lys                                             

     305                 3 - #10                                               

     - (2) INFORMATION FOR SEQ ID NO: 185:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 541 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #185: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TCGTATCTTT TTTTGGAGCA ATGTTCGCGT AGAAGGACAT TCCATGGATC CG - #ACCCTAGC   

       60                                                                      

     - GGATGGCGAA ATTCTCTTCG TTGTAAAACA CCTTCCTATT GACCGTTTTG AT - #ATCGTGGT   

      120                                                                      

     - GGCCCATGAG GAAGATGGCA ATAAGGACAT CGTCAAGCGC GTGATTGGAA TG - #CCTGGCGA   

      180                                                                      

     - CACCATTCGT TACGAAAATG ATAAACTCTA CATCAATGAC AAAGAAACGG AC - #GAGCCTTA   

      240                                                                      

     - TCTAGCAGAC TATATCAAAC GCTTCAAGGA TGACAAACTC CAAAGCACTT AC - #TCAGGCAA   

      300                                                                      

     - GGGCTTTGAA GGAAATAAAG GAACTTTCTT TAGAAGTATC GCTCAAAAAG CT - #CAAGCCTT   

      360                                                                      

     - CACAGTTGAT GTCAACTACA ACACCAACTT TAGCTTTACT GTTCCAGAAG GA - #GAATACCT   

      420                                                                      

     - TCTCCTCGGA GATGACCGCT TGGTTTCGAG CGACAGCCGC CACGTAGGTA CC - #TTCAAAGC   

      480                                                                      

     - AAAAGATATC ACAGGGGAAG CTAAATTCCG CTTATGGCCA ATCACCCGTA TC - #GGAACATT   

      540                                                                      

     #              541                                                        

     - (2) INFORMATION FOR SEQ ID NO:186:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 180 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:186:                           

     - Arg Ile Phe Phe Trp Ser Asn Val Arg Val Gl - #u Gly His Ser Met Asp     

     #                15                                                       

     - Pro Thr Leu Ala Asp Gly Glu Ile Leu Phe Va - #l Val Lys His Leu Pro     

     #            30                                                           

     - Ile Asp Arg Phe Asp Ile Val Val Ala His Gl - #u Glu Asp Gly Asn Lys     

     #        45                                                               

     - Asp Ile Val Lys Arg Val Ile Gly Met Pro Gl - #y Asp Thr Ile Arg Tyr     

     #    60                                                                   

     - Glu Asn Asp Lys Leu Tyr Ile Asn Asp Lys Gl - #u Thr Asp Glu Pro Tyr     

     #80                                                                       

     - Leu Ala Asp Tyr Ile Lys Arg Phe Lys Asp As - #p Lys Leu Gln Ser Thr     

     #                95                                                       

     - Tyr Ser Gly Lys Gly Phe Glu Gly Asn Lys Gl - #y Thr Phe Phe Arg Ser     

     #           110                                                           

     - Ile Ala Gln Lys Ala Gln Ala Phe Thr Val As - #p Val Asn Tyr Asn Thr     

     #       125                                                               

     - Asn Phe Ser Phe Thr Val Pro Glu Gly Glu Ty - #r Leu Leu Leu Gly Asp     

     #   140                                                                   

     - Asp Arg Leu Val Ser Ser Asp Ser Arg His Va - #l Gly Thr Phe Lys Ala     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Asp Ile Thr Gly Glu Ala Lys Phe Arg Le - #u Trp Pro Ile Thr Arg     

     #               175                                                       

     - Ile Gly Thr Phe                                                         

                 180                                                           

     - (2) INFORMATION FOR SEQ ID NO: 187:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 400 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #187: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGACTCTCTC AAAGATGTGA AAGCAAATGC TAGCGACAGC AAGCCTGCAC AG - #GACAAGAA   

       60                                                                      

     - GGATGCAAAA CAAGGAACGG AAGATAGTAA GGATTCAGAT AAGATGACTG AA - #ACAAACTC   

      120                                                                      

     - AGTTCCGGCA GGAGTGATTG TGGTCAGTCT ACTTGCCCTC CTAGGCGTGA TT - #GCCTTCTG   

      180                                                                      

     - GCTGATTCGC CGTAAGAAAG AGTCAGAAAT CCAGCAATTA AGCACGGAAT TG - #ATCAAGGT   

      240                                                                      

     - TCTAGGACAG CTAGATGCAG AAAAAGCGGA TAAAAAAGTC CTTGCCAAAG CC - #CAAAACCT   

      300                                                                      

     - TCTCCAAGAA ACCCTTGATT TCGTGAAAGA AGAAAATGGC TCAGCAGAGA CA - #GAAACTAA   

      360                                                                      

     #   400            AAAG CAATCCTTGA CAAACTCAAG                             

     - (2) INFORMATION FOR SEQ ID NO:188:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 133 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:188:                           

     - Asp Ser Leu Lys Asp Val Lys Ala Asn Ala Se - #r Asp Ser Lys Pro Ala     

     #                15                                                       

     - Gln Asp Lys Lys Asp Ala Lys Gln Gly Thr Gl - #u Asp Ser Lys Asp Ser     

     #            30                                                           

     - Asp Lys Met Thr Glu Thr Asn Ser Val Pro Al - #a Gly Val Ile Val Val     

     #        45                                                               

     - Ser Leu Leu Ala Leu Leu Gly Val Ile Ala Ph - #e Trp Leu Ile Arg Arg     

     #    60                                                                   

     - Lys Lys Glu Ser Glu Ile Gln Gln Leu Ser Th - #r Glu Leu Ile Lys Val     

     #80                                                                       

     - Leu Gly Gln Leu Asp Ala Glu Lys Ala Asp Ly - #s Lys Val Leu Ala Lys     

     #                95                                                       

     - Ala Gln Asn Leu Leu Gln Glu Thr Leu Asp Ph - #e Val Lys Glu Glu Asn     

     #           110                                                           

     - Gly Ser Ala Glu Thr Glu Thr Lys Leu Val Gl - #u Glu Leu Lys Ala Ile     

     #       125                                                               

     - Leu Asp Lys Leu Lys                                                     

         130                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 189:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1201 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #189: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CAAGAAATCC TATCATCTCT TCCAGAAGCA AACAGAGACG AGGGGAATTC AG - #ACTCAGTT   

       60                                                                      

     - GATTGAAGAA TCGCTTAGTC AGCAGACTAT AATCCAGTCC TTCAATGCTC AA - #ACAGAATT   

      120                                                                      

     - TATCCAAAGA TTGCGTGAGG CTCATGACAA CTACTCAGGC TATTCTCAGT CA - #GCCATCTT   

      180                                                                      

     - TTATTCTTCA ACGGTCAATC CTTCGACTCG CTTTGTAAAT GCACTCATTT AT - #GCCCTTTT   

      240                                                                      

     - AGCTGGAGTA GGAGCTTATC GTATCATGAT GGGTTCAGCC TTGACCGTCG GT - #CGTTTAGT   

      300                                                                      

     - GACTTTTTTG AACTATGTTC AGCAATACAC CAAGCCCTTT AACGATATTT CT - #TCAGTGCT   

      360                                                                      

     - AGCTGAGTTG CAAAGTGCTC TGGCTTGCGT AGAGCGTATC TATGGAGTCT TA - #GATAGCCC   

      420                                                                      

     - TGAAGTGGCT GAAACAGGTA AGGAAGTCTT GACGACCAGT GACCAAGTTA AG - #GGAGCTAT   

      480                                                                      

     - TTCCTTTAAA CATGTCTCTT TTGGCTACCA TCCTGAAAAA ATTTTGATTA AG - #GACTTGTC   

      540                                                                      

     - TATCGATATT CCAGCTGGTA GTAAGGTAGC CATCGTTGGT CCGACAGGTG CT - #GGAAAATC   

      600                                                                      

     - AACTCTTATC AATCTCCTTA TGCGTTTTTA TCCCATTAGC TCGGGAGATA TC - #TTGCTGGA   

      660                                                                      

     - TGGGCAATCC ATTTATGATT ATACACGAGT ATCATTGAGA CAGCAGTTTG GT - #ATGGTGCT   

      720                                                                      

     - TCAAGAAACC TGGCTCACAC AAGGGACCAT TCATGATAAT ATTGCCTTTG GC - #AATCCTGA   

      780                                                                      

     - AGCCAGTCGA GAGCAAGTAA TTGCTGCTGC CAAAGCAGCT AATGCAGACT TT - #TTCATCCA   

      840                                                                      

     - ACAGTTGCCA CAGGGATACG ATACCAAGTT GGAAAATGCT GGAGAATCTC TC - #TCTGTCGG   

      900                                                                      

     - CCAAGCTCAG CTCTTGACCA TAGCCCGAGT CTTTCTGGCT ATTCCAAAGA TT - #CTTATCTT   

      960                                                                      

     - AGACGAGGCA ACTTCTTCCA TTGATACACG GACAGAAGTG CTGGTACAGG AT - #GCCTTTGC   

     1020                                                                      

     - AAAACTCATG AAGGGCCGCA CAAGTTTCAT CATTGCTCAC CGTTTGTCAA CC - #ATTCAGGA   

     1080                                                                      

     - TGCGGATTTA ATTCTTGTCT TAGTAGATGG TGATATTGTT GAATATGGTA AC - #CATCAAGA   

     1140                                                                      

     - ACTCATGGAT AGAAAGGGTA AGTATTACCA AATGCAAAAA GCTGCGGCTT TT - #AGTTCTGA   

     1200                                                                      

     #             1201                                                        

     - (2) INFORMATION FOR SEQ ID NO:190:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 400 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:190:                           

     - Lys Lys Ser Tyr His Leu Phe Gln Lys Gln Th - #r Glu Thr Arg Gly Ile     

     #                15                                                       

     - Gln Thr Gln Leu Ile Glu Glu Ser Leu Ser Gl - #n Gln Thr Ile Ile Gln     

     #            30                                                           

     - Ser Phe Asn Ala Gln Thr Glu Phe Ile Gln Ar - #g Leu Arg Glu Ala His     

     #        45                                                               

     - Asp Asn Tyr Ser Gly Tyr Ser Gln Ser Ala Il - #e Phe Tyr Ser Ser Thr     

     #    60                                                                   

     - Val Asn Pro Ser Thr Arg Phe Val Asn Ala Le - #u Ile Tyr Ala Leu Leu     

     #80                                                                       

     - Ala Gly Val Gly Ala Tyr Arg Ile Met Met Gl - #y Ser Ala Leu Thr Val     

     #                95                                                       

     - Gly Arg Leu Val Thr Phe Leu Asn Tyr Val Gl - #n Gln Tyr Thr Lys Pro     

     #           110                                                           

     - Phe Asn Asp Ile Ser Ser Val Leu Ala Glu Le - #u Gln Ser Ala Leu Ala     

     #       125                                                               

     - Cys Val Glu Arg Ile Tyr Gly Val Leu Asp Se - #r Pro Glu Val Ala Glu     

     #   140                                                                   

     - Thr Gly Lys Glu Val Leu Thr Thr Ser Asp Gl - #n Val Lys Gly Ala Ile     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ser Phe Lys His Val Ser Phe Gly Tyr His Pr - #o Glu Lys Ile Leu Ile     

     #               175                                                       

     - Lys Asp Leu Ser Ile Asp Ile Pro Ala Gly Se - #r Lys Val Ala Ile Val     

     #           190                                                           

     - Gly Pro Thr Gly Ala Gly Lys Ser Thr Leu Il - #e Asn Leu Leu Met Arg     

     #       205                                                               

     - Phe Tyr Pro Ile Ser Ser Gly Asp Ile Leu Le - #u Asp Gly Gln Ser Ile     

     #   220                                                                   

     - Tyr Asp Tyr Thr Arg Val Ser Leu Arg Gln Gl - #n Phe Gly Met Val Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Gln Glu Thr Trp Leu Thr Gln Gly Thr Ile Hi - #s Asp Asn Ile Ala Phe     

     #               255                                                       

     - Gly Asn Pro Glu Ala Ser Arg Glu Gln Val Il - #e Ala Ala Ala Lys Ala     

     #           270                                                           

     - Ala Asn Ala Asp Phe Phe Ile Gln Gln Leu Pr - #o Gln Gly Tyr Asp Thr     

     #       285                                                               

     - Lys Leu Glu Asn Ala Gly Glu Ser Leu Ser Va - #l Gly Gln Ala Gln Leu     

     #   300                                                                   

     - Leu Thr Ile Ala Arg Val Phe Leu Ala Ile Pr - #o Lys Ile Leu Ile Leu     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Asp Glu Ala Thr Ser Ser Ile Asp Thr Arg Th - #r Glu Val Leu Val Gln     

     #               335                                                       

     - Asp Ala Phe Ala Lys Leu Met Lys Gly Arg Th - #r Ser Phe Ile Ile Ala     

     #           350                                                           

     - His Arg Leu Ser Thr Ile Gln Asp Ala Asp Le - #u Ile Leu Val Leu Val     

     #       365                                                               

     - Asp Gly Asp Ile Val Glu Tyr Gly Asn His Gl - #n Glu Leu Met Asp Arg     

     #   380                                                                   

     - Lys Gly Lys Tyr Tyr Gln Met Gln Lys Ala Al - #a Ala Phe Ser Ser Glu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - (2) INFORMATION FOR SEQ ID NO: 191:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1033 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #191: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ACGAAATGCA GGGCAGACAG ATGCCTCGCA AATTGAAAAG GCGGCAGTTA GC - #CAAGGAGG   

       60                                                                      

     - AAAAGCAGTG AAAAAAACAG AAATTAGTAA AGACGCAGAC TTGCACGAAA TT - #TATCTAGC   

      120                                                                      

     - TGGAGGTTGT TTCTGGGGAG TGGAGGAATA TTTCTCACGT GTTCCCGGGG TG - #ACGGATGC   

      180                                                                      

     - CGTTTCAGGC TATGCAAATG GTAGAGGAGA AACAACCAAG TACGAATTGA TT - #AACCAAAC   

      240                                                                      

     - AGGTCATGCA GAAACCGTCC ATGTCACCTA TGATGCCAAG CAAATTTCTC TC - #AAGGAAAT   

      300                                                                      

     - CCTGCTTCAC TATTTCCGCA TTATCAATCC AACCAGCAAA AATAAACAAG GA - #AATGATGT   

      360                                                                      

     - GGGGACCCAG TACCGTACTG GTGTTTATTA CACAGATGAC AAGGATTTGG AA - #GTGATTAA   

      420                                                                      

     - CCAAGTCTTT GATGAGGTGG CTAAGAAATA CGATCAACCT CTAGCAGTTG AA - #AAGGAAAA   

      480                                                                      

     - CTTGAAGAAT TTTGTGGTGG CTGAGGATTA CCATCAAGAC TATCTCAAGA AA - #AATCCAAA   

      540                                                                      

     - TGGCTACTGC CATATCAATG TTAATCAGGC GGCCTATCCT GTCATTGATG CC - #AGCAAATA   

      600                                                                      

     - TCCAAAACCA AGTGATGAGG AATTGAAAAA GACCCTGTCA CCTGAGGAGT AT - #GCAGTTAC   

      660                                                                      

     - CCAGGAAAAT CAAACAGAAC GAGCTTTCTC AAACCGTTAC TGGGATAAAT TT - #GAATCCGG   

      720                                                                      

     - TATCTATGTG GATATAGCAA CTGGGGAACC TCTCTTTTCA TCAAAAGACA AA - #TTTGAGTC   

      780                                                                      

     - TGGTTGTGGC TGGCCTAGTT TTACCCAACC CATCAGTCCA GATGTTGTCA CC - #TACAAGGA   

      840                                                                      

     - AGATAAGTCC TACAATATGA CGCGTATGGA AGTGCGGAGC CGAGTAGGAG AT - #TCTCACCT   

      900                                                                      

     - TGGGCATGTC TTTACGGATG GTCCACAGGA CAAGGGCGGC TTACGTTACT GT - #ATCAATAG   

      960                                                                      

     - CCTCTCTATC CGCTTTATTC CCAAAGACCA AATGGAAGAA AAAGGCTACG CT - #TATTTACT   

     1020                                                                      

     #    1033                                                                 

     - (2) INFORMATION FOR SEQ ID NO:192:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 344 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:192:                           

     - Arg Asn Ala Gly Gln Thr Asp Ala Ser Gln Il - #e Glu Lys Ala Ala Val     

     #                15                                                       

     - Ser Gln Gly Gly Lys Ala Val Lys Lys Thr Gl - #u Ile Ser Lys Asp Ala     

     #            30                                                           

     - Asp Leu His Glu Ile Tyr Leu Ala Gly Gly Cy - #s Phe Trp Gly Val Glu     

     #        45                                                               

     - Glu Tyr Phe Ser Arg Val Pro Gly Val Thr As - #p Ala Val Ser Gly Tyr     

     #    60                                                                   

     - Ala Asn Gly Arg Gly Glu Thr Thr Lys Tyr Gl - #u Leu Ile Asn Gln Thr     

     #80                                                                       

     - Gly His Ala Glu Thr Val His Val Thr Tyr As - #p Ala Lys Gln Ile Ser     

     #                95                                                       

     - Leu Lys Glu Ile Leu Leu His Tyr Phe Arg Il - #e Ile Asn Pro Thr Ser     

     #           110                                                           

     - Lys Asn Lys Gln Gly Asn Asp Val Gly Thr Gl - #n Tyr Arg Thr Gly Val     

     #       125                                                               

     - Tyr Tyr Thr Asp Asp Lys Asp Leu Glu Val Il - #e Asn Gln Val Phe Asp     

     #   140                                                                   

     - Glu Val Ala Lys Lys Tyr Asp Gln Pro Leu Al - #a Val Glu Lys Glu Asn     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Lys Asn Phe Val Val Ala Glu Asp Tyr Hi - #s Gln Asp Tyr Leu Lys     

     #               175                                                       

     - Lys Asn Pro Asn Gly Tyr Cys His Ile Asn Va - #l Asn Gln Ala Ala Tyr     

     #           190                                                           

     - Pro Val Ile Asp Ala Ser Lys Tyr Pro Lys Pr - #o Ser Asp Glu Glu Leu     

     #       205                                                               

     - Lys Lys Thr Leu Ser Pro Glu Glu Tyr Ala Va - #l Thr Gln Glu Asn Gln     

     #   220                                                                   

     - Thr Glu Arg Ala Phe Ser Asn Arg Tyr Trp As - #p Lys Phe Glu Ser Gly     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ile Tyr Val Asp Ile Ala Thr Gly Glu Pro Le - #u Phe Ser Ser Lys Asp     

     #               255                                                       

     - Lys Phe Glu Ser Gly Cys Gly Trp Pro Ser Ph - #e Thr Gln Pro Ile Ser     

     #           270                                                           

     - Pro Asp Val Val Thr Tyr Lys Glu Asp Lys Se - #r Tyr Asn Met Thr Arg     

     #       285                                                               

     - Met Glu Val Arg Ser Arg Val Gly Asp Ser Hi - #s Leu Gly His Val Phe     

     #   300                                                                   

     - Thr Asp Gly Pro Gln Asp Lys Gly Gly Leu Ar - #g Tyr Cys Ile Asn Ser     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Leu Ser Ile Arg Phe Ile Pro Lys Asp Gln Me - #t Glu Glu Lys Gly Tyr     

     #               335                                                       

     - Ala Tyr Leu Leu Asp Tyr Val Asp                                         

                 340                                                           

     - (2) INFORMATION FOR SEQ ID NO: 193:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 396 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #193: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGTATAGTTT TTAGCGCTTG TTCTTCTAAT TCTGNTAAAA ATGAAGAAAA TA - #CTTCTAAA   

       60                                                                      

     - GAGCATGCGC CTGATAAAAT AGTTTTAGAT CATGCTTTCG GTCAAACTAT AT - #TAGATAAA   

      120                                                                      

     - AAACCTGAAA GAGTTGCAAC TATTGCTTGG GGAAATCATG ATGTAGCATT AG - #CTTTAGGA   

      180                                                                      

     - ATAGTTCCTG TTGGATTTTC AAAAGCAAAT TACGGTGTAA GTGCTGATAA AG - #GAGTTTTA   

      240                                                                      

     - CCATGGACAG AAGAAAAAAT CAAAGAACTA AATGGTAAAG CTAACCTATT TG - #ACGATTTG   

      300                                                                      

     - GATGGACTTA ACTTTGAAGC AATATCAAAT TCTAAACCAG ATGTTATCTT AG - #CAGGTTAT   

      360                                                                      

     #      396         AAGA TTATGACACT CTATCA                                 

     - (2) INFORMATION FOR SEQ ID NO:194:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 132 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:194:                           

     - Cys Ile Val Phe Ser Ala Cys Ser Ser Asn Se - #r Xaa Lys Asn Glu Glu     

     #                15                                                       

     - Asn Thr Ser Lys Glu His Ala Pro Asp Lys Il - #e Val Leu Asp His Ala     

     #            30                                                           

     - Phe Gly Gln Thr Ile Leu Asp Lys Lys Pro Gl - #u Arg Val Ala Thr Ile     

     #        45                                                               

     - Ala Trp Gly Asn His Asp Val Ala Leu Ala Le - #u Gly Ile Val Pro Val     

     #    60                                                                   

     - Gly Phe Ser Lys Ala Asn Tyr Gly Val Ser Al - #a Asp Lys Gly Val Leu     

     #80                                                                       

     - Pro Trp Thr Glu Glu Lys Ile Lys Glu Leu As - #n Gly Lys Ala Asn Leu     

     #                95                                                       

     - Phe Asp Asp Leu Asp Gly Leu Asn Phe Glu Al - #a Ile Ser Asn Ser Lys     

     #           110                                                           

     - Pro Asp Val Ile Leu Ala Gly Tyr Ser Gly Il - #e Thr Lys Glu Asp Tyr     

     #       125                                                               

     - Asp Thr Leu Ser                                                         

         130                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 195:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 844 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #195: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GTGTGTCGAG CATATTCTGA AGCAAACCTA TCAAAATATA GAAATTATTT TA - #GTTGATGA   

       60                                                                      

     - CGGTTCTACG GATAATTCTG GGGAAATTTG TGATGCTTTT ATGATGCAAG AT - #AATCGTGT   

      120                                                                      

     - GCGAGTATTG CATCAAGAAA ATAAGGGGGG GGCAGCACAA GCTAAAAATA TG - #GGGATTAG   

      180                                                                      

     - TGTAGCTAAG GGAGAGTACA TCACGATTGT TGATTCAGAT GATATCGTAA AA - #GAAAATAT   

      240                                                                      

     - GATTGAAACT CTTTATCAGC AAGTCCAAGA AAAGGATGCA GATGTTGTTA TA - #GGGAATTA   

      300                                                                      

     - CTATAATTAT GACGAAAGTG ACGGGAATTT TTATTTTTAT GTAACAGGGC AA - #GATTTTTG   

      360                                                                      

     - CGTCGAAGAA TTAGCTATAC AAGAAATTAT GAACCGTCAA GCAGGAGATT GG - #AAATTCAA   

      420                                                                      

     - TAGCTCGGCC TTTATATTGC CGACATTTAA GTTGATTAAA AAAGAATTAT TC - #AATGAAGT   

      480                                                                      

     - TCACTTTTCA AATGGTCGCC GCTTTGATGA TGAAGCAACT ATGCATCGCT TT - #TATCTTTT   

      540                                                                      

     - AGCCTCTAAA ATCGTCTTTA TAAACGATAA TCTCTATCTG TATAGAAGAC GT - #TCAGGAAG   

      600                                                                      

     - CATCATGAGA ACGGAATTTG ATCTTTCCTG GGCAAGAGAT ATTGTTGAAG TG - #TTTTCTAA   

      660                                                                      

     - GAAAATATCG GATTGTGTCT TGGCTGGTTT GGATGTCTCC GTTCTGCGTA TT - #CGATTTGT   

      720                                                                      

     - CAATCTTTTA AAAGATTATA AGCAAACTTT AGAATACCAT CAATTAACAG AT - #ACTGAGGA   

      780                                                                      

     - ATATAAAGAT ATTTGTTTCA GATTAAAGTT GTTTTTTGAT GCAGAACAAA GA - #AATGGTAA   

      840                                                                      

     #            844                                                          

     - (2) INFORMATION FOR SEQ ID NO:196:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 281 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:196:                           

     - Cys Val Glu His Ile Leu Lys Gln Thr Tyr Gl - #n Asn Ile Glu Ile Ile     

     #                15                                                       

     - Leu Val Asp Asp Gly Ser Thr Asp Asn Ser Gl - #y Glu Ile Cys Asp Ala     

     #            30                                                           

     - Phe Met Met Gln Asp Asn Arg Val Arg Val Le - #u His Gln Glu Asn Lys     

     #        45                                                               

     - Gly Gly Ala Ala Gln Ala Lys Asn Met Gly Il - #e Ser Val Ala Lys Gly     

     #    60                                                                   

     - Glu Tyr Ile Thr Ile Val Asp Ser Asp Asp Il - #e Val Lys Glu Asn Met     

     #80                                                                       

     - Ile Glu Thr Leu Tyr Gln Gln Val Gln Glu Ly - #s Asp Ala Asp Val Val     

     #                95                                                       

     - Ile Gly Asn Tyr Tyr Asn Tyr Asp Glu Ser As - #p Gly Asn Phe Tyr Phe     

     #           110                                                           

     - Tyr Val Thr Gly Gln Asp Phe Cys Val Glu Gl - #u Leu Ala Ile Gln Glu     

     #       125                                                               

     - Ile Met Asn Arg Gln Ala Gly Asp Trp Lys Ph - #e Asn Ser Ser Ala Phe     

     #   140                                                                   

     - Ile Leu Pro Thr Phe Lys Leu Ile Lys Lys Gl - #u Leu Phe Asn Glu Val     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - His Phe Ser Asn Gly Arg Arg Phe Asp Asp Gl - #u Ala Thr Met His Arg     

     #               175                                                       

     - Phe Tyr Leu Leu Ala Ser Lys Ile Val Phe Il - #e Asn Asp Asn Leu Tyr     

     #           190                                                           

     - Leu Tyr Arg Arg Arg Ser Gly Ser Ile Met Ar - #g Thr Glu Phe Asp Leu     

     #       205                                                               

     - Ser Trp Ala Arg Asp Ile Val Glu Val Phe Se - #r Lys Lys Ile Ser Asp     

     #   220                                                                   

     - Cys Val Leu Ala Gly Leu Asp Val Ser Val Le - #u Arg Ile Arg Phe Val     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Asn Leu Leu Lys Asp Tyr Lys Gln Thr Leu Gl - #u Tyr His Gln Leu Thr     

     #               255                                                       

     - Asp Thr Glu Glu Tyr Lys Asp Ile Cys Phe Ar - #g Leu Lys Leu Phe Phe     

     #           270                                                           

     - Asp Ala Glu Gln Arg Asn Gly Lys Ser                                     

     #       280                                                               

     - (2) INFORMATION FOR SEQ ID NO: 197:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 811 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #197: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GTGTTTGGAT AGCATTCAGA ATCAGACGTA TCAAAATTTT GAGTGTTTAT TA - #ATCAATGA   

       60                                                                      

     - TGGCTCTCCA GATCATTCAT CCAAAATATG TGAAGAATTT GTAGAGAAAG AT - #TCTCGTTT   

      120                                                                      

     - CAAATATTTT GAGAAAGCAA ACGGCGGTCT TTCATCAGCT CGTAACCTAG GT - #ATTGAATG   

      180                                                                      

     - TTCGGGGGGG GCGTACATTA CTTTTGTAGA CTCTGATGAT TGGTTGGAAC AT - #GATGCTTT   

      240                                                                      

     - AGACCGATTA TATGGTGCTT TGAAAAAGGA AAACGCAGAT ATTAGTATCG GG - #CGTTATAA   

      300                                                                      

     - TTCTTATGAT GAAACACGCT ATGTGTATAT GACTTATGTT ACGGATCCAG AT - #GATTCTCT   

      360                                                                      

     - AGAAGTGATA GAAGGTAAAG CAATTATGGA TAGGGAAGGT GTCGAAGAAG TC - #AGAAATGG   

      420                                                                      

     - GAACTGGACT GTAGCTGTCT TGAAGTTATT CAAGAGAGAG TTACTACAAG AT - #TTACCATT   

      480                                                                      

     - TCCTATAGGA AAAATTGCAG AGGATACTTA CTGGACATGG AAGGTACTTC TA - #AGAGCTTC   

      540                                                                      

     - GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT CT - #GATACTTT   

      600                                                                      

     - ATCGAATACA TGGAGTGAAA AGCGTATGTA TGATGAAATT GGGGCTAGGG AA - #GAAAAGAT   

      660                                                                      

     - AGCTATTTTA GCAAGTTCAG ACTATGACTT GACCAATCAT ATTTTGATTT AT - #AAAAATAG   

      720                                                                      

     - ATTACAAAGA GTGATAGCAA AATTAGAAGA ACAAAATATG CAGTTCACAG AG - #ATTTACAG   

      780                                                                      

     #         811      TTGT CTTTACTTCC G                                      

     - (2) INFORMATION FOR SEQ ID NO:198:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 270 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:198:                           

     - Cys Leu Asp Ser Ile Gln Asn Gln Thr Tyr Gl - #n Asn Phe Glu Cys Leu     

     #                15                                                       

     - Leu Ile Asn Asp Gly Ser Pro Asp His Ser Se - #r Lys Ile Cys Glu Glu     

     #            30                                                           

     - Phe Val Glu Lys Asp Ser Arg Phe Lys Tyr Ph - #e Glu Lys Ala Asn Gly     

     #        45                                                               

     - Gly Leu Ser Ser Ala Arg Asn Leu Gly Ile Gl - #u Cys Ser Gly Gly Ala     

     #    60                                                                   

     - Tyr Ile Thr Phe Val Asp Ser Asp Asp Trp Le - #u Glu His Asp Ala Leu     

     #80                                                                       

     - Asp Arg Leu Tyr Gly Ala Leu Lys Lys Glu As - #n Ala Asp Ile Ser Ile     

     #                95                                                       

     - Gly Arg Tyr Asn Ser Tyr Asp Glu Thr Arg Ty - #r Val Tyr Met Thr Tyr     

     #           110                                                           

     - Val Thr Asp Pro Asp Asp Ser Leu Glu Val Il - #e Glu Gly Lys Ala Ile     

     #       125                                                               

     - Met Asp Arg Glu Gly Val Glu Glu Val Arg As - #n Gly Asn Trp Thr Val     

     #   140                                                                   

     - Ala Val Leu Lys Leu Phe Lys Arg Glu Leu Le - #u Gln Asp Leu Pro Phe     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Pro Ile Gly Lys Ile Ala Glu Asp Thr Tyr Tr - #p Thr Trp Lys Val Leu     

     #               175                                                       

     - Leu Arg Ala Ser Arg Ile Val Tyr Leu Asn Ar - #g Cys Val Tyr Trp Tyr     

     #           190                                                           

     - Arg Val Gly Leu Ser Asp Thr Leu Ser Asn Th - #r Trp Ser Glu Lys Arg     

     #       205                                                               

     - Met Tyr Asp Glu Ile Gly Ala Arg Glu Glu Ly - #s Ile Ala Ile Leu Ala     

     #   220                                                                   

     - Ser Ser Asp Tyr Asp Leu Thr Asn His Ile Le - #u Ile Tyr Lys Asn Arg     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Leu Gln Arg Val Ile Ala Lys Leu Glu Glu Gl - #n Asn Met Gln Phe Thr     

     #               255                                                       

     - Glu Ile Tyr Arg Arg Met Met Glu Lys Leu Se - #r Leu Leu Pro             

     #           270                                                           

     - (2) INFORMATION FOR SEQ ID NO: 199:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 2023 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #199: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GTGCCTAGAT AGTATTATTA CTCAAACATA TAAAAATATT GAGATTGTTG TC - #GTTAATGA   

       60                                                                      

     - TGGTTCTACG GATGCTTCAG GTGAAATTTG TAAAGAATTT TCAGAAATGG AT - #CACCGAAT   

      120                                                                      

     - TCTCTATATA GAACAAGAAA ATGCTGGTCT TTCTGCCGCA CGAAACACCG GT - #CTGAATAA   

      180                                                                      

     - TATGTCCGGA AATTATGTGA CCTTTGTGGA CTCGGATGAT TGGATTGAGC AA - #GATTATGT   

      240                                                                      

     - AGAAACTCTA TATAAAAAAA TAGTAGAGTA TCAGGCTGAT ATTGCAGTTG GT - #AATTATTA   

      300                                                                      

     - TTCTTTCAAC GAAAGTGAAG GAATGTTCTA CTTTCATATA TTGGGAGACT CC - #TATTATGA   

      360                                                                      

     - GAAAGTATAT GATAATGTTT CTATCTTTGA GAACTTGTAT GAAACTCAAG AA - #ATGAAGAG   

      420                                                                      

     - TTTTGCTTTG ATATCTGCTT GGGGTAAACT CTATAAGGCA AGATTGTTTG AG - #CAGTTGCG   

      480                                                                      

     - CTTTGACATA GGTAAATTAG GAGAAGATGG TTACCTCAAT CAAAAGGTAT AT - #TTATTATC   

      540                                                                      

     - AGAAAAGGTA ATTTATTTAA ATAAAAGTCT TTATGCTTAT CGGATTAGAA AA - #GGTAGTTT   

      600                                                                      

     - ATCAAGAGTT TGGACAGAAA AGTGGATGCA CGCTTTAGTT GATGCTATGT CT - #GAACGTAT   

      660                                                                      

     - TACGCTACTA GCTAATATGG GTTATCCTCT AGAGAAACAC TTGGCAGTTT AT - #CGTCAGAT   

      720                                                                      

     - GTTGGAAGTC AGTCTCGCCA ACGGTCAAGC TAGTGGTTTA TCTGACACAG CA - #ACGTATAA   

      780                                                                      

     - AGAGTTTGAA ATGAAACAAA GGCTTTTAAA TCAGCTATCG AGACAAGAGG AA - #AGTGAAAA   

      840                                                                      

     - GAAAGCCATT GTCCTCGCAG CAAACTATGG CTATGTAGAC CAAGTTTTAA CG - #ACAATCAA   

      900                                                                      

     - GTCTATTTGT TATCATAATC GTTCGATTCG TTTTTATCTG ATTCATAGCG AT - #TTTCCAAA   

      960                                                                      

     - TGAATGGATT AAGCAATTAA ATAAGCGCTT AGAGAAGTTT GACTCAGAAA TT - #ATTAATTG   

     1020                                                                      

     - TCGGGTAACT TCTGAGCAAA TTTCATGTTA TAAATCGGAT ATTAGTTACA CA - #GTCTTTTT   

     1080                                                                      

     - ACGCTATTTC ATAGCTGATT TCGTGCAAGA AGACAAGGCC CTCTACTTGG AC - #TGTGATCT   

     1140                                                                      

     - AGTTGTAACG AAAAATCTGG ATGACTTGTT TGCTACAGAC TTACAAGATT AT - #CCTTTGGC   

     1200                                                                      

     - TGCTGTTAGA GATTTTGGGG GCAGAGCTTA TTTTGGTCAA GAAATCTTTA AT - #GCCGGTGT   

     1260                                                                      

     - TCTCTTGGTA AACAATGCTT TTTGGAAAAA AGAGAATATG ACCCAAAAAT TA - #ATTGATGT   

     1320                                                                      

     - AACCAATGAA TGGCATGATA AGGTGGATCA GGCAGATCAG AGCATCTTGA AT - #ATGCTTTT   

     1380                                                                      

     - TGAACATAAA TGGTTGGAAT TGGACTTTGA TTATAATCAT ATTGTCATTC AT - #AAACAGTT   

     1440                                                                      

     - TGCTGATTAT CAATTGCCTG AGGGTCAGGA TTATCCTGCT ATTATTCACT AT - #CTTTCTCA   

     1500                                                                      

     - TCGGAAACCG TGGAAAGATT TGGCGGCCCA AACCTATCGT GAAGTTTGGT GG - #TACTATCA   

     1560                                                                      

     - TGGGCTTGAA TGGACAGAAT TGGGACAAAA CCATCATTTA CATCCATTAC AA - #AGATCTCA   

     1620                                                                      

     - CATCTATCCA ATAAAGGAAC CTTTCACTTG TCTAATCTAT ACTGCCTCAG AC - #CATATTGA   

     1680                                                                      

     - ACAAATTGAG ACATTGGTTC AATCCTTGCC TGATATTCAG TTTAAGATAG CA - #GCTAGAGT   

     1740                                                                      

     - AATAGTTAGT GATCGATTGG CTCAGATGAC AATTTATCCA AACGTGACTA TA - #TTTAACGG   

     1800                                                                      

     - AATTCACTAT TTGGTAGATG TCGATAATGA ATTGGTAGAA ACCAGTCAAG TA - #CTTTTAGA   

     1860                                                                      

     - TATTAATCAT GGCGAAAAGA CAGAAGAAAT TCTCGATCAA TTTGCTAATC TT - #GGCAAGCC   

     1920                                                                      

     - TATCTTATCC TTTGAAAATA CTAAAACCTA TGAAGTAGGT CAGGAGGCAT AT - #GCTGTTGA   

     1980                                                                      

     #                 202 - #3AAATTGAG AGAAATAAGC AAA                         

     - (2) INFORMATION FOR SEQ ID NO:200:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 674 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:200:                           

     - Cys Leu Asp Ser Ile Ile Thr Gln Thr Tyr Ly - #s Asn Ile Glu Ile Val     

     #                15                                                       

     - Val Val Asn Asp Gly Ser Thr Asp Ala Ser Gl - #y Glu Ile Cys Lys Glu     

     #            30                                                           

     - Phe Ser Glu Met Asp His Arg Ile Leu Tyr Il - #e Glu Gln Glu Asn Ala     

     #        45                                                               

     - Gly Leu Ser Ala Ala Arg Asn Thr Gly Leu As - #n Asn Met Ser Gly Asn     

     #    60                                                                   

     - Tyr Val Thr Phe Val Asp Ser Asp Asp Trp Il - #e Glu Gln Asp Tyr Val     

     #80                                                                       

     - Glu Thr Leu Tyr Lys Lys Ile Val Glu Tyr Gl - #n Ala Asp Ile Ala Val     

     #                95                                                       

     - Gly Asn Tyr Tyr Ser Phe Asn Glu Ser Glu Gl - #y Met Phe Tyr Phe His     

     #           110                                                           

     - Ile Leu Gly Asp Ser Tyr Tyr Glu Lys Val Ty - #r Asp Asn Val Ser Ile     

     #       125                                                               

     - Phe Glu Asn Leu Tyr Glu Thr Gln Glu Met Ly - #s Ser Phe Ala Leu Ile     

     #   140                                                                   

     - Ser Ala Trp Gly Lys Leu Tyr Lys Ala Arg Le - #u Phe Glu Gln Leu Arg     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Phe Asp Ile Gly Lys Leu Gly Glu Asp Gly Ty - #r Leu Asn Gln Lys Val     

     #               175                                                       

     - Tyr Leu Leu Ser Glu Lys Val Ile Tyr Leu As - #n Lys Ser Leu Tyr Ala     

     #           190                                                           

     - Tyr Arg Ile Arg Lys Gly Ser Leu Ser Arg Va - #l Trp Thr Glu Lys Trp     

     #       205                                                               

     - Met His Ala Leu Val Asp Ala Met Ser Glu Ar - #g Ile Thr Leu Leu Ala     

     #   220                                                                   

     - Asn Met Gly Tyr Pro Leu Glu Lys His Leu Al - #a Val Tyr Arg Gln Met     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Leu Glu Val Ser Leu Ala Asn Gly Gln Ala Se - #r Gly Leu Ser Asp Thr     

     #               255                                                       

     - Ala Thr Tyr Lys Glu Phe Glu Met Lys Gln Ar - #g Leu Leu Asn Gln Leu     

     #           270                                                           

     - Ser Arg Gln Glu Glu Ser Glu Lys Lys Ala Il - #e Val Leu Ala Ala Asn     

     #       285                                                               

     - Tyr Gly Tyr Val Asp Gln Val Leu Thr Thr Il - #e Lys Ser Ile Cys Tyr     

     #   300                                                                   

     - His Asn Arg Ser Ile Arg Phe Tyr Leu Ile Hi - #s Ser Asp Phe Pro Asn     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Glu Trp Ile Lys Gln Leu Asn Lys Arg Leu Gl - #u Lys Phe Asp Ser Glu     

     #               335                                                       

     - Ile Ile Asn Cys Arg Val Thr Ser Glu Gln Il - #e Ser Cys Tyr Lys Ser     

     #           350                                                           

     - Asp Ile Ser Tyr Thr Val Phe Leu Arg Tyr Ph - #e Ile Ala Asp Phe Val     

     #       365                                                               

     - Gln Glu Asp Lys Ala Leu Tyr Leu Asp Cys As - #p Leu Val Val Thr Lys     

     #   380                                                                   

     - Asn Leu Asp Asp Leu Phe Ala Thr Asp Leu Gl - #n Asp Tyr Pro Leu Ala     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Ala Val Arg Asp Phe Gly Gly Arg Ala Tyr Ph - #e Gly Gln Glu Ile Phe     

     #               415                                                       

     - Asn Ala Gly Val Leu Leu Val Asn Asn Ala Ph - #e Trp Lys Lys Glu Asn     

     #           430                                                           

     - Met Thr Gln Lys Leu Ile Asp Val Thr Asn Gl - #u Trp His Asp Lys Val     

     #       445                                                               

     - Asp Gln Ala Asp Gln Ser Ile Leu Asn Met Le - #u Phe Glu His Lys Trp     

     #   460                                                                   

     - Leu Glu Leu Asp Phe Asp Tyr Asn His Ile Va - #l Ile His Lys Gln Phe     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Ala Asp Tyr Gln Leu Pro Glu Gly Gln Asp Ty - #r Pro Ala Ile Ile His     

     #               495                                                       

     - Tyr Leu Ser His Arg Lys Pro Trp Lys Asp Le - #u Ala Ala Gln Thr Tyr     

     #           510                                                           

     - Arg Glu Val Trp Trp Tyr Tyr His Gly Leu Gl - #u Trp Thr Glu Leu Gly     

     #       525                                                               

     - Gln Asn His His Leu His Pro Leu Gln Arg Se - #r His Ile Tyr Pro Ile     

     #   540                                                                   

     - Lys Glu Pro Phe Thr Cys Leu Ile Tyr Thr Al - #a Ser Asp His Ile Glu     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Gln Ile Glu Thr Leu Val Gln Ser Leu Pro As - #p Ile Gln Phe Lys Ile     

     #               575                                                       

     - Ala Ala Arg Val Ile Val Ser Asp Arg Leu Al - #a Gln Met Thr Ile Tyr     

     #           590                                                           

     - Pro Asn Val Thr Ile Phe Asn Gly Ile His Ty - #r Leu Val Asp Val Asp     

     #       605                                                               

     - Asn Glu Leu Val Glu Thr Ser Gln Val Leu Le - #u Asp Ile Asn His Gly     

     #   620                                                                   

     - Glu Lys Thr Glu Glu Ile Leu Asp Gln Phe Al - #a Asn Leu Gly Lys Pro     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Ile Leu Ser Phe Glu Asn Thr Lys Thr Tyr Gl - #u Val Gly Gln Glu Ala     

     #               655                                                       

     - Tyr Ala Val Asp Gln Val Gln Ala Met Ile Gl - #u Lys Leu Arg Glu Ile     

     #           670                                                           

     - Ser Lys                                                                 

     - (2) INFORMATION FOR SEQ ID NO: 201:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 910 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #201: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT GTTGATGATG GT - #GCAACAGA   

       60                                                                      

     - TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT GACAGGGTGT CA - #GTGCTTCA   

      120                                                                      

     - TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAATGATGGG ATGAAGCAGG CT - #CACGGGGA   

      180                                                                      

     - TTATCTGATT TTTATTGACT CAGATGATTA TATCCATCCA GAAATGATTC AG - #AGCTTATA   

      240                                                                      

     - TGAGCAATTA GTTCAAGAAG ATGCGGATGT TTCGAGCTGT GGTGTCATGA AT - #GTCTATGC   

      300                                                                      

     - TAATGATGAA AGCCCACAGT CAGCCAATCA GGATGACTAT TTTGTCTGTG AT - #TCTCAAAC   

      360                                                                      

     - ATTTCTAAAG GAATACCTCA TAGGTGAAAA AATACCTGGG ACGATTTGCA AT - #AAGCTAAT   

      420                                                                      

     - CAAGAGACAG ATTGCAACTG CCCTATCCTT TCCTAAGGGG TTGATTTACG AA - #GATGCCTA   

      480                                                                      

     - TTACCATTTT GATTTAATCA AGTTGGCCAA GAAGTATGTG GTTAATACTA AA - #CCCTATTA   

      540                                                                      

     - TTACTATTTC CATAGAGGGG ATAGTATTAC GACCAAACCC TATGCAGAGA AG - #GATTTAGC   

      600                                                                      

     - CTATATTGAT ATCTACCAAA AGTTTTATAA TGAAGTTGTG AAAAACTATC CT - #GACTTGAA   

      660                                                                      

     - AGAGGTCGCT TTTTTCAGAT TGGCCTATGC CCACTTCTTT ATTCTGGATA AG - #ATGTTGCT   

      720                                                                      

     - AGATGATCAG TATAAACAGT TTGAAGCCTA TTCTCAGATT CATCGTTTTT TA - #AAAGGCCA   

      780                                                                      

     - TGCCTTTGCT ATTTCTAGGA ATCCAATTTT CCGTAAGGGG AGAAGAATTA GT - #GCTTTGGC   

      840                                                                      

     - CCTATTCATA AATATTTCCT TATATCGATT CTTATTACTG AAAAATATTG AA - #AAATCTAA   

      900                                                                      

     #       910                                                               

     - (2) INFORMATION FOR SEQ ID NO:202:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 303 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:202:                           

     - Ile Gln Lys Gln Thr Tyr Gln Asn Leu Glu Il - #e Ile Leu Val Asp Asp     

     #                15                                                       

     - Gly Ala Thr Asp Glu Ser Gly Arg Leu Cys As - #p Ser Ile Ala Glu Gln     

     #            30                                                           

     - Asp Asp Arg Val Ser Val Leu His Lys Lys As - #n Glu Gly Leu Ser Gln     

     #        45                                                               

     - Ala Arg Asn Asp Gly Met Lys Gln Ala His Gl - #y Asp Tyr Leu Ile Phe     

     #    60                                                                   

     - Ile Asp Ser Asp Asp Tyr Ile His Pro Glu Me - #t Ile Gln Ser Leu Tyr     

     #80                                                                       

     - Glu Gln Leu Val Gln Glu Asp Ala Asp Val Se - #r Ser Cys Gly Val Met     

     #                95                                                       

     - Asn Val Tyr Ala Asn Asp Glu Ser Pro Gln Se - #r Ala Asn Gln Asp Asp     

     #           110                                                           

     - Tyr Phe Val Cys Asp Ser Gln Thr Phe Leu Ly - #s Glu Tyr Leu Ile Gly     

     #       125                                                               

     - Glu Lys Ile Pro Gly Thr Ile Cys Asn Lys Le - #u Ile Lys Arg Gln Ile     

     #   140                                                                   

     - Ala Thr Ala Leu Ser Phe Pro Lys Gly Leu Il - #e Tyr Glu Asp Ala Tyr     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Tyr His Phe Asp Leu Ile Lys Leu Ala Lys Ly - #s Tyr Val Val Asn Thr     

     #               175                                                       

     - Lys Pro Tyr Tyr Tyr Tyr Phe His Arg Gly As - #p Ser Ile Thr Thr Lys     

     #           190                                                           

     - Pro Tyr Ala Glu Lys Asp Leu Ala Tyr Ile As - #p Ile Tyr Gln Lys Phe     

     #       205                                                               

     - Tyr Asn Glu Val Val Lys Asn Tyr Pro Asp Le - #u Lys Glu Val Ala Phe     

     #   220                                                                   

     - Phe Arg Leu Ala Tyr Ala His Phe Phe Ile Le - #u Asp Lys Met Leu Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Asp Asp Gln Tyr Lys Gln Phe Glu Ala Tyr Se - #r Gln Ile His Arg Phe     

     #               255                                                       

     - Leu Lys Gly His Ala Phe Ala Ile Ser Arg As - #n Pro Ile Phe Arg Lys     

     #           270                                                           

     - Gly Arg Arg Ile Ser Ala Leu Ala Leu Phe Il - #e Asn Ile Ser Leu Tyr     

     #       285                                                               

     - Arg Phe Leu Leu Leu Lys Asn Ile Glu Lys Se - #r Lys Lys Leu His         

     #   300                                                                   

     - (2) INFORMATION FOR SEQ ID NO: 203:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1972 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #203: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAAGGCTGAT AATCGTGTTC AAATGAGAAC GACGATTAAT AATGAATCGC CA - #TTGTTGCT   

       60                                                                      

     - TTCTCCGTTG TATGGCAATG ATAATGGTAA CGGATTATGG TGGGGGAACA CA - #TTGAAGGG   

      120                                                                      

     - AGCATGGGAA GCTATTCCTG AAGATGTAAA GCCATATGCA GCGATTGAAC TT - #CATCCTGC   

      180                                                                      

     - AAAAGTCTGT AAACCAACAA GTTGTATTCC ACGAGATACG AAAGAATTGA GA - #GAATGGTA   

      240                                                                      

     - TGTCAAGATG TTGGAGGAAG CTCAAAGTCT AAACATTCCA GTTTTCTTGG TT - #ATTATGTC   

      300                                                                      

     - GGCTGGAGAG CGTAATACAG TTCCTCCAGA GTGGTTAGAT GAACAATTCC AA - #AAGTATAG   

      360                                                                      

     - TGTGTTAAAA GGTGTTTTAA ATATTGAGAA TTATTGGATT TACAATAACC AG - #TTAGCTCC   

      420                                                                      

     - GCATAGTGCT AAATATTTGG AAGTTTGTGC CAAATATGGA GCGCATTTTA TC - #TGGCATGA   

      480                                                                      

     - TCATGAAAAA TGGTTCTGGG AAACTATTAT GAATGATCCG ACATTCTTTG AA - #GCGAGTCA   

      540                                                                      

     - AAAATATCAT AAAAATTTGG TGTTGGCAAC TAAAAATACG CCAATAAGAG AT - #GATGCGGG   

      600                                                                      

     - TACAGATTCT ATCGTTAGTG GATTTTGGTT GAGTGGCTTA TGTGATAACT GG - #GGCTCATC   

      660                                                                      

     - AACAGATACA TGGAAATGGT GGGAAAAACA TTATACAAAC ACATTTGAAA CT - #GGAAGAGC   

      720                                                                      

     - TAGGGATATG AGATCCTATG CATCGGAACC AGAATCAATG ATTGCTATGG AA - #ATGATGAA   

      780                                                                      

     - TGTATATACT GGGGGAGGCA CAGTTTATAA TTTCGAATGT GCCGCGTATA CA - #TTTATGAC   

      840                                                                      

     - AAATGATGTA CCAACTCCAG CATTTACTAA AGGTATTATT CCTTTCTTTA GA - #CATGCTAT   

      900                                                                      

     - ACAAAATCCA GCTCCAAGTA AGGAAGAAGT TGTAAATAGA ACAAAAGCTG TA - #TTTTGGAA   

      960                                                                      

     - TGGAGAAGGT AGGATTAGTT CATTAAACGG ATTTTATCAA GGACTTTATT CG - #AATGATGA   

     1020                                                                      

     - AACAATGCCT TTATATAATA ATGGGAGATA TCATATTCTT CCTGTAATAC AT - #GAGAAAAT   

     1080                                                                      

     - TGATAAGGAA AAGATTTCAT CTATATTCCC TAATGCAAAA ATTTTGACTA AA - #AATAGTGA   

     1140                                                                      

     - GGAATTGTCT AGTAAAGTCA ACTATTTAAA CTCGCTTTAT CCAAAACTTT AT - #GAAGGAGA   

     1200                                                                      

     - TGGGTATGCT CAGCGTGTAG GTAATTCCTG GTATATTTAT AATAGTAATG CT - #AATATCAA   

     1260                                                                      

     - TAAAAATCAG CAAGTAATGT TGCCTATGTA TACTAATAAT ACAAAGTCGT TA - #TCGTTAGA   

     1320                                                                      

     - TTTGACGCCA CATACTTACG CTGTTGTTAA AGAAAATCCA AATAATTTAC AT - #ATTTTATT   

     1380                                                                      

     - GAATAATTAC AGGACAGATA AGACAGCTAT GTGGGCATTA TCAGGAAATT TT - #GATGCATC   

     1440                                                                      

     - AAAAAGTTGG AAGAAAGAAG AATTAGAGTT AGCGAACTGG ATAAGCAAAA AT - #TATTCCAT   

     1500                                                                      

     - CAATCCTGTA GATAATGACT TTAGGACAAC AACACTTACA TTAAAAGGGC AT - #ACTGGTCA   

     1560                                                                      

     - TAAACCTCAG ATAAATATAA GTGGCGATAA AAATCATTAT ACTTATACAG AA - #AATTGGGA   

     1620                                                                      

     - TGAGAATACC CATGTTTATA CCATTACGGT TAATCATAAT GGAATGGTAG AG - #ATGTCTAT   

     1680                                                                      

     - AAATACTGAG GGGACAGGTC CAGTCTCTTT CCCAACACCA GATAAATTTA AT - #GATGGTAA   

     1740                                                                      

     - TTTGAATATA GCATATGCAA AACCAACAAC ACAAAGTTCT GTAGATTACA AT - #GGAGACCC   

     1800                                                                      

     - TAATAGAGCT GTGGATGGTA ACAGAAATGG TAATTTTAAC TCTGGTTCGG TA - #ACACACAC   

     1860                                                                      

     - TAGGGCAGAT AATCCCTCTT GGTGGGAAGT CGATTTGAAA AAAATGGATA AA - #GTTGGGCT   

     1920                                                                      

     - TGTTAAAATT TATAATCGCA CAGATGCTGA GACTCAACGT CTATCTAATT TT - #           

     1972                                                                      

     - (2) INFORMATION FOR SEQ ID NO:204:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 657 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:204:                           

     - Lys Ala Asp Asn Arg Val Gln Met Arg Thr Th - #r Ile Asn Asn Glu Ser     

     #                15                                                       

     - Pro Leu Leu Leu Ser Pro Leu Tyr Gly Asn As - #p Asn Gly Asn Gly Leu     

     #            30                                                           

     - Trp Trp Gly Asn Thr Leu Lys Gly Ala Trp Gl - #u Ala Ile Pro Glu Asp     

     #        45                                                               

     - Val Lys Pro Tyr Ala Ala Ile Glu Leu His Pr - #o Ala Lys Val Cys Lys     

     #    60                                                                   

     - Pro Thr Ser Cys Ile Pro Arg Asp Thr Lys Gl - #u Leu Arg Glu Trp Tyr     

     #80                                                                       

     - Val Lys Met Leu Glu Glu Ala Gln Ser Leu As - #n Ile Pro Val Phe Leu     

     #                95                                                       

     - Val Ile Met Ser Ala Gly Glu Arg Asn Thr Va - #l Pro Pro Glu Trp Leu     

     #           110                                                           

     - Asp Glu Gln Phe Gln Lys Tyr Ser Val Leu Ly - #s Gly Val Leu Asn Ile     

     #       125                                                               

     - Glu Asn Tyr Trp Ile Tyr Asn Asn Gln Leu Al - #a Pro His Ser Ala Lys     

     #   140                                                                   

     - Tyr Leu Glu Val Cys Ala Lys Tyr Gly Ala Hi - #s Phe Ile Trp His Asp     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - His Glu Lys Trp Phe Trp Glu Thr Ile Met As - #n Asp Pro Thr Phe Phe     

     #               175                                                       

     - Glu Ala Ser Gln Lys Tyr His Lys Asn Leu Va - #l Leu Ala Thr Lys Asn     

     #           190                                                           

     - Thr Pro Ile Arg Asp Asp Ala Gly Thr Asp Se - #r Ile Val Ser Gly Phe     

     #       205                                                               

     - Trp Leu Ser Gly Leu Cys Asp Asn Trp Gly Se - #r Ser Thr Asp Thr Trp     

     #   220                                                                   

     - Lys Trp Trp Glu Lys His Tyr Thr Asn Thr Ph - #e Glu Thr Gly Arg Ala     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Arg Asp Met Arg Ser Tyr Ala Ser Glu Pro Gl - #u Ser Met Ile Ala Met     

     #               255                                                       

     - Glu Met Met Asn Val Tyr Thr Gly Gly Gly Th - #r Val Tyr Asn Phe Glu     

     #           270                                                           

     - Cys Ala Ala Tyr Thr Phe Met Thr Asn Asp Va - #l Pro Thr Pro Ala Phe     

     #       285                                                               

     - Thr Lys Gly Ile Ile Pro Phe Phe Arg His Al - #a Ile Gln Asn Pro Ala     

     #   300                                                                   

     - Pro Ser Lys Glu Glu Val Val Asn Arg Thr Ly - #s Ala Val Phe Trp Asn     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Gly Glu Gly Arg Ile Ser Ser Leu Asn Gly Ph - #e Tyr Gln Gly Leu Tyr     

     #               335                                                       

     - Ser Asn Asp Glu Thr Met Pro Leu Tyr Asn As - #n Gly Arg Tyr His Ile     

     #           350                                                           

     - Leu Pro Val Ile His Glu Lys Ile Asp Lys Gl - #u Lys Ile Ser Ser Ile     

     #       365                                                               

     - Phe Pro Asn Ala Lys Ile Leu Thr Lys Asn Se - #r Glu Glu Leu Ser Ser     

     #   380                                                                   

     - Lys Val Asn Tyr Leu Asn Ser Leu Tyr Pro Ly - #s Leu Tyr Glu Gly Asp     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Gly Tyr Ala Gln Arg Val Gly Asn Ser Trp Ty - #r Ile Tyr Asn Ser Asn     

     #               415                                                       

     - Ala Asn Ile Asn Lys Asn Gln Gln Val Met Le - #u Pro Met Tyr Thr Asn     

     #           430                                                           

     - Asn Thr Lys Ser Leu Ser Leu Asp Leu Thr Pr - #o His Thr Tyr Ala Val     

     #       445                                                               

     - Val Lys Glu Asn Pro Asn Asn Leu His Ile Le - #u Leu Asn Asn Tyr Arg     

     #   460                                                                   

     - Thr Asp Lys Thr Ala Met Trp Ala Leu Ser Gl - #y Asn Phe Asp Ala Ser     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Lys Ser Trp Lys Lys Glu Glu Leu Glu Leu Al - #a Asn Trp Ile Ser Lys     

     #               495                                                       

     - Asn Tyr Ser Ile Asn Pro Val Asp Asn Asp Ph - #e Arg Thr Thr Thr Leu     

     #           510                                                           

     - Thr Leu Lys Gly His Thr Gly His Lys Pro Gl - #n Ile Asn Ile Ser Gly     

     #       525                                                               

     - Asp Lys Asn His Tyr Thr Tyr Thr Glu Asn Tr - #p Asp Glu Asn Thr His     

     #   540                                                                   

     - Val Tyr Thr Ile Thr Val Asn His Asn Gly Me - #t Val Glu Met Ser Ile     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Asn Thr Glu Gly Thr Gly Pro Val Ser Phe Pr - #o Thr Pro Asp Lys Phe     

     #               575                                                       

     - Asn Asp Gly Asn Leu Asn Ile Ala Tyr Ala Ly - #s Pro Thr Thr Gln Ser     

     #           590                                                           

     - Ser Val Asp Tyr Asn Gly Asp Pro Asn Arg Al - #a Val Asp Gly Asn Arg     

     #       605                                                               

     - Asn Gly Asn Phe Asn Ser Gly Ser Val Thr Hi - #s Thr Arg Ala Asp Asn     

     #   620                                                                   

     - Pro Ser Trp Trp Glu Val Asp Leu Lys Lys Me - #t Asp Lys Val Gly Leu     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Val Lys Ile Tyr Asn Arg Thr Asp Ala Glu Th - #r Gln Arg Leu Ser Asn     

     #               655                                                       

     - Phe                                                                     

     - (2) INFORMATION FOR SEQ ID NO: 205:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 811 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #205: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTGTGGCAAT CAGTCAGCTG CTTCCAAACA GTCAGCTTCA GGAACGATTG AG - #GTGATTTC   

       60                                                                      

     - ACGAGAAAAT GGCTCTGGGA CACGGGGTGC CTTCACAGAA ATCACAGGGA TT - #CTCAAAAA   

      120                                                                      

     - AGACGGTGAT AAAAAAATTG ACAACACTGC CAAAACAGCT GTGATTCAAA AT - #AGTACAGA   

      180                                                                      

     - AGGTGTTCTC TCAGCAGTTC AAGGGAATGC TAATGCTATC GGCTACATCT CC - #TTGGGATC   

      240                                                                      

     - TTTAACGAAA TCTGTCAAGG CTTTAGAGAT TGATGGTGTC AAGGCTAGTC GA - #GACACAGT   

      300                                                                      

     - TTTAGATGGT GAATACCCTC TTCAACGTCC CTTCAACATT GTTTGGTCTT CT - #AATCTTTC   

      360                                                                      

     - CAAGCTAGGT CAAGATTTTA TCAGCTTTAT CCACTCCAAA CAAGGTCAAC AA - #GTGGTCAC   

      420                                                                      

     - AGATAATAAA TTTATTGAAG CTAAAACCGA AACCACGGAA TATACAAGCC AA - #CACTTATC   

      480                                                                      

     - AGGCAAGTTG TCTGTTGTAG GTTCCACTTC AGTATCTTCT TTAATGGAAA AA - #TTAGCAGA   

      540                                                                      

     - AGCTTATAAA AAAGAAAATC CAGAAGTTAC GATTGATATT ACCTCTAATG GG - #TCTTCAGC   

      600                                                                      

     - AGGTATTACC GCTGTTAAGG AGAAAACCGC TGATATTGGT ATGGTTTCTA GG - #GAATTAAC   

      660                                                                      

     - TCCTGAAGAA GGTAAGAGTC TCACCCATGA TGCTATTGCT TTAGACGGTA TT - #GCTGTTGT   

      720                                                                      

     - GGTCAATAAT GACAATAAGG CAAGCCAAGT CAGTATGGCT GAACTTGCAG AC - #GTTTTTAG   

      780                                                                      

     #         811      TGGG ACAAGATTAA A                                      

     - (2) INFORMATION FOR SEQ ID NO:206:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 270 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:206:                           

     - Cys Gly Asn Gln Ser Ala Ala Ser Lys Gln Se - #r Ala Ser Gly Thr Ile     

     #                15                                                       

     - Glu Val Ile Ser Arg Glu Asn Gly Ser Gly Th - #r Arg Gly Ala Phe Thr     

     #            30                                                           

     - Glu Ile Thr Gly Ile Leu Lys Lys Asp Gly As - #p Lys Lys Ile Asp Asn     

     #        45                                                               

     - Thr Ala Lys Thr Ala Val Ile Gln Asn Ser Th - #r Glu Gly Val Leu Ser     

     #    60                                                                   

     - Ala Val Gln Gly Asn Ala Asn Ala Ile Gly Ty - #r Ile Ser Leu Gly Ser     

     #80                                                                       

     - Leu Thr Lys Ser Val Lys Ala Leu Glu Ile As - #p Gly Val Lys Ala Ser     

     #                95                                                       

     - Arg Asp Thr Val Leu Asp Gly Glu Tyr Pro Le - #u Gln Arg Pro Phe Asn     

     #           110                                                           

     - Ile Val Trp Ser Ser Asn Leu Ser Lys Leu Gl - #y Gln Asp Phe Ile Ser     

     #       125                                                               

     - Phe Ile His Ser Lys Gln Gly Gln Gln Val Va - #l Thr Asp Asn Lys Phe     

     #   140                                                                   

     - Ile Glu Ala Lys Thr Glu Thr Thr Glu Tyr Th - #r Ser Gln His Leu Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Lys Leu Ser Val Val Gly Ser Thr Ser Va - #l Ser Ser Leu Met Glu     

     #               175                                                       

     - Lys Leu Ala Glu Ala Tyr Lys Lys Glu Asn Pr - #o Glu Val Thr Ile Asp     

     #           190                                                           

     - Ile Thr Ser Asn Gly Ser Ser Ala Gly Ile Th - #r Ala Val Lys Glu Lys     

     #       205                                                               

     - Thr Ala Asp Ile Gly Met Val Ser Arg Glu Le - #u Thr Pro Glu Glu Gly     

     #   220                                                                   

     - Lys Ser Leu Thr His Asp Ala Ile Ala Leu As - #p Gly Ile Ala Val Val     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Val Asn Asn Asp Asn Lys Ala Ser Gln Val Se - #r Met Ala Glu Leu Ala     

     #               255                                                       

     - Asp Val Phe Ser Gly Lys Leu Thr Thr Trp As - #p Lys Ile Lys             

     #           270                                                           

     - (2) INFORMATION FOR SEQ ID NO: 207:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 805 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #207: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTGTCAACAA CAACATGCTA CTTCTGAGGG GACGAATCAA AGGCAAAGCA GT - #TCAGCGAA   

       60                                                                      

     - AGTTCCATGG AAAGCTTCAT ACACCAACCT AAACAACCAG GTAAGTACAG AA - #GAGGTCAA   

      120                                                                      

     - ATCTCTCTTA TCAGCTCACT TGGATCCAAA TAGTGTTGAT GCATTTTTTA AT - #CTCGTTAA   

      180                                                                      

     - TGACTATAAT ACCATTGTCG GCTCAACTGG CTTATCAGGA GATTTCACTT CC - #TTTACTCA   

      240                                                                      

     - CACCGAATAC GATGTTGAGA AAATCAGTCA TCTCTGGAAT CAAAAGAAGG GC - #GATTTTGT   

      300                                                                      

     - TGGGACCAAC TGCCGTATCA ATAGTTATTG TCTTTTGAAA AATTCAGTCA CC - #ATTCCAAA   

      360                                                                      

     - GCTTGAAAAG AATGACCAGT TGCTTTTCCT AGATAATGAT GCGATTGATA AA - #GGAAAGGT   

      420                                                                      

     - CTTTGATTCA CAAGATAAGG AAGAGTTTGA TATTCTATTT TCGAGAGTTC CA - #ACTGAGTC   

      480                                                                      

     - AACTACAGAT GTCAAGGTTC ACGCTGAAAA GATGGAAGCA TTCTTCTCAC AA - #TTTCAATT   

      540                                                                      

     - CAATGAAAAA GCTCGAATGC TGTCTGTAGT CTTGCACGAC AATTTGGATG GC - #GAGTATCT   

      600                                                                      

     - GTTTGTAGGC CACGTTGGGG TCTTAGTACC TGCTGATGAC GGTTTCTTAT TT - #GTAGAGAA   

      660                                                                      

     - ATTGACTTTC GAAGAGCCCT ACCAAGCGAT TAAATTTGCT AGTAAGGAAG AT - #TGCTACAA   

      720                                                                      

     - GTATTTGGGC ACCAAGTATG CGGATTATAC AGGCGAGGGA CTGGCTAAGC CT - #TTTATCAT   

      780                                                                      

     #              805 GTTA AACTT                                             

     - (2) INFORMATION FOR SEQ ID NO:208:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 268 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:208:                           

     - Cys Gln Gln Gln His Ala Thr Ser Glu Gly Th - #r Asn Gln Arg Gln Ser     

     #                15                                                       

     - Ser Ser Ala Lys Val Pro Trp Lys Ala Ser Ty - #r Thr Asn Leu Asn Asn     

     #            30                                                           

     - Gln Val Ser Thr Glu Glu Val Lys Ser Leu Le - #u Ser Ala His Leu Asp     

     #        45                                                               

     - Pro Asn Ser Val Asp Ala Phe Phe Asn Leu Va - #l Asn Asp Tyr Asn Thr     

     #    60                                                                   

     - Ile Val Gly Ser Thr Gly Leu Ser Gly Asp Ph - #e Thr Ser Phe Thr His     

     #80                                                                       

     - Thr Glu Tyr Asp Val Glu Lys Ile Ser His Le - #u Trp Asn Gln Lys Lys     

     #                95                                                       

     - Gly Asp Phe Val Gly Thr Asn Cys Arg Ile As - #n Ser Tyr Cys Leu Leu     

     #           110                                                           

     - Lys Asn Ser Val Thr Ile Pro Lys Leu Glu Ly - #s Asn Asp Gln Leu Leu     

     #       125                                                               

     - Phe Leu Asp Asn Asp Ala Ile Asp Lys Gly Ly - #s Val Phe Asp Ser Gln     

     #   140                                                                   

     - Asp Lys Glu Glu Phe Asp Ile Leu Phe Ser Ar - #g Val Pro Thr Glu Ser     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Thr Asp Val Lys Val His Ala Glu Lys Me - #t Glu Ala Phe Phe Ser     

     #               175                                                       

     - Gln Phe Gln Phe Asn Glu Lys Ala Arg Met Le - #u Ser Val Val Leu His     

     #           190                                                           

     - Asp Asn Leu Asp Gly Glu Tyr Leu Phe Val Gl - #y His Val Gly Val Leu     

     #       205                                                               

     - Val Pro Ala Asp Asp Gly Phe Leu Phe Val Gl - #u Lys Leu Thr Phe Glu     

     #   220                                                                   

     - Glu Pro Tyr Gln Ala Ile Lys Phe Ala Ser Ly - #s Glu Asp Cys Tyr Lys     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Leu Gly Thr Lys Tyr Ala Asp Tyr Thr Gl - #y Glu Gly Leu Ala Lys     

     #               255                                                       

     - Pro Phe Ile Met Asp Asn Asp Lys Trp Val Ly - #s Leu                     

     #           265                                                           

     - (2) INFORMATION FOR SEQ ID NO: 209:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 508 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #209: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTGTTCAGGC AAGTCCGTGA CTAGTGAACA CCAAACGAAA GATGAAATGA AG - #ACGGAGCA   

       60                                                                      

     - GACAGCTAGT AAAACAAGCG CAGCTAAAGG GAAAGAGGTG GCTGATTTTG AA - #TTGATGGG   

      120                                                                      

     - AGTAGATGGC AAGACCTACC GTTTATCTGA TTACAAGGGC AAGAAAGTCT AT - #CTCAAATT   

      180                                                                      

     - CTGGGCTTCT TGGTGTTCCA TCTGTCTGGC TAGTCTTCCA GATACGGATG AG - #ATTGCTAA   

      240                                                                      

     - AGAAGCTGGT GATGACTATG TGGTCTTGAC AGTAGTGTCA CCAGGACATA AG - #GGAGAGCA   

      300                                                                      

     - ATCTGAAGCG GACTTTAAGA ATTGGTATAA GGGATTGGAT TATAAAAATC TC - #CCAGTCCT   

      360                                                                      

     - AGTTGACCCA TCAGGCAAAC TTTTGGAAAC TTATGGTGTC CGTTCTTACC CA - #ACCCAAGC   

      420                                                                      

     - CTTTATAGAC AAAGAAGGCA AGCTGGTCAA AACACATCCA GGATTCATGG AA - #AAAGATGC   

      480                                                                      

     #            508   AAGG AATTAGCC                                          

     - (2) INFORMATION FOR SEQ ID NO:210:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 169 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:210:                           

     - Cys Ser Gly Lys Ser Val Thr Ser Glu His Gl - #n Thr Lys Asp Glu Met     

     #                15                                                       

     - Lys Thr Glu Gln Thr Ala Ser Lys Thr Ser Al - #a Ala Lys Gly Lys Glu     

     #            30                                                           

     - Val Ala Asp Phe Glu Leu Met Gly Val Asp Gl - #y Lys Thr Tyr Arg Leu     

     #        45                                                               

     - Ser Asp Tyr Lys Gly Lys Lys Val Tyr Leu Ly - #s Phe Trp Ala Ser Trp     

     #    60                                                                   

     - Cys Ser Ile Cys Leu Ala Ser Leu Pro Asp Th - #r Asp Glu Ile Ala Lys     

     #80                                                                       

     - Glu Ala Gly Asp Asp Tyr Val Val Leu Thr Va - #l Val Ser Pro Gly His     

     #                95                                                       

     - Lys Gly Glu Gln Ser Glu Ala Asp Phe Lys As - #n Trp Tyr Lys Gly Leu     

     #           110                                                           

     - Asp Tyr Lys Asn Leu Pro Val Leu Val Asp Pr - #o Ser Gly Lys Leu Leu     

     #       125                                                               

     - Glu Thr Tyr Gly Val Arg Ser Tyr Pro Thr Gl - #n Ala Phe Ile Asp Lys     

     #   140                                                                   

     - Glu Gly Lys Leu Val Lys Thr His Pro Gly Ph - #e Met Glu Lys Asp Ala     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Ile Leu Gln Thr Leu Lys Glu Leu Ala                                     

                     165                                                       

     - (2) INFORMATION FOR SEQ ID NO: 211:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 994 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #211: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTCGCAAATT GAAAAGGCGG CAGTTAGCCA AGGAGGAAAA GCAGTGAAAA AA - #ACAGAAAT   

       60                                                                      

     - TAGTAAAGAC GCAGACTTGC ACGAAATTTA TCTAGCTGGA GGTTGTTTCT GG - #GGAGTGGA   

      120                                                                      

     - GGAATATTTC TCACGTGTTC CCGGGGTGAC GGATGCCGTT TCAGGCTATG CA - #AATGGTAG   

      180                                                                      

     - AGGAGAAACA ACCAAGTACG AATTGATTAA CCAAACAGGT CATGCAGAAA CC - #GTCCATGT   

      240                                                                      

     - CACCTATGAT GCCAAGCAAA TTTCTCTCAA GGAAATCCTG CTTCACTATT TC - #CGCATTAT   

      300                                                                      

     - CAATCCAACC AGCAAAAATA AACAAGGAAA TGATGTGGGG ACCCAGTACC GT - #ACTGGTGT   

      360                                                                      

     - TTATTACACA GATGACAAGG ATTTGGAAGT GATTAACCAA GTCTTTGATG AG - #GTGGCTAA   

      420                                                                      

     - GAAATACGAT CAACCTCTAG CAGTTGAAAA GGAAAACTTG AAGAATTTTG TG - #GTGGCTGA   

      480                                                                      

     - GGATTACCAT CAAGACTATC TCAAGAAAAA TCCAAATGGC TACTGCCATA TC - #AATGTTAA   

      540                                                                      

     - TCAGGCGGCC TATCCTGTCA TTGATGCCAG CAAATATCCA AAACCAAGTG AT - #GAGGAATT   

      600                                                                      

     - GAAAAAGACC CTGTCACCTG AGGAGTATGC AGTTACCCAG GAAAATCAAA CA - #GAACGAGC   

      660                                                                      

     - TTTCTCAAAC CGTTACTGGG ATAAATTTGA ATCCGGTATC TATGTGGATA TA - #GCAACTGG   

      720                                                                      

     - GGAACCTCTC TTTTCATCAA AAGACAAATT TGAGTCTGGT TGTGGCTGGC CT - #AGTTTTAC   

      780                                                                      

     - CCAACCCATC AGTCCAGATG TTGTCACCTA CAAGGAAGAT AAGTCCTACA AT - #ATGACGCG   

      840                                                                      

     - TATGGAAGTG CGGAGCCGAG TAGGAGATTC TCACCTTGGG CATGTCTTTA CG - #GATGGTCC   

      900                                                                      

     - ACAGGACAAG GGCGGCTTAC GTTACTGTAT CAATAGCCTC TCTATCCGCT TT - #ATTCCCAA   

      960                                                                      

     #       994        AAAG GTACGCTTAT TTAC                                   

     - (2) INFORMATION FOR SEQ ID NO:212:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 331 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:212:                           

     - Ser Gln Ile Glu Lys Ala Ala Val Ser Gln Gl - #y Gly Lys Ala Val Lys     

     #                15                                                       

     - Lys Thr Glu Ile Ser Lys Asp Ala Asp Leu Hi - #s Glu Ile Tyr Leu Ala     

     #            30                                                           

     - Gly Gly Cys Phe Trp Gly Val Glu Glu Tyr Ph - #e Ser Arg Val Pro Gly     

     #        45                                                               

     - Val Thr Asp Ala Val Ser Gly Tyr Ala Asn Gl - #y Arg Gly Glu Thr Thr     

     #    60                                                                   

     - Lys Tyr Glu Leu Ile Asn Gln Thr Gly His Al - #a Glu Thr Val His Val     

     #80                                                                       

     - Thr Tyr Asp Ala Lys Gln Ile Ser Leu Lys Gl - #u Ile Leu Leu His Tyr     

     #                95                                                       

     - Phe Arg Ile Ile Asn Pro Thr Ser Lys Asn Ly - #s Gln Gly Asn Asp Val     

     #           110                                                           

     - Gly Thr Gln Tyr Arg Thr Gly Val Tyr Tyr Th - #r Asp Asp Lys Asp Leu     

     #       125                                                               

     - Glu Val Ile Asn Gln Val Phe Asp Glu Val Al - #a Lys Lys Tyr Asp Gln     

     #   140                                                                   

     - Pro Leu Ala Val Glu Lys Glu Asn Leu Lys As - #n Phe Val Val Ala Glu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Asp Tyr His Gln Asp Tyr Leu Lys Lys Asn Pr - #o Asn Gly Tyr Cys His     

     #               175                                                       

     - Ile Asn Val Asn Gln Ala Ala Tyr Pro Val Il - #e Asp Ala Ser Lys Tyr     

     #           190                                                           

     - Pro Lys Pro Ser Asp Glu Glu Leu Lys Lys Th - #r Leu Ser Pro Glu Glu     

     #       205                                                               

     - Tyr Ala Val Thr Gln Glu Asn Gln Thr Glu Ar - #g Ala Phe Ser Asn Arg     

     #   220                                                                   

     - Tyr Trp Asp Lys Phe Glu Ser Gly Ile Tyr Va - #l Asp Ile Ala Thr Gly     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Glu Pro Leu Phe Ser Ser Lys Asp Lys Phe Gl - #u Ser Gly Cys Gly Trp     

     #               255                                                       

     - Pro Ser Phe Thr Gln Pro Ile Ser Pro Asp Va - #l Val Thr Tyr Lys Glu     

     #           270                                                           

     - Asp Lys Ser Tyr Asn Met Thr Arg Met Glu Va - #l Arg Ser Arg Val Gly     

     #       285                                                               

     - Asp Ser His Leu Gly His Val Phe Thr Asp Gl - #y Pro Gln Asp Lys Gly     

     #   300                                                                   

     - Gly Leu Arg Tyr Cys Ile Asn Ser Leu Ser Il - #e Arg Phe Ile Pro Lys     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Asp Gln Met Glu Glu Lys Gly Thr Leu Ile Ty - #r                         

     #               330                                                       

     - (2) INFORMATION FOR SEQ ID NO: 213:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 625 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #213: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TTGTCAGTCA GGTTCTAATG GTTCTCAGTC TGCTGTGGAT GCTATCAAAC AA - #AAAGGGAA   

       60                                                                      

     - ATTAGTTGTG GCAACCAGTC CTGACTATGC ACCCTTTGAA TTTCAATCAT TG - #GTTGATGG   

      120                                                                      

     - AAAGAACCAG GTAGTCGGTG CAGACATCGA CATGGCTCAG GCTATCGCTG AT - #GAACTTGG   

      180                                                                      

     - GGTTAAGTTG GAAATCTCAA GCATGAGTTT TGACAATGTT TTGACCAGTC TT - #CAAACTGG   

      240                                                                      

     - TAAGGCTGAC CTAGCAGTTG CAGGAATTAG TGCTACTGAC GAGAGAAAAG AA - #GTCTTTGA   

      300                                                                      

     - TTTTTCAATC CCATACTATG AAAACAAGAT TAGTTTCTTG GTTCGTAAGG CT - #GATGTGGA   

      360                                                                      

     - AAAATACAAG GATTTAACTA GCCTAGAAAG TGCTAATATT GCAGCCCAAA AA - #GGGACTGT   

      420                                                                      

     - TCCAGAATCA ATGGTCAAGG AACAATTGCC AAAAGTTCAA TTAACTTCCC TA - #ACTAATAT   

      480                                                                      

     - GGGTGAAGCA GTCAATGAAT TGCAGGCTGG AAAAATAGAT GCTGTTCATA TG - #GATGAGCC   

      540                                                                      

     - TGTTGCACTT AGTTATGCTG CTAAAAACGC TGGCTTAGCT GTCGCAACTG TC - #AGCTTGAA   

      600                                                                      

     #              625 GCCA ATGCC                                             

     - (2) INFORMATION FOR SEQ ID NO:214:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 208 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:214:                           

     - Cys Gln Ser Gly Ser Asn Gly Ser Gln Ser Al - #a Val Asp Ala Ile Lys     

     #                15                                                       

     - Gln Lys Gly Lys Leu Val Val Ala Thr Ser Pr - #o Asp Tyr Ala Pro Phe     

     #            30                                                           

     - Glu Phe Gln Ser Leu Val Asp Gly Lys Asn Gl - #n Val Val Gly Ala Asp     

     #        45                                                               

     - Ile Asp Met Ala Gln Ala Ile Ala Asp Glu Le - #u Gly Val Lys Leu Glu     

     #    60                                                                   

     - Ile Ser Ser Met Ser Phe Asp Asn Val Leu Th - #r Ser Leu Gln Thr Gly     

     #80                                                                       

     - Lys Ala Asp Leu Ala Val Ala Gly Ile Ser Al - #a Thr Asp Glu Arg Lys     

     #                95                                                       

     - Glu Val Phe Asp Phe Ser Ile Pro Tyr Tyr Gl - #u Asn Lys Ile Ser Phe     

     #           110                                                           

     - Leu Val Arg Lys Ala Asp Val Glu Lys Tyr Ly - #s Asp Leu Thr Ser Leu     

     #       125                                                               

     - Glu Ser Ala Asn Ile Ala Ala Gln Lys Gly Th - #r Val Pro Glu Ser Met     

     #   140                                                                   

     - Val Lys Glu Gln Leu Pro Lys Val Gln Leu Th - #r Ser Leu Thr Asn Met     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gly Glu Ala Val Asn Glu Leu Gln Ala Gly Ly - #s Ile Asp Ala Val His     

     #               175                                                       

     - Met Asp Glu Pro Val Ala Leu Ser Tyr Ala Al - #a Lys Asn Ala Gly Leu     

     #           190                                                           

     - Ala Val Ala Thr Val Ser Leu Lys Met Lys As - #p Gly Asp Ala Asn Ala     

     #       205                                                               

     - (2) INFORMATION FOR SEQ ID NO: 215:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 3022 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #215: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - GGAAACTTCA CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAAGGAAA AA - #GAAGTTGT   

       60                                                                      

     - TAGTAAAAAT CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TC - #AAAGAAGA   

      120                                                                      

     - AAATTCCAAT AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA AC - #ACAGAAAA   

      180                                                                      

     - TCCCAAAAAA GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AA - #TCTGGAGA   

      240                                                                      

     - AAAAGCAATC AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CT - #TATGATAG   

      300                                                                      

     - AATTTTTAAC GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TT - #AAACAAAT   

      360                                                                      

     - AGAAGGTATT TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA AT - #CATGCCAG   

      420                                                                      

     - AAAGGAAATT GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CT - #CCGTTTGG   

      480                                                                      

     - GAAAAATTTT GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CA - #GATTATAG   

      540                                                                      

     - ACATAAGGCT ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AA - #AAAGAAGA   

      600                                                                      

     - CTTAAAAGGC ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CG - #TTCAATTA   

      660                                                                      

     - TTATAATGGT GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT AT - #TTTGACCC   

      720                                                                      

     - ACATGGGATG CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG AC - #ATCAAAAA   

      780                                                                      

     - CTTTAACGGC ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AA - #ATGTATTC   

      840                                                                      

     - TGACGCAGGA TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AA - #GATTCTAT   

      900                                                                      

     - CAAACACAAC GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GT - #CTTGTAGG   

      960                                                                      

     - TGAGAAATAT TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TG - #GTTGTCGC   

     1020                                                                      

     - TACGGGTAAC TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CA - #AATAATCA   

     1080                                                                      

     - TCTGAAAATG ACCGACACTG GAAATGTAAC ACGAACTGCA GCACATGAAG AT - #GCGATAGC   

     1140                                                                      

     - GGTCGCTTCT GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GT - #GGAGAAAG   

     1200                                                                      

     - TTTTAAATAC AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CA - #AATGAAGA   

     1260                                                                      

     - TGGAACAAAA GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AA - #GACCAAGA   

     1320                                                                      

     - TTTGATAGGT TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT AT - #ACAAAGGA   

     1380                                                                      

     - TTTAAAAAAT GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TG - #GTTGTAAA   

     1440                                                                      

     - TACTGTAAAT TACTACAATA GAGATAATTG GACAGAGCTT CCAGCTATGG GA - #TATGAAGC   

     1500                                                                      

     - GGATGAAGGT ACTAAAAGTC AAGTGTTTTC AATTTCAGGA GATGATGGTG TA - #AAGCTATG   

     1560                                                                      

     - GAACATGATT AATCCTGATA AAAAAACTGA AGTCAAAAGA AATAATAAAG AA - #GATTTTAA   

     1620                                                                      

     - AGATAAATTG GAGCAATACT ATCCAATTGA TATGGAAAGT TTTAATTCCA AC - #AAACCGAA   

     1680                                                                      

     - TGTAGGTGAC GAAAAAGAGA TTGACTTTAA GTTTGCACCT GACACAGACA AA - #GAACTCTA   

     1740                                                                      

     - TAAAGAAGAT ATCATCGTTC CAGCAGGATC TACATCTTGG GGGCCAAGAA TA - #GATTTACT   

     1800                                                                      

     - TTTAAAACCC GATGTTTCAG CACCTGGTAA AAATATTAAA TCCACGCTTA AT - #GTTATTAA   

     1860                                                                      

     - TGGCAAATCA ACTTATGGCT ATATGTCAGG AACTAGTATG GCGACTCCAA TC - #GTGGCAGC   

     1920                                                                      

     - TTCTACTGTT TTGATTAGAC CGAAATTAAA GGAAATGCTT GAAAGACCTG TA - #TTGAAAAA   

     1980                                                                      

     - TCTTAAGGGA GATGACAAAA TAGATCTTAC AAGTCTTACA AAAATTGCCC TA - #CAAAATAC   

     2040                                                                      

     - TGCGCGACCT ATGATGGATG CAACTTCTTG GAAAGAAAAA AGTCAATACT TT - #GCATCACC   

     2100                                                                      

     - TAGACAACAG GGAGCAGGCC TAATTAATGT GGCCAATGCT TTGAGAAATG AA - #GTTGTAGC   

     2160                                                                      

     - AACTTTCAAA AACACTGATT CTAAAGGTTT GGTAAACTCA TATGGTTCCA TT - #TCTCTTAA   

     2220                                                                      

     - AGAAATAAAA GGTGATAAAA AATACTTTAC AATCAAGCTT CACAATACAT CA - #AACAGACC   

     2280                                                                      

     - TTTGACTTTT AAAGTTTCAG CATCAGCGAT AACTACAGAT TCTCTAACTG AC - #AGATTAAA   

     2340                                                                      

     - ACTTGATGAA ACATATAAAG ATGAAAAATC TCCAGATGGT AAGCAAATTG TT - #CCAGAAAT   

     2400                                                                      

     - TCACCCAGAA AAAGTCAAAG GAGCAAATAT CACATTTGAG CATGATACTT TC - #ACTATAGG   

     2460                                                                      

     - CGCAAATTCT AGCTTTGATT TGAATGCGGT TATAAATGTT GGAGAGGCCA AA - #AACAAAAA   

     2520                                                                      

     - TAAATTTGTA GAATCATTTA TTCATTTTGA GTCAGTGGAA GCGATGGAAG CT - #CTAAACTC   

     2580                                                                      

     - CAGCGGGAAG AAAATAAACT TCCAACCTTC TTTGTCGATG CCTCTAATGG GA - #TTTGCTGG   

     2640                                                                      

     - GAATTGGAAC CACGAACCAA TCCTTGATAA ATGGGCTTGG GAAGAAGGGT CA - #AGATCAAA   

     2700                                                                      

     - AACACTGGGA GGTTATGATG ATGATGGTAA ACCGAAAATT CCAGGAACCT TA - #AATAAGGG   

     2760                                                                      

     - AATTGGTGGA GAACATGGTA TAGATAAATT TAATCCAGCA GGAGTTATAC AA - #AATAGAAA   

     2820                                                                      

     - AGATAAAAAT ACAACATCCC TGGATCAAAA TCCAGAATTA TTTGCTTTCA AT - #AACGAAGG   

     2880                                                                      

     - GATCAACGCT CCATCATCAA GTGGTTCTAA GATTGCTAAC ATTTATCCTT TA - #GATTCAAA   

     2940                                                                      

     - TGGAAATCCT CAAGATGCTC AACTTGAAAG AGGATTAACA CCTTCTCCAC TT - #GTATTAAG   

     3000                                                                      

     #               3022TGA TT                                                

     - (2) INFORMATION FOR SEQ ID NO:216:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 1007 amino                                          

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:216:                           

     - Glu Thr Ser Gln Asp Phe Lys Glu Lys Lys Th - #r Ala Val Ile Lys Glu     

     #                15                                                       

     - Lys Glu Val Val Ser Lys Asn Pro Val Ile As - #p Asn Asn Thr Ser Asn     

     #            30                                                           

     - Glu Glu Ala Lys Ile Lys Glu Glu Asn Ser As - #n Lys Ser Gln Gly Asp     

     #        45                                                               

     - Tyr Thr Asp Ser Phe Val Asn Lys Asn Thr Gl - #u Asn Pro Lys Lys Glu     

     #    60                                                                   

     - Asp Lys Val Val Tyr Ile Ala Glu Phe Lys As - #p Lys Glu Ser Gly Glu     

     #80                                                                       

     - Lys Ala Ile Lys Glu Leu Ser Ser Leu Lys As - #n Thr Lys Val Leu Tyr     

     #                95                                                       

     - Thr Tyr Asp Arg Ile Phe Asn Gly Ser Ala Il - #e Glu Thr Thr Pro Asp     

     #           110                                                           

     - Asn Leu Asp Lys Ile Lys Gln Ile Glu Gly Il - #e Ser Ser Val Glu Arg     

     #       125                                                               

     - Ala Gln Lys Val Gln Pro Met Met Asn His Al - #a Arg Lys Glu Ile Gly     

     #   140                                                                   

     - Val Glu Glu Ala Ile Asp Tyr Leu Lys Ser Il - #e Asn Ala Pro Phe Gly     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Lys Asn Phe Asp Gly Arg Gly Met Val Ile Se - #r Asn Ile Asp Thr Gly     

     #               175                                                       

     - Thr Asp Tyr Arg His Lys Ala Met Arg Ile As - #p Asp Asp Ala Lys Ala     

     #           190                                                           

     - Ser Met Arg Phe Lys Lys Glu Asp Leu Lys Gl - #y Thr Asp Lys Asn Tyr     

     #       205                                                               

     - Trp Leu Ser Asp Lys Ile Pro His Ala Phe As - #n Tyr Tyr Asn Gly Gly     

     #   220                                                                   

     - Lys Ile Thr Val Glu Lys Tyr Asp Asp Gly Ar - #g Asp Tyr Phe Asp Pro     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - His Gly Met His Ile Ala Gly Ile Leu Ala Gl - #y Asn Asp Thr Glu Gln     

     #               255                                                       

     - Asp Ile Lys Asn Phe Asn Gly Ile Asp Gly Il - #e Ala Pro Asn Ala Gln     

     #           270                                                           

     - Ile Phe Ser Tyr Lys Met Tyr Ser Asp Ala Gl - #y Ser Gly Phe Ala Gly     

     #       285                                                               

     - Asp Glu Thr Met Phe His Ala Ile Glu Asp Se - #r Ile Lys His Asn Val     

     #   300                                                                   

     - Asp Val Val Ser Val Ser Ser Gly Phe Thr Gl - #y Thr Gly Leu Val Gly     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Glu Lys Tyr Trp Gln Ala Ile Arg Ala Leu Ar - #g Lys Ala Gly Ile Pro     

     #               335                                                       

     - Met Val Val Ala Thr Gly Asn Tyr Ala Thr Se - #r Ala Ser Ser Ser Ser     

     #           350                                                           

     - Trp Asp Leu Val Ala Asn Asn His Leu Lys Me - #t Thr Asp Thr Gly Asn     

     #       365                                                               

     - Val Thr Arg Thr Ala Ala His Glu Asp Ala Il - #e Ala Val Ala Ser Ala     

     #   380                                                                   

     - Lys Asn Gln Thr Val Glu Phe Asp Lys Val As - #n Ile Gly Gly Glu Ser     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Phe Lys Tyr Arg Asn Ile Gly Ala Phe Phe As - #p Lys Ser Lys Ile Thr     

     #               415                                                       

     - Thr Asn Glu Asp Gly Thr Lys Ala Pro Ser Ly - #s Leu Lys Phe Val Tyr     

     #           430                                                           

     - Ile Gly Lys Gly Gln Asp Gln Asp Leu Ile Gl - #y Leu Asp Leu Arg Gly     

     #       445                                                               

     - Lys Ile Ala Val Met Asp Arg Ile Tyr Thr Ly - #s Asp Leu Lys Asn Ala     

     #   460                                                                   

     - Phe Lys Lys Ala Met Asp Lys Gly Ala Arg Al - #a Ile Met Val Val Asn     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Thr Val Asn Tyr Tyr Asn Arg Asp Asn Trp Th - #r Glu Leu Pro Ala Met     

     #               495                                                       

     - Gly Tyr Glu Ala Asp Glu Gly Thr Lys Ser Gl - #n Val Phe Ser Ile Ser     

     #           510                                                           

     - Gly Asp Asp Gly Val Lys Leu Trp Asn Met Il - #e Asn Pro Asp Lys Lys     

     #       525                                                               

     - Thr Glu Val Lys Arg Asn Asn Lys Glu Asp Ph - #e Lys Asp Lys Leu Glu     

     #   540                                                                   

     - Gln Tyr Tyr Pro Ile Asp Met Glu Ser Phe As - #n Ser Asn Lys Pro Asn     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Val Gly Asp Glu Lys Glu Ile Asp Phe Lys Ph - #e Ala Pro Asp Thr Asp     

     #               575                                                       

     - Lys Glu Leu Tyr Lys Glu Asp Ile Ile Val Pr - #o Ala Gly Ser Thr Ser     

     #           590                                                           

     - Trp Gly Pro Arg Ile Asp Leu Leu Leu Lys Pr - #o Asp Val Ser Ala Pro     

     #       605                                                               

     - Gly Lys Asn Ile Lys Ser Thr Leu Asn Val Il - #e Asn Gly Lys Ser Thr     

     #   620                                                                   

     - Tyr Gly Tyr Met Ser Gly Thr Ser Met Ala Th - #r Pro Ile Val Ala Ala     

     625                 6 - #30                 6 - #35                 6 -   

     #40                                                                       

     - Ser Thr Val Leu Ile Arg Pro Lys Leu Lys Gl - #u Met Leu Glu Arg Pro     

     #               655                                                       

     - Val Leu Lys Asn Leu Lys Gly Asp Asp Lys Il - #e Asp Leu Thr Ser Leu     

     #           670                                                           

     - Thr Lys Ile Ala Leu Gln Asn Thr Ala Arg Pr - #o Met Met Asp Ala Thr     

     #       685                                                               

     - Ser Trp Lys Glu Lys Ser Gln Tyr Phe Ala Se - #r Pro Arg Gln Gln Gly     

     #   700                                                                   

     - Ala Gly Leu Ile Asn Val Ala Asn Ala Leu Ar - #g Asn Glu Val Val Ala     

     705                 7 - #10                 7 - #15                 7 -   

     #20                                                                       

     - Thr Phe Lys Asn Thr Asp Ser Lys Gly Leu Va - #l Asn Ser Tyr Gly Ser     

     #               735                                                       

     - Ile Ser Leu Lys Glu Ile Lys Gly Asp Lys Ly - #s Tyr Phe Thr Ile Lys     

     #           750                                                           

     - Leu His Asn Thr Ser Asn Arg Pro Leu Thr Ph - #e Lys Val Ser Ala Ser     

     #       765                                                               

     - Ala Ile Thr Thr Asp Ser Leu Thr Asp Arg Le - #u Lys Leu Asp Glu Thr     

     #   780                                                                   

     - Tyr Lys Asp Glu Lys Ser Pro Asp Gly Lys Gl - #n Ile Val Pro Glu Ile     

     785                 7 - #90                 7 - #95                 8 -   

     #00                                                                       

     - His Pro Glu Lys Val Lys Gly Ala Asn Ile Th - #r Phe Glu His Asp Thr     

     #               815                                                       

     - Phe Thr Ile Gly Ala Asn Ser Ser Phe Asp Le - #u Asn Ala Val Ile Asn     

     #           830                                                           

     - Val Gly Glu Ala Lys Asn Lys Asn Lys Phe Va - #l Glu Ser Phe Ile His     

     #       845                                                               

     - Phe Glu Ser Val Glu Ala Met Glu Ala Leu As - #n Ser Ser Gly Lys Lys     

     #   860                                                                   

     - Ile Asn Phe Gln Pro Ser Leu Ser Met Pro Le - #u Met Gly Phe Ala Gly     

     865                 8 - #70                 8 - #75                 8 -   

     #80                                                                       

     - Asn Trp Asn His Glu Pro Ile Leu Asp Lys Tr - #p Ala Trp Glu Glu Gly     

     #               895                                                       

     - Ser Arg Ser Lys Thr Leu Gly Gly Tyr Asp As - #p Asp Gly Lys Pro Lys     

     #           910                                                           

     - Ile Pro Gly Thr Leu Asn Lys Gly Ile Gly Gl - #y Glu His Gly Ile Asp     

     #       925                                                               

     - Lys Phe Asn Pro Ala Gly Val Ile Gln Asn Ar - #g Lys Asp Lys Asn Thr     

     #   940                                                                   

     - Thr Ser Leu Asp Gln Asn Pro Glu Leu Phe Al - #a Phe Asn Asn Glu Gly     

     945                 9 - #50                 9 - #55                 9 -   

     #60                                                                       

     - Ile Asn Ala Pro Ser Ser Ser Gly Ser Lys Il - #e Ala Asn Ile Tyr Pro     

     #               975                                                       

     - Leu Asp Ser Asn Gly Asn Pro Gln Asp Ala Gl - #n Leu Glu Arg Gly Leu     

     #           990                                                           

     - Thr Pro Ser Pro Leu Val Leu Arg Ser Ala Gl - #u Glu Gly Leu Ile         

     #      10050                                                              

     - (2) INFORMATION FOR SEQ ID NO: 217:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1696 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #217: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TGTGGTCGAA GTTGAGACTC CTCAATCAAT AACAAATCAG GAGCAAGCTA GG - #ACAGAAAA   

       60                                                                      

     - CCAAGTAGTA GAGACAGAGG AAGCTCCAAA AGAAGAAGCA CCTAAAACAG AA - #GAAAGTCC   

      120                                                                      

     - AAAGGAAGAA CCAAAATCGG AGGTAAAACC TACTGACGAC ACCCTTCCTA AA - #GTAGAAGA   

      180                                                                      

     - GGGGAAAGAA GATTCAGCAG AACCAGCTCC AGTTGAAGAA GTAGGTGGAG AA - #GTTGAGTC   

      240                                                                      

     - AAAACCAGAG GAAAAAGTAG CAGTTAAGCC AGAAAGTCAA CCATCAGACA AA - #CCAGCTGA   

      300                                                                      

     - GGAATCAAAA GTTGAACAAG CAGGTGAACC AGTCGCGCCA AGAGAAGACG AA - #AAGGCACC   

      360                                                                      

     - AGTCGAGCCA GAAAAGCAAC CAGAAGCTCC TGAAGAAGAG AAGGCTGTAG AG - #GAAACACC   

      420                                                                      

     - GAAACAAGAA GAGTCAACTC CAGATACCAA GGCTGAAGAA ACTGTAGAAC CA - #AAAGAGGA   

      480                                                                      

     - GACTGTTAAT CAATCTATTG AACAACCAAA AGTTGAAACG CCTGCTGTAG AA - #AAACAAAC   

      540                                                                      

     - AGAACCAACA GAGGAACCAA AAGTTGAACA AGCAGGTGAA CCAGTCGCGC CA - #AGAGAAGA   

      600                                                                      

     - CGAACAGGCA CCAACGGCAC CAGTTGAGCC AGAAAAGCAA CCAGAAGTTC CT - #GAAGAAGA   

      660                                                                      

     - GAAGGCTGTA GAGGAAACAC CGAAACCAGA AGATAAAATA AAGGGTATTG GT - #ACTAAAGA   

      720                                                                      

     - ACCAGTTGAT AAAAGTGAGT TAAATAATCA AATTGATAAA GCTAGTTCAG TT - #TCTCCTAC   

      780                                                                      

     - TGATTATTCT ACAGCAAGTT ACAATGCTCT TGGACCTGTT TTAGAAACTG CA - #AAAGGTGT   

      840                                                                      

     - CTATGCTTCA GAGCCTGTAA AACAGCCTGA GGTAAATAGC GAGACAAATA AA - #CTTAAAAC   

      900                                                                      

     - GGCTATTGAC GCTCTAAACG TTGATAAAAC TGAATTAAAC AATACGATTG CA - #GATGCAAA   

      960                                                                      

     - AACAAAGGTA AAAGAACATT ACAGTGATAG AAGTTGGCAA AACCTCCAAA CT - #GAAGTTAC   

     1020                                                                      

     - AAAGGCTGAA AAAGTTGCAG CTAATACAGA TGCTAAACAA AGTGAAGTTA AC - #GAAGCTGT   

     1080                                                                      

     - TGAAAAATTA ACTGCAACTA TTGAAAAATT GGTTGAATTA TCTGAAAAGC CA - #ATATTAAC   

     1140                                                                      

     - ATTGACTAGT ACCGATAAGA AAATATTGGA ACGTGAAGCT GTTGCTAAGT AT - #ACTCTAGA   

     1200                                                                      

     - AAATCAAAAC AAAACAAAAA TCAAATCAAT CACAGCTGAA TTGAAAAAAG GA - #GAAGAAGT   

     1260                                                                      

     - TATTAATACT GTAGTCCTTA CAGATGACAA GGTAACAACA GAAACTATAA GC - #GCTGCATT   

     1320                                                                      

     - TAAGAACCTA GAGTACTACA AAGAATACAC CCTATCTACA ACTATGATTT AC - #GACAGAGG   

     1380                                                                      

     - TAACGGTGAA GAAACTGAAA CTCTAGAAAA TCAAAATATT CAATTAGATC TT - #AAAAAAGT   

     1440                                                                      

     - TGAGCTTAAA AATATTAAAC GTACAGATTT AATCAAATAC GAAAATGGAA AA - #GAAACTAA   

     1500                                                                      

     - TGAATCACTG ATAACAACTA TTCCTGATGA TAAGAGCAAT TATTATTTAA AA - #ATAACTTC   

     1560                                                                      

     - AAATAATCAG AAAACTACAT TACTAGCTGT TAAAAATATA GAAGAAACTA CG - #GTTAACGG   

     1620                                                                      

     - AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA CT - #GCTGATAA   

     1680                                                                      

     #  1696                                                                   

     - (2) INFORMATION FOR SEQ ID NO:218:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 565 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:218:                           

     - Val Val Glu Val Glu Thr Pro Gln Ser Ile Th - #r Asn Gln Glu Gln Ala     

     #                15                                                       

     - Arg Thr Glu Asn Gln Val Val Glu Thr Glu Gl - #u Ala Pro Lys Glu Glu     

     #            30                                                           

     - Ala Pro Lys Thr Glu Glu Ser Pro Lys Glu Gl - #u Pro Lys Ser Glu Val     

     #        45                                                               

     - Lys Pro Thr Asp Asp Thr Leu Pro Lys Val Gl - #u Glu Gly Lys Glu Asp     

     #    60                                                                   

     - Ser Ala Glu Pro Ala Pro Val Glu Glu Val Gl - #y Gly Glu Val Glu Ser     

     #80                                                                       

     - Lys Pro Glu Glu Lys Val Ala Val Lys Pro Gl - #u Ser Gln Pro Ser Asp     

     #                95                                                       

     - Lys Pro Ala Glu Glu Ser Lys Val Glu Gln Al - #a Gly Glu Pro Val Ala     

     #           110                                                           

     - Pro Arg Glu Asp Glu Lys Ala Pro Val Glu Pr - #o Glu Lys Gln Pro Glu     

     #       125                                                               

     - Ala Pro Glu Glu Glu Lys Ala Val Glu Glu Th - #r Pro Lys Gln Glu Glu     

     #   140                                                                   

     - Ser Thr Pro Asp Thr Lys Ala Glu Glu Thr Va - #l Glu Pro Lys Glu Glu     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Thr Val Asn Gln Ser Ile Glu Gln Pro Lys Va - #l Glu Thr Pro Ala Val     

     #               175                                                       

     - Glu Lys Gln Thr Glu Pro Thr Glu Glu Pro Ly - #s Val Glu Gln Ala Gly     

     #           190                                                           

     - Glu Pro Val Ala Pro Arg Glu Asp Glu Gln Al - #a Pro Thr Ala Pro Val     

     #       205                                                               

     - Glu Pro Glu Lys Gln Pro Glu Val Pro Glu Gl - #u Glu Lys Ala Val Glu     

     #   220                                                                   

     - Glu Thr Pro Lys Pro Glu Asp Lys Ile Lys Gl - #y Ile Gly Thr Lys Glu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Pro Val Asp Lys Ser Glu Leu Asn Asn Gln Il - #e Asp Lys Ala Ser Ser     

     #               255                                                       

     - Val Ser Pro Thr Asp Tyr Ser Thr Ala Ser Ty - #r Asn Ala Leu Gly Pro     

     #           270                                                           

     - Val Leu Glu Thr Ala Lys Gly Val Tyr Ala Se - #r Glu Pro Val Lys Gln     

     #       285                                                               

     - Pro Glu Val Asn Ser Glu Thr Asn Lys Leu Ly - #s Thr Ala Ile Asp Ala     

     #   300                                                                   

     - Leu Asn Val Asp Lys Thr Glu Leu Asn Asn Th - #r Ile Ala Asp Ala Lys     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Thr Lys Val Lys Glu His Tyr Ser Asp Arg Se - #r Trp Gln Asn Leu Gln     

     #               335                                                       

     - Thr Glu Val Thr Lys Ala Glu Lys Val Ala Al - #a Asn Thr Asp Ala Lys     

     #           350                                                           

     - Gln Ser Glu Val Asn Glu Ala Val Glu Lys Le - #u Thr Ala Thr Ile Glu     

     #       365                                                               

     - Lys Leu Val Glu Leu Ser Glu Lys Pro Ile Le - #u Thr Leu Thr Ser Thr     

     #   380                                                                   

     - Asp Lys Lys Ile Leu Glu Arg Glu Ala Val Al - #a Lys Tyr Thr Leu Glu     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Asn Gln Asn Lys Thr Lys Ile Lys Ser Ile Th - #r Ala Glu Leu Lys Lys     

     #               415                                                       

     - Gly Glu Glu Val Ile Asn Thr Val Val Leu Th - #r Asp Asp Lys Val Thr     

     #           430                                                           

     - Thr Glu Thr Ile Ser Ala Ala Phe Lys Asn Le - #u Glu Tyr Tyr Lys Glu     

     #       445                                                               

     - Tyr Thr Leu Ser Thr Thr Met Ile Tyr Asp Ar - #g Gly Asn Gly Glu Glu     

     #   460                                                                   

     - Thr Glu Thr Leu Glu Asn Gln Asn Ile Gln Le - #u Asp Leu Lys Lys Val     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Glu Leu Lys Asn Ile Lys Arg Thr Asp Leu Il - #e Lys Tyr Glu Asn Gly     

     #               495                                                       

     - Lys Glu Thr Asn Glu Ser Leu Ile Thr Thr Il - #e Pro Asp Asp Lys Ser     

     #           510                                                           

     - Asn Tyr Tyr Leu Lys Ile Thr Ser Asn Asn Gl - #n Lys Thr Thr Leu Leu     

     #       525                                                               

     - Ala Val Lys Asn Ile Glu Glu Thr Thr Val As - #n Gly Thr Pro Val Tyr     

     #   540                                                                   

     - Lys Val Thr Ala Ile Ala Asp Asn Leu Val Se - #r Arg Thr Ala Asp Asn     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Lys Phe Glu Glu Glu                                                     

                     565                                                       

     - (2) INFORMATION FOR SEQ ID NO: 219:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1879 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #219: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA CT - #GCTGATAA   

       60                                                                      

     - TAAATTTGAA GAAGAATACG TTCACTATAT TGAAAAACCT AAAGTCCACG AA - #GATAATGT   

      120                                                                      

     - ATATTATAAT TTCAAAGAAT TAGTGGAAGC TATTCAAAAC GATCCTTCAA AA - #GAATATCG   

      180                                                                      

     - TCTGGGACAA TCAATGAGCG CTAGAAATGT TGTTCCTAAT GGAAAATCAT AT - #ATCACTAA   

      240                                                                      

     - AGAATTCACA GGAAAACTTT TAAGTTCTGA AGGAAAACAA TTTGCTATTA CT - #GAATTGGA   

      300                                                                      

     - ACATCCATTA TTTAATGTGA TAACAAACGC AACGATAAAT AATGTGAATT TT - #GAAAATGT   

      360                                                                      

     - AGAGATAGAA CGTTCTGGTC AAGATAATAT TGCATCATTA GCCAATACTA TG - #AAAGGTTC   

      420                                                                      

     - TTCAGTTATT ACAAATGTCA AAATTACAGG CACACTTTCA GGTCGTAATA AT - #GTTGCTGG   

      480                                                                      

     - ATTTGTAAAT AATATGAATG ATGGAACTCG TATTGAAAAT GTTGCTTTCT TT - #GGCAAACT   

      540                                                                      

     - ACACTCTACA AGTGGAAATG GCTCTCATAC AGGGGGAATT GCAGGTACAA AC - #TATAGAGG   

      600                                                                      

     - AATTGTTAGA AAAGCATATG TTGATGCTAC TATTACAGGA AACAAAACAC GC - #GCCAGCTT   

      660                                                                      

     - GTTAGTTCCT AAAGTAGATT ATGGATTAAC TCTAGACCAT CTTATTGGTA CA - #AAAGCTCT   

      720                                                                      

     - CCTAACTGAG TCGGTTGTAA AAGGTAAAAT AGATGTTTCA AATCCAGTAG AA - #GTTGGAGC   

      780                                                                      

     - AATAGCAAGT AAGACTTGGC CTGTAGGTAC GGTAAGTAAT TCTGTCAGCT AT - #GCTAAGAT   

      840                                                                      

     - TATCCGTGGA GAGGAGTTAT TCGGCTCTAA CGACGTTGAT GATTCTGATT AT - #GCTAGTGC   

      900                                                                      

     - TCATATAAAA GATTTATATG CGGTAGAGGG ATATTCGTCA GGTAATAGAT CA - #TTTAGGAA   

      960                                                                      

     - ATCTAAAACA TTTACTAAAT TAACTAAAGA ACAAGCTGAT GCTAAAGTTA CT - #ACTTTCAA   

     1020                                                                      

     - TATTACTGCT GATAAATTAG AAAGTGATCT ATCTCCTCTT GCAAAACTTA AT - #GAAGAAAA   

     1080                                                                      

     - AGCCTATTCT AGTATTCAAG ATTATAACGC TGAATATAAC CAAGCCTATA AA - #AATCTTGA   

     1140                                                                      

     - AAAATTAATA CCATTCTACA ATAAAGATTA TATTGTATAT CAAGGTAATA AA - #TTAAATAA   

     1200                                                                      

     - AGAACACCAT CTAAATACTA AAGAAGTTCT TTCTGTTACC GCGATGAACA AC - #AATGAGTT   

     1260                                                                      

     - TATCACAAAC CTAGATGAAG CTAATAAAAT TATTGTTCAC TATGCGGACG GT - #ACAAAAGA   

     1320                                                                      

     - TTACTTTAAC TTGTCTTCTA GCAGTGAAGG TTTAAGTAAT GTAAAAGAAT AT - #ACTATAAC   

     1380                                                                      

     - TGACTTAGGA ATTAAATATA CACCTAATAT CGTTCAAAAA GATAACACTA CT - #CTTGTTAA   

     1440                                                                      

     - TGATATAAAA TCTATTTTAG AATCAGTAGA GCTTCAGTCT CAAACGATGT AT - #CAGCATCT   

     1500                                                                      

     - AAATCGATTA GGTGACTATA GAGTTAATGC AATCAAAGAT TTATATTTAG AA - #GAAAGCTT   

     1560                                                                      

     - CACAGATGTT AAAGAAAACT TAACAAACCT AATCACAAAA TTAGTTCAAA AC - #GAAGAACA   

     1620                                                                      

     - TCAACTAAAT GATTCTCCAG CTGCTCGTCA AATGATTCGT GATAAAGTCG AG - #AAAAACAA   

     1680                                                                      

     - AGCAGCTTTA TTACTAGGTT TAACTTACCT AAATCGTTAC TATGGAGTTA AA - #TTTGGTGA   

     1740                                                                      

     - TGTTAATATT AAAGAATTAA TGCTATTCAA ACCAGATTTC TATGGTGAAA AA - #GTTAGCGT   

     1800                                                                      

     - ATTAGACAGA TTAATTGAAA TCGGTTCTAA AGAGAACAAC ATTAAAGGTT CA - #CGTACATT   

     1860                                                                      

     #                 187 - #9                                                

     - (2) INFORMATION FOR SEQ ID NO:220:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 626 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:220:                           

     - Thr Pro Val Tyr Lys Val Thr Ala Ile Ala As - #p Asn Leu Val Ser Arg     

     #                15                                                       

     - Thr Ala Asp Asn Lys Phe Glu Glu Glu Tyr Va - #l His Tyr Ile Glu Lys     

     #            30                                                           

     - Pro Lys Val His Glu Asp Asn Val Tyr Tyr As - #n Phe Lys Glu Leu Val     

     #        45                                                               

     - Glu Ala Ile Gln Asn Asp Pro Ser Lys Glu Ty - #r Arg Leu Gly Gln Ser     

     #    60                                                                   

     - Met Ser Ala Arg Asn Val Val Pro Asn Gly Ly - #s Ser Tyr Ile Thr Lys     

     #80                                                                       

     - Glu Phe Thr Gly Lys Leu Leu Ser Ser Glu Gl - #y Lys Gln Phe Ala Ile     

     #                95                                                       

     - Thr Glu Leu Glu His Pro Leu Phe Asn Val Il - #e Thr Asn Ala Thr Ile     

     #           110                                                           

     - Asn Asn Val Asn Phe Glu Asn Val Glu Ile Gl - #u Arg Ser Gly Gln Asp     

     #       125                                                               

     - Asn Ile Ala Ser Leu Ala Asn Thr Met Lys Gl - #y Ser Ser Val Ile Thr     

     #   140                                                                   

     - Asn Val Lys Ile Thr Gly Thr Leu Ser Gly Ar - #g Asn Asn Val Ala Gly     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Phe Val Asn Asn Met Asn Asp Gly Thr Arg Il - #e Glu Asn Val Ala Phe     

     #               175                                                       

     - Phe Gly Lys Leu His Ser Thr Ser Gly Asn Gl - #y Ser His Thr Gly Gly     

     #           190                                                           

     - Ile Ala Gly Thr Asn Tyr Arg Gly Ile Val Ar - #g Lys Ala Tyr Val Asp     

     #       205                                                               

     - Ala Thr Ile Thr Gly Asn Lys Thr Arg Ala Se - #r Leu Leu Val Pro Lys     

     #   220                                                                   

     - Val Asp Tyr Gly Leu Thr Leu Asp His Leu Il - #e Gly Thr Lys Ala Leu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Leu Thr Glu Ser Val Val Lys Gly Lys Ile As - #p Val Ser Asn Pro Val     

     #               255                                                       

     - Glu Val Gly Ala Ile Ala Ser Lys Thr Trp Pr - #o Val Gly Thr Val Ser     

     #           270                                                           

     - Asn Ser Val Ser Tyr Ala Lys Ile Ile Arg Gl - #y Glu Glu Leu Phe Gly     

     #       285                                                               

     - Ser Asn Asp Val Asp Asp Ser Asp Tyr Ala Se - #r Ala His Ile Lys Asp     

     #   300                                                                   

     - Leu Tyr Ala Val Glu Gly Tyr Ser Ser Gly As - #n Arg Ser Phe Arg Lys     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Ser Lys Thr Phe Thr Lys Leu Thr Lys Glu Gl - #n Ala Asp Ala Lys Val     

     #               335                                                       

     - Thr Thr Phe Asn Ile Thr Ala Asp Lys Leu Gl - #u Ser Asp Leu Ser Pro     

     #           350                                                           

     - Leu Ala Lys Leu Asn Glu Glu Lys Ala Tyr Se - #r Ser Ile Gln Asp Tyr     

     #       365                                                               

     - Asn Ala Glu Tyr Asn Gln Ala Tyr Lys Asn Le - #u Glu Lys Leu Ile Pro     

     #   380                                                                   

     - Phe Tyr Asn Lys Asp Tyr Ile Val Tyr Gln Gl - #y Asn Lys Leu Asn Lys     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Glu His His Leu Asn Thr Lys Glu Val Leu Se - #r Val Thr Ala Met Asn     

     #               415                                                       

     - Asn Asn Glu Phe Ile Thr Asn Leu Asp Glu Al - #a Asn Lys Ile Ile Val     

     #           430                                                           

     - His Tyr Ala Asp Gly Thr Lys Asp Tyr Phe As - #n Leu Ser Ser Ser Ser     

     #       445                                                               

     - Glu Gly Leu Ser Asn Val Lys Glu Tyr Thr Il - #e Thr Asp Leu Gly Ile     

     #   460                                                                   

     - Lys Tyr Thr Pro Asn Ile Val Gln Lys Asp As - #n Thr Thr Leu Val Asn     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Asp Ile Lys Ser Ile Leu Glu Ser Val Glu Le - #u Gln Ser Gln Thr Met     

     #               495                                                       

     - Tyr Gln His Leu Asn Arg Leu Gly Asp Tyr Ar - #g Val Asn Ala Ile Lys     

     #           510                                                           

     - Asp Leu Tyr Leu Glu Glu Ser Phe Thr Asp Va - #l Lys Glu Asn Leu Thr     

     #       525                                                               

     - Asn Leu Ile Thr Lys Leu Val Gln Asn Glu Gl - #u His Gln Leu Asn Asp     

     #   540                                                                   

     - Ser Pro Ala Ala Arg Gln Met Ile Arg Asp Ly - #s Val Glu Lys Asn Lys     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Ala Ala Leu Leu Leu Gly Leu Thr Tyr Leu As - #n Arg Tyr Tyr Gly Val     

     #               575                                                       

     - Lys Phe Gly Asp Val Asn Ile Lys Glu Leu Me - #t Leu Phe Lys Pro Asp     

     #           590                                                           

     - Phe Tyr Gly Glu Lys Val Ser Val Leu Asp Ar - #g Leu Ile Glu Ile Gly     

     #       605                                                               

     - Ser Lys Glu Asn Asn Ile Lys Gly Ser Arg Th - #r Phe Asp Ala Phe Gly     

     #   620                                                                   

     - Gln Val                                                                 

     625                                                                       

     - (2) INFORMATION FOR SEQ ID NO: 221:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 1777 base                                           

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #221: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - ATTAGACAGA TTAATTGAAA TCGGTTCTAA AGAGAACAAC ATTAAAGGTT CA - #CGTACATT   

       60                                                                      

     - CGACGCATTC GGTCAAGTAT TGGCTAAATA TACTAAATCA GGTAATTTAG AT - #GCATTTTT   

      120                                                                      

     - AAATTATAAT AGACAATTGT TCACAAATAT AGACAATATG AACGATTGGT TT - #ATTGATGC   

      180                                                                      

     - TACAGAAGAC CATGTCTACA TCGCAGAACG CGCTTCTGAG GTCGAAGAAA TT - #AAAAATTC   

      240                                                                      

     - TAAACATCGT GCATTCGATA ATTTAAAACG AAGTCACCTT AGAAATACTA TA - #CTCCCACT   

      300                                                                      

     - ACTGAATATT GATAAAGCAC ATCTTTATTT AATTTCAAAT TATAATGCAA TT - #GCCTTTGG   

      360                                                                      

     - TAGTGCAGAG CGATTAGGTA AAAAATCATT AGAAGATATT AAAGATATCG TT - #AACAAAGC   

      420                                                                      

     - TGCAGATGGT TATAGAAACT ATTATGATTT CTGGTATCGT CTAGCGTCTG AT - #AACGTTAA   

      480                                                                      

     - ACAACGACTA CTAAGAGATG CTGTTATTCC TATTTGGGAA GGTTATAACG CT - #CCTGGTGG   

      540                                                                      

     - ATGGGTTGAA AAATATGGCC GCTATAATAC CGACAAAGTA TATACTCCTC TT - #AGAGAATT   

      600                                                                      

     - CTTTGGTCCT ATGGATAAGT ATTATAATTA TAATGGAACA GGAGCTTATG CT - #GCTATATA   

      660                                                                      

     - TCCTAACTCT GATGATATTA GAACTGATGT AAAATATGTT CATTTAGAAA TG - #GTTGGTGA   

      720                                                                      

     - ATACGGTATT TCAGTTTACA CACATGAAAC AACACACGTC AACGACCGTG CG - #ATTTACTT   

      780                                                                      

     - AGGTGGCTTT GGACACCGTG AAGGTACTGA TGCTGAAGCA TATGCTCAGG GT - #ATGCTACA   

      840                                                                      

     - AACTCCTGTT ACTGGTAGTG GATTTGATGA GTTTGGTTCT TTAGGTATTA AT - #ATGGTATT   

      900                                                                      

     - TAAACGCAAA AATGATGGGA ATCAGTGGTA TATTACAGAT CCAAAAACTC TA - #AAAACACG   

      960                                                                      

     - AGAAGATATT AATAGATATA TGAAGGGTTA TAATGACACT TTAACTCTTC TT - #GATGAAAT   

     1020                                                                      

     - TGAGGCTGAA TCTGTGATTT CTCAACAAAA TAAAGATTTA AATAGTGCAT GG - #TTCAAAAA   

     1080                                                                      

     - AATAGATAGA GAATACCGTG ATAACAATAA ATTAAATCAA TGGGATAAAA TT - #CGAAATCT   

     1140                                                                      

     - AAGTCAAGAA GAGAAAAATG AATTAAATAT TCAATCTGTT AATGATTTAG TT - #GATCAACA   

     1200                                                                      

     - ATTAATGACT AATCGCAATC CAGGTAATGG TATCTATAAA CCCGAAGCAA TT - #AGCTATAA   

     1260                                                                      

     - CGATCAATCA CCTTATGTAG GTGTTAGAAT GATGACCGGT ATCTACGGAG GT - #AATACTAG   

     1320                                                                      

     - TAAAGGTGCT CCTGGAGCTG TTTCATTCAA ACATAATGCT TTTAGATTAT GG - #GGTTACTA   

     1380                                                                      

     - CGGATACGAA AATGGGTTCT TAGGTTATGC TTCAAATAAA TATAAACAAC AA - #TCTAAAAC   

     1440                                                                      

     - AGATGGTGAG TCTGTTCTAA GTGATGAATA TATTATCAAG AAAATATCTA AC - #AATACATT   

     1500                                                                      

     - TAATACTATT GAAGAATTTA AAAAAGCTTA CTTCAAAGAA GTTAAAGATA AA - #GCAACGAA   

     1560                                                                      

     - AGGATTAACA ACATTCGAAG TAAATGGTTC TTCCGTTTCA TCATACGATG AT - #TTACTGAC   

     1620                                                                      

     - ATTGTTTAAA GAAGCTGTTA AAAAAGATGC CGAAACTCTT AAACAAGAAG CA - #AACGGTAA   

     1680                                                                      

     - TAAAACAGTA TCTATGAATA ATACAGTTAA ATTAAAAGAA GCTGTTTATA AG - #AAACTTCT   

     1740                                                                      

     #    1777          TTTA AAACTTCAAT CTTTAAA                                

     - (2) INFORMATION FOR SEQ ID NO:222:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 592 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:222:                           

     - Leu Asp Arg Leu Ile Glu Ile Gly Ser Lys Gl - #u Asn Asn Ile Lys Gly     

     #                15                                                       

     - Ser Arg Thr Phe Asp Ala Phe Gly Gln Val Le - #u Ala Lys Tyr Thr Lys     

     #            30                                                           

     - Ser Gly Asn Leu Asp Ala Phe Leu Asn Tyr As - #n Arg Gln Leu Phe Thr     

     #        45                                                               

     - Asn Ile Asp Asn Met Asn Asp Trp Phe Ile As - #p Ala Thr Glu Asp His     

     #    60                                                                   

     - Val Tyr Ile Ala Glu Arg Ala Ser Glu Val Gl - #u Glu Ile Lys Asn Ser     

     #80                                                                       

     - Lys His Arg Ala Phe Asp Asn Leu Lys Arg Se - #r His Leu Arg Asn Thr     

     #                95                                                       

     - Ile Leu Pro Leu Leu Asn Ile Asp Lys Ala Hi - #s Leu Tyr Leu Ile Ser     

     #           110                                                           

     - Asn Tyr Asn Ala Ile Ala Phe Gly Ser Ala Gl - #u Arg Leu Gly Lys Lys     

     #       125                                                               

     - Ser Leu Glu Asp Ile Lys Asp Ile Val Asn Ly - #s Ala Ala Asp Gly Tyr     

     #   140                                                                   

     - Arg Asn Tyr Tyr Asp Phe Trp Tyr Arg Leu Al - #a Ser Asp Asn Val Lys     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Gln Arg Leu Leu Arg Asp Ala Val Ile Pro Il - #e Trp Glu Gly Tyr Asn     

     #               175                                                       

     - Ala Pro Gly Gly Trp Val Glu Lys Tyr Gly Ar - #g Tyr Asn Thr Asp Lys     

     #           190                                                           

     - Val Tyr Thr Pro Leu Arg Glu Phe Phe Gly Pr - #o Met Asp Lys Tyr Tyr     

     #       205                                                               

     - Asn Tyr Asn Gly Thr Gly Ala Tyr Ala Ala Il - #e Tyr Pro Asn Ser Asp     

     #   220                                                                   

     - Asp Ile Arg Thr Asp Val Lys Tyr Val His Le - #u Glu Met Val Gly Glu     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Tyr Gly Ile Ser Val Tyr Thr His Glu Thr Th - #r His Val Asn Asp Arg     

     #               255                                                       

     - Ala Ile Tyr Leu Gly Gly Phe Gly His Arg Gl - #u Gly Thr Asp Ala Glu     

     #           270                                                           

     - Ala Tyr Ala Gln Gly Met Leu Gln Thr Pro Va - #l Thr Gly Ser Gly Phe     

     #       285                                                               

     - Asp Glu Phe Gly Ser Leu Gly Ile Asn Met Va - #l Phe Lys Arg Lys Asn     

     #   300                                                                   

     - Asp Gly Asn Gln Trp Tyr Ile Thr Asp Pro Ly - #s Thr Leu Lys Thr Arg     

     305                 3 - #10                 3 - #15                 3 -   

     #20                                                                       

     - Glu Asp Ile Asn Arg Tyr Met Lys Gly Tyr As - #n Asp Thr Leu Thr Leu     

     #               335                                                       

     - Leu Asp Glu Ile Glu Ala Glu Ser Val Ile Se - #r Gln Gln Asn Lys Asp     

     #           350                                                           

     - Leu Asn Ser Ala Trp Phe Lys Lys Ile Asp Ar - #g Glu Tyr Arg Asp Asn     

     #       365                                                               

     - Asn Lys Leu Asn Gln Trp Asp Lys Ile Arg As - #n Leu Ser Gln Glu Glu     

     #   380                                                                   

     - Lys Asn Glu Leu Asn Ile Gln Ser Val Asn As - #p Leu Val Asp Gln Gln     

     385                 3 - #90                 3 - #95                 4 -   

     #00                                                                       

     - Leu Met Thr Asn Arg Asn Pro Gly Asn Gly Il - #e Tyr Lys Pro Glu Ala     

     #               415                                                       

     - Ile Ser Tyr Asn Asp Gln Ser Pro Tyr Val Gl - #y Val Arg Met Met Thr     

     #           430                                                           

     - Gly Ile Tyr Gly Gly Asn Thr Ser Lys Gly Al - #a Pro Gly Ala Val Ser     

     #       445                                                               

     - Phe Lys His Asn Ala Phe Arg Leu Trp Gly Ty - #r Tyr Gly Tyr Glu Asn     

     #   460                                                                   

     - Gly Phe Leu Gly Tyr Ala Ser Asn Lys Tyr Ly - #s Gln Gln Ser Lys Thr     

     465                 4 - #70                 4 - #75                 4 -   

     #80                                                                       

     - Asp Gly Glu Ser Val Leu Ser Asp Glu Tyr Il - #e Ile Lys Lys Ile Ser     

     #               495                                                       

     - Asn Asn Thr Phe Asn Thr Ile Glu Glu Phe Ly - #s Lys Ala Tyr Phe Lys     

     #           510                                                           

     - Glu Val Lys Asp Lys Ala Thr Lys Gly Leu Th - #r Thr Phe Glu Val Asn     

     #       525                                                               

     - Gly Ser Ser Val Ser Ser Tyr Asp Asp Leu Le - #u Thr Leu Phe Lys Glu     

     #   540                                                                   

     - Ala Val Lys Lys Asp Ala Glu Thr Leu Lys Gl - #n Glu Ala Asn Gly Asn     

     545                 5 - #50                 5 - #55                 5 -   

     #60                                                                       

     - Lys Thr Val Ser Met Asn Asn Thr Val Lys Le - #u Lys Glu Ala Val Tyr     

     #               575                                                       

     - Lys Lys Leu Leu Gln Gln Thr Asn Ser Phe Ly - #s Thr Ser Ile Phe Lys     

     #           590                                                           

     - (2) INFORMATION FOR SEQ ID NO: 223:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 460 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #223: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - TAAGACAGAT GAACGGAGCA AGGTGTTTGA CTTTTCCATT CCCTACTATA CT - #GCAAAAAA   

       60                                                                      

     - TAAACTCATT GTCAAAAAAT CTGACTTGAC TACTTATCAG TCTGTAAACG AC - #TTGGCGCA   

      120                                                                      

     - GAAAAAGGTT GGAGCGCAGA AAGGTTCGAT TCAAGAGACG ATGGCGAAAG AT - #TTGCTACA   

      180                                                                      

     - AAATTCTTCC CTCGTATCTC TGCCTAAAAA TGGGAATTTA ATCACAGATT TA - #AAATCAGG   

      240                                                                      

     - ACAAGTGGAT GCCGTTATCT TTGAAGAACC TGTTTCCAAG GGATTTGTGG AA - #AATAATCC   

      300                                                                      

     - TGATTTAGCA ATCGCAGACC TCAATTTTGA AAAAGAGCAA GATGATTCCT AC - #GCGGTAGC   

      360                                                                      

     - CATgAAAAAA GATAGCAAGA AATTGAAGAG GCAGTTCGAT AAAACCATTC AA - #AAGTTGAA   

      420                                                                      

     #   460            GACA AACTCATTGA GGAAGCCTTA                             

     - (2) INFORMATION FOR SEQ ID NO:224:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 153 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:224:                           

     - Lys Thr Asp Glu Arg Ser Lys Val Phe Asp Ph - #e Ser Ile Pro Tyr Tyr     

     #                15                                                       

     - Thr Ala Lys Asn Lys Leu Ile Val Lys Lys Se - #r Asp Leu Thr Thr Tyr     

     #            30                                                           

     - Gln Ser Val Asn Asp Leu Ala Gln Lys Lys Va - #l Gly Ala Gln Lys Gly     

     #        45                                                               

     - Ser Ile Gln Glu Thr Met Ala Lys Asp Leu Le - #u Gln Asn Ser Ser Leu     

     #    60                                                                   

     - Val Ser Leu Pro Lys Asn Gly Asn Leu Ile Th - #r Asp Leu Lys Ser Gly     

     #80                                                                       

     - Gln Val Asp Ala Val Ile Phe Glu Glu Pro Va - #l Ser Lys Gly Phe Val     

     #                95                                                       

     - Glu Asn Asn Pro Asp Leu Ala Ile Ala Asp Le - #u Asn Phe Glu Lys Glu     

     #           110                                                           

     - Gln Asp Asp Ser Tyr Ala Val Ala Met Lys Ly - #s Asp Ser Lys Lys Leu     

     #       125                                                               

     - Lys Arg Gln Phe Asp Lys Thr Ile Gln Lys Le - #u Lys Glu Ser Gly Glu     

     #   140                                                                   

     - Leu Asp Lys Leu Ile Glu Glu Ala Leu                                     

     145                 1 - #50                                               

     - (2) INFORMATION FOR SEQ ID NO: 225:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 751 base                                            

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #225: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     - CTGTGAGAAT CAAGCTACAC CCAAAGAGAC TAGCGCTCAA AAGACAATCG TC - #CTTGCTAC   

       60                                                                      

     - AGCTGGCGAC GTGCCACCAT TTGACTACGA AGACAAGGGC AATCTGACAG GC - #TTTGATAT   

      120                                                                      

     - CGAAGTTTTA AAGGCAGTAG ATGAAAAACT CAGCGACTAC GAGATTCAAT TC - #CAAAGAAC   

      180                                                                      

     - CGCCTGGGAG AGCATCTTCC CAGGACTTGA TTCTGGTCAC TATCAGGCTG CG - #GCCAATAA   

      240                                                                      

     - CTTGAGTTAC ACAAAAGAGC GTGCTGAAAA ATACCTTTAC TCGCTTCCAA TT - #TCCAACAA   

      300                                                                      

     - TCCCCTCGTC CTTGTCAGCA ACAAGAAAAA TCCTTTGACT TCTCTTGACC AG - #ATCGCTGG   

      360                                                                      

     - TAAAACAACA CAAGAGGATA CCGGAACTTC TAACGCTCAA TTCATCAATA AC - #TGGAATCA   

      420                                                                      

     - GAAACACACT GATAATCCCG CTACAATTAA TTTTTCTGGT GAGGATATTG GT - #AAACGAAT   

      480                                                                      

     - CCTAGACCTT GCTAACGGAG AGTTTGATTT CCTAGTTTTT GACAAGGTAT CC - #GTTCAAAA   

      540                                                                      

     - GATTATCAAG GACCGTGGTT TAGACCTCTC AGTCGTTGAT TTACCTTCTG CA - #GATAGCCC   

      600                                                                      

     - CAGCAATTAT ATCATTTTCT CAAGCGACCA AAAAGAGTTT AAAGAGCAAT TT - #GATAAAGC   

      660                                                                      

     - GCTCAAAGAA CTCTATCAAG ACGGAACCCT TGAAAAACTC AGCAATACCT AT - #CTAGGTGG   

      720                                                                      

     #         751      CAAT CTCAGTTACA A                                      

     - (2) INFORMATION FOR SEQ ID NO:226:                                      

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #acids    (A) LENGTH: 250 amino                                           

               (B) TYPE: amino acid                                            

               (C) STRANDEDNESS: single                                        

               (D) TOPOLOGY: linear                                            

     -     (ii) MOLECULE TYPE: protein                                         

     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:226:                           

     - Cys Glu Asn Gln Ala Thr Pro Lys Glu Thr Se - #r Ala Gln Lys Thr Ile     

     #                15                                                       

     - Val Leu Ala Thr Ala Gly Asp Val Pro Pro Ph - #e Asp Tyr Glu Asp Lys     

     #            30                                                           

     - Gly Asn Leu Thr Gly Phe Asp Ile Glu Val Le - #u Lys Ala Val Asp Glu     

     #        45                                                               

     - Lys Leu Ser Asp Tyr Glu Ile Gln Phe Gln Ar - #g Thr Ala Trp Glu Ser     

     #    60                                                                   

     - Ile Phe Pro Gly Leu Asp Ser Gly His Tyr Gl - #n Ala Ala Ala Asn Asn     

     #80                                                                       

     - Leu Ser Tyr Thr Lys Glu Arg Ala Glu Lys Ty - #r Leu Tyr Ser Leu Pro     

     #                95                                                       

     - Ile Ser Asn Asn Pro Leu Val Leu Val Ser As - #n Lys Lys Asn Pro Leu     

     #           110                                                           

     - Thr Ser Leu Asp Gln Ile Ala Gly Lys Thr Th - #r Gln Glu Asp Thr Gly     

     #       125                                                               

     - Thr Ser Asn Ala Gln Phe Ile Asn Asn Trp As - #n Gln Lys His Thr Asp     

     #   140                                                                   

     - Asn Pro Ala Thr Ile Asn Phe Ser Gly Glu As - #p Ile Gly Lys Arg Ile     

     145                 1 - #50                 1 - #55                 1 -   

     #60                                                                       

     - Leu Asp Leu Ala Asn Gly Glu Phe Asp Phe Le - #u Val Phe Asp Lys Val     

     #               175                                                       

     - Ser Val Gln Lys Ile Ile Lys Asp Arg Gly Le - #u Asp Leu Ser Val Val     

     #           190                                                           

     - Asp Leu Pro Ser Ala Asp Ser Pro Ser Asn Ty - #r Ile Ile Phe Ser Ser     

     #       205                                                               

     - Asp Gln Lys Glu Phe Lys Glu Gln Phe Asp Ly - #s Ala Leu Lys Glu Leu     

     #   220                                                                   

     - Tyr Gln Asp Gly Thr Leu Glu Lys Leu Ser As - #n Thr Tyr Leu Gly Gly     

     225                 2 - #30                 2 - #35                 2 -   

     #40                                                                       

     - Ser Tyr Leu Pro Asp Gln Ser Gln Leu Gln                                 

     #               250                                                       

     - (2) INFORMATION FOR SEQ ID NO: 227:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #227: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       CTAC GACAATAAAA ATC                                    

     - (2) INFORMATION FOR SEQ ID NO: 228:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 27 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #228: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #             27   TGCT GGTTGAG                                           

     - (2) INFORMATION FOR SEQ ID NO: 229:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #229: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     CAAT ACGGACTATG                                        

     - (2) INFORMATION FOR SEQ ID NO: 230:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 28 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #230: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #             28   TAGG TCGGAAAC                                          

     - (2) INFORMATION FOR SEQ ID NO: 231:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #231: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TCAA GCTACACCCA AAGAG                                  

     - (2) INFORMATION FOR SEQ ID NO: 232:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #232: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      CTGA GATTGATCTG G                                      

     - (2) INFORMATION FOR SEQ ID NO: 233:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #233: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          CCGC TCTTCTCGTA ACGCAGC                                

     - (2) INFORMATION FOR SEQ ID NO: 234:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #234: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      GAAC TTTTACGCTT CC                                     

     - (2) INFORMATION FOR SEQ ID NO: 235:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 45 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #235: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #45                AATT TGACAGGTAA CAGCAAAAAA GCTGC                       

     - (2) INFORMATION FOR SEQ ID NO: 236:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #236: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TTTT CAAGAATTCA TCG                                    

     - (2) INFORMATION FOR SEQ ID NO: 237:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #237: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         AGGA ACTGCTTCTA AAGAC                                  

     - (2) INFORMATION FOR SEQ ID NO: 238:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #238: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      ATTC GTTGGTGAAG CC                                     

     - (2) INFORMATION FOR SEQ ID NO: 239:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #239: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           AGGT GGAAACGCTG GTTCATCC                               

     - (2) INFORMATION FOR SEQ ID NO: 240:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #240: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TTTT CCACCTTCAA CAACC                                  

     - (2) INFORMATION FOR SEQ ID NO: 241:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #241: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          CTAT GGTAAATCTG CGGATGG                                

     - (2) INFORMATION FOR SEQ ID NO: 242:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 42 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #242: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #  42              ATCC GCTTTCATCG GGTTAAAGAA GG                          

     - (2) INFORMATION FOR SEQ ID NO: 243:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #243: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AAAT TCTAGCGAAA CTAGTGG                                

     - (2) INFORMATION FOR SEQ ID NO: 244:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #244: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TTCT TTTACTTCTT TGGTTGC                                

     - (2) INFORMATION FOR SEQ ID NO: 245:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #245: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            CGGA AAAAAAGATA CAACTTCTGG                             

     - (2) INFORMATION FOR SEQ ID NO: 246:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #246: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          CAAT CCTTCAGCAA TCTTGTC                                

     - (2) INFORMATION FOR SEQ ID NO: 247:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #247: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AAAA AATACAGCTT CAAGTCC                                

     - (2) INFORMATION FOR SEQ ID NO: 248:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #248: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         GTTT GCTTGGTATT GGTCG                                  

     - (2) INFORMATION FOR SEQ ID NO: 249:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 41 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #249: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #   41             AAAC TCAAGCACTA GTCAGACAGA G                           

     - (2) INFORMATION FOR SEQ ID NO: 250:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #250: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        AGCT TTTTGTATGT CTTC                                   

     - (2) INFORMATION FOR SEQ ID NO: 251:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #251: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TTCT GGCGGAAGTA AAGATGC                                

     - (2) INFORMATION FOR SEQ ID NO: 252:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #252: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         TAGC TTTTTTGATT GTTTCG                                 

     - (2) INFORMATION FOR SEQ ID NO: 253:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #253: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            AGAA AAAACAAAAA ATGAAGATGG                             

     - (2) INFORMATION FOR SEQ ID NO: 254:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #254: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      GTAG TCTCCGCCTT C                                      

     - (2) INFORMATION FOR SEQ ID NO: 255:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #255: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           TCTG TGGTCAAATA ATCTTACC                               

     - (2) INFORMATION FOR SEQ ID NO: 256:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #256: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AACA TGGTGCTTGC CAATAGG                                

     - (2) INFORMATION FOR SEQ ID NO: 257:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #257: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AGAA AAGAAAGCAG ACAATGC                                

     - (2) INFORMATION FOR SEQ ID NO: 258:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #258: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         TGGT TGATCCAAAC CATCTG                                 

     - (2) INFORMATION FOR SEQ ID NO: 259:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #259: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         AGGG TCAGAAGGTG CAGACC                                 

     - (2) INFORMATION FOR SEQ ID NO: 260:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #260: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         GCTT GGTGTGCCCC AGTTGC                                 

     - (2) INFORMATION FOR SEQ ID NO: 261:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #261: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       GGCA GCTTTTAAAA ATC                                    

     - (2) INFORMATION FOR SEQ ID NO: 262:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #262: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      CCAT TCACCATTAC C                                      

     - (2) INFORMATION FOR SEQ ID NO: 263:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 29 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #263: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #            29    GCAA AAAATTAAG                                         

     - (2) INFORMATION FOR SEQ ID NO: 264:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 28 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #264: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #             28   CCAT TCACCATT                                          

     - (2) INFORMATION FOR SEQ ID NO: 265:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #265: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TGAG GAAGAAACTA AAAAG                                  

     - (2) INFORMATION FOR SEQ ID NO: 266:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #266: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           CTGT AGGAATGCTT CGAATTTG                               

     - (2) INFORMATION FOR SEQ ID NO: 267:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #267: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           TAAC AATAAAACTA TTGAAGAG                               

     - (2) INFORMATION FOR SEQ ID NO: 268:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #268: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       ACCT CCAAAAATCA CGG                                    

     - (2) INFORMATION FOR SEQ ID NO: 269:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #269: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AGGT AAACAACTAC AAGTCGG                                

     - (2) INFORMATION FOR SEQ ID NO: 270:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #270: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      GTTT GGCTCAGAAT TG                                     

     - (2) INFORMATION FOR SEQ ID NO: 271:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #271: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TGAT ACAAGTATCG CA                                     

     - (2) INFORMATION FOR SEQ ID NO: 272:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #272: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      AGTA TGGCTAGATG G                                      

     - (2) INFORMATION FOR SEQ ID NO: 273:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #273: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          ATCA TTTGAAAACA AAGAAAC                                

     - (2) INFORMATION FOR SEQ ID NO: 274:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #274: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TGTT GCTGTGCTTG TG                                     

     - (2) INFORMATION FOR SEQ ID NO: 275:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #275: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            AAAG GAAAGTCAGA CAGGAAAGG                              

     - (2) INFORMATION FOR SEQ ID NO: 276:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 41 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #276: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #   41             GAGC TTTTTTGATA AAGGTTGCGC A                           

     - (2) INFORMATION FOR SEQ ID NO: 277:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #277: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            TAGA TATATTTTAG CATTTGAGAC                             

     - (2) INFORMATION FOR SEQ ID NO: 278:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #278: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TATC AAAGGCAAGA CTTGG                                  

     - (2) INFORMATION FOR SEQ ID NO: 279:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #279: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TAAA GTTGGTATTA ACGG                                   

     - (2) INFORMATION FOR SEQ ID NO: 280:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #280: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         TTTG CGAAGTATTC CAAGAG                                 

     - (2) INFORMATION FOR SEQ ID NO: 281:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #281: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          CGAG TTGGGACTGT ATCAAGC                                

     - (2) INFORMATION FOR SEQ ID NO: 282:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #282: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            TTTT TCCTTACTTA CAGATGAAGG                             

     - (2) INFORMATION FOR SEQ ID NO: 283:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #283: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          GATG CATCATAATC TAGGAGC                                

     - (2) INFORMATION FOR SEQ ID NO: 284:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #284: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          GACA TCTCCATCAT AAGTCGC                                

     - (2) INFORMATION FOR SEQ ID NO: 285:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #285: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         GAGA AAGTATTTGC AGGGG                                  

     - (2) INFORMATION FOR SEQ ID NO: 286:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #286: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            TTTC ATGGATGCAA TTTTTTTGG                              

     - (2) INFORMATION FOR SEQ ID NO: 287:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 43 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #287: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     # 43               ATTT ACTATCCATA CAGTAGAGTC AGC                         

     - (2) INFORMATION FOR SEQ ID NO: 288:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #288: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            AGGT TGCAATTCTG GATTAATTGG                             

     - (2) INFORMATION FOR SEQ ID NO: 289:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #289: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     GGAA AGAGTGGATG                                        

     - (2) INFORMATION FOR SEQ ID NO: 290:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #290: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TTTA AATTGACTAT GCGCCCG                                

     - (2) INFORMATION FOR SEQ ID NO: 291:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #291: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         CTAT GAACTTGGTC GTCACC                                 

     - (2) INFORMATION FOR SEQ ID NO: 292:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #292: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         GATT TTTCCAAGTA AATCT                                  

     - (2) INFORMATION FOR SEQ ID NO: 293:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #293: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        GGGT GAATTAGAAA AAGG                                   

     - (2) INFORMATION FOR SEQ ID NO: 294:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #294: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       AGGA TTGTTAGTAG TTG                                    

     - (2) INFORMATION FOR SEQ ID NO: 295:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #295: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TCAG GCTCAAGAAA GTTCAGG                                

     - (2) INFORMATION FOR SEQ ID NO: 296:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #296: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         TGAT GGAGCAAAGT AATACC                                 

     - (2) INFORMATION FOR SEQ ID NO: 297:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #297: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            TGTA ACCCATATCC AGCTCCTTCC                             

     - (2) INFORMATION FOR SEQ ID NO: 298:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #298: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TTGT TTATCTGGGG TTGC                                   

     - (2) INFORMATION FOR SEQ ID NO: 299:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #299: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           TGGT ACTTGGCAAG GAAAACAG                               

     - (2) INFORMATION FOR SEQ ID NO: 300:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #300: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         GCCA CCTAGCTTCT CATTG                                  

     - (2) INFORMATION FOR SEQ ID NO: 301:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #301: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            TCAA TATGTCAGAG ATGATACTAG                             

     - (2) INFORMATION FOR SEQ ID NO: 302:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #302: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         CCAT TCACCATTAT CATTG                                  

     - (2) INFORMATION FOR SEQ ID NO: 303:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #303: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TAGA GAAGCATTAA AAACC                                  

     - (2) INFORMATION FOR SEQ ID NO: 304:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #304: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         ATCT TGAAACTCCT CTGGTC                                 

     - (2) INFORMATION FOR SEQ ID NO: 305:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #305: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TGTC GAGGAGTGTC ATACC                                  

     - (2) INFORMATION FOR SEQ ID NO: 306:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #306: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         TTTA CCCTTACGAA TCCAGG                                 

     - (2) INFORMATION FOR SEQ ID NO: 307:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 43 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #307: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     # 43               AGTT TATGCGGATG AAACACTTAT TAC                         

     - (2) INFORMATION FOR SEQ ID NO: 308:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #308: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        GTAG AGATAGAAGT CATG                                   

     - (2) INFORMATION FOR SEQ ID NO: 309:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #309: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           TGGT ATCGTAGATA CAGCCGGC                               

     - (2) INFORMATION FOR SEQ ID NO: 310:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #310: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TTGC GTACCTTCTA AGCGACC                                

     - (2) INFORMATION FOR SEQ ID NO: 311:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #311: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          GGTT GCATGGGATG CGATTCG                                

     - (2) INFORMATION FOR SEQ ID NO: 312:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #312: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TTTA TTAGTTTGAC TAGC                                   

     - (2) INFORMATION FOR SEQ ID NO: 313:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #313: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       CTAT GTAAATAAAG AGA                                    

     - (2) INFORMATION FOR SEQ ID NO: 314:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #314: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     CCCT GTTTGAGGCA                                        

     - (2) INFORMATION FOR SEQ ID NO: 315:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #315: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TCCT CAATCAATAA CAAA                                   

     - (2) INFORMATION FOR SEQ ID NO: 316:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #316: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        AGTA GGAGAAACTG AACT                                   

     - (2) INFORMATION FOR SEQ ID NO: 317:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #317: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     TCAA GAAACTGCGG                                        

     - (2) INFORMATION FOR SEQ ID NO: 318:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #318: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      CTCA TTCTTGCTTC C                                      

     - (2) INFORMATION FOR SEQ ID NO: 319:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 29 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #319: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #            29    AGGT GAGACTGAG                                         

     - (2) INFORMATION FOR SEQ ID NO: 320:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #320: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         TAAT TCAAGTGTTT TCTCTG                                 

     - (2) INFORMATION FOR SEQ ID NO: 321:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #321: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        ATTG GTAGCACAAG ATCC                                   

     - (2) INFORMATION FOR SEQ ID NO: 322:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 29 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #322: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #            29    AGCC ACTGGTCTC                                         

     - (2) INFORMATION FOR SEQ ID NO: 323:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #323: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      GTCA GCTTCAGGAA C                                      

     - (2) INFORMATION FOR SEQ ID NO: 324:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #324: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     CTTG TCCCAGGTGG                                        

     - (2) INFORMATION FOR SEQ ID NO: 325:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #325: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TGAT GCGGATGAAA AG                                     

     - (2) INFORMATION FOR SEQ ID NO: 326:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #326: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TCTT TGGGTATTTC GCA                                    

     - (2) INFORMATION FOR SEQ ID NO: 327:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #327: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     TCGA TCAAAAGTAG                                        

     - (2) INFORMATION FOR SEQ ID NO: 328:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 28 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #328: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #             28   GTCT CGAGGTTC                                          

     - (2) INFORMATION FOR SEQ ID NO: 329:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #329: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      AACA GGAAACTGGG AC                                     

     - (2) INFORMATION FOR SEQ ID NO: 330:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #330: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TTCT GTACCTGTGT TTG                                    

     - (2) INFORMATION FOR SEQ ID NO: 331:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 41 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #331: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #   41             GCTC AATCCAACCC CAGGTCAAGT C                           

     - (2) INFORMATION FOR SEQ ID NO: 332:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #332: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            TTTA TCCTCTGACA TCATCGTATC                             

     - (2) INFORMATION FOR SEQ ID NO: 333:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #333: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TCAA AAACAGGCAG ATGG                                   

     - (2) INFORMATION FOR SEQ ID NO: 334:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #334: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     CATA GTCCAAGGCA                                        

     - (2) INFORMATION FOR SEQ ID NO: 335:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #335: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           AGGA TCGAACGGTA AGACAACC                               

     - (2) INFORMATION FOR SEQ ID NO: 336:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #336: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TAAC TCCGCTACTG TGTC                                   

     - (2) INFORMATION FOR SEQ ID NO: 337:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #337: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           ATCG AAGATGGTTG GGAAGTCC                               

     - (2) INFORMATION FOR SEQ ID NO: 338:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #338: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      CCAC ATGCTCAACC TT                                     

     - (2) INFORMATION FOR SEQ ID NO: 339:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #339: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AGCT AGTGAAATGC AAGAAAG                                

     - (2) INFORMATION FOR SEQ ID NO: 340:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #340: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TTTT GAACTAGTTG CTTTCGT                                

     - (2) INFORMATION FOR SEQ ID NO: 341:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #341: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          GATG GGGCACAAGG TTCAGGG                                

     - (2) INFORMATION FOR SEQ ID NO: 342:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #342: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TAAC GAACAGTTCA ATCTG                                  

     - (2) INFORMATION FOR SEQ ID NO: 343:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #343: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         CCCA ACTGTTGGTA CTTTCC                                 

     - (2) INFORMATION FOR SEQ ID NO: 344:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #344: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TGTT ACATTTTGAC CGTC                                   

     - (2) INFORMATION FOR SEQ ID NO: 345:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #345: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         CCCA ACTGTTGGTA CTTTC                                  

     - (2) INFORMATION FOR SEQ ID NO: 346:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #346: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          ATAA CGATCATTTT CTTTACC                                

     - (2) INFORMATION FOR SEQ ID NO: 347:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #347: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           ATAT TTAAGTCTAA GTGAAGCG                               

     - (2) INFORMATION FOR SEQ ID NO: 348:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #348: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TGTT ACATTTTGCA AGTC                                   

     - (2) INFORMATION FOR SEQ ID NO: 349:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #349: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TTTT GAAGGAAGTA AG                                     

     - (2) INFORMATION FOR SEQ ID NO: 350:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #350: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TTTG AAAAATGGAG GTGTATC                                

     - (2) INFORMATION FOR SEQ ID NO: 351:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #351: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      CCTC TCGAGAGAAA G                                      

     - (2) INFORMATION FOR SEQ ID NO: 352:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #352: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TTTT ACTCGTTTGA CA                                     

     - (2) INFORMATION FOR SEQ ID NO: 353:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 42 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #353: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #  42              CAAA AGTCAGACCG CTAAGAAAGT GC                          

     - (2) INFORMATION FOR SEQ ID NO: 354:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #354: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           ATCC AAATACTGGT TGTTGATG                               

     - (2) INFORMATION FOR SEQ ID NO: 355:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #355: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         GTCT CAGGATCAGA CTCAGG                                 

     - (2) INFORMATION FOR SEQ ID NO: 356:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #356: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         CATC CACCTCTTGA CCTTTG                                 

     - (2) INFORMATION FOR SEQ ID NO: 357:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #357: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           CTTT GCCAAATGGT GGGAAGGG                               

     - (2) INFORMATION FOR SEQ ID NO: 358:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #358: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            TAAC ACTTTTCGAG GTTTGGTACC                             

     - (2) INFORMATION FOR SEQ ID NO: 359:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #359: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       AGAG AAGGAAAACT TGG                                    

     - (2) INFORMATION FOR SEQ ID NO: 360:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #360: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        CAAC AAACCTTGTT CTTG                                   

     - (2) INFORMATION FOR SEQ ID NO: 361:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #361: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      TATT GAGGACCACT T                                      

     - (2) INFORMATION FOR SEQ ID NO: 362:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #362: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        CTCA GTCAATTCTT TTCC                                   

     - (2) INFORMATION FOR SEQ ID NO: 363:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #363: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         AAAT ACCAGAGGTG TTCAG                                  

     - (2) INFORMATION FOR SEQ ID NO: 364:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #364: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         ATGG GTGTGACAGG TTTGAA                                 

     - (2) INFORMATION FOR SEQ ID NO: 365:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #365: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         ACAA TTAGAAAAAG ATAGC                                  

     - (2) INFORMATION FOR SEQ ID NO: 366:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #366: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        ACTA GGTTCTGCAA TGCC                                   

     - (2) INFORMATION FOR SEQ ID NO: 367:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #367: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            CAAG CAAAAAGAAG CAGTCAATGA                             

     - (2) INFORMATION FOR SEQ ID NO: 368:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #368: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        GACT TTACGATTTG CTCC                                   

     - (2) INFORMATION FOR SEQ ID NO: 369:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 41 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #369: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #   41             CTCT GTCCAGTCCA CTTTTTCAGC G                           

     - (2) INFORMATION FOR SEQ ID NO: 370:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #370: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TGTT TCCTTAATGC GTT                                    

     - (2) INFORMATION FOR SEQ ID NO: 371:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #371: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            AATT CAAAAAAATA GGCAAGAGG                              

     - (2) INFORMATION FOR SEQ ID NO: 372:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #372: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TTTG ATTGCCAACA ACTG                                   

     - (2) INFORMATION FOR SEQ ID NO: 373:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #373: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            CCAG CAACAAAGCG AGCAAAAGG                              

     - (2) INFORMATION FOR SEQ ID NO: 374:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #374: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TTCT TTTTCCACAC GA                                     

     - (2) INFORMATION FOR SEQ ID NO: 375:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #375: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            ACAA GTCGCCCACT ATCAAGACT                              

     - (2) INFORMATION FOR SEQ ID NO: 376:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #376: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       CTCT TTCTTTTCAG GCT                                    

     - (2) INFORMATION FOR SEQ ID NO: 377:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #377: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         CGGC TGGCAATATA TCCCGT                                 

     - (2) INFORMATION FOR SEQ ID NO: 378:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #378: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          CCAT TCGCCATTAT AGTTGAC                                

     - (2) INFORMATION FOR SEQ ID NO: 379:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #379: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         ATCA GAATGGGTAG AAGAC                                  

     - (2) INFORMATION FOR SEQ ID NO: 380:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #380: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        ATTG ATTTTCATCA TCAC                                   

     - (2) INFORMATION FOR SEQ ID NO: 381:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #381: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         AGAT GATTCTGAAG GATGG                                  

     - (2) INFORMATION FOR SEQ ID NO: 382:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #382: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           CCAT TCACCATTCT AGTTTAAG                               

     - (2) INFORMATION FOR SEQ ID NO: 383:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #383: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TGCA AATGAAACTG AAGTAGC                                

     - (2) INFORMATION FOR SEQ ID NO: 384:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #384: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        AACG CTGACATCTA CGCG                                   

     - (2) INFORMATION FOR SEQ ID NO: 385:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #385: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          TCAG CCTACTTTTG TAAGAGC                                

     - (2) INFORMATION FOR SEQ ID NO: 386:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #386: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TTCA CCATTGGCAT TGAC                                   

     - (2) INFORMATION FOR SEQ ID NO: 387:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 41 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #387: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #   41             GGTG AAAGGTCATG CTACATTTGT G                           

     - (2) INFORMATION FOR SEQ ID NO: 388:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #388: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TTGA GACCTTGCAA CAC                                    

     - (2) INFORMATION FOR SEQ ID NO: 389:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 41 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #389: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #   41             TCCT TTGAAGGATT TGAGAGAAAC C                           

     - (2) INFORMATION FOR SEQ ID NO: 390:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 41 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #390: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #   41             AAGA TAAGATAAAT ATATATAAAG T                           

     - (2) INFORMATION FOR SEQ ID NO: 391:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 47 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #391: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #                47ATAT GGGACTTTTT TTCTACAACA AAATAGG                     

     - (2) INFORMATION FOR SEQ ID NO: 392:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #392: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          CAGC TCATTTAATC GTTTTTG                                

     - (2) INFORMATION FOR SEQ ID NO: 393:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #393: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TGAG AATTATTTGC GAATG                                  

     - (2) INFORMATION FOR SEQ ID NO: 394:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #394: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      ACAA AAGTAATGTA C                                      

     - (2) INFORMATION FOR SEQ ID NO: 395:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #395: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TCAA TCAAGTTCTT CAGCC                                  

     - (2) INFORMATION FOR SEQ ID NO: 396:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #396: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          GGCT TGGACCAGAT TGAAAAG                                

     - (2) INFORMATION FOR SEQ ID NO: 397:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #397: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         AACA TTAAAACGTC CTGAGG                                 

     - (2) INFORMATION FOR SEQ ID NO: 398:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #398: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        AACT GTGACGCTGG TTCC                                   

     - (2) INFORMATION FOR SEQ ID NO: 399:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #399: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         GGAG ACCTTTAAAA ATATC                                  

     - (2) INFORMATION FOR SEQ ID NO: 400:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #400: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      CATC TTGTACATAC C                                      

     - (2) INFORMATION FOR SEQ ID NO: 401:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #401: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      TGCG CAATCAAATT C                                      

     - (2) INFORMATION FOR SEQ ID NO: 402:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #402: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AGCC CAATAATCTA TAAAGCT                                

     - (2) INFORMATION FOR SEQ ID NO: 403:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #403: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      CGTT CATCAAGATG TC                                     

     - (2) INFORMATION FOR SEQ ID NO: 404:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #404: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TGTT GAAAAGAGAG TG                                     

     - (2) INFORMATION FOR SEQ ID NO: 405:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #405: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            GGGC TTTAACTATC TTCGTATTCG                             

     - (2) INFORMATION FOR SEQ ID NO: 406:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #406: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       CTTC ACTTTCCCTT TCC                                    

     - (2) INFORMATION FOR SEQ ID NO: 407:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #407: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        CCAG CATCGTTCGC AGGA                                   

     - (2) INFORMATION FOR SEQ ID NO: 408:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #408: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         GAAG AAATAATGAT TGTGG                                  

     - (2) INFORMATION FOR SEQ ID NO: 409:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #409: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      CCTT GAAATCCCAC TT                                     

     - (2) INFORMATION FOR SEQ ID NO: 410:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #410: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            AAGG TTGTAGAATG ATTTCAATC                              

     - (2) INFORMATION FOR SEQ ID NO: 411:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #411: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         CTTT TTTTGGAGCA ATGTT                                  

     - (2) INFORMATION FOR SEQ ID NO: 412:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #412: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TCCG ATACGGGTGA TTG                                    

     - (2) INFORMATION FOR SEQ ID NO: 413:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #413: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TCTC AAAGATGTGA AAG                                    

     - (2) INFORMATION FOR SEQ ID NO: 414:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #414: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TTTG TCAAGGATTG CTTT                                   

     - (2) INFORMATION FOR SEQ ID NO: 415:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #415: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           ATCC TATCATCTCT TCCAGAAG                               

     - (2) INFORMATION FOR SEQ ID NO: 416:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #416: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       ACTA AAAGCCGCAG CTT                                    

     - (2) INFORMATION FOR SEQ ID NO: 417:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 30 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #417: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #           30     TGCA GGGCAGACAG                                        

     - (2) INFORMATION FOR SEQ ID NO: 418:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #418: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         ATAA TCTAGTAAAT AAGCGT                                 

     - (2) INFORMATION FOR SEQ ID NO: 419:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #419: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         GTTT TTAGCGCTTG TTCTTC                                 

     - (2) INFORMATION FOR SEQ ID NO: 420:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #420: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         AGTG TCATAATCTT CTTTAG                                 

     - (2) INFORMATION FOR SEQ ID NO: 421:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #421: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      CGAG CATATTCTGA AG                                     

     - (2) INFORMATION FOR SEQ ID NO: 422:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #422: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          ACCA TTTCTTTGTT CTGCATC                                

     - (2) INFORMATION FOR SEQ ID NO: 423:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #423: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           GGAT AGCATTCAGA ATCAGACG                               

     - (2) INFORMATION FOR SEQ ID NO: 424:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #424: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      TAAA GACAATTTTT CC                                     

     - (2) INFORMATION FOR SEQ ID NO: 425:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #425: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          AGAT AGTATTATTA CTCAAAC                                

     - (2) INFORMATION FOR SEQ ID NO: 426:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #426: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TATT TCTCTCAATT TTTC                                   

     - (2) INFORMATION FOR SEQ ID NO: 427:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 36 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #427: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       36         GAAG CAGACCTATC AAAATC                                 

     - (2) INFORMATION FOR SEQ ID NO: 428:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 43 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #428: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     # 43               TTTT TTAGATTTTT CAATATTTTT CAG                         

     - (2) INFORMATION FOR SEQ ID NO: 429:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #429: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TGAT AATCGTGTTC AAATG                                  

     - (2) INFORMATION FOR SEQ ID NO: 430:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #430: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      AGAT AGACGTTGAG T                                      

     - (2) INFORMATION FOR SEQ ID NO: 431:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #431: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         CAAT CAGTCAGCTG CTTCC                                  

     - (2) INFORMATION FOR SEQ ID NO: 432:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 40 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #432: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    40            CTTG TCCCAGGTGG TTAATTTGCC                             

     - (2) INFORMATION FOR SEQ ID NO: 433:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #433: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           ACAA CAACATGCTA CTTCTGAG                               

     - (2) INFORMATION FOR SEQ ID NO: 434:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #434: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         AACC CACTTATCAT TATCC                                  

     - (2) INFORMATION FOR SEQ ID NO: 435:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #435: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            AGGC AAGTCCGTGA CTAGTGAAC                              

     - (2) INFORMATION FOR SEQ ID NO: 436:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #436: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       TTCC TTCAAAGTTT GCA                                    

     - (2) INFORMATION FOR SEQ ID NO: 437:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #437: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            AATT GAAAAGGCGG CAGTTAGCC                              

     - (2) INFORMATION FOR SEQ ID NO: 438:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #438: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         AAGC GTACCTTTTT CTTCC                                  

     - (2) INFORMATION FOR SEQ ID NO: 439:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #439: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           GTCA GGTTCTAATG GTTCTCAG                               

     - (2) INFORMATION FOR SEQ ID NO: 440:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 32 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #440: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          32      GGCG TCGCCGTCCT TC                                     

     - (2) INFORMATION FOR SEQ ID NO: 441:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 38 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #441: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #     38           TTCA CAGGATTTTA AAGAGAAG                               

     - (2) INFORMATION FOR SEQ ID NO: 442:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #442: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        TCCT TCTTCTGCAC TTCT                                   

     - (2) INFORMATION FOR SEQ ID NO: 443:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #443: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          CGAA GTTGAGACTC CTCAATC                                

     - (2) INFORMATION FOR SEQ ID NO: 444:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 31 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #444: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #          31      AAAT TTATTATCAG C                                      

     - (2) INFORMATION FOR SEQ ID NO: 445:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 39 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #445: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #    39            TGTA TATAAAGTTA CAGCAATCG                              

     - (2) INFORMATION FOR SEQ ID NO: 446:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 37 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #446: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #      37          ACCG AATGCGTCGA ATGTACG                                

     - (2) INFORMATION FOR SEQ ID NO: 447:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #447: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        CAGA TTAATTGAAA TCGG                                   

     - (2) INFORMATION FOR SEQ ID NO: 448:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #448: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        GATT GAAGTTTTAA AGCT                                   

     - (2) INFORMATION FOR SEQ ID NO: 449:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #449: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         AGAT GAACGGAGCA AGGTG                                  

     - (2) INFORMATION FOR SEQ ID NO: 450:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 35 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #450: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #       35         TTCC TCAATGAGTT TGTCT                                  

     - (2) INFORMATION FOR SEQ ID NO: 451:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 33 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #451: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #         33       GAAT CAAGCTACAC CCA                                    

     - (2) INFORMATION FOR SEQ ID NO: 452:                                     

     -      (i) SEQUENCE CHARACTERISTICS:                                      

     #pairs    (A) LENGTH: 34 base                                             

               (B) TYPE: nucleic acid                                          

               (C) STRANDEDNESS: double                                        

               (D) TOPOLOGY: linear                                            

     #452: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                               

     #        34        CTGA GATTGATCTG GGAG                                   

     __________________________________________________________________________

Claims

1. An isolated nucleic acid molecule comprising a polynucleotide selected from the group consisting of:

(a) a polynucleotide encoding amino acids 1 to 796 of SEQ ID NO:56;
(b) a polynucleotide encoding amino acids 2 to 796 of SEQ ID NO:56;
(c) a polynucleotide encoding an epitope-bearing portion of amino acid residues 1 to 796 of SEQ ID NO:56;
(d) a polynucleotide probe encoding at least 15 contiguous amino acid residues of amino acid residues 1 to 796 of SEQ ID NO:56;
(e) a polynucleotide consisting of at least 50 contiguous nucleotides of nucleotides 1-2389 of SEQ ID NO:55; and
(f) a polynucleotide completely complementary to a polynucleotide in (a), (b), (c), (d), or (e) above.

2. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (a).

3. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (b).

4. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (c).

5. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (d).

6. The isolated nucleic acid molecule of claim 5, wherein said polynucleotide encodes at least 30 contiguous amino acid residues of amino acid residues 1 to 796 of SEQ ID NO:56.

7. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (e).

8. The isolated nucleic acid molecule of claim 7, wherein said polynucleotide consists of at least 100 contiguous nucleotides of nucleotides 1-2389 of SEQ ID NO:55.

9. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (f).

10. The isolated nucleic acid molecule of claim 2, wherein said polynucleotide is SEQ ID NO:55.

11. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Arg-10 to Arg-17 of SEQ ID NO:56.

12. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Lys-29 to Ser-39 of SEQ ID NO:56.

13. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ser-140 to Ala-153 of SEQ ID NO:56.

14. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Arg-158 to Tyr-169 of SEQ ID NO:56.

15. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Asp-175 to Ala-183 of SEQ ID NO:56.

16. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Gly-216 to Asn-236 of SEQ ID NO:56.

17. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ala-261 to Leu-270 of SEQ ID NO:56.

18. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Arg-282 to Phe-291 of SEQ ID NO:56.

19. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Thr-297 to Ala-305 of SEQ ID NO:56.

20. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Pro-342 to Gln-362 of SEQ ID NO:56.

21. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Phe-455 to Asp-463 of SEQ ID NO:56.

22. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues His-497 to Thr-511 of SEQ ID NO:56.

23. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ala-521 to Gly-529 of SEQ ID NO:56.

24. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ile-537 to Val-546 of SEQ ID NO:56.

25. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ile-556 to Ala-568 of SEQ ID NO:56.

26. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Pro-581 to Ser-595 of SEQ ID NO:56.

27. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Glu-670 to Ala-685 of SEQ ID NO:56.

28. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ser-696 to Ala-705 of SEQ ID NO:56.

29. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Leu-782 to Ser-791 of SEQ ID NO:56.

30. A method of producing a polypeptide comprising:

(a) culturing a host cell containing the polynucleotide of claim 4 under conditions suitable to produce a polypeptide encoded by said polynucleotide;
(b) isolating said polypeptide from said host cell.

31. A polypeptide produced by the method of claim 30.

32. A polypeptide produced according to the method of claim 30.

33. The isolated nucleic acid molecule of claim 1, wherein said nucleic acid molecule comprises a heterologous polynucleotide sequence.

34. The isolated nucleic acid molecule of claim 33, wherein said heterologous polynucleotide sequence encodes a heterologous polypeptide.

35. A method for making a recombinant vector comprising inserting the isolated nucleic acid molecule of claim 1 into a vector.

36. A recombinant vector comprising the isolated nucleic acid molecule of claim 1.

37. The recombinant vector of claim 36, wherein said nucleic acid molecule is operably associated with a heterologous regulatory sequence that controls gene expression.

38. A recombinant host cell comprising the isolated nucleic acid molecule of claim 1.

39. The recombinant host cell of claim 38, wherein said nucleic acid molecule is operably associated with a heterologous regulatory sequence that controls gene expression.

40. A method of using the polynucleotide of claim 1 to detect a nucleic acid molecule which hybridizes to said polynucleotide in a biological sample comprising:

(a) obtaining a biological sample suspected of containing said nucleic acid molecule;
(b) contacting said biological sample with said polynucleotide under conditions suitable for hybridization of said polynucleotide to said nucleic acid molecule; and
(c) determining the presence or absence of said nucleic acid molecule in said biological sample.

41. A method of detecting Streptococcus nucleic acids in a biological sample obtained from an animal comprising:

(a) contacting the biological sample with the nucleic acid of claim 1 under conditions such that hybridization occurs, and
(b) detecting hybridization of said nucleic acid to the Streptococcus nucleic acid sequences present in the biological sample, wherein the detection of said hybridization is indicative of the presence of Streptococcus nucleic acids in said biological sample.

42. An isolated polypeptide comprising a member selected from the group consisting of:

(a) a polypeptide of amino acid residues 1 to 796 of SEQ ID NO:56;
(b) a polypeptide of amino acid residues 2 to 796 of SEQ ID NO:56; and
(c) an epitope-bearing portion of a polypeptide consisting of amino acid residues 1 to 796 of SEQ ID NO:56.

43. The isolated polypeptide of claim 42, wherein said polypeptide is (c).

44. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Arg-10 to Arg-17 of SEQ ID NO:56.

45. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Lys-29 to Ser-39 of SEQ ID NO:56.

46. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ser-140 to Ala-153 of SEQ ID NO:56.

47. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Arg-158 to Tyr-169 of SEQ ID NO:56.

48. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Asp-175 to Ala-183 of SEQ ID NO:56.

49. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Gly-216 to Asn-236 of SEQ ID NO:56.

50. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ala-261 to Leu-270 of SEQ ID NO:56.

51. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Arg-282 to Phe-291 of SEQ ID NO:56.

52. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Thr-297 to Ala-305 of SEQ ID NO:56.

53. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Pro-342 to Gln-362 of SEQ ID NO:56.

54. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Phe-455 to Asp-463 of SEQ ID NO:56.

55. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues His-497 to Thr-511 of SEQ ID NO:56.

56. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ala-521 to Gly-529 of SEQ ID NO:56.

57. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ile-537 to Val-546 of SEQ ID NO:56.

58. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ile-556 to Ala-568 of SEQ ID NO:56.

59. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Pro-581 to Ser-595 of SEQ ID NO:56.

60. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Glu-670 to Ala-685 of SEQ ID NO:56.

61. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ser-696 to Ala-705 of SEQ ID NO:56.

62. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Leu-782 to Ser-791 of SEQ ID NO:56.

63. The isolated polypeptide of claim 42, wherein said polypeptide is (b).

64. The isolated polypeptide of claim 42, wherein said polypeptide is fused to a heterologous polypeptide.

65. A method of producing the polypeptide of claim 42 comprising:

(a) culturing a host cell capable of expressing said polypeptide under conditions suitable to produce said polypeptide; and
(b) isolating said polypeptide from host cell.

66. A polypeptide produced according to the method of claim 65.

67. The isolated polypeptide of claim 42, wherein said polypeptide is (a).

68. A method of using the polypeptide of claim 67 to detect an antibody which specifically binds said polypeptide comprising:

(a) obtaining a biological sample containing said antibody;
(b) obtaining a biological sample containing said polypeptide;
(c) contacting said antibody containing sample with said polypeptide containing sample under conditions suitable for said antibody to specifically bind said polypeptide; and
(d) detecting the presence of said antibody bound to said polypeptide.

69. A method of attenuating an infection caused by a member of the Streptococcus genus in an animal, comprising administering to said animal the polypeptide of claim 42, wherein said polypeptide is administered in an amount effective to attenuate said infection.

70. The method of claim 69 wherein the polypeptide comprises amino acid residues 1 to 796 of SEQ ID NO:56.

71. The method of claim 65 wherein the animal is a human.

72. The method of claim 69 wherein the polypeptide comprises an epitope-bearing portion of amino acid residues 1 to 796 of SEQ ID NO:56.

73. The method of claim 72 wherein the animal is a human.

Referenced Cited
Foreign Patent Documents
622081 November 1994 EPX
687688 December 1995 EPX
93/10238 May 1993 WOX
95/06732 March 1995 WOX
95/14712 June 1995 WOX
95/31548 November 1995 WOX
96/05859 February 1996 WOX
96/08582 March 1996 WOX
96/16082 May 1996 WOX
96/33276 October 1996 WOX
97/43303 November 1997 WOX
98/18930 May 1998 WOX
98/26072 June 1998 WOX
Other references
  • Martin et al., The EMBO Journal, 11(11):3831-3836 (1992). Altschul et al., J. Mol. Biol., 215:403-410 (1990). Pearson et al., Proc. Natl. Acad. Sci., 85:2444-2448 (1988).
Patent History
Patent number: 6159469
Type: Grant
Filed: Oct 30, 1997
Date of Patent: Dec 12, 2000
Assignee: Human Genome Sciences, Inc. (Rockville, MD)
Inventors: Gil H. Choi (Rockville, MD), Charles A. Kunsch (Atlanta, GA), Steven C. Barash (Rockville, MD), Patrick J. Dillon (Carlsbad, CA), Brian Dougherty (Killingworth, CT), Michael R. Fannon (Silver Spring, MD), Craig A. Rosen (Laytonsville, MD)
Primary Examiner: James C. Housel
Assistant Examiner: Ja-Na A. Hines
Law Firm: Human Genome Sciences, Inc.
Application Number: 8/961,083
Classifications
Current U.S. Class: 424/1651; 424/1651; 424/1841; 424/19711; 424/2001; 424/2441; Preparing Compound Containing Saccharide Radical (435/72); 435/734; Streptococcus; Staphylococcus (435/36); 435/693; 435/2523; 435/2534; Binds A Bacterium Or Similar Microorganism Or Component Or Product Thereof (e.g., Streptococcus, Legionella, Mycoplasma, Bacterium Associated Antigen, Exotoxin, Etc.) (435/340); 435/3884; 536/243; 536/2432; 536/253
International Classification: A61K 3940; A61K 3900; A61K 39385; A61K 3902;