Nucleic acids and proteins from Streptococcus pneumoniae
Novel proteins from Streptococcus pneumoniae are described, together with nucleic acid sequences encoding them. Their use in vaccines and in screening methods is also described.
[0001] The present invention relates to proteins derived from Streptococcus pneumoniae, nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or proteins as antigens/immunogens and in detection/diagnosis, as well as methods for screening the proteins/nucleic acid sequences as potential anti-microbial targets.
[0002] Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an important pathogenic organism. The continuing significance of Streptoccocus pneumoniae infections in relation to human disease in developing and developed countries has been authoritatively reviewed (Fiber, G. R., Science, 265: 1385-1387 (1994)). That indicates that on a global scale this organism is believed to be the most common bacterial cause of acute respiratory infections, and is estimated to result in 1 million childhood deaths each year, mostly in developing countries (Stansfield, S. K., Pediatr. Infect. Dis., 6: 622 (1987)). In the USA it has been suggested (Breiman et al, Arch. Intern. Med., 150: 1401 (1990)) that the pneumococcus is still the most common cause of bacterial pneumonia, and that disease rates are particularly high in young children, in the elderly, and in patients with predisposing conditions such as asplenia, heart, lung and kidney disease, diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These groups are at higher risk of pneumococcal septicaemia and hence meningitis and therefore have a greater risk of dying from pneumococcal infection. The pneumococcus is also the leading cause of otitis media and sinusitis, which remain prevalent infections in children in developed countries, and which incur substantial costs.
[0003] The need for effective preventative strategies against pneumococcal infection is highlighted by the recent emergence of penicillin-resistant pneumococci. It has been reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found to be resistant to penicillin and some isolates were also resistant to other antibiotics including third generation cyclosporins (Schappert, S. M., Vital and Health Statistics of the Centres for Disease Control/National Centre for Health Statistics, 214:1 (1992)). The rates of penicillin resistance can be higher (up to 20%) in some hospitals (Breiman et al, J. Am. Med. Assoc., 271: 1831 (1994)). Since the development of penicillin resistance among pneumococci is both recent and sudden, coming after decades during which penicillin remained an effective treatment, these findings are regarded as alarming.
[0004] For the reasons given above, there are therefore compelling grounds for considering improvements in the means of preventing, controlling, diagnosing or treating pneumococcal diseases.
[0005] Various approaches have been taken in order to provide vaccines for the prevention of pneumococcal infections. Difficulties arise for instance in view of the variety of serotypes (at least 90) based on the structure of the polysaccharide capsule surrounding the organism. Vaccines against individual serotypes are not effective against other serotypes and this means that vaccines must include polysaccharide antigens from a whole range of serotypes in order to be effective in a majority of cases. An additional problem arises because it has been found that the capsular polysaccharides (each of which determines the serotype and is the major protective antigen) when purified and used as a vaccine do not reliably induce protective antibody responses in children under two years of age, the age group which suffers the highest incidence of invasive pneumococcal infection and meningitis.
[0006] A modification of the approach using capsule antigens relies on conjugating the polysaccharide to a protein in order to derive an enhanced immune response, particularly by giving the response T-cell dependent character. This approach has been used in the development of a vaccine against Haemophilus influenzae, for instance. There are, however, issues of cost concerning both the multi-polysaccharide vaccines and those based on conjugates.
[0007] A third approach is to look for other antigenic components which offer the potential to be vaccine candidates. This is the basis of the present invention. Using a specially developed bacterial expression system, we have been able to identify a group of protein antigens from pneomococcus which are associated with the bacterial envelope or which are secreted.
[0008] Thus, in a first aspect the present invention provides a Streptococcus pneumoniae protein or polypeptide having a sequence selected from those shown in table 1.
[0009] In a second aspect, the present invention provides a Streptococcus pneumoniae protein or polypeptide having a sequence selected from those shown in table 2.
[0010] A protein or polypeptide of the present invention may be provided in substantially pure form. For example, it may be provided in a form which is substantially free of other proteins.
[0011] As discussed herein, the proteins and polypeptides of the invention are useful as antigenic material. Such material can be “antigenic” and/or “immunogenic”,. Generally, “antigenic” is taken to mean that the protein or polypeptide is capable of being used to raise antibodies or indeed is capable of inducing an antibody response in a subject. “Immunogenic” is taken to mean that the protein or polypeptide is capable of eliciting a protective immune response in a subject. Thus, in the latter case, the protein or polypeptide may be capable of not only generating an antibody response but, in addition, a non-antibody based immune response.
[0012] The skilled person will appreciate that homologues or derivatives of the proteins or polypeptides of the invention will also find use in the context of the present invention, ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides which include one or more additions, deletions, substitutions or the like are encompassed by the present invention. In addition, it may be possible to replace one amino acid with another of similar “type”. For instance replacing one hydrophobic amino acid with another.
[0013] One can use a program such as the CLUSTAL program to compare amino acid sequences. This program compares amino acid sequences and finds the optimal alignment by inserting spaces in either sequence as appropriate. It is possible to calculate amino acid identity or similarity (identity plus conservation of amino acid type) for an optimal alignment. A program like BLASTx will align the longest stretch of similar sequences and assign a value to the fit. It is thus possible to obtain a comparison where several regions of similarity are found, each having a different score. Both types of identity analysis are contemplated in the present invention.
[0014] In the case of homologues and derivatives, the degree of identity with a protein or polypeptide as described herein is less important than that the homologue or derivative should retain the antigenicity or immunogenicity of the original protein or polypeptide. However, suitably, homologues or derivatives having at least 60% similarity (as discussed above) with the proteins or polypeptides described herein are provided. Preferably, homologues or derivatives having at least 70% similarity, more preferably at least 80% similarity are provided. Most preferably, homologues or derivatives having at least 90% or even 95% similarity are provided.
[0015] In an alternative approach, the homologues or derivatives could be fusion proteins, incorporating moieties which render purification easier, for example by effectively tagging the desired protein or polypeptide. It may be necessary to remove the “tag” or it may be the case that the fusion protein itself retains sufficient antigenicity to be useful.
[0016] In an additional aspect of the invention there are provided antigenic/immunogenic fragments of the proteins or polypeptides of the invention, or of homologues or derivatives thereof.
[0017] For fragments of the proteins or polypeptides described herein, or of homologues or derivatives thereof, the situation is slightly different. It is well known that is possible to screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions which are responsible for the protein or polypeptide's antigenicity or immunogenicity. Methods for carrying out such screening are well known in the art. Thus; the fragments of the present invention should include one or more such epitopic regions or be sufficiently similar to such regions to retain their antigenic/immunogenic properties. Thus, for fragments according to the present invention the degree of identity is perhaps irrelevant, since they may be 100% identical to a particular part of a protein or polypeptide, homologue or derivative as described herein. The key issue, once again, is that the fragment retains the antigenic/immunogenic properties.
[0018] Thus, what is important for homologues, derivatives and fragments is that they possess at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from which they are derived.
[0019] Gene cloning techniques may be used to provide a protein of the invention in substantially pure form. These techniques are disclosed, for example, in J. Sambrook et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). Thus, in a third aspect, the present invention provides a nucleic acid molecule comprising or consisting of a sequence which is:
[0020] (i) any of the DNA sequences set out in Table 1 or their RNA equivalents;
[0021] (ii) a sequence which is complementary to any of the sequences of (i);
[0022] (iii) a sequence which codes for the same protein or polypeptide, as those sequences of (i) or (ii);
[0023] (iv) a sequence which has substantial identity with any of those of (i), (ii) and (iii);
[0024] (v) a sequence which codes for a homologue, derivative or fragment of a protein as defined in Table 1.
[0025] In a fourth aspect the present invention provides a nucleic acid molecule comprising or consisting of a sequence which is:
[0026] (i) any of the DNA sequences set out in Table 2 or their RNA equivalents;
[0027] (ii) a sequence which is complementary to any of the sequences of (i);
[0028] (iii) a sequence which codes for the same protein or polypeptide, as those sequences of (i) or (ii);
[0029] (iv) a sequence which has substantial identity with any of those of (i), (ii) and (iii); or
[0030] (v) a sequence which codes for a homologue, derivative or fragment of a protein as defined in Table 2.
[0031] The nucleic acid molecules of the invention may include a plurality of such sequences, and/or fragments. The skilled person will appreciate that the present invention can include novel variants of those particular novel nucleic acid molecules which are exemplified herein. Such variants are encompassed by the present invention. These may occur in nature, for example because of strain variation. For example, additions, substitutions and/or deletions are included. In addition, and particularly when utilising microbial expression systems, one may wish to engineer the nucleic acid sequence by making use of known preferred codon usage in the particular organism being used for expression. Thus, synthetic or non-naturally occurring variants are also included within the scope of the invention.
[0032] The term “RNA equivalent” when used above indicates that a given RNA molecule has a sequence which is complementary to that of a given DNA molecule (allowing for the fact that in RNA “U” replaces “T” in the genetic code).
[0033] When comparing nucleic acid sequences for the purposes of determining the degree of homology or identity one can use programs such as BESTFIT and GAP (both from the Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for example, compares two sequences and produces an optimal alignment of the most similar segments. GAP enables sequences to be aligned along their whole length and finds the optimal alignment by inserting spaces in either sequence as appropriate. Suitably, in the context of the present invention when discussing identity of nucleic acid sequences, the comparison is made by alignment of the sequences along their whole length.
[0034] Preferably, sequences which have substantial identity have at least 50% sequence identity, desirably at least 75% sequence identity and more desirably at least 90 or at least 95% sequence identity with said sequences. In some cases the sequence identity may be 99% or above.
[0035] Desirably, the term “substantial identity” indicates that said sequence has a greater degree of identity with any of the sequences described herein than with prior art nucleic acid sequences.
[0036] It should however be noted that where a nucleic acid sequence of the present invention codes for at least part of a novel gene product the present invention includes within its scope all possible sequence coding for the gene product or for a novel part thereof.
[0037] The nucleic acid molecule may be in isolated or recombinant form. It may be incorporated into a vector and the vector may be incorporated into a host. Such vectors and suitable hosts form yet further aspects of the present invention.
[0038] Therefore, for example, by using probes based upon the nucleic acid sequences provided herein, genes in Streptococcus pneumoniae can be identified. They can then be excised using restriction enzymes and cloned into a vector. The vector can be introduced into a suitable host for expression.
[0039] Nucleic acid molecules of the present invention may be obtained from S. pneumoniae by the use of appropriate probes complementary to part of the sequences of the nucleic acid molecules. Restriction enzymes or sonication techniques can be used to obtain appropriately sized fragments for probing.
[0040] Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. Thus the sequence data provided herein can be used to design two primers for use in PCR so that a desired sequence, including whole genes or fragments thereof, can be targeted and then amplified to a high degree.
[0041] Typically primers will be at least 15-25 nucleotides long.
[0042] As a further alternative chemical synthesis may be used. This may be automated. Relatively short sequences may be chemically synthesised and ligated together to provide a longer sequence.
[0043] There is another group of proteins from S. pneumoniae which have been identified using the bacterial expression system described herein. These are known proteins from S. pneumoniae, which have not previously been identified as antigenic proteins. The amino acid sequences of this group of proteins, together with DNA sequences coding for them are shown in Table 3. These proteins, or homologues, derivatives and/or fragments thereof also find use as antigens/immunogens. Thus, in another aspect the present invention provides the use of a protein or polypeptide having a sequence selected from those shown in Tables 1-3, or homologues, derivatives and/or fragments thereof, as an immunogen/antigen.
[0044] In yet a further aspect the present invention provides an immunogenic/antigenic composition comprising one or more proteins or polypeptides selected from those whose sequences are shown in Tables 1-3, or homologues or derivatives thereof, and/or fragments of any of these. In preferred embodiments, the immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay.
[0045] In the case of vaccines suitable additional excipients, diluents, adjuvants or the like may be included. Numerous examples of these are well known in the art.
[0046] It is also possible to utilise the nucleic acid sequences shown in Tables 1-3 in the preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine composition comprising one or more nucleic acid sequences as defined herein. DNA vaccines are described in the art (see for instance, Donnelly et al, Ann. Rev. Immunol., 15:617-648 (1997)) and the skilled person can use such art described techniques to produce and use DNA vaccines according to the present invention.
[0047] As already discussed herein the proteins or polypeptides described herein, their homologues or derivatives, and/or fragments of any of these, can be used in methods of detecting/diagnosing S. pneumoniae. Such methods can be based on the detection of antibodies against such proteins which may be present in a subject. Therefore the present invention provides a method for the detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact a sample to be tested with at least one protein, or homologue, derivative or fragment thereof, as described herein. Suitably, the sample is a biological sample, such as a tissue sample or a sample of blood or saliva obtained from a subject to be tested.
[0048] In an alternative approach, the proteins described herein, or homologues, derivatives and/or fragments thereof, can be used to raise antibodies, which in turn can be used to detect the antigens, and hence S. pneumoniae. Such antibodies form another aspect of the invention. Antibodies within the scope of the present invention may be monoclonal or polyclonal.
[0049] Polyclonal antibodies can be raised by stimulating their production in a suitable animal host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as described herein, or a homologue, derivative or fragment thereof, is injected into the animal. If desired, an adjuvant may be administered together with the protein. Well-known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium hydroxide. The antibodies can then be purified by virtue of their binding to a protein as described herein.
[0050] Monoclonal antibodies can be produced from hybridomas. These can be formed by fusing myeloma cells and spleen cells which produce the desired antibody in order to form an immortal cell line. Thus the well-known Kohler & Milstein technique (Nature 256 (1975)) or subsequent variations upon this technique can be used.
[0051] Techniques for producing monoclonal and polyclonal antibodies that bind to a particular polypeptide/protein are now well developed in the art. They are discussed in standard immunology textbooks, for example in Roitt et al, Immunology second edition (1989), Churchill Livingstone, London.
[0052] In addition to whole antibodies, the present invention includes derivatives thereof which are capable of binding to proteins etc as described herein. Thus the present invention includes antibody fragments and synthetic constructs. Examples of antibody fragments and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 1994).
[0053] Antibody fragments include, for example, Fab, F(ab′)2 and Fv fragments. Fab fragments (These are discussed in Roitt et al [supra]). Fv fragments can be modified to produce a synthetic construct known as a single chain Fv (scFv) molecule. This includes a peptide linker covalently joining Vh and Vl regions, which contributes to the stability of the molecule. Other synthetic constructs that can be used include CDR peptides. These are synthetic peptides comprising antigen-binding determinants. Peptide mimetics may also be used. These molecules are usually conformationally restricted organic rings that mimic the structure of a CDR loop and that include antigen-interactive side chains.
[0054] Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or primatised) antibodies or derivatives thereof are within the scope of the present invention. An example of a humanised antibody is an antibody having human framework regions, but rodent hypervariable regions. Ways of producing chimaeric antibodies are discussed for example by Morrison et al in PNAS, 81, 6851-6855 (1984) and by Takeda et al in Nature. 314, 452454 (1985).
[0055] Synthetic constructs also include molecules comprising an additional moiety that provides the molecule with some desirable property in addition to antigen binding. For example the moiety may be a label (e.g. a fluorescent or radioactive label). Alternatively, it may be a pharmaceutically active agent.
[0056] Antibodies, or derivatives thereof, find use in detection/diagnosis of S. pneumoniae. Thus, in another aspect the present invention provides a method for the detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact a sample to be tested and antibodies capable of binding to one or more proteins described herein, or to homologues, derivatives and/or fragments thereof.
[0057] In addition, so-called “Affibodies” may be utilised. These are binding proteins selected from combinatorial libraries of an alpha-helical bacterial receptor domain (Nord et al,) Thus, Small protein domains, capable of specific binding to different target proteins can be selected using combinatorial approaches.
[0058] It will also be clear that the nucleic acid sequences described herein may be used to detect/diagnose S. pneumoniae. Thus, in yet a further aspect, the present invention provides a method for the detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact a sample to be tested with at least one nucleic acid sequence as described herein. Suitably, the sample is a biological sample, such as a tissue sample or a sample of blood or saliva obtained from a subject to be tested. Such samples may be pretreated before being used in the methods of the invention. Thus, for example, a sample may be treated to extract DNA. Then, DNA probes based on the nucleic acid sequences described herein (ie usually fragments of such sequences) may be used to detect nucleic acid from S. pneumoniae.
[0059] In additional aspects, the present invention provides:
[0060] (a) a method of vaccinating a subject against S. pneumoniae which comprises the step of administering to a subject a protein or polypeptide of the invention, or a derivative, homologue or fragment thereof, or an immunogenic composition of the invention;
[0061] (b) a method of vaccinating a subject against S. pneumoniae which comprises the step of administering to a subject a nucleic acid molecule as defined herein;
[0062] (c) a method for the prophylaxis or treatment of S. pneumoniae infection which comprises the step of administering to a subject a protein or polypeptide of the invention, or a derivative, homologue or fragment thereof, or an immunogenic composition of the invention;
[0063] (d) a method for the prophylaxis or treatment of S. pneumoniae infection which comprises the step of administering to a subject a nucleic acid molecule as defined herein;
[0064] (e) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one or more proteins or polypeptides of the invention, or homologues, derivatives or fragments thereof, or an antigenic composition of the invention; and
[0065] (f) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one or more nucleic acid molecules as defined herein.
[0066] Given that we have identified a group of important proteins, such proteins are potential targets for anti-microbial therapy. It is necessary, however, to determine whether each individual protein is essential for the organism's viability. Thus, the present invention also provides a method of determining whether a protein or polypeptide as described herein represents a potential anti-microbial target which comprises antagonising, inhibiting or otherwise interfering with the function or a expression of said protein and determining whether S. pneumoniae is still viable.
[0067] A suitable method for inactivating the protein is to effect selected gene knockouts, ie prevent expression of the protein and determine whether this results in a lethal change. Suitable methods for carrying out such gene knockouts are described in Li et al, P.N.A.S., 94:13251-13256 (1997) and Kolkman et al, 178:3736-3741(1996).
[0068] In a final aspect the present invention provides the use of an agent capable of antagonising, inhibiting or otherwise interfering with the function or expression of a protein or polypeptide of the invention in the manufacture of a medicament for use in the treatment or prophylaxis of S. pneumoniae infection.
[0069] As mentioned above, we have used a bacterial expression system as a means of identifying those proteins which are surface associated, secreted or exported and thus, would find use as antigens.
[0070] The information necessary for the secretion/export of proteins has been extensively studied in bacteria. In the majority of cases, protein export requires a signal peptide to be present at the N-terminus of the precursor protein so that it becomes directed to the translocation machinery on the cytoplasmic membrane. During or after translocation, the signal peptide is removed by a membrane associated signal peptidase. Ultimately the localization of the protein (i.e. whether it be secreted, an integral membrane protein or attached to the cell wall) is determined by sequences other than the leader peptide itself.
[0071] We are specifically interested in surface located or exported proteins as these are likely to be antigens for use in vaccines, as diagnostic reagents or as targets for therapy with novel chemical entities. We have therefore developed a screening vector-system in Lactococcus lactis that permits genes encoding exported proteins to be identified and isolated. We provide below a representative example showing how given novel surface associated proteins from Streptococcus pneumoniae have been identified and characterized. The screening vector incorporates the staphylococcal nuclease gene nuc lacking its own export signal as a secretion reporter. Staphylococcal nuclease is a naturally secreted heat-stable, monomeric enzyme which has been efficiently expressed and secreted in a range of Gram positive bacteria (Shortle, Gene, 22:181-189 (1983); Kovacevic et al., J. Bacteriol., 162:521-528 (1985); Miller et al., J. Bacteriol., 169:3508-3514 (1987); Liebl et al., J. Bacterol., 174:1854-1861 (1992); Le Loir et al., J. Bacteriol., 176:5135-5139 (1994); Poquet et al., J. Bacteriol., 180:1904-1912 (1998)).
[0072] Recently, Poquet et al. ((1998), supra) have described a screening vector incorporating the nuc gene lacking its own signal leader as a reporter to identify exported proteins in Gram positive bacteria, and have applied it to L. lactis. This vector (pFUN) contains the pAM&bgr;1 replicon which functions in a broad host range of Gram-positive bacteria in addition to the ColE1 replicon that promotes replication in Escherichia coli and certain other Gram negative bacteria. Unique cloning sites present in the vector can be used to generate transcriptional and translational fusions between cloned genomic DNA fragments and the open reading frame of the truncated nuc gene devoid of its own signal secretion leader. The nuc gene makes an ideal reporter gene because the secretion of nuclease can readily be detected using a simple and sensitive plate test: Recombinant colonies secreting the nuclease develop a pink halo whereas control colonies remain white (Shortle, (1983), supra; Le Loir et al., (1994), supra).
[0073] Thus, the invention will now be described with reference to the following representative example, which provides details of how the proteins, polypeptides and nucleic acid sequences described herein identified as antigenic targets.
[0074] We describe herein the construction of three reporter vectors and their use in L. lactis to identify and isolate genomic DNA fragments from Streptococcus pneumoniae encoding secreted or surface associated proteins.
[0075] The invention will now be described with reference to the examples, which should not be construed as in any way limiting the invention. The examples refer to the figures in which:
[0076] FIG. 1: shows the results of a number of DNA vaccine trials; and
[0077] FIG. 2: shows the results of further DNA vaccine trials.
EXAMPLE 1[0078] (i) Construction of the pTREP1-nuc Series of Reporter Vectors
[0079] (a) Construction of Expression Plasmid PTREP1
[0080] The pTREP1 plasmid is a high-copy number (40-80 per cell) theta-replicating gram positive plasmid, which is a derivative of the pTREX plasmid which is itself a derivative of the previously published pIL253 plasmid. pIL253 incorporates the broad Gram-positive host range replicon of pAM&bgr;1 (Simon and Chopin, Biochimie, 70:559-567 (1988)) and is non-mobilisable by the L lactis sex-factor. pIL253 also lacks the tra function which is necessary for transfer or efficient mobilisation by conjugative parent plasmids exemplified by pIL501. The Enterococcal pAM&bgr;1 replicon has previously been transferred to various species including Streptococcus, Lactobacillus and Bacillus species as well as Clostridium acetobutylicum, (Oultram and Klaenhammer, FEMS Microbiological Letters, 27:129-134 (1985); Gibson et al., (1979); LeBlanc et al., Proceedings of the National Academy of Science USA, 75:3484-3487 (1978)) indicating the potential broad host range utility. The pTREP1 plasmid represents a constitutive transcription vector.
[0081] The pTREX vector was constructed as follows. An artificial DNA fragment containing a putative RNA stabilising sequence, a translation initiation region (TIR), a multiple cloning site for insertion of the target genes and a transcription terminator was created by annealing 2 complementary oligonucleotides and extending with Tfl DNA polymerase. The sense and anti-sense oligonucleotides contained the recognition sites for NheI and BamHI at their 5′ ends respectively to facilitate cloning. This fragment was cloned between the XbaI and BamHI sites in pUC19NT7, a derivative of pUC19 which contains the T7 expression cassette from pLET1 (Wells et al, J. Appl. Bacteriol., 74:629-636 (1993)) cloned between the EcoRI and HindIII sites. The resulting construct was designated pUCLEX. The complete expression cassette of pUCLEX was then removed by cutting with HindIII and blunting followed by cutting with EcoRI before cloning into EcoRI and SacI (blunted) sites of pIL253 to generate the vector pTREX (Wells and Schofield, In Current advances in metabolism, genetics and applications-NATO ASI Series, H 98:37-62 (1996)). The putative RNA stabilising sequence and TIR are derived from the Escherichia coli T7 bacteriophage sequence and modified at one nucleotide position to enhance the complementarity of the Shine Dalgarno (SD) motif to the ribosomal 16s RNA of Lactococcus lactis (Schofield et al. pers. corns. University of Cambridge Dept. Pathology.).
[0082] A Lactococcus lactis MG1363 chromosomal DNA fragment exhibiting promoter activity which was subsequently designated P7 was cloned between the EcoRI and BglII sites present in the expression cassette, creating pTREX7. This active promoter region had been previously isolated using the promoter probe vector pSB292 (Waterfield et al, Gene, 165:9-15 (1995)). The promoter fragment was amplified by PCR using the Vent DNA polymerase according to the manufacturer.
[0083] The pTREP1 vector was then constructed as follows. An artificial DNA fragment which included a transcription terminator, the forward pUC sequencing primer, a promoter multiple -cloning site region and a universal translation stop sequence was created by annealing two overlapping partially complementary synthetic oligonucleotides together and extending with sequenase according to manufacturers instructions. The sense and anti-sense (pTREPF and pTREPR) oligonucleotides contained the recognition sites for EcoRV and BamHI at their 5′ ends respectively to facilitate cloning into pTREX7. The transcription terminator was that of the Bacillus penicillinase gene, which has been shown to be effective in Lactococcus (Jos et al., Applied and Environmental Microbiology, 50:540-542 (1985)). This was considered necessary as expression of target genes in the pTREX vectors was observed to be leaky and is thought to be the result of cryptic promoter activity in the origin region (Schofield et al. pers. coms. University of Cambridge Dept. Pathology.). The forward pUC primer sequencing was included to enable direct sequencing of cloned DNA fragments. The translation stop sequence which encodes a stop codon in 3 different frames was included to prevent translational fusions between vector genes and cloned DNA fragments The pTREX7 vector was first digested with EcoRI and blunted using the 5′-3′ polymerase activity of T4 DNA polymerase (NEB) according to manufacturer's instructions. The EcoRI digested and blunt ended pTREX7 vector was then digested with Bgl II thus removing the P7 promoter. The artificial DNA fragment derived from the annealed synthetic oligonucleotides was then digested with EcoRV and Bam HI and cloned into the EcoRI(blunted)-Bgl II digested pTREX7 vector to generate pTREP. A Lactococcus lactis MG1363 chromosomal promoter designated P1 was then cloned between the EcoRI and BglII sites present in the pTREP expression cassette forming pTREP1. This promoter was also isolated using the promoter probe vector pSB292 and characterised by Waterfield et al., (1995), supra. The P1 promoter fragment was originally amplified by PCR using vent DNA polymerase according to manufacturers instructions and cloned into the pTREX as an EcoRI-BglII DNA fragment. The EcoRI-BglII P1 promoter containing fragment was removed from pTREX1 by restriction enzyme digestion and used for cloning into pTREP (Schofield et al. pers. coms. University of Cambridge, Dept. Pathology.).
[0084] (b) PCR Amplification of the S. aureus nuc Gene.
[0085] The nucleotide sequence of the S. aureus nuc gene (EMBL database accession number V01281) was used to design synthetic oligonucleotide primers for PCR amplification. The primers were designed to amplify the mature form of the nuc gene designated nucA which is generated by proteolytic cleavage of the N-terminal 19 to 21 amino acids of the secreted propeptide designated Snase B (Shortle, (1983), supra). Three sense primers (nucS1, nucS2 and nucS3, Appendix 1) were designed, each one having a blunt-ended restriction endonuclease cleavage site for EcoRV or SmaI in a different reading frame with respect to the nuc gene. Additionally BglII and BamHI were incorporated at the 5′ ends of the sense and anti-sense primers respectively to facilitate cloning into BamHI and BglII cut pTREP1. The sequences of all the primers are given in Appendix 1. Three nuc gene DNA fragments encoding the mature form of the nuclease gene (NucA) were amplified by PCR using each of the sense primers combined with the anti-sense primer described above. The nuc gene fragments were amplified by PCR using S. aureus genomic DNA template, Vent DNA Polymerase (NEB) and the conditions recommended by the manufacturer. An initial denaturation step at 93° C. for 2 min was followed by 30 cycles of denaturation at 93° C. for 45 sec, annealing at 50° C. for 45 seconds, and extension at 73° C. for 1 minute and then a final 5 Tin extension step at 73° C. The PCR amplified products were purified using a Wizard clean up column (Promega) to remove unincorporated nucleotides and primers.
[0086] (c) Construction of the pTREP1-nuc Vectors
[0087] The purified nuc gene fragments described in section b were digested with Bgl II and BamHI using standard conditions and ligated to BamHI and BglII cut and dephosphorylated pTREP1 to generate the pTREP1-nuc1, pTREP1-nuc2 and pTREP1-nuc3 series of reporter vectors. General molecular biology techniques were carried out using the reagents and buffer supplied by the manufacture or using standard conditions (Sambrook and Maniatis, (1989), supra). In each of the pTREP1-nuc vectors the expression cassette comprises a transcription terminator, lactococcal promoter P1, unique cloning sites (BglII, EcoRV or SmaI) followed by the mature form of the nuc gene and a second transcription terminator. Note that the sequences required for translation and secretion of the nuc gene were deliberately excluded in this construction. Such elements can only be provided by appropriately digested foreign DNA fragments (representing the target bacterium) which can be cloned into the unique restriction sites present immediately upstream of the nuc gene.
[0088] In possessing a promoter, the pTREP1-nuc vectors differ from the pFUN vector described by Poquet et al. (1998), supra, which was used to identify L. lactis exported proteins by screening directly for Nuc activity directly in L. lactis. As the pFUN vector does not contain a promoter upstream of the nuc open reading frame the cloned genomic DNA fragment must also provide the signals for transcription in addition to those elements required for translation initiation and secretion of Nuc. This limitation may prevent the isolation of genes that are distant from a promoter for example genes which are within polycistronic operons. Additionally there can be no guarantee that promoters derived from other species of bacteria will be recognised and functional in L. lactis. Certain promoters may be under stringent regulation in the natural host but not in L. lactis. In contrast, the presence of the P1 promoter in the pTREP1-nuc series of vectors ensures that promoterless DNA fragments (or DNA fragments containing promoter sequences not active in L. lactis) will still be transcribed.
[0089] (d) Screening for Secreted Proteins in S. pneumoniae
[0090] Genomic DNA isolated from S. pneumoniae was digested with the restriction enzyme Tru9I. This enzyme which recognises the sequence 5′-TTAA -3′ was used because it cuts A/T rich genomes efficiently and can generate random genomic DNA fragments within the preferred size range (usually averaging 0.5-1.0 kb). This size range was preferred because there is an increased probability that the PI promoter can be utilised to transcribe a novel gene sequence. However, the PI promoter may not be necessary in all cases as it is possible that many Streptococcal promoters are recognised in L. lactis. DNA fragments of different size ranges were purified from partial Tru9I digests of S. pneumoniae genomic DNA. As the Tru 9I restriction enzyme generates staggered ends the DNA fragments had to be made blunt ended before ligation to the EcoRV or SmaI cut pTREP1-nuc vectors. This was achieved by the partial fill-in enzyme reaction using the 5′-3′ polymerase activity of Klenow enzyme. Briefly Tru9I digested DNA was dissolved in a solution (usually between 10-20 &mgr;l in total) supplemented with T4 DNA ligase buffer (New England Biolabs; NEB) (1×) and 33 &mgr;M of each of the required dNTPs, in this case dATP and dTTP. Klenow enzyme was added (1 unit Klenow enzyme (NEB) per &mgr;g of DNA) and the reaction incubated at 25° C. for 15 minutes. The reaction was stopped by incubating the mix at 75° C. for 20 minutes. EcoRV or SmaI digested pTREP-nuc plasmid DNA was then added (usually between 200400 ng). The mix was then supplemented with 400 units of T4 DNA ligase (NEB) and T4 DNA ligase buffer (1×) and incubated overnight at 16° C. The ligation mix was precipitated directly in 100% Ethanol and {fraction (1/10)} volume of 3M sodium acetate (pH 5.2) and used to transform L. lactis MG1363 (Gasson, 1983). Alternatively, the gene cloning site of the pTREP-nuc vectors also contains a BglII site which can be used to clone for example Sau3AI digested genomic DNA fragments.
[0091] L. lactis transformant colonies were grown on brain heart infusion agar and nuclease secreting (Nuc+) clones were detected by a toluidine blue-DNA-agar overlay (0.05 M Tris pH 9.0, 10 g of agar per liter, 10 g of NaCl per liter, 0.1 mM CaCl2, 0.03% wt/vol. salmon sperm DNA and 90 mg of Toluidine blue 0 dye) essentially as -described by Shortle, 1983, supra and Le Loir et al., 1994, supra). The plates were then incubated at 37° C. for up to 2 hours. Nuclease secreting clones develop an easily identifiable pink halo. Plasmid DNA was isolated from Nuc+ recombinant L. lactis clones and DNA inserts were sequenced on one strand using the NucSeq sequencing primer described in Appendix 1, which sequences directly through the DNA insert.
[0092] Isolation of Genes Encoding Exported Proteins from S. pneumoniae
[0093] A large number of gene sequences putatively encoding exported proteins in S. pneumoniae have been identified using the nuclease screening system. These have now been further analysed to remove artefacts. The sequences identified using the screening system have been analysed using a number of parameters.
[0094] 1. All putative surface proteins were analysed for leader/signal peptide sequences using the software programs Sequencher (Gene Codes Corporation) and DNA Strider (Marck, Nucleic Acids Res., 16:1829-1836 (1988)). Bacterial signal peptide sequences share a common design. They are characterised by a short positively charged N-terminus (N region) immediately preceding a stretch of hydrophobic residues (central portion-h region) followed by a more polar C-terminal portion which contains the cleavage site &agr;-region). Computer software is available which allows hydropathy profiling of putative proteins and which can readily identify the very distinctive hydrophobic portion (h-region) typical of leader peptide sequences. In addition, the sequences were checked for the presence of or absence of a potential ribosomal binding site (Shine-Dalgarno motif) required for translation initiation of the putative nuc reporter fusion protein.
[0095] 2. All putative surface protein sequences were also matched with all of the protein/DNA sequences using the publicly databases [OWL-proteins inclusive of SwissProt and GenBank translations]. This allows us to identify sequences similar to known genes or homologues of genes for which some function has been ascribed. Hence it has been possible to predict a function for some of the genes identified using the LEEP system and to unequivocally establish that the system can be used to identify and isolate gene sequences of surface associated proteins. We should also be able to confirm that these proteins are indeed surface related and not artifacts. The LEEP system has been used to identify novel gene targets for vaccine and therapy.
[0096] 3. Some of the genes identified proteins did not possess a typical leader peptide sequence and did not show homology with any DNA/protein sequences in the database. Indeed these proteins may indicate the primary advantage of our screening method, i.e. the isolation of atypical surface-related proteins, which may have been missed in all previously described screening protocols or approaches based on sequence homology searches.
[0097] In all cases, only partial gene sequences were initially obtained. Full length genes were obtained in all cases by reference to the TIGR S. pneumoniae database (www@tigr.org). Thus, by matching the originally obtained partial sequences with the database, we were able to identify the full length gene sequences. In this way, as described herein, three groups of genes were clearly identified, ie a group of genes encoding previously unidentified S. pneumoniae proteins, a second group exhibiting some homology with known proteins from a variety of sources and a third group which encoded known S. pneumoniae proteins, which were, however, not known as antigens.
EXAMPLE 2 Vaccine Trials[0098] pcDNA3.1+ as a DNA Vaccine Vector
[0099] pcDNA3.1+
[0100] The vector chosen for use as a DNA vaccine vector was pcDNA3.1 (Invitrogen) (actually pcDNA3.1+, the forward orientation was used in all cases but may be referred to as pcDNA3.1 here on). This vector has been widely and successfully employed as a host vector to test vaccine candidate genes to give protection against pathogens in the literature (Zhang, et al., Kurar and Splitter, Anderson et al.). The vector was designed for high-level stable and non-replicative transient expression in mammalian cells. pcDNA3.1 contains the ColE1 origin of replication which allows convenient high-copy number replication and growth in E. coli. This in turn allows rapid and efficient cloning and testing of many genes. The pcDNA3.1 vector has a large number of cloning sites and also contains the gene encoding ampicillin resistance to aid in cloning selection and the human cytomegalovirus (CMV) immediate-early promoter/enhancer which permits efficient, high-level expression of the recombinant protein. The CMV promoter is a strong viral promoter in a wide range of cell types including both muscle and immune (antigen presenting) cells. This is important for optimal immune response as it remains unknown as to which cells types are most important in generating a protective response in vivo. A T7 promoter upstream of the multiple cloning site affords efficient expression of the modified insert of interest and which allows in vitro transcription of a cloned gene in the sense orientation.
[0101] Zhang, D., Yang, X., Berry, J. Shen, C., McClarty, G. and Brunham, R. C. (1997) “DNA vaccination with the major outer-membrane protein genes induces acquired immunity to Chlamydia trachomatis (mouse pneumonitis) infection”. Infection and Immunity, 176, 103540.
[0102] Kurar, E. and Splitter, G. A. (1997) “Nucleic acid vaccination of Brucella abortus ribosomal L7/L12 gene elicits immune response”. Vaccine, 15, 1851-57.
[0103] Anderson, R., Gao, X. -M., Papakonstantinopoulou, A., Roberts, M. and Dougan, G. (1996) “Immune response in mice following immunisation with DNA encoding fragment C of tetanus toxin”. Infection and Immunity, 64, 3168-3173.
[0104] Preparation of DNA Vaccines
[0105] Oligonucleotide primers were designed for each individual gene of interest derived using the LEEP system. Each gene was examined thoroughly, and where possible, primers were designed such that they targeted that portion of the gene thought to encode only the mature portion of the gene protein. It was hoped that expressing those sequences that encode only the mature portion of a target gene protein, would facilitate its correct folding when expressed in mammalian cells. For example, in the majority of cases primers were designed such that putative N-terminal signal peptide sequences would not be included in the final amplification product to be cloned into the pcDNA3.1 expression vector. The signal peptide directs the polypeptide precursor to the cell membrane via the protein export pathway where it is normally cleaved off by signal peptidase I (or signal peptidase Id if a lipoprotein). Hence the signal peptide does not make up any part of the mature protein whether it be displayed on the surface of the bacteria surface or secreted. Where an N-terminal leader peptide sequence was not immediately obvious, primers were designed to target the whole of the gene sequence for cloning and ultimately, expression in pcDNA3.1.
[0106] Having said that, however, other additional features of proteins may also affect the expression and presentation of a soluble protein. DNA sequences encoding such features in the genes encoding the proteins of interest were excluded during the design of oligonucleotides. These features included:
[0107] 1. LPXTG cell wall anchoring motifs.
[0108] 2. LXXC ipoprotein attachment sites.
[0109] 3. Hydrophobic C-terminal domain.
[0110] 4. Where no N-terminal signal peptide or LXXC was present the start codon was excluded.
[0111] 5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop codon was removed.
[0112] Appropriate PCR primers were designed for each gene of interest and any and all of the regions encoding the above features was removed from the gene when designing these primers. The primers were designed with the appropriate enzyme restriction site followed by a conserved Kozak nucleotide sequence (in most cases(NB except in occasional instances for example ID59) GCCACC was used. The Kozak sequence facilitates the recognition of initiator sequences by eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of interest. For example the forward primer using a BamH1 site the primer would begin GCGGGATCCGCCACCATG followed by a small section of the 5′ end of the gene of interest. The reverse primer was designed to be compatible with the forward primer and with a NotI restriction site at the 5′ end in most cases (this site is TTGCGGCCGC) (NB except in occasional instances for example ID59 where a XhoI site was used instead of NotI).
[0113] PCR Primers
[0114] The following PCR primers were designed and used to amplify the truncated genes of interest.
[0115] ID5
[0116] Forward Primer 5′
[0117] CGGATCCGCCACCATGGGTCTAATTGAAGACTTAAAAAATCAA 3′
[0118] Reverse Primer 5′ TTGCGGCCGCCAATGCTAGACTAAACACAAGACTCA 3′
[0119] ID59
[0120] Forward Primer 5′ CGCGGATCCATGAAAAAAATCTATTCATTTTTAGCA 3′
[0121] Reverse Primer 5′ CCCTCGAGGGCTACTTCCGATACATTTTAAACTGTAGG 3′
[0122] ID51
[0123] Forward Primer 5′ CGGATCCGCCACCATGAGTCATGTCGCTGCAAATG 3′
[0124] Reverse Primer 5′ TTGCGGCCGCATACCAAACGCTGACATCTACG 3′
[0125] ID29
[0126] Forward Primer 5′ CGGATCCGCCACCATGCAAAAAGAGCGGTATGGTTATG 3′
[0127] Reverse Primer 5′ TTGCGGCCGCACCCCCATTCTTAATCCCTT 3′
[0128] ID50
[0129] Forward Primer 5′
[0130] CGGATCCGCCACCATGGAGGTATGTGAAATGTCACGTAAA 3′
[0131] Reverse Primer 5′TTGCGGCCGCTTTTACAAAGTCAAGCAAAGCC 3′
[0132] Cloning
[0133] The insert along with the flanking features described above was amplified using PCR against a template of genomic DNA isolated from type 4 S. pneumoniae strain 11886 obtained from the National Collection of Type Cultures. The PCR product was cut with the appropriate restriction enzymes and cloned in to the multiple cloning site of pcDNA3.1 using conventional molecular biological techniques. Suitably mapped clones of the genes of interested were cultured and the plasmids isolated on a large scale (>1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and maintenance of genes was confirmed by restriction mapping and sequencing 700 base pairs through the 5′ cloning junction of each large scale preparation of each construct.
[0134] Strain Validation
[0135] A strain of type 4 was used in cloning and challenge methods which is the strain from which the S. pneumoniae genome was sequenced. A freeze dried ampoule of a homogeneous laboratory strain of type 4 S. pneumoniae strain NCTC 11886 was obtained from the National Collection of Type Strains. The ampoule was opened and the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 5% blood) and incubated statically overnight at 37° C. This culture was streaked on to 5% blood agar plates to check for contaminants and confirm viability and on to blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. The slopes were sent to the Public Health Laboratory Service where the type 4 serotype was confirmed.
[0136] A glycerol stock of NCTC 11886 was streaked on a 5% blood agar plate and incubated overnight in a CO2 gas jar at 37° C. Fresh streaks were made and optochin sensitivity was confirmed.
[0137] Pneumococcal Challenge
[0138] A standard inoculum of type 4 S. pneumoniae was prepared and frozen down by passaging a culture of pneumococcus 1× through mice, harvesting from the blood of infected animals, and grown up to a predetermined viable count of around 109 cfu/ml in broth before freezing down. The preparation is set out below as per the flow chart. 1 Streak pneumococcal culture and confirm identity | V Grow over-night culture from 4-5 colonies on plate above | V Animal passage pneumococcal culture (i.p. injection of cardiac bleed to harvest) | V Grow over-night culture from animal passaged pneumococcus | V Grow day culture (to pre-determined optical density) from over-night of animal passage and freeze down at −70° C. - This is standard minimum | V Thaw one aliquot of standard inoculum to viable count | V Use standard inoculum to determine effective dose (called Virulence Testing) | V
[0139] All subsequent challenges—use standard inoculum to effective dose
[0140] An aliquot of standard inoculum was diluted 500× in PBS and used to inoculate the mice.
[0141] Mice were lightly anaesthetised using halothane and then a dose of 1.4×105 cfu of pneumococcus was applied to the nose of each mouse. The uptake was facilitated by the normal breathing of the mouse, which was left to recover on its back.
[0142] S. pneumoniae Vaccine Trials
[0143] Vaccine trials in mice were carried out by the administration of DNA to 6 week old CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six and each group was immunised with recombinant pcDNA3.1+ plasmid DNA containing a specific target-gene sequence of interest. A total of 100 Ag of DNA in Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior muscle of both legs (50 &mgr;l in each leg). A boost was carried using the same procedure 4 weeks later. For comparison, control groups were included in all vaccine trials. These control groups were either unvaccinated animals or those administered with non-recombinant pcDNA3.1+ DNA (sham vaccinated) only, using the same time course described above. 3 weeks after the second immunisation, all mice groups were challenged intra-nasally with a lethal dose of S. pneumoniae serotype 4 (strain NCTC 11886). The number of bacteria administered was monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A problem with intranasal immunisations is that in some mice the inoculum bubbles out of the nostrils, this has been noted in results table and taken account of in calculations. A less obvious problem is that a certain amount of the inoculum for each mouse may be swallowed. It is assumed that this amount will be the same for each mouse and will average out over the course of inoculations. However, the sample sizes that have been used are small and this problem may have significant effects in some experiments. All mice remaining after the challenge were killed 3 or 4 days after infection. During the infection process, challenged mice were monitored for the development of symptoms associated with the onset of S. pneumoniae induced-disease. Typical symptoms in an appropriate order included piloerection, an increasingly hunched posture, discharge from eyes, increased lethargy and reluctance to move. The latter symptoms usually coincided with the development of a moribund state at which stage the mice were culled to prevent further suffering. These mice were deemed to be very close to death, and the time of culling was used to determine a survival time for statistical analysis. Where mice were found dead, the survival time was taken as the last time point when the mouse was monitored alive.
[0144] Interpretation of Results
[0145] A positive result was taken as any DNA sequence that was cloned and used in challenge experiments as described above which gave protection against that challenge. Protection was taken as those DNA sequences that gave statistically significant protection (to a 95% confidence level (p<0.05)) and also those which were marginal or close to significant using Mann-Whitney or which show some protective features for example there were one or more outlying mice or because the time to the first death was prolonged. It is acceptable to allow marginal or non-significant results to be considered as potential positives when it is considered that the clarity of some of the results may be clouded by the problems associated with the administration of intranasal infections.
[0146] Results
[0147] Trials 1-6 (see FIG. 1) 2 Mean survival times (hours) Unvacc Unvacc Unvacc Unvacc Mouse control pcDNA ID5 control ID59 control ID59 control ID51 number (1) 3.1 + (1) (1) (2) (2) (5) (5) (6) (6) 1 47.5 61.0 61.0 49.0 55.0 58.0 55.3 71.6* 50.0 2 57.0 47.5 61.0 51.0 55.0 75.0 98.0 60.7 99.9T 3 47.5 50.5 57.0 49.0 55.0 48.0 58.5 98.5 53.6 4 47.5 50.5 72.0 55.0 69.5 46.7 55.3 (101.2)*T 99.9 5 77.0 72.0 47.5 49.0 74.0 58.0 53.5 60.7 59.4 6 57.0 50.5 mouse 49.0 mouse 75.0 98.0 50.8 50.0* died died Mean 55.6 55.3 59.7 50.3 61.7 60.1 69.8 68.4 68.8 sd 11.5 9.4 8.8 2.4 9.3 12.5 21.9 18.3 24.4 p value 1 — — 0.1722 — 0.0064 — 0.2862 — <36.0 p value 2 — — 0.2565 — — — — — — *—bubbled when dosed so may not have received full inoculum. T—terminated at end of experiment having no symptoms of infection. Numbers in brackets—survival times disregarded assuming incomplete dosing p value 1 refers to significance tests compared to unvaccinated controls p value 2 refers to significance tests compared to pcDNA3.1 + vaccinated controls
[0148] Statistical Analyses.
[0149] Trial 1—None of the other groups had significantly longer survival times than the controls. The survival times of the unvaccinated and pcDNA3. 1 control groups were not significantly different. One of the mice from ID5 was an outlying result and the mean survival times for ID5 were extended but not significantly so.
[0150] Trial 2—The group vaccinated with ID59 had significantly longer survival times than the unvaccinated control group.
[0151] Trial 5—The group vaccinated with ID59 again survived for an average of almost 10 hours longer than the controls but the results were not quite statistically significant.
[0152] Trial 6—The group vaccinated with ID51 did not have survival times significantly higher than unvaccinated controls (p=<36.0), however, there were 2 outlying mice in the vaccinated group.
[0153] Vaccine Trials 7 and 8 (See FIG. 2) 3 Mean survival times (hours) Mouse Unvacc ID29 Unvacc ID50 number control (7) (7) control (8) (8) 1 59.6 73.1 45.1 60.6 2 47.2 54.8 50.8 60.6 3 59.6 59.3 60.4 51.1 4 70.9 54.8* 55.2 60.6 5 68.6* 59.3 45.1 60.6 6 76.0 54.8 45.1 60.6 Mean 63.6 59.35 50.2 59.1 sd 10.3 7.1 6.4 3.9 p value 1 — <39.0 — 0.0048 *bubbled when dosed so may not have received full inoculum. T - terminated at end of experiment having no symptoms of infection. Numbers in brackets - survival times disregarded assuming incomplete dosing p value 1 refers to significance tests compared to unvaccinated controls
[0154] Numbers in brackets—survival times disregarded assuming incomplete dosing
[0155] p value 1 refers to significance tests compared to unvaccinated controls
[0156] Statistical Analyses.
[0157] Trial 7—The ID29 vaccinated group showed prolonged times to the first death. T
[0158] Trial 8—The group vaccinated with ID50 survived significantly longer than unvaccinated controls.
[0159] Appendix 1—Oligonucleotide Primers
[0160] nucS1 4 Bgl II Eco RV 5′-cgagatctgatatctcacaaacagataacggcgtaaatag-3′ nucS2 Bgl II Sma I 5′-gaagatcttccccgggatcacaaacagataacggcgtaaatag-3′ nucS3 Bgl II Eco RV 5′-cgagatctgatatccatcacaaacagataacggcgtaaatag-3′ nucR Bam HI 5′-cgggatccttatggacctgaatcagcgttgtc-3′ NucSeq 5′-ggatgctttgtttcaggtgtatc-3′ pTREPF 5′-catgatatcggtacctcaagctcatatcattgtccggcaatggtgtg ggctttttttgttttagcggataacaatttcacac-3′ pTREPR 5′-gcggatcccccgggcttaattaatgtttaaacactagtcgaagatct cgcgaattctcctgtgtgaaattgttatccgcta-3′ PUCF 5′-cgccagggttttcccagtcacgac-3′ VR 5′-tcaggggggcggagcctatg-3′ V1 5′-tcgtatgttgtgtggaattgtg-3′ V2 5′-tccggctcgtatgttgtgtggaattg-3′
[0161] 5 TABLE 1 ID4 1200 bp ATGAGAAATATGTGGGTTGTAATCAAGGAAACCTATCTTCGACATGTCGAGTCATGGAGTTTCTTCTTTATGGTGA TTTCGCCGTTCCTCTTTTTAGGAATCTCTGTAGGAATTGGGCATCTCCAAGGTTCTTCTATGGCTAAAAATAATAA AGTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAACTTCGACTATAA AGACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAAGAAGATA GTGTTCTAAAGGCAGTTTATCTGGCGAAACATCGCTTGAAAATGGAATTAAATTTGAGGTTACAGGTACACTCA ATGAACTGCAAAATCAGCTTAATCGTTCAACTGCTTCCTTGTCTCAAGAGCAGGAAAAACGCTTAGCGCAGACAA TTCAATTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTGCCTTAG GATTCTTTCTTTATATGATTCTGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAA AATTATGGAAGTCGTTTTTTCTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATGATGGCTCTGTTTCTAGTG ATTTTAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCTGCCGTTTTGCTCTTTAAAGATTTGCCATTCTTGGC TCAGTCTGGTATTTTGGATCACTTGGGAGATGCTATCTCACTGAATACCTTGCTCTTTATTTTGATCAGTCTTTTCA TGTACGTAGTCTTGGCAGCCTTCCTAGGATCTATGGTTTCTCCTCCTGAGGACTCAGGGAAAGCCTTGTCGCCTTT GATGATTTTGATTATGGGTGGTTTTTTTGGAGTGACAGCTCTAGGTGCAGCTGGTGACAATCTCCTCTTGAAGATT GGTTCTTATATTCCCTTTATTTCGACCTTCTTTATGCCGTTTCGAACGATTAATGACTATGCGGGGGGAGCAGAAG CATGGATTTCACTTGCTATTACAGTGATTTTTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCTAGTCT CGTTCTTCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCTTATCTTATAAATAG MRNMWVVIKETYLRHVESWSFFFMVISPFLFLGISVGIGHLQGSSMAKNNKVAVVTTVPSVAEGLKNVNGVNFDYKD RASAKEAIKEEKLKGYLTIDQEDSVLKAVYHGETSLENGIKFEVTGTLNELQNQNRSTASLSQEQEKRLAQTIQFTEKI DEAKENKKFIQTIAAGALGFFLYMILITYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGIYVVG GLAAVLLFKDLPFLAQSGILDHLGDAISLNTLLFILISLFMYVVLAAFLGSMVSRPEDSGKALSPLMILIMGGFFGVTALG AAGDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWISLAITVIFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYK Z ID5 1125 bp CCTGGGAAAGTCTTGAAAATTATGATAGAATGGTGGAAGGAAAAATTCAGGAGAGTAGTAGTGACTCAAAATGTT GAAAGTCTTCTCGTATCCATTGTAATCAGTGCATACAATGAAGAAAAATATCTGCCTGGTCTAATTGAAGACTTAA AAAATCAAACCTATCCTAAAGAGGATATTGAAATTCTATTTATAAATGCTATGTCCACAGATGGGACCACAGTA TCATTCAGCAATTTATAAAGGAAGATACAGAGTTAACTCAATTAGATTGTATAACAATCCTAAGAAAAATCAAG CTAGTGGTTTTAACCTGGGAGTTAAACATTCTGTAGGGGACCTTATTTTAAAAATTGATGCTCATTCAAAAGTTAC TGAGACTTTTGTAATGAACAATGTGGCTATTATTCAACAAGGTGAATTTGTCTGTGGGGGGCCTAGACCGACGATT GTCGAAGGAAAAGGAAAATGGGCAGAGACCTTGCATCTTGTTGAGGAAAATATGTTTGGCAGTAGCATTGCCAAT TATCGAAATAGTTCTGAGGATAGATATGTTTCTTCTATTTTTCATGGAATGTATAAACGAGAGGTTTTCCAGAAGG TTGGTTTAGTAAATGAGCAACTTGGCCGAACTGAAGATAATGATATTCATTATAGAATTCGAGAATATGGTTATAA AATCCGCTATGCCCAAGTATTCTATCTTATCAGTATATTCGACCAACATTCAAGAAAATGCTGCATCAAAAGTAT TCAAATGGTTTGTGGATTGGCTTGACAAGTCATGTTCAGTTTAAGTGTTTATCATTATTTCACTATGTTCCTTGTTT ATTTGTTTTGAGTCTTGTGTTTAGTCTAGCATTGTTACCGATCACATTCGTATTCATAACTTTACTATTAGGTGCCT ATTTTCTACTTTTGTCATTACTCACTTTGCTGACTTTATTAAAACATAAAAATGGATTTCTAATTGTGATGCCCTTT ATTTTATTTTCCATTCACTTTGCTTATGGCCTTGGGACGATTGTAGGTTTAATTAGAGGATTTAAATGGAGAAGG AGTACAAGAGAACAATAATTTATTTGGATAAAATAAGCCAAATAAATCAAAATATGCTATAA PGKVLKIMIEWWKEKTRRVVVTQNVESLLVSIVISAYNEEKYLPGLIEDLKNQTYPKEDIENAMTGTAIIQQFIK EDTEPNSIRLYNNPKKNQASGTNLGVKHSVGDLILKIDAHSKTETTVMNNVAIIQQGETCGGPPTIVEGKGKWAT LHLVEENMTGSSIANYRNSSEDRYVSSIFHGMYKREVFQKVGLVNEQLGRTEDNDIHYRIREYGYKIRYSPSILSYQIRP TFKKMLHQKYSNGLWIGLTSHVQFKCLSFHYVPCLFVLSLVFSLALLPITFVFITLLLGAYFLLLSLLTLLTLLKHKNGF LIVMPFILFSIHFAYGLGTIVGLIRGFKWKKEYKRTIIYLDKISQINQNMT ID11 696 bp ATGATGAAAGAACAAAATACGATAGAAATCGATGTATTTCAATTAGTTAAAAGCTTGTGGAAACGCAAGCTAATG ATTTTAATAGTGGCACTTGTGACAGGTGCGGGGCTTTTGCATATAGCACTTTTATTGTTAAGCCAGAATATACGA GTACCACGCGAATTTACGTAGTGAATCGCAATCAAGGAGACAAGCCGGGGTTGACAAATCAGGATTTGCAGGCAG GAACTTATCTGGTAAAAGACTACCGTGAGATTATCCTTTCGCAGGATGTTTTGGAGGAAGTTGTTTCTGATTTGAA ACTAGATTTGACGCCAAAAGGTTTGGCTAATAAAATTAAAGTGACAGTACCAGTTGATACCCGTATTGTCTCTATT TCAGTTAATGATCGAGTTCCTGAAGAGGCAAGCCGTATCGCTAACTCTTTGAGAGAAGTAGCTGCTCAAAAAATT ATCAGTATTACTCGTGTTTCTGACGTGACAACACTGGAGGAGGCAAGGCCGGCGATATCCCCGTCTTCGCCAAAT ATTAAACGCAATACACTAATTGGTTTTTTGGCAGGGGTGATTGGAACTAGTGTTATAGTTCTTCATCTTGAACTTTT GGATACTCGTGTGAAACGTCCGGAAGATATCGAAAATACATTGCAGATGACACTTTTGGGAGTTGTGCCAAACTT GGGTAAGTTGAAATAG MMKEQNTIEDVFQLVKSLWKRKLMILIVALVTGAGAFAYSTFIVKPEYTSTTRIYVVNRNQGDKPGLTNQDLQAGTYL VKDYREIILSQDVLEEVVSDLKLDLTPKGLANKIKVTVPVDTRIVSISVNDRVPEEASRIANSLREVAAQKIISITRVSDVT TLEEARPAISPSSPNIKRNTLIGFLAGVIGTSVIVLHLELLDTRVKRPEDIENTLQMTLLGVVPNLGKLKZ ID19 555 bp ATGGTAAAAGTAGCAGTTATATTAGCTCAGGGCTTTGAAGAAATTGAAGCCTTGACAGTTGTAGATGTCTTGCGTC GAGCCAATATCACATGTGATATGGTTGGTTTTGAAGAGCAAGTAACGGGTTCGCATGCAATCCAAGTAATGAGCAG ATCATGTCTTTGATGGAGATTTATCAGACTATGATATGATTGTTCTTCCTGGAGGTATGCCTGGTTCTGCACATTTA CGTGATAATCAGACCTTGATTCAAGAATGCAAAGCTTCGAGCAAGAAGGGAAGAAACTAGCAGCCATTTGTGCG GCACCAATTGCCCTCAATCAAGCAGAGATATTGAAAAATAAGCGATACACTTGTTATGACGGCGTTCAAGAGCAA ATCCTTGATGGTCACTACGTCAAGGAAACAGTAGTGGTAGATGGTCAGTTGACAACCAGTCGGGGTCCTTCAACA GCCCTTGCCTTTGCCTACGAGTTGGTGGAGCAACTAGGAGGGGACGCAGAGAGTTTACGAACAGGAATGCTCTAT CGAGATGTCTTTGGTAAAAATCAGTAA MVKVAVILAQGFEEIEALTVVDVLRRANITVDMVGFEEQVTGSHAIQVRADHFDGDLSDYDMIVLPGGMPGSAHLR DNQTLIQELQSFEQEGKKLAAICAAPIALNQAEILKNKRYTCYDGVQEQILDGHYVKTVVVDGQLTSRGPTALAFA YELVEQLGGDAESLRTGMLYRDVFGKNQZ ID27 306 bp GTGGTAGGGATGGTAGAACCAAACCTAGAAAGCCTTATAAAAGATCTTTACAATCATGCTCGACATGATTTGAGT GAAGATTTAGTTGCTGCTCTCCTAGAGACTACTAAAAAACTGCCTACTACAAATGAGCAATTGCAGGCAGTTCGTC TCTCAGGCCTGGTCAATCGTGAATTGCTCCTAAATCCCAAACATCCAGCACCTGAGTTGCTCAACTTGGCTCGCTT TGTCAAAAGAGAAGAAGCCAAGTACAGAGGAACTGCGACTTCTGCGCTTATGTATGAGGAACTCTTTAAAATGCT TTGA MVGMVEPNLESLIKDLYNHARHDLSEDLVAALLEKIITNEQLQAVRLSGLVNRELLLNPKHPAPELLNITARTVK REEAKYRGTATSALMYEELFKMLZ ID29 945 bp TTGTTCTTAAAAAAGGAAAGAGAGGTAATCAGCATGCGTAAATGGACAAAAGGATTTCTCATCTTTGGTGTGGTG ACTACCGTTATCGGCTTTATCCTGCTTTTTGTAGGTATCCAATCTGACGGGATAAGAGCCTACTTTCCATGTCCAA CCAACACACGCTCACCATCACAGACTCTTTCGATGATCAAATCCACATTTCTTACCATCCATCTCTTTCTGCTCAC CATGATCTTATCACCAATCAGAACGATAGAACTCTGAGTCTCACTGATAAGAAACTGTCTGAAACTCCGTTTCTCT CTTCTGGAATTGGTGGGATTCTTCATATCGCAAGTAGCTACTCTAGTCGTTTTGAAGAAGTTATTCTCCGACTACC AAAAGGGAGAACTCTAAAAGGGATCAACATCTCAGCCAATCGCGGACAAACCACCATCATAAATGCTAGCCTTGA AAATGCGACCCTCAATACAAACAGCTATATCCTCCGAATTGAAGGAAGTCGTATCAAAAACAGTAAACTCACAAC GCCCAATATCGTTAATATCTTTGATACAGTTCTTACAGATAGTCAGCTAGAGTCAACAGAGAATCACTTCCACGCT GAAAATATCCAAGTCCATGGCAAGGTTGAACTGACTGCCAAAGATTATCTCAGAATCATCCTAGACCAGAAAGAA AGCCAACGAATTAACTGGGACATCTCAAGCAACTATGGTTCTATCTTCCAATTCACAAGAGAAAAGCCTGAATCA AGAGGTACGGAATTAAGCAACCCTTACAAAACTGAAAAAACCGATGTCAAGGATCAACTCATTGCGAGATCTGAT GATAATATTGATCTAATATCCACACCAAGCAGACGTGA MFLKKEREVISMRKWTKGFLIFGVVTTVIGFILLFVGIQSDGIKSLLSMSKEPVYDSRTEKLTFGKEVENLEITLHQHTLTI TDSFDDQIHISYHPSLSAHHDLITNQNDRTLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVILRLPKGRTLKGINISANR GQTTIINASLENATLNTNSYILRIEGSRIKNSKLTTPNIVNIFDTVLTDSQLESTENHFHAENIQVHGKVELTAKDYLRIILD QKESQRINWDISSNYGSIGQFTREKPESRGTELSNPYKTEKTDVKDQLTPTDDNIDLISTPSRRZ ID30 879 bp ATGAAACAAGAATGGTTTGAAAGTAATGATTTTGTAAAAACAACAAGCAAGAACAAGCCTGAAGAGCAAGCTCA AGAGGTTGCAGACAAGGCTGAAGAAACGATAGCCGATCTCGATACACCAATTGAAAAAAATACTCAGTTAGAGG AGGAAGTCCCTCAAGCTGAAGTCGAATTGGAAAGCCAGCAAGAAGAGAAAATTGAAGCTCCTGAAGACAGTGAA GCGAGAACAGAAATAGAAGAAAAGAAGGCATCTAATTCTACTGAAGAAGAGCCAGACCTTTCTAAAGAAACAGA AAAAGTCACTATAGCTGAAGAGAGCCAAGAAGCTCTTCCTCAGCAAAAGCAACCACGAAAGAGCCACTTCTTAT CAGTAAATCTTTAGAAAGTCCTTATATCCCCGACCAAGCTCCAAAATCTAGGGATAAATGGAAAGAGCAAGTGCT TGATTTTTGGTCTTGGCTAGTGGAAGCGATCAAATCTCCTACAAGTAAGTTGGAAACAAGTATCACACACAGTTAC ACAGCCTTTCTCTTGCTCATTCTGTTTTCTGCATCTTCCTTTTTCTTTAGTATCTATCACATCAAACATGCTTACTAT GGACATATAGCAAGCATTAACAGTCGCTTCCCTGAGCAGCTAGCTCCTTTAACTCTTTTTTCTATCATCTCTATCCT AGTAGCGACAACACTCTTCTTCTTTTCATTCCTCTTGGGTAGTTTCGTTGTGACACGATTTATCCACCAGGAAAAG GACTGGACGCTAGACAAGGTTCTCCAACAATATAGTCAACTCTTGGCAATTCCAATCTCCTCACTGCTATTGCTAG TTTCTTTGCTTTCTTTGATAGCCTACGATTTACAGCCCTCTTGTGTGTGA MKQEWFESNDFVKTTSKNKPEEQAQEVADKAEETIADLDTPIEKNTQLEEEVPQAEVELESQQEEKIEAPEDSEARTEIE EKKASNSTEEEPDLSKETEKVTIAEESQEALPQQKATTKEPLLISKSLESPYIPDQAPKSRDKWKEQVLDTWSWLVEAIKS PTSKLESITHSYTAFLLLILFSASSFFFSIYHIKHAYYGHIASINSRFPEQLAPLTLFSIISILVATTLFFFSFLLGSFVVRRFIH QEKDWTLDKVLQQYSQLLAIPISSLLLLVSLLSLIAYDLQPSCVZ ID105 990 bp ATGCAACTCGCTCTTCGGTCTACTCATTGTTCGTCTGGTACAATTTGTTCTTAAAAAAGGAAAGAGAGGTAATCA GCATGCGTAAATGGACAAAAGGATTTCTCATCTTGGTGTGGTGACTACCGTTATCGGCTTTATCCTGCTTTTTGTA GGTATCCAATCTGACGGGATTAAGAGCCTACTCCATGTCCAAAGAACCTGTCTATGATAGCCGTACGGAAAAG CTAACCTTTGGCAAGGAAGTCGAAAACCTAGAAATTACTCTCCACCAACACACGCTCACCATCACAGACTCTTTC GATGATCAAATCCACATTTCTTACCATCCATCTCTTTCTGCTCACCATGATCTTATCACCAATCAGAACGATAGAA CTCTGAGTCTCACTGATAAGAAACTGTCTGAAACTCCGTTTCTCTCTTCTGGAATTGGTGGGATTCTTCATATCGC AAGTAGCTACTCTAGTCGTTTTGAAGAAGTTATTCTCCGACTACCAAAAGGGAGAACTCTAAAATGGGATCAATAT CTCAGCCAATCGCGGACAAACCACCATCATAAATGCTAGCCTTGAAAATGCGACCCTCAATACAAACAGCTATAT CCTCCGAATTGAAGGAAGTCGTATCAAAAACAGTAAACTCACAACGCCCAATATCGTTAATATCTTTGATACAGTT CTTACAGATAGTCAGCTAGAGTCAACAGAGAATCACTTTCCACGCTGAAAATATCCAAGTCCATGGCAAGGTTGAA CTGACTGCCAAAGATTATCTCAGAATCATCCTAGACCAGAAAGAAAGCCAACGAATTAACTGGGACATCTCAAGC AACTATGGTTCTATCTTCCAATTCACAAGAGAAAAGCCTGAATCAAGAGGTACGGAATTAAGCAACCCTTACAAA ACTGAAAAAACCGATGTCAAGGATCAACTCATTGCGAGATCTGATGATAATATTGATCTAATATCCACACCAAGC AGACGTTTGA MQLASSVYSLFVWYNLFLKKEREVISMRKWTKGTLITGVVTTVIGTILLTVGIQSDGIKSLLSMSKEPVYDSRTEKLTFG KEVENLEITLHQHTLTITDSFDDQIHISYHPSLSAHHDLITNQNDRTLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVIL RLPKGRTLKGINISANRGQTTIINASLENATLNTNSYILRIEGSRIKNSKLTTPNIVNFDTVLTDSQLESTENHTHAENIQV HGKVELTAKDYLRIILDQKESQRINWDISSNYGSITQTTREKPESRGTELSNPYKTEKTDVKDQLIARSDDNIDLISTPSRR Z ID107-78 bp ATGATATGTAAAATGAAGCAGGGAGGGAGCAGGGCGTGCTGGGGATGGAGAGTGGGGGAGGGACGCTGCTATTT TAATC MICKMKQGGSRACWGWRVGEGRCYFN ID109 714 bp CGATAAAGAGGCCTTGAGTAATCTCAATTTGCAGATTGAAAATGGAGAGATTATGGGCTTGATTGGTCATAATGG GGCTGGAAAATCGACCACTATAAAATCCCTAGTCAGTATCKTTTCACCCAGCAGTGGTCGTATTTTGGTAGACGGT CAGGAGTTATCGGAAAATCGCTTGGCTATTAAACGAAAGAITGGCTACGTAGCAGACTCGCCTGACTTATTTTTAC GCTTAACGGCCAATTGAATTTTGGGAATTGATCGCCTCATCCTATGATCTGAGTAGATCTGACTTGGAGGTAGTCT AGCTAGGCTATTGAACGTTTTTGATTTTGCTGAAAATCGCTATCAGGTTATTGAAACTCTTTCTCACGGAATGCGT CAGAAAGTCTTTGTCATCGGAGCACTCTTGTCTGATCCCGATATTTGGGTTTTGGACGAACCCTTGACTGGTTTGG ATCCCCAGGCTGCCTTTGATTTGAAACAGATGATGAAGGAACATGCACAAAAAGGGAAGACAGTCTTGTTTTCAA CTCATGTCCTAGAGGTGGCAGAGCAAGTCTGTGATCGGATTGCCATTTTGAAAAAGGGGCATTTGATTTATTGTGG TAAGGTAGAGGACTTGAGGAAAGACCACCCAGACCAGTCTTTGGAAAGTATCTACCTTAGTCTTGCTGGTAGTAAA AGAGGAGGTIGCGGATGCGTCTCAAGGTCATTAA DKEALSNLNLQIENGEIMGLIGHNGAGKSTTIKSLVSIISPSSGRILVDGQELSENRLAIKRKIGYVADSPDLFLRLTANEF WELIASSYDLSRSDLEASLARLLNVFDFAENRYQVIETLSHGMRQKVFVIGALLSDPDIWVLDEPLTGLDPQAAFDLKQ MMKEHAQKGKTVLFSTHVLEVAEQVCDRIAILKKGHLIYCGKVEDLRKDHPDQSLESIYLSLAGRKEEVADASQGHZ ID112 360 bp ATGGCTGTTTTCAGAGAGAGGAGCAGTACGGAAGACACCAATGGCAAGTCCAATAATGAGACCTATGATGTTT CCGACGATAGAGATTAAAAGAGTGATACCAGCACCACGCAAGAGTTGTTGCCAGTTTTCAGAAAGAATTTTAGCA ACTTGGCTAAAGAAACTACTGCTAGTTTCAGTTGTTGTAGCTTCGGCAGGTTGTTCCTTGATCATACGATCCA TCAAGGCAACTGGTCATCTTGAAATGGTTTCAATGCTGGCATTGATTTGGCTAATACGATTGTCATTTTTACGA AGCCCGATAGCGATAGCTGTATCTTCTTCCCCAGTTTTGAAACCAGGTTCTACTTGA MALTSERGAVRKTPMASPIMRPMMVPTIEIKRVIPAPRKSCCQTSERILATWLTZKLLLVSSVVVASAGCSLIIRSIKATWSS FEMVSMLALIWLIRLSFLRSPIAIAVSSSPVLKPGSTZ ID 128-3.43 ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCTGTTATCGTATC CTTGAGTCTATGTGCCTATGCACTAAACCAGCATCGTTCGCAGGAAAATA AGGACAATAATCGTGTCTCTATGTGGATGGCAGCCAGTCAAGTCAGAAA AGTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGC TGAGCAAATTGTAATCAAAATTACAGATCAGGGCTATGTAACGTCACACG GTGACCACTATCATTACTATAATGGGAAAGTTCCTTATGATGCCCTCTTT AGTGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGA TATTGTCAATGAAGTCAAGGGTGGTTATATCATCAAGGTCGATGGAAAAT ATTATGTCTACCGAAAGATGCAGCTCATGCTGATAATGTTCGAACTAAA GATGAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGT TAACTCTAATGTTGCTGTAGCAAGGTCTCAGGGACGATATACGACAAATG ATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCT TATATCGTTCCTCATGGAGGTCACTATCACTACATTCCCAAAAGCGATTT ATCTGCTAGTGAATTAGCAGCAGCTAAAGCACATCTGGCTGGAAAAAATA TGCAACCGAGTCAGTTAAGCTATCTTCAACAGCTAGTGACAATAACACG CAATCTGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAA TCTCCAGAGTCTTTTGAAGGAACTCTATGATTCACCTAGCGCCCAACGTT ACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATTATCAGTCGT ACACCAAATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCC TTACAGCAAGCTTTCTGCCTTAGAAGAAAAGATTGCCAGAATGGTGCCTA TCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAAACCTAATGAAGTA GTGTCTAGTCTAGGCAGTCTTTCAAGCAATCCTTCTTCTTTAACGACAAG TAAGGAGCTCTCTTCAGCATCTGATGGTTATATTTTTAATCCAAAAGATA TCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTC CATTACATTCCAAAATCAAATCAAATTGGGCAACCGACTCTCCAAACAA TAGTCTAGCAACACCTTCTCCATCTCTTCCAATCAATCAGGAACTTCAC ATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATC GCTGAAGATGAATCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTA TTTCTTCAAGAAGGACTTGACAGAAGAGCAAATTAAGGTGCGCAAAAACA TTTAG MKTSKKYIAAGSAVIVSLSLCAYALNQIIRSQENKDNNRVSYVDGSQSSQK SENLTPDQVSQKEGIQAEQTTTKTTDQGYVTSHGDHYHYYNGKVPYDALF SEELLMKDPNYQLKDADIVNEVKGGYIIKVDGICYYVYLKDAAHADNVRTK DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVTNPADIIEDTGNA YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVTDPAKIISR TPNGVMPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV VSSLGSLSSNPSSLTTSKSASDGYWNPKDIVEETATAYIVRHGDHF WYIPKSNQIC3QPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII AEDESGTVMSHGDHNHYTTKKDLTEEQIKVRIKNI*
[0162] 6 TABLE 2 ID2 840 bp ATGGGAATTGCTCTAGAAAATGTGAATTTTACATATCAAGAAGGTACTCCCTTAGCTTCAGCAGCTTTGTCGGATG TTTCTTTGACGATTGAAGATGGCTC1TATACAGCTTTAATTGGGCACACAGGTAGTGGTAAATCAACTATTTTACA ACTCTTAAATGGTTTATTGGTGCCAAGTCAAGGGAGTGTGAGGGTTTTTGATACCTTAATCACCTCGACTTCTAAA AATAAAGATATTCGTCAAATTAGAAAACAGGTTGGCTTGGTATTTCAGTTTGCTGAAAACAGATTTTTGAAGAAA CGGTTTTGAAGGACGTTGCTTTTGGACCGCAAAATTTTGGAGTTTCTGAAGAAGATGCTGTGAAGACTGCGCGTGA GAAACTGGCTCTGGTTGGAATTGATGAATCACTTTTTGATCGTAGTCCGTTTGAGCTGTCAGGGGGACAAATGAGA CGTGTTGCCATTGCAGGCATACTTGCCATGGAGCCAGCTATATTAGTCTTAGATGAGCCAACAGCTGGTCTAGATC CTCTAGGGAGAAAAGAGTTGATGACCCTGTTCAAAAAACTCCACCAGTCAGGGATGACCATCGTCTTGGTAACGC ATTTGATGGATGATGTTGCTGAATATGCGAATCAAGTCTATGTAATGGAAAAGGGACGTTTAGTAAATGGGGGGCA AACCAAGTGATGTCTTTCAAGACGTTGTTTATGGAAGAAGTTCAGTTGGGAGTACCTAAAATTACGGCCTTTTG TAAACGATTGGCTGATAGAGGCGTGTCATTTAAACGATTACCGATTAAGATAGAGGAGTTCAAGGAGTCGCTAAA TGGATAG MGIALENVNFTYQEGTPLASAALSDVSLTIEDGSYTALIGHTGSGKSTTLQLLNGLLVPSQGSVRVTDTLTTSTSKNKDIR QIRKQVGLVFQFAENQIFEETVLKDVATGPQNTGVSEEDAVKTAREKLALVGDESLTDRSPTELSGGQMRRVARGILA MEPAILVLDEPTAGLDPLGRKELMTLFKKLIIQSGMTTVLVTHLMDDVAEYANQVYVMEKGRLVKGGKPSDVTQDVV TMEEVQLGVPKTTATCKRLADRGVSTKRLPIKTTETKBSLNGZ ID 3 6360 bp TACCCGGTAGTCTTAGCAGACACATCTAGCTCTGAAGATGCTTAAACATCTCTGATAAAGAAAAAGTAGCAGAA AATAAAGAGAAACATGAAAATATCCATAGTGCTATGGAAACTTCACAGGATTTTAAAGAGAAGAAAACAGCAGTC ATTAAGGAAAAAGAAGTTGTTAGTAAAAATCCTGTGATAGACAATAACACTAGCAATGAAGAAGCAAAAATCAA AGAAGAAAATTCCAATAAATCCCAAGGAGATTATACGGACTCATTTGTGAATAAAAACACAGAAAATCCCAAAAA AGAAGATAAAGTTGTCTATATTGCTGAATTTAAAGATAAAGAATCTGGAGAAAAAGCAATCAAGGACTATCCAG TCTTAAGAATACAAAAGTTTTATATACTTATGATAGAATTTTTAACGGTAGTGCCATAGAAACAACTCCAGATAAC TTGGACAAAATTAAACAAATAGAAGGTATTTTCATCGGTTGAAAGGGCACAAAAAGTCCAACCCATGATGAATCAT GCCAGAAAGGAAATTGGAGTTGAGGAAGCTATTGATTACCTAAAGTCTATCAATGCTCCGTTTGGGAAAAATTTT GATGGTAGAGGTATGGTCATTTCAAATATCGATACTGGAACAGATTATAGACATAAGGCTATGAGAATCGATGAT GATGCCAAAGCCTCAATGAGATTTAAAAAAGAAGACTTAAAAGGCACTGATAAAAATTATTGGTTGAGTGATAAA ATCCCTCATGCGTTCAATTATTATAATGGTGGCAAAATCACTGTAGAAAAATATGATGATGGAAGGGATTATTTTG ACCCACATGGGATGCATATTGCAGGGATTCTTGCTGGAAATGATACTGAACAAGACATCAAAAACTTTAACGGCA TAGATGGAATTGCACCTAATGCACAAATTTTCTCTTACAAAATGTATTCTGACGCAGGATCTGGGTTTGCGGGTGA TGAAACAATGTTTCATGCTATTGAAGATTCTATCAAACACAACGTTGATGTTTGTTCGGTATCATTTGTTTTACA CCAACAGGTCTTGTAGGTGAGAAATATTGGCAAGCTATTCGGGCATTAAGAAAAGCAGGCATTCCAATGGTTGTC GCTACGGGTAACTATGCGACTTCTGCTTCAAGTTCTTTCATGGGATTTAGTAGCAAATAATCATCTGATTGACCG ACACTGGAAATGTAACACGAACTGCAGCACATGAAGATGCGATAGCGGTCGCTTCTGCTAAAAATCTATCAGTTG AGTTTGATAAAGTTAACATAGGTGGAGAAAGTTTTAAATACAGAAATATAGGGGCCTTTTTTCGATAAGAGTTAT TCACAACAAATGAAGATGGAACAAAAGCTCCTAGTAAATTAAAATATGTATATATAGGCATGGGGCAAGACCAAG ATTTGATAGGTTTGGATCTTAGGGGCAAAATTGCAGTAATGGATAGAATTTATACAAAGGATTTAAAAAATGCTTT TAAAAAAGCTATGGATAAGGGTGCACGCGCCATTATGGTTGTAAATACTGTAAATTACTACAATAGAGATAATTG GACAGAGCTTCCAGCTATGGGATATGAAGCGGATGAAGGTACTAAAAGTCAAGTGTTTTCAATTTCAGGAGATGA TGGTGTAAAGCTATGGAACATGATTAATCCTGATAAAAAAACTGAAGTCAAAAGAAATAATAAAGAAGATTTTAA AGATAAATTGGAGCAATACTATCCAATTGATATGGAAAGTTTTAATTCCAACAAACCGAATGTAGGTGACGAAAA AGAGATTGACTTTAAGTTTGCACCTGACACAGACAAAGAACTCTATAAAGAAGATATCATCGTTCCAGCAGGATC TACATCTTGGGGGCCAAGAATAGATTTACTTTTAAAACCCGATGTTTCAGCACCTGGTAAAAATATTAAATCCACG CTTAATGTTATTAATGGCAAATCAACTTATGGCTATATGTCAGGAACTAGTATGGCGATCCAATCGTGGCAGCTT CTACTGTTTTGATTAGACCGAAATTAAAGGAAATGCTTGAAAGACCTGTATTGAAAAATCTTAAGGGAGATGACA AAATAGATCTTACAAGTCTTACAAAAATTGCCCTACAAAATACTGCGCGACCTATGATGGATGCAACTTCTTGGA AAGAAAAAAGTCAATACTTTGCATCACCTAGACAACAGGGAGCAGGCCTAATTAATGTGGCCAATGCTTTGAGAA ATGAAGTTGTAGCAACTTTCAAAAACACTGATTCTAAAGGTTTGGTAAACTCATATGGTTCCATTTCTCTTAAAGA AATAAAAGGTGATAAAAAATACTTTACAATCAAGCTTCACAATACATCAAACAGACCTTTGACTTTTAAAGTTCA GCATCAGCGATAACTACAGATTCTCTAACTGACAGATTAAAACTTGATGAAACATATAAAGATGAAAAATCTCCA GATGGTAAGCAAATTGTTCCAGAAATTCACCCAGAAAAAGTCAAAGGAGCAAATATCACATTTGAGCATGATACT TTCACTATAGGCGCAAATTCTAGCTTTGATTTGAATGCGGTTATAAATGTTGGAGAGGCCAAAAACAAAAATAAA TTTGTAGAATCATTTATTCATTTTGAGTCAGTGGAAGCGATGGAAGCTCTAAACTCCAGCGGGAAGAAAATAAAC TTCCAACCTTCTTTGTCGATGCCTCTAATGGGATTTGCTGGGAATTGGAACCACGAACCAATCTGATTAAATGGG CTTGGGAAGAAGGGTCAAGATCAAAAACACTGGGAGGTTATGATGATGATGGTAAACCGAAAATTCCAGGAACCT TAAATAAGGGAATTGGTGGAGAACATGGTATAGATAAATTTAATCCAGCAGGAGTTATACAAAATAGAAAAGATA AAAATACAACATCCCTGGATCAAAATCCAGAATTATTTGCTTTCAATAACGAAGGGATCAACGTCCATCATCAA GTGGTTCTAAGATTGCTAACATTTATCCTTTAGATTCAAATGGAAATCCTCAAGATGCTCAACTTGAAAGAGGATT AACACCTTCTCCACTTGTATTAAGAAGTGCAGAAGAAGGATTGATTTCAATAGTAAATACAAATAAAGAGGGAGA AAATCAAAGAGACTTAAAAGTCATTTCGAGAGAACACTTTATTAGAGGAATTTTAAATTTCTAAAAGCAATGATGC AAAGGGAATCAAATCATCTAAACTAAAAGTTTGGGGTGACTTGAAGTGGGATGGACTCATCTATAATCCTAGAGG TAGAGAAGAAAATGCACCAGAAAGTAAGGATAATCAAGATCCTGCTACTAAGATAAGAGGTCAATTTGAACCGAT TGCGGAAGGTCAATATTTCTATAAATTTAAATATAGATTAACTAAAGATTACCCATGGCAGCTTTCCTATATTCCT GTAAAAATTGATAACACCGCCCCTAAGATTGTTTCGGTTGATTTTTCAAATCCTGAAAAAATTAAGTTGATTACAA AGGATACTTATCATAAGGTAAAAGATCAGTATAAGAATGAAACGCTACTTTGCGAGAGATCAAAAAGAACATCCTG AAAAATTTGACGAGATTGCGAACGAAGTTTGGTATGCTGGCGCCGCTCTTGTTAATGAAGATGGAGAGGTTGAAA AAAATCTTGAAGTAACTTACGCAGGTGAGGGTCAAGGAAGAAATAGAAAACTTGATAAAGACGGAAATACCATTT ATGAAATTAAAGGTGCGGGAGATTAAGGGGAAAAATCATTGAAGTCATTGCATTAGATGGTTCTAGCAATTTCA CAAAGATTCATAGAATTAAATTGCTAATCAGGTGATGAAAAGGGGATGATTTCCTATTATCTAGTAGATCCTGA TCAAGATTCATCTAAATATCAAAAGCTTGGCGAGATTGCAGAATCTAAATTTAAAAATTTAGGAAATGGAAAAGA GGGTAGTCTAAAAAAAGATACAACTGCGGTAGAACATCATCATCAAGAAAATGAAGAGTCTATTAAAGAAAAAT CTAGTTTTACTATTGATAGAAATATTTCAACAATTAGAGACTTTGAAAATAAAGACTTAAAGAAACTCATTAAAAA GAAATTTAGAGAAGTTGATGKTTTTTACAAGTGAAACTGGTAAGAGAATGGAGGAATACGATTATAAATACGATGA TAAAGGAAATATAATAGCCTACGATGATGGGACTGATCTAGAATATGAAACTGAGAAACTTGACGAAATCAAATC AAAAATTTATGGTGTCTAAGTCCGTCTAAAGATGGACACTTTGAAATTCTTGGAAAGATAAGTAATGTTTCTAAA AATGCCAAGGTATATTATGGGAATAACTATAAATCTATAGAAATCAAAGCGACCAAGTATGATTTCCACTCAAAA ACGATGACATTTGATCTATACGCTAATATTAATGATATTGTGGATGGATTAGCTTTTGCAGGAGATATGAGATTAT TTGTTAAAGATAATGATCAGAAAAAAGCTGAAATTAAAATTAGAATGCCTGAAAAAATTAAGGAAACTAAATCAG AATATCCCTATGTATCAAGTTATGGGAATGTCATAGAATTAGGGGAAGGAGATCTTTCAAAAAACAAACCAGACA ATTTAACTAAAATGGAATCTGGTAAAATCTATTCTGATTCAGAAAAACAACAATATCTGTTAAAGGATAATATCAT TCTAAGAAAAGGCTATGCACTAAAAGTGACTACCTATAATCCTGGAAAAACGGATATGTTAGAAGGAAATGGAGT CTATAGCAAGGAAGATATAGCAAAAATACAAAAGGCCAATCCTAATCTAAGAGCCCTTTCAGAAACAACAATTTA TGCTGATAGTAGAAATGTTGAAGATGGAAGAAGTACCCAATCTGTATTAATGTCGGCTTTGGACGGCTTTAACATT ATAAGGTATCAAGTGTTTACATTTAAAATGAACGATAAAGGGGAAGCTATCGATAAAGACGGAAATCTTGTGACA GATTCTTCTAAACTTGTATTATTTGGTAAGGATGATAAAGAATACACTGGAGAGGATAAGTTCAATGTAGAAGCTA TAAAAGAAGATGGCTCCATGTTATTTATTGATACCAAACCAGTAAACCTTTCAATGGATAAGAACTACTTTAATCC ATCTAAATCTAATAAAATTTATGTACGAAATCCAGAATTTTATTTAAGAGGTAAGATTTCTGATAAGGGTGGTTTT AACTGGGAATTGAGAGTTAATGAATCGGTTGTAGATAATTATTTAATCTACGGAGATTTACACATTGATAACACTA GAGATTTTAATATTAAGCTGAATGTTAAAGACGGTGACATCATGGACTGGGGAATGAAAGACTATAAAGCAAACG GATTCCAGATAAGGTAACAGATATGGATGGAAATGTTTATCTTCAAACTGGCTATAGCGATTTGAATGCTAAAGC AGTTGGAGTCCACTATCAGTTTTATATGATAATGTTAAACCCGAAGTAAACKTTGATCCTAAGGGAAATACTAGT ATCGAATATGCTGATGGAAAATCTGTAGTCTAACATCAATGATAAAAGAAATTAATGGATTCGNTGGTGAGATT CAAGAACAACATATATATAAATGGAAAAGAATATACATCATTAATGATATTAAACAAATATAATAGACATGACA CTAAACATTAAGATTGTTGTAAAAGATTTTGCAAGAAATACAACCGTAAAAGAATTCATTTTAAATAAAGATACG GGAGAGGTAAGTGAATTAAAACCTCATAGGGTAACTGTGACCATTCAAAATGGAAAAGAAATGAGTTCAACGATA GTGTCGGAAGAAGATTTTATTTTACCTGTTATAAGGGTGAATTAGAAAAAGGATACCAATTTGATGGTTGGGAAA TTTCTGGTTTCGAAGGTAAAAAAGACGCTGGCTATGTTATTAATCTATCAAAAGATACCTATAATACCTGTATT CAAGAAAATAGAGGAGAAAAAGGAGGAAGAAAATAAACCTACTTTTGATGTATCGAAAAGAAAGATATACCCAC AAGTAAACCATAGTCAATTAAATGAAAGTCACAGAAAAGAGGATTTACAAAGAGAAGAGCATTCACAACAATTCT GATTCAACTAAGGATGTTACAGCTACAGTTCTGATAAAAACAATATCAGTAGTAAATCAACTACTAACAATCCT AATAAGTTGCCAAAAACTGGAACAGCAAGCGGAGCCCAGACACTATTAGCTGCCGGAATAATGTTTATAGTAGGA ATTTTTCTTGGATTTGAAGAAAAAAAATCAAGATTAA YPVVLADTSSSEDALNISDKEKVAENKEKHENIHSAMETSQFKEKKTAVIKEKEVVSKNPVIDNNTSNEEAKIKEENSN KSQGDYTDSTVNKNTENPKKEDKVVYIAEFKDKESGEKATKELSSLKNTKVLYTYDRTTNGSAIETTPDNLDKIKQIEGIS SVERAQKVQPMMNHARKEIGVEEAIDYLKSINAPFGKNDGRGMVISNIDTGTDYRHKAMRIDDDAKASMRFKKEDL KGTDKNYWLSDKIPHAFNYYNGGKTTVEKYDDGRDYTDPHGMHIAGILAGNDTEQDIKNPNGIDGIAPNAQTFSYKMY SDAGSGFAGDETMTHATEDSIKHNVDVVSVSSGFTGTGLVGEKYWQATRALRKAGIPMVVATCNYATSASSSSWDLVA NNHLKMTDTGNVTRTAAHEDATAVASAKNQTVEFDKVNIGGESFKYRNIGATTDKSKTTTNEDGTKAPSKLKPVYIGK GQDQGLIGLDLRGKIAVMDRIYTKDLKNAFKKAMDKGARAIMVVNTVNYYNRDNWTELPAMGYEADEGTKSQVFSI SGDDGVKLWNMINPDKKTEVKRNNKEDTKDKLEQYYPIDMESFNSNKPNVGDEKEIDTKFAPDTDKELYKEDIIPAG STSWGPRIDLLLKPDVSAPGKNIKSTLNVINGKSTYGYMSGTSMATPIVAASTVLIRPKLKEMLERPVLKNLKGDDKIDL TSLTKIALQNTARPMMDATSWKEKSQYFASPRQQGAGLINVANALRNEVVATTKNTDSKGLVNSYGSISLICEIKGDKK YFTIKLHNTSNRPLTTKVSASATTDSLTDRLKLDETYKDEKSPDGKQIVPEIHPEKVKGANITTEHDTFTIGANSSTDLN AVINVGEAKNKNKFVESFIHPESVEAMEALNSSGKKINFQPSLSMPLMGTAGNWNHEPILDKWAWEEGSRSICTLGGYD DDGKPKIPGTLNKGIGGEHGIDKFNPAGVIQNRKGKNTTSLDQNPELFAFNNEGINAPSSSGSKIANIYPLDSNGNPQDA QLERGLTPSPLVLRSAEEGLISIVNTNKEGENQRDLKVISREHFIRGILNSKSNDAKGTTTSSKLKVWGDLKWDGLIYNPRG REENAPESKDNQDPATKIRGQTEPIAEGQYFYKFKYRLTKDYPWQVSYIPVKIDNTAPKIVSVDTSNPEKIKLITKDTYHK VKDQYKNETLFARDQKEHPEKFGEIANEVWYAGAALVNEDGEVEKNLEVTYAGEGQGRNRKLDKDGNTIYEIKGAG DLRCKIIEVIALDGSSNTKIHRKTANQADEKGMISYYLVDPDQDSSKYQKLGEJAESKTKNLGNGKEGSLKKDTTGVE HHHQENEESIKEKSSFTIDRNISTIRDFENKDLKKLIKKKFREVDDFTSETGKRMEEYDYKYDDKGNIIAYDDGTLEYE TEKLDEIKSKIYGVLSPSKDGHFEILGKISNVSKNAKVYYGNNYKSIEIKATKYDFHSKTMTFDLYANINDIVDGLAFAG DMRLFVKDNDQKKAEIK1RMPEKIKETKSEYPYVSSYGNVIELGEGDLSKNKPDNLTKMESGKIYSDSEKQQYLLKDNII LRKGYALICVTTYNPGKTDMLEGNGVYSKEDIAKJQKANPNLRALSETTTYADSRNVEDGRSTQSVLMSALDGTNIIRYQ VFTFKMNDKGEATDKDGNLWSSKLVLTGKDDKEYTGEDKPNVEATKEDGSMLPIDTTTPVNLSMDKNYTNPSKSNKI YVRNPETYLRGTCISDKGGFNWELRVNESVVGNYLIYGDLHIDNTRDFNIKLNVKDGDIMDWGMKDYKANGTPDKVTD MDGNVYLQTGYSDLNAKAVGVHYQFLYDNVKPEVNIGPKGNTSIEYADGKSVVFNINDKRNNGPDGEIQEQHIYINGIC EYTSPNDIKQIIDKTLNIKIVVKDFARNTTVKETTLNKDTGEVSELKPHRVTVTTQNGKEMSSTTVSSEDFILPVYKGELEK GYQFDGWEISGFEGKKDAGYVINLSKDTFIKPVFKKIEEKKEEENKPThDVSKKKDNPQVNHSQLNESHRKEDLQREEH SQKSDSTKDVTATVLDKNNISSKSTTNNPNKLPKTGTASGAQTLLAAGIMFIVGTTLGLKKKNQDZ ID6 597 bp CTTGAATTAAATAAAAAACGTCATGCGACTAAGCATTTTACTGATAAGCTTGTTGATCCCAAAGATGTGCGTACGG CTATCGAAATTGCAACCTTAGCGCCAAGCGCCCACAACAGCCAGCCTTGGAAATTTGTGGTGGTACGTGAGAAAA ATGCGAACTGGCAAAGTTAGCTTATGGTTCCAATTTTGAACAGGTATCATCAGCGCCTGTAACCATTGCCTTGTT TACAGATACGGACTTAGCCAAACGTGCTCGTAAGATTGCCCGTGTTGGTGGTGCTAATAACTTTTCTGAAGAGCAA CTTCAATATTTTATGAAAAATCTGCCAGCTGAGTTTGCCCGTTACAGTGAGCAACAAGTCAGCGACTACCTAGCTC TCAATGCAGGTTGGTTGCCATGAACTTGGTTCTTGCATTGACAGACCAAGGAATTGGTTCTAACATTATTCTTGG TTTTGACAAATCAAAAGTTAATGAAGTTTTGGAAATCGAAGACCGTTTCCGCCCAGAACTCTTGATCACAGTGGGT TATACAGACGAAAAATTGGAACCAAGCTACCGCTTGCCAGTAGATGAAATCATCGAGAAAAGATAG LELNKKRHATKHTTDKLVDPKDVRTATEIATLAPSAHNSQPWKFVVVREKNAELAKLAYGSNTEQVSSAPVTTALFTDT DLAKRARKIARVGGANNFSEEQLQYTMKNLPAEFARYSEQQVSDYLALNAGLVAMNLVLALTDQGIGSNLILGFDKSK VNEVLEIEDRFRPELLTTVGYTDEKLEPSYRLPVDEIIEKRZ ID7 1401 bp ATGACAGCAATTGATTTTACAGCAGAAGTAGAAAAACGCAAAGAAGACCTCTTGGCTGACTTGTTTAGCCTTTTG GAAATCAATTCAGAACGTGATGACAGCAAGGCTGATGCCCAGCATCCATTTGGGCCTGGTCCAGTAAAAGCCTTG GAGAAATTCCTTGAAATCGCAGACCGCGATGGCTACCCAACTAAGAATGTTGATAACTATGCAGGACATTTTGAG TTTGGTGATGGAGAAGAAGTTCTCGGAATCTTTGCCCATATGGATGTGGTGCCTGCTGGTAGCGGTGGGACACAG ACCCTTACACACCAACTATCAAAGATGGTCGCCTTTATGCGCGCGGGGCTTCGGACGATAAGGGTCCTACAACAG CTTGTTACTATGGTTTGAAAATCATCAAAGAATTGGGTCTTCCAACTTCTAAGAAAGTTCGCTTCATCGTTGGAAC AGACGAAGAATCAGGCTGGGCAGACATGGACTACTACTTTGAGCACGTAGGACTTTGCCAAACCAGATTTCGGTTT CTCACCAGATGCTGAATTTCCAATCATCAATGGTGAAAAAGGAAATATCACGGAATACCTCCACTTTGCAGGAGA AAATACAGGTGTTGCCCGTCTTCACAGCTTTACAGGTGGTTTACGTGAAAATATGGTACCAGAATCAGCAACAGC AGTCGTTTCAGGTGACTTGGCTGACTTGCAAGCTAAACTAGATGCCTTTGTTGCAGAACACAAACTTAGAGGAGA ACTCCAAGAAGAAGCTGGCAAATACAAGGTGACGATCATTGGTAAATCAGCCCACGGTGCTATGCCTGCTTCAGG TGTCAATGGCGCAACTACCTTGCCCTCTTCCTCAGCCAGTTTGGCTTTGCTGGTCCAGCCAAAGACTACCTTGAC ATCGCAGGTAAAATTCTCTTGAACGATCATGAGGGTGAAAATCTTAAGATTGCTCATGTGGATGAAAAGATGGGT GCTCTTTCTATGAATGCCGGCGTCTTCCACTTCGATGAAACAAGTGCTGATAATACCATTGCCCTCAACATCCGCT ATCCAAAAGGAACAAGTCCAGAACAAATCAAGTCAATCCTGAAAACTTGCCAGTTGTTTCTGTTAGCCTGTCTGA ACACGGTCACACGCCTCACTATGTGCCAATGGAAGATCCACTGTGCAAACCTTGTTGAATATCTATGAAAAACA AACTGGCTTTAAAGGTCATGAACAAGTCATCGGTGGTGGAACCTTTGGTCGCTTGCTAGAACGCGGAGTTGCCTA CGGTGCTATGTTCCCAGACTCGATTGATACCATGCACCAAGCCAATGANTTTKTCGCCTTGGATGATCTTTTCCGA GCAGCAGCAATTTATGCCGAAGCTATTTACGAATTGATCAAATAA MTAIDTAEVEKRKEDLLADLTSLLEINSERDDSKADAQHPTGPGPVKALEKTLEIADRDGYPTKNVDNYAGHTEIGD GEEVLGIFAHMDVVPAGSGWDTDPYTTTKDGRLYARGASDDKGPTTACYYGLKIIKELGLPTSKKVRFIVGTDEESGW ADMDYYFEHVGLAKPDFGTSPDAEFPIINGEKGNTTBYLHFAGENTGVARLHSFTGGLRENMVPESATAVVSGDLADL QAKLDAFVAEHKLRGELQEEAGTCYKVTTTGKSAHGAMPASGVNGATYLALFLSQTGFAGPAKDYLDIAGKILLNDHEG ENLKIAHVDEKMGALSMNAGVFHFDETSADNTTALNIRYPKGTSPEQIKSILENLPVVSVSLSEHGHTPHYVPMWPLVQ TLLNIYEKQTGFKGHEQVIGGGTFGRLLERGVAYGAMTPDSIDTMHQANETTALDDLFRAAATYAEATYELIKZ ID8 1617 bp GTGTATACTATTATAAAATCAAATATAAAAAAATTTAGTTTATTAACGATATTTATTGTTGCTGGTCAATTATTGCT AATTTATGCAGCAACTATTAATGCTCTGGTGTTGAATGAATTAATTGCGATGAATTTAGAGCGGTTTTTGAAATTG TCAATCTACCAAATGATTGTCTGGTGTGGGATAATATTCCTTGACTGGGTAGTGAAAAATTATCAGGTTGAAGTGA TCCAAGAGTTTAATCTAGAGATTCGAAATAGAGTTGCCACAGACATCTCTAACTCTACCTATCAAGAATTTCATAG TAAATCATCAGGAACATATCTTTCGTGGCTAAATAATGATGTTCAGACTTTAAATGATCAGGCGTTTAAACAACTT TTTTAGTAATAAAAGGAATTTCTGGTACTATATTTGCAGTTGTGACTCTTAATCACTATCATTGGTCKTTGACTGT AGCCACCTTGTTTTCATTAATGATTATGCTACTTGTACCAAAAATCTTTGCATCGAAAATGCGAGAAGTTAGTCTA AATTTAACTAACCAAAATGAAGCTTTTTTAAAATCTAGTGAGACTATATTGAATGGATTTGATGTGTTAGCGTCCT TGAATCTTTTATATGTATTGCCTAAGAAAATTAAAGAAGCAGGAATTTTATTAAAGATGGTTATACAAAGAAAGA CAACTGTAGAAACGTTAGCAGGCGCTATTAGCTTCTTTCTCAATATTTTTTTTCAGATATCTCTCGTTTTTTTAAT GGCTATCTTTGCAATAAAAGGAATAGTGAAAATTGGTACTATTGAAGCAATAGGAGCACTAACAGGTGTTATTTTT ACAGCGCTAGGTGAATTAGGAGGTCAATTATCCTCTATTATTGGTACGAAGCCTATTTTTTTAAAATTGTATTCAA TTAATCCAATTGAGTCAAATAAAATGAATGATATCGAACCAAATGAGGTGAATAGAGATTTTCCGTTATATGAAG CAAAAAATATTTGCTATAAGTATGGAGATAAAGAAATATTAAAAAACTTAAATTTTTGTTTTCAACGTAATGAAAA GTATTTAATTTTAGGTGAAAGTGGAAGCGGGAAATCTACATTATTAAAATTATTGAATGGCTTTTTGAGAGATTAT AGTGGAGAATTGCGATTCTGCGGGGATGATATAAAAAAAACCTCCTATTTAAATATGGTTTCGAATGTTCTATATG TAGATCAAAAAGCTTATTTGTTTGAAGGTACGATTAGAGATAATATTTTATTGGAAGAAAATTATACTGATGAAGA AATACTACAGTCTTTAGAGCAAGTTGGTTTGAGTGTAAAAGATTTTCCTAATAACATTTTAGATTATTATGTTGGT GATGATGGGAGATTACTGTCAGGAGGGCAGAAACAAAAAATTACTTTAGCTAGAGGGCTAATTAGAAATAAGAA AATAGTATTAATTGACGAGGGAACTTCTGCTATCGATAGGAGAACTCGTTAGCGATTGAACGTAAGATATTAGA TAGAGAGGATTTGACTGTCATTATTGTTACCCATGCTCCGCATCCGGAACTTAAACAATATTTTACTAAGATATAT CAATTTCCAAAGGATTTTATTTAA MYTIIKSNLIKTSLLTTIVAGQLLLIYAATINALVLNELIAMNLERFLKTTYQMIVWCGTTTLDWVTNYQVEVIQEFNL EIRNRVATDISNSTYQETHSKSSGTYLSWLNNDVQTLNDQAFKQLFLVIKGISGTIFAVVTLNHYHWSLTVATLTSLMIM LLVPKIFASKMREVSLNLTNQNEAFLKSSETTLNGFDVLASLNLLYVLPiIKEAGILLKMVIQRKTTVETLAGAISFFLNI FFQISLVFLTGYLAIKGIVKIGTIEIGALTGVIFTALGELGGQLSSIIGTKPIFLKLYSINPIESNKMNDIEPNEVNRDFPLYE AKNICYKYGDKEILKNLNFCFQRNEKYLILGESGSGKSTLLKLLNGFLRDYSGELPTTCGDDIKKTSYLNMVSNVLYVDQ KAYLTEGTTRDNILLEENYTDEEILQSLEQVGLSVKDTPNNILDYYVGDDGTLLSGGQKQKTTLARGLIPTKKIVLIDEGT SAIDRRTSLAIERKILDRBDLTVIIVTHAPHPELKQYFTKIYQFPKDFIZ ID9 705 bp ATAACAGTTAAACAGATTATGGACGAAATAGCCGTTTCAGATATGACTGCAAGGCGATATTTACAGGAATTAGCT GATAAAGATTTGCTGATTCGTGTGCATGGTGGAGCTGAAAAACTTCGAACCAACTCCCTTTTGACTAATGAGCGAT CAAATATTGAAAAACAAGCCCTCCAAACGGCAGAAAAACAAGAAATAGCCCATTTTGCAGGCAGTCTAGTAGAA GAAAGAGAAACTATTTTCATTGGACCAGGAACAACATTAGAGTTTTTTGCGCGTGAGTTGCCTATTGACAATATCC GCGTCGTAACCAACAGTCTACCTGTTTTTCTGATTTTAAGCGAACGAAAATTAACAGATTTGATTTTAATAGGTGG AAATTATCGCGATATTACAGGTGCTTTTGTTGGTACATTGACCCTACAAAATCTCTCTAATCTCCAATTTTCTAAA GCTTTCGTTAGCTGTAATGGTATTCAAAACGGAGCTCTAGCTACTTTTAGCGAGGAAGAGGGAGAGGCTCAACGC ATCGCTTTAAATAATTCTAATAAAAAATATTTACTCGCAGATCATAGCAAGTTCAATAAGTTTGATTTTTATACTTT TTATAATGTATCAAATCTTGATACTATTGTTTCAGATTCTAAACTAAGTGATTCAATCCTTTTTAAGCTATCTAAAC ACATTAAAGTCATCAAGCCTTAA ITVKQIMDEIAVSDMTARRYLQELADKDLLIRVHGGAEKLRTNSLLTNETNIEKQALQTTKQEIAHFAGSLVEERETI FIGPTTLEFFARELPIDNIRVVTNSLPVFLILSERKLTDLILIGGNYRDITGAFVGTLTLQNLSNLQFSKAFVSCNGIQNGA LATFSEEEGEAQRIALNNSNKKYLLADHSKFNKFDFYTFYNVSNLDTIVSDSKLSDSILFKLSKHIKVIKPZ ID10 483 bp ATGACTGAGTTTTCGTTAGATCTTCTTCTAGAAGCCATTAAACTAGCTCGTTGGACCTACTACTATCACTTGAAAC AGCTAGACAAAACAGATAAAGACCAAGAGCTTAAAACTGAAATTCAATCCATCTTTATCGAACACAAGGGAAATT ATGCTTATCGCCGGGTTCATTTAGAACTAAGAAATCGTGGTTATCTGGTAAATCATAAAAGAGTTCAAGGCTTGaT GAAAGTACTCAATTTACAAGCTAAAATGCGAAAGAAACGAAAATATTCTTCTCATAAAGGAGACGTTGTAAGAA GGCAGAGAATCTCATTCAAGCCCAATTTGAAGGCTCTAAAACAATGGAAAAGTGCTACACAGATGTGACTGAATT TGCCATTCCAGCAAGTACTCAAAAGCTTTACTTATCACCAGTTTTAGATGGCTTTAACAGCGAAATTATTGCTTTT AATCTTTCTTGTTCGCCTAATTTAGAATAA MTEFSLDLLLEAIKLARWTYYYHLKQLDKTDKQELKTEIQSIFIEHKGNYAYRRVHLELRNRGYLVNHKRVQGLMK VLNLQAKMRKKRKYSSHKGDVGKKAENLIQAQFEGSKTMEKCYTDVTEFAIPASTQKLYLSPVDGFNSEIIAFNLSCS1 PNLEZ ID14 1266 bp CCAGGATTGGTACCGTTGCAAGTGGTGTGCCTTTCCTCCTAAAGGAAAATGGAGGAAAAATCAATCAATCAGCA CATTCAGATATCAAAGTTGCTAAGGTATTGGTCAAGGATGAAGATGAAAAAAATCGCTTGCTTGCAGCAGGGAAT GACTTTAACTTTGTAACCAATGTGGATGATATTTATCAGACCAGGATATTACTATCGTAGTGGAATTGATGGGGC GTATTGAGCCTGCTAAAACCTTTATCACTCGTGCCTTGGAAGCTGGAAAACACGTTGTTACTGCTAACAAGGACCT TTTAGCTGTCCATGGCGCAGAATTGCTAGAAATCGCTCAAGCTAACAAGGTAGCACTTTACTACGAAGCAGCAGT TGCTGGTGGGATTCCAATTCTTCGTACTTTAGCAAATTCCTTGGCTTCTGATAAAATTACGCGCGTCTTGGAGTA GTCAACGGAACTTCCAACTTCATGGTGACCAAGATGGTGGAAGAAGGCTGGTCTTACGATGATGCTCTTGCGGAA GCACAACGTCTAGGATTTGCAGAAAGCGATCCGACGAATGACGTAGATGGGATTGATGCAGCCTACAAGATGGTT ATTTTGAGCCAATTTGCCTTTGGCATGAAGATTGCCTTTGATGATGTAGCCCACAAGGGAATCCGCAATATTCAC CAGAAGACGTAGCTGTAGCTCAAGAGCTTGGTTACGTAGTGAAATTGGTTGGTTCTATTGAGGAAACTTCTTCAGG TATTGCTGCAGAAGTGACTCCAACCTTCCTACCTAAAGCGCACCCACTTGCTAGTGTGAATGGCGTAATGAACGCT GTCTTTGTAGAATCTATCGGTATTGGTGAGTCTATGTACTACGGACCAGGTGCGGGTCAAAAACCAACTGCAACA AGTGTTGTAGCTGATATTGTCCGTATCGTTCGTCGTTTGAATGATGGTACTATTGGCAAAGACTTCAACGAATATA GCCGTGACTTGGTCTTGGCAAATCCTGAAGATGTCAAAGCAAACTACTATTTCTCAATCTTGGCTCTAGACTCAAA AGGTCAGGTCTTGAAGTTGGCTGAAATCTTCAATGCTCAAGATATTTCCTTTAAGCAAATCCTTCAAGATGGCAAA GAGGGTGACAAGGCGCGTGTCGTTATCATCACACACAAGATTTAATAAAGCCCAGCTTGAAAATGTCTCAGCTGAA TTGAAGAAGGTTTCAGAATTCGACCTCTTGAATACCTTCAAGGTGCTAGGAGAATAA PGFGTVASGVPFLLKENGGKINQSAMSDIICVAKVLVKDEDEKNRLLAAGNDFNFVTNVDDILSDQDITIVVELMGRIEP AKTFITRALEAGKHVVTANKDLLAVHGAELLEIAQANKVALYYEAAVAGGIPILRTLANSLASDKTTRVLGVVNGTSNF MVTKMVEEGWSYDDALAEAQRLGFAESDPTNDVDGIDAAYKMVlLSQFAFGMKIAFDDVAHKGIRNITPEDVAVAQE LGYVVKLVGSIEETSSGIAAEVTPTFLPKAHPLASVNGVMNAVFVESIGIGESMYYGPGAGQKPTATSVVADIVRIVRRL NDGTTGKDFNEYSRDLVLANPEDVKANYYFSILALDSKGQVLKLAETTNAQDISFKQILQDGKEGDKARVVITTMKINKA QLENVSAELKKVSETDLLNTTKVLGEZ ID16 1725 bp ATGAAACACCTATTATCTTACTTCAAACCCTACATCAAGGAATCAATTTTAGCCCCCTTGTTCAAGCTGTTAGAAG CTGTTTTTGAGCTCTTGGTTCCCATGGTGATTGCTGGGATTGTTGACCAATCTTTACCTCAGGGAGATCAAGGTCA TCTCTGGATGCAGATTGGCCTGCTCCTTATCTTTGCAGTAKTTGGCGTTTTAGTGGCCTTGATAGCTCAATTTTACT CAGCAAAGGCAGCAGTAGGTTCTGCTAAGGAATTGACAAACGATCTTTATCGTCATATTCTTTCCTTGCCCAAGGA CAGCAGAGACCGTCTGACAACTTCTAGTTTGGTCACTCGCTTGACTTCGGATACCTACCAGATTCAGACTGGTATC AATCAATTCCTGCGTCTCTTnTACGAGCGCCCATTATCGTTTTTGGTGCCATTTTTATGGCTTATCGAATCTCAGC TGAGTTGACTTTCTGGTTCTAGTCTTGGTTGCCATTTTGACCATTGTCATTGTAGGGTTATCTCGATTGGTCAATC CTTTCTACAGTAGTCTCAGAAAGAAAACGGACCAACTGGTTCAGGAAACGCGCCAGCAATTGCAAGGGATGCGGG TTATTCGTGCTTTTGGTCAAGAAAAACGAGAGTTACAGATTTTTCAAACCCTTAACCAAGTTTATGCTAGATACA AGAAAAGACAGGTTCTGGTCTAGTTTATTAACACCTCTGACCTATCTGATTGTCAATGGAACTCTTCTCGTTATT ATCTGGCAAGGCTATATTTCAATTCAAGGAGGAGTGCTCAGTCAAGGTGCTCTCATTGCTCTTATCAATTACCTCT TACAGATTTTGGTGGAATTGGTCAAGCTAGCCATGTTGATCAATTCCCTCAACCAGTCCTATATCTCAGTCAAGCG AATCGAGGAAGTCTTTGTTGAGGCTCCAGAGGATATCCATTCAGAGTTAGAACAAAAGCAAGCTACCAGAGATAA GGTTTTACAAGTCCAAGAATTGACCTTTACCTATCCTGATGCGGCCCAGCCTTCTCTGAGATACATTTCCTTTGAT ATGACTCAAGGACAAATTCTAGGTATCATCGGGGGAACTGGTTCTGGTAAATCAAGCTTGGTGCAACTCTTACTTG GACTTTATCCAGTAGACAAGGGGAACATTGACCTTTATCAAAATGGACGTAGTCCTCTTAATTTGGAGCAGTGGC GGTCTTGGATTGCCTATGTACCTCAAAAGGTCGAACTCTTTAAAGGAACCATTCGTTCCAACTTGACTCTAGGTTT CAATCAAGAAGTATCTGACCAGGAACTCTGGCAGGCCTTGGAGATTGCGCAAGCTAAGGATTTTGTCAGTGAAAA GGAAGGACTCTGGATGCTCTAGTTGAGGCAGGGGGGCGAAATTTCTCAGGTGGACAAAAACAAAGATTGTCTAT CGCCCGAGCAGTCTTGCGCCAGGCTCCGTTTCTCATCCTAGATGATGCAACCTCGGCACTGGATACCATTACAGAG TCCAAGCTCTTGAAAGCTATTAGAGAAAATTTTCCAAACACGAGCTTAATTTTGATCTCTCAACGAACCTCAACTT TACAGATGGCGGACCAGATTCTCCTCTTGGAAAAAGGTGAGTTGCTAGCTGTTGGCAAGCACGATGACTTGATGA AATCCAGCCAAGTCTATTGTGAAATCAATGCATCCCAACATGGAAAGGAGGACTAG MKHLLSYTKPYIESILAPLFKLLEAVFELLVPMTTAGIVDQSLPQGDQGHLWMQIGLLLIFAVIGVLVATIAQFYSAKA AVGSAKELTNDLYRHILSLPKDSRDRLTTSSLVTRLTSDTYQIQTGINQTLRLFLRAPIIVTGATTMAYRISAELTWFLVL VAILTIVIVGLSRLVNPFYSSLRKKTDQLVQETRQQLQGMRVIRAFGQEKRELQTTQTLNQVYARLQEKTGTWSSLLTPL TYLIVNGTLLVIIWQGYISIQGGVLSQGALIALINYLLQILVELVKLAMLINSLNQSYISVKRIEEVTVEAPEDIHSELEQKQ ATRDKVLQVQELTFTYPDAAQPSLRYISFDMTQGQILGIIGGTGSGKSSLVQLLLGLYPVDKGNIDLYQNGRSPLNLEQ WRSWIAYVPQKVELTKGTTRSNLTLGTNQEVSDQELWQALE1AQAKDTVSEKEGLLDALVEAGGRNFSGGQKQRLSIA RAVLRQAPTLILDDATSALDTTTESKLLKATRBNTPNTSLILISQRTSTLQMADQILLLEKGELLAVGKHDDLMKSSQVYC EINASQHGKEDZ ID18 1224 bp ATGAAACGTTCTCTCGACTCAAGAGTCGATTACAGTTTGCTCTTGCCAGTATTTTTCTACTGGTCATCGGTGTGGT GGCTATCTATATAGCCGTTAGTCATGATTATCCCAATAATATTCTTGCCCATTTTAGGGCAGCAGGTCGCTGGATT GCCTTGGGGCTTGTGATTGGTTTTGTGGTCATGCTCTTTAATACAGAATTTCTTTGGAAGGTGACCCCCTTTCTATA TATTTTAGGCTTGGGACTTATGATCTTGCCGATGTATTTTATAATCCAAGCTTAGTTGCATCAACGGGTGCCAAA AACTGGGTATCAATAAATGGAATTACCCTATTCCAACCGTCAGAATTTATGAAGATATCCTATATCCTCATGTTGG CTCGTGTCATTGTCCAATTTACAAAGAAACATAAGGAATGGAGACGCACGGTTCCGCTGGACTTTTTGTTAATTTT CTGGATGATTCTCTTTACCATTCCAGTCCTAGTTCTTTTAGCACTTCAAAGTGACTTGGGGACGGCTTTGTTTTTG TAGCCATTTTCTCAGGAATCTTTTATTATCAGGGGTTTCTTGGAAAATTATTATCCCAGTATTTGTGACTGCTGTA ACAGGAGTTGCTGGTTTCTTAGCTATCTTTATTAGCAAGGACGGACGAGCTTTTCTTCACCAGATTGGAATGCCGA CCTACCAAATTAATCGGATTTTGGCTTGGCTCAATCCCTTTGAGTTTGCCCAAACAACGACTTACCAGCAGGCTCA AGGGCAGATTGCCATTGGGAGTGGTGGCTTATTTGGTCAGGGATTTAATGCTTCGAATCTGCTTATCCCAGTTCGA GAGTCAGATATGATTTTTACGGTTATTGCAGAAGATTTTGGCTTTATTGGCTCTGTCCTGGTTATTGCCCTCTATCT CATGTTGATTTACCGTATGTTGAAGATTACTCTTAAATCAAATAACCAGTTCTACACTTATATTTCCACAGGTTTGA TTATGATGTTGCTCTTCCACATCTTTGAGAATATCGGTGCTGTGACTGGACTACTTCCTTTGACGGGGATTCCCTTG CCTTTCATTTCGCAAGGGGGATCAGCTATTATCAGTAATCTGATTGGTGTTGGTTTGCTTTTATCGATGAGTTACCA GACTAATCTAGCTGAAGAAAAGAGCGGAAAAGTCCCATTCAAACGGAAAAAGGTTGTATTAAAACAAATTAAATA A MKRSLDSRVDYSLLLPVTTLLVIGVVATYIAVSHDYPNNILPILGQQVAWIALGLTTGTVVMLTNTETLWKVTPTLYILGL GLMILPIVFYNPSLVASTGAKNWVSINGITLFQPSEFMKISYILMLARVIVQFTKKHKEWRJTTVPLDTLLTTWMILTPVL VLLALQSDLGTALVFVAIFSGIVLLSGVSWKIIIPVFVTAVTGVAGFLAIFISKDGRAFLHQIGMPTYQINRILAWLNPFEF AQTTYQQAQGQIAIGSGGLFGQGFNASNLLIPVRESDMIFTVIAEDFGFIGSVLVIALYLMLIYRMLKITLKSNNQFYTY ISTGLIMMLLFHIFENIGAVTGLLPLTGIPLPFISQGGSAIISNLIGVGLLISMSYQTNLAEEKSGKVPFKRKKVVLKQIKZ ID22 987 bp ATGGTGGCTAAGAAAAAAATCTTATTTTTTATGTGGTCTTTTTCTCTTGGAGGTGGTGCAGAGAAGATTCTATCAA CCATTGTTTCAAATCTGGATCCAGAAAAGTATGATATTGATATTCTTGAAATGGAGCACTTTGACAAGGGATATGA ATCTGTTCCAAAGCATGTACGCATTTTAAAATCCCTTCAAGATTATCGCCAAACCAGATGGTTACGAGCTTTTTTG TGGAGAATGAGAATTTATTTTCCAAGACTGACTCGTCGTTTGCTTGTAAAAGATGATTATGATGTTAAGTTTCTT TTACCATTATGAATCCACCACTGTTGTTCTCTAAAAGAAGAGAAGTCAAGAAGATATCTTGGATTCATGGAAGTAT TGAAGAACTTCTTAAGGATAGCTCTAAAAGAGAATCACATAGAAGCCAGTTGGATGCTGCGAATACAATTGTAGG GATTTCAAAAAAGACCAGCAATTCTATCAAGGAAGTTTATCCAGATTATACTTCTAAATTACAGACAATCTACAAT GGATATGATTTTCAGACTATTCTAGAAAAATCTCAAGAGAAGATCGATATCGAGATTGCTCCTCAAAGTATCTGTA CTATCGGACGGATTGAGGAAAATAAGGGTTCTGACCGTGTAGTGGAAGTGATACGATTATTACACCAAGAGGGAA AAAACTATCATCTCTATTTTATCGGGGCTGGTGATATGGAAGAGGAATGAAAAAATTCGAGTCAAAGAGTATGGGA TTGAGGACTATGTACATTTCCTTGGTTATCAAAAAAATCCTTATCAGTATCTATCTCAGACGAAAGTTCTTTTGTCT ATGTCTAAACAAGAAGGTTTTCCTGGAGTGTATGTGGAGGCCTTGAGTCTGGGACTCCCTTTTATCTCTACGGACG TTGGAGGGGCTGAGGAATTATCCCAAGAAGGACGATTTGGACAAATCATTGAGAGCAATCAAGAGGCAGCTCAG GCGATTACTAATTACATGACTTCTGCCTCAAACTTTGATGTCGATGAGGCTAGCCAATTCATTCAACAATTTACAA TTACAAAACAAATCGAACAAGTAGAAAAACTATTAGAGGAGTAG MVAKKKILFFMWSFSLGGGAEKILSTIVSNLDPEKYDIDILEMEHFDKGYESVPKHVRILKSLQDYRQTRWLRAFLWRM RIYFPRLTRRLLVKDDYDVEVSFTIMNPPLLFSKRREVKKISWIHGSIEELLKDSSKRESHRSQLDAAVTIVGISKKTSNSIK EVYPDYTSKLQTTYNGYDTQTTLECSQEKIDIE1APQSTTTGREENKGSDRVVETTPTLHQEGKNYKLYTGAGDMEEEEL KKRVKEYGIEDYVHFLGYQKNPYQYLSQTKVLLSMSKQEGFPGVYVEALSLGLPFISTDVGGAEELSQEGRFGIIESNQ EAAQAITNYMTSASNFDVDEASQFIQQVTTTKQIEQVEKLLEEZ ID23 1434 bp ATGGAAACTGCATTAATTAGTGTGATTGTGCCAGTCTATAATGTGGCGCAGTACCTAGAAAAATCGATAGCTTCCA TTCAGAAGCAGACCTATCAAAATCTGGAAATTATTCTTGTTGATGATGGTGCAACAGATGAAAGTGGTCGTTTGTG TGATTCAATCGCTGAACAAGATGACAGGGTGTCAGTGCTTCATAAAAAGAACGAAGGATTGTCGCAAGCACGAAA TGATGGGATGAAGCAGGCTCACGGGGATTATCTGATTTTTATTGACTCAGATGATTATATCCATCCAGAAATGATT CAGAGCTTATATGAGCAATTAGTTCAAGAAGATGCGGATGTTTCGAGCTGTGGTGTCATGAATGTCTATGCTAATG ATGAAAGCCCACAGTCAGCCAATCAGGATGACTATTTTGTCTGTGATTCTCAAACATTTCTAAAGGAATACCTCAT AGGTGAAAAAATACCTGGGACGATTTGCAATAAGCTAATCAAGAGACAGATTGCAACTGCCCTATCCTTTCCTAA GGGGTTGATTTACGAAGATGCCTATTACCATTTTGATTTAATCAAGTTGGCCAAGAAGTATGTGGTTAATACTAAA CCCTATTATTACTATTTCCATAGAGGGGATAGTATTACGACCAAACCCTATGCAGAGAAGGATTTAGCCTATATTG ATATCTACCAAAAGTTTTATAATGAAGTTGTGAAAAACTATCCTGACTTGAAAGAGGTCGCTTTTTTCAGATTGGC CTATGCCCACTTCTTTATTCTGGATAAGATGTTGCTAGATGATCAGTATAAACAGTTTGAAGCCTATTCTCAGATT CATCGTTTTTTAAAAGGCCATGCCTTTGCTATTTCTAGGAATCCAATTTTCCGTAAGGGGAGAAGAATTAGTGCTT TGGCCCTATTCATAAATATTTCCTTATATCGATTCTTATTACTGAAAAATATTGAAAAATCTAAAAAATTACATTA G METALISVIVPVYNVAQYLEKSIASIQKQTYQNLEIILVDDGATDESGRLCDSIAEQDDRVSVLHKKNEGLSQARNDGM KQAHGDYLIFIDSDDYIHPEMIQSLYEQLVQEDADVSSCGVMNVYANDESPQSANQDDYFVCDSQTFLKEYLIGEKIPG TICNKLIKRQIATALSFPKGLIYEDAYYHFDLIKLAKKYVVNTKPYYYYFHRGDSITTKPYAEKDLAYIDIYQKTYNEVV KNYPDLKEVAFFRLAYHFFILDKMLLDDQYKFEAYSQIHRFLKGHAFAISRNPIFRKGRRISALALFINISLYRFLLLK NIEKSKKLHZ ID24 735 bp ATGAGAATCAAAGAGAAAACCAATAATATTAATGGAGGAATAAAAAATGTAAGTAAGCATTATGGTCATTCAATC ATTCTCAAAGATATAAATTTTGCACTTAACAAGGGTGAAATTGTTGGTCTAGCAGGGAGAAATGGAGTTGGTAAG AGTACGTTGATGAAAATTCTTGTTCAGAATAATCAACCGACTTCAGGTAATATTATAAGCAGTGATAATGTTGGGT ATTTAATCGAAGAACCAAAATTATTTTTATCTAAAACAGGTTTAGAGAATTTAAAATATTTGTCAAATTTATATGG TGTTGACTACAATCAAGAAAGATTTAGATGTTTGATCCAAGAGTTAGATTTGACTCAGTCTATTAATAAAAAAGTA AAGACCTATTCTTTGGGTACAAAACAAAAATTAGCTTTGCTTCTAACTCTCGTTACGGAACCTGATATATTGATTT TAGATGAACCGACTAATGGTTTAGATATTGAATCATCACAAATAGTTTTAGCGGTTCTAAAAAAATTAGCTTTACA TAGATGAACCGACTAATGGTTTAGATATTGAATCATCACAAATAGTTTTAGCGGTTCTAAAAAAATTAGCTTTACA TGAAAATGTGGGAATTTTAATATCGAGTCATAAATTAGAAGACATTGAAGAAATTTGTGAGAGAGTTCTTTTCTTG GAGAACGGGCTTTTGACATTTCAAAAAGTAGGAAAAGATAGTCATAATTTCTTGTTTGAGATAGCTTTTTCATCAG CTACAGATAGAGACATTTTCATTACCAAACAAGAATTGGGATATTGTTTAG MRIKEKTNNINGGIKNVSKHYGHSIILKDINTALNKGEIVGLAGRNGVGKSTLMKILVQNNQPTSGNLISSDNVGYLIEEP KLFLSKTGLENLKYLSNLYGVDYNQERTRCLIQELDLTQSINKKVKTYSLGTKQKLALLLTLVTTPDILILDEPTNGLDIE SSQIVLAVLKKLALHENVGILISSHKLEDIEEICERVLFLENGLLTFQKVGKDSHNLFEIAFSSATDRDIFITKQEFWDIVZ ID25 1704 bp ATGACTGAATTAGATAAACGTCACCGCAGTAGCATTATGACAGCATGGTTAAATCACCTAACCGTGCTATGCTTC GTGCGACTGGTATGACAGATAAGGACTTGAAACATCGATTGTGGGAGTGATTTCGACTTGGGCGGAAAATACAC CATGTACATTCACTTGCATGATTTCGGGAAACTGGCTAAAGAAGGTGTCAAATCTGCAGGCGCTTGGCCTGTAC AGTTTGGAACCATTACCGTAGCGGACGGGATCGCTATGGGAACGCCTGGTATGCGTTTCTCTCTAACATCTCGTGA CATCATCGCGGACTCCATCGAGGCGGCTATGAGTGGTCACAACGTGGATGCCTTCGTCGCTATCGGTGGCTGTGA CAAGAACATGCCTGGATCTATGATTGCTATTGCTAATATGGATATCCCAGCTATTTTCGCCTATGGTGGAACTAT GCACCGGGAAATCTTGATGGTAAAGATATCGACTTGGTTTCTGTCTTGAAGGTATCGGAAAAATTGAACCACGGT GACATGACAGCTGAGGACGTGAAACGTCTTGAATGTAATGCCTGCCCTGGCCCTGGTGGTTGTGGTGGTATGTAT ACTGCTAATACCATGGCAACTGCTATCGAAGTTCTAGGGATGAGTTTGCCAGGGTCATCCTCTCACCCAGCTGAAT CAGTGATAAGAAAGAAGATATCGAAGCAGCAGGACGTGCTGTTGTTAAGATGTTGGAACTTGGTCTCAAATCCAT CAGATATCTTGACTCGTGAAGCCTTTGAAGATGCTATCACTGTAACGATGGCTCTCGGTGGTTCTACAAACGCCAC TCTTCACTTGCTCGCCATTGCCCATGCCGCAAATGTTGACTTGTCACTTGAGGACTTCAATACGATTCAAGAACGT GTGCCTCACTTGGCCGACTTGAAACCATCTGGTCAGTATGTCTTCCAAGACCTCTACGAAGTCGGTGGTGTCCCTG CGGTTATGAAGTATTTGTTGGCAAATGGTTTCCTTCACGGAGATCGCATCACATGTACTGGTAAGACTGTAGCTGA AAACTTGGCTGACTTTGCAGACTTGACTCCAGGCCAAAAAGTTATCATGCCACTTGAAAATCCAAAACGTGCGGA TGGTCCGCTTATCATCTTGAACGGGAACCTTGCTCCTGACGGTGCAGTTGCCAAGGTATCAGGTGTTAAAGTGCT CGTCACGTTGGGCCAGCTAAGGTCTTTGACTCAGAAGAAGATGCGATTCAGGCCGTTCTGACAGATGAAATCGTT GATGGCGATGTAGTCGTTGTTCGTTTTGTTGGACCTAAAGGTGGTCCTGGTATGCCTGAGATGTATCACTTCTTC AATGATTGTTGGTAAAGGTCAGGGAGATAAGGTGGCCCTCTTGACGGACGGACGTTTCTCTGGTGGTACTTATGGT CTGGTTGTTGGACATATCGCTCCTGAAGCTCAGGATGGTGGACCAATTGCCTATCTCCGTACCGGCGATATCGTTA CGGTTGACCAAGATACCAAAGAAATTTCTATGGCCGTATCCGAAGAAGAACTTGAAAAACGCAAGGCAGAAACA ACCTTGCCACCACTTTACAGCCGTGGTGTCCTCGGTAAATATGCCCACATCGTATCATCTGTCACGCGGAGCCG TGACAGACTTCTGGAATATGGACAAGTCAGGTAAAAAATAA MTELDKRHRSSIYDSMVKPNRAMLRATGMTDKDFETSIVGVISTWAENTPCNIHLHFGKLAKEGVKSAGAWPVQPG TITVADGTAMGTPGMRTSLTSRIIADSIEAAMSGHNVDATVATGGCDKNMPGSMTIAIANMDIPATTAYGGTTAPGNLT KDIDLVSVFEGIGKWNHGDMTAEDVKRLECNACPGPGGCGGMYTANTMATAIEVLGMSLPGSSSHPAESADKKEDIE AAGRAVVKMLELGLKPSDILTREATEDATTVTMALGGSTNATLHLLATAHAANVDLSLEDRATTQERVPHLADLKPSGQ YVPQDLYEVGGVPAVMKYLLANGTLHGDRTTCTGKTVAENLADTADLTPGQKVIMPLENPKRADGPLIILNGNLAPDG AVAKVSGVKVRRNVGPAKVPDSEEDATQAVLTDEIVDGDVVVVILIWGPKGGPGMPEMLSLSSMIVGKGQGDKVALLT DGRTSGGTYGLVVGHIAPEAQDGGPIAYLRTGDIVTVDQDTKEISMAVSEEELEKPKAETTLPPLYSRGVLGKYAHIVSS ASRGAVTDTWNMDKSGKKZ ID26 274 bp ATGTTATAATAAAAATAAAGAATTTAAGGAGAAATACAATATGTCAATTTTTATTGGAGGAGCATGGCCATATGC AAACGGTTCGTTACATATTGGTCACGCGGCAGCGCTTTACCGGGGGATATTCTTGCkALGATACTATCGTCAGAA GGGAGAGGAAGTTTTATATGTTTCTGGAAGTGATTGTAATGGAACCCCTATTTCTATCAGAGCTAAAAAAGAAAA TAAGTCTGTGAAAGAAATTGCTGATTTTTATCATAAGGANTTTAATCCA CYNKKETKEKYNMSTTGGAWPYANGSLHIGHAAALLPGDILARYYRQKGEEVLYVSGSDCNGTPISIPTKJCENKSVK ETADTYHKETNP ID28 1065 bp ATGACAACATTATTTTCAAAAATTAAAGAAGTAACAGAACTTGCTGCAGTCTCAGGTCATGTGCGCCTGTCCGT GCTTATCTCGTGAAAAGTTGACACCGCATGTGGATGAAGTGGTGACAGATGGCTTGGGTGGTATTTTTGGTATT AACATTCAGAAGCTGTGGATGCACCGCGCGTCTTGGTCGCTTCTCATATGGACGAAGTGGTTTATGGTCAGCGA AATCAAGCCAGATGGTACCTTCCGTGTCGTAGAAATCGGCTGGCTGGAACCCCATGGTGGTTAGCAGCTCGTTT CAAACTCTTGACTCGTGATGGTCATGAAATTCCTGTGATTTCAGGTTCTGTTCCTCCGCATTTGACTCGTGGAAAC GGGGGACCAACCATGCCAGCCATTGCCGATATCGTTTTTGATGGTGGTTTTGCGGACAAGGCTGAGGCAGAAAGT TTTGGCATCCGTCCTGGTGATACCATTGTACCAGATAGTTCTGCATTTTGACAGCCAATGAAAAAAATATCATCT CAAAAGCTTGGGATAACCGCTACGGTGTCCTCATGGTAAGCGAGCTAGCTGAAGCTTTATCGGGTCAAAAACTCG GCAATGAACTCTATCTGGGTTCTAACGTCCAAGAAGAAGTTGGTCTGCGTGGCGCTCATACCTCTACAACCAAGTT TGACCCAGAAGTCTTCCTCGCAGTTGATTGCTCACCAGCAGGTGATGTCTACCGTGGTCAAGGCAAGATTGGAGA TGGAACCTTGATTCGTTTCTATGATCCAGGTCACTTGCTCTCCCAGGGATGATGGATTTCCTTTTGACAACGGCT GAAGAAGCTGGTATCAAGTACCAATACTACTGTGGTAAAGGCGGAACAGATGCAGGTGCAGCTCATCTGAAAAAT GGTGGTGTCCCATCAACAACTATCGGTGTCTGCGCTCGTTATATCCATTCTCACCAAACCCTCTATGCAATGGATG ACTTCCTAGAAGCGCAAGCTTTCTTACAAGCCVTGGTGAAGAAATTGGATCGTTCAACGGTTGATTTGATTAAACA TATTAA MTTLFSKIKEVELAAVSGHEAPVRAYLREKLTPHVDEVVTDGLGGIFGIHSHEAVDAPRVLVASHMDEVGTMVSEIKP DGTFRVVEIGGWNPMVVSSQRFKLLTRDGHEIPVISGSVPPHLTRGKGGPTMPAIADIVFDGGFADKAEAESFGIRPGDT IVPDSSATLTANEKNIISKAWDNRYGVLMVSELEALSGQKLGNELYLGSNVQEEVGLRGAHTSTTKYDPEVTLAVTTS PAGDVYGGQGKIGDGTLIRTYDPGHLLLPGMKDTLLTTAEEAGIKYQYYCGKGGTDAGAAHLKNGGVPSTTTGVCARY IHSHQTLYAMDDTLEAQARTQALVKKLDRSTVDLIKHYZ ID31 1182 bp ATGGAATTTTCTATGAAATCAGTCAAAGGACTACTCTTTATCATAGCTAGTTTTATCTTGACTCTTTTGACTTGGAT GAACACTTCTCCCCAATTCATGATTCCAGGACTAGCTTTAACAAGCCTATCTCTGACTTTTATCCTAGCCACTCGT CTCCCACTACTAGAAAGCTGGTTTCACAGTTTGGAGAAGGTCTACACCGTCCACAAATTCACAGCCTTTCTCTCAA TCATCCTACTAATCTTTCATAACTTTAGTATGGGCGTTTGTGGGGCTCTCGCTTAGCTGCTCAGTTTGGCAATCTT GCCATCTATATCTTTGCCAGCATCATCCTTGTCGCCTATTTAGGCAAATACATCCAATACGAAGCTTGGCGATGGA TTCACCGCCTGGTTTACCTAGCCTATATTTTAGGACTCTTTCACATCTACATGATAATTGGGCAATCGWTCTTACA TTTAATCTTCTAAGTTTTCTTGTTGGTAGCTATGCCCTTTTAGGCTTACTAGCTGGTTTTTATATCATTTTTCTATAT CAAAAGATTTCCTTCCCCTATCTAGGGAAAATTACCCATCTCAAACGCTTAAATCACGATACTAGAGAAATTCAA ATCCATCTTAGCAGACCTTTCAACTATCAATCAGGACAATTTGCCTTTCTAAAGATTTTCCAAGAAGGCTTTGAAA GTGCTCCGCATCCCTTTTCTATCTCAGGAGGTCATGGTCAAACTCTTTCTTTACTGTTAAAACTTCAGGCGACCA TACCAAGAATATCTATGATAATCTTCAAGCCGGCAGCAAAGTAACCCTAGACAGAGCTTACGGACACATGATCAT AGAAGAAGGACGAGAAAATCAGGTTTGGATTGCTGGAGGTATTGGGATCACCCCCTTCATCTCTTACATCCGTGA ACATCCTATTTTAGATAAACAGGTTCACTTCTACTATAGCTTCCGTGGAGATGAAAATGCAGTCTACCTAGATTTA CTCCGTAACTATGCTCAGAAAAATCCTAATTTTGAACTCCATCTAATCGACAGTACGAAAGACGGCTATCTTAATT TTGAACAAAAAGAAGTGCCCGAACATGCAACCGTCTATATGTGTGGTCCTATTTCTATGATGAAGGCACTTGCCA AACAGATTAAGAAACAAAATCCAAAAACAGAGCATATTTAC MEFSMKSVKGLLFIIASFILTLLTWMNTSPQTMIPGLALTSLSLTFILATRLPLLESWFHSLEKVYTVHKFTAFLSIILLIFH NFSMGGLWGSRLAAQFGNLAIYTTASIILVAYLGKYIQYEAWRWIHRLYLAYILGLFHIYMIMGNRLLTFNLLSFLVGS YALLGLLAGFYIIFLYQKISTPYLGKITHLKRLNHDTREIQIHLSRPFNYQSGQFAFLKIFQEGFESAPHPFSISGGHGQTLY TTVKTSGDHTKNIYDNLQAGSKVTLDRAYGHMIIEEGRBNQVWIAGGIGTTPTTSYIREHPILDKQVHIYYSTRGDENAV YLDLLRNYAQKNPNFELHLIDSTKDGYLNFEQKEVPEHATVYMCGPISMMKALAKQIKKQNPKTEHIY ID32 900 bp ATGACTTTTAAATCAGGCTTTGTAGCCATTTTAGGACGTCCCAATGTTGGGAAGTCAACCTTTTTAAATCACGTTA TGGGGCAAAAGATTGCCATCATGAGTGACAAGGCGCAGACAACGCGCAATAAAATCATGGGAATTTACACGACTG ATAAGGAGCAAATTGTCTTTATCGACACACCAGGGATTCACAAGCCTAAAACAGCTCTCGGAGATTTCATGGTTG AGTCTGCCTACAGTACCCTTCGCGAAGTGGACACTGTTCTTTTCATGGTGCCTGCTGATGAAGCGCTGGTAAGGG GGACGATATGATTATCGAGCGTCTCAAGGCTGCCAAGGTTCCGTGATTTTTGGTGGTGAATAAAATCGATAAGGTC CATCCAGACCAGCTCTTGTCTCAGATTGATGACTTCCGTAATCAAATGGACTTTAAGGAAATTGTTCCAATCTCAG CCCTCAGGGAAATAACGTGTCTCGTCTAGTGGATATTTTGAGTGAAAATCTGGATGAAGGTTTCCAATATTTCCC GTCTGATCAAATCACAGACCATCCAGAACGTTTCTGGTTTCAGAAATGGTTCGCGAGAAAGTCTTGCACCTAACT CGTGAAGAGATTCCGCATTCTGTAGCAGTAGTTGTTGACTCTATGAAACGAGACGAAGAGACAGACAAGGTTCAC ATCCGTGCAACCATCATGGTCGAGCGCGATAGCCAAAAAGGGATTATCATCGGTAAAGGTGGCGCTATGCTTAAG AAAATCGGTAGCATGGCCCGTCGTGATATCGAACTCATGCTAGGAGACAAGGTCTTCCTAGAAACCTGGGTCAAG GTCAAGAAAAACTGGCGCGATAAAAAGCTAGATTTGGCTGACTTTGGCTATAATGAAATGAGAATATAA MTFKSGFVAILGRPNVGKSTFLNHVMGQKIATMSDKAQTTRNKIMGIYTTDKEQIVFIDTPGIHPKTALGDFMVESAYS TLREVDTVLFMVPADEARGKGDDMIIERLKAAKVPVILVVNKIDKVHPDQLLSQIDDFRNQMDFKEIVPISALQGNNVS RLVDILSBNLDEGTQYTPSDQTTDHPERTLVSEMVREKVLHLTREEIPHSVAVVVDSMKRDEBTDKVHIRATTMVEPDSQ KGIIIGKGGAMLKKIGSMARRDIELMLGDKVTLETWVKVKKNWRDKKLDLADTGYNERBYZ ID33 855 bp CTGCTTCTTGTTTTTACAGAAGGAGGACTTATGCCTGAATTACCTGAGGTTGAAACCGTTTGTCGTGGCTTAGAAA AATTGATTATAGGAAAGAAGATTTCGAGTATAGAAATTCGCTACCCCAAGATGATTAAGACGGATTTGGAAGAGT TTCAAAGGGAATTGCCTAGTCAGATTATCGAGTCAATGGGACGTCGTGGAAAATATTTGCTTTTTATCTGACAGA CAAGGTCTTGATTTCCCATTTGCGGATGGAGGGCAAGTATTTTTACTATCCAGACCAAGGACCTGTCGCAAGCAT GCCCATGTTTTCTTTCATTTTGAAGATGGTGGCACGCTTGTTTTATGAGGATGTTCGCTGTTTGGTCCATGGAAC TCTTGGTGCCTGACCTAGACGTCTACTTTATflTCTAAAAAATTAGGTCCTGAACCAAGCGAACAAGACTTTGA TTTACAGGTCTTTCAATCTGCCCTTGCCAAGTCCAAAAAGCCTATCAAATCCCATCTCCTAGACCAGACCTTGGTA GCTGGACTTGGCAATATCTATGTGGATGAGGTTCTCTGGCGAGCTCAGGTTCATCCAGCTAGACCTTCCCAGACTT TGACAGCAGAAGAAGCGACTGCCATTCATGACCAGACCATTGCTGTTTTGGGCCAGGCTGTTGAAAAAGGTGGCT CCACCATTCGGACTTATACCAATGCCTTTGGGGAAGATGGAAGCATGCAGGACTTTCATCAGGTCTATGATAAGA CTGGTCAAGAATGTGTACGCTGTGGTACCATCATTGAGAAAATTCAACTAGGCGGACGTGGAACCCACTTTTGTCC AAACTGTCAAAGGAGGGACTGA MLLVTTEGGLMPELPEVETVCRGLEKLLIGKKISSIEIRYPKMIKTDLEETQRBLPSQLIESMGRRGKYLLTYLTDKVLISHL RMEGKYTYYPDQGPERKHAHVPTTTEDGGTLVYEDVRKTGTMELLVPDLLDVYTTSKKLGPBPSEQDTDLQVTQSALA KSKKPLKSHLLDQTLVAGLGNIYVDEVLWRAQVHPARPSQTLTAEEATATHDQTTAVLGQAVEKGGSTTRTYTNATGED GSMQDTHQVYDKTGQECVRCGTLIEICIQLGGRGTHTCPNCQRRDZ ID34 633 bp TGTCCAAACTGTCAAAGGAGGGACGATGGGAAAAATCATCGGAATCATGGGGGAATGCCGTTT ACTGTGACAAKflTTCTAAGACAGCAAGGCTITTCAAGTAGTGGATGCCGACGCAGTCGTCCACCAACTACAGAAA CCTGGTGGTCGTCTGTGAGGCTCTAGTACAGCACTGGGCAAGAAATCATTCTGAATACGGAGTCTCJ&TTC GCCCTCTCCTAGCTAGTCTCATCTCAAATCCTGATGAACGAGAATGGTCTATAGCAATTCTGGGGAGATAT CCGTGAGGAACTGGCTACTTGAGAGAACAGTTGGCTCAGACAGAAGAGATTCTCATGGATAnCCCaACT TGAGCAGGACTACAGCGATTGGTTGCTGAGACTGGTTGGTCTATGTGGACCGAGATGCCCAAGTGGAACGC TTTATGAAAAGGGACCAGTTGTCCAAAGATGAAGCTGAGTCTCGTCTGGCAGCCCAGTGGCCTTAGTJTJTG AAAGATTTGGCCAGCCAGGTTGATAATAATGGCAATCAGAACCAGCTCTAATCAAGTGCATATCCnCTTG AGGGAGGTAGGCAAGATGACAGAGATTAA MSKLSTTEGLMGKflGTTGGIASGKSTVTNTLRQQGTQVVDADAVVNQLQKPGGRLTEALVQHTGQEllLENGELNRPLL ASLTTSNPDEREWSKQIQGEIIREELATLRBQLAQTEEITTMDIPLLTEQDYSDWTAETWLVYVDRDAQVERLMKRDQLS KDEAESRLAAQWPLEKKKDLASQVLDNNGNQNQLLNQVHILLEGGRQDDRDZ ID35 1269 bp TTGATAATAATGGCAATCAGAACCAGCTCTTAATCAAGTGCATATCCTCTTGAGGGAGGTAGGCJTGATGACA GAGA7AACTGGAAGGATAATCTGCGCATTGCCTGGTGGTAATTCTGACAGGAGCCAGTATTGTTGTG TACCTTTTATGCCCATCTTCGTGGAAAATCTAGGTGTAGGGAGTCAGCAAGTCGCtTTTTATGCAGGCTTAGCAAT TTCTGTCTCTGCTATTCCGCGGCGCTGTTCTCCTATTGGGGTATCTTGCTGACAAATACGGCCGTJATCCCA TGATGATTCGGCCAGGTCTGCTATGACTATCACTATGGGAGGCTGGCTGTCCCAT&ATATCTATTGGTTT CTCTCGTTACTAAACGGTGTATTTGCAGGnTTGTTCCTAATGCAACGGCACTGATAGCCAGTCAGGnCCA AAGGAGAAATCAGGCTCTGCCTAGGTACTGTCTACAGGCGTAGTTGCAGGTAATGGTCCTTTAITG GTGGCTTTATCGCAGAATTATTTGGCMTCGTACAGTTTTTTCTTACTQGTTGGTAGTTTTCTATTTTTAGCTGCTATTT TGACTATTTGCTTATCAAGGAAGATTCAACCAGTAGCCAAGCATAGGCTATTCCAACAAAGGTAflIAC T GCCCTNTTTTGGCTCTTATGTACGCGACTAGGGCAGACAGAGAATCTTCTTTTTGTCTCTGGTTTGATTGTGTCC AGTATGGGCTTCCAGCATGATGAGTGCAGGAG7CATGGGCAAGCTAGGTGACAAGGTGGGCTJJCATCGTTC TGGTTGTCGCCCAGTATTCAGTCATCATCTATCTCCTCTGTGCCAATGCCTCTAGCCCCTCTAGGAT CTATCGTCCTCTGGKTTGGGAACCGGTGCCTTGATTCCCGGGGTTAATGCCCTACTCAGCAAAATGACTCCC AAAGCCGGCATTTCGAGGGTCtITTGCCCAATCAGGTATTCTTATCTGGGAGGTGTTGYTGGTCCCATGQTQ GTTCTGCAG7AGCAGGTCAAtYTGGCTACCATGCTGTCTTATGCGACAAGCTTTGTGTTGCAGTGTTC TAACCTGATTCAATTCGAACATTATTAAAAGTAAAGGAAAWTAG MIIMATRTSTLIKCTSTLRBVGTTMTEINWKDNLRIAWTGNTLTGASSLVVPTMPTTVENLGVGSQQVATYAGLATSVSAT AALTSPIWGILADKYGRKPMMJRAGLAMTTTMGGLATVPNIYWLTTLRiLNGVTAGTVPNATALxASQVPKTSGSAiG TLTGVVAGTLTGPTTGGRAELTGIRTVTLLVGSTLTLAATLTTCTTKEDTQPVAKEKATPTK2LTTSVKYPYLLLNLTLTS TVIQTSAQSTGPILALYVRDLGQTENLLIVSGLIVSSMGTSSMMSAGVMGKLGDKVGNHRLLVVAQTYSVIIYLLCANAS SPLQLGLYRTLTGLGTGALIPGVNALLSKMPKAGSRVTATNQVTTYLGGVVGPMAGSAVAGQTGYHAVTYATSLCV ATSCLTNLIQTRTLLKVKEIZ ID36 1311 bp ATGGCCCTACCAACTATTGCCATTGTAGGACGTCCCAATGTTGGGAAATCAACCCTATTTTAATCGGATCGCTGGTG AGCGAATCTCCATTGTAGAAGATGTCGAAGGAGTGACACGTGACCGTATTTATGCAACGGGTGAGTGTAYTC GTTCTTTAGCATGATTGATACAGGAGGAATTGATGATGTCGATGCTCCTTTCATGGAACAAATCAAGCACCAGGC AGAAKTTGCCATGGAAGAAGCAGATGTTATCGTTTTTGTCGTGTCTGGTAAGGAAGGAATTACTGATGCAGACGA ATACGTAGCTCGTAAGCTATAAGACCCACAAACCAGTTATCCTCGCAGTCATCATGGTGGACTCCTGAGAT GAGAAATGATATATATGKTTTCTATGCTCTCGGTTTGGGTGAACCATTGCCTATCTCATCTGTCCATGGAATCGG? ACAGGGGATGTGCTAGATGCGATCGTAGAAAATCTTCCAAATGAATATGAGGAAGAAAATCCAGATGTCATMG TAGCTTGATTGGTCGTCCTAACGTTGGAAAATCAAGGTGATCAATGCTATCTGGGAGAAGACCGTGflATJG CTAGTCCTGTTGCTGGAACAACTCGTGATGCCATTGATACCCACTTTACAGATACAGATGGTCAAGAGTTTACCAT GATTGATACGGCTGGTATGCGTAAGTCTGGTAAGGTTTATGAAAATACTGAGAAATACTCTGTTATGCGTGCCATG CGTGCTATTGACCGTTCAGATGTGGTCTTGATGGTCATCAATGCGGAAGAAGGCATTCGTGAGTACGACAAGCGT ATCGCAGGATTTGCCCATGAAGCTGGTAAAGGGATGATTATCGTGGTCAACAAGTGGGATACGCTTGAAAAAGAT AACCACACTATGAAAAACTGGGAAGAAGATATCCGTGAGCAGTTCCAATACCTGCCTTACGCACCGATTATCTTT GTATCAGCTTTAACCAAGCAACGTCTCCACAAACTTCCTGAGATGATTAAGCAAATCAGCGAAAGTCAAAATTACA CGTATTCCATCAGCTGTCTTGAACGATGTCATCATGGATGCCATTGCCATCAACCCAACACCGACAGACAAAGGA AAACGTCTCAAGATTTTCTATGCGACCCAAGTGGCAACCAAACCACCAACCTTTGTCATCTTTGTCAATGAAGAAG AACTCATGCACTTTCTTACCTGCGTTTCTTGGAAAATCAAATCCGCAAGGCCTTTGTTTTTGAGGGAACACCGAT TCATCTCATCGCAAGAAAACGCAAATAA MALPTIAIVGRPNVGKSTLTNRIAGERISIVEDVEGVTRDRIYATGEWLNRSTSMIDTGGIDDVDAPTMEQIKHQAEIAM EEADVIVTVVSGKEGTTDADEYVARKLYTZTHKPTTLAVNKVDNPEMRNDIYDTYALGLGEPLPISSVHGIGTGDVLDAT VENLPNEYEEENPDVIKTSLIGRPNVGKSSLINATLGEDRVIASPVAGTTRDATDTHTTDTDGQETTMIDTAGMRKSGKV YENTEKYSVMRAMRAIDRSDVVLMVINAEEGIREYDKRIAGFAHEAGKGMIIVVNKWDTLEKDNHTMKNWEEDIREQ TQYLPYAPITTVSALTKQRLHKLPEMIKQISESQNTRIPSAVLNDVIMDATATNVTDKGKRLKTTYATQVATKPPTTVTTV NEEELMHFSYLRFLENQIRAFVTEGTPIHLIARKRKZ ID37 714 bp ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAGGTTTAAAGAGCAAGAAATCCCCAAGTA GACTTAAATGAGATTTTGACAGCAGCCCAGATGGCATCATCTGGAAGAATTTCCAATCCTACTCTGTGATGTGG TACGAAGTCAAGAGAAGAAAGATGCCTTGTATGAATTGGTACCTCAAGAAGCCTTCGCCAGTCTGCTGTTTCCT TCTCTTTGTCGGAGATTTGAACCGAGCAGAAAAGGGAGCCCGACTTCATACCGACACCTTCCAACCCCAAGGTGT GGAAGGTCTCTTTGATTAGTTCGGTCGATGCAGCTCTTGCTGGACAAAACGCCTTGTTGGCAGCTGAAAGCTTGGGC TATGGTGGTGTGATTATCGGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCATAACCTACCTGACTACA CCTATTCTGTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCCACTAGAGA ATGTTGTCTTTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGACCGTGTTCAGGCTGACT ATGCTGGGGCGCGTGCGACCACAAGCTGGAGTCAGCGCCTAGCAGAACAGTTTGGTCAAGCTGAACCAAGCTCAA CTAGAAAAAATCTTGAACAGAAGAAATATTATTGTAG MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVIVVRSQEKKDALYELVPQEAIRQSAVFLLFV GDLNRAEKGARLHTDTTQPQGVEGLLISSVDAALAGQNATLAAESLGYGGVIIGLVRYKSEEVAELTNLPDYTYSVTG MALGVPNQHHDMKPRPLPLENVVFEEEYQEQSTEAIQAYDTVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK KLLZ ID38 729 bp ATGACAGAAATTAGACTAGAGCACGTCAGTTATGCCTATGGTCAGGAGAGGATTTTAGAGGATATCAACCTACAG GTGACTTCAGCCGAAGTGGTTTCCATCCTAGGCCCAAGTGGTGYTGGAAAGACCACCCTCITTAATCTAATCGCTG GGATTTTAGAAGTTCAGTCAGGGAGAATTGTCCTTGATGGTGAAGAAAATCCCAAGGGGCGCGTGAGTTATATGT TGCAAAAGGATCTGCTCTTTGGAGCACAAGACGGTGCTGGAAATATCATTCTGCCCCTCTTGATTCAAAAGGTGG ATAAGGCAGAAGCTATTTCCCGAGCGGATAAAATTCTTGCGACCTTCCAGCTGACAGCTGTAAGAGACAAGTATC CTCATGAACTTAGCGGTGGGATGCGCCAGCGTGTAGCCTTACTCCGGACCTACCTTTTTGGGCACAAGCTCTTTCT CTTAGATGAGGCCTTTAGCGCCTTGGATGAGATGACAAAGATGGAACTCCACGCTTTGGTATCTTGAGATCACAA GCAGTTGCAGCTAACAACCCTGATCATCACGCATAGTATTGAGGAGGCCCTCAATCTCAGCGACCGTATCTATATC TTGAAAAATCGCCCTGGGCAGATTGTTTCAGAAATTAAACTAGATTGGTCTGAAGATGAGGACAAGGAAGTCCAA AAGATTGCCTACAAACGTCAAATTTTGGCGGAATTAGGCTTAGATAAGTAG MTEIRLEHVSYAYGQERILEDINLQVTSGEVVSILGPSGVGKTTLFNLIAGILEVQSGRVLDGEENPKGRVSYMLQKDLL LEHKTVLGNIILPLLIQKVDKAEMSRADKKLATTQLTAVRDKYPHELSGGMRQRVALLRTYLTGHKLRTTDEATSALDE MTKMELHAWYLEiHKQLQLTTLITTHSIEEALNLSDRIYILKNRPGQIVSEIKLDWSEDEDKEVQKIAYKRQILAELGLDK Z ID39 2433 bp ATGAACTATTCAAAAGCATTGAATGAATGTATCGAAAGTGCCTACATGGTTGCTGGACATTTTGGAGCTCGTTATC TAGAGTCGTGGCACTTGTTGATTGCCATGTCTAATCACAGTTATAGTGTAGCAGGGGCAACTTTAAATGATTATCC GTATGAGATGGACCGTTTAGAAGAGGTGGCTTTGGAACTGACTGAAACGGACTATAGCCAGGATGAAACCTTTAC GGAATTGCCGTTCTCCCGTCGTTTGCAGGTTCTTTTTGATGAAGCAGAGTATGTAGCGTCAGTGGTCCATGCTAAG GTACTAGGGACAGAGCACGTCCTCTATGCGATTTTGCATGATAGCAATGCCTTGGCGACTCGTATCTTGGAGAGG GCTGGTTTTTCTTATGAAGACAAGAAAGATCAGGTCAAGATTGCTGCTCTTCGTCGAATTTTAGAAGAACGGGCA GGCTGGACTCGTGAAGATCTCAAGGCTTTACGCCAACGCCATCGTACAGTAGCTGACAAGCAAAATTCTATGGCC AATATGATGGGCATGCCGCAGACTCCTAGTGGTGGTCTCGAGGATTATACGCATGATTTGACAGAGCAAGCGCGT TCTGGCAAGTTTAGAACCAGTCATCGGTCGGGACAAGGAAATCTCACGTATGATTCAAATCTTGAGCCGGAAGACT AAGAACAACCCTGTCTTGGTTGGGGATGCTGGTGTCGGGAAAACAGCTCTTCCGCTTGGTCTTGCCCAGCGTTATG CTAGTGGTGACGTGCCTGCGGAAATGGCTAAGATGCGCGTGTTAGAACTTGATTTGATGAATCTCGTTGCAGGGA CACGCTTCCGTGGTGACTTTGAAGAACGCATGAATAATATCATCAAGGATATTGAAGAAGATGCCCAAGTCATCC TCTATCGATGAACTCCACACCATCATGGGTTCTTGGTAGCGGGATTGATTCGACTCTGGATGCGGCCAATATCTT GAAACCAGCCTTGCCGCGTGGAACTTTGAGAACGGTTGGTGCCACTACTCAGGAAGAATATCAAAAACATATCGA AAAAGATGCGGCACITTCTCGTCGTVICGCTAAAGTGACGATTGAAGAACCAAGTGTGGCAGATAGTATGACTAT TTTACAAGGTTTGAAGGCGACTTATGAGAAACATCACCGTGTACAAATCACAGATGAAGCGGTTTGAAACAGCGGT TAAGATGGCTCATCGTTATTTAACCAGTCGTCACTTGCCAGACTCTGCTATCGATCTCTTGGATGAGGCGGCAGCA ACAGTGCAAAATAAGGCAAAGCATGTAAAAGCAGACGATTCAGATTTGAGTCCAGCTGACAAGGCCCTGATGGAT GGCAAGTGGAAACAGGCAGCCCAGCTAATCGCAAAAGAAGAGGAAGTACCTGTCTACAAAGACTTGGTGACAGA GTCTGATATTTTGACCACCTTGAGTCGCTTGTCAGGAATCCCAGTTCAAAAACTGACTCAAACGGATGCTAAGTG TATTTAAATCTTGAAGCAGAACTCCATAAACGGGTTATCGGTCAAGATCAAGCTGTTTCAAGCATTAGCCGTGCCA TTCGCCGCAACCAGTCAGGGATTCGCAGTCATAAGCGTCCGATTGTTCCTTTATGTTCCTAGGGCCTACAGGTGT CGGGAAAACTGAATTAGCCAAGGCTCTGGCAGAAGTTCTTTTTGACGACGAATCAGCCCTTATCCGCTTTGATATG AGTGAGTATATGGAGAAATTTGCAGCTAGTCGTCTCAACGGAGCTCCTCCAGGCTATGTAGGATATGAAGAAGGT GGGGAGTTGACAGAGAAGGTTCGCAATAAACCCTATTCCGTTCTCCTCTTTGATGAGGTAGAGAAGGCCCACCCA GATATCTTTAATGTTCTVTTGCAGGTTCTGGATGACGGTGTCTTGACAGATAGCAAGGGACGCAAGGTCGATTTTT CAAATACCATTATCATTATGACATCGAATCTAGGTGCGACTGCCCTTCGTGATGATAAGACTGTTGGTTTTGGGGC TAAGGATATTCGTTTTGACCAGGAAAATATGGAAAAACGCATCTTTTGAAGAACTGAAAAAAGCTTATAGACCGGA ATCATCAACCGTATTTGATGAGAAGGTGGTCTTCCATAGCCTATCTAGTGATCATATGCAGGAAGTGGTGAAGATT ATGGTCAAGCCTTTAGTGGCAAGTTTGACTGAAAAAGGCATTGACTTGAAATTACAAGCTTCAGCTCTGAAATTGT TAGCAAATCAAGGATATGACCCAGAGATGGGAGCTCGCCCACTTCGCAGAACCCTGCAAACAGAAGTGGAGGAC AAGTTGGCAGAACTTCTTCTCAAGGGAGATTTAGTGGCAGGCAGCACACTTAAGKTTGGTGTCAAAGCAGGCCAG TTAAAATTTGATATTGCATAA MNYSKALNECIESAYMVAGHTGARYLESWHLLIAMSNHSYSVAGATLNDYPYEMDRLEEVALELTETDYSQDETFTE LPTSRRLQVLFDEAEYVASVVHAKVLGTEHVLYATLEDSNALATRILERAGTSYEDKKDQVKIAALRRNLEERAGWTR EDLKALRQRERTVADKQNSMANMMGMPQTPSGGLEDYTHDLTEQARSGKLEPVIGRDKEISRMIQILSRKTKNNPVLV GDAGVGKTALALGLAQRLIASGDVPAEMAKMRVLELDLMNVVAGTRTRGDTEERMNNHKDIEEDGQVILRDELHTTM GSGSGIDSTLDAANILKPALARGTLRTVGATTQEEYQKHIEKDAALSRRFAKVTIEEPSVADSMTILQGLKATYEKHHRV QITDEAVETAVKMAHRYLTSRHLPDSATDLLDEAAATVQNICAKHVKADDSDLSPADKALMDGKWKQAAQLIAKEEEV PVKDLVTESDILTTLSRLSGIPVQKLTQTDAKKYLNLEAELHKRVIGQDQAVSSlSRATRRNQSGIRSHKRPIGSTMFLGP TGVGKTELAKALAEVLTDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLTDEVEKAHP DIFNVLLQVLDDGVLTDSKGRKVDTSNTTIIMTSNLGATALRDDKTVGFGAKDIRTDQENMBKRMTEELICKAYRPEFIN RIDEKVVFHSLSSDHMQEVVKIMVKPLVASLTEKGIDLTLQASALKLLANQGYDPEMGARPLRRTLQTEVEDKLAELL LKGDLVAGSTLKIGVKAGQLICTDEAZ ID40 1008 bp ATGAAGAAAACATGGAAAGTGTTTTTAACGCTTGTAACAGCTCTTGTAGCTGTTGTGCTTGTGGCCTGTGGTCAAG GAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTGGACACCAAATACCA ACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTCAAAGAAGCTGGAGTGGATTTTTGATTTGAAATTGC CACCAGAAGAAAGTTCTTCTGACTGGTTATCAACGGAAAGGCACCATTTGCAGTGTATTTCCAAGACTACATGGC TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTTGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA ACTTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAAAAGTACCAAATAACGA CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACTGCTGGKTTTACTACGGTTGGGATGGTATCCTT GCTAAATCTCAAGGTGTAGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTACTATCACCAG TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAA GGCTACCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATYCTCATCAAGAATGCACCTGAATTCAAGGAA AAACGTGACTTTGTCATCGAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAAAATGGGGTCAA TTTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGAAGACTTGACAGAC AAAGGCTTCACCAACGAATTTGTGAAATAA MKKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLP PEESSSDLVINGKAPFAVYGQDYMAKKLEKGAGITAVAAIVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAML KTLVESQGGDTEKVEKVPNNDSNSTTPIANGVTDTAWIYYGWDGILAKSQGVDANTMYLKDYVKBTDYYSPVIIANND YLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKSKWGQPDTRWNAPY KWDKENGTLKEDLTDKGTTNETVKZ ID41 762 bp TTGATGAGAAACTTGAGAAGTATACTGAGACGACACATTAGTCTATTGGGCTTTCTCGGAGTATTGTCAATCTGGC AGTTAGCAGGTTTTCTTAAACTTCTCCCCAAGTTTATCCTGCCGACACCTCTTGAAATTCTCCAGCCCTTTGTTCGT GACAGAGAATTTCTCTGGCACCATAGCTGGGCGACCTTGAGAGTGGCTTTACTGGGGCTGATTTTGGGAGTTTTGA TTGCCTGTCTTATGGCTGTGCTCATGGATAGTTTGACTTGGCTCAATGACCTGATTTACCCTATGATGGTGGTCATT CAGACCATTCCGACCATTGCCATAGCTCCTATCCTGGTCTTGTGGCTAGGTTATGGGATTTTGCCCAAGATTGTCT TGATTATCTTAACGACAACCTTTCCCATCATCGTTAGTATTTTGGACGGTTTTAGGCATTGCGACAAGGATATGCT GACCTTGTTTAGTCTGATGCGGGCCAAGCCTTGGCAAATCCTGTGGCATTTTAAAATCCCAGTTAGCCTGCCTTAC TTTTATGCAGGTCTGAGGGTCAGTGTCTCCTACGCCTTTATCACAACTGTGGTATCTGAGTGGTTGGGAGGTTTTG AAGGTCTTGGTGTTTATATGATTCAGTCTAAAAAACTGTTTCAGTATGATACCATGTTTGCCATTATTATTCTGGTG TCGATTATCAGTCTTTTGGGTATGAAGCTGGTCGATATCAGTGAAAAATATGTGATTAAATGGAAACGTTCGTAG MMRNLRSILRRHISLLGFLGVLSIWQLAGFLKLLPKFILPTPLEILQPFVRDREFLWHHSWATLRVALLGLILGVLIACLM AVLMDSLTWLNDLIYPMMVVIQTIPTIAIAPILVLWLGYGILPKIVIILTTTFPIIVSILDGFRHCDKDMLTLFSLMRAKP WQILWHFKIPVSLPYFYAGLRVSVSYAFITTVVSEWLGGFEGLGVYMIQSKKLFQYDTMTATTTLVSIISLLGMKLVDISE KYVIKWKRSZ ID42 372 bp TTGATTTTTAATCCTATTTGCTGTATGATAAGGGAAAAGAAAGGGGACAGAGATATGGCTTTTACCAATACCCACA TGCGATCTGCTAGTTTTGGTATTGTTACCAGCTTGCCTGATGACATCATTGACTCTTTTTGGTATATCATCGACCAT TTCTTAAAAAATGTCTTTGAATTGGAAGAAGAACTCGAGTTTCAATTGCTTAATAACCAAGGAAAGATTACCTTCC ACTTTTCAAGTCAACACCTCCCTACAGCCATTGATTTTGACTTTAACCATCCTTTCGACCCTCGTTATCCCCCAAGA GTACTGGTTTTAGACATGGACGGTAGAGAAACTATCCTCCTCCCAGAAGAAAATGACCTATTTTAA MIFNPICCMIREKKGDRDMAFNTHMRSASFGIVTSLPDDIIDSFWYIIDHFLKFELEEELEFQLLNNQGKTTTHFSSQ HLPTAIDFDFNHPFDPRYPPRVLVLDMDGRETILLPEENDLT ID43 1569 bp ACAGCGGTGTCATTCTATCTATTTTAAGAAAAGTAATAATCAATTGTTAAAAATAGTAAAAAAATTGGAGGTTCTG ATGAAATATTTTGTTCCTAATGAGGTATTCAGTATTCGTAAATTAAAGGTGGGGACTTGCTCGGTACTATTGGCAA TTTCAATTTTGGGAAGCCAAGGTATTTTATCGGATGAAGTTGTTACTAGTTCTTCACCGATGGCTACAAAAGAGTC TTCTAATGCAATTACTAATGATTTAGATAATTCACCAACTGTTAATCAGAATCGTTCTGCTGAAATGATTGCCTCT AATTCAACCACTAATGGTTTAGATAATTCGTTAAGTGTTAATAGCATCAGCTCTAATGGTACTATTCGTTCCAATT CACAATTAGACAACAGAACAGTTGAATCTACAGTAACATCTACTAATGAAAATAAGAGTTATAAGGAAGATGTTA TAAGTGACAGAATTATCAAAAAAGAATTTGAAGATACTGCTTTAAGTGTAAAAGATTATGGTGCAGTAGGTGATG GGATTCATGATGATCGACAAGCAATTCAAGATGCAATAGATGCTGCAGCTCAAGGGCTAGGTGGAGGAAATGTAT ATTTTCCTGAAGGAACTTATTTAGTAAAAGAAATTGTTTTTTTAAAAAGTCATACACACTTAGAATTGAATGAGAA AGCTACAATTCTAAATGGTATAAATATTAAGAATCACCCTTCCATTGTTTTTATGACAGGTTTATTTACGGATGAT GGTGCGCAAGTAGAATGGGGCCCAACAGAAGATATTAGTTATTCTGGTGGTACGATTGATATGAACGGTGCTTTG AATGAAGAAGGAACTAAAGCAAAAAAATCTACCACTTATAAATTCTTCAGGTGCATTTGCTATTGGGAATTCAAAT AACGTAACTATAAAAAATGTAACATTCAAGGATAGTTATCAAGGGCATGCTATTCAAATTGCAGGTTCGAAAAAT GTATTAGTTGATAATTCTCGTTTTCTTGGGCAAGCCTTACCCAAAACGATGAAGGATGGGCAAATCATAAGTAAGG AGAGCATTCAGATTGAACCATTAACTAGAAAAGGTTTTCCTTATGCCTTGAATGATGATGGGAAAAAATCTGAAA ATGTGACTATTCAAAATTCCTATTTTGGCAAAAGTGATAAATCTGGGGAATTAGTAACAGCAATTGGCACACACTA TCAAACATTGTCGACACAGAACCCCTCTAATATTAAAATTCAAAATAATCATTTTGATAACATGATGTATGCAGGT GTACGTTTTACAGGATTCACTGATGTATTAATCAAAGGAAATCGCTTTGATAAGAAAGTTAAAGGAGAGAGTGTA CATTATCGAGAAAGCGGAGCAGCTTTAGTAAATGCTTATAGCTATAAAAACACTAAAGACCTATTAGATTTAAAT AAACAGGTGGTTATCGCCGAAAATATATTTAATATTGCCGATCCTAAAACAAAAGCGATACGAGTTGCAAAAGAT AGTGCAGAATGTTTAGGAAAAGTATCAGATATTACTGTAACAAAAAATGTAATTAATAATAATTCTAAGGAAACA CAACAACCAAATATTGAATTATTACGAGTTAGTGATAATTTAGTAGTCTCAGAGAATAGT QRCHSIYFKKSNNQLLKIVKKLEVLMKYFVPNEVFSIRKLKVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITN DLDNSPTVNQNRSAEMIASNSTTNGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENKSYKEDVISDRIIKKEFEDT ALSVKDYGAVGDGIHDDRQAIQDAIDAAAQGLGGGNVYFPEGTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVF MTGLLFTDDGAQVEWGPTEDISYSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIA GSKNVLVDNSRFLGQALPKTMKDGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSYFGKSDKSGELVTAIGTHY QTLSTQNPSNIKIQNNHFDNMMYAGVRFTGFTDVLIKGNRFDKKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQ VVIAENTTNIADPKTKATRVAKDSAECLGKVSDTTVTKNVINNNSKETEQPNIELLRVSDNLVVSENS ID44 324 bp GTGATGAAAGAAACTCAGCTATTAAAAGGTGTTCTTGAAGGTTGTGTCTTGGATATGATTGGTCAAAAAGAGCGG TATGGTTATGAGTTGGTTCAGACTTTGCGAGAGGCTGGATTTGATACTATCGTTCCAGGAACTATTTATCCTTTGTT GCAAAAGTTAGAAAAAAATCAATGGATAAGAGGCGACATGCGCCCGTCGCCAGATGGTCCAGATCGGAAGTATTT TTCATTAATGAAAGAAGGAGAAGAGCGTGTCTCATCTTTTGGCAACAATGGGACGATTTGAGTCAAAAAGTAGA AGGGATTAAGAATGGGGGTTAA MMKETQLLKGVLEGCVLDMIGQKERYGYELVQTLREAGFDTIVPGTIYPLLQKLEKNQWIRGDMRPSPDGPDRKYFSL MKEGEERVSVTWQQWDDLSQKVEGIKNGGZ D45 816 bp ATGAAGAAAATGAAGTATTACGAAGAAACAAGCGCTTTGCTACATGAGTTTTCTGAGGAGAATCAAAAGTATTTT GAGGAGTTGTGGGAAAGTTTTAATCTTGCTGGMTTCTCTATGATGAAGACTATCTCAGAGAGCAGATCTATTTGA TGATGCTAGATTTCTCAGAAGCAGAACGAGATGGCATGAGTGCAGAGGATTATCTAGGTAAGAATCCTAAAAAAA TAATGAAAGAGATTCTCAAGGGAGCACCTCGCAGTTCTATCAAAGAGTCCCTTTTGACGCCAATTCTTGTCCTGGC GGTATTACGTTATTATCAACTACTAAGTGTTTTTCTAAAGGTCCTCTCTTAACAGTCAATTTGCTCACATTTTTAG GGCAACTCTTATTTTTCTGATTGGATTTGGACTTGTGGCCACAATTTTACGAAGAAGTTTAGTCCAAGATTCTCCT AAAATGAAAATTGGCACTTACATTGTTGTTGGGACTATAGTTCTTCTAGTTGTTTTAGGATATGTAGGAATGGCAA GCTTCATACAAGAAGGAGCCTTTTATATTCCGGCTCCCTGGGATAGTTTGTCTGTCTTTACGATTTCGCTAGTTATC GGTATTGGAATTGGAAAGAAGCGGTCTTTCGTCCATTTGTCAGTATGATTATTGCCCATCTTGTGGTGGGTTCTCT GCTCCGTTATTATGAGTGGATGGGAATTTCAAATGTTTTCCTTACAAAAGTTATTCCTTTAGCTGTCCTCTTTATTG GAATCTTTGTCTTGTTCCGTGGGTTTAAGAAGATAAAATGGAGTGAAGTATAG MKKMKYYEETSALLHEFSEENQKYFEELWESFNLAGFLYDEDYLREQIYLMMLDFSEAERDGMSAEDYLGKNPKKIM KEILKGAPSSISLLTPILVLAVLRYYQLTDTSKGPLLTVNLLTPLGQLLTTLIGTGLVATTLRRSLVQDSPKMTTGTYI VVGTIVLLVVLGYVGMASFIQEGAFYIPAPWDSLSVFTISLVIGIWNWKEAVFRPFVSMIIAHLVVGSLLRYYEWMGISN VTLTKVIPLAVLTTGTTVLTRGTKKIKWSEVZ ID46 348 bp CTGTTTTTTTATTTATACTCAATGAAAATCAAAGAGCAAACTAGGAAGCTAGCCGCAGGTTGCTCAAAACACTGTT TTGAGGTTGTAGACGAAACTGACGAAGTCAGCTCAAAACATGTVTTTGAGGTTGTAGATGATACTGACGAAGTCA GCTCAAAACACTGTTTTGAGGTTGTAGAATGAAAGTCACGAAGTCAGCTCAAAACAGTCTTTTGAGTTGTAGATG AAACTGACGAAGTCAGCTCAAAACATGTTTTTGAGGTTGTAGATGAAACTGACGAAGTCAGTAACCATACATACG GTAGGGCGACGCTGACGTGTTTGAAGAGATTCGAAGAGTATTAA MTTYLYSMKIKEQTRKLAAGCSKHCTEVVDETDEVSSKHVTEVVDETDEVSSKHCTEVVDETDEVSSKHCTEVVDETD EVSSKHVTEVVDETDEVSNHTYGRATLTWTEETTEEYZ ID47 1260 bp ATGCAGAATCTGAAATTTGCCTTTTCATCTATCATGGCTCACAAGATGCGTTCTTTGCTTACTATGATTGGGATTAT TATCGGTGTTTCATCAGTTGTTGTGATTATGGCTTTGGGTGATTCCCTATCTCGTCAAGTCAATAAAGATATGACTA AATCTCAGAAAAATATTAGCGTCTTTTTCTCTCCTAAAAAAAGTAAAGACGGGTCTTTTACTCAGAAACAAATCAGC TTTTACGGTTTCTGGAAAGGAAGAGGAAGTTCCTGTTTGAACCGCCAAAACCGCAAGAATCCTGGGTCCAAGAGGC AGCTAAACTGAAGGGAGTGGATAGTTACTATGTAACCAATTCAACGAATGCCATCTTGACCTATCAAGATAAAAA GGTTTGAGAATGCTAATTTGACAGGTGGAAACAGAACTTACATGGACGCTGTTAAGAATGATATTATTGCAGGTCG TAGTCTGAGAGAGCAAGATTTCAAAGAGTTTGCAAGTGTCATTTTGCTAGATGAGGAATTGTCCATTAGTTTATTT GAATCTCCTCAAGAGGCTATTAACAAGGTTTGTAGAAGTCAATGGATTTAGTTACCGGGTCATTGGGGTTTATATA GTCCGGAGGCTAAAAGATCAAAAATATATGGGTTTGGTGGCTTGCCTATTACTACCAATATCTCCCTTGCTGCGAA TTTTAATGTAGATGAAATAGCTAATATTGTCTTTCGAGTGAATGATACCAGTTTAACCCCAACTCTGGGTCCAGAA CTGGCACGAAAAATGACAGAGCTTGCAGGCTTACAACAGGGAGAATACCAGGTGGCAGATGAGTCCGTTGTATTT GCAGAAATTCAACAATCGTTTAGTTTTATGACGACGATTATTAGTTCCATCGCAGGGATTTCTCTCTTTGTTGGAG GAACTGGTGTCATGAACATCATGCTGGTTTCGGTGACAGAGCGCACTCGTGAGATTGGTCTTCGTAAGGCTTTGGG TGCAACAGCTGCCAATATTTTAATTCAGTTTTTGATTGAATCCATGATTTTGACCTTGTTAGGTGGCTTAATTGGCT TGACAATTGCAAGTGGTTTAACTGCCTTAGCAGGTTTGTACTGCAAGGTTTAATAGAAGGTATAGAAGTTGGAGT AAGGCATCGAAACTTGATCCAATTGAAGCCCTTCGTTATGAATGA MQNLKTATSSIMAHKMRLTMIGIIIGVSSVVVIMALGDSLSRQVNKDMTKSQKNISVTTSPKKSKDGSTQKQSATFVS GKEEEVPVEPPKPQESWVQEAAKLKGVDSYYVTNSTNAILTYQDKKVENANLTGGNRTYMDAVKNEIIAGPTLPTQDT KEFASVILLDEELSISLFESPQEAINKVVEVNGFSYRVIGVYTSPEAKRSKIYGFGGLPITTNISLAANFNVDEIANIVFRVN DTSLTPTLGPELARKMTELAGLQQGEYQVADESVVFAEIQQSFSFMTTIISSIAGISLFVGGTGVMNIMLVSTERTREIG LRALGATRANILIQTLIESMILTLLGGLIGLTIASGLTALAGLLLQGLIEGTEVGVSIPVALTSTTVSTVGMIFGVLPANK ASKLDPIEALRYEZ ID48 705 bp CTGATGAAGCAACTAATTAGTCTAAAAAATATCTTCAGAAGTTACCGTAATGGTGACCAAGAACTGCAGGTTCTC AAAAATATCAATCTAGAAGTGAATGAGGGTGAATTTGTAGCCATCATGGGACCATCTGGGTCTGGTAAGTCCACT CTGATGAATACGATTGGCATGTTGGATACACCAACCAGTGGAGAATATTATCTTGAAGGTCAAGAAGTGGCTGGG CTTGGTGAAAAACAACTAGCTAAGGTCCGTAACCAACAAATCGGTTTTGTCTTTCAGCAGTTCTTTCTTCTATCGA AGTCAATGCTCTGCAAAATGTAGAATTGCCCTTGATTTACGCAGGAGTTTCGTCTTCAAAACGTCGCAAGTTGGC TGAGGAATATTTAGACAAGGTTGAATTGACAGAACGTAGTCACCATTTACCTTCAGAATTATCTGGTGGTCAAAA GCAACGTGTAGCCATTGCGCGTGCCTTGGTAAACAATCCTTCTATTATCCTAGCGGATGAACCGACAGGAGCCTTG GATACCAAAACAGGTAACCAAATTATGCAATTATTGGTTGATTTGAATAAAGAAGGAAAAACCATTATCATGGTA ACGCATGAGCCTGAGATTGCTGCCTATGCCAAACGTCAGATTGTCATTCGGGATGGGGTCATTTCGTCTGACAGTG CTCAGTTAGGAAAGGAGGAAAACTAA MMKQLISLKNIRSYRNGDQELQVLKNINLEVNEGEFVAIMGPSGSGKSTLMNTIGMLDTPTSGEYYLEGQEVAGLGEK QLAKVRNQQIGFVFQQFFLLSKLNALQNVELPLIYAGVSSSRRKLAEEYLDKVELTERSHHLPSELSGGQKQRVAIARA LVNNPSIILADEPTGALDTKTGNQIMQLLVDLNEGKTIIMVTHEPEIAAYAKRQIVIRDGVISSDSAQLGKEENZ ID49 1200 bp ATGAAGAAAAAGAATGGTAAAGCTAAAAAGTGGCAACTGTATGCAGCAATCGGTGCTGCGAGTGTAGTTGTATTG GGTGCTGGGGGGATTTTACTCTTAGACAACCTTCTCAGACTGCTCTAAAAGATGAGCCTACTCATCTTGTTGTTG CCAAGGAAGGAAGCGTGGCCTCCTCTGTTTTATTGTCAGGGACAGTAACAGCAAAAAATGAACAATATGTTTATT TTGATGCTAGTAAGGGTGATTTAGATGAAATCCTTGTTTCTGTGGGCGATAAGGTCAGCGAAGGGCAGGCTTTAGT CAAGTACAGTAGTTCAGAAGCGCAGGCGGCCTATGATTCAGCTAGTCGAGCAGTAGCTAGGGCAGATCGTCATAT CAATGAACTCATCAAGCACGAAATGAAGCCGCTTCAGCTCCGGCTCCACAGTTACCAGCGCCAGTAGGAGGAGA AGATGCAACGGTGCAAAGCCCAACTCCAGTGGCTGGAAATTCTGTTGCTTCTATTGACGCTCAATTGGGTGATGCC CGTGATGCGCGTGCAGATGCTGCGGCGCAATTAAGCAAGGCTCAAAGTCAATTGGATGCAACAACTGTTCTCAGT ACCCTAGAGGGAACTGTGGTCGAAGTCAATAGCAATGTTTCTAAATCTCCAACAGGGGCGAGTCAAGTTATGGTT CATATTGTCAGCAATGAAAATTTACAAGTCAAGGGAGAATTGTCTGAGTACAATCTAGCCAACCTTTCTGTAGGTC AAGAAGTAAGCTTTACTTCTAAAGTGTATCCTGATAAAAAATGGACTGGGAAATTAAGCTATATTTCTGACTATCC TAAAAACAATGGTGAAGCAGCTAGTCCAGCAGCCGGGAATAATACAGGTTCTAAATACCCTTATACTATTGATGT GACAGGCGAGGTTGGTGATTTGAAACAAGGTTTTTCTGTCAACATTGAGGTTAAAAGCAAAACTAAGGCTATTCTT GTTCCTGTTAGCAGTCTAGTAATGGATGATAGTAAAAATTATGTCTGGATTGTGGATGAACAACAAAAGGCTAAA AAAGTTGAGGTTTCATTGGGAAATGCTGACGCAGAAAATCAAGAAATCACTTCTGGTTTAACGAACGGTGCTAAG GTCATCAGTAATCCAACATCTTCCTTGGAAGAAGGAAAAGAGGTGAAGGCTGATGAAGCAACTAATTAG MKKKNGKAKKWQYLYAAIGAASVVVLGAGGILLFRQPSQTALKDEPTHLVVAKEGVASSVLLSGTVTAKNEQYVYFD ASKGDLDEILVSVGDKSEGQALVKYSSSEAQAAYDSASRAVARADRHINELNQARNEAASAPAPQLPAPVGGEDATV QSPTPVAGNSVAISDAQLGDARDARADAAAQLSKAQSQLDATTVLSTLEGTVVEVNSNVSKSPTGASQVMVHIVSNEN LQVKGELSEYNLANLSVGQEVSFTSKVYPDKKWTGKLSYISDYPKNNGEAASPAAGNNTGSKYPYTIDVTGEVGDLKQ GFSVNIEVKSKTKAILVPSSLVMDDSKNYVWIVDEQQKAKKVEVSLGNADAENQEITSGLTNGAKVISNPTSSLEEGKE VKADEATNZ D50 759 bp ATGTCACGTAAACCATTTATCGCTGGTAACTGGAAAATGAACAAAAATCCAGAAGAAGCTAAAGCATTCGTTGAA GCAGTTGCATCAAAACTTCCTTCATCAGATCTTGTTGAAGCAGGTATCGCTGCTCCAGCTCTTGATTTGACACTG TTCTTGCTGTTGCAAAAGGCTCAAACCTTAAAGTTGCTGCTCAAAACTGCTACTTTGAAAATGCAGGTGCTTTCAC TGGTGAAACTAGCCCACAAGTTTTGAAAGAAATCGGTACTGACTACGTTGTTATCGGTCACTCAGAACGCCGTGA CTACTTCCATGAAACTGATGAAGATATCAACAAAAAAGCAAAAGCAATCTTTGCGAACGGTATGCTTCCAATCAT CTGTTGTGGTGAATCACTTGAAACTTACGAAGCTGGTAAAGCTGCTGAATTCGTAGGTGCTCAAGTATCTGCTGCA TTGGCTGGATTGACTGCTGAACAAGTTGCTGCCTCAGTTATCGCTTATGAGCCAATCTGGGCTATCGGTACTGGTA AATCAGCTTCACAAGACGATGCACAAAAAATGTGTAAAGTTGTTCGTGACGTTGTAGCTGCTGACTTTGGTCAAG AAGTCGCAGACAAAGTTCGTGTTCAATACGGTGGTTCTGTTAAACCTGAAAATGTTGCTTCATACATGGCTTGCCC AGACGTTGACGGTGCCCTTGTAGGTGGTGCGTCACTTGAAGCTGAAAGCTTCTTGGCTTTGCTTGACTTTGTAAAA TAA MSRKPFIAGNWKMNKNPEEAKATVEAVASKLPSSDLVEAGIAAPALDLTTVLAVAKGSNLKVAAQNCYFENAGAFTG ETSPQVLKEIGTDYVVIGHSERRDYFHETDEDINKKAKAIFANGMLPIICCGESLETYEAGKAAEFVGAQVSAALAGLTA EQVAASVIAYEPIWAIGTGKSASQDDAQKMCKVVRDVVAADFGQEVADKVRVQYGGSVKPENVASYMACPDVDGAL VGGASLEAESFLALLDFVKZ ID51 1473 bp TTGAAAACAAAAATTGGATTAGCAAGTATCTGTTTACTAGGCTTGGCAACTAGTCATGTCGCTGCAAATGAAACTG AAGTAGCAAAAACTTCGCAGGATACAACGACAGCTTCAAGTAGTTCAGAGCAAAATCAGTCTTCTAATAAAAGCG AAACGAGCGCAGAAGTACAGACTAATGCTGCTGCCCACTGGGATGGGGATTATTATGTAAAGGATGATGGTTCTA AAGCTCAAAGTGAATGGATTTTTGACAACTACTATAAGGCTTGGTTTTATATTAATTCAGATGGTCGTTACTCGCA GAATGAATGGCATGGAAATTACTACCTGAAATCAGGTGGATATATGGCCCAAAACGAGTGGATCTATGACAGTAA TTACAAGAGTTGGTTTTATCTCAAGTCAGATGGGGCTTATGCTCATCAAGAATGGCAATTGATTGGAAATAAGTGG TACTACTTCAAGAAGTGGGGTTACATGGCTAAAAGCCAATGGCAAGGAAGTTATTTCTTGAATGGTCAAGGAGCT ATGATGCAAAATGAATGGCTCTATGATCCAGCCTATTCTGCTTATTTTTATCTAAAATCCGATGGAACTTATGCTA ACCAAGAGTGGCAAAAAGTGGGCGGCAAATGGTACTATTTCAAGAAGTGGGGCTATATGGCTCGGAATGAGTGGC AAGGCAACTACTATTTGACTGGAAGTGGTGCCATGGCGACTGACGAAGTGATTATGGATGGTACTCGCTATATCTT TGCGGCCTCTGGTGAGCTCAAAGAAAAAAAAGATTTGAATGTCGGCTGGGTTCACAGAGATGGTAAGCGCTATTT CTTTAATAATAGAGAAGAACAAGTGGGAACCGAACATGCTAAGAAAGTCATTGATATTAGTGAGCACAATGGTCG TATCAATGATTGGAAAAAGGTTATTGATGAGAACGAAGTGGATGGTGTCATTGTTCGTCTAGGTTATAGCGGTAA AGAAGACAAGGAATTGGCGCATAACATTAAGGAGTTAAACCGTCTGGGAATTCCTTATGGTGTCTATCTCTATAC CTATGCTGAAAATGAGACCGTGCTGAGAGTGACGCTAAACAGACCATTGAACTTATAAAGAAATACAATATGAAC CTGTCTTACCCTATCTATTATGATGTTGAGAATTGGGAATATGTAAATAAGAGCAAGAGAGCTCCAAGTGATACA GGCACTTGGGTTAAAATCATCAACAAGTACATGGACACGATGAAGCAGGCGGGTTATCAAAATGTGTATGTCTAT AGCTATCGTAGTTTATTACAGACGCGTTAAAACACCCAGATATTTTAAAACATCTAATCTGGGTAGCGGCCTATA CGAATGCTTTAGAATGGGAAAACCCTCATTATTCAGGAAAAAAAGGTTGGCAATATACCTCTTCTGAATACATGA AAGGAATCCAAGGGCGCGTAGATGTCAGCGTTTGGTATTAA MKTKIGLASICLLGLATSHVAANETEVAKTSQDTTTASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQ SEWIFDNYYKAWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYY FKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVGGKWYYFKKWGYMARNE WQGNYYLTGSGAMATDEVIMDGTRYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVIDISEHNG RINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYLYTYAEETDAESDAKQTIELIKKYNMNLSY PIYYDVENWEYVNKSKRAPSDTGTWVKIINKYMDTMKQAGYQNVYVYSYRSLLQTRLKHYDILKHVNWVTYTTNAL EWENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWYZ ID52 774 bp ATGAAAAAATTTGCCAACCTTTATCTGGGACTGGTCTTTCTGGTCCTCTACCTGCCTATCTTTTACTTGATTGGCTA TGCCTTTAATGCTGGTGATGATATGAATAGCTTTACAGGTTTTAGCTGGACTCACTTTGAAACCATGTTTGGAGAT GGGAGACTCATGCTGATTTTGGCTCAGACATTTTTCTTGGCCTTCCTATCAGCCTTGATAGCGACCATTATCGGGA CTTTTGGTGCCATTTACATCTACCAGTCTCGTAAGAAATACCAAGAAGCCTTTCTATCACTCAATAATATCCTCAT GGTTGCGCCTGACGTTATGATTGGTGCTAGCTTCTTGATTCTCTTTACCCAACTCAAGTTTTCATTGGCTTTTTGA CCGTTCTATCTAGTCACGTGGCCTTCTCCATTCCTATCGTGGTCTTGATGGTCTTGCCTCGACTCAAGGAAATGAA TGGCGACATGATTCATGCGGCCTATGACTTGGGAGCTAGTCAATTTCAGATGTTCAAGGAAATCATGCTTCTTTAC CTGACTCCGTCTATCATTACTGGTTATTTCATGGCCYCACCTATTCGTTAGATGACTTTGCCGTGACCTTCTTTGT AACAGGAAATGGCTTTTCAACCCTATCAGTCGAGATTTACTCTCGTGCTCGCAAGGGGATTTCCTTAGAAATCAAT GCCCTGTCTGCTCTAGTCTTTCTCTTTAGTATTATCCTAGTTGTAGGTTATTACTTTATCTCTCGTGAGAAGGAGGA GCAAGCATGA MKKFANLYLGLVFLVLYLPIFYLIGYAFNAGDDMNSFTGFSWTHFETMFGDGRLMLILAQTFFLAFLSALIATIIGTFGA IYIYQSRKKYQEAFLSLNNILMVAPDVMIGASFLILFTQLGSLGFLTVLSSHVAFSIPIVVLMVLPRLKEMNGDMIHAAY DLGASQFQMPKEIMLPYLTPSITTGYFMAFTYSLDDFAVTFFVTGNGFSTLSVEIYSRARKISLEINALSALVFLFSIILVV GYYTTSREKEEQAZ ID59 1071 bp ATGAAAAAAATCTATTCATTTTTAGCAGGAATTGCAGCGATTATCCTTGTCTTGTGGGGAATTGCGACTCATTTAG ATAGTAAAATCAATAGTCGAGATAGTCAAAAATTGGTTATCTATAACTGGGGAGACTATATCGATCTTGAACTCTT GACTCAGTTTACAGAAGAAACAGGAATTCAAGTTCAGTACGAGACTTTTGACTCCAACGAAGCCATGTACACTAA GATAAAGCAGGGTGGAACGACCTACGATATTGCCATTCCAAGTGAATACATGATTAACAAGATGAAGGACGAAG ACTTCTGGTTCCGCTGATTATTCAAAAATTGAAGGAATCGAAAATATCGGACCAGAGTTTTTCATCCAGTCCTT TGACCCAGGTAATAAATTCTCCATCCCTTACTCTGGGGAACCTTAGGAATTGTCTACAACGAAACCATGGTAGAT GAAGCGCCTGAGCATTGGGATGACCTTTGGAAGCCGGAGTATAAGAATTCTATCATGCTCTTTGATGGGGCGCT GAGGTGCTGGGACTAGGACTCAATTCCCTCGGCTACAGCCTCAACTCCAAGGATCTGCAGCAGTTGGAAGAGACA GTGGATAAGCTCTACAAACTGACTCCAAATATCAAGGCTATCGTTGCGGACGAGATGAAGGGCTATATGATTCAG AATAATGTTGCAATCGGCGTGACCTTCTCTGGTGAAGCCAGCCAAATGTTAGAAAAAAATGAAAATCTACGTTAT GTGGTACCGACAGAGGCCAGCAATCTTTGGTTTGACAATATGGTCATTCCCAAAACAGTTAAAAACCAAAACTCA GCCTATGCCTTTATCAACTTTATGTTGAAACCTGAAAATGCTCTCCAAAATGCGGAGTATGTCGGCTATTCAACAC CAAACCTACCAGCGAAGGAATTGCTCCCAGAGGAAACAAAGGAAGATAAGGCCTTCTATCCCGATGTTGAAACCA TGAAACACCTAGAAGTTTATGAGAAATTTGACCATAAATGGACAGGGAAATATAGCGACCTCTTCCTACAGTTTA AAATGTATCGGAAGTAG MKKIYSFLAGIAAIILVLWGIATHLDSKINSRDSQKLVIYNWGDYIDPELLTQFTEETGIQVQYETFDSNEAMYTKIKQGG TTYDIAIPSEYMINKMKDEDLLPVDYSKIEGIENIGPEFLNQSFDPGNKFSIPYFWGTLGIVYNETMVDEAPEHWDDLW KPEYKNSIMLFDGAREVLGLGLNSLGYSLNSKDLQQLEETVDKYKLTPNIKAIVADEMKGYMIQNNVAIGTFSGEAS QMLEKNENLRYVVPTEASNLWFDNMVIPKTVKNQNSAYAFINFMLKPENALQNAEYVGYSTPNLPAKELLPETTKED KATYPDVETMKHLEVYEKTDHKWTGKYSDLTLQTKMYRKZ ID61 1851 bp ATGAATAAAAAACTAACAGATTATGTGATTGATCTGGTGGAAATTTTAAATAAACAACAAAAGCAGGTTTTCTGG GGAATATTTGATATTTTCAGTATGGTGGTTTCCCATCATTGTATCTTATATTTTATTTTATGGGCTGATTAATCCAGC ACCTGTTGACTACATTATCTATACGAGTTTGGCCTTCCTGTTCTATCAATTGATGATTGGTTTTTGGGGGTTGAACG CGAGCATTAGTCGTTACAGCAAGATTACGGATTTCATGAAAATCTTTTTTGGTGTGACTGCTAGCAGTGTCTTGTC ATATAGTATCTGTTATGCCTTCTTGCCACTCTTCTCCATCCGTTTCATCATTCTCTTTATCTTGTTGATACCTTCTT GATTTTATTGCCACGGATTACTTGGCAGTTAATCTACTCCAGACGCAAAAAAGGTAGTGGTGATGGAGAACACCG TCGGACCTTCTTGATTGGTGCCGGTGATGGTGGGGCTCTTTTTATGGATAGTTACCAACATCCAACCAGTGAATTA GAACTGGTCGGTATTTTGGATAAGGATCTAAGAAAAAGGGTCAAAAACTTGGTGGTATTCCTGTTTTGGGCTCTT ATGACAATCTGCCTGAATTAGCCAAACGCCATCAAATCGAGCGTGTCATCGTTGCGATTCCGTCGTCTGGATCCGTC AGAATATGAGCGTATCCTGCAGATGTGTAATAAGCTGGGTGTCAAATGTTACAAGATGCCTAAGGTTGAAACTGT TGTTCAGGGCCTTCACCAAGCAGGTACTGGCTTCCAAAAAATTGATATTACGGACCTTTTGGGTCGTCAGGAAATC CGTCTTGACGAATCGCGTCTGGGTGCAGAACTGACAGGTAAGACCATCTTAGTCACAGGAGCTGGAGGTTCAATC GGTTCTGAAATCTGTCGTCAAGTTAGTCGCTTCAATCCTGAACGCATTGTCTTGCTCGGTCATGGGGAAAACTCAA TCTACCTTGTTTATCATGAATTGATTCGTAAGTTCCAAGGGATTGATTATGTACCTGTGATTGCGGACATTCAAGA CTATGATCGTTTGTTGCAAGTCTTTGAGCAGTACAAACCTGCTATTGTTTATCATGCGGCAGCCCACAAGCATGTT CCTATGATGGAGCGCAATCCAAAAGAAGCCTTCAAAAACAATATCCGTGGAACTTACAATGTTGCTAAGGCTGTT GATGAAGCTAAAGTGTCTAAGATGGTTATGATTTCGACAGATAAGGCAGTCAATCCACCAAATGTTATGGGAGCA ACCAAGCGCGTGGCGGAGTTGATTGTCACTGGCTTTAACCAACGTAGCCAATCAACCTACTGTGCAGTTCGTTTTG GGAATGTTCTTGGTAGCCGTGGTAGTGTCATTCCAGTCTTTGAACGTCAGATTGCTGAAGGTGGGCCTGTAACGGT GACAGACTTCGTATGACCCGTTACTTTATGACCATTCCAGAAGCTAGCCGTCTGGTTATCCATGCTGGTGCTTAT GCCAAAGATGGGGAAGTCTTTATCCTTGATATGGGCAAACCAGTCAAGATTTATGACTTGGCCAAGAAGATGGTG CTTCTAAGTGGCCACACTGAAAGTGAAATTCCAATCGTTGAAGTTGGAATCCGCCCAGGTGAAAAACTCTACGAA GAACTCTTGGTATCAACCGAACTCGTTGATAATCAAGTTATGGATAAGATTTTCGTGGTAAGGTAATTTTCATGC CTTTAGAATCCATCAATCAAAAGATTGGAGAGTTCCGCACTCTCAGTGGAGATGAGTTGAAGCAAGCTATTATCG CCTTTGCTAATCAAACAACCCACATTGAATAA MNKKLTDYVIDLVEILNKQQKQVFWGIFDIFSMVVSIIVSYILFYGLINPAPVDYIIYTSLAFLFYQLMIGFWGLNASISRY SKITDFMKIFFGVTASSVLSYSICYAFLPLFSIRFIILFILLSTFLILLPRITWQLIYSRRKKGSGDGEHRRTFLIGAGDGGALF MDSYQHPTSELELVGILDKDSKKKGQKLGGIPVLGSYDNLPELAKRHQIERVIVAIPSLDPSEYERILQMCNKLGVKCYK MPKVETVVQGLHQAGTGFQKIDITDLLGRQEIRLDESRLGAELTGKTILVTGAGGISGSEICRQVSRFNPERIVLLGHGEN SIYLVYHELIRKFQGIDYVPVIADIQDYDRLLQVFEQYKPAIVYHAAAHKHVPMMERNPKEAFNNIRGTYVAKAVD EAKVSKMVMISTDKAVNPPNVMGATKRVAELIVTGFNQRSQSTYCAVRFGNVLGSRGSVIPFERQIAEGGPVTVTDFR MTRYFMTIPEASRLVIHAGAYAKDGEVFTLDMGKPVKIYDLAKKMVLLSGHTESEIPIVEVGRPGEKLYEELLVSTELV DNQVMDKKTVGKVNVMPLEINQKIGETRThSGDELKQATIATANQTHIEZ ID101 1338 bp ATGATTGAACTTTATGATAGTTACAGTCAAGAAAGTCGAGATTTACATGAAAGTCTAGTCGTACTGGTCTTTCTC AACTTGGAGTGGTCATCGATGCAGATGGTTTTCTGCCTGATGGTCTGCTTTCTCCTTTTACCTATTATCTAGGTTAC GAGGATGGAAAACCTCTCTATTTTAATCAAGTTCCCGTTTCAGATTTTTGGGAAATTTTAGGAGATAATCAGTCTG CTTGTATTGAAGATGTGACGCAGGAGAGGGCTGTCATTCATTATGCTGATGGAATGCAGGCTCGCTTGGTTAAACA GGTAGACTGGAAAGACCTAGAAGGTCGAGTACGTCAGGTTGACCACTACAATCGCTTCGGAGCTTGTTTTGCTAC AACGACTTATAGCGCAGATAGCGAGCCGATTATGACAGTTTACCAAGATGTCAATGGTCAACAAGTTTTACTGGA AAACCATGTGACGGGTGATATCTTATTGACTTTGCCAGGTCAGTCCATGCGTTACTTTGCAAATAAAGTTGAATTT ATCACCTTCTTTTTGCAAGATTTGGAAATAGATACCAGTCAGCTTATCTTTAATACTCTAGCGACTCCTTTCTTGGT TTCCTTCCATCATCCAGATAAATCTGGCTCGGATGTCTTGGTATGGCAGGAACCTCTCTATGATGCCATTCCAGGT AATATGCAGTTGATTTTGGAAAGTGATAATGTGCGTACTAAGAAGATCATCATTCCAAATAAGGCGACTTATGAG CGCGCTTTAGAGTTAACTGACGAGAAATACCATGATCAGTTTGTGCACTTGGGTTATCATTACCAGTTCAAACGTG ATAATTTCCTAAGACGAGATGCCTTAATCTTGACCAATTCAGATCAGATTGAGCAAGTAGAAGCAATCGCAGGAG CCTTGCCTGATGTCACTTTCCGTATTGCAGCGGTGACAGAGATGTCTTCTAAGCTCTTAGACATGCTTTGCTATCCT AATGTGGCCCTTTACCAGAACGCTAGTCCACAGAAGATTCAGGAGCTGTATCAATGTCGGATATTTACTTGGATA TAAACCACAGTAATGAGTTGCTACAGGCAGTGCGTCAGGCCTTTGAGCACAATCTCTTGATTCTTGGCTTTAATC GACGGTGCACAATAGACTTTATATCGCTCCAGACCATCTATTTGAAAGTAGTGAGTTGCTGCTTTGGTTGAGACC ATTAAATTGGCCCTTTCAGATGTTGATCAAATGCGTCAGGCACTTGGCAAACAAGGCCAACATGCAAATTATGTTG ACTTGGTGAGATATCAGGAAACCATGCAAACTGTTTTAGGAGGCTAA MIELYDSYSQESRDLHESLVATGLSQLGVVIDADGFLPDGLLSPFTYYLGYEDGKPLYFNQVPVSDFWEILGDNQSACIE DVTQERAVIHYADGMQARLVKQVDWKDLEGRVRQVDHYNRFGACFATTTYSADSEPIMTVYQDVNGQQVLLENHV TGDILLTLPGQSMRYFANKVEFITFFLQDLEIDTSQLIFNTLATPFLVSTFHHPDKSGSDVLVWQEPLYDAIPGNMQLILES DNVRTKKIIIPNKATYERALELTDEKYHDQFVHLGYHYQFKRDNFLRRDALILTNSDQIEQVEAIAGALPVTFRIAAT EMSSKLLDMLCYPNVALYQNASPQKIQELTQLSDIYLDINHSNELLQAVRQAFEHNLLILGFNQTVHNRLYIAPDHLFE SSEVAALVETIKLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGGZ ID102 1512 bp ATGACAATTTACAATATAAATTTAGGAATTGGTTGGGCTAGTAGCGGTGTTGAATACGCTCAAGCCTATCGTGCTG GTGTTTTCGGAAATTAAATCTGTCCTCTAAGTTTATVTTTACAGATATGATTTTAGCCGATAATATTCAGCACTTA ACAGCCAATATTGGTTTTGATGATAATCAGGTTATCTGGCTTTATAATCATTTCACAGATATCAAAATTGCACCTA CTAGCGTGACAGTGGATGATGTCTTGGCTTACTTTGGTGGTGAAGAAAGTCACAGAGAAAAAAAATGGCAAGGTTT TACGTGTATTCTTTTTTGACCAAGATAAGTTTGTAACCTGTTATTTGGTTGATGAGAACAAGGACTTGGTTCAACA TGCCGAGTATGTTTTTAAGGGAAACCTGATTCGGAAGGATTACTTTTCTTATACGCGTTATTGTAGCGAGTATTTT GCTCCCAAGGACAATGTTGCAGTCTTATACCAACGACTTTTTATAATGAAGTGACGGGACTCCAGTCTATGATATT TGATGAATCAAGGGAAGGAAGAAGTTTATCATTTCAAGGATAAGATTTTCTATGGAAAGCAAGCTTTTGTGCGTG CCTTTATGAAATCTTTGAATTGAATAAGTCTGATTTGGTCATTTTCGATAGGGAGACAGGTATTGGACAGGTTGT GTTTGAGGAAGCACAGACAGCACATCTAGCGGTAGTTGTTCATGCGCAGCATTATAGTGAAAATGCTACAAATGA GGACTATATCCTTTGGAATAACTATTATGACTATCAGTTTACCAATGCAGATAAGGTTGACTTCTTTATCGTGTCT ACTGATAGACAAAATGAAGTTCTACAAGAGCAATTTGCCAAATATACTCAGCATCAGCCAAAGATTGTTACCATT CCTGTAGGCAGTATTGATTCCTGACAGATTCAAGTCAAGGGCGCAAACCATTTTCATTGATTACGGCTTCACGTC TTGCCAAAGAAAAGCACATTGATTGGCTTGTGAAAGCTGTGATTGAAGCTCATAAGGAGTTACCGGAACTAACCT TGATATCTATGGTAGTGGTGGAGAAGATTCTCTGCTTAGAGATATTATTGCAAATCATCAGGCAGAGGACTATAT CCAACTCAAGGGGCATGCGGAACTTTCGCAGATTTATAGCCAGTATGAGGTCTACTTAACGGCTTCTACCAGCGA AGGATTTGGTCTGACCTTGATGGAAGCTATTGGTTCAGGTCTACCTCTAATTGGTTTTGATGTGCCTTATGGTAATC AGACCTTTATAGAGGATGGGCAAAATGGTTATTTGATTCCAAGTTCATCTGACCATGTAGAAGACCAAATCAAGC AAGCTTATGCCGCTAAGATTTGTCAATTGTATCAAGAAAATCGTTTGGAAGCTATGCGTGCCTATTTTACCAAAT TGCAGAAGGCTTCTTGACCAAAGAAATTTTAGAAAAGTGGAAGAAAACAGTAGAGGAGGTGCTCCATGATTGA MTIYNINLGIGWASSGVEYAQAYRAGVFRKLNLSSKFIFTDMILADNIQHLTANIGFDDNQVIWLYNHFTDIKIAPTSVT VDDVLAYFGGEESHREKNGKVLRVFFFDQDKFVTCYLVDENKDLVQHAEYVFKGNLIRKDYFSTRYCSEYFAPKDN VAVLYQRTFYNEDGTPVYDILMNQGKEEVYNFKDKIFYGKQAFVRAFMKSLNLNKSDLVILDRETGIGQVVFEEAQTA HLAVVVHAEHYSENATNEDYILWNNYYDYQFTNADKVDFFIVSTDRQNEVLQEQFAKTYTQHQPKIVTTPVGSIDSLTDS SQGRKPFSLITASRLAKEKHIDWLVKAVIEAHKELPELTFDIYGSGGEDSLLREIIANHQAEDYIQLKGHAELSQIYSQYE VYTASTSEGFGLTLMEAIGSGLPLIGFDVPYGNQTFIEDGQNGYLIPSSSDHVEDQIKQAYAAKICQLYQENRLEAMRA YSYQIAEGTLTKEILEKWKKTVEEVLMDZ D103 2292 bp ATGTCCTCTCTCGGATCAAGAATTTAGTAGCTAAAACAGTAGAGTTTCGTCAGCGTCTTTCCGAGGGAGAAAGTC TAGACGATATTTTGGTTGAAGCTTTTGCTGTGGTGCGTGAAGCAGATAAGCGGATTTAGGGATGTTTCTTATGA TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT GACAGCTACCATGCCTGTCTATTTGAACGCTTTTTCAGGAGAAGGAGTGATGGTTGTGACTCTAATGATATTTA TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCGTTTTCTAGGATTGACCATTGGTGTACCATTTACGGAAG ATCCAAAGAAGGAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCTTCGGATATCATCTACACAACCAATAGTA ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAGTTTTTACGACCGTTTAACTATGT GATTATTGATGAAATTGATGATATCTTGCTTGATAGTGCACAAACTCCTCTGATTATTGCGGGTTCTCCTCGTGTTC AGTCTAATTACTATGCGATCATTGATACACTTGTAACAACCTTGGTCGTGGAAGGAGAGGATTATATCTTTAAAGAGGA GAAAGAGGAGGTTTGGCTCACTACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGATAATTTATACAA GGAAGAGCATGCGTCTTTTGCTCGTCATTTGGTTTATGCGATTCGAGCTCATAAGCTCTTTACTAAAGATTGGAC TATATCATTCGTGGAAATGAGATGGTACTGGTTGATAAGGGAACAGGGCGTCTAATGGAATGACTAAACTTCAA GGAGGTCTCCATCAGGCTATTGAAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCGATC ACCTATCAGAGTCTTTTTAAGATGTTAATAAGATATCTGGTATGACAGGGACAGGTAAGGTCGCGGAAAAAGAG TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGATTGACTATCCAGATA ATCTATATATCACTTTACCTGAAAAAGTGTATGCATCCTTGGAGTACATCAAGCAATACCATGCTAAGGGAAATCC TTTACTCGTTTTTGTAGGCTCAGTTGAAATGTCTCAACTCTATTCGTCTCTCTTGTTTCGTGAAGGGATTGCCCATA ATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGATGGGGGCTGTGACAG TGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGTCGCAGAGCTTGGGGGTTGATTG TTATTGGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATCGTGGCCGTTCTGGTCGTCAGGGAGATC CTGGTATGAGTAAATTTTTTGTATCCTAGAGGATGATGTTATCAAGAAATTTGGTCCATCTTGGGTGCATTAAA GTACAAAGACTATCAGGTTCAAGATATGACTCAACCGGAAGTATTGAAAGGTCGTAAATATACCGGTTAGTCGA AAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTGGAGTATGTGTAAAGTATGAA TATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGTCTCGTGACTAGAGGATGTTGTTGTG GATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTATTGTTTCACTTTATTG TGACCAATATTAGTTTTCATGTTAAAGAGGTTCCAGATTATATAGATGTAACTGACAAAACTGCAGTTCGTAGCTT TATGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAGAAAGAATTACTTAATCAACATGACTTATATGAACAGTT TTTACGACTTTCACTGCTTAAAGCCATTGATGACAACTGGGTAGAGCAGGTAGACTATCTACAACAGCTATCCATG GCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTACGCGGGCTTTGAA GCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGGGCTGGTTGAGGTCACTCCAAAAGGT GAAATCGTGACTCATTTTCCATAA MSSLSDQELVAKTVEFRQRLSEGESLDDILVEAFAVVREADKRILGMFPYDVQVMGAIVMHYGNVAEMNTGEGKTLT ATMPVYLNAFSGEGVMVVTPNEYLSKRDAEEMGQVYRFLGLTIGPFTEDPKKEMKAEEKKLIYASDIIYTTNSNLGF DYLNDNLASNEEGKFLRPFNYYVIIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGDYIFKEEKEEVWLTTK GAKSAENFLGIDNLYKEEHASFARHLVYAIRAHKLFTKDKYIIRGNEMVLVDKGTGRLMEMTKLQGGLHQAIEAKEH VKLSPETRAMASITYQSLFKMFNKISGMTGTGKVAEKEFIETYNMSVVRIPTNRPRQRIDYPDNLYITLPEKVYASLEYIK QYHAKGNPLLVFVGSVEMSQLYSSFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRTDIKLGKGVAE LGGLIVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRK LVEKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFIVT NISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLYEQFLRSLLKAIDDNWVEQVDTLQQLSMAIGG QSASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ ID104 879 bp ATGAAACAAGAATGGTTTGAAAGTAATGATTTTGTAAAAACAACAAGCAAGAACAAGCCTGAAGAGCAAGCTCA AGAGGTTGCAGACAAGGCTGAAGAAAGGATACCCGATCTCGATACACCAATTGAAAAAAATACTCAGTTAGAGG AGGAAGTCTCTCAAGCTGAAGTCGAATTGGAAAGCCAGCAAGAAGAGAAAATTGAAGCTCCTGAAGACAGTGAA GCGAGAACAGAAATAGAAGAAAAGAAGGCATCTAATTCTACTGAAGAAGAGCCAGACCTTTCTAAAGAAACAGA AAAAGTCACTATAGCTGAAGAGAGCCAAGAAGCTCTTCCTCAGCAAAAAGCAACCACGAAAGAGCCACTTCTTAT CAGTAAATCTTTAGAAAGTCCTTATATCCCCGACCAAGCTCCAAAATCTAGGGATAAATGGAAAGAGCAAGTGCT TGATTTTTGGTCTTGGCTAGTGGAAGCGATCAAATCTCCTACAAGTAAGTTGGAAACAAGTATCACACACAGTTAC ACAGCCTTTCTCTTGCTCATTCTGTTTTCTGCATCTTCCTTTTTCTTTAGTATCTATCACATCAAACATGCTTACTAT GGACATATAGCAAGCATTAACAGTCGCTTCCCTGAGCAGCTAGCTCCTTTAACTCTTTTTTCTATCATCTCTATCCT AGTAGCGACAACACTCTTCTTCTTTTCATTCCTCTTGGGTAGTTTCGTTGTGAGACGATTTATCCACCAGGAAAAG GACTGGACGCTAGACAAGGTTCTCCAACAATATAGTCAACTCTTGGCAATTCCAATCTCCTCACTGCTATTGCTAG TTTCTTTGCTTTCTTTGATAGCCTACGATTTACAGCCCTCTTGTGTGTGA MKQEWTESNDTVKTTSKNKPEEQAQEVADKAEETPDLDTPIEKNTQLEEEVSQAEVELTQQEEKIEAPEDSEARTEIE EKKASNSTEEEPDLSKETEKVTIAEESQEALPQQKATTKEPLLISKSLESPYIPDQAPKSRDKWKEQVLDFWSWLVEAIKS PTSKLETSITHSYTAFLLLILFSASSFFFSIYHIKHAYYGHIASINSRFPEQLAPLTLFSIISILVATTLFFFSFLLGSFVVRRFIH QEKDWTLDKVLQQYSQLLAIPISSLLLLVSLLSLIAYDLQPSCVZ ID106 327 bp TGTACTTTCCAACATCCTCTGCCTTGATTGAATTTCTCATCTTGGCTGTACTGGAGCAGGGTGATTCTTATGGTTA TGAGATTAGCCAAACCATTAAGCTGATCGCTAATATCAAAGAATCCACACTCTATCCCATTCTCAAAAAATTGGA AGGCAATAGCTTTCTGACAACCTATTCTAGAGAGTTCCAAGGTCGCATGCGCAAATACTACTCCTTGACAAACGG TGGTATAGAGCAGCTCTTGACCCTAAAAGATGAATGGGCACTCTATACAGACACCATCAATGGCATCATAGTAAGG GAGTATCCGCCATGACAAGAACTGA MYFPTSSALIEFLILAVLEQGDSYGYEISQTIKLIANIKESTLYPILKKLEGNSFLTTYSREFQGRMRKYYSLTNGGIEQLLT LKDEWALYTDTINGIIEGSIRHDKNZ D108 954 bp ATGGATTTTGAAAAAATTGAACAAGCTTATATCTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGA CCAACTTTTATGACGCCTTGGTGGAGCAAAATAGCATCTATCTGGATGGTGAAACTGAGCTAAACCAGGTCAAAG ACAACAATCAGGCCCTTAAGCGTTTAGCACTACGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGA AGGCTGGGCAAACAGAACCCTTGCAGGCCAATCACCAGTTTACACCGGATGCTATTGCTTTGCTTTTGGTGTTTAT TGTGGAAGAGTTGTTTAAAGAGGAGGAAATTACTATCCTCGAAATGGGTTCTGGGATGGGAATTCTAGGCGCTAT TTTCTTGACCTCGCTTACTAAAAAGGTGGATTACTTGGGAATGGAAGTGGATGATTTGCTGATTGATCTGGCAGCT AGCATGGCAGATGTAATTGGTTTGCAGGCTGGCTTTGTCCAAGGAGATGCCGTTCGCCCACAAATGCTCAAAGAA AGCGATGTGGTCATCAGTGACTTGCCTGTCGGCTATTATCCTGATGATGCCGTTGCGTCGCGCCATCAAGTTGCTT CTAGCCAAGAACATACTTACGCCCATCACTTGCTCATGGAACAAGGGCTTAAGTACCTCAAGTCAGACGGATACG CTATTTTTCTAGCTCCGAGTGATTTGTTGACCAGTCCTCAAAGTGATTTGTTAAAAGAATGGCTGAAAGAAGAGGC GAGTCTGGTTGCTATGATTAGTCTGCCTGAAAATCTCTTTGCTAATGCCAAACAATCTAAGACTATTTTTATCTTAC AGAAGAAAAATGAAATAGCAGTAGAGCCTTTTGTTTATCCACTTGCTAGCTTGCAAGATGCAAGTGTTTTAATGAA ATTTAAAGAAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA GQTEPLQANHQVPDAIALLLVTTVEELTKEEETTLEMGSGMGILGATTLTSLTVDYLGMEVDDLLIDLTSMADVI GLQAGFVQGDAVRPQMLKESDVVISDLPVGYYPDDAVASRHQVASSQEHTYAHHLLMEQGLKYLKSDGYAIFLAPSD LLTSPQSDLLKEWLKEEASLVAMISLPENLFANAKQSKTIFILQKKNEIAVEPFVYPLASLQDASVLMKFKENFQKWTQG TEIZ ID110 1902 bp ATGATTATTTTACAAGCTAATAAAATTGAACGTTCTTTTGCAGGAGAGGTTCTTTTCGATAATATCAACCTGCAGG TTGATGAACGAGATCGGATTGCTCTTGTTGGGAAAAATGGTGCAGGTAAGTCTACTCTTTTGAAGKETTTAGTTGG AGAAGAGGAGCCAACTAGCGGAGAAATCAATAAGAAAAAAGATATTTCTCTGTCTTACCTAGCCCAAGATAGCCG TTTTGAGTCTGAAAATACCATCTACGATGAAATGCTTCATGWITTAATGATTTGCGTCGGACGGAGAGACAACTG CGTCAGATGGAGCTGGAGATGGGTGAAAAGTCTGGTGAGGATTTGGATAAACTGATGTCAGATTATGACCGCTTA TCTGAGAATTTTCGCCAAGCAGGTGGCTTTACCTATGAAGCTGATATTCGAGCGATTTTGAATGGATTCAAGTTTG ACGAGTCTATGTGGCAGATGAAAATTGCTGAGCTTTCTGGTGGTCAAAATACTCGTTTGGCACTTGCCAAAATGCT CCTTGAAAAGCCCAATCTCTTGGTCTTGGACGAGCCAACTAACCACTTGGATATTGAAACCATCGCCTGGCTAGA GAATTACTTGGTAAACTATAGCGGTGCCCTCATTATCGTCAGCCACGACCGTTATTTCTTGGACAAGGTTGCGACA ATTACGCTAGATTTGACCAAGCATTCCTTGGATCGCTATGTGGGGAATTACTCTCGTTTTGTCGAATTGAAGGAGC AAAAGCTAGTTACTGAGGCAAAAAACTATGAAAAGCAACAGAAGGAAATCGCTGCTCTGGAAGACTTTGTCAATC GCAATCTAGTTCGTGCTTCAACGACTAAACGTGCTCAATCTCGCCGTAAACAACTAGAAAAAATGGAGCGTTTGG ACAAGCCTGAAGCTGGCAAGAAAGCAGCCAACATGACCTTCCAGTCTGAAAAAACGTCGGGCAATGTTGTTTTGA CTGTTGAAAATGCAGCGTTGGCTATGACGGGGAAGTCTTGTCACAACCTATCAACCTAGATCTTCGTAAGATGAA TGCTGTCGCTATCGTTGGTCCAAATGGTATCGGCAAGTCAACCTTATCAAGTCTATTGTGGACCAGATTCCTTTT ATCAAGGGAGAAAAGCGCTTTGGCGCTAATGTTGAGGTTGGTTACTATGACCAAACCCAAAGCAAGCTGACACCA AGTAATACGGTGCTGGATGAATCTGGAATGAITTTCAAACTGACACCAGAAGTTGAAATCCGCAACCGTCTTGGA GCCTTCCTTTTCTCAGGAGATGATGTTAAAAAATCAGTCGGCATGCTATCTGGTGGCGAAAAAGCTCGTTTGCTTT TAGCTAAATTGTCTATGGAAAACAATAACTTTTTGATCTGGATGAGCCGACCAACCACTTGGATATTGATAGTAA GGAAGTGCTAGAAAATGCCTTGATTGACTTTGATGGAACCTTGCTGTTTGTCAGTCATGATCGTTACTTTATCAAT CGTGTGGCAACTCATGTTTTGGAATTGTCTGAGAATGGTTCAACTCTCTACCTTGGAGATTACGACTACTATGTTG AGAAGAAAGCAACAGCAGAAATGAGTCAGACTGAGGAAGCTTCAACTAGCAATCAAGCAAAGGAAGCAAGTCCA GTCAATGACTATCAGGCCCAGAAAGAAAGTCAAAAAGAAGTTCGCAAACTCATGCGACAAATCGAAAGTCTAGA AGCTGAAATTGAAGAGCTAGAAAGTCAAAGCCAAGCCATTTCTGAACAAATGTTGGAAACAAACGATGCCGACA AACTCATGGAATTACAGGCTGAGCTGGACAAAATCAGCCATCGTTCAGGAAGAAGCTATGCTTGAGTGGGAAGAAT TATCAGAGCAGGTGTAA MIILQANKIERSFAGEVLFDNINLQVDERDRIALVGKNGAGKSTLLKILVGEEEPTSGEINKKKDISLSYLAQDSRFESENT IYDEMLHVTNDLRRTERQLRQMELEMGTKSGEDLDKLMSDYDRLSENTRQAGGTTYEADIRATLNGTKTDESMWQMK IAELSGGQNTRLALAKMLLEKPNLLVLDEPTNHLDLETTAWLENYLVNYSGALHVSHDRYFLDKVATTTLDLTKHSLDR YVGNYSRTELKEQKLVTEAKNYEKQQKEIAALEDFVNRNLVRASTTKRAQSRRKQLEKMERLDKPEAGKKAANMTF QSEKTSGNVVLTVENAAVGYDGEVLSQPINLDLRKMNAVATVGPNdIIGKSTTTKSWDQIFITKGEKRTGANVSVGYYDQ TQSKLTPSNTVLDELWNDTKLTPEVEIRNRLGATLTSGDDVKKSVGMLSGGEKARLLLAKLSMENNNPLILDEPTNHL DIDSKEVLENALIDPDGTLLTVSHDRYTTNRVATHVLSLSENGSTLYLGDYDYYVEKKATAEMSQTEEASTSNQAKEAS PVNDYQAQKESQKEVRKLMRQIESLEAEIEELESQSQATSEQMLETNDADKLMELQAELDKISHRQEEAMLEWEELSEQ VZ ID111 1189 bp ATGAATCGCTATGCAGTGCAGTTGATTAGCCGTGGGGCTATCAATAAAATGGGAAATATGCTCTATGATTATGGA AATAGTGTCTGGTTGGCTTCTATGGGGACTATAGGACAGACAGTTTTAGGAATGTATCAGATTTCTGAGCTCGTCA CATCTATTCTCGTCAATCCCTTTGGCGGAGTTATTTCAGACCGTTTTTCTCGTCGTAAGATTTTAATGACGGCAGAT CTTGTTTGTGGGATTCTTTGTCTGGCTATTTCTTTCATAAGGAATGATAGCTGGATGATTGGCGCTTTGATTGTTGC TAACATTGTGCAGGCTATTGCTTTTGCCTTTTCTCGCACAGCCAATAAAGCTATCATAACTGAAGTGGTGGAGAAA GATGAGATTGTGATCTATAATTCTCGCTTAGAGCTGGTTTTGCAGGTTGTAGGTGTTAGCTCTCCTGTTCTTTCTT CCTTGTTTTACAGTTGCAAGTCTCCATATGACGCTACTGCTAGACTCGCTGACTTTTTTCATTGCTTTTGTTCTAG TGGCTTTCCTTTCCAAAAGAGGAAGCAAAAGTTCAAGAGAAAAAGGCTTTTACTGGGAGAGATATTTTTGTAGATA TCAAGGATGGGTTACACTATATCTGGCATCAGCAAGAAATTTTCTTCCTTTTGCTGGTAGCTTCCAGCGTTAATTT CTTTTTTGCAGCTTTTGAATTTCTACTTCCCITTTCGAATCAGCTTTACGGGTCAGAAGGAGCCTATGCAAGTATTT TAACTATGGGGGCTATTGGTTCCATCATTGGGGCTCTTCTAGCTAGTAAAATTAAAGCTAATATTTATAATCTTTT GATTTTACTGGCTTTGACAGGTGTCGGAGTTTTTATGATGGGATTACCACTTCCAACTTTTCTTTCTTTTCTGGAA ATTTAGTTTGTGAATTGTTATGACGATTTTTAATATTCACTTTTTTACTCAAGTACAAACCAAGGTTGAGAGCGAA TTTCTTGGAAGAGTACTGAGTACAATTTTTACCTTAGCTATTCTATTTATGCCTATTGCAAAAGGATTTATGACAGT CTTGCCAAGTGTCCATCTTTATTCTTTCTTGATTATTGGACTTGGAGTTGTAGCCTTATATTTCTTAGCTCTCGGAT ATGTTCGAACTCATTTTGAAAAATTGATATAA MNRYAVQLISRGAINKMGNMLYDYGNSVWLASMGTTGQTVLGMYQISELVTSILVNPTGGVISDRPSRRTTTLMTADLV CGILCLATSTTRNDSWMIGALIVANIVQATATATSRTANKATTTEVVEKDEIVIYNSRLELVLQVVGVSSPVLSTLVLQTASL HMTLLLDSLTTTTATVLVATLPKEEAKVQEKKATGRDTTVDIKDGLHYIWHQQETTTLLLVASSVNTTTAATALPFSN QLYGSEGAYASILTMGATGSIIGALLSKKANIYNLLILLALTGVGVTMMGLPLIFPLSTSGNLVCELTMTTTNIHFFTQV QTKVESETLGRVLTTTTLATLTMPIAKGTMTVLPSVMLYSTLLIGLGVVALYTLALGYVRTHTEKLIZ ID113 2466 bp ATGCAAAATCAATTAAATGAATTAAAACGAAAAATGCTGGAATTTTTCCAGCAAAAACAAAAAAATAAAAAATCA GCTAGACCTGGCAAGAAAGGTTCAAGTACCAAAAAATCTAAAACCTTAGATAAGTCAGCCATTTTCCCAGCTATT TTACTGAGTATAAAAGCCTTATTTAACTTACTCTTTGTACTCGGTTTTCTAGGAGGAATGTTGGGAGCTGGGATTG CTTTGGGATACGGAGTGGCCTTATTTGACAAGGTTCGGGTGCCTCAGACAGAAGAATTGGTGAATCAGGTCAAGG ACATCTCTTCTATTTCAGAGATTACCTATTCGGACGGGACGGTGATTGCTTCCATAGAGAGTGATTTGTTGCGCAC TTCTATCTCATCTGAGCAANTTTCGGAAAATCTGAAGAAGGCTATCATTGCGACAGAAGATGAACACTTTAAAGA ACATAAGGGTGTAGTACCCAAGGCGGTGATCGTGCGACCTGGGGAAATTTGTAGGTTTGGGTTTCCTCTAGTGGG GGTTCAACCTTGACCCAGCAACTAATTAAACAGCAGGTGGTTGGGGATGCGCCGACCTTGGCTCGTAAGGCGGCA GAGATTGTGGATGCTCTTGCCTTGGAACGCGCCATGAATAAAGATGAGATTTTAACGACCTATCTCAATGTGGCTC CCTTTGGCCGAAATAATAAGGGACAGAATATTGCAGGGGCTCGGCAAGCAGCTGAGGGAATTTTCGGTGTAGATG CCAGTCAGTTGACTGTTCCTCAAGCAGCATTTTTAGCAGGACTTCCACAGAGTCCCATTACTTACTCTCCTTATGA AAATACTGGGGAGTTGAAGAGTGATGAAGACCTAGAAATTGGCTTAAGACGGGCTAAGGCAGTTCTTTACAGTAT GTATCGTACAGGTGCATTAAGCAAAGACGAGTATTCTCAGTACAAGGATTATGACCTTAAACAGGACTTTTTACC ATCGGGCACGGTTACAGGAATTTCACGAGACTATTTATACTTTACAACTTTGGCAGAAGCTCAAGAACGTATGTAT GACTATCTAGCTCAGAGAGACAATGTCTCCGCtAAGGAGTTGAAAAATGAGGCAACTCAGAAGTTTTATCGAGAT TTGGCAGCCAAGGAAKTTGAAAATGGTGGTTATAAGATTACTACTACCATAGATCAGAAAATTCATTCTGCCATG CAAAGTGCGGTTGCTGATTATGGCTATCTTTTAGACGATGGAACAGGTCGTGTAGAAGTAGGGAATGTCTTGATG GATAACCAAACAGGTGCTATTCTAGGCTTTGTAGGTGGTCGTAATATCAAGAAAATCAAAATAATCATGCCTTTG ATACCAAACGTTCGCCAGCTTCTACTACCAAGCCCTTGCTGGCCTACGGTATTGCTATTGACCAGGGCTTGATGGG AAGTGAAACGATTCTATCTAACTATCCAACAAACTYTGCTAATGGCAATCCGATTATGTATGCTAATAGCAAGGG AACAGGAATGATGACCTTGGGAGAAGCTCTGAACTATTCATGGAATATCCCTGCTTACTGGACCTATCGTATGCTC CGTGAAAAGGGTGTTGATGTCAAGGGTTATATGGAAAAGATGGGTTACGAGATTCCTGAGTACGGTATTGAGAGC TTGCCAATGGGTGGTGGTATTGAAGTCACAGTTGCCCAGCATACCAATGGCTATCAGACCTTAGCTAATAATGGA GTTTATCATCAGAAGCATGTGATTTCAAAGATTGAAGCAGCAGATGGTAGAGTGGTGTATGAGTATCAGGATAAA CCGGTTCAAGTCTATTCAAAAGCTACTGCGACGATTATGCAGGGATTGCTACGAGAAGTTCTATCCTCTCGTGTGA CAACAACCTTCAAGTCTAACCTGACTTCTTTAAATCCTACTCTGGCTAATGCAGATTGGATTGGGAAGACTGGTAC AACCAACCAAGACGAAAATATGTGGCTCATGCTTTCGACACCTAGATTAACCCTAGGTGGCTGGATTGGGCATGA TGATAATCATTCATTGTCACGTAGAGCAGGTTATTCTAATAACTCTAATTACATGGCTCATCTGGTAAATGCGATT CAGCAAGCTTCCCCAAGCATTTGGGGGAACGAGCGCTTTGCTTTAGATCCTAGTGTAGTGAAATCGGAAGTCTTG AAATCAACAGGTCAAAAACCAGAGAAGGTTTCTGTTGAAGGAAAAGAAGTAGAGGTCACAGGTTCGACTGTTACC AGCTATTGGGCTAATAAGTCAGGAGCGCCAGCGACAAGTTATCGCTTTGCTATTGGCGGAAGTGATGCGGATTAT CAGAATGCTTGGTTAGTATTGTGGGGAGTCTACCAACTCCATCCAGCTCCAGCAGTTCAAGTAGTAGTTCTAGCG ATAGCAGTAACTCAAGTACTACACGACCTCTCTTCAAGGGCGAGACGATAA MQNQLNELKRKMLETTQQKQICNKKSARPGKKGSSTKKSKTLDKSATTPATLLSIKALTNLLTVLGTLGGMLGAGIALGY GVALPPKVRVPQTEELVNQVKDISSJETYSDGTVIASlESDLLRTSISSEQISENLKKAHATEDEHTKEHIKVVPKAVTR ATLGKFVGLGSSSGGSTLTQQLIKQQVVGDAPTLARKAAEIVDALALERAMNKDEILTTYLNVAPFGRNNKGQNIAGA RQAAEGTTGVDASQLTVPQAAPLAGLPQSPTTYSPYENTGELKSDEDLEIGLRRAKAVLYSMYRTGALSKDEYSQYKDY DLKQDFLPSGTVTGISRDYLYTTTLAEAQERMYDYLAQRDNVSAKELKNEATQKFYRDLAAKEIENGGYKTTTTTDQKI HSAMQSAVADYGYLLDDGTGRVEVGNVLMDNQTGAILGFVGGRNYQENQNNHAFDTKRSPASTTKPLLAYGIAIDQG LMGSETTLSNYPTNTANGNPIMYANSKGTGMMTLGEALNYSWNIPAYVTTYRMLRBKGVDVKGYMEKMGYEIPEYGIE SLPMGGGIEVTVAQHTNGYQTLANNGVYNQKHVISKIEAADGRVVYEYQDKPVQVYSKATATIMQGLLREVLSSRVTT TFKSNLTSLNTTLANADWIGKTGTTNQDENMWLMLSTPRLTLGGWIGMDDNHSLSRRAGYSNNSNYMAHLVNAIQQA SPSIWGNEREALDPSVVKSEVLKSTGQKPEKVSVEGKEVEVTGSTVTSYWANKSGAPATSYRTATGGSDADYQNAWSSI VGSLTTPSSSSSSSSSSSDSSNSSTTRPSSSRARRZ ID114 1974 bp ATGAAAAAATTTATGTAAGTCCAATTTTTCCTATTCTAGTAGGATTGATTGCGTTTGGAGTCTTATCCACTTTCAT TATTTTTGTTAATAATAATCTGTTGACGGTTTTAATTTTGTTTCTTTTTGTAGGAGGCTATGTTTTTTTATTTAAGAA ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGAC AGCTCTATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCTGGAGAGGTTGAGTGGTTTAATCCC TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCTGTTCAAACGATTATCAAGGCTTCAG TAGGAAATCCGTCTACTTATGCCAAGCTTGGTGAGAAGCGTTATGCTGTTCATATGGATGCTTCTTCCGGTGTTTT GTATTTTGTAGATGTATCCAGGGAACAAGCCATAACAGATGAATTGGTAACAAGTAGACCAGTGATTGGGATTGT CTCTGTGGATAATTATGATGATTGGAGGATGAAACTCTGAGTCAGATATTAGTCAAATCAJTTAGTTTTGTAGCT AATTTTATATCAGAGTTCAGAAAAACACATGATGTTTTCTCGTCGGGTAAGTATGGATCGKTTTTATCTATTTAC TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTCTGTTATTGATGCTTTCAGAGAAGAGTCGAAACAG AGACAGTTGCCCTTGACCTTAAGTATGGGATTTTCTTATGGCGATGGAAATCATGATGAGATAGGGAAAGTTGCTT TGCTCAATTTGAACTTGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGACGAAACGAAAAATC CAGTTTATTTTGCTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCTATGATGACAGCTAT TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTTTAGTCGGTCACAAAAATTTAGACATGGATGCTTTGGGCTCT GCTGTAGGTATGCAGTTTGTTCGCCAGCAATGTGKTTGAAAATAGCTATGCTCTTTTATGATGAAGAACAAATGTCTC CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGTTAAGGATGCAATGG GGATGGTGACCAATCGTTCTTTGTGATTCTTGTAGACCATTCAAAGACAGCCTTAACATTATCAAAAGAATTTTA TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGGATTTTCCAGATAATGCGGTTATTACT TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTTGGTAACGGAATTGATTCAGTTCCAGAATTCTAAGAAAAAT CGTTTGAGTCGTATGCAAGCAAGTGTCTTGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGAGTAA CTAGTCGGACATTTTGATGTTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGAAATCGCTGC GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATGTACTAATA GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA GGTATTGAAGCGAGTTTTGTTCTGCGAAGAATACACAAGGATLTATCTCTATCTCAGCTCGAAGTCGTAGTAAAC TGAATGTACAACGGATTATGGAAGAGTTTAGGCGGTGGAGGCCACTTTAATTTTGGCAGCAGCTCAAATTAAAGATG TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAGAAAGAAGAA TGA MKKTYVSPTTPILVGLIATGVLSTTTTTVNNNLLTVLILTLTVGGYVTLTKKLRVHYTRSDVEQIQYVNHQABESLTALLE QMPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYTVDVS REQATTDELVTSRPTTGIVSVDNYDDLEDETSESDISQINSTVANTTSETSEKHMMPSRRVSMDRPYLTTDYTVLEGLMN DKTSVIDATREESKQRQLPLTLSMGTSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYTGGGSAASIK RTRTRTRAMMTATSDKIRSVDQVTVVGHKNLDMDALGSAVGMQLTASNVIENSYALYDEEQMSPDIERAVSTTEKEGV TKLLSVKDAMGMVTNRSLLILVDHSKTALTLSKETYDLTTQTTVIDNEIRRDQDTPDNAVTTYIESGASSASELVTELIQTQ NSKKNRLSRMQASVLMAGMMLDTKNTTSRVTSRTTDVASYLRTRGSDSIATQEIAATDTEEYREVNELILQGRKLGSDV LLAEAKDMKCYDTVVISKAADAMLAMSGIEASTVLAKNTQGTTSISARSRSKLNVQRIMEELGGGGHFNLAAAQIKDVT LSEAGEKLTEIVLNEMKEKEKEEZ ID115 663 bp ATGAAGTGCTTGTTATGTGGGCAGACTATGAAGACTGTTTTAACTTTTAGTAGTCTCTTACTTCTGAGGAATGATG ACTCTTGTCTTTGTTCAGACTGTGATTCTACTTTTGAAAGAATTGGGGAAGAGAACTGTCCAAATTGTATGAAAAC AGAGTTGTCAACAAAGTGTCAAGATTGTCAACTTTGGTGTAAAGAGGGAGTTGAAGTCAGTCATAGAGCGATTTT TACTTACAATCAAGCTATGAAGGATTfTTCAGTCGGTATAAGTTTGATGGAGACTTCCTGTTAAGAAAAGTTTTC GCTTCATTTTTAAGTGAGGAGTTGAAAAAGTACAAAGAGTATCAATTTGTTGTAATTCCCCTAAGTCCTGATAGAT ATGCTAATAGAGGATTTAATCAGGTTGAGGGCTTGGTAGAGGCAGCAGGCTTTGAGTATCTGGATTTATTAGAGA AAAGAGAAGAGAGAGCCAGTTCTTCTAAAAATCGTTCAGAGCGCTTGGGGACAGAACTTCCTTTCTTTATTAAAA GTGGAGTCACTATTCCTAAAAAAATCCTACTTATAGATGATATCTATACTACAGGAGCAACTATAAATCGTGTTAA GAAACTGTTGGAAGAAGCTGGTGCTAAGGATGTAAAAACATTTTCCCTTGTAAGATGA MKCLLCGQTMKTVLTTSSLLLLRNDDSCLCSDCDSTTERIGEENCPNCMKTELSTKCQDCQLWCKEGVEVSHRAIFTY NQAMKDTTSRYKTDGDILLRKVTASPLSEELKKYKEYQTVVIPLSPDRYANRGTNQVEGLVEAAGTBYLDLLEKREER ASSSKNRSERLGTELPTTTKSGVTTPKKILLIDDIYTTGATTNRVKKLLEEAGAKDVKTTSLVRZ ID116 1299 bp ATGAAAGTAAATTTTAGATTATTCGGTCGTTTATTTACTGAGAATGAATTAACAGAAGAAGAACGTCAGTTGGCG GAGAAACTTCCAGCAATGAGAAAGGAGAAGGGGAAACTTTTCTGTCAACGCTGTAATAGTACTATTCTAGAAGAA TGGTATTTGCCCATCGGTGCTACTATTGTCGAGAGTGCTTGCTGATGAAGCGAGTCAGAAGTGATCAAACTTTAT ACTATTTTCCGCAGGAGGATTTTCCAAAGCAAGATGTTCTCAAATGGCGCGGCCAATTAACTCCTTTTCAAGAGAA GGTGTCAGAGGGATTGCTTCAAGTAGTAGACAAGCAAAAGCCAACCTTAGTTCATGCGGTATCAGGAGCTGGAAA GACAGAAATGATTTATCAAGTAQTGGCTAAAGTGATCAATGCGGGTGGTGCAGTGTGTTTGGCTAGTCCTCGCAT AGATGTTTGTTTGGAGCTGTACAAGCGCCTGCAACAGGATTTTTCTTGCGGGATAGCTTTGCTACATGGAGAATCG GAACTTATTTCGAACACCACTAGTTGTTGCAACAACCCATCAGTTATTGAAGTTTTATCAAGCTTTYGATTTGCT GATAGTGGATGAAGTAGATGCTTTCCTTATGTTGATAATCCCATGCTTTACCACGCTGTCAAGAATAGTGTAAAT GAGAATGGATTGAGAATCTTTTTAACAGCGACTTCGACCAATGAGTTAGATAkAAAGGTCCGTTTAGGAGAACTA AAAAGACTGAATTTACCGAGACGGTTCATGGAAATCCGTTGATTATTCCAAAACCAATTIGGTTATCGGATTTTA ATCGCTACTTAGACAAGAATCGTTTGTCACCAAAGTTAAAGTCCTATATTGAGAAGCAGAGAAAGACAGCTTATC CGTTACTCATTTTTGCTTCAGAAATTAAGAAAGGGGAGCAGTTAGCAGAAATCTTACAGGAGCAATTTCCAAATG AGAAAATTGGCTTTGTATGTCTGTAACAGAGGATCGATTTAGAGCAAGTACAAGCTTTTCGAGATGGAGAACTGA CAATACTTATCAGTACGACAATCTTGGAGCGCGGAGTACCTCCCTTGTGTGGATTTTTTTCGTAGTAGAGGCCAA TCATCGTTTGTTTACCAAGTCTAGTTGATTCAGATTGGTGGACGAGTTGGACGAAGCATTGATAGACCGATAGGA GATTTGCTTTTCTCCATGATGGGTTAAATGCTCAATCAAGAAGGCGATTATGGAAATTCAGATGATGAATTAAGG AGGCTGGTCTATGA MKVNLDYLGRLFTENELTEEERQLAEKPAMRKEKGTTLTCQRCNSTTLEEWYLPIGAYYCRECLLMTRVTTSDQTTTYYF PQEDTPKQDVLKWRGQLTPFQEKVSEGLLQVVDKQKPTLVHAVTGAGKTBMIYQVVAKVINAGGAVCLASPRIDVCL ELYKRLQQDTSCGIALLHGESEPYTRTPL,VVATTHQLLKTYQATDLLLVDEVDATPYVDNPMLYHAVKNSVKENGLRTT LTATSTNELDKKVRLGELKRLNLPRRTHGNPLIIPKPIWLSDINRYLDKNRLSPKLKSYIEKQRKTAYPLLIIASEIKKGE QLAEILQEQTPNEKIGTVSSVTBDRLEQVQATRDGELTTLISTTTLERGVTTPCVDVTVVEANHRLTTKSSLIQIGGRVGRS MDRPTGDLLTTHDGLNASIKKATKEIQMMNKBAGLZ ID117 870 bp ATGCAAATTCAAAAAAGTTTTAAGGGGCAGTCTCCCTATGGCAAGCTGTATCTAGTGGCAACGCCGATTGGCAAT CTAGATGATATGACTTTTCGTGCTATCCAGACCTTGAAAGAAGTGGACTGGATTGCTGCTGAGGATACGCGCAAT ACAGGGCTTTTGCTCAAGCATTTTGACATTTCCACCAAGCAGATCAGTTTTCATGAGCACAATGCCAAGGAAAAA ATTCCTGATTGATTGGTTTCTTGAAAGCAGGGCAAAGTATTGCTCAGGTCTCTGATGCCGGTTTGCCTAGCATTT CAGACCCTGGTCATGATTTAGTTAAGGCAGCTATTGAGGAAGAAATTGCAGTTGTGACAGTTCCAGGTGCCTCTGC AGGAATTTCTGCCTTGATTGCCAGTGGTTTAGCGCCACAGCCACATATCTTTTACGGTTTTTTACCGAGAAAATCA GGTCAGCAGAAGCAATTTTTTGGCTTGAAAAAAGATTATCCTGAAACACAGATTTTTTATGAATCACCTCATCTG TAGCAGACACGTTGGAAAATATGTTAGAACTCTACGGTGACCGCTCCGTTGTCTTGGTCAGGGAATTGACCAAAA TCTATGAAGAATACCAACGAGGTACATCTCTGAGTTATTAGAAAGCATTGCTGAAACGCCACTCAAGGGCGAAAT GTCTTCTCATTGTTGAGGGTGCCAGTCAGGGTGTGGAGGAAAAGGACGAGGAAGACTTGTTCGTAGAAATTCAAA CCCGCATCCAGCAAGGTGTGAAGAAAAACCAAGCTATCAAGGAAGTCGCTAAGATTTACCAGTGGAATAAAAGTC AGCTCTACGCTGCCTACCACGACTGGGAAGAAAAACAATAA MQIQKSFKGQSPYGKLYLVATPIGNLDDMTTRATQThKBVDWIAAEDTRNTGLLLKHTDISTKQISTHEHNAKEKTPKLI GTLKAGQSIAQVSDAGLPSISDPGNDLVKAATEEEIAVVTVPGASAGISALIASGLAPQPHTTYGTLPRKSGQQKQTPGLKK DYPETQTTYESPHRVADTLENMLEVYGDRSVVLVRBLTKIYEEYQRGTTSELLESIAETPLKGECLLIVEGASQGVEEKDE EDLTVEIQTRIQQGVKKNQATKEVAKIYQWNKSQLYAAYHDWEEKQZ ID118 345 bp ATGATAAAGAAAGGAAAGGGCTGTTTTATGGACAAAAAAGAATTATTTGACGCGCTGGATGATTTTTCCCAACAA TTATTGGTAACCTTAGCCGATGTGGAAGCCATCAAGAAAAATCTCAAGAGCCTGGTAGAGGAAAATACAGCTCTT CGCTTGGAAAATAGTAAGTTGCGAGAACGCTTGGGTGAGGTGGAAGCAGATGCTCCTGTCAAGGCCAAGCATGTT CGCGAAAGTGTCCGTCGTATTACCGTGATGGATTTCACGTATGTAATGATTTTTATGGACAACGTCGAGAGCAGG ACGAAGAATGTATGTTTTGTGACGAGTTTGTTATACAGGGAGTAA MIKKGKGCTMDKKELTDALDDFSQQLLVTLADVEAIKKNLKSLVEENTALRNSKRBRLGEVEADAPVKAKHVRES VRRIYPDGTHVCNDTYGQRREQDEECMTCDELLYREZ ID119 639 bp ATGTCAAAAGGATTTTTAGTCTCTCTTGAGGGACCAGAGGGAGCAGGCAAGACCAGTGTTTTAGAGGCTCTGCTA CCAATTTTAGAGGAAAAAGGAGTAGAGGTGTTGACGACCCGTGAACCTGGCGGAGTCTTGATTGGGGAGAAGATT CGGGAAGTGATTTTGGATCCAAGTCATACTCAGATGGATGCTAAAACAGAGCTACTTCTCTATATTGCCAGTCGCA GACAGCATTTGGTGGAAAAAGTTCTTCCAGCCCTTGAAGCTGGCAAGTTGGTCATCATGGATCGTTTTATCGATAG TTCTGTTGCCTATCAGGGATTTGGTCGTGGCTAGATATTGAAGCCATTGACTGGCTCAATCAGTTTGCGACAGAT GGCCTCAAACCCGATTTGACACTCTATTTTGACATCGAGGTGGAAGAAGGGCTGGCTCGTATTGTGTTAATAGTG ACCGCGAGGTTAATCGTTTGGATTTGGAAGGGTTGGACTTGCATAAAAAAGTTCGTCAAGGCTACCTTTCTCTTCT GGATAAAGAGGGAAATCGCATTGTCAAGATTGATGCTAGTCTCCCTTTGGAGCAAGTTGTGGAAACTACCAAGGC TGTCTTGTTTGACGGAATGGGCTTGGCCAAATGA MSKGFLVSLEGPEGAGKSVLEALLPILEEKGVEVLTTREPGGVLIGEKIREVILDPSHTQMDAKTELLLYIASTQHLVE KVLPALEAGKLTTMDRTTDSSVAYQGTGRGLDIEATDWLNQTATGGLKPDLTLYTDTTVEEGLAANSDRTVNRLDL EGLDLHKKVRQGYLSLLDKEGNRIVKIDASLPLEQVVETTKAVLTDGMGLAKZ ID120 408 bp ATGGTAGAACAAAGAAAATCAATTACCATGAAAGATGTTGCTTTAGAAGCAGGAGTTAGTGTTGGTACTGTTTCA CGTGTAATTAATAAAGAAAAAGGCATTAAAGAAGTAACTTTGAAAAAAGTGGAACAAGCGATTAAAACTTTGAAT TACATTCCAGATTACTACGCTAGAGGAATGAAAAAAAATCGAACAGAAACGATTGCAATCATTGTACCAAGTATC TGGCATCCCTTCTTTTCAGAATTTGCTATGCATGTGGAAAATGAAGTCTATAAGAGAAATAACATTTACTCTTAT GTTCTATCAATGGTACAAATAGAGAGCAAGACTATCTGGAGATGTTGCGTCATAATAAATGTTGATGGAGTGTTG CCATTACCTATAGGCCAATTGAACATTACTTGACGTCAGGAATTCCCTTTGTTAGTATTGACCGCACATATCAGA GATTGCCATCCVTGTGTTTCA MVEQRKSTTMKDVALEAGVSVGTVSRTTNKKGTTTEVTLKKVEQATTLNYIPDYYARGMKKNRTETTATTVPSIWHPFF SEFAMHVENEVYKRNNKLLLCSINGTNEQDYLEMLRAINKVDGVVATTYRPIEHYLTSGIPTVSEDRTYSEIAIPCVS ID126 1281 bp TTGTTTAACAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATGGGTGAAAACTTTTTGCAGATGCTAATGG GAATGGTGGACAGTTATGGTTGCTCATTTAGGATTGATAGCTATTTCAGGGGTTTTCAGTAGCTGGTAATATTAT CACCATTTATCAGGCGATTTTCATCGCTCTGGGAGCTGCTATTTCCAGTGTTATTTCAAAAAGCATAGGGCAGAAA GACCAGTCGAAGTTGGCCTATCATGTGACTGAGGCGTTGAAGATTACCTACTATTAAGTTTCCTTTTAGGATTTT TGTCCATCTTCGCTGGGAAAGAGATGATAGGACTTTTGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGT ATCTATCTTTGGTAGGCGGATCGATTGTTCTCTTAGGTTTAATGACTAGTCTAGGAGCCTTGATTCGTGCAACGCA TAATCCACGTCTGCCTCTCTATGTTAGTTTTTTATCCAATGCCTTGAATATTCTTTTTTCAAGTCTAGCTATTTTTGT TCTGGATATGGGGATAGCTGGTGTTGCTGGGGGACAATTGTGTCTCGTTGGTTGGTCTTGTGATTTTGTGGTCAC AATTAAAACTGCCTTATGGGAAGCCAACTTTTGGTTTAGATAAGGAACTGTTGACCTTGGCTTTACCAGCAGCTGG AGAGCGACTTATGATGAGGGCThGAGATGTAGTGATCATTGCCTTGGTCGTTTCTTTTGGGACGGAGGCAGTTGCT GGGAATGCAATCGGAGAAGTCTTGACCCAGTTTAACTATATGCCTGCCTTTGGCGTCGCTACGGCAACGGTCATG CTGTTGGCCCGAGCAGTTGGAGAGGATGATTGGAAAAGAGTTGCTAGTTGAGTAAACAATCCTTTTGGCTTTCTC TGTTTCCTCATGTTGCCCCTGTCCTTTTTAGTATATATGTCTTGGGTGTACCATTAACTCATCTCTATACGACTTCT CTAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTTTTCACTACTTGGGACCCCTATGACGACAGGAACAGTCATCT ATACGGCAGTCTGGCAGGGATTAGGAAATGCACGCCTCCCTTTTTATGCGACAAGTATAGGAATGTGGTGTATCC GCATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGdTTGCCTGGTATTTGGGCAGGGTCTCTCTTGGA TAATGGTTTTCGCTGGTTATTTCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG LFKKNKDILNIALPAMGENTLQMLMGMVDSYLVAHLGLIATSGVSVAGNITTTYQATTTALGAATSSVISKSIGQKDQSKLA YHVTEALKITLLLSFLLGFSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGALIRATHNPRLPLYVSFL SNALNIILFSSLATTVLDMGIAGVAWGTTVSRLVGLTTLWSQLKLPYGKGLPKELLTLALPATGERLMMTGDVVIIA LVVSTGTEAVAGNATGEVLTQPNYMPATGVATATVMLLARAVGEGDWKRVASLSKQTTWLSLTLMLPLSTSIYVLGVP LTTILYTTTDSLAVEASVLVTLTSLLGTPMTTGTVIYTAVWQGLGNARLPTYATSIGMWCTRIGTGYLMGIVLGWGLPGIW AGSLLDNGTRWLTLRYRYQRYMSLKGZ ID127 894 bp GTGGGAAGAATTATCAGAGCAGGTGTAAAGATGGAACATCTTGGAAAAGTATTTCGTGAATTTCGAACAAGTGGA AATTATTCTTTAAAGGAAGCAGCAGGCGAATCCTGCTCTACCTCTCAGTTATCTCGCTTTGAGCTTGGGGAGTCTG ACCTGGCAGTCTCCCGTTTCTTTGAGATTTTGGATAACATTCATGTAACAATCGAAAATTTCATGGATAAGGCAAG GAATTTTCATAATCATGAACATGTGTCTATGATGGCACAGATTATCCCACTTTACTATTCAAACGATATTGCAGGT TTTCAAAAGCTTCAAAGAGAACAACTTGAAAAGTCTAAGAGTTCGACGACTCCCCTTTATTTTGAGCTGAACTGGA TTTTGCTACAAGGTCTGATTTGTCAAAGAGATGCGAGTTATGATATGAAGCAGGATGATTTGGGTAAGGTAGCAG ATTATCTCTTCAAAACAGAAGAATGGACCATGTATGAGTTGKTTCTTTTCGGTAACCTCTATAGTTTCTACGATGT AGACTATGTCACTCGGATTGGTAGAGAAGTTATGGAGAGGGAGGAATTTTACCAAGAGATTAGTCGCCATAAGAG ATTAGTGTTGATTTGGCCCTCAATTGTTACCAGCATTGTTTAGAGCATTGTCTTTTTATAATGCCAACTATTTTG AGGCTTATACAGAGAAGATTATTGACAAAGGTATTAAGCTTTATGAGCGTAATGTTTTCCATTATTTAAAAGGTTT TGCCTTATATCAAAAAGGACAGTGTAAAGAAGGCTGTAAGCAGATGCAAGAGGCCATGCATATTTTTGATGTGTT AGGTCTTCCAGAGCAAGTAGCCTATTATCAGGAACACTACGAAAAATTTGTCAAAAGTTAA VGRIIRAGVKMEHLGKVFREFRTSGNYSKEAAGESCSTSQLRFELGESDLAVSRFFEILDNIHVTIENFMDKARNFHN HEHVSMMAQIIPLYYSNDIAGTQKLQREQLEKSKSSIIPLYTELNWILLQGLICQRDASYDMKQGDDLGKVADYLTKTEE WTMYELILTGNLYSTYDVDYVTRIGREVMEREEPYQEISRHKRLVLILALNCYQHCLEHSSTYNANYFEAYTEKIIDKGI KLYERNVTHYLKGTALYQKGQCKEGCKQMQEAMHTTDVLGLPEQVAYYQEHYEKTVKSZ
[0163] 7 TABLE 3 ID1 1068 bp ATGTCTAACATTCAAAACATGTCCCTGGAGGACATCATGGGAGAGCGCTTTGGTCGCTACTCCAAGTACATTATTC AAGACCGGGCTTTGCCAGATATTCGTGATGGGTTGAAGCCGGTTCAGCGCCGTATTCTTTATTCTATGAATAAGGA TAGCAATACTTTTGACAAGAGCTACCGTAAGTCGGCCAAGTCAGTCGGGAACATCATGGGGAATTTCCACCCACA CGGGGATTCTATCTATGATGCCATGGTTCGTATGTCACAGAACTGGAAAAATCGTGAGATTCTAGTTGAAATG CACGGTAATAACGGTTCTATGGACGGAGATCCTCCTGCGGCTATGCGTTATACTGAGGCACGTTTGTCTGAAATTG CAGGCTACCTTCTTCAGGATATCGAGAAAAAGACAGTTCCTTTGCATGGAACTTTGACGATACGGAGAAAGAAC CAACGGTCTTGCCAGCAGCCTTTCCAAACCTCTTGGTCAATGGTTCGACTGGGATTTCGGCTGGTTATGCCACAGA CATTCCTCCCCATAATTTAGCTGAGGTCATAGATGCTGCAGTTTACATGATTGACCACCCAACTGCAAAGATTGAT AAACTCATGGAATTCTGCCTGGACCAGACTTCCCTACAGGGGCTATTATTCAGGGTCGTGATGAAATCAAGAAA GCTTATGAGACTGGGAAAGGGCGCGTGGTTGTTCGTTCCAAGACTGAAATTGAAAAGCTAAAAGGTGGTAAGGA CAAATCGTTATTATTGAGATTCCTTATGAAATCAATAAGGCCAATCTAGTCAAGAAAATCGATGATGTTCGTGTTA ATAACAAGGTAGCTGGGATTGCTGAGGTTCGTGATGAGTCTGACCGTGATGGTCTTCGTATCGCTATCGAACTTAA GAAAGACGCTAATACTGAGCTTGTTCTCAACTACTTATTTAAGTACACCGACCTACAAATCACATACAACTTTAAT ATGGTGGCGATTGACAATTTCACACCTCGTGAGGTTGGATTGTTCCAATCCTGTCTAGCTATATCGCTCACCGTCG AGAAGTGA MSNIQNMSLEDIMGERFGRYSKYIIQDRALPBIRDGLKPVQRRILYSMNKDSNTFDKSYRKSAKVGNIMGNFHPHGDS SIYDAMVRMSQNWKNREILVEMHGNNGSMDGDPPAAMRYTEARLSEIAGYLLQDIEKKTVPPFAWNFDDTEKEPTVLP AAFPNLLVNGSTGISAGYATDIPPHNLAEVIDAAVYMIDHPTAKIDKLMEFLPGPDFPTGAIIQGRDEIKKAYETGKGRV VVRSKTEIEKLKGGKEQIVIIEIPYEINKANLVKKIDDVRVNNKVAGIAEVRDESDRDGLRIAIELKKDANTELVLNYLFK YTDLQINYNFNMVAIDNFTPRQVGLFQSCLAISLTVEKZ ID12 684 bp ATGCCGACATTAGAAATAGCACAAAAAAAACTGGAGTTCATTAAGAAGGCAGAAGAATATTACAATGCCTTGTGT ACAAATATACAGTTGAGCGGAGATAAACTAAAAGTAATTCCGTTACTTCTGTTAACCCTGGGGAAGGAAAAAACA ACTACTTCCATAAATATAGCATGGTCGTTTGCGCGTGCAGGCTATAAAACTCTTTTGATCGATGGCGATACTCGAA ATTCAGTTATGTTAGGAGTTTTTAAATCTCGTGAAAAAATTACAGGGCTAACAGAATTTTTATCTGGGACAGCTGA TTTATCTCACGGTTTATGTGATACAAATATTGAAAATTATTTGTAGTTCAATCGGGATCTGTATCACCAAACCCT ACAGCCTTGTTACAAAGTAAAAATTTAATGATATGATTGAAACATTGCGTAAATATTTTGATTATATCATTATTG ATACACCGCCTATTGGAATTGTTATTGATGCGGCAATTATCACTCAAAAGTGTGATGCGTCCATCTTGGTAACAGC AACAGGTGAGGCGAATAAACGTGATATCCAAAAAGCGAAACAACAATTAAAACAAACAGGGAAACTGTTCCTAG GAGTTGTTTTAAATAAATTGGATATCTCGGTTAATAAGTATGGAGTTTACGGTTCCTATGGAAATTATGGTAAAAA ATAA MPTLEIAQKKLEFIKKAEEYYNALCTNIQLSGDKLKVISVTSVNPGEGKTTTSINIAWSFARAGYKTLLIDGDTRNSVML GVFKSREKITGLTEFLSGTADLSHGLCDTNIENLFVVQSGSVSPNPTALLQSKNFNDMIETLRKYFDYIIIDTPPIGIVIDAA IITQKCDASILVTATGEANKRDIQKAKQQLKQTGKLFLGVVLNKLDISVNKYGVYGSYGNYGKKZ ID13 1182 bp ATGGAGGCAAATATGAAACATCTAAAAACATTTTACAAAAAATGGTTTCAATTATTAGTCGTTATCGTCATTAGCT TTTTTAGTGGAGCCTTGGGTAGTTTTTCAATAACTCAACTAACTCAAAAAAGTAGTGTAAACAACTCTAACAACAA TGTTTCTGTTATTACTTATTCGGCAAACAGACAAAATAGCGTATTTGGCAATGATGATACTGACACAGATTCTCAG CGAATCTCTAGTGAAGGATCTGGAGTTATTTATAAAAAGAATGATAAAGAAGCTTACATCGTCACCAACAATCAC GTTATTAATGGCGCCAGCAAAGTAGATATTCGATTGTCAGATGGGACTAAAGTACCTGGAGAAATTGTCGGAGCT GACACTTTCTCTGATATTGCTGTCGTCAAAATCTCTTCAGAAAAAGTGACAACAGTAGCTGAGTTTGGTGATTCTA GTAAGTTAACTGTAGGAGAAACTGCTATTGCCATCGGTAGCCCGTTAGGTTCTGAATATGCAAATACTGTCACTCA AGGTATCGTATCCAGTCTCAATAGAAATGTATCCTTAAAATCGGAAGATGGACAAGCTATTTCTACAAAAGCCAT CCAAACTGATACTGCTATTAACCCAGGTAACTCTGGCGGCCCACTGATCAATATTCAAGGGCAGGTTATCGGAAT TACCTCAAGTAAAATTGCTACAAATGGAGGAACATCTGTAGAAGGTCTTGGTTTCGCAATTCCTGCAAATGATGCT ATCAATATTATTGAACAGTTAGAAAAAAACGGAAAAGTGACGCGTCCAGCTTTGGGAATCCAGATGTTCGTTCGG TACAAAGTAATATGCCTGCCAATGGTCACCTTGAAAAATACGATGTAATTACAAAAGTAGATGACAAAGAGATTG CTTCATCAACAGACTTACAAAGTGCTCTTTACAACCATTCTATCGGAGACACCATTAAGATAACCTACTATCGTAA CGGGAAAGAAGAAACTACCTCTATCAAACTTAACAAGAGTTCAGGTGATTTAGAATCTTAA MEANMKHLKTFYKKWFQLLVVIVISFFSGALGSFSITQLTQKSSVNNSNNNSTITQTAYKNENSTTQAVNKVKDAVVSV ITYSANRQNSVFGNDDTDTDSQRISSEGSGVIYKKNDKEAYIVTNNHVINGASKVDIRLSDGTKVPGEIVGADTFSDLAV VKISSEKVTTVAEFGDSSKLTVGETAIAIGSPLGSEYANTVTQGIVSSLNRNVSLKEDGQAISTKAIQTDTAINPGNSGGP LINIQGQVIGITSSKIATNGGTSVEGLGFAIPANDAINIIEQLEKNGKVTRPALGIQMVNLSNVSTSDIRRLNIPSNVTSGVIV RSVQSNMPANGHLEKYDVITKVDDKEIASSTDLQSALYNHSIGDTIKITYYRNGKEETTSIKLNKSSGDLESZ ID15 939 bp ATGGCAGAAATTTATCTAGCAGGTGGTTGTTTTTGGGGCCTAGAGGAATATTTTTCACGCATTTCTGGAGTGCTAG AAACCAGTGTTGGCTACGCTAATGGTCAAGTCGAAACGACCAATTACCAGTTGCTCAAGGAAACAGACCATGCAG AAACGGTCCAAGTGATTTACGATGAGAAGGAAGTGTCACTCAGAGAGATTTTACTTTATTATTTCCGAGTTATCGA TCCTATCTATCAATCAACAAGGGAATGACCGTGGTCGCCAATATCGACTGGGATTTAT1ATCAGGATGAAGC AGATTTGCCAGCTATCTACACAGTGGTGCAGGAGCAGGAACGCATGCTGGGTCGAAAGATTGCAGTAGAAGTGGA GCAATTACGCCACTACATTCTGGCTGAAGACTACCACCAAGACTATCTCAGGAAGAATCCTTCAGGTTACTGTCAT ATCGATGTGACCGATGCTGATAAGCCATTGATTGATGCAGCAAACTATGAAAAGCCTAGTCAAGAGGTGTTGAAG GCCAGTCTATCTGAAGAGTCTTATCGTGTCACACAAGAAGCTGCTACAGAGGCTCCATTFACCAATGCCTATGACC AAACCTTTGAAGAGGGGATTTATGTAGATATTACGACAGGTGAGCCACTCTTTTTTGCCAAGGATAAGTTTGCTTC AGGTTGTGGTTGGCCAAGTTTTAGCCGTCCGATTTCCAAAGAGTTGATTCATTATTACAAGGATCTGAGGCCATGGA ATGGAGCGAATTGAAGTTCGTTCTCGTTCAGGCAGTGCTCACTTGGGTCATGTTTTCACAGATGGACCGCGGGAGT TAGGCGGCCTCCGTTACTGTATCAATTCTGCTTCTTTACGCTTTGTGGCCAAGGATGAGATGGAAAAAGCAGGATA TGGCTATCTATTGCCTTACTTAAACAAATAA MAEIYLAGGCFWGLEEYFSRISGVLETSVGYANGQVETTNYQLLKETDHAETVQVIYDEKEVSLREILLYYFRVIDPLSI NQQGNDRGRQYRTGIYYQDEADLPAIYTVVQEQERMLGRKIAVEVEQLRHYILAEDYHQDYLRKNPSGYCHIDVTDA DKPLIDAANYEKPSQEVLKASLSEESYRVTQEAATEAPFTNAYDQTFEEGIYVDITTGEPLFFAKDKFASGCGWPSFSRPI SKELIHYYKDLSHGMERIEVRSRSGSAHLGHVFTDGPRELGGLRYCINSASLRFVAKDEMEKAGYGYLLPYLNKZ ID17 870 bp ATGAAGATTATTGTACCTGCAACCAGTGCCAATATCGGGCCAGGTTTTGACTCGGYCGGTGTAGCTGTAACCAAGT ATCTTCAAATTGAGGTCTGCGAAGAACGAGATGAGTGGCTGATTGAACACCAGATTGGCAAATGGATTCCACATG ACGAGCGTAATCTCTTGCTCAAAATCGCTTTGCAAATTGTACCAGACTTGCAACCAAGACGCTTGAAAATGACCA GTGATGTCCCTTTGGCGCGCGGTTTGGGTTCTTCCAGCTCGGTTATCGTTGCTGGGATTGAACTAGCCAACCAACT GGGTCAACTCAACTTATCAGACCATGAAAATTGCAGTTAGCGACCAAGATTGAAGGGCATCCTGACAATGTGGC TCCAGCCATTTATGGTAATCTCGTTATTGCAAGTTCTGTTGAAGGGCAAGTCTCTGCTATCGTAGCAGACTTTCCA GAGTGTGATTTTCTAGCTTACATTCCAAACTATGAATTACGTACTCGCGACAGCCGTAGTGTCTTGCCTAAAAAAT TGTCTTATAAGGAAGCTGTTGCTGCAAGTTCTATCGCCAATGTAGCGGTTGCTGCCTTGTTGGCAGGAGACATGGT GACCGCTGGGCAAGCAATCGAGGGAGACCTCTTCCATGAGCGCTATCGTCAGGACTTGGTAAGAGAATTTGCGAT GATTAAGCAAGTGACCAAAGAAAATGGGGCCTATGCAACCTACCTTTCTGGTGCTGGGCCGACAGTTATGGTTCT GGCTTCTCATGACAAGATGCCAACAATTAAGGCAGAATTGGAAAAGCAACCTTTCAAAGGAAAACTGCATGACTT GAGAGTTGATACCCAAGGTGTCCGTGTAGAAGCAAAATAA MKIIVPATSANIGPGFDSVGVAVTKYLQIEVCEERDEWLIEHQIGKWIPHDERNLLLKIALQIVPDLQPRRLKMTSDVPLA RGLGSSSSVIVAGIELANQLGQLNLSDHEKLQLATKIEGHPDNVAPAIYGNLVIASSVEGQVSAIVADFPECDFLAYIPNY ELRTRDSRSVLPKKLSYKEAVAASSLANVAVAALLAGDMVTAGQAIEGDLFHERYRQDLVREFAMIKQVTKENGAYAT YLSGAGPTVMVLASHDKMPTIKAELEKQPFKGKLHDLRVDTQGVRVEAKZ ID20 564 bp ATGAAATATCACGATTACATCTGGGATTTAGGTGGAACTTTACTGGATAATTATGAAACTTCAACAGCTGCATTTG TTGAAACATTGGCACTGTATGGTATCACACAAGACCATGACAGTGTCTATCAAGCTTTAAAGGTTTCTACTCCTTT TGCGATTGAGACATTCGCTCCCAATTTAGAGAATTTTTTAGAAAAGTACAAGGAAAATGAAGCCAGAGAGCTTGA ACACCCGATTTTATTTGAAGGAGTTTCTGACCTATTGGAAGACATTTCAAATCAAGGTGGCCGTCATTTTTTGGTC TCTCATCGAAATGATCAGGTTTTGGAAATTTTAGAAAAAACCTCTATAGCAGCTTATTTTACAGAAGTGGTGACTT CTAGCTCAGGCTTTAAGAGAAAGCCAAATCCCGAATCCATGCTTTATTTAAGAGAAAAGTATCAGATTAGCTCTG GTCTTGTCATTGGTGATCGGCCGATTGATATCGAAGCAGGTCAAGCTGCAGGACTTGATACCCACTTGTTTACCAG TATCGTGAATTTAAGACAAGTATTAGACATATAA MKYHDYIWDLGGTLLDNYETSTAAFVETLALYGITQDHDSVYQALKVSTPFAIETFAPNLENFLEKYKENEARELEHPI LFEGVSDLLEDISNQGGRHFLVSHRNDQVLEILEKTSIAAYFTEVVTSSSGFKRXPNPESMLYLREKYQISSGLVIGDRPID IEAGQAAGLDTHLFTSIVNLRQVLDIZ ID21 1875 bp ATGACAGAAGAAATCAAAAATCTGCAGGCACAGGATTATGATGCCAGTCAAATTCAAGTTTTAGAGGGCTTAGAG GTCGTTCGTATGCGTCCAGGGATGTACATTGGATCAACCTCAAAAGAAGGTCTTCACCATCTAGTCTGGGAAATTG TTGATAACTCAATTGACGAGGCCTTGGCAGGATTTGCCAGCCATATTCAAGTTTTTATTGAGCCAGATGATTCGAT TACTGTTGTGGATGATGGGCGTGGTATCCCAGTCGATATTCAGGAAAAAACAGGCCGTCCTGCTGTTGAGACCGT CTTTACAGTCCTTCACGCTGGAGGAAAGTTCGGCGGTGGTGGATACAAGGTTTCAGGTGGTCTTCACGGGGTGGG GTCGTCAGTAGTTAATGCCCTTTCCACTCAATTAGACGTTCATGTTCACAAAAATGGTAAGATTCATTACCAAGAA TACCGTCGTGGTCATGTTGTCGCAGATCTTGAAATAGTTGGAGATACGGATAAAACAGGAACAACTGTTCACTTC ACACCGGACCCAAAAATCTTCACTGAAACAACAATCTTTCATTTTGATAAATTAAATAAACGGATTCAAGAGTTG GCCTTTCTAAATCGCGGTCTTCAAATTTCAATTACAGATAAGCGCCAAGGTTTGGAACAAACCAAGCATTATCATT ATGAAGGTGGGATTGCTAGTTACGTTGAATATATCAACGAGAACAAGGATCTAATCTTTGATACACCAATCTATA CAGACGGTGAGATGGATGATATCACAGTTGAGGTAGCCATGCAGTACACAACTGGTTACCATGAAAATGTCATGA GTTTCGCCAATAATATTCATACCCATGAAGGTGGAACACATGAACAAGGTTTCCGTACAGCCTTGACACGTGTTAT CAACGATTATGCTCGTAAAAATAAGTTACTGAAAGACAATGAAGATAATTTAACAGGGGAAGATGTTCGCGAAGG CTTAACTGCAGTTATCTCAGTTAAACACCCAAATCCACAGTTTGAAGGACAAACCAAGACCAAATTGGGAAATAG CGAAGTGGTCAAGATTACCAATCGCCTCTTCAGTGAAGCTTTCTCCGATTTCCTCATGGAAAATCCACAGATTGCC AAACGTATCGTAGAAAAAGGAATTTTGGCTGCCAAGGCTCGTGTGGCTGCCAAGCGTGCGCGTGAAGTCACACGT AAAAAATCTGGTTTGGAAATTTCCAACCTTCCAGGGAAACTAGCAGACTGTTCTTCTAATAACCCTGCTGAAACAG AACTCTTCATCGTCGAAGGAGACTCAGCTGGTGGATCAGCCAAATCTGGTCGTAACCGTGAGTTTCAGGCTATCCT TCCAATTCGCGGTAAGATTTTGAACGTTGAAAAAGCAAGTATGGATAAGATTCTAGCCAACGAAGAAATTCGTAG TCTTTTCACAGCCATGGGAACAGGATTTGGCGCAGAATTTGATGTTTCGAAAGCCCGTTACCAAAAACTCGTTTTG ATGACCGATGCCGATGTCGATGGAGCCCACATTCGTACCCTTCTTTTAACCTTGATTTATCGTTATATGAAACCAA TCCTAGAAGCTGGTTATGTTTATATTGCCCAACCACCAATCTATGGTGTCAAGGTTGGAAGCGAGATTAAAGAATA TATCCAGCCGGGTGCAGATCAAGAAATCAAACTCCAAGAAGCTTTAGCCCGTTATAGTGAAGGTCGTACCAAACC GACTATTCAGCGTTATAAGGGGCTAGGTGAAATGGACGATCATCAGCTGTGGGAAACAACCATGGATCCCGAACA TCGCTTGATGGCTAGAGTTTCTGTAGATGATGTGCAGAAGCAGATAAAATCTTTGATATGTTGA MTEEIKNLQAQDYDASQIQVLEGLEAVRMRPGMYIGSTSKEGLHHLVWEIVDNSIDEALAGFASHIQVFIEPDDSITVVD DGRGIPVDIQEKTGRPAVETVFTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSTQLDVHVHKNGKIHYQEYRRGHV VADLEIVGDTDKTGTTVHFTPDPKIFTETTIFDFDKLNKRIQELAFLNRGLQISITDKRQGLEQTKHYHYEGGIASYVEYI NENKDVIFDTPIYTDGEMDDITVEVAMQYTTGYHENVMSFANNIHTHEGGTHEQGFRTALTRVINDYARKNKLLKDN EDNLTGEDVREGLTAVISVKHPNPQFEGQTKTKLGNSEVVKITNRLFSEAFSDFLMENPIAKRIVEKGILAAKARVAAK RAREVTRKKSGLEISNLPGKLADCSSNNPAETELFIVEGDSAGGSAKSGRNREFQAILPIRGKILNVEKASMDKILANEEI RSLFTAMGTGFGAEFDVSKARYQKLVLMTDADVDGAHIRTLLLTLIYRYMKPILEAGYVYIAQPPIYGVKVGSEIKEYI QPGADQEIKLQEALARYSEGRTKPTIQRYKGLGEMDDHQLWETTMDPEHRLMARVSVDDVQKQIKSLICZ ID54 1446 bp ATGAGTAGACGTTTTAAAAAATCACGTTCACAGAAAGTGAAGCGAAGTGTTAATATAGTTTTGCTGACTATTTATT TATTGTTAGTTTGTTTTTTATTGTTCTTAATCTTTAAGTACAATATCCTTGCTTTTAGATATCTTAATCTAGTGGTAA CTGCGTTAGTCCTACTAGTTGCCTTGGTAGGGCTACTCTTGATTATCTATAAAAAAGCTGAAAAGTTTACTATTTTT CTGTTGGTGTTCTCTATCCTTGTCAGCTGTGTCGCTCTTTGCAGTACAGCAGTTTGTTGGACTGACCAATCGTTT AAATGCGACTTCTAATTACTCAGAATATTCAATCAGTGTCGCTCTTTTAGCAGATAGTGAGATCGAAAATCTTACG CAACTGACGAGTGTGACAGCACCGACTGGGACTAATAATGAAAATATTCAGAAATTACTAGCTGATATCAAGTCA AGTCAGAATACCGATTTGACGGTCAACCAGAGTTCGTCTTACTTGGCAGCTTACAAGAGTTTGATTGCAGGGGAG ACTAAGGCCATTGTCCTAAATAGTGTCTTTGAAAACATCATCGAGTCAGAGTATCCAGACTACGCATCGAAGATA AAAAAGATTTATACTAAGGGATTCACTAAAAAAGTAGAAGCTCCTAAGACGTCTAAGAGTCAGTCTTTCAATATC TATGTTAGTGGAATTGACACCTATGGTCCTATTAGTTCGGTGTCGCGATCAGATGTCAACATCCTGATGACTGTCA ATCGAGATACCAAGAAAATCCTCTTGACCACAACGCCACGTGATGCCTATGTACCAATCGCAGATGGTGGAAATA AGTGGATATCAATTACTATGTGCGATTGAACTTCACTTCGTTTTTGAAATTGATTGATTTGTTGGGTGGAATTGATG TTTATAATGATCAAGAATTTACTGCCCATACGAATGGAAAGTATTACCCTGCAGGCAATGTTCATCTTGATTCAGA ACAGGCTCTCGGTTTTGTTCGTGAGCGCTACTCCCTAGCAGATGGCGATCGTGACCGCGGGGCGCCATCAACAAAA GGTGATTGTGGCTATCCTTCAAAAATTAACGTCAACCGAAGTGCTGAAAAATTATAGTACGATCATTAATAGCTTG CAAGATTCTATCCAAACAAATATGCCACTTGAGACCATGATAAATTTGGTCAATGCTCAGTTAGAAAGTGGAGGG AATTATAAAGTAAATTCTCAAGATTTAAAAGGGACAGGTCGATGGATCTTCCTTCTTATGCAATGCCAGACAGTA AATTATAAAGTAAATTCTCAAGATTTAAAAGGGACAGGTCGGATGGATCTTCCTTCTTATGCAATGCCAGACAGTA ACCTCTATGTGATGGAAATAGATGATAGTAGTTTAGCTGTACTTAAAGCAGCTATACAGGATGTGATGGAAGGGTA GATGA MSRRFKKSRSQKVKRSVNIVLLTIYLLLVCFLLFLIFKYNILAFRYLNLVVTALVLLVALVGLLLIIYKKAEKFTIFLLVFS ILVSSVSFAVQQFVGLTNRLNATSNYSEYSISVAVLADSEIENVTQLTSTAPTGTNNENIQKLLADIKSSQNTDLTVNQ SSSYLAAYKSLIAGETKAIVLNSVFENIIESEYPDYASKIKKIYTKGFTKKVEAPKTSKSQSFNIYVSGIDTYGPISSVSRSDV NILMTVNRDTKKILLTTTPRDAYVPIADGGNNQKDKLTHAGIYGVDSSIHTLENLYGVDINYYVRLNFTSFLKLIDLLGG IDVYNDQEFTAHTNGKYYPAGNVHLDSEQALGFVRERYSLADGDRDRGRHQQKVIVAILQKLTSTEVLKNYSTIINLQ DSIQTNMPLETMINLVNAQLESGGNYKVNSQDLKGTGRMDLPSYAMPDSNLYVMEIDDSSLAVVKAAIQDVMEGRZ ID55 732 bp ATGATAGACATCCATTCGCATATCGTTTTTGATGTACATGACGGTCCCAAGTCAAGAGAGGAAAGCAAGGCTCTC TTGGCAGAATCCTACAGACAGGGGGTGCGAACCATTGTTTTCTACCTCTCACCGTCGCAAGGGCATGTTTGAAACTC CGGAAGAGAAGATAGCAGAAAACTTTCTTCAGGTTCGGGAAATAGCTAAGGAAGTGGCGAGTGACTTGGTCATTG CTTACGGGGCTGAAATTTATTACACACCAGATGTTCTGGATAAGCTGGAAAAAAAGCGGATTCCGACCCTCAATG ATAGTCGTTATGCCTTGATAGAGTTTAGTATGAACACTCCTTATCGCGATATTCATAGCGCCTTGAGCAAGATCTT GATGTTGGGAATTACTCCAGTCATTGCCCACATTGAGCGCTATGATGCTCTTGAAAATAATGAAAAACGCGTTCGA GAACTGATCGATATGGGCTGTTACACGCAAGTAAATAGTTCACATGTCCTCAAACCCAAACTTTTTGGCGAACGTT ATAAATTCATGAAAAAAAGAGCTCAGTATTTTTTTAGAGAGGATTTGGTTCATGTCATTGCAAGTGATATGCACAA TCTAGACGGTAGACCTCCTCATATGGCAGAAGCATATGACCTTGTTACCCAAAAATACGGAGAAGCGAAGGCTCA GGAACTTTTTATAGACAATCCTCGAAAAATTGTAATGGATCAACTAATTTAG MIDIHSHIVFDVDDGPKSREESKALLAESYRQGVRTIVSTSHRRKGMFETPEEKIAENFLQVREIAKEVASDLVIAYGAEI YYTPDVLDKLEKKRIPTLNDSRYALIEFSMNTPYRDIHSALSKILMLGITPVIAHIERYDALENNEKRVELIDMGCYTQV NSSHVLKPKLFGERYKFMKKRAQYFLEQDLVHVIASDMHNLDGRPPHMAEAYDLVTOKYGEAKAOELFIDNPRKIVM DQLIZ ID58 3990 bp TTGATTTATATAATCGCTATCAATATAACAATGCAATCAGGAGGTTTTGCAATGAAACATGAAAAACAACAGCGT TTTTCTATTCGTAAATACGCTGTAGGAGCAGCTTCTGTTCTAATTGGATTTGCCTTCCAAGCACAGACTGTTGCAG CCGATGGAGTTACTCCTACTACTACAGAAAACCAACCGACCATCCATACGGTTTCTGATTCCCCTCAATCATCCGA AAATCGGACTGAGGAAACACCTAAAGCAGTGCTTCAACCAGAAGCTCCAAAAACTGTAGAAACAGAAACTCCAG CTACTGATAAGGTAGCTAGTCTTCCAAAAACAGAAGAAAAACCACAAGAGGAAGTTAGTTCAACTCCTAGTGATA AAGCAGAAGTGGTAACTCCAACTTCTGCTGAAAAAGAAACTGCTAATAAAAAGGCAAGAAGAAGCTAGCCCTAAA AAGGAAGAAGCGAAAGAGGTTGATTCTAAAGAGTCAAATACAGACAAGACTGACAAGGATAAACCAGCTAAAAA AGATGAAGCGAAAGCAGAGGCTGACAAACCGGCAACAGAGGCAGGAAAGGAACGTGCTGCAACTGTAAATGAAA AACTAGCGAAAAAGAAAATTGTTTCTATTGATGCTGGACGTAAATATTTCTCACCAGAACAGCTCAAGGAAATCA TCGATAAAGCGAAACATTATGGCTACACTGATTTACACCTATTAGTCGGAAATGATGGACTCCGTTTCATGTTGGA CGATATGAGCATCACAGCTAACGGCAAGACCTATGCCAGTGACGATGTCAAACGCGCCATTGAAAAAGGTACAAA TGATTATTACAACGATCCAAACGGCAATCACTTAACAGAAAGTCAAATGACAGATCTGATTAACTATGCCAAAGA TAAAGGTATCGGTCTCATTCCGACAGTAAATAGTCCTGGACACATGGATGCGATTCTCAATGCCATGAAAGAATT GGGAATCCAAAACCCTAACTTTAGCTATTTTGGGAAGAAATCAGCCCGTACTGTCGATCTTGACAACGAACAAGC TGTCGCTTTTACAAAAGCCCTTATCGACAAGTATGCTGCTTATTTCGCGAAAAAGACTGAAATCTTCAACATCGGA CTTGATGAATATGCCAATGATGCGACAGATGCTAAAGGTTGGAGTGTGCTTCAAGCTGATAAATACTATCCAAAC GAAGGCTACCCTGTAAAAGGCTATGAAAAATTTATTGCCTACGCCAATGACCTCGCTCGTATTGTAAAATCGCAC GGTCTCAAACCAATGGCTTTTAACGACGGTATCTACTACAATAGCGACACAAGCTTTGGTAGTTTTGACAAAGAC ATCATCGTTTCTATGTGGACTGGTGGTTGGGGAGGCTACGATGTCGCTTCTTCTAAACTACTAGCTGAAAAAGGTC ACCAAATCCTTAATACCAATGATGCTTGGTACTACGTTCTTGGACGAAACGCTGATGGCCAAGGCTGGTACAATCT CGATCAGGGGCTCAATGGTATTAAAAACACACCAATCACTTCTGTACCAAAAACAGAAGGAGCTGATATCCCAAT CATCGGTGGTATGGTAGCTGCTTGGGCTGACACTCCATCTGCACGTTATTCACCATCACGCCTCTTCAAACTCATG CGTCATTTTTGCAAATGCCAACGCTGAATACTTCGCAGCTGATTATGAATCTGCAGAGCAAGCACTTAACGAGGTA CCAAAAGACCTGAACCGTTATACTGCAGAAAGCGTCACGGCCGTAAAAGAAGCTGAAAAAGCTATTCGCTCTCTC GATAGCAACCTTAGCCGTGCCCAACAAGATACGATTGATCAAGCCATTGCTAAACTTCAAGAAACTGTCAACAAC TTGACCCTCACGCCTGAAGCTCAAAAAGAAGAAGAAGCTAAACGTGAGGTTGAAAAACTTGCCAAAAACAAGGT AATCTCAATCGATGCTGGACGCAAATACTTTACTCTGAACCAGCTCAAACGCATCGTAGACAAGGCCAGTGAGCT CGGATATTCTGATGTCCATCTCCTTCTAGGAAATGACGGACTTCGCTTTCTACTCGATGATATGACCATTACTGCC AACGGAAAAACCTATGCTAGTGATGACGTTAAAAAAGCTATTATCGAAGGAACTAAAGCTTACTACGACGATCCA AACGGTACTGCACTAACACCGGCAGAAGTAACAGAGCTAATTGAATACGCTAAATCTAAGGACATCGGTCTCATC CCAGCTATTAACAGTCCAGGTCACATGGATGCTATGCTGGTTGCCATGGAAAAATTAGGTATTAAAAATCCTCAA GCCCACTTTGATAAAGTTTCAAAAACAACTATGGACTTGAAAAACGAAGAAGCGATGAACTTTGTAAAAAGCCCTC ATCGGTAAATACATGGACTTCTTTGCAGGTAAAACAAAGATTTTCAACTTTGGTACTGACGAATACGCCAACGAT GCGACTAGTGCCCAAGGCTGGTACTACCTCAAGTGGTATCAACTCTATGGCAAATTTGCCGAATATGCCAACACC CTCCAGCTATGGCCAAAGAAAGAGGGCTTCAACCAATGGCCTTCAACGATGGCTTCTACTATGAAGACAAGGAC GATGTTCAGTTTGACAAAGATGTCTTGATTTCTTACTGGTCTAAAGGCTGGTGGGGATATAACCTCGCATCACCTC AATACCTAGCAAGCAAAGGCTATAAATTCTTGAATACCAACGGTGACTGGTACTACATTCTTGGTCAAAAACCAG AAGATGGTGGTGGTTTCCTCAAGAAAGCTATTGAGAATACTGGAAAAACACCATTCAATCAACTAGCTTCTACCA AATATCCTGAAGTAGATCTTCCAACAGTCGGAAGTATGCTTTCAATCTGGGCAGATAGACCAAGCGCTGAATACA AGGAAGAGGAAATCTTTGAACTCATGACTGCCTTTGCAGACCACAACAAAGACTACTTTCGTGCTAATTATAATG CTCTCCGCGAAGAATTAGCTAAAATTCCTACAAACTTAGAAGGATATAGTAAAGAAAGTCTTGAGGCCCTTGACG CAGCTAAAACAGCTCTAAATTACAACCTCAACCGTAATAAACAAGCTGAGCTTGACACGCTTGTAGCCAACCTAA AAGCCGCTCTTCAAGGCCTCAAACCAGCTGTAACTCATTCAGGAAGCCTCGATGAAAATGAAGTGGCTCGGAATG TTGAAACCAGACCAGAACTCATCACAAGAACTGAAGAAATTCCATTTCAAGTTATCAAGAAAGAAAATCCTAACC TCCCAGCCGGTCAGGAAAATATTATCACAGCAGGAGTCAAAGGTGAACGAACTCATTACATCTCTGTACTCACTG AAAATGGAAAAACAACAGAAACAGTCCTTCATAGCCAGGTAACCAAAGAAGTTATAAACCAAGTGGTTGAAGTT GGCGCTCCTGTAACTCAGAAGGGTGATGAAAGTGGTCTTGCACCAACTACTGAGGTAAAACCTAGACTGGATATC CAAGAAGAAGAAATTCCATTTACCACAGTGACTTGTGAAAATCCACTCTTACTCAAAGGAAAAACACAAGTCATT ACTAAGGGCGTCAATGGACATCGTAGCAACTTCTACTCTGTGAGCACTTCTGCCGATGGTAAGGAAGTGAAAACA CTTGTAAATAGTGTCGTAGCACAGGAAGCCGTTACTCAAATAGTCGAAGTCGGAACTATGGTAACACATGTAGGC GATGAAAACGGACAAGCCGCTATTGCTGAAGAAAAACCAAAACTAGAAATCCCAAGCCAACCAGCTCCATCAAC TGCTCCTGCTGAGGAAAGCAAAGTTCTTCCTCAAGATCCAGCTCCTGTGGTAACAGAGAAAAAACTTCCTGAAAC AGGAACTCACGATTCTGCAGGACTAGTAGTCGCAGGACTCATGTCCACACTAGCAGCCTATGGACTCACTAAAAG AAAAGAAGACTAA MIYILAINITMQSGGFAMKHEKQQRFSIRKYAVGAASVLIGFAFQAQTVAADGVTPTTTENQPTIHTVSDSPQSSENRTEE TPKAVLQPEAPKTVETETPATDKVASLPKTEEKPQEEVSSTPSDKAEVVTPTSAEKETANKKAEEASPKKEEAKEVDSKE SNTDKTDKDKPAKKDEAKAEADKPATEAGKERAATVNEKLAKKKIVSIDAGRKYFSPEQLKEIIDKAKHYGYTDLHLL VGNDGLRFMLDDMSITANGKTYASDDVKRAIEKGTNDYYNDPNGNHLTESQMTDLINYAKDKGIGLIPTVNSPGHMD AILNAMKELGIQNPNFSYFGKKSARTVDLDNEQAVAFTKALIDKYAAYFAKKTEIFNIGLDEYANDATDAKGWSVLQA DKYYPNEGYPVKGYEKFIAYANDLARIVKSHGLKPMAFNDGIYYNSDTSFGSFDKDIIVSMWTGGWGGYDVASSKLLA EKGHQILNTNDAWYYVLGRNADGQGWYNLDQGLNGIKNTPITSVPKTEGADIPIIGGMVAAWADTPSARYSPSRLFKL MRHFANANAEYFADADYESAEQALNEVPKDLNRYTAESVTAVKEAEKAIRSLDSNLSRAQQDTIDQAIAKLQETVNNLT LTPEAQKEEEAKREVEKLAKNKVISIDAGRKYFTLNQLKRIVDKASELGYSDVHLLLGNDGLRFLLDDMTTTANGKTYA SDDVKKAIIEGTKAYYDDPNGTALTQAEVTELIEYAKSKDIGLIPAINSPGHMDAMLVAMEKLGIKNPQAHFDKVSKTT MDLKNEEAMNFVKALIGKYMDFFAGKTKIFNFGTDEYANDATSAQGWYYLKWYQLYGKFAEYANTLAAMAKERGL QPMAFNDGFYYEDKDDVQFDKDVLISYWSKGWWGYNLASPQYLASKGYKFLNTNGDWYYILGQKPEDGGGFLKKAI ENTGKTPFNQLASTKYPEVDLPTVGSMLSIWADRPSAEYKEEEIFELMTAFADHNKDYFRANYNALREELAKIPTNLEG YSKESLEALDAAKTALNYNLNRNKQAELDTLVANLKAALQGLKPAVTHSGSLDENEVAANVETRPELITRTEEIPFEVI KKENPNLPAGQENIITAGVKGERTHYISVLTENGKTTETVLDSQVTKEVINQVVEVGAPVTHKGDESGLAPTTEVKPRL DIQEEEIPFTTVTCENPLLLKGKTQVITKGVNGHRSNFYSVSTSADGKEVKTLVNSVVAQEAVTQIVEVGTMVTHVGDE NGQAAIAEEKPKLEIPSQPAPSTAPAEESKVLPQDPAPVVTEKKLPETGTHDSAGLVVAGLMSTLAAYGLKRKEDZ ID122 825 bp ATGAACAAAAAAACAAGACAGACACTAATCGGACTGCTAGTGTTATTGTTTTGTCTACAGGGAGCTATTATATC AAGCAGATGCCGTCGGCACCTAATAGTCCCAAAACCAATCTTAGTCAGAAAAAACAAGCGTCTGAAGCTCCTAGT CAAGCATTGGCAGAGAGTGTCTTAACAGACGCAGTCAAGAGTCAAATAAAAAAAAGTCTGGAGTGGAATGGCTC AGGTGCTTTTATCGTCAATGGTAATAAAACAAATCTAGATGCCAAGGTTTCAAGTAAGCCCTACGCTGACAATAA AACAAAGACAGTGGGCAAGGAAACTGTTCCAACCGTAGCTAATGCCCTCTTGTCTAAGGCCACTCGTCAGTACAA GAATCGTAAAGAAACTGGGAATGGTTCAACTTCTTGGACTCCTCCAGGTTGGCATCAGGTCAAGAATCTAAAGGG CTCTTACCCATGCAGTCGATAGAGGTCATTTGTTAGGCTATGCCTTAATCGGTGGTTTGGATGGTTTTGATGCCT CAACAAGCAATCCTAAAAACATTGCTGTTCAGACAGCCTGGGCAAATCAGGCACAAGCCGAGTATTCGACTGGTC AAAACTACTATGAAAGCAAGGTGCGTAAAGCCTTGGACCAAAACAAGCGTGTCCGTTACCGTGTAACCCTTTACT ACGCTTCAAACGAGGATTTAGTTCCCTCAGCTTCACAGATTGAAGCCAAGTCTTCGGATGGAGAATTGGAATTCA ATGTTCTAGTTCCCAATGTTCAAAAGGGACTTCAACTGGATTACCGAACTGGAGAAGTAACTGTAACTCAGTAA MNKKTRQTLIGLLVLLLLSTGSYYIKQMPSAPNSPKTNLSQKKQASEAPSQALAESVLTDAVKSQIKGSLEWNGSGAFIV NGNKTNLDAKVSSKPYADNKTKTVGKETVPTVANALLSKATRQYKNRKETGNGSTSWTPPGWHQVKNLKGSYTHAV DRGHLLGYALIGGLDGFASTSNPKNIAVQTAWANQAQAEYSTGQNYYESKVRKALDQNKRVRYRVTLYYASNEDL VPSASQIEAKSSDGELEFNVLVPNVQKGLQDYRTGEVTVTQZ ID123 225 bp GTGCTAAGATTCAGCGGATTGAGGCAAGTGATGAAGATGAATAAGAAATCAAGCTACGTAGTCAAGCGTTTACTT TTAGTCATCATAGTACTGATTTTAGGTACTCTGGCTCTAGGAATCGGTTTAATGGTAGGTTATGGAATCTTGGGCA AGGGTCAAGATCCATGGGCTATCCTGTCTCCAGCAAAATGGCAGGAATTGATTCATAAATTTACAGGAAATTAG VLRFSGLRQVMKMKKSSYVVKRLLLVIIVLILGTLALGIGLMVGTYGILGKGQDPQAILSPAKWQELIHKFTGNZ
[0164]
Claims
1. A Streptococcus pneumoniae protein or polypeptide having a sequence selected from those shown in table 1.
2. A Streptococcus pneumoniae protein or polypeptide having a sequence selected from those shown in table 2.
3. A protein or polypeptide as claimed in claim 1 or claim 2 provided in substantially pure form.
4. A protein or polypeptide which is substantially identical to one defined in any one of claims 1 to 3.
5. A homologue or derivative of a protein or polypeptide as defined in any one of claims 1 to 4.
6. An antigenic and/or immunogenic fragment of a protein or polypeptide as defined in Tables 1-3.
7. A nucleic acid molecule comprising or consisting of a sequence which is:
- (i) any of the DNA sequences set out in Table 1 or their RNA equivalents;
- (ii) a sequence which is complementary to any of the sequences of (i);
- (iii) a sequence which codes for the same protein or polypeptide, as those sequences of (i) or (ii);
- (iv) a sequence which is substantially identical with any of those of (i), (ii) and (iii);
- (v) a sequence which codes for a homologue, derivative or fragment of a protein as defined in Table 1.
8. A nucleic acid molecule comprising or consisting of a sequence which is:
- (i) any of the DNA sequences set out in Table 2 or their RNA equivalents;
- (ii) a sequence which is complementary to any of the sequences of (i);
- (iii) a sequence which codes for the same protein or polypeptide, as those sequences of (i) or (ii);
- (iv) a sequence which is substantially identical with any of those of (i), (ii) and (iii);
- (v) a sequence which codes for a homologue, derivative or fragment of a protein as defined in Table 2.
9. The use of a protein or polypeptide having a sequence selected from those shown in Tables 1-3, or homologues, derivatives and/or fragments thereof, as an immunogen and/or antigen.
10. An immunogenic and/or antigenic composition comprising one or more proteins or polypeptides selected from those whose sequences are shown in Tables 1-3, or homologues or derivatives thereof, and/or fragments of any of these.
11. An immunogenic and/or antigenic composition as claimed in claim 10 which is a vaccine or is for use in a diagnostic assay.
12. A vaccine as claimed in claim 11 which comprises one or more additional components selected from excipients, diluents, adjuvants or the like.
13. A vaccine composition comprising one or more nucleic acid sequences as defined in Tables 1-3.
14. A method for the detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact a sample to be tested with at least one protein or polypeptide as defined in Tables 1-3, or homologue, derivative or fragment thereof.
15. An antibody capable of binding to a protein or polypeptide as defined in Tables 1-3, or for a homologue, derivative or fragment thereof.
16. An antibody as defined in claim 15 which is a monoclonal antibody.
17. A method for the detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact a sample to be tested and at least one antibody as define din claim 15 or claim 16.
18. A method for the detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact a sample to be tested with at least one nucleic acid sequence as defined in claim 7 or claim 8.
19. A method of determining whether a protein or polypeptide as defined in Tables 1-3 represents a potential anti-microbial target which comprises inactivating said protein or polypeptide and determining whether S. pneumoniae is still viable.
20. The use of an agent capable of antagonising, inhibiting or otherwise interfering with the function or expression of a protein or polypeptide as defined in Tables 1-3 in the manufacture of a medicament for use in-the treatment or prophylaxis of S. pneumoniae infection
Type: Application
Filed: Jan 26, 2001
Publication Date: Jul 17, 2003
Inventors: Richard William Falla Le Page (London), Jeremy Mark Wells (Norwich), Sean Bosco Hanniffy (Cambridge), Philip Michael Hansbro (Newcastle)
Application Number: 09769744
International Classification: C07H021/02; C07H021/04; C12N001/20; C07K001/00; C07K014/00; C07K017/00;