METHOD TO PREDICT THE PATTERN OF LOCOMOTION IN HORSES
The present invention provides methods for predicting the pattern of locomotion in a horse including the ability of a horse to use different gaits and the ability to trot at a fast speed. The methods comprise determining in a sample of DNA obtained from a horse the presence or absence of at least one genetic marker, wherein said at least one genetic marker is located on horse chromosome 23, said marker being associated with the ability to use different gaits. The invention further provides primers that amplify markers being associated with the ability to use different gaits and hybridization probes to detect markers being associated with the ability to use different gaits and the ability to trot at a fast speed.
Latest CAPILET GENETICS AB Patents:
The present invention relates to methods for predicting the pattern of locomotion in horses including the ability of a horse to use different gaits and the ability to trot or pace at a fast speed. The methods comprise determining in a sample of DNA obtained from a horse the allele of at least one genetic marker, wherein said at least one genetic marker is located on horse chromosome 23, said marker being associated with the ability to use different gaits.
BACKGROUNDHorses show a considerable variation in their pattern of locomotion both within and between breeds. The three basic gaits in horses are walk, trot and gallop. The horses use these different gaits according to their speed, walk is used at slow speed, trot is a faster mode of locomotion and gallop is the gait horses normally use to run fast. However, some horses have the ability to also use alternative gaits, for example pace and toelt, and such horses are called gaited horses. A horse that pace moves the two legs on the same side in a lateral movement in contrast to a trotting horse that makes a diagonal movement where the diagonal front and hind legs move forward and backwards together. Furthermore, Icelandic horses are able to perform a fifth gait named toelt, which is a four beet gait with the same foot fall pattern as the walk. A characteristic feature of toelt is that the horse then always has at least one hoof touching the ground, giving a very smooth gait. Examples of other similar alternative gaits, also known as ambling gaits, are fox trot, the rack, running walk and paso cort. The alternative gaits vary in footfall pattern, timing, and cadence, and can be generally divided into four categories: pace, regular rhythm ambling, lateral ambling and diagonal ambling. Table 1 provides a classification of breeds as gaited or non-gaited horses. Most horse breeds are in fact non-gaited and only representative examples of such breeds are listed in the table. Horses representing breeds classified as non-gaited never or rarely are able to perform the alternative gaits whereas most or all horses from the gaited breeds can perform alternative gaits. There are more gaited breeds worldwide in addition to the ones listed in table 1. Sometimes, there is a considerable variation also within breeds as regards the pattern of locomotion. For instance, Icelandic horses are classified as four-gaited or five-gaited, where the former can perform walk, trot, gallop and toelt whereas the latter can also pace.
The Standardbred horse, used for harness racing has a unique ability to trot or pace at a very fast speed without falling into gallop which is the normal gait at high speed for a horse. In North America, a subpopulation of Standardbred horses that pace at very high speed has been developed. Other horse breeds used for harness racing includes breeds like the Cold-blooded trotter, Finnhorses, the Frensch trotter and the Orlove trotter.
The pattern of locomotion in horses is under strong selection in horse breeding. For instance, the ability to race using gallop, trot and pace are selected in Thoroughbred horses, Standardbred trotters and Standardbred pacers, respectively. Horses with the ability to use alternative gaits are also highly desired by some riders and is a trait upon which many specialized breeds have been developed. Methods for predicting the pattern of locomotion in a horse, i.e. its ability to use different gaits, would therefore have a great utility in the horse breeding industry.
BRIEF DESCRIPTION OF INVENTIONThe present inventors have identified a genetic locus in horses that determines the horse's ability to use different gaits and the ability to trot at a fast speed. A premature stop codon in the gene for the doublesex and mab-3 related transcription factor 3 (DMRT3) was found in all tested horses with the ability to perform alternative gaits. Mutant horses express a truncated DMRT3 protein which lacks the last 174 amino acid residues but maintains a functional DNA-binding domain. DMRT3 is expressed in a subset of neurons in the spinal cord of the horse.
Accordingly the present invention provides methods for predicting the pattern of locomotion in horses including the ability of a horse to use different gaits, the ability to trot or pace at a fast speed, and the ability to perform in dressage.
A first aspect of the invention provides methods for predicting the pattern of locomotion in horses including the ability of a horse to use alternative gaits, the ability to trot at a fast speed, and the ability to perform in dressage which comprise extracting protein from a sample obtained from a horse. The methods further comprise determining in said protein sample the presence or absence of a truncated form of the DMRT3 protein. The DMRT3 protein can be a DMRT3 protein truncated at amino acid position 300 corresponding to the protein SEQ ID NO: 4. The determination can be made by use of an immunochemical method, such as Western blot, using an anti DMRT3 antibody.
A second aspect of the invention provides methods for predicting the pattern of locomotion in horses including the ability of a horse to use alternative gaits, the ability to trot at a fast speed, and the ability to perform in dressage which comprise extracting DNA from a sample obtained from a horse. The methods further comprise determining in said DNA the allele of at least one genetic marker, wherein said at least one genetic marker is located in the region between the flanking SNPs at nucleotide positions 22,628,976 (corresponding to position 51 in SEQ ID NO: 6) and 23,315,071 (corresponding to position 51 in SEQ ID NO: 7) on horse chromosome 23.
The genetic marker can be selected from single nucleotide polymorphisms (SNPs) and insertion/deletions (INDELs).
Preferably, the genetic marker is selected from the genetic markers listed in Tables 4, 5, 7 and 8.
Preferably the genetic marker is located in the region between the flanking SNPs at nucleotide positions 22,919,878 and 23,011,289 on horse chromosome 23.
Preferably, the genetic marker is selected from the genetic markers listed in Table 8.
Most preferably the genetic marker is located at position 22,999,655 on horse chromosome 23, corresponding to position 939 in SEQ ID NO:1.
More specifically, the methods can comprise identifying in said DNA the nucleotide in one or more specific position(s) selected from the positions 22,919,878; 22,920,361; 22,920,434; 22,920,646; 22,920,717; 22,921,203; 22,922,079; 22,922,780; 22,923,569; 22,924,120; 22,924,142; 22,924,299; 22,924,380; 22,924,407; 22,926,098; 22,926,188; 22,926,872; 22,927,387; 22,927,607; 22,928,220; 22,928,537; 22,928,587; 22,929,137; 22,930,011; 22,932,024; 22,932,895; 22,933,218; 22,936,034; 22,940,759; 22,942,423; 22,945,643; 22,946,599; 22,948,774; 22,949,055; 22,949,108; 22,949,240; 22,949,710; 22,956,846; 22,960,132; 22,960,528; 22,960,710; 22,964,042; 22,965,059; 22,967,119; 22,967,656; 22,967,915; 22,968,898; 22,973,984; 22,974,589; 22,979,124; 22,980,014; 22,982,879; 22,984,588; 22,985,746; 22,988,210; 22,988,991; 22,993,092; 22,994,591; 22,999,058; 22,999,655; 23,002,606; 23,003,956; 23,008,772; 23,008,789; 23,009,648; 23,010,164; and 23,011,289, on horse chromosome 23.
Most preferably the methods comprise identifying in said DNA the nucleotide in the specific position 22,999,655 on horse chromosome 23.
More specifically, the methods can comprise determining in said DNA the presence or absence of:
-
- i) the nucleotide C in a nucleotide position corresponding to position 939 in SEQ ID NO: 1,
- ii) the nucleotide A in a nucleotide position corresponding to position 939 in SEQ ID NO: 3,
- iii) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 5,
- iv) the nucleotide A and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 6,
- v) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 7,
- vi) the nucleotide G and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 8,
- vii) the nucleotide A and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 9,
- viii) the nucleotide T and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 10,
- ix) the nucleotide T and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 11,
- x) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 12,
- xi) the nucleotide A and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 13,
- xii) the nucleotide A and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 14
- xiii) the nucleotide G and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 15,
- xiv) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 16,
- xv) the nucleotide G and/or A in a nucleotide position corresponding to position 51 in SEQ ID NO: 17,
- xvi) the nucleotide G and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 18,
- xvii) the nucleotide C and/or A in a nucleotide position corresponding to position 51 in SEQ ID NO: 19,
- xviii) the nucleotide T and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 20,
- xix) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 21,
- xx) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 22,
- xxi) the nucleotide C and/or A in a nucleotide position corresponding to position 51 in SEQ ID NO: 23,
- xxii) the nucleotide C and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 24,
- xxiii) the nucleotide A and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 25,
Preferably the methods comprise determining in said DNA the presence or absence of:
-
- i) the nucleotide C in a nucleotide position corresponding to position 939 in SEQ ID NO: 1,
- ii) the nucleotide A in a nucleotide position corresponding to position 939 in SEQ ID NO: 3,
- iii) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 5,
- iv) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 7,
- v) the nucleotide T and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 20,
- vi) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 21,
- vii) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 22,
- viii) the nucleotide C and/or A in a nucleotide position corresponding to position 51 in SEQ ID NO: 23,
- ix) the nucleotide C and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 24,
- x) the nucleotide A and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 25,
Most preferably the methods comprise determining in said DNA the presence or absence of:
-
- i) the nucleotide C in a nucleotide position corresponding to position 939 in SEQ ID NO: 1 or
- ii) the nucleotide A in a nucleotide position corresponding to position 939 in SEQ ID NO: 3.
The horse can be selected from any horse or breed of horses belonging to the species Equus caballus. Examples of horse breeds can be found in Table 1.
According to one aspect of the invention the methods according to the present invention can be used for paternity testing of horses.
It is contemplated that any method or composition described herein can be implemented with respect to any other method or composition described herein. The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.” These, and other, embodiments of the invention will be better appreciated and understood when considered in conjunction with the following description and the accompanying drawings. It should be understood, however, that the following description, while indicating various embodiments of the invention and numerous specific details thereof, is given by way of illustration and not of limitation. Many substitutions, modifications, additions and/or rearrangements may be made within the scope of the invention without departing from the spirit thereof, and the invention includes all such substitutions, modifications, additions and/or rearrangements.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
Super-shifts were demonstrated using an anti-myc antibody (that recognizes both forms) or with an anti-DMRT3 antibody that recognizes the C-terminal part of the wild-type protein, but not the truncated form. An oligonucleotide corresponding to a DMRT1-binding site was also used and gave similar results (data not shown). The cold competing oligonucleotide was added in 150× excess. GS=gel-shift representing complex between DMRT3 protein and oligonucleotide; SS=super-shift representing complex between antibody, DMRT3 protein and oligonucleotide.
DETAILED DESCRIPTION OF THE INVENTIONThe present inventors have demonstrated that there is a locus, here named Gait, on horse chromosome 23 that has a major impact on the pattern of locomotion in horses. The present results show that homozygosity for a recessive allele at this locus is required for the ability of a horse to pace. It is postulated that the nonsense mutation at nucleotide position 22,999,655 in exon 2 of the DMRT3 gene is the causative mutation for the Gait allele. DMRT3 is a highly conserved gene present in all vertebrates studied so far. The function of the DMRT3 protein has not been established by any previous studies but the fact that it is expressed in brain and in the spinal cord of the mouse (MGI, www.informatics.jax.org) is consistent with a critical role for controlling locomotion as demonstrated by the present study. The nonsense mutation underlying the gait allele may very well have a phenotypic effect in the heterozygous condition since it occurs in the last exon of DMRT3 and is expected to encode a truncated form of the protein (SEQ ID NO:4) that lacks the last 174 amino acids (
This study has established a genetic marker that can be used to predict the genetic constitution of a horse as regards its pattern of locomotion. We predict that the gait allele is present in most, if not all, gaited breeds some of which are listed as gaited in Table 1 and it may occur at a low frequency in other breeds as well. The marker also predicts a horse capacity to trot or pace at a high speed as it is found at a high frequency in horses used for harness racing. Further, we predict that horses with at least one wild-type allel are better at showjumping, traditional dressage, and completion racing in gallop.
The pattern of locomotion determines the ability of a horse to use alternative gaits, as well as the horse's ability to trot or pace at a fast speed, and its ability to perform in dressage. Alternative gaits include, pace, and the ambling gaits exemplified by toelt, running walk, rack, classic fino, paso corto, paso largo, paso ilano, sobreandando, fox trot.
A horse being homozygous or heterozygous for the gait allele can be predicted to have the ability to use alternative gaits and to trot at high speed. A horse being homozygous or heterozygote for the wild type allele can be predicted to have better ability to perform in showjumping, dressage, and completion racing in gallop.
The utility of this invention in the horse breeding industry includes the determination of the genotype of potential breeding animals to maximise the chance to obtain a progeny with a favoured pattern of locomotion. The information about the genotype at the DMRT3 locus may also be used by sellers and buyers of horses to predict the ability of the horse to perform different gaits. Furthermore, the methods according to the invention can be used to effectively introgress the gait allele into non-gaited breeds.
According to one aspect of the invention the methods according to the present invention can be used for selecting horses for breeding.
Accordingly, one aspect of the invention provides methods for selecting a horse for breeding, said methods comprising determining in a DNA sample obtained from said horse the allele of at least one genetic marker, wherein said at least one genetic marker is located in the region between the flanking SNPs at nucleotide positions 22,628,976 on horse chromosome 23. The genetic marker can be selected from single nucleotide polymorphisms (SNPs) and insertion/deletions (INDELs).
Preferably, the genetic marker is selected from the genetic markers listed in Tables 4, 5, 7 and 8.
Preferably the genetic marker is located in the region between the flanking SNPs at nucleotide positions 22,919,878 and 23,011,289 on horse chromosome 23.
Preferably, the genetic marker is selected from the genetic markers listed in Table 8.
Most preferably the genetic marker is located at position 22,999,655 on horse chromosome 23, corresponding to position 939 in SEQ ID NO:1.
The most reliable test for determining the genotype at the Gait locus is to determine the presence and/or absence of the nonsense mutation in exon 2 of DMRT3 (nucleotide position 22,999,655 on chromosome 23, corresponding to nucleotide position 939 in SEQ ID NO:3). However, genetic markers located in the interval between the flanking markers at nucleotide positions 22,628,976 and 23,315,071, and more specifically genetic markers located in the interval between positions 22,919,878 and 23,011,289, exemplified by the markers listed in Table 8, show a more or less strong association to the genotype for the causative SNP at nucleotide position 22,999,655 due to the presence of linkage disequilibrium in the region. Accordingly, one or more of these markers, individually or in combination, can be used to determine the genotype at the Gait locus, and can consequently as well be used in the methods according to the present invention.
The term “sample” or “biological sample” according to the present invention refers to any material containing nucleated cells from said horse to be tested. In a preferred embodiment the biological sample to be used in the methods of the present invention is selected from the group consisting of blood, sperm, hair roots, milk, body fluids as well as tissues including nucleated cells.
DNA extraction, isolation and purification methods are well-known in the art and can be applied in the present invention. Standard protocols for the isolation of genomic DNA are inter alia referred to in Sambrook, J., Russell. D. W. Molecular Cloning: A Laboratory Manual, the third edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor. New York, 1.31-1.38, 2001 and Sharma. R. C., et al. “A rapid procedure for isolation of RNA-free genomic DNA from mammalian cells”, BioTechniques, 14. 176-178. 1993.
According to the present invention the term “SNP” refers to a single nucleotide polymorphism at a particular position in the horse genome that varies among a population of individuals. SNPs can be identified by their location within the disclosed particular sequence, i.e. within the interval of 22,628,976 and 23,315,071 base pairs on horse chromosome 23 or their name as shown in Tables 4, 5, 7 and 8. SNPs identified as being useful for predicting the ability of a horse to use different gaits according to the present invention are shown in Tables 4, 5, 7 and 8. For example, the SNP BIEC2-620109 of Table 5 indicates that the nucleotide base (or the allele) at nucleotide position 22,967,656 on chromosome 23 of the reference sequence as referred to herein may be either Cytosine (C) or Thymidine (T). The allele associated with or indicative for a horse able to use five gaits is in the case of SNP BIEC2-620109 of Table 5 Thymidine (T).
The term “determining in said DNA the allele of at least one genetic marker” in accordance with the present invention refers to a method for determining or identifying whether a particular nucleotide sequence is present in a DNA sample.
The term “identifying in said DNA the nucleotide in one or more specific position on the horse chromosome 23” refers to a method for determining the identity of the nucleotide in said specific position on the horse chromosome 23, i.e. to determine whether the nucleotide in said specific position is Adenosine (A), Cytosine (C), Guanosine (G), or Thymidine (T).
There are several methods known by those skilled in the art for determining whether a particular nucleotide sequence is present in a DNA sample and for identifying the nucleotide in a specific position in a DNA sequence. These include the amplification of a DNA segment encompassing the genetic marker by means of the polymerase chain reaction (PCR) or any other amplification method, interrogate the genetic marker by means of allele specific hybridization, the 3′exonuclease assay (Taqman assay), fluorescent dye and quenching agent-based PCR assay, the use of allele-specific restriction enzymes (RFLP-based techniques), direct sequencing, the oligonucleotide ligation assay (OLA), pyrosequencing, the invader assay, minisequencing, DHPLC-based techniques, single strand conformational polymorphism (SSCP), allele-specific PCR, denaturating gradient gel electrophoresis (DGGE), temperature gradient gel electrophoresis (TGGE), chemical mismatch cleavage (CMC), heteroduplex analysis based system, techniques based on mass spectroscopy, invasive cleavage assay, polymorphism ratio sequencing (PRS), microarrays, a rolling circle extension assay, HPLC-based techniques, extension based assays, ARMS (Amplification Refractory Mutation System), ALEX (Amplification Refractory Mutation Linear Extension), SBCE (Single base chain extension), molecular beacon assays, invader (Third wave technologies), ligase chain reaction assays, 5′-nuclease assay-based techniques, hybridization capillary array electrophoresis (CAE), protein truncation assays (PTT), immunoassays, and solid phase hybridization (dot blot, reverse dot blot, chips). This list of methods is not meant to be exclusive, but just to illustrate the diversity of available methods. Some of these methods can be performed in accordance with the methods of the present invention in microarray format (microchips) or on beads.
The invention thus also relates to the use of primers or primer pairs, wherein the primers or primer pairs hybridize(s) under stringent conditions to the DNA comprising the interval between nucleotide positions 22,628,976 and 23,315,071, preferably between positions 22,919,878 and 23,011,289, base pairs on horse chromosome 23, or to the complementary strand thereof.
Preferably the primers or primer pairs hybridize(s) under stringent conditions to the sequences SEQ ID NO: 1, 3 and 5 to 25.
Preferably, the primers of the invention have a length of at least 14 nucleotides such as 17 or 21 nucleotides.
More specifically the primers can be selected from SEQ NO:26, SEQ ID NO:27, SEQ ID NO:30, and SEQ ID NO:31.
In one embodiment, the primers actually binds to the position of the SNPs as referred to in Tables 4, 5, 7 and 8. Such an allele specific oligonucleotide in accordance with the present invention is typically an oligonucleotide of at least 14 to 21 nucleotide bases in length designed to detect a difference of a single base in the target's genetic sequence of the horse to be tested. In accordance with the present invention one or more specific primers can be applied in order to identify more than a single SNP as referred to herein. As a consequence, when binding is performed under stringent conditions, such primer or such primers is/are useful to distinguish between different polymorphic variants as binding only occurs if the sequences of the primer and the target have full complementarily. It is further preferred that the primers have a maximum length of 24 nucleotides. Such primers can be coupled with an appropriate detection method such as an elongation reaction or an amplification reaction, which may be used to differentiate between the polymorphic variants and then draw conclusions with regard to the horse as regards its ability to use different gaits.
Hybridisation is preferably performed under stringent or highly stringent conditions. “Stringent or highly stringent conditions” of hybridization are well known to or can be established by the person skilled in the art according to conventional protocols. Appropriate stringent conditions for each sequence may be established on the basis of well-known parameters such as temperature, composition of the nucleic acid molecules, salt conditions etc.: see, for example, Sambrook et al. “Molecular Cloning, A Laboratory Manual”, CSH Press, Cold Spring Harbor, 1989 or Higgins and Hames (eds.), “Nucleic acid hybridization, a practical approach”, IRL Press, Oxford 1985, see in particular the chapter “Hybridization Strategy” by Britten & Davidson. Typical (highly stringent) conditions comprise hybridization at 65° C. in 0.5×SSC and 0.1% SDS or hybridization at 42° C. in 50% formamide, 4×SSC and 0.1% SDS. Hybridization is usually followed by washing to remove unspecific signals. Washing conditions include conditions such as 65° C., 0.2×SSC and 0.1% SDS or 2×SSC and 0.1% SDS or 0.3×SSC and 0.1% SDS at 25° C.-65° C.
The term “nucleotide positions 22,628,976 and 23,315,071 base pairs on horse chromosome 23” and other similar denoted nucleotide positions refer to the horse reference sequence according to the September 2007 Equus caballus draft assembly EquCab2 (UCSC version equCab2). EquCab2 was produced by The Broad Institute. EquCab2 is available at the www.genome.ucsc.edu genome browser.
ExamplesA genome-wide screen for genes affecting pattern of locomotion using the horse SNP chip comprising assays for 54,602 single nucleotide polymorphisms in the horse genome (Illumina EquineSNP50 BeadChip; http://www.illumina.com/products/equine_snp50_whole_genome_genotyping_kits.ilmn) was performed. A population material comprising 70 Icelandic horses in which 30 were classified as four-gaited and 40 were classified as five-gaited, i.e. only the latter had a documented ability to pace, was used in the assay.
Animal Material.
Blood samples were collected from 70 Icelandic horses from Sweden. Genomic DNA was prepared from all horses using QIAamp DNA Blood Midi Kit (Qiagen). The owners of the horses were asked to classify their horses as four-gaited or five-gaited. Hair samples were collected from 61 Swedish Standardbred horses and 2 North-Swedish Trotter. DNA from six hair roots was extracted by adding 97 μl Chelex solution and 7 μl Proteinas K and incubated in 56° C. for 60 minutes followed by an incubation in 95° C. for 10 minutes.
Genome-Wide Analysis (GWA).
The GWA was performed using the Illumina EquineSNP50 BeadChip (http://www.illumina.com/products/equine_snp50_whole_genome_genotyping_kits.ilmn). The statistical analysis of the data was carried out using the software PLINK (Purcell et al. 2007. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81:559-575).
DNA Sequencing.
A number of coding and non-coding regions located between the flanking SNPs at nucleotide positions 22,628,976 and 23,315,071 base pairs on horse chromosome 23 was PCR amplified and sequenced to identify sequence polymorphisms. All primers used for these experiments are listed in Table 2. The amplicons were amplified with standard PCR conditions and (2720 Thermal Cycler, Applied Biosystems, Foster City, Calif.). Standard Sanger sequencing was performed using an AB3730 capillary sequencer (Applied Biosystems, Foster City, Calif.).
In Depth Genome Resequencing.
DNA samples from two Icelandic horses, one female mutant DMRT3 homozygote and one male control (homozygous wild-type) were prepared for sequencing. Illumina paired-end libraries were generated from these DNA samples (mean insert sizes of approximately 220 bases). The two libraries were sequenced (2×100 bp) on seven and five lanes, respectively, using an Illumina HiSeq instrument. The reads were mapped to the horse genome (EquCab2 reference assembly) using the software BWA, and PCR-duplicates were removed using the software Picard (http://picard.sourceforge.net). The average read depth obtained for each sample was approximately 30×. SNPs and small insertions/deletions were called from the mapping data after subjecting the alignments to realignment around indels and then variant calling using the Genome Analysis Toolkit (GATK). The variant calls were subjected to recommended VariantFiltrationWalker filters for SNPs listed in the GATK wild page (http://www.broadinstitute.org/gsa/wiki/index.php/The_Genome_Analysis_Toolkit) and read alignments overlapping SNP and insertion/deletion calls within the 438 kb Gait locus were then manually reviewed to remove obvious artifact calls. Read depths observed in one kilobase windows were used to call candidate duplications in the minimum IBD region, and mapping distances and orientations between paired reads were used to detect structural variations in relation to the reference assembly. The software ANNOVAR was used to annotate SNPs in relation to Ensembl genes.
SNP Analysis Using TaqMan Assays.
TaqMan assays were designed to screen the SNPs at chromosome 23, nucleotide position 22,967,656 (BIEC2—620109; the SNP included in the Illumina SNP panel showing the strongest association to the phenotype) and at nucleotide position 22,999,655 (DMRT3.3; the SNP causing a premature Stop codon in DMRT3 exon 2). Custom TaqMan SNP Genotyping assays (Applied Biosystems, Foster City, Calif.) designed for these two SNPs are summarized in Table 3. Probe and primer designs were obtained from the Applied Biosystems web page (http://www5.appliedbiosystems.com/tools/cadt/) using the custom genotyping assays order option. The ABI PRISM 7900 HT sequence detection system for 384-well format (Applied Biosystems, Foster City, Calif.) was used for the analysis.
Genome-Wide Analysis Reveals a Locus on Horse Chromosome 23 Controlling the Pattern of Locomotion.
Statistical analysis of the SNP-chip data for the 70 Icelandic horses with a phenotypic classification as four-gaited or five-gaited was carried using PLINK; 39,695 SNPs passed the quality control. A chi-square test was performed for each marker separately in order to test for a significant difference in genotype frequencies between four-gaited versus five-gaited horses. A genetic model assuming a recessive mode of inheritance was used. Ten thousand permutations were used to correct for multiple testing. The statistical analysis revealed a highly significant association between a SNP (BIEC2—620109, SEQ ID NO: 5) at nucleotide position 22,967,656 base pair on horse chromosome 23 and the gait phenotype (P=0.0002, genome-wide significance;
Resequencing of Selected Regions Refine the Localization of the Gait Locus.
A number of amplicons (Table 2) from the genomic region harbouring the Gait locus as defined by the genome-wide screen (from nucleotide position 22,628,976 to position 23,315,071 on chromosome 23) were resequenced in a small set of four-gaited and five-gaited horses in order to refine the localization of the Gait locus. All the sequence polymorphisms detected in this analysis are summarized in Table 4. The results showed that there is a distinct haplotype associated with the recessive gait allele and that the haplotype block showing a complete association to gait in this breed breaks up at nucleotide position 22,877,015 just upstream of the DMRT1 gene. The results refine the localization of the Gait locus to the interval from nucleotide position 22,877,015 base pair to position 23,315,071 base pair; ANKRD15 is located outside the critical interval for Gait.
A Nonsense Mutation Located in Exon 2 of DMRT3 Shows Complete Concordance with the Ability to Pace.
The critical interval for the Gait locus comprises the four genes DMRT1, DMRT2, DMRT3 and GTF2A2. The DMRT genes belong to a family of transcription factors that contains the zinc-finger like DNA binding DM domain (Murphy et al. 2007. Vertebrate DM domain proteins bind similar DNA sequences and can heterodimerize on DNA. BMC Mot. Biol. 8:58). We sequenced most of the DMRT exons in this region and identified a small number of sequence polymorphisms (Table 4). One of these (DMRT3.3), located in exon 2 of DMRT3 at nucleotide position 22,999,655, caused a nonsense mutation in the allele associated with the ability to pace (
TaqMan assays were designed for the polymorphisms at nucleotide positions 22,967,656 (the most significantly associated SNP in the GWA analysis) and at position 22,999,655 (the mutation in DMRT3 creating a premature Stop codon): These were used to screen all 70 Icelandic horses included in this study. Both SNPs showed complete association between homozygosity for the non-reference allele at both, loci and the phenotype (Table 5), the statistical support for an association was overwhelming (P=6.73×10−10 for both SNPs, Fisher's Exact Test). The results imply that there is very strong linkage disequilibrium between these two SNPs in the studied population, the two SNPs are located 32 kilo base pairs apart. Nine animals that were classified as four-gaited were homozygous for the haplotype associated with the gait allele (Table 5). These animals were either misclassified by their owners, which is fully possible, or the Gait genotype shows incomplete penetrance due to interaction with environmental factors (for instance training) or other unknown genetic factors.
We tested 2 North-Swedish Trotters and 61 Swedish Standardbred horses (both used for harness racing in Sweden) to investigate if the gait allele is present in other horse breeds. We found that both the 2 North-Swedish Trotters and 59. Standard bred horses were homozygous for the DMRT3 nonsense mutation at nucleotide position 22,999,655 on horse chromosome 23 while the remaining 2 Standardbred horses were heterozygous A/C. The high frequency of this allele in these breeds strongly suggests that the mutation has a favourable effect on the ability to trot at a fast speed. In deed, the two horses identified as being heterozygous for the gait allele were also considered as being poor trotters. We predict that the gait allele is present at a high frequency in most, if not all, gaited horse breeds as well as horses used for harness racing.
Electrophoretic Mobility Shift Assays (EMSA).
The oligonucleotide 5′-ggatccTCGAGAACAATGTAACAATTTCGCCC-3′ and its complementary sequence were annealed in 10 mM Tris pH 7.5, 1 mM EDTA, 50 mM KCl by firstly heating to 95° C. for 2 min and thereafter cooled to 25° C. (2 min/degree). The duplex was labelled with Klenow DNA polymerase and [α-32P]-dCTP and purified using a Bio-Rad Micro Bio-Spin 30 column. DMRT3 wild type and mutant protein were produced by in vitro-translation using a TNT Quick Coupled Transcription/Translation System (Promega). EMSA was performed as described by Culbertson & Leeds, 2003 (Looking at mRNA decay pathways through the window of molecular evolution. Curr. Opin. Genet. Dev. 13, 207-214) with the following modifications. No plasmid DNA was added and 1.0 μl in vitro-translated protein and 150× cold competitor were used. The reaction mixture was incubated on ice for 20 min before adding the radioactive oligo and thereafter incubated at room temperature for 30 min. Gels were run at 150 V in room temperature. Both full-length wild-type and mutant DMRT3 protein were found to bind a previously defined DMRT-binding motif (
We have presented abundant evidence that the DMRT3_Ser301STOP mutation has a major effect on gaits in horses. Our interpretation of the phenotypic consequences of this mutation is that homozygosity for the mutation is required but not sufficient for pacing, as many Standardbred Trotters and some Icelandic horses that are homozygous mutant do not pace. On the other hand heterozygosity or homozygosity for the mutation are permissive to enable a variety of four-beat ambling gaits to be performed, with genetic modifiers that may be unique to each gaited breed. The mutation promotes ambling gaits and pace and it inhibits the transition from trot or pace to gallop, which explains its high frequency in pacers and trotters used for harness racing. It is an open question if the mutation alters the fate of DMRT3-neurons or changes their transcriptional regulation, but it is clear that these neurons must have a key role for the control centre in the spinal cord coordinating limb movements.
All of the compositions and/or methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and/or methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.
Claims
1. A method for predicting the pattern of locomotion in a horse including the ability to use alternative gaits, to trot or pace at a fast speed, and to perform in dressage, said method comprising steps of;
- i) extracting DNA from a sample obtained from a horse,
- ii) determining in said DNA the presence or absence of at least one genetic marker, wherein said at least one genetic marker is located in the region between the flanking SNPs at nucleotide positions 22,628,976 and 23,315,071 base pairs on horse chromosome 23.
2. The method according to claim 1, wherein said at least one genetic marker is located in the region between the flanking SNPs at nucleotide positions 22,919,878 and 23,011,289 base pairs on horse chromosome 23.
3. The method according to claim 1, wherein the genetic marker is selected from the genetic markers listed in Table 4, Table 5, Table 7, and Table 8.
4. The method according to claim 2, wherein the genetic marker is selected from the genetic markers listed in Table 8.
5. The method according to claim 2 comprising identifying in said DNA the nucleotide in one or more specific position selected from the positions
- 22,919,878; 22,920,361; 22,920,434; 22,920,646; 22,920,717; 22,921,203; 22,922,079; 22,922,780;22,923,569; 22,924,120; 22,924,142; 22,924,299; 22,924,380; 22,924,407; 22,926,098; 22,926,188; 22,926,872; 22,927,387; 22,927,607; 22,928,220; 22,928,537; 22,928,587; 22,929,137; 22,930,011; 22,932,024; 22,932,895; 22,933,218; 22,936,034; 22,940,759; 22,942,423; 22,945,643; 22,946,599; 22,948,774; 22,949,055; 22,949,108; 22,949,240; 22,949,710; 22,956,846; 22,960,132; 22,960,528; 22,960,710; 22,964,042; 22,965,059; 22,967,119; 22,967,656; 22,967,915; 22,968,898; 22,973,984; 22,974,589; 22,979,124; 22,980,014; 22,982,879; 22,984,588; 22,985,746; 22,988,210; 22,988,991; 22,993,092; 22,994,591; 22,999,058; 22,999,655; 23,002,606; 23,003,956; 23,008,772; 23,008,789; 23,009,648; 23,010,164; and 23,011,289, on horse chromosome 23.
6. The method according to claim 1 comprising determining in said DNA the presence or absence of:
- i) the nucleotide C in a nucleotide position corresponding to position 939 in SEQ ID NO: 1,
- ii) the nucleotide A in a nucleotide position corresponding to position 939 in SEQ ID NO: 3,
- iii) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 5,
- iv) the nucleotide A and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 6,
- v) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 7,
- vi) the nucleotide G and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 8,
- vii) the nucleotide A and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 9,
- viii) the nucleotide T and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 10,
- ix) the nucleotide T and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 11,
- x) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 12,
- xi) the nucleotide A and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 13,
- xii) the nucleotide A and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 14
- xiii) the nucleotide G and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 15,
- xiv) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 16,
- xv) the nucleotide G and/or A in a nucleotide position corresponding to position 51 in SEQ ID NO: 17,
- xvi) the nucleotide G and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 18,
- xvii) the nucleotide C and/or A in a nucleotide position corresponding to position 51 in SEQ ID NO: 19,
- xviii) the nucleotide T and/or C in a nucleotide position corresponding to position 51 in SEQ ID NO: 20,
- xix) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 21,
- xx) the nucleotide C and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 22,
- xxi) the nucleotide C and/or A in a nucleotide position corresponding to position 51 in SEQ ID NO: 23,
- xxii) the nucleotide C and/or G in a nucleotide position corresponding to position 51 in SEQ ID NO: 24, and/or
- xxiii) the nucleotide A and/or T in a nucleotide position corresponding to position 51 in SEQ ID NO: 25.
7. The method according to claim 4 comprising determining in said DNA the presence or absence of:
- i) the nucleotide C in a nucleotide position corresponding to position 939 in SEQ ID NO: 1, and/or
- ii) the nucleotide A in a nucleotide position corresponding to position 939 in SEQ ID NO: 3.
8. A method for predicting the pattern of locomotion in a horse including the ability to use alternative gaits, to trot or pace at a fast speed, and to perform in dressage, said method comprising steps of;
- i) extracting protein from a sample obtained from a horse, and
- ii) determining in said protein sample the presence or absence of a truncated form of the DMRT3 protein.
9. The use of the method according to any of claims 1-8 for paternity testing.
10. The use of the method according to any of claims 1-8 for selection a horse for breeding,
11. A method for selection a horse for breeding, said method comprising determining in a DNA sample obtained from said horse the allele of at least one genetic marker, wherein said at least one genetic marker is located in the region between the flanking SNPs at nucleotide positions 22,628,976 on horse chromosome 23.
Type: Application
Filed: May 4, 2012
Publication Date: Feb 27, 2014
Applicant: CAPILET GENETICS AB (JARFALLA)
Inventors: Lisa S. Andersson (Uppsala), Leif Andersson (Uppsala), Gabriella Lindgren (Knivsta)
Application Number: 13/696,128
International Classification: C12Q 1/68 (20060101);