METHODS OF DIAGNOSING ALZHEIMER'S DISEASE AND MARKERS IDENTIFIED BY SET ASSOCIATION
The present disclosure relates to genetic markers and methods of diagnosing and screening for late-onset Alzheimer's disease (LOAD). As such, the disclosure encompasses a whole-genome association analysis of single nucleotide polymorphisms (SNPs) of which a number are located within the GRB2-associated binding protein 2 (GAB2) gene as well as other markers associated with other genes. The disclosure identifies two novel haplotypes within the GAB2 gene, i.e., a LOAD risk-enhancing and a LOAD risk-decreasing haplotype. These haplotypes modify LOAD risk differentially in combination with APOE alleles. Further encompassed are therapeutic methods and agents of decreasing the deterioration of cells associated with LOAD.
Latest Translational Genomics Research Institute Patents:
- Molecular tagging methods and sequencing libraries
- Systems and methods of diagnosing and characterizing infections
- Methods for tracking improved athletic performance by detecting expression of dysferlin
- Methods of detecting somatic and germline variants in impure tumors
- Methods for predicting onset of a migraine in a subject by detecting STXBP3 RNA levels
This application is a continuation application of U.S. Ser. No. 12/148,728, filed Apr. 21, 2008, which claims the benefit of U.S. Provisional Application Ser. No. 60/933,583, filed Jun. 6, 2007, and U.S. Provisional Application Ser. No. 60/925,638, filed Apr. 20, 2007, all of which are hereby incorporated by reference in their entireties.
BACKGROUND1. Field
The present disclosure relates to genetic markers, compositions for the detection of genetic markers and methods of diagnosing and screening for Alzheimer's disease (AD). Further encompassed are therapeutic methods and agents for decreasing the deterioration of cells associated with AD and methods of screening such agents.
2. Related Art
Disorders of the brain are serious medical conditions causing disability and diminished quality of life. Neurological damage is largely irreversible and thus early diagnosis and close monitoring are critical to the successful treatment of patients. Alzheimer's disease (AD) is a neurodegenerative disease associated with progressive memory loss and cognitive dysfunction. It is associated with abnormal clumps (amyloid plaques) and tangled bundles of fibers (neurofibrillary tangles) in the brain which are considered signs of AD. An estimated 4 million Americans have AD. By the year 2030 approximately 1 in every 80 persons in the U.S. will have AD.
Familial Alzheimer's disease (FAD) is known to be inherited. In affected families, members of at least two generations have had the disease. FAD is rare, accounting for less than 1% of all cases of AD. FAD has an earlier onset, i.e., about 40 years of age and can be observed to run in families.
Early-onset Alzheimer's disease (EOAD) is a rare form of Alzheimer's disease in which individuals are diagnosed with the disease before age 65. Less than 10% of all Alzheimer's disease patients have EOAD. Younger individuals who develop Alzheimer's disease exhibit more of the brain abnormalities that are normally associated with Alzheimer's disease. EOAD is usually familial and follows an autosomal dominant inheritance pattern. To date, mutations in three genes including amyloid precursor protein (APP) on chromosome 21, presenilin 1 (PSEN1) on chromosome 14 and presenilin 2 (PSEN2) on chromosome 1 have been identified in families with EOAD. Mutations in the APP, PSEN1 and PSEN2 genes account for about 50% of the disease. Most of the pathogenic mutations in the APP and presenilin genes are associated with abnormal processing of APP, which leads to the overproduction of toxic Aβ-1-42. Down syndrome patients, who have three copies of chromosome 21 which includes the APP gene, begin to develop the characteristic senile plaques and tau tangles at the ages of 30 and 40 (M. Ilyas Kamboh (2004), Molecular Genetics of Late-Onset Alzheimer's Disease, Annals of Human Genetics 68(4):381-404).
Late-onset Alzheimer's disease (LOAD) is the most common form of Alzheimer's disease, accounting for about 90% of cases and usually occurring after age 65. LOAD strikes almost half of all individuals over the age of 85 and may or may not be hereditary. It is a complex and multifactorial disease with the possible involvement of several genes. Genome-wide linkage or linkage disequilibrium studies on LOAD have provided informative data for the existence of multiple putative genes for AD on several chromosomes, with the strongest evidence on chromosomes 12, 10, 9 and 6. LOAD cases tend to be sporadic, wherein there is no family history of the disease. Genetic susceptibility at multiple genes and interaction between these genes as well as environmental factors are most likely responsible for the etiology of LOAD. Twin data on incident cases indicates that almost 80% of the LOAD risk is attributable to genetic factors. The Apolipoprotein E (APOE) gene on chromosome 19q13 has been identified as a strong risk factor for LOAD. In fact, the APOE-ε4 allele has been established as a strong susceptibility marker that accounts for nearly 30% of the risk in late-onset AD. More specifically, three variants of APOE, encoded by codons 112 and 158, have been found to modify the risk of LOAD. As compared to the common APOE-ε3 allele (codon 112=Cys and codon 158=Arg), the APOE-ε4 allele (codon 112=Arg and codon 158=Arg) increases the risk of AD, while the APOE-ε2 allele (codon 112=Cys and codon 158=Cys) decreases the risk of AD. The effect of the APOE-ε4 allele is dose related, wherein one or two copies of the APOE-ε4 allele are associated with 3-fold or 15-fold risk, respectively. However, the effect of the APOE-ε4 allele on AD risk appears to decline with increasing age (M. Ilyas Kamboh (2004), supra).
From the time of diagnosis, people with AD survive about half as long as those of similar age without dementia. Medicare costs for beneficiaries with AD were $91 billion in 2005 and may increase to as much as $160 billion in 2010. Finding a treatment that could delay the onset by five years could reduce the number of individuals with AD by nearly 50 percent after 50 years. Drug development for AD is very active and sensitive diagnostic and screening technologies could identify patients for therapy and monitor their response. Improved diagnostic tools for AD would thus be a significant advancement to drug development for this disease and would also provide a means to guide therapeutic decision making thus improving outcomes and reducing unnecessary exposure of patients to costly medications with unwanted side effects.
SUMMARYThe present disclosure relates to genetic markers and methods of diagnosing and screening for late-onset Alzheimer's disease (LOAD). As such, the disclosure encompasses a whole-genome association analysis of single nucleotide polymorphisms (SNPs) of which a number are located within the GRB2-associated binding protein 2 (GAB2) gene and other genes as set forth in Table 7. The disclosure identifies two novel haplotypes within the GAB2 gene, i.e., a LOAD risk-enhancing and a LOAD risk-decreasing haplotype as well as other markers disclosed on Table 7 associated with LOAD risk. These haplotypes and some of the markers modify LOAD risk differentially in combination with APOE alleles. Further encompassed are therapeutic methods and agents of decreasing the deterioration of cells associated with LOAD.
One aspect of the disclosure provides a method of assigning a subject to a late onset Alzheimer's disease (LOAD) risk group wherein the method includes providing a biological sample from the subject, detecting a marker in the biological sample selected from a group comprising: a marker associated with a haplotype of GAB2 associated with LOAD and the markers in Table 7; and assigning the subject to the late onset Alzheimer's disease (LOAD) risk group based upon the presence or absence of the marker. The method involves directly or indirectly detecting the presence or absence of the marker. In one embodiment, the marker is associated with increased risk of LOAD and the subject is assigned to a high LOAD risk group if the subject has the marker. Thus, the presence of this haplotype indicates that the subject is at an increased risk of developing LOAD. As such, the haplotype of GAB2 has the SNP haploblock identified by SEQ ID NO: 2. In another embodiment, the marker is associated with a decreased risk of LOAD and the subject is assigned to a low LOAD risk group if the subject has the marker. Thus, the presence of this haplotype indicates that the subject is at a decreased risk of developing LOAD. As such, the haplotype of GAB2 has the SNP haploblock identified by SEQ ID NO: 1. In addition, the subject usually carries an apolipoprotein E allele associated with increased LOAD risk regardless of the haplotype. Most often, the subject will carry APOE-ε4, an allele commonly associated with LOAD. In certain instances, the subject will carry the APOC-1a SNP rs4420638.
The disclosure contemplates that the marker is detected by a method including but not limited to nucleic acid hybridization, antibody binding, activity assay, polymerase chain reaction (PCR), S1 nuclease assay and via gene chip. For example, the marker can be detected by nucleic acid hybridization using at least one hybridization probe specific to a SNP specific, by way of example, to the SNP haploblock identified by SEQ ID NO: 1 or SEQ ID NO: 2.
Another aspect of the disclosure provides for an isolated and/or recombinant nucleic acid encoding the amino acid sequence of GAB2-R2, wherein the amino acid sequence codes for a GAB2 protein that is associated with increased risk of LOAD in individuals that carry the APOE-ε4 allele. The corresponding nucleotide sequence is provided by GAB2-R1. The nucleic acid can be provided in combination with an expression vector which can be transformed into a host cell such that the host cell expresses the GAB2 protein that is associated with increased risk of LOAD.
Another aspect of the disclosure provides for an isolated and/or recombinant nucleic acid encoding the amino acid sequence of GAB2-P2, wherein the amino acid sequence codes for a GAB2 protein that is associated with decreased risk of LOAD in individuals that carry the APOE-ε4 allele. The corresponding nucleotide sequence is provided by GAB2-P1. The nucleic acid can be provided in combination with an expression vector which can be transformed into a host cell such that the host cell expresses the GAB2 protein that is associated with decreased risk of LOAD. This protein has a protective effect and thus, it decreases the likelihood of developing LOAD.
The disclosure further encompasses a set of molecular probes for detection, including at least two probes capable of detecting, directly or indirectly, at least two markers disclosed herein associated with increased or decreased risk of LOAD, wherein the molecular probes are not associated with a microarray of greater than 1000 elements. In certain embodiments, at least one of the probes is specific for a marker associated with increased risk of LOAD and at least one other probe is specific for a marker associated with decreased risk of LOAD. Thus, the probes are able to detect directly or indirectly the presence or absence of the marker and are useful for diagnosis, monitoring and risk assessment of LOAD.
Another aspect of the disclosure provides a method of decreasing neuronal cell deterioration in a subject with dementia (e.g., LOAD), wherein the method includes administering a construct with a nucleic acid of SEQ ID NO: 4 or a fragment thereof to the subject. The nucleic acid codes for GAB2 and the construct includes a recombinant vector in order to transform the construct into cells or tissues of the subject such that the protein can be expressed. In one embodiment, the construct is provided to the subject in form a pharmaceutical composition. Thus, the construct or pharmaceutical composition thereof can be used to treat LOAD. Alternatively, the disclosure provides for a method of decreasing neuronal cell deterioration in a subject with dementia, wherein the method includes administering a polypeptide of SEQ ID NO: 5 or a fragment thereof to the subject. The polypeptide codes for GAB2 which can be administered in form of a pharmaceutical composition. Thus, the protein or pharmaceutical composition thereof can be used to treat LOAD. The subject to be treated for LOAD usually carries an apolipoprotein E allele such as APOE-ε4. In a preferred embodiment, the subject is a human or other mammal.
The disclosure further includes a method of decreasing or preventing neuronal cell deterioration in a subject at risk for LOAD, wherein the method includes administering a compound that modulates PI3K or ERK activity. Such a compound can mimic the function of GAB2 which is encoded by the nucleic acid sequence of SEQ ID NO: 4. The corresponding amino acid sequence is encoded by SEQ ID NO: 5.
The disclosure also encompasses a method of decreasing or preventing neuronal cell deterioration in a subject at risk for LOAD, wherein the method includes administering an siRNA specific for the haplotype of GAB2 with the SNP haploblock identified by SEQ ID NO: 2. Herein, the subject is heterozygous for the haplotype of GAB2 with the SNP haploblock identified by SEQ ID NO: 2. In a preferred embodiment, the subject is a human or other mammal.
Yet, another aspect of the disclosure provides a method of treating or preventing neuronal cell deterioration in a subject at risk for LOAD, wherein the method includes isolating stem cells from the subject; contacting the stem cells with a nucleic acid encoding GAB2 or a fragment thereof which is not the haplotype of GAB2 with the SNP haploblock identified by SEQ ID NO: 2, wherein the nucleic acid homologously recombines with at least one genomic copy of GAB2 in the stem cells; and contacting the subject with the stem cells to treat or prevent neuronal cell deterioration. Alternatively, the disclosure provides a method of treating or preventing neuronal cell deterioration in a subject at risk for LOAD, wherein the method includes isolating stem cells from the subject; contacting the stem cells with a nucleic acid encoding GAB2 or a fragment thereof which is the haplotype of GAB2 with the SNP haploblock identified by SEQ ID NO: 1, wherein the nucleic acid homologously recombines with at least one genomic copy of GAB2 in the stem cells; and contacting the subject with the stem cells to treat or prevent neuronal cell deterioration. The subject usually carries an apolipoprotein E allele associated with increased LOAD risk such as the APOE-ε4 allele.
Still, another aspect of the disclosure provides a method for screening for a therapeutic agent effective to inhibit development of cell deterioration associated with dementia, wherein the method includes incubating neuronal cells in vitro with a mixture of PI3K and an agent to be tested; measuring PI3K activity; comparing the activity to PI3K activity in the absence of the agent to be tested; and identifying the therapeutic agent by indication of PI3K activity in the presence of the agent. The therapeutic agent can inhibit the cell deterioration associated with dementia such as LOAD.
The disclosure further contemplates a non-human transgenic animal with a genome including SEQ ID NO: 1 or SEQ ID NO: 2 in association with a gene for GAB2. The non-human animal can be used as a disease model for screening therapeutic compounds for treatment of LOAD.
In yet another aspect, the disclosure provides a method for screening for a therapeutic agent effective to inhibit development of cell deterioration associated with dementia, wherein the method includes providing a transgenic animal with a genome including SEQ ID NO: 1 or SEQ ID NO: 2 in association with a gene for GAB2; administering to the transgenic animal an agent to be tested for its effectiveness in treating or preventing the cell deterioration associated with dementia; and assessing the effectiveness of the agent. This method can be used to screen for therapeutic agents that are effective in the treatment and prevention of LOAD.
The present disclosure is best understood when read in conjunction with the accompanying figures, which serve to illustrate the preferred embodiments. It is understood, however, that the disclosure is not limited to the specific embodiments disclosed in the Figures.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
-
- A—Kaplan-Meier plot for carriers of the risk-modifying haplotypes of GAB2.
- B—Signaling Cascade for GAB2: GAB2 is an important scaffolding molecule that mediates signaling from growth factor and cytokine receptors to tau phosphorylation, ApoE synthesis, mast cell degranulation and learning and memory.
-
- A) LOAD hippocampus (neutral red counterstain) (40× objective). The arrow indicates a highly dystrophic cell with the size and morphology of a cortical pyramidal neuron, the cell type selected by laser capture for genomic analyses. Arrowheads point to one of many structures in the sections that resemble dystrophic neurites or neuropil threads.
- B) LOAD hippocampus (neutral red counterstain) (40×). The arrow denotes a putative neurofibrillary tangle containing neuron. Arrowheads again indicate a dystrophic neurite. Immunohistochemistry with a rabbit polyclonal antibody directed at GAB2 residues 285-676 (Upstate Cell Signaling Solutions, Lake Placid, N.Y.) gave similar but weaker staining (not shown).
- C) AD posterior cingulate gyrus (40×). Filled arrows point to relatively normal appearing, putative neurons. The open arrow points to a cell with the features of a neurofibrillary tangle bearing neuron. Immunoreactive structures clearly resembling pyramidal cell apical dendrites were also observed ascending through the cortical layers (arrowheads).
- D) AD posterior cingulate cortex (100× objective). GAB2 immunoreactive cell with the flame-shaped cytoplasmic inclusion typical of the neurofibrillary tangle.
The present disclosure encompasses the following sequences with assigned sequence identifiers:
SEQ ID NO: 1—TC-GCA-TGAGGTGTCTT-Haploblock of SNPs that identify the haplotype of GAB2 that decreases LOAD risk.
SEQ ID NO: 2—CT-AAG-CAGATCAGACG-Haploblock of SNPs that identify the Haplotype of GAB2 that increases LOAD risk.
SEQ ID NO: 3—CT-AAG-CAGAGCAGCCG-Haploblock of SNPs that identify the Haplotype of GAB2 that does not alter risk for LOAD.
SEQ ID NO: 4—cDNA for GAB2 (from a fourth haplotype).
SEQ ID NO: 5—protein sequence for GAB2 (from a fourth haplotype).
GAB2-R1—cDNA sequence for LOAD risk increasing allele of human GAB2.
GAB2-R2—polypeptide sequence for LOAD risk increasing allele of human GAB2.
GAB2-P1—cDNA sequence for LOAD risk decreasing (protective) allele of human GAB2.
GAB2-P2—polypeptide sequence for LOAD risk decreasing (protective) allele of human GAB2.
GAB2-R1-Genomic—genomic sequence for LOAD risk increasing allele of human GAB2.
GAB2-P1-Genomic—genomic sequence for LOAD risk decreasing (protective) allele of human GAB2.
GAB2-Neutral-Genomic—genomic sequence for non-LOAD risk altering allele of human GAB2.
DETAILED DESCRIPTION OF THE DISCLOSURE i.) General OverviewAlzheimer's disease (AD) is a progressive neurodegenerative disorder characterized by memory and cognitive impairments and other non-cognitive behavioral symptoms. Age is the strongest risk factor, wherein almost 50% of people over the age of 85 are affected. Early-onset AD (EOAD) is associated with genetic mutations in amyloid precursor protein (APP), presenilin 1 (PSEN1) and presenilin 2 (PSEN2). However, sporadic or late-onset AD (LOAD) is multi-factorial and genetically more complex. In addition, genetic factors may account for as much as 80% of the disease risk associated with LOAD (Gatz et al. (2006) Arch. Gen. Psychiatry 63(2):168-174). While monogenic mutations cause EOAD, the only extensively validated susceptibility gene for LOAD is the apolipoprotein E (APOE-ε4) allele (Saunders et al. (1993) Neurology 43(8):1467-1472 and Farrer et al. (1997) JAMA 278(16):1349-1356). But alleles of the APOE gene do not account for all of the genetic load responsible for LOAD predisposition. Stratification by APOE-ε4 carrier status allows for the detection of association signals that are normally overwhelmed and thus, masked by the signal of APOE alleles in a non-stratified study design. Specifically, it allowed the present inventors to detect a locus that modifies the LOAD risk of individuals who carry the APOE-ε4 allele.
Late-onset Alzheimer's Disease (LOAD) is the most common age-related dementia, and is caused by neuronal death of poorly understood etiology. The most accurate method of diagnosis is the presence of characteristic plaques and tangles in the brain which can be revealed through histological examination. The apolipoprotein E (APOE-ε4) allele (supra) accounts for approximately 65% of the total risk. The disclosure encompasses a new whole-genome association analysis of single nucleotide polymorphisms (SNPs) using 502,627 SNPs in 760 histopathologically verified Alzheimer's disease (AD) cases and controls. Of the 25 SNPs with the most significant p-values in the APOE-ε4 carrier group, 10 SNPs are found to be located within the GRB2-associated binding protein 2 (GAB2) gene. There is a single haplotype block encompassing the entire gene and which contains four potentially functional non-synonymous nucleotide changes. The disclosure identifies a risk-enhancing and a risk-decreasing haplotype within the GAB2 gene. These haplotypes modify risk differentially in combination with APOE alleles. In addition, the present disclosure includes additional markers associated with increased or decreased risk of LOAD as shown in Tables 7.
ii.) DefinitionsThe following definitions are set forth to illustrate and define the meaning and scope of the various terms used to describe the present disclosure.
The term “single polynucleotide polymorphism (SNP)” refers to a DNA sequence variation that involves a change in a single nucleotide. They present in humans with a frequency of about once in every 1000 bases and contribute to differences among individuals. The majority of SNPs have no effect. However, some affect the risk for certain diseases.
The term “haplotype” means one or more closely linked alleles, i.e., genes or DNA polymorphisms (e.g., SNPs) inherited as a unit. Different combinations of polymorphisms are known as haplotypes. The difference of a single genetic marker can delineate a distinct haplotype. Alternatively, the results from several loci could be referred to as a haplotype. For example, a haplotype can be a set of SNPs on a single chromatid that are statistically associated (i.e., inherited as a unit). It is thought that these associations, and the identification of a few alleles of a “haplotype block”, can help to identify all other polymorphic sites in its region. With the identification of a haplotype block associated with a particular haplotype, one of skill in the art may readily identify all other DNA polymorphisms associated with the particular haplotype by routine sequencing of the genomic DNA of an individual having such haplotype, preferably homozygous for such haplotype. Thus, such information is valuable for investigating the genetics behind common diseases.
A “SNP haploblock identified by SEQ ID NO:” means set of at least two SNPs that are associated with an allele of a gene such as GAB2, wherein said SNPs are grouped together in form of a synthetic nucleotide sequence. The SNPs in a given haploblock may be associated with a risk increasing or risk decreasing haplotype for LOAD. Thus, when a nucleic acid is “specific to a SNP haploblock” or “specific to a SNP within a haploblock” then that nucleic acid (e.g., nucleic acid probe) is complementary to at least one SNP that is grouped within that haploblock, which may include one of the SNPs that are associated with the synthetic nucleotide sequence or any other SNP associated with the haplotype identified by the SNP haploblock. A “SNP that is specific to a SNP haploblock” is a SNP where the DNA polymorphism is found only in the particular haplotype and no other. By way of example, considering the three GAB2 haplotypes identified by the haploblocks in Table 1 below, SNP 5 is specific to the SNP haploblock identified by SEQ ID NO: 1 since the SNP haploblock identified by SEQ ID NO: 1 is the risk decreasing haplotype and this haplotype has a T at SNP 5 while the other two haplotypes have a C.
The term “HapMap” refers to a catalog of common genetic variants that occur in human beings. It describes what these variants are, where they occur in the DNA, and how they are distributed among individuals within populations and among populations in different parts of the world (A haplotype map of the human genome (2005) Nature 437:1299-1320).
iii.) Haplotypes of GAB2 As Genetic Markers for ADThe genetic sequences of different individuals are remarkably similar. When the chromosomes of two humans are compared, their DNA sequences can be identical for hundreds of bases. But at about one in every 1000 to 1,200 bases, on average, the sequences will differ. As such, one individual might have an A at that location, while another individual has a G, or a person might have extra bases at a given location or a missing segment of DNA. Differences in individual bases are the most common type of genetic variation. These genetic differences are known as single nucleotide polymorphisms (SNPs) (supra). SNPs act as markers to locate genes in DNA. Given the relatively close spacing between these SNPs, SNPs are typically inherited in blocks.
GAB2 is scaffolding protein involved in multiple signaling pathways that impact AD pathology (see
Despite the postulation of many genes as possible susceptibility factors in LOAD, the only well-established genetic link is the APOE-ε4 allele. Association studies traditionally fail to replicate due to underlying genetic structure, low sample number, poor genomic coverage and incorrect diagnosis. However, the inventors have addressed these issues by controlling for underlying genetic structure and interrogating 502,627 SNPs in 1469 samples of which 1086 samples are histopathologically confirmed based on the presence or absence of plaques and tangles in the postmortem samples. The inventors have further enhanced the analysis of this complex disorder by stratifying the data for APOE-ε4 carrier status in order to detect signals that would normally be masked in a non-stratified design. Thus, the disclosure identifies two GAB2 haplotype blocks wherein one is associated with increased risk for LOAD (SEQ ID NO: 1) and the other is associated with decreased risk for LOAD (SEQ ID NO: 2), wherein the latter exhibits a strong protective effect. These haplotypes modify risk differentially in combination with APOE alleles.
There is a single haplotype block encompassing the entire GAB2 gene and which contains four potentially functional non-synonymous nucleotide changes. The haplotype block is shown in
Determining the functional variant in the GAB2 haplotype block is challenging because the block spans 189 kb and includes at least 614 known SNPs, each of which reflects a significant association with AD due to the significant haplotype block. Of the 614 described SNPs, 4 are non-synonymous and are the best candidates for causing the disease risk-modifying effects of the haplotype. It is unlikely that a complex, partially penetrant disease like LOAD would have a truncated protein or other catastrophic risk or protective allele. However, it is likely that subtle genetic differences lead to altered protein expression or phosphorylation. PupaSNP is used to identify known SNPs that are likely to play a significant physiological role due to their location in exonic splicing enhancers (ESEs) or triplex forming regions (TRPs), both of which provide for sequence specific regulation of expression and splicing (see
An array platform (e.g., AFFYMETRIX 500K array) can be used to genotype all three populations that are utilized. However, any other platform can be employed as well. There are two reasons for confidence that the genotype calls are correct and that the association effect observed is indeed accurate (and not due to a bias of any single platform used). First, the observation is not a single SNP association, but rather the association of a large haplotype block which is in agreement with the LD structure of the HapMap CEPH population. Second, one benefit of using SNiPer-HD for genotype calls is the generation of a quality score measuring how well the clustering algorithm is able to separate the chip output into three distinct Gaussians. All six SNPs showing replication in the region have high quality scores (>0.45), indicating that the data for each SNP clustered into three distinct Gaussians corresponding to the three possible genotype calls.
The whole genome association study uses an evenly distributed panel of SNPs across the genome with an average inter-marker distance of 6 kb in a sample of 1469 well characterized AD cases and age-matched unaffected controls. As such, it identified risk-modifying haplotypes of GAB2 for APOE-ε4 carriers. Ultra-high density SNP genotyping is successfully employed to identify risk and protective loci in complex diseases because it is rigorously controlled and performed using a sufficiently reliable and dense set of markers. Further efforts are in progress to identify the functional variant(s) present in GAB2 in order to include pre-symptomatic risk assessment strategies and actual drug targets.
Several additional markers were identified in or near other genes that have utility in diagnosing and monitoring late onset Alzheimer's disease which may be used alone or in combination with the markers disclosed herein or otherwise known in the art such as the APOE-ε4 allele.
iv.) Methods of Diagnosing ADThe disclosure provides a method of assigning a subject to a late onset Alzheimer's disease (LOAD) risk group in order to assess the likelihood of the subject being afflicted with the disease. This method can be employed to assess the risk at early stages of disease progression. The method includes providing a biological sample from the subject, detecting a marker in a biological sample, which can be a haplotype of GAB2 associated with LOAD or any of the markers on Table 7, and assigning the subject to the late onset Alzheimer's disease (LOAD) risk group based upon the presence or absence of the haplotype. The method involves directly or indirectly detecting the presence or absence of the markers. For example, the haplotype of GAB2 may be associated with increased risk of LOAD and the subject is then assigned to a high LOAD risk group if the subject has the haplotype of GAB2. As an example of such is the haplotype of GAB2 identified by the SNP haploblock identified by SEQ ID NO: 2. Thus, the presence of haploblock identified by SEQ ID NO: 2 indicates that the subject is at increased risk for developing the disease. Alternatively, the marker may be associated with decreased risk of LOAD and the subject is assigned to a low LOAD risk group if the subject has the marker. An example of such is the haplotype of GAB2 identified by the SNP haploblock identified by SEQ ID NO: 1. Thus, the presence of haploblock identified by SEQ ID NO: 1 indicates that the subject is at a decreased risk for developing the disease. In addition, the subject may be further stratified by LOAD risk group based upon whether the subject carries an apolipoprotein E allele associated with increased or altered LOAD risk. By way of example, APOE-ε4, is commonly associated with LOAD. In addition, various haplotypes of APOE have been associated with LOAD risk groups as set forth in U.S. Patent Publ. 2005/0277129. Finally multiple markers disclosed herein may be used in combination to improve the accuracy, preferably two or more, three or more, four or more, five or more, or ten or more of the markers may be used.
The disclosure contemplates that the markers may be detected by a variety of methodologies or procedures that are well know in the art including, but not limited to, nucleic acid hybridization, antibody binding, activity assay, polymerase chain reaction (PCR), S1 nuclease assay and via gene chip or microarray as well as any other assay known in the art that may be used to detect the SNPs associated with a haplotype or the gene product produced from the gene of the haplotype including mRNA and protein. For example, the haplotype of GAB2 can be detected by nucleic acid hybridization using at least one hybridization probe specific to a SNP specific to the SNP haploblock identified by SEQ ID NO: 1 or SEQ ID NO: 2 (see
The step of detecting the presence or absence of a marker disclosed herein or a close isoform thereof may be carried out either directly or indirectly by any suitable means. A variety of techniques are known to those skilled in the art (supra). All generally involve the step of collecting a biological sample containing DNA or protein from the subject, and then detecting whether or not the subject possesses the marker or a close isoform thereof. For example, the detecting step may be carried out by collecting a biological sample from the subject (any fluid or tissue containing the markers associated with the haplotype), and then determining the presence or absence of the haplotype in the sample (e.g., by nucleic acid hybridization, immunoassay such as antibody binding, activity assay, polymerase chain reaction (PCR), S1 nuclease assay, via gene chip or microarray, via isoelectric focusing, etc.). Other markers may also be used that are associated with the markers disclosed herein such as SNPs or other polymorphic markers that are in close enough proximity to have a statistically significant association with the marker disclosed herein (i.e., other markers in linkage disequilibrium with a marker disclosed herein). It will be readily appreciated by those of skill in the art that the detecting steps described herein may be carried out directly or indirectly. Thus, for example, if a marker or a close isoform thereof is detected in the subject, then it is determined that the subject is either at increased or decreased risk for LOAD depending on which marker or close isoform thereof is identified (i.e., a significant enough number of markers associated with a haplotype). If, for example, the subject is identified to carry the risk increasing GAB2 haplotype in combination with the APOE-ε4 allele then the subject is very likely to be at increased risk of developing LOAD. If, for example, the subject is identified to carry the risk decreasing GAB2 haplotype in combination with the APOE-ε4 allele then the subject is very likely not at increased risk of developing LOAD or even at a lower risk of developing LOAD compared to APOE-ε4 carriers that do not carry any GAB2 haplotype associated with either SEQ ID NO: 1 or SEQ ID NO: 2. Other means of indirectly determining a risk for LOAD could be by measuring other polymorphic markers that are in statistically significant proximity to any of the SNPs of the GAB2 haplotypes.
The disclosure also provides set of molecular probes for detection, including at least two probes capable of detecting, directly or indirectly, a marker disclosed herein associated with increased or decreased risk of LOAD, wherein the molecular probes are not associated with a microarray of greater than 1000 elements, a microarray with greater than 500 elements, a microarray with greater than 100 elements a microarray with greater than 50 elements, or are not associated with a microarray. In preferred embodiments, at least one of the probes is capable of detecting, directly or indirectly, a marker disclosed herein associated with increased risk of LOAD and at least one other probe is capable of detecting, directly or indirectly, a marker disclosed herein associated with decreased risk of LOAD. Thus, the probes are able to detect directly or indirectly the presence or absence of such marker disclosed herein.
Detecting the presence or absence of a marker disclosed herein or a close isoform thereof may be carried out either directly or indirectly. For example, SNP testing technologies can be employed to detect the haplotype of GAB2. Based upon a patient's specific genotype (SNP pattern) an estimate of the patient's relative risk for LOAD could be provided. This would include genotyping first for ApoE 2/3/4 status, and then further genotyping for the GAB2 haplotype status. This includes Odds Ratio estimates based on carrier status of these two genes.
Detection of the disease also includes detection of the haplotype by any SNPs/markers within the haplotype, but also indirectly through SNPs/markers outside the haplotype and leveraging linkage disequilibrium to identify carriers of the haplotype.
In addition to determining a patient's relative risk for LOAD, the diagnosis may include prescribing therapeutic regimens to treat, prevent or delay onset of LOAD.
In certain preferred embodiments, the method of diagnosis will further include direct or indirect detection of APOE alleles associated with LOAD, preferably the APOE-ε4 allele. Such detection may be performed using any of the detection methods available to one of skill in the art and the markers disclosed herein and APOE alleles may be detected using the same or different methods and may be detected at the same or different times. Further, the method of diagnosis may rely upon the information regarding the APOE alleles of the subject that had been previously determined. With information regarding the marker disclosed herein and APOE alleles of a subject, the diagnosis of risk may be determined and present in the form of Odds Ratio estimates for the set of alleles the subject has.
v.) Screening MethodsThe disclosure also provides a method for screening for a therapeutic agent effective to inhibit development of cell deterioration associated with dementia. This method includes incubating neuronal cells in vitro with an agent to be tested, measuring PI3K activity, comparing the activity to PI3K activity in the absence of the agent to be tested, and identifying the therapeutic agent by indication of PI3K activity in the presence of the agent. The therapeutic agent can inhibit the cell deterioration associated with dementia such as LOAD. These methods also includes incubating neuronal cells in vitro with an agent to be tested, measuring GAB2 activity directly or indirectly, comparing the activity to GAB2 activity in the absence of the agent to be tested, and identifying the therapeutic agent by detection of modulation of GAB2 activity in the presence of the agent. Examples of measuring GAB2 activity include measuring mRNA level, mRNA transcription rate, protein level, assaying the activity of the GAB2 protein or one of the proteins that it regulates such as PI3K, and measuring activity of a reporter gene operably linked to a GAB2 regulatory element which can include the GAB2 promoter and/or one or more GAB2 enhancer or repressor elements. The therapeutic agent can inhibit the cell deterioration associated with dementia such as LOAD. Thus, the therapeutic agent will act as an agonist to activate PI3K. For the purpose of screening for such agents, the disclosure also includes a non-human transgenic animal or mammalian cell based assay (preferably a human cell based assay) with a genome including the human GAB2 gene associated with the haplotype identified by the SNP haploblock of SEQ ID NO: 1 or SEQ ID NO: 2. The non-human animal or mammalian cell based assay can be used as a disease model for screening therapeutic compounds for treatment of LOAD. Thus, the disclosure provides a method for screening for a therapeutic agent effective to inhibit development of cell deterioration associated with dementia, wherein the method includes providing a transgenic animal or mammalian cell based assay with a genome including the human GAB2 gene associated with the haplotype identified by the SNP haploblock of SEQ ID NO: 1 or SEQ ID NO: 2. The method further includes administering to the transgenic animal an agent to be tested for its effectiveness in treating or preventing the cell deterioration associated with dementia, and assessing the effectiveness of the agent.
vi.) Therapeutic Methods and FormulationsIn people with AD, changes in the brain may begin 10 to 20 years before any visible signs or symptoms appear. Some regions of the brain may begin to shrink, resulting in memory loss, the first visible sign of AD. Over time, AD progresses through three main stages: mild, moderate, and severe. These stages are characterized by a collection of signs and symptoms and behaviors that individuals with AD experience. People with mild symptoms of AD often seem healthy, but they are actually having difficulty making sense of the world around them. Initial symptoms are often confused with changes that take place in normal aging. Symptoms and early signs of AD may include difficulty learning and remembering new information, difficulty managing finances planning meals, taking medication on schedule, depression symptoms (sadness, decreased interest in usual activities, loss of energy), getting lost in familiar places, etc. In moderate AD, the damaging processes occurring in the brain worsen and spread to other areas that control language, reasoning, sensory processing, and thought. In this stage, symptoms and signs of AD become more pronounced and behavioral problems may become more obvious. Signs and symptoms of moderate AD may include forgetting old facts, repeating stories and/or questions over and over, making up stories to fill gaps, difficulty performing tasks, following written notes, agitation, restlessness, repetitive movements, wandering, paranoia, delusions, hallucinations, deficits in intellect and reasoning, lack of concern for appearance, hygiene, and sleep, etc. In the advanced stage of AD, damage to the brain's nerve cells is widespread. At this point, full-time care is typically required. People with severe AD may have difficulty walking, and they often suffer complications from other illnesses, such as pneumonia. Signs of severe AD may include screaming, mumbling, speaking gibberish, refusing to eat, failing to recognize family or faces, and difficulty with all essential activities of daily living.
The disclosure provides a method of decreasing or preventing neuronal cell deterioration in a subject at risk for LOAD, wherein the method includes administering a compound that mimics the function of GAB2. By way of example, such a compound may modulate PI3K or ERK activity. Such a compound is provided by the instant disclosure and is encoded by the amino acid sequence of SEQ ID NO: 5. The corresponding nucleic acid sequence is encoded by SEQ ID NO: 4. Additional compounds and agents may be readily identified by the screening methods discussed above. Such modulating may include, for example, turning on PI3K activity which will in turn activate the protein AKT which will lead to inhibition of TAU which is associated with increased neurofibrillary tangles and plaques in the brain that are characteristic of LOAD, and to inhibition of the FOXO proteins and BAD protein which increase cell death. Since PI3K is a known cancer-related gene, many compounds that modulate PI3K activity have been identified and either may be used directly in the methods herein or may be used to design libraries for further screening of PI3K modulators. Examples of such PI3K modulators may be found in U.S. Patent Application Nos: 2006005832, 20060128732, 20060106038, and 20020151549. Such modulating may also include, for example, turning on the ERK 1 and 2 cascades will lead to activated CREB which enhances learning and memory. Thus, a compound that mimics GAB2 can be used as a therapeutic agent or agonist that is expected to decrease or prevent neuronal cell deterioration associated with LOAD.
GAB2 agonists, mimics, and modulatory agents (e.g., drugs based on SEQ ID NO: 5) are formulated as pharmaceuticals to be used in the methods of the disclosure such as for the treatment and prevention of AD, particularly LOAD. Any composition or compound that can stimulate a biological response associated with the binding of a ligand analogue (i.e., an agonist or modulator) to GAB2 or that can mimic GAB2 can be used as a pharmaceutical in the disclosure. General details on techniques for formulation and administration are well described in the scientific literature (see “Remington's Pharmaceutical Sciences”, Maack Publishing Co, Easton Pa.). GAB2 agonist, mimic, and modulatory agent pharmaceutical formulations can be prepared according to any method known in the art for the manufacture of pharmaceuticals. The GAB2 agonists, mimics, and modulatory agents used in the methods of the disclosure can be formulated for administration in any conventionally acceptable way including via intravenous injection, IM, IP, orally, topically, vaginally or rectally. Oral administration is preferred. Illustrative examples are set forth below.
Pharmaceutical formulations for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical formulations to be formulated in unit dosage forms as tablets, pills, powder, capsules, liquids, lozenges, gels, syrups, slurries, suspensions, etc., suitable for ingestion by the patient. Pharmaceutical preparations for oral use can be obtained through combination of GAB2 agonist compounds with a solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable additional compounds, if desired, to obtain tablets or pills. Suitable solid excipients are carbohydrate or protein fillers which include, but are not limited to, sugars, including lactose, sucrose, mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; and gums including arabic and tragacanth; as well as proteins such as gelatin and collagen. If desired, disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium alginate. Pharmaceutical preparations of the disclosure that can also be used orally are, for example, push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a coating such as glycerol or sorbitol. Push-fit capsules can contain GAB2 agonists, mimics, and modulatory agents mixed with a filler or binders such as lactose or starches, lubricants such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, the GAB2 agonist, mimic, and modulatory agent compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycol with or without stabilizers.
Aqueous suspensions of the disclosure contain a GAB2 agonist, mimic, and modulatory agent in admixture with excipients suitable for the manufacture of aqueous suspensions. Such excipients include a suspending agent, such as sodium carboxymethylcellulose, methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia, and dispersing or wetting agents such as a naturally occurring phosphatide (e.g., lecithin), a condensation product of an alkylene oxide with a fatty acid (e.g., polyoxyethylene stearate), a condensation product of ethylene oxide with a long chain aliphatic alcohol (e.g., heptadecaethylene oxycetanol), a condensation product of ethylene oxide with a partial ester derived from a fatty acid and a hexitol (e.g., polyoxyethylene sorbitol mono-oleate), or a condensation product of ethylene oxide with a partial ester derived from fatty acid and a hexitol anhydride (e.g., polyoxyethylene sorbitan monooleate). The aqueous suspension can also contain one or more preservatives such as ethyl or n-propyl p-hydroxybenzoate, one or more coloring agents, one or more flavoring agents and one or more sweetening agents, such as sucrose, aspartame or saccharin. Formulations can be adjusted for osmolarity.
Oil suspensions can be formulated by suspending a GAB2 agonist, mimic, and modulatory agent in a vegetable oil, such as arachis oil, olive oil, sesame oil or coconut oil, or in a mineral oil such as liquid paraffin. The oil suspensions can contain a thickening agent, such as beeswax, hard paraffin or acetyl alcohol. Sweetening agents can be added to provide a palatable oral preparation. These formulations can be preserved by the addition of an antioxidant such as ascorbic acid.
Dispersible powders and granules of the disclosure suitable for preparation of an aqueous suspension by the addition of water can be formulated from a GAB2 agonist, mimic, and modulatory agent in admixture with a dispersing, suspending and/or wetting agent, and one or more preservatives. Suitable dispersing or wetting agents and suspending agents are exemplified by those disclosed above. Additional excipients, for example sweetening, flavoring and coloring agents, can also be present.
The pharmaceutical formulations of the disclosure can also be in the form of oil-in-water emulsions. The oily phase can be a vegetable oil, such as olive oil or arachis oil, a mineral oil, such as liquid paraffin, or a mixture of these. Suitable emulsifying agents include naturally-occurring gums, such as gum acacia and gum tragacanth, naturally occurring phosphatides, such as soybean lecithin, esters or partial esters derived from fatty acids and hexitol anhydrides, such as sorbitan mono-oleate, and condensation products of these partial esters with ethylene oxide, such as polyoxyethylene sorbitan mono-oleate. The emulsion can also contain sweetening and flavoring agents. Syrups and elixirs can be formulated with sweetening agents, such as glycerol, sorbitol or sucrose. Such formulations can also contain a demulcent, a preservative, a flavoring or a coloring agent.
When the drugs are delivered by intravenous injection, the GAB2 agonist, mimic, and modulatory agent pharmaceutical formulations of the disclosure can be in the form of a sterile injectable preparation, such as a sterile injectable aqueous or oleaginous suspension. This suspension can be formulated according to the known art using those suitable dispersing or wetting agents and suspending agents which have been mentioned above. The sterile injectable preparation can also be a sterile injectable solution or suspension in a nontoxic parenterally-acceptable diluent or solvent. Among the acceptable vehicles and solvents that can be employed are water and Ringer's solution, an isotonic sodium chloride. In addition, sterile fixed oils can conventionally be employed as a solvent or suspending medium. For this purpose any bland fixed oil can be employed including synthetic mono- or diglycerides. In addition, fatty acids such as oleic acid can likewise be used in the preparation of injectables.
vii.) Administration and Dosing Regimen of GAB2 Agonists, Mimics, and Modulatory AgentsThe GAB2 agonists, mimics, and modulatory agents used in the methods of the disclosure can be administered in any conventionally acceptable way including via intravenous injection, IM, IP, orally, topically, vaginally or rectally. Oral administration is preferred. Administration will vary with the pharmacokinetics and other properties of the drugs and the patients' condition of health. The methods of the disclosure reduce cellular deterioration in patients who suffer from LOAD. The amount of GAB2 agonist, mimic, and modulatory agent that is adequate to accomplish this is considered the therapeutically effective dose. Precise dose schedules cannot be stated. The dosage schedule and amounts effective for this use, i.e., the “dosing regimen,” will depend upon a variety of factors, including the stage of LOAD, the severity of LOAD, the severity of the adverse side effects, the general state of the patient's health, the patient's physical status, age and the like. In calculating the dosage regimen for a patient, the mode of administration is also taken into consideration. The dosage regimen must also take into consideration the pharmacokinetics, i.e., the agonist, mimic, and modulatory agent rate of absorption, bioavailability, metabolism, clearance, and the like.
The state of the art allows the clinician to determine the dosage regimen for each individual patient who suffers from LOAD (mild, moderate and severe stages). The formulations should provide a sufficient quantity of GAB2 agonist, mimic, and modulatory agent to effectively ameliorate LOAD which is characterized by symptoms such as memory loss, depression, restlessness, etc. A typical pharmaceutical formulation for oral administration of GAB2 agonist, mimic, and modulatory agent would depend on the stage of LOAD and condition of the patient. For example, a GAB2 agonist, mimic, and modulatory agent may be administered to a patient through mono therapy (i.e., with no other Alzheimer's medications) or in combination therapy with other Alzheimer's drugs including cholinesterase inhibitors such as DONEPEZIL (Aricept), GALANTAMINE (Razadyne), RIVASTIGMINE (Exelon), and TACRINE (Cognex), drugs that generally used for treatment of mild to moderate AD. NAMENDA (memantine HCl), a drug used for treatment of moderate to severe AD may also be used in combination with GAB2 agonists, mimics, and modulatory agents. Thus, the dosages of selective GAB2 agonists, mimics, and modulatory agents administered to a patient may vary depending on age, degree of illness, drug tolerance, and concomitant medications and conditions.
viii.) ExamplesThe following specific examples are intended to illustrate the disclosure and should not be construed as limiting the scope of the claims.
In ε4 carriers from neuropathologically verified discovery, neuropathologically verified replication, and clinically characterized replication cohorts of 1425 cases and controls, LOAD was associated with six SNPs from the GRB-associated binding protein 2 (GAB2) gene and with a common haplotype encompassing the entire GAB2 gene. Of those homozygous for both the ε4 allele and the GAB2 risk haplotype, 98.7% had LOAD. In addition, the GAB2 gene was over-expressed in pathologically vulnerable neurons, the GAB2 protein was detected in neurons, tangle-bearing neurons and dystrophic neurites, and interference with GAB2 gene expression increased Tau phosphorylation. Collectively, the inventors have shown that GAB2 modifies LOAD risk in APOE ε4 carriers and influences Alzheimer's neuropathology.
More specifically, whole-genome association analysis of single nucleotide polymorphisms (SNPs) was performed using 502,627 SNPs in 760 histopathologically verified AD cases and controls (discovery cohort). All samples were stratified for APOE-ε4 carrier status prior to analysis to identify additional LOAD risk loci. Of the 25 SNPs with the most significant p-values in the APOE-ε4 carrier group, 10 were located within the GRB2-associated binding protein 2 (GAB2) gene (p-valmax=5.9×10-7; OR=0.2571 (0.1173-0.5636). These results were replicated in an additional 326 histopathologically diagnosed cases and controls, also stratified for APOE-ε4 carrier status (replication cohort 1). An additional cohort of 383 clinically diagnosed LOAD samples provided further validation for the GAB2 association (replication cohort 2). There is a single haplotype block encompassing the entire gene and which contains four potentially functional non-synonymous nucleotide changes. This analysis identified a risk-enhancing and a risk-decreasing haplotype within the GAB2 gene which modify risk differentially in combination with APOE alleles in LOAD.
Thus, the inventors conducted individualized genome-wide surveys of 502,627 single nucleotide polymorphisms (SNPs) in an effort to characterize and confirm novel LOAD susceptibility genes in three separate cohorts of LOAD cases and controls, including a discovery cohort of clinically and neuropathologically characterized expired brain donors, a replication cohort of similarly characterized expired brain donors, and a replication cohort of clinically characterized living subjects. The expired brain donor cohorts were selected to exclude clinically misdiagnosed cases and cognitively normal but neuropathologically affected elderly controls; the clinical cohort was selected to confirm genetic associations independent of any brain donor selection bias. Within each cohort, LOAD cases and controls were stratified into sub-groups of APOE ε4 carriers and non-carriers, permitting the investigation of genes that modify LOAD risk in the ε4 carriers and genes that might otherwise be masked by disproportionately large ε4 effects in the non-carriers. The inventors discovered associations between a common gene and LOAD in APOE ε4 carriers in the three neuropathological and clinical cohorts and showed that the implicated gene is associated with AD neuropathology in neuronal microarray and immunohistochemical studies. Further provided is a possible mechanism by which GAB2 modifies AD risk in a preliminary small-interfering RNA (siRNA) study.
Example 1 DesignThe software program STRUCTURE (Pritchard et al. (2000) Genetics 155:945-59) was used to test for underlying genetic stratification, using 5000 randomly selected SNPs with at least 100 SNPs per chromosome. Initial analysis yielded empirical evidence of three populations (K=3). One, which contained fourteen samples and was far removed from the rest of the study population, was eliminated from further analyses. STRUCTURE was re-run with K=2, and each sample was assigned an admixture of the resulting populations. Comparison of the resulting case and control populations defined by these admixture vectors yielded a silhouette score of 0.06, indicating that while the samples in this study are the likely result of admixture of two populations, the distribution of those populations is equivalent in cases and controls (Pritchard et al. (2000) Genetics 155(2):945-59).
Example 2 Introduction to High-Density Genome-Wide Association StudyThe genome-wide association study was performed using 502,627 SNPs on DNA samples extracted from brain tissue or blood of donors who were at least 65 years of age at the time of their death. The donors included a postmortem cohort of 664 patients who satisfied clinical and neuropathological criteria for the diagnosis of AD and 422 persons that did not meet clinical or neuropathological criteria for AD as controls (broken up into a 70% “discovery cohort” and a 30% “replication cohort 1” design). Tissue and neuropathological diagnoses were supplied by investigators from twenty National Institute on Aging Alzheimer's Disease Centers (ADCs) in accordance with an agreement with these Centers, the National Institute on Aging, and the National Alzheimer's Coordinating Center (NACC). Additional post-mortem samples were received from Sun Health Research Institute and the Netherlands Brain Bank. An additional sample series including 228 clinically assessed antemortem AD patients and 155 clinically assessed antemortem controls (“replication cohort 2”) was provided by the Mayo Clinic Rochester. APOE genotypes were obtained by either pyrosequencing (Ahmadian et al. (2000) Anal. Biochem. 280(1): 103-110) or restriction fragment length polymorphism (RFLP) analysis (Lai et al. (1998) Genomics 54(1):31-38).
A three-tiered approach was used for the analysis of the genome-wide association study. The postmortem set was split into two cohorts matched for sex and age, a discovery cohort containing 70% of the postmortem sample (“discovery cohort”) and a replication cohort containing 30% of the postmortem samples (“replication cohort 1”). An ante-mortem series from the Mayo Clinic Rochester was used as a second replication cohort (“replication cohort 2”). SNiPer-HD (Hua et al. (2007) Bioinformatics 23(1):57-63) and BRLMM were used to call the genotypes and the results were filtered to exclude any SNPs with SNiPer-HD quality score less than 0.45 (excludes monomorphic loci, and loci that do not cluster into three distinct Gaussians), those with Hardy Weinberg equilibrium p-value less than 0.01 or minor allele frequency less than 3%, and those where concordance between SNiPer-HD calls and BRLMM calls is less than 98%. After filtering, 321,953 SNPs remained. Allelic chi squares were computed for each SNP in the discovery cohort (70%) in 3 classes: all samples, APOE-ε4 carriers and APOE-ε4 non-carriers.
There were no loci in the ApoE-ε4 non-carriers that had a significant p-value after Bonnferroni correction for multiple testing (321,953 independent tests). Only one SNP, rs4420638, the only SNP on the AFFYMETRIX 500K Mapping Array in linkage disequilibrium (LD) with the ApoE-ε4 locus, was significant in the entire sample after multiple testing correction. Two SNPs in the ApoE-ε4 carrier population were significant after Bonferroni multiple testing correction. One of these SNPs, rs2373115, is located in an intron of GRB2-associated binding protein 2 (GAB2). Visual inspection revealed that ten of the top twenty-five SNPs in the ApoE-ε4 carriers stratified analysis were located in the introns or coding sequence of GAB2. These ten SNPs were then interrogated in the 30% replication cohort and the Mayo Clinic Jacksonville replication cohort (see Table 2 below). All ten SNPs examined were significant (p-value <0.05) in the 30% cohort, and six SNPs were significant in the Mayo Rochester ante-mortem cohort. The most significant p-values and lowest odds ratios (indicating a protective allele) were found to be located toward the distal, 5′ end of GAB2 (see Table 2 below).
Individualized genome-wide surveys were performed in a “neuropathological discovery cohort” of 749 expired brain donors, a “neuropathological replication cohort” of 307 expired brain donors, and an additional “clinical replication cohort” of 369 living subjects who were at least 65 years old at the time of their death or last clinical assessment and who were independently assessed for their APOE genotype. For the two neuropathological cohorts, brain tissue for DNA extraction, neuropathological diagnoses and data were supplied by investigators from twenty of the National Institute on Aging (NIA) sponsored Alzheimer's Disease Centers (ADCs) (in accordance with agreements with the NIA, the ADCs and the National Alzheimer's Coordinating Center) and from the Netherlands Brain Bank. 70% of the 1,056 expired LOAD cases and controls were randomly assigned to the hypothesis-generating neuropathological discovery cohort and the remaining 30% were assigned to the hypothesis-testing neuropathological replication cohort. For the hypothesis-testing clinical replication cohort, DNA extracted from blood, clinical diagnoses and data from subjects assessed in Rochester, Minn., were supplied by investigators from the Mayo Clinic.
The neuropathological discovery cohort included 454 LOAD cases (274 ε4 carriers and 180 ε4 non-carriers) and 295 controls (105 ε4 carriers and 190 ε4 non-carriers); the neuropathological replication cohort included 193 LOAD cases (102 ε4 carriers and 91 ε4 non-carriers) and 114 controls (27 ε4 carriers and 87 ε4 non-carriers); and the clinical replication cohort included 224 LOAD cases (117 ε4 carriers and 107 ε4 non-carriers) and 145 controls (28 ε4 carriers and 117 ε4 non-carriers). The expired brain donor cases satisfied clinical and neuropathological criteria for LOAD, and were age 73.5±6.2 at the time of death. The expired brain donor controls did not have significant cognitive impairment or significant neuropathological features of AD, and were age 75.8±7.5 at the time of death. The clinical cases satisfied criteria for probable AD, and were ages 78.9±7.8 at their last clinical visit. The clinical controls did not have clinically significant cognitive impairment and were age 81.7±6.6 at their last clinical assessment. APOE genotypes were obtained in each subject by either pyrosequencing (Ahmadian, supra) or restriction fragment length polymorphism (RFLP) analysis (Lai, supra).
The 500K GeneChip (Affymetrix, Santa Clara, Calif.) was used to survey 502,267 SNPs in each subject. Genotypes were extracted using both SNiPer-HD (Hua et al. (2007) Bioinformatics 23:57-63) and BRLMM (Affymetrix, Santa Clara, Calif.) software, and 321,953 SNPs were analyzed after excluding those that were monomorphic, did not cluster into three distinct Gaussian distributions, were associated with Hardy Weinberg equilibrium significance levels less than 0.01, had minor allele frequencies less than 3%, or exhibited less than 98% concordance between the SNiPer-HD and BRLMM calls. The software program STRUCTURE (supra) was employed to test for underlying genetic stratification, using 5,000 randomly selected SNPs and including at least 100 SNPs per chromosome. The initial analysis yielded empirical evidence of three populations. Since fourteen subjects belonged to a population far removed from the rest of the study population, they were eliminated from further analyses. STRUCTURE was used to demonstrate a comparable admixture of the two populations in the cases and controls. After stratifying the LOAD cases and controls for presence or absence of the APOE ε4 allele, allelic chi squares were computed for each SNP.
The inventors initially surveyed SNPs in the neuropathological cohort to explore LOAD associations in the ε4 carrier and non-carrier sub-groups. Within the discovery sub-group of APOE ε4 carriers, ten of the twenty-five SNPs with the most significant LOAD-association significance levels (p=2×10−4 to 6×10−7) were located in the GRB-associated binding protein 2 (GAB2) gene on chromosome 11q14.1 (see
Haploview v3.32 was used to determine the linkage disequilibrium (LD) structure of the chromosome 11q14.1 region surrounding GAB2 in each of the three APOE ε4-stratified cohorts (
Data from the 1425 subjects (including 653 APOE ε4 carriers and 772 non-carriers) in all three cohorts were combined to characterize odds ratios (ORs) and 95% confidence intervals (CIs) for GAB2 risk haplotype homozygotes (see Table 6 below) and GAB2 protective haplotype carriers.
In ε4 carriers, 67% of the LOAD cases and 43% of the controls were homozygous for the GAB2 risk haplotype. In comparison with the other ε4 carriers, GAB2 risk haplotype homozygotes had a significantly higher risk of LOAD (OR 2.70, 95% CI 1.72-4.73, P=9.4×10−6). Indeed, 78 out of 79 persons (98.7%) homozygous for both the ε4 allele and the GAB2 risk haplotype had LOAD, a significantly higher proportion than the 35 out of 39 APOE ε4 homozygotes (89.7%) with fewer than 2 copies of the GAB2 risk haplotype (P=0.023). Within the same APOE ε4 carrier group, 22% of the LOAD cases and 49% of the controls had at least one copy of the GAB2 protective haplotype. In comparison with the other ε4 carriers, carriers of the GAB2 protective haplotype had a significantly lower risk of LOAD (OR 0.37, 95% CI 0.23-0.59, P=1.5×10−6). In contrast, APOE ε4 non-carriers homozygous for the GAB2 risk haplotype did not have a higher LOAD risk than the other ε4 non-carriers (OR 1.01, 95% CI 0.73-1.38, P=0.97) and ε4 non-carriers with the GAB2 protective haplotype did not have a lower LOAD risk than the other ε4 non-carriers (OR 1.05, 95% CI 0.74-1.50, P=0.79). Whereas the inventors confirmed a younger age at dementia onset in the APOE ε4 carriers than in non-carriers (age 80.8±7.7 versus 86.0±7.5, P=3.8×10−8), there was no significant GAB2 haplotype effect on age at dementia onset in either the ε4 carriers (P=0.32) or non-carriers (P=0.84). Hence, the effects of GAB2 on LOAD risk most likely depends on both the ε4 allele and a person's age.
Example 4 Identification of a Protective GAB2 Haplotype in LOADHaploview v3.32 was used to determine the linkage disequilibrium (LD) structure of the region on chromosome 11q14.1 surrounding GAB2 for each of the three APOE-ε4 stratified cohorts (discovery, replication 1, and replication 2; see
In order to provide converging evidence that GAB2 is biologically relevant to AD neuropathology, the inventors analyzed data from a neuronal microarray study of LOAD cases and controls. Expression profiling using the Affymetrix Human Genome U133 Plus 2.0 array was used to characterize and compare GAB2 expression in laser-capture microdissected non-tangle bearing neurons of cases and controls in six brain regions differentially affected by AD. LOAD cases had significantly greater neuronal GAB2 expression in the posterior cingulate cortex (9 cases, 13 controls, 4.50-fold change, P=0.00039) and hippocampus (9 cases, 13 controls, 2.94-fold change, P=0.00085), and no significant differences in the entorhinal cortex (10 cases, 13 controls, 1.20-fold change, P=0.46), middle temporal gyrus (13 cases, 12 controls, 1.44-fold change, P=0.14), superior frontal gyrus (22 cases, 11 controls, 1.25-fold change, P=0.47), or primary visual cortex (17 cases, 12 controls, 1.53-fold change, P=0.14).
The hippocampus is especially vulnerable to AD-related neurofibrillary tangles (Braak et al. (1991) Acta. Neuropathol. 82:239-259), neuronal loss and brain atrophy (Bobinski et al. (2000) Neuroscience 95:721-725). It is preferentially involved in AD-related memory impairment (Jack et al. (1999) Neurology 52:1397-1403) and is associated with the highest cerebral GAB2 expression in the rodent brain (Lein et al. (2007) Nature 445:160-161). The posterior cingulate cortex is vulnerable to AD-related hypometabolic abnormalities and fibrillar amyloid deposition, and is also involved in AD-related memory impairment. While the entorhinal cortex, temporal and prefrontal regions are also affected by AD neuropathology, the visual cortex is relatively spared. Using a repeated measures analysis of variance to analyze neuronal GAB2 gene expression data from the same 8 LOAD cases and 10 controls, there was a significant group-by-region interaction (P=0.011), with LOAD-related increases in neuronal GAB2 gene expression that were greater in the posterior cingulate cortex and hippocampus than in the visual cortex.
Example 7 Immunohistochemical ValidationIn order to show neuronal expression of GAB2 at the protein level, GAB2 immunohistochemistry was assessed in LOAD hippocampus and posterior cingulate cortex in LOAD cases. In hippocampus, GAB2 immunoreactivity was observed in structures with the morphology of dystrophic neurites or neuropil threads, neurons, and corpora amylacea. The putative neurons were almost entirely dystrophic in appearance (
In addition to its other properties, GAB2 is the principal activator of the phosphatidylinositol 3-kinase (PI3K) signaling pathway (Pratt et al. (2001) J. Immunol. 165:4158-4163). PI3K activates Akt, which in turn promotes glycogen synthase kinase-3 (GSK-3) phosphorylation/inactivation, suppressing GSK-3-dependent phosphorylation of tau at AD-related hyperphosphorylated tau residues, the principal component of neurofibrillary tangles, and preventing apoptosis of confluent cells (Baki et al. (2004) EMBO J. 23:2586-2596; Kang et al. (2005) J. Biol. Chem. 208:31537-31547). Based on this relationship, GAB2 could function to protect cells from neuronal tangle formation and cell death and a loss-of-function GAB2 haplotype would lose some of this protection function. Thus, interference with GAB2 expression using siRNA treatment would increase tau expression at the serine-262 residue known to be hyperphosphorylated in AD.
Neuroglioma cells overexpressing tau protein were grown in 96-well plates and transfected with siRNA directed at GAB2 mRNA. Following four days of transfection, cells were fixed, permeablized, and immunostained with antibodies against total tau protein and tau protein phosphorylated on serine-262. A FITC- and Cy5-conjugated secondary antibody cocktail was then applied. After incubation and washing, images were captured and quantitated using the InCell imager 3000 (General Electric). The fold increase in serine-262 phosphorylated tau levels was calculated against control samples that had been transfected with a scrambled siRNA sequence. As shown in
The present inventors were able to use their genome-wide survey of more than 500,000 SNPs, two unusually large clinically characterized and neuropathologically verified cohorts of AD cases and controls, a third cohort of clinically well characterized subjects, and stratification of the samples carriers and non-carriers of a major susceptibility gene to characterize and confirm associations between the GAB2 gene and LOAD risk in APOE ε4 carriers. Six SNPs and a common haplotype block encompassing the entire GAB2 gene were implicated in three independent cohorts. The data from the microarray study of laser-capture microdissected neurons in LOAD cases and controls, immunohistochemistry, and a siRNA study provided converging evidence for the relevance of GAB2 to the neuropathology of LOAD. This allows the inventors to link to a mechanism by which GAB2 can modify LOAD risk in ε4 carriers and, thus, provide targets at which to aim new treatments.
Although only one genotyping platform was used in all three cohorts, the inventor's findings are unlikely to be attributable to any platform-related bias in genotyping calls since the observed association was not limited to a single SNP but was related to a large haplotype block in agreement with the LD structure of the HapMap CEPH population. Further, all six of the implicated scores had high-quality SNiPer-HD scores (greater than 0.45), indicating that the data for each SNP clustered into three distinct Gaussian distributions corresponding to the three possible genotype calls. While GAB2 haplotypes were characterized in only a few of the subjects from the microarray studies, inspection of posterior cingulate GAB2 gene expression in the three GAB2-characterized controls homozygous for the risk haplotype (10.8±10.1 raw expression units) and the three GAB2-characterized non-homozygous controls (32.8±28.6 raw expression units) support that GAB2 is under-expressed in GAB2 risk haplotype homozygotes (See also,
GAB2 is a scaffolding protein involved in multiple signaling pathways, which can affect AD-related tau, amyloid, metabolic, or other aspects of AD pathology and cell survival. Thus, discovery of this novel LOAD susceptibility gene provides new opportunities to investigate LOAD pathogenesis, predisposition, treatment and prevention.
Various modifications and variations of the present disclosure will be apparent to those skilled in the art without departing from the scope and spirit of the disclosure. Although the disclosure has been described in connection with specific preferred embodiments, it should be understood that the disclosure as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the disclosure which are understood by those skilled in the art are intended to be within the scope of the claims.
Example 10 Additional MarkersGenetically complex phenotypes and diseases result from the interplay between genetic variants and environmental factors. Despite this fact, most genetic association studies to date have either evaluated one genetic variation at a time or have analyzed multiple variations independently. However, this approach ignores the polygenic nature of the studied phenotypes, focuses on marginal gene effects, inflates the probability of type I error, and erroneously assumes that the impact of the variation under study is comparable to a major gene effect. Recent developments in high-throughput genotyping and computerized algorithms now facilitate multi-locus analyses for the localization of complex trait genes and clusters thereof. As shown for schizophrenia and Alzheimer's disease, multi-locus analyses acknowledge the polygenic nature of disease, provide sets of trait-associated markers and also control for multiple testing. Importantly, such analyses may provide individual genetic clusters with potentially important clinical implications for future diagnosis, prognosis, and personalized treatments.
One of the recently used data-reduction methods which yields clusters of genes related to the phenotype of interest and which has already worked well in our hands is the set-association method. It uses relevant sources of genetic information, such as allelic association and Hardy-Weinberg disequilibrium (HWD). Information is combined over multiple markers and genes in the genome, quality control is improved by trimming SNPs with high HWD values, and permutation testing limits the overall false-positive rate. For each candidate marker, two chi-square statistics were computed: one for the allelic association with disease status and one for HWD in cases. The products of these statistics for each marker are then ranked from largest to smallest. Progressively larger sums (Sj) are then calculated over the j largest chi-square statistics. For example, S1 is the largest chi-square statistic of association. S2 is the sum of the largest and second largest. S3 is the sum of the largest, second largest and third largest, etc. The empirical significance level (Pj) for each Sj is evaluated by permutation methods carried-out under the null hypothesis of no genetic association with LOAD. The smallest of the empirical significance levels (i.e. Pjmin) identifies the best and most parsimonious model predicting disease status. Importantly, the set-association method has been shown to be of superior power compared with conventional locus-by-locus analyzes and to successfully capture statistical interactions between genes.
This set-association method was applied to the hypothesis-free whole-genome association study described in the previous examples, which utilized the Affymetrix 550K SNPchip in 593 histopathologically verified AD cases and 320 histopathologically verified controls. To minimize the possibility of type I errors, cases and controls were a priori and randomly separated in two samples: 70% of cases and controls composed a hypothesis-generating sample (i.e., the set-association method including the generation of a SNP cluster was applied in this sample), whereas the remaining 30% were used as a hypothesis-testing sample to evaluate independently the significance of the SNP cluster. After removing SNPs with low call rates, the top 1000 SNPs of the hypothesis-generating sample, as assessed by their genotypic P-value for association with AD in single-locus χ2 tests, were included in the set-association procedure. A cluster of 20 SNPs (including APOE) was significantly (P<10−10) associated with LOAD and was then used for the generation of a composite cluster score, which was based upon the combined set-association statistic for each of the 20 cluster SNPs. Table 7 shows the SNPs excluding APOE. Thus, the score corresponded to the number of individual risk alleles weighted by the contribution (i.e. effect size) of each risk allele to the overall cluster. The genetic score of individuals not carrying any risk allele was set equal to 0, which is the minimal possible score value. Accordingly, the score of individuals possessing one risk allele was set equal to the corresponding set-association statistic for the particular SNP. For two or more risk alleles, set-association statistics of the corresponding SNPs were added.
In the hypothesis-generating sample, AD patients (n=407) had a mean score (±SD) of 429±108, the score of the controls (n=224) was 300±97 (P<10−10). The discriminatory ability between cases and controls (as assessed by ROC analysis, See
Claims
1. A method of assigning a subject to a late onset Alzheimer's disease (LOAD) risk group, the method comprising:
- a.) providing a biological sample from the subject;
- c.) detecting, directly or indirectly, a marker in the biological sample selected from a group comprising: a marker associated with a haplotype of GAB2 associated with LOAD and a marker in Table 7; and
- d.) assigning the subject to the late onset Alzheimer's disease (LOAD) risk group based upon the presence or absence of the haplotype.
2. The method of claim 1, wherein the marker is associated with increased risk of LOAD and the subject is assigned to a high LOAD risk group if the subject has the marker.
3. The method of claim 2, wherein the marker is associated with the haplotype of GAB2 and is selected from the SNP haploblock identified by SEQ ID NO: 2.
4. The method of claim 3, wherein said assigning is further based upon the presence or absence in the subject of an apolipoprotein E allele associated with increased LOAD risk.
5. The method of claim 4, wherein said apolipoprotein E allele is APOE-ε4.
6. The method of claim 3, wherein said assigning is further based upon the presence or absence in the subject of an apolipoprotein C allele associated with increased LOAD risk.
7. The method of claim 6, wherein said apolipoprotein C allele is APOC-1a as identified by SNP rs4420638.
8. The method of claim 1, wherein the haplotype of GAB2 is associated with decreased risk of LOAD and the subject is assigned to a low LOAD risk group if the subject has the haplotype of GAB2.
9. The method of claim 8, wherein the marker is associated with the haplotype of GAB2 and is selected from the SNP haploblock identified by SEQ ID NO: 1.
10. The method of claim 9, wherein said assigning is further based upon the presence or absence in the subject of an apolipoprotein E allele associated with increased LOAD risk.
11. The method of claim 10, wherein said apolipoprotein E allele is APOE-ε4.
12. The method of claim 1, wherein the marker is detected by a method selected from the group consisting of nucleic acid hybridization, GAB2 antibody binding, and GAB2 activity assay.
13. The method of claim 12, wherein the marker is detected by nucleic acid hybridization using at least one hybridization probe specific to a SNP specific to the SNP haploblock identified by SEQ ID NO:1 or SEQ ID NO:2.
14. The method of claim 1, wherein the marker is detected by polymerase chain reaction (PCR).
15. The method of claim 1, wherein the marker is detected through a S1 nuclease assay.
16. The method of claim 1, wherein the marker is detected by a gene chip.
17. A set of molecular probes for diagnosis or monitoring a late onset Alzheimer's disease (LOAD) comprising at least two probes capable of detecting directly or indirectly at least two markers selected from a group comprising: a marker associated with a haplotype of GAB2 associated with LOAD, a marker in Table 7, wherein said molecular probes are not associated with a microarray of greater than 1000 elements.
18. A non-human transgenic animal with a genome comprising the human GAB2 gene of the haplotype identified by the SNP haploblock of SEQ ID NO: 1 or SEQ ID NO: 2.
19. A method for screening for a therapeutic agent effective to inhibit development of cell deterioration associated with dementia, said method comprising:
- a.) providing the transgenic animal of claim 18;
- b.) administering to the transgenic animal an agent to be tested for its effectiveness in treating or preventing said cell deterioration associated with dementia; and
- c.) assessing the effectiveness of the agent.
20. The method of claim 19, wherein said dementia is LOAD.
Type: Application
Filed: Jun 18, 2009
Publication Date: Oct 15, 2009
Applicants: Translational Genomics Research Institute (Phoenix, AZ), Kronos Science Laboratory (Phoenix, AZ)
Inventors: Dietrich A. STEPHAN (San Francisco, CA), Eric M. Reiman (Scottsdale, AZ), Jennifer Webster (Phoenix, AZ), Christopher B. Heward (Mesa, AZ), Pamela Heward (Mesa, AZ), Andreas Papassotiropoulos (Zurich)
Application Number: 12/487,608
International Classification: A01K 67/027 (20060101); C12Q 1/68 (20060101); G01N 33/53 (20060101); C40B 30/04 (20060101); C07H 21/04 (20060101);