Patents by Inventor Keith D. Noto
Keith D. Noto has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10679729Abstract: Novel haplotype cluster Markov models are used to phase genomic samples. After the models are built, they rapidly and accurately phase new samples without requiring that the new samples be used to re-build the models. The models set transition probabilities such that the probability for an appearance of any allele within any haplotype is a non-zero number. Furthermore, the most unlikely pairs of haplotypes are discarded from each model at each level until c of the likelihood mass at each level is discarded. The models are also constructed such that contributing windows of SNPs partially overlap so that phasing decisions near one of the extreme ends of any model is are not significantly determinative of the phase. Additionally, the models are configured such that two or more nodes can be merged during the building/updating procedure to consolidate haplotype clusters having similar distributions.Type: GrantFiled: October 19, 2015Date of Patent: June 9, 2020Assignee: Ancestry.com DNA, LLCInventors: Catherine Ann Ball, Keith D. Noto, Kenneth G. Chahine, Mathew J. Barber, Yong Wang
-
Publication number: 20200160202Abstract: An input sample SNP genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid hidden Markov Model (HMM) is built and from a haplotype Markov Model (MM). The diploid HMM for a window is used to determine the probability that the window corresponds to a pair of labels (e.g., ethnicity labels). An inter-window HMM, with a set of states for each window, is built based on the diploid HMMs for each window. Labels are assigned to the input sample genotype based on the inter-window HMM.Type: ApplicationFiled: January 8, 2020Publication date: May 21, 2020Inventors: Keith D. Noto, Yong Wang
-
Publication number: 20200098445Abstract: Described are computational methods to reconstruct the chromosomes (and genomes) of ancestors given genetic data, IBD information, and full or partial pedigree information of some number of their descendantsType: ApplicationFiled: December 3, 2019Publication date: March 26, 2020Inventors: Julie M. Granka, Keith D. Noto
-
Publication number: 20200082903Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.Type: ApplicationFiled: September 11, 2019Publication date: March 12, 2020Inventors: Shiya Song, Keith D. Noto, Yong Wang
-
Patent number: 10558930Abstract: An input sample SNP genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid hidden Markov Model (HMM) is built and from a haplotype Markov Model (MM). The diploid HMM for a window is used to determine the probability that the window corresponds to a pair of labels (e.g., ethnicity labels). An inter-window HMM, with a set of states for each window, is built based on the diploid HMMs for each window. Labels are assigned to the input sample genotype based on the inter-window HMM.Type: GrantFiled: July 13, 2016Date of Patent: February 11, 2020Assignee: Ancestry.com DNA, LLCInventors: Keith D. Noto, Yong Wang
-
Patent number: 10504611Abstract: Described are computational methods to reconstruct the chromosomes (and genomes) of ancestors given genetic data, IBD information, and full or partial pedigree information of some number of their descendants.Type: GrantFiled: October 19, 2015Date of Patent: December 10, 2019Assignee: Ancestry.com DNA, LLCInventors: Julie M. Granka, Keith D. Noto
-
Publication number: 20190139624Abstract: Identification of inheritance-by-descent haplotype matches between individuals is described. A set of tables including word match, haplotypes and segment match tables are populated. DNA samples are received and stored. A word identification module extracts haplotype values from each sample. The word match table is indexed according to the unique combination of position and haplotype. Each column represents a different sample, and each cell indicates whether that sample includes that haplotype at that position. The haplotypes table includes the raw haplotype data for each sample. The segment match table is indexed by sample identifier, and columns represent other samples. Each cell is populated to indicate for each identified sample pair which position range(s) include matching haplotypes for both samples. The tables are persistently stored in databases of the matching system. As new sample data is received, each table is updated to include the newly received samples, and additional matching takes place.Type: ApplicationFiled: October 4, 2018Publication date: May 9, 2019Inventors: Jake Kelly Byrnes, Aaron Ling, Keith D. Noto, Jeremy Pollack, Catherine Ann Ball, Kenneth Gregory Chahine
-
Patent number: 10114922Abstract: Identification of inheritance-by-descent haplotype matches between individuals is described. A set of tables including word match, haplotypes and segment match tables are populated. DNA samples are received and stored. A word identification module extracts haplotype values from each sample. The word match table is indexed according to the unique combination of position and haplotype. Each column represents a different sample, and each cell indicates whether that sample includes that haplotype at that position. The haplotypes table includes the raw haplotype data for each sample. The segment match table is indexed by sample identifier, and columns represent other samples. Each cell is populated to indicate for each identified sample pair which position range(s) include matching haplotypes for both samples. The tables are persistently stored in databases of the matching system. As new sample data is received, each table is updated to include the newly received samples, and additional matching takes place.Type: GrantFiled: September 17, 2013Date of Patent: October 30, 2018Assignee: Ancestry.com DNA, LLCInventors: Jake Kelly Byrnes, Aaron Ling, Keith D. Noto, Jeremy Pollack, Catherine Ann Ball, Kenneth Gregory Chahine
-
Publication number: 20170277827Abstract: Described are computational methods to reconstruct the chromosomes (and genomes) of ancestors given genetic data, IBD information, and full or partial pedigree information of some number of their descendants.Type: ApplicationFiled: October 19, 2015Publication date: September 28, 2017Applicant: ANCESTRY.COM DNA, LLCInventors: Julie M. GRANKA, Keith D. NOTO
-
Publication number: 20170262577Abstract: Novel haplotype cluster Markov models are used to phase genomic samples. After the models are built, they rapidly and accurately phase new samples without requiring that the new samples be used to re-build the models. The models set transition probabilities such that the probability for an appearance of any allele within any haplotype is a non-zero number. Furthermore, the most unlikely pairs of haplotypes are discarded from each model at each level until c of the likelihood mass at each level is discarded. The models are also constructed such that contributing windows of SNPs partially overlap so that phasing decisions near one of the extreme ends of any model is are not significantly determinative of the phase. Additionally, the models are configured such that two or more nodes can be merged during the building/updating procedure to consolidate haplotype clusters having similar distributions.Type: ApplicationFiled: October 19, 2015Publication date: September 14, 2017Inventors: Catherine Ann BALL, Keith D. NOTO, Kenneth G. CHAHINE, Mathew J. BARBER, Yong WANG
-
Publication number: 20170220738Abstract: System, computer program products, and methods are disclosed for estimating a degree of ancestral relatedness between two individuals. The haplotype data for a population of individuals is divided into segment windows based on genetic markers, and matched segments for the haplotype data are generated. Each matched segment having a first cM width that exceeds a threshold cM width is included in counting the matched segments in each segment window. A weight associated with each segment window is estimated based on the count of matched segments in the associated segment window. A weighted sum of per-window cM widths for each matched segment is calculated based on the first cM width and the weights associated with the segment windows of the matched segment. The weighted sum of per-window cM widths are used to estimate a degree of ancestral relatedness between two individuals.Type: ApplicationFiled: October 14, 2015Publication date: August 3, 2017Inventors: Mathew J BARBER, Yong WANG, Keith D. NOTO, Kenneth G. CHAHINE, Catherine Ann BALL
-
Publication number: 20170017752Abstract: An input sample SNP genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid hidden Markov Model (HMM) is built and from a haplotype Markov Model (MM). The diploid HMM for a window is used to determine the probability that the window corresponds to a pair of labels (e.g., ethnicity labels). An inter-window HMM, with a set of states for each window, is built based on the diploid HMMs for each window. Labels are assigned to the input sample genotype based on the inter-window HMM.Type: ApplicationFiled: July 13, 2016Publication date: January 19, 2017Inventors: Keith D. Noto, Yong Wang
-
Publication number: 20170011042Abstract: A system identifies ancestral birth locations or surnames estimated to be associated with an individual's ancestors using an individual's genetic sample. The system identifies users who are genetic matches to the individual and determines whether and how often a birth location or surname appears in the pedigrees of those users. Birth locations or surnames that appear frequently throughout the pedigrees of genetically matching users may represent birth locations or surnames that are affiliated with the individual's ancestors. The system determines whether the frequency of appearance of a birth location or surname is statistically significant to eliminate biases for certain birth locations or surnames that appear more frequently than others. The birth location or surname may be provided to the individual based on an also-determined enrichment score.Type: ApplicationFiled: July 6, 2016Publication date: January 12, 2017Inventors: Amir R. Kermany, Julie M. Granka, Keith D. Noto
-
Publication number: 20160026755Abstract: Identification of inheritance-by-descent haplotype matches between individuals is described. A set of tables including word match, haplotypes and segment match tables are populated. DNA samples are received and stored. A word identification module extracts haplotype values from each sample. The word match table is indexed according to the unique combination of position and haplotype. Each column represents a different sample, and each cell indicates whether that sample includes that haplotype at that position. The haplotypes table includes the raw haplotype data for each sample. The segment match table is indexed by sample identifier, and columns represent other samples. Each cell is populated to indicate for each identified sample pair which position range(s) include matching haplotypes for both samples. The tables are persistently stored in databases of the matching system. As new sample data is received, each table is updated to include the newly received samples, and additional matching takes place.Type: ApplicationFiled: September 17, 2013Publication date: January 28, 2016Applicant: Ancestry.com DNA, LLCInventors: Jake Kelly Byrnes, Aaron Ling, Keith D. Noto, Jeremy Pollack, Catherine Ann Ball, Kenneth Gregory Chahine
-
Publication number: 20140067355Abstract: Phased haplotype features are used to infer an individual's ancestry. Reference genomic data is obtained for individuals of known ancestral origin. Haplotype features are identified based on consecutive SNPs from each individual. Sample genomic data is obtained for an individual of unknown ancestral origin. The data is phased and divided into features analogous to the features in the reference data. An admixture estimator then performs an admixture estimation based on the observed feature values in the sample data and the reference data. The estimation indicates a contribution of each of the known populations to the genome of the sample individual.Type: ApplicationFiled: September 6, 2013Publication date: March 6, 2014Applicant: Ancestry.com DNA, LLCInventors: Keith D. Noto, Jake Kelly Byrnes, Catherine Ann Ball, Kenneth Gregory Chahine