Patents by Inventor Shiya Song

Shiya Song has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Graphical user interface displaying relatedness based on shared DNA

Patent number: 11887697

Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.

Type: Grant

Filed: October 7, 2022

Date of Patent: January 30, 2024

Assignee: Ancestry.com DNA, LLC

Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
ESTIMATION OF PHENOTYPES USING DNA, PEDIGREE, AND HISTORICAL DATA

Publication number: 20230352115

Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.

Type: Application

Filed: June 8, 2023

Publication date: November 2, 2023

Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song, Genevieve Heather Linnea Roberts, Shannon Ries McCurdy, Jialiang Gu
ACCELERATED HIDDEN MARKOV MODELS FOR GENOTYPE ANALYSIS

Publication number: 20230335217

Abstract: Disclosed is a configuration for determining a genotyping label composition of a target individual using direct acyclic paths. The configuration includes receiving a phased genotype of the target individual, including a first haplotype and a second haplotype. The configuration initiates a full-ethnicity hidden Markov model (HMM) including nodes with a set of ethnicity labels. The first haplotype is input to determine a first subset of ethnicity labels that match the first haplotype. The second haplotype is input to determine a second subset of ethnicity labels that match the second haplotype. The first and second subsets of ethnicity labels are combined to create a candidate subset of ethnicity labels for the target individual. The configuration initiates a simplified HMM with nodes from the candidate subset of ethnicity labels. The phased genotype of the target individual is input to the simplified HMM to determine genotyping label composition of the target individual.

Type: Application

Filed: April 13, 2023

Publication date: October 19, 2023

Inventors: Keith Daniel Noto, James Parker Ferry, Bryan Joseph Johnson, Alisa Sedghifar, Yong Wang, Shiya Song, Jeffrey Adrion
Estimation of phenotypes using DNA, pedigree, and historical data

Patent number: 11735290

Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.

Type: Grant

Filed: January 14, 2021

Date of Patent: August 22, 2023

Assignee: Ancestry.com DNA, LLC

Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song, Genevieve Heather Linnea Roberts, Shannon Ries McCurdy, Jialiang Gu
GRAPHICAL USER INTERFACE DISPLAYING RELATEDNESS BASED ON SHARED DNA

Publication number: 20230116793

Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.

Type: Application

Filed: October 7, 2022

Publication date: April 13, 2023

Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
LINKING INDIVIDUAL DATASETS TO A DATABASE

Publication number: 20220365934

Abstract: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.

Type: Application

Filed: July 20, 2022

Publication date: November 17, 2022

Inventors: Shiya Song, Jingwen Pei, Brett Frederick Jorgensen, Aaron James Stern, Ross E. Curtis
Graphical user interface displaying relatedness based on shared DNA

Patent number: 11482306

Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.

Type: Grant

Filed: February 27, 2020

Date of Patent: October 25, 2022

Assignee: Ancestry.com DNA, LLC

Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
Linking individual datasets to a database

Patent number: 11429615

Abstract: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.

Type: Grant

Filed: December 19, 2020

Date of Patent: August 30, 2022

Assignee: Ancestry.com DNA, LLC

Inventors: Shiya Song, Jingwen Pei, Brett Frederick Jorgensen, Aaron James Stern, Ross E. Curtis
LINKING INDIVIDUAL DATASETS TO A DATABASE

Publication number: 20210216556

Abstract: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.

Type: Application

Filed: December 19, 2020

Publication date: July 15, 2021

Inventors: Shiya Song, Jingwen Pei, Brett Frederick Jorgensen, Aaron James Stern, Ross E. Curtis
ESTIMATION OF PHENOTYPES USING DNA, PEDIGREE, AND HISTORICAL DATA

Publication number: 20210134391

Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.

Type: Application

Filed: January 14, 2021

Publication date: May 6, 2021

Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song, Genevieve Heather Linnea Roberts, Shannon Ries McCurdy, Jialiang Gu
ANCESTRY INFERENCE BASED ON CONVOLUTIONAL NEURAL NETWORK

Publication number: 20210134387

Abstract: A system divides an input genotype dataset into a plurality of windows, each including a sequence of SNPs and determines a pair of phased haplotype datasets from the plurality of windows of genotype datasets. For at least one window, a plurality of emission probabilities are determined using one or more CNN models that take phased haplotypes as input and generates emission probabilities as output, where the emission probability corresponds to a probability of observing the pair of phased haplotype datasets within the window given a pair of ethnicity labels. The system then generates a directed acyclic graph that comprises a plurality of node groups and a plurality of edges, wherein the node group corresponding to the particular window comprises a plurality of nodes and each node is associated with one of the emission probabilities. Based on the directed acyclic graph, the system generates information on ethnic origin of the individual.

Type: Application

Filed: January 15, 2021

Publication date: May 6, 2021

Inventors: Joshua Goodwin Jon McMaster-Schraiber, Shiya Song, Yong Wang
Estimation of phenotypes using DNA, pedigree, and historical data

Patent number: 10896742

Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.

Type: Grant

Filed: October 31, 2019

Date of Patent: January 19, 2021

Assignee: Ancestry.com DNA, LLC

Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song
GLOBAL ANCESTRY DETERMINATION SYSTEM

Publication number: 20200286579

Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.

Type: Application

Filed: May 13, 2020

Publication date: September 10, 2020

Inventors: Shiya Song, Keith D. Noto, Yong Wang
GRAPHICAL USER INTERFACE DISPLAYING RELATEDNESS BASED ON SHARED DNA

Publication number: 20200273542

Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.

Type: Application

Filed: February 27, 2020

Publication date: August 27, 2020

Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
Global ancestry determination system

Patent number: 10692587

Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.

Type: Grant

Filed: September 11, 2019

Date of Patent: June 23, 2020

Assignee: Ancestry.com DNA, LLC

Inventors: Shiya Song, Keith D. Noto, Yong Wang
ESTIMATION OF PHENOTYPES USING DNA, PEDIGREE, AND HISTORICAL DATA

Publication number: 20200135296

Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.

Type: Application

Filed: October 31, 2019

Publication date: April 30, 2020

Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song
ADMIXED SYNTHETIC REFERENCE PANEL

Publication number: 20200082905

Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.

Type: Application

Filed: September 11, 2019

Publication date: March 12, 2020

Inventors: Shiya Song, David Andrew Turissini, Yong Wang, Jake Kelly Byrnes
GLOBAL ANCESTRY DETERMINATION SYSTEM

Publication number: 20200082903

Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.

Type: Application

Filed: September 11, 2019

Publication date: March 12, 2020

Inventors: Shiya Song, Keith D. Noto, Yong Wang
CONFIDENCE AND RANGE OF ETHNICITY ESTIMATES IN A GLOBAL ANCESTRY DETERMINATION SYSTEM

Publication number: 20200082909

Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.

Type: Application

Filed: September 11, 2019

Publication date: March 12, 2020

Inventors: Yong Wang, Alisa Sedghifar, Shiya Song, David Andrew Turissini