Patents by Inventor Shiya Song

Shiya Song has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11887697
    Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.
    Type: Grant
    Filed: October 7, 2022
    Date of Patent: January 30, 2024
    Assignee: Ancestry.com DNA, LLC
    Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
  • Publication number: 20230352115
    Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.
    Type: Application
    Filed: June 8, 2023
    Publication date: November 2, 2023
    Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song, Genevieve Heather Linnea Roberts, Shannon Ries McCurdy, Jialiang Gu
  • Publication number: 20230335217
    Abstract: Disclosed is a configuration for determining a genotyping label composition of a target individual using direct acyclic paths. The configuration includes receiving a phased genotype of the target individual, including a first haplotype and a second haplotype. The configuration initiates a full-ethnicity hidden Markov model (HMM) including nodes with a set of ethnicity labels. The first haplotype is input to determine a first subset of ethnicity labels that match the first haplotype. The second haplotype is input to determine a second subset of ethnicity labels that match the second haplotype. The first and second subsets of ethnicity labels are combined to create a candidate subset of ethnicity labels for the target individual. The configuration initiates a simplified HMM with nodes from the candidate subset of ethnicity labels. The phased genotype of the target individual is input to the simplified HMM to determine genotyping label composition of the target individual.
    Type: Application
    Filed: April 13, 2023
    Publication date: October 19, 2023
    Inventors: Keith Daniel Noto, James Parker Ferry, Bryan Joseph Johnson, Alisa Sedghifar, Yong Wang, Shiya Song, Jeffrey Adrion
  • Patent number: 11735290
    Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.
    Type: Grant
    Filed: January 14, 2021
    Date of Patent: August 22, 2023
    Assignee: Ancestry.com DNA, LLC
    Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song, Genevieve Heather Linnea Roberts, Shannon Ries McCurdy, Jialiang Gu
  • Publication number: 20230116793
    Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.
    Type: Application
    Filed: October 7, 2022
    Publication date: April 13, 2023
    Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
  • Publication number: 20220365934
    Abstract: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.
    Type: Application
    Filed: July 20, 2022
    Publication date: November 17, 2022
    Inventors: Shiya Song, Jingwen Pei, Brett Frederick Jorgensen, Aaron James Stern, Ross E. Curtis
  • Patent number: 11482306
    Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.
    Type: Grant
    Filed: February 27, 2020
    Date of Patent: October 25, 2022
    Assignee: Ancestry.com DNA, LLC
    Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
  • Patent number: 11429615
    Abstract: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.
    Type: Grant
    Filed: December 19, 2020
    Date of Patent: August 30, 2022
    Assignee: Ancestry.com DNA, LLC
    Inventors: Shiya Song, Jingwen Pei, Brett Frederick Jorgensen, Aaron James Stern, Ross E. Curtis
  • Publication number: 20210216556
    Abstract: The disclosed system links an individual dataset to a database. The system receives a target individual dataset associated with a target individual and identifies candidate individual datasets that are potentially related to the target individual dataset. The system identifies a related individual dataset that has data bits that match some data bits in the target individual dataset. The system then identifies a parent node that is a common parent node to both the target individual dataset and the related individual dataset. The system retrieves a data tree that the parent node belongs to with the data tree containing information describing inter-relationships among datasets in the data tree. A node in the data tree is identified to assign the target individual dataset based on strings of matched data bits and number of the matched strings between the target individual dataset and the datasets in the data tree.
    Type: Application
    Filed: December 19, 2020
    Publication date: July 15, 2021
    Inventors: Shiya Song, Jingwen Pei, Brett Frederick Jorgensen, Aaron James Stern, Ross E. Curtis
  • Publication number: 20210134391
    Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.
    Type: Application
    Filed: January 14, 2021
    Publication date: May 6, 2021
    Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song, Genevieve Heather Linnea Roberts, Shannon Ries McCurdy, Jialiang Gu
  • Publication number: 20210134387
    Abstract: A system divides an input genotype dataset into a plurality of windows, each including a sequence of SNPs and determines a pair of phased haplotype datasets from the plurality of windows of genotype datasets. For at least one window, a plurality of emission probabilities are determined using one or more CNN models that take phased haplotypes as input and generates emission probabilities as output, where the emission probability corresponds to a probability of observing the pair of phased haplotype datasets within the window given a pair of ethnicity labels. The system then generates a directed acyclic graph that comprises a plurality of node groups and a plurality of edges, wherein the node group corresponding to the particular window comprises a plurality of nodes and each node is associated with one of the emission probabilities. Based on the directed acyclic graph, the system generates information on ethnic origin of the individual.
    Type: Application
    Filed: January 15, 2021
    Publication date: May 6, 2021
    Inventors: Joshua Goodwin Jon McMaster-Schraiber, Shiya Song, Yong Wang
  • Patent number: 10896742
    Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.
    Type: Grant
    Filed: October 31, 2019
    Date of Patent: January 19, 2021
    Assignee: Ancestry.com DNA, LLC
    Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song
  • Publication number: 20200286579
    Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.
    Type: Application
    Filed: May 13, 2020
    Publication date: September 10, 2020
    Inventors: Shiya Song, Keith D. Noto, Yong Wang
  • Publication number: 20200273542
    Abstract: A user may select one or more potential common ancestors with a DNA match to view the target individual's relationship with them. The process may include identifying, from a first genealogical profile of the target individual. A first individual has a first linkage that connects the target individual towards the selected potential common ancestor. The process may also include identifying, from a second genealogical profile of the DNA match, a second individual who has a second linkage that connects the DNA match towards the selected potential common ancestor. The process may further include connecting the first linkage and the second linkage with the selected potential common ancestor by adding one or more individuals whose profiles are retrieved from other searchable genealogical profiles stored in the online system. With the nodes and connections available, the process may generate a map of visual connections between the target individual and the DNA match.
    Type: Application
    Filed: February 27, 2020
    Publication date: August 27, 2020
    Inventors: Shiya Song, Neal Craig Varner, Ross E. Curtis, Brian Jerel Kerr, Kelly McCloy Becker, Brett Frederick Jorgensen, Bryce Damon Ririe, Michael Joseph Mulligan, Justin Matthew Robert Van Dyke, Michaela Black Bonkemeyer
  • Patent number: 10692587
    Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.
    Type: Grant
    Filed: September 11, 2019
    Date of Patent: June 23, 2020
    Assignee: Ancestry.com DNA, LLC
    Inventors: Shiya Song, Keith D. Noto, Yong Wang
  • Publication number: 20200135296
    Abstract: Disclosed are techniques for predicting a trait of an individual and identifying a set of enriched record collections of a genetic community. To predict a trait of an individual, DNA features and non-DNA features of the individual are accessed to generate a feature vector that is inputted into a machine learning model. The machine learning model generates a prediction of the trait. The prediction may be based on an inheritance prediction and/or a community prediction. To identify a set of enriched record collections, individuals belonging to a genetic community are identified and a set of candidate record collections are accessed. A community count and a background count is determined for each candidate record collection. The set of enriched record collections are identified based on a comparison of the community count and the background count. The genetic community may be annotated using the set of enriched record collections.
    Type: Application
    Filed: October 31, 2019
    Publication date: April 30, 2020
    Inventors: Ahna R. Girshick, Natalie Telis, Julie M. Granka, Asher Keith Haug Baltzell, Shiya Song
  • Publication number: 20200082905
    Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.
    Type: Application
    Filed: September 11, 2019
    Publication date: March 12, 2020
    Inventors: Shiya Song, David Andrew Turissini, Yong Wang, Jake Kelly Byrnes
  • Publication number: 20200082903
    Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.
    Type: Application
    Filed: September 11, 2019
    Publication date: March 12, 2020
    Inventors: Shiya Song, Keith D. Noto, Yong Wang
  • Publication number: 20200082909
    Abstract: An input genotype is divided into a plurality of windows, each including a sequence of SNPs. For each window, a diploid HMM is computed based on genotypes and/or phased haplotypes to determine a probability of a haplotype sequence being associated with a particular label. For example, the diploid HMM for a window is used to determine the emission probability that the window corresponds to a set of labels. An inter-window HMM, with a set of states for each window, is computed. Labels are assigned to the input genotype based on the inter-window HMM. Upper and lower bounds are estimated to produce a range of likely percentage values an input can be assigned to a given label. Confidence values are determined indicating a likelihood that an individual inherits DNA from a certain population. Maps are generated with polygons representing regions where a measure of ethnicity of population falls within specific ranges.
    Type: Application
    Filed: September 11, 2019
    Publication date: March 12, 2020
    Inventors: Yong Wang, Alisa Sedghifar, Shiya Song, David Andrew Turissini