Patents by Inventor Laxmi P. Parida

Laxmi P. Parida has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11514087
    Abstract: A method includes generating, by a processor system, a graph. The graph is based at least in part on a plurality of instances in which operational taxonomic units are identified as being represented within an environment. The method can also include determining, using the processor system, that at least one instance of the plurality of instances corresponds to a false-positive identification of an operational taxonomic unit. The determining is based on the properties of the graph. The method can also include reporting the determination.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: November 29, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Niina S. Haiminen, Laxmi P. Parida
  • Patent number: 11410749
    Abstract: Various embodiments perform stable gene analysis of transcriptome sequencing data. In one embodiment, a plurality of datasets each including transcriptome sequencing data are received by a processor. Each of the plurality of datasets includes a plurality of genes and a respective ranking value for each of the plurality of genes. A plurality of rank normalized input datasets is generated based on assigning, for each of the plurality of datasets, a rank to each of the plurality of genes. One or more longest increasing subsequence (LIS) of ranks are identified between each pair of the plurality of rank normalized input datasets. A set of stable genes from the plurality of genes is identified based on each of the one or more LIS of ranks across the plurality of rank normalized input datasets.
    Type: Grant
    Filed: August 30, 2018
    Date of Patent: August 9, 2022
    Assignee: International Business Machines Corporation
    Inventors: Niina S. Haiminen, Laxmi P. Parida
  • Patent number: 11335434
    Abstract: Various embodiments select markers for modeling epistasis effects. In one embodiment, a processor receives a set of genetic markers and a phenotype. A relevance score is determined with respect to the phenotype for each of the set of genetic markers. A threshold is set based on the relevance score of a genetic marker with a highest relevancy score. A relevance score is determined for at least one genetic marker in the set of genetic markers for at least one interaction between the at least one genetic marker and at least one other genetic marker in the set of genetic markers. The at least one interaction is added to a top-k feature set based on the relevance score of the at least one interaction satisfying the threshold.
    Type: Grant
    Filed: September 14, 2018
    Date of Patent: May 17, 2022
    Assignee: International Business Machines Corporation
    Inventors: David C. Haws, Dan He, Laxmi P. Parida
  • Patent number: 11335433
    Abstract: Various embodiments select markers for modeling epistasis effects. In one embodiment, a processor receives a set of genetic markers and a phenotype. A relevance score is determined with respect to the phenotype for each of the set of genetic markers. A threshold is set based on the relevance score of a genetic marker with a highest relevancy score. A relevance score is determined for at least one genetic marker in the set of genetic markers for at least one interaction between the at least one genetic marker and at least one other genetic marker in the set of genetic markers. The at least one interaction is added to a top-k feature set based on the relevance score of the at least one interaction satisfying the threshold.
    Type: Grant
    Filed: September 14, 2018
    Date of Patent: May 17, 2022
    Assignee: International Business Machines Corporation
    Inventors: David C. Haws, Dan He, Laxmi P. Parida
  • Patent number: 11194922
    Abstract: Embodiments of the invention include systems and methods for protecting study participant data for aggregate analysis. Aspects include sending a broker encryption key to a plurality of subjects. Aspects also include receiving double-encrypted subject data from the plurality of subjects. Aspects also include decrypting the double-encrypted subject data with a broker decryption key to generate single-encrypted subject data for the plurality of subjects. Aspects also include aggregating the single-encrypted subject data for the plurality of subjects to generate an aggregated single-homomorphically encrypted data set. Aspects also include including a plurality of random factors in the aggregated single-encrypted data set. Aspects also include sending the aggregated single-homomorphically encrypted data set to a researcher.
    Type: Grant
    Filed: February 28, 2018
    Date of Patent: December 7, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Laxmi P. Parida, Daniel Enoch Platt
  • Patent number: 11177018
    Abstract: Various embodiments perform stable gene analysis of transcriptome sequencing data. In one embodiment, a plurality of datasets each including transcriptome sequencing data are received by a processor. Each of the plurality of datasets includes a plurality of genes and a respective ranking value for each of the plurality of genes. A plurality of rank normalized input datasets is generated based on assigning, for each of the plurality of datasets, a rank to each of the plurality of genes. One or more longest increasing subsequence (LIS) of ranks are identified between each pair of the plurality of rank normalized input datasets. A set of stable genes from the plurality of genes is identified based on each of the one or more LIS of ranks across the plurality of rank normalized input datasets.
    Type: Grant
    Filed: August 28, 2018
    Date of Patent: November 16, 2021
    Assignee: International Business Machines Corporation
    Inventors: Niina S. Haiminen, Laxmi P. Parida
  • Patent number: 10733214
    Abstract: A method includes generating, by a processor system, a graph. The graph is based at least in part on a plurality of instances in which operational taxonomic units are identified as being represented within an environment. The method can also include determining, using the processor system, that at least one instance of the plurality of instances corresponds to a false-positive identification of an operational taxonomic unit. The determining is based on the properties of the graph. The method can also include reporting the determination.
    Type: Grant
    Filed: March 20, 2017
    Date of Patent: August 4, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Niina S. Haiminen, Laxmi P. Parida
  • Patent number: 10607718
    Abstract: Embodiments of the present invention include method, systems and computer program products for algebraic phasing of polyploids. Aspects of the invention include receiving a matrix including a set of two or more single-nucleotide poloymorphisms (SNPs) for two or more sample organisms. Each row of the matrix is set to a ploidy based on a number of ploidies present in the two or more sample organisms. Each allele in the set of two or more SNPs is represented as a binary number. A set of algebraic rules is received, wherein the set of algebraic rules include an algebraic phasing algorithm. And the set of algebraic rules are applied to the matrix to determine a haplotype of a parent of the two or more sample organisms.
    Type: Grant
    Filed: May 14, 2019
    Date of Patent: March 31, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORORATION
    Inventors: Laxmi P. Parida, Filippo Utro
  • Publication number: 20200026718
    Abstract: A method includes generating, by a processor system, a graph. The graph is based at least in part on a plurality of instances in which operational taxonomic units are identified as being represented within an environment. The method can also include determining, using the processor system, that at least one instance of the plurality of instances corresponds to a false-positive identification of an operational taxonomic unit. The determining is based on the properties of the graph. The method can also include reporting the determination.
    Type: Application
    Filed: September 27, 2019
    Publication date: January 23, 2020
    Inventors: NIINA S. HAIMINEN, LAXMI P. PARIDA
  • Patent number: 10468122
    Abstract: Various embodiments reconstruct haplotypes from genotype data. In one embodiment, a set of progeny genotype data comprising n progenies encoded with m genetic markers is accessed. A first set of parent haplotypes associated with a first parent of the n progenies and a second set of parent haplotypes associated with a second parent of the n progenies are identified based on at least the set of progeny genotype data. A total minimum number of observable crossovers in the n progenies is determined. An agglomerate data structure comprising a collection of sets of haplotype sequences characterizing the n progenies is constructed based on the set of progeny genotype data and the first and second sets of parent haplotypes. Each set of haplotype sequences includes a number of crossovers equal to the total minimum number of observable crossovers in the n progenies.
    Type: Grant
    Filed: June 21, 2012
    Date of Patent: November 5, 2019
    Assignee: International Business Machines Corporation
    Inventors: Niina S. Haiminen, Laxmi P. Parida, Filippo Utro
  • Patent number: 10460832
    Abstract: A system for reconstructing haplotypes from genotype data includes a memory, a processor, and a reconstruction module. The reconstruction module is configured to access a set of progeny genotype data including n progenies encoded with m genetic markers. A first set of parent haplotypes associated with a first parent of the n progenies and a second set of parent haplotypes associated with a second parent of the n progenies are identified based on at least the set of progeny genotype data. An agglomerate data structure including a collection of sets of haplotype sequences characterizing the n progenies is constructed based on the set of progeny genotype data and the first and second sets of parent haplotypes. Each set of haplotype sequences includes a number of crossovers equal to a total minimum number of observable crossovers in the n progenies.
    Type: Grant
    Filed: February 7, 2013
    Date of Patent: October 29, 2019
    Assignee: International Business Machines Corporation
    Inventors: Niina S. Haiminen, Laxmi P. Parida, Filippo Utro
  • Publication number: 20190267111
    Abstract: Embodiments of the present invention include method, systems and computer program products for algebraic phasing of polyploids. Aspects of the invention include receiving a matrix including a set of two or more single-nucleotide poloymorphisms (SNPs) for two or more sample organisms. Each row of the matrix is set to a ploidy based on a number of ploidies present in the two or more sample organisms. Each allele in the set of two or more SNPs is represented as a binary number. A set of algebraic rules is received, wherein the set of algebraic rules include an algebraic phasing algorithm. And the set of algebraic rules are applied to the matrix to determine a haplotype of a parent of the two or more sample organisms.
    Type: Application
    Filed: May 14, 2019
    Publication date: August 29, 2019
    Inventors: Laxmi P. Parida, Filippo Utro
  • Publication number: 20190266343
    Abstract: Embodiments of the invention include systems and methods for protecting study participant data for aggregate analysis. Aspects include sending a broker encryption key to a plurality of subjects. Aspects also include receiving double-encrypted subject data from the plurality of subjects. Aspects also include decrypting the double-encrypted subject data with a broker decryption key to generate single-encrypted subject data for the plurality of subjects. Aspects also include aggregating the single-encrypted subject data for the plurality of subjects to generate an aggregated single-homomorphically encrypted data set. Aspects also include including a plurality of random factors in the aggregated single-encrypted data set. Aspects also include sending the aggregated single-homomorphically encrypted data set to a researcher.
    Type: Application
    Filed: February 28, 2018
    Publication date: August 29, 2019
    Inventors: Laxmi P. PARIDA, Daniel Enoch PLATT
  • Patent number: 10373707
    Abstract: Embodiments of the present invention include method, systems and computer program products for algebraic phasing of polyploids. Aspects of the invention include receiving a matrix including a set of two or more single-nucleotide poloymorphisms (SNPs) for two or more sample organisms. Each row of the matrix is set to a ploidy based on a number of ploidies present in the two or more sample organisms. Each allele in the set of two or more SNPs is represented as a binary number. A set of algebraic rules is received, wherein the set of algebraic rules include an algebraic phasing algorithm. And the set of algebraic rules are applied to the matrix to determine a haplotype of a parent of the two or more sample organisms.
    Type: Grant
    Filed: December 31, 2018
    Date of Patent: August 6, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Laxmi P. Parida, Filippo Utro
  • Publication number: 20190220572
    Abstract: Embodiments include methods, systems, and computer program products for analyzing mutational evolution. Aspects include receiving a whole genome data set for a patient including a plurality of mutations. Aspects also include determining a variant allele frequency for each of the plurality of mutations. Aspects also include labeling each of the plurality of mutations with a gene set designation. Aspects also include constructing an evolution topology comprising an ordered representation of the plurality of mutations, wherein each of the plurality of mutations comprises one of the gene set designations.
    Type: Application
    Filed: January 16, 2018
    Publication date: July 18, 2019
    Inventors: Chaya LEVOVITZ, Laxmi P. PARIDA, Kahn RHRISSORRAKRAI
  • Publication number: 20190138687
    Abstract: Embodiments include methods, systems, and computer program products for identifying content in a metagenomic sample. Aspects include receiving a plurality of metagenomic reads for a sample. Aspects also include comparing a portion of the plurality of metagenomic reads to a genomic database and identifying a plurality of associated nodes, wherein the genomic database includes a plurality of gene sequences having known taxonomies. Aspects also include generating a probabilistic score for each of the associated nodes per metagenomic read. Aspects also include generating an output including a plurality of identifications and a final probability score for each of the identifications based at least in part upon the probabilistic score for each of the associated nodes per metagenomic read.
    Type: Application
    Filed: November 3, 2017
    Publication date: May 9, 2019
    Inventors: Niina S. HAIMINEN, Laxmi P. PARIDA, Enrico SIRAGUSA, Filippo UTRO
  • Publication number: 20190139626
    Abstract: Embodiments of the present invention include method, systems and computer program products for algebraic phasing of polyploids. Aspects of the invention include receiving a matrix including a set of two or more single-nucleotide poloymorphisms (SNPs) for two or more sample organisms. Each row of the matrix is set to a ploidy based on a number of ploidies present in the two or more sample organisms. Each allele in the set of two or more SNPs is represented as a binary number. A set of algebraic rules is received, wherein the set of algebraic rules include an algebraic phasing algorithm. And the set of algebraic rules are applied to the matrix to determine a haplotype of a parent of the two or more sample organisms.
    Type: Application
    Filed: December 31, 2018
    Publication date: May 9, 2019
    Inventors: Laxmi P. Parida, Filippo Utro
  • Publication number: 20190114392
    Abstract: Embodiments include methods, systems, and computer program products for identifying a metagenomic read. Aspects include receiving a metagenomic read of a sample. Aspects also include comparing the metagenomic read to a genomic database including a plurality of gene sequences. Aspects also include randomly fragmenting the metagenomic read into two read fragments. Aspects also include comparing the read fragments to the genomic database including a plurality of gene sequences. Aspects also include generating an identification based at least in part on a determination that a read fragment exactly matches a gene sequence.
    Type: Application
    Filed: October 17, 2017
    Publication date: April 18, 2019
    Inventors: Laxmi P. PARIDA, Filippo UTRO
  • Publication number: 20190102514
    Abstract: Embodiments of the present invention are directed to a computer-implemented method for generating a list of genetic variants. A non-limiting example of the computer-implemented method includes receiving genetic and biological data. The exemplary method also includes generating data patterns from the genetic and biological data with data mining. The method also includes determining redescription distances between each of a plurality of data patterns. The method also includes generating computational homology filtrations from the redescription distances using a topological data analysis and homology groups including homology group elements based upon the computational homology filtrations. The method also includes generating a single nucleotide polymorphism combination list based upon the homology group elements and redescription clusters.
    Type: Application
    Filed: November 2, 2017
    Publication date: April 4, 2019
    Inventors: Laxmi P. PARIDA, Daniel E. PLATT
  • Publication number: 20190102513
    Abstract: Embodiments of the present invention are directed to a computer-implemented method for generating a list of genetic variants. A non-limiting example of the computer-implemented method includes receiving genetic and biological data. The exemplary method also includes generating data patterns from the genetic and biological data with data mining. The method also includes determining redescription distances between each of a plurality of data patterns. The method also includes generating computational homology filtrations from the redescription distances using a topological data analysis and homology groups including homology group elements based upon the computational homology filtrations. The method also includes generating a single nucleotide polymorphism combination list based upon the homology group elements and redescription clusters.
    Type: Application
    Filed: October 4, 2017
    Publication date: April 4, 2019
    Inventors: Laxmi P. PARIDA, Daniel E. PLATT