Patents by Inventor Michael Ruehle

Michael Ruehle has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240127905
    Abstract: This disclosure describes methods, non-transitory computer readable media, and systems that can generate genotype calls from a combined pipeline for processing nucleotide reads from multiple read types/sources for robust, accurate genotype calls. For example, the disclosed systems can train and/or utilize a genotype-call-integration machine-learning model to generate predictions for genotype calls based on data associated with a first type of nucleotide reads (e.g., short reads) and a second type of nucleotide reads (e.g., long reads). As disclosed, the disclosed systems can determine sequencing metrics and can utilize a genotype-call-integration machine-learning model to generate predictions (e.g., genotype probabilities, variant call classifications) for generating output genotype calls based on the sequencing metrics.
    Type: Application
    Filed: October 4, 2023
    Publication date: April 18, 2024
    Inventors: Gavin Derek Parnaby, Seyedmohammadjafar Hashemidoulabi, Aaron L. Halpern, Michael Ruehle
  • Publication number: 20240061843
    Abstract: Methods, systems, and apparatuses, including computer programs for generating and using a hash table configured to improve mapping of reads are disclosed that include obtaining a first seed of K nucleotides from a reference sequence, generating a seed extension tree having a nodes, wherein each node of the nodes corresponds to (i) an extended seed that is an extension of the first seed and has a nucleotide length of K* and (ii) one or more locations, in a seed extension table, that include data describing reference sequence locations that match the extended seed, and for each node: storing interval information at a location of the hash table that corresponds to an index key for the extended seed, wherein the interval information references one or more locations in the seed extension table that include reference sequence locations that match the extended seed associated with the node.
    Type: Application
    Filed: October 30, 2023
    Publication date: February 22, 2024
    Inventor: Michael Ruehle
  • Publication number: 20230420075
    Abstract: This disclosure describes methods, non-transitory computer readable media, and systems that can determine allele likelihoods of a genomic region exhibiting certain haplotype alleles using one or both of consolidated computations and data exchanges across specialized hardware. For instance, the disclosed systems can determine an intermediate allele likelihood of a genomic region comprising a haplotype allele by running a single-pass-concurrent-multiplication operation. In some cases, the disclosed systems determine and store subsets of intermediate allele likelihoods corresponding to marker-variant groups and extemporaneously generate sets of intermediate allele likelihoods for a set of marker variants by using the intermediate-allele-likelihood subsets as hot-start points.
    Type: Application
    Filed: June 27, 2023
    Publication date: December 28, 2023
    Inventors: Mark David Hahm, Sven Bilke, Andrew Christopher Du Preez, Michael Ruehle
  • Publication number: 20230420080
    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for efficiently identifying and selecting split groups corresponding to one or more nucleotide reads. Generally, split groups comprise chains of fragments forming split-alignments of one read. The disclosed system utilizes dynamic programming to generate and evaluate candidate split groups. The disclosed system can generate split group scores for each of the candidate split groups. To generate the split group scores, the disclosed system considers fragment alignment scores and geometries of fragment alignments within the candidate split groups. The disclosed systems select a predicted split group from the candidate split groups based on the split group scores.
    Type: Application
    Filed: June 23, 2023
    Publication date: December 28, 2023
    Inventor: Michael Ruehle
  • Patent number: 11842796
    Abstract: A system, method and apparatus for executing a bioinformatics analysis on genetic sequence data includes an integrated circuit formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the integrated circuit that may be connected with an electronic data source for receiving reads of genomic data. The hardwired digital logic circuits may be arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired digital logic circuits to perform one or more steps in the bioinformatics analysis on the reads of genomic data. Each subset of the hardwired digital logic circuits may be formed in a wired configuration to perform the one or more steps in the bioinformatics analysis.
    Type: Grant
    Filed: March 27, 2020
    Date of Patent: December 12, 2023
    Assignee: Edico Genome Corporation
    Inventors: Pieter van Rooyen, Michael Ruehle, Rami Mehio
  • Patent number: 11803554
    Abstract: Methods, systems, and apparatuses, including computer programs for generating and using a hash table configured to improve mapping of reads are disclosed that include obtaining a first seed of K nucleotides from a reference sequence, generating a seed extension tree having a nodes, wherein each node of the nodes corresponds to (i) an extended seed that is an extension of the first seed and has a nucleotide length of K* and (ii) one or more locations, in a seed extension table, that include data describing reference sequence locations that match the extended seed, and for each node: storing interval information at a location of the hash table that corresponds to an index key for the extended seed, wherein the interval information references one or more locations in the seed extension table that include reference sequence locations that match the extended seed associated with the node.
    Type: Grant
    Filed: May 22, 2020
    Date of Patent: October 31, 2023
    Assignee: Illumina, Inc.
    Inventor: Michael Ruehle
  • Publication number: 20230298703
    Abstract: Disclosed herein include systems, machines, devices, and methods for single-pass methylation mapping. C-to-T converted sequence reads and G-to-A converted sequence reads generated from a sample subjected to a methylation assay can be mapped to a mapping reference sequence comprising a C-to-T converted reference sequence and a G-to-A converted reference sequence generated to a reference genome sequence. The counts of Cs and Ts of sequence reads mapped to each of one or more positions with Cs in the reference genome sequence can be used to determine whether the position is a methylated C or an unmethylated C in the sample.
    Type: Application
    Filed: March 14, 2023
    Publication date: September 21, 2023
    Inventors: John Cooper Roddey, Asaf Levy, Mengchi Wang, Adam Birnbaum, Brian Y. Tsui, Michael Ruehle
  • Publication number: 20220230084
    Abstract: A system and method for a reduced computation hidden markov model (HMM) in computational biology applications is disclosed herein. The method includes performing a correlation between the haplotype sequence and the read sequence at the HMM pre-filter engine. The method includes computing a MINI metric from a reduced number of cells at the HMM computation engine.
    Type: Application
    Filed: January 27, 2022
    Publication date: July 21, 2022
    Inventors: Mark Hahm, Michael Ruehle, Rami Mehio
  • Publication number: 20220019919
    Abstract: A system and method for a reduced computation hidden markov model (HMM) in computational biology applications is disclosed herein. The method includes performing a correlation between the haplotype sequence and the read sequence at the HMM pre-filter engine. The method includes computing a HMM metric from a reduced number of cells at the HMM computation engine.
    Type: Application
    Filed: July 29, 2015
    Publication date: January 20, 2022
    Inventors: Mark Hahm, Michael Ruehle, Rami Mehio
  • Publication number: 20210313014
    Abstract: A system, method and apparatus for executing a sequence analysis pipeline on genetic sequence data includes an integrated circuit formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the integrated circuit connected with an electronic data source for receiving reads of genomic data. The hardwired digital logic circuits are arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired digital logic circuits to perform one or more steps in the sequence analysis pipeline on the reads of genomic data. Each subset of the hardwired digital logic circuits is formed in a wired configuration to perform the one or more steps in the sequence analysis pipeline.
    Type: Application
    Filed: June 17, 2021
    Publication date: October 7, 2021
    Inventors: Pieter Van Rooyen, Robert J. McMillen, Michael Ruehle
  • Publication number: 20210313009
    Abstract: Methods, systems, and apparatus for hardware-accelerated generation of a K-mer graph using a programmable logic device. In one aspect, a method includes actions of obtaining a first set of nucleic acid sequences, generating a K-mer graph using the obtained first set of nucleic acid sequences and using a plurality of non-pipelined hardware logic units of a programmable logic device, and periodically updating, with a control machine, graph description data for the K-mer graph after performance of the one or more operations by each hardware logic unit.
    Type: Application
    Filed: April 7, 2021
    Publication date: October 7, 2021
    Inventor: Michael Ruehle
  • Publication number: 20210257052
    Abstract: A system, method and apparatus for executing a bioinformatics analysis on genetic sequence data is provided. Particularly, a genomics analysis platform for executing a sequence analysis pipeline is provided. The genomics analysis platform includes one or more of a first integrated circuit, where each first integrated circuit forms a central processing unit (CPU) that is responsive to one or more software algorithms that are configured to instruct the CPU to perform a first set of genomic processing steps of the sequence analysis pipeline.
    Type: Application
    Filed: February 2, 2021
    Publication date: August 19, 2021
    Inventors: Pieter Van Rooyen, Michael Ruehle, Rami Mehio, Gavin Stone, Mark Hahm, Eric Ojard, Amnon Ptashek
  • Patent number: 11049588
    Abstract: A system, method and apparatus for executing a bioinformatics analysis on genetic sequence data is provided. Particularly, a genomics analysis platform for executing a sequence analysis pipeline is provided. The genomics analysis platform includes one or more of a first integrated circuit, where each first integrated circuit forms a central processing unit (CPU) that is responsive to one or more software algorithms that are configured to instruct the CPU to perform a first set of genomic processing steps of the sequence analysis pipeline.
    Type: Grant
    Filed: August 31, 2018
    Date of Patent: June 29, 2021
    Assignee: Illumina, Inc.
    Inventors: Pieter van Rooyen, Michael Ruehle, Rami Mehio, Gavin Stone, Mark Hahm, Eric Ojard, Amnon Ptashek
  • Publication number: 20210193261
    Abstract: A system, method and apparatus for executing a sequence analysis pipeline on genetic sequence data includes a integrated circuit formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the integrated circuit connected with an electronic data source for receiving reads of genomic data. The hardwired digital logic circuits are arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired digital logic circuits to perform one or more steps in the sequence analysis pipeline on the reads of genomic data. Each subset of the hardwired digital logic circuits is formed in a wired configuration to perform the one or more steps in the sequence analysis pipeline.
    Type: Application
    Filed: September 25, 2020
    Publication date: June 24, 2021
    Inventors: Pieter van Rooyen, Robert J. McMillen, Michael Ruehle, Rami Mehio
  • Publication number: 20210193254
    Abstract: Methods, systems, and apparatuses, including computer programs for identifying a gene fusion in a biological sample is disclosed. The method can include actions of obtaining first data that represents a plurality of aligned reads, identifying a plurality of fusion candidates included within the obtained first data, filtering the plurality of fusion candidates to determine a filtered set of fusion candidates, for each particular fusion candidate of the filtered set of fusion candidates: generating, by one or more computers, input data for input to a machine learning model that includes extracted feature data that to represents the particular fusion candidate, providing the generated input data as an input to the machine learning model that has been trained to generate output data representing a likelihood that a fusion candidate is a valid gene fusion, and determining whether the particular fusion candidate corresponds to a valid gene fusion based on the output data.
    Type: Application
    Filed: December 4, 2020
    Publication date: June 24, 2021
    Inventors: Viraj Deshpande, Johann Felix Wilhelm Schlesinger, Sean Truong, John Cooper Roddey, Michael Ruehle, Severine Catreux, Rami Mehio
  • Patent number: 11043285
    Abstract: A system, method and apparatus for executing a sequence analysis pipeline on genetic sequence data includes an integrated circuit formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the integrated circuit connected with an electronic data source for receiving reads of genomic data. The hardwired digital logic circuits are arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired digital logic circuits to perform one or more steps in the sequence analysis pipeline on the reads of genomic data. Each subset of the hardwired digital logic circuits is formed in a wired configuration to perform the one or more steps in the sequence analysis pipeline.
    Type: Grant
    Filed: February 20, 2018
    Date of Patent: June 22, 2021
    Assignee: Edico Genome Corporation
    Inventors: Pieter Van Rooyen, Robert McMillen, Michael Ruehle
  • Publication number: 20210183468
    Abstract: Systems, methods, and computer programs for analyzing genetic sequence data is disclosed. In one aspect, the system can include one or more of a first integrated circuit, with each first integrated circuit forming a central processing unit (CPU) that is responsive to one or more software algorithms that are configured to instruct the CPU to perform a first set of genomic processing steps of a sequence analysis pipeline. Additionally, the system can include one or more second integrated circuits, with each second integrated circuit forming a field programmable gate array (FPGA). The FPGA can be configured by firmware to arrange a set of hardwired digital logic circuits to perform a second set of genomic processing stages of the sequence analysis pipeline, the set of hardwired digital logic circuits of each FPGA being arranged as a set of processing engines to perform the second set of genomic processing stages.
    Type: Application
    Filed: February 2, 2021
    Publication date: June 17, 2021
    Inventors: Mark David Hahm, Jacobus de Beer, Varun Jain, Rami Mehio, Eric Ojard, Michael Ruehle, Amnon Ptashek, Severine Catreux, Arun Visvanath
  • Publication number: 20200372031
    Abstract: Methods, systems, and apparatuses, including computer programs for generating and using a hash table configured to improve mapping of reads are disclosed that include obtaining a first seed of K nucleotides from a reference sequence, generating a seed extension tree having a nodes, wherein each node of the nodes corresponds to (i) an extended seed that is an extension of the first seed and has a nucleotide length of K* and (ii) one or more locations, in a seed extension table, that include data describing reference sequence locations that match the extended seed, and for each node: storing interval information at a location of the hash table that corresponds to an index key for the extended seed, wherein the interval information references one or more locations in the seed extension table that include reference sequence locations that match the extended seed associated with the node.
    Type: Application
    Filed: May 22, 2020
    Publication date: November 26, 2020
    Inventor: Michael Ruehle
  • Patent number: 10847251
    Abstract: A system, method and apparatus for executing a sequence analysis pipeline on genetic sequence data includes a integrated circuit formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the integrated circuit connected with an electronic data source for receiving reads of genomic data. The hardwired digital logic circuits are arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired digital logic circuits to perform one or more steps in the sequence analysis pipeline on the reads of genomic data. Each subset of the hardwired digital logic circuits is formed in a wired configuration to perform the one or more steps in the sequence analysis pipeline.
    Type: Grant
    Filed: January 11, 2017
    Date of Patent: November 24, 2020
    Assignee: Illumina, Inc.
    Inventors: Pieter van Rooyen, Robert J. McMillen, Michael Ruehle, Rami Mehio
  • Publication number: 20200350038
    Abstract: A system, method and apparatus for executing a bioinformatics analysis on genetic sequence data includes an integrated circuit formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the integrated circuit that may be connected with an electronic data source for receiving reads of genomic data. The hardwired digital logic circuits may be arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired digital logic circuits to perform one or more steps in the bioinformatics analysis on the reads of genomic data. Each subset of the hardwired digital logic circuits may be formed in a wired configuration to perform the one or more steps in the bioinformatics analysis.
    Type: Application
    Filed: March 27, 2020
    Publication date: November 5, 2020
    Inventors: Pieter van Rooyen, Michael Ruehle, Rami Mehio