Patents Assigned to SEVEN BRIDGES GENOMICS INC.
  • Patent number: 12205679
    Abstract: Genomic data is written to disk in a compact format by dividing the data into segments and encoding each segment with the smallest number of bits per character necessary for whatever alphabet of characters appears in that segment. A computer system dynamically chooses the segment boundaries for maximum space savings. A first one of the segments may use a different number of bits per character than a second one of the segments. In one embodiment, dividing the data into segments comprises scanning the data and keeping track of a number of unique characters, noting positions in the sequence where the number increases to a power of two, calculating a compression that would be obtained by dividing the genomic data into one of the plurality of segments at ones of the noted positions, and dividing the genomic data into the plurality of segments at the positions that yield the best compression.
    Type: Grant
    Filed: August 20, 2020
    Date of Patent: January 21, 2025
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventor: Vladimir Semenyuk
  • Patent number: 12119089
    Abstract: Embodiments of the invention utilize a graph-based approach for simulating genomic datasets from large scale populations. Genomic data may be represented as a directed acyclic graph (DAG) that incorporates individual sample data including variant type, position, and zygosity. A simulator may operate on the DAG to generate variant datasets based on probabilistic traversal of the DAG. This probabilistic traversal reflects genomic variant types associated with the subpopulation used to build the DAG, and as a result, the generated variant datasets maintain statistical fidelity to the original sample data.
    Type: Grant
    Filed: April 6, 2020
    Date of Patent: October 15, 2024
    Assignee: SEVEN BRIDGES GENOMICS, INC.
    Inventors: Agata Foryciarz, Dennis A. Dean, II
  • Patent number: 11558487
    Abstract: A method for stream-processing biomedical data includes receiving, by a file system on a computing device, a first request for access to at least a first portion of a file stored on a remotely located storage device. The method includes receiving, by the file system, a second request for access to at least a second portion of the file. The method includes determining, by a pre-fetching component executing on the computing device, whether the first request and the second request are associated with a sequential read operation. The method includes automatically retrieving, by the pre-fetching component, a third portion of the requested file, before receiving a third request for access to least the third portion of the file, based on a determination that the first request and the second request are associated with the sequential read operation.
    Type: Grant
    Filed: March 3, 2021
    Date of Patent: January 17, 2023
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventor: Nemanja Zbiljic
  • Patent number: 11327792
    Abstract: Various approaches for data storage and retrieval for a computer memory include processing a computational workflow having multiple data-processing steps, generating and storing a first hash value associated with a first step of the data-processing steps based on an input to the first step, generating and storing a second hash value associated with a second step of the data-processing steps based on the generated first hash value, and reconstructing a computational state of the workflow based on the second hash value, and thereby avoid re-execution of a portion of the workflow corresponding to the second hash value.
    Type: Grant
    Filed: December 19, 2019
    Date of Patent: May 10, 2022
    Assignee: SEVEN BRIDGES GENOMICS, INC.
    Inventor: Yilong Li
  • Patent number: 11269697
    Abstract: Some embodiments relate to systems for processing one or more computational workflows. In one embodiment, a description of a computational comprises a plurality of applications, in which applications are represented as nodes and edges connect the nodes indicate the flow of data elements between applications. A task execution module is configured to create and execute tasks. An application programming interface (API) is in communication with the task execution module and comprises a plurality of function calls for controlling at least one function of the task execution module. An API script includes instructions to the API to create and execute a plurality of tasks corresponding to the execution of the computational workflow for a plurality of samples. A graphical user interface (GUI) is in communication with the task execution module and configured to receive input from an end user to initiate execution of the API script.
    Type: Grant
    Filed: April 29, 2020
    Date of Patent: March 8, 2022
    Assignee: SEVEN BRIDGES GENOMICS, INC.
    Inventors: Christian Frech, Raunaq Malhotra
  • Patent number: 11055135
    Abstract: In one aspect, a method for scheduling jobs in a computational workflow includes identifying, from a computational workflow by a workflow execution engine executing on a processor, a plurality of jobs ready for execution. The method includes sorting, based on computational resource requirements associated with each identified job, the identified jobs into a prioritized queue. The method includes provisioning one or more computational instances based on the computational resource requirements of the identified jobs in the prioritized queue, wherein at least one computational instance is provisioned based on a highest priority job in the queue. The method includes submitting the prioritized jobs for execution to the one or more computational instances.
    Type: Grant
    Filed: May 29, 2018
    Date of Patent: July 6, 2021
    Assignee: SEVEN BRIDGES GENOMICS, INC.
    Inventors: Milos Popovic, Goran Rakocevic, Mihailo Andrejevic, Aleksandar Minic
  • Patent number: 11037345
    Abstract: In one embodiment, a method of processing a computational workflow comprises receiving a description of a computational workflow. The description comprises a plurality of steps, in which each step has at least one input and at least one output, and further wherein an input from a second step depends on an output from a first step. The description is translated into a static workflow graph stored in a memory, the static workflow graph comprising a plurality of nodes having input ports and output ports, wherein dependencies between inputs and outputs are specified as edges between input ports and output ports. Information about a first set of nodes is then extracted from the static workflow graph and placed into a dynamic graph. A first actionable job is identified from the dynamic graph and executed.
    Type: Grant
    Filed: April 24, 2020
    Date of Patent: June 15, 2021
    Assignee: SEVEN BRIDGES GENOMICS, INC.
    Inventors: Janko Simonovic, Sinisa Ivkovic, Nebojsa Tijanic
  • Patent number: 10790044
    Abstract: Genomic data is written to disk in a compact format by dividing the data into segments and encoding each segment with the smallest number of bits per character necessary for whatever alphabet of characters appears in that segment. A computer system dynamically chooses the segment boundaries for maximum space savings. A first one of the segments may use a different number of bits per character than a second one of the segments. In one embodiment, dividing the data into segments comprises scanning the data and keeping track of a number of unique characters, noting positions in the sequence where the number increases to a power of two, calculating a compression that would be obtained by dividing the genomic data into one of the plurality of segments at ones of the noted positions, and dividing the genomic data into the plurality of segments at the positions that yield the best compression.
    Type: Grant
    Filed: May 17, 2017
    Date of Patent: September 29, 2020
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventor: Vladimir Semenyuk
  • Patent number: 10678613
    Abstract: Some embodiments relate to systems for processing one or more computational workflows. In one embodiment, a description of a computational comprises a plurality of applications, in which applications are represented as nodes and edges connect the nodes indicate the flow of data elements between applications. A task execution module is configured to create and execute tasks. An application programming interface (API) is in communication with the task execution module and comprises a plurality of function calls for controlling at least one function of the task execution module. An API script includes instructions to the API to create and execute a plurality of tasks corresponding to the execution of the computational workflow for a plurality of samples. A graphical user interface (GUI) is in communication with the task execution module and configured to receive input from an end user to initiate execution of the API script.
    Type: Grant
    Filed: October 31, 2018
    Date of Patent: June 9, 2020
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventors: Christian Frech, Raunaq Malhotra
  • Patent number: 10672156
    Abstract: In one embodiment, a method of processing a computational workflow comprises receiving a description of a computational workflow. The description comprises a plurality of steps, in which each step has at least one input and at least one output, and further wherein an input from a second step depends on an output from a first step. The description is translated into a static workflow graph stored in a memory, the static workflow graph comprising a plurality of nodes having input ports and output ports, wherein dependencies between inputs and outputs are specified as edges between input ports and output ports. Information about a first set of nodes is then extracted from the static workflow graph and placed into a dynamic graph. A first actionable job is identified from the dynamic graph and executed.
    Type: Grant
    Filed: March 9, 2017
    Date of Patent: June 2, 2020
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventors: Janko Simonovic, Sinisa Ivkovic, Nebojsa Tijanic
  • Patent number: 10629292
    Abstract: Embodiments of the invention utilize a graph-based approach for simulating genomic datasets from large scale populations. Genomic data may be represented as a directed acyclic graph (DAG) that incorporates individual sample data including variant type, position, and zygosity. A simulator may operate on the DAG to generate variant datasets based on probabilistic traversal of the DAG. This probabilistic traversal reflects genomic variant types associated with the subpopulation used to build the DAG, and as a result, the generated variant datasets maintain statistical fidelity to the original sample data.
    Type: Grant
    Filed: December 19, 2016
    Date of Patent: April 21, 2020
    Assignee: SEVEN BRIDGES GENOMICS, INC.
    Inventors: Agata Foryciarz, Dennis A. Dean, II
  • Patent number: 10545955
    Abstract: A method for generating a query of a genomic data store includes receiving, by a query generator executing on a computing device, from a graphical user interface, an identification of a first entity of a first entity class for inclusion in a resource description framework (RDF) query. The method includes receiving from the graphical user interface, an identification of a second entity of the first entity class, the second entity having a bi-directional relationship with the first entity. The method includes automatically generating an RDF query based upon the received identification of the first entity and the received identification of the second entity. The method includes executing the RDF query to select, from a plurality of genomic data sets, at least one genomic data set for at least one patient cohort. The method includes providing a listing of genomic data sets resulting from executing the RDF query.
    Type: Grant
    Filed: January 13, 2017
    Date of Patent: January 28, 2020
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventors: Dragan Djordjevic, Filip Jelic, Dragan Bajcic, Jovan Cejovic, Adam Stanojevic, Milos Nesic, Jelena Radenkovic, Vladimir Mladenovic
  • Patent number: 10545792
    Abstract: Systems and methods for data storage and retrieval for a computer memory include processing a computational workflow having multiple data-processing steps, generating and storing a first hash value associated with a first step of the data-processing steps based on an input to the first step, generating and storing a second hash value associated with a second step of the data-processing steps based on the generated first hash value, and reconstructing a computational state of the workflow based on the second hash value, and thereby avoid re-execution of a portion of the workflow corresponding to the second hash value.
    Type: Grant
    Filed: September 8, 2017
    Date of Patent: January 28, 2020
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventor: Yilong Li
  • Patent number: 10460829
    Abstract: In one embodiment, a method of encoding variation data for a population comprises receiving, by a variant encoding engine executing on a processor, information describing genetic variation of a population of individuals. The information comprises a plurality of variable sites within the reference genome of the population and the genotypes of a plurality of individuals in the population with respect to those variable sites. The method further comprises selecting an encoding strategy for the information based on the characteristics of the genetic variation across the population, and encoding the information according to the selected encoding strategy. In certain embodiments, selecting an encoding strategy may comprise determining the variability of a variable site within the population, and encoding information associated with the variable site based on the variability.
    Type: Grant
    Filed: January 26, 2016
    Date of Patent: October 29, 2019
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventor: Benjamin Murray
  • Patent number: 10275567
    Abstract: The invention relates to methods for determining a haplotype for an organism by using a system for transforming SNP alleles found in sequence fragments into vertices in a graph with edges connecting vertices for alleles that appear together in a sequence fragment. A community detection operation can be used to infer the haplotype from the graph. The system may produce a report that includes the haplotype of the SNPs found in the genome of that organism.
    Type: Grant
    Filed: January 21, 2016
    Date of Patent: April 30, 2019
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventors: Milos Popovic, Goran Rakocevic
  • Patent number: 9898575
    Abstract: The invention includes methods for aligning reads (e.g., nucleic acid reads, amino acid reads) to a reference sequence construct, methods for building the reference sequence construct, and systems that use the alignment methods and constructs to produce sequences. The method is scalable, and can be used to align millions of reads to a construct thousands of bases or amino acids long. The invention additionally includes methods for identifying a disease or a genotype based upon alignment of nucleic acid reads to a location in the construct.
    Type: Grant
    Filed: September 3, 2013
    Date of Patent: February 20, 2018
    Assignee: SEVEN BRIDGES GENOMICS INC.
    Inventor: Deniz Kural
  • Publication number: 20150112602
    Abstract: Methods of analyzing a transcriptome that involves obtaining at least one pair of paired-end reads from a transcriptome from an organism, finding an alignment with an optimal score between a first read of the pair and a node in a directed acyclic data structure (the data structure has nodes representing RNA sequences such as exons or transcripts and edges connecting pairs of nodes), identifying candidate paths that include the node connected to a downstream node by a path having a length substantially similar to an insert length of the pair of paired-end reads, and aligning the paired-end rends to the candidate paths to determine an optimal-scoring alignment.
    Type: Application
    Filed: January 17, 2014
    Publication date: April 23, 2015
    Applicant: SEVEN BRIDGES GENOMICS INC.
    Inventors: Deniz Kural, Nathan Meyvis
  • Publication number: 20150112658
    Abstract: The invention generally provides systems and methods for analysis of RNA-Seq reads in which an annotated reference is represented as a directed acyclic graph (DAG) or similar data structure. Features such as exons and introns from the reference provide nodes in the DAG and those features are linked as pairs in their canonical genomic order by edges. The DAG can scale to any size and can in fact be populated in the first instance by import from an extrinsic annotated reference.
    Type: Application
    Filed: January 17, 2014
    Publication date: April 23, 2015
    Applicant: SEVEN BRIDGES GENOMICS INC.
    Inventors: Deniz Kural, Nathan Meyvis
  • Publication number: 20150057946
    Abstract: The invention includes methods for aligning reads (e.g., nucleic acid reads, amino acid reads) to a reference sequence construct, methods for building the reference sequence construct, and systems that use the alignment methods and constructs to produce sequences. The method is scalable, and can be used to align millions of reads to a construct thousands of bases or amino acids long. The invention additionally includes methods for identifying a disease or a genotype based upon alignment of nucleic acid reads to a location in the construct.
    Type: Application
    Filed: September 3, 2013
    Publication date: February 26, 2015
    Applicant: SEVEN BRIDGES GENOMICS INC.
    Inventor: Deniz Kural
  • Publication number: 20150056613
    Abstract: The invention provides methods for identifying rare variants near a structural variation in a genetic sequence, for example, in a nucleic acid sample taken from a subject. The invention additionally includes methods for aligning reads (e.g., nucleic acid reads) to a reference sequence construct accounting for the structural variation, methods for building a reference sequence construct accounting for the structural variation or the structural variation and the rare variant, and systems that use the alignment methods to identify rare variants. The method is scalable, and can be used to align millions of reads to a construct thousands of bases long, or longer.
    Type: Application
    Filed: September 30, 2013
    Publication date: February 26, 2015
    Applicant: SEVEN BRIDGES GENOMICS INC.
    Inventor: Deniz Kural