Patents by Inventor Kai-How FARH

Kai-How FARH has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230108368
    Abstract: The technology disclosed relates to training a pathogenicity predictor.
    Type: Application
    Filed: September 26, 2022
    Publication date: April 6, 2023
    Applicants: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias HAMP, Hong GAO, Kai-How FARH
  • Publication number: 20230108241
    Abstract: The technology disclosed relates to determining pathogenicity of nucleotide variants. In particular, the technology disclosed relates to specifying a particular amino acid at a particular position in a protein as a gap amino acid, and specifying remaining amino acids at remaining positions in the protein as non-gap amino acids, generating a gapped spatial representation of the protein that includes spatial configurations of the non-gap amino acids, and excludes a spatial configuration of the gap amino acid, determining an evolutionary conservation at the particular position of respective amino acids of respective amino acid classes based at least in part on the gapped spatial representation, and based at least in part on the evolutionary conservation of the respective amino acids, determining a pathogenicity of respective nucleotide variants that respectively substitute the particular amino acid with the respective amino acids in alternate representations of the protein.
    Type: Application
    Filed: September 26, 2022
    Publication date: April 6, 2023
    Applicants: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias HAMP, Hong GAO, Kai-How FARH
  • Publication number: 20230059877
    Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional network network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.
    Type: Application
    Filed: October 20, 2022
    Publication date: February 23, 2023
    Applicant: Illumina, Inc.
    Inventors: Kishore JAGANATHAN, Kai-How FARH, Sofia KYRIAZOPOULOU PANAGIOTOPOULOU, Jeremy Francis MCRAE
  • Publication number: 20230047347
    Abstract: The technology disclosed describes determination of which elements of a sequence are nearest to uniformly spaced cells in a grid, where the elements have element coordinates, and the cells have dimension-wise cell indices and cell coordinates. The determination includes generating an element-to-cells mapping that maps, to each of the elements, a subset of the cells. The subset of the cells mapped to a particular element in the sequence includes a nearest cell in the grid and one or more neighborhood cells in the grid, and the nearest cell is selected based on matching element coordinates of the particular element to the cell coordinates. The determination further includes generating a cell-to-elements mapping that maps, to each of the cells, a subset of the elements, and using the cell-to-elements mapping to determine, for each of the cells, a nearest element in the sequence.
    Type: Application
    Filed: October 26, 2022
    Publication date: February 16, 2023
    Applicants: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias HAMP, Hong GAO, Kai-How FARH
  • Publication number: 20230044917
    Abstract: The technology disclosed relates to a variant pathogenicity prediction network. The variant pathogenicity classifier includes memory, a variant encoding sub-network, a protein contact map generation sub-network, and a pathogenicity scoring sub-network. The memory stores a reference amino acid sequence of a protein, and an alternative amino acid sequence of the protein that contains a variant amino acid caused by a variant nucleotide. The variant encoding sub-network is configured to process the alternative amino acid sequence, and generate a processed representation of the alternative amino acid sequence. The protein contact map generation sub-network is configured to process the reference amino acid sequence and the processed representation of the alternative amino acid sequence, and generate a protein contact map of the protein. The pathogenicity scoring sub-network is configured to process the protein contact map, and generate a pathogenicity indication of the variant amino acid.
    Type: Application
    Filed: July 28, 2022
    Publication date: February 9, 2023
    Applicant: ILLUMINA, INC.
    Inventors: Chen CHEN, Hong GAO, Laksshman S. SUNDARAM, Kai-How FARH
  • Publication number: 20230045003
    Abstract: The technology disclosed relates to a variant pathogenicity classifier. The variant pathogenicity classifier comprises memory and runtime logic. The memory stores (i) a reference amino acid sequence of a protein, (ii) an alternative amino acid sequence of the protein that contains a variant amino acid caused by a variant nucleotide, and (iii) a protein contact map of the protein. The runtime logic has access to the memory, and is configured to provide (i) the reference amino acid sequence, (ii) the alternative amino acid sequence, and (iii) the protein contact map as input to a first neural network, and to cause the first neural network to generate a pathogenicity indication of the variant amino acid as output in response to processing (i) the reference amino acid sequence, (ii) the alternative amino acid sequence, and (iii) the protein contact map.
    Type: Application
    Filed: July 28, 2022
    Publication date: February 9, 2023
    Applicant: ILLUMINA, INC.
    Inventors: Chen CHEN, Hong GAO, Laksshman S. SUNDARAM, Kai-How FARH
  • Patent number: 11538555
    Abstract: The technology disclosed relates to determining pathogenicity of nucleotide variants. In particular, the technology disclosed relates to specifying a particular amino acid at a particular position in a protein as a gap amino acid, and specifying remaining amino acids at remaining positions in the protein as non-gap amino acids. The technology disclosed further relates to generating a gapped spatial representation of the protein that includes spatial configurations of the non-gap amino acids, and excludes a spatial configuration of the gap amino acid, and determining a pathogenicity of a nucleotide variant based at least in part on the gapped spatial representation, and a representation of an alternate amino acid created by the nucleotide variant at the particular position.
    Type: Grant
    Filed: November 22, 2021
    Date of Patent: December 27, 2022
    Assignees: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias Hamp, Hong Gao, Kai-How Farh
  • Publication number: 20220406411
    Abstract: An artificial intelligence-based system comprises an input preparation module that accesses a sequence database and generates an input base sequence. The input base sequence comprises a target base sequence with target bases, wherein the target base sequence is flanked by a right base sequence with downstream context bases, and a left base sequence with upstream context bases. A sequence-to-sequence model processes the input base sequence and generates an alternative representation of the input base sequence. An output module processes the alternative representation of the input base sequence and produces at least one per-base output for each of the target bases in the target base sequence. The per-base output specifies, for a corresponding target base, signal levels of a plurality of epigenetic tracks.
    Type: Application
    Filed: September 18, 2020
    Publication date: December 22, 2022
    Applicant: Illumina, Inc.
    Inventors: Sofia KYRIAZOPOULOU PANAGIOTOPOULOU, Kai-How FARH
  • Patent number: 11515010
    Abstract: The technology disclosed relates to determining pathogenicity of variants. In particular, the technology disclosed relates to generating amino acid-wise distance channels for a plurality of amino acids in a protein. Each of the amino acid-wise distance channels has voxel-wise distance values for voxels in a plurality of voxels. A tensor includes the amino acid-wise distance channels and at least an alternative allele of the protein expressed by a variant. A deep convolutional neural network determines a pathogenicity of the variant based at least in part on processing the tensor. The technology disclosed further augments the tensor with supplemental information like a reference allele of the protein, evolutionary conservation data about the protein, annotation data about the protein, and structure confidence data about the protein.
    Type: Grant
    Filed: September 7, 2021
    Date of Patent: November 29, 2022
    Assignees: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias Hamp, Hong Gao, Kai-How Farh
  • Patent number: 11488009
    Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.
    Type: Grant
    Filed: October 15, 2018
    Date of Patent: November 1, 2022
    Assignee: Illumina, Inc.
    Inventors: Kishore Jaganathan, Kai-How Farh, Sofia Kyriazopoulou Panagiotopoulou, Jeremy Francis McRae
  • Publication number: 20220336054
    Abstract: The technology disclosed relates to determining pathogenicity of variants. In particular, the technology disclosed relates to generating amino acid-wise distance channels for a plurality of amino acids in a protein. Each of the amino acid-wise distance channels has voxel-wise distance values for voxels in a plurality of voxels. A tensor includes the amino acid-wise distance channels and at least an alternative allele of the protein expressed by a variant. A deep convolutional neural network determines a pathogenicity of the variant based at least in part on processing the tensor. The technology disclosed further augments the tensor with supplemental information like a reference allele of the protein, evolutionary conservation data about the protein, annotation data about the protein, and structure confidence data about the protein.
    Type: Application
    Filed: April 15, 2021
    Publication date: October 20, 2022
    Applicants: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias HAMP, Hong GAO, Kai-How FARH
  • Publication number: 20220336057
    Abstract: The technology disclosed relates to efficiently determining which atoms in a protein are nearest to voxels in a grid. The atoms have three-dimensional (3D) atom coordinates, and the voxels have 3D voxel coordinates. The technology disclosed generates an atom-to-voxels mapping that maps, to each of the atoms, a containing voxel selected based on matching 3D atom coordinates of a particular atom of the protein to the 3D voxel coordinates in the grid. The technology disclosed generates a voxel-to-atoms mapping that maps, to each of the voxels, a subset of the atoms. The subset of the atoms mapped to a particular voxel in the grid includes those atoms in the protein that are mapped to the particular voxel by the atom-to-voxels mapping. The technology disclosed includes using the voxel-to-atoms mapping to determine, for each of the voxels, a nearest atom in the protein.
    Type: Application
    Filed: March 24, 2022
    Publication date: October 20, 2022
    Applicants: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias HAMP, Kai-How FARH, Hong GAO
  • Publication number: 20220336056
    Abstract: A system includes at least a voxelizer, an alternative allele encoder, an evolutionary conservation encoder, and a convolutional neural network. The voxelizer accesses a three-dimensional structure of a reference amino acid sequence of a protein and fits a three-dimensional grid of voxels on atoms in the three-dimensional structure on an amino acid-basis to generate amino acid-wise distance channels. The alternative allele encoder encodes an alternative allele sequence to each voxel in the three-dimensional grid of voxels. The evolutionary conservation encoder encodes an evolutionary conservation sequence to each voxel in the three-dimensional grid of voxels. The convolutional neural network applies three-dimensional convolutions to a tensor that includes the amino acid-wise distance channels encoded with the alternative allele sequence and respective evolutionary conservation sequences and determines a pathogenicity of a variant nucleotide based at least in part on the tensor.
    Type: Application
    Filed: March 24, 2022
    Publication date: October 20, 2022
    Applicants: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias HAMP, Kai-How FARH, Hong GAO
  • Publication number: 20220336055
    Abstract: The technology disclosed relates to determining pathogenicity of variants. In particular, the technology disclosed relates to generating amino acid-wise distance channels for a plurality of amino acids in a protein. Each of the amino acid-wise distance channels has voxel-wise distance values for voxels in a plurality of voxels. A tensor includes the amino acid-wise distance channels and at least an alternative allele of the protein expressed by a variant. A deep convolutional neural network determines a pathogenicity of the variant based at least in part on processing the tensor. The technology disclosed further augments the tensor with supplemental information like a reference allele of the protein, evolutionary conservation data about the protein, annotation data about the protein, and structure confidence data about the protein.
    Type: Application
    Filed: September 7, 2021
    Publication date: October 20, 2022
    Applicants: Illumina, Inc., Illumina Cambridge Limited
    Inventors: Tobias HAMP, Hong GAO, Kai-How FARH
  • Publication number: 20220237457
    Abstract: The technology disclosed relates to constructing a computer-implemented method for variant classification. In particular, the method includes using a pathogenicity prediction neural network to process as input, (i) a reference protein sequence that has a first chain of amino acids with at least twenty amino acids, (ii) an alternative protein sequence aligned with the reference sequence, where the alternative protein sequence has a second chain of amino acids with at least twenty amino acids, and the first and second chains of amino acids differ by a variant amino acid caused by a nucleotide substitution, and (iii) a primate conservation profile generated using a primate cross-species multiple sequence alignment that aligns the reference protein sequence with other protein sequences from primate species. The method further includes based on the processing of the input by the neural network, generating as output a pathogenicity prediction for the nucleotide substitution.
    Type: Application
    Filed: April 6, 2022
    Publication date: July 28, 2022
    Applicant: Illumina, Inc.
    Inventors: Laksshman Sundaram, Kai-How Farh, Hong Gao, Samskruthi Reddy Padigepati, Jeremy Francis McRae
  • Patent number: 11397889
    Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional network network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.
    Type: Grant
    Filed: October 15, 2018
    Date of Patent: July 26, 2022
    Assignee: Illumina, Inc.
    Inventors: Kishore Jaganathan, Kai-How Farh, Sofia Kyriazopoulou Panagiotopoulou, Jeremy Francis McRae
  • Patent number: 11386324
    Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.
    Type: Grant
    Filed: January 27, 2020
    Date of Patent: July 12, 2022
    Assignee: Illumina, Inc.
    Inventors: Hong Gao, Kai-How Farh, Laksshman Sundaram, Jeremy Francis McRae
  • Publication number: 20220164710
    Abstract: Systems, computer-implemented methods, and non-transitory computer readable media are provided for sharing medical data. The disclosed systems may be configured to create a first workgroup having a first knowledgebase. This first knowledgebase may be federated with a common knowledgebase, and with a second knowledgebase of a second workgroup. At least one of the first knowledgebase, common knowledgebase, and second knowledgebase may be configured to store data items comprising associations, signs, and evidence. The signs may comprise measurements and contexts, and the associations may describe the relationships between the measurements and contexts. The evidence may support these associations. The disclosed systems may be configured to receive a request from a user in the first workgroup, retrieve matching data items, and optionally then output to the user at least some of the retrieved matching data items. The request may comprise at least one of a first association and a first measurement.
    Type: Application
    Filed: February 7, 2022
    Publication date: May 26, 2022
    Inventors: Kai-How Farh, Donavan Cheng, John Casey Shon, Jorg Hakenberg, Eugene Bolotin, James Geaney, Hong Gao, Pam Cheng, Inderjit Singh, Daniel Roche, Milan Karangutkar
  • Patent number: 11315016
    Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.
    Type: Grant
    Filed: October 15, 2018
    Date of Patent: April 26, 2022
    Assignee: Illumina, Inc.
    Inventors: Laksshman Sundaram, Kai-How Farh, Hong Gao, Samskruthi Reddy Padigepati, Jeremy Francis McRae
  • Patent number: 11244246
    Abstract: Systems, computer-implemented methods, and non-transitory computer readable media are provided for sharing medical data. The disclosed systems may be configured to create a first workgroup having a first knowledgebase. This first knowledgebase may be federated with a common knowledgebase, and with a second knowledgebase of a second workgroup. At least one of the first knowledgebase, common knowledgebase, and second knowledgebase may be configured to store data items comprising associations, signs, and evidence. The signs may comprise measurements and contexts, and the associations may describe the relationships between the measurements and contexts. The evidence may support these associations. The disclosed systems may be configured to receive a request from a user in the first workgroup, retrieve matching data items, and optionally then output to the user at least some of the retrieved matching data items. The request may comprise at least one of a first association and a first measurement.
    Type: Grant
    Filed: February 24, 2020
    Date of Patent: February 8, 2022
    Assignee: ILLUMINA, INC.
    Inventors: Kai-How Farh, Donavan Cheng, John Shon, Jorg Hakenberg, Eugene Bolotin, James Casey Geaney, Hong Gao, Pam Cheng, Inderjit Singh, Daniel Roche, Milan Karangutkar