Patents by Inventor Kai-How FARH

Kai-How FARH has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Deep convolutional neural networks to predict variant pathogenicity using three-dimensional (3D) protein structures

Patent number: 11515010

Abstract: The technology disclosed relates to determining pathogenicity of variants. In particular, the technology disclosed relates to generating amino acid-wise distance channels for a plurality of amino acids in a protein. Each of the amino acid-wise distance channels has voxel-wise distance values for voxels in a plurality of voxels. A tensor includes the amino acid-wise distance channels and at least an alternative allele of the protein expressed by a variant. A deep convolutional neural network determines a pathogenicity of the variant based at least in part on processing the tensor. The technology disclosed further augments the tensor with supplemental information like a reference allele of the protein, evolutionary conservation data about the protein, annotation data about the protein, and structure confidence data about the protein.

Type: Grant

Filed: September 7, 2021

Date of Patent: November 29, 2022

Assignees: Illumina, Inc., Illumina Cambridge Limited

Inventors: Tobias Hamp, Hong Gao, Kai-How Farh
Deep learning-based splice site classification

Patent number: 11488009

Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.

Type: Grant

Filed: October 15, 2018

Date of Patent: November 1, 2022

Assignee: Illumina, Inc.

Inventors: Kishore Jaganathan, Kai-How Farh, Sofia Kyriazopoulou Panagiotopoulou, Jeremy Francis McRae
Deep Convolutional Neural Networks to Predict Variant Pathogenicity using Three-Dimensional (3D) Protein Structures

Publication number: 20220336055

Abstract: The technology disclosed relates to determining pathogenicity of variants. In particular, the technology disclosed relates to generating amino acid-wise distance channels for a plurality of amino acids in a protein. Each of the amino acid-wise distance channels has voxel-wise distance values for voxels in a plurality of voxels. A tensor includes the amino acid-wise distance channels and at least an alternative allele of the protein expressed by a variant. A deep convolutional neural network determines a pathogenicity of the variant based at least in part on processing the tensor. The technology disclosed further augments the tensor with supplemental information like a reference allele of the protein, evolutionary conservation data about the protein, annotation data about the protein, and structure confidence data about the protein.

Type: Application

Filed: September 7, 2021

Publication date: October 20, 2022

Applicants: Illumina, Inc., Illumina Cambridge Limited

Inventors: Tobias HAMP, Hong GAO, Kai-How FARH
Deep Convolutional Neural Networks to Predict Variant Pathogenicity using Three-Dimensional (3D) Protein Structures

Publication number: 20220336054

Abstract: The technology disclosed relates to determining pathogenicity of variants. In particular, the technology disclosed relates to generating amino acid-wise distance channels for a plurality of amino acids in a protein. Each of the amino acid-wise distance channels has voxel-wise distance values for voxels in a plurality of voxels. A tensor includes the amino acid-wise distance channels and at least an alternative allele of the protein expressed by a variant. A deep convolutional neural network determines a pathogenicity of the variant based at least in part on processing the tensor. The technology disclosed further augments the tensor with supplemental information like a reference allele of the protein, evolutionary conservation data about the protein, annotation data about the protein, and structure confidence data about the protein.

Type: Application

Filed: April 15, 2021

Publication date: October 20, 2022

Applicants: Illumina, Inc., Illumina Cambridge Limited

Inventors: Tobias HAMP, Hong GAO, Kai-How FARH
MULTI-CHANNEL PROTEIN VOXELIZATION TO PREDICT VARIANT PATHOGENICITY USING DEEP CONVOLUTIONAL NEURAL NETWORKS

Publication number: 20220336056

Abstract: A system includes at least a voxelizer, an alternative allele encoder, an evolutionary conservation encoder, and a convolutional neural network. The voxelizer accesses a three-dimensional structure of a reference amino acid sequence of a protein and fits a three-dimensional grid of voxels on atoms in the three-dimensional structure on an amino acid-basis to generate amino acid-wise distance channels. The alternative allele encoder encodes an alternative allele sequence to each voxel in the three-dimensional grid of voxels. The evolutionary conservation encoder encodes an evolutionary conservation sequence to each voxel in the three-dimensional grid of voxels. The convolutional neural network applies three-dimensional convolutions to a tensor that includes the amino acid-wise distance channels encoded with the alternative allele sequence and respective evolutionary conservation sequences and determines a pathogenicity of a variant nucleotide based at least in part on the tensor.

Type: Application

Filed: March 24, 2022

Publication date: October 20, 2022

Applicants: Illumina, Inc., Illumina Cambridge Limited

Inventors: Tobias HAMP, Kai-How FARH, Hong GAO
EFFICIENT VOXELIZATION FOR DEEP LEARNING

Publication number: 20220336057

Abstract: The technology disclosed relates to efficiently determining which atoms in a protein are nearest to voxels in a grid. The atoms have three-dimensional (3D) atom coordinates, and the voxels have 3D voxel coordinates. The technology disclosed generates an atom-to-voxels mapping that maps, to each of the atoms, a containing voxel selected based on matching 3D atom coordinates of a particular atom of the protein to the 3D voxel coordinates in the grid. The technology disclosed generates a voxel-to-atoms mapping that maps, to each of the voxels, a subset of the atoms. The subset of the atoms mapped to a particular voxel in the grid includes those atoms in the protein that are mapped to the particular voxel by the atom-to-voxels mapping. The technology disclosed includes using the voxel-to-atoms mapping to determine, for each of the voxels, a nearest atom in the protein.

Type: Application

Filed: March 24, 2022

Publication date: October 20, 2022

Applicants: Illumina, Inc., Illumina Cambridge Limited

Inventors: Tobias HAMP, Kai-How FARH, Hong GAO
VARIANT PATHOGENICITY PREDICTION USING NEURAL NETWORK

Publication number: 20220237457

Abstract: The technology disclosed relates to constructing a computer-implemented method for variant classification. In particular, the method includes using a pathogenicity prediction neural network to process as input, (i) a reference protein sequence that has a first chain of amino acids with at least twenty amino acids, (ii) an alternative protein sequence aligned with the reference sequence, where the alternative protein sequence has a second chain of amino acids with at least twenty amino acids, and the first and second chains of amino acids differ by a variant amino acid caused by a nucleotide substitution, and (iii) a primate conservation profile generated using a primate cross-species multiple sequence alignment that aligns the reference protein sequence with other protein sequences from primate species. The method further includes based on the processing of the input by the neural network, generating as output a pathogenicity prediction for the nucleotide substitution.

Type: Application

Filed: April 6, 2022

Publication date: July 28, 2022

Applicant: Illumina, Inc.

Inventors: Laksshman Sundaram, Kai-How Farh, Hong Gao, Samskruthi Reddy Padigepati, Jeremy Francis McRae
Aberrant splicing detection using convolutional neural networks (CNNs)

Patent number: 11397889

Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional network network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.

Type: Grant

Filed: October 15, 2018

Date of Patent: July 26, 2022

Assignee: Illumina, Inc.

Inventors: Kishore Jaganathan, Kai-How Farh, Sofia Kyriazopoulou Panagiotopoulou, Jeremy Francis McRae
Recurrent neural network-based variant pathogenicity classifier

Patent number: 11386324

Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.

Type: Grant

Filed: January 27, 2020

Date of Patent: July 12, 2022

Assignee: Illumina, Inc.

Inventors: Hong Gao, Kai-How Farh, Laksshman Sundaram, Jeremy Francis McRae
FEDERATED SYSTEMS AND METHODS FOR MEDICAL DATA SHARING

Publication number: 20220164710

Abstract: Systems, computer-implemented methods, and non-transitory computer readable media are provided for sharing medical data. The disclosed systems may be configured to create a first workgroup having a first knowledgebase. This first knowledgebase may be federated with a common knowledgebase, and with a second knowledgebase of a second workgroup. At least one of the first knowledgebase, common knowledgebase, and second knowledgebase may be configured to store data items comprising associations, signs, and evidence. The signs may comprise measurements and contexts, and the associations may describe the relationships between the measurements and contexts. The evidence may support these associations. The disclosed systems may be configured to receive a request from a user in the first workgroup, retrieve matching data items, and optionally then output to the user at least some of the retrieved matching data items. The request may comprise at least one of a first association and a first measurement.

Type: Application

Filed: February 7, 2022

Publication date: May 26, 2022

Inventors: Kai-How Farh, Donavan Cheng, John Casey Shon, Jorg Hakenberg, Eugene Bolotin, James Geaney, Hong Gao, Pam Cheng, Inderjit Singh, Daniel Roche, Milan Karangutkar
Deep convolutional neural networks for variant classification

Patent number: 11315016

Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.

Type: Grant

Filed: October 15, 2018

Date of Patent: April 26, 2022

Assignee: Illumina, Inc.

Inventors: Laksshman Sundaram, Kai-How Farh, Hong Gao, Samskruthi Reddy Padigepati, Jeremy Francis McRae
Federated systems and methods for medical data sharing

Patent number: 11244246

Abstract: Systems, computer-implemented methods, and non-transitory computer readable media are provided for sharing medical data. The disclosed systems may be configured to create a first workgroup having a first knowledgebase. This first knowledgebase may be federated with a common knowledgebase, and with a second knowledgebase of a second workgroup. At least one of the first knowledgebase, common knowledgebase, and second knowledgebase may be configured to store data items comprising associations, signs, and evidence. The signs may comprise measurements and contexts, and the associations may describe the relationships between the measurements and contexts. The evidence may support these associations. The disclosed systems may be configured to receive a request from a user in the first workgroup, retrieve matching data items, and optionally then output to the user at least some of the retrieved matching data items. The request may comprise at least one of a first association and a first measurement.

Type: Grant

Filed: February 24, 2020

Date of Patent: February 8, 2022

Assignee: ILLUMINA, INC.

Inventors: Kai-How Farh, Donavan Cheng, John Shon, Jorg Hakenberg, Eugene Bolotin, James Casey Geaney, Hong Gao, Pam Cheng, Inderjit Singh, Daniel Roche, Milan Karangutkar
VARIANT PATHOGENICITY SCORING AND CLASSIFICATION AND USES THEREOF

Publication number: 20220028485

Abstract: Derivation and use of pathogenicity scores for gene variants are described herein. Applications, uses, and variations of the pathogenicity scoring process include, but are not limited to, the derivation and use of thresholds to characterize a variant as pathogenic or benign, the estimation of selection effects associated with a gene variant, the estimation of genetic disease prevalence using pathogenicity scores, and the recalibration of methods used to assess pathogenicity scores.

Type: Application

Filed: July 21, 2021

Publication date: January 27, 2022

Inventors: Hong Gao, Kai-How Farh, Jeremy Francis McRae
VARIANT PATHOGENICITY SCORING AND CLASSIFICATION AND USES THEREOF

Publication number: 20220027388

Abstract: Derivation and use of pathogenicity scores for gene variants are described herein. Applications, uses, and variations of the pathogenicity scoring process include, but are not limited to, the derivation and use of thresholds to characterize a variant as pathogenic or benign, the estimation of selection effects associated with a gene variant, the estimation of genetic disease prevalence using pathogenicity scores, and the recalibration of methods used to assess pathogenicity scores.

Type: Application

Filed: July 21, 2021

Publication date: January 27, 2022

Inventors: Hong Gao, Kai-How Farh, Jeremy Francis McRae
FEDERATED SYSTEMS AND METHODS FOR MEDICAL DATA SHARING

Publication number: 20200380412

Abstract: Systems, computer-implemented methods, and non-transitory computer readable media are provided for sharing medical data. The disclosed systems may be configured to create a first workgroup having a first knowledgebase. This first knowledgebase may be federated with a common knowledgebase, and with a second knowledgebase of a second workgroup. At least one of the first knowledgebase, common knowledgebase, and second knowledgebase may be configured to store data items comprising associations, signs, and evidence. The signs may comprise measurements and contexts, and the associations may describe the relationships between the measurements and contexts. The evidence may support these associations. The disclosed systems may be configured to receive a request from a user in the first workgroup, retrieve matching data items, and optionally then output to the user at least some of the retrieved matching data items. The request may comprise at least one of a first association and a first measurement.

Type: Application

Filed: February 24, 2020

Publication date: December 3, 2020

Inventors: Kai-How FARH, Donavan CHENG, John SHON, Jorg HAKENBERG, Eugene BOLOTIN, James Casey GEANEY, Hong GAO, Pam CHENG, Inderjit SINGH, Daniel ROCHE, Milan KARANGUTKAR
Deep Learning-Based Techniques for Training Deep Convolutional Neural Networks

Publication number: 20200279157

Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional neural network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.

Type: Application

Filed: January 27, 2020

Publication date: September 3, 2020

Applicant: Illumina, Inc.

Inventors: Hong GAO, Kai-How FARH, Laksshman SUNDARAM, Jeremy Francis McRAE
Deep Learning-Based Framework for Identifying Sequence Patterns that Cause Sequence-Specific Errors (SSEs)

Publication number: 20200251183

Abstract: The technology disclosed presents a deep learning-based framework, which identifies sequence patterns that cause sequence-specific errors (SSEs). Systems and methods train a variant filter on large-scale variant data to learn causal dependencies between sequence patterns and false variant calls. The variant filter has a hierarchical structure built on deep neural networks such as convolutional neural networks and fully-connected neural networks. Systems and methods implement a simulation that uses the variant filter to test known sequence patterns for their effect on variant filtering. The premise of the simulation is as follows: when a pair of a repeat pattern under test and a called variant is fed to the variant filter as part of a simulated input sequence and the variant filter classifies the called variant as a false variant call, then the repeat pattern is considered to have caused the false variant call and identified as SSE-causing.

Type: Application

Filed: July 8, 2019

Publication date: August 6, 2020

Applicant: Illumina, Inc.

Inventors: Dorna KASHEFHAGHIGHI, Amirali KIA, Kai-How FARH
Federated systems and methods for medical data sharing

Patent number: 10607156

Abstract: Systems, computer-implemented methods, and non-transitory computer readable media are provided for sharing medical data. The disclosed systems may be configured to create a first workgroup having a first knowledgebase. This first knowledgebase may be federated with a common knowledgebase, and with a second knowledgebase of a second workgroup. At least one of the first knowledgebase, common knowledgebase, and second knowledgebase may be configured to store data items comprising associations, signs, and evidence. The signs may comprise measurements and contexts, and the associations may describe the relationships between the measurements and contexts. The evidence may support these associations. The disclosed systems may be configured to receive a request from a user in the first workgroup, retrieve matching data items, and optionally then output to the user at least some of the retrieved matching data items. The request may comprise at least one of a first association and a first measurement.

Type: Grant

Filed: August 22, 2017

Date of Patent: March 31, 2020

Assignee: ILLUMINA, INC.

Inventors: Kai-How Farh, Donavan Cheng, John Shon, Jorg Hakenberg, Eugene Bolotin, James Casey Geaney, Hong Gao, Pam Cheng, Inderjit Singh, Daniel Roche, Milan Karangutkar
Deep Convolutional Neural Networks for Variant Classification

Publication number: 20200065675

Abstract: The technology disclosed relates to constructing a convolutional neural network-based classifier for variant classification. In particular, it relates to training a convolutional neural network-based classifier on training data using a backpropagation-based gradient update technique that progressively match outputs of the convolutional network network-based classifier with corresponding ground truth labels. The convolutional neural network-based classifier comprises groups of residual blocks, each group of residual blocks is parameterized by a number of convolution filters in the residual blocks, a convolution window size of the residual blocks, and an atrous convolution rate of the residual blocks, the size of convolution window varies between groups of residual blocks, the atrous convolution rate varies between groups of residual blocks. The training data includes benign training examples and pathogenic training examples of translated sequence pairs generated from benign variants and pathogenic variants.

Type: Application

Filed: October 15, 2018

Publication date: February 27, 2020

Applicant: Illumina, Inc.

Inventors: Laksshman Sundaram, Kai-How Farh, Hong Gao, Samskruthi Reddy Padigepati, Jeremy Francis McRae
Display screen or portion thereof with graphical user interface

Patent number: D875773

Type: Grant

Filed: September 11, 2019

Date of Patent: February 18, 2020

Assignee: Illumina, Inc.

Inventors: Kai-How Farh, Donavan Cheng, Andrew Warren, Ian D. Patrick

prev 1 2 3 4 next