Patents by Inventor Mark Kunitomi
Mark Kunitomi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12100486Abstract: Provided is a deep learning algorithm that analyzes fragments of biological sequences. The input for the deep learning algorithm is a biological sequence fragment of unknown origin and the output is the closest known biological genome that could share phenotypic properties with the biological species of unknown origin. The workflow thus has application for genomic classification, identification of mutations within known genomes, and the identification of the closest class for unknown species.Type: GrantFiled: May 14, 2021Date of Patent: September 24, 2024Assignee: International Business Machines CorporationInventors: Vito Paolo Pastore, Mark Kunitomi, Simone Bianco
-
Patent number: 12020007Abstract: Information received from a graphical user interface (GUI) and a list of user-curated command line patterns are received by an auto-wrapper system, wherein the auto-wrapper system is associated with an analytics workflow service. A module including a parameter space having one or more parameters and options used in the list of user-curated command line patterns is generated, by the auto-wrapper system, wherein content for each parameter is derived from the parameter's presence in the list of user-curated command line patterns combined with the information received from a GUI.Type: GrantFiled: September 15, 2022Date of Patent: June 25, 2024Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Mark Kunitomi, David Chambliss, Forest Dussault
-
Publication number: 20240094996Abstract: Information received from a graphical user interface (GUI) and a list of user-curated command line patterns are received by an auto-wrapper system, wherein the auto-wrapper system is associated with an analytics workflow service. A module including a parameter space having one or more parameters and options used in the list of user-curated command line patterns is generated, by the auto-wrapper system, wherein content for each parameter is derived from the parameter's presence in the list of user-curated command line patterns combined with the information received from a GUI.Type: ApplicationFiled: September 15, 2022Publication date: March 21, 2024Inventors: Mark Kunitomi, David Chambliss, Forest Dussault
-
Patent number: 11830581Abstract: An iterative process for optimizing one or more parameters used by a k-mer based de novo genome assembler program to assemble a set of sequenced nucleic acids is described. The method utilizes quality metrics whose desired values are initially specified. Computed values of the quality metrics are calculated during the assembly process and compared to the desired values. The assembly process stops when the computed values are not desired values. After modification of one or more of the parameters (e.g., k-mer value), the assembly process re-initiates using the modified parameter set. This process repeats until the computed values of the quality metrics meet the desired values. The final parameter set is then used to generate or complete one or more final assembled genomes.Type: GrantFiled: March 7, 2019Date of Patent: November 28, 2023Assignee: International Business Machines CorporationInventors: Matthew A. Davis, Mark Kunitomi, Kun Hu
-
Patent number: 11830580Abstract: A large collection of sample genomes containing misclassified k-mers and metadata errors from a reference taxonomy was converted to a self-consistent k-mer database comprising a self-consistent taxonomy. The self-consistent taxonomy was based on genetic distances calculated using the MinHash method or the Meier-Koltoff method. An agglomerative clustering algorithm was used to calculate the self-consistent taxonomy. Each k-mer of the sample genomes was assigned to only one node of the self-consistent taxonomy. In another step, each node of the self-consistent taxonomy was mapped to the reference taxonomy, thereby preserving in the self-consistent taxonomy links to the reference taxonomy while correcting for the misclassification errors therein. The self-consistent k-mer database can be used to taxonomically profile sequenced nucleic acids with greater specificity compared to systems relying on the reference taxonomy.Type: GrantFiled: September 30, 2018Date of Patent: November 28, 2023Assignees: International Business Machines Corporation, Mars, IncorporatedInventors: James H. Kaufman, Matthew A. Davis, Mark Kunitomi, Bart C. Weimer
-
Patent number: 11809498Abstract: Methods are disclosed for reducing the size of a k-mer reference database used for queries and/or taxonomic classifications when available computer storage and/or memory are inadequate. The k-mers of the reference database have been previously classified to a taxonomy, preferably based on genetic distances. In one method, the k-mers are separated into one or more groups followed by removing k-mers common to the groups. In another method, k-mers are removed based on a selected taxonomic threshold level. A third method combines the features of the previous two methods. The methods are adaptable to machine learning.Type: GrantFiled: November 7, 2019Date of Patent: November 7, 2023Assignee: International Business Machines CorporationInventors: Matthew A. Davis, Mark Kunitomi, James H. Kaufman
-
Patent number: 11781190Abstract: A bioinformatics method is provided for identifying candidate biological sequences, such as DNA, RNA, and proteins, with high sensitivity and specificity for application in procedures such as PCR and gene and protein sequencing. The method involves categorizing a collection of biological sequences within an out-group and an in-group, identifying the intersection between the in-group and the out-group, the union of the out-group, and a relative complement of sequences that are members of the in-group, but not the out-group. A biological signature for a species of interest with high sensitivity and specificity will be a member of the relative complement that has an out-group frequency of zero.Type: GrantFiled: August 6, 2020Date of Patent: October 10, 2023Assignee: International Business Machines CorporationInventors: Mark Kunitomi, Daniel Waddington
-
Patent number: 11766451Abstract: Techniques regarding treating one or more microbe infections with combination therapy are provided. For example, one or more embodiments described herein can comprise a method, which can comprise enhancing an antimicrobial activity of an antibiotic by a combination therapy. The combination therapy can comprise the antibiotic and a polycarbonate polymer functionalized with a guanidinium functional group.Type: GrantFiled: December 29, 2020Date of Patent: September 26, 2023Assignee: International Business Machines CorporationInventors: James L Hedrick, Simone Bianco, Mark Kunitomi, Yi Yan Yang, Xin Ding, Chuan Yang, Zhen Chang Liang, Paola Florez de Sessions, Balamurugan Periaswamy
-
Patent number: 11646117Abstract: A method is described that utilizes non-negative matrix factorization to predict susceptibility of a microorganism to an antimicrobial drug. A sparse adjacency matrix is constructed from existing ground truth datasets that include antibiogram data and other data associated with microorganisms. The rows of the adjacency matrix correspond to biosamples, and the columns correspond to instances of metadata and drugs associated with one or more of the biosamples. The elements of the adjacency matrix are assigned non-zero numerical values or zero depending on whether a known association exists. The adjacency matrix is then factored using a selected number of latent factors, thereby producing a reconstruction matrix approximating the adjacency matrix. The values of the reconstruction matrix are used to predict antimicrobial susceptibility of a biosample ID to a drug when antibiogram data are lacking.Type: GrantFiled: June 4, 2019Date of Patent: May 9, 2023Assignee: International Business Machines CorporationInventors: Mark Kunitomi, Dmitry Zubarev, Sarathkrishna Swaminathan
-
Publication number: 20220415437Abstract: A computer-implemented method for generating a cover set of biological sequences to detect a group of biological members. The method includes one or more computer processors receiving a request to generate a cover set of k-mers used to detect a first group of biological members that are related via a taxonomic lineage. The method further includes obtaining a plurality of biological sequence data corresponding to biological members of the first group. The method further includes determining a set of k-mers respectively associated with a biological member included within the first group of biological members. The method further includes determining the cover set of k-mers utilized to detect the biological members of the first group by selecting a subset of k-mers from a superset of k-mers associated with the first group of biological members based on preventing false-positive detections of biological members different from the first group of biological members.Type: ApplicationFiled: June 24, 2021Publication date: December 29, 2022Inventors: Mark Kunitomi, Samyukta Satish Rao, Daniel Waddington, Amir Abboud
-
Publication number: 20220399078Abstract: One embodiment provides for a method including determining, by at least one processor, sequence-sequence distances for a biological sequence collection. The at least one processor generates a matrix Mij of the sequence-sequence distances, where i and j are positive integers. The at least one processor further generates clusters for the matrix Mij by performing hierarchical clustering. A self-consistent taxonomy is created from the clusters. A visual heat map display of the matrix Mij is selectively controlled using metadata, zoom input and opacity input.Type: ApplicationFiled: August 4, 2022Publication date: December 15, 2022Inventors: Matthew Davis, James Kaufman, Mark Kunitomi
-
Patent number: 11515011Abstract: A computer-implemented method includes receiving genomic data associated with a plurality of genomes and identifying k-mer sets within the genomic data. The method includes constructing a k-mer subset tree according to the following process: performing iterative pairwise comparisons on the k-mer sets, wherein the iterative pairwise comparisons identify fragments with the most shared k-mers, merging the identified fragments into non-leaf nodes of the k-mer subset tree, and placing each remaining k-mer into a leaf node of the k-mer subset tree. The method includes storing the k-mer subset tree. A computer program product for data compression includes a computer readable storage medium having program instructions embodied therewith. The program instructions are executable by a computer to cause the compute to perform the foregoing method. A system includes a processor and logic. The logic is configured to perform the foregoing method.Type: GrantFiled: August 9, 2019Date of Patent: November 29, 2022Assignee: International Business Machines CorporationInventors: Daniel Waddington, Mark Kunitomi, Amir Abboud, Samyukta Satish Rao
-
Publication number: 20220367011Abstract: Provided is a deep learning algorithm that analyzes fragments of biological sequences. The input for the deep learning algorithm is a biological sequence fragment of unknown origin and the output is the closest known biological genome that could share phenotypic properties with the biological species of unknown origin. The workflow thus has application for genomic classification, identification of mutations within known genomes, and the identification of the closest class for unknown species.Type: ApplicationFiled: May 14, 2021Publication date: November 17, 2022Inventors: Vito Paolo Pastore, Mark Kunitomi, Simone Bianco
-
Patent number: 11456057Abstract: One embodiment provides for a method including determining, by at least one processor, sequence-sequence distances for a biological sequence collection. The at least one processor generates a matrix Mij of the sequence-sequence distances, where i and j are positive integers. The at least one processor further generates clusters for the matrix Mij by performing hierarchical clustering. A self-consistent taxonomy is created from the clusters. A visual heat map display of the matrix Mij is selectively controlled using metadata, zoom input and opacity input.Type: GrantFiled: March 29, 2018Date of Patent: September 27, 2022Assignee: International Business Machines CorporationInventors: Matthew Davis, James Kaufman, Mark Kunitomi
-
Patent number: 11347810Abstract: A method is described for automatically correcting metadata errors in a k-mer database. A k-mer database having a self-consistent taxonomy based on genome-genome distance was constructed from a set of sample and reference genomes whose metadata included taxonomic labeling from a reference taxonomy (the standard NCBI taxonomy), which is not based on genetic distance. As a result, genomes of a given taxonomic ID of the self-consistent taxonomy could be separated into clusters based on the differences in the metadata. Genomes of the clusters less than a minimum cluster size Cmin were removed and profiled against the remaining genomes, correcting metadata automatically for those genomes that could be mapped back. The resulting k-mer database showed improved specificity for genetic profiling. Another method is described for identifying and handling chimeric genomes using the self-consistent taxonomy. Another method is described for correcting a classification database.Type: GrantFiled: December 20, 2018Date of Patent: May 31, 2022Assignee: International Business Machines CorporationInventors: James H. Kaufman, Matthew A. Davis, Mark Kunitomi, Kenneth L. Clarkson
-
Publication number: 20220042114Abstract: A bioinformatics method is provided for identifying candidate biological sequences, such as DNA, RNA, and proteins, with high sensitivity and specificity for application in procedures such as PCR and gene and protein sequencing. The method involves categorizing a collection of biological sequences within an out-group and an in-group, identifying the intersection between the in-group and the out-group, the union of the out-group, and a relative complement of sequences that are members of the in-group, but not the out-group. A biological signature for a species of interest with high sensitivity and specificity will be a member of the relative complement that has an out-group frequency of zero.Type: ApplicationFiled: August 6, 2020Publication date: February 10, 2022Inventors: Mark Kunitomi, Daniel Waddington
-
Publication number: 20210141833Abstract: Methods are disclosed for reducing the size of a k-mer reference database used for queries and/or taxonomic classifications when available computer storage and/or memory are inadequate. The k-mers of the reference database have been previously classified to a taxonomy, preferably based on genetic distances. In one method, the k-mers are separated into one or more groups followed by removing k-mers common to the groups. In another method, k-mers are removed based on a selected taxonomic threshold level. A third method combines the features of the previous two methods. The methods are adaptable to machine learning.Type: ApplicationFiled: November 7, 2019Publication date: May 13, 2021Inventors: Matthew A. Davis, Mark Kunitomi, James H. Kaufman
-
Publication number: 20210121498Abstract: Techniques regarding treating one or more microbe infections with combination therapy are provided. For example, one or more embodiments described herein can comprise a method, which can comprise enhancing an antimicrobial activity of an antibiotic by a combination therapy. The combination therapy can comprise the antibiotic and a polycarbonate polymer functionalized with a guanidinium functional group.Type: ApplicationFiled: December 29, 2020Publication date: April 29, 2021Inventors: James L. Hedrick, Simone Bianco, Mark Kunitomi, Yi Yan Yang, Xin Ding, Chuan Yang, Zhen Chang Liang, Paola Florez de Sessions, Balamurugan Periaswamy
-
Patent number: 10953039Abstract: Techniques regarding treating one or more microbe infections with combination therapy are provided. For example, one or more embodiments described herein can comprise a method, which can comprise enhancing an antimicrobial activity of an antibiotic by a combination therapy. The combination therapy can comprise the antibiotic and a polycarbonate polymer functionalized with a guanidinium functional group.Type: GrantFiled: September 27, 2018Date of Patent: March 23, 2021Assignees: INTERNATIONAL BUSINESS MACHINES CORPORATION, AGENCY FOR SCIENCE, TECHNOLOGY AND RESEARCHInventors: James L. Hedrick, Simone Bianco, Mark Kunitomi, Yi Yan Yang, Xin Ding, Chuan Yang, Zhen Chang Liang, Paola Florez de Sessions, Balamurugan Periaswamy
-
Publication number: 20210043282Abstract: A computer-implemented method includes receiving genomic data associated with a plurality of genomes and identifying k-mer sets within the genomic data. The method includes constructing a k-mer subset tree according to the following process: performing iterative pairwise comparisons on the k-mer sets, wherein the iterative pairwise comparisons identify fragments with the most shared k-mers, merging the identified fragments into non-leaf nodes of the k-mer subset tree, and placing each remaining k-mer into a leaf node of the k-mer subset tree. The method includes storing the k-mer subset tree. A computer program product for data compression includes a computer readable storage medium having program instructions embodied therewith. The program instructions are executable by a computer to cause the compute to perform the foregoing method. A system includes a processor and logic. The logic is configured to perform the foregoing method.Type: ApplicationFiled: August 9, 2019Publication date: February 11, 2021Inventors: Daniel Waddington, Mark Kunitomi, Amir Abboud, Samyukta Satish Rao