Patents by Inventor Yuan-Chi Chang
Yuan-Chi Chang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11163751Abstract: A resource exploitation management system, method and a computer program product therefor. A description of new geological evidence for a geological resource is received, e.g., as one or more triples describing the evidence. Keywords in the description are matched against keywords in representations in a geological resource database. Geological relations are inferred from the descriptions and matched against predefined geological relations from the geological resource database. Consistent triple matches are merged with the geological resource database. The confidence level for merged matches is updated in the geological resource database.Type: GrantFiled: January 17, 2019Date of Patent: November 2, 2021Assignee: International Business Machines CorporationInventors: Sonia Mariette Embid Droz, Cristina Ibanez-Llano, Giorgio De Paola, Ruben Rodriguez Torrado, Akiko Suzuki, Mustafa Canim, Yuan-Chi Chang, Robert Farrell, Sharon M. Trewin
-
Patent number: 11126503Abstract: Techniques are provided for pre-filtering of join execution over multi-column range summaries and other synopses. An exemplary method comprises maintaining a synopsis for a plurality of data tables, wherein a given synopsis summarizes a set of records in a corresponding data table; and, in response to a request for a join operation for a set of the data tables: joining the synopses associated with the set of data tables to generate a joined synopsis; for joined records in the joined synopsis, obtaining corresponding records from the set of data tables as candidate records; and joining the candidate records. Two or more of the set of data tables can be distributed across a plurality of nodes and the synopses can be replicated and/or broadcasted across the plurality of nodes. Incremental updates to broadcasted and/or replicated synopses are optionally provided to at least one node.Type: GrantFiled: August 10, 2016Date of Patent: September 21, 2021Assignee: International Business Machines CorporationInventors: Yuan Chi Chang, Timothy Ray Malkemus, Mohammad Sadoghi Hamedani
-
Patent number: 11099979Abstract: A mechanism is provided to identify wall-clock time reference dependency in one or more software components of a data analytics solution. The data analytics solution is decomposed into a set of software components. A first software component of the set of software components is deployed to a first computer server and the remaining software components are deployed to a second computer server. A system clock time on the first computer server is changed to differ from the system clock of the second computer server. Based on executing a test on the data analytics solution, a determination is made of whether the first software component, is wall-clock time independent. Responsive to the test of the of the software component failing indicating that the wall-clock time of the software component is dependent of the system clock time difference, the software component is recorded as wall-clock time dependent and an administrator is notified.Type: GrantFiled: October 31, 2019Date of Patent: August 24, 2021Assignee: International Business Machines CorporationInventors: Yuan-Chi Chang, Long Vu, Timothy R. Dinger, Venkata N. Pavuluri, Lingtao Cao
-
Patent number: 11030561Abstract: Techniques for scenario planning are provided. In one example, a computer-implemented method can comprise analyzing, by a device operatively coupled to a processor, content using a topic model. The content can be associated with a defined source and is related to one or more current events. The computer-implemented method can also comprise determining, by the device, one or more portions of the analyzed content that are relevant to one or more key risk drivers using a risk driver model. The computer-implemented method can also comprise aggregating, by the device, the determined one or more portions into one or more emerging storylines based on values of one or more attributes of the topic model.Type: GrantFiled: December 14, 2017Date of Patent: June 8, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Yuan-Chi Chang, Mark D. Feblowitz, Nagui Halim, Stuart S. Horn, Edward J. Pring, Anton V. Riabov, Edward W. Shay, Shirin Sohrabi Araghi, Deepak S. Turaga, Octavian Udrea, Fang Yuan, Peter Zimmer
-
Patent number: 11023840Abstract: Techniques for scenario planning are provided. In one example, a computer-implemented method can comprise analyzing, by a device operatively coupled to a processor, content using a topic model. The content can be associated with a defined source and is related to one or more current events. The computer-implemented method can also comprise determining, by the device, one or more portions of the analyzed content that are relevant to one or more key risk drivers using a risk driver model. The computer-implemented method can also comprise aggregating, by the device, the determined one or more portions into one or more emerging storylines based on values of one or more attributes of the topic model.Type: GrantFiled: January 27, 2017Date of Patent: June 1, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Yuan-Chi Chang, Mark D. Feblowitz, Nagui Halim, Stuart S. Horn, Edward J. Pring, Anton V. Riabov, Edward W. Shay, Shirin Sohrabi Araghi, Deepak S. Turaga, Octavian Udrea, Fang Yuan, Peter Zimmer
-
Publication number: 20210158084Abstract: Mechanism are provided to select a machine learning model from an analytics model library based on ingested data. One or more pieces of clarified data are fused to provide time-correlated data tuples of data streams. One or more features are extracted from the time-correlated data tuples and scored based on a set of predetermined rules thereby generating discriminative scoring of trigger data. Utilizing the discriminative scoring of the trigger data, trigger data of a current analytics model being utilized by the data processing and one or more new analytics models from the analytics model library are scored. Responsive to the scoring of the trigger data indicating a selection of a different analytics model from the analytics model library, the current analytics model is replaced with a selected analytics model from the analytics model library such that the data processing system executes the selected analytics model.Type: ApplicationFiled: November 22, 2019Publication date: May 27, 2021Inventors: Timothy R. Dinger, Yuan-Chi Chang, Long Vu, Venkata N. Pavuluri, Lingtao Cao
-
Publication number: 20210142211Abstract: A mechanism is provided for implementing a model update mechanism to update new models in real time while avoiding data loss and system downtime. Responsive to receiving a request to update a scorer model currently being executed by an existing worker thread in the data processing system, the model update mechanism initializing a new worker thread. The model update mechanism loads an updated scorer model into the new worker thread and initializes a state transfer from the existing worker thread to the new worker thread. The model update mechanism executes the updated scorer model such that the updated scorer model scores the input data. The model update mechanism then outputs a prediction based on the updated scorer model processing of the input data.Type: ApplicationFiled: November 12, 2019Publication date: May 13, 2021Inventors: Long Vu, Yuan-Chi Chang, Timothy R. Dinger, Venkata N. Pavuluri, Lingtao Cao
-
Publication number: 20210142222Abstract: Split an input dataset into training and test datasets; the former includes a plurality of data examples, each represented as a feature vector, and having an associated true label. Split the training dataset into a plurality of training data subsets; for each, train a corresponding machine learning model to obtain a plurality of such models, and apply same to the test dataset to obtain a plurality of predicted labels and prediction scores. For each of the plurality of examples, compute an agreement metric based on a corresponding one of the associated true labels; corresponding ones of the predicted labels; and corresponding ones of the prediction scores. Based on the computed metric, select, for at least some of the true label values, appropriate ones of the data examples to be added to a regression set. Add the appropriate ones of the data examples from the test dataset to the regression set.Type: ApplicationFiled: November 13, 2019Publication date: May 13, 2021Inventors: Yuan-Chi Chang, Deepak Srinivas Turaga, Long Vu, Venkata Nagaraju Pavuluri, Saket Sathe, Rodrigue Ngueyep Tzoumpe
-
Publication number: 20210133090Abstract: A mechanism is provided to identify wall-clock time reference dependency in one or more software components of a data analytics solution. The data analytics solution is decomposed into a set of software components. A first software component of the set of software components is deployed to a first computer server and the remaining software components are deployed to a second computer server. A system clock time on the first computer server is changed to differ from the system clock of the second computer server. Based on executing a test on the data analytics solution, a determination is made of whether the first software component, is wall-clock time independent. Responsive to the test of the of the software component failing indicating that the wall-clock time of the software component is dependent of the system clock time difference, the software component is recorded as wall-clock time dependent and an administrator is notified.Type: ApplicationFiled: October 31, 2019Publication date: May 6, 2021Inventors: Yuan-Chi Chang, Long Vu, Timothy R. Dinger, Venkata N. Pavuluri, Lingtao Cao
-
Publication number: 20200327456Abstract: Embodiments for implementing enhanced ensemble model diversity and learning by a processor. One or more data sets may be created by combining one or more clusters of data points of a minority class with selected data points of a majority class. One or more ensemble models may be created from the one or more data sets using a supervised machine learning operation. An occurrence of an event may be predicted using the one or more ensemble models.Type: ApplicationFiled: April 11, 2019Publication date: October 15, 2020Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Saket SATHE, Deepak TURAGA, Charu AGGARWAL, Raju PAVULURI, Yuan-Chi CHANG
-
Publication number: 20200233851Abstract: A resource exploitation management system, method and a computer program product therefor. A description of new geological evidence for a geological resource is received, e.g., as one or more triples describing the evidence. Keywords in the description are matched against keywords in representations in a geological resource database. Geological relations are inferred from the descriptions and matched against predefined geological relations from the geological resource database. Consistent triple matches are merged with the geological resource database. The confidence level for merged matches is updated in the geological resource database.Type: ApplicationFiled: January 17, 2019Publication date: July 23, 2020Applicants: REPSOL, S. A., International Business Machines CorporationInventors: Sonia Mariette Embid Droz, Cristina Ibanez-Llano, Giorgio De Paola, Ruben Rodriguez Torrado, Akiko Suzuki, Mustafa Canim, Yuan-Chi Chang, Robert Farrell, Sharon M. Trewin
-
Publication number: 20200142879Abstract: A method, system and computer program product for index maintenance in a computer system comprising a plurality of nodes, a database, and an index to the database. In one embodiment, the method comprises, for a defined period of time, building a snapshot of selected change requests received by the nodes to change the database. After this defined period of time, a selection is made, based on specified criteria, whether to rebuild a new index to the database, or to add entries to a current index. When the selection is to rebuild a new index, the new index is rebuilt based on data in the database and in the change requests in the snapshot. When the selection is to add entries to a current index, entries are added to the current index based on data in the database and in the change requests in the snapshot.Type: ApplicationFiled: January 8, 2020Publication date: May 7, 2020Inventors: Yuan-chi Chang, Jason Crawford, Liana L. Fong, Wei Tan
-
Patent number: 10579608Abstract: A method, system and computer program product for index maintenance in a computer system comprising a plurality of nodes, a database, and an index to the database. In one embodiment, the method comprises, for a defined period of time, building a snapshot of selected change requests received by the nodes to change the database. After this defined period of time, a selection is made, based on specified criteria, whether to rebuild a new index to the database, or to add entries to a current index. When the selection is to rebuild a new index, the new index is rebuilt based on data in the database and in the change requests in the snapshot. When the selection is to add entries to a current index, entries are added to the current index based on data in the database and in the change requests in the snapshot.Type: GrantFiled: June 7, 2018Date of Patent: March 3, 2020Assignee: International Business Machines CorporationInventors: Yuan-chi Chang, Jason Crawford, Liana L. Fong, Wei Tan
-
Patent number: 10409828Abstract: Methods and apparatus are provided for incremental frequent subgraph mining on dynamic graphs. An exemplary subgraph mining method comprises maintaining a set of embeddings comprising matching embeddings of a given subgraph in an input graph; maintaining a first fringe set of subgraphs comprising subgraphs substantially on a fringe of frequent subgraphs in the input graph that satisfy a predefined support threshold; maintaining a second fringe set of subgraphs comprising subgraphs substantially on a fringe of infrequent subgraphs in the input graph that do not satisfy the predefined support threshold; for an edge addition, checking a support of the subgraphs in the second fringe set based on the set of the embeddings and searching for new embeddings created by the edge addition; and for an edge deletion, removing obsolete embeddings that comprise the deleted edge from the first fringe set based on the set of embeddings.Type: GrantFiled: July 29, 2016Date of Patent: September 10, 2019Assignee: International Business Machines CorporationInventors: Ehab Abdelhamid Mohammed Abdelhamid, Bishwaranjan Bhattacharjee, Mustafa Canim, Yuan Chi Chang, Mohammad Sadoghi Hamedani
-
Patent number: 10289707Abstract: Conventionally, in addition to indexing, a synopsis of a base table of a database is used to skip and compress data. However, scanning of the entire synopsis for all queries is required, which takes a long time when the synopsis gets significantly big in a large data warehouse. A method for efficient data skipping and compression through vertical partitioning of data is provided to eliminate the cost of synopsis storage overhead while enabling the synopsis search functionality.Type: GrantFiled: August 10, 2015Date of Patent: May 14, 2019Assignee: International Business Machines CorporationInventors: Yuan-Chi Chang, Timothy R. Malkemus, Mohammad Sadoghi Hamedani
-
Patent number: 10242406Abstract: Systems, methods and articles of manufacture are disclosed for building and executing analytics solutions. Such a solution may provide a comprehensive analytics solution (e.g., a risk assessment, fraud detection solution, dynamic operational risk evaluations, regulatory compliance assessments, etc.). The analytics solution may perform an analytics task using operational data distributed across a variety of independently created and governed data repositories in different departments of an organization. A framework is disclosed which allows a user (e.g., a risk analyst) to compose analytical tools that can access data from a variety of sources (both internal and external to an enterprise) and perform a variety of analytic functions.Type: GrantFiled: September 14, 2009Date of Patent: March 26, 2019Assignee: International Business Machines CorporationInventors: Manoj Kumar, Yuan-Chi Chang, Chung-Sheng Li
-
Patent number: 10127299Abstract: Systems, methods and articles of manufacture are disclosed for building and executing analytics solutions. Such a solution may provide a comprehensive analytics solution (e.g., a risk assessment, fraud detection solution, dynamic operational risk evaluations, regulatory compliance assessments, etc.). The analytics solution may perform an analytics task using operational data distributed across a variety of independently created and governed data repositories in different departments of an organization. A framework is disclosed which allows a user (e.g., a risk analyst) to compose analytical tools that can access data from a variety of sources (both internal and external to an enterprise) and perform a variety of analytic functions.Type: GrantFiled: September 14, 2009Date of Patent: November 13, 2018Assignee: International Business Machines CorporationInventors: Manoj Kumar, Yuan Chi Chang, Chung-Sheng Li
-
Publication number: 20180285400Abstract: A method, system and computer program product for index maintenance in a computer system comprising a plurality of nodes, a database, and an index to the database. In one embodiment, the method comprises, for a defined period of time, building a snapshot of selected change requests received by the nodes to change the database. After this defined period of time, a selection is made, based on specified criteria, whether to rebuild a new index to the database, or to add entries to a current index. When the selection is to rebuild a new index, the new index is rebuilt based on data in the database and in the change requests in the snapshot. When the selection is to add entries to a current index, entries are added to the current index based on data in the database and in the change requests in the snapshot.Type: ApplicationFiled: June 7, 2018Publication date: October 4, 2018Inventors: Yuan-chi Chang, Jason Crawford, Liana L. Fong, Wei Tan
-
Patent number: 10050866Abstract: A method for searching the top-K simple shortest paths between a specified source node and a specified target node in a graph, with graph data partitioned and distributed across a plurality of computing servers, the method including a parallel path search initialized from either one or both of the source and target nodes and traversing the graph by building likely path sequences for a match. Each computing server determines and forwards a path sequence as discovery progresses until the top-K paths are discovered.Type: GrantFiled: December 3, 2015Date of Patent: August 14, 2018Assignee: International Business Machines CorporationInventors: Yuan-Chi Chang, Mustafa Canim
-
Publication number: 20180218300Abstract: Techniques for scenario planning are provided. In one example, a computer-implemented method can comprise analyzing, by a device operatively coupled to a processor, content using a topic model. The content can be associated with a defined source and is related to one or more current events. The computer-implemented method can also comprise determining, by the device, one or more portions of the analyzed content that are relevant to one or more key risk drivers using a risk driver model. The computer-implemented method can also comprise aggregating, by the device, the determined one or more portions into one or more emerging storylines based on values of one or more attributes of the topic model.Type: ApplicationFiled: December 14, 2017Publication date: August 2, 2018Inventors: Yuan-Chi Chang, Mark D. Feblowitz, Nagui Halim, Stuart S. Horn, Edward J. Pring, Anton V. Riabov, Edward W. Shay, Shirin Sohrabi Araghi, Deepak S. Turaga, Octavian Udrea, Fang Yuan, Peter Zimmer