Patents by Inventor Tin Kam Ho

Tin Kam Ho has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10783202
    Abstract: A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (NLP) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over time by identifying changes for one or more concepts included in the first and/or second set of concept sequences.
    Type: Grant
    Filed: February 13, 2019
    Date of Patent: September 22, 2020
    Assignee: International Business Machines Corporation
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli
  • Publication number: 20200265102
    Abstract: A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (NLP) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over time by identifying changes for one or more concepts included in the first and/or second set of concept sequences.
    Type: Application
    Filed: May 5, 2020
    Publication date: August 20, 2020
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli
  • Patent number: 10740678
    Abstract: A method and apparatus are provided for generating concept hierarchies in response to a user request to produce a hierarchy of concepts from a first concept set by performing a natural language processing (NLP) analysis comparison of the vector representations of the concepts in the first concept set to determine a similarity measure for each pair of distinct concepts Ci and Cj in the first concept set, and to construct therefrom a concept hierarchy based on one or more specified hierarchy parameters and the similarity measure for each pair of distinct concepts Ci and Cj in the first concept set.
    Type: Grant
    Filed: March 31, 2016
    Date of Patent: August 11, 2020
    Assignee: International Business Machines Corporation
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli, Livio Soares
  • Patent number: 10713323
    Abstract: A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (NLP) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over time by identifying changes for one or more concepts included in the first and/or second set of concept sequences.
    Type: Grant
    Filed: September 27, 2018
    Date of Patent: July 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli
  • Patent number: 10691766
    Abstract: A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (NLP) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over time by identifying changes for one or more concepts included in the first and/or second set of concept sequences.
    Type: Grant
    Filed: September 27, 2018
    Date of Patent: June 23, 2020
    Assignee: International Business Machines Corporation
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli
  • Patent number: 10671683
    Abstract: A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (NLP) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over time by identifying changes for one or more concepts included in the first and/or second set of concept sequences.
    Type: Grant
    Filed: August 31, 2018
    Date of Patent: June 2, 2020
    Assignee: International Business Machines Corporation
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli
  • Publication number: 20200167604
    Abstract: Embodiments for creating compact example subsets for intent classification in a conversational system are provided. A set of content used for training an intent classifier is received from a conversational corpus. Entries within the set of content are separated into a first subset and a second subset, and a cross-validation operation is performed on the first and second subsets to identify a correctly labeled portion and an incorrectly labeled portion of the set of content. A reduced content used for performing a final training of the intent classifier is formed by combining a first number of the entries from the correctly labeled portion and a second number of the entries from the incorrectly labeled portion of the set of content.
    Type: Application
    Filed: November 28, 2018
    Publication date: May 28, 2020
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Abhishek SHAH, Tin Kam HO
  • Publication number: 20200159853
    Abstract: From a first attribute-value pair in a record, new data comprising a first token is created. From each token using a processor and a memory, new data including a corresponding vector is computed. From the record, a target row is selected, wherein a target attribute-value pair in the target row includes a value requiring correction. Using a similarity measure, a set of most similar rows to the target row is determined, wherein each row in the set of most similar rows to the target row has a corresponding similarity measure above a threshold similarity measure and wherein each row in the set of most similar rows includes the target attribute. From values corresponding to the target attribute in the set of most similar rows, a replacement value is determined. The value requiring correction in the target row is replaced with the replacement value.
    Type: Application
    Filed: November 20, 2018
    Publication date: May 21, 2020
    Applicant: International Business Machines Corporation
    Inventors: Rajesh Bordawekar, Tin Kam Ho
  • Publication number: 20200152174
    Abstract: A method, apparatus, and system are provided for resolving conflicts between training data conflicts by retrieving independent training data sets, each comprising a plurality of intents and end-user utterances for use in training one or more classifiers to recognize a corresponding intent from one or more of the end-user utterances, providing a first test end-user utterance associated with a first intent from the first independent training data set to the one or more classifiers to select an output intent generated by the one or more classifiers; identifying a first conflict when the first intent does not match the output intent, and automatically generating, by the system, one or more conflict resolution recommendations for display and selection by an end user to resolve the first conflict.
    Type: Application
    Filed: January 15, 2020
    Publication date: May 14, 2020
    Inventors: David Amid, David Boaz, Tin Kam Ho, Amir Kantor, Luis A. Lastras-Montano, Neil R. Mallinar
  • Publication number: 20200142959
    Abstract: Embodiments provide for driving intent expansion via anomaly detection by ranking, according to anomaly scores, a plurality of historic utterances that have been associated by a classifier with a given intent of a plurality of predefined intents; identifying a given utterance from the plurality of historic utterances having a given anomaly score greater than an anomaly threshold; in response to verifying that the given utterance is associated with the given intent, adding the given utterance to a training dataset as a positive example for the given intent; and in response to verifying that the given utterance is not associated with the given intent, adding the given utterance to the training dataset as a complement example for the given intent. A complement example for one intent may be added as a positive example for a different intent. The training dataset may be used to train or retrain an intent classifier.
    Type: Application
    Filed: November 5, 2018
    Publication date: May 7, 2020
    Inventors: Neil R. MALLINAR, Tin Kam HO
  • Publication number: 20200142960
    Abstract: Embodiments provide for class balancing for intent authoring using search via: receiving a positive example of an utterance associated with an intent, building an in-intent pool of utterances from a conversation log using the positive example in a first search query of the conversation log; adding the in-intent pool of utterances as a positive class to a training dataset; applying Boolean operators to negate the positive example to form a complement example; building an out-intent pool of utterances from the conversation log using the complement example in a first search query of the conversation log; and adding the out-intent pool of utterances as a complement class to the training dataset. The training dataset may be balanced to include a predefined ratio of positive and complement examples. The training dataset may be used to train or retrain an intent classifier.
    Type: Application
    Filed: November 5, 2018
    Publication date: May 7, 2020
    Inventors: Abhishek SHAH, Tin Kam HO
  • Patent number: 10628507
    Abstract: A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (NLP) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over time by identifying changes for one or more concepts included in the first and/or second set of concept sequences.
    Type: Grant
    Filed: August 31, 2018
    Date of Patent: April 21, 2020
    Assignee: International Business Machines Corporation
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli
  • Publication number: 20200074984
    Abstract: Embodiments for training an automated response system using weak supervision and co-training in a computing environment are provided. A plurality of conversational logs comprising interactive dialog sessions between agents and clients for a given product or service are received. A subset of the plurality of conversational logs are retrieved according to a defined criterion, and a selected set of the subset of the plurality of retrieved conversational logs are labeled by a user. The labeling is associated with a semantic scope of intent considered by the clients. A combination of propagation operations and learning algorithms using the selected set of labeled conversational logs are applied to a remaining corpus of the plurality of conversational logs to train the automated response system according to the semantic scope of intent.
    Type: Application
    Filed: August 31, 2018
    Publication date: March 5, 2020
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Tin Kam HO, Robert L. YATES, Blake MCGREGOR, Rajendra G. UGRANI, Neil R. MALLINAR, Abhishek SHAH, Ayush GUPTA
  • Patent number: 10553202
    Abstract: A method, apparatus, and system are provided for resolving conflicts between training data conflicts by retrieving independent training data sets, each comprising a plurality of intents and end-user utterances for use in training one or more classifiers to recognize a corresponding intent from one or more of the end-user utterances, providing a first test end-user utterance associated with a first intent from the first independent training data set to the one or more classifiers to select an output intent generated by the one or more classifiers; identifying a first conflict when the first intent does not match the output intent, and automatically generating, by the system, one or more conflict resolution recommendations for display and selection by an end user to resolve the first conflict.
    Type: Grant
    Filed: October 31, 2017
    Date of Patent: February 4, 2020
    Assignee: International Business Machines Corporation
    Inventors: David Amid, David Boaz, Tin Kam Ho, Amir Kantor, Luis A. Lastras-Montano, Neil R. Mallinar
  • Patent number: 10546247
    Abstract: An approach is provided in which an information handling system trains multiple classifiers using a set of training samples. The information handling system selects a leader classifier from the multiple classifiers that generates the most amount of correct decisions corresponding to the set of training samples. Next, the information handling system identifies an endorser classifier from the multiple classifiers that generates the highest proportion of correct decisions among the endorser classifier's decisions matching the leader classifier's decisions, and combines the leader classifier and the endorser classifier into a combined classifier stage. In turn, the information handling system utilizes the combined classifier stage to process inquiries and generate results.
    Type: Grant
    Filed: December 1, 2015
    Date of Patent: January 28, 2020
    Assignee: International Business Machines Corporation
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Vinith Misra
  • Patent number: 10510336
    Abstract: A method, system, and apparatus are provided for resolving conflicts between training data conflicts by retrieving independent training data sets, each comprising a plurality of intents and end-user utterances for use in training one or more classifiers to recognize a corresponding intent from one or more of the end-user utterances, providing a first test end-user utterance associated with a first intent from the first independent training data set to the one or more classifiers to select an output intent generated by the one or more classifiers; identifying a first conflict when the first intent does not match the output intent, and automatically generating, by the system, one or more conflict resolution recommendations for display and selection by an end user to resolve the first conflict.
    Type: Grant
    Filed: June 12, 2017
    Date of Patent: December 17, 2019
    Assignee: International Business Machines Corporation
    Inventors: David Amid, David Boaz, Tin Kam Ho, Amir Kantor, Luis A. Lastras-Montano, Neil R. Mallinar
  • Publication number: 20190294953
    Abstract: A first observation window in a first time series is identified. The first observation window is preceded by a first portion of the first time series. A neural network is trained using the first portion of the first time series and the first observation window, and weights are extracted from the middle layers of the neural network. A first feature vector is generated based on the weights. A second observation window in a second time series is identified, where the second observation window is preceded by a first portion of the second time series. A second feature vector associated with the second observation window is determined. The second feature vector is based at least in part on the first set of weights. A similarity between the first and second observation windows is determined based on comparing the first feature vector and the second feature vector.
    Type: Application
    Filed: March 20, 2018
    Publication date: September 26, 2019
    Inventors: Rajesh BORDAWEKER, Tin Kam HO
  • Publication number: 20190266167
    Abstract: A method and apparatus are provided for recommending concepts from a first concept set in response to user selection of a first concept Ci by performing a natural language processing (NLP) analysis comparison of vector representations of user concepts contained in written content authored by the user and candidate concepts in a first concept set to determine a similarity measure for each candidate concept, and to select therefrom one or more of the candidate concepts for display as recommended concepts which are related to the user concepts contained in written content authored by the user based on the similarity measure between each candidate concept and each user concept.
    Type: Application
    Filed: May 8, 2019
    Publication date: August 29, 2019
    Inventors: Michele M. Franceschini, Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli, Livio Soares
  • Patent number: 10394829
    Abstract: A method and apparatus are provided for recommending concepts from a first concept set in response to user selection of a first concept Ci by performing a natural language processing (NLP) analysis comparison of vector representations of user concepts contained in written content authored by the user and candidate concepts in a first concept set to determine a similarity measure for each candidate concept, and to select therefrom one or more of the candidate concepts for display as recommended concepts which are related to the user concepts contained in written content authored by the user based on the similarity measure between each candidate concept and each user concept.
    Type: Grant
    Filed: December 8, 2015
    Date of Patent: August 27, 2019
    Assignee: International Business Machines Corporation
    Inventors: Michele M. Franceschini, Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli, Livio Soares
  • Publication number: 20190179862
    Abstract: A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (NLP) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over time by identifying changes for one or more concepts included in the first and/or second set of concept sequences.
    Type: Application
    Filed: February 13, 2019
    Publication date: June 13, 2019
    Inventors: Tin Kam Ho, Luis A. Lastras-Montano, Oded Shmueli