Patents by Inventor Md Faisal Mahbub Chowdhury

Md Faisal Mahbub Chowdhury has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

NATURAL LANGUAGE DATA GENERATION USING AUTOMATED KNOWLEDGE DISTILLATION TECHNIQUES

Publication number: 20240111969

Abstract: Methods, systems, and computer program products for natural language data generation using automated knowledge distillation techniques are provided herein. A computer-implemented method includes retrieving, in response to an input query, a set of passages from at least one knowledge base by processing the input query using a first set of artificial intelligence techniques; ranking at least a portion of the set of passages by processing the set of passages using a second set of artificial intelligence techniques; generating at least one natural language answer, in response to the input query, by processing a subset of the set of passages in connection with automated knowledge distillation techniques based on the ranking of the at least a portion of the set of passages; and performing automated actions based on the ranking of the at least a portion of the set of passages and/or the at least one generated natural language answer.

Type: Application

Filed: September 30, 2022

Publication date: April 4, 2024

Inventors: Michael Robert Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Alfio Massimiliano Gliozzo
Unsupervised discriminative facet generation for dynamic faceted search

Patent number: 11940996

Abstract: With a computerized search engine, retrieve a plurality of electronic documents relevant to a query. Obtaining, via computerized term embedding, from the retrieved documents, a plurality of most similar terms with respect to the query. For each of the most similar terms, determine a pervasiveness score and a relevance score. Filter out, from the most similar terms, those of the terms that are pervasive, based on the pervasiveness score, those of the terms that are irrelevant, based on the relevance score, and those of the terms that are redundant. Output a top number of terms remaining in the most similar terms after the filtering, based on similarity to the query, as discriminative facets.

Type: Grant

Filed: December 26, 2020

Date of Patent: March 26, 2024

Assignee: International Business Machines Corporation

Inventor: Md Faisal Mahbub Chowdhury
Dynamic facet ranking

Patent number: 11941010

Abstract: Embodiments of the present invention provide a computer system, a computer program product, and a method that comprises analyzing a performed query by identifying a plurality of indicative markers based on a pre-stored classification database associated with the performed query; generating a plurality of facets based on the analysis of the performed query; selecting at least two facets within the generated plurality of facets by determining a quantitative similarity value between each respective facet and the plurality of identified indicative markers associated with the performed query; dynamically ranking the selected facets by prioritizing the selected facets based on a calculated overall score associated with assigned weighted values for each selected facet in the generated plurality of facets using a supervised machine learning algorithm; and displaying the dynamically ranked facets within a user interface of a computing device associated with a user.

Type: Grant

Filed: December 22, 2020

Date of Patent: March 26, 2024

Assignee: International Business Machines Corporation

Inventors: Soumitra Sarkar, Md Faisal Mahbub Chowdhury, Ruchi Mahindru, Gaetano Rossiello, Alfio Massimiliano Gliozzo, Nicolas Rodolfo Fauceglia
EXTRACTING CORRECTIVE ACTIONS FROM INFORMATION TECHNOLOGY OPERATIONS

Publication number: 20230412475

Abstract: A computer-implemented method, a computer system and a computer program product create a database of corrective actions from IT operations. The method includes obtaining a plurality of tickets from a server. A ticket in the plurality of tickets comprises text. The method also includes generating a plurality of clusters of tickets from the plurality of tickets using a machine learning clustering algorithm. In addition, the method includes identifying the corrective action in the text of the ticket using a natural language processing algorithm. The method further includes determining that the corrective action represents a successful action. lastly, the method includes storing the corrective action in the database of corrective actions, where the database associates the corrective action with the cluster of tickets.

Type: Application

Filed: June 20, 2022

Publication date: December 21, 2023

Inventors: Boris Sobolev, Yu Deng, Md Faisal Mahbub Chowdhury, Paulina Toro Isaza
Hypernym detection using strict partial order networks

Patent number: 11694035

Abstract: One embodiment of the present invention provides a method comprising receiving a text corpus, and generating a first list of triples based on the text corpus. Each triple of the first list comprises a first term representing a candidate hyponym, a second term representing a candidate hypernym, and a frequency value indicative of a number of times a hypernymy relation is observed between the candidate hyponym and the candidate hypernym in the text corpus. The method further comprises training a neural network for hypernym induction based on the first list. The trained neural network is a strict partial order network (SPON) model.

Type: Grant

Filed: June 9, 2021

Date of Patent: July 4, 2023

Assignee: International Business Machines Corporation

Inventors: Sarthak Dash, Alfio Massimiliano Gliozzo, Md Faisal Mahbub Chowdhury
Unsupervised corpus expansion using domain-specific terms

Patent number: 11615154

Abstract: In an approach to unsupervised corpus expansion using domain-specific terms, one or more computer processors retrieve one or more domain-specific terms from a corpus of text. One or more computer processors search the World Wide Web for the one or more domain-specific terms to produce a plurality of web pages associated with each of the one or more domain-specific terms. One or more computer processors determine a confidence score for each of the plurality of web pages. One or more computer processors determine the confidence score of at least one of the plurality of web pages exceeds a pre-defined threshold. One or more computer processors add the at least one of the plurality of web pages to the corpus of text.

Type: Grant

Filed: February 17, 2021

Date of Patent: March 28, 2023

Assignee: International Business Machines Corporation

Inventors: Md Faisal Mahbub Chowdhury, Alfio Massimiliano Gliozzo
Natural language text generation from a set of keywords using machine learning and templates

Patent number: 11586829

Abstract: An embodiment of the present invention generates natural language content from a set of keywords in accordance with a template. Keyword vectors representing a context for the keywords are generated. The keywords are associated with language tags, while the template includes a series of language tags indicating an arrangement for the generated natural language content. Template vectors are generated from the series of language tags of the template and represent a context for the template. Contributions from the contexts for the keywords and the template are determined based on a comparison of the series of language tags of the template with the associated language tags of the keywords. One or more words for each language tag of the template are generated to produce the natural language content based on combined contributions from the contexts for the keywords and the template.

Type: Grant

Filed: May 1, 2020

Date of Patent: February 21, 2023

Assignee: International Business Machines Corporation

Inventors: Abhijit Mishra, Md Faisal Mahbub Chowdhury, Sagar Manohar, Dan Gutfreund
Discovering ranked domain relevant terms using knowledge

Patent number: 11526688

Abstract: One embodiment of the invention provides a method for terminology ranking for use in natural language processing. The method comprises receiving a list of terms extracted from a corpus, where the list comprises a ranking of the terms based on frequencies of the terms across the corpus. The method further comprises accessing a domain ontology associated with the corpus, and re-ranking the list based on the domain ontology. The resulting re-ranked list comprises a different ranking of the terms based on relevance of the terms using knowledge from the domain ontology. The method further comprises generating clusters of terms via a trained model adapted to the corpus, and boosting a rank of at least one term of the re-ranked list based on the clusters to increase a relevance of the at least one term using knowledge from the trained model.

Type: Grant

Filed: April 16, 2020

Date of Patent: December 13, 2022

Assignee: International Business Machines Corporation

Inventors: Nandana Mihindukulasooriya, Ruchi Mahindru, Md Faisal Mahbub Chowdhury, Yu Deng, Alfio Massimiliano Gliozzo, Sarthak Dash, Nicolas Rodolfo Fauceglia, Gaetano Rossiello
Unsupervised hypernym induction machine learning

Patent number: 11507828

Abstract: Training a machine learning model such as a neural network, which can automatically extract a hypernym from unstructured data, is disclosed. A preliminary candidate list of hyponym-hypernym pairs can be parsed from the corpus. A preliminary super-term—sub-term glossary can be generated from the corpus, the preliminary super-term—sub-term glossary containing one or more super-term—sub-term pairs. A super-term—sub-term pair can be filtered from the preliminary super-term—sub-term glossary, responsive to detecting that the super-term—sub-term pair is not a candidate for hyponym-hypernym pair, to generate a final super-term—sub-term glossary. The preliminary candidate list of hyponym-hypernym pairs and the final super-term—sub-term glossary can be combined to generate a final list of hyponym-hypernym pairs. An artificial neural network can be trained using the final list of hyponym-hypernym pairs as a training data set, the artificial neural network trained to identify a hypernym given new text data.

Type: Grant

Filed: October 29, 2019

Date of Patent: November 22, 2022

Assignee: International Business Machines Corporation

Inventors: Md Faisal Mahbub Chowdhury, Robert G. Farrell, Nicholas Brady Garvan Monath, Michael Robert Glass, Md Arafat Sultan
Taxonomy generation to insert out of vocabulary terms and hypernym-hyponym pair induction

Patent number: 11501070

Abstract: An approach to induction of unknown terms into a term taxonomy graph may be provided. The approach may include analyzing a domain specific corpus to generate a term taxonomy graph using a term taxonomy graph generation model with a term knowledge base and determining which terms within the domain specific corpus are out of vocabulary (OOV) terms. The approach may also analyze the terms in the domain specific corpus with a semantic representation model to generate feature vectors of the OOV terms and terms known within the generated term taxonomy graph. The approach may determine if an OOV can be a hyponym of a term within the term taxonomy graph based on the feature vectors and insert the OOV term into the graph at the appropriate location.

Type: Grant

Filed: July 1, 2020

Date of Patent: November 15, 2022

Assignee: International Business Machines Corporation

Inventors: Feifei Pan, Md Faisal Mahbub Chowdhury, Alfio Massimiliano Gliozzo
Automated evaluation of information retrieval

Patent number: 11481404

Abstract: A method, system, and computer program product for automated evaluation of information retrieval systems are provided. The method accesses a natural language query from a set of natural language queries. The natural language query is associated with a query difficulty level. The method generates one or more natural language responses to the natural language natural language query. Each natural language response is associated with at least one facet of the plurality of facets. The method generates a set of feedback cues. A set of search results for the natural language query are returned. The set of search results include a highest ranked natural language response of the one or more natural language responses. The method generates an evaluation result for the HCIR system for the query difficulty level based on the one or more natural language responses, the set of search results, and the set of feedback cues.

Type: Grant

Filed: September 16, 2020

Date of Patent: October 25, 2022

Assignee: International Business Machines Corporation

Inventors: Md Faisal Mahbub Chowdhury, Yu Deng, Alfio Massimiliano Gliozzo, Ruchi Mahindru, Nandana Mihindukulasooriya, Nicolas Rodolfo Fauceglia, Gaetano Rossiello
Taxonomy construction via graph-based cross-domain knowledge transfer

Patent number: 11423307

Abstract: A system, computer program product, and method are provided for employing a graph neural network (GNN) to construct a taxonomy. The GNN is subject to a training cycle and an inference cycle. The training cycle encodes cross-domain terms pairs from a set of noisy cross domain pairs extracted from a corpora, and outputs a preliminary taxonomy. The inference cycle identifies candidate term pairs and selectively subjects the candidate term pairs to selective filtering to produce a system predicted taxonomy from the preliminary taxonomy.

Type: Grant

Filed: June 3, 2020

Date of Patent: August 23, 2022

Assignee: International Business Machines Corporation

Inventors: Chao Shang, Sarthak Dash, Md Faisal Mahbub Chowdhury, Alfio Massimiliano Gliozzo
UNSUPERVISED CORPUS EXPANSION USING DOMAIN-SPECIFIC TERMS

Publication number: 20220261444

Abstract: In an approach to unsupervised corpus expansion using domain-specific terms, one or more computer processors retrieve one or more domain-specific terms from a corpus of text. One or more computer processors search the World Wide Web for the one or more domain-specific terms to produce a plurality of web pages associated with each of the one or more domain-specific terms. One or more computer processors determine a confidence score for each of the plurality of web pages. One or more computer processors determine the confidence score of at least one of the plurality of web pages exceeds a pre-defined threshold. One or more computer processors add the at least one of the plurality of web pages to the corpus of text.

Type: Application

Filed: February 17, 2021

Publication date: August 18, 2022

Inventors: Md Faisal Mahbub Chowdhury, Alfio Massimiliano Gliozzo
UNSUPERVISED DISCRIMINATIVE FACET GENERATION FOR DYNAMIC FACETED SEARCH

Publication number: 20220207030

Abstract: With a computerized search engine, retrieve a plurality of electronic documents relevant to a query. Obtaining, via computerized term embedding, from the retrieved documents, a plurality of most similar terms with respect to the query. For each of the most similar terms, determine a pervasiveness score and a relevance score. Filter out, from the most similar terms, those of the terms that are pervasive, based on the pervasiveness score, those of the terms that are irrelevant, based on the relevance score, and those of the terms that are redundant. Output a top number of terms remaining in the most similar terms after the filtering, based on similarity to the query, as discriminative facets.

Type: Application

Filed: December 26, 2020

Publication date: June 30, 2022

Inventor: Md Faisal Mahbub Chowdhury
OPTIMISTIC FACET SET SELECTION FOR DYNAMIC FACETED SEARCH

Publication number: 20220207087

Abstract: Determining an initial rank and a probability of relevance of each of a retrieved plurality of electronic documents relevant to a query. For each of a plurality of candidate facets, determine a revised rank for each of the retrieved plurality of electronic documents relevant to the query. Selecting, for each of the retrieved plurality of electronic documents relevant to the query, a minimum rank from among the initial rank and the revised rank for each of the plurality of candidate facets. Determine an expected discounted cumulative gain based on the probability of relevance and the minimum rank for each of the retrieved plurality of electronic documents relevant to the query. Select a set of optimistic facets based on maximizing the expected discounted cumulative gain.

Type: Application

Filed: December 26, 2020

Publication date: June 30, 2022

Inventors: Michael Robert Glass, Md Faisal Mahbub Chowdhury, Alfio Massimiliano Gliozzo
DYNAMIC FACET RANKING

Publication number: 20220197916

Abstract: Embodiments of the present invention provide a computer system, a computer program product, and a method that comprises analyzing a performed query by identifying a plurality of indicative markers based on a pre-stored classification database associated with the performed query; generating a plurality of facets based on the analysis of the performed query; selecting at least two facets within the generated plurality of facets by determining a quantitative similarity value between each respective facet and the plurality of identified indicative markers associated with the performed query; dynamically ranking the selected facets by prioritizing the selected facets based on a calculated overall score associated with assigned weighted values for each selected facet in the generated plurality of facets using a supervised machine learning algorithm; and displaying the dynamically ranked facets within a user interface of a computing device associated with a user.

Type: Application

Filed: December 22, 2020

Publication date: June 23, 2022

Inventors: Soumitra Sarkar, Md Faisal Mahbub Chowdhury, Ruchi Mahindru, Gaetano Rossiello, Alfio Massimiliano Gliozzo, Nicolas Rodolfo Fauceglia
AUTOMATED EVALUATION OF INFORMATION RETRIEVAL

Publication number: 20220083559

Abstract: A method, system, and computer program product for automated evaluation of information retrieval systems are provided. The method accesses a natural language query from a set of natural language queries. The natural language query is associated with a query difficulty level. The method generates one or more natural language responses to the natural language natural language query. Each natural language response is associated with at least one facet of the plurality of facets. The method generates a set of feedback cues. A set of search results for the natural language query are returned. The set of search results include a highest ranked natural language response of the one or more natural language responses. The method generates an evaluation result for the HCIR system for the query difficulty level based on the one or more natural language responses, the set of search results, and the set of feedback cues.

Type: Application

Filed: September 16, 2020

Publication date: March 17, 2022

Inventors: Md Faisal Mahbub Chowdhury, Yu Deng, Alfio Massimiliano Gliozzo, Ruchi Mahindru, NANDANA MIHINDUKULASOORIYA, Nicolas Rodolfo Fauceglia, Gaetano Rossiello
Dynamic faceted search on a document corpus

Patent number: 11275796

Abstract: A query-focused faceted structure generation method, system, and computer program product for generating a query-focused faceted structure from a taxonomy for searching a document collection, including ingesting a document corpus, generating a vector space representation of a query and instances from a taxonomy of the document corpus, and producing a dynamic structure of a relevant facet categories and facet values using a two-vector space representation from the generated vector space representation.

Type: Grant

Filed: April 30, 2019

Date of Patent: March 15, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Biying Kong, Nidhi Rajshree, Alfio Massimiliano Gliozzo, Nicolas Rodolfo Fauceglia, Robert G. Farrell, Md Faisal Mahbub Chowdhury, Anish Mathur
KNOWLEDGE INDUCTION USING CORPUS EXPANSION

Publication number: 20220067539

Abstract: A method, a computer program product, and a computer system induce knowledge from a knowledge graph. The method includes receiving a request indicative of a domain. The method includes determining a corpus corresponding to the domain and determining a quality of the corpus in generating the knowledge graph relative to a quality threshold. If the quality threshold is not met, the method includes determining a candidate expansion corpus to incorporate further data therefrom into the corpus relative to an expansion threshold. If the expansion threshold is met, the method includes generating an expanded corpus by expanding the corpus with the further data. The method includes generating the knowledge graph based on the expanded corpus from which the knowledge is induced and generating a response to the request based on the knowledge graph.

Type: Application

Filed: September 1, 2020

Publication date: March 3, 2022

Inventors: NANDANA MIHINDUKULASOORIYA, Md Faisal Mahbub Chowdhury, Yu Deng, Ruchi Mahindru, Nicolas Rodolfo Fauceglia, Alfio Massimiliano Gliozzo
HYPERNYM-HYPONYM PAIR INDUCTION

Publication number: 20220004711

Abstract: An approach to induction of unknown terms into a term taxonomy graph may be provided. The approach may include analyzing a domain specific corpus to generate a term taxonomy graph using a term taxonomy graph generation model with a term knowledge base and determining which terms within the domain specific corpus are out of vocabulary (OOV) terms. The approach may also analyze the terms in the domain specific corpus with a semantic representation model to generate feature vectors of the OOV terms and terms known within the generated term taxonomy graph. The approach may determine if an OOV can be a hyponym of a term within the term taxonomy graph based on the feature vectors and insert the OOV term into the graph at the appropriate location.

Type: Application

Filed: July 1, 2020

Publication date: January 6, 2022

Inventors: Feifei Pan, Md Faisal Mahbub Chowdhury, Alfio Massimiliano Gliozzo

1 2 next