Dictionary Building, Modification, Or Prioritization Patents (Class 704/10)

Contextual assistant using mouse pointing or touch cues

Patent number: 12346634

Abstract: A method for a contextual assistant to use mouse pointing or touch cues includes receiving audio data corresponding to a query spoken by a user, receiving, in a graphical user interface displayed on a screen, a user input indication indicating a spatial input applied at a first location on the screen, and processing the audio data to determine a transcription of the query. The method also includes performing query interpretation on the transcription to determine that the query is referring to an object displayed on the screen without uniquely identifying the object, and requesting information about the object. The method further includes disambiguating, using the user input indication indicating the spatial input applied at the first location on the screen, the query to uniquely identify the object that the query is referring to, obtaining the information about the object requested by the query, and providing a response to the query.

Type: Grant

Filed: June 8, 2023

Date of Patent: July 1, 2025

Assignee: Google LLC

Inventor: Dongeek Shin
Rapid and efficient case opening from negative news

Patent number: 12328201

Abstract: Disclosed is an approach in which news alerts are scanned in real-time or near real-time, relevant alerts identified through a topic extraction model, and associated actors identified through an entity extraction model. An entity resolution model may be applied to determine which actors are clients. The topic extraction, entity extraction, and/or entity resolution models may apply, for example, natural language processing models. The alert may be enriched by being packaged with client and transactional data to generate an enriched alert. A predictive model may be applied to the enriched alert to identify events with a high probability of law enforcement referral, and the enriched alert may be automatically transmitted to certain identified devices. The predictive model is trained using a combination of news alerts and data on clients and transactions, yielding enhanced predictions.

Type: Grant

Filed: August 2, 2023

Date of Patent: June 10, 2025

Assignee: Wells Fargo Bank, N.A.

Inventors: Angelica Bullard, Mauricio Flores, Ian Kloville, Jeremy Norvell, Sameer Shetty, Michael Traverso
System and methods for annotating offensive content

Patent number: 12316592

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage medium, to facilitate interception of messages that include offensive content. In one aspect, a method includes actions of receiving input on a user device that includes message content, determining, on the user device, whether the message content includes offensive content, and in response to determining, on the user device, that the message content includes offensive content, generating an alert message for display on the user device that provides an indication that the message includes offensive content.

Type: Grant

Filed: March 21, 2024

Date of Patent: May 27, 2025

Inventor: Trisha N. Prabhu
Discovering new question and answer knowledge from conversation

Patent number: 12299589

Abstract: New question and answer (QA) pairs can be automatically discovered from a corpus of data such as online chats and conversations. Newly discovered QA pairs can augment QA database, which can be used by a computer processor or device, e.g., by a chatbot, an automated machine, and/or another. Existing QA knowledge can be used to learn the structures of QA knowledge distribution in conversations, and new QA knowledge can be automatically learned through the structure of learned QA knowledge distribution in conversations. The structure of learned QA knowledge distribution can be refined by adding more semantics based on labeled data.

Type: Grant

Filed: June 30, 2021

Date of Patent: May 13, 2025

Assignee: International Business Machines Corporation

Inventors: Lijun Mei, Qi Cheng Li, Xue Han, Xin Zhou, Zi Ming Huang, Ya Bin Dang
Machine content generation

Patent number: 12299385

Abstract: Computerized systems and methods are disclosed to generate a document from one or more first and second text prompts, generating one or more context-sensitive text suggestions using a transformer with an encoder on the text prompts and a decoder that produces a text expansion to provide the context-sensitive text suggestions based on the one or more first and second text prompts by applying generative artificial intelligence with token biased weights for zero-shot, one-shot or some-shot generation of the artificial intelligence context-sensitive text suggestions from the one or more first and second text prompts.

Type: Grant

Filed: July 11, 2023

Date of Patent: May 13, 2025

Inventor: Bao Tran
Technologies for using machine learning to determine product certification eligibility

Patent number: 12293375

Abstract: Systems and methods for using machine learning to assess eligibility for product certification are disclosed. According to certain aspects, a server computer may train a set of machine learning models using a set of training data, where the set of machine learning models may be specific to products and certifications for the products. The server computer may access product specifications associated with a set of products sought to be certified, and may analyze the product specifications using the an appropriate machine learning model(s), the output of which may indicate whether the set of products is eligible for certification, the set of products is ineligible for certification, or the product specifications need further review.

Type: Grant

Filed: October 30, 2020

Date of Patent: May 6, 2025

Assignee: UL LLC

Inventors: Scot Webster, Mahmood Tabaddor, John C. Jones, Mario Xerri
System and method for generating training data for machine learning classifier

Patent number: 12282861

Abstract: Systems and methods are provided for generating training data for a machine-learning classifier. A knowledge representation synthesized based on an object of interest is used to assign labels to content items. The labeled content items can be used as training data for training a machine learning classifier. The labeled content items can also be used as validation data for the classifier.

Type: Grant

Filed: December 22, 2022

Date of Patent: April 22, 2025

Inventors: Mathew Whitney Wilson, Ihab Ilyas, Peter J. Sweeney
Text mining based on document structure information extraction

Patent number: 12277389

Abstract: Frequent sequences extracted from a set of documents according to a common rule are obtained. Based on comparing occurrence frequencies of various sequences, confidence of the first frequent sequence being a label expression representing a document part in a target document is evaluated. Keywords are extracted from the target document based on evaluation of the confidence.

Type: Grant

Filed: May 10, 2021

Date of Patent: April 15, 2025

Assignee: International Business Machines Corporation

Inventors: Tetsuya Nasukawa, Shoko Suzuki, Daisuke Takuma, Issei Yoshida
Systems, methods, and apparatuses for implementing an adaptive and scalable AI-driven personalized learning platform

Patent number: 12272265

Abstract: Processing circuitry of a learning platform may be configured to maintain a graph database describing student learners. Processing circuitry may obtain new student learner data and load the data into the graph database. Processing circuitry may receive an engagement or interaction from the new student learner and responsively extract new learnings about the new student learner which are loaded into the graph database. Processing circuitry may receive an inquiry from the new student learner and in response, extract the new student learner data and the new learnings from the graph database and contextualize, using a large language model, a learning unit from the educational content provided by the learning platform as a response to the inquiry using the new student learner data and the new learnings. Processing circuitry may further return the learning unit contextualized by the large language model to the new student learner.

Type: Grant

Filed: May 10, 2024

Date of Patent: April 8, 2025

Assignee: Arizona Board of Regents on Behalf of Arizona State University

Inventor: Mark Naufel
Systems and methods for detecting offensive content in a single responsive message

Patent number: 12273312

Abstract: Methods, systems, and computer programs for identifying offensive content. A method can include for each particular responsive message received in response to an initial message: providing the particular responsive message as an input to a machine learning model trained to predict a likelihood that an initial message includes offensive content based on processing of a responsive message received responsive to the initial message, processing the content of the particular responsive message through the machine learning model to generate output data indicating a likelihood that the initial message includes offensive content, and storing the generated output data.

Type: Grant

Filed: March 29, 2024

Date of Patent: April 8, 2025

Inventor: Trisha N. Prabhu
Generating synthetic code-switched data for training language models

Patent number: 12242820

Abstract: Techniques for training a language model for code switching content are disclosed. Such techniques include, in some embodiments, generating a dataset, which includes identifying one or more portions within textual content in a first language, the identified one or more portions each including one or more of offensive content or non-offensive content; translating the identified one or more salient portions to a second language; and reintegrating the translated one or more portions into the textual content to generate code-switched textual content. In some cases, the textual content in the first language includes offensive content and non-offensive content, the identified one or more portions include the offensive content, and the translated one or more portions include a translated version of the offensive content. In some embodiments, the code-switched textual content is at least part of a synthetic dataset usable to train a language model, such as a multilingual classification model.

Type: Grant

Filed: February 17, 2022

Date of Patent: March 4, 2025

Assignee: Adobe Inc.

Inventors: Cesa Salaam, Seunghyun Yoon, Trung Huu Bui, Franck Dernoncourt
Grouping similar words in a language model

Patent number: 12236946

Abstract: Systems and methods are provided for performing automated speech recognition. The systems and methods access a LM that includes a plurality of n-grams, each of the plurality of n-grams comprising a respective sequence of words and corresponding LM score and receive a list of words associated with a group classification, each word in the list of words being associated with a respective weight. The systems and method compute, based on the LM scores of the plurality of n-grams, a probability that a given word in the list of words associated with the group classification appears in an n-gram in the LM comprising an individual sequence of words and adds one or more new n-grams to the LM comprising one or more words in the list of words in combination with the individual sequence of words and associated with a particular LM score based on the computed probability.

Type: Grant

Filed: August 22, 2022

Date of Patent: February 25, 2025

Inventors: Jacob Assa, Alan Bekker, Zach Moshe
Alternating positioning of primary text

Patent number: 12229492

Abstract: Methods and systems of displaying substring pairs where visual characteristics delineate adjacent substring pairs from each other, specifically in the case of at least one of the primary text string and the secondary text string, the placement of the first substring alternates position on an electronic display above and below the second substring. The method may comprise receiving a plurality of the primary substrings, a plurality of the secondary substrings, and a plurality of visual characteristics, displaying, on an electronic display, the primary substrings and the secondary substrings arranged into substring pairs, and one of the visual characteristics in each of the correspondence areas. Additional desired visual effects may be achieved through the use of specific demarcations, demarcation placements, and substring modifications.

Type: Grant

Filed: August 26, 2024

Date of Patent: February 18, 2025

Assignee: Read Twogether Ltd

Inventors: David Allen Fesbinder, Alexander Postnikov
Data source evaluation platform for improved generation of supervised learning models

Patent number: 12216740

Abstract: Aspects of the disclosure relate to evaluating sources of training data for model generation. A computing platform may receive, from one or more data sources, a labelled data set. The computing platform may apply, to the labelled data set, an unsupervised learning algorithm, resulting in a clustered data set. The computing platform may compare, for each data point in the labelled data set, corresponding clustering information and labelling information to identify discrepancies. The computing platform may flag, for data points with identified discrepancies between the clustering information and labelling information, a labelling error. The computing platform may grade, based on the flagged labelling errors, each of the one or more data sources. Using remaining data of the labelled data set, not flagged with labelling errors, the computing platform may train a supervised learning model by weighting the remaining data based on: a corresponding data source and its grade.

Type: Grant

Filed: January 8, 2021

Date of Patent: February 4, 2025

Assignee: Bank of America Corporation

Inventors: Maharaj Mukherjee, Utkarsh Raj
Character input device, character input method, and computer-readable storage medium storing a character input program for searching a dictionary to determine registration of a character string

Patent number: 12190058

Abstract: A character input device according to one or more embodiments may include: an output unit configured to output a first character string to an application program having a suggestion function; a detector configured to detect selection of a second character string that is presented in correspondence with the first character string by the application program; and a registration unit configured to register, in a dictionary database, the second character string that is detected, by the detector, to have been selected.

Type: Grant

Filed: December 22, 2022

Date of Patent: January 7, 2025

Assignee: OMRON Corporation

Inventor: Yui Nonomura
Query expansion using a graph of question and answer vocabulary

Patent number: 12189668

Abstract: A method and/or system for query expansion may include: providing a set of training data in a given domain in the form of training question texts and training answer texts, identifying disjoint answer words in the training answer text that do not occur in the associated training question text, generating a graph of question word nodes and answer word nodes generated from the set of training data for the given domain in the form of the training question texts and the training answer texts, wherein edges are provided between a disjoint pair of a question word node for a question word in a training question and an answer word node for a disjoint answer word in an associated training answer, and applying spreading activation through the graph to result in a top n most highly activated nodes that are used as candidate words for expansion of a user query input.

Type: Grant

Filed: April 20, 2022

Date of Patent: January 7, 2025

Assignee: International Business Machines Corporation

Inventors: Seamus R. McAteer, Ahmed M. M. R. Salem, Daniel J. McCloskey, Mikhail Sogrin
Password reset for multi-domain environment

Patent number: 12164623

Abstract: A computer implemented method is used for changing a password in a multi-domain environment. The method includes obtaining a private key and a public key from a security card at a user device in a user domain, transferring the public key to a controller in a secure domain, requesting a password change, receiving a public key encrypted new password from the secure domain, and decrypting the new password using the private key.

Type: Grant

Filed: April 1, 2021

Date of Patent: December 10, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Kameshwar Jayaraman, Nicholas Elliot Claunch, Priyanshu Kumar Jha, Shankaranand Arunachalam
Regular expression matching in dictionary-encoded strings

Patent number: 12164574

Abstract: Techniques are described for generating an encoded-string automaton for a regex pattern from a decoded-string automaton of the regex pattern. In an embodiment, the process obtains a decoded-string automaton of the regex pattern and applies unique decoded string value(s) from the dictionary of the encoding. When applied at a selected state in the decoded-string automaton, the application may yield a transition to at least one target state in the decoded-string automaton for a unique dictionary value. Such a transition generates a transition in the encoded-string automaton from an encoded state corresponding to the selected state in the decoded-string automaton to a target state in the encoded-string automaton corresponding to the target state in the decoded-string automaton. The generated transition in the encoded-string automaton is conditioned on the token of the unique decoded string value in the dictionary.

Type: Grant

Filed: November 29, 2022

Date of Patent: December 10, 2024

Assignee: Oracle International Corporation

Inventors: Giacomo Fabris, Aleksei Kashuba, Alexander Ulrich
Distributed endpoint security architecture enabled by artificial intelligence

Patent number: 12155692

Abstract: A system for protecting an endpoint device of a user includes a web interface module that identifies a present URL visited by the user and target URLs to which navigation is available. A password management module installed on the endpoint device stores multiple entries. One entry includes a username, a password, and a login URL. The password management module selectively supplies credentials to the web interface module, including supplying the password to the web interface module in response to the web interface module identifying the login URL as the present URL. A URL analysis module evaluates the target URLs to classify each of the target URLs as either safe or suspicious and initiates a warning to the user in response to one of the target URLs being classified as suspicious. The URL analysis module performs the classification based in part on login URLs stored by the password management module.

Type: Grant

Filed: June 22, 2021

Date of Patent: November 26, 2024

Assignee: AADYA SECURITY, INC.

Inventors: Raffaele Mauro-Aniello Mautone, Chad Sterling Priest
Method for mathematical language processing via tree embeddings

Patent number: 12147407

Abstract: A method for processing formulae includes encoding a formula by: training, with a server, a model by using a machine learning algorithm with a data set that includes a plurality of formulae; transforming, with a processor, a first formula into a tree format using the trained model; converting, with the processor, the tree format of the first formula into a plurality of lists; and encoding, with the processor, the plurality of lists into a fixed dimension vector by leveraging a stacked attention module; and generating one or more formula candidates by: obtaining, with the processor, input information; and generating, with the processor, one or more second formula candidates based on input information by using the stacked attention module with a tree beam search algorithm.

Type: Grant

Filed: April 21, 2023

Date of Patent: November 19, 2024

Assignees: William Marsh Rice University, University of Massachusetts

Inventors: Zichao Wang, Shiting Lan, Richard G. Baraniuk
Method for editing text information

Patent number: 12141521

Abstract: Disclosed is a method of correcting text information. The method can be performed by a computing device. The method includes obtaining the text information. The method includes determining problem text within the text information. The method includes generating alternative text to correct the problem text by utilizing expanded text associated with the problem text or non-text type information associated with the problem text. The method includes providing information about the alternative text for correcting the problem text.

Type: Grant

Filed: April 11, 2024

Date of Patent: November 12, 2024

Assignee: ActionPower Corp.

Inventors: Jihwa Lee, Jaeyup Song
Search queries matching system

Patent number: 12124438

Abstract: A computer implemented method for managing search queries. The method uses a number of processor units to receive data records. The number of processor units identify a set of data record pairs from the data records. The number of processor units generates a list of long data records based on frequencies of occurrences for long data records associated with each short data record in the set of data record pairs. The number of processor units receive a search query comprises a number of short data records in the set of data record pairs. The number of processor units identify a number of long data records for each short data record in the number of short data records using the lists of long data records for short data records. The number of processor units expand the search query by adding the number of long data records to the search query.

Type: Grant

Filed: June 14, 2023

Date of Patent: October 22, 2024

Assignee: S&P Gloal Inc.

Inventor: Craig William Schmidt
Parameter learning apparatus, parameter learning method, and computer readable recording medium

Patent number: 12118314

Abstract: A parameter learning apparatus 100 extracts one entity in a document and a related text representation as a one-term document fact, outputs a one-term partial predicate fact including only the one entity using a predicate fact that includes entities and a predicate, calculates a first one-term score indicating the degree of establishment of the one-term document fact using a one-term partial predicate feature vector, a one-term text representation feature vector, and a one-term entity feature vector that are calculated from parameters, calculates a second one-term score with respect to a combination of one entity and a predicate or a text representation that is not extracted as the one-term partial predicate fact, updates the parameters such that the first one-term score is higher than the second one-term score, and calculates a score indicating the degree of establishment of the predicate fact and a score indicating the degree of establishment of a combination of entities and a predicate that is not obtained

Type: Grant

Filed: May 31, 2019

Date of Patent: October 15, 2024

Assignee: NEC CORPORATION

Inventors: Kosuke Akimoto, Takuya Hiraoka, Kunihiko Sadamasa
Systems for real-time intelligent haptic correction to typing errors and methods thereof

Patent number: 12118209

Abstract: Systems and methods of the present disclosure enable context-aware haptic error notifications. The systems and methods include a processor to receive input segments into a software application from a character input component and determine a destination. A context identification model predicts a context classification of the input segments based at least in part on the software application and the destination. Potential errors are determined in the input segments based on the context classification. An error characterization machine learning model determines an error type classification and an error severity score associated with each potential error and a haptic feedback pattern is determined for each potential error based on the error type classification and the error severity score of each potential error of the one or more potential errors. And a haptic event latency is determined based on the error type classification and the error severity score of each potential error.

Type: Grant

Filed: November 28, 2023

Date of Patent: October 15, 2024

Assignee: Capital One Services, LLC

Inventors: Abdelkader M'hamed Benkreira, Nimma Bhusri, Tyler Maiman
Vocabulary generation for neural machine translation

Patent number: 12112139

Abstract: Implementations of the present disclosure relate to methods, devices, and computer program products for generating a destination vocabulary from a source vocabulary. In a method, a group of candidate vocabularies are determined from the source vocabulary based on a corpus, a size of a candidate vocabulary in the group of candidate vocabularies being different from a size of the source vocabulary. A group of marginal scores are obtained for the group of candidate vocabularies, respectively, a marginal score in the group of marginal scores being obtained for the candidate vocabulary based on a corpus entropy of the candidate vocabulary and a size of the candidate vocabulary. The destination vocabulary is selected from the group of candidate vocabularies based on the group of marginal scores.

Type: Grant

Filed: November 24, 2021

Date of Patent: October 8, 2024

Assignee: Beijing Youzhuju Network Technology Co. Ltd.

Inventors: Jingjing Xu, Chun Gan, Hao Zhou, Lei Li, Zaixiang Zheng
Sentiment analysis

Patent number: 12086544

Abstract: Polarity classifications of writing samples are obtained by sentiment analysis operations including embedding each word of a writing sample into a word vector based on surrounding words, extracting one or more features of the writing sample, applying a feature learning function to the one or more features, estimating a polarity of the writing sample based on output from the word learning function and output from the feature learning function, and training the word learning function and the feature learning function based on a loss function relating the estimated polarity to the word vector to produce a model for writing sample polarity classification.

Type: Grant

Filed: December 22, 2021

Date of Patent: September 10, 2024

Assignee: RAKUTEN MOBILE, INC.

Inventors: Petrit Nahi, Madhukiran Medithe
Maintenance of a data glossary

Patent number: 12050866

Abstract: A system may receive a data glossary comprising a list of terms. The system may then measure a usage dimension for a set of the terms from the list of terms. The system may select a candidate term from the set based on the usage dimension and perform a maintenance action on the candidate terms.

Type: Grant

Filed: December 13, 2020

Date of Patent: July 30, 2024

Assignee: International Business Machines Corporation

Inventors: Albert Maier, Michael Baessler, Peter Gerstl, Oliver Suhre, Thomas Schwarz
Semantic duplicate normalization and standardization

Patent number: 12050873

Abstract: Systems, methods, and computer-readable media are disclosed for list attribute normalization and standardization for creation of a controlled vocabulary. A vocabulary set comprising a plurality of vocabulary term may be received. For each vocabulary term, semantic duplicates may be identified. The semantic duplicates may be identified by analyzing semantics, syntactics, or phonetics of the vocabulary terms. Semantic chains may be formed from each vocabulary term and the corresponding semantic duplicates. The terms in each semantic chain may be ranked to determine a most probable vocabulary term. The most probable vocabulary term may then replace the semantic chain. The most probable vocabulary term across all semantic chains from the vocabulary set may form the controlled vocabulary.

Type: Grant

Filed: October 28, 2021

Date of Patent: July 30, 2024

Assignee: SAP SE

Inventor: Hans-Martin Ramsl
Method and system for classifying word as obscene word

Patent number: 12026465

Abstract: There is disclosed a method and system for classifying a word as an obscene word, the method comprising, at a training phrase: acquiring a first word, the first word corresponding to a given obscene word; generating a first set of misspelled words, the first set of misspelled words comprising a plurality of misspelled variations of the first word; generating a training pairs, the training pairs comprising: a set of positive training pairs comprising the first word paired with each misspelled variations of the first word; training a machine learning algorithm, the training comprising: determining, for each training pairs, a set of features representative of a property of the training pairs; generating an inferred function based on the set of features, the inferred function being configured to assign, in use, an indecency score, the decency score being indicative of a likelihood of the word being obscene.

Type: Grant

Filed: December 17, 2021

Date of Patent: July 2, 2024

Assignee: Direct Cursus Technology L.L.C

Inventor: Mikhail Borisovich Libman
Systems and methods for processing nuances in natural language

Patent number: 12001806

Abstract: Systems, apparatuses, methods, and computer program products are disclosed for processing electronic information indicative of natural language. An example method includes generating a natural language attribute data set based on a first word in a sequence of words provided by a user, a first natural language of the word, and one or more exogenous events. The example method further includes generating a natural language transliteration data set based on the natural language attribute data set. The example method further includes generating a translation of the first word in a second natural language based on the natural language transliteration data set. The example method further includes generating, using machine learning and based at least in part on the translation, a response signal for transmission to a client device.

Type: Grant

Filed: December 23, 2021

Date of Patent: June 4, 2024

Assignee: Wells Fargo Bank, N.A.

Inventors: Romica Juneja, Abhijit Rao
Using communicative discourse trees to detect distributed incompetence

Patent number: 12001804

Abstract: Techniques are disclosed for detecting distributed incompetence in text of a conversation using communicative discourse trees and then inserting an automatic response from an autonomous agent (chatbot) or other entity. For example, a computing system generates a communicative discourse tree from utterances from multiple agents to a user. The computing system obtains a prediction of whether the text includes distributed incompetence by applying a trained predictive model to the communicative discourse tree. Based on the detection, the computing system generates an updated response to a user device.

Type: Grant

Filed: May 19, 2022

Date of Patent: June 4, 2024

Assignee: Oracle International Corporation

Inventor: Boris Galitsky
Adjusting explainable rules using an exploration framework

Patent number: 11989515

Abstract: A computer-implemented method according to one embodiment includes receiving a plurality of linguistic expressions (LEs); changing one or more conditions of the plurality of linguistic expressions to create an updated plurality of linguistic expressions, utilizing a visual exploration framework (VEF) that visually presents to a user each of the plurality of linguistic expressions; and including the updated plurality of linguistic expressions in a model used to classify input sentences. According to another embodiment, a computer-implemented method includes receiving (i) a set of linguistic expressions (LEs) and (ii) a set of labeled data as input, where the LEs are logical combinations of predicates learned from the labeled data, and each data point in the labeled data comprises a piece of text and ground-truth labels; presenting the LEs in a visual exploration framework; and allowing a user to sort, filter, subset, and select LEs based on different criteria, utilizing the framework.

Type: Grant

Filed: February 28, 2020

Date of Patent: May 21, 2024

Assignee: International Business Machines Corporation

Inventors: Prithviraj Sen, Yiwei Yang, Yunyao Li, Eser Kandogan
Learning method using machine learning to generate correct sentences, extraction method, and information processing apparatus

Patent number: 11977975

Abstract: A learning method to be executed by a computer, the learning method includes when a first input sentence in which a predetermined target is represented by a first named entity is input to a first machine learning model, learning a first parameter of the first machine learning model such that a value output from the first machine learning model approaches correct answer information corresponding to the first input sentence; and when an intermediate representation generated when the first input sentence is input to the first machine learning model and a second input sentence in which the predetermined target is represented by a second named entity are input to a second machine learning model, learning the first parameter and a second parameter of the second machine learning model such that a value output from the second machine learning model approaches correct answer information corresponding to the second input sentence.

Type: Grant

Filed: February 26, 2020

Date of Patent: May 7, 2024

Assignee: FUJITSU LIMITED

Inventor: Tomoya Iwakura
Method and apparatus of NER-oriented chinese clinical text data augmentation

Patent number: 11972214

Abstract: Disclosed is a method and an apparatus NER-orientated Chinese clinical text data augmentation, and unannotated data and annotated data of label linearization processing through data preprocessing. A concealed part is predicted based on retained information by using the unannotated data and concealing part of information in text, and meanwhile an entity word-level discrimination task is introduced for pre-training of a span-based language model; and a plurality of decoding mechanisms are introduced in a fine-tune stage, a relationship between a text vector and text data is obtained based on the pre-trained span-based language model, linearized data with entity labels is converted into the text vector, and text generation is performed through forward decoding and reverse decoding in a prediction stage of a text generation model to obtain enhanced data with annotation information.

Type: Grant

Filed: July 6, 2023

Date of Patent: April 30, 2024

Assignee: ZHEJIANG LAB

Inventors: Jingsong Li, Lixin Shi, Ran Xin, Zongfeng Yang, Yu Tian, Tianshu Zhou
Generating replacement sentences for a particular sentiment

Patent number: 11966703

Abstract: Certain aspects of the present disclosure provide techniques for generating a replacement sentence with the same or similar meaning but a different sentiment than an input sentence. The method generally includes receiving a request for a replacement sentence and iteratively determining a next word of the replacement sentence word-by-word based on an input sentence. Iteratively determining the next word generally includes evaluating a set of words of the input sentence using a language model configured to output candidate sentences and evaluating the candidate sentences using a sentiment model configured to output sentiment scores for the candidates sentences. Iteratively determining the next word further includes calculating convex combinations for the candidate sentences and selecting an ending word of one of the candidate sentences as the next word of the replacement sentence. The method further includes transmitting the replacement sentence in response to the request for the replacement sentence.

Type: Grant

Filed: December 14, 2022

Date of Patent: April 23, 2024

Assignee: Intuit Inc.

Inventors: Manav Kohli, Cynthia Joann Osmon, Nicholas Roberts
Perspective annotation for numerical representations

Patent number: 11947903

Abstract: Various techniques for providing perspective annotation to numerical representations are disclosed herein. For example, a method includes detecting a numerical representation in an original content and retrieving one or more perspectives from a database based on the detected numerical representation. The one or more perspectives individually include a restatement of information contained in the numerical representation. The method can also include annotating the original content with the retrieved one or more perspectives to form an annotated content.

Type: Grant

Filed: October 22, 2018

Date of Patent: April 2, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jake Hofman, Miroslav Dudik, Daniel Goldstein
Data augmentation

Patent number: 11947570

Abstract: A computer-implemented method for data augmentation is provided according an embodiment of the present disclosure. In the method, a first feature vector for input data may be obtained based on a first model. The input data may be clustered to a plurality of clusters. For each of the clusters, a second feature vector may be obtained based on the first model. Then, a similarity between the first feature vector and the second feature vector may be estimated for each of the clusters. At least one cluster of the plurality of clusters for which the similarity is lower than a threshold may be determined. Moreover, data augmentation may be performed to the at least one cluster.

Type: Grant

Filed: September 3, 2019

Date of Patent: April 2, 2024

Assignee: International Business Machines Corporation

Inventors: Qing Wang, Shi Lei Zhang, Yonghua Lin
Method and system for detecting harmful web resources

Patent number: 11936673

Abstract: A method and a system for detecting harmful content on a network are provided. The method comprises: receiving a URL; obtaining, from the URL, an HTML document associated therewith; converting the HTML document into a text; normalizing the text associated with the HTML document, thereby generating a plurality of tokens associated therewith; aggregating, each one of the plurality of tokens into a token vector associated with the HTML document; and applying, one or more classifiers to the token vector associated with the HTML document to determine a likelihood parameter indicative of the URL being associated with the harmful content; in response to the likelihood parameter being equal to or greater than a predetermined likelihood parameter threshold: identifying, the URL as being associated with the harmful content; and storing, the URL in a database of harmful URLs.

Type: Grant

Filed: December 10, 2020

Date of Patent: March 19, 2024

Assignee: GROUP IB, LTD

Inventor: Nikolay Prudkovskiy
Partial execution of translation in browser

Patent number: 11900073

Abstract: A computer-implemented method is programmed to support efficient and rapid generation of machine translation suggestions on client devices. Network latency is substantially reduced or eliminated by separating certain aspects of the translation workload across multiple classes of tasks, including final neural network output, between a client device and server device. The client device and server device may be connected such that a decoder portion of a machine translation system may be downloaded onto the client device, along with an initial translation suggestion and encoder outputs associated with a document, which document is in a source language to be translated into a target language. The initial translation suggestion may be replaced by an updated machine translation suggestion as a user inputs text in the target language called a prefix. This updated machine translation is generated on the client-side decoder using the previously-downloaded encoder outputs as input and the prefix as constraint.

Type: Grant

Filed: September 7, 2021

Date of Patent: February 13, 2024

Assignee: Lilt, Inc.

Inventors: Geza Kovacs, John DeNero
Training a machine learning system for keyword prediction with neural likelihood

Patent number: 11893348

Abstract: Computer implemented methods and systems are provided for generating diverse key phrases while maintaining competitive output quality. A system for training a sequence to sequence (S2S) machine learning model is proposed where neural unlikelihood objective approaches are used at (1) a target token level to discourage the generation of repeating tokens, and (2) a copy token level to avoid copying repetitive tokens from the source text. K-step ahead token prediction approaches are also proposed as an additional mechanism to augment the approach to further enhance the overall diversity of key phrase outputs.

Type: Grant

Filed: June 30, 2021

Date of Patent: February 6, 2024

Assignee: ROYAL BANK OF CANADA

Inventors: Hareesh Pallikara Bahuleyan, Layla El Asri
Classification code parser for identifying a classification code to a text

Patent number: 11886819

Abstract: A classification code parser and method can include: reading a classification code having a description; reading a required keyword, and a total number of keywords associated with the classification code; reading text of a note; tokenizing the text of the note to create a note token stream, the note token stream having a note token and a position of the note token within the note token stream; creating a keyword map including a total number of matched keywords; determining a match ratio from the total number of the matched keywords and the total number of the keywords; determining a proximity factor based on a shortest span of tokens within the note token stream containing all the matched keywords; and determining a strength of a match between the classification code and the note based on the match ratio being multiplied by the proximity factor.

Type: Grant

Filed: February 8, 2023

Date of Patent: January 30, 2024

Assignee: IQVIA Inc.

Inventors: Brian Berns, Kirk Junker
Bot dialog manager

Patent number: 11886816

Abstract: A method manages bot dialogue. A user input is converted to a phrase vector. A set of identified tokens are identified by a token identification engine from the phrase vector. An unsupervised token is selected from the set of identified tokens. A supervised token is selected from the set of identified tokens. A voted token selected from the unsupervised token and the supervised token. A next token is identified based on a set of recent tokens that includes the voted token. The next token is presented as one of a voice communication and an email communication.

Type: Grant

Filed: February 22, 2021

Date of Patent: January 30, 2024

Assignee: Prosper Funding LLC

Inventor: Paul Golding
Document data management via graph cliques for layout understanding

Patent number: 11880403

Abstract: One example method includes, for each document in a group of annotated documents, extracting a set of words from the annotated document, and each of the words is positioned in a respective field of the annotated document. The method further includes using an aggregation function to determine, for one of the fields, a similarity of each one of the annotated documents to all of the other annotated documents, creating a document layout graph with nodes that each correspond to a respective annotated document, and each node is connected to all other nodes for which a similarity threshold for the one field has been met, and running an algorithm on the document layout graph to identify a clique of the annotated documents, and each annotated document in the clique has a similar layout to respective layouts of the other annotated documents in the clique.

Type: Grant

Filed: October 8, 2021

Date of Patent: January 23, 2024

Assignee: EMC IP HOLDING COMPANY LLC

Inventors: Paulo Abelha Ferreira, Pablo Nascimento da Silva, Rômulo Teixeira de Abreu Pinho, Vinicius Michel Gottin
Detecting hypocrisy in text

Patent number: 11880652

Abstract: Techniques are disclosed for identifying hypocrisy in text. A computer system creates, from fragments of text, a syntactic tree that represents syntactic relationships between words in the fragments. The system identifies, in the syntactic tree, a first entity and a second entity. The system further determines that the first entity is opposite to the second entity. The system further determines a first sentiment score for a first fragment comprising the first entity and a second sentiment score for a second fragment comprising the second entity. The system, responsive to determining that the first sentiment score and the second sentiment score indicate opposite emotions, identifies the text as comprising hypocrisy and providing the text to an external device.

Type: Grant

Filed: January 6, 2023

Date of Patent: January 23, 2024

Assignee: Oracle International Corporation

Inventor: Boris Galitsky
Creation of indexes for information retrieval

Patent number: 11874860

Abstract: The present invention may be a system for creating indexes for information retrieval comprises a processor and a memory. The memory has program instructions embodied therewith. The program instructions are executable by the processor to cause the system to read a document having hinting information into a memory, where the hinting information is associated with each unique expression in an original document. The program instructions are further executable to create the indexes from the document, where a first analysis method for generating a contiguous sequence of items from a text in the document is used for creating the indexes for each sequence in the unique expression with which the hinting information is associated and a second analysis method for dividing the text into meaningful units is used for creating the indexes for each word in the text other than the unique expression.

Type: Grant

Filed: August 29, 2019

Date of Patent: January 16, 2024

Assignee: International Business Machines Corporation

Inventors: Hidekazu Fujiwara, Yoko Nameki, Soh Ohta
Identification of content gaps based on relative user-selection rates between multiple discrete content sources

Patent number: 11868341

Abstract: Identification of content gaps based on relative user-selection rates between multiple discrete content sources. A system analyzes search log activity to determine whether users that are conducting particular types of search activities are ultimately selecting and relying upon content resources from a predefined content source of interest or, alternatively, whether such users are unsatisfied with the predefined content source of interest and are instead relying upon other third-party content sources. This particular type of analysis provides valuable insights into whether content gaps exist within the predefined content source of interest.

Type: Grant

Filed: December 15, 2020

Date of Patent: January 9, 2024

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Junia Anna George, Chetan Bansal, Nikitha Rao, Casey Jo Gossard, Dung Nguyen, David Boyd Ludwig, IV, Curtis Dean Anderson
System and process for concept tagging and content retrieval

Patent number: 11853287

Abstract: A system and process for tagging electronic documents or other electronic content with concepts mentioned, contained, or otherwise described in that content. Once tagged, the content may be searchable, indexable, and retrievable in order to provide that content to an end user or another recipient. The system may be configured to handle a considerable number of asset files and a large number of users, workflows, and access applications simultaneously. The system may auto-tag the content and also may include a user interface for confirming and updating those tags and for manually creating new or additional tags. Content may include documents such as medical documents relating to procedures, diagnoses, medications or other domains. Alternatively, the content may include information about various care providers, in order to allow a user to locate a physician meeting one or more desired criteria.

Type: Grant

Filed: August 15, 2016

Date of Patent: December 26, 2023

Assignee: Intelligent Medical Objects, Inc.

Inventors: Regis J P Charlot, Frank Naeymi-Rad, Alina E. Oganesova, Andre L. Young, Jr., Andrei Naeymi-Rad, Aziz M. Bodal, David O. Haines, Jose A. Maldonado, Masayo Kobashi, Stephanie J. Schaefer
Machine translation system for entertainment and media

Patent number: 11847425

Abstract: A process receives, with a processor, audio corresponding to media content. Further, the process converts, with the processor, the audio to text. In addition, the process concatenates, with the processor, the text with one or more time codes. The process also parses, with the processor, the concatenated text into one or more text chunks according to one or more subtitle parameters. Further, the process automatically translates, with the processor, the parsed text from a first spoken language to a second spoken language. Moreover, the process determines, with the processor, if the language translation complies with the one or more subtitle parameters. Additionally, the process outputs, with the processor, the language translation to a display device for display of the one or more text chunks as one or more subtitles at one or more times corresponding to the one or more time codes.

Type: Grant

Filed: August 1, 2018

Date of Patent: December 19, 2023

Assignee: Disney Enterprises, Inc.

Inventor: Erika Doggett
Interpreting a text classifier

Patent number: 11842159

Abstract: Techniques for interpreting a text classifier model are described. An exemplary method includes receiving a request to interpret the text classifier; receiving input text to be used to interpret the text classifier; interpreting the text classifier using the input text and masked input text to determine two or more of a counterfactual score for the received input text or an aspect thereof, an importance score for the received input text or an aspect thereof, and a bias score for the received input text or an aspect thereof as requested by the request, and providing the determined one or more scores is provided to a requester.

Type: Grant

Filed: March 16, 2021

Date of Patent: December 12, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Sawan Kumar, Kalpit Dixit, Syed Kashif Hussain Shah
Natural language processing techniques using joint sentiment-topic modeling

Patent number: 11842162

Abstract: There is a need for more effective and efficient natural language processing (NLP) solutions. This need can be addressed by, for example, solutions for performing NLP-based document prioritization by utilizing joint sentiment-topic (JST) modeling.

Type: Grant

Filed: October 3, 2022

Date of Patent: December 12, 2023

Assignee: Optum Technology, Inc.

Inventors: Ayan Sengupta, Suman Roy, Tanmoy Chakraborty, Gaurav Ranjan, William Scott Paka

1 2 3 4 5 … next