Natural Language Patents (Class 704/9)
-
Patent number: 12210847Abstract: A sentence generation device has: an estimation unit for receiving input of a first sentence and a focus point related to generation of a second sentence to be generated based on the first sentence, and estimating importance of each word constituting the first sentence using a pre-trained model; and a generation unit for generating the second sentence based on the importance, and thus makes it possible to evaluate importance of a constituent element of an input sentence in correspondence with a designated focus point.Type: GrantFiled: February 21, 2020Date of Patent: January 28, 2025Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Itsumi Saito, Kyosuke Nishida, Atsushi Otsuka, Kosuke Nishida, Hisako Asano, Junji Tomita
-
Patent number: 12210575Abstract: A document management system analyzes document clauses using document clause clusters. The document management system uses measures of similarity between document clauses from different documents to assign clauses to clause clusters. Clause clusters may be used to perform various analyses, such as to assign clauses a classification corresponding to a relevant clause cluster. The document management system provides analyses performed using document clause clusters for user review, such as to approve clause clusters, classify clause clusters, modify clause clusters, or some combination thereof.Type: GrantFiled: June 29, 2021Date of Patent: January 28, 2025Assignee: Docusign, Inc.Inventors: Kenneth Patrick Slattery, James Kenneth Wagner, Jr.
-
Patent number: 12210829Abstract: An entity recognition method, apparatus, electronic device, and computer-readable storage medium are provided. The method includes: determining at least one entity boundary word corresponding to a text sequence; determining at least one entity candidate region in the text sequence based on the at least one entity boundary word; and performing entity recognition on the text sequence and identifying at least one entity in the text sequence based on the at least one entity candidate region.Type: GrantFiled: April 7, 2022Date of Patent: January 28, 2025Assignees: SAMSUNG ELECTRONICS CO., LTD., BEIJING SAMSUNG TELECOM R&D CENTERInventors: Huadong Wang, Ting Chen
-
Patent number: 12204848Abstract: Systems, methods, and devices including smart interfaces with facilitated input and mistake recovery are described. For example, a smart interface system can identify one or more portions of user input as alterable decisions, and, for each of the one or more alterable decisions, store, in a memory, information about one or more alternative options for the alterable decision. The system can also identify one of the alterable decisions as the currently alterable decision, and upon receiving an input indicative of an actuation of the alteration key, alter the currently alterable decision to another of the one or more alternative options based on the stored information.Type: GrantFiled: December 20, 2023Date of Patent: January 21, 2025Assignee: I.Q. JOE, LLCInventor: Jeffrey James Hatch
-
Patent number: 12204590Abstract: An information processing method includes receiving first request information entered by a user, determining a first task engine for the first request information, where a first slot is set in the first task engine, extracting key information from the first request information based on the first slot, and if the key information fails to be extracted from the first request information based on the first slot, or if the key information is extracted from the first request information based on the first slot, but the extracted key information does not meet a condition, obtaining target key information from a shared parameter list of the user.Type: GrantFiled: May 7, 2020Date of Patent: January 21, 2025Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Zhefeng Yan, Lifeng Shang, Tao Cai, Li Qian
-
Patent number: 12204867Abstract: Provided is a computer-implemented method, system, and computer program product for process mining asynchronous support conversations using attributed directly follows graphing. A processor may collect a plurality of conversation threads from an asynchronous data stream. The processor may label each utterance of a plurality of utterances from the plurality of conversation threads with an event label. The processor may analyze the event label for each utterance of the plurality of utterances. The processor may generate, based on the analyzing of the event label for each utterance, an attributed directly follows graph (DFG).Type: GrantFiled: March 22, 2022Date of Patent: January 21, 2025Assignee: International Business Machines CorporationInventors: Sampath Dechu, Monika Gupta, Prerna Agarwal, Renuka Sindhgatta Rajan, Naveen Eravimangalath Purushothaman
-
Patent number: 12204859Abstract: A text processing method, a model training method, and an apparatus related to the field of artificial intelligence is provided. The method includes: obtaining target knowledge data; processing the target knowledge data to obtain a target knowledge vector; processing to-be-processed text to obtain a target text vector; fusing the target text vector and the target knowledge vector based on a target fusion model, to obtain a fused target text vector and a fused target knowledge vector; and processing the fused target text vector and/or the fused target knowledge vector based on a target processing model, to obtain a processing result corresponding to a target task. The foregoing technical solution can improve accuracy of a result of processing a target task by the target processing model.Type: GrantFiled: November 15, 2021Date of Patent: January 21, 2025Assignees: Huawei Technologies Co., Ltd., TSINGHUA UNIVERSITYInventors: Yasheng Wang, Xin Jiang, Xiao Chen, Qun Liu, Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu
-
Patent number: 12197866Abstract: A method executed by a computing device includes determining a set of identigens for each phrase word of a phrase to produce sets of identigens. A set of identigens of the sets of identigens represents one or more different meanings of a phrase word of the phrase. The method further includes obtaining sensor information for one or more phrase words of the phrase. The method further includes selecting an identigen of a first set of identigens based on the sensor information to produce a first identigen selection for the first set of identigens having a selected meaning of one or more different meanings of the first phrase word. The method further includes interpreting remaining sets of identigens of the sets of identigens to produce an entigen group so that the entigen group represents a most likely meaning interpretation of the phrase.Type: GrantFiled: October 27, 2023Date of Patent: January 14, 2025Assignee: entigenlogic LLCInventors: Frank John Williams, Stephen Emerson Sundberg, Ameeta Vasant Reed, Dennis Arlen Roberson, Thomas James MacTavish, Karl Olaf Knutson, Jessy Thomas, Niklas Josiah MacTavish, David Michael Corns, II, Andrew Chu, Kyle Edward Alberth, Ali Fattahian, Zachary John McCord, Ahmad Abdelqader Abunaser, Gary W. Grube
-
Patent number: 12197504Abstract: A method, computer system, and a computer program for conducting a conversational search. In one embodiment, the method includes monitoring a dialogue involving at least one user and capturing user utterances provided during the dialogue. These user utterances are then analyzed and classified according to the context of the dialogue. The dialogue is intervened upon the determination that a user needs additional information and/or upon execution of an action on behalf of the user and based on the plurality of user utterances and context. The Required information may be provided back to the user using Documentation Recommendation Module. The Documentation Recommendation Module determines a valid resource recommendation as determined by a combination of the context and a resource that includes additional information.Type: GrantFiled: April 22, 2022Date of Patent: January 14, 2025Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Jatin Ganhotra, Nathaniel Mills, Chulaka Gunasekara, Kshitij Fadnis, Sachindra Joshi, Luis A. Lastras-Montano
-
Patent number: 12199834Abstract: Described herein are systems and methods for generating a computing network from natural language descriptions of the computing network are provided. In one or more examples, the systems and methods described below can be used harvest publically available (or even privately available) natural language descriptions of a computing network, and convert those descriptions into an operational replica of the described computing network. By harvesting both publically available data, as well as other resources, the systems and methods described herein can allow for an analyst tasked with analyzing a given computing network to generate a fully operational replica of the computing network without having to have direct access to the network. In one or more examples, the analyst can recreate a fully-functional replica of the network to be analyzed based on partial descriptions of the computing network, thereby greatly improving the level of analysis capable of being performed.Type: GrantFiled: October 19, 2022Date of Patent: January 14, 2025Assignee: The MITRE CorporationInventors: Michael Bastian Kouremetis, Christopher David Jellen, William Bryan Booth, Andrew Fred Applebaum
-
Patent number: 12197846Abstract: Provided is a method, a computer program product, and a system for associating mathematical functions to numerical text in a natural language sample. The method includes inputting a natural language sample from a text dataset and identifying a numerical text within the natural language sample. The method further includes displaying a mathematical function corresponding to the numerical text to be selected. The mathematical function can be selected via graphical user interface displayed on a computing device. The method also includes receiving and inserting the mathematical function as a feature into a feature vector of the natural language sample and selecting an output label for the natural language sample. The output label relates to the mathematical function selected for the numerical text. The method further includes exporting the natural language sample into a labeled dataset which can be used to train a machine learning model.Type: GrantFiled: November 19, 2019Date of Patent: January 14, 2025Assignee: International Business Machines CorporationInventors: Lalit Agarwalla, Gandhi Sivakumar, Maharaj Mukherjee, Rashida A. Hodge
-
Patent number: 12197499Abstract: When creators generate media content in accordance with media programs, the media content is evaluated to identify any number of violations of policies, and to generate scores representing a level of risk that the creators will violate one or more of the policies in the future. Subsequently, media content of the creators is transmitted to listeners in accordance with the scores. In addition to audio data of creators or transcripts of the audio data, scores may be generated based on images associated with the creators, titles or summaries of media programs, or reports received from listeners. Scores calculated for creators may increase or decrease over time, depending on numbers of violations of policies by such creators, or other factors, and be utilized with a goal of protecting listeners against exposure to harmful content.Type: GrantFiled: May 23, 2022Date of Patent: January 14, 2025Assignee: Amazon Technologies, Inc.Inventors: Rakshit Karnawat, Madhuri R. Marri, Mikesh Narendra Vora
-
Patent number: 12197503Abstract: Methods, systems, and computer-readable media for interactive command generation for natural language input are disclosed. A natural language dialog system receives a natural language input for a dialog with a user. The system determines a state representation of the dialog based at least in part on the natural language input. The state representation indicates an operation offered by a service. The system generates a natural language output based at least in part on the natural language input. The natural language output solicits an additional natural language input for the dialog. The system determines an updated state representation of the dialog based at least in part on the additional natural language input and the state representation. The updated state representation indicates parameter value(s) for the operation. Based at least in part on the updated state representation, the system generates a command invoking the operation with the parameter value(s).Type: GrantFiled: September 30, 2020Date of Patent: January 14, 2025Inventors: Rashmi Gangadharaiah, Jonathan James Pezzino, James W. Horsley, Mira E Hall
-
Patent number: 12197861Abstract: A system, computer program product, and method are provided for jointly learning dictionary based rules and dictionary candidates. Natural language text is received and parsed into subsets, with the subset being subjected to natural language processing to identify one or more verbs within the subset. The identified verbs are evaluated with respect to a dictionary and one or more rules. The evaluation is directed at each predicate in the rules with respect to the identified verbs. A neural network is leveraged to jointly induce modification of the rules and one or more dictionaries responsive to the evaluation.Type: GrantFiled: February 19, 2021Date of Patent: January 14, 2025Assignee: International Business Machines CorporationInventors: Prithviraj Sen, Marina Danilevsky Hailpern, Yunyao Li
-
Patent number: 12190055Abstract: A computer system includes memory configured to store a document database and a machine learning model. The document database includes multiple historical documents each having at least one version labeled as compliant and at least one version labeled as non-compliant. The system includes a creator user interface, a compliance user interface, an automated distribution module, and a model building module configured to train the machine learning model to classify a document according to a compliance score indicating a likelihood of document compliance with one or more compliance criteria. The system also includes an orchestrator module configured to receive the compliance score for the submitted document from the machine learning model, determine whether the compliance score is greater than or equal to a compliance score threshold, and supply the submitted document to the compliance user interface for transmission to the compliance team device when the compliance score is above a threshold.Type: GrantFiled: September 28, 2020Date of Patent: January 7, 2025Assignee: Charles Schwab & Co., IncInventors: Sean Ming-Yin Law, Logan Sommers Ahlstrom
-
Patent number: 12190046Abstract: Text editing apparatus comprises a database memory configured to store a text database, in which the text database is configured to store a plurality of text portions and a set of links between text portions, the set of links defining a document as a linked list of the text portions; and a data processor configured, in response to user input, to perform an editing operation to edit the text database so as to define an edited document by changing at least one of: (i) text within a text portion and (ii) the set of links between text portions.Type: GrantFiled: November 26, 2021Date of Patent: January 7, 2025Assignee: SONY GROUP CORPORATIONInventors: Vittorio Loreto, Pietro Gravino
-
Patent number: 12190072Abstract: In some embodiments, text for user consumption may be generated based on an intended user action category and a user profile. In some embodiments, an action category, a plurality of text seeds, and a profile comprising feature values may be obtained. Context values may be generated based on the feature values, and text generation models may be obtained based on the text seeds. In some embodiments, messages may be generated using the text generation models based on the action category and the context values. Weights associated with the messages may be determined, and a first text message of the messages may be sent to an address associated with the profile based on the weights. Based on a reaction value obtained in response to the first message, a first expected allocation value may be updated based on the reaction value.Type: GrantFiled: June 22, 2023Date of Patent: January 7, 2025Assignee: Capital One Services, LLCInventors: Huong Nguyen, Isha Chaturvedi, Kalanand Mishra
-
Patent number: 12190064Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for announcing and detecting automated conversation are disclosed. One of the methods includes initiating, over a natural language communication channel, a conversation with a communication participant using a natural language communication method that includes a dialogue of natural language communications. The communication participant is determined to be automated using a pre-defined adaptive interactive protocol that specifies natural language linguistic transformations defined in a sequence. The conversation can be transitioned to a communication method that is different form the natural language communication method in response to determining that the communication participant is automated.Type: GrantFiled: June 30, 2023Date of Patent: January 7, 2025Assignee: GOOGLE LLCInventors: Sebastian Millius, Sandro Feuz
-
Patent number: 12182498Abstract: Portions of text data generated from inverse text normalization may be redacted. Text data for redaction may be obtained. One or more inverse text normalization models may be applied to the text data to generate normalized text data. A machine learning model, trained to recognize text for redaction, may be applied to identify portions of the normalized text data for redaction. The identified portions may be redacted and the redacted normalized text provided to a destination.Type: GrantFiled: June 30, 2022Date of Patent: December 31, 2024Assignee: Amazon Technologies, Inc.Inventors: Monica Lakshmi Sunkara, Deepthi Devaiah Devanira, Chaitanya Shivade, Sravan Babu Bodapati, Katrin Kirchhoff, Srikanth Ronanki
-
Patent number: 12182179Abstract: An apparatus for generating obfuscated data within a computing environment, comprising a processor and a memory containing instructions configuring the processor to access a database containing a plurality of private data elements belonging to at least a private record, generate a set of obfuscated data elements, representative of the at least a private record, as a function of the plurality of private data elements using an generative model, determine a first distance measure between at least an obfuscated data element within the set of obfuscated data elements and at least a private data element of the plurality of private data elements within the database, and verify the first distance measure is within a distance range, wherein a minimum threshold of the distance range is determined as a function of a deidentification parameter and a maximum threshold of the distance range is determined as a function of an obfuscation parameter.Type: GrantFiled: April 8, 2024Date of Patent: December 31, 2024Assignee: nference, Inc.Inventors: Murali Aravamudan, Ajit Rajasekharan
-
Patent number: 12182526Abstract: Implementations relate to effectively localizing system responses, that include dynamic information, to target language(s), such that the system responses are grammatical and/or natural in the target language(s). Some of those implementations relate to various techniques for resource efficient generation of templates for a target language. Some versions of those implementations relate to resource efficient generation of target language natural language generation (NLG) templates and, more particularly, to techniques that enable a human user to generate a target language NLG template more efficiently and/or with greater accuracy. The more efficient target language NLG template generation enables less utilization of various client device resources and/or can mitigate the risk of flawed NLG templates being provided for live use in one or more systems.Type: GrantFiled: May 12, 2021Date of Patent: December 31, 2024Assignee: GOOGLE LLCInventors: Katherine Vadella, Joshua Andrews, Max Copperman, Gabrielle Gayles, Shanjian Li, Jieyu Lu, Luchuan Xu
-
Patent number: 12182148Abstract: A computing system includes a processor and memory storing instructions that, when executed by the processor, cause the processor to perform acts. The acts include receiving a query provided by a user. The acts additionally include determining that the query is related to a comparison between entities. Based upon the determining that the query is related to the comparison between entities, the computing system generates a prompt that is to be input to a generative language model, where the prompt includes: 1) an instruction for the generative language model to generate a table based upon the query; and 2) attribute values for entities identified by a search system based upon the query. The acts also include providing the prompt as input to the generative language model, where the generative language model generates a table based upon the prompt, and further where the table includes the attribute values for the entities.Type: GrantFiled: May 30, 2023Date of Patent: December 31, 2024Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Arun Kumar Sacheti, Parthasarathy Govindarajen, Marcelo Medeiros De Barros, Yucan Zhang, Sharada Chandrasekaran, Sumit Chatterjee, Aditya Khandelwal, Achraf AbdelMoneim Chalabi
-
Patent number: 12182749Abstract: An apparatus and method for classifying an entity datum into instruction sets is provided. The apparatus includes processor that may receive the entity data, which includes data describing entity operations and an assessment of operations of each entity. The processor may receive instruction sets including an impact datum describing an effect of a respective instruction set on the entity data and classify elements of data describing assessments into instruction sets. The processor uses a machine learning model including a classifier to correlate data describing the assessment with data describing instruction sets and to generate a user interface. The user interface configures a display device to display a sequence based on classification of elements of data describing assessments of respective instruction sets into at least some instruction sets. The sequence may change a color, or an order based on the impact datum relative to the entity data.Type: GrantFiled: April 30, 2023Date of Patent: December 31, 2024Assignee: The Strategic Coach Inc.Inventors: Barbara Sue Smith, Daniel J. Sullivan
-
Patent number: 12182098Abstract: Methods and systems for curating data by a data manager are disclosed. Data may be curated from various data sources before being provided to downstream consumers that may rely on the trustworthiness of the curated data in order to provide desired computer-implemented services. During the data curation process, data curation resources are used to improve the trustworthiness and/or value of the collected data. However, data curation resources (e.g., data curators, computing resources) may be limited and/or insufficient to perform the data curation process as desired, which may result in unusable and/or uncurated (e.g., untrustworthy) data. Thus, the data may be screened for ambiguous values. A potential replacement value for each ambiguous value may be provided to the data source and the data source may indicate whether the potential replacement value should be used in the data pipeline as a final replacement value for the ambiguous value.Type: GrantFiled: June 29, 2023Date of Patent: December 31, 2024Assignee: Dell Products L.P.Inventors: Ofir Ezrielev, Hanna Yehuda, Kristen Jeanne Walsh
-
Patent number: 12183350Abstract: Techniques for detecting a fraudulent attempt by an adversarial user to voice verify as a user are presented. An authenticator component can determine characteristics of voice information received in connection with a user account based on analysis of the voice information. In response to determining the characteristics sufficiently match characteristics of a voice print associated with the user account, authenticator component can determine a similarity score based on comparing the characteristics of the voice information and other characteristics of a set of previously stored voice prints associated with the user account. Authenticator component can determine whether the similarity score is higher than a threshold similarity score to indicate whether the voice information is a replay of a recording or a deep fake emulation of the voice of the user. Above the threshold can indicate the voice information is fraudulent, and below the threshold can indicate the voice information is valid.Type: GrantFiled: April 12, 2021Date of Patent: December 31, 2024Assignee: PayPal, Inc.Inventors: Karl Anton Hennig, Ajay Aswal, Bisrat Zerihun
-
Patent number: 12182527Abstract: A translating method using visually represented elements, and device therefor is provided.Type: GrantFiled: September 24, 2021Date of Patent: December 31, 2024Inventor: Hyun Jin Kim
-
Patent number: 12183347Abstract: Processing stacked data structures is provided. A system receives an input audio signal detected by a sensor of a local computing device, identifies an acoustic signature, and identifies an account corresponding to the signature. The system establishes a session and a profile stack data structure including a first profile layer having policies configured by a third-party device. The system pushes, to the profile stack data structure, a second profile layer retrieved from the account. The system parses the input audio signal to identify a request and a trigger keyword. The system generates, based on the trigger keyword and the second profile layer, a first action data structure compatible with the first profile layer. The system provides the first action data structure for execution. The system disassembles the profile stack data structure to remove the first profile layer or the second profile layer from the profile stack data structure.Type: GrantFiled: January 13, 2023Date of Patent: December 31, 2024Assignee: GOOGLE LLCInventors: Anshul Kothari, Gaurav Bhaya, Tarun Jain
-
Patent number: 12183333Abstract: A speech recognition system comprises: an input, for receiving an input signal from at least one microphone; a first buffer, for storing the input signal; a noise reduction block, for receiving the input signal and generating a noise reduced input signal; a speech recognition engine, for receiving either the input signal output from the first buffer or the noise reduced input signal from the noise reduction block; and a selection circuit for directing either the input signal output from the first buffer or the noise reduced input signal from the noise reduction block to the speech recognition engine.Type: GrantFiled: December 13, 2021Date of Patent: December 31, 2024Assignee: Cirrus Logic Inc.Inventors: John Paul Lesso, Robert James Hatfield
-
Patent number: 12175460Abstract: The present invention discloses a transfer information determination method and device through deep learning, by which a determination is made on whether transfer information is included in a message, and the transfer information is classified. A method for determining transfer information within a message through natural language processing based on deep learning according to the present invention comprises the steps of: pre-processing an acquired message in a user terminal according to a reference word; extracting an embedding vector corresponding to each segmented text from the preprocessed message to determine whether transfer information is included, through weighted calculation of the extracted embedding vector; and classifying the transfer information within the pre-processed message determined to include the transfer information.Type: GrantFiled: July 11, 2022Date of Patent: December 24, 2024Assignee: KakaoBank Corp.Inventors: Sang Hyun Jeon, Dong Hwa Shin, Jae Eui Sohn
-
Patent number: 12175187Abstract: Methods for correcting raw text generated by deep learning techniques is disclosed. The methods may be performed by systems/computing devices described herein. Raw text previously generated by the deep learning techniques may be obtained. A search query can be generated from a raw text sentence of the raw text. The search query is executed against a knowledge base or a corpus of text to obtain a set of search results, the set of search results comprising a plurality of candidate true sentences that can potentially be utilized to correct one or more entities or phrases of the raw text sentence. A candidate true sentence is selected from the plurality and used to correct the raw text sentence. For example, at least one entity or phrase of the candidate true sentence can be used to replace a corresponding entity or phrase of the raw text sentence.Type: GrantFiled: February 2, 2022Date of Patent: December 24, 2024Assignee: Oracle International CorporationInventor: Boris Galitsky
-
Patent number: 12175196Abstract: A natural language understanding (NLU) framework includes a modeling and optimization system that enables enhanced understanding and explainability to the operation of the NLU framework. The NLU framework includes a configuration vector storing settings of various components that may be applied during NLU inference of an utterance, such as which components should be activated or deactivated, as well as which numerical values (e.g., threshold values, coefficients, weight values) that are used by these components during operation. By using this configuration vector to systematically disable and adjust numerical parameters of the components of the NLU framework, and then determining the performance of the NLU framework in these configurations, the modeling and optimization system determines relationships between, as well as the relative importance of, the components of the NLU framework.Type: GrantFiled: January 19, 2022Date of Patent: December 24, 2024Assignee: ServiceNow, Inc.Inventors: Roshnee Sharma, Edwin Sapugay, Sathwik Tejaswi Madhusudhan, Anil Kumar Madamala, Hari Subramani, Jonggun Park, Srinivas Satyasai Sunkara
-
Patent number: 12176083Abstract: A method and a device are provided for generating a summary of a hospital stay by a patient, including maintaining a database of electronic medical records (EMRs) where the EMRs include clinical notes pertaining to a patient during a time interval, identifying a set of significant physician notes for the time interval, generating a candidate set of summaries for each of the significant physician notes, for each significant physician note, analyzing the factuality of each of the candidate summaries and selecting the most factual summary, and generating a daily section for inclusion in a hospital course section of a discharge note that includes the selected factual summary for each of the significant physician notes.Type: GrantFiled: August 11, 2022Date of Patent: December 24, 2024Assignee: Abstractive Health, Inc.Inventors: Vincent Christopher Hartman, Sanika Bapat
-
Patent number: 12174874Abstract: Systems, apparatuses, methods, and computer program products are disclosed for automated prototyping of a topic model. An example method includes a data manipulation engine ingesting and pre-processing source data from a set of data sources, a feature extraction engine that thereafter transforms the pre-processed data into a set of numeric representations of the pre-processed data, and an autonomous model generator that automatically generates a trained topic model using the set of numeric representations. Embodiments further enable visualization of topic model output, which permits a user to easily consume and utilize information from a topic model for any number of purposes.Type: GrantFiled: January 20, 2021Date of Patent: December 24, 2024Assignee: Wells Fargo Bank, N.A.Inventors: Brian Karp, William Thompson, Antonio Iniguez, James Ma, Kelley Impoco, Richard Penfil, II
-
Patent number: 12174844Abstract: Provided are systems, methods, and computer program products for searching a plurality of documents based on a text string. The system includes at least one processor programmed or configured to identify a plurality of documents including a plurality of document types, each document of the plurality of documents including a document type, receive a text string based on user input, generate, with a machine-learning model, an ordered list of document types based the text string, search the plurality of documents for the text string to identify a subset of documents based on similarity between the text string and each document of the subset of documents, rank the subset of documents based at least partially on the similarity, a document type of each document of the subset of documents, and the ordered list of document types, and generate a graphical user interface based on the ranked list of documents.Type: GrantFiled: June 26, 2024Date of Patent: December 24, 2024Assignee: Clearbrief, Inc.Inventors: Jacqueline Grace Schafer, Jose Demetrio Saura
-
Patent number: 12169689Abstract: A crime type inference system based on text data, may include: a keywords dictionary construction unit configured to receive crime source data, and generate a crime type keywords dictionary by extracting crime keywords; a data set construction unit configured to generate a dataset for crime type learning by using the crime source data and the keywords dictionary; a crime type prediction model training unit configured to generate a crime type prediction model by using the dataset, and train the crime type prediction model; and a crime type inference unit configured to infer a crime type by using new crime data.Type: GrantFiled: May 24, 2022Date of Patent: December 17, 2024Assignee: Electronics and Telecommunications Research InstituteInventors: Myung Sun Baek, Seung Hee Kim, Young Soo Park, Won Joo Park, Sang Yun Lee, Yong Tae Lee
-
Patent number: 12170133Abstract: In one example, a method being performed by a computer system comprises: receiving an image file containing a pathology report; performing an image recognition operation on the image file to extract input text strings; detecting, using a natural language processing (NLP) model, entities from the input text strings, each entity including a label and a value; extracting, using the NLP model, the values of the entities from the input text strings; converting, based on a mapping table that maps entities and values to pre-determined terminologies, the values of at least some of the entities to the corresponding pre-determined terminologies; and generating a post-processed pathology report including the entities detected from the input text strings and the corresponding pre-determined terminologies.Type: GrantFiled: September 8, 2020Date of Patent: December 17, 2024Assignee: Roche Molecular Systems, Inc.Inventors: Vishakha Sharma, Yogesh Pandit, Ram Balasubramanian
-
Patent number: 12169850Abstract: A system and method for enhancing e-commerce product listings is disclosed, performed on a server. The method involves importing listing data through Application Programming Interface (API) connections and analyzing this data to calculate a multimodal vector embedding. A quality score is estimated based on the embedding and real-time market data metrics. The method generates content elements, including product images, textual descriptions, and infographics, by applying a controlled generation algorithm through a text-to-image diffusion model. This model integrates loss-guidance and attention injection mechanisms to produce a controlled layout of the product images, producing content that is visually appealing and market-relevant. The resulting content elements are stored in the server's data storage, ready for e-commerce display.Type: GrantFiled: March 22, 2024Date of Patent: December 17, 2024Assignee: ECOMTENT INC.Inventors: Timur Luguev, Zakaria Patel, Hantang Li, Max Sinclair
-
Patent number: 12164873Abstract: Aspects of the subject disclosure may include, for example, identifying an entity of a natural language question, locating a node of a knowledge graph corresponding to the entity, and generating a candidate answer set including a group of other entities located a predetermined proximity to the node. Contextual information for the group of other entities is determined from the knowledge graph, and the natural language question and contextual information are separately encoded to obtain separate encoded vectorial representations of the natural language question and members of the candidate answer set. The encoding uses pre-trained language model embeddings obtained via a bidirectional encoder representations from transformer encoding process. The encoded vectorial representations of the question under an influence of aspects of the contextual information are scored and a member of the candidate answer set selected according to the score to obtain an answer to the original question.Type: GrantFiled: March 10, 2021Date of Patent: December 10, 2024Assignee: AT&T Intellectual Property I, L.P.Inventors: Sai Sharath Japa, Roderic William Paulk, Joseph Samuel Miller, Lal Payyappilly Paul
-
Patent number: 12164870Abstract: The present disclosure provides a neural document embedding based ontology mapping. Conventional methods that map ontology concepts across domains/species extensively take help of bridging ontologies. Initially the system receives a Human Phenotype (HP) Identification number (ID) pertaining to a phenotype. A first HP ID vector is computed from the HP ID using a trained word2vec model. A second HP ID vector is computed from the HP ID using a trained Doc2vec model. An average HP ID vector is computed based on the first HP ID vector and the second HP ID vector. A plurality of cosine similarity scores are computed based on a comparison between the average HP ID vector and a plurality of average MP ID vectors. The plurality of MP IDs are sorted based on the plurality of cosine similarity scores. The plurality of MP IDs corresponding to the HP ID are selected based on a selection threshold.Type: GrantFiled: September 22, 2022Date of Patent: December 10, 2024Assignee: TATA CONSULTANCY SERVICES LIMITEDInventors: Sadhna Rana, Rajgopal Srinivasan, Swatantra Pradhan
-
Patent number: 12164860Abstract: In an embodiment, a programmed computer system implemented via client-server Software as a Service (SaaS) techniques provides an interactive user interface for identifying specific portions of a digital document susceptible for review and improvement. A server computer may receive a representation of a digital document, such as an email, comprising words arranged into sentences. An embodiment may tokenize a set of all sentences comprising the sequence of sentences into a document-specific vocabulary, then compute a corresponding first and second score for each sentence of the sequence of sentences. The first score may represent a calculated probability of semantic importance of the corresponding sentence to an overall meaning of the digital document. The second score may represent a calculated likelihood that the corresponding sentence will be read by a future reader of the digital document. An embodiment may identify key sentences using the first scores and second scores.Type: GrantFiled: August 24, 2023Date of Patent: December 10, 2024Assignee: Grammarly, Inc.Inventors: Roman Khlystik, Karun Singh, Dimitrios Alikaniotis, Jonathan Vandamme
-
Patent number: 12159135Abstract: In an example embodiment, machine learning techniques are utilized to create virtual tables that connect to actual tables in a user's own system. The virtual table predicts how the user's data can be used to populate fields in newer versions of software that the user already runs, even when those fields are not present in the version that the user already runs. These tables may then be used in a specialized tool, which displays in one area of the display a screen of the version of the software that the user is currently running (“the existing version”) and displays in another area of the display a screen of the version of the software that the user is comparing to the existing version. Both display areas display the same screen, as rendered by their respective different versions of the software, using the same underlying base data.Type: GrantFiled: November 21, 2022Date of Patent: December 3, 2024Assignee: SAP SEInventor: Ramalingam Tv
-
Patent number: 12159122Abstract: A method comprising a deep learning based translation system translating a source sentence from language A to a target sentence in language B, placing constraints on the target sentence in language B.Type: GrantFiled: August 11, 2020Date of Patent: December 3, 2024Assignee: SONY GROUP CORPORATIONInventors: Javier Alonso Garcia, Fabien Cardinaux, Thomas Kemp, Lukas Mauch, Stefan Uhlich, Stephen Tiedemann
-
Patent number: 12159115Abstract: Examples described herein generate training data for machine learning (ML) for natural language (NL) processing (such as semantic parsing for translating NL). A formula tree is generated based on sampling both a formula grammar and NL templates. Using the formula tree, an ML training data instance pair is generated comprising a formula example and an NL example. A context example may also be used during instantiation of the formula tree. An ML model is trained with training data including the ML training data instance pair, and ML output is generated from NL input. The ML output includes, for example, a machine-interpretable formula, a database querying language command, or a general programming language instruction. Some examples support context-free grammar, probabilistic context-free grammar, and/or non-context-free production rules.Type: GrantFiled: October 19, 2021Date of Patent: December 3, 2024Assignee: Microsoft Technology Licensing, LLC.Inventors: Zeqi Lin, Yu Hu, Haiyuan Cao, Yi Liu, Jian-Guang Lou, Kuralmani Elango, PalaniRaj Kaliyaperumal, Weizhu Chen, Kunal Mukerjee
-
Patent number: 12159252Abstract: A system and method for document summarization generates summarized articles and risk factor categorizations for display at a graphical user interface (GUI) dashboard. A transaction monitoring system includes an adverse media dashboard pipeline for processing risk factor alerts and generating document summarizations for display at a user device. Document summarization extracts several sentences from a source text and stacks the sentences to create a summary. The method creates a vector representation of each sentence using a machine learning word embedding model and generates a sentence similarity matrix by computing cosine similarity values. A sentence graph creation algorithm creates a graph corresponding to the sentence similarity matrix and calculates importance scores used in selecting sentences for the document summary.Type: GrantFiled: September 13, 2022Date of Patent: December 3, 2024Assignee: BANK OF MONTREALInventors: Drew R. Galow, Meera Das
-
Patent number: 12159476Abstract: A classification system is provided that separates unclassified pages into unclassified, separated documents and classifies the separated documents. The classification system applies a page-level recognition model to the unclassified pages to recognize the logical boundaries between documents and, based on the logical boundaries, separates the pages into unclassified, separated documents. The classification system further applies a document-level recognition model to classify the separated documents.Type: GrantFiled: January 2, 2023Date of Patent: December 3, 2024Assignee: OPEN TEXT SA ULCInventors: Sangeetha Yanamandra, Srirama Chandra Akella, Satish Chandra Paled, Newton Isaac Rajkumar
-
Patent number: 12159119Abstract: A first set of text generation prompts may be determined based on an input document and a first text generation prompt template. The first set of text generation prompts may include an instruction to identify factual assertions in the input text. The prompts may be sent to a remote text generation modeling system, which may respond by identifying factual assertions in the input text. A second set of text generation prompts may be determined based on the factual assertions and a second text generation prompt template. The second set of text generation prompts may include an instruction to respond to the factual assertions. A response to the input text may be generated based on written responses provided by the remote text generation modeling system.Type: GrantFiled: February 15, 2023Date of Patent: December 3, 2024Assignee: Casetext, Inc.Inventors: Jake Heller, Pablo Arredondo, Walter DeFoor, Ryan Walker, Javed Qadrud-Din
-
Patent number: 12159109Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for pre-training entity extraction models to facilitate domain adaptation in resource-constrained domains. In an example embodiment, a first machine learning model is used to encode sentences of a source domain corpus and a target domain corpus into sentence embeddings. The sentence embeddings of the target domain corpus are combined into a target corpus embedding. Training sentences from the source domain corpus within a threshold of similarity to the target corpus embedding are selected. A second machine learning model is trained on the training sentences selected from the source domain corpus.Type: GrantFiled: November 12, 2021Date of Patent: December 3, 2024Assignee: Adobe Inc.Inventors: Aniruddha Mahapatra, Sharmila Reddy Nangi, Aparna Garimella, Anandha velu Natarajan
-
Patent number: 12159107Abstract: A method for automatically identifying word repetition errors includes the following steps: after performing word segmentation on a large-scale training corpus, performing statistics to obtain two-tuple and three-tuple structures including repeated words in the training corpus, and repeated combination degrees, left contextual adjacent word information entropy and right contextual adjacent word information entropy in the training corpus; performing statistics and recording words containing repeated characters in a Chinese dictionary and establishing a repeated word library of the Chinese dictionary; judging the repeated words appearing in the text to be subjected to error checking based on the repeated words in the Chinese dictionary; and judging the repeated words appearing in the text to be subjected to error checking based on the repeated combination degrees, left contextual adjacent word information entropy and right contextual adjacent word information entropy obtained by performing statistics.Type: GrantFiled: February 3, 2021Date of Patent: December 3, 2024Assignee: CHINA NATIONAL INSTITUTE OF STANDARDIZATIONInventors: Haitao Wang, Xinyu Cao, Liangliang Liu, Changqing Zhou
-
Patent number: 12153891Abstract: A system and method for machine learning classification of user sentiment is disclosed. The method includes storing including a plurality of category information. The plurality of category information includes a set of domain-specific category information. The method further includes extracting a plurality of aspects from textual data. The method further includes generating a sentiment by a machine learning model. The method further includes receiving the plurality of aspects and the set of domain-specific category information. The method further includes generating a sentiment based on the plurality of aspects and the set of domain-specific category information.Type: GrantFiled: June 21, 2021Date of Patent: November 26, 2024Assignee: Home Depot Product Authority, LLCInventors: Haozheng Tian, James Morgan White
-
Patent number: 12154356Abstract: A document to be analyzed and a set of key names to be extracted from the document are received. A set of strings of characters contained within the document and a location for each string of character are identified. Moreover, a document graph for the document is generated. The document graph includes a set of nodes and a set of edges. Each node of the set of nodes corresponds to a string of characters of the set of strings of characters. Each edge of the set of edges connects two or more nodes together. Additionally, based on the document graph and the received set of key names, a set of keys are identified. Furthermore, a set of values are extracted from the document, and a set of key-value pairs are generated based on the identified set of keys and the extracted set of values.Type: GrantFiled: March 2, 2022Date of Patent: November 26, 2024Assignee: Alteryx, Inc.Inventors: Jad Dino Raad, Adam Blacke