Word Recognition Patents (Class 704/251)

Preliminary matching (Class 704/252)

Endpoint detection (Class 704/253)

Subportions (Class 704/254)

Specialized models (Class 704/255)

Markov (Class 704/256)

Hidden Markov Model (HMM) (EPO) (Class 704/256.1)

Training of HMM (EPO) (Class 704/256.2)

With insufficient amount of training data, e.g., state sharing, tying, deleted interpolation (EPO) (Class 704/256.3)

Continuous density, e.g, Gaussian distribution, Lapalce (EPO) (Class 704/256.7)
Discrete density, e.g., Vector Quantization preprocessor, look up tables (EPO) (Class 704/256.8)

Natural language (Class 704/257)

System and method for distributed learning

Patent number: 12373727

Abstract: A computer implemented method of distributed learning in a system comprising a parameter server configured to maintain a global parameter set of a model to be trained and a plurality of workers. The method comprises transmitting a current global parameter set to a worker. The worker performs a training step based on training data available to the worker, thereby generating a local set of parameters of the model, determines a likelihood of the local set of parameters being suitable for improving the global parameter set and omits transmission of the local set of parameters to the parameter server if it is determined that the local set of parameters is not likely suitable for improving the global parameter set.

Type: Grant

Filed: July 25, 2018

Date of Patent: July 29, 2025

Assignee: Kabushiki Kaisha Toshiba

Inventors: Abdussalam Ahmed Elturki, Aftab Khan
Voice interaction scripts

Patent number: 12360738

Abstract: This disclosure describes systems and methods that identify activities for which scripts can be built to perform an activity when requested by a user. The scripts can be voice-activated by a defined customized voice command and can include delivery preferences. The user's identity can be verified by analyzing voice biometrics of the customized voice command. After performance of the activity, results can be delivered to the device in the format indicated in the script.

Type: Grant

Filed: January 27, 2023

Date of Patent: July 15, 2025

Assignee: United Services Automobile Association (USAA)

Inventors: Charise Renee Whitaker, Michael J. Maciolek
Natural language processing

Patent number: 12340797

Abstract: Devices and techniques are generally described for inference reduction in natural language processing using semantic similarity-based caching. In various examples, first automatic speech recognition (ASR) data representing a first natural language input may be determined. A cache may be searched using the first ASR data. A first skill associated with the first ASR data may be determined from the cache. In some examples, first intent data representing a semantic interpretation of the first natural language input data may be determined by using a first natural language process associated with the first skill.

Type: Grant

Filed: March 6, 2023

Date of Patent: June 24, 2025

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Kiana Hajebi, Vivek Yadav, Pradeep Natarajan
Dynamic workflow engine in an enterprise bot hub system

Patent number: 12339960

Abstract: Aspects of the disclosure relate to monitoring, evaluating, and repairing bots in a hashchain-based distributed bot hub that process a workflow. In some embodiments, a computing platform may receive workflow information associated with performing a first workflow that includes executing one or more tasks using a plurality of virtual bots, instantiate a first subset of the plurality of bots to process the one or more tasks of the first workflow, and instantiate a first subset of the plurality of bots to process the one or more tasks of the first workflow. identifying a potential anomalous activity may include causing the monitor bot hub to remove the identified bot to a quarantine hub, and execute a repair process on the identified bot in the quarantine hub.

Type: Grant

Filed: May 8, 2023

Date of Patent: June 24, 2025

Assignee: Bank of America Corporation

Inventors: Sakshi Bakshi, Sudhakar Balu, Siva Paini
Computing similarity of tree data structures using metric functions defined on sets

Patent number: 12332861

Abstract: Example embodiments facilitate efficient comparison operations of tree structures, resulting in comparison metrics (e.g., similarity or distance metrics or scores) used enhance software systems, such as search algorithms, code optimization software, enterprise database applications, and so on. Trees to be compared are converted into sets, i.e., serialized using a novel enumeration method. Metric functions can then be efficiently applied to the sets to facilitate the comparison operations. In an illustrative embodiment, subtrees of larger trees can be compared individually, pairwise, where the comparison results of the subtree comparisons can be selectively weighted and summed to yield an aggregated comparison metric that is tailored for a specific application or comparison priority.

Type: Grant

Filed: June 7, 2022

Date of Patent: June 17, 2025

Assignee: Oracle International Corporation

Inventor: Eugene Perkov
Enhanced searching using fine-tuned machine learning models

Patent number: 12314318

Abstract: An advanced search system leverages a pre-trained large language model to enhance user query responses. The system, equipped with hardware processors, a search query via an interface and accesses a pre-trained large language model designed to respond to the search query. The system fine-tunes the model to generate a task-specific generative model. The system employs the task-specific generative model to generate a search result to the search query and analyzes the search result based on a performance metric associated with the task-specific generative model. The system refines the task-specific generative model based on the analyzing of the search result.

Type: Grant

Filed: February 16, 2024

Date of Patent: May 27, 2025

Assignee: Snowflake Inc.

Inventors: Rahil Bathwal, Daniel Fernando Campos, Ashwin Devaraj, Seth Michael Li, Yash Pande, Vivek Raghunathan, Rajhans Samdani, Danmei Xu
System and method for detecting cognitive decline using speech analysis

Patent number: 12277929

Abstract: System and method for detecting cognitive decline in a subject using a classification system for detecting cognitive decline in the subject based on a speech sample. The classification system is trained using speech data corresponding to audio recordings of speech from normal and cognitive decline patients to generate an ensemble classifier comprising a plurality of component classifiers and an ensemble module. Each of the plurality of component classifiers is a machine-learning classifier configured to generate a component output identifying a sample data as corresponding to a normal patient or a cognitive patient. The machine-learning classifier is generated based on a subset of available features. The ensemble module receives component outputs from all of the component classifiers and generates an ensemble output identifying the sample data as corresponding to a normal or cognitive decline patient based on the component outputs.

Type: Grant

Filed: April 17, 2023

Date of Patent: April 15, 2025

Assignee: Janssen Pharmaceutica NV

Inventors: Srinivasan Vairavan, Vaibhav Narayan
Preprocessor system for natural language avatars

Patent number: 12271987

Abstract: A preprocessor for use with natural language processors for control of computerized avatars provides for an embedding of avatar control information in a speech response file of the natural language processor providing avatars with improved perception of emotional intelligence. Rapid avatar response is provided by independent end of speech detection and a response cache bypassing text-to-speech conversion times. The preprocessor may be shared among multiple websites to provide a shared analysis of query optimization.

Type: Grant

Filed: January 13, 2023

Date of Patent: April 8, 2025

Assignee: Codebaby, Inc.

Inventors: Dan German, Michelle Collins, Tyler W. Chase-Nason, Navroz J. Daroga
Transferring dialog data from an initially invoked automated assistant to a subsequently invoked automated assistant

Patent number: 12260858

Abstract: Systems and methods for providing dialog data, from an initially invoked automated assistant to a subsequently invoked automated assistant. A first automated assistant may be invoked by a user utterance, followed by a dialog with the user that is processed by the first automated assistant. During the dialog, a request to transfer dialog data to a second automated assistant is received. The request may originate with the user, by the first automated assistant, and/or by the second automated assistant. Once authorized, the first automated assistant provides the previous dialog data to the second automated assistant. The second automated assistant performs one or more actions based on the dialog data.

Type: Grant

Filed: November 22, 2021

Date of Patent: March 25, 2025

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
Techniques for anonymized searching of medical providers

Patent number: 12259933

Abstract: A user device can be used to generate medical term expressions, which represent medical terms of a health record. The user device can identify a medical concept present in the health record based on a medical term expression. The user device can generate a node in a personalized relational graph that corresponds to the medical concept. One or more sub-nodes can be added to the node. Responsive to a request, a user interface is presented that identifies the medical concept and some of the additional information.

Type: Grant

Filed: April 24, 2023

Date of Patent: March 25, 2025

Assignee: Apple Inc.

Inventors: David W. Padgett, Jason B. Morley, Christian Schroeder, Zhe Li, Mark E. Pennell, Kevin M. Lynch
Accounting for item attributes when selecting items satisfying a query based on item embeddings and an embedding for the query

Patent number: 12259894

Abstract: An online system maintains various items and maintains values for different attributes of the items, as well as an item embedding for each item. When the online system receives a query for retrieving one or more items, the online system generates an embedding for the query. Based on measures of similarity between the embedding for the query and item embeddings, the online system selects a set of items. The online system identifies a specific attribute of items and generates a whitelist of values for the specific attribute based on measures of similarity between item embeddings for items in the selected set and the embedding for the query. The online system removes items having values for the selected attribute outside of the whitelist of values from the selected set of items to identify items more likely to be relevant to the query.

Type: Grant

Filed: February 7, 2022

Date of Patent: March 25, 2025

Assignee: Maplebear Inc.

Inventors: Taesik Na, Zhihong Xu, Guanghua Shu, Tejaswi Tenneti, Haixun Wang
Speaker attributed transcript generation

Patent number: 12243534

Abstract: A computer implemented method processes audio streams recorded during a meeting by a plurality of distributed devices.

Type: Grant

Filed: April 4, 2022

Date of Patent: March 4, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventors: Takuya Yoshioka, Andreas Stolcke, Zhuo Chen, Dimitrios Basile Dimitriadis, Nanshan Zeng, Lijuan Qin, William Isaac Hinthorn, Xuedong Huang
Subquery generation from a query

Patent number: 12229173

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating subqueries from a query. In one aspect, a method includes obtaining a query, generating a set of two subqueries from the query, where the set includes a first subquery and a second subquery, determining a quality score for the set of two subqueries, determining whether the quality score for the set of two subqueries satisfies a quality threshold, and in response to determining that the quality score for the set of two subqueries satisfies the quality threshold, providing a first response to the first subquery that is responsive to a first operation that receives the first subquery as input and providing a second response to the second subquery that is responsive to a second operation that receives the second subquery as input.

Type: Grant

Filed: December 9, 2020

Date of Patent: February 18, 2025

Assignee: Google LLC

Inventors: Vladimir Vuskovic, Joseph Lange, Behshad Behzadi, Marcin M. Nowak-Przygodzki
Detection and/or enrollment of hot commands to trigger responsive action by automated assistant

Patent number: 12217740

Abstract: Techniques are described herein for detecting and/or enrolling (or commissioning) new “hot commands” that are useable to cause an automated assistant to perform responsive action(s) without having to be first explicitly invoked. In various implementations, an automated assistant may be transitioned from a limited listening state into a full speech recognition state in response to a trigger event. While in the full speech recognition state, the automated assistant may receive and perform speech recognition processing on a spoken command from a user to generate a textual command. The textual command may be determined to satisfy a frequency threshold in a corpus of textual commands. Consequently, data indicative of the textual command may be enrolled as a hot command. Subsequent utterance of another textual command that is semantically consistent with the textual command may trigger performance of a responsive action by the automated assistant, without requiring explicit invocation.

Type: Grant

Filed: February 19, 2024

Date of Patent: February 4, 2025

Assignee: GOOGLE LLC

Inventors: Yuan Yuan, Bibo Xu, Tianyu Wang, Anurag Jain
Systems and methods for enabling topic-based verbal interaction with a virtual assistant

Patent number: 12217754

Abstract: Systems and methods are disclosed for enabling verbal interaction with an NLUI application without relying on express wake terms. The NLUI application receives an audio input comprising a plurality of terms. In response to determining that none of the terms is an express wake term pre-programmed into the NLUI application, the NLUI application determines a topic for the plurality of terms. The NLUI application then determines whether the topic is within a plurality of topics for which a response should be generated. If the determined topic of the audio input is within the plurality of topics, the NLUI application generates a response to the audio input.

Type: Grant

Filed: August 1, 2023

Date of Patent: February 4, 2025

Assignee: Adeia Guides Inc.

Inventors: Vikram Makam Gupta, Sukanya Agarwal, Gyanveer Singh
Using multiple modality input to feedback context for natural language understanding

Patent number: 12217750

Abstract: Input context for a statistical dialog manager may be provided. Upon receiving a spoken query from a user, the query may be categorized according to at least one context clue. The spoken query may then be converted to text according to a statistical dialog manager associated with the category of the query and a response to the spoken query may be provided to the user.

Type: Grant

Filed: January 21, 2022

Date of Patent: February 4, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventors: Michael Bodell, John Bain, Robert Chambers, Karen M. Cross, Michael Kim, Nick Gedge, Daniel Frederick Penn, Kunal Patel, Edward Mark Tecot, Jeremy C. Waltmunson
Medical voice command integration

Patent number: 12207954

Abstract: System and methods for controlling healthcare devices and systems using voice commands are presented. In some aspects a listening device may receive voice command from a person. The voice command may be translated into human readable or machine readable text via a speech-to-text service. A control component may receive the text and send device-specific instructions to a medical device associated with a patient based on the translated voice command. In response to the instructions, the medical device may take an action on a patient. Some examples of actions taken may include setting an alarm limit on a monitor actively monitoring a patient and adjusting the amount of medication delivered by an infusion pump. Because these devices may be controlled using a voice command, in some cases, no physical or manual interaction is needed with the device. As such, multiple devices may be hands-free controlled from any location.

Type: Grant

Filed: May 9, 2023

Date of Patent: January 28, 2025

Assignee: CERNER INNOVATION, INC.

Inventors: Chad Hays, Randy Lantz
Information processing system that executes command corresponding to utterance, image processing apparatus, control method for information processing system, and storage medium storing control program for information processing system

Patent number: 12212724

Abstract: An information processing system that a user easily masters a relation between an execution process and an utterance instruction. The information processing system includes a display device, a microphone, an output unit, a display control unit, and an execution unit. The display device can display information. The microphone can obtain voice. The output unit outputs word information based on voice in natural language obtained with the microphone. The display control unit additionally displays utterance examples in association with touch objects included in a screen that is currently displayed on the display device. The execution unit executes a predetermined process linked to a touch object based on words included in a corresponding utterance example and the output word information at least.

Type: Grant

Filed: December 3, 2021

Date of Patent: January 28, 2025

Assignee: CANON KABUSHIKI KAISHA

Inventor: Kazuhiro Sugawara
Determining dialog states for language models

Patent number: 12205586

Abstract: Systems, methods, devices, and other techniques are described herein for determining dialog states that correspond to voice inputs and for biasing a language model based on the determined dialog states. In some implementations, a method includes receiving, at a computing system, audio data that indicates a voice input and determining a particular dialog state, from among a plurality of dialog states, which corresponds to the voice input. A set of n-grams can be identified that are associated with the particular dialog state that corresponds to the voice input. In response to identifying the set of n-grams that are associated with the particular dialog state that corresponds to the voice input, a language model can be biased by adjusting probability scores that the language model indicates for n-grams in the set of n-grams. The voice input can be transcribed using the adjusted language model.

Type: Grant

Filed: February 10, 2022

Date of Patent: January 21, 2025

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro Jose Moreno Mengibar
Systems and methods for processing and utilizing video data

Patent number: 12206517

Abstract: A method includes receiving, from an entity, a request to organize a survey on a topic, based on the request, organizing a survey of a plurality of people, recording a video of the survey, obtaining a transcription of the video and linking the transcription of the video in time to the video to yield a processed video. The method can further include presenting, on a user interface to the entity based on the processed video, the video and the transcription of the video, wherein each word in the transcription of the video is selectable by the entity, receiving a selection of text by the entity from the transcription of the video and, based on the selection of the text, presenting a portion of the video at a time that is associated with when a participant in the video spoke the text.

Type: Grant

Filed: February 12, 2024

Date of Patent: January 21, 2025

Assignee: Mercury Analytics, LLC

Inventors: Scott James Brickner, Matthew Thomas Williams, Peter Calvin Viss, Elizabeth Michael Karen, James Lord Ardery
Digital assistance development system

Patent number: 12198688

Abstract: A system includes a development system and a digital assistance system. The development system includes a network interface configured to communicate with a plurality of communication channels, a processing system configured to interface with a project management subsystem, a scheduling subsystem, and the network interface, and an application programming interface configured to receive a command sequence for the project management subsystem and the scheduling subsystem. The digital assistance system includes a natural language processing engine configured to interface with a voice-enabled communication session through one of the communication channels. The digital assistance system also includes a command generator configured to generate the command sequence based on one or more requested tasks detected through the voice-enabled communication session and provide the command sequence to the application programming interface to execute the one or more requested tasks.

Type: Grant

Filed: June 23, 2021

Date of Patent: January 14, 2025

Assignee: THE TRAVELERS INDEMNITY COMPANY

Inventors: Obaid Shaikh, Ajay Srinivasulu, Madhavi Atluri, Sandhya Narayanamoorthy
Graphical user interface and pipeline for text analytics

Patent number: 12197481

Abstract: A graphical user interface (GUI) and pipeline for processing text documents is provided herein. In one example, a system can receive unstructured text documents. The system can determine entity-issue descriptions corresponding to the unstructured text documents. The system can then generate a GUI indicating the entity-issue descriptions. The GUI can also indicate assignments of the unstructured text documents to categories of a predefined schema. The GUI can allow the user to adjust the assignments of the unstructured text documents to the categories. The GUI can also include a table of rows, where each row corresponds to one of the unstructured text documents. Each row can indicate an entity-issue description in the corresponding unstructured text document and the categories assigned to the unstructured text document. Each row can also include a graphical button that is selectable to allow the user to view the unstructured text document corresponding to the row.

Type: Grant

Filed: June 7, 2024

Date of Patent: January 14, 2025

Assignee: SAS Institute Inc.

Inventors: Murali Krishna Pagolu, Corey Kyle Kozak
Filtering device, control system, and filtering method

Patent number: 12198670

Abstract: A filtering device is configured to estimate the characteristics of noise superposed on measurement data relating to the status of a controlled machine based on the status information representing the status of a controlled machine, thus adjusting the filtering to eliminate noise based on the estimated noise characteristics.

Type: Grant

Filed: March 30, 2020

Date of Patent: January 14, 2025

Assignee: NEC CORPORATION

Inventors: Daisuke Ohta, Hiroshi Yoshida, Tatsuya Yoshimoto
Robot response method, apparatus, device and storage medium

Patent number: 12182183

Abstract: The present application provides a robot response method, apparatus, device and storage medium. The method includes: obtaining, by a robot, current query voice; extracting semantic information of the current query voice; matching the semantic information of the current query voice with multiple semantic information clusters stored in advance to get a matched target semantic information cluster, where each semantic information cluster includes: at least one Q&A instance, and each Q&A instance includes: semantic information corresponding to a historical query voice and a query question selected in a query list corresponding to the historical query voice; and obtaining, by the robot, the number of times each query question was selected in the target semantic information cluster, determining, according to the number of times each query question was selected, a target query question corresponding to the current query voice, and outputting a query response corresponding to the target query question.

Type: Grant

Filed: April 20, 2020

Date of Patent: December 31, 2024

Assignee: JINGDONG TECHNOLOGY HOLDING CO., LTD.

Inventor: Yuyu Zheng
Artificial intelligence communication assistance

Patent number: 12166809

Abstract: A method of electronic communication assistance is provided. The method includes receiving, via an artificial intelligence assistant computing facility, an electronic communication from a first user intended to be received by a second user; and determining, via the artificial intelligence assistant computing facility, a capacity of the second user to receive the electronic communication. The method further includes determining, via the artificial intelligence assistant computing facility and based at least in part on the capacity of the second user, a time to send the electronic communication; and transmitting, via the artificial intelligence assistant computing facility, the time to the first user.

Type: Grant

Filed: June 16, 2023

Date of Patent: December 10, 2024

Assignee: Grammarly, Inc.

Inventors: Oleksiy Shevchenko, Ayan Mandal, Bradley Jon Hoover, Joel Tetreault, Maksym Lytvyn, Dmytro Lider
Devices, systems, and methods for distributed voice processing

Patent number: 12165643

Abstract: Systems and methods for distributed voice processing are disclosed herein. In one example, the method includes detecting sound via a microphone array of a first playback device and analyzing, via a first wake-word engine of the first playback device, the detected sound. The first playback device may transmit data associated with the detected sound to a second playback device over a local area network. A second wake-word engine of the second playback device may analyze the transmitted data associated with the detected sound. The method may further include identifying that the detected sound contains either a first wake word or a second wake word based on the analysis via the first and second wake-word engines, respectively. Based on the identification, sound data corresponding to the detected sound may be transmitted over a wide area network to a remote computing device associated with a particular voice assistant service.

Type: Grant

Filed: March 29, 2023

Date of Patent: December 10, 2024

Assignee: Sonos, Inc.

Inventors: Connor Kristopher Smith, John Tolomei, Betty Lee
Contextual utterance recommendations for natural language interfaces that support conversational visual analysis

Patent number: 12159116

Abstract: A computing device receives user selection of a data source. In accordance with the user selection, the device generates one or more initial natural language utterances according to metrics of data fields in the data source and/or previous user interaction with the data source. Each of the initial natural language utterances corresponds to a respective suggestion to guide visual analysis of the data source. The device displays the initial utterances in a graphical user interface. The device receives user selection of a first initial utterance of the initial utterances. In response to the user selection, the device generates and displays a first data visualization in accordance with data fields and/or analytical operations specified in the first initial utterance. The device also generates updated natural language utterances in accordance with the first initial utterance and the first data visualization, and displays the updated utterances with the first data visualization.

Type: Grant

Filed: January 10, 2022

Date of Patent: December 3, 2024

Assignee: Tableau Software, LLC

Inventors: Arjun Srinivasan, Vidya Raghavan Setlur
System and method for generating a block in a blockchain network using a voice-based hash value generated by a voice signature

Patent number: 12155748

Abstract: A system receives a speech of a user that indicates a request. The system extracts a plurality of voice features from the speech. The system converts the speech into a plurality of binary digits. The system determines a first voice feature constant value associated with a first voice feature, where the first voice feature constant value is an average of the first voice feature. The system determines a second voice feature constant value associated with the second voice feature, where the second voice feature constant value is an average of the second voice feature. The system encrypts the plurality of binary digits using the first and second voice feature constant values, where the encrypted plurality of binary digits corresponds to a voice-based hash value. The system generates a new block in a blockchain network using the voice-based hash value.

Type: Grant

Filed: April 7, 2022

Date of Patent: November 26, 2024

Assignee: Bank of America Corporation

Inventors: Prashant Khare, Abhishek Trivedi, Gaurav Dadhich, Saurabh Dutta, Shruti Nandini Thakur, Parneet Kaur Gujral, Zeno Valerian Anthony
AI based voice ordering system and method therefor

Patent number: 12154565

Abstract: The present invention relates to an AI-based voice ordering system and a method therefor and provides a voice ordering method and system, the voice ordering method comprising: a first step of an ordering smart terminal standing by for voice data reception; a second step of the ordering smart terminal analyzing whether an input signal has been received by an input unit corresponding to a microphone activation button; and a third step of, if the analysis result indicates that an input signal has not been received, returning to the first step and, conversely, if an input signal has been received, the ordering smart terminal receiving a voice signal from a microphone, converting the voice signal into voice data of a preset format, and then transmitting the converted voice data to a voice ordering server via a host terminal connected to a network, so that analysis of text data is performed.

Type: Grant

Filed: November 19, 2020

Date of Patent: November 26, 2024

Inventors: Sung Jin Park, Eun Jin Park
Voice-based scene selection for video content on a computing device

Patent number: 12149773

Abstract: Voice-based interaction with video content being presented by a media player application is enhanced through the use of an automated assistant capable of identifying when a spoken utterance by a user is a request to playback a specific scene in the video content. A query identified in a spoken utterance may be used to access stored scene metadata associated with video content being presented in the vicinity of the user to identify one or more locations in the video content that correspond to the query, such that a media control command may be issued to the media player application to cause the media player application to seek to a particular location in the video content that satisfies the query.

Type: Grant

Filed: September 2, 2022

Date of Patent: November 19, 2024

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
Voice information processing apparatus and voice information processing method

Patent number: 12142272

Abstract: A voice information processing apparatus sequentially converts an utterance of a user into text during a voice reception period that is a period in which an uttered voice to be converted into text is received from a user, and in a case where it can be regarded that the utterance of the user has been interrupted, the voice information processing apparatus automatically causes utterance content already uttered by the user to be output by a voice during the voice reception period. As a result, the voice information processing apparatus can cause the user to recognize a content of a sentence that has been uttered by the user so far and converted into text, when it can be regarded that the utterance of the user has been interrupted.

Type: Grant

Filed: September 9, 2021

Date of Patent: November 12, 2024

Assignee: ALPS ALPINE CO., LTD.

Inventor: Hongda Zheng
Systems and methods for natural language processing using a plurality of natural language models

Patent number: 12135945

Abstract: A virtual assistant server receives an utterance provided by an end user via a channel of a virtual assistant rendered in a client device. The virtual assistant server identifies a current-node of execution from a plurality of nodes of a conversation definition of the virtual assistant and identifies a first set of language models from a group of language models of the virtual assistant to interpret the utterance. Further, the virtual assistant server executes the first set of language models in an order based on the current-node until an intent of the utterance is determined. Subsequently, the virtual assistant server generates a response based on the intent and outputs the response to the client device.

Type: Grant

Filed: November 30, 2021

Date of Patent: November 5, 2024

Assignee: Kore.ai, Inc.

Inventors: Rajkumar Koneru, Prasanna Kumar Arikala Gunalan, Thirupathi Bandam, Girish Ahankari
Providing command bundle suggestions for an automated assistant

Patent number: 12135748

Abstract: Generating and/or recommending command bundles for a user of an automated assistant. A command bundle comprises a plurality of discrete actions that can be performed by an automated assistant. One or more of the actions of a command bundle can cause transmission of a corresponding command and/or other data to one or more devices and/or agents that are distinct from devices and/or agents to which data is transmitted based on other action(s) of the bundle. Implementations determine command bundles that are likely relevant to a user, and present those command bundles as suggestions to the user. In some of those implementations, a machine learning model is utilized to generate a user action embedding for the user, and a command bundle embedding for each of a plurality of command bundles. Command bundle(s) can be selected for suggestion based on comparison of the user action embedding and the command bundle embeddings.

Type: Grant

Filed: June 9, 2023

Date of Patent: November 5, 2024

Assignee: GOOGLE LLC

Inventor: Yuzhao Ni
Training keyword spotters

Patent number: 12136412

Abstract: A method of training a custom hotword model includes receiving a first set of training audio samples. The method also includes generating, using a speech embedding model configured to receive the first set of training audio samples as input, a corresponding hotword embedding representative of a custom hotword for each training audio sample of the first set of training audio samples. The speech embedding model is pre-trained on a different set of training audio samples with a greater number of training audio samples than the first set of training audio samples. The method further includes training the custom hotword model to detect a presence of the custom hotword in audio data. The custom hotword model is configured to receive, as input, each corresponding hotword embedding and to classify, as output, each corresponding hotword embedding as corresponding to the custom hotword.

Type: Grant

Filed: May 4, 2022

Date of Patent: November 5, 2024

Assignee: Google LLC

Inventors: Matthew Sharifi, Kevin Kilgour, Dominik Roblek, James Lin
Load current derived switch timing of switching resonant topology

Patent number: 12112919

Abstract: Systems, devices, and methods are discussed relating to plasma sources using load current switch timing of zero volt switching resonant topology.

Type: Grant

Filed: October 10, 2023

Date of Patent: October 8, 2024

Assignee: Kaufman & Robinson, Inc.

Inventor: Steven J. Geissler
Artificial intelligence models for composing audio scores

Patent number: 12100374

Abstract: A method for training one or more AI models for generating audio scores accompanying visual datasets includes obtaining training data comprising a plurality of audiovisual datasets and analyzing each of the plurality of audiovisual datasets to extract multiple visual features, textual features, and audio features. The method also includes correlating the multiple visual features and textual features with the multiple audio features via a machine learning network. Based on the correlations between the visual features, textual features, and audio features, one or more AI models are trained for composing one or more audio scores for accompanying a given dataset.

Type: Grant

Filed: May 13, 2021

Date of Patent: September 24, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventor: Todd Matthew Williams
Method and apparatus for training speech recognition model, electronic device and storage medium

Patent number: 12100388

Abstract: A method and apparatus for training a speech recognition model, an electronic device and a storage medium are provided.

Type: Grant

Filed: May 18, 2022

Date of Patent: September 24, 2024

Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor: Qingen Zhao
Electronic device and voice recognition method thereof

Patent number: 12094460

Abstract: An electronic device is disclosed. The electronic device comprises: a voice reception unit for receiving user's voice; a storage unit for storing a first speech recognition module for recognizing user's voice and a second speech recognition module for recognizing only predetermined voice in the user's voice; and a processor for performing speech recognition of only a part of the user's voice through the first speech recognition module, when a result of speech recognition through the second speech recognition module shows that the user's voice includes the predetermined voice.

Type: Grant

Filed: July 18, 2017

Date of Patent: September 17, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Jae-Hyun Bae
Conversation detector to insert audible announcements

Patent number: 12090924

Abstract: Systems and methods for detecting a convenient time to play an audible announcement. The techniques described herein can be implemented for audible announcements in various settings, including, for example, audible announcements in an autonomous vehicle and audible announcements from a mapping service on a mobile device. In an autonomous vehicle, interior microphones can be used to detect voices within the vehicle and identify pauses in conversation. Audible notifications and announcements within the autonomous vehicle can then be made during the pauses.

Type: Grant

Filed: December 21, 2021

Date of Patent: September 17, 2024

Assignee: GM Cruise Holdings LLC

Inventor: Brian Vaughn Gilbert
Determining dialog states for language models

Patent number: 12080290

Abstract: Systems, methods, devices, and other techniques are described herein for determining dialog states that correspond to voice inputs and for biasing a language model based on the determined dialog states. In some implementations, a method includes receiving, at a computing system, audio data that indicates a voice input and determining a particular dialog state, from among a plurality of dialog states, which corresponds to the voice input. A set of n-grams can be identified that are associated with the particular dialog state that corresponds to the voice input. In response to identifying the set of n-grams that are associated with the particular dialog state that corresponds to the voice input, a language model can be biased by adjusting probability scores that the language model indicates for n-grams in the set of n-grams. The voice input can be transcribed using the adjusted language model.

Type: Grant

Filed: February 10, 2022

Date of Patent: September 3, 2024

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro Jose Moreno Mengibar
Multi-task automatic speech recognition system

Patent number: 12079587

Abstract: Disclosed herein are methods, systems, and computer-readable media for generating an output transcript from an input audio segment using a multi-task transformer model. In some embodiments, the transformer model can be trained to transcribe or translate audio data in multiple languages using labeled audio data. The labeled audio data can include first audio segments associated with first same-language transcripts of the first audio segments and second audio segments associated with second different-language transcripts of the second audio segments. In some embodiments, a vocabulary of the model can include special purpose and time stamp tokens. The special purpose tokens can specify tasks for the model to perform.

Type: Grant

Filed: April 18, 2023

Date of Patent: September 3, 2024

Assignee: OpenAI OpCo, LLC

Inventors: Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey-Payne, Ilya Sutskever
Supplementing voice inputs to an automated assistant according to selected suggestions

Patent number: 12073832

Abstract: Implementations described herein relate to providing suggestions, via a display modality, for completing a spoken utterance for an automated assistant, in order to reduce a frequency and/or a length of time that the user will participate in a current and/or subsequent dialog session with the automated assistant. A user request can be compiled from content of an ongoing spoken utterance and content of any selected suggestion elements. When a currently compiled portion of the user request (from content of a selected suggestion(s) and an incomplete spoken utterance) is capable of being performed via the automated assistant, any actions corresponding to the currently compiled portion of the user request can be performed via the automated assistant. Furthermore, any further content resulting from performance of the actions, along with any discernible context, can be used for providing further suggestions.

Type: Grant

Filed: January 31, 2022

Date of Patent: August 27, 2024

Assignee: GOOGLE LLC

Inventors: Gleb Skobeltsyn, Olga Kapralova, Konstantin Shagin, Vladimir Vuskovic, Yufei Zhao, Bradley Nelson, Alessio Macrì, Abraham Lee
Multiple digital assistant coordination in vehicular environments

Patent number: 12073834

Abstract: The present disclosure is generally related to a data processing system to selectively invoke applications for execution. A data processing system can receive an input audio signal and can parse the input audio signal to identify a command. The data processing system can identify a first functionality of a first digital assistant application hosted on the data processing system in the vehicle and a second functionality of a second digital assistant application accessible via a client device. The data processing system can determine that one of the first functionality or the second functionality supports the command. The data processing system can select one of the first digital assistant application or the second digital assistant application based on the determination. The data processing system invoke one of the first digital assistant application or the second digital assistant application based on the selection.

Type: Grant

Filed: March 23, 2023

Date of Patent: August 27, 2024

Assignee: GOOGLE LLC

Inventors: Haris Ramic, Vikram Aggarwal, Moises Morgenstern Gali, Brandon Stuut
Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant

Patent number: 12073147

Abstract: An electronic device with one or more processors and memory includes a procedure for enabling conversation persistence across two or more instances of a digital assistant. In some embodiments, the device displays a first dialogue in a first instance of a digital assistant user interface. In response to a request to display a user interface different from the digital assistant user interface, the device displays the user interface different from the digital assistant user interface. In response to a request to invoke the digital assistant, the device displays a second instance of the digital assistant user interface, including displaying a second dialogue in the second instance of the digital assistant user interface, where the first dialogue remains available for display in the second instance of the digital assistant user interface.

Type: Grant

Filed: June 9, 2021

Date of Patent: August 27, 2024

Assignee: Apple Inc.

Inventors: David Carson, Daniel Keen, Evan Dibiase, Harry J. Saddler, Marco Iacono, Stephen O. Lemay, Donald W. Pitschel, Thomas R. Gruber
Dialog generation method and apparatus, device, and storage medium

Patent number: 12056167

Abstract: The present disclosure provides a dialog generation method, performed by a human-machine dialog system. The method includes obtaining an input dialog sequence from a dialog client; obtaining associated information related to the input dialog sequence; encoding, by an encoder, the input dialog sequence to obtain an input encoding vector; encoding, by the encoder, the associated information to obtain an associated encoding vector; decoding, by a decoder, the input encoding vector and the associated encoding vector to obtain an output dialog sequence, the output dialog sequence comprising an out-of-vocabulary word corresponding to the associated information; and transmitting the output dialog sequence to the dialog client.

Type: Grant

Filed: June 11, 2021

Date of Patent: August 6, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Yizhang Tan, Jiachen Ding, Changyu Miao
Evaluating screen content for accessibility

Patent number: 12051399

Abstract: In one example, a method for evaluating screen content for accessibility with a screen reader device is disclosed. The method provides a baseline document including a script of expected screen content that conforms accessibility requirements. The method may generate an audio file based on screen content elements. For some implementations, the method uses a machine learning model to transcribe the audio file into an output transcription file. The method may determine whether output transcription file matches the baseline document and a corresponding output report is generated.

Type: Grant

Filed: December 2, 2021

Date of Patent: July 30, 2024

Assignee: JPMORGAN CHASE BANK, N.A.

Inventors: Chandrasekar Murugesan, Sushama Addepalli, Xiang Zhang, Sudharsan Selvakumar, Sanjay Durgadin
Unmuted microphone notification

Patent number: 12046235

Abstract: One embodiment provides a method, including: receiving, at an input device associated with an information handling device, audio input; determining, using a processor, that an audible anomaly exists in the audio input, wherein the audible anomaly corresponds to a deviation from an established speech input pattern of a user; and performing, responsive to determining that the audible anomaly exists in the audio input, a remedial action to address the audible anomaly. Other aspects are described and claimed.

Type: Grant

Filed: July 29, 2021

Date of Patent: July 23, 2024

Assignee: LENOVO (SINGAPORE) PTE. LTD.

Inventor: Matthew Tucker
Machine learning for targeting help content

Patent number: 12039351

Abstract: Media, methods, and systems of recommending personalized help content within a group-based communication system. A machine learning model trained with prior user interaction data and historical user engagement data is used to generate a list of recommended help content based at least in part on received user interaction data for a user.

Type: Grant

Filed: November 29, 2022

Date of Patent: July 16, 2024

Assignee: Salesforce, Inc.

Inventors: Andrew Timmons, Fiona Condon, Joel Bartlett, Elijah Joseph-Young, Jason Kranker, Mihailo Milic, Shreya Mohan Shetty
Automated initiation and adaptation of a dialog with a user via user interface devices of a computing device of the user

Patent number: 12026530

Abstract: Methods and apparatus directed to utilizing an automated messaging system to initiate and/or adapt a dialog with at least one user, where the dialog occurs via user interface input and output devices of at least one computing device of the user. In some of those implementations, the automated messaging system identifies at least one task associated with the user and initiates the dialog with the user based on identifying the task. The automated messaging system may initiate the dialog to provide the user with additional information related to the task and/or to determine, based on user input provided during the dialog, values for one or more parameters of the task. In some implementations, the automated messaging system may further initiate performance of the task utilizing parameters determined during the dialog.

Type: Grant

Filed: November 7, 2022

Date of Patent: July 2, 2024

Assignee: GOOGLE LLC

Inventors: Guangqiang Zhang, Zhou Bailiang
Methods and systems for a compliance framework database schema

Patent number: 12026183

Abstract: Generating a compliance framework. The compliance framework facilitates an organization's compliance with multiple authority documents by providing efficient methodologies and refinements to existing technologies, such as providing hierarchical fidelity to the original authority document; separating auditable citations from their context (e.g., prepositions and or informational citations); asset focused citations; SNED and Live values, among others.

Type: Grant

Filed: January 27, 2021

Date of Patent: July 2, 2024

Assignee: UNIFIED COMPLIANCE FRAMEWORK (NETWORK FRONTIERS)

Inventor: Dorian J. Cougias

1 2 3 4 5 … next