Word Recognition Patents (Class 704/251)

Preliminary matching (Class 704/252)

Endpoint detection (Class 704/253)

Subportions (Class 704/254)

Specialized models (Class 704/255)

Markov (Class 704/256)

Hidden Markov Model (HMM) (EPO) (Class 704/256.1)

Training of HMM (EPO) (Class 704/256.2)

With insufficient amount of training data, e.g., state sharing, tying, deleted interpolation (EPO) (Class 704/256.3)

Continuous density, e.g, Gaussian distribution, Lapalce (EPO) (Class 704/256.7)
Discrete density, e.g., Vector Quantization preprocessor, look up tables (EPO) (Class 704/256.8)

Natural language (Class 704/257)

Multimodal entity and coreference resolution for assistant systems

Patent number: 11966986

Abstract: In one embodiment, a method includes receiving, at a client system, an audio input, where the audio input comprises a coreference to a target object, accessing visual data from one or more camera associated with the client system, where the visual data comprises images portraying one or more objects, resolving the coreference to the target object from among the one or more objects, resoling the target object to a specific entity, and providing, at the client system, a response to the audio input, where the response comprises information about the specific entity.

Type: Grant

Filed: August 1, 2022

Date of Patent: April 23, 2024

Assignee: Meta Platforms, Inc.

Inventors: Shivani Poddar, Seungwhan Moon, Paul Anthony Crook, Rajen Subba
Methods and apparatus for leveraging machine learning for generating responses in an interactive response system

Patent number: 11967309

Abstract: Apparatus and methods for leveraging machine learning and artificial intelligence to generate a response to an utterance expressed by a user during an interaction between an interactive response system and the user is provided. The methods may include a natural language processor processing the utterance to output an utterance intent. The methods may also include a signal extractor processing the utterance, the utterance intent and previous utterance data to output utterance signals. The methods may additionally include an utterance sentiment classifier using a hierarchy of rules to extract, from a database, a label, the extracting being based on the utterance signals. The methods may further include a sequential neural network classifier using a trained algorithm to process the label and a sequence of historical labels to output a sentiment score. The methods may further include, based on the utterance intent, the label and the score, to output a response.

Type: Grant

Filed: December 1, 2021

Date of Patent: April 23, 2024

Assignee: Bank of America Corporation

Inventors: Isaac Persing, Emad Noorizadeh, Ramakrishna R. Yannam, Sushil Golani, Hari Gopalkrishnan, Dana Patrice Morrow Branch
Hotword suppression

Patent number: 11967323

Abstract: A method includes adding, by a first computing device, a first audio watermark to first speech data corresponding to playback of a first utterance including a hotword used to invoke an attention of a second computing device. The method includes outputting, by the first computing device, the playback of the first utterance corresponding to the watermarked first speech data. The second computing device is configured to receive the watermarked first speech data and determine to cease processing of the watermarked first speech data.

Type: Grant

Filed: June 24, 2022

Date of Patent: April 23, 2024

Assignee: GOOGLE LLC

Inventors: Alexander H. Gruenstein, Taral Pradeep Joglekar, Vijayaditya Peddinti, Michiel A. U. Bacchiani
Multi-tag concurrent identification method and system for query tree based on feature groups

Patent number: 11966807

Abstract: A multi-tag concurrent identification method and a system for a query tree based on feature groups are provided in this disclosure. In the disclosure, a whole data string space is divided into a plurality of disjoint subsets according to features of data strings returned by tags, where each of the subsets contains several different data strings, each of the data strings in the each of the subsets is regarded as a complete tag ID or a partial ID, and the each of the subsets corresponds to a unique query prefix, a length of the prefix is fixed and does not dynamically increase with an actual location of a collision, and when multiple data strings from a same subset return at a same time, a reader is capable of identifying them at a same time in a slot.

Type: Grant

Filed: August 15, 2023

Date of Patent: April 23, 2024

Assignee: Nanjing University of Information Science and Technology

Inventors: Jian Su, Jialin Zhou, Wei Zhuang, Ling Tan
Localized wakeword verification

Patent number: 11961519

Abstract: In one aspect, a networked microphone device is configured to (i) receive sound data, (ii) determine, via the wake-word engine, that a first portion of the sound data is representative of a wake word, (iii) determine that a second networked microphone device was added to a media playback system, (iv) transmit the first portion of the sound data to a second networked microphone device, (v) begin determining a command to be performed by the first networked microphone device, (vi) receive an indication of whether the first portion of the sound data is representative of the wake word, and (vii) output a response indicative of whether the first portion of the sound data is representative of the wake word.

Type: Grant

Filed: April 18, 2022

Date of Patent: April 16, 2024

Assignee: Sonos, Inc.

Inventor: Connor Kristopher Smith
Electronic apparatus and controlling method thereof

Patent number: 11961506

Abstract: An electronic apparatus including a memory configured to store first voice recognition information related to a first language and second voice recognition information related to a second language, and a processor to obtain a first text corresponding to a user voice that is received on the basis of first voice recognition information, based on an entity name being included in the user voice according to the obtained first text, identify a segment in the user voice in which the entity name is included. The processor is to obtain a second text corresponding to the identified segment of the user voice on the basis of the second voice recognition information, and obtain control information corresponding to the user voice on the basis of the first text and the second text.

Type: Grant

Filed: February 23, 2023

Date of Patent: April 16, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Chansik Bok, Jihun Park
Multi-stage entity resolution

Patent number: 11947913

Abstract: Techniques for performing multi-stage entity resolution (ER) processing are described. A system may determine a portion of a user input corresponding to an entity name, and may request an entity provider component to perform a search to determine one or more entities corresponding to the entity name. The preliminary search results may be sent to a skill selection component for processing, while the entity provider component performs a complete search to determine entities corresponding to the entity name. A selected skill component may request the complete search results to perform its processing, including determining an output responsive to the user input.

Type: Grant

Filed: June 24, 2021

Date of Patent: April 2, 2024

Assignee: Amazon Technologies, Inc.

Inventors: David Paul Ramos, Tonytip Ketudat, Vikas Chawla, Lukas Leon Brower
Detection and/or enrollment of hot commands to trigger responsive action by automated assistant

Patent number: 11948556

Abstract: Techniques are described herein for detecting and/or enrolling (or commissioning) new “hot commands” that are useable to cause an automated assistant to perform responsive action(s) without having to be first explicitly invoked. In various implementations, an automated assistant may be transitioned from a limited listening state into a full speech recognition state in response to a trigger event. While in the full speech recognition state, the automated assistant may receive and perform speech recognition processing on a spoken command from a user to generate a textual command. The textual command may be determined to satisfy a frequency threshold in a corpus of textual commands Consequently, data indicative of the textual command may be enrolled as a hot command. Subsequent utterance of another textual command that is semantically consistent with the textual command may trigger performance of a responsive action by the automated assistant, without requiring explicit invocation.

Type: Grant

Filed: December 11, 2019

Date of Patent: April 2, 2024

Assignee: GOOGLE LLC

Inventors: Yuan Yuan, Bibo Xu, Tianyu Wang, Anurag Jain
Methods and apparatus for leveraging sentiment values in flagging and/or removal of real time workflows

Patent number: 11948557

Abstract: Aspects of the disclosure relate to using an apparatus for flagging and removing real time workflows that produce sub-optimal results. Such an apparatus may include an utterance sentiment classifier. The apparatus stores a hierarchy of rules. Each of the rules is associated with one or more rule signals. In response to receiving the one or more utterance signals, the classifier iterates through the hierarchy of rules in sequential order to identify a first rule for which the one or more utterance signals are a superset of the rule's one or more rule signals. In response to receiving the one or more alternate utterance signals from the signal extractor, the classifier may iterate through the hierarchy of rules in sequential order to identify the first rule in the hierarchy for which the one or more alternate utterance signals are a superset of the first rule's one or more rule signals.

Type: Grant

Filed: December 1, 2021

Date of Patent: April 2, 2024

Assignee: Bank of America Corporation

Inventors: Ramakrishna R. Yannam, Isaac Persing, Emad Noorizadeh
Method and apparatus for using image data to aid voice recognition

Patent number: 11942087

Abstract: A device performs a method for using image data to aid voice recognition. The method includes the device capturing image data of a vicinity of the device and adjusting, based on the image data, a set of parameters for voice recognition performed by the device. The set of parameters for the device performing voice recognition include, but are not limited to: a trigger threshold of a trigger for voice recognition; a set of beamforming parameters; a database for voice recognition; and/or an algorithm for voice recognition. The algorithm may include using noise suppression or using acoustic beamforming.

Type: Grant

Filed: January 13, 2021

Date of Patent: March 26, 2024

Assignee: Google Technology Holdings LLC

Inventors: Robert A. Zurek, Adrian M. Schuster, Fu-Lin Shau, Jincheng Wu
Recognizing speech in the presence of additional audio

Patent number: 11942083

Abstract: The technology described in this document can be embodied in a computer-implemented method that includes receiving, at a processing system, a first signal including an output of a speaker device and an additional audio signal. The method also includes determining, by the processing system, based at least in part on a model trained to identify the output of the speaker device, that the additional audio signal corresponds to an utterance of a user. The method further includes initiating a reduction in an audio output level of the speaker device based on determining that the additional audio signal corresponds to the utterance of the user.

Type: Grant

Filed: May 21, 2021

Date of Patent: March 26, 2024

Assignee: Google LLC

Inventors: Diego Melendo Casado, Ignacio Lopez Moreno, Javier Gonzalez-Dominguez
Electronic apparatus for recognizing voice and method of controlling the same

Patent number: 11942089

Abstract: Disclosed is an electronic apparatus which may receive a user voice of a user to be registered as a start command for activating a speech recognition function of the electronic apparatus; identify whether the user voice is misrecognized as a reference audio, wherein the reference audio includes a voice uttered by a user with high frequency from among a plurality of voices previously uttered by the user; and based on the user voice being not misrecognized as the reference audio, register the user voice as the start command.

Type: Grant

Filed: September 10, 2021

Date of Patent: March 26, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Chanhee Choi
System and techniques for inferring a threat model in a cloud-native environment

Patent number: 11936678

Abstract: In some aspects, a server device may identify one or more services of a cloud infrastructure via a management layer. The server device may determine service information and configuration information for the one or more services. The server device may generate an environment model based at least in part on the service information and the configuration information, the environment model providing information on relationship between one or more components of the cloud infrastructure. The server device may determine one or more threats to the one or more services based at least in part on analyzing the environment model and accessing a threat information database. The server device may generate a threat model that lists the one or more threats to the one or more services. The server device may generate one or more recommendations for the cloud infrastructure based at least on the threat model.

Type: Grant

Filed: January 6, 2022

Date of Patent: March 19, 2024

Assignee: Oracle International Corporation

Inventors: Olgierd Stanislaw Pieczul, Junaid Khan
Preserving speech hypotheses across computing devices and/or dialog sessions

Patent number: 11935519

Abstract: Implementations can receive, at a computing device, audio data corresponding to a spoken utterance of a user, process the audio data to generate, for one or more parts of the spoken utterance, a plurality of speech hypotheses, select a given one of the speech hypotheses, cause the given one of the speech hypotheses to be incorporated as a portion of a transcription associated with the software application, and store the plurality of speech hypotheses. In some implementations, the plurality of speech hypotheses can be loaded at an additional computing device when the transcription is accessed at the additional computing device. In additional or alternative implementations, the plurality of speech hypotheses can be loaded into memory of the computing device when the software application is reactivated and/or when a subsequent dialog session associated with the transcription is initiated.

Type: Grant

Filed: October 15, 2020

Date of Patent: March 19, 2024

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
Methods and apparatus for leveraging an application programming interface (“API”) request for storing a list of sentiment values in real time interactive response systems

Patent number: 11935532

Abstract: Aspects of the disclosure relate to receiving a stateless application programming interface (“API”) request. The API request may store an utterance, previous utterance data and a sequence of labels, each label in the sequence of labels being associated with a previous utterance expressed by a user during an interaction. The previous utterance data may, in certain embodiments, be limited to a pre-determined number of utterances occurring prior to the utterance. Embodiments process the utterance, using a natural language processor in electronic communication with the first processor, to output an utterance intent, a semantic meaning of the utterance and an utterance parameter. The utterance parameter may include words in the utterance and be associated with the intent. The natural language processor may append the utterance intent, the semantic meaning of the utterance and the utterance parameter to the API request. A signal extractor processor may append the plurality of utterance signals to the API request.

Type: Grant

Filed: December 1, 2021

Date of Patent: March 19, 2024

Assignee: Bank of America Corporation

Inventors: Ramakrishna R. Yannam, Emad Noorizadeh, Isaac Persing, Sushil Golani, Hari Gopalkrishnan, Dana Patrice Morrow Branch
Multi-tier rule and AI processing for high-speed conversation scoring and selecting of optimal responses

Patent number: 11935531

Abstract: Apparatus and methods for leveraging machine learning and artificial intelligence to assess a sentiment of an utterance expressed by a user during an interaction between an interactive response system and the user is provided. The methods may include a natural language processor processing the utterance to output an utterance intent. The methods may also include a signal extractor processing the utterance, the utterance intent and previous utterance data to output utterance signals. The methods may additionally include an utterance sentiment classifier using a hierarchy of rules to extract, from a database, a label, the extracting being based on the utterance signals. The methods may further include a sequential neural network classifier using a trained algorithm to process the label and a sequence of historical labels to output a sentiment score.

Type: Grant

Filed: December 1, 2021

Date of Patent: March 19, 2024

Assignee: Bank of America Corporation

Inventors: Isaac Persing, Emad Noorizadeh, Ramakrishna R. Yannam, Sushil Golani, Hari Gopalkrishnan, Dana Patrice Morrow Branch
Electronic device for managing user model and operating method thereof

Patent number: 11929079

Abstract: Disclosed is an operating method of an electronic device, including receiving a first user model from another electronic device of a user registered in the electronic device, through a communication circuit of the electronic device, refining a user model of the electronic device based on the first user model, and identifying the user based on a first voice input of the user by using the refined user model, wherein the user model is trained by the electronic device based on a second user model of at least one another user other than the user before the user is registered, and wherein the first user model is trained by the another electronic device based on a second voice input of the user obtained by the another electronic device of the user.

Type: Grant

Filed: September 23, 2021

Date of Patent: March 12, 2024

Assignee: Samsung Electronics Co., Ltd

Inventors: Taegu Kim, Dale Noh, Yoonju Lee
Hub device, multi-device system including the hub device and plurality of devices, and operating method of the hub device and multi-device system

Patent number: 11930230

Abstract: A method, performed by a hub device, of storing a voice assistant model for controlling a device includes receiving information about a first device connected to the hub device, requesting a voice assistant server to update a device determination model stored in the hub device; receiving the updated device determination model from the voice assistant server and storing the received updated device determination model; requesting, from the voice assistant server, information about a function determination model corresponding to the first device; receiving the information about the function determination model corresponding to the first device from the voice assistant server, and determining whether to store the function determination model in the hub device based on the received information; and based on the function determination model being determined to be stored in the hub device, storing, in the hub device, the function determination model corresponding to the first device.

Type: Grant

Filed: October 30, 2020

Date of Patent: March 12, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Jiyeon Hong, Hyeonmok Ko, Sungja Choi, Changho Paeon
Multi-tier rule and AI processing for high-speed conversation scoring

Patent number: 11922928

Abstract: Apparatus and methods for leveraging machine learning and artificial intelligence to assess a sentiment of an utterance expressed by a user during an interaction between an interactive response system and the user is provided. The methods may include a natural language processor processing the utterance to output an utterance intent. The methods may also include a signal extractor processing the utterance, the utterance intent and previous utterance data to output utterance signals. The methods may additionally include an utterance sentiment classifier using a hierarchy of rules to extract, from a database, a label, the extracting being based on the utterance signals. The methods may further include a sequential neural network classifier using a trained algorithm to process the label and a sequence of historical labels to output a sentiment score.

Type: Grant

Filed: December 1, 2021

Date of Patent: March 5, 2024

Assignee: Bank of America Corporation

Inventors: Ramakrishna R. Yannam, Isaac Persing, Emad Noorizadeh, Sushil Golani, Hari Gopalkrishnan, Dana Patrice Morrow Branch
Phrase alternatives representation for automatic speech recognition and methods of use

Patent number: 11922944

Abstract: Phrase alternative data structures are generated from the lattice output of an audio input to Automatic Speech Recognition (ASR) system. A user interface is supported for users to view phrase alternatives to selected portions of an audio transcript of the audio input, search the transcript based on query phrases, or edit the transcript based on phrase alternatives.

Type: Grant

Filed: June 28, 2021

Date of Patent: March 5, 2024

Assignee: MOD9 TECHNOLOGIES

Inventors: Arlo Mukai Faria, Adam Louis Janin, Deanna Dee Gelbart, Aniruddh Rangaprasad Iyengar, Elliot Zhenbin Lin
Interface and mode selection for digital action execution

Patent number: 11922193

Abstract: Interface and mode selection for digital action execution is provided. For example, a system loads a script library embedded in an electronic resource. The system determines a historic level of engagement between a client computing device and one or more digital assistants. The system selects, based on a first property of the client computing device and the historic level of engagement, a type of digital interface. The system generates, based on the type of digital interface, a digital interface with the call-to-action. The system determines, responsive to an instruction to execute the call-to-action, a mode of execution. The system selects a digital assistant and a second client device to execute the call-to-action. The system transmits the call-to-action to the second client device for execution.

Type: Grant

Filed: February 28, 2020

Date of Patent: March 5, 2024

Assignee: GOOGLE LLC

Inventors: Prithvi Balaram, Nikhil Rao, Ian Baker, Adam Coimbra
Systems and methods for detecting voice commands to generate a peer-to-peer communication link

Patent number: 11917092

Abstract: A voice-based peer-to-peer communication system may be used to detect voice commands from users to provide a wireless communication voice connection that allows the users to directly communicate with each other. The system may include a first computing device of a first user communicatively coupled to a second computing device of a second user over the wireless connection. The system may process the detected voice command having a phrase, contact name, and voice message. The phrase may include a wake, answer, or stop phrase. The contact name may be utilized to determine whether that contact name matches an entry within a predetermined contact list of the first user, where the matched contact name may be associated with the second user. Finally, the system may generate audio data based on the processed voice command that is then transmitted to the second computing device of the second user over the wireless connection.

Type: Grant

Filed: June 3, 2021

Date of Patent: February 27, 2024

Assignee: SYNTIANT

Inventor: Jeremiah H. Holleman, III
Multi-modal input on an electronic device

Patent number: 11914925

Abstract: A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.

Type: Grant

Filed: July 13, 2022

Date of Patent: February 27, 2024

Assignee: Google LLC

Inventors: Brandon M. Ballinger, Johan Schalkwyk, Michael H. Cohen, William J. Byrne, Gudmundur Hafsteinsson, Michael J. Lebeau
Dynamically extracting n-grams for automated vocabulary updates

Patent number: 11907657

Abstract: Systems and methods dynamically extracting n-grams for automated vocabulary updates. Text is received. An n-gram extracted from the text is matched to a canonical n-gram from a vocabulary to identify a tag for the text. An n-gram weight is computed for the n-gram extracted from the text. The n-gram weight may be computed by adjusting a term frequency of the n-gram. A relevancy score is computed for the tag using the n-gram weight and using an n-gram frequency of the canonical n-gram. The relevancy score is computed by dividing the n-gram weight by a value proportional to the n-gram frequency of the canonical n-gram. The relevancy score of the n-gram is presented.

Type: Grant

Filed: June 30, 2023

Date of Patent: February 20, 2024

Assignee: Intuit Inc.

Inventors: Byungkyu Kang, Shivakumara Narayanaswamy, Andrew Mattarella-Micke
Systems and methods for processing and utilizing video data

Patent number: 11902042

Abstract: A method includes receiving, from an entity, a request to organize a survey on a topic, based on the request, organizing a survey of a plurality of people, recording a video of the survey, obtaining a transcription of the video and linking the transcription of the video in time to the video to yield a processed video. The method can further include presenting, on a user interface to the entity based on the processed video, the video and the transcription of the video, wherein each word in the transcription of the video is selectable by the entity, receiving a selection of text by the entity from the transcription of the video and, based on the selection of the text, presenting a portion of the video at a time that is associated with when a participant in the video spoke the text. The user can also select a “clip to next speaker” option to generate a clip.

Type: Grant

Filed: February 28, 2022

Date of Patent: February 13, 2024

Assignee: Mercury Analytics, LLC

Inventors: Scott James Brickner, Matthew Thomas Williams, Peter Calvin Viss, Elizabeth Michael Karen, James Lord Ardery
Context driven device arbitration

Patent number: 11875820

Abstract: This disclosure describes, in part, context-driven device arbitration techniques to select a speech interface device from multiple speech interface devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include executing multiple pipeline instances to analyze audio signals and device metadata received from each of the multiple speech interface devices which detected the speech utterance. A remote speech processing service may execute the multiple pipeline instances and analyze the audio signals and/or metadata, at various stages of the pipeline instances, to determine which speech interface device is to respond to the speech utterance.

Type: Grant

Filed: September 24, 2021

Date of Patent: January 16, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Tony Roy Hardie, Brian Alexander Oliver, Vikram Kumar Gundeti
Systems and methods for capturing, processing, and rendering one or more context-aware moment-associating elements

Patent number: 11869508

Abstract: Computer-implemented method and system for receiving and processing one or more moment-associating elements. For example, the computer-implemented method includes receiving the one or more moment-associating elements, transforming the one or more moment-associating elements into one or more pieces of moment-associating information, and transmitting at least one piece of the one or more pieces of moment-associating information.

Type: Grant

Filed: April 28, 2021

Date of Patent: January 9, 2024

Assignee: Otter.ai, Inc.

Inventors: Yun Fu, Simon Lau, Kaisuke Nakajima, Julius Cheng, Sam Song Liang, James Mason Altreuter, Kean Kheong Chin, Zhenhao Ge, Hitesh Anand Gupta, Xiaoke Huang, James Francis McAteer, Brian Francis Williams, Tao Xing
Agent device, agent system, and computer-readable storage medium

Patent number: 11869488

Abstract: In cases in which a confidence score of an inferred intent label is a predetermined threshold or less, an intent inference section searches for whether or not wording pertaining to a location, such as “on the door”, is present in a question. In cases in which a word relating to a location is present, the intent inference section consults individual function identification data associated with OM item codes in order to find intent labels including individual functions relevant to the location (such as “door”). In cases in which an intent label including an individual function relevant to the “door” is found, an OMA interaction control section consults QA data to find and acquire associated response information based on the found intent label and the OM item code, and notifies a HMI interaction control section of such response information.

Type: Grant

Filed: November 30, 2020

Date of Patent: January 9, 2024

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventors: Chikage Kubo, Keiko Nakano, Eiichi Maeda, Hiroyuki Nishizawa
Acoustic method and system for providing digital data

Patent number: 11867539

Abstract: An acoustic system and method is disclosed for providing spatial and temporal classification of a range of different types of sound producing targets in a geographical area. The system includes an optical signal transmitter arrangement for repeatedly transmitting, at multiple instants, interrogating optical signals into each of one or more optical fibres distributed across the geographical area and forming at least part of an installed fibre-optic communications network. An optical signal detector arrangement receives, during an observation period following each of the multiple instants, returning optical signals scattered in a distributed manner over distance along the one or more of optical fibres.

Type: Grant

Filed: January 10, 2023

Date of Patent: January 9, 2024

Assignee: Fiber Sense Limited

Inventor: Mark A. Englund
Methods, systems, and media for providing direct and hybrid data acquisition approaches

Patent number: 11861531

Abstract: Methods, systems, and media for providing direct and hybrid data acquisition approaches are provided.

Type: Grant

Filed: August 1, 2022

Date of Patent: January 2, 2024

Assignee: Integral Ad Science, Inc.

Inventors: Joshua M. Attenberg, Foster J. Provost
Apparatus and method for recognition of patient activities

Patent number: 11862033

Abstract: A medication confirmation method and apparatus. The method of an embodiment of the invention includes the steps of capturing one or more video sequences of a user administering medication via a medication administration apparatus, storing the captured one or more video sequences, and analyzing at least one of the stored video sequences to confirm that the user has properly administered the medication, and has properly positioned the medication administration apparatus.

Type: Grant

Filed: May 11, 2020

Date of Patent: January 2, 2024

Assignee: AIC Innovations Group, Inc.

Inventors: Adam Hanina, Gordon Kessler, Lei Guan
Dynamic microphone system for autonomous vehicles

Patent number: 11854541

Abstract: Devices, systems and processes for a dynamic microphone system that enhances the passenger experience in autonomous vehicles are described. One example method for enhancing a passenger experiences includes generating, using an artificial intelligence algorithm, a plurality of filters based on a plurality of stored waveforms previously recorded by each of one or more passengers and a plurality of recordings of one or more noise sources, capturing voice commands from at least one of the one or more passengers inside the autonomous vehicle, generating voice commands with reduced distortion based on processing the voice commands using the plurality of filters, and instructing, based on the voice commands with reduced distortion, the autonomous vehicle to perform one or more actions.

Type: Grant

Filed: December 1, 2020

Date of Patent: December 26, 2023

Assignee: ALPINE ELECTRONICS OF SILICON VALLEY, INC.

Inventors: Thomas Yamasaki, Rocky Chau-Hsiung Lin, Koichiro Kanda
Message playing method and terminal

Patent number: 11837217

Abstract: A message playing method includes: receiving a first message, and asking in a voice manner, whether to play the first message; if a first voice of a user does not match a keyword of a positive reply, continuing to detect a voice of the user; if a second voice of the user detected, matches the keyword of the positive reply, playing the first message in the voice manner, and recording a quantity of times of using a text corresponding to the first voice; and when the quantity of times of using the text that corresponds to the first voice and that is recorded is greater than a first threshold, adding the text to the keyword of the positive reply.

Type: Grant

Filed: July 4, 2018

Date of Patent: December 5, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Yue Zhang, Qiang Tao
Rear door warning system

Patent number: 11833996

Abstract: A vehicle includes a rear door, at least one sensor component proximate to the rear door, an alert system, a memory in which program instructions are stored, and a processor operably connected to the alert system, the at least one sensor component, and the memory. The processor executes the program instructions to detect an object within a detection zone of the at least one sensor component relative to the rear door. The processor further activates the alert system in response to the object being within a predefined protection zone of the detection zone when the rear door is not moving.

Type: Grant

Filed: April 22, 2021

Date of Patent: December 5, 2023

Assignee: Robert Bosch GmbH

Inventor: Robert Kaster
Diagnostic systems and methods of vehicles using olfaction

Patent number: 11828210

Abstract: A vehicle system includes: at least one of: (a) a particulate matter sensor configured to measure an amount of particulate within a passenger cabin of a vehicle; and (b) a volatile organic compounds (VOC) sensor configured to measure an amount of VOCs within the passenger cabin of the vehicle; and a control module configured to, based on at least one of the amount of particulate and the amount of VOCs within the passenger cabin of the vehicle, diagnose at least one of: a characteristic of the vehicle; presence of a chemical in the passenger cabin of the vehicle; occurrence of an event within the passenger cabin of the vehicle; and a remaining life of engine oil of the vehicle.

Type: Grant

Filed: March 31, 2021

Date of Patent: November 28, 2023

Assignee: DENSO International America, Inc.

Inventors: Sibu Varughese, Martín Nespolo, Thomas Krzyzak, Gareth Webb, Wilson Yim, Matthew Johnson
Expediting interaction with a digital assistant by predicting user responses

Patent number: 11823661

Abstract: A computer-implemented technique is described herein for expediting a user's interaction with a digital assistant. In one implementation, the technique involves receiving a system prompt generated by a digital assistant in response to an input command provided by a user via an input device. The technique then generates a predicted response based on linguistic content of the system prompt, together with contextual features pertaining to a circumstance in which the system prompt was issued. The predicted response corresponds to a prediction of how the user will respond to the system prompt. The technique then selects one or more dialogue actions from a plurality of dialogue actions, based on a confidence value associated with the predicted response. The technique expedites the user's interaction with the digital assistant by reducing the number of system prompts that the user is asked to respond to.

Type: Grant

Filed: November 24, 2020

Date of Patent: November 21, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Vipul Agarwal, Rahul Kumar Jha, Soumya Batra, Karthik Tangirala, Mohammad Makarechian, Imed Zitouni
Method and system of audio false keyphrase rejection using speaker recognition

Patent number: 11823679

Abstract: Techniques related to a method and system of audio false keyphrase rejection using speaker recognition are described herein. Such techniques use speaker recognition of a computer originated voice to omit actions triggered when a keyphrase is present in captured audio and omitted when speech of the captured audio was spoken by the computer originated voice.

Type: Grant

Filed: July 13, 2022

Date of Patent: November 21, 2023

Assignee: Intel Corporation

Inventors: Jacek Ossowski, Tobias Bocklet, Kuba Lopatka
Load current derived switch timing of switching resonant topology

Patent number: 11823867

Abstract: Systems, devices, and methods are discussed relating to plasma sources using load current switch timing of zero volt switching resonant topology.

Type: Grant

Filed: December 21, 2021

Date of Patent: November 21, 2023

Assignee: Kaufman & Robinson, Inc.

Inventor: Steven J. Geissler
Sensitivity mode for an audio spotting system

Patent number: 11823707

Abstract: An audio spotting system configured for various operating modes including a regular mode and sensitivity mode is described. An example cascade audio spotting system may include a high-power subsystem including a high-power trigger and a transfer module. This high-power trigger includes one or more detection models used to detect whether a target sound activity is included in the one or more audio streams. The one or more detection models are associated with a first set of hyperparameters when the cascade audio spotting system is in a regular mode, and the one or more detection models are associated with a second set of hyperparameters when the cascade audio spotting system is in a sensitivity mode. The transfer module provides at least one of one or more processed audio streams for further processing in response to the high-power trigger detecting the target sound activity in the one or more audio streams.

Type: Grant

Filed: January 10, 2022

Date of Patent: November 21, 2023

Assignee: Synaptics Incorporated

Inventor: Saeed Mosayyebpour Kaskari
Architecture for context-augmented word embedding

Patent number: 11823671

Abstract: Features are disclosed for training and using a word embedding model configured to receive textual and context data associated with an utterance of a user. A word embedding model can be trained with text data and context data to account for context associated with the text data. The word embedding model can receive an input vector including text data and one or more sets of context data associated with the text data and perform word embedding based on the input vector. In some embodiments, the input vector can include an automatic speech recognition (“ASR”) confidence score generated by an ASR model and one or more labels generated by an NLU model. In some embodiments, the input vector can include user information associated with the user.

Type: Grant

Filed: May 11, 2020

Date of Patent: November 21, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Prathap Ramachandra, Lan Ma, Liu Yang, Yi Qin, Kyung Jae Lee, Wenbo Yan, Charlotte Alizerine Dzialo, Darshan Ashok Fofadiya
User profile linking

Patent number: 11810555

Abstract: Described are techniques for linking generating a skill-stored user profile, and linking same with a natural language processing (NLP) system-stored user profile. In at least some examples, a user may provide a natural language input to a NLP system. The NLP system may determine a skill is to process to perform an action responsive to the natural language input. To perform the action, the skill may require the user have a user profile stored by the skill, but the user may not have such a user profile. However, the NLP system may store a user profile for the user. The NLP system may determine the user profile stored thereby and may send, with user permission, information in the user profile to the skill. The skill may use the received information to generate and store a user profile for the user. Thereafter, the skill may provide the NLP system with a user profile identifier that the skill may use to identify the user's profile stored thereby.

Type: Grant

Filed: August 18, 2021

Date of Patent: November 7, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Fabien Hertschuh, Mohit Mittal, John Montague Howard, Akshit Proothi, Rahul Manne
Systems and methods for addressing possible interruption during interaction with digital assistant

Patent number: 11810566

Abstract: Systems and methods are described for handling interruptions during a digital assistant session between a user and a digital assistant by detecting if an interruption event is to occur during the digital assistant session. In response to detecting that the interruption event is to occur, options to address the interruption are provided.

Type: Grant

Filed: February 18, 2022

Date of Patent: November 7, 2023

Assignee: Rovi Guides, Inc.

Inventors: Vikram Makam Gupta, Vishwas Sharadanagar Panchaksharaiah
Low power integrated circuit to analyze a digitized audio stream

Patent number: 11810569

Abstract: Methods, devices, and systems for processing audio information are disclosed. An exemplary method includes receiving an audio stream. The audio stream may be monitored by a low power integrated circuit. The audio stream may be digitized by the low power integrated circuit. The digitized audio stream may be stored in a memory, wherein storing the digitized audio stream comprises replacing a prior digitized audio stream stored in the memory with the digitized audio stream. The low power integrated circuit may analyze the stored digitized audio stream for recognition of a keyword. The low power integrated circuit may induce a processor to enter an increased power usage state upon recognition of the keyword within the stored digitized audio stream. The stored digitized audio stream may be transmitted to a server for processing. A response received from the server based on the processed audio stream may be rendered.

Type: Grant

Filed: June 14, 2021

Date of Patent: November 7, 2023

Assignee: QUALCOMM Incorporated

Inventors: Eric Liu, Stefan Johannes Walter Marti, Seung Wook Kim
Unsupervised keyword spotting and word discovery for fraud analytics

Patent number: 11810559

Abstract: Embodiments described herein provide for a computer that detects one or more keywords of interest using acoustic features, to detect or query commonalities across multiple fraud calls. Embodiments described herein may implement unsupervised keyword spotting (UKWS) or unsupervised word discovery (UWD) in order to identify commonalities across a set of calls, where both UKWS and UWD employ Gaussian Mixture Models (GMM) and one or more dynamic time-warping algorithms. A user may indicate a training exemplar or occurrence of call-specific information, referred to herein as “a named entity,” such as a person's name, an account number, account balance, or order number. The computer may perform a redaction process that computationally nullifies the import of the named entity in the modeling processes described herein.

Type: Grant

Filed: June 6, 2022

Date of Patent: November 7, 2023

Assignee: PINDROP SECURITY, INC.

Inventor: Hrishikesh Rao
On-device custom wake word detection

Patent number: 11798535

Abstract: Generally discussed herein are devices, systems, and methods for on-device detection of a wake word. A device can include a memory including model parameters that define a custom wake word detection model, the wake word detection model including a recurrent neural network transducer (RNNT) and a lookup table (LUT), the LUT indicating a hidden vector to be provided in response to a phoneme of a user-specified wake word, a microphone to capture audio, and processing circuitry to receive the audio from the microphone, determine, using the wake word detection model, whether the audio includes an utterance of the user-specified wake word, and wake up a personal assistant after determining the audio includes the utterance of the user-specified wake word.

Type: Grant

Filed: September 14, 2021

Date of Patent: October 24, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Emilian Stoimenov, Rui Zhao, Kaustubh Prakash Kalgaonkar, Ivaylo Andreanov Enchev, Khuram Shahid, Anthony Phillip Stark, Guoli Ye, Mahadevan Srinivasan, Yifan Gong, Hosam Adel Khalil
Learning offline voice commands based on usage of online voice commands

Patent number: 11790890

Abstract: Methods, systems, apparatus, including computer programs encoded on a computer storage medium, for a user device to learn offline voice actions. In one aspect, the method includes actions of detecting, by the user device, an utterance at a first time when the user device is connected to a server by a network, providing, by the user device, the utterance to the server using the network, receiving, by the user device and from the server, an update to the grammar of the user device, detecting, by the user device, a subsequent utterance of the utterance at a second time when the user device is not connected to the server by a network, and in response to detecting, by the user device, the subsequent utterance of the utterance at the second time, identifying, by the user device, an operation to perform based on (i) the subsequent utterance, and (ii) the updated grammar.

Type: Grant

Filed: November 8, 2021

Date of Patent: October 17, 2023

Assignee: GOOGLE LLC

Inventors: Vikram Aggarwal, Moises Morgenstern Gali
Systems, methods, and apparatuses for providing assistant deep links to effectuate third-party dialog session transfers

Patent number: 11790004

Abstract: Methods, apparatus, systems, and computer-readable media are provided for transferring dialog sessions between devices using deep links. The dialog sessions can correspond to interactions, mediated by an automated assistant, between a user and a third party application. During the dialog session, a user can request that the dialog session be transferred to a different device, for example, to interact with the third party application through a different modality. In response, the automated assistant and/or the third party application can generate a link that can be transferred to the transferee device to allow the transferee device to seamlessly take over the dialog session. In this way, computational resources and electrical power can be preserved by not requiring a recipient device to re-process natural language inputs previously provided during the dialog session.

Type: Grant

Filed: January 9, 2023

Date of Patent: October 17, 2023

Assignee: GOOGLE LLC

Inventors: Justin Lewis, Scott Davies
System and method for synthesizing automated test cases from natural interactions

Patent number: 11790886

Abstract: A system for synthesizing real IVR scenarios, automatically reproducing test scenarios, and providing accurate reports on those scenarios. An IVR tester uses a telephony device (website, mobile app, VOIP, etc.) and an automated IVR test case generator to place test calls to an IVR system. The automated IVR test case generator intercepts and collects the phone call's audio stream, dual-tone multi-frequency signals, and available metadata (e.g., response times, post-speech silence timeout, etc.). Once the call is complete, the automated IVR test case generator pre-populates a webpage or other user-interface technology and a database with the audio stream, dual-tone multi-frequency signals, and available metadata from the test scenario phone call. The IVR tester may then save that test scenario and continue to the next scenario. The saved scenario may now be used automatically by the IVR test case generator for regression testing or duplicated and edited for use in other systems.

Type: Grant

Filed: November 6, 2020

Date of Patent: October 17, 2023

Assignee: CYARA SOLUTIONS PTY LTD

Inventors: Alok Kulkarni, Geoff Willshire, Thomas Fejes
Information processing apparatus and destination search method

Patent number: 11769494

Abstract: An information processing apparatus is connected to a voice processing server that analyzes text data transmitted from a voice input/output apparatus that converts an instruction by an utterance of a user to the text data and outputs the text data, and outputs an instruction obtained by analysis and utterance language information indicating a language of the utterance, and the information processing apparatus includes: a communicator that communicates with the voice processing server; a destination searcher that determines on the basis of the utterance language information whether to include a space character in a target of the search, and searches for a name indicated in a search character string from a destination list on the basis of a result of the determination; and a hardware processor that performs control to transmit a search result of a destination by the destination searcher to the voice processing server via the communicator.

Type: Grant

Filed: April 15, 2021

Date of Patent: September 26, 2023

Assignee: KONICA MINOLTA, INC.

Inventor: Yusuke Mikoshiba
Method for processing multimodal mobile donations via text message and email communication

Patent number: 11769138

Abstract: A donation processing system receives a text message from a donor indicating that they would like to make a donation to a non-profit organization. In response to the text message, a response message indicating the particulars of the donation including the donation amount and the non-profit organization is sent to the user. The donor reviews the information and, if the information is correct, a confirmation message is sent from the donor in response to the response message. The confirmation message confirms the donation. The donation is then made to the non-profit organization in response to the confirmation message.

Type: Grant

Filed: February 22, 2017

Date of Patent: September 26, 2023

Assignee: SWOOP IP HOLDINGS LLC

Inventor: Gabriel L. Sierra

1 2 3 4 5 … next