Speech Controlled System Patents (Class 704/275)

Systems and methods for voice-assisted media content selection

Patent number: 12360734

Abstract: Systems and methods for media playback via a media playback system include (i) capturing a voice input comprising a request for media content, (ii) receiving information derived at least from the request for media content, (iii) requesting and receiving information from at least one remote computing device associated with a first media content service and at least one remote computing device associated with a second media content service, wherein (a) the information identifies first media content available via the first media content service for playback and identifies second media content available via the second media content service for playback, and (b) the first and second media content are related to the requested media content, and (iv) after receiving at least one of the first information and the second information, (a) selecting the first media content instead of the second media content, and (b) playing back the first media content.

Type: Grant

Filed: October 10, 2023

Date of Patent: July 15, 2025

Assignee: Sonos, Inc.

Inventors: Sherwin Liu, Paul Bates
Processing system having a machine learning engine for providing an output via a digital assistant system

Patent number: 12360739

Abstract: Aspects of the disclosure relate to generating outputs using a digital personal assistant computing control platform and machine learning. A computing platform may receive, from a digital personal assistant computing device, a first voice command input. The computing platform may then determine, via machine learning algorithms, an identifier output indicating a user associated with the first voice command input and a location output indicating a geographic location associated with the user. The computing platform may determine, via a stored calendar, an availability output indicating availability associated with the user. Based on the identifier output, the location output, and the availability output, a charitable opportunity output indicating a charitable opportunity may be determined by the computing platform and may be transmitted to a computing device associated with the charitable opportunity.

Type: Grant

Filed: December 6, 2023

Date of Patent: July 15, 2025

Assignee: Allstate Insurance Company

Inventors: Elizabeth C. Schreier, Jamie E. Grahn
Speech skill jumping method for man machine dialogue, electronic device and storage medium

Patent number: 12347432

Abstract: Disclosed is a speech skill jumping method for man-machine dialogue, comprising constructing a field migration map in advance based on user's historical man-machine dialogue data; receiving external speech; determining a dialogue field that the external speech hits; and judging whether the hit dialogue field belongs to one of the plurality of dialogue fields in the field migration map, and ignoring the external speech if not, or jumping to a speech skill corresponding to the hit dialogue field if yes. A field migration map is generated based on a user's historical man-machine dialogue data which reflects the user's interaction habits, and whether to perform a speech skill jump is judged based on the field migration map, obviously abnormal input content can be shielded, improving the task completion and interaction efficiency.

Type: Grant

Filed: October 21, 2020

Date of Patent: July 1, 2025

Assignee: AI SPEECH CO., LTD.

Inventors: Hongbo Song, Shuai Fan, Chun Li
Multi-task learning for personalized keyword spotting

Patent number: 12347439

Abstract: Systems and techniques are provided for processing audio data. For example, the systems and techniques can be used for personalized keyword spotting through multi-task learning (PK-MTL). A process can include obtaining an audio sample, generating a representation of a keyword based on the audio sample, and generating a representation of a speaker based on the audio sample. The speaker can be associated with the keyword. A first similarity score can be determined based on a reference representation and one or more of the representation of the keyword and a representation of the speaker. The reference representation can be associated with one or more of the keyword and the speaker. A keyword spotting (KWS) output can be generated based on analyzing the first similarity score against at least a first threshold, wherein the KWS output accepts or rejects the audio sample as including a target keyword.

Type: Grant

Filed: January 12, 2023

Date of Patent: July 1, 2025

Assignee: QUALCOMM Incorporated

Inventors: Seunghan Yang, Byeonggeun Kim, Inseop Chung, Simyung Chang
Binary file feature information extraction through binary file immobilization and wavelet signal processing

Patent number: 12339964

Abstract: Disclosed is a method of extracting file feature information, the method being performed by a computing device including at least one processor, the method including: converting input data in a form of a binary file into data with a preset size; and extracting feature information of the input data from the data with the preset size. The representative drawing may be FIG. 2.

Type: Grant

Filed: July 15, 2022

Date of Patent: June 24, 2025

Assignee: Korea University Research and Business Foundation

Inventors: Huy Kang Kim, Sang Min Park, Sang Hoon Jeon
Determining a current system utterance with connective and content portions from a user utterance

Patent number: 12340803

Abstract: A voice dialogue system includes: a voice input unit which acquires a user utterance, a dialogue text creator which creates a text of a system utterance, a voice output unit which outputs the system utterance as voice data, and a setting unit that sets a response deadline for system utterances. When a first system utterance is output by the voice output unit, a second system utterance is output after the first system utterance without having acquired a user utterance, and the second system utterance comprises a connective portion for connecting following sentences and a content portion that is a subject of the second system utterance, the setting unit sets a timing at which output of the connective portion of the second system utterance ends or output of the content portion of the second system utterance starts as the response deadline for the first system utterance.

Type: Grant

Filed: December 14, 2023

Date of Patent: June 24, 2025

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventors: Atsushi Ikeno, Yusuke Jinguji, Toshifumi Nishijima, Fuminori Kataoka, Hiromi Tonegawa, Norihide Umeyama
Selectively invoking an automated assistant based on detected environmental conditions without necessitating voice-based invocation of the automated assistant

Patent number: 12327552

Abstract: Implementations set forth herein relate to an automated assistant that is invoked according to contextual signals—in lieu of requiring a user to explicitly speak an invocation phrase. When a user is in an environment with an assistant-enabled device, contextual data characterizing features of the environment can be processed to determine whether a user intends to invoke the automated assistant. Therefore, when such features are detected by the automated assistant, the automated assistant can bypass requiring an invocation phrase from a user and, instead, be responsive to one or more assistant commands from the user. The automated assistant can operate based on a trained machine learning model that is trained using instances of training data that characterize previous interactions in which one or more users invoked or did not invoke the automated assistant.

Type: Grant

Filed: January 17, 2020

Date of Patent: June 10, 2025

Assignee: GOOGLE LLC

Inventors: Petar Aleksic, Pedro Jose Moreno Mengibar
Intelligent device grouping

Patent number: 12327550

Abstract: Systems and methods for intelligent device grouping are disclosed. An environment, such as a home, may have a number of voice-enabled devices and accessory devices that may be controlled by the voice-enabled devices. One or more models, such as linguistics model(s) and/or device affinity models may be utilized to determine which accessory devices are candidates for inclusion in a device group, and a recommendation for grouping the devices may be provided. Device-group naming recommendations may also be generated and may be sent to users.

Type: Grant

Filed: June 23, 2022

Date of Patent: June 10, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Zeya Chen, Charles Edwin Ashton Brett, Jay Patel, Lizhen Peng, Aniruddha Basak, Hongyang Wang, Yunfeng Jiang, Sven Eberhardt, Akshay Kumar, William Evan Welbourne, Sara Hillenmeyer
Driving assistance apparatus, driving assistance method, and computer-readable storage medium storing driving assistance program

Patent number: 12311934

Abstract: A driving assistance apparatus executes a moving control to autonomously control a moving of a vehicle. The apparatus informs a driver of contents of a voice operation process planned to be executed for the moving control in accordance with utterance contents of the driver acquired by voice recognition and request the driver to perform an approval operation to approve the informed contents. The apparatus executes the voice operation process when the approval operation is performed. When a moving situation of the vehicle is not a predetermined situation which needs the approval operation, the apparatus execute the voice operation process without the approval operation being performed, and set an upper limit of an acceleration of the vehicle by the moving control to a value smaller than the upper limit set when executing the voice operation process in response to the approval operation being performed.

Type: Grant

Filed: May 25, 2023

Date of Patent: May 27, 2025

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventor: Yuma Ito
Support for syntax analysis during processing instructions for execution

Patent number: 12300223

Abstract: The present disclosure relates to computer-implemented methods, software, and systems for data processing of instructions requested for execution at a given version of a software system or application. One example method includes defining rules for processing instructions at a syntax analyzer of an application. The rules can include hint metadata for evaluating validity statuses of instructions at different versions of the application. An instruction that includes tokens is receive at the syntax analyzer to determine whether a token sequence corresponds to a rule from the rules. In response to determining that the token sequence corresponds to the rule and the rule is inactive for a current version of the application associated with the syntax analyzer, a second version of the application can be determined that is associated with the corresponding rule as an active rule. An indication of the second version of the application can be provided.

Type: Grant

Filed: January 4, 2022

Date of Patent: May 13, 2025

Assignee: SAP SE

Inventor: Marcos Del Puerto Garcia
Method and an apparatus for executing operation/s on device/s

Patent number: 12300229

Abstract: Aspects of the invention are directed towards an apparatus and method for executing operation/s on device/s. One or more embodiments of the invention describe the method comprising steps of receiving a voice command from a user for enabling one or more devices to execute an operation and determining validity of the voice command received from the user. The method further describes steps of converting the voice command to a generic command based on the validity of the command and transmitting the generic command for enabling the one or more devices to execute the operation.

Type: Grant

Filed: September 29, 2020

Date of Patent: May 13, 2025

Assignee: KIDDE FIRE PROTECTION, LLC

Inventors: Anjaneyulu Goriparti, Pratibha Kothakota
Multi-mode guard for voice commands

Patent number: 12293762

Abstract: Embodiments may be implemented by a computing device, such as a head-mountable display, in order to use a single guard phrase to enable different voice commands in different interface modes. An example device includes an audio sensor and a computing system configured to analyze audio data captured by the audio sensor to detect speech that includes a predefined guard phrase, and to operate in a plurality of different interface modes comprising at least a first and a second interface mode. During operation in the first interface mode, the computing system may initially disable one or more first-mode speech commands, and respond to detection of the guard phrase by enabling the one or more first-mode speech commands. During operation in the second interface mode, the computing system may initially disable a second-mode speech command, and to respond to the guard phrase by enabling the second-mode speech command.

Type: Grant

Filed: December 16, 2022

Date of Patent: May 6, 2025

Assignee: GOOGLE LLC

Inventors: Michael J. LeBeau, Mat Balez
Systems and methods for generating wake signals from known users

Patent number: 12293196

Abstract: Provided herein is an integrated circuit including, in some embodiments, a host processor, a digitally implemented neural network co-processor, and a communications interface between the host processor and the co-processor configured to transmit information therebetween. The special-purpose host processor can be operable as a stand-alone processor. The neural network co-processor may include a digitally implemented neural network. The co-processor is configured to enhance special-purpose processing of the host processor through an artificial neural network. In such embodiments, the host processor is wake keyword identifier processor configured to transmit one or more detected patterns to the co-processor over a communications interface. The co-processor is configured to transmit the recognized patterns to the host processor which can then identify and verify wake keywords spoken by a known user.

Type: Grant

Filed: January 16, 2021

Date of Patent: May 6, 2025

Assignee: SYNTIANT

Inventors: Kurt F. Busch, Jeremiah H. Holleman, III, Pieter Vorenkamp, Stephen W. Bailey, David Christopher Garrett
Co-located full-body gestures

Patent number: 12282592

Abstract: A method for detecting full-body gestures by a mobile device includes a host mobile device detecting the tracked body of a co-located participant in a multi-party session. When the participant's tracked body provides a full-body gesture, the host's mobile device recognizes that there is a tracked body providing a full-body gesture. The host mobile device iterates through the list of participants in the multi-party session and finds the closest participant mobile device with respect to the screen-space position of the head of the gesturing participant. The host mobile device then obtains the user ID of the closest participant mobile device and broadcasts the recognized full-body gesture event to all co-located participants in the multi-party session, along with the obtained user ID. Each participant's mobile device may then handle the gesture event as appropriate for the multi-party session. For example, a character or costume may be assigned to a gesturing participant.

Type: Grant

Filed: September 1, 2022

Date of Patent: April 22, 2025

Assignee: Snap Inc.

Inventors: Daekun Kim, Lei Zhang, Youjean Cho, Ava Robinson, Yu Jiang Tham, Rajan Vaish, Andrés Monroy-Hernández
Cooking appliance and control knob with integrated display

Patent number: 12276427

Abstract: A control knob for an appliance deploys a round bezel region that surround a fixed electronic display panel that is touch sensitive. The appliance state or operating condition is modified by the user rotating the bezel and selecting graphic user interface (GUI) icons on the electronic display. The bezel orientation modifies the GUI to display icons that operate the appliance in different modes. In a first mode of power control, the GUI displays power levels and the bezel orientation modifies the power level. In a second mode, the GUI displays temperature settings and the bezel orientation modifies the temperature. The GUI displays icons that when activated switch between the first and second mode as well as turn off the appliance. Another icon may activate a third mode that enables control and display of other settings, such as temperature units, time and date, count down timers and the like.

Type: Grant

Filed: July 9, 2021

Date of Patent: April 15, 2025

Assignee: Hestan Commercial Corporation

Inventors: Raymond Nilssen, Jairad Sloyer
Graph-based data compliance using natural language text

Patent number: 12271498

Abstract: Various embodiments of the present disclosure provide automated data compliance techniques for complex access controlled datasets subject to a plurality of data access constraints. Some of the techniques may include generating, using one or more natural language models, entity-relationship data for an access controlled dataset and generating a knowledge graph based on the entity-relationship data. The knowledge graph includes a plurality of vertices connected by a plurality of edges that may be traversed to identify a data access condition indicative of a data access violation or a data coverage violation. Some of the techniques may include generating, using the knowledge graph, a natural language condition description based on the data access condition and providing a condition alert indicative of the natural language condition description.

Type: Grant

Filed: August 21, 2023

Date of Patent: April 8, 2025

Assignee: Optum, Inc.

Inventors: Donald E. Johnson, Jr., Somadev Pasala, Ravi Kondadadi, Hadi D. Halim, Ramin Anushiravani, Ayush Tomar, Adam Russell, Robert K. Rossmiller
Multi-service business platform system having conversation intelligence systems and methods

Patent number: 12271869

Abstract: The disclosure is directed to various ways of improving the functioning of computer systems, information networks, data stores, search engine systems and methods, and other advantages. Among other things, provided herein are methods, systems, components, processes, modules, blocks, circuits, sub-systems, articles, and other elements (collectively referred to in some cases as the “platform” or the “system”) that collectively enable, in one or more datastores (e.g., where each datastore may include one or more databases) and systems. A system and method for providing conversation intelligence services may include pre-processing, transcribing, and post-processing. A conversation recording may be pre-processed generating a conversation record (e.g., conversation object). The pre-processed conversation recording may be transcribed into a transcript.

Type: Grant

Filed: March 11, 2022

Date of Patent: April 8, 2025

Assignee: HubSpot, Inc.

Inventors: Ian Leaman, Kevin M. Walsh, Hector Urdiales
Vehicle speed management

Patent number: 12269498

Abstract: An improved system and method of defining speeding severity levels, receiving over a wireless network from a vehicle device speed data and metadata, determining associated speeding severity levels, based at least in part on the determined associated speeding severity levels, determining whether an alert communication is to be transmitted the vehicle device, and in response to determining that an alert communication is to be transmitted the vehicle device, transmitting the alert communication via a wireless interface to the vehicle device, the alert communication configured to cause the vehicle device to provide a corresponding in-vehicle alert.

Type: Grant

Filed: February 6, 2023

Date of Patent: April 8, 2025

Assignee: Samsara Inc.

Inventors: Cassandra Lee Rommel, Casey Takahashi, Ava O'Neill, Matthew Basham, Salil Gupta, Xicheng Xiong, Aaron Zeisler
Systems and methods for predicting and providing automated online chat assistance

Patent number: 12271803

Abstract: Methods and systems are presented for providing automated online chat assistance in an online chat session. One or more utterances transmitted from a user device of a user via the online chat session are obtained. The one or more utterances are provided to a first prediction model to predict an intent of a user. If it is determined that the first prediction model is unable to predict the intent of the user based on the one or more utterances, the one or more utterances are provided to a second prediction model. After predicting the intent of the user by the second prediction model, the intent is used by a chat robot to provide a dialogue with the user via the online chat session. The one or more utterances and the predicted intent are used to re-train the first prediction model.

Type: Grant

Filed: February 27, 2023

Date of Patent: April 8, 2025

Assignee: PAYPAL, INC.

Inventors: Yu-Hsuan Kuo, Venkata Ramana Nadimpalli
Joining users to communications via voice commands

Patent number: 12260153

Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.

Type: Grant

Filed: November 6, 2023

Date of Patent: March 25, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Ty Loren Carlson, Rohan Mutagi
Listener animation

Patent number: 12254548

Abstract: A system configured to perform style-aware listener animation. By representing different listening styles (e.g., facial expressions) using an embedding space, a single model can be trained to generate unique facial animations for a number of distinct listeners. Thus, individual listening styles can be associated with a listener identifier, enabling the system to (i) animate a plurality of different listeners with unique nonverbal behavior and/or (ii) select a particular listener identifier or desired type of listener style with which to animate. This enables the model to be generalized to new listeners to generate additional listener facial responses without needing training data for each new listener. The model may process a listener representation style or listener identifier, along with input data corresponding to a speaker talking, to generate unique facial animation responsive to the speech.

Type: Grant

Filed: December 16, 2022

Date of Patent: March 18, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Gourav Datta, Vivek Yadav, Yue Wu, Ayush Jaiswal, Rajiv M Reddy, Prateek Singhal, Karthik Ramakrishnan, Premkumar Natarajan
Voice control system for an implant

Patent number: 12251209

Abstract: A system for the control of a medical implant in a mammal body is provided. The system comprises a first and a second part being adapted for communication with each other, in which system: the first part is adapted for implantation in the mammal body for the control of and communication with the medical implant, and the second part is adapted to be worn on the outside of the mammal body and adapted to receive control commands from a user and to transmit these commands to the first part.

Type: Grant

Filed: September 8, 2023

Date of Patent: March 18, 2025

Inventor: Peter Forsell
Electronic control device for an avionics system for implementing a critical avionics function, method and computer program therefor

Patent number: 12252237

Abstract: An electronic control device of an avionics system for implementation of a critical avionics function, comprising: a module for receiving a voice instruction signal; a speech recognition module configured to transform the voice signal into a textual transcript; a processing module configured to associate the textual transcript with at least one action to be performed; a monitoring system comprising: a control module configured to check whether the textual transcript and/or the action to be performed is consistent if and only if: a) the textual transcript and/or the action to be performed is consistent with the expected syntax, b) the textual transcript and/or the action to be performed is consistent with the expected lexical field, and c) the textual transcript and/or the action to be performed is consistent with the current context, a module for generating an associated command only if no inconsistencies are detected.

Type: Grant

Filed: May 19, 2022

Date of Patent: March 18, 2025

Assignee: THALES

Inventors: Florence De Grancey, Sébastien Boussiron
Artificial intelligence enabled reagent-free imaging hematology analyzer

Patent number: 12254624

Abstract: The subject invention pertains to methods and systems for classifying leukocytes using artificial intelligence called AIRFIHA (artificial-intelligence enabled reagent-free imaging hematology analyzer) that can accurately classify subpopulations of leukocytes in a label-free manner. AIRFIHA can not only subtype lymphocytes into B and T cell but is capable of sorting different types of T cells subtypes. AIRFIHA is realized through training a two-step neural network using label-free images of separated leukocytes acquired from a custom-built quantitative phase microscope. Owing to its easy operation, low cost, and strong discerning capability of complex leukocyte subpopulations, AIRFIHA is clinically translatable and can also be deployed in resource-limited settings.

Type: Grant

Filed: December 9, 2021

Date of Patent: March 18, 2025

Assignee: The Chinese University of Hong Kong

Inventors: Renjie Zhou, Xin Shu, Rishikesh Pandey
Hotword detection on multiple devices

Patent number: 12254884

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.

Type: Grant

Filed: January 24, 2024

Date of Patent: March 18, 2025

Assignee: Google LLC

Inventor: Matthew Sharifi
Multi-layer keyword detection

Patent number: 12249331

Abstract: A system and method for temporarily disabling keyword detection to avoid detection of machine-generated keywords. A local device may operate two keyword detectors. The first keyword detector operates on input audio data received by a microphone to capture keywords uttered by a user. In these instances, the keyword may be detected by the first detector and the audio data may be indicated for speech processing. The system may determine output audio data responsive to the input audio data. The local device may process the output audio data to determine that it also includes the keyword. The device may then disable the first keyword detector while the output audio data is played back by an audio speaker of the local device. Thus the local device may avoid detection of a keyword originating from the output audio. The first keyword detector may be reactivated after a time interval during which the keyword might be detectable in the output audio.

Type: Grant

Filed: May 8, 2023

Date of Patent: March 11, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Christopher Wayne Lockhart, Matthew Joseph Cole, Xulei Liu
Batching techniques for handling unbalanced training data for a chatbot

Patent number: 12236321

Abstract: The present disclosure relates to chatbot systems, and more particularly, to batching techniques for handling unbalanced training data when training a model such that bias is removed from the trained machine learning model when performing inference. In an embodiment, a plurality of raw utterances is obtained. A bias eliminating distribution is determined and a subset of the plurality of raw utterances is batched according to the bias-reducing distribution. The resulting unbiased training data may be input into a prediction model for training the prediction model. The trained prediction model may be obtained and utilized to predict unbiased results from new inputs received by the trained prediction model.

Type: Grant

Filed: March 30, 2021

Date of Patent: February 25, 2025

Assignee: Oracle International Corporation

Inventors: Thanh Long Duong, Mark Edward Johnson, Vishal Vishnoi, Balakota Srinivas Vinnakota, Yu-Heng Hong, Elias Luqman Jalaluddin
Using structured audio output to detect playback and/or to adapt to misaligned playback in wireless speakers

Patent number: 12236951

Abstract: Implementations are directed to determining an audio delay, of a computing device, by causing an audio data stream to be transmitted to the computing device via a wireless communication channel. The computing device causes audio output generated using the audio data stream to be rendered via speaker(s). The rendered audio output is captured via microphone(s), and the audio delay determined by comparing the captured audio output with the audio data stream. A delay audio segment can be appended to an additional audio data stream transmitted to the computing device, where the length of the delay audio segment is determined using the audio delay. A noise reduction technique can additionally or alternatively be adapted based on the audio delay. Implementations are additionally or alternatively directed to determining if an audio data stream transmitted to a computing device for rendering through speaker(s) driven by the computing device—is actually being rendered.

Type: Grant

Filed: August 14, 2023

Date of Patent: February 25, 2025

Assignee: GOOGLE LLC

Inventors: Nathaniel Nesiba, Xiang Cao
Autocomplete prediction engine providing automatic form filling from email and ticket extractions

Patent number: 12229500

Abstract: A method is provided. The method is executed by an autocomplete prediction engine implemented as a computer program within a computing environment. The autocomplete prediction engine executes automated communication mining on a communication. The method includes processing the communication to extract intents and entities related to each intent. The method includes providing the intents and the entities into forms using a language model to provide a conversational or natural language understanding of the communication.

Type: Grant

Filed: January 12, 2023

Date of Patent: February 18, 2025

Assignee: UiPath, Inc.

Inventors: Marius Cobzarenco, Arthur Wilcke, Harshil Shah, Martin Moxon
Server, terminal device, and method for home appliance management thereby

Patent number: 12231254

Abstract: A server is disclosed. The disclosed server includes a communication device for performing communication with a home appliance and a terminal device, a memory for storing state information of the home appliance and operation pattern information obtained by analyzing an operation pattern of the home appliance, and a processor for, when a query for the home appliance is received from the terminal device, generating response information in response to the received query, and controlling the communication device to transmit the generated response information to the terminal device, wherein the processor extracts a keyword included in the received query, checks at least one state item corresponding to the extracted keyword, and generates response information by using information corresponding to at least one state item in the state information and the operation pattern information.

Type: Grant

Filed: December 6, 2019

Date of Patent: February 18, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventor: Youngsoo Kim
Information processor, information processing method, and program

Patent number: 12230265

Abstract: An information processor including: an operation control unit that controls a motion of an autonomous mobile body acting on the basis of recognition processing, in a case where a target sound that is a target voice for voice recognition processing is detected, the operation control unit moving the autonomous mobile body to a position, around an approach target, where an input level of a non-target sound that is not the target voice becomes lower, the approach target being determined on the basis of the target sound.

Type: Grant

Filed: September 13, 2022

Date of Patent: February 18, 2025

Assignee: SONY GROUP CORPORATION

Inventors: Ryosuke Sawata, Yuichiro Koyama
Voice control method and apparatus for device, storage medium, and electronic apparatus

Patent number: 12230273

Abstract: A voice control method and apparatus for a device, a storage medium, and an electronic apparatus are provided. The method includes: acquiring a first voice feature of first voice data collected by a cleaning device, where the first voice data correspond to a first wake-up instruction sent by a use object, and the first wake-up instruction is used to wake up at least one of the cleaning device and a base station; acquiring a second voice feature of second voice data collected by the base station, where the second voice data correspond to the first wake-up instruction; and selecting a first device to be woken up from the cleaning device and the base station according to the first and second voice features, and waking up the first device, where the first device in a wake-up state is configured to respond to a voice instruction sent by the use object.

Type: Grant

Filed: May 29, 2024

Date of Patent: February 18, 2025

Assignee: DREAME INNOVATION TECHNOLOGY (SUZHOU) CO., LTD.

Inventors: Yadong Wu, Haining Cai
System and method for controlling lamp of vehicle

Patent number: 12213228

Abstract: A system and a method for controlling a lamp of a vehicle are provided, The system includes a voice recognition device to recognize a voice signal from a user, a voice analyzing device to analyze context of the recognized voice signal to determine a lamp device to be controlled, and to determine a control intent of the user with respect to the determined lamp device, and a controller to control the lamp device, based on the determined control intent of the user.

Type: Grant

Filed: September 8, 2022

Date of Patent: January 28, 2025

Assignee: HYUNDAI MOBIS CO., LTD.

Inventor: Seong Yeon Han
Method for controlling ambient sound and electronic device for the same

Patent number: 12212944

Abstract: An electronic device includes a speaker, a sensor, a communication circuit, a processor, and a memory to store instructions. The instructions, when executed by the processor, cause a wireless audio device to, while outputting a signal for reducing an external sound through the speaker, identify, using the communication circuit, an external electronic device, identify, using the sensor, a conversation responsive to a location of the external electronic device satisfying a specified condition, responsive to identifying the conversation, stop an output of the signal for reducing the external sound for a first period of time, and responsive to identifying a specified keyword included in the conversation, prolong stopping the output of the signal for reducing the external sound for a second period of time.

Type: Grant

Filed: July 7, 2022

Date of Patent: January 28, 2025

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Chulmin Lee
Voice wakeup method and system, and device

Patent number: 12211515

Abstract: A voice wakeup method includes receiving a plurality of voice wakeup messages sent by a plurality of electronic devices, where each voice wakeup message includes a distance and a wakeup energy value; determining, based on distances and wakeup energy values in the plurality of voice wakeup messages from the plurality of electronic devices, whether energy attenuation of the wakeup word emitted by the sound source complies with an attenuation law of sound energy radiated by a point source; and when determining that the energy attenuation of the wakeup word emitted by the sound source does not comply with the attenuation law of the sound energy radiated by the point source, sending a wakeup forbidding instruction to the plurality of electronic devices.

Type: Grant

Filed: December 23, 2020

Date of Patent: January 28, 2025

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventor: Xiang Chen
Method and device for interacting meeting minute, apparatus and medium

Patent number: 12211007

Abstract: An interactive method and an interactive device for the meeting minute, an apparatus and a medium are provided. The method includes receiving an interactive triggering operation of a user for the meeting minute in a meeting minute display interface, where the meeting minute display interface displays a multimedia, a meeting subtitle of the multimedia and the meeting minute; and playing the multimedia based on an associated time period of the meeting minute, and distinctively displaying an associated subtitle of the meeting minute in the meeting subtitle. According to the above technical solution, the multimedia can be associated with content related to the meeting subtitle through the interactive triggering operation of the user for the meeting minute in the meeting minute display interface, to improve interactive experience effect of the user.

Type: Grant

Filed: March 10, 2022

Date of Patent: January 28, 2025

Assignee: BEIJING ZITIAO NETWORK TECHNOLOGY CO., LTD.

Inventors: Kojung Chen, Jingsheng Yang, Xiang Zheng, Chunsai Du, Xinyun Geng
Using structured audio output to detect playback and/or to adapt to misaligned playback in wireless speakers

Patent number: 12205592

Abstract: Implementations are directed to determining an audio delay, of a computing device, by causing an audio data stream to be transmitted to the computing device via a wireless communication channel. The computing device causes audio output generated using the audio data stream to be rendered via speaker(s). The rendered audio output is captured via microphone(s), and the audio delay determined by comparing the captured audio output with the audio data stream. A delay audio segment can be appended to an additional audio data stream transmitted to the computing device, where the length of the delay audio segment is determined using the audio delay. A noise reduction technique can additionally or alternatively be adapted based on the audio delay. Implementations are additionally or alternatively directed to determining if an audio data stream transmitted to a computing device for rendering through speaker(s) driven by the computing device—is actually being rendered.

Type: Grant

Filed: August 14, 2023

Date of Patent: January 21, 2025

Assignee: GOOGLE LLC

Inventors: Nathaniel Nesiba, Xiang Cao
Body language detection and microphone control

Patent number: 12206991

Abstract: A system includes a gimbal, a shotgun microphone coupled to the gimbal, a camera, and at least one processor. The at least one processor is configured to receive data indicative of an image or video feed from the camera. The at least one processor is also configured to determine, based on the data indicative of the image or video feed, a primary human speaker among a group of humans and a location of the primary human speaker. The at least one processor is also configured to control the gimbal to point the shotgun microphone at the location of the primary human speaker.

Type: Grant

Filed: April 1, 2022

Date of Patent: January 21, 2025

Assignee: Universal City Studios LLC

Inventors: Robert Michael Jordan, Howard Mall
System for multi-perspective discourse within a dialog

Patent number: 12204854

Abstract: Techniques are described for training and/or utilizing sub-agent machine learning models to generate candidate dialog responses. In various implementations, a user-facing dialog agent (202, 302), or another component on its behalf, selects one of the candidate responses which is closest to user defined global priority objectives (318). Global priority objectives can include values (306) for a variety of dialog features such as emotion, confusion, objective-relatedness, personality, verbosity, etc. In various implementations, each machine learning model includes an encoder portion and a decoder portion. Each encoder portion and decoder portion can be a recurrent neural network (RNN) model, such as a RNN model that includes at least one memory layer, such as a long short-term memory (LSTM) layer.

Type: Grant

Filed: January 4, 2024

Date of Patent: January 21, 2025

Assignee: KONINKLIJKE PHILIPS N.V.

Inventors: Vivek Varma Datla, Sheikh Sadid Al Hasan, Aaditya Prakash, Oladimeji Feyisetan Farri, Tilak Raj Arora, Junyi Liu, Ashequl Qadir
Systems and methods for local automated speech-to-text processing

Patent number: 12205585

Abstract: Systems and methods are described herein for enabling, on a local device, a voice control system that limits the amount of data needed to be transmitted to a remote server. A data structure is built at the local device to support a local speech-to-text model by receiving a query and transmitting, to a remote server over a communication network, a request for a speech-to-text transcription of the query. The transcription is received from the remote server and stored in the data structure at the local device in association with an audio clip of the query. Metadata describing the query is used to train the local speech-to-text model to recognize future instances of the query.

Type: Grant

Filed: December 10, 2019

Date of Patent: January 21, 2025

Assignee: Adeia Guides Inc.

Inventors: Jeffry Copps Robert Jose, Aashish Goyal
Dialog-driven applications supporting alternative vocal input styles

Patent number: 12205584

Abstract: A set of alternative vocal input styles for specifying a parameter of a dialog-driven application is determined. During execution of the application, an audio prompt requesting input in one of the styles is presented. A value of the parameter is determined by applying a collection of analysis tools to vocal input obtained after the prompt is presented. A task of the application is initiated using the value.

Type: Grant

Filed: November 22, 2021

Date of Patent: January 21, 2025

Assignee: Amazon Technologies, Inc.

Inventors: John Baker, Anubhav Mishra, Bangrui Liu, Christopher Michael Hittner, Sravan Babu Bodapati, Harshal Pimpalkhute, Katrin Kirchhoff, Anuj Gautam Surana, Yilai Su, Brandon Louis Mendez, Chengshun Zhang
Electronic device and control method thereof

Patent number: 12198678

Abstract: An electronic device of the present disclosure comprises: a communication unit; a memory; and a processor for: detecting a voice section in an audio signal acquired by the electronic device; identifying whether a wake-up word stored in the memory exists in a user voice included in the detected voice section; when it is identified that the wake-up word exists in the user voice, transmitting, via the communication unit, the user voice to a server for providing a voice recognition service; and when response information for the user voice is received from the server, providing a response to the user voice on the basis of the received response information, wherein the processor identifies that the wake-up word exists in the user voice, when a part of the user voice matches the wake-up word. In particular, a method for acquiring a natural language for providing a response may use an artificial intelligence model learned according to at least one of machine learning, a neural network, and a deep learning algorithm.

Type: Grant

Filed: July 26, 2019

Date of Patent: January 14, 2025

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Changhan Kim, Bowon Kim, Jinsuk Lee, Hyeontaek Lim, Jungkwan Seo
Method for vehicle lane changing control, device, storage medium, and program product

Patent number: 12195001

Abstract: A method for vehicle lane changing control, a device, a storage medium and a program product, relating to the fields of autonomous driving, intelligent transportation, big data, cloud computing, etc. in computer technology. A specific implementation solution is: during a process of vehicle driving, when a distance between the vehicle and a front bifurcation is less than or equal to a longest lane changing operation distance, according to map data and traffic data of a road section ahead, it is determined whether a current position is a preferred position to start performing the lane changing operation, so as to determine whether performing the lane changing operation at the current position; and if determined to perform the lane changing operation at the current position, lane changing operation execution information will be issued to enable the vehicle to start performing the lane changing operation at the current position.

Type: Grant

Filed: March 24, 2022

Date of Patent: January 14, 2025

Assignee: Apollo Intelligent Connectivity (Beijing) Technology Co., Ltd.

Inventors: Manni Chen, Binglin Zhang
System and method for displaying radio communication transcription

Patent number: 12198699

Abstract: Methods and systems are provided for displaying transcriptions of radio communication transcription for an aircraft. The method comprises capturing audio signals of radio communication traffic to and from the aircraft. The captured audio signals are preprocessed to divide the signals into independent spoken utterances. Each spoken utterance is transcribed using a speech recognition decoder that utilizes an air traffic control (ATC) speech recognition model and classification data is extracted from the transcription of each spoken utterance. The transcription of each spoken utterance is logged with reference to the classification data and a textual display of the transcription is provided to a crew member of the aircraft.

Type: Grant

Filed: June 6, 2022

Date of Patent: January 14, 2025

Assignee: HONEYWELL INTERNATIONAL INC.

Inventors: Chaya Garg, Vasantha Paulraj, Robert De Mers, Roger Burgin, Jitender Kumar Agarwal, Mahesh Kumar Sampath, Mohan M. Thippeswamy, Naveen Venkatesh Prasad Nama, Rahul Pradhan, Nitish Sharma
Wakeword detection using a secondary microphone

Patent number: 12190865

Abstract: Techniques for capturing spoken user inputs while a device is prevented from capturing such spoken user inputs are described. When a first device becomes incapable of capturing spoken user inputs intended for a system, a second device, for capturing such spoken user inputs, may be identified. The second device may be identified based on the second device being connected to a same vehicle computing system as the first device. The second device may be enabled to capture spoken user inputs, intended for the system, until the first device is again able to capture such spoken user inputs.

Type: Grant

Filed: May 18, 2021

Date of Patent: January 7, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Andrew Mitchell, Gabor Nagy
Drive through audio communication system with multi-lane support

Patent number: 12192130

Abstract: A communications system for a restaurant includes multiple field communications units, a portable communications device, and processing circuitry. The field communications units include a speaker and a microphone. The portable communications device is transportable by an order taker of the restaurant. The processing circuitry obtains a user input to transition the portable communications device between multiple different channels. The multiple different channels correspond to one of the multiple field communications units. The processing circuitry transitions the portable communications device to one of the multiple different channels according to the user input. The processing circuitry facilitates end-to-end bi-directional audio communication between the order taker and a customer by operating the portable communications device and the one of the multiple field communications units to exchange audio data over the one of the multiple different channels.

Type: Grant

Filed: March 15, 2024

Date of Patent: January 7, 2025

Assignee: Xenial, Inc.

Inventors: Christopher Siefken, Tushar Dabhade, Michael Roth, Israel Rivera, Arjun Wadwalkar
Voice control system for ophthalmic laser systems

Patent number: 12190874

Abstract: A voice control system for ophthalmologic laser treatment systems sets parameters for delivering laser energy based on voice commands and prevents potentially harmful parameters due to operator mistakes and misunderstood voice commands by providing incremental parameter adjustment and restricting the amount by which the parameters can be adjusted for each executed voice command. Valid voice commands include indications of which parameter to set, a value for the parameter, and whether to increase or decrease the value of the parameter. In one example, parameter values can only be increased or decreased by a certain percentage with respect to the current value. In another example, the parameters are adjusted by selecting the next highest or lowest value with respect to the current parameter value from a predetermined sequence of possible values for particular parameters. Voice control functionality can also be deactivated under certain conditions such as when it is determined that a parameter was not set.

Type: Grant

Filed: August 5, 2021

Date of Patent: January 7, 2025

Assignee: NORLASE APS

Inventors: Greg Fava, Peter Skovgaard
Complex task cognitive planning and execution system

Patent number: 12182767

Abstract: Disclosed is a system for determining sequences of operations that will automatically execute one or more tasks specified by a user. In some embodiments, the sequences of operations are based on operations that have been previously performed by users and recorded by the system. The system interprets an intention of a user based on analysis of terms used by the user to indicate a request. The system generates a sequence of operations, executable by an operating system associated with a client device that will perform one or more tasks specified or implied by the request of the user.

Type: Grant

Filed: December 15, 2017

Date of Patent: December 31, 2024

Assignee: Brain Technologies, Inc.

Inventors: Sheng Yue, Yuan Lin
Apparatus and method for controlling vehicle sound

Patent number: 12175962

Abstract: An apparatus and a method for controlling a vehicle sound are provided. The apparatus a detection device that detects driving information and drive mode setting information and a processing device electrically connected with the detection device. The processing device determines an emotional state of a driver based on at least one of the driving information or the drive mode setting information, determines a sound concept depending on the emotional state of the driver, and controls a vehicle sound depending on the sound concept.

Type: Grant

Filed: May 25, 2022

Date of Patent: December 24, 2024

Assignees: Hyundai Motor Company, Kia Corporation, Seoul National University R&DB Foundation

Inventors: Ki Chang Kim, Dong Chul Park, Myung Hwan Yun, Sung Ho Kim
Computer-based communication generation using phrases selected based on behaviors of communication recipients

Patent number: RE50381

Abstract: A method and system for generating adaptive explanations for associated recommendations is disclosed. The adaptive explanations comprise a syntactical structure and associated phrases that are selected in accordance with usage behaviors and/or inferences associated with usage behaviors. The phrases included in an adaptive explanation may be selected through application of a non-deterministic process. The adaptive explanations may be beneficially applied to recommendations that are associated with content, products, and people, including recommendations that comprise advertisements.

Type: Grant

Filed: August 11, 2022

Date of Patent: April 15, 2025

Assignee: Gula Consulting Limited Liability Company

Inventors: Steven Dennis Flinn, Naomi Felina Moneypenny

1 2 3 4 5 … next