Distributed Recognition, E.g., In Client-server Systems For Mobile Phones Or Network Applications, Etc. (epo) Patents (Class 704/E15.047)

Media playback system with concurrent voice assistance

Patent number: 12210801

Abstract: Example techniques involve invoking voice assistance for a media playback system. In some embodiments, a NMD stores in memory a set of command information comprising a listing of playback commands and associated command criteria. The NMD captures a voice input and detects inclusion, within the voice input, of one or more particular playback commands from among the playback commands in the listing. In response, the NMD selects a local voice assistant that supports (a) one or more additional playback commands relative to a cloud-based VAS and (b) fewer non-playback commands relative to the cloud-based VAS, determines, via the local voice assistant, an intent in the captured voice input, and performs a response to the determined intent. The NMD foregoes selection of the cloud-based VAS when the local voice assistant is selected.

Type: Grant

Filed: February 5, 2024

Date of Patent: January 28, 2025

Assignee: Sonos, Inc.

Inventors: Dayn Wilberding, John Tolomei
AI supported UI form text field autocorrecting its value from a live conversation

Patent number: 12190056

Abstract: Systems and methods for autofilling and autocorrecting an AI supported UI form text field via smart glasses are provided. The data autofillted and autocorrected may be identification data, including account numbers and user IDs. The smart glasses may work together with a conversation tracking application and a smart glasses UI. Methods may include prompting agents based on real-time conversational analysis with context clues and texts to assist and maintain customer eye contact. Methods may further include capturing segments of data within conversational analysis and storing the segments of data in memory on the smart glasses. Methods may further include updating smart glasses UI form text fields based on AI supported autocorrection from real-time conversation, by autofilling the segment of data in the smart glasses UI form text fields.

Type: Grant

Filed: August 1, 2023

Date of Patent: January 7, 2025

Assignee: Bank of America Corporation

Inventor: Sandeep Verma
Delta models for providing privatized speech-to-text during virtual meetings

Patent number: 12165646

Abstract: Provided herein are systems and methods for delta models for providing privatized speech-to-text during virtual meetings. In one embodiment, a system may include a non-transitory computer-readable medium; a communications interface; and a processor. The processor may be configured to execute processor-executable instructions to: join a virtual meeting. Each participant in the virtual meeting may exchange audio streams with other participants in the virtual meeting. The instructions may include receiving, from a video conference provider, a local model for speech recognition. The local model may be a copy of a centralized model. The instructions may include performing speech recognition using the local model on the audio streams.

Type: Grant

Filed: April 29, 2022

Date of Patent: December 10, 2024

Assignee: Zoom Video Communications, Inc.

Inventors: Shane Paul Springer, Alexander Waibel
Keyword-based dialogue summarizer

Patent number: 12165631

Abstract: A method of generating keyword-based dialogue summaries is provided. The method includes inputting a transcript of an audio conversation and a keyword into a machine learning model trained based on encodings representing the keyword and the transcript, generating computer-generated text different from and semantically descriptive of the transcript and semantically associated with the keyword, and outputting the computer-generated text in association with a selectable item selectable for inclusion of the computer-generated text in displayed text representing the transcript, the selectable item associated with the keyword.

Type: Grant

Filed: May 3, 2022

Date of Patent: December 10, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Abedelkader Asi, Royi Ronen, Roy Eisenstadt, Dean Geckt
Multiple choice decision engine for an electronic personal assistant

Patent number: 12159243

Abstract: To simplify assisting a user in their day-to-day activities, a communication for performing an action may be sent to a user in the form of a query, where the query includes the most likely set of choices for the action arranged in a group of dichotomous (e.g., yes/no) or multiple choice answers. In this manner, a user may respond to the query by simply selecting one of the dichotomous or multiple choice answers. Historical logs of past actions, responses, queries, and so forth, may be used to predict future user actions or needs, and to formulate future queries for sending to the user. These techniques may be implemented, for example, through a remote coordination server or directly through a user's personal electronics device.

Type: Grant

Filed: July 11, 2023

Date of Patent: December 3, 2024

Assignee: Telepathy Labs, Inc.

Inventors: Damien Phelan Stolarz, David Joseph Diaz, James Rossfeld, Scott Raven, Christopher O'Malley, Christopher Kurpinski
Method for performing multi-link communication in wireless communication system

Patent number: 12089140

Abstract: According to various embodiments, a multi-link device (MLD) operating in multiple links including a first link may transmit, through a first station (STA) and to a first AP of an AP multi-link device, a request frame including an information field for requesting at least one element related to a second link. The multi-link device may receive at least one element related to the second link on the basis of the request frame.

Type: Grant

Filed: July 6, 2023

Date of Patent: September 10, 2024

Assignee: LG ELECTRONICS INC.

Inventors: Namyeong Kim, Jeongki Kim, Jinsoo Choi, Sungjin Park, Taewon Song, Insun Jang
Generating sale transactions from voice data input by a user

Patent number: 12073453

Abstract: There is provided systems and method for generating sale transaction from voice data input by a user. A user device may receive voice data including a preference for purchasing an item. The user device may convert the voice data to the preferences and perform a search for a sales transaction corresponding to the preferences. The search may include parameters about the user, such as a location. The sales transaction may include purchase prices, times, locations, or other relevant data. A user may accept or decline the sales transaction with additional user data. If the user accepts the sales transaction, the sales transaction may be completed with a payment provider and a transaction history given to the user for later redemption of the item. If the user declines the sales transaction, further sale transactions with additional items may be present to the user.

Type: Grant

Filed: August 17, 2021

Date of Patent: August 27, 2024

Assignee: PAYPAL, INC.

Inventors: Hyunju Lee, Joel P. Yarbrough, Francisco Vittorio Octavio Joachin D. Barretto, Gokul G Narayana Pillai
Method and device for speech/music classification and core encoder selection in a sound codec

Patent number: 12062381

Abstract: Two-stage speech/music classification device and method classify an input sound signal and select a core encoder for encoding the sound signal. A first stage classifies the input sound signal into one of a number of final classes. A second stage extracts high-level features of the input sound signal and selects the core encoder for encoding the input sound signal in response to the extracted high-level features and the final class selected in the first stage.

Type: Grant

Filed: April 8, 2021

Date of Patent: August 13, 2024

Assignee: VOICEAGE CORPORATION

Inventor: Vladimir Malenovsky
Moment capturing system

Patent number: 12014733

Abstract: A vehicle occupant aid system is disclosed. The system may comprise a rearview assembly. Further, the rearview assembly may comprise a button. The system may further comprise one or more data capturing element. Each element may be a microphone, an imager, a location device, and/or a sensor. In some embodiments, a controller may record the data for a predetermined period of time. Further, the controller may transmit information to a remote device based upon initiation of a trigger. The information being based, at least in part, on the data. In other embodiments, the controller may operability record the data in response to a first operation of the button. Further, the controller may transmit information to a remote device based upon a second operation of the button. The information being based, at least in part, on the data recorded between the first and second operations of the button.

Type: Grant

Filed: May 27, 2021

Date of Patent: June 18, 2024

Assignee: GENTEX CORPORATION

Inventors: Thomas S. Wright, Eric P. Bigoness
Electronic apparatus and controlling method thereof

Patent number: 12008988

Abstract: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a microphone, a camera, a memory configured to store at least one command, and at least one processor configured to, based on a first user voice being input from a user, provide a response to the first user voice, based on an audio signal including a voice being input while the response to the first user voice is provided, analyze an image captured by the camera and determine whether there is a second user voice uttered by the user in the audio signal, and based on determining that there is the second user voice uttered by the user in the audio signal, stop providing the response to the first user voice and obtain and provide a response to the second user voice.

Type: Grant

Filed: October 7, 2020

Date of Patent: June 11, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hyeontaek Lim, Sejin Kwak, Youngjin Kim
Method, apparatus, device and computer storage medium for processing voices

Patent number: 11990130

Abstract: A method, apparatus, device and computer storage medium for processing voices, which relate to the technical field of voices, are disclosed. An implementation includes: recognizing a voice request received by a first voice assistant to obtain a text request; determining information of a second voice assistant which is able to process the text request; and calling the second voice assistant to respond to the text request.

Type: Grant

Filed: May 7, 2020

Date of Patent: May 21, 2024

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventors: Jizhou Huang, Shiqiang Ding, Changshun Hou
Apparatus and audio signal processor, for providing processed audio signal representation, audio decoder, methods and computer programs

Patent number: 11990146

Abstract: An apparatus for providing a processed audio signal representation on the basis of input audio signal representation configured to apply an un-windowing, in order to provide the processed audio signal representation on the basis of the input audio signal representation. The apparatus is configured to adapt the un-windowing in dependence on one or more signal characteristics and/or in dependence on one or more processing parameters used for a provision of the input audio signal representation.

Type: Grant

Filed: May 4, 2021

Date of Patent: May 21, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Stefan Bayer, Pallavi Maben, Emmanuel Ravelli, Guillaume Fuchs, Eleni Fotopoulou, Markus Multrus
Communication resource allocation for rateless coding

Patent number: 11990997

Abstract: This disclosure provides systems, methods and apparatus, including computer programs encoded on computer storage media, for mapping coding indices associated with a rateless coding scheme to respective sets of communication resources over which a transmitting device may perform transmissions of portions of a message corresponding to each of the coding indices. In some aspects, for example, different coding indices may correspond to different cumulative portions of the message and the transmitting device may transmit a first cumulative portion of the message corresponding to a first coding index over a first set of communication resources to which the first coding index is mapped. The transmitting device may signal an indication of such a resource mapping scheme to a receiving device and the receiving device may attempt to decode transmissions of one or more cumulative portions of the message using the indicated resource mapping scheme.

Type: Grant

Filed: November 17, 2021

Date of Patent: May 21, 2024

Assignee: QUALCOMM Incorporated

Inventors: Gideon Shlomo Kutz, David Yunusov, Tal Oved, Assaf Touboul, Amit Bar-Or Tillinger
Two-way communication support system and storage medium

Patent number: 11984137

Abstract: A two-way communication support system supports two-way communication between a second terminal device which receives, from a first terminal device, first state data changed according to the state of a first user present on a side where the first terminal device is arranged and which outputs an image and a voice indicated by the first state data and the first terminal device. The two-way communication support system includes a state identifier and an output controller. The state identifier analyzes second state data changed according to the state of a second user present on a side where the second terminal device is arranged so as to identify the state of the second user. The output controller causes the first terminal device to output analysis information indicating a result of the identification of the state of the second user performed by the state identifier.

Type: Grant

Filed: December 21, 2021

Date of Patent: May 14, 2024

Assignee: SHARP KABUSHIKI KAISHA

Inventor: Mamoru Takaya
Method and system for performing speech recognition in an electronic device

Patent number: 11967318

Abstract: The present subject matter at least describes a method and a system (300, 1200) of performing speech-recognition in an electronic device having an embedded speech recognizer. The method comprises receiving an input-audio comprising speech at a device. In real-time, at-least one speech-recognition module is selected within at least one of the device and a server for recognition of at least a portion of the received speech based on a criteria defined in terms of a) past-performance of speech-recognition modules within the device and server; b) an orator of speech; and c) a quality of service associated with at least one of the device and a networking environment thereof. Based upon the selection of the server, output of the selected speech-recognition modules within the device are selected for processing by corresponding speech-recognition modules of the server. An uttered-speech is determined within the input-audio based on output of the selected speech-recognition modules of the device or the server.

Type: Grant

Filed: December 19, 2019

Date of Patent: April 23, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jithendra Vepa, Periyasamy Paramasivam, Ramya Viswanathan, Rajesh Krishna Selvaraj Krishnan
Information processing device and information processing method

Patent number: 11948564

Abstract: Provided is an information processing device including a response control unit that controls a response to a user's utterance based on a first utterance interpretation result and a second utterance interpretation result. The first utterance interpretation result is a result of natural language understanding processing for an utterance text generated by automatic speech recognition processing based on the user's utterance and the second utterance interpretation result is an interpretation result acquired based on learning data in which the first utterance interpretation result and the utterance text used to acquire the first utterance interpretation result are associated with each other. The response control unit further controls the response to the user's utterance based on the second utterance interpretation result in a case where the second utterance interpretation result is acquired based on the user's utterance before acquisition of the first utterance interpretation result.

Type: Grant

Filed: March 13, 2019

Date of Patent: April 2, 2024

Assignee: SONY CORPORATION

Inventors: Hiro Iwase, Yuhei Taki, Kunihito Sawai
Speech decoding method and apparatus, computer device, and storage medium

Patent number: 11935517

Abstract: A speech decoding method is performed by a computer device, the speech including a current audio frame and a previous audio frame. The method includes: obtaining a target token corresponding to a smallest decoding score from a first token list including first tokens obtained by decoding the previous audio frame, each first token including a state pair and a decoding score, the state pair being used for characterizing a correspondence between a first state of the first token in a first decoding network corresponding to a low-order language model and a second state of the first token in a second decoding network corresponding to a differential language model; determining pruning parameters according to the target token and an acoustic vector of the current audio frame when the current audio frame is decoded; and decoding the current audio frame according to the first token list, the pruning parameters, and the acoustic vector.

Type: Grant

Filed: March 3, 2021

Date of Patent: March 19, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Yiheng Huang, Xiaozheng Jian, Liqiang He
Integrating voice controls into applications

Patent number: 11935539

Abstract: A voice support server is used to provide voice control functionality to a third party application that does not natively support voice control functions. The voice support server implements a domain specific to the third party application that maintains a domain-specific language model (DLM) reflecting the functionality of the third party application. The DLM comprises a plurality of intent patterns corresponding to different commands and their possible variations that may be issued by the user, and maps each intent pattern to a corresponding action to be performed by the third party application. Received audio data is analyzed to determine one or more user utterances, which are transcribed and compared to the intent patterns of the DLM to identify an intent corresponding to the user utterance. The voice control module may then transmit instructions to the third party application to perform the action corresponding to the identified intent.

Type: Grant

Filed: January 24, 2020

Date of Patent: March 19, 2024

Assignee: Alan AI, Inc.

Inventors: Andrey Ryabov, Anna Miroshnichenko, Evgeny Yusov, Alex Sotnikov
Brokerless reliable totally ordered many-to-many interprocess communication on a single node that uses shared memory and multicast

Patent number: 11915065

Abstract: Examples described herein include systems and methods for brokerless reliable totally ordered many-to-many inter-process communication on a single node. A messaging protocol is provided that utilizes shared memory for one of the control plane and data plane, and multicast for the other plane. Readers and writers can store either control messages or message data in the shared memory, including in a ring buffer. Write access to portions of the shared memory can be controlled by a robust futex, which includes a locking mechanism that is crash recoverable. In general, the writers and readers can control the pace of communications and the crash of any process does not crash the overall messaging on the node.

Type: Grant

Filed: January 20, 2022

Date of Patent: February 27, 2024

Assignee: VMware, Inc.

Inventors: Rusko Atanasov, Kalin Tsvetkov
Electronic device, an electronic reference device, and related method for positioning of the electronic device

Patent number: 11906613

Abstract: An electronic device includes memory circuitry, interface circuitry, and processor circuitry. The processor circuitry is configured to transmit, to a plurality of electronic reference devices, a first signal, the first signal having a pulse width below a threshold. The processor circuitry is configured to determine, based on the received second signals and at least one predetermined time period, a time of flight of each of the second signals. The processor circuitry is configured to obtain, from the memory circuitry, reference positions of the plurality of electronic reference devices. The processor circuitry is configured to determine, based on the associations, one or more candidate positions of the electronic device. The processor circuitry is configured to determine, based on the distances, the one or more candidate positions, and the obtained reference positions, a position of the electronic device.

Type: Grant

Filed: June 3, 2021

Date of Patent: February 20, 2024

Assignee: Sony Group Corporation

Inventor: Peter Ljung
Multi-device speech processing

Patent number: 11900921

Abstract: Techniques for partially processing an input on a device and completing processing at a remote system are provided. The device may process an input using an on-device machine learning (ML) model, and determine to cease processing at an intermediary node of the (ML) model based on the output of the intermediary node. Based on the output of the intermediary node satisfying a condition, the device may use the output of the intermediary node to generate an output responsive to the input. Conversely, if the output of the intermediary node does not satisfy a condition, the device may send the output of the intermediary node to the remote system, so the remote system can use another machine learning model to complete processing with respect to the input.

Type: Grant

Filed: October 26, 2020

Date of Patent: February 13, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Rahul Gupta, Christophe Dupuy, Jacob Ryan Stolee, Clement Chung
Media playback system with concurrent voice assistance

Patent number: 11893308

Abstract: Example techniques involve invoking voice assistance for a media playback system. In some embodiments, a NMD stores in memory a set of command information comprising a listing of playback commands and associated command criteria. The NMD captures a voice input and detects inclusion, within the voice input, of one or more particular playback commands from among the playback commands in the listing. In response, the NMD selects a local voice assistant that supports (a) one or more additional playback commands relative to a cloud-based VAS and (b) fewer non-playback commands relative to the cloud-based VAS, determines, via the local voice assistant, an intent in the captured voice input, and performs a response to the determined intent. The NMD foregoes selection of the cloud-based VAS when the local voice assistant is selected.

Type: Grant

Filed: March 28, 2022

Date of Patent: February 6, 2024

Assignee: Sonos, Inc.

Inventors: Dayn Wilberding, John Tolomei
Allocation of local and remote resources for speech processing

Patent number: 11869487

Abstract: Speech processing tasks may be allocated at least partly to a local device (e.g., user computing device that receives spoken words) and at least partly to a remote device to determine one or more user commands or tasks to be performed by the local device. The remote device may be used to process speech that the local device could not process or understand, or for other reasons, such as for error checking. The local device may then execute or begin to execute locally determined tasks to reduce user-perceived latency. Meanwhile, the entire media input, or a portion thereof, may be sent to the remote device to process speech, verify the tasks and/or identify other user commands in the media input (or portion thereof).

Type: Grant

Filed: August 16, 2019

Date of Patent: January 9, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Sanjoy Ghosh, Pieter Sierd van der Meulen
Offline voice control

Patent number: 11869503

Abstract: As noted above, example techniques relate to offline voice control. A local voice input engine may process voice inputs locally when processing voice inputs via a cloud-based voice assistant service is not possible. Some techniques involve local (on-device) voice-assisted set-up of a cloud-based voice assistant service. Further example techniques involve local voice-assisted troubleshooting the cloud-based voice assistant service. Other techniques relate to interactions between local and cloud-based processing of voice inputs on a device that supports both local and cloud-based processing.

Type: Grant

Filed: December 13, 2021

Date of Patent: January 9, 2024

Assignee: Sonos, Inc.

Inventor: Connor Smith
Proactive environment-based chat information system

Patent number: 11863646

Abstract: Disclosed is the technology for computer-based “Daily Brief” service, which includes methods and corresponding systems for proactively providing push notifications for users of chat information systems. The push notifications are dynamically generated and presented to the user based on identification of one or more triggering events, which may include predetermined time/date, current geographical location, activity of peers and friends in social media associated with the user, scheduled events, appointments, meetings, emails, instant messages, and many more. The described technology improves the interaction interface between the user and chat information system.

Type: Grant

Filed: February 3, 2023

Date of Patent: January 2, 2024

Assignee: GOOGLE LLC

Inventors: Ilya Gennadyevich Gelfenbeyn, Artem Goncharuk, Ilya Andreevich Platonov, Pavel Aleksandrovich Sirotin, Olga Aleksandrovna Gelfenbeyn
Multi-user voice assistant with disambiguation

Patent number: 11830490

Abstract: Disambiguating question answering responses by receiving voice command data associated with a first user, determining a first user identity according to the first user voice command data, determining a first user activity context according to the first user voice command data, determining a first response for the first user, receiving voice command data associated with a second user, determining a second user identity according to the second user voice command data, determining a second user activity context according to the second user voice command data, determining a second response for the second user, determining a predicted ambiguity between the first response and the second response, altering the first response according to the predicted ambiguity, and providing the first response and the second response.

Type: Grant

Filed: August 11, 2021

Date of Patent: November 28, 2023

Assignee: International Business Machines Corporation

Inventors: Venkata Vara Prasad Karri, Sarbajit K. Rakshit, Sri Harsha Varada, Sampath Kumar Pulupula Venkata
Defense of targeted database attacks through dynamic honeypot database response generation

Patent number: 11824894

Abstract: Embodiments of the invention are directed to techniques that include receiving a query intended for a targeted database and determining that the query is from an unauthorized user. A response is returned to the unauthorized user generated by a model, the response being dynamically generated to fulfill the query. The model is configured to generate responses consistent with any previous responses returned to the unauthorized user.

Type: Grant

Filed: November 25, 2020

Date of Patent: November 21, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Marco Simioni, Stefano Braghin, Killian Levacher
Brokerless reliable totally ordered many-to-many interprocess communication on a single node that uses shared memory and multicast

Patent number: 11803431

Abstract: Examples described herein include systems and methods for brokerless reliable totally ordered many-to-many inter-process communication on a single node. A messaging protocol is provided that utilizes shared memory for one of the control plane and data plane, and multicast for the other plane. Readers and writers can store either control messages or message data in the shared memory, including in a ring buffer. Write access to portions of the shared memory can be controlled by a robust futex, which includes a locking mechanism that is crash recoverable. In general, the writers and readers can control the pace of communications and the crash of any process does not crash the overall messaging on the node.

Type: Grant

Filed: March 14, 2022

Date of Patent: October 31, 2023

Assignee: VMware, Inc.

Inventors: Rusko Atanasov, Kalin Tsvetkov, Viktoriya Bambaldokova
Speech-processing system

Patent number: 11790902

Abstract: A system may include first and second speech-processing systems. The first speech-processing system may process received audio data and determine that a command represented therein is associated with a second speech-processing system. The first speech-processing system may send command data to the second speech-processing system and receive response data in return. The first speech-processing system may then process the response data to determine second response data that includes an indication of the second speech-processing system and cause output of audio corresponding to the second response data.

Type: Grant

Filed: February 4, 2020

Date of Patent: October 17, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Timothy Whalin, Catherine Michelle Loo, Calvin Phuong Nguyen
Voice processing method based on artificial intelligence

Patent number: 11790893

Abstract: A voice processing method is disclosed. The voice processing method applies first and second sentence vectors extracted from first and second utterances, that are included in one dialog group and are separated from each other, to a learning model and generates an output from which at least one word having an overlapping meaning is removed. The voice processing method can be associated with an artificial intelligence module, an unmanned aerial vehicle (UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.

Type: Grant

Filed: September 30, 2020

Date of Patent: October 17, 2023

Assignee: LG ELECTRONICS INC.

Inventors: Kwangyong Lee, Hyun Yu, Byeongha Kim, Yejin Kim
Automated system and method to prioritize language model and ontology expansion and pruning

Patent number: 11769012

Abstract: A system and method for updating computerized language models is provided that automatically adds or deletes terms from the language model to capture trending events or products, while maximizing computer efficiencies by deleting terms that are no longer trending and use of knowledge bases, machine learning model training and evaluation corpora, analysis tools and databases.

Type: Grant

Filed: March 25, 2020

Date of Patent: September 26, 2023

Assignee: Verint Americas Inc.

Inventors: Ian Roy Beaver, Christopher James Jeffs
Systems, methods, and apparatuses for implementing a geo-demographic zoning optimization engine

Patent number: 11763404

Abstract: Systems, methods, and apparatuses for implementing a geo-demographic zoning optimization engine are disclosed.

Type: Grant

Filed: June 15, 2021

Date of Patent: September 19, 2023

Assignee: Arizona Board of Regents on behalf of Arizona State University

Inventors: Jon J. Miller, Vikash Bajaj, Srinivasa Srivatsav Kandala, Fangwu Wei, Michael Kuby, Wangshu Mu, Daoqin Tong
Multiple choice decision engine for an electronic personal assistant

Patent number: 11741385

Abstract: To simplify assisting a user in their day-to-day activities, a communication for performing an action may be sent to a user in the form of a query, where the query includes the most likely set of choices for the action arranged in a group of dichotomous (e.g., yes/no) or multiple choice answers. In this manner, a user may respond to the query by simply selecting one of the dichotomous or multiple choice answers. Historical logs of past actions, responses, queries, and so forth, may be used to predict future user actions or needs, and to formulate future queries for sending to the user. These techniques may be implemented, for example, through a remote coordination server or directly through a user's personal electronics device.

Type: Grant

Filed: July 28, 2022

Date of Patent: August 29, 2023

Assignee: Telepathy Labs, Inc

Inventors: Damien Phelan Stolarz, David Joseph Diaz, James Rossfeld, Scott Raven, Christopher O'Malley, Christopher Kurpinski
Caption service system for remote speech recognition

Patent number: 11735185

Abstract: The present invention provides a caption service system for remote speech recognition, which provides caption service for the hearing impaired. This system includes a speaker and a live broadcast equipment at A, a listener-typist and a computer at B, a hearing impaired and a live screen at C, and an automatic speech recognition (ASR) caption server at D. Connect the live broadcast equipment, the computer, the live screen and the ASR caption server with a network. The speaker's audio is sent to the automatic speech recognition (ASR) caption server to be converted into text, which is corrected by the listener-typist, and then the text caption is sent to the live screen of the hearing impaired together with the speaker's video and audio, so that the hearing impaired can see the text caption spoken by the speaker.

Type: Grant

Filed: August 19, 2021

Date of Patent: August 22, 2023

Assignee: NATIONAL YANG MING CHIAO TUNG UNIVERSITY

Inventors: Sin Horng Chen, Yuan Fu Liao, Yih Ru Wang, Shaw Hwa Hwang, Bing Chih Yao, Cheng Yu Yeh, You Shuo Chen, Yao Hsing Chung, Yen Chun Huang, Chi Jung Huang, Li Te Shen, Ning Yun Ku
Methods and systems for establishing and maintaining presence information of neighboring Bluetooth devices

Patent number: 11729596

Abstract: A communication device and method can include one or more processors operatively coupled to memory, a sensor and an output device, where the one or more processors to perform operations of identifying target person locations using internet searching and short range communication enabled devices such as Bluetooth LE devices.

Type: Grant

Filed: December 2, 2022

Date of Patent: August 15, 2023

Assignee: Staton Techiya LLC

Inventor: Steven Wayne Goldstein
Intermediate data for inter-device speech processing

Patent number: 11721347

Abstract: Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.

Type: Grant

Filed: June 29, 2021

Date of Patent: August 8, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Stanislaw Ignacy Pasko, Pawel Zelazko, Cagdas Bak, Eli Joshua Fidler, Michal Kowalczuk, Andrew Oberlin, Ariya Rastrow
Transcription of communications using multiple speech recognition systems

Patent number: 11710488

Abstract: A method may include obtaining audio data originating at a first device during a communication session between the first device and a second device and providing the audio data to a first speech recognition system to generate a first transcript based on the audio data and directing the first transcript to the second device. The method may also include in response to obtaining a quality indication regarding a quality of the first transcript, multiplexing the audio data to provide the audio data to a second speech recognition system to generate a second transcript based on the audio data while continuing to provide the audio data to the first speech recognition system and direct the first transcript to the second device, and in response to obtaining a transfer indication that occurs after multiplexing of the audio data, directing the second transcript to the second device instead of the first transcript.

Type: Grant

Filed: December 19, 2018

Date of Patent: July 25, 2023

Assignee: Sorenson IP Holdings, LLC

Inventors: Kenneth Boehme, Michael Holm, Shane Roylance
Shared speech processing network for multiple speech applications

Patent number: 11700484

Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules. A first speech application module corresponds to a speaker verifier, and a second speech application module corresponds to a speech recognition network.

Type: Grant

Filed: February 10, 2022

Date of Patent: July 11, 2023

Assignee: QUALCOMM Incorporated

Inventors: Lae-Hoon Kim, Sunkuk Moon, Erik Visser, Prajakt Kulkarni
Recording medium, information processing method, information processing device, and information processing system

Patent number: 11695836

Abstract: A computer program and the like are provided that are capable of causing an information processing device connected to a private network, to automatically execute operation processing of a browser. The computer program is a computer program for causing the information processing device connected to the private network, to automatically execute the operation of the browser that accesses a web server on the private network, based on an instruction from a server connected to a global network, and causes the information processing device to execute the processing of: requesting the server to establish a connection; obtaining an operation instruction related to the operation processing which is push-transmitted from the server, by using the connection; executing the operation processing of the browser based on the obtained operation instruction; obtaining an execution result of the operation processing; and outputting the obtained execution result to the server.

Type: Grant

Filed: June 18, 2020

Date of Patent: July 4, 2023

Assignee: C-RISE Ltd.

Inventors: Masanori Murai, Yutaka Mitsubayashi
Distributed identification in networked system

Patent number: 11683320

Abstract: The present disclosure is generally directed to a data processing system for customizing content in a voice activated computer network environment. With user consent, the data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, increasing the accuracy of the voice identification process used in the generation of customized content. The present solution can make accurate identifications while generating fewer audio identification models, which are computationally intensive to generate.

Type: Grant

Filed: April 22, 2021

Date of Patent: June 20, 2023

Assignee: GOOGLE LLC

Inventors: Victor Carbune, Thomas Deselaers, Sandro Feuz
Hybrid policy dialogue manager for intelligent personal assistants

Patent number: 11669697

Abstract: A method for providing responsive actions to user inputs in a multi-domain context includes receiving, by a speech-based user interface, a first speech input from a user and converting said first speech input into a text-based representation of the first speech input. A natural language processor processes the text-based representation to determine an intent, entity and internal state of the first speech input. The method further includes determining, by a model-based module based on the intent, entity and internal state, a first data processing policy to apply to the first speech input, wherein the first data processing policy is either a rules-based data processing policy applied by a rules-based module or a statistical model-based data processing policy applied by the model-based module. The first responsive action is generated by the determined first data processing module, and outputted via the speech-based user interface and/or a machine interface.

Type: Grant

Filed: October 23, 2019

Date of Patent: June 6, 2023

Assignee: Bayerische Motoren Werke Aktiengesellschaft

Inventors: Wangsu Hu, Jilei Tian
Selecting an optimal combination of systems for query processing

Patent number: 11650983

Abstract: A method is provided for generating a classification model configured to select an optimal execution combination for query processing. The method provides, to a processor, training queries and different execution combinations for executing the training queries. Each different execution combination involves a respective different query engine and a respective different runtime. The method extracts, from a set of Directed Acyclic Graphs (DAGs) using a set of Cost-Based Optimizers (CBOs), a set of feature vectors for each of the plurality of training queries. The method adds, by the processor to each of merged feature vectors a respective label indicative of the optimal execution combination based on actual respective execution times of the plurality of different execution combinations, to obtain a set of labels. The method trains, by the processor, the classification model by learning the set of merged feature vectors with the set of labels.

Type: Grant

Filed: December 22, 2020

Date of Patent: May 16, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Tatsuhiro Chiba
Devices, systems, and methods for distributed voice processing

Patent number: 11646023

Abstract: Systems and methods for distributed voice processing are disclosed herein. In one example, the method includes detecting sound via a microphone array of a first playback device and analyzing, via a first wake-word engine of the first playback device, the detected sound. The first playback device may transmit data associated with the detected sound to a second playback device over a local area network. A second wake-word engine of the second playback device may analyze the transmitted data associated with the detected sound. The method may further include identifying that the detected sound contains either a first wake word or a second wake word based on the analysis via the first and second wake-word engines, respectively. Based on the identification, sound data corresponding to the detected sound may be transmitted over a wide area network to a remote computing device associated with a particular voice assistant service.

Type: Grant

Filed: December 14, 2020

Date of Patent: May 9, 2023

Assignee: Sonos, Inc.

Inventors: Connor Kristopher Smith, John Tolomei, Betty Lee
Systems and methods for interpreting natural language search queries

Patent number: 11594213

Abstract: Systems and methods are described herein for interpreting natural language search queries that account for contextual relevance of words of the search query that would ordinarily not be processed, including, for example, processing each word of the query. Each term is associated with a respective part of speech, and a frequency of occurrence of each term in content metadata is determined. A relevance of each term is then determined based on its respective part of speech and frequency. The natural language search query is then interpreted based on the importance or relevance of each term.

Type: Grant

Filed: March 3, 2020

Date of Patent: February 28, 2023

Assignee: ROVI GUIDES, INC.

Inventors: Jeffry Copps Robert Jose, Ajay Kumar Mishra
Moving object control system, moving object control device, and moving object control method

Patent number: 11588799

Abstract: A moving object control system includes a server, a portable terminal configured to transmit authentication information issued by the server, and a controller provided in a moving object and configured to authenticate the portable terminal according to the authentication information transmitted from the portable terminal, and when the portable terminal is authenticated, control the moving object in response to an operation signal from the portable terminal. The controller is configured to perform information communication with the server and control the moving object according to control information received from the server.

Type: Grant

Filed: July 29, 2019

Date of Patent: February 21, 2023

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventor: Yasuhisa Fujiwara
Electronic apparatus and control method thereof

Patent number: 11587571

Abstract: An electronic apparatus includes: at least one processor configured to: receive audio of a voice input of a user; obtain, from a plurality of voice recognizers capable of recognizing the voice input, a plurality of recognition results of the received audio; and perform an operation based on a recognition result of which recognition suitability for the voice input is identified to be high, among the plurality of recognition results.

Type: Grant

Filed: September 2, 2020

Date of Patent: February 21, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Chanhee Choi
Systems, methods, and apparatuses for providing assistant deep links to effectuate third-party dialog session transfers

Patent number: 11550846

Abstract: Methods, apparatus, systems, and computer-readable media are provided for transferring dialog sessions between devices using deep links. The dialog sessions can correspond to interactions, mediated by an automated assistant, between a user and a third party application. During the dialog session, a user can request that the dialog session be transferred to a different device, for example, to interact with the third party application through a different modality. In response, the automated assistant and/or the third party application can generate a link that can be transferred to the transferee device to allow the transferee device to seamlessly take over the dialog session. In this way, computational resources and electrical power can be preserved by not requiring a recipient device to re-process natural language inputs previously provided during the dialog session.

Type: Grant

Filed: May 17, 2021

Date of Patent: January 10, 2023

Assignee: GOOGLE LLC

Inventors: Justin Lewis, Scott Davies
Terminal device, server and controlling method thereof

Patent number: 11538476

Abstract: A terminal device is provided and includes a communication interface including circuitry, a display and at least one processor configured to control the communication interface to transmit a user voice including a plurality of intents to an external server, based on word use information included in the user voice and summary information regarding the user voice generated based on user-related information being received from the external server, control the display to display the received summary information, based on a user feedback regarding the summary information being input, transmit information regarding the user feedback to the external server, and based on response information regarding the user voice generated based on the user feedback being received from the external server, control the display to provide the response information.

Type: Grant

Filed: November 24, 2020

Date of Patent: December 27, 2022

Assignee: Samsung Electronics Co., Ltd.

Inventors: Sanghyuk Yoon, Heejun Song, Heejae Yu
Electronic apparatus and method for controlling voice recognition thereof

Patent number: 11538458

Abstract: Disclosed is an electronic apparatus capable of controlling voice recognition. The electronic apparatus increases a score of a category corresponding to a word included in user's utterance in a database when the instruction included in the user's utterance is present in the database. The electronic apparatus checks whether the score of the category corresponding to the word is equal to or greater than a preset value when the instruction is not present in the database. The electronic apparatus registers the instruction in the database so that the instruction is included in the category corresponding to the word when the score is equal to or greater than the preset value as the check result.

Type: Grant

Filed: September 16, 2020

Date of Patent: December 27, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Heejae Kim
Voice control for remote monitoring

Patent number: 11501879

Abstract: Techniques for voice control of a patient care device are described. A patient care device receives an audio request from a user. The patient care device records the audio request. The patient care device transmits the audio request over a communication network to a speech recognition service, and in response receives, from the speech recognition service, a textual representation of the audio request. The patient care device matches the textual representation, using the computer processor, to a first command in a vocabulary of available commands, and in response performs the first command.

Type: Grant

Filed: October 1, 2019

Date of Patent: November 15, 2022

Assignee: PREVENTICE TECHNOLOGIES, INC.

Inventors: Richard M. Smith, Scott J. Burrichter, Jon P. Otterstatter

1 2 next