Speech Controlled System Patents (Class 704/275)

Systems and methods for automatically populating information in a graphical user interface using natural language processing

Patent number: 11861298

Abstract: The present disclosure relates to systems, methods, and computer-readable media for performing natural language processing on a clinical note or audio information associated with medical personnel. A computer-implemented method performed by one or more processors for populating a graphical user interface with data associated with a voice input. The method may include receiving a voice input, generating a first text based on the voice input, comparing the text against a computer model, identifying a data field in the text, selecting a form field based on the identified data field, extracting a second text based on the generated text, and populating the second text in the selected form field.

Type: Grant

Filed: October 22, 2018

Date of Patent: January 2, 2024

Assignee: TeleTracking Technologies, Inc.

Inventors: Albert Tackie, Tejashree Gharat, Vanita Kolukulri
Method to determine intended direction of a vocal command and target for vocal interaction

Patent number: 11853651

Abstract: Systems and methods are described for recognizing and responding to commands in a virtual or physical environment. A system may receive voice data and determine an intended command. The system may then determine a position and viewpoint orientation of the user to be able to determine one or more digital assets associated with the user. The system may then determine a current state associated with each digital asset of the one or more digital assets to be able to determine at least one digital asset that is configured to process the command. The system can then apply the command to at least a first digital asset of the at least one digital asset that is configured to process the command.

Type: Grant

Filed: November 10, 2022

Date of Patent: December 26, 2023

Assignee: COMCAST CABLE COMMUNICATIONS, LLC

Inventor: Mark David Francisco
Development system and method for a conversational application

Patent number: 11853729

Abstract: A method, computer program product, and computing system for enabling usage of a conversational application by a plurality of users; gathering usage data concerning usage of the conversational application by the plurality of users; defining a visual representation of the conversational application; and overlaying the usage data onto the visual representation of the conversational application to generate visual traffic flow data.

Type: Grant

Filed: December 16, 2022

Date of Patent: December 26, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: David Ardman, Andrew Matkin, Nirvana Tikku, Abhishek Rohatgi, Marco Antonio Padron Chavez, Flaviu Gelu Negrean, Gabrielle R. Martone
Data processing for connected and autonomous vehicles

Patent number: 11847908

Abstract: A method may be implemented to prioritize and analyze data exchanged in a connected vehicle transit network. The method may include receiving, at a roadside unit, vehicle data from a connected vehicle. The method may further include prioritizing the vehicle data received from the connected vehicle based on a level of urgency, network latency or available computing resources.

Type: Grant

Filed: January 12, 2022

Date of Patent: December 19, 2023

Assignee: Bentley Systems, Incorporated

Inventors: Mark E. Pittman, Patrick B. Brown, David J. Sacharny, Victor Gill
Systems and methods for generating synthesized speech responses to voice inputs by training a neural network model based on the voice input prosodic metrics and training voice inputs

Patent number: 11842721

Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.

Type: Grant

Filed: August 5, 2022

Date of Patent: December 12, 2023

Assignee: Rovi Guides, Inc.

Inventors: Ankur Aher, Jeffry Copps Robert Jose
Information processing system, information processing apparatus, and information processing method

Patent number: 11838460

Abstract: An information processing system includes a microphone configured to acquire sound, a display device configured to display information, and at least one controller. The at least one controller is configured to cause the display device to display a first screen at least including one setting item with a set value that can be changed by an instruction, cause the display device to display a second screen including a selection object for transition to the first screen, and acquire text data based on speech acquired through the microphone during display of the second screen and change the set value of the one setting item on the basis of the text data without causing the display device to display the first screen.

Type: Grant

Filed: April 14, 2022

Date of Patent: December 5, 2023

Assignee: Canon Kabushiki Kaisha

Inventors: Toru Takahashi, Takeshi Matsumura, Yuji Naya
Digital assistant interaction in a video communication session environment

Patent number: 11837232

Abstract: This relates to an intelligent automated assistant in a video communication session environment. An example method includes, during a video communication session between at least two user devices, and at a first user device: receiving a first user voice input; in accordance with a determination that the first user voice input represents a communal digital assistant request, transmitting a request to provide context information associated with the first user voice input to the first user device; receiving context information associated with the first user voice input; obtaining a first digital assistant response based at least on a portion of the context information received from the second user device and at least a portion of context information associated with the first user voice input that is stored on the first user device; providing the first digital assistant response to the second user device; and outputting the first digital assistant response.

Type: Grant

Filed: February 28, 2023

Date of Patent: December 5, 2023

Assignee: Apple Inc.

Inventors: Niranjan Manjunath, Willem Mattelaer, Jessica Peck, Lily Shuting Zhang
Audio processing techniques for semantic audio recognition and report generation

Patent number: 11837208

Abstract: Example methods, apparatus and articles of manufacture to determine semantic information for audio are disclosed. Example apparatus disclosed herein are to process an audio signal obtained by a media device to determine values of a plurality of features that are characteristic of the audio signal, compare the values of the plurality of features to a first template having corresponding first ranges of the plurality of features to determine a first score, the first template associated with first semantic information, compare the values of the plurality of features to a second template having corresponding second ranges of the plurality of features to determine a second score, the second template associated with second semantic information, and associate the audio signal with at least one of the first semantic information or the second semantic information based on the first score and the second score.

Type: Grant

Filed: August 16, 2021

Date of Patent: December 5, 2023

Assignee: The Nielsen Company (US), LLC

Inventors: Alan Neuhauser, John Stavropoulos
Method for processing man-machine dialogues

Patent number: 11830483

Abstract: The present disclosure discloses a method for processing man-machine dialogues, which includes: acquiring a first user voice message from a client; determining a dialogue intent corresponding to the first user voice message; determining a target duplex wake-up mode corresponding to the dialogue intent based on an intent wake-up mode table, wherein the intent-wake mode table includes duplex wake-up modes corresponding to a plurality of candidate dialogue intents respectively, and the duplex wake-up modes comprise a full-duplex wake-up mode and a half-duplex wake-up mode; and sending a wake-up mode instruction corresponding to the target duplex wake-up mode to the client, such that the client processes the first user voice message according to the target duplex wake-up mode. Using the method and apparatus for carrying out the method, the wake-up mode of the client may be switched dynamically.

Type: Grant

Filed: November 25, 2019

Date of Patent: November 28, 2023

Assignee: AI SPEECH CO., LTD.

Inventor: Xinwei Yang
Multiple speech processing system with synthesized speech styles

Patent number: 11830485

Abstract: A device having a voice-based interface activates or “wakes” when it detects an utterance that includes a wakeword; the device may be installed in a vehicle, such as an automobile. The device may distinguish between different wakewords; a different speech processing system may be associated with each wakeword, and each speech processing engine may have its own speech style and associated applications and functions.

Type: Grant

Filed: December 11, 2018

Date of Patent: November 28, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Abhay Gupta, Timothy Whalin
Multitouch data fusion

Patent number: 11816329

Abstract: A method for performing multi-touch (MT) data fusion is disclosed in which multiple touch inputs occurring at about the same time are received to generating first touch data. Secondary sense data can then be combined with the first touch data to perform operations on an electronic device. The first touch data and the secondary sense data can be time-aligned and interpreted in a time-coherent manner. The first touch data can be refined in accordance with the secondary sense data, or alternatively, the secondary sense data can be interpreted in accordance with the first touch data. Additionally, the first touch data and the secondary sense data can be combined to create a new command.

Type: Grant

Filed: October 13, 2022

Date of Patent: November 14, 2023

Assignee: Apple Inc.

Inventors: Wayne Carl Westerman, John Greer Elias
Input device and image forming apparatus

Patent number: 11818309

Abstract: An input device is mountable on an image forming apparatus including a display portion capable of displaying information and is a numerical key unit 70 at least including a numerical key portion 110 consisting of a hardware key capable of inputting information on a numerical value and a start key 131 consisting of a hardware key capable of inputting information for starting predetermined processing. The start key 131 is disposed on a front side F than the numerical key portion 110 in a state in which the numerical key unit 70 is mounted on the image forming apparatus.

Type: Grant

Filed: July 6, 2021

Date of Patent: November 14, 2023

Assignee: CANON KABUSHIKI KAISHA

Inventor: Takehito Utsunomiya
Method and system for adaptive audio modification

Patent number: 11818426

Abstract: Systems and methods for modifying audio events in video content that correspond to one or more defined audio event types. A request is received to modify audio events in video corresponding to an audio event type. Video content to be presented on a display device is obtained that includes visual and audio content. An occurrence of a defined audio event corresponding to a defined audio event type is identified in the audio content. The defined audio event is modified according to a modification operation to generate modified audio content. The modified audio content is associated with a segment of the visual content that corresponds to the occurrence of the defined audio event. The modified audio content is provided in association with the segment of visual content for display on the display device.

Type: Grant

Filed: November 14, 2019

Date of Patent: November 14, 2023

Assignee: DISH Network L.L.C.

Inventors: Neil Marten, Rebecca Albinola
Machine learning used to detect alignment and misalignment in conversation

Patent number: 11817086

Abstract: Digitized media is received that records a conversation between individuals. Cues are extracted from the digitized media that indicate properties of the conversation. The cues are entered as training data into a machine learning module to create a trained machine learning model. The trained machine learning model is used in a processor to detect other misalignments in subsequent digitized conversations.

Type: Grant

Filed: March 13, 2020

Date of Patent: November 14, 2023

Assignee: XEROX CORPORATION

Inventors: Evgeniy Bart, Margaret H. Szymanski
Systems and methods for connecting and controlling configurable lighting units

Patent number: 11815234

Abstract: In some embodiments, a configurable lighting device, connectors, controllers, and methods for layout detection are provided. The configurable lighting devices, suitably connected, form an assembly of configurable lighting devices that can be removably connected with one another and re-arranged. Connectors are provided that form mechanical and electrical connections between configurable lighting devices such that a flow of electricity and control signals may be propagated without the need for direct connection between every configurable lighting device and a controller. The controller or devices connected to the controller are configured to perform layout detection such that pleasing visualizations may be rendered across the assembly that are rendered using at least the detected layout. When the configuration of the configurable lighting devices changes, the layout detection is automatically updated.

Type: Grant

Filed: April 18, 2022

Date of Patent: November 14, 2023

Assignee: Nanogrid Limited

Inventors: Tomas Rodinger, Aliakbar Juzer Eski, Henry Chow, Arash Sadr, John Anders Ohrn, Gimmy Chu
Joining users to communications via voice commands

Patent number: 11816394

Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.

Type: Grant

Filed: March 20, 2023

Date of Patent: November 14, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Ty Loren Carlson, Rohan Mutagi
Intelligent device arbitration and control

Patent number: 11809783

Abstract: This relates to systems and processes for using a virtual assistant to arbitrate among and/or control electronic devices. In one example process, a first electronic device samples an audio input using a microphone. The first electronic device broadcasts a first set of one or more values based on the sampled audio input. Furthermore, the first electronic device receives a second set of one or more values, which are based on the audio input, from a second electronic device. Based on the first set of one or more values and the second set of one or more values, the first electronic device determines whether to respond to the audio input or forego responding to the audio input.

Type: Grant

Filed: March 5, 2021

Date of Patent: November 7, 2023

Assignee: Apple Inc.

Inventors: Kurt Piersol, Ryan M. Orr, Daniel J. Mandel
Electronic device configured to perform action using speech recognition function and method for providing notification related to action using same

Patent number: 11810571

Abstract: A method includes receiving a designated event related to a second application while an execution screen of a first application is displayed on a display. The method also includes executing an artificial intelligent application in response to the designated event. The method further includes transmitting data related to the designated event to an external server, based on the executed artificial intelligent application. Additionally, the method includes sensing a user utterance related to the designated event for a designated period of time. The method also includes transmitting the user utterance to the external server. The method further includes receiving an action order for performing a function related to the user utterance from the external server. The method also includes executing the second application at least based on the received action order. The method further includes outputting a result of performing the function by using the second application.

Type: Grant

Filed: February 13, 2023

Date of Patent: November 7, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Taegu Kim, Gajin Song, Jaeyung Yeo
Dynamic adjustment of wake word acceptance tolerance thresholds in voice-controlled devices

Patent number: 11810564

Abstract: Systems and methods are provided for detecting wake words. An electronic device detects, from a microphone array, an audio signal in an environment proximate to the audio front end system. The electronic device processes the audio signal using a plurality of wake word detection engines, including dynamically adjusting how many wake word detection engines are available for processing the audio signal. The electronic device independently adjusts respective wake word detection thresholds for the plurality of wake word detection engines used to process the audio signal.

Type: Grant

Filed: March 25, 2022

Date of Patent: November 7, 2023

Assignee: Spotify AB

Inventors: Daniel Bromand, Joseph Cauteruccio, Sven Erland Fredrik Lewin
Explaining anomalous phonetic translations

Patent number: 11810558

Abstract: A method includes: receiving, by a computing device, a digital voice stream; receiving, by the computing device, converted text that represents the digital voice stream; identifying, by the computing device, an erroneously converted portion of the converted text; selecting, by the computing device, the erroneously converted portion for explainability processing; parsing, by the computing device, the erroneously converted portion into parts based on a predetermined parsing level; collecting, by the computing device, supplementary input data related to the erroneously converted portion; and determining, by the computing device and based on the supplemental input data, a reason why the erroneously converted portion was erroneously converted.

Type: Grant

Filed: May 26, 2021

Date of Patent: November 7, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Gandhi Sivakumar, Kushal S. Patel, Luke Peter Macura, Sarvesh S. Patel
Gaze detection using one or more neural networks

Patent number: 11803759

Abstract: Apparatuses, systems, and techniques are described to determine locations of objects using images including digital representations of those objects. In at least one embodiment, a gaze of one or more occupants of a vehicle is determined independently of a location of one or more sensors used to detect those occupants.

Type: Grant

Filed: October 11, 2021

Date of Patent: October 31, 2023

Assignee: Nvidia Corporation

Inventors: Feng Hu, Niranjan Avadhanam, Yuzhuo Ren, Sujay Yadawadkar, Sakthivel Sivaraman, Hairong Jiang, Siyue Wu
Method and system for asynchronous notifications for users in contextual interactive systems

Patent number: 11803400

Abstract: A terminal server of a virtual assistant system for proactively triggering notifications is disclosed. The terminal server is configured to: receive data indicative of a change of a service related state associated with a user of at least one terminal client; generate accordingly a close-ended type question; instruct a transmission of the close-ended type question to the at least one terminal client; in response to a retransmission request, received from the at least one terminal client in relation to the transmission: not perform the close-ended type question, access a storage of the service related state to generate accordingly a new close-ended type question, instruct a transmission of the new close-ended type question to the at least one terminal client, analyze a closed type answer provided by the at least one terminal client, and instruct transmission of a current response to the answer provided by the user.

Type: Grant

Filed: June 25, 2020

Date of Patent: October 31, 2023

Assignee: International Business Machines Corporation

Inventors: Offer Akrabi, Ari Volcoff, Eliezer Segev Wasserkrug, Erez Lev Meir Bilgory
Recorded media hotword trigger suppression

Patent number: 11798543

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for suppressing hotword triggers when detecting a hotword in recorded media are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio corresponding to playback of an item of media content. The actions further include determining, by the computing device, that the audio includes an utterance of a predefined hotword and that the audio includes an audio watermark. The actions further include analyzing, by the computing device, the audio watermark. The actions further include based on analyzing the audio watermark, determining, by the computing device, whether to perform speech recognition on a portion of the audio following the predefined hotword.

Type: Grant

Filed: January 13, 2022

Date of Patent: October 24, 2023

Assignee: Google LLC

Inventor: Ricardo Antonio Garcia
Systems, methods, and apparatus for enhanced presentation remotes

Patent number: 11797102

Abstract: In accordance with some embodiments, systems, apparatus, interfaces, methods, and articles of manufacture are provided for ascertaining aspects of a presentation and/or of an audience member. A presentation remote can be used to obtain information about the presentation and provide it to the audience member.

Type: Grant

Filed: January 18, 2023

Date of Patent: October 24, 2023

Assignee: SCIENCE HOUSE LLC

Inventors: James Jorasch, Michael Werner, Geoffrey Gelman, Isaac W. Hock, Gennaro Rendino, Christopher Capobianco
Voice activated device for use with a voice-based digital assistant

Patent number: 11798547

Abstract: A voice activated device for interaction with a digital assistant is provided. The device comprises a housing, one or more processors, and memory, the memory coupled to the one or more processors and comprising instructions for automatically identifying and connecting to a digital assistant server. The device further comprises a power supply, a wireless network module, and a human-machine interface. The human-machine interface consists essentially of: at least one speaker, at least one microphone, an ADC coupled to the microphone, a DAC coupled to the at least one speaker, and zero or more additional components selected from the set consisting of: a touch-sensitive surface, one or more cameras, and one or more LEDs. The device is configured to act as an interface for speech communications between the user and a digital assistant of the user on the digital assistant server.

Type: Grant

Filed: August 6, 2020

Date of Patent: October 24, 2023

Assignee: Apple Inc.

Inventor: Kevin Milden
Real-time message delivery and update service in a proxy server network

Patent number: 11792295

Abstract: This patent document describes technology for providing real-time messaging and entity update services in a distributed proxy server network, such as a CDN. Uses include distributing real-time notifications about updates to data stored in and delivered by the network, with both high efficiency and locality of latency. The technology can be integrated into conventional caching proxy servers providing HTTP services, thereby leveraging their existing footprint in the Internet, their existing overlay network topologies and architectures, and their integration with existing traffic management components.

Type: Grant

Filed: May 20, 2022

Date of Patent: October 17, 2023

Assignee: Akamai Technologies, Inc.

Inventors: Matthew J. Stevens, Michael G. Merideth, Nil Alexandrov, Andrew F. Champagne, Brendan Coyle, Timothy Glynn, Mark A. Roman, Xin Xu
System and methods for disambiguating an ambiguous entity in a search query based on the gaze of a user

Patent number: 11792476

Abstract: Systems and methods for disambiguation of an ambiguous entity in a search query based on the gaze of a user. These systems and methods may be implemented by a media guidance application (e.g., executed by user equipment associated with the user). In some aspects, the media guidance application may monitor the gaze of the user and attempt to disambiguate the ambiguous entity based on an area of the screen the user viewed while issuing the search query. If the media guidance application receives an indication that it did not disambiguate the ambiguous entity in the search query correctly, the media guidance application may increase the area of the screen that the user viewed in order to find an additional entity. This may allow the media guidance application to quickly and accurately find the correct answer to the user's search query.

Type: Grant

Filed: June 23, 2020

Date of Patent: October 17, 2023

Assignee: Rovi Product Corporation

Inventors: Ajay Kumar Gupta, William L. Thomas, Mathew C. Burns, Gabriel C. Dalbec, Alexander W. Liston, Jonathan A. Logan, Margret B. Schmidt
Method and an equipment for configuring a service

Patent number: 11790905

Abstract: An equipment and a method for configuring a service on an equipment. A method includes receiving a first voice input from a user to configure an equipment with a service. The equipment is configured with a voice-bot to interact with the user. The method also includes validating the first voice input, initiating configuration of the service and outputting a first voice response based on the validation of the first voice input. The method includes receiving a second voice input from the user in response to the first voice response and validating the second voice input. The method includes outputting a second voice response based on the validation of the second voice input and configuring the service on the equipment based on the voice inputs from the user.

Type: Grant

Filed: December 7, 2020

Date of Patent: October 17, 2023

Assignee: CARRIER CORPORATION

Inventors: Karthikeyan Loganathan, Akil Vivek Jalisatgi
Image processing system, image processing apparatus, and image processing method

Patent number: 11785150

Abstract: An image processing system capable of managing image data using a plurality of boxes, comprises a microphone that obtains a sound, an obtaining unit that obtains a user identifier based on voice information of a user obtained via the microphone, a specifying unit that specifies one box among the plurality of boxes based on specification information including at least the user identifier, and an informing unit that informs the user of information related to the specified one box.

Type: Grant

Filed: September 8, 2021

Date of Patent: October 10, 2023

Assignee: Canon Kabushiki Kaisha

Inventor: Shintaro Okamura
Digital assistant interaction in a video communication session environment

Patent number: 11769497

Abstract: Embodiments provide a context-aware digital assistant at multiple user devices participating in a video communication session by using context information from a first user device to determine a digital assistant response at a second user device. In this manner, users participating in the video communication session may interact with the digital assistant during the video communication session as if the digital assistant is another participant in the video communication session. Embodiments further describe automatically determining candidate digital assistant tasks based on a shared transcription of user voice inputs received at user devices participating in a video communication session. In this manner, a digital assistant of a user device participating in a video communication session may proactively determine one or more tasks that a user of the user device may want the digital assistant to perform based on conversations held during the video communication session.

Type: Grant

Filed: January 26, 2021

Date of Patent: September 26, 2023

Assignee: Apple Inc.

Inventors: Niranjan Manjunath, Willem Mattelaer, Jessica Peck, Lily Shuting Zhang
System and method for temporal attention behavioral analysis of multi-modal conversations in a question and answer system

Patent number: 11769018

Abstract: Methods and systems for attention behavioral analysis for a conversational question and answer system are disclosed. A multi-modality input is selected from a plurality of multimodality conversations among two or more users. The system annotates the first modality inputs and at least one attention region in the first modality input corresponding to a set of entities and semantic relationships in a unified modality is identified by a discrete aspect of information bounded by the attention elements. The system models the representations of the multimodality inputs at different levels of granularity, which includes entity level, turn level, conversational level. The method proposed uses a network that consists of multilevel encoder-decoder architecture that is used to determine unified focalized attention, analyze and construct one or more responses for one or more turns in a conversation.

Type: Grant

Filed: November 24, 2020

Date of Patent: September 26, 2023

Assignee: Openstream Inc.

Inventor: Rajasekhar Tumuluri
Artificial intelligence apparatus

Patent number: 11769508

Abstract: Disclosed herein is an artificial intelligence apparatus including an input interface configured to receive speech data, and a processor configured to detect a non-utterance interval included in the speech data and determine presence/absence of a second utterance after the non-utterance interval according to characteristics of a first utterance before the non-utterance interval, when the non-utterance interval exceeds a set time.

Type: Grant

Filed: October 28, 2022

Date of Patent: September 26, 2023

Assignee: LG ELECTRONICS INC.

Inventor: Hansuk Shim
Systems and methods for tool integration using cross channel digital forms

Patent number: 11763074

Abstract: Embodiments of the invention are directed to a system, method, or computer program product for digital form integration and presentation. The method includes extracting one or more data input fields from a form and generating one or more user prompts to be presented to a user in order to complete at least one of the one or more entries of the one or more data input fields. The method further includes causing the transmission of at least one of the one or more user prompts to the user and receiving a prompt response from the user. The method also includes determining whether the prompt response meets one or more form requirements for a given entry and updating the form upon determination that the prompt response meets one or more form requirements for a given entry. The method further includes displaying to the user the form in an appropriate channel format.

Type: Grant

Filed: December 2, 2021

Date of Patent: September 19, 2023

Assignee: BANK OF AMERICA CORPORATION

Inventors: Indradeep Dantuluri, Charanjit S. Bagga, Muralidhar Chowdarapu, Burton M. Covnot, Sandeep Gandhi, Ryan Scott Heller, Saurabh Khanna, Silvia Adriana Krasuk, Mardochee Macxis, Walter Thomas Robinson, Rupal V. Shah, Mansoor Zafar
Electronic device and method for speech recognition processing of electronic device

Patent number: 11756575

Abstract: An electronic device and method are disclosed. The device includes a memory and speech recognition circuitry and/or a processor, which implements the method, including: receiving a first utterance, and processing the first utterance to initiate a session and generate a first response result, after the session related to the first utterance is terminated, receiving a second utterance, processing the second utterance to generate a second response result, based on the second response result, determining whether to execute follow-up utterance processing on the second utterance as if the session were active, based on determining to execute the follow-up utterance processing, reprocessing the second utterance based at least in part on the first response result related to the first utterance to generate a third response result, and outputting the third response result.

Type: Grant

Filed: February 22, 2022

Date of Patent: September 12, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jisu Son, Youngbin Kim, Jungkeun Cho
Occupancy tracking using sound recognition

Patent number: 11747038

Abstract: An occupancy tracking device configured to receive a plurality of sound samples over a predetermined time period. The device is further configured to compute an audio signature for each sound sample. The audio signature includes a numerical value that uniquely identifies characteristics of an audio signal. The device is further configured to populate entries in the voice data log for the sound samples, to identify one or more clusters based on an audio signature that is associated with the populated entries, and to determine a number of clusters that are identified. The device is further configured to determine a predicted occupancy level based on the number of clusters that are identified and to control a Heating, Ventilation, and Air Conditioning (HVAC) system based on the predicted occupancy level.

Type: Grant

Filed: July 19, 2022

Date of Patent: September 5, 2023

Assignee: Lennox Industries Inc.

Inventors: Sunil Bondalapati, Prasad Mecheri Chandravihar
Voice assistant-enabled web application or web page

Patent number: 11749276

Abstract: Various embodiments discussed herein enable applications to seamlessly contribute to executing voice commands of users via voice assistant functionality. In response to receiving a user request to open an application or web page, the application can request and responsively receive a voice assistant runtime component along with the application or web page. The application, using a particular universal application interface component can compile or interpret the voice assistant runtime component from a source code format to an intermediate code format. In response to the application or web page being rendered and the detection of a key word or phrase, the application can activate voice assistant command execution functionality. The user can issue a voice command after which the application along with specific services can help execute the voice command.

Type: Grant

Filed: October 19, 2021

Date of Patent: September 5, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Rene Huangtian Brandel, Jason Eric Voldseth, Biao Kuang
Voice question and answer method and device, computer readable storage medium and electronic device

Patent number: 11749255

Abstract: A voice question and answer method and device, a computer readable storage medium, and an electronic device are described. The method comprises: receiving question voice information, and obtaining question text information according to the question voice information; performing at least one of general semantic analysis processing and dedicated semantic analysis processing on the question text information to generate an analysis result; and obtaining answer information according to the analysis result. The general semantic analysis is used for semantic analysis in the general field, and the dedicated semantic analysis is used for semantic analysis in the art field.

Type: Grant

Filed: March 31, 2020

Date of Patent: September 5, 2023

Assignee: BOE TECHNOLOGY GROUP CO., LTD.

Inventors: Chu Xu, Shuo Chen, Xingqun Jiang
Using corrections, of automated assistant functions, for training of on-device machine learning models

Patent number: 11741953

Abstract: Processor(s) of a client device can: receive sensor data that captures environmental attributes of an environment of the client device; process the sensor data using a machine learning model to generate a predicted output that dictates whether one or more currently dormant automated assistant functions are activated; making a decision as to whether to trigger the one or more currently dormant automated assistant functions; subsequent to making the decision, determining that the decision was incorrect; and in response to determining that the determination was incorrect, generating a gradient based on comparing the predicted output to ground truth output. In some implementations, the generated gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model. In some implementations, the generated gradient is additionally or alternatively transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.

Type: Grant

Filed: November 8, 2019

Date of Patent: August 29, 2023

Assignee: GOOGLE LLC

Inventors: Françoise Beaufays, Rajiv Mathews, Dragan Zivkovic, Kurt Partridge, Andrew Hard
Selective suppression of noises in a sound signal

Patent number: 11741983

Abstract: System and method for detecting and identifying noises in a sound signal occurring during a call on a mobile device and selectively filtering and suppressing the noises in the sound signal are provided. In the mobile device, a processor is configured to receive a sound signal, detect noises in the received sound signal, identify the noises in the received sound signal, display the identified noises in a user interface (UI), receive a selection of the displayed identified noises from the UI, and filter the received selection of the displayed identified noises from the received sound signal. The processor may use a machine learning module with a neural network to detect and identify the noises in the received sound signal.

Type: Grant

Filed: January 13, 2021

Date of Patent: August 29, 2023

Assignee: QUALCOMM Incorporated

Inventors: Vishnu Vardhan Kasilya Sudarsan, Naga Chandan Babu Gudivada, Dinesh Ramakrishnan, Rameshkumar Karuppusamy
Video integration with home assistant

Patent number: 11736760

Abstract: Methods, devices, systems, and means for video integration with a home assistant device are described herein. The home assistant device interacts with a person in a video stream by capturing, using a network-enabled outdoor video camera, a video stream of an outdoor location of a premises at which the person is present and analyzing the person appearing in the captured video stream to determine an identity of the person. Based on determining the identity of the person, the home assistant device announces a presence of the person that is outdoors and outputs instructions to the person.

Type: Grant

Filed: March 28, 2022

Date of Patent: August 22, 2023

Assignee: Google LLC

Inventors: Jessica Yuan, James Stewart, Rajeev Nongpiur, Patrick Lister, Chi Yeung Jonathan Ng
Providing contextual actions for mobile onscreen content

Patent number: 11734581

Abstract: Systems and methods provide an application programming interface to offer action suggestions to third-party applications using context data associated with the third-party. An example method includes receiving content information and context information from a source mobile application, the content information representing information to be displayed on a mobile device as part of a source mobile application administered by a third party, the context information being information specific to the third party and unavailable to a screen scraper. The method also includes predicting an action based on the content information and the context information, the action representing a deep link for a target mobile application. The method further includes providing the action to the source mobile application with a title and a thumbnail, the source mobile application using the title and thumbnail to display a selectable control that, when selected, causes the mobile device to initiate the action.

Type: Grant

Filed: May 19, 2021

Date of Patent: August 22, 2023

Assignee: GOOGLE LLC

Inventors: Ibrahim Badr, Mauricio Zuluaga, Aneto Okonkwo, Gökhan Bakir
Image projection device, medium, and image projection method

Patent number: 11736669

Abstract: An image projection device which is mounted on a vehicle or worn by a driver of the vehicle includes a control unit and a projection unit. The control unit accepts a selection for specifying a particular user, detects a current location, acquires an image showing at least a part of an appearance of the particular user, which is captured when the particular user has previously driven a particular user's vehicle at a location which is the same as or near to the current location, and projects the image onto a driver's seat of the vehicle by the projection unit.

Type: Grant

Filed: April 21, 2022

Date of Patent: August 22, 2023

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventors: Toyokazu Nakashima, Ryuichi Kamaga, Mitsuhiro Miura, Yasuhiro Baba, Tomokazu Maya, Ryosuke Kobayashi, Genshi Kuno
Human-machine dialogue method and electronic device

Patent number: 11735163

Abstract: Disclosed is a human-computer dialogue method including determining a set number of jump topics about a target topic, and generating a topic jump map converging to the target topic based on the correlation intensions among the set number of jump topics; after an initial response to a user's dialogue request, selecting from the topic jump map a jump topic to which the user's dialogue request relates as an initial topic for a first round of recommendation; after completing a human-machine dialogue of the initial topic, determining a jump topic to jump according to the jump probability of jumping out of the initial topic to the k jump topics at the downstream level for a next round of recommendation; and gradually guiding from the initial topic to the target topic by step-by-step recommendation. A more fluent and efficient human-machine dialogue based on a clear communication goal can be realized.

Type: Grant

Filed: January 22, 2019

Date of Patent: August 22, 2023

Assignee: AI SPEECH CO., LTD.

Inventors: Min Chu, Taotao Guo, Zhongyuan Dai, Chao Yang
Methods and systems for using voice to control multiple devices

Patent number: 11727932

Abstract: Methods, apparatuses, computer-readable media, and systems for using voice (e.g., voice commands) to control a plurality of network devices via a motion sensing control device are provided. A control device can detect movement (e.g., a gesture) associated with the control device. The control device can initiate a either a direct or a proxy communication session with a remote computing device. The communication session can be established and maintained for a predefined period such that data associated with a command can be immediately transmitted to the remote computing device. Thus, data associated with the command can be transmitted over the already established communication session to the remote computing device. The remote computing device can provide a response to the control device and/or transmit a command code associated with the voice command to one or more devices intended to be controlled.

Type: Grant

Filed: March 25, 2022

Date of Patent: August 15, 2023

Assignee: Comcast Cable Communications, LLC

Inventor: Michael Rekstad
Dynamic modification of placeholder text in conversational interfaces

Patent number: 11727218

Abstract: According to one embodiment, a computer-implemented method for dynamically modifying placeholder text in a conversational interface includes: processing a conversation log reflecting a conversation between a human user and an automated agent; determining, based at least in part on the processing: one or more capabilities of the automated agent; and/or a trajectory of the conversation; and dynamically modifying placeholder text in the conversational interface based at least in part on: the one or more capabilities of the automated agent; the trajectory of the conversation; or both the one or more capabilities of the automated agent and the trajectory of the conversation. Other embodiments in the form of systems and computer program products are also disclosed.

Type: Grant

Filed: October 26, 2018

Date of Patent: August 15, 2023

Assignee: International Business Machines Corporation

Inventors: Raphael I. Arar, Robert J. Moore, Guangjie Ren, Margaret H. Szymanski, Eric Y. Liu
Transitioning between prior dialog contexts with automated assistants

Patent number: 11727220

Abstract: Techniques are described related to prior context retrieval with an automated assistant. In various implementations, instance(s) of free-form natural language input received from a user during a human-to-computer dialog session between the user and an automated assistant may be used to generate a first dialog context. The first dialog context may include intent(s) and slot value(s) associated with the intent(s). Similar operations may be performed with additional inputs to generate a second dialog context that is semantically distinct from the first dialog context. When a command is received from the user to transition the automated assistant back to the first dialog context, natural language output may be generated that conveys at least one or more of the intents of the first dialog context and one or more of the slot values of the first dialog context. This natural language output may be presented to the user.

Type: Grant

Filed: March 22, 2022

Date of Patent: August 15, 2023

Assignee: GOOGLE LLC

Inventors: Justin Lewis, Scott Davies
Acoustic identification of audio products

Patent number: 11726161

Abstract: Disclosed are techniques for a multimedia device with audio and video capturing capability to identify an audio device based on acoustic playback signal if the audio device cannot be identified from captured video. The multimedia device may assemble a list of candidate audio devices that are a possible match for the observed audio device from a database of previously recognized audio devices and may transmit commands to the candidate audio devices to play acoustic identification signals. The acoustic identification signals may be audible sound or ultrasonic tone sequences with embedded identification information unique to each audio device. The multimedia device may record and analyze the acoustic identification signals received from any of the candidate audio devices to construct metrics to select the most likely candidate for the observed audio device. The metrics may include time of flight, direction of arrival, received amplitude, direct-to-reverberant ratio (DRR) of the acoustic identification signals.

Type: Grant

Filed: September 1, 2021

Date of Patent: August 15, 2023

Assignee: Apple Inc.

Inventors: Christopher T. Eubank, Martin E. Johnson, Daniel K. Boothe, Jonathan D. Sheaffer
Systems and methods for identifying content corresponding to a language spoken in a household

Patent number: 11721321

Abstract: Systems and methods for identifying content corresponding to a language are provided. Language spoken by a first user based on verbal input received from the first user is automatically determined with voice recognition circuitry. A database of content sources is cross-referenced to identify a content source associated with a language field value that corresponds to the determined language spoken by the first user. The language field in the database identifies the language that the associated content source transmits content to a plurality of users. A representation of the identified content source is generated for display to the first user.

Type: Grant

Filed: August 23, 2021

Date of Patent: August 8, 2023

Assignee: Rovi Guides, Inc.

Inventor: Shuchita Mehra
Selecting an input mode for a virtual assistant

Patent number: 11720238

Abstract: Methods, systems, and apparatus for selecting an input mode are described. In one aspect, a method includes receiving request data specifying a request to launch a virtual assistant application from a lock screen of a mobile device. In response to receiving the request data, input signals are obtained. A selection of an input mode for the virtual assistant application is made, from candidate input modes, based on the input signals. Each candidate input mode is of an input type different from each other input type of each other candidate input mode. The input types include an image type and an audio type. The input mode of the image type receives pixel data for input to the virtual assistant application. The input mode of the audio type receives audio input for the virtual assistant application. The virtual assistant application presents content selected based on input signals received using the selected input mode.

Type: Grant

Filed: October 8, 2021

Date of Patent: August 8, 2023

Assignee: GOOGLE LLC

Inventor: Ibrahim Badr
Method and device for performing voice recognition using grammar model

Patent number: RE49762

Abstract: A method of updating speech recognition data including a language model used for speech recognition, the method including obtaining language data including at least one word; detecting a word that does not exist in the language model from among the at least one word; obtaining at least one phoneme sequence regarding the detected word; obtaining components constituting the at least one phoneme sequence by dividing the at least one phoneme sequence into predetermined unit components; determining information regarding probabilities that the respective components constituting each of the at least one phoneme sequence appear during speech recognition; and updating the language model based on the determined probability information.

Type: Grant

Filed: September 28, 2021

Date of Patent: December 19, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Chi-youn Park, Il-hwan Kim, Kyung-min Lee, Nam-hoon Kim, Jae-won Lee

prev 1 2 3 4 5 6 … next