Speech Controlled System Patents (Class 704/275)
  • Patent number: 11861298
    Abstract: The present disclosure relates to systems, methods, and computer-readable media for performing natural language processing on a clinical note or audio information associated with medical personnel. A computer-implemented method performed by one or more processors for populating a graphical user interface with data associated with a voice input. The method may include receiving a voice input, generating a first text based on the voice input, comparing the text against a computer model, identifying a data field in the text, selecting a form field based on the identified data field, extracting a second text based on the generated text, and populating the second text in the selected form field.
    Type: Grant
    Filed: October 22, 2018
    Date of Patent: January 2, 2024
    Assignee: TeleTracking Technologies, Inc.
    Inventors: Albert Tackie, Tejashree Gharat, Vanita Kolukulri
  • Patent number: 11853651
    Abstract: Systems and methods are described for recognizing and responding to commands in a virtual or physical environment. A system may receive voice data and determine an intended command. The system may then determine a position and viewpoint orientation of the user to be able to determine one or more digital assets associated with the user. The system may then determine a current state associated with each digital asset of the one or more digital assets to be able to determine at least one digital asset that is configured to process the command. The system can then apply the command to at least a first digital asset of the at least one digital asset that is configured to process the command.
    Type: Grant
    Filed: November 10, 2022
    Date of Patent: December 26, 2023
    Assignee: COMCAST CABLE COMMUNICATIONS, LLC
    Inventor: Mark David Francisco
  • Patent number: 11853729
    Abstract: A method, computer program product, and computing system for enabling usage of a conversational application by a plurality of users; gathering usage data concerning usage of the conversational application by the plurality of users; defining a visual representation of the conversational application; and overlaying the usage data onto the visual representation of the conversational application to generate visual traffic flow data.
    Type: Grant
    Filed: December 16, 2022
    Date of Patent: December 26, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: David Ardman, Andrew Matkin, Nirvana Tikku, Abhishek Rohatgi, Marco Antonio Padron Chavez, Flaviu Gelu Negrean, Gabrielle R. Martone
  • Patent number: 11847908
    Abstract: A method may be implemented to prioritize and analyze data exchanged in a connected vehicle transit network. The method may include receiving, at a roadside unit, vehicle data from a connected vehicle. The method may further include prioritizing the vehicle data received from the connected vehicle based on a level of urgency, network latency or available computing resources.
    Type: Grant
    Filed: January 12, 2022
    Date of Patent: December 19, 2023
    Assignee: Bentley Systems, Incorporated
    Inventors: Mark E. Pittman, Patrick B. Brown, David J. Sacharny, Victor Gill
  • Patent number: 11842721
    Abstract: The system provides a synthesized speech response to a voice input, based on the prosodic character of the voice input. The system receives the voice input and calculates at least one prosodic metric of the voice input. The at least one prosodic metric can be associated with a word, phrase, grouping thereof, or the entire voice input. The system also determines a response to the voice input, which may include the sequence of words that form the response. The system generates the synthesized speech response, by determining prosodic characteristics based on the response, and on the prosodic character of the voice input. The system outputs the synthesized speech response, which includes a more natural, relevant, or both answer to the call of the voice input. The prosodic character of the voice input and/or response may include pitch, note, duration, prominence, timbre, rate, and rhythm, for example.
    Type: Grant
    Filed: August 5, 2022
    Date of Patent: December 12, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Ankur Aher, Jeffry Copps Robert Jose
  • Patent number: 11838460
    Abstract: An information processing system includes a microphone configured to acquire sound, a display device configured to display information, and at least one controller. The at least one controller is configured to cause the display device to display a first screen at least including one setting item with a set value that can be changed by an instruction, cause the display device to display a second screen including a selection object for transition to the first screen, and acquire text data based on speech acquired through the microphone during display of the second screen and change the set value of the one setting item on the basis of the text data without causing the display device to display the first screen.
    Type: Grant
    Filed: April 14, 2022
    Date of Patent: December 5, 2023
    Assignee: Canon Kabushiki Kaisha
    Inventors: Toru Takahashi, Takeshi Matsumura, Yuji Naya
  • Patent number: 11837232
    Abstract: This relates to an intelligent automated assistant in a video communication session environment. An example method includes, during a video communication session between at least two user devices, and at a first user device: receiving a first user voice input; in accordance with a determination that the first user voice input represents a communal digital assistant request, transmitting a request to provide context information associated with the first user voice input to the first user device; receiving context information associated with the first user voice input; obtaining a first digital assistant response based at least on a portion of the context information received from the second user device and at least a portion of context information associated with the first user voice input that is stored on the first user device; providing the first digital assistant response to the second user device; and outputting the first digital assistant response.
    Type: Grant
    Filed: February 28, 2023
    Date of Patent: December 5, 2023
    Assignee: Apple Inc.
    Inventors: Niranjan Manjunath, Willem Mattelaer, Jessica Peck, Lily Shuting Zhang
  • Patent number: 11837208
    Abstract: Example methods, apparatus and articles of manufacture to determine semantic information for audio are disclosed. Example apparatus disclosed herein are to process an audio signal obtained by a media device to determine values of a plurality of features that are characteristic of the audio signal, compare the values of the plurality of features to a first template having corresponding first ranges of the plurality of features to determine a first score, the first template associated with first semantic information, compare the values of the plurality of features to a second template having corresponding second ranges of the plurality of features to determine a second score, the second template associated with second semantic information, and associate the audio signal with at least one of the first semantic information or the second semantic information based on the first score and the second score.
    Type: Grant
    Filed: August 16, 2021
    Date of Patent: December 5, 2023
    Assignee: The Nielsen Company (US), LLC
    Inventors: Alan Neuhauser, John Stavropoulos
  • Patent number: 11830483
    Abstract: The present disclosure discloses a method for processing man-machine dialogues, which includes: acquiring a first user voice message from a client; determining a dialogue intent corresponding to the first user voice message; determining a target duplex wake-up mode corresponding to the dialogue intent based on an intent wake-up mode table, wherein the intent-wake mode table includes duplex wake-up modes corresponding to a plurality of candidate dialogue intents respectively, and the duplex wake-up modes comprise a full-duplex wake-up mode and a half-duplex wake-up mode; and sending a wake-up mode instruction corresponding to the target duplex wake-up mode to the client, such that the client processes the first user voice message according to the target duplex wake-up mode. Using the method and apparatus for carrying out the method, the wake-up mode of the client may be switched dynamically.
    Type: Grant
    Filed: November 25, 2019
    Date of Patent: November 28, 2023
    Assignee: AI SPEECH CO., LTD.
    Inventor: Xinwei Yang
  • Patent number: 11830485
    Abstract: A device having a voice-based interface activates or “wakes” when it detects an utterance that includes a wakeword; the device may be installed in a vehicle, such as an automobile. The device may distinguish between different wakewords; a different speech processing system may be associated with each wakeword, and each speech processing engine may have its own speech style and associated applications and functions.
    Type: Grant
    Filed: December 11, 2018
    Date of Patent: November 28, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Abhay Gupta, Timothy Whalin
  • Patent number: 11816329
    Abstract: A method for performing multi-touch (MT) data fusion is disclosed in which multiple touch inputs occurring at about the same time are received to generating first touch data. Secondary sense data can then be combined with the first touch data to perform operations on an electronic device. The first touch data and the secondary sense data can be time-aligned and interpreted in a time-coherent manner. The first touch data can be refined in accordance with the secondary sense data, or alternatively, the secondary sense data can be interpreted in accordance with the first touch data. Additionally, the first touch data and the secondary sense data can be combined to create a new command.
    Type: Grant
    Filed: October 13, 2022
    Date of Patent: November 14, 2023
    Assignee: Apple Inc.
    Inventors: Wayne Carl Westerman, John Greer Elias
  • Patent number: 11818309
    Abstract: An input device is mountable on an image forming apparatus including a display portion capable of displaying information and is a numerical key unit 70 at least including a numerical key portion 110 consisting of a hardware key capable of inputting information on a numerical value and a start key 131 consisting of a hardware key capable of inputting information for starting predetermined processing. The start key 131 is disposed on a front side F than the numerical key portion 110 in a state in which the numerical key unit 70 is mounted on the image forming apparatus.
    Type: Grant
    Filed: July 6, 2021
    Date of Patent: November 14, 2023
    Assignee: CANON KABUSHIKI KAISHA
    Inventor: Takehito Utsunomiya
  • Patent number: 11818426
    Abstract: Systems and methods for modifying audio events in video content that correspond to one or more defined audio event types. A request is received to modify audio events in video corresponding to an audio event type. Video content to be presented on a display device is obtained that includes visual and audio content. An occurrence of a defined audio event corresponding to a defined audio event type is identified in the audio content. The defined audio event is modified according to a modification operation to generate modified audio content. The modified audio content is associated with a segment of the visual content that corresponds to the occurrence of the defined audio event. The modified audio content is provided in association with the segment of visual content for display on the display device.
    Type: Grant
    Filed: November 14, 2019
    Date of Patent: November 14, 2023
    Assignee: DISH Network L.L.C.
    Inventors: Neil Marten, Rebecca Albinola
  • Patent number: 11817086
    Abstract: Digitized media is received that records a conversation between individuals. Cues are extracted from the digitized media that indicate properties of the conversation. The cues are entered as training data into a machine learning module to create a trained machine learning model. The trained machine learning model is used in a processor to detect other misalignments in subsequent digitized conversations.
    Type: Grant
    Filed: March 13, 2020
    Date of Patent: November 14, 2023
    Assignee: XEROX CORPORATION
    Inventors: Evgeniy Bart, Margaret H. Szymanski
  • Patent number: 11815234
    Abstract: In some embodiments, a configurable lighting device, connectors, controllers, and methods for layout detection are provided. The configurable lighting devices, suitably connected, form an assembly of configurable lighting devices that can be removably connected with one another and re-arranged. Connectors are provided that form mechanical and electrical connections between configurable lighting devices such that a flow of electricity and control signals may be propagated without the need for direct connection between every configurable lighting device and a controller. The controller or devices connected to the controller are configured to perform layout detection such that pleasing visualizations may be rendered across the assembly that are rendered using at least the detected layout. When the configuration of the configurable lighting devices changes, the layout detection is automatically updated.
    Type: Grant
    Filed: April 18, 2022
    Date of Patent: November 14, 2023
    Assignee: Nanogrid Limited
    Inventors: Tomas Rodinger, Aliakbar Juzer Eski, Henry Chow, Arash Sadr, John Anders Ohrn, Gimmy Chu
  • Patent number: 11816394
    Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.
    Type: Grant
    Filed: March 20, 2023
    Date of Patent: November 14, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Ty Loren Carlson, Rohan Mutagi
  • Patent number: 11809783
    Abstract: This relates to systems and processes for using a virtual assistant to arbitrate among and/or control electronic devices. In one example process, a first electronic device samples an audio input using a microphone. The first electronic device broadcasts a first set of one or more values based on the sampled audio input. Furthermore, the first electronic device receives a second set of one or more values, which are based on the audio input, from a second electronic device. Based on the first set of one or more values and the second set of one or more values, the first electronic device determines whether to respond to the audio input or forego responding to the audio input.
    Type: Grant
    Filed: March 5, 2021
    Date of Patent: November 7, 2023
    Assignee: Apple Inc.
    Inventors: Kurt Piersol, Ryan M. Orr, Daniel J. Mandel
  • Patent number: 11810571
    Abstract: A method includes receiving a designated event related to a second application while an execution screen of a first application is displayed on a display. The method also includes executing an artificial intelligent application in response to the designated event. The method further includes transmitting data related to the designated event to an external server, based on the executed artificial intelligent application. Additionally, the method includes sensing a user utterance related to the designated event for a designated period of time. The method also includes transmitting the user utterance to the external server. The method further includes receiving an action order for performing a function related to the user utterance from the external server. The method also includes executing the second application at least based on the received action order. The method further includes outputting a result of performing the function by using the second application.
    Type: Grant
    Filed: February 13, 2023
    Date of Patent: November 7, 2023
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Taegu Kim, Gajin Song, Jaeyung Yeo
  • Patent number: 11810564
    Abstract: Systems and methods are provided for detecting wake words. An electronic device detects, from a microphone array, an audio signal in an environment proximate to the audio front end system. The electronic device processes the audio signal using a plurality of wake word detection engines, including dynamically adjusting how many wake word detection engines are available for processing the audio signal. The electronic device independently adjusts respective wake word detection thresholds for the plurality of wake word detection engines used to process the audio signal.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: November 7, 2023
    Assignee: Spotify AB
    Inventors: Daniel Bromand, Joseph Cauteruccio, Sven Erland Fredrik Lewin
  • Patent number: 11810558
    Abstract: A method includes: receiving, by a computing device, a digital voice stream; receiving, by the computing device, converted text that represents the digital voice stream; identifying, by the computing device, an erroneously converted portion of the converted text; selecting, by the computing device, the erroneously converted portion for explainability processing; parsing, by the computing device, the erroneously converted portion into parts based on a predetermined parsing level; collecting, by the computing device, supplementary input data related to the erroneously converted portion; and determining, by the computing device and based on the supplemental input data, a reason why the erroneously converted portion was erroneously converted.
    Type: Grant
    Filed: May 26, 2021
    Date of Patent: November 7, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Gandhi Sivakumar, Kushal S. Patel, Luke Peter Macura, Sarvesh S. Patel
  • Patent number: 11803759
    Abstract: Apparatuses, systems, and techniques are described to determine locations of objects using images including digital representations of those objects. In at least one embodiment, a gaze of one or more occupants of a vehicle is determined independently of a location of one or more sensors used to detect those occupants.
    Type: Grant
    Filed: October 11, 2021
    Date of Patent: October 31, 2023
    Assignee: Nvidia Corporation
    Inventors: Feng Hu, Niranjan Avadhanam, Yuzhuo Ren, Sujay Yadawadkar, Sakthivel Sivaraman, Hairong Jiang, Siyue Wu
  • Patent number: 11803400
    Abstract: A terminal server of a virtual assistant system for proactively triggering notifications is disclosed. The terminal server is configured to: receive data indicative of a change of a service related state associated with a user of at least one terminal client; generate accordingly a close-ended type question; instruct a transmission of the close-ended type question to the at least one terminal client; in response to a retransmission request, received from the at least one terminal client in relation to the transmission: not perform the close-ended type question, access a storage of the service related state to generate accordingly a new close-ended type question, instruct a transmission of the new close-ended type question to the at least one terminal client, analyze a closed type answer provided by the at least one terminal client, and instruct transmission of a current response to the answer provided by the user.
    Type: Grant
    Filed: June 25, 2020
    Date of Patent: October 31, 2023
    Assignee: International Business Machines Corporation
    Inventors: Offer Akrabi, Ari Volcoff, Eliezer Segev Wasserkrug, Erez Lev Meir Bilgory
  • Patent number: 11798543
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for suppressing hotword triggers when detecting a hotword in recorded media are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio corresponding to playback of an item of media content. The actions further include determining, by the computing device, that the audio includes an utterance of a predefined hotword and that the audio includes an audio watermark. The actions further include analyzing, by the computing device, the audio watermark. The actions further include based on analyzing the audio watermark, determining, by the computing device, whether to perform speech recognition on a portion of the audio following the predefined hotword.
    Type: Grant
    Filed: January 13, 2022
    Date of Patent: October 24, 2023
    Assignee: Google LLC
    Inventor: Ricardo Antonio Garcia
  • Patent number: 11797102
    Abstract: In accordance with some embodiments, systems, apparatus, interfaces, methods, and articles of manufacture are provided for ascertaining aspects of a presentation and/or of an audience member. A presentation remote can be used to obtain information about the presentation and provide it to the audience member.
    Type: Grant
    Filed: January 18, 2023
    Date of Patent: October 24, 2023
    Assignee: SCIENCE HOUSE LLC
    Inventors: James Jorasch, Michael Werner, Geoffrey Gelman, Isaac W. Hock, Gennaro Rendino, Christopher Capobianco
  • Patent number: 11798547
    Abstract: A voice activated device for interaction with a digital assistant is provided. The device comprises a housing, one or more processors, and memory, the memory coupled to the one or more processors and comprising instructions for automatically identifying and connecting to a digital assistant server. The device further comprises a power supply, a wireless network module, and a human-machine interface. The human-machine interface consists essentially of: at least one speaker, at least one microphone, an ADC coupled to the microphone, a DAC coupled to the at least one speaker, and zero or more additional components selected from the set consisting of: a touch-sensitive surface, one or more cameras, and one or more LEDs. The device is configured to act as an interface for speech communications between the user and a digital assistant of the user on the digital assistant server.
    Type: Grant
    Filed: August 6, 2020
    Date of Patent: October 24, 2023
    Assignee: Apple Inc.
    Inventor: Kevin Milden
  • Patent number: 11792295
    Abstract: This patent document describes technology for providing real-time messaging and entity update services in a distributed proxy server network, such as a CDN. Uses include distributing real-time notifications about updates to data stored in and delivered by the network, with both high efficiency and locality of latency. The technology can be integrated into conventional caching proxy servers providing HTTP services, thereby leveraging their existing footprint in the Internet, their existing overlay network topologies and architectures, and their integration with existing traffic management components.
    Type: Grant
    Filed: May 20, 2022
    Date of Patent: October 17, 2023
    Assignee: Akamai Technologies, Inc.
    Inventors: Matthew J. Stevens, Michael G. Merideth, Nil Alexandrov, Andrew F. Champagne, Brendan Coyle, Timothy Glynn, Mark A. Roman, Xin Xu
  • Patent number: 11792476
    Abstract: Systems and methods for disambiguation of an ambiguous entity in a search query based on the gaze of a user. These systems and methods may be implemented by a media guidance application (e.g., executed by user equipment associated with the user). In some aspects, the media guidance application may monitor the gaze of the user and attempt to disambiguate the ambiguous entity based on an area of the screen the user viewed while issuing the search query. If the media guidance application receives an indication that it did not disambiguate the ambiguous entity in the search query correctly, the media guidance application may increase the area of the screen that the user viewed in order to find an additional entity. This may allow the media guidance application to quickly and accurately find the correct answer to the user's search query.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: October 17, 2023
    Assignee: Rovi Product Corporation
    Inventors: Ajay Kumar Gupta, William L. Thomas, Mathew C. Burns, Gabriel C. Dalbec, Alexander W. Liston, Jonathan A. Logan, Margret B. Schmidt
  • Patent number: 11790905
    Abstract: An equipment and a method for configuring a service on an equipment. A method includes receiving a first voice input from a user to configure an equipment with a service. The equipment is configured with a voice-bot to interact with the user. The method also includes validating the first voice input, initiating configuration of the service and outputting a first voice response based on the validation of the first voice input. The method includes receiving a second voice input from the user in response to the first voice response and validating the second voice input. The method includes outputting a second voice response based on the validation of the second voice input and configuring the service on the equipment based on the voice inputs from the user.
    Type: Grant
    Filed: December 7, 2020
    Date of Patent: October 17, 2023
    Assignee: CARRIER CORPORATION
    Inventors: Karthikeyan Loganathan, Akil Vivek Jalisatgi
  • Patent number: 11785150
    Abstract: An image processing system capable of managing image data using a plurality of boxes, comprises a microphone that obtains a sound, an obtaining unit that obtains a user identifier based on voice information of a user obtained via the microphone, a specifying unit that specifies one box among the plurality of boxes based on specification information including at least the user identifier, and an informing unit that informs the user of information related to the specified one box.
    Type: Grant
    Filed: September 8, 2021
    Date of Patent: October 10, 2023
    Assignee: Canon Kabushiki Kaisha
    Inventor: Shintaro Okamura
  • Patent number: 11769497
    Abstract: Embodiments provide a context-aware digital assistant at multiple user devices participating in a video communication session by using context information from a first user device to determine a digital assistant response at a second user device. In this manner, users participating in the video communication session may interact with the digital assistant during the video communication session as if the digital assistant is another participant in the video communication session. Embodiments further describe automatically determining candidate digital assistant tasks based on a shared transcription of user voice inputs received at user devices participating in a video communication session. In this manner, a digital assistant of a user device participating in a video communication session may proactively determine one or more tasks that a user of the user device may want the digital assistant to perform based on conversations held during the video communication session.
    Type: Grant
    Filed: January 26, 2021
    Date of Patent: September 26, 2023
    Assignee: Apple Inc.
    Inventors: Niranjan Manjunath, Willem Mattelaer, Jessica Peck, Lily Shuting Zhang
  • Patent number: 11769018
    Abstract: Methods and systems for attention behavioral analysis for a conversational question and answer system are disclosed. A multi-modality input is selected from a plurality of multimodality conversations among two or more users. The system annotates the first modality inputs and at least one attention region in the first modality input corresponding to a set of entities and semantic relationships in a unified modality is identified by a discrete aspect of information bounded by the attention elements. The system models the representations of the multimodality inputs at different levels of granularity, which includes entity level, turn level, conversational level. The method proposed uses a network that consists of multilevel encoder-decoder architecture that is used to determine unified focalized attention, analyze and construct one or more responses for one or more turns in a conversation.
    Type: Grant
    Filed: November 24, 2020
    Date of Patent: September 26, 2023
    Assignee: Openstream Inc.
    Inventor: Rajasekhar Tumuluri
  • Patent number: 11769508
    Abstract: Disclosed herein is an artificial intelligence apparatus including an input interface configured to receive speech data, and a processor configured to detect a non-utterance interval included in the speech data and determine presence/absence of a second utterance after the non-utterance interval according to characteristics of a first utterance before the non-utterance interval, when the non-utterance interval exceeds a set time.
    Type: Grant
    Filed: October 28, 2022
    Date of Patent: September 26, 2023
    Assignee: LG ELECTRONICS INC.
    Inventor: Hansuk Shim
  • Patent number: 11763074
    Abstract: Embodiments of the invention are directed to a system, method, or computer program product for digital form integration and presentation. The method includes extracting one or more data input fields from a form and generating one or more user prompts to be presented to a user in order to complete at least one of the one or more entries of the one or more data input fields. The method further includes causing the transmission of at least one of the one or more user prompts to the user and receiving a prompt response from the user. The method also includes determining whether the prompt response meets one or more form requirements for a given entry and updating the form upon determination that the prompt response meets one or more form requirements for a given entry. The method further includes displaying to the user the form in an appropriate channel format.
    Type: Grant
    Filed: December 2, 2021
    Date of Patent: September 19, 2023
    Assignee: BANK OF AMERICA CORPORATION
    Inventors: Indradeep Dantuluri, Charanjit S. Bagga, Muralidhar Chowdarapu, Burton M. Covnot, Sandeep Gandhi, Ryan Scott Heller, Saurabh Khanna, Silvia Adriana Krasuk, Mardochee Macxis, Walter Thomas Robinson, Rupal V. Shah, Mansoor Zafar
  • Patent number: 11756575
    Abstract: An electronic device and method are disclosed. The device includes a memory and speech recognition circuitry and/or a processor, which implements the method, including: receiving a first utterance, and processing the first utterance to initiate a session and generate a first response result, after the session related to the first utterance is terminated, receiving a second utterance, processing the second utterance to generate a second response result, based on the second response result, determining whether to execute follow-up utterance processing on the second utterance as if the session were active, based on determining to execute the follow-up utterance processing, reprocessing the second utterance based at least in part on the first response result related to the first utterance to generate a third response result, and outputting the third response result.
    Type: Grant
    Filed: February 22, 2022
    Date of Patent: September 12, 2023
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jisu Son, Youngbin Kim, Jungkeun Cho
  • Patent number: 11747038
    Abstract: An occupancy tracking device configured to receive a plurality of sound samples over a predetermined time period. The device is further configured to compute an audio signature for each sound sample. The audio signature includes a numerical value that uniquely identifies characteristics of an audio signal. The device is further configured to populate entries in the voice data log for the sound samples, to identify one or more clusters based on an audio signature that is associated with the populated entries, and to determine a number of clusters that are identified. The device is further configured to determine a predicted occupancy level based on the number of clusters that are identified and to control a Heating, Ventilation, and Air Conditioning (HVAC) system based on the predicted occupancy level.
    Type: Grant
    Filed: July 19, 2022
    Date of Patent: September 5, 2023
    Assignee: Lennox Industries Inc.
    Inventors: Sunil Bondalapati, Prasad Mecheri Chandravihar
  • Patent number: 11749276
    Abstract: Various embodiments discussed herein enable applications to seamlessly contribute to executing voice commands of users via voice assistant functionality. In response to receiving a user request to open an application or web page, the application can request and responsively receive a voice assistant runtime component along with the application or web page. The application, using a particular universal application interface component can compile or interpret the voice assistant runtime component from a source code format to an intermediate code format. In response to the application or web page being rendered and the detection of a key word or phrase, the application can activate voice assistant command execution functionality. The user can issue a voice command after which the application along with specific services can help execute the voice command.
    Type: Grant
    Filed: October 19, 2021
    Date of Patent: September 5, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Rene Huangtian Brandel, Jason Eric Voldseth, Biao Kuang
  • Patent number: 11749255
    Abstract: A voice question and answer method and device, a computer readable storage medium, and an electronic device are described. The method comprises: receiving question voice information, and obtaining question text information according to the question voice information; performing at least one of general semantic analysis processing and dedicated semantic analysis processing on the question text information to generate an analysis result; and obtaining answer information according to the analysis result. The general semantic analysis is used for semantic analysis in the general field, and the dedicated semantic analysis is used for semantic analysis in the art field.
    Type: Grant
    Filed: March 31, 2020
    Date of Patent: September 5, 2023
    Assignee: BOE TECHNOLOGY GROUP CO., LTD.
    Inventors: Chu Xu, Shuo Chen, Xingqun Jiang
  • Patent number: 11741953
    Abstract: Processor(s) of a client device can: receive sensor data that captures environmental attributes of an environment of the client device; process the sensor data using a machine learning model to generate a predicted output that dictates whether one or more currently dormant automated assistant functions are activated; making a decision as to whether to trigger the one or more currently dormant automated assistant functions; subsequent to making the decision, determining that the decision was incorrect; and in response to determining that the determination was incorrect, generating a gradient based on comparing the predicted output to ground truth output. In some implementations, the generated gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model. In some implementations, the generated gradient is additionally or alternatively transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.
    Type: Grant
    Filed: November 8, 2019
    Date of Patent: August 29, 2023
    Assignee: GOOGLE LLC
    Inventors: Françoise Beaufays, Rajiv Mathews, Dragan Zivkovic, Kurt Partridge, Andrew Hard
  • Patent number: 11741983
    Abstract: System and method for detecting and identifying noises in a sound signal occurring during a call on a mobile device and selectively filtering and suppressing the noises in the sound signal are provided. In the mobile device, a processor is configured to receive a sound signal, detect noises in the received sound signal, identify the noises in the received sound signal, display the identified noises in a user interface (UI), receive a selection of the displayed identified noises from the UI, and filter the received selection of the displayed identified noises from the received sound signal. The processor may use a machine learning module with a neural network to detect and identify the noises in the received sound signal.
    Type: Grant
    Filed: January 13, 2021
    Date of Patent: August 29, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Vishnu Vardhan Kasilya Sudarsan, Naga Chandan Babu Gudivada, Dinesh Ramakrishnan, Rameshkumar Karuppusamy
  • Patent number: 11736760
    Abstract: Methods, devices, systems, and means for video integration with a home assistant device are described herein. The home assistant device interacts with a person in a video stream by capturing, using a network-enabled outdoor video camera, a video stream of an outdoor location of a premises at which the person is present and analyzing the person appearing in the captured video stream to determine an identity of the person. Based on determining the identity of the person, the home assistant device announces a presence of the person that is outdoors and outputs instructions to the person.
    Type: Grant
    Filed: March 28, 2022
    Date of Patent: August 22, 2023
    Assignee: Google LLC
    Inventors: Jessica Yuan, James Stewart, Rajeev Nongpiur, Patrick Lister, Chi Yeung Jonathan Ng
  • Patent number: 11734581
    Abstract: Systems and methods provide an application programming interface to offer action suggestions to third-party applications using context data associated with the third-party. An example method includes receiving content information and context information from a source mobile application, the content information representing information to be displayed on a mobile device as part of a source mobile application administered by a third party, the context information being information specific to the third party and unavailable to a screen scraper. The method also includes predicting an action based on the content information and the context information, the action representing a deep link for a target mobile application. The method further includes providing the action to the source mobile application with a title and a thumbnail, the source mobile application using the title and thumbnail to display a selectable control that, when selected, causes the mobile device to initiate the action.
    Type: Grant
    Filed: May 19, 2021
    Date of Patent: August 22, 2023
    Assignee: GOOGLE LLC
    Inventors: Ibrahim Badr, Mauricio Zuluaga, Aneto Okonkwo, Gökhan Bakir
  • Patent number: 11736669
    Abstract: An image projection device which is mounted on a vehicle or worn by a driver of the vehicle includes a control unit and a projection unit. The control unit accepts a selection for specifying a particular user, detects a current location, acquires an image showing at least a part of an appearance of the particular user, which is captured when the particular user has previously driven a particular user's vehicle at a location which is the same as or near to the current location, and projects the image onto a driver's seat of the vehicle by the projection unit.
    Type: Grant
    Filed: April 21, 2022
    Date of Patent: August 22, 2023
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventors: Toyokazu Nakashima, Ryuichi Kamaga, Mitsuhiro Miura, Yasuhiro Baba, Tomokazu Maya, Ryosuke Kobayashi, Genshi Kuno
  • Patent number: 11735163
    Abstract: Disclosed is a human-computer dialogue method including determining a set number of jump topics about a target topic, and generating a topic jump map converging to the target topic based on the correlation intensions among the set number of jump topics; after an initial response to a user's dialogue request, selecting from the topic jump map a jump topic to which the user's dialogue request relates as an initial topic for a first round of recommendation; after completing a human-machine dialogue of the initial topic, determining a jump topic to jump according to the jump probability of jumping out of the initial topic to the k jump topics at the downstream level for a next round of recommendation; and gradually guiding from the initial topic to the target topic by step-by-step recommendation. A more fluent and efficient human-machine dialogue based on a clear communication goal can be realized.
    Type: Grant
    Filed: January 22, 2019
    Date of Patent: August 22, 2023
    Assignee: AI SPEECH CO., LTD.
    Inventors: Min Chu, Taotao Guo, Zhongyuan Dai, Chao Yang
  • Patent number: 11727932
    Abstract: Methods, apparatuses, computer-readable media, and systems for using voice (e.g., voice commands) to control a plurality of network devices via a motion sensing control device are provided. A control device can detect movement (e.g., a gesture) associated with the control device. The control device can initiate a either a direct or a proxy communication session with a remote computing device. The communication session can be established and maintained for a predefined period such that data associated with a command can be immediately transmitted to the remote computing device. Thus, data associated with the command can be transmitted over the already established communication session to the remote computing device. The remote computing device can provide a response to the control device and/or transmit a command code associated with the voice command to one or more devices intended to be controlled.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: August 15, 2023
    Assignee: Comcast Cable Communications, LLC
    Inventor: Michael Rekstad
  • Patent number: 11727218
    Abstract: According to one embodiment, a computer-implemented method for dynamically modifying placeholder text in a conversational interface includes: processing a conversation log reflecting a conversation between a human user and an automated agent; determining, based at least in part on the processing: one or more capabilities of the automated agent; and/or a trajectory of the conversation; and dynamically modifying placeholder text in the conversational interface based at least in part on: the one or more capabilities of the automated agent; the trajectory of the conversation; or both the one or more capabilities of the automated agent and the trajectory of the conversation. Other embodiments in the form of systems and computer program products are also disclosed.
    Type: Grant
    Filed: October 26, 2018
    Date of Patent: August 15, 2023
    Assignee: International Business Machines Corporation
    Inventors: Raphael I. Arar, Robert J. Moore, Guangjie Ren, Margaret H. Szymanski, Eric Y. Liu
  • Patent number: 11727220
    Abstract: Techniques are described related to prior context retrieval with an automated assistant. In various implementations, instance(s) of free-form natural language input received from a user during a human-to-computer dialog session between the user and an automated assistant may be used to generate a first dialog context. The first dialog context may include intent(s) and slot value(s) associated with the intent(s). Similar operations may be performed with additional inputs to generate a second dialog context that is semantically distinct from the first dialog context. When a command is received from the user to transition the automated assistant back to the first dialog context, natural language output may be generated that conveys at least one or more of the intents of the first dialog context and one or more of the slot values of the first dialog context. This natural language output may be presented to the user.
    Type: Grant
    Filed: March 22, 2022
    Date of Patent: August 15, 2023
    Assignee: GOOGLE LLC
    Inventors: Justin Lewis, Scott Davies
  • Patent number: 11726161
    Abstract: Disclosed are techniques for a multimedia device with audio and video capturing capability to identify an audio device based on acoustic playback signal if the audio device cannot be identified from captured video. The multimedia device may assemble a list of candidate audio devices that are a possible match for the observed audio device from a database of previously recognized audio devices and may transmit commands to the candidate audio devices to play acoustic identification signals. The acoustic identification signals may be audible sound or ultrasonic tone sequences with embedded identification information unique to each audio device. The multimedia device may record and analyze the acoustic identification signals received from any of the candidate audio devices to construct metrics to select the most likely candidate for the observed audio device. The metrics may include time of flight, direction of arrival, received amplitude, direct-to-reverberant ratio (DRR) of the acoustic identification signals.
    Type: Grant
    Filed: September 1, 2021
    Date of Patent: August 15, 2023
    Assignee: Apple Inc.
    Inventors: Christopher T. Eubank, Martin E. Johnson, Daniel K. Boothe, Jonathan D. Sheaffer
  • Patent number: 11721321
    Abstract: Systems and methods for identifying content corresponding to a language are provided. Language spoken by a first user based on verbal input received from the first user is automatically determined with voice recognition circuitry. A database of content sources is cross-referenced to identify a content source associated with a language field value that corresponds to the determined language spoken by the first user. The language field in the database identifies the language that the associated content source transmits content to a plurality of users. A representation of the identified content source is generated for display to the first user.
    Type: Grant
    Filed: August 23, 2021
    Date of Patent: August 8, 2023
    Assignee: Rovi Guides, Inc.
    Inventor: Shuchita Mehra
  • Patent number: 11720238
    Abstract: Methods, systems, and apparatus for selecting an input mode are described. In one aspect, a method includes receiving request data specifying a request to launch a virtual assistant application from a lock screen of a mobile device. In response to receiving the request data, input signals are obtained. A selection of an input mode for the virtual assistant application is made, from candidate input modes, based on the input signals. Each candidate input mode is of an input type different from each other input type of each other candidate input mode. The input types include an image type and an audio type. The input mode of the image type receives pixel data for input to the virtual assistant application. The input mode of the audio type receives audio input for the virtual assistant application. The virtual assistant application presents content selected based on input signals received using the selected input mode.
    Type: Grant
    Filed: October 8, 2021
    Date of Patent: August 8, 2023
    Assignee: GOOGLE LLC
    Inventor: Ibrahim Badr
  • Patent number: RE49762
    Abstract: A method of updating speech recognition data including a language model used for speech recognition, the method including obtaining language data including at least one word; detecting a word that does not exist in the language model from among the at least one word; obtaining at least one phoneme sequence regarding the detected word; obtaining components constituting the at least one phoneme sequence by dividing the at least one phoneme sequence into predetermined unit components; determining information regarding probabilities that the respective components constituting each of the at least one phoneme sequence appear during speech recognition; and updating the language model based on the determined probability information.
    Type: Grant
    Filed: September 28, 2021
    Date of Patent: December 19, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chi-youn Park, Il-hwan Kim, Kyung-min Lee, Nam-hoon Kim, Jae-won Lee