Speech Controlled System Patents (Class 704/275)
  • Patent number: 11967314
    Abstract: Systems and methods are disclosed herein for building contextual transcripts. A computing system may receive a textual transcript of a meeting that contains a variety of statements made by various attendees of the meeting, select the first statement made during the meeting, and determine which meeting attendee made the statement. A machine learning model corresponding to the particular attendee that has been trained using previously received statements by the particular attendee may be used on the utterance to determine the tone of the utterance. That tone may be recorded within the transcript and this process may be repeated for each utterance to build a contextual transcript.
    Type: Grant
    Filed: November 2, 2021
    Date of Patent: April 23, 2024
    Assignee: Capital One Services, LLC
    Inventors: Grant Eden, Jeremy Goodsitt, Austin Walters, Anh Truong
  • Patent number: 11961518
    Abstract: Provided is a quick-responsive voice control technique even in use in a planetarium. A control device of a projector of a planetarium includes: a storage unit that stores a plurality of commands for controlling the projector, flags indicating whether or not the respective commands can be executed, and keywords associated with the respective commands; a voice acquisition unit that acquires voice data; a control unit that controls the control device; and a communication unit that communicates with the projector.
    Type: Grant
    Filed: November 12, 2019
    Date of Patent: April 16, 2024
    Assignee: KONICA MINOLTA PLANETARIUM CO., LTD.
    Inventor: Kenichi Komaba
  • Patent number: 11961509
    Abstract: Methods and systems are disclosed for improving dialog management for task-oriented dialog systems. The disclosed dialog builder leverages machine teaching processing to improve development of dialog managers. In this way, the dialog builder combines the strengths of both rule-based and machine-learned approaches to allow dialog authors to: (1) import a dialog graph developed using popular dialog composers, (2) convert the dialog graph to text-based training dialogs, (3) continuously improve the trained dialogs based on log dialogs, and (4) generate a corrected dialog for retraining the machine learning.
    Type: Grant
    Filed: April 3, 2020
    Date of Patent: April 16, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Swadheen Kumar Shukla, Lars Hasso Liden, Thomas Park, Matthew David Mazzola, Shahin Shayandeh, Jianfeng Gao, Eslam Kamal Abdelreheem
  • Patent number: 11960516
    Abstract: Methods and systems are provided herein for playing back indexed conversations based on the presence of other people. When a user asks a query, the system monitors the area, determines the other users in the area, and searches its database for a conversation that addresses the query in consideration of the other users present in the area. The system filters the indexed conversations to find conversations that included all the users present and determines the best matching conversation based on the words of the query as well as the keywords from the conversation. Once the system has determined the best match conversation, the system plays back the conversation to the user.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: April 16, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: Michael McCarty, Glen E. Roe
  • Patent number: 11960789
    Abstract: A device may include a processor, a receiver, and a transmitter. The transmitter may be configured to transmit an audible signal, an inaudible signal, or both. The inaudible signal may be associated with a content identifier of the audible signal. The transmitter may be configured to transmit the audible signal, the inaudible signal, or both, to a first electronic device, a second electronic device, or both. The receiver may be configured to receive a first message that includes a first input and a second message that includes a second input. The processor may be configured to determine whether the first input matches the second input. The transmitter may be further configured to transmit the first message to the first service on a condition that the first input and the second input are determined to match.
    Type: Grant
    Filed: February 17, 2021
    Date of Patent: April 16, 2024
    Assignee: ROVI GUIDES, INC.
    Inventors: David D. Shoop, Dylan M. Wondra
  • Patent number: 11960698
    Abstract: Apparatus transmits an identifier for association with a virtual area by an administering network service, generates output data from human perceptible stimulus in a physical space, transmits the output data in connection with the virtual area, receives input data associated with the virtual area, and generates human perceptible stimulus in the physical space from the input data. A persistent association is created between the apparatus and a virtual area. A respective presence is established in the virtual area for a communicant operating a client network node connected to the virtual area. A respective connection between each active pair of complementary sources and sinks of the client network node and the apparatus are administered in association with the virtual area. A client network node displays a graphical user interface, establishes the administered connections, and presents interaction controls associated with the object for interacting with communicants in the physical space.
    Type: Grant
    Filed: November 9, 2020
    Date of Patent: April 16, 2024
    Assignee: Sococo, Inc.
    Inventor: David Van Wie
  • Patent number: 11960674
    Abstract: Disclosed are a display method and a display apparatus for operation prompt information of an input control. The display method includes: during a user's interaction with the display apparatus, obtaining an operation instruction from a user; in response to the operation instruction, determining a target interaction mode for the user's interaction with the display apparatus; in response to a start instruction of an input control generated by invoking of the input control in the target interaction mode, obtaining target operation prompt information for the input control; and generating the input box on the user interface and display the target operation prompt information in the input box.
    Type: Grant
    Filed: September 27, 2022
    Date of Patent: April 16, 2024
    Assignee: Hisense Visual Technology Co., Ltd.
    Inventor: Xuelei Wang
  • Patent number: 11954449
    Abstract: The disclosure discloses a method for generating a conversation, an electronic device, and a storage medium. The detailed implementation includes: obtaining a current conversation and historical conversations of the current conversation; selecting multiple reference historical conversations from the historical conversations and adding the multiple reference historical conversations to a temporary conversation set; and generating reply information of the current conversation based on the current conversation and the temporary conversation set.
    Type: Grant
    Filed: September 14, 2021
    Date of Patent: April 9, 2024
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Fan Wang, Siqi Bao, Xinxian Huang, Hua Wu, Jingzhou He
  • Patent number: 11947875
    Abstract: An apparatus for maintaining an event listing using voice control, the apparatus includes a sound capturing device configured to capture acoustic data and a computing device connected to the sound device configured to receive the acoustic data, identify a voice input based on the acoustic data using a voice recognition module, wherein the voice recognition module is configured to identify a target entity and identify event activation data, obtain entity data associated with the target entity containing historical event data, generate a voice-activated command using the voice input via a command interpretation module, wherein the command interpretation module is configured to determine a maintenance operation for an event related to the target entity as a function of event activation data and the historical event data, maintain an event listing using the voice-activated command by executing the at least a maintenance operation, and display the event listing using a user interface.
    Type: Grant
    Filed: September 13, 2023
    Date of Patent: April 2, 2024
    Assignee: Actriv Healthcare Inc.
    Inventor: Allan Njoroge
  • Patent number: 11941883
    Abstract: This application discloses a video classification method performed by a computer device. The method includes: obtaining an image frame sequence corresponding to a to-be-classified video file; obtaining an appearance information feature sequence corresponding to the image frame sequence by using an image classification network model, the appearance information feature sequence including T appearance information features; obtaining a motion information feature sequence corresponding to the appearance information feature sequence by using a motion prediction network model, the motion information feature sequence including T motion information features, and the motion prediction network model being configured to predict the motion information features corresponding to the appearance information features; and determining a video classification result of the to-be-classified video file according to the appearance information feature sequence and the motion information feature sequence.
    Type: Grant
    Filed: April 14, 2021
    Date of Patent: March 26, 2024
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Yongyi Tang, Lin Ma, Wei Liu
  • Patent number: 11934582
    Abstract: Systems, methods, and apparatuses disclosed herein can generate vibrations in relation to, for example, synchronous with, audio that is associated with an event being hosted at a venue. These systems, methods, and apparatuses can generate the vibrations at the one or more frequencies over the one or more intervals in time to provide physical sensations to an audience within the venue as the audience is viewing the event. These physical sensations can provide new immersive experiences to the audience as the audience is viewing the event. These systems, methods, and apparatuses can be mechanically coupled, for example, attached, to seats within the venue. The vibrations generated by these systems, methods, and apparatuses can propagate through the seats onto the audience to provide the new immersive experiences to the audience as the audience is viewing the event.
    Type: Grant
    Filed: November 16, 2022
    Date of Patent: March 19, 2024
    Assignee: MSG Entertainment Group, LLC
    Inventor: Robert Anderson
  • Patent number: 11928110
    Abstract: A database dependency resolver system can identify different dependencies of a user application and integrate the identified dependencies in different execution environments of a distributed database system. The different execution environments can manage different versions of a given programming language, or other types of computational architectures (e.g., different CPU types). A database user can provide a database statement (e.g., query) that activates the different dependencies in the different environments to generate results data.
    Type: Grant
    Filed: October 31, 2022
    Date of Patent: March 12, 2024
    Assignee: Snowflake Inc.
    Inventors: Srilakshmi Chintala, Chong Han, Albert L. Hu, Nitya Kumar Sharma, Igor Zinkovsky
  • Patent number: 11929081
    Abstract: An electronic apparatus is provided. The electronic apparatus may include a microphone; a memory configured to store a wakeup word; and a processor configured to: identify, based on context information of the electronic apparatus, an occurrence of a pre-determined event; change, based on the occurrence of the pre-determined event, a first threshold value for recognizing the wakeup word; obtain, based on a first user voice input received via the microphone, a similarity value between first text information corresponding to the first user voice input and the wakeup word; and perform, based on the similarity value being greater than or equal to the first threshold value, a voice recognition function on second text information corresponding to a second user voice input received via the microphone after the first user voice input.
    Type: Grant
    Filed: June 22, 2021
    Date of Patent: March 12, 2024
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Hejung Yang, Hyungjun Lim, Jaeyoung Roh, Yeaseul Song, Hojun Jin, Jubum Han
  • Patent number: 11929073
    Abstract: A method for selecting a speech recognition result on a computing device includes receiving a first speech recognition result determined by the computing device, receiving first features, at least some of the features being determined using the first speech recognition result, determining whether to select the first speech recognition result or to wait for a second speech recognition result determined by a cloud computing service based at least in part on the first speech recognition result and the first features.
    Type: Grant
    Filed: October 3, 2022
    Date of Patent: March 12, 2024
    Assignee: Cerence Operating Company
    Inventor: Min Tang
  • Patent number: 11929075
    Abstract: Methods, systems, and apparatus for receiving, by a voice action system, data specifying trigger terms that trigger an application to perform a voice action and a context that specifies a status of the application when the voice action can be triggered. The voice action system receives data defining a discoverability example for the voice action that comprises one or more of the trigger terms that trigger the application to perform the voice action when a status of the application satisfies the specified context. The voice action system receives a request for discoverability examples for the application from a user device having the application installed, and provides the data defining the discoverability examples to the user device in response to the request. The user device is configured to provide a notification of the one or more of the trigger terms when a status of the application satisfies the specified context.
    Type: Grant
    Filed: July 23, 2020
    Date of Patent: March 12, 2024
    Assignee: GOOGLE LLC
    Inventors: Bo Wang, Sunil Vemuri, Barnaby John James, Pravir Kumar Gupta, Nitin Mangesh Shetti
  • Patent number: 11930332
    Abstract: The disclosure relates to a method for recognizing at least one naturally emitted sound produced by a real-life sound source in an environment comprising at least one artificial sound source. The method is implemented by an audio recognition device, and it includes simultaneously obtaining a first audio signal from a first microphone located in the environment and a second audio signal from an audio acquisition device associated with the at least one artificial sound source; analyzing the first audio signal, delivering a first list of sound classes corresponding to sounds recognized in the first audio signal; analyzing the second audio signal, delivering a second list of sound classes corresponding to sounds recognized in the second audio signal; and delivering a third list of sound classes, comprising only sound classes included in the first list of sound classes which are not included in the second list of sound classes.
    Type: Grant
    Filed: November 4, 2020
    Date of Patent: March 12, 2024
    Assignee: Thomson Licensing
    Inventors: Henk Heijnen, Philippe Gilberton, Eric Gautier
  • Patent number: 11930236
    Abstract: A content reproduction apparatus includes an outputter configured to output audio and video, a user interface configured to receive an utterance input from a user, a memory storing one or more instructions, and a processor configured to execute the one or more instructions stored in the memory. The processor is configured to control the outputter to output a first screen in which one or more objects selectable by the user's utterance are included and a focus is displayed with respect to one of the one or more objects, and, to control the outputter to output utterable guide information for a next selection according to the object corresponding to the focus displayed, when the user does not provide an utterance through the user interface.
    Type: Grant
    Filed: July 29, 2021
    Date of Patent: March 12, 2024
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Byungjeong Jeon, Jina Kwon, Yuri Min, Hansol Park
  • Patent number: 11923081
    Abstract: Apparatus and associated methods relate to the determination of local environmental air quality by processing data from a local device sensing a user's respiration-vocalization. In an illustrative example, respiration-vocalization for a CPAP user may be sensed by an airflow and/or air pressure sensor. Respiratory disturbance events, such as coughing, for example, may be detected. The sensed events, converted to respiration-vocalization data, may be collected to estimate the environmental air quality and/or particle density around the user. Some examples may estimate specific allergen concentrations by correlating user respiration-vocalization data with the respiration-vocalization data from users/patients with known airborne particle sensitivities. In some embodiments, regional environmental air quality data may be compared with respiration-vocalization data to produce local environmental air quality results.
    Type: Grant
    Filed: June 2, 2022
    Date of Patent: March 5, 2024
    Assignee: Honeywell International Inc.
    Inventors: Adam Dewey McBrady, Stephan Bork
  • Patent number: 11922356
    Abstract: Methods and systems for videoconferencing include generating work quality metrics based on emotion recognition of an individual such as a call center agent. The work quality metrics allow for workforce optimization. One example method includes the steps of receiving a video including a sequence of images, detecting an individual in one or more of the images, locating feature reference points of the individual, aligning a virtual face mesh to the individual in one or more of the images based at least in part on the feature reference points, dynamically determining over the sequence of images at least one deformation of the virtual face mesh, determining that the at least one deformation refers to at least one facial emotion selected from a plurality of reference facial emotions, and generating quality metrics including at least one work quality parameter associated with the individual based on the at least one facial emotion.
    Type: Grant
    Filed: October 29, 2019
    Date of Patent: March 5, 2024
    Assignee: SNAP INC.
    Inventors: Victor Shaburov, Yurii Monastyrshyn
  • Patent number: 11922938
    Abstract: A multi-assistant speech-processing system that centrally determines multiple execution plans to respond to a user input. A central component determines whether a particular input should be processed using a requested assistant or a different assistant or should be terminated. Assistant handoff may be determined based on system policies as well as user input-specific data. A ranked list of execution options may be supplemented by augmented data corresponding to messages to a user. The system may attempt to execute plans in the ranked order until a plan succeeds.
    Type: Grant
    Filed: November 22, 2021
    Date of Patent: March 5, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Yaser Khan, Piyush Kandpal, Ritesh Patel, Mark Lawrence, Srinivas Palla, Ashish Rangole, Jason Wang
  • Patent number: 11923054
    Abstract: An AI based platform for processing information collected during a medical procedure. A method includes capturing images and speech during a medical procedure; processing the images using a trained classifier to identify image-based quality-of-care indicators (QIs); converting the speech into text; parsing the text into sentences; performing a search and replace on predefined text patterns in the sentences; identifying text-based QIs in the sentences; classifying sentences into sentence types using a trained model; updating sentences by integrating the image-based QIs with text-based QIs; and outputting structured data that includes sentences organized by sentence type.
    Type: Grant
    Filed: July 19, 2022
    Date of Patent: March 5, 2024
    Assignee: UTECH PRODUCTS, INC.
    Inventors: Rakesh Madan, Zohair Hussain, Manish K. Madan
  • Patent number: 11915706
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.
    Type: Grant
    Filed: January 5, 2023
    Date of Patent: February 27, 2024
    Assignee: Google LLC
    Inventor: Matthew Sharifi
  • Patent number: 11907272
    Abstract: Disclosed in some examples are methods, systems, machine-readable media, and devices which provide for real-time personalized suggestions for participants in a network-based communication service. The personalized suggestions may include options for taking actions, content suggestions, and smart replies. These suggestions may be based upon the current conversation and are delivered personally to each participant.
    Type: Grant
    Filed: February 17, 2017
    Date of Patent: February 20, 2024
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Nikrouz Ghotbi, Eddie Fusaro, John Alton Price, Jeff Roger DeVries
  • Patent number: 11907390
    Abstract: Discloses a method and an apparatus for visual construction of a knowledge graph system. In the present disclosure, data permission of a distributed client is determined through a central server. The central server obtains a master template of a knowledge graph system and sends it to the distributed client. The distributed client receives a natural language inputted by a user and parses to generate an abstract syntax tree. The user completes customization of a subtemplate of the knowledge graph system through visual operation. The distributed client encrypts the subtemplate and then sends it to the central server. When the knowledge graph system is to be used, any knowledge concept is inputted, the central server calls and decrypts the subtemplate and then searches a database, and a tree structure knowledge graph is generated and sent to the distributed client.
    Type: Grant
    Filed: June 16, 2023
    Date of Patent: February 20, 2024
    Assignee: ZHEJIANG LAB
    Inventors: Jingsong Li, Guangyuan Deng, Tianshu Zhou, Yu Tian
  • Patent number: 11908464
    Abstract: An electronic device and a method for controlling same are provided. The present electronic device comprises: a communication unit; and a processor configured to receive multiple audio signals via the communication unit, the multiple audio signals being acquired by multiple external electronic devices which have microphones, respectively, and which are positioned at different places, via microphones thereof, the processor being configured to determine at least one audio signal including a user voice uttered by a user among the multiple audio signals and to perform voice recognition regarding an audio signal acquired from the determined audio signals on the basis of the intensity of the determined audio signals.
    Type: Grant
    Filed: November 25, 2019
    Date of Patent: February 20, 2024
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jaesun Shin, Joonrae Cho, Jeongman Lee
  • Patent number: 11900323
    Abstract: Systems and methods to generate units of work within a collaboration environment based on video dictation are described herein. Exemplary implementations may: manage environment state information maintaining a collaboration environment; responsive to the user-initiation of video dictation sessions, obtain video information characterizing content of the video dictation sessions; responsive to detection of completion of the video dictation sessions, generate one or more units of work for the users based on the content of the video dictation sessions; responsive to detection of completion of the video dictation sessions generate, store the video information as part of the environment state information; and/or perform other operations.
    Type: Grant
    Filed: September 29, 2020
    Date of Patent: February 13, 2024
    Assignee: Asana, Inc.
    Inventor: Alexander Hood
  • Patent number: 11899519
    Abstract: Systems, methods, and devices with reduced power consumption in network microphone devices. In one embodiment, a network microphone device is configured to perform a method that includes (i) capturing audio content; (ii) using a first algorithm to perform a keyword detection process for determining whether the audio content includes a keyword; (iii) responsive to determining that the audio content includes the keyword, using a second, more computationally intensive algorithm to perform a wake-word detection process for determining whether the audio content includes a wake word; and (iv) responsive to performing the wake-word detection process, (a) causing a voice service corresponding to the wake word to process the audio content if the wake-word detection process confirms that the audio content includes the wake word or (b) ceasing performance of the wake-word detection process if the wake-word detection process disconfirms that the audio content includes the wake word.
    Type: Grant
    Filed: October 23, 2018
    Date of Patent: February 13, 2024
    Assignee: Sonos, Inc.
    Inventors: Nick D'Amato, Daniele Giacobello, Joachim Fainberg, Klaus Hartung
  • Patent number: 11900932
    Abstract: A voice dialogue system includes a voice input unit which acquires a user utterance, an intention understanding unit which interprets an intention of utterance of a voice acquired by the voice input unit, a dialogue text creator which creates a text of a system utterance, and a voice output unit which outputs the system utterance as voice data. When creating a text of a system utterance, the dialogue text creator creates the text by inserting a tag in a position in the system utterance, and the intention understanding unit interprets an utterance intention of a user in accordance with whether a timing at which the user utterance is made is before or after an output of a system utterance at a position corresponding to the tag from the voice output unit.
    Type: Grant
    Filed: July 2, 2021
    Date of Patent: February 13, 2024
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventors: Atsushi Ikeno, Yusuke Jinguji, Toshifumi Nishijima, Fuminori Kataoka, Hiromi Tonegawa, Norihide Umeyama
  • Patent number: 11899706
    Abstract: Systems, apparatuses, and methods for providing content using notifications with content-specific keywords are provided. In one example embodiment, a method includes identifying, by one or more computing devices, a media content item for a user of a user device. The method includes generating, by the one or more computing devices, a keyword for the user of the user device based at least in part on data associated with the media content item. The keyword is indicative of the media content item. The method includes providing, by the one or more computing devices to the user device, for display a notification indicating that the media content is available for the user. The notification includes the keyword and the keyword is viewable by the user.
    Type: Grant
    Filed: July 30, 2021
    Date of Patent: February 13, 2024
    Assignee: GOOGLE LLC
    Inventors: Justin D. Lewis, Scott Tadashi Davies
  • Patent number: 11893311
    Abstract: A method includes determining, by an assistant executing at one or more processors, a default group of actions that the assistant is configured to execute in response to receiving a particular audible command. The method includes determining, by the assistant, based on the default group of actions and a user profile associated with a particular user, a custom group of actions that the assistant is configured to execute in response to receiving the particular audible command from the particular user. The method also includes receiving, by the assistant, an indication of the particular audible command, and determining, by the assistant, whether the indication of particular audible command originated from the particular user. The method further includes, responsive to determining that the indication of particular audible command originated from the particular user, executing, by the assistant, each action from the custom group of actions.
    Type: Grant
    Filed: January 12, 2023
    Date of Patent: February 6, 2024
    Assignee: GOOGLE LLC
    Inventors: Vikram Aggarwal, Michael Andrew Goodman
  • Patent number: 11893526
    Abstract: Systems and methods to implement customer contact service with real-time supervisor assistance. A supervisor may oversee multiple agents in a customer contact service. A service of a computing resource service provider may monitor a plurality of audio connections at a service of a computing resource service provider, generate transcripts for the plurality of audio data, analyze the transcripts using a set of natural language processing (NLP) techniques to generate metadata, tag the transcripts with categories based at least in prat on the metadata, generate information for at least a portion of the plurality of connections based on the transcripts, metadata, and categories, and provide the information to a supervisor of the agents.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: February 6, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Swaminathan Sivasubramanian, Vasanth Philomin, Vikram Anbazhagan, Ashish Singh, Atul Deo, Anuroop Arora, Colin Thomas Davidson, Jessie Young, Yasser El-Haggan
  • Patent number: 11893545
    Abstract: The disclosure provides a method, system, and a software program product for assisting a user and/or managing tasks of the user, by a mobile secretary cloud application configured to operate in a mobile client device and cloud server network. The mobile secretary cloud application reads data from another software application and operates at least one of another application and a third application based on the read data. Further, Artificial intelligence is utilized by the mobile secretary cloud application for operating another application and the third application.
    Type: Grant
    Filed: April 11, 2023
    Date of Patent: February 6, 2024
    Inventor: Mikko Vaananen
  • Patent number: 11886821
    Abstract: Automated response generation systems and methods are disclosed. The systems can include a deep learning model specially configured to apply inferencing techniques to redesign natural language querying systems for use over knowledge graphs. The disclosed systems and methods provide a model for inferencing referred to as a Hierarchical Recurrent Path Encoder (HRPE). An entity extraction and linking module as well as a data conversion and generation module process the content of a given query. The output is processed by the proposed model to generate inferred answers.
    Type: Grant
    Filed: April 16, 2021
    Date of Patent: January 30, 2024
    Assignee: Accenture Global Solutions Limited
    Inventors: Shubhashis Sengupta, Annervaz K. M., Gupta Aayushee, Sandip Sinha, Shakti Naik
  • Patent number: 11887404
    Abstract: Techniques and systems are provided for authenticating a user of a device. For example, input biometric data associated with a person can be obtained. A similarity score for the input biometric data can be determined by comparing the input biometric data to a set of templates that include reference biometric data associated with the user. The similarity score can be compared to an authentication threshold. The person is authenticated as the user when the similarity score is greater than the authentication threshold. The similarity score can also be compared to a learning threshold that is greater than the authentication threshold. A new template including features of the input biometric data is saved for the user when the similarity score is less than the learning threshold and greater than the authentication threshold.
    Type: Grant
    Filed: December 2, 2021
    Date of Patent: January 30, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Eyasu Zemene Mequanint, Shuai Zhang, Yingyong Qi, Ning Bi
  • Patent number: 11887598
    Abstract: In one aspect, a network microphone device includes a plurality of microphones and is configured to capture a voice input via the one or more microphones, detect a wake word in the voice input, transmit data associated with the voice input to one or more remote computing devices associated with a voice assistant service, and receive a response from the one or more remote computing devices, the response comprising a playback command based on the voice input. The network microphone device may be configured to obtain verification information characterizing the voice input and, based on the verification information indicating that the voice input was spoken by an unverified user, functionally disable the NMD from performing the playback command.
    Type: Grant
    Filed: December 2, 2022
    Date of Patent: January 30, 2024
    Assignee: Sonos, Inc.
    Inventor: Connor Kristopher Smith
  • Patent number: 11887591
    Abstract: Embodiments herein disclose methods and systems for providing a digital assistant in a device, which can generate responses to commands from a user based on ambience of the user. On receiving a command from the user of the device to perform an action, content stored in the device can be extracted. The embodiments include determining degree of privacy and sensitivity of the content. The embodiments include determining ambience of the user based on ambient noise, location of the device, presence of other humans, emotional state of the user, application parameters, user activity, and so on. The embodiments include generating a response and revealing the response based on the determined ambience and the degree of privacy and sensitivity of the extracted content. The embodiments include facilitating dialog with the user for generating appropriate responses based on the ambience of the user.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: January 30, 2024
    Inventors: Siddhartha Mukherjee, Udit Bhargava
  • Patent number: 11882505
    Abstract: A telecommunications system, that after a communication is established by a first electronic communication device and a second electronic communication device, while the conversation is ongoing between a first person using the first electronic communication device and a second person using the second electronic communication device, responsive to content of converted text based on a plurality of words spoken, route the content to a cloud-based phone recognition and entity identification, annotation, and relevance processing resource, to enable display of information related to the content by at least one of the first electronic communication device and the second electronic communication device.
    Type: Grant
    Filed: December 7, 2022
    Date of Patent: January 23, 2024
    Assignee: Eolas Technologies Inc.
    Inventors: David C. Martin, Michael D. Doyle
  • Patent number: 11882250
    Abstract: An image scanning device may comprise a microphone for receiving a voice, an image scanning section for scanning an image, and a control section for instructing the image scanning section to start scanning after a delay time elapses after the voice. The image scanning device may be able to scan a copy sheet, without being affected by an air flow caused by a voice when instructed to scan the copy sheet with a voice instruction.
    Type: Grant
    Filed: August 31, 2021
    Date of Patent: January 23, 2024
    Assignee: Konica Minolta, Inc.
    Inventor: Keita Ishihara
  • Patent number: 11881217
    Abstract: According to one embodiment, a method, computer system, and computer program product for solution guided generation of responses for dialog systems is provided. The embodiment may include receiving, by a processor, first voice data associated with a first user utterance in conversation in a guided dialog system. The embodiment may include identifying from the first voice data a first topic of a set of topics associated with the first user utterance. The embodiment may include identifying a first solution associated with the first topic. The first solution having one or more solution segments for accomplishing a task related to the topic. The embodiment may include generating a first response for a second user based on a first solution segment of the first solution and the first voice data.
    Type: Grant
    Filed: June 30, 2021
    Date of Patent: January 23, 2024
    Assignee: International Business Machines Corporation
    Inventors: Chulaka Gunasekara, Jatin Ganhotra, Sachindra Joshi
  • Patent number: 11881219
    Abstract: Systems for voice control of medical devices in a healthcare facility are disclosed herein. The systems employ continuous speech processing software, voice recognition software, natural language processing software, and other software to permit voice control of the medical devices. Systems are also provided for distinguishing which medical device from among multiple medical devices in a patient room is the particular medical device to be controlled by voice input from a caregiver or a patient.
    Type: Grant
    Filed: August 26, 2021
    Date of Patent: January 23, 2024
    Assignee: Hill-Rom Services, Inc.
    Inventors: Timothy J. Receveur, Dan R. Tallent, Richard J. Schuman, Eric D. Agdeppa, John S. Schroder, Catherine Infantolino, Sinan Batman, Kenzi L. Mudge, John V. Harmeyer
  • Patent number: 11880667
    Abstract: This application discloses an information conversion method and apparatus, a storage medium, and an electronic apparatus.
    Type: Grant
    Filed: May 22, 2020
    Date of Patent: January 23, 2024
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventors: Jun Xie, Mingxuan Wang, Jiangquan Huang, Jian Yao
  • Patent number: 11875087
    Abstract: Aspects of the disclosure relate to generating outputs using a digital personal assistant computing control platform and machine learning. A computing platform may receive, from a digital personal assistant computing device, a first voice command input. The computing platform may then determine, via machine learning algorithms, an identifier output indicating a user associated with the first voice command input and a location output indicating a geographic location associated with the user. The computing platform may determine, via a stored calendar, an availability output indicating availability associated with the user. Based on the identifier output, the location output, and the availability output, a charitable opportunity output indicating a charitable opportunity may be determined by the computing platform and may be transmitted to a computing device associated with the charitable opportunity.
    Type: Grant
    Filed: February 20, 2023
    Date of Patent: January 16, 2024
    Assignee: Allstate Insurance Company
    Inventors: Elizabeth C. Schreier, Jamie E. Grahn
  • Patent number: 11869505
    Abstract: Embodiments herein relate to a local assistant system responding to voice input using an ear-wearable device. The system detects a wake-up signal and receives a first voice input communicating a first query content. The system includes speech recognition circuitry to determine the first query content, speech generation circuitry, and an input database of locally-handled user inputs. If the first audio input matches one of the locally-handled user inputs, then the system takes a local responsive action. If the first audio input does not match one of the locally-handled user inputs, then the system transmits at least a portion of the first query content over a wireless network to a network resource.
    Type: Grant
    Filed: January 26, 2022
    Date of Patent: January 9, 2024
    Assignee: Starkey Laboratories, Inc.
    Inventors: Achintya Kumar Bhowmik, David Alan Fabry, Amit Shahar, Justin R. Burwinkel, Jeffrey Paul Solum, Thomas Howard Burns
  • Patent number: 11868720
    Abstract: Techniques are described for training and/or utilizing sub-agent machine learning models to generate candidate dialog responses. In various implementations, a user-facing dialog agent (202, 302), or another component on its behalf, selects one of the candidate responses which is closest to user defined global priority objectives (318). Global priority objectives can include values (306) for a variety of dialog features such as emotion, confusion, objective-relatedness, personality, verbosity, etc. In various implementations, each machine learning model includes an encoder portion and a decoder portion. Each encoder portion and decoder portion can be a recurrent neural network (RNN) model, such as a RNN model that includes at least one memory layer, such as a long short-term memory (LSTM) layer.
    Type: Grant
    Filed: January 16, 2020
    Date of Patent: January 9, 2024
    Assignee: KONINKLIJKE PHILIPS N.V.
    Inventors: Vivek Varma Datla, Sheikh Sadid Al Hasan, Aaditya Prakash, Oladimeji Feyisetan Farri, Tilak Raj Arora, Junyi Liu, Ashequl Qadir
  • Patent number: 11869496
    Abstract: The present invention provides an information processing device that processes a voice-based agent interaction, and an information processing method, and provides an information processing system. The information processing device is provided with: a communication unit that receives information related to an interaction with a user through an agent residing in a first apparatus; and a control unit that controls an external agent service. The control unit collects the information that includes at least one among an image or a voice of the user, information related to operation of the first apparatus by the user, and sensor information detected by a sensor with which the first apparatus is equipped. The control unit controls calling of the external agent service.
    Type: Grant
    Filed: April 11, 2019
    Date of Patent: January 9, 2024
    Assignee: SONY CORPORATION
    Inventors: Masahiro Hara, Shinpei Kameoka
  • Patent number: 11870936
    Abstract: A system and method for routing a call from a customer to a customer service representative at a call center is described. The method being performed by an augmented intelligence system. The method includes receiving an incoming call from a customer at the call center. The method also includes determining a match between a classification of the customer and a classification of a selected customer service representative based on a profile of the customer and a profile of the selected customer service representative. The method further includes routing the incoming call from the customer to the selected customer service representative.
    Type: Grant
    Filed: June 24, 2021
    Date of Patent: January 9, 2024
    Assignee: United Services Automobile Association (USAA)
    Inventors: Thomas Wayne Schwarz, Jr., Joel S. Hartshorn, Ruthie D. Lyle
  • Patent number: 11870942
    Abstract: Systems and methods are described to enable a device of a user to automatically join an ongoing conference, where the device is not currently joined to the conference. A first audio signature is generated based on voices of users already in the conference, and a second audio signature is generated based on an audio signal captured by a microphone of the device associated with the first user when the device associated with the first user was not joined to the conference. The first audio signature and the second audio signature are compared, and in response to determining that first audio signature matches the second audio signature, the device associated with the first user is joined to the conference.
    Type: Grant
    Filed: October 27, 2022
    Date of Patent: January 9, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: Srikanth Channapragada, Vikram Makam Gupta, Pooja Srivastava
  • Patent number: 11862159
    Abstract: A system and method establishes a communication connection between a first device of a first user and a second device of a second user. Request data corresponding to a request to establish a communication connection with a second user is received, and a user profile associated with the second user is determined. One or more sensors of the second device receive input data corresponding to the environment of the second device, and an identity of the second user is determined based thereon. The communication connection is established and, based on the identity, the second device tracks movement of the second user in the environment.
    Type: Grant
    Filed: September 2, 2021
    Date of Patent: January 2, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Shambhavi Sathyanarayana Rao, Anna Chen Santos, Tony Roy Hardie
  • Patent number: 11862148
    Abstract: Systems and methods to analyze contacts data. Contacts data may be encoded as text (e.g., chat logs), audio (e.g., audio recordings), and various other modalities. A computing resource service provider may implement a service to obtain audio data from a client, transcribe the audio data, thereby generating text, execute one or more natural language processing techniques to generate metadata associated with the text, processing at least the metadata to generate an output, determine whether the output matches one or more categories, and provide the output to the client. Techniques described herein may be performed as an asynchronous workflow.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: January 2, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Swaminathan Sivasubramanian, Vasanth Philomin, Vikram Anbazhagan, Ashish Singh, Atul Deo, Anuroop Arora, Jessie Young, Harsh Yadav, Priyanka Shirish Kale
  • Patent number: 11862158
    Abstract: A method for controlling a device includes: collecting audio data where the device is located; determining whether each target frame of the audio data is a first type signal; in response to the target frame of the audio data being the first type signal, determining an acoustic event type represented by the first type signal; and controlling the device to execute control instructions corresponding to the acoustic event type.
    Type: Grant
    Filed: July 20, 2021
    Date of Patent: January 2, 2024
    Assignee: BEIJING XIAOMI PINECONE ELECTRONICS CO., LTD.
    Inventor: Chuming Liang