Speech Controlled System Patents (Class 704/275)

Systems and methods for processing message exchanges using artificial intelligence

Patent number: 11042910

Abstract: Systems and methods for processing automated message exchanges using artificial intelligence are providing. In some embodiments, a message is generated by populating variable fields within a message template with corresponding data from a knowledge set and/or a lead data set. Lead data is the data known about the intended recipient of the message, whereas the knowledge set is contextual knowledge useful for the artificial intelligence. Once the message has been generated, the system waits for a response from the lead. Once the response is received, the AI algorithms may categorize the response and generate a corresponding confidence value for the categorization. The categorization and confidence level are utilized to determine which subsequent action the system takes. The actions consist of sending a follow-up message, a subsequent message in the series, requesting user input, or discontinuing messaging.

Type: Grant

Filed: January 23, 2015

Date of Patent: June 22, 2021

Assignee: CONVERSICA, INC.

Inventor: Benjamin P. Brigham
Identifying relevant messages in a conversation graph

Patent number: 11042599

Abstract: According to an aspect, a method for identifying relevant messages in a conversation graph on a messaging platform includes transmitting, over a network, messages between a plurality of users on the messaging platform, generating a conversation graph based on relationships between the messages, where the conversation graph includes a plurality of messages related to a conversation, and the plurality of messages of the conversation graph include a root message and one or more reply messages connected to the root message. The method includes marking a subset of the plurality of messages of the conversation graph as relevant to the conversation, including marking a message from a user account having a credibility rating over a threshold level, and transmitting, over the network, digital information to render the subset of the plurality of messages on a user interface of a computing device.

Type: Grant

Filed: April 7, 2020

Date of Patent: June 22, 2021

Assignee: Twitter, Inc.

Inventors: Ross Cohen, Kyle Maxwell, Stuart Hood, Cara Meverden, Coleen Baik, Marcel Molina
Adjusting user interface for touchscreen and mouse/keyboard environments

Patent number: 11042272

Abstract: Aspects of the subject technology relate to dynamically adjusting a UI based on the current modality. Layout features of a UI may be determined based on an input modality of a computing device. The arrangement of the UI elements may be determined based on the layout features and respective importance scores of the UI elements. The UI elements arranged based on the arrangement may be provided for display of the computing device.

Type: Grant

Filed: July 19, 2018

Date of Patent: June 22, 2021

Assignee: Google LLC

Inventors: Thomas Deselaers, Victor Carbune, Daniel Martin Keysers
Information processing apparatus and method

Patent number: 11042615

Abstract: An information processing apparatus displays a control, based on generation instructions of the control to accept a user operation, such that information indicating biometric authentication is requested is added to the control, executes a processing request corresponding to the control when the user operation on the displayed control is detected, and transmits, to a request destination of the processing request, data based on a result of biometric authentication using biometric information read by the information processing apparatus and stored biometric information. Information related to processing corresponding to the control is displayed based on the processing request and the data based on the result of the biometric authentication.

Type: Grant

Filed: June 22, 2018

Date of Patent: June 22, 2021

Assignee: Canon Kabushiki Kaisha

Inventor: Kotaro Matsuda
Technologies for pedometric sensing in footwear

Patent number: 11035692

Abstract: Technologies for pedometric sensing in footwear include a step tracker compute device. The step tracker compute device is to receive acceleration data indicative of movement of a foot of a user, generate energy contour data indicative of energy levels over time, based on the received acceleration data, determine dynamic energy thresholds indicative of peaks in the energy contour data, and detect steps of the user based on the dynamic energy thresholds and the energy contour data to generate step data. Other embodiments are described and claimed.

Type: Grant

Filed: June 20, 2016

Date of Patent: June 15, 2021

Assignee: Intel Corporation

Inventors: Mohsin Y. Ahmed, Suraj Sindia
Audio message processing method and apparatus

Patent number: 11037568

Abstract: Audio message processing methods and apparatuses are provided, where a method may include a server recognizing types of communication messages transmitted between communicating counterparties; when a type of any communication message is an audio type, the server acquiring the any communication message, and converting the any communication message to corresponding text content; and upon determining that any communicating party has a conversion need for the any communication message, the server sending the text content to the any communicating party. Through technical solutions of the present disclosure, text conversion may be performed upon audio messages in advance, thereby increasing response speed for audio conversion requests of users.

Type: Grant

Filed: September 26, 2018

Date of Patent: June 15, 2021

Assignee: Alibaba Group Holding Limited

Inventors: Daping Zhang, Lili Zhang, Yixin Huang, Yun Chen, Jiandong Lai, Haohua Zhong
System and method for creation and invocation of predefined print settings via speech input

Patent number: 11036441

Abstract: A system and method for language-based multifunction peripheral control includes receiving a user selection of an electronic document via a user interface of wireless portable data device. User selected print settings are also received via the user interface for printing the selected electronic document. The selected electronic document is then printed in accordance with the user selected print settings. The user associates a verbal shortcut with their current print settings, and the verbal instruction and settings are stored associatively. When the user wishes to print again, they can select their document and issue verbal print instructions which include their verbal shortcut. Print settings associated with this verbal shortcut are retrieved from memory and the document is printed using the print settings.

Type: Grant

Filed: January 27, 2020

Date of Patent: June 15, 2021

Assignee: Toshiba TEC Kabushiki Kaisha

Inventor: Marianne Kodimer
Outcome-oriented dialogs on a speech recognition platform

Patent number: 11037572

Abstract: A speech recognition platform configured to receive an audio signal that includes speech from a user and perform automatic speech recognition (ASR) on the audio signal to identify ASR results. The platform may identify: (i) a domain of a voice command within the speech based on the ASR results and based on context information associated with the speech or the user, and (ii) an intent of the voice command. In response to identifying the intent, the platform may perform multiple actions corresponding to this intent. The platform may select a target action to perform, and may engage in a back-and-forth dialog to obtain information for completing the target action. The action may include streaming audio to the device, setting a reminder for the user, purchasing an item on behalf of the user, making a reservation for the user or launching an application for the user.

Type: Grant

Filed: November 15, 2019

Date of Patent: June 15, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Jeff Bradley Beal, Kevin Robert Charter, Ajay Gopalakrishnan, Sumedha Arvind Kshirsagar, Nishant Kumar
Self-aware visual-textual co-grounded navigation agent

Patent number: 11029694

Abstract: An agent for navigating a mobile automated system is disclosed herein. The navigation agent receives a navigation instruction and visual information for one or more observed images. The navigation agent is provided or equipped with self-awareness, which provides or supports the following abilities: identifying which direction to go or proceed by determining the part of the instruction that corresponds to the observed images (visual grounding), and identifying which part of the instruction has been completed or ongoing and which part is potentially needed for the next action selection (textual grounding). In some embodiments, the navigation agent applies regularization to ensures that the grounded instruction can correctly be used to estimate the progress made towards the navigation goal (progress monitoring).

Type: Grant

Filed: October 31, 2018

Date of Patent: June 8, 2021

Assignee: salesforce.com, inc.

Inventors: Chih-Yao Ma, Caiming Xiong
Method and device for speech processing

Patent number: 11030991

Abstract: Disclosed are a speech processing method and a speech processing device, for performing speech processing by executing artificial intelligence (AI) algorithms and/or machine learning algorithms installed thereon, thus enabling the communication between a user terminal and a server in a 5G communication environment. The speech processing method according to an embodiment of the present disclosure includes receiving a user spoken utterance, outputting a voice actor spoken utterance in a voice actor's voice having the highest degree of similarity with a user's voice by using a user-voice actor mapping learning model, the voice actor spoken utterance corresponding to the user spoken utterance, and performing speech recognition for the voice actor spoken utterance.

Type: Grant

Filed: October 4, 2019

Date of Patent: June 8, 2021

Assignee: LG ELECTRONICS INC.

Inventor: Jong Hoon Chae
Systems and methods for personifying communications

Patent number: 11024293

Abstract: Systems and methods are described for personifying communications. According to at least one embodiment, the computer-implemented method for personifying a natural-language communication includes observing a linguistic pattern of a user. The method may also include analyzing the linguistic pattern of the user and adapting the natural-language communication based at least in part on the analyzed linguistic pattern of the user. In some embodiments, observing the linguistic pattern of the user may include receiving data indicative of the linguistic pattern of the user. The data may be one of verbal data or written data. Written data may include at least one of a text message, email, social media post, or computer-readable note. Verbal data may include at least one of a recorded telephone conversation, voice command, or voice message.

Type: Grant

Filed: August 12, 2018

Date of Patent: June 1, 2021

Assignee: Vivint, Inc.

Inventors: Jefferson Lyman, Nic Brunson, Wade Shearer, Mike Warner, Stefan Walger
Method and apparatus for acquiring and processing an operation instruction

Patent number: 11024314

Abstract: Systems and methods are provided for acquiring and processing operation instructions. The systems and methods may include receiving a first operation instruction. The system may determine an application associated with the first operation instruction, and determine whether the first operation instruction satisfies one or more predetermined conditions, wherein the predetermined conditions are associated with the application. The system may also, responsive to determining that the first operation instruction satisfies the predetermined conditions, and after the second operation instruction is received, invoke the application, and transmit the second operation instruction to the application for execution. The system may also, responsive to determining that the first operation instruction does not satisfy the predetermined conditions, execute the first operation instruction as a standalone instruction.

Type: Grant

Filed: December 23, 2015

Date of Patent: June 1, 2021

Assignee: BANMA ZHIXING NETWORK (HONGKONG) CO., LIMITED

Inventors: Xiaodong Li, Hongxing Wu
Systems and methods for using image searching with voice recognition commands

Patent number: 11024305

Abstract: Embodiments described herein include systems and methods for using image searching with voice recognition commands. Embodiments of a method may include providing a user interface via a target application and receiving a user selection of an area on the user interface by a user, the area including a search image. Embodiments may also include receiving an associated voice command and associating, by the computing device, the associated voice command with the search image.

Type: Grant

Filed: February 23, 2018

Date of Patent: June 1, 2021

Assignee: Dolbey & Company, Inc.

Inventor: Curtis A. Weeks
Wakeword detection using a secondary microphone

Patent number: 11024290

Abstract: Techniques for capturing spoken user inputs while a device is prevented from capturing such spoken user inputs are described. When a first device becomes incapable of capturing spoken user inputs intended for a system, a second device, for capturing such spoken user inputs, may be identified. The second device may be identified based on the second device being connected to a same vehicle computing system as the first device. The second device may be enabled to capture spoken user inputs, intended for the system, until the first device is again able to capture such spoken user inputs.

Type: Grant

Filed: February 11, 2019

Date of Patent: June 1, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Andrew Mitchell, Gabor Nagy
Electronic device for processing multi-modal input, method for processing multi-modal input and server for processing multi-modal input

Patent number: 11023201

Abstract: An electronic device is provided. The electronic device includes a housing, a touchscreen display exposed through a first portion of the housing, a microphone disposed at a second portion of the housing, a speaker disposed at a third portion of the housing, a memory disposed inside the housing, a processor disposed inside the housing, and electrically connected to the display, the microphone, the speaker, and the memory. The memory is configured to store a plurality of application programs, each of which includes a graphic user interface (GUI).

Type: Grant

Filed: January 28, 2019

Date of Patent: June 1, 2021

Assignee: Samsung Electronics Co., Ltd.

Inventors: In Jong Rhee, Ji Min Lee, Sang Ki Kang, Han Jun Ku, Sung Pa Park, Jang Seok Seo, In Wook Song, Won Ick Ahn, Kyoung Gu Woo, Ji Soo Yi, Chang Kyun Jeon, Ho Jun Jaygarl, Il Hwan Choi, Yoo Jin Hong, Ji Hyun Kim, Jae Yung Yeo
Systems and methods for maintaining a conversation

Patent number: 11018997

Abstract: Systems and methods for an interactive communications system capable of generating a response to conversational input are provided. The interactive communications system analyzes the conversational input to determine relevant topics of discussion. The interactive communications system further determines which of the relevant topics of discussion can potentially lead to an unwanted end to a conversation. The interactive communications system redirects the conversation by providing responses to the conversational input that are intended simply to avoid the unwanted end to the conversation.

Type: Grant

Filed: April 12, 2018

Date of Patent: May 25, 2021

Assignee: Disney Enterprises, Inc.

Inventors: Raymond Scanlon, Douglas Fidaleo
System and method for dynamic dialog control for contact center system

Patent number: 11019205

Abstract: A method for engaging in an automated dialog with a user that includes: retrieving a preset dialog flow that includes various blocks directing the dialog with the user; providing a prompt to the user based on a current block of the dialog flow; receiving an action from the user in response to the prompt; and retrieving a classification tree corresponding to the dialog flow. The classification tree has a plurality of nodes mapped to the blocks of the dialog flow representing user intents. The processor computes a probability for each of the nodes based on the action from the user. A particular one of the nodes is then selected based on the computed probabilities. A target block of the dialog flow is identified based on the selected node, and a response is output in response to the identified target block.

Type: Grant

Filed: March 14, 2020

Date of Patent: May 25, 2021

Inventors: Conor McGann, Ioana Grigoropol, Mariya Orshansky, Ankit Pat
Method and system for implementing an elastic cloud-based voice search utilized by set-top box (STB) clients

Patent number: 11019402

Abstract: Systems, and methods are described to provide voice search in an elastic cloud environment communicating with a set-top box (STB) by receiving by a voice cloud search server pulse-code modulation (PCM) audio packets transmitted from the STB; sending the PCM audio packets to a natural language processing (NLP) service for converting to text; sending the text sets to an elastic voice cloud search server for querying an electronic program guide (EPG) service, channel and program data associated with the text sets wherein the EPG service to at least return identified channel and program data; in response to an identified return of channel and television program data, sending sets of text to a search service for performing an elastic search for related data from a plurality of different search sources and returning search results and error codes to a requester.

Type: Grant

Filed: October 17, 2019

Date of Patent: May 25, 2021

Assignee: DISH Network L.L.C.

Inventors: James Wilde, Ashok Soni, Hawk McGinty, James Shuler, Lixing Zhang, Michael Disante, Narayanan Sekhar, Xiaomei Sun, Xinhua Yang
Conversational agent

Patent number: 11010556

Abstract: A method includes converting a user's utterance to text; encapsulating the converted text in a rheme object; searching, for each of a plurality of topics, for keywords in the converted text; determining a relevancy metric for each of the plurality of topics based on such searching; selecting one or more topics based on determined relevancy metrics; comparing some or all of the converted text to names in one or more patient lists or databases; identifying a unique patient whose name is contained in the converted; attaching an indication of the identified patient to the rheme object; effecting an action based on the selected one or more topics and the attached patient indication; and saving the topic in a conversation history with a reference to the identified patient.

Type: Grant

Filed: April 24, 2018

Date of Patent: May 18, 2021

Assignee: ALLSCRIPTS SOFTWARE, LLC

Inventors: Matthew Dreselly Thomas, William Loftus, Harry Wepuri, Arif Ogan
Inferring confidence and need for natural language processing of input data

Patent number: 11010566

Abstract: Improved data ingestion techniques are provided. A data set comprising records is received, where each record contains one or more fields. A group of fields is identified, where each of the fields has a common metadata attribute. Metrics are determined for the group based on metadata associated with each field, and weight values are assigned to each of the metrics. A natural language processing (NLP) measure and a discreteness measure are generated for the group of fields based on the metrics and the weight values. A processing workflow is selected to use when ingesting data from the group of fields into a corpus, based on comparing the NLP measure and the discreteness measure to one or more predefined thresholds, and each of the fields in the group of fields are processed using the processing workflow.

Type: Grant

Filed: May 22, 2018

Date of Patent: May 18, 2021

Assignee: International Business Machines Corporation

Inventors: Troy Biesterfeld, Andrew R Freed, Elizabeth Teresa Dettman, Jeremy J Salsman, Paul R Chmielewski
Refinement of voice query interpretation

Patent number: 11003419

Abstract: A system for refinement of a voice query interpretation interprets a voice query received at a voice-enabled device to identify commands responsive to the voice query for execution at the voice-enabled device, and enables refinement of the interpretation of the voice query through a graphical user interface generated and displayed at a GUI-capable device. The graphical user interface includes a set of selectable options relating to the voice query and identifying a refinement of the interpretation of the voice query to enable control and/or adjustment of commands to be executed by the voice-enabled device. For example, if one of the selectable options is selected, then a command associated with the selected option is identified and executed by the voice-enabled device.

Type: Grant

Filed: May 24, 2019

Date of Patent: May 11, 2021

Assignee: Spotify AB

Inventors: Philip Glenny Edmonds, Matthew Joseph Kane, Joshua Pham, Eder G. Bastos, Marcus Daniel Better, Adithya Kalyan Tammavarapu, Amilcar Andrade Garcia, Chen Ye Li, Adam Jonathan Shonkoff, Aaron Paul Harmon, Christopher Phair, Ching Chuan Sung
Training of chatbots from corpus of human-to-human chats

Patent number: 11004013

Abstract: Automated (autonomous) and computer-assisted preparation of initial training patterns for an Artificial Intelligence (AI) based automated conversational agent system, such as an AI-based chatbot, includes a computer processor accessing a corpus of digital weighted conversation models representing text-based interlocutory conversations, wherein each digital weighted conversation model contains annotations and paths, and wherein each path in each digital weighted conversation model is associated with a weight; selecting a plurality of the conversations which meet at least one criteria and in which at least one path meets at least one weight threshold according to the plurality of digital weighted conversation models; converting the weights associated with the selected conversations into initial training pattern values according to at least one Artificial Intelligence (AI) based automated conversational agent system; and exporting the training pattern values to at least one Artificial Intelligence (AI) based aut

Type: Grant

Filed: January 6, 2020

Date of Patent: May 11, 2021

Assignee: DISCOURSE.AI, INC.

Inventor: Jonathan E. Eisenzopf
Alias resolving intelligent assistant computing device

Patent number: 11004446

Abstract: Intelligent assistant systems, methods and computing devices are disclosed for resolving alias identifiers. A method comprises receiving and parsing data comprising a current user input that includes an alias identifier. The data and/or other sensor data are analyzed to identify the user. Based at least on identifying the user and recognizing the alias identifier, usage pattern data comprising at least one previous user input that includes the alias identifier and corresponding context information is accessed. The usage pattern data is used to resolve the alias identifier to mean the alias identifier in an alias record of a known entity. Based at least on resolving the alias identifier, an output device is controlled to one or more of generate a message and perform an action with respect to the known entity.

Type: Grant

Filed: June 30, 2017

Date of Patent: May 11, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Erich-Soren Finkelstein, Han Yee Mimi Fung, Oz Solomon
System and method for learning preferences in dialogue personalization

Patent number: 11003860

Abstract: The present teaching relates to method, system, medium, and implementations for user machine dialogue. Historic dialogue data related to past dialogues are accessed and used to learn, via machine learning, expected utilities. During a dialogue involving a user and a machine agent, a representation of a shared mindset between the user and the agent is obtained to characterize the current state of the dialogue, which is then used to update the expected utilities. Continuous expected utility functions are then generated based on the updated expected utilities, wherein the continuous expected utility functions are to be used in determining how to conduct a dialogue with a user.

Type: Grant

Filed: December 27, 2018

Date of Patent: May 11, 2021

Assignee: DMAI, INC.

Inventors: Rui Fang, Changsong Liu
Interactive dialog training and communication system using artificial intelligence

Patent number: 11003863

Abstract: A system for training and deploying an artificial conversational entity using an artificial intelligence (AI) based communications system is disclosed. The system may comprise a memory storing machine readable instructions. The system may also comprise a processor to execute the machine readable instructions to receive a request via an artificial conversational entity. The processor may also transmit a response to the request based on a dialog tree generated from at least a model-based action generator and a memory-based action generator. The processor may further provide a training option to a user in the event the response is suboptimal. The processor may additionally receive a selection from the user via the training option. The selection may be associated with an optimal response.

Type: Grant

Filed: March 22, 2019

Date of Patent: May 11, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Matthew Brigham Hall, Weizhu Chen, Junyan Chen, Pengcheng He, Yu Zhao, Yi-Min Wang, Yuting Sun, Zheng Chen, Katherine Winant Osborne
Use of human input recognition to prevent contamination

Patent number: 10997444

Abstract: Embodiments of a system and method for processing and recognizing non-contact types of human input to prevent contamination are generally described herein. In example embodiments, human input is captured, recognized, and used to provide active input for control or data entry into a user interface. The human input may be provided in variety of forms detectable by recognition techniques such as speech recognition, gesture recognition, identification recognition, and facial recognition. In one example, the human input recognition techniques are used in connection with a device cleaning workflow used to obtain data and human input during cleaning procedures while minimizing cross-contamination between the contaminated device or person and other objects or persons. In another example, the human input recognition techniques are used in connection with a device tracking workflow used to obtain data and human input while tracking interactions with and locations of the contaminated or uncontaminated device.

Type: Grant

Filed: October 3, 2019

Date of Patent: May 4, 2021

Assignee: Medivators Inc.

Inventors: Ward Sly, Stacy Lemmer, Terry Mistalski, Michael Petersen
System and method for screening a patient's foot

Patent number: 10993655

Abstract: A foot screening system configured to aid in screening a plantar surface of a foot of a user for sores, ulcers, and other signs of diseases. The foot screening system including a foot platform including a foot contacting surface configured to serve as a foot stabilizing device during a screening of a plantar surface of a foot of a user, a camera stabilizer platform configured to support a mobile computing device at a desired angle and distance from the foot platform, and a user interface configured to aid a user in capturing one or more images of the plantar surface of the foot of the user, flagging the one or more images for additional review, and uploading the one or more images to a network for access by a healthcare provider.

Type: Grant

Filed: May 4, 2020

Date of Patent: May 4, 2021

Inventor: Mark Swerdlow
Electronic device microphone listening modes

Patent number: 10993057

Abstract: A wide range, non-focused listening mode of a microphone of an electronic device can be set to be selectively less than a maximal range around the device. The microphone can be operated in the wide range, non-focused listening mode to detect a spoken trigger phrase. The microphone can then be operated in a narrow range, focused listening mode directed towards a location from which the spoken trigger phrase was detected in the wide range, non-focused listening mode.

Type: Grant

Filed: April 21, 2016

Date of Patent: April 27, 2021

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: David H. Hanes, Jon R. Dory
Spoken keyword detection based utterance-level wake on intent system

Patent number: 10984783

Abstract: An embodiment of a wake-on-intent speech recognition device includes technology to detect one or more keywords in a digital representation of a spoken natural language utterance, determine an intent of the spoken natural language utterance based on the detected keywords, and provide the spoken natural language utterance to a speech recognition and interpretation system if the determined intent is to further process the spoken natural language utterance. Other embodiments are disclosed and claimed.

Type: Grant

Filed: March 27, 2019

Date of Patent: April 20, 2021

Assignee: Intel Corporation

Inventors: Wenda Chen, Jonathan Huang, Tobias Bocklet, Munir Georges
Server and method for controlling server

Patent number: 10986391

Abstract: A display apparatus and a server which implements an interactive system are disclosed. The server includes a communicator which receives text information corresponding to a user voice collected at the display apparatus from the display apparatus, and a controller which extracts an utterance component from the text information and controls so that a query to search contents is generated using the extracted utterance component and transmitted to an external server which categorizes metadata of the content under each item and stores the same, in which the controller generates the query by adding a preset item to a criteria to search a content, when a number of criteria to categorize the content under an item corresponding to the extracted utterance component is less than a preset number.

Type: Grant

Filed: June 6, 2016

Date of Patent: April 20, 2021

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Yong-wook Shin, Seung-min Shin, Sung-wook Choi, Hye-jeong Lee, Ji-hye Chung
Speaker verification using co-location information

Patent number: 10986498

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Grant

Filed: September 17, 2019

Date of Patent: April 20, 2021

Assignee: Google LLC

Inventors: Raziel Alvarez Guevara, Othar Hansson
Dialogue management system with hierarchical classification and progression

Patent number: 10984034

Abstract: A dialogue management system applies hierarchical classifiers and other natural language processing to dialogue input, and determines whether performance of an action is likely to occur. The dialogue management system may process dialogue input to assess a dialogue participant's current position in various hierarchies or other classification schemes associated with performance of a desired action. The system may then present results of the assessment to another dialogue participant or provide the results to another system. In some embodiments, the dialogue management system may automatically generate responses or questions designed to engage a dialogue participant and cause the participant to progress through the levels of a hierarchy toward performance of a desired action.

Type: Grant

Filed: October 5, 2017

Date of Patent: April 20, 2021

Assignee: Cyrano.ai, Inc.

Inventors: Scott Douglas Sandland, Daniel Paris
Intelligent personal assistant controller where a voice command specifies a target appliance based on a confidence score without requiring uttering of a wake-word

Patent number: 10979242

Abstract: Embodiments of the present disclosure pertain to a personal assistant controller. In one embodiment, the present disclosure includes a computer implemented method comprising receiving a voice audio signal in the personal assistant controller, converting the voice audio signal into a target command corresponding to one of a plurality of personal assistants, wherein different personal assistants comprise different target command protocols for executing different operations on different network enabled appliances, and sending the target command for execution by a backend system corresponding to the one of the plurality of personal assistants, and in accordance therewith, performing an operation on the backend system.

Type: Grant

Filed: June 5, 2018

Date of Patent: April 13, 2021

Assignee: SAP SE

Inventors: Alexander Ocher, Andrey Belyy, Viktor Lapitski
Mechanism for retrieval of previously captured audio

Patent number: 10976990

Abstract: In one aspect a device-side audio handling input/output unit (DIO) of a hardware device writes audio data generated by the hardware device within a ring buffer. An input provided by a user for activation of a software program is received, and a notification that the software program is ready to accept the audio data is generated. A system-side audio handling input/output unit (SIO) additionally provides past audio data from the ring buffer to the software program. Other aspects also are described.

Type: Grant

Filed: August 28, 2019

Date of Patent: April 13, 2021

Assignee: APPLE INC.

Inventors: Jeffrey C. Moore, Richard M. Powell, Alexander C. Powers, Anthony J. Guetta
Static analysis performing method based on voice information and device for the same

Patent number: 10978065

Abstract: A static analysis performing method based on voice information may be provided that includes: receiving voice information from a user; determining user's intention to perform static analysis on the basis of the voice information; acquiring history information on static analysis performed in the past in accordance with the user's intention; determining a static analysis target on the basis of the history information; and performing the static analysis on the static analysis target.

Type: Grant

Filed: December 12, 2018

Date of Patent: April 13, 2021

Assignee: SURESOFT TECHNOLOGIES INC.

Inventors: Hyun Seop Bae, June Kim, Seung-uk Oh, Min Hyuk Kwon
Systems and methods for preemptively preventing interruptions from network-connected devices from occurring during media viewing

Patent number: 10979244

Abstract: Systems and methods are provided herein for preventing interruptions to a media viewing activity caused by operations performed in a household by network-connected devices. A media guidance application may determine that operations are being performed by an IoT device and may cause an interruption to media viewing. The media guidance application may prevent the interruption by extending or otherwise handling the operation.

Type: Grant

Filed: September 18, 2019

Date of Patent: April 13, 2021

Assignee: Rovi Guides, Inc.

Inventors: Maria Rocio Ramirez, Denisse Breaux, Angel Merced
Voice-controlled device switching between modes based on speech input

Patent number: 10978062

Abstract: Techniques for presenting content by a voice-controlled device are described. In an example, the voice-controlled device is operatively coupled to a presentation device and supports dual mode functionalities. In a first mode, the voice-controlled device sends content for presentation at the presentation device. In a second mode, the voice-controlled device presents the content at a presentation interface of the voice-controlled device. Based on speech input from a user indicating an issue with a content presentation in the first mode, the voice-controlled device switches to the second mode and presents a message at the presentation interface indicating that subsequent content presentations would be presented at this interface. The voice-controlled device remains in the second mode until receiving additional speech input necessitating a switch to the first mode.

Type: Grant

Filed: September 27, 2018

Date of Patent: April 13, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Valere Joseph Vanderschaegen, Kazim Das, Donald L. Cantrell, Johan Le Nerriec, Joseph Pedro Tavares
Device independent text captioned telephone service

Patent number: 10972604

Abstract: A communication system and method for displaying text captions corresponding to voice communications between an assisted user's mobile wireless device and a separate hearing user's device includes at least one communication component configured to enable the appliance to communicate with a relay, a display, and a processor operably coupled to the at least one communication component and the display. The processor is configured to enable the assisted user to establish an association between the appliance and the mobile device, receive text originating at the relay, the text corresponding to a transcript of the hearing user's voice signal originating at the hearing user's device, and cause text captions corresponding to the received text to be displayed on the display.

Type: Grant

Filed: September 3, 2019

Date of Patent: April 6, 2021

Assignee: Ultratec, Inc.

Inventors: Robert M. Engelke, Kevin R. Colwell, Troy D. Vitek
Input device, electronic device, system comprising the same and control method thereof

Patent number: 10971143

Abstract: An input device which includes a sensor, a microphone, a communicator, and a processor configured to, based on an operation of a user being identified based on a value sensed through the sensor, transmit utterance intention sensing information to an electronic device, based on a command to initiate a speech recognition and feedback information being received from the electronic device according to the utterance intention sensing information transmitted to the electronic device, activate the microphone and provide a feedback according to the feedback information, and transmit a voice signal received via the microphone to the electronic device.

Type: Grant

Filed: July 26, 2018

Date of Patent: April 6, 2021

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Ki-hyun Song, Je-hwan Seo, Suk-hoon Yoon, Jong-keun Lee, Chae-young Lim, Min-sup Kim, Hyun-kyu Yun
Voice control of a media playback system

Patent number: 10971139

Abstract: An example system is configured to cause a first playback device in a first playback zone to operate in a given playback state including play back of media items identified in a playback queue associated with the first playback zone. The system is also configured to, while the first playback device is operating in the given playback state, (i) receive data corresponding to a detected voice input including an indication of (a) a command word and (b) one or more zone variable instances and (ii) determine, based on the command word and the one or more zone variable instances, an intent to transfer the given playback state to a second playback zone. The system is also configured to transfer the given playback state to the second playback zone, thereby causing a second playback device in the second playback zone to play back the media items identified in the playback queue.

Type: Grant

Filed: November 2, 2020

Date of Patent: April 6, 2021

Assignee: Sonos, Inc.

Inventors: Nicholas A. J. Millington, Keith Corbin, Mark Plagge
Speech interaction feedback method for smart TV, system and computer readable medium

Patent number: 10971145

Abstract: The present disclosure provides a speech interaction feedback method for smart TV, a system and a computer readable medium. The method comprises: collecting audio stream of a speech query sent by a user and element information of a current interface of the smart TV; sending the audio stream and the element information of the current interface to a cloud server so that the cloud server generates an information response message carrying a target element, according to the audio stream and the element information of the current interface; wherein the target element is an element in the current interface hit by an intention of the speech query corresponding to the audio stream; receiving the response message returned by the cloud server; according to information of the target element in the response message, performing a preset effect display for the corresponding target element on the current interface, as an interaction feedback for the speech query.

Type: Grant

Filed: November 2, 2018

Date of Patent: April 6, 2021

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventors: Junnan Luo, Jing Li, Zhixi Chen
Methods and systems for implementing an elastic cloud based voice search using a third-party search provider

Patent number: 10972802

Abstract: Systems, and methods are described to provide voice search in an elastic cloud environment communicating with a set-top box (STB) by receiving by a voice cloud search server pulse-code modulation (PCM) audio packets transmitted from the STB; sending the PCM audio packets to a natural language processing (NLP) service for converting to text; sending the text sets to an elastic voice cloud search server for querying an electronic program guide (EPG) service, channel and program data associated with the text sets wherein the EPG service to at least return identified channel and program data; in response to an identified return of channel and television program data, sending sets of text to a third-party search service for performing an independent search for related data and returning search results of video and image content which is then stripped of dynamic scripts to return to the STB.

Type: Grant

Filed: October 17, 2019

Date of Patent: April 6, 2021

Assignee: DISH Network L.L.C.

Inventors: James Wilde, Ashok Soni, Hawk McGinty, James Shuler, Lixing Zhang, Michael Disante, Narayanan Sekhar, Xiaomei Sun, Xinhua Yang
Audio headset system

Patent number: 10972595

Abstract: A base station for an audio headset system is provided which is able to communicate with a remote device. The base station is configured to be convertible such that it facilitates communication with remote devices over a range of different communication protocols, in accordance with varying user preference.

Type: Grant

Filed: April 13, 2019

Date of Patent: April 6, 2021

Inventor: James Clarke
Information processing apparatus with voice print authentication and program

Patent number: 10972632

Abstract: There is provided an information processing apparatus in which voice operation is enabled, the information processing apparatus including: a voice input device that accepts voice input for voice operation; and a hardware processor that: sets an inputted condition as a job; identifies a content of voice operation based on a voice inputted to the voice input device, and reflects the content of the voice operation in setting of the job; returns the job set by the hardware processor to an initial setting condition; identifies a user by performing voice print authentication; and changes a time until a setting condition of the job is returned to the initial setting condition by the hardware processor, between a case where a user who has uttered the voice inputted to the voice input device is changed in the voice print authentication and a case where the user is not changed.

Type: Grant

Filed: November 12, 2019

Date of Patent: April 6, 2021

Assignee: Konica Minolta, Inc.

Inventor: Toshihiko Otake
Systems and methods for selecting effective phrases to be presented during a conversation

Patent number: 10970485

Abstract: A conversation may be monitored in real time using a trained machine learning model to identify a desired outcome of a conversation and generate one or more phrases for accomplishing the desired outcome. A confidence score may also be determined for one or more phrases that indicates a likelihood that the one or more phrases may help accomplish the desired outcome of the conversation. In some examples, a confidence score may be based on whether an agent, a caller, or both responded unfavorably to a similar phrase used previously in another conversation. In other examples, a confidence score corresponding to one or more phrases may be based on whether a prior conversation in which one or more similar phrases was used resulted in the desired outcome being accomplished.

Type: Grant

Filed: July 31, 2020

Date of Patent: April 6, 2021

Assignee: CRESTA INTELLIGENCE INC.

Inventors: Tianlin Shi, Saurabh Misra, Motoki Dean Wu
Asynchronous virtual assistant

Patent number: 10964325

Abstract: Aspects of the subject disclosure may include, for example, obtaining an input, e.g., from a human operator, comprising a request. A number of activities are identified, e.g., by a virtual assistant, based on the request. Performance of the number of activities is facilitated, e.g., by the virtual assistant. A result is determined, e.g., by the virtual assistant, based on the performance of the number of activities, wherein a response to the request is based on the result. Other embodiments are disclosed.

Type: Grant

Filed: November 13, 2019

Date of Patent: March 30, 2021

Assignee: AT&T Intellectual Property I, L.P.

Inventor: Mazin E. Gilbert
Systems and methods for enabling topic-based verbal interaction with a virtual assistant

Patent number: 10964324

Abstract: Systems and methods are disclosed for enabling verbal interaction with an NLUI application without relying on express wake terms. The NLUI application receives an audio input comprising a plurality of terms. In response to determining that none of the terms is an express wake term pre-programmed into the NLUI application, the NLUI application determines a topic for the plurality of terms. The NLUI application then determines whether the topic is within a plurality of topics for which a response should be generated. If the determined topic of the audio input is within the plurality of topics, the NLUI application generates a response to the audio input.

Type: Grant

Filed: April 26, 2019

Date of Patent: March 30, 2021

Assignee: Rovi Guides, Inc.

Inventors: Vikram Makam Gupta, Sukanya Agarwal, Gyanveer Singh
Controlling optically-switchable devices

Patent number: 10964320

Abstract: This disclosure relates generally to optically switchable devices, and more particularly, to methods for controlling optically switchable devices. In various embodiments, one or more optically switchable devices may be controlled via voice control and/or gesture control. The method may be implemented on a network of optically switchable devices, and may be implemented to control the optical state of a plurality of optically switchable devices on the network.

Type: Grant

Filed: April 25, 2017

Date of Patent: March 30, 2021

Assignee: View, Inc.

Inventors: Dhairya Shrivastava, Mark D. Mendenhall
Scalable dynamic class language modeling

Patent number: 10957312

Abstract: This document generally describes systems and methods for dynamically adapting speech recognition for individual voice queries of a user using class-based language models. The method may include receiving a voice query from a user that includes audio data corresponding to an utterance of the user, and context data associated with the user. One or more class models are then generated that collectively identify a first set of terms determined based on the context data, and a respective class to which the respective term is assigned for each respective term in the first set of terms. A language model that includes a residual unigram may then be accessed and processed for each respective class to insert a respective class symbol at each instance of the residual unigram that occurs within the language model. A transcription of the utterance of the user is then generated using the modified language model.

Type: Grant

Filed: December 31, 2019

Date of Patent: March 23, 2021

Assignee: Google LLC

Inventors: Justin Max Scheiner, Petar Aleksic
Image display apparatus and method of controlling the same

Patent number: 10957323

Abstract: Provided are an image display apparatus and a method of controlling the same. The image display apparatus enabling voice recognition includes: a first voice inputter which receives a user-side audio signal; an audio outputter which outputs an audio signal processed by the image display apparatus; a first voice recognizer which recognizes the user-side audio signal received through the first voice inputter; and a controller which decreases a volume of the audio signal output through the audio outputter to a predetermined level if a voice recognition start command is received.

Type: Grant

Filed: September 13, 2019

Date of Patent: March 23, 2021

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Dae Gyu Bae, Tae Hwan Cha, Ho Jeong You

prev … 6 7 8 9 10 11 12 13 14 … next