Patents Examined by Michael Colucci
  • Patent number: 10656910
    Abstract: A method and system are provided. The method includes receiving, by a microphone and camera, user utterances indicative of user commands and associated user gestures for the user utterances. The method further includes parsing, by a hardware-based recognizer, sample utterances and the user utterances into verb parts and noun parts. The method also includes recognizing, by a hardware-based recognizer, the user utterances and the associated user gestures based on the sample utterances and descriptions of associated supporting gestures for the sample utterances. The recognizing step includes comparing the verb parts and the noun parts from the user utterances individually and as pairs to the verb parts and the noun parts of the sample utterances. The method additionally includes selectively performing a given one of the user commands responsive to a recognition result.
    Type: Grant
    Filed: July 24, 2018
    Date of Patent: May 19, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Jonathan Lenchner, Vinay Venkataraman
  • Patent number: 10657463
    Abstract: One embodiment provides a method comprising answering one or more incoming phone calls received at one or more pre-specified phone numbers utilizing a bot. The bot is configured to engage in a conversation with a caller initiating an incoming phone call utilizing a voice recording that impersonates a human being. The method further comprises recording each conversation the bot engages in, and classifying each recorded conversation as one of poison data or truthful training data based on content of the recorded conversation and one or more learned detection models for detecting poisoned data.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: May 19, 2020
    Assignee: International Business Machines Corporation
    Inventors: Nathalie Baracaldo Angel, Pawan R. Chowdhary, Heiko H. Ludwig, Robert J. Moore, Taiga Nakamura
  • Patent number: 10656909
    Abstract: A method and system are provided. The method includes receiving, by a microphone and camera, user utterances indicative of user commands and associated user gestures for the user utterances. The method further includes parsing, by a hardware-based recognizer, sample utterances and the user utterances into verb parts and noun parts. The method also includes recognizing, by a hardware-based recognizer, the user utterances and the associated user gestures based on the sample utterances and descriptions of associated supporting gestures for the sample utterances. The recognizing step includes comparing the verb parts and the noun parts from the user utterances individually and as pairs to the verb parts and the noun parts of the sample utterances. The method additionally includes selectively performing a given one of the user commands responsive to a recognition result.
    Type: Grant
    Filed: July 24, 2018
    Date of Patent: May 19, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Jonathan Lenchner, Vinay Venkataraman
  • Patent number: 10650055
    Abstract: A wearable sound capturing and retrieval system that includes a wearable sound capturing device that comprises a data collection device including at least one microphone configured for capturing sound data adjacent a user in at least a substantially continuous manner. The system may, for example: (1) store the captured sound data; (2) convert the captured sound data to captured textual data; (3) index data selected from: one or more segments of captured sound data and one or more segments of captured textual data; and (3) facilitate retrieval of at least a portion of the indexed data, wherein facilitating the retrieval includes (I) scanning the indexed data to identify one or more key phrases, (ii) retrieving one or more segments of indexed data that was communicated by the user at least about contemporaneously with the one or more key phrases, and (iii) saving the one or more segments of indexed data.
    Type: Grant
    Filed: February 8, 2019
    Date of Patent: May 12, 2020
    Assignee: Viesoft, Inc.
    Inventor: Anthony Vierra
  • Patent number: 10629187
    Abstract: Systems and methods are described herein for providing media guidance. Control circuitry may receive a first voice input and access a database of topics to identify a first topic associated with the first voice input. A user interface may generate a first response to the first voice input, and subsequent to generating the first response, the control circuitry may receive a second voice input. The control circuitry may determine a match between the second voice input and an interruption input such as a period of silence or a keyword or a phrase, such as “Ahh,”, “Umm,”, or “Hmm.” The user interface may generate a second response that is associated with a second topic related to the first topic. By interrupting the conversation and changing the subject from time to time, media guidance systems can appear to be more intelligent and human.
    Type: Grant
    Filed: April 9, 2019
    Date of Patent: April 21, 2020
    Assignee: Rovi Guides, Inc.
    Inventors: Charles Dawes, Walter R. Klappert
  • Patent number: 10629204
    Abstract: Utterance-based user interfaces can include activation trigger processing techniques for detecting activation triggers and causing execution of certain commands associated with particular command pattern activation triggers without waiting for output from a separate speech processing engine. The activation trigger processing techniques can also detect speech analysis patterns and selectively activate a speech processing engine.
    Type: Grant
    Filed: October 3, 2018
    Date of Patent: April 21, 2020
    Assignee: SPOTIFY AB
    Inventor: Richard Mitic
  • Patent number: 10593330
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.
    Type: Grant
    Filed: October 26, 2018
    Date of Patent: March 17, 2020
    Assignee: Google LLC
    Inventor: Matthew Sharifi
  • Patent number: 10276152
    Abstract: An audible based electronic challenge system is used to control access to resources by using a test to identify an origin of a voice. The test is based on optimized text sentences selected for their discrimination capability in identifying different speakers.
    Type: Grant
    Filed: July 19, 2016
    Date of Patent: April 30, 2019
    Assignee: J. Nicholas and Kristin Gross
    Inventor: John Nicholas Gross
  • Patent number: 10157607
    Abstract: A computer-implemented method according to one embodiment includes identifying a request for audio data, determining one or more factors associated with the request, adjusting a speed of the audio data to create adjusted audio data, based on the one or more factors, and returning the adjusted audio data in response to the request.
    Type: Grant
    Filed: October 20, 2016
    Date of Patent: December 18, 2018
    Assignee: International Business Machines Corporation
    Inventors: Inseok Hwang, Su Liu, Eric J. Rozner, Chin Ngai Sze
  • Patent number: 10157350
    Abstract: Method(s) and system(s) providing for providing context based conversations are described here. The method may include receiving user data pertaining to a user. The user data includes registration information of the user and metadata associated with the user. The method may include determining a pre-defined role of the user based on the registration information. Further, the method may include providing restricted access to a users' data repository to the user, based on the role of the user. The method includes obtaining a text input pertaining to a conversation. Based on the text input an expression is generated. Further, one of a discussion service, a learning service, and an unlearning service is invoked, based on the expression and the metadata associated with the user. Based on at least one of the invoking services and the metadata associated with the user, retrieving a response. The response is shared with the user.
    Type: Grant
    Filed: August 31, 2015
    Date of Patent: December 18, 2018
    Assignee: TATA CONSULTANCY SERVICES LIMITED
    Inventors: Sumesh M R, Anju Paul, Neethu Manuel, Sibimon Sasidharan, Keerthi Damaraju, Viju Chacko, Shampa Sarkar
  • Patent number: 10157179
    Abstract: Some embodiments include a computer-implement method of producing a flexible sentence syntax to facilitate one or more computer applications to generate and publish sentence expressions. For example, the method can include providing a developer interface to define a flexible sentence syntax that controls one or more sentences publishable by an application service. A developer of the application service can customize the flexible sentence syntax including selecting at least one of selectable tokens that is associated with another element to incorporate in the flexible sentence syntax. Based on the selected token, a computing device can generate and publish a target sentence according to the flexible sentence syntax on the application service's behalf.
    Type: Grant
    Filed: July 18, 2017
    Date of Patent: December 18, 2018
    Assignee: FACEBOOK, INC.
    Inventors: Ling Bao, Hugo Johan van Heuven, Jiangbo Miao
  • Patent number: 10146773
    Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments may enable multi-lingual communications through different modes of communications including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments may implement communications systems and methods that translate text between two or more languages (e.g., spoken), while handling/accommodating for one or more of the following in the text: specialized/domain-related jargon, abbreviations, acronyms, proper nouns, common nouns, diminutives, colloquial words or phrases, and profane words or phrases.
    Type: Grant
    Filed: November 6, 2017
    Date of Patent: December 4, 2018
    Assignee: MZ IP Holdings, LLC
    Inventors: Gabriel Leydon, Francois Orsini, Nikhil Bojja, Shailen Karur
  • Patent number: 10133728
    Abstract: The system that performs semantic parsing may automatically extract complex information from databases. Complex information may comprise nested event structures. In one example process, a processor may receive unannotated text and may access a natural-language database that includes nested events. The processor, in performing semantic parsing, may automatically generate syntactic trees that include annotations that represent the semantic information. In particular, the natural-language sentences and the database include nested event structures.
    Type: Grant
    Filed: March 20, 2015
    Date of Patent: November 20, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Hoifung Poon, Kristina Toutanova, Ankur P. Parikh
  • Patent number: 10115405
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Grant
    Filed: October 31, 2017
    Date of Patent: October 30, 2018
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Lars Villemoes
  • Patent number: 10102199
    Abstract: Representative embodiments disclose mechanisms to complete partial natural language questions. Users enter a partial question. The system comprises a plurality of indexes, one index comprising common phrases associated with natural language questions and other indexes comprising short text entries associated with documents, such as document titles. The partial question is used to search one or more of the indexes. The search yields a ranked list of results. The top k entries of the list are selected and one or more language models are created from the top k entries. Each language model comprises n-grams from the top k entries from an index and an associated probability for each n-gram. A question completion generator creates question completion suggestions by matching n-grams with the partial question, removing ungrammatical candidate suggestions, and filtering the remaining suggestions per a filtering criteria. The top N results are returned as suggestions to complete the question.
    Type: Grant
    Filed: February 24, 2017
    Date of Patent: October 16, 2018
    Inventors: Peter Richard Bailey, David Anthony Hawking, David Maxwell
  • Patent number: 10102771
    Abstract: A method and a device for learning a language and a computer readable recording medium are provided. The method includes following steps. An input voice from a voice receiver is transformed into an input sentence according to a grammar rule. Whether the input sentence is the same as a learning sentence displayed on a display is determined. If the input sentence is different from the learning sentence, an ancillary information containing at least one error word in the input sentence that is different from the learning sentence is generated.
    Type: Grant
    Filed: February 13, 2014
    Date of Patent: October 16, 2018
    Assignee: Wistron Corporation
    Inventor: Hsi-chun Hsiao
  • Patent number: 10089061
    Abstract: According to one embodiment, an electronic device includes a memory and a hardware processor. The hardware processor is in communication with the memory. The hardware processor is configured to obtain a sound file including sound data and attached data, determine a type of meeting of the sound file classified based on an utterance state of the sound data, and display the sound file based on at least one of the sound data and the attached data such that the type of meeting is visually recognizable.
    Type: Grant
    Filed: February 16, 2016
    Date of Patent: October 2, 2018
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventor: Yusaku Kikugawa
  • Patent number: 10079013
    Abstract: A computing system is operable as virtual personal assistant (VPA) to understand relationships between different instances of natural language dialog expressed by different people in a multi-person conversational dialog session. The VPA can develop a common resource, a shared intent, which represents the VPA's semantic understanding of at least a portion of the multi-person dialog experience. The VPA can store and manipulate multiple shared intents, and can alternate between different shared intents as the multi-person conversation unfolds. With the shared intents, the computing system can generate useful action items and present the action items to one or more of the participants in the dialog session.
    Type: Grant
    Filed: November 27, 2013
    Date of Patent: September 18, 2018
    Assignee: SRI International
    Inventors: Edgar T. Kalns, Douglas A. Bercow, James F. Carpenter
  • Patent number: 10079025
    Abstract: A method of modification of audio data to improve the quality of the audio modification or reconstruction or improves the speed of such reconstruction or modification and produces more realistic audio data. Realistic audio data is audio data that is generated in natural events like talking or singing or a vehicle passing by and is not generated only by artificially constructing audio data like in a synthesizer. This will lead to audio data that will be perceived more likely as natural or unmodified audio signal when being played back to human beings. The method involves modification of some part of transformed audio data, especially phase data.
    Type: Grant
    Filed: October 18, 2016
    Date of Patent: September 18, 2018
    Assignee: Steinberg Media Technologies GmbH
    Inventors: Jean-Baptiste Jacques Guillaume Rolland, Yvan Grabit
  • Patent number: 10078632
    Abstract: An approach is provided in which an information handling system detects a multi-entity co-occurrence anomaly within a set of documents that corresponds to an amount of times that a first entity and a second entity co-occur in the set of documents. The information handling system then determines that at least one of the documents includes a title having a verb that grammatically connects the first entity to the second entity. As such, the information handling system collects document segments from the set of documents that have the first entity, the second entity, and the connecting verb. In turn, the information handling system uses the collected document segments to train a relation-based classifier.
    Type: Grant
    Filed: March 12, 2016
    Date of Patent: September 18, 2018
    Assignee: International Business Machines Corporation
    Inventors: Devin R. Harper, Pawan K. Lakshmanan, Gregory W. Schoeninger, Elliot B. Turner