Patents Examined by Daniel Abebe
  • Patent number: 10121494
    Abstract: A speech-capture device can capture audio data during wakeword monitoring and use the audio data to determine if a user is present nearby the device, even if no wakeword is spoken. Audio such as speech, human originating sounds (e.g., coughing, sneezing), or other human related noises (e.g., footsteps, doors closing) can be used to detect audio. Audio frames are individually scored as to whether a human presence is detected in the particular audio frames. The scores are then smoothed relative to nearby frames to create a decision for a particular frame. Presence information can then be sent according to a periodic schedule to a remote device to create a presence “heartbeat” that regularly identifies whether a user is detected proximate to a speech-capture device.
    Type: Grant
    Filed: March 30, 2017
    Date of Patent: November 6, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Shiva Kumar Sundaram, Chao Wang, Shiv Naga Prasad Vitaladevuni, Spyridon Matsoukas, Arindam Mandal
  • Patent number: 10121480
    Abstract: A method for processing audio data includes determining a first common scalefactor value for representing quantized audio data in a frame. A second common scalefactor value is determined for representing the quantized audio data in the frame. A line equation common scalefactor value is determined from the first and second common scalefactor values.
    Type: Grant
    Filed: July 28, 2016
    Date of Patent: November 6, 2018
    Assignee: Intel Corporation
    Inventors: Dmitry N. Budnikov, Igor Igor Chikalov, Sergey N. Zheltov
  • Patent number: 10120866
    Abstract: Examples of the present disclosure describe systems and methods relating to conversational system user behavior identification. A user of the conversational system may be evaluated based on one or more factors. The one or more factors may be compared to an aggregated measure for a larger group of conversational system users, such that “anomalous” behavior (e.g., behavior that deviates from a normal behavior) may be identified. When a user is identified as exhibiting anomalous behavior, the conversational system may adapt its interactions with the user in order to encourage, discourage, or further observe the identified behavior. As a result, the conversational system may be able to verify a user's anomalous behavior, discourage the anomalous behavior, or take other action while interacting with the user.
    Type: Grant
    Filed: April 28, 2017
    Date of Patent: November 6, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Joseph Edwin Johnson, Jr., Emmanouil Koukoumidis, Donald Brinkman, Matthew Schuerman
  • Patent number: 10115406
    Abstract: An apparatus for decoding to obtain a reconstructed audio signal envelope includes a signal envelope reconstructor for generating the reconstructed audio signal envelope depending on one or more splitting points and an output interface for outputting the reconstructed audio signal envelope. The signal envelope reconstructor is configured to generate the reconstructed audio signal envelope such that the one or more splitting points divide the reconstructed audio signal envelope into two or more audio signal envelope portions, and to generate the reconstructed audio signal envelope such that, for each of the two or more signal envelope portions, an absolute value of its signal envelope portion value is greater than half of an absolute value of the signal envelope portion value of each of the other signal envelope portions.
    Type: Grant
    Filed: December 9, 2015
    Date of Patent: October 30, 2018
    Assignee: Fraunhofer-Gesellschaft zur foerderung der angewandten Forschung e.V
    Inventors: Tom Baeckstroem, Benjamin Schubert, Markus Multrus, Sascha Disch, Konstantin Schmidt, Grzegorz Pietrzyk
  • Patent number: 10114816
    Abstract: A dialogue complexity assessment method, system, and computer program product for introducing the notion of dialogue complexity to understand and compare dialogues in a repository, calculating the dialogue complexity, use the dialogue complexity to understand customer interactions in a variety of domains using public and proprietary data, and demonstrate the dialogue complexity usage to improve a service management operation.
    Type: Grant
    Filed: April 28, 2017
    Date of Patent: October 30, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Biplav Srivastava, Qingzi Vera Liao, Pavan Kapanipathi Bangalore
  • Patent number: 10102855
    Abstract: Methods and systems for rendering lists of instructions and performing actions associated with those lists are described herein. In some embodiments, an individual may request that a voice activated electronic device associated with their user account assist in performing a task using a list of instructions. The list of instructions may include metadata that indicates actions capable of being performed by additional Internet of Things (“IoT”) devices. When the instructions are rendered, an instructions speechlet may recognize the metadata and may cause one or more of the IoT devices to perform a particular action. Furthermore, the metadata may also correspond to content capable of being rendered by the voice activated electronic device to assist the individual in performing a particular step of the instructions.
    Type: Grant
    Filed: March 30, 2017
    Date of Patent: October 16, 2018
    Assignee: Amazon Technologies, Inc.
    Inventor: Manoj Sindhwani
  • Patent number: 10096324
    Abstract: A frame error concealment (FEC) method is provided. The method includes: selecting an FEC mode based on states of a current frame and a previous frame of the current frame in a time domain signal generated after time-frequency inverse transform processing; and performing corresponding time domain error concealment processing on the current frame based on the selected FEC mode, wherein the current frame is an error frame or the current frame is a normal frame when the previous frame is an error frame.
    Type: Grant
    Filed: January 30, 2017
    Date of Patent: October 9, 2018
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ho-sang Sung, Nam-suk Lee
  • Patent number: 10089303
    Abstract: A computer-aided translation system includes a processor configured to generate a suggestion pool of possible translations for each sentence in a document. A translation module configured to provide a best suggestion from the suggestion pool to a user for a sentence being translated and to provide an updated best suggestion from the updated suggestion pool to the user after the receipt of a user's translation prefix input. A pool update module configured to update the suggestion pool based on the user's input of a translation prefix.
    Type: Grant
    Filed: May 4, 2016
    Date of Patent: October 2, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Libin Shen, Bowen Zhou
  • Patent number: 10089982
    Abstract: Methods, systems, and apparatus for determining that a software application installed on a user device is compatible with a new voice action, wherein the new voice action is specified by an application developer of the software application. One or more trigger terms for triggering the software application to perform the new voice action are identified. An automatic speech recognizer is biased to prefer the identified trigger terms of the new voice action over trigger terms of other voice actions. A transcription of an utterance generated by the biased automatic speech recognizer is obtained. The transcription of the utterance generated by the biased automatic speech recognizer is determined to include a particular trigger term included in the identified trigger terms. Based at least on determining that the transcription of the utterance generated by the biased automatic speech recognizer includes the particular trigger term, execution of the new voice action is triggered.
    Type: Grant
    Filed: June 8, 2017
    Date of Patent: October 2, 2018
    Assignee: GOOGLE LLC
    Inventors: Bo Wang, Sunil Vemuri, Barnaby John James, Pravir Kumar Gupta, Scott B. Huffman
  • Patent number: 10089981
    Abstract: Methods and systems for performing contact resolution are described herein. When initiating a communications session using a voice activated electronic device, a contact name may be resolved to determine an appropriate contact with which the communications session may be directed to. Contacts from an individual's contact list may be queried to determine a listing of probable contacts associated with the contact name, and contact identifiers associated with the contact may be determined. Using one or more rules for disambiguating between similar contact names, a single contact may be identified, and a communications session with that contact may be initiated.
    Type: Grant
    Filed: March 9, 2017
    Date of Patent: October 2, 2018
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Someshwaran Elangovan, Aparna Nandyal, Venkatesh Kancharla, Arun Rajendran, Sumedha Arvind Kshirsagar, Christopher Geiger Parker
  • Patent number: 10089983
    Abstract: Methods and systems for adding functionality to an account of a language processing system where the functionality is associated with a second account of a first application system is described herein. In a non-limiting embodiment, an individual may log into a first account of a language processing system and log into a second account of a first application system. While logged into both the first account and the second account, a button included within a webpage provided by the first application may be invoked. A request capable of being serviced using the first functionality may be received by the language processing system from a device associated with the first account. The language processing system may send first account data and the second account data to the first application system to facilitate an action associated with the request, thereby enabling the first functionality for the first account.
    Type: Grant
    Filed: June 8, 2017
    Date of Patent: October 2, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Ganesh Kumar Gella, Venkata Abhinav Sidharth Bhagavatula, Robert William Serr, Yonnas Getahun Beyene
  • Patent number: 10083005
    Abstract: A user speech interface for interactive media guidance applications, such as television program guides, guides for audio services, guides for video-on-demand (VOD) services, guides for personal video recorders (PVRs), or other suitable guidance applications is provided. Voice commands may be received from a user and guidance activities may be performed in response to the voice commands.
    Type: Grant
    Filed: April 21, 2016
    Date of Patent: September 25, 2018
    Assignee: Rovi Guides, Inc.
    Inventors: M. Scott Reichardt, David M. Berezowski, Michael D. Ellis, Toby DeWeese
  • Patent number: 10083696
    Abstract: A method for determining user liveness is provided that includes calculating, by a computing device, a spectral property difference between voice biometric data captured from a user and user record voice biometric data. The user and the computing device constitute a user-computing device pair, and the voice biometric data is captured by the computing device during a verification transaction. Moreover, the method includes inputting the spectral property difference into a machine learning algorithm, calculating an output score with the machine learning algorithm, and determining the voice biometric data was captured from a live user when the output score satisfies a threshold score.
    Type: Grant
    Filed: March 7, 2017
    Date of Patent: September 25, 2018
    Assignee: DAON HOLDINGS LIMITED
    Inventor: Raphael Blouet
  • Patent number: 10083706
    Abstract: The coding efficiency of an audio codec using a controllable—switchable or even adjustable—harmonic filter tool is improved by performing the harmonicity-dependent controlling of this tool using a temporal structure measure in addition to a measure of harmonicity in order to control the harmonic filter tool. In particular, the temporal structure of the audio signal is evaluated in a manner which depends on the pitch. This enables to achieve a situation-adapted control of the harmonic filter tool so that in situations where a control made solely based on the measure of harmonicity would decide against or reduce the usage of this tool, although using the harmonic filter tool would, in that situation, increase the coding efficiency, the harmonic filter tool is applied, while in other situations where the harmonic filter tool may be inefficient or even destructive, the control reduces the appliance of the harmonic filter tool appropriately.
    Type: Grant
    Filed: January 20, 2017
    Date of Patent: September 25, 2018
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V.
    Inventors: Goran Markovic, Christian Helmrich, Emmanuel Ravelli, Manuel Jander, Stefan Doehla
  • Patent number: 10049667
    Abstract: Location-based conversational understanding may be provided. Upon receiving a query from a user, an environmental context associated with the query may be generated. The query may be interpreted according to the environmental context. The interpreted query may be executed and at least one result associated with the query may be provided to the user.
    Type: Grant
    Filed: January 7, 2016
    Date of Patent: August 14, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Larry Paul Heck, Madhusudan Chinthakunta, David Mitby, Lisa Stifelman
  • Patent number: 10043535
    Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the input narrowband audio signal. Other embodiments are disclosed.
    Type: Grant
    Filed: January 15, 2014
    Date of Patent: August 7, 2018
    Assignee: Staton Techiya, LLC
    Inventors: John Usher, Dan Ellis
  • Patent number: 10026410
    Abstract: Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.
    Type: Grant
    Filed: July 26, 2016
    Date of Patent: July 17, 2018
    Assignee: Digimarc Corporation
    Inventors: Aparna R. Gurijala, Yang Bai, Ravi K. Sharma, Brett A. Bradley
  • Patent number: 10026418
    Abstract: An abnormal frame detection method and apparatus are disclosed. In an embodiment the method includes obtaining a signal frame from a speech signal, and dividing the signal frame into at least two subframes; obtaining a local energy value of a subframe of the signal frame; obtaining, according to the local energy value of the subframe, a first characteristic value used to indicate a local energy trend of the signal frame; performing singularity analysis on the signal frame to obtain a second characteristic value; and determining the signal frame as an abnormal frame if the first characteristic value meets a first threshold and the second characteristic value meets a second threshold. It is implemented whether distortion occurs in a speech signal is detected.
    Type: Grant
    Filed: January 25, 2017
    Date of Patent: July 17, 2018
    Assignee: Huawei Technologies Co., Ltd.
    Inventor: Wei Xiao
  • Patent number: 10002123
    Abstract: A data processing method, program, and apparatus for identifying a document within a block of text. A block of text is tokenized into a plurality of text tokens according to at least one rule parser. Each of the plurality of text tokens is sequentially compared to a plurality of document tokens to determine if the text token matches one of the plurality of document tokens. The plurality of document tokens correspond to a plurality of documents which have been tokenized according to the one or more rule parsers. Each matched text token is filtered according to predetermined filtering criteria to generate one or more candidate text tokens. It is then determined whether sequence of candidate text tokens that occur in sequential order within the block of text match sequence of document tokens. If so, then it is determined that the document has been identified within the block of text. The document can correspond to an artist, a song names, and misspellings and aliases thereof.
    Type: Grant
    Filed: November 30, 2016
    Date of Patent: June 19, 2018
    Assignee: Spotify AB
    Inventors: Brian Whitman, Hui Cao
  • Patent number: 10002129
    Abstract: This disclosure relates generally to natural language processing, and more particularly to a system and method for extracting subject-verb-object (SVO) chunked text from an unstructured text. In one embodiment, a method is provided for extracting SVO chunked text from an unstructured text. The method comprises identifying a plurality of part of speech (PoS) tokens in the unstructured text, and determining a plurality of SVO chunked text directly from the plurality of PoS tokens using a machine learning chunker model. The machine learning chunker model is trained on a subject-verb-object (SVO) annotated training data.
    Type: Grant
    Filed: March 30, 2017
    Date of Patent: June 19, 2018
    Assignee: Wipro Limited
    Inventor: Shaun Cyprian D'Souza