Patents Examined by Shaun Roberts
  • Patent number: 9792907
    Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include updating a start state based rejection model and a key phrase model based on scores of sub-phonetic units from an acoustic model to generate a rejection likelihood score and a key phrase likelihood score and determining whether received audio input is associated with a predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.
    Type: Grant
    Filed: November 24, 2015
    Date of Patent: October 17, 2017
    Assignee: Intel IP Corporation
    Inventors: Tobias Bocklet, Joachim Hofer
  • Patent number: 9792911
    Abstract: A method of operating a speech recognition system includes converting a spoken utterance by a user into an electrical voice signal by use of a local microphone associated with a local electronic device. The electrical voice signal is transmitted to a remote voice recognizer. The remote voice recognizer is used to transcribe the electrical voice signal and to produce a confidence score. The confidence score indicates a level of confidence that the transcription of the electrical voice signal substantially matches the words of the spoken utterance. The transcription of the electrical voice signal and the confidence score are transmitted from the remote voice recognizer to the local electronic device. The electrical voice signal, the transcription of the electrical voice signal, and the confidence score are used at the local device to train a local voice recognizer.
    Type: Grant
    Filed: March 24, 2015
    Date of Patent: October 17, 2017
    Assignee: Panasonic Automotive Systems Company of America, Division of Panasonic Corporation of North America
    Inventors: Ilya Veksler, Ambuj Kumar, Naveen Reddy Korupol
  • Patent number: 9779752
    Abstract: A system for cloud acoustic enhancement is disclosed. In particular, the system may leverage metadata and cloud-computing network resources to mitigate the impact of noisy environments that may potentially interfere with user communications. In order to do so, the system may receive an audio stream including an audio signal associated with a user, and determine if the audio stream also includes an interference signal. The system may determine that the audio stream includes the interference signal if a portion of the audio stream correlates with metadata that identifies the interference signal. If the audio stream is determined to include the interference signal, the system may cancel the interference signal from the audio stream by utilizing the metadata and the cloud-computing network resources. Once the interference signal is cancelled, the system may transmit the audio stream including the audio signal associated with the user to an intended destination.
    Type: Grant
    Filed: October 31, 2014
    Date of Patent: October 3, 2017
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Donald J. Bowen, Dimitrios B. Dimitriadis, Lusheng Ji, Horst J. Schroeter
  • Patent number: 9767808
    Abstract: A method and apparatus for suppressing vocoder noise are provided. In the method, first information and second information are received from a channel decoder, the first information indicating whether a decoded data frame has an error and the second information being a channel quality metric, error concealment voice decoding is performed on the decoded data frame if the first information indicates that no channel decoding error has been generated and the second information is smaller than a predetermined first threshold, and normal voice decoding is performed on the decoded data frame if the first information indicates that no channel decoding error has been generated and the second information is equal to or larger than the first threshold.
    Type: Grant
    Filed: July 26, 2013
    Date of Patent: September 19, 2017
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Yong-Won Shin, Joon-Sang Ryu, Jung-In Kim
  • Patent number: 9753918
    Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.
    Type: Grant
    Filed: January 5, 2015
    Date of Patent: September 5, 2017
    Assignee: Facebook, Inc.
    Inventors: Alexander Waibel, Ian R. Lane
  • Patent number: 9747916
    Abstract: In a CELP-type speech coding apparatus, switching between an orthogonal search of a fixed codebook and a non-orthogonal search is performed in a practical and effective manner. The CELP-type speech coding apparatus includes a parameter quantizer that selects an adaptive codebook vector and a fixed codebook vector so as to minimize an error between a synthesized speech signal and an input speech signal. The parameter quantizer includes a fixed codebook searcher that switches between the orthogonal fixed codebook search and the non-orthogonal fixed codebook search based on a correlation value between a target vector for the fixed codebook search and the adaptive codebook vector obtained as a result of a synthesis filtering process.
    Type: Grant
    Filed: January 20, 2016
    Date of Patent: August 29, 2017
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Hiroyuki Ehara, Takako Hori
  • Patent number: 9747896
    Abstract: In certain implementations, follow-up responses may be provided for prior natural language inputs of a user. As an example, a natural language input associated with a user may be received at a computer system. A determination of whether information sufficient for providing an adequate response to the natural language input is currently accessible to the computer system may be effectuated. A first response to the natural language input (that indicates that a follow-up response will be provided) may be provided based on a determination that information sufficient for providing an adequate response to the natural language input is not currently accessible. Information sufficient for providing an adequate response to the natural language input may be received. A second response to the natural language input may then be provided based on the received sufficient information.
    Type: Grant
    Filed: October 15, 2015
    Date of Patent: August 29, 2017
    Assignee: VoiceBox Technologies Corporation
    Inventors: Michael R. Kennewick, Jr., Michael R. Kennewick, Sr.
  • Patent number: 9741362
    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank (101) configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit (102) configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.
    Type: Grant
    Filed: August 2, 2016
    Date of Patent: August 22, 2017
    Assignee: Dolby International AB
    Inventor: Lars Villemoes
  • Patent number: 9734145
    Abstract: Disclosed are a word comfort/discomfort index prediction apparatus and method therefore. The word comfort/discomfort index prediction apparatus includes: a calculation unit calculating emotional associations between the word and one or more respective comparative words, calculating emotional associations between at least one predefined reference word and the one or more respective comparative words, and calculating an emotional similarity between the word and each of the at least one reference word; and a prediction unit predicting the comfort/discomfort index of the word by using the at least one emotional similarity and a predefined comfort/discomfort index of the at least one reference word.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: August 15, 2017
    Assignee: Foundation of Soongsil University-Industry Cooperation
    Inventors: Soowon Lee, Kangbok Lee
  • Patent number: 9721582
    Abstract: Existing post-filtering methods for microphone array speech enhancement have two common deficiencies. First, they assume that noise is either white or diffuse and cannot deal with point interferers. Second, they estimate the post-filter coefficients using only two microphones at a time, performing averaging over all the microphones pairs, yielding a suboptimal solution. The provided method describes a post-filtering solution that implements signal models which handle white noise, diffuse noise, and point interferers. The method also implements a globally optimized least-squares approach of microphones in a microphone array, providing a more optimal solution than existing conventional methods. Experimental results demonstrate the described method outperforming conventional methods in various acoustic scenarios.
    Type: Grant
    Filed: February 3, 2016
    Date of Patent: August 1, 2017
    Assignee: GOOGLE INC.
    Inventors: Yiteng Huang, Alejandro Luebs, Jan Skoglund, Willem Bastiaan Kleijn
  • Patent number: 9720901
    Abstract: A method for an automated text-evaluation service, and more particularly a method and apparatus for automatically evaluating text and returning a score which represents a degree of inappropriate language. The method is implemented in a computer infrastructure having computer executable code tangibly embodied in a computer readable storage medium having programming instructions. The programming instructions are configured to: receive an input text which comprises an unstructured message at a first computing device; process the input text according to a string-structure similarity measure which compares each word of the input text to a predefined dictionary to indicate whether there is similarity in meaning, and generate an evaluation score for each word of the input text and send the evaluation score to another computing device. The evaluation score for each input message is based on the string-structure similarity measure between each word of the input text and the predefined dictionary.
    Type: Grant
    Filed: November 19, 2015
    Date of Patent: August 1, 2017
    Assignee: KING ABDULAZIZ CITY FOR SCIENCE AND TECHNOLOGY
    Inventors: Abdulqadir A. Alaqeeli, Ibrahim Almosallam, Sakhar B. Alkhereyf, Suliman Alsowelim
  • Patent number: 9704491
    Abstract: Systems, methods and articles of manufacture for outputting an audio effect on a remote device are disclosed. Embodiments select a device from a plurality of devices within a physical environment for use in outputting an audio effect. Upon determining that transmitting the audio effect as uncompressed data to the selected device would violate a predefined performance criteria, the audio effect is modified by determining, for each of a plurality of portions of the audio effect, a respective priority. Additionally, upon determining that a first portion of the plurality of portions of the audio effect is a lower priority, relative to a second portion of the audio effect, embodiments compress the first portion of the audio effect, while the second portion of the audio effect remains uncompressed. The modified audio effect is then transmitted to the selected device for playback.
    Type: Grant
    Filed: February 11, 2015
    Date of Patent: July 11, 2017
    Assignee: Disney Enterprises, Inc.
    Inventors: Michael P. Goslin, Ron C. Fish, David J. Logan, Reek N. Havok, Susan M. Lum
  • Patent number: 9704489
    Abstract: A portable acoustic unit is adapted for insertion into an electrical receptacle. The portable acoustic unit has an integrated microphone and a wireless network interface to an automation controller. The portable acoustic unit detects spoken voice commands from users in the vicinity of the electrical receptacle. The portable acoustic unit merely plugs into a conventional electrical outlet to provide an extremely simple means of voice control through a home or business.
    Type: Grant
    Filed: November 20, 2015
    Date of Patent: July 11, 2017
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Nafiz Haider, Kristin Patterson, Thomas Risley, Curtis Stephenson, David Vaught, Ross Newman
  • Patent number: 9704487
    Abstract: Disclosed herein are speech recognition apparatuses, vehicles having the speech recognition apparatuses, and methods for controlling vehicles. According to an aspect, a speech recognition apparatus includes a speech input unit configured to receive a speech command from a user, a communication unit configured to receive the result of processing for speech recognition acquired by at least one user terminal located near the user, and a controller configured to compare the result of processing for speech recognition acquired from the speech command received by the speech input unit to the result of processing for speech recognition acquired by the at least one user terminal, thus processing the speech command according to the result of the comparison.
    Type: Grant
    Filed: November 30, 2015
    Date of Patent: July 11, 2017
    Assignee: Hyundai Motor Company
    Inventor: Kyu Hyung Lim
  • Patent number: 9704413
    Abstract: A method for scoring non-native speech includes receiving a speech sample spoken by a non-native speaker and performing automatic speech recognition and metric extraction on the speech sample to generate a transcript of the speech sample and a speech metric associated with the speech sample. The method further includes determining whether the speech sample is scorable or non-scorable based upon the transcript and speech metric, where the determination is based on an audio quality of the speech sample, an amount of speech of the speech sample, a degree to which the speech sample is off-topic, whether the speech sample includes speech from an incorrect language, or whether the speech sample includes plagiarized material. When the sample is determined to be non-scorable, an indication of non-scorability is associated with the speech sample. When the sample is determined to be scorable, the sample is provided to a scoring model for scoring.
    Type: Grant
    Filed: March 23, 2015
    Date of Patent: July 11, 2017
    Assignee: Educational Testing Service
    Inventors: Su-Youn Yoon, Derrick Higgins, Klaus Zechner, Shasha Xie, Je Hun Jeon, Keelan Evanini, Guangming Ling, Isaac Bejar
  • Patent number: 9704506
    Abstract: Devices, systems and methods are disclosed for reducing noise in input data by performing a hysteresis operation followed by a lateral excitation smoothing operation. For example, an audio signal may be represented as a sequence of feature vectors. A row of the sequence of feature vectors may, for example, be associated with the same harmonic of the audio signal at different points in time. To determine portions of the row that correspond to the harmonic being present, the system may compare an amplitude to a low threshold and a high threshold and select a series of data points that are above the low threshold and include at least one data point above the high threshold. The system may iteratively perform a spreading technique, spreading a center value of a center data point in a kernel to neighboring data points in the kernel, to further reduce noise.
    Type: Grant
    Filed: January 9, 2017
    Date of Patent: July 11, 2017
    Assignee: KNUEDGE, INC.
    Inventors: David C. Bradley, Yao Huang Morin
  • Patent number: 9690781
    Abstract: A language setting system, for use by a user having a native language and carrying a personal electronic device, when using an informational device having an active language setting. As the user approaches the informational device, the informational device determines that the personal electronic device is associated with the user. The system determines the native language of the user by contacting the personal electronic device and sets the active language informational device to the native language. The informational device provides messages and information to the user in the native language of the user. When the user retreats from the informational device, the active language of the informational device may be reset to a default language.
    Type: Grant
    Filed: March 31, 2015
    Date of Patent: June 27, 2017
    Inventor: James E. Niles
  • Patent number: 9691398
    Abstract: The embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.
    Type: Grant
    Filed: November 16, 2016
    Date of Patent: June 27, 2017
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Sebastian Näslund, Volodya Grancharov, Erik Norvell
  • Patent number: 9672202
    Abstract: Various components provide options to re-format an input based on one or more contexts. The input is received that has been submitted to an application (e.g., messaging application, mobile application, word-processing application, web browser, search tool, etc.), and one or more outputs are identified that are possibilities to be provided as options for re-formatting. A respective score of each output is determined by applying a statistical model to a respective combination of the input and each output, the respective score comprising a plurality of context scores that quantify a plurality of contexts of the respective combination. Exemplary contexts include historical-user contexts, domain contexts, and general contexts. One or more suggested outputs are selected from among the one or more outputs based on the respective scores and are provided as options to re-format the input.
    Type: Grant
    Filed: March 20, 2014
    Date of Patent: June 6, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Issac Alphonso, Nick Kibre, Michael Levit, Sarangarajan Parthasarathy
  • Patent number: 9645993
    Abstract: A method and system for facilitating a semantic search based on one or more corpuses of natural language texts are provided. One or more corpuses of natural language texts are received including indexed linguistic parameters and semantic structures of lexical units. The linguistic parameters and semantic structures are generated during a preliminary syntactico-semantic analysis. Searching for text fragments satisfying a query in the one or more corpuses is performed. Relevance of the search results is estimated.
    Type: Grant
    Filed: December 27, 2013
    Date of Patent: May 9, 2017
    Assignee: ABBYY InfoPoisk LLC
    Inventors: Konstantin Zuev, Tatiana Danielyan, Elmira Rakhmatulina