Patents Examined by Yi-Sheng Wang
  • Patent number: 10014005
    Abstract: Embodiments are described for harmonicity estimation, audio classification, pitch determination and noise estimation. Measuring harmonicity of an audio signal includes calculation a log amplitude spectrum of audio signal. A first spectrum is derived by calculating each component of the first spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are odd multiples of the component's frequency of the first spectrum. A second spectrum is derived by calculating each component of the second spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are even multiples of the component's frequency of the second spectrum. A difference spectrum is derived subtracting the first spectrum from the second spectrum. A measure of harmonicity is generated as a monotonically increasing function of the maximum component of the difference spectrum within predetermined frequency range.
    Type: Grant
    Filed: March 21, 2013
    Date of Patent: July 3, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Zhiwei Shuang, Shen Huang
  • Patent number: 10008216
    Abstract: Method and apparatus for reducing a size of databases required for recorded speech data.
    Type: Grant
    Filed: April 15, 2014
    Date of Patent: June 26, 2018
    Assignee: SPEECH MORPHING SYSTEMS, INC.
    Inventors: Fathy Yassa, Benjamin Reaves, Steve Pearson
  • Patent number: 9977684
    Abstract: The disclosure generally describes computer-implemented methods, software, and systems for self-learning localization services. A computer-implemented method includes: identifying, at a location remote from a first application, a request for localization of a string value associated with the first application from a source language to a target language, sending the string value to a translation request buffer in response to a determination that the localization of the string value in the target language is unavailable, and triggering, in response to satisfaction of at least one heuristic analysis, a translation process of the string value from the source language into the target language where the string value is retrieved from the translation request buffer. In some instances, the location remove from the first application is a centralized localization service accessible by remote requests from a plurality of applications.
    Type: Grant
    Filed: June 12, 2013
    Date of Patent: May 22, 2018
    Assignee: SAP SE
    Inventors: Alexey Arseniev, Felix F. Hoefer
  • Patent number: 9978362
    Abstract: A “Facet Recommender” creates conversational recommendations for facets of particular conversational topics, and optionally for things associated with those facets, from consumer reviews or other social media content. The Facet Recommender applies a machine-learned facet model and optional sentiment-model, to identify facets associated with spans or segments of the content and to determine neutral, positive, or negative consumer sentiment associated with those facets and, optionally, things associated with those facets. These facets are selected by the facet model from a list or set of manually defined or machine-learned facets for particular conversational topic types. The Facet Recommender then generates new conversational utterances (i.e., short neutral, positive or negative suggestions) about particular facets based on the sentiments associated with those facets. In various implementations, utterances are fit to one or more predefined conversational frameworks.
    Type: Grant
    Filed: September 2, 2014
    Date of Patent: May 22, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bill Dolan, Margaret Mitchell, Jay Banerjee, Pallavi Choudhury, Susan Hendrich, Rebecca Mason, Ron Owens, Mouni Reddy, Yaxiao Song, Kristina Toutanova, Liang Xu, Xuetao Yin
  • Patent number: 9977826
    Abstract: A computerized method for generating and evaluating natural language-generated text involves receiving, in a computer, data input by a user, generating, using a natural language generation technique, multiple instances of text stories based upon both contents of a corpus and the received data; analyzing the multiple instances of text stories as a weighted combination of computed geographic scores, distance scores, information content scores, replacement scores and extra aspect scores, providing a ranked set of the generated text stories to a user, receiving a selection of one of the text stories in the ranked set, and storing the selected story.
    Type: Grant
    Filed: October 21, 2015
    Date of Patent: May 22, 2018
    Assignee: Cloudera, Inc.
    Inventors: Micha Gorelick, Hilary Mason, Grant Custer
  • Patent number: 9922655
    Abstract: A computer speech output control method, system, and non-transitory computer readable medium, include a computer speech output control system, including a computer speech output unit configured to output a computer speech, a human speech monitoring circuit configured to determine whether a human conversation is occurring, an interruption priority setting circuit configured to set a priority setting for when the human conversation can be interrupted by the computer speech, and an interruption determining circuit configured to determine whether to cause the computer speech output unit to output the computer speech based on the priority setting and a status of the human conversation.
    Type: Grant
    Filed: May 31, 2016
    Date of Patent: March 20, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Christopher J. Hardee, Steven Robert Joroff, Pamela Ann Nesbitt, Scott Edward Schneider
  • Patent number: 9892733
    Abstract: An exemplary computer system configured to user multiple automatic speech recognizers (ASRs) with a plurality of language and acoustic models to increase the accuracy of speech recognition.
    Type: Grant
    Filed: May 20, 2014
    Date of Patent: February 13, 2018
    Assignee: SPEECH MORPHING SYSTEMS, INC.
    Inventor: Fathy Yassa
  • Patent number: 9886237
    Abstract: A text-reading device includes: a visual line direction detection device for a driver; a memory that stores the visual line direction when the driver looks at a display device; a gaze determination device that determines that the driver gazes the display device when a state that the detected visual line direction coincides with the stored visual line direction continues for predetermined time or longer; a voice conversion device that outputs text information of the display device as a voice signal based on an instruction; and a reading control device that inputs the instruction when the driver gazes the display device while the display device displays the text information, and the vehicle starts to move.
    Type: Grant
    Filed: October 2, 2013
    Date of Patent: February 6, 2018
    Assignee: DENSO CORPORATION
    Inventors: Kensuke Suzuki, Yuji Shinkai
  • Patent number: 9865278
    Abstract: A frequency domain converter is configured to create a plurality of pieces of frequency domain information by individually converting a plurality of input audio signals, which is acquired at different positions, into frequency domain information. A relative value calculator is configured to calculate a relative value of time frequency components of at least one set of frequency domain information among the plurality of pieces of frequency domain information. A signal determiner is configured to determine whether or not each of the input audio signals includes an audio signal component, which is emitted from a predetermined position, based on whether or not the relative value is included in a range specified and based on a relative threshold value stored in a memory in advance.
    Type: Grant
    Filed: March 1, 2016
    Date of Patent: January 9, 2018
    Assignee: JVC KENWOOD CORPORATION
    Inventor: Masato Sugano
  • Patent number: 9852131
    Abstract: Computer-implemented techniques can include receiving a selected word in a source language, obtaining one or more parts of speech for the selected word, and for each of the one or more parts-of-speech, obtaining candidate translations of the selected word to a different target language, each candidate translation corresponding to a particular semantic meaning of the selected word. The techniques can include for each semantic meaning of the selected word: obtaining an image corresponding to the semantic meaning of the selected word, and compiling translation information including (i) the semantic meaning, (ii) a corresponding part-of-speech, (iii) the image, and (iv) at least one corresponding candidate translation. The techniques can also include outputting the translation information.
    Type: Grant
    Filed: May 18, 2015
    Date of Patent: December 26, 2017
    Assignee: GOOGLE LLC
    Inventors: Alexander Jay Cuthbert, Barak Turovsky
  • Patent number: 9805734
    Abstract: From a mixed signal in which a first signal and a second signal are mixed, the second signal is removed at low processing cost and without delay. As a result, an estimated first signal which has low residue of the second signal and low distortion is obtained. An estimated first signal is generated by subtracting a pseudo second signal which is estimated to be mixed in a first mixed signal in which a first signal and a second signal are mixed from the first mixed signal. The pseudo second signal is obtained by a first adaptive filter using a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal. A coefficient update amount of the first adaptive filter is made smaller as compared with a case when the estimated first signal is smaller than the first mixed signal, in case the estimated first signal is larger than the first mixed signal.
    Type: Grant
    Filed: September 15, 2011
    Date of Patent: October 31, 2017
    Assignee: NEC CORPORATION
    Inventor: Akihiko Sugiyama
  • Patent number: 9792280
    Abstract: Mechanisms are provided for performing context based synonym filtering for natural language processing. Content is parsed into one or more conceptual units, wherein each conceptual unit comprises a portion of text of the content that is associated with a single concept. For each conceptual unit, a term in the conceptual unit is identified that has a synonym to be utilized during natural language processing of the content. A first measure of relatedness of the term to at least one other term in the conceptual unit is determined. A second measure of relatedness of the synonym of the term to the at least one other term in the conceptual unit is determined. A determination whether or not to utilize the synonym when performing natural language processing on the conceptual unit is made based on the first and second measures of relatedness and natural language processing on the content is performed accordingly.
    Type: Grant
    Filed: June 3, 2016
    Date of Patent: October 17, 2017
    Assignee: International Business Machines Corporation
    Inventors: Kay Mueller, Christopher M. Nolan, William G. Visotski, David E. Wilson
  • Patent number: 9767804
    Abstract: A method of utilizing a speech assistant, the speech assistant designed to provide a voice input and speech output capability, the method comprising, enabling the use of the speech assistant for communication with a user, and terminating the speech assistant when the communication is complete. The method further comprises receiving a notification from a native application associated with the communication, and activating a sub-portion of the speech assistant, to enable outputting of the notification using speech output, thereby enabling the use of speech output for periodic announcements without enabling the speech assistant.
    Type: Grant
    Filed: August 16, 2016
    Date of Patent: September 19, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Elizabeth A. Dykstra-Erickson, Jared L. Strawderman
  • Patent number: 9761227
    Abstract: Methods described herein provide functionality for automatic speech recognition (ASR). One such embodiment performs speech recognition using received speech recognition result candidates, where the received candidates were generated by performing Statistical Language Model (SLM) based speech recognition on one or more frames of audio data. In turn, such an embodiment transmits results of the speech recognition, performed using the received speech recognition result candidates, to a user device via a communications network. Results of the speech recognition are available with lower latency than pure cloud based ASR solutions.
    Type: Grant
    Filed: May 26, 2016
    Date of Patent: September 12, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Carl Benjamin Quillen, Naveen Parihar
  • Patent number: 9754591
    Abstract: Features are disclosed for performing functions in response to user requests based on contextual data regarding prior user requests. Users may engage in conversations with a computing device in order to initiate some function or obtain some information. A dialog manager may manage the conversations and store contextual data regarding one or more of the conversations. Processing and responding to subsequent conversations may benefit from the previously stored contextual data by, e.g., reducing the amount of information that a user must provide if the user has already provided the information in the context of a prior conversation. Additional information associated with performing functions responsive to user requests may be shared among applications, further improving efficiency and enhancing the user experience.
    Type: Grant
    Filed: November 18, 2013
    Date of Patent: September 5, 2017
    Assignee: Amazon Technologies, Inc.
    Inventors: Nishant Kumar, David Robert Thomas, Sumedha Arvind Kshirsagar, Vikas Jain, Jeff Bradley Beal, Ajay Gopalakrishnan, Shishir Sridhar Bharathi
  • Patent number: 9728185
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for recognizing speech using neural networks. One of the methods includes receiving an audio input; processing the audio input using an acoustic model to generate a respective phoneme score for each of a plurality of phoneme labels; processing one or more of the phoneme scores using an inverse pronunciation model to generate a respective grapheme score for each of a plurality of grapheme labels; and processing one or more of the grapheme scores using a language model to generate a respective text label score for each of a plurality of text labels.
    Type: Grant
    Filed: May 22, 2015
    Date of Patent: August 8, 2017
    Assignee: Google Inc.
    Inventors: Johan Schalkwyk, Francoise Beaufays, Hasim Sak, John Giannandrea
  • Patent number: 9716901
    Abstract: Methods and systems are provided for separating signal-correlated and signal-uncorrelated error components in quantization noise. Such separation leads to a generalization of the conventional rate-distortion optimization problem. For the commonly used assumption of a Gaussian process, a quantizer according to this principle is implemented in a straightforward manner using a dithered quantizer and appropriate pre-filters and post-filters. If the penalization of the signal-uncorrelated error component is increased over that of the signal-correlated error component, then the pre-filter emphasizes the signal spectrum more, reducing the differential entropy rate of the pre-filtered signal. Accordingly, the signal-uncorrelated noise is reduced for a given rate.
    Type: Grant
    Filed: April 3, 2013
    Date of Patent: July 25, 2017
    Assignee: Google Inc.
    Inventor: Willem Bastiaan Kleijn
  • Patent number: 9697840
    Abstract: The present document relates to methods and systems for music information retrieval (MIR). In particular, the present document relates to methods and systems for extracting a chroma vector from an audio signal. A method (900) for determining a chroma vector (100) for a block of samples of an audio signal (301) is described. The method (900) comprises receiving (901) a corresponding block of frequency coefficients derived from the block of samples of the audio signal (301) from a core encoder (412) of a spectral band replication based audio encoder (410) adapted to generate an encoded bitstream (305) of the audio signal (301) from the block of frequency coefficients; and determining (904) the chroma vector (100) for the block of samples of the audio signal (301) based on the received block of frequency coefficients.
    Type: Grant
    Filed: November 28, 2012
    Date of Patent: July 4, 2017
    Assignee: Dolby International AB
    Inventors: Arijit Biswas, Marco Fink, Michael Schug
  • Patent number: 9672207
    Abstract: A method, system, and non-transitory compute readable medium determining and discerning items with multiple meanings in a sequence of items including producing a distributed representation for each item of the sequence of items including a word vector and a context vector, partitioning the sequence of items into classes, for an item using a representative word vector of each class, calculating a cosine distance between the word vector of said item and the class representative vector, and producing a new sequence of items by modifying the distributed representation in the producing by replacing each occurrence of an item depending on the cosine distance calculated by the calculating.
    Type: Grant
    Filed: October 19, 2015
    Date of Patent: June 6, 2017
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Oded Shmueli
  • Patent number: 9666211
    Abstract: There is provided an information processing apparatus including an information acquiring unit that acquires information to identify an editing point of content including a voice, on the basis of language analysis of the content, and an information output unit that outputs the acquired information.
    Type: Grant
    Filed: June 6, 2013
    Date of Patent: May 30, 2017
    Assignee: SONY CORPORATION
    Inventor: Takashi Kuwabara