Patents Examined by Yi-Sheng Wang

Harmonicity estimation, audio classification, pitch determination and noise estimation

Patent number: 10014005

Abstract: Embodiments are described for harmonicity estimation, audio classification, pitch determination and noise estimation. Measuring harmonicity of an audio signal includes calculation a log amplitude spectrum of audio signal. A first spectrum is derived by calculating each component of the first spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are odd multiples of the component's frequency of the first spectrum. A second spectrum is derived by calculating each component of the second spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are even multiples of the component's frequency of the second spectrum. A difference spectrum is derived subtracting the first spectrum from the second spectrum. A measure of harmonicity is generated as a monotonically increasing function of the maximum component of the difference spectrum within predetermined frequency range.

Type: Grant

Filed: March 21, 2013

Date of Patent: July 3, 2018

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Xuejing Sun, Zhiwei Shuang, Shen Huang
Method and apparatus for exemplary morphing computer system background

Patent number: 10008216

Abstract: Method and apparatus for reducing a size of databases required for recorded speech data.

Type: Grant

Filed: April 15, 2014

Date of Patent: June 26, 2018

Assignee: SPEECH MORPHING SYSTEMS, INC.

Inventors: Fathy Yassa, Benjamin Reaves, Steve Pearson
Self-learning localization service

Patent number: 9977684

Abstract: The disclosure generally describes computer-implemented methods, software, and systems for self-learning localization services. A computer-implemented method includes: identifying, at a location remote from a first application, a request for localization of a string value associated with the first application from a source language to a target language, sending the string value to a translation request buffer in response to a determination that the localization of the string value in the target language is unavailable, and triggering, in response to satisfaction of at least one heuristic analysis, a translation process of the string value from the source language into the target language where the string value is retrieved from the translation request buffer. In some instances, the location remove from the first application is a centralized localization service accessible by remote requests from a plurality of applications.

Type: Grant

Filed: June 12, 2013

Date of Patent: May 22, 2018

Assignee: SAP SE

Inventors: Alexey Arseniev, Felix F. Hoefer
Facet recommendations from sentiment-bearing content

Patent number: 9978362

Abstract: A “Facet Recommender” creates conversational recommendations for facets of particular conversational topics, and optionally for things associated with those facets, from consumer reviews or other social media content. The Facet Recommender applies a machine-learned facet model and optional sentiment-model, to identify facets associated with spans or segments of the content and to determine neutral, positive, or negative consumer sentiment associated with those facets and, optionally, things associated with those facets. These facets are selected by the facet model from a list or set of manually defined or machine-learned facets for particular conversational topic types. The Facet Recommender then generates new conversational utterances (i.e., short neutral, positive or negative suggestions) about particular facets based on the sentiments associated with those facets. In various implementations, utterances are fit to one or more predefined conversational frameworks.

Type: Grant

Filed: September 2, 2014

Date of Patent: May 22, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Bill Dolan, Margaret Mitchell, Jay Banerjee, Pallavi Choudhury, Susan Hendrich, Rebecca Mason, Ron Owens, Mouni Reddy, Yaxiao Song, Kristina Toutanova, Liang Xu, Xuetao Yin
Computerized method of generating and analytically evaluating multiple instances of natural language-generated text

Patent number: 9977826

Abstract: A computerized method for generating and evaluating natural language-generated text involves receiving, in a computer, data input by a user, generating, using a natural language generation technique, multiple instances of text stories based upon both contents of a corpus and the received data; analyzing the multiple instances of text stories as a weighted combination of computed geographic scores, distance scores, information content scores, replacement scores and extra aspect scores, providing a ranked set of the generated text stories to a user, receiving a selection of one of the text stories in the ranked set, and storing the selected story.

Type: Grant

Filed: October 21, 2015

Date of Patent: May 22, 2018

Assignee: Cloudera, Inc.

Inventors: Micha Gorelick, Hilary Mason, Grant Custer
System, method, and recording medium for controlling dialogue interruptions by a speech output device

Patent number: 9922655

Abstract: A computer speech output control method, system, and non-transitory computer readable medium, include a computer speech output control system, including a computer speech output unit configured to output a computer speech, a human speech monitoring circuit configured to determine whether a human conversation is occurring, an interruption priority setting circuit configured to set a priority setting for when the human conversation can be interrupted by the computer speech, and an interruption determining circuit configured to determine whether to cause the computer speech output unit to output the computer speech based on the priority setting and a status of the human conversation.

Type: Grant

Filed: May 31, 2016

Date of Patent: March 20, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Christopher J. Hardee, Steven Robert Joroff, Pamela Ann Nesbitt, Scott Edward Schneider
Method and apparatus for an exemplary automatic speech recognition system

Patent number: 9892733

Abstract: An exemplary computer system configured to user multiple automatic speech recognizers (ASRs) with a plurality of language and acoustic models to increase the accuracy of speech recognition.

Type: Grant

Filed: May 20, 2014

Date of Patent: February 13, 2018

Assignee: SPEECH MORPHING SYSTEMS, INC.

Inventor: Fathy Yassa
Text-reading device and text-reading method

Patent number: 9886237

Abstract: A text-reading device includes: a visual line direction detection device for a driver; a memory that stores the visual line direction when the driver looks at a display device; a gaze determination device that determines that the driver gazes the display device when a state that the detected visual line direction coincides with the stored visual line direction continues for predetermined time or longer; a voice conversion device that outputs text information of the display device as a voice signal based on an instruction; and a reading control device that inputs the instruction when the driver gazes the display device while the display device displays the text information, and the vehicle starts to move.

Type: Grant

Filed: October 2, 2013

Date of Patent: February 6, 2018

Assignee: DENSO CORPORATION

Inventors: Kensuke Suzuki, Yuji Shinkai
Audio signal processing device, audio signal processing method, and audio signal processing program

Patent number: 9865278

Abstract: A frequency domain converter is configured to create a plurality of pieces of frequency domain information by individually converting a plurality of input audio signals, which is acquired at different positions, into frequency domain information. A relative value calculator is configured to calculate a relative value of time frequency components of at least one set of frequency domain information among the plurality of pieces of frequency domain information. A signal determiner is configured to determine whether or not each of the input audio signals includes an audio signal component, which is emitted from a predetermined position, based on whether or not the relative value is included in a range specified and based on a relative threshold value stored in a memory in advance.

Type: Grant

Filed: March 1, 2016

Date of Patent: January 9, 2018

Assignee: JVC KENWOOD CORPORATION

Inventor: Masato Sugano
Techniques for providing visual translation cards including contextually relevant definitions and examples

Patent number: 9852131

Abstract: Computer-implemented techniques can include receiving a selected word in a source language, obtaining one or more parts of speech for the selected word, and for each of the one or more parts-of-speech, obtaining candidate translations of the selected word to a different target language, each candidate translation corresponding to a particular semantic meaning of the selected word. The techniques can include for each semantic meaning of the selected word: obtaining an image corresponding to the semantic meaning of the selected word, and compiling translation information including (i) the semantic meaning, (ii) a corresponding part-of-speech, (iii) the image, and (iv) at least one corresponding candidate translation. The techniques can also include outputting the translation information.

Type: Grant

Filed: May 18, 2015

Date of Patent: December 26, 2017

Assignee: GOOGLE LLC

Inventors: Alexander Jay Cuthbert, Barak Turovsky
Signal processing device, signal processing method and signal processing program for noise cancellation

Patent number: 9805734

Abstract: From a mixed signal in which a first signal and a second signal are mixed, the second signal is removed at low processing cost and without delay. As a result, an estimated first signal which has low residue of the second signal and low distortion is obtained. An estimated first signal is generated by subtracting a pseudo second signal which is estimated to be mixed in a first mixed signal in which a first signal and a second signal are mixed from the first mixed signal. The pseudo second signal is obtained by a first adaptive filter using a second mixed signal in which the first signal and the second signal are mixed in a different proportion from the first mixed signal. A coefficient update amount of the first adaptive filter is made smaller as compared with a case when the estimated first signal is smaller than the first mixed signal, in case the estimated first signal is larger than the first mixed signal.

Type: Grant

Filed: September 15, 2011

Date of Patent: October 31, 2017

Assignee: NEC CORPORATION

Inventor: Akihiko Sugiyama
Context based synonym filtering for natural language processing systems

Patent number: 9792280

Abstract: Mechanisms are provided for performing context based synonym filtering for natural language processing. Content is parsed into one or more conceptual units, wherein each conceptual unit comprises a portion of text of the content that is associated with a single concept. For each conceptual unit, a term in the conceptual unit is identified that has a synonym to be utilized during natural language processing of the content. A first measure of relatedness of the term to at least one other term in the conceptual unit is determined. A second measure of relatedness of the synonym of the term to the at least one other term in the conceptual unit is determined. A determination whether or not to utilize the synonym when performing natural language processing on the conceptual unit is made based on the first and second measures of relatedness and natural language processing on the content is performed accordingly.

Type: Grant

Filed: June 3, 2016

Date of Patent: October 17, 2017

Assignee: International Business Machines Corporation

Inventors: Kay Mueller, Christopher M. Nolan, William G. Visotski, David E. Wilson
Reducing speech session resource use in a speech assistant

Patent number: 9767804

Abstract: A method of utilizing a speech assistant, the speech assistant designed to provide a voice input and speech output capability, the method comprising, enabling the use of the speech assistant for communication with a user, and terminating the speech assistant when the communication is complete. The method further comprises receiving a notification from a native application associated with the communication, and activating a sub-portion of the speech assistant, to enable outputting of the notification using speech output, thereby enabling the use of speech output for periodic announcements without enabling the speech assistant.

Type: Grant

Filed: August 16, 2016

Date of Patent: September 19, 2017

Assignee: Nuance Communications, Inc.

Inventors: Elizabeth A. Dykstra-Erickson, Jared L. Strawderman
Method and system for hybrid decoding for enhanced end-user privacy and low latency

Patent number: 9761227

Abstract: Methods described herein provide functionality for automatic speech recognition (ASR). One such embodiment performs speech recognition using received speech recognition result candidates, where the received candidates were generated by performing Statistical Language Model (SLM) based speech recognition on one or more frames of audio data. In turn, such an embodiment transmits results of the speech recognition, performed using the received speech recognition result candidates, to a user device via a communications network. Results of the speech recognition are available with lower latency than pure cloud based ASR solutions.

Type: Grant

Filed: May 26, 2016

Date of Patent: September 12, 2017

Assignee: Nuance Communications, Inc.

Inventors: Carl Benjamin Quillen, Naveen Parihar
Dialog management context sharing

Patent number: 9754591

Abstract: Features are disclosed for performing functions in response to user requests based on contextual data regarding prior user requests. Users may engage in conversations with a computing device in order to initiate some function or obtain some information. A dialog manager may manage the conversations and store contextual data regarding one or more of the conversations. Processing and responding to subsequent conversations may benefit from the previously stored contextual data by, e.g., reducing the amount of information that a user must provide if the user has already provided the information in the context of a prior conversation. Additional information associated with performing functions responsive to user requests may be shared among applications, further improving efficiency and enhancing the user experience.

Type: Grant

Filed: November 18, 2013

Date of Patent: September 5, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Nishant Kumar, David Robert Thomas, Sumedha Arvind Kshirsagar, Vikas Jain, Jeff Bradley Beal, Ajay Gopalakrishnan, Shishir Sridhar Bharathi
Recognizing speech using neural networks

Patent number: 9728185

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for recognizing speech using neural networks. One of the methods includes receiving an audio input; processing the audio input using an acoustic model to generate a respective phoneme score for each of a plurality of phoneme labels; processing one or more of the phoneme scores using an inverse pronunciation model to generate a respective grapheme score for each of a plurality of grapheme labels; and processing one or more of the grapheme scores using a language model to generate a respective text label score for each of a plurality of text labels.

Type: Grant

Filed: May 22, 2015

Date of Patent: August 8, 2017

Assignee: Google Inc.

Inventors: Johan Schalkwyk, Francoise Beaufays, Hasim Sak, John Giannandrea
Quantization with distinct weighting of coherent and incoherent quantization error

Patent number: 9716901

Abstract: Methods and systems are provided for separating signal-correlated and signal-uncorrelated error components in quantization noise. Such separation leads to a generalization of the conventional rate-distortion optimization problem. For the commonly used assumption of a Gaussian process, a quantizer according to this principle is implemented in a straightforward manner using a dithered quantizer and appropriate pre-filters and post-filters. If the penalization of the signal-uncorrelated error component is increased over that of the signal-correlated error component, then the pre-filter emphasizes the signal spectrum more, reducing the differential entropy rate of the pre-filtered signal. Accordingly, the signal-uncorrelated noise is reduced for a given rate.

Type: Grant

Filed: April 3, 2013

Date of Patent: July 25, 2017

Assignee: Google Inc.

Inventor: Willem Bastiaan Kleijn
Enhanced chroma extraction from an audio codec

Patent number: 9697840

Abstract: The present document relates to methods and systems for music information retrieval (MIR). In particular, the present document relates to methods and systems for extracting a chroma vector from an audio signal. A method (900) for determining a chroma vector (100) for a block of samples of an audio signal (301) is described. The method (900) comprises receiving (901) a corresponding block of frequency coefficients derived from the block of samples of the audio signal (301) from a core encoder (412) of a spectral band replication based audio encoder (410) adapted to generate an encoded bitstream (305) of the audio signal (301) from the block of frequency coefficients; and determining (904) the chroma vector (100) for the block of samples of the audio signal (301) based on the received block of frequency coefficients.

Type: Grant

Filed: November 28, 2012

Date of Patent: July 4, 2017

Assignee: Dolby International AB

Inventors: Arijit Biswas, Marco Fink, Michael Schug
System, method, and recording medium for determining and discerning items with multiple meanings

Patent number: 9672207

Abstract: A method, system, and non-transitory compute readable medium determining and discerning items with multiple meanings in a sequence of items including producing a distributed representation for each item of the sequence of items including a word vector and a context vector, partitioning the sequence of items into classes, for an item using a representative word vector of each class, calculating a cosine distance between the word vector of said item and the class representative vector, and producing a new sequence of items by modifying the distributed representation in the producing by replacing each occurrence of an item depending on the cosine distance calculated by the calculating.

Type: Grant

Filed: October 19, 2015

Date of Patent: June 6, 2017

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Oded Shmueli
Information processing apparatus, information processing method, display control apparatus, and display control method

Patent number: 9666211

Abstract: There is provided an information processing apparatus including an information acquiring unit that acquires information to identify an editing point of content including a voice, on the basis of language analysis of the content, and an information output unit that outputs the acquired information.

Type: Grant

Filed: June 6, 2013

Date of Patent: May 30, 2017

Assignee: SONY CORPORATION

Inventor: Takashi Kuwabara

prev 1 2 3 4 next