Patents Examined by Qi Han

Apparatus and method for processing an audio signal for speech enhancement using a feature extraction

Patent number: 9064498

Abstract: An apparatus for processing an audio signal to obtain control information for a speech enhancement filter has a feature extractor for extracting at least one feature per frequency band of a plurality of frequency bands of a short-time spectral representation of a plurality of short-time spectral representations, where the at least one feature represents a spectral shape of the short-time spectral representation in the frequency band. The apparatus additionally has a feature combiner for combining the at least one feature for each frequency band using combination parameters to obtain the control information for the speech enhancement filter for a time portion of the audio signal. The feature combiner can use a neural network regression method, which is based on combination parameters determined in a training phase for the neural network.

Type: Grant

Filed: February 2, 2011

Date of Patent: June 23, 2015

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Christian Uhle, Oliver Hellmuth, Bernhard Grill, Falko Ridderbusch
Hybrid coded audio data streaming apparatus and method

Patent number: 9059727

Abstract: An audio coding system in which a plurality of quantization methods are selectable for application to components of a streamed audio signal to achieve a target frame size that is determined by comparing an achieved bit rate against a target bit rate. Based on the target frame size, the system calculates a bit allocation for signal components and compares the bit allocation to the dynamic range of the signal components. Depending on the outcome of the comparison, the system may select to quantize or not quantize a signal component. The system employs lossless coding techniques, but is capable of introducing lossy coding by quantization in order to meet the target bit rate.

Type: Grant

Filed: May 3, 2012

Date of Patent: June 16, 2015

Assignee: Cambridge Silicon Radio Limited

Inventors: Neil Smyth, David Trainor
Disambiguating input based on context

Patent number: 9053706

Abstract: In one implementation, a computer-implemented method includes receiving, at a mobile computing device, ambiguous user input that indicates more than one of a plurality of commands; and determining a current context associated with the mobile computing device that indicates where the mobile computing device is currently located. The method can further include disambiguating the ambiguous user input by selecting a command from the plurality of commands based on the current context associated with the mobile computing device; and causing output associated with performance of the selected command to be provided by the mobile computing device.

Type: Grant

Filed: July 20, 2011

Date of Patent: June 9, 2015

Assignee: Google Inc.

Inventors: John Nicholas Jitkoff, Michael J. LeBeau
Speaker recognition from telephone calls

Patent number: 9043207

Abstract: The present invention relates to a method for speaker recognition, comprising the steps of obtaining and storing speaker information for at least one target speaker; obtaining a plurality of speech samples from a plurality of telephone calls from at least one unknown speaker; classifying the speech samples according to the at least one unknown speaker thereby providing speaker-dependent classes of speech samples; extracting speaker information for the speech samples of each of the speaker-dependent classes of speech samples; combining the extracted speaker information for each of the speaker-dependent classes of speech samples; comparing the combined extracted speaker information for each of the speaker-dependent classes of speech samples with the stored speaker information for the at least one target speaker to obtain at least one comparison result; and determining whether one of the at least one unknown speakers is identical with the at least one target speaker based on the at least one comparison result.

Type: Grant

Filed: November 12, 2009

Date of Patent: May 26, 2015

Assignee: Agnitio S.L.

Inventors: Johan Nikolaas Langehoven Brummer, Luis Buera Rodriguez, Marta Garcia Gomar
Methods and arrangements for loudness and sharpness compensation in audio codecs

Patent number: 9031835

Abstract: In a method of improving perceived loudness and sharpness of a reconstructed speech signal delimited by a predetermined bandwidth, performing the steps of providing (S10) the speech signal, and separating (S20) the provided signal into at least a first and a second signal portion. Subsequently, adapting (S30) the first signal portion to emphasize at least a predetermined frequency or frequency interval within the first bandwidth portion. Finally, reconstructing (S40) the second signal portion based on at least the first signal portion, and combining (S50) the adapted first signal portion and the reconstructed second signal portion to provide a reconstructed speech signal with an overall improved perceived loudness and sharpness.

Type: Grant

Filed: June 29, 2010

Date of Patent: May 12, 2015

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventors: Volodya Grancharov, Sigurdur Sverrisson
Spectral envelope coding of energy attack signal

Patent number: 9020815

Abstract: MDCT or FFT-based audio coding algorithms often have the problem named here spectral pre-echoes when coding an energy attack signal. This invention presents several possibilities to avoid the spectral pre-echoes existing in decoded signal segment before the energy attack point. The spectral envelope before the attack point can be improved by performing spectrum smoothing, replacing the segment of having spectral pre-echoes or filtering the segment with a combined filter obtained by doing LPC analysis.

Type: Grant

Filed: May 7, 2013

Date of Patent: April 28, 2015

Assignee: Huawei Technologies Co., Ltd.

Inventor: Yang Gao
Method and apparatus for detecting a sentiment of short messages

Patent number: 9015033

Abstract: A method, computer readable medium and apparatus for detecting a sentiment for a short message are disclosed. For example, the method receives the short message, and obtains an abstraction of the short message. The method then determines the sentiment of the short message based upon the abstraction.

Type: Grant

Filed: October 26, 2010

Date of Patent: April 21, 2015

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Luciano De Andrade Barbosa, Junlan Feng
Methods and devices for generating an action item summary

Patent number: 9015034

Abstract: Methods and devices for generating an action item summary are described. In one example embodiment, the present application describes a processor-implemented method. The method includes: receiving a request for creation of an action item, the action item comprising a record of a proposed future action; obtaining context information associated with the action item; storing the action item and context information; and generating a sentence describing the action item based on the context information associated with the action item.

Type: Grant

Filed: May 15, 2012

Date of Patent: April 21, 2015

Assignee: BlackBerry Limited

Inventors: Carl Magnus Borg, Lars Johan Anders Olsson, John Cross Neumann
Handheld electronic device with text disambiguation

Patent number: 9015028

Abstract: In view of the foregoing, an improved handheld electronic device includes a keypad in the form of a reduced QWERTY keyboard and is enabled with disambiguation software. As a user enters keystrokes, the device provides output in the form of a default output and a number of variants from which a user can choose. The output is based largely upon the frequency, i.e., the likelihood that a user intended a particular output, but various features of the device provide additional variants that are not based solely on frequency and rather are provided by various logic structures resident on the device. The device enables editing during text entry, and when initiating an activity session on a word such as during editing, the display outputs variants of the entire word being edited, rather than providing as variants only those parts of a word that are being edited. The device also provides a learning function that allows the disambiguation function to adapt to provide a customized experience for the user.

Type: Grant

Filed: February 19, 2010

Date of Patent: April 21, 2015

Assignee: BlackBerry Limited

Inventors: Vadim Fux, Michael G. Elizarov, Sergey V. Kolomiets
Device and method for a bandwidth extension of an audio signal

Patent number: 8996362

Abstract: For a bandwidth extension of an audio signal, in a signal spreader the audio signal is temporally spread by a spread factor greater than 1. The temporally spread audio signal is then supplied to a demicator to decimate the temporally spread version by a decimation factor matched to the spread factor. The band generated by this decimation operation is extracted and distorted, and finally combined with the audio signal to obtain a bandwidth extended audio signal. A phase vocoder in the filterbank implementation or transformation implementation may be used for signal spreading.

Type: Grant

Filed: January 20, 2009

Date of Patent: March 31, 2015

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Frederik Nagel, Sascha Disch, Max Neuendorf
Apparatus and method for determining a plurality of local center of gravity frequencies of a spectrum of an audio signal

Patent number: 8996363

Abstract: An apparatus for determining a plurality of local center-of-gravity frequencies of a spectrum of an audio signal includes an offset determiner, a frequency determiner and an iteration controller. The offset determiner determines an offset frequency for each iteration start frequency of a plurality of iteration start frequencies based on the spectrum of the audio signal, wherein a number of discrete sample values of the spectrum is larger than a number of iteration start frequencies. The frequency determiner determines a new plurality of iteration start frequencies by increasing or reducing each iteration start frequency of the plurality of iteration start frequencies by the corresponding determined offset frequency. The iteration controller provides the new plurality of iteration start frequencies to the offset determiner for further iteration or provides the plurality of local center-of-gravity frequencies, if a predefined termination condition is fulfilled.

Type: Grant

Filed: March 18, 2010

Date of Patent: March 31, 2015

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Harald Popp
Method and system for automatic domain adaptation in speech recognition applications

Patent number: 8996371

Abstract: A system and method for adapting a language model to a specific environment by receiving interactions captured the specific environment, generating a collection of documents from documents retrieved from external resources, detecting in the collection of documents terms related to the environment that are not included in an initial language model and adapting the initial language model to include the terms detected.

Type: Grant

Filed: March 29, 2012

Date of Patent: March 31, 2015

Assignee: Nice-Systems Ltd.

Inventors: Eyal Hurvitz, Ezra Daya, Oren Pereg, Moshe Wasserblat
Global speech user interface

Patent number: 8983838

Abstract: A global speech user interface (GSUI) comprises an input system to receive a user's spoken command, a feedback system along with a set of feedback overlays to give the user information on the progress of his spoken requests, a set of visual cues on the television screen to help the user understand what he can say, a help system, and a model for navigation among applications. The interface is extensible to make it easy to add new applications.

Type: Grant

Filed: September 17, 2013

Date of Patent: March 17, 2015

Assignee: Promptu Systems Corporation

Inventors: Adam Jordan, Scott Lynn Maddux, Tim Plowman, Victoria Stanbach, Jody Williams
Noise filer, noise filling parameter calculator encoded audio signal representation, methods and computer program

Patent number: 8983851

Abstract: A noise filler for providing a noise-filled spectral representation of an audio signal on the basis of an input spectral representation of the audio signal has a spectral region identifier configured to identify spectral regions of the input spectral representation spaced from non-zero spectral regions of the input spectral representation by at least one intermediate spectral region, to obtain identified spectral regions, and a noise inserter configured to selectively introduce noise into the identified spectral regions to obtain the noise-filled spectral representation of the audio signal. A noise filling parameter calculator for providing a noise filling parameter on the basis of a quantized spectral representation of an audio signal has a spectral region identifier, as mentioned above, and a noise value calculator configured to selectively consider quantization errors of the identified spectral regions for a calculation of the noise filling parameter.

Type: Grant

Filed: January 11, 2011

Date of Patent: March 17, 2015

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Nikolaus Rettelbach, Bernhard Grill, Guillaume Fuchs, Stefan Geyersberger, Markus Multrus, Harald Popp, Juergen Herre, Stefan Wabnik, Gerald Schuller, Jens Hirschfeld
Method and an apparatus for processing an audio signal

Patent number: 8972270

Abstract: A method for processing an audio signal is disclosed. The method for processing an audio signal includes frequency-transforming an audio signal to generate a frequency-spectrum, deciding a weighting per band corresponding to energy per band using the frequency spectrum, receiving a masking threshold based on a psychoacoustic model, applying the weighting to the masking threshold to generate a modified masking threshold, and quantizing the audio signal using the modified masking threshold.

Type: Grant

Filed: May 25, 2009

Date of Patent: March 3, 2015

Assignees: LG Electronics Inc., Industry-Academic Cooperation Foundation, Yonsei University

Inventors: Hyen-O Oh, Chang Heon Lee, Jeongook Song, Yang Won Jung, Hong Goo Kang
Sound perception using frequency transposition by moving the envelope

Patent number: 8949113

Abstract: A method of operating an audio processing device to improve a user's perception of an input sound includes defining a critical frequency fcrit between a low frequency range and a high frequency range, receiving an input sound by the audio processing device, and analyzing the input sound in a number of frequency bands below and above the critical frequency. The method also includes defining a cut-off frequency fcut below the critical frequency fcrit, identifying a source frequency band above the cut-off frequency fcut, and extracting an envelope of the source band. Further, the method identifying a corresponding target band below the critical frequency fcrit, extracting a phase of the target band, and combining the envelope of the source band with the phase of the target band.

Type: Grant

Filed: April 6, 2011

Date of Patent: February 3, 2015

Assignee: Oticon A/S

Inventors: Marcus Holmberg, Thomas Kaulberg, Jan Mark de Haan
Method and apparatus for recording/replaying application execution with recorded voice recognition utterances

Patent number: 8949134

Abstract: A diagnostic tool for speech recognition applications is provided, which enables a administrator to collect multiple recorded speech sessions. The administrator can then search for various failure points common to one or more of the recorded sessions in order to get a list of all sessions that have the same failure points. The invention allows the administrator to playback the session or replay any portion of the session to see the flow of the application and the recorded utterances. The invention provides the administrator with information about how to maximize the efficiency of the application which enables the administrator to edit the application to avoid future failure points.

Type: Grant

Filed: September 13, 2004

Date of Patent: February 3, 2015

Assignee: Avaya Inc.

Inventors: Jacob Levine, John Muller, Christopher Passaretti, Wu Chingfa
Display apparatus and voice conversion method thereof

Patent number: 8949123

Abstract: The voice conversion method of a display apparatus includes: in response to the receipt of a first video frame, detecting one or more entities from the first video frame; in response to the selection of one of the detected entities, storing the selected entity; in response to the selection of one of a plurality of previously-stored voice samples, storing the selected voice sample in connection with the selected entity; and in response to the receipt of a second video frame including the selected entity, changing a voice of the selected entity based on the selected voice sample and outputting the changed voice.

Type: Grant

Filed: April 11, 2012

Date of Patent: February 3, 2015

Assignee: Samsung Electronics Co., Ltd.

Inventors: Aditi Garg, Kasthuri Jayachand Yadlapalli
Language identification for documents containing multiple languages

Patent number: 8938384

Abstract: Multiple nonoverlapping languages within a single document can be identified. In one embodiment, for each of a set of candidate languages, a set of non-overlapping languages is defined. The document is analyzed under the hypothesis that the whole document is in one language and that part of the document is in one language while the rest is in a different, non-overlapping language. Language(s) of the document are identified based on comparing these competing hypotheses across a number of language pairs. In another embodiment, transitions between non-overlapping character sets are used to segment a document, and each segment is scored separately for a subset of candidate languages. Language(s) of the document are identified based on the segment scores.

Type: Grant

Filed: July 16, 2012

Date of Patent: January 20, 2015

Assignee: Stratify, Inc.

Inventor: Sauraj Goswami
Automatic conversation system and conversation scenario editing device

Patent number: 8935163

Abstract: A conversation scenario editor generates/edits a conversation scenario for an automatic conversation system. The system includes a conversation device and a conversation server. The conversation device generates an input sentence through speech recognition of an utterance by a user. The conversation server determines the reply sentence based on the conversation scenario when a reply sentence to the input sentence is requested from the conversation device. The editor includes a language model generator for generating a language model to be used for the speech recognition based on the conversation scenario. According to the editor, a non-expert can generate the language model to provide an adequate conversation based on the speech recognition.

Type: Grant

Filed: August 17, 2009

Date of Patent: January 13, 2015

Assignee: Universal Entertainment Corporation

Inventors: Shengyang Huang, Hiroshi Katukura

prev … 16 17 18 19 20 21 22 23 24 … next