Pattern Matching Vocoders Patents (Class 704/221)

Vector quantization (Class 704/222)

Excitation patterns (Class 704/223)

Noise suppression in a Mel-filtered spectral domain

Patent number: 8942975

Abstract: Techniques are described herein that suppress noise in a Mel-filtered spectral domain. For example, a window may be applied to a representation of a speech signal in a time domain. The windowed representation in the time domain may be converted to a subsequent representation of the speech signal in the Mel-filtered spectral domain. A noise suppression operation may be performed with respect to the subsequent representation to provide noise-suppressed Mel coefficients.

Type: Grant

Filed: March 22, 2011

Date of Patent: January 27, 2015

Assignee: Broadcom Corporation

Inventor: Jonas Borgstrom
Apparatus and method for encoding and reproduction of speech and audio signals

Patent number: 8930197

Abstract: A method comprising receiving at a user equipment encrypted content. The content is stored in said user equipment in an encrypted form. At least one key for decryption of said stored encrypted content is stored in the user equipment.

Type: Grant

Filed: May 9, 2008

Date of Patent: January 6, 2015

Assignee: Nokia Corporation

Inventors: Anssi Ramo, Mikko Tammi, Adriana Vasilache, Lasse Laaksonen
Coding method, coding apparatus, coding program, and recording medium therefor

Patent number: 8909521

Abstract: A lossless coding technique for near-logarithmic companded PCM that achieves high compression performance is provided. In coding, the coding method that produces the smaller code amount is selected between the prediction coding method, which performs linear prediction of samples in a frame and codes the amplitude of the prediction error, and the normalization coding method, which normalizes the amplitude of the samples in the frame and codes the normalized amplitude, and a selection code that indicates the selection result is output. The samples in the frame are coded according to the selected coding method to produce a compression code. In decoding, the compression code is decoded according to a decoding process corresponding to the coding method specified by the selection code.

Type: Grant

Filed: May 28, 2010

Date of Patent: December 9, 2014

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takehiro Moriya, Noboru Harada, Yutaka Kamamoto
Source code adaption based on communication link quality and source coding delay

Patent number: 8898060

Abstract: Method and arrangement in a network node for adapting a property of source coding to the quality of a communication link in packet switched conversational services in a communication system. The method comprises obtaining (404) information related to the quality of a communication link. The method further comprises selecting (406) a source coding mode with an associated source coding delay, based on the obtained information and the associated source coding delay. The selected source coding mode is selected from a set of at least two source coding modes associated with different source coding delays, and is to be used when source coding voice data to be transmitted over the communication link.

Type: Grant

Filed: March 2, 2010

Date of Patent: November 25, 2014

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventor: Stefan Bruhn
Systems, methods, and apparatus for voice activity detection

Patent number: 8898058

Abstract: Systems, methods, apparatus, and machine-readable media for voice activity detection in a single-channel or multichannel audio signal are disclosed.

Type: Grant

Filed: October 24, 2011

Date of Patent: November 25, 2014

Assignee: QUALCOMM Incorporated

Inventors: Jongwon Shin, Erik Visser, Ian Ernan Liu
Encoding device and encoding method, decoding device and decoding method, and program

Patent number: 8892429

Abstract: The present invention relates to an encoding device and an encoding method, a decoding device and a decoding method, and a program that reduce deterioration of sound quality due to encoding of audio signals. An envelope emphasis part (51) emphasizes an envelope (ENV). A noise shaping part (52) divides an emphasized envelope (D) formed by emphasis of the envelope (ENV) by a value larger than 1, and subtracts noise shaping (G) specified by information (NS) from a result of the division. A quantization part (14) sets a result of the subtraction as a quantization bit count (WL), and quantizes a normalized spectrum (S1) formed by normalization of a spectrum (S0) based on the quantization bit count (WL). A multiplexing part (53) multiplexes the information (NS), a quantized spectrum (QS) formed by quantization of the normalized spectrum (S1), and the envelope (ENV). The present invention can be applied to an encoding device encoding audio signals, for example.

Type: Grant

Filed: March 8, 2011

Date of Patent: November 18, 2014

Assignee: Sony Corporation

Inventors: Shiro Suzuki, Yuuki Matsumura, Yasuhiro Toguri, Yuuji Maeda
LOW COMPLEXITY REPETITION DETECTION IN MEDIA DATA

Publication number: 20140330556

Abstract: Low complexity detection of a time-wise position of a representative segment in media data is described. A subset of offset values is located in a set of offset values in media data using a first type of one or more types of features, which are extractable from (e.g., derivable from components of) the media data. The subset of offset values comprise values that are selected from the set of offset values based on one or more selection criteria. A set of candidate seed time points is identified based on the subset of offset values using a second type of the one or more types of features.

Type: Application

Filed: December 10, 2012

Publication date: November 6, 2014

Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Barbara Resch, Regunathan Radhakrishnan, Arijit Biswas, Jonas Engdegard
Audio signal bandwidth extension in CELP-based speech coder

Patent number: 8868432

Abstract: A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes obtaining a second excitation signal having an audio bandwidth extending beyond the audio bandwidth of the CELP excitation signal, obtaining a set of signals by filtering the second excitation signal with a set of bandpass filters, scaling the set of signals using a set of energy-based parameters, and obtaining a composite output signal by combining the scaled set of signals with a signal based on the audio signal decoded by the CELP-based decoder element.

Type: Grant

Filed: September 28, 2011

Date of Patent: October 21, 2014

Assignee: Motorola Mobility LLC

Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
System enhancement of speech signals

Patent number: 8849656

Abstract: A system enhances speech by detecting a speaker's utterance through a first microphone positioned a first distance from a source of interference. A second microphone may detect the speaker's utterance at a different position. A monitoring device may estimate the power level of a first microphone signal. A synthesizer may synthesize part of the first microphone signal by processing the second microphone signal. The synthesis may occur when power level is below a predetermined level.

Type: Grant

Filed: October 14, 2011

Date of Patent: September 30, 2014

Assignee: Nuance Communications, Inc.

Inventors: Gerhard Schmidt, Mohamed Krini
Post-noise suppression processing to improve voice quality

Patent number: 8831937

Abstract: Provided are methods and systems for improving quality of speech communications. The method may be for improving quality of speech communications in a system having a speech encoder configured to encode a first audio signal using a first set of encoding parameters associated with a first noise suppressor. A method may involve receiving a second audio signal at a second noise suppressor which provides much higher quality noise suppression than the first noise suppressor. The second audio signal may be generated by a single microphone or a combination of multiple microphones. The second noise suppressor may suppress the noise in the second audio signal to generate a processed signal which may be sent to a speech encoder. A second set of encoding parameters may be provided by the second noise suppressor for use by the speech encoder when encoding the processed signal into corresponding data.

Type: Grant

Filed: November 14, 2011

Date of Patent: September 9, 2014

Assignee: Audience, Inc.

Inventors: Carlo Murgia, Scott Isabelle
Audio and speech processing with optimal bit-allocation for constant bit rate applications

Patent number: 8781822

Abstract: Methods and apparatus for audio and speech processing including generating a plurality of frames, each of the frames comprising a plurality of transform coefficients, and allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal.

Type: Grant

Filed: February 2, 2010

Date of Patent: July 15, 2014

Assignee: QUALCOMM Incorporated

Inventors: Somdeb Majumdar, Amin Fazeldehkordi, Harinath Garudadri
Voiced interval command interpretation

Patent number: 8781821

Abstract: A method is disclosed for controlling a voice-activated device by interpreting a spoken command as a series of voiced and non-voiced intervals. A responsive action is then performed according to the number of voiced intervals in the command. The method is well-suited to applications having a small number of specific voice-activated response functions. Applications using the inventive method offer numerous advantages over traditional speech recognition systems including speaker universality, language independence, no training or calibration needed, implementation with simple microcontrollers, and extremely low cost. For time-critical applications such as pulsers and measurement devices, where fast reaction is crucial to catch a transient event, the method provides near-instantaneous command response, yet versatile voice control.

Type: Grant

Filed: April 30, 2012

Date of Patent: July 15, 2014

Assignee: Zanavox

Inventor: David Edward Newman
Differential dynamic content delivery with text display in dependence upon simultaneous speech

Patent number: 8781830

Abstract: Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document; selecting from the session structured document a classified structural element in dependence upon user classifications of a user participant in the presentation; presenting the selected structural element to the user; streaming presentation speech to the user including individual speech from at least one user participating in the presentation; converting the presentation speech to text; detecting whether the presentation speech contains simultaneous individual speech from two or more users; and displaying the text if the presentation speech contains simultaneous individual speech from two or more users.

Type: Grant

Filed: July 2, 2013

Date of Patent: July 15, 2014

Assignee: Nuance Communications, Inc.

Inventors: William K. Bodin, Michael J. Burkhart, Daniel G. Eisenhauer, Thomas J. Watson, Daniel M. Schumacher
METHOD AND DEVICE FOR IDENTIFYING AND OBTAINING AMBE ENCODING AND DECODING RATE INFORMATION IN SDP

Publication number: 20140195224

Abstract: The present invention provides a method and a device for identifying and obtaining AMBE encoding and decoding rate information in an SDP. The method for identifying the AMBE encoding and decoding rate information in the SDP includes: setting an SDP, which specifically includes: in an m attribute line of the SDP, using a payload type PT value to describe AMBE encoding and decoding, and in an fmtp attribute of an a attribute line of the SDP, identifying, in a format field, a PT value same as that of the m attribute line, and identifying AMBE encoding and decoding rate information in a format specific parameter field; and sending the set SDP to a peer end. The method and the device for identifying and obtaining the AMBE encoding and decoding rate information in the SDP provided by the present invention can implement automatic rate switching in a bearer plane.

Type: Application

Filed: February 28, 2014

Publication date: July 10, 2014

Applicant: Huawei Technologies Co., Ltd.

Inventor: Libin ZHANG
System and method for providing particularized audible alerts

Patent number: 8767953

Abstract: A system and method of generating at least two distinctive auditory alerts upon receiving a transmission or telephone call at a device is described. Data indicative of a first plurality of sounds corresponding to a user of a device configured to receive the transmission or telephone call is accessed, such as from a memory. The first plurality of sounds is played at the device so as to identify a received transmission or telephone call being directed to the user. A telephone number, subscriber name or identifier associated with a transmitting or calling party of the transmission or telephone call is accessed. Data indicative of a second plurality of sounds designating the transmitting or calling party based on the subscriber name, telephone number or identifier is retrieved, such as from a data structure, and the second plurality of sounds is played at the device so as to identify the transmitting or calling party.

Type: Grant

Filed: January 7, 2011

Date of Patent: July 1, 2014

Assignee: Somatek

Inventor: Fitchmun I. Mark
System and method of speech compression using an inter frame parameter correlation

Patent number: 8762136

Abstract: The disclosure provides a speech encoder, decoder, speech processor and methods of encoding and decoding speech. In one embodiment, the speech encoder includes: (1) a speech frame generator configured to form a speech frame from an input speech signal, the speech frame having a length of multiple samples, (2) a speech frame processor configured to determine if the speech frame is a subsequent voiced frame of a group of consecutive voiced frames and, based thereon, perform speech analysis of the subsequent voiced frame; and (3) a speech frame coder configured to perform, if the speech frame is a subsequent voiced frame, differential coding of speech parameters of the subsequent voiced frame with respect to previous speech parameters of the previous voiced frame of the consecutive voiced frames.

Type: Grant

Filed: May 3, 2011

Date of Patent: June 24, 2014

Assignee: LSI Corporation

Inventors: Sooraj Kovoor Chathoth, Kumar U. Phani, Ganesh Guddanti
Speech signal processing device

Patent number: 8738367

Abstract: A speech signal processing device is equipped with a power acquisition unit, a probability distribution acquisition unit, and a correspondence degree determination unit. The power acquisition unit accepts an inputted speech signal and, based on the accepted speech signal, acquires power representing the intensity of a speech sound represented by the speech signal. The probability distribution acquisition unit acquires a probability distribution using the intensity of the power acquired by the power acquisition unit as a random variable. The correspondence degree determination unit determines whether a correspondence degree representing a degree that power acquired by the power acquisition unit in a case that a predetermined reference speech signal is inputted into the power acquisition unit corresponds with predetermined reference power is higher than a predetermined reference correspondence degree, based on the probability distribution acquired by the probability distribution acquisition unit.

Type: Grant

Filed: February 18, 2010

Date of Patent: May 27, 2014

Assignee: NEC Corporation

Inventor: Tadashi Emori
Apparatus and method for encoding at least one parameter associated with a signal source

Patent number: 8725500

Abstract: Apparatus (119) for encoding at least one parameter associated with a signal source for transmission over k frames to a decoder comprises a processor (119) which is configured in operation to assign a predetermined bit pattern to n bits associated with the at least one parameter of a first frame of k frames and set the n bits associated with the at least one parameter of each of k?1 subsequent frames to values, such that the values of the n bits of the k?1 subsequent frames represent the at least one parameter. The predetermined bit pattern indicates a start of the at least one parameter.

Type: Grant

Filed: November 19, 2008

Date of Patent: May 13, 2014

Assignee: Motorola Mobility LLC

Inventors: Jonathan A Gibbs, James P Ashley, Holly L Francois, Udar Mittal
Audio encoder for encoding an audio signal having an impulse-like portion and stationary portion, encoding methods, decoder, decoding method, and encoding audio signal

Patent number: 8706480

Abstract: An audio encoder for encoding an audio signal includes an impulse extractor for extracting an impulse-like portion from the audio signal. This impulse-like portion is encoded and forwarded to an output interface. Furthermore, the audio encoder includes a signal encoder which encodes a residual signal derived from the original audio signal so that the impulse-like portion is reduced or eliminated in the residual audio signal. The output interface forwards both, the encoded signals, i.e., the encoded impulse signal and the encoded residual signal for transmission or storage. On the decoder-side, both signal portions are separately decoded and then combined to obtain a decoded audio signal.

Type: Grant

Filed: June 5, 2008

Date of Patent: April 22, 2014

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.

Inventors: Juergen Herre, Ralf Geiger, Stefan Bayer, Guillaume Fuchs, Ulrich Kraemer, Nikolaus Rettelbach, Bernhard Grill
Methods and apparatus for formant-based voice synthesis

Patent number: 8706488

Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

Type: Grant

Filed: February 27, 2013

Date of Patent: April 22, 2014

Assignee: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
Dialed digits based vocoder assignment

Patent number: 8676575

Abstract: A system and method for providing voice communications with desired characteristics based upon the intended recipient of a voice communication. An apparatus includes a list of dial strings associated with parties having desired voice communication characteristics. A dial string entered by a user and associated with an intended recipient is compared to a list of preferred dial strings to determine the characteristics of an encoded voice signal to be sent to the recipient. The apparatus can include a vocoder having different bit rate modes and a bit rate mode is selected based upon the dial string entered by a user. Dial strings can be stored at the device or on a network. The apparatus can include a mode selector to select a desired vocoder mode to generate an encoded voice signal.

Type: Grant

Filed: November 25, 2009

Date of Patent: March 18, 2014

Assignee: AT&T Mobility II LLC

Inventors: Jun Shen, Jack Denenberg, Alan MacDonald
Non-causal postfilter

Patent number: 8620645

Abstract: A decoder arrangement comprising a receiver input for parameters of frame-based coded signals and a decoder arranged to provide frames of decoded audio signals based on the parameters. The receiver input and/or the decoder is arranged to establish a time difference between the occasion when parameters of a first frame is available at the receiver input and the occasion when a decoded audio signal of the first frame is available at an output of the decoder, which time difference corresponds to at least one frame. A postfilter is connected to the output of the decoder and to the receiver input. The postfilter is arranged to provide a filtering of the frames of decoded audio signals into an output signal in response to parameters of a respective subsequent frame.

Type: Grant

Filed: December 14, 2007

Date of Patent: December 31, 2013

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventor: Stefan Bruhn
System and method for tracking sound pitch across an audio signal using harmonic envelope

Patent number: 8620646

Abstract: A system and method may be configured to analyze audio information derived from an audio signal. The system and method may track sound pitch across the audio signal. The tracking of pitch across the audio signal may take into account change in pitch by determining at individual time sample windows in the signal duration an estimated pitch and a representation of harmonic envelope at the estimated pitch. The estimated pitch and the representation of harmonic envelope may then be implemented to determine an estimated pitch for another time sample window in the signal duration with an enhanced accuracy and/or precision.

Type: Grant

Filed: August 8, 2011

Date of Patent: December 31, 2013

Assignee: The Intellisis Corporation

Inventors: David C. Bradley, Rodney Gateau, Daniel S. Goldin, Robert N. Hilton, Nicholas K. Fisher
Audio coder/decoder with predictive coding of synthesis filter and critically-sampled time aliasing of prediction domain frames

Patent number: 8595019

Abstract: An audio encoder adapted for encoding frames of a sampled audio signal to obtain encoded frames, wherein a frame includes a number of time domain audio samples. The audio encoder includes a predictive coding analysis stage for determining information on coefficients of a synthesis filter and a prediction domain frame based on a frame of audio samples. The audio encoder further includes a time-aliasing introducing transformer for transforming overlapping prediction domain frames to the frequency domain to obtain prediction domain frame spectra, wherein the time-aliasing introducing transformer is adapted for transforming the overlapping prediction domain frames in a critically-sampled way. Moreover, the audio encoder includes a redundancy reducing encoder for encoding the prediction domain frame spectra to obtain the encoded frames based on the coefficients and the encoded prediction domain frame spectra.

Type: Grant

Filed: January 11, 2011

Date of Patent: November 26, 2013

Assignees: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V., Voiceage Corporation

Inventors: Ralf Geiger, Bernhard Grill, Bruno Bessette, Philippe Gournay, Guillaume Fuchs, Markus Multrus, Max Neuendorf, Gerald Schuller
Audio encoding method and device

Patent number: 8595017

Abstract: Audio encoding method and device comprising the transmission, in addition to the data representing a frequency-limited signal, of information relating to a temporal filter that can be applied to the entire broadened signal, both in its transmitted low-frequency part and in its reconstituted high-frequency part. The application of this filter allowing the reshaping the reconstituted high-frequency part and the correction of compression artifacts present in the transmitted low-frequency part. In this way, the application of the temporal filter, simple and inexpensive, to all or part of the reconstituted signal, makes it possible to obtain a signal of good perceived quality.

Type: Grant

Filed: December 27, 2007

Date of Patent: November 26, 2013

Assignee: Mobiclip

Inventor: Alexandre Delattre
Automatic polarity adaptation for ambient noise cancellation

Patent number: 8571226

Abstract: A sound reproducing device has a loudspeaker arranged to produce sound from an audio signal provided by an audio signal source. A microphone is positioned to pick up ambient noise and generate a microphone signal which comprises the noise. An ambient noise cancellation (ANC) system receives the microphone signal from the microphone and generates anti-noise corresponding to the ambient noise in the microphone signal. An automatic polarity adaptation (AAP) system monitors the ANC system and, when a decision criterion is fulfilled, causes a switch in polarity for the generated anti-noise.

Type: Grant

Filed: December 10, 2010

Date of Patent: October 29, 2013

Assignees: Sony Corporation, Sony Mobile Communications AB

Inventor: Peter Isberg
Postfilter for layered codecs

Patent number: 8571852

Abstract: A scalable decoder device (50) for signals representing audio comprises a primary decoder (21) connected to an input (40). The primary decoder (21) is arranged to provide a primary decoded signal (23) based on received parameters (4). A primary postfilter (31) is connected to the primary decoder (23) to provide a primary postfiltered signal (32). A secondary enhancement decoder (45) is connected to the input (40) and arranged to provide a secondary decoded enhancement signal (44). The device further comprises a combiner arrangement (55), arranged for combining the primary postfiltered signal (32) and a signal (53) based on the secondary decoded enhancement signal (44) into an output signal (6) to be provided at an output (6). The combining is made with an adaptable strength relation between contributions from the two signals. A method for decoding coded signals representing audio operates in analogy with the scalable decoder device (50).

Type: Grant

Filed: December 14, 2007

Date of Patent: October 29, 2013

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventor: Stefan Bruhn
Voice-activity detection based on far-end and near-end statistics

Patent number: 8565127

Abstract: Methods and apparatus of managing a communication system, wherein a decision regarding a level of activity at a first end is made based at least in part on the level of activity at the second end. In one embodiment, the energy level of a first-end audio signal is measured. The first end is declared voice-active if the first-end energy level is greater than or equal to a first threshold value. The first end is declared voice-inactive if the first-end energy level is less than the first threshold value. To determine the value of the first threshold value, the energy level of a second-end audio signal is measured. If the second-end energy level is greater than or equal to a second threshold value, the second end is declared voice-active, in which case the first threshold is maintained at a relatively high level. If the second-end energy level is less than the second threshold value, the second end is declared voice-inactive, in which case the first threshold is maintained at a relatively lower level.

Type: Grant

Filed: November 16, 2010

Date of Patent: October 22, 2013

Assignee: Broadcom Corporation

Inventor: Wilfred LeBlanc
Encoding device, decoding device, and method thereof

Patent number: 8560328

Abstract: A decoding device is capable of flexibly calculating high-band spectrum data with a high accuracy in accordance with an encoding band selected by an upper-node layer of the encoding side. In this device: a first layer decoder decodes first layer encoded information to generate a first layer decoded signal; a second layer decoder decodes second layer encoded information to generate a second layer decoded signal; a spectrum decoder performs a band extension process by using the second layer decoded signal and the first layer decoded signal up-sampled in an up-sampler so as to generate an all-band decoded signal; and a switch outputs the first layer decoded signal or the all-band decoded signal according to the control information generated in a controller.

Type: Grant

Filed: December 14, 2007

Date of Patent: October 15, 2013

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
Method and system for inserting advertisements in unified messaging solutions

Patent number: 8539362

Abstract: A method and an apparatus for inserting an included message into an e-mail message, wherein the e-mail message is transferred through a unified messaging solution have been provided. In one embodiment, the unified messaging solution detects transmission of a voice mail message as the e-mail attachment. The voice mail message is received by a system that facilitates the transfer of the e-mail message. The system associates the included message with the voice mail message. The included message is inserted into the e-mail message. The system sends the e-mail message along with the included message and the attached voice mail message to an intended user. In a preferred embodiment, the included message is an advertising message.

Type: Grant

Filed: December 16, 2011

Date of Patent: September 17, 2013

Assignee: Cisco Technology, Inc.

Inventors: Labhesh Patel, Shmuel Shaffer, Alan Gatzke, Mukul Jain
Method of Accessing a Dial-Up Service

Publication number: 20130238323

Abstract: A method of accessing a dial-up service is disclosed. An example method of providing access to a service includes receiving a first speech signal from a user to form a first utterance; recognizing the first utterance using speaker independent speaker recognition; requesting the user to enter a personal identification number; and when the personal identification number is valid, receiving a second speech signal to form a second utterance and providing access to the service.

Type: Application

Filed: April 30, 2013

Publication date: September 12, 2013

Applicant: AT&T Intellectual Property I, L.P.

Inventor: Robert Wesley Bossemeyer, JR.
Handoffs between different voice encoder systems

Patent number: 8521520

Abstract: Provided are methods and systems of managing handoffs in a wireless communication system having different types of vocoders. Some embodiments include translating state memory of a first vocoder to a second vocoder using a state memory transcoder. The state memory may be delayed to align differences in algorithmic delays between the first vocoder and the second vocoder. In one embodiment, a speech signal may be decoded from the first vocoder, delayed, and encoded to the second vocoder. Furthermore, for a period of time during and/or adjacent to the handoff, the first vocoder may output with decreasing amplitude while the second vocoder outputs with increasing amplitude. Such techniques may be used alone or in combination.

Type: Grant

Filed: February 3, 2010

Date of Patent: August 27, 2013

Assignee: General Electric Company

Inventors: Richard Louis Zinser, Michael James Hartman, John Erik Hershey
Technique for encoding/decoding of codebook indices for quantized MDCT spectrum in scalable speech and audio codecs

Patent number: 8515767

Abstract: Codebook indices for a scalable speech and audio codec may be efficiently encoded based on anticipated probability distributions for such codebook indices. A residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer may be obtained, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal. The residual signal may be transformed at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum. The transform spectrum is divided into a plurality of spectral bands, where each spectral band having a plurality of spectral lines. A plurality of different codebooks are then selected for encoding the spectral bands, where each codebook is associated with a codebook index. A plurality of codebook indices associated with the selected codebooks are then encoded together to obtain a descriptor code that more compactly represents the codebook indices.

Type: Grant

Filed: November 3, 2008

Date of Patent: August 20, 2013

Assignee: QUALCOMM Incorporated

Inventor: Yuriy Reznik
Encoding method and decoding method, and devices, program and recording medium for the same

Patent number: 8502708

Abstract: Information that includes first information identifying integer quotients obtained by divisions using prediction residuals or integers not smaller than 0 that increase monotonically with increases in the amplitude of the prediction residuals, as dividends, and a separation parameter decided for a time segment corresponding to the prediction residuals or a mapped integer value of the separation parameter, as a modulus, and second information identifying the remainders obtained when the dividends are divided by the modulus is generated as a code corresponding to the prediction residuals, and each piece of side information that includes the separation parameter is subjected to variable length coding.

Type: Grant

Filed: December 8, 2009

Date of Patent: August 6, 2013

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takehiro Moriya, Noboru Harada, Yutaka Kamamoto
Differential dynamic content delivery with text display in dependence upon simultaneous speech

Patent number: 8504364

Abstract: Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document; selecting from the session structured document a classified structural element in dependence upon user classifications of a user participant in the presentation; presenting the selected structural element to the user; streaming presentation speech to the user including individual speech from at least one user participating in the presentation; converting the presentation speech to text; detecting whether the presentation speech contains simultaneous individual speech from two or more users; and displaying the text if the presentation speech contains simultaneous individual speech from two or more users.

Type: Grant

Filed: September 14, 2012

Date of Patent: August 6, 2013

Assignee: Nuance Communications, Inc.

Inventors: William K. Bodin, Michael John Burkhart, Daniel G. Eisenhauer, Thomas James Watson, Daniel Mark Schumacher
Method and apparatus for transmitting speech data to a remote device in a distributed speech recognition system

Patent number: 8494849

Abstract: A method of transmitting speech data to a remote device in a distributed speech recognition system, includes the steps of: dividing an input speech signal into frames; calculating, for each frame, a voice activity value representative of the presence of speech activity in the frame; grouping the frames into multiframes, each multiframe including a predetermined number of frames; calculating, for each multiframe, a voice activity marker representative of the number of frames in the multiframe representing speech activity; and selectively transmitting, on the basis of the voice activity marker associated with each multiframe, the multiframes to the remote device.

Type: Grant

Filed: June 20, 2005

Date of Patent: July 23, 2013

Assignee: Telecom Italia S.p.A.

Inventors: Ivano Salvatore Collotta, Donato Ettorre, Maurizio Fodrini, Pierluigi Gallo, Roberto Spagnolo
System and method for modeling speech spectra

Patent number: 8489392

Abstract: A system and method for modeling speech in such a way that both voiced and unvoiced contributions can co-exist at certain frequencies. In various embodiments, three spectral bands (or bands of up to three different types) are used. In one embodiment, the lowest band or group of bands is completely voiced, the middle band or group of bands contains both voiced and unvoiced contributions, and the highest band or group of bands is completely unvoiced. The embodiments of the present invention may be used for speech coding and other speech processing applications.

Type: Grant

Filed: September 13, 2007

Date of Patent: July 16, 2013

Assignee: Nokia Corporation

Inventors: Jani Nurminen, Sakari Himanen
Determining an upperband signal from a narrowband signal

Patent number: 8484020

Abstract: A method for determining an upperband speech signal from a narrowband speech signal is disclosed. A list of narrowband line spectral frequencies (LSFs) is determined from the narrowband speech signal. A first pair of adjacent narrowband LSFs that have a lower difference between them than every other pair of adjacent narrowband LSFs in the list is determined. A first feature that is a mean of the first pair of adjacent narrowband LSFs is determined. Upperband LSFs are determined based on at least the first feature using codebook mapping.

Type: Grant

Filed: October 22, 2010

Date of Patent: July 9, 2013

Assignee: QUALCOMM Incorporated

Inventors: Venkatesh Krishnan, Daniel J. Sinder, Ananthapadmanabhan Arasanipalai Kandhadai
Sending speech packets with same and complementary subsets of fixed codebook pulses

Patent number: 8463601

Abstract: Packets of real-time information are sent with a source rate greater than zero kilobits per second, and a time or path or combined time/path diversity rate initially being zero kilobits per second. This results in a quality of service QoS, optionally measured at the sender or the receiver. When the QoS is on an unacceptable side of a threshold of acceptability, the sender sends diversity packets at an increased rate. Increasing the diversity rate while either reducing or maintaining the overall transmission rate is new. CELP-based multiple-description data partitioning sends the base or important information plus a subset of fixed excitation in one packet and sends the base or important information plus the complementary subset of fixed excitation in another packet. Reconstruction produces acceptable quality when only one of the two packets is received and better quality when both packets are received. Reconstruction provides for single and multiple lost packets.

Type: Grant

Filed: June 11, 2012

Date of Patent: June 11, 2013

Assignee: Texas Instruments Incorporated

Inventors: Krishnasamy Anandakumar, Vishu R. Viswanathan, Alan V. McCree
Remote audio surveillance for detection and analysis of wildlife sounds

Patent number: 8457962

Abstract: This invention provides remote audio surveillance by recording audio data via three microphones and storage on a removable digital mass storage device, operating on battery power. The housing is of a weather resistant design to withstand outdoor conditions. Recording can be done in person or recording times can be defined so that the unit will only ‘listen’ during the desired times of the day, on a day to day basis. The user does not have to be in the vicinity but simply programs the record time(s) and leaves the device in the woods. The device also has play back capabilities for any recorded audio data and can interface with personal computers via the removable digital mass storage device. In addition to the audio collection and playback capabilities, PC software will be provided with the device which will analyze the data and provide direction of sound (based upon relative amplitude of the 3 microphones) and distance of sound (based on absolute and relative recorded amplitudes).

Type: Grant

Filed: August 4, 2006

Date of Patent: June 4, 2013

Inventor: Lawrence P. Jones
Encoding device, decoding device, and method thereof

Patent number: 8452588

Abstract: It is possible to improve quality of a decoding signal in a band spread for estimating a high band from a low band of a decoding signal. A first layer encoder encodes a lower band portion below a predetermined frequency of an input signal so as to generate first layer encoded information. A first layer decoder decodes the first layer encoded information so as to generate a first layer demodulated signal. A second layer encoder divides a high band portion higher, than a predetermined frequency, of an input signal into a plurality of sub-bands and estimates each of the sub-bands from the input signal or the first layer decoded signal by using the estimation result of the sub-band adjacent to the lower band side so as to generate second encoded information including the estimation results of the sub-bands.

Type: Grant

Filed: March 13, 2009

Date of Patent: May 28, 2013

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
Methods and apparatus for formant-based voice systems

Patent number: 8447592

Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

Type: Grant

Filed: September 13, 2005

Date of Patent: May 21, 2013

Assignee: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
Methods and devices for coding and decoding the position of the last significant coefficient

Patent number: 8446301

Abstract: Methods and devices are described for entropy coding data using an entropy coder to encode quantized transform domain coefficient data. Last significant coefficient information is signaled in the bitstream using two-dimensional coordinates for the last significant coefficient. The context for bins of one of the coordinates is based, in part, upon the value of the other of the coordinates. In one case, instead of signaling last significant coefficient information, the number of non-zero coefficients is binarized and entropy encoded.

Type: Grant

Filed: April 15, 2011

Date of Patent: May 21, 2013

Assignee: Research In Motion Limited

Inventors: Dake He, Jing Wang
Audio encoding device, audio decoding device, audio encoding method, and audio decoding method

Patent number: 8447597

Abstract: In an encoding process, a CPU transforms an audio signal from the real-time domain to the frequency domain, and transforms the signal into spectra consisting of MDCT coefficients. The CPU separates the audio signal into several frequency bands, and performs bit shifting in each band such that the MDCT coefficients can be expressed with pre-configured numbers of bits. The CPU re-quantizes the MDCT coefficients at a precision differing for each band, and transmits the values acquired thereby and shift bit numbers as encoded data. Meanwhile, in a decoding process, a CPU receives encoded data and inverse re-quantizes and inverse bit shifts the data, thereby restoring the MDCT coefficients. Furthermore, the CPU transforms the data from frequency domain to the real-time domain by using the inverse MDCT, and restores and outputs the audio signal.

Type: Grant

Filed: October 1, 2007

Date of Patent: May 21, 2013

Assignee: Casio Computer Co., Ltd.

Inventor: Hiroyasu Ide
Apparatus and method for adaptive audio coding

Patent number: 8442818

Abstract: An audio encoder capable of implementing a plurality of encoding functions, wherein an adaptation controller adjusts the implementation of the encoding functions in response to feedback received by the adaptation controller during use. The adjustment may involve adapting encoding algorithms or selecting alternative encoding algorithms. The encoder may also include an operations scheduler to adjust the order in which the encoding functions are applied. The feedback may be received from internally of the encoder, for example from the currently implemented encoding functions, or from externally of the encoder. A corresponding decoder is also provided.

Type: Grant

Filed: November 16, 2009

Date of Patent: May 14, 2013

Assignee: Cambridge Silicon Radio Limited

Inventor: David Trainor
Vector quantization apparatus, vector dequantization apparatus, and the methods

Patent number: 8438020

Abstract: A vector quantizer which improves the accuracy of vector quantization in switching over a vector quantization codebook on a first stage depending on the type of feature having the correlation with a quantization target vector. In the vector quantizer, a classifier generates classification information representing a type of narrowband LSP vector having the correlation with wideband LSP (Line Spectral Pairs) of the plural types. A first codebook selects one sub-codebook corresponding to the classification information as a codebook used for the quantization of the first stage from plural sub-codebooks corresponding to each of the types of narrowband LSP vectors. A multiplier multiplies the quantization residual vector of the first stage inputted from an adder by a scaling factor corresponding to the classification information of plural scaling factors stored in a scaling factor determiner and outputs it to an adder as the quantization target of a second stage.

Type: Grant

Filed: October 10, 2008

Date of Patent: May 7, 2013

Assignee: Panasonic Corporation

Inventors: Kaoru Satoh, Toshiyuki Morii, Hiroyuki Ehara
Encoding and decoding method and device

Patent number: 8436754

Abstract: The present invention relates to information processing technologies and discloses an encoding and decoding method and device to solve the poor decoding quality problem. The technical solution of the present invention includes: encoding each sample of an input signal to generate an encoded signal of a core layer; comparing residuals of all or a part of the samples of the input signal with encoding thresholds, where the residuals are generated by core layer encoding, and performing encoding according to comparison results to generate an encoded signal of an enhancement layer; and writing the encoded signal of the core layer and the encoded signal of the enhancement layer into a bitstream to generate an encoded signal of the input signal.

Type: Grant

Filed: April 14, 2011

Date of Patent: May 7, 2013

Assignee: Huawei Technologies Co., Ltd.

Inventors: Chen Hu, Lei Miao, Zexin Liu, Longyin Chen, Qing Zhang, Herve Marcel Taddei
Audio decoding device, audio decoding method, program, and integrated circuit

Patent number: 8428953

Abstract: An audio decoding device of the present invention includes: a decoding unit decoding a stream to a spectrum coefficient, and outputting stream information when a frame included in the stream cannot be decoded; an orthogonal transformation unit transforming the spectrum coefficient to a time signal; a correction unit generating a correction time signal based on an output waveform within a reference section that is in a section that overlaps between an error frame section to which the stream information is outputted and an adjacent frame section and that is a section in the middle of the adjacent frame section, when the decoding unit outputs the stream information: and an output unit generating the output waveform by synthesizing the correction time signal and the time signal.

Type: Grant

Filed: May 20, 2008

Date of Patent: April 23, 2013

Assignee: Panasonic Corporation

Inventors: Kojiro Ono, Takeshi Norimatsu, Yoshiaki Takagi, Takashi Katayama
Distributed record server architecture for recording call sessions over a VoIP network

Patent number: 8422641

Abstract: Devices, systems, and methods for recording call sessions over a VoIP network using a distributed record server architecture are disclosed. An example recording device for recording segments of a call session includes a record server configured to receive an agent voice data stream and an external caller voice data stream from an agent telephone station, and a file repository configured to store voice data and call data associated with each recorded segment of the call session. The recording device is configured to tag recorded segments of each call session, which can be later used by a third-party application or database to check the status and/or integrity of the recorded call session.

Type: Grant

Filed: June 15, 2009

Date of Patent: April 16, 2013

Assignee: Calabrio, Inc.

Inventor: James Paul Martin, II
Voice recognition system, method, and program

Patent number: 8417518

Abstract: A voice recognition system comprises: a voice input unit that receives an input signal from a voice input element and output it; a voice detection unit that detects an utterance segment in the input signal; a voice recognition unit that performs voice recognition for the utterance segment; and a control unit that outputs a control signal to at least one of the voice input unit and the voice detection unit and suppresses a detection frequency if the detection frequency satisfies a predetermined condition.

Type: Grant

Filed: February 27, 2008

Date of Patent: April 9, 2013

Assignee: NEC Corporation

Inventor: Toru Iwasawa

prev 1 2 3 4 5 6 … next