Time Patents (Class 704/211)

Pulse code modulation (pcm) (Class 704/212)

Zero crossing (Class 704/213)

Voiced or unvoiced (Class 704/214)

Silence decision (Class 704/215)

Correlation function (Class 704/216)

Method and system for enabling audio speed conversion

Patent number: 7363232

Abstract: The present invention provides a method and system for processing an audio signal. According to an exemplary method, an audio signal such as a digital voice signal is received and divided into one or more individual unit cycles. An audio speed conversion operation is enabled by repeating or removing one or more of the individual unit cycles. In particular, repeating one or more of the individual unit cycles decreases audio speed, and removing one or more of the individual unit cycles increases audio speed.

Type: Grant

Filed: June 29, 2001

Date of Patent: April 22, 2008

Assignee: Thomson Licensing

Inventors: Magdy Megeid, Markus Inkamp
Speech-duration detector and computer program product therefor

Publication number: 20080077400

Abstract: A speech-duration detector includes a starting-end detecting unit that detects a starting end of a first duration where the characteristic exceeds a threshold value as a starting end of a speech-duration, when the first duration continues for a first time length; a trailing-end-candidate detecting unit that detects a starting end of a second duration where the characteristic is lower than the threshold value as a candidate point for a trailing end of speech, when the second duration continues for a second time length; and a trailing-end-candidate determining unit that determines the candidate point as a trailing end of the speech-duration, when the second duration where the characteristic exceeds the threshold value does not continue for the first time length while a third time length elapses from measurement at the candidate point.

Type: Application

Filed: March 20, 2007

Publication date: March 27, 2008

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Koichi Yamamoto, Akinori Kawamura
Adaptive noise state update for a voice activity detector

Patent number: 7346502

Abstract: There is provided a method of updating a noise state of a voice activity detector (VAD) for indicating an active voice mode and an inactive voice mode. The method comprises receiving an input signal having a plurality of frames, determining an elapsed time since the last update of the noise state, updating the noise state of the VAD if the elapsed time exceeds a predetermined time, determining an average minimum energy based on two or more of the plurality of frames, determining a current minimum energy based on a current frame of the plurality of frames, updating the noise state of the VAD if the average minimum energy is less than the current minimum energy, and updating the noise state of the VAD if the average minimum energy is greater than the current minimum energy plus a first predetermined value.

Type: Grant

Filed: January 26, 2006

Date of Patent: March 18, 2008

Assignee: Mindspeed Technologies, Inc.

Inventors: Yang Gao, Eyal Shlomot, Adil Benyassine
METHOD AND APPARATUS FOR PROCESSING SPEECH SIGNAL DATA

Publication number: 20080059157

Abstract: Method and computing apparatus for processing speech signal data. A speech signal is divided into frames. Each frame is characterized by a frame number T representing a unique interval of time. Each speech signal is characterized by a power spectrum with respect to frame T and frequency band ?. A speech segment and a reverberation segment of the speech signal is determined. L filter coefficients W(k) (k=1, 2, . . . , L) respectively corresponding to L frames immediately preceding frame T are computed such that the L filter coefficients minimize a function ? that is a linear combination of sum of squares of a residual speech power in the reverberation segment and a sum of squares of a subtracted speech power in the speech segment. The computed L filter coefficients are stored within storage media of the computing apparatus.

Type: Application

Filed: August 7, 2007

Publication date: March 6, 2008

Inventors: Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura
Packet Loss Concealment for Sub-band Predictive Coding Based on Extrapolation of Full-band Audio Waveform

Publication number: 20080046233

Abstract: A technique for concealing the effect of a lost frame in a series of frames representing an encoded audio signal in a sub-band predictive coding system is provided. In accordance with the technique, one or more received frames in the series of frames are decoded to generate a full-band output audio signal, wherein the full-band output audio signal comprises a combination of at least a first sub-band decoded audio signal and a second sub-band decoded audio signal. The full-band output audio signal corresponding to the one or more received frames is stored. Then, a full-band output audio signal corresponding to the lost frame is synthesized, wherein synthesizing the full-band output audio signal corresponding to the lost frame comprises performing waveform extrapolation based on the stored full-band output audio signal corresponding to the one or more received frames.

Type: Application

Filed: August 15, 2007

Publication date: February 21, 2008

Applicant: BROADCOM CORPORATION

Inventors: Juin-Hwey Chen, Jes Thyssen, Robert W. Zopf
Fast wavelet estimation of weak bio-signals using novel algorithms for generating multiple additional data frames

Patent number: 7333619

Abstract: A method and apparatus for de-noising weak bio-signals having a relatively low signal to noise ratio utilizes an iterative process of wavelet de-noising a data set comprised of a new set of frames of wavelet coefficients partially generated through a cyclic shift algorithm. The method preferably operates on a data set having 2N frames, and the iteration is performed N?1 times. The resultant wavelet coefficients are then linearly averaged and an inverse discrete wavelet transform is performed to arrive at the de-noised original signal. The method is preferably carried out in a digital processor.

Type: Grant

Filed: May 30, 2006

Date of Patent: February 19, 2008

Assignees: Everest Biomedical Instruments Company, Washington University

Inventors: Elvir Causevic, Eldar Causevic, Mladen Victor Wickerhauser
Audio segmentation and classification

Patent number: 7328149

Abstract: A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.

Type: Grant

Filed: November 29, 2004

Date of Patent: February 5, 2008

Assignee: Microsoft Corporation

Inventors: Hao Jiang, Hongjiang Zhang
SYSTEMS, METHODS, AND APPARATUS FOR GAIN FACTOR LIMITING

Publication number: 20080027718

Abstract: The range of disclosed configurations includes methods in which subbands of a speech signal are separately encoded, with the excitation of a first subband being derived from a second subband. Gain factors are calculated to indicate a time-varying relation between envelopes of the original first subband and of the synthesized first subband. The gain factors are quantized, and quantized values that exceed the pre-quantized values are re-coded.

Type: Application

Filed: December 13, 2006

Publication date: January 31, 2008

Inventors: Venkatesh Krishnan, Ananthapadmanabhan A. Kandhadai
Systems and methods for dynamically analyzing temporality in speech

Patent number: 7324944

Abstract: Systems and methods for dynamically analyzing temporality in an individual's speech in order to selectively categorize the speech fluency of the individual and/or to selectively provide speech training based on the results of the dynamic analysis. Temporal variables in one or more speech samples are dynamically quantified. The temporal variables in combination with a dynamic process, which is derived from analyses of temporality in the speech of native speakers and language learners, are used to provide a fluency score that identifies a proficiency of the individual. In some implementations, temporal variables are measured instantaneously.

Type: Grant

Filed: December 11, 2003

Date of Patent: January 29, 2008

Assignee: Brigham Young University, Technology Transfer Office

Inventors: Lynne Hansen, Joshua Rowe
Method and arrangement in a communication system

Patent number: 7321851

Abstract: The present invention relates to the decoding-/playback part of received sound data packets in systems for transmission of sound over packet switched networks. According to the invention, the lengths of received signal frames are manipulated by performing time expansion or time compression of one or more signal frames at time varying intervals and with time varying lengths of the expansion or the compression, said intervals and said lengths being determined so as to maintain a continuous flow of signal samples to be played back.

Type: Grant

Filed: February 4, 2000

Date of Patent: January 22, 2008

Assignees: Global IP Solutions (GIPS) AB, Global IP Solutions, Inc.

Inventors: Soren V. Andrsen, Willem B. Kleijn, Patrik Sörqvist
Audio Encoder, Audio Decoder and Audio Processor Having a Dynamically Variable Warping Characteristic

Publication number: 20080004869

Abstract: An audio encoder, an audio decoder or an audio processor includes a filter for generating a filtered audio signal, the filter having a variable warping characteristic, the characteristic being controllable in response to a time-varying control signal, the control signal indicating a small or no warping characteristic or a comparatively high warping characteristic. Furthermore, a controller is connected for providing the time-varying control signal, which depends on the audio signal. The filtered audio signal can be introduced to an encoding processor having different encoding algorithms, one of which is a coding algorithm adapted to a specific signal pattern. Alternatively, the filter is a post-filter receiving a decoded audio signal.

Type: Application

Filed: June 30, 2006

Publication date: January 3, 2008

Inventors: Juergen Herre, Bernhard Grill, Markus Multrus, Stefan Bayer, Ulrich Kraemer, Jens Hirschfeld, Stefan Wabnik, Gerald Schuller
Apparatus and method for concealing erased periodic signal data

Patent number: 7305338

Abstract: Circuitry and a method compensate the erasure of speech signal data or similar periodic signal data, by substitution using past periodic signal data input. After a predetermined number of latest periodic signal data have been saved, whether or not an erasure occurs is determined with every periodic signal data sequence, which is a unit of processing. When an erasure occurs, one of periodic signal data sequences saved, which lies in a determined segment to be used, is used to generate synthetic data for substitution. The position of the segment to be used is determined such that when the erasure continues over units of processing, the position sequentially varies gradually for each processing units.

Type: Grant

Filed: May 14, 2004

Date of Patent: December 4, 2007

Assignee: Oki Electric Industry Co., Ltd.

Inventors: Atsushi Tashiro, Hiromi Aoyagi, Masashi Takada
Method and apparatus for speech coding and decoding

Patent number: 7305337

Abstract: The present invention includes a method for speech encoding and decoding and a design of speech coder and decoder. The characteristic of speech encoding method relies on the type of data with high compression rate after the whole speech data is compressed. The present invention is able to lower the bit rate of the original speech from 64 Kbps to 1.6 Kbps and provide a bit rate lower than the traditional compression method. It can provide good speech quality, and attain the function of storing the maximum speech data with minimum memory. As to the speech decoding method, some random noises are appropriated added into the exciting source, so that more speech characteristics can be simulated to produce various speech sounds. In addition, the present invention also discloses a coder and a decoder designed by application specific integrated circuit, and the structural design is optimized according to the software.

Type: Grant

Filed: December 24, 2002

Date of Patent: December 4, 2007

Assignee: National Cheng Kung University

Inventors: Jhing-Fa Wang, Jia-Ching Wang, Yun-Fei Chao, Han-Chiang Chen, Ming-Chi Shih
Fast estimation of weak bio-signals using novel algorithms for generating multiple additional data frames

Patent number: 7302064

Abstract: A method and apparatus for de-noising weak bio-signals having a relatively low signal to noise ratio utilizes an iterative process of de-noising a data set comprised of a new set of frames. The method separately performs a non-linear de-noising operation on each of the component frames and combines the resultant de-noised frames to form a combined resultant de-noised input signal. The method is preferably carried out in a digital processor.

Type: Grant

Filed: January 24, 2006

Date of Patent: November 27, 2007

Assignee: Brainscope Company, Inc.

Inventors: Elvir Causevic, Eldar Causevic
Method and apparatus to prepare listener-interest-filtered works

Patent number: 7299184

Abstract: An embodiment of the present invention is a method for generating a listener-interest-filtered work for an audio or audio-visual work, which method includes steps of: (a) generating one or more average speed contours for one or more audio or audio-visual works for one or more categories of users; (b) converting the one or more average speed contours to one or more conceptual speed association data structures; and forming a listener-interest-filtered conceptual speed association data structure from the one or more conceptual speed association data structures.

Type: Grant

Filed: September 7, 2004

Date of Patent: November 20, 2007

Assignee: Enounce Incorporated

Inventor: Donald J. Hejna, Jr.
Information processing apparatus, information processing method, and program

Publication number: 20070265841

Abstract: An information processing apparatus, comprises: a lower time series data generation unit having a plurality of recurrent neural networks which learn predetermined time series data, and generate prediction time series data according to the learning result; an upper time series data generation unit having recurrent neural networks which learn error time series data that is time series data of errors raised at the time of the learning by the respective plural recurrent neural networks of the lower time series data generation unit, and generate prediction error time series data that is time series data of prediction errors according to the learning result; and a conversion unit that performs nonlinear conversion for the prediction errors generated by the upper time series data generation unit, wherein the lower time series data generation unit outputs the prediction time series data generated by the respective plural recurrent neural networks according to the prediction errors which have undergone the nonlinear c

Type: Application

Filed: May 14, 2007

Publication date: November 15, 2007

Inventors: Jun Tani, Ryunosuke Nishimoto, Masato Ito
Method for determining intensity parameters of background noise in speech pauses of voice signals

Patent number: 7277847

Abstract: A method for determining intensity characteristics of background noise during speech pauses of speech signals includes determining a proportion of speech pauses in the undisturbed source speech signal so as to define a frequency threshold. The disturbed speech signal is divided into short successive signal elements, an intensity value is determined for each of the signal elements, and a cumulative relative frequency distribution is formed from the determined intensity values of the signal elements. The cumulative relative frequency distribution is used to determine an intensity threshold value which corresponds to the defined frequency threshold. At least one intensity characteristic of the background noise during the speech pauses is determined using a region of the cumulative relative frequency distribution below the intensity threshold value.

Type: Grant

Filed: April 3, 2002

Date of Patent: October 2, 2007

Assignee: Deutsche Telekom AG

Inventor: Jens Berger
Method for making a voice activity decision

Patent number: 7254532

Abstract: The invention relates to a method for determining voice activity in a signal section of an audio signal. The result, i.e., whether voice activity is present in the section of the signal thus observed, depends upon spectral and temporal stationarity of the signal section and/or prior signal sections. In a first step, the method determines whether there is spectral stationarity in the observed signal section. In a second step, the method determines whether there is temporal stationarity in the signal section in question. The final decision as to the presence of voice activity in the signal section observed depends upon the initial values of both steps.

Type: Grant

Filed: March 16, 2001

Date of Patent: August 7, 2007

Assignee: Deutsche Telekom AG

Inventors: Alexander Kyrill Fischer, Christoph Erdmann
EFFICIENT FILTERING WITH A COMPLEX MODULATED FILTERBANK

Publication number: 20070179781

Abstract: A filter apparatus for filtering a time domain input signal to obtain a time domain output signal, which is a representation of the time domain input signal filtered using a filter characteristic having an non-uniform amplitude/frequency characteristic, comprises a complex analysis filter bank for generating a plurality of complex subband signals from the time domain input signals, a plurality of intermediate filters, wherein at least one of the intermediate filters of the plurality of the intermediate filters has a non-uniform amplitude/frequency characteristic, wherein the plurality of intermediate filters have a shorter impulse response compared to an impulse response of a filter having the filter characteristic, and wherein the non-uniform amplitude/frequency characteristics of the plurality of intermediate filters together represent the non-uniform filter characteristic, and a complex synthesis filter bank for synthesizing the output of the intermediate filters to obtain the time domain output signal.

Type: Application

Filed: September 1, 2006

Publication date: August 2, 2007

Inventor: Lars Villemoes
Method and device for analyzing a wave signal and method and apparatus for pitch detection

Patent number: 7251596

Abstract: The present invention provides a unique wave-trigon transformation (WTT) method for performing transformation process over a wave signal. The present invention also provides a pitch detecting method and apparatus for detecting pitch based on the WTT process as well as a sentence detecting method and apparatus for detecting a sentence in a sound signal based on the WTT process. The pitch detecting method and apparatus can effectively detect pitch in a sound signal. In the WTT process, an inputted wave signal (such as a sound signal) is transformed into a series of trigons, and an energy-width spectrum is formed using these trigons. For a sound signal containing voice, the distribution of trigons transformed from the sound signal has a certain pattern. By analyzing the pattern, whether a pitch is contained in the sound signal can be determined. In particular, existence of a pitch can be determined by determining and evaluating the periodicity of trigons in a candidate chained peak in the energy-width spectrum.

Type: Grant

Filed: December 23, 2002

Date of Patent: July 31, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Lianshan Zhu, Tao Yu
Adaptive time and/or frequency-based encoding mode determination apparatus and method of determining encoding mode of the apparatus

Publication number: 20070174051

Abstract: An adaptive time/frequency-based encoding mode determination apparatus including a time domain feature extraction unit to generate a time domain feature by analysis of a time domain signal of an input audio signal, a frequency domain feature extraction unit to generate a frequency domain feature corresponding to each frequency band generated by division of a frequency domain corresponding to a frame of the input audio signal into a plurality of frequency domains, by analysis of a frequency domain signal of the input audio signal, and a mode determination unit to determine any one of a time-based encoding mode and a frequency-based encoding mode, with respect to the each frequency band, by use of the time domain feature and the frequency domain feature.

Type: Application

Filed: September 21, 2006

Publication date: July 26, 2007

Applicant: SAMSUNG Electronics Co., Ltd.

Inventors: Eun Mi Oh, Ki Hyun Choo, Jung-Hoe Kim, Chang Yong Son
Speed control playback of parametric speech encoded digital audio

Patent number: 7239999

Abstract: A method of pitch corrected speed control (PCSC) playback in which a decoder rate controller receives a desired playback speed from a PCSC controller and determines the number of decoded digital audio samples stored in a buffer. The rate controller then determines the required number of execution times of a parametric speech decoder based on the desired playback speed and the number of decoded samples stored in the buffer. The parametric speech decoder is then executed the determined number of times.

Type: Grant

Filed: July 23, 2002

Date of Patent: July 3, 2007

Assignee: Intel Corporation

Inventor: Changwon D. Rhee
Method and apparatus for gradient-descent based window optimization for linear prediction analysis

Patent number: 7231344

Abstract: The shape of windows used during linear predictive analysis can be optimized through the use of gradient-descent based window optimization procedures. Window optimization may be achieved fairly precisely through the use of a primary optimization procedure, or less precisely through the use of an alternate optimization procedure. Both optimization procedures use the principle of gradient-descent to find a window sequence that will either minimize the prediction error energy or maximize the segmental prediction gain. However, the primary optimization procedure uses a Levinson-Durbin based algorithm to determine the gradient while the alternate optimization procedure uses an estimate of the gradient based on the basic definition of a derivative. These optimization procedures can be implemented as computer readable software code. Additionally, the optimization procedures may be implemented in a window optimization device which generally includes a window optimization unit and may also include an interface unit.

Type: Grant

Filed: October 29, 2002

Date of Patent: June 12, 2007

Assignee: NTT DoCoMo, Inc.

Inventor: Wai C. Chu
Method for detecting the time sequences of a fundamental frequency of an audio response unit to be synthesized

Patent number: 7219061

Abstract: Predetermined macrosegments of the fundamental frequency are determined by a neural network, and these predefined macrosegments are reproduced by fundamental-frequency sequences stored in a database. The fundamental frequency is generated on the basis of a relatively large text section which is analyzed by the neural network. Microstructures from the database are received in the fundamental frequency. The fundamental frequency thus formed is thus optimized both with regard to its macrostructure and to its microstructure. As a result, an extremely natural sound is achieved.

Type: Grant

Filed: October 24, 2000

Date of Patent: May 15, 2007

Assignee: Siemens Aktiengesellschaft

Inventors: Caglayan Erdem, Martin Holzapfel
Method and apparatus for reducing access delay in discontinuous transmission packet telephony systems

Patent number: 7197464

Abstract: A system, method and computer-readable medium are disclosed for operating a communications network. The method aspect comprises receiving an audio signal and to remove a first portion of a frame of the audio signal, and generating an overlap-added segment from (1) a first segment of the frame, the first segment being located before the first portion; and (2) a second segment of the frame, the second segment comprising an endmost portion of a terminal section of the frame. The method preferably operates in a discontinuous transmission packet telephony network having a channel access delay.

Type: Grant

Filed: July 27, 2005

Date of Patent: March 27, 2007

Assignee: AT&T Corp.

Inventors: Richard Vandervoort Cox, David A. Kapilow
Time-scale modification of data-compressed audio information

Patent number: 7143047

Abstract: A data-compressed audio waveform is temporally modified without requiring complete decompression of the audio signal. Packets of compressed audio data are first unpacked, to remove scaling that was applied in the formation of the packets. The unpacked data is then temporally modified, using one of a number of different approaches. This modification takes place while the audio information remains in a data-compressed format. New packets are then assembled from the modified data, to produce a data-compressed output stream that can be subsequently processed in a conventional manner to reproduce the desired sound. The assembly of the new packets employs a technique for inferring an auditory model from the original packets, to requantize the data in the output packets.

Type: Grant

Filed: September 17, 2004

Date of Patent: November 28, 2006

Assignee: Vulcan Patents LLC

Inventors: Michele M. Covell, Malcolm Slaney, Arthur Rothstein
Apparatus and method for changing the playback rate of recorded speech

Patent number: 7143029

Abstract: An apparatus for changing the playback rate of recorded speech includes memory storing a plurality of recorded speech messages and a plurality of feature tables. Each feature table is associated with an individual one of the speech messages and includes speech frame parameters based on the jitter states of speech frames of the associated recorded speech message. A playback module receives input specifying a recorded speech message in the memory to be played and the rate at which the recorded speech message is to be played back. In response to the input, the playback module uses a set of decision rules to modify the specified speech message based on the speech frame parameters in the feature table associated with the specified speech message and the specified playback rate, prior to playing back the specified speech message.

Type: Grant

Filed: September 9, 2004

Date of Patent: November 28, 2006

Assignee: Mitel Networks Corporation

Inventor: Moustafa Elshafei
Method for analysis of vocal jitter for near-term suicidal risk assessment

Patent number: 7139699

Abstract: Method and apparatus to measure jitter (period-to-period fluctuations in fundamental frequency) among the voices of suicidal, major depressed, and non-suicidal patients to predict near-term suicidal risk.

Type: Grant

Filed: October 5, 2001

Date of Patent: November 21, 2006

Inventors: Stephen E. Silverman, Asli Ozdas, Marilyn K. Silverman
Virtual presence

Patent number: 7120577

Abstract: A system and terminal for facilitating a “virtual presence” allows users on a communication network to simply begin speaking through other users. A system immediately detects the destination party's name, and begins routing the audio signal to a particular destination without any noticeable call set-up. Additionally, the system performs pitch corrected speed control in order to allow the detection and processing of a speech pattern without causing delay to an end user.

Type: Grant

Filed: January 9, 2003

Date of Patent: October 10, 2006

Assignee: Intel Corporation

Inventor: Howard Bubb
Linking in parametric encoding

Patent number: 7085724

Abstract: The invention relates to a linking unit 100, a parametric encoder 400 and a method for generating linking information L indicating components of consecutive extended segments sp and sc which may be linked together in order to form a sinusoidal track. The segments sp and sc approximate consecutive segments of a sinusoidal audio or speech signal s. The linking unit comprises a calculating unit 120 for generating a similarity matrix S(m,n) in response to received sinusoidal code data and an evaluating unit 140 for receiving and evaluating said similarity matrix S in order to generate said linking information by selecting those pairs of components m,n the similarity of which is maximal. According to the invention the calculating unit 120 is adapted to calculate the similarity matrix S by additionally considering information about the phase consistency between the components of the extended previous segment sp and the extended current segment sc.

Type: Grant

Filed: January 14, 2002

Date of Patent: August 1, 2006

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Albertus Cornelis Den Brinker, Arnoldus Werner Johannes Oomen, Fransiscus Marinus Jozephus De Bont, Erik Gosuinus Petrus Schuijers
Speech recognition from concurrent visual and audible inputs

Patent number: 7072829

Abstract: With respect to each of codes corresponding to code vectors in a code book stored in a code book storage section, an expectation degree storage section stores an expectation degree at which observation is expected when an integrated parameter with respect to a word as a recognition target is inputted. A vector quantization section vector-quantizes the integrated parameter and outputs a series of codes of a code vector which has a shortest distance to the integrated parameter.

Type: Grant

Filed: June 10, 2002

Date of Patent: July 4, 2006

Assignee: Sony Corporation

Inventors: Tetsujiro Kondo, Norifumi Yoshiwara
System and method for concealment of data loss in digital audio transmission

Patent number: 7069208

Abstract: A system and method for the concealment of errors resulting from missing or corrupted data in the transmission of audio signals in compressed digital packet formats is disclosed. The system utilizes a circular FIFO buffer to store audio frames from the transmitted audio signal, and a beat detector, to identify the presence of beats in the audio signal. The error concealment method replaces erroneous audio frames with error-free audio frames by a process which takes into account the presence and location of the detected beats.

Type: Grant

Filed: January 24, 2001

Date of Patent: June 27, 2006

Assignee: Nokia, Corp.

Inventor: Ye Wang
Method of and system for coding and decoding sound signals

Patent number: 7069210

Abstract: Method of and system for coding a sound signal (10) as multiple independent streams of frames (14, 15) by creating frames (1,2,3,4,5,6) using sinusoidal coding and then placing frame i into stream i modulo the number of streams, method of and system for reconstructing a sound signal (23) by decoding frames from multiple streams (21, 22) in an interleaved fashion and reconstructing missing frames by using information from surrounding frames, system for recording and playing back sound signals implementing the above two methods, where under normal circumstances both streams (31, 32) of a coded signal are stored, and when capacity on the storage medium (35) is low, only one of the two streams of a coded signal is stored while one of the two streams of existing coded signals is overwritten and allowing a decoder (37) to reconstruct a sound signal by using either both or the one available stream for that sound signal.

Type: Grant

Filed: November 29, 2000

Date of Patent: June 27, 2006

Assignee: Koninklijke Philips Electronics N.V.

Inventor: Rakesh Taori
Fast wavelet estimation of weak bio-signals using novel algorithms for generating multiple additional data frames

Patent number: 7054454

Abstract: A method and apparatus for de-noising weak bio-signals having a relatively low signal to noise ratio utilizes an iterative process of wavelet de-noising a data set comprised of a new set of frames of wavelet coefficients partially generated through a cyclic shift algorithm. The method preferably operates on a data set having 2N frames, and the iteration is performed N?1 times. The resultant wavelet coefficients are then linearly averaged and an inverse discrete wavelet transform is performed to arrive at the de-noised original signal. The method is preferably carried out in a digital processor.

Type: Grant

Filed: March 29, 2002

Date of Patent: May 30, 2006

Assignee: Everest Biomedical Instruments Company

Inventors: Elvir Causevic, Eldar Causevic, Mladen Victor Wickerhauser
Fast estimation of weak bio-signals using novel algorithms for generating multiple additional data frames

Patent number: 7054453

Abstract: A method and apparatus for de-noising weak bio-signals having a relatively low signal to noise ratio utilizes an iterative process of de-noising a data set comprised of a new set of frames. The method separately performs a non-linear de-noising operation on each of the component frames and combines the resultant de-noised frames to form a combined resultant de-noised input signal. The method is preferably carried out in a digital processor.

Type: Grant

Filed: March 29, 2002

Date of Patent: May 30, 2006

Assignee: Everest Biomedical Instruments Co.

Inventors: Elvir Causevic, Eldar Causevic
Method and apparatus to determine and use audience affinity and aptitude

Patent number: 7043433

Abstract: Embodiments of the present invention provide method and apparatus for determining audience affinity and/or aptitude in portions of media works and for developing information that represent measures of the audience affinity and/or aptitude. Further embodiments of present invention provide method and apparatus for utilizing the information to create altered media works and/or to present the altered media works to an audience. One embodiment of the present invention is a method for inferring audience affinity or aptitude with regard to content or properties of portions of a media work which includes: (a) presenting the media work to an audience; (b) obtaining user input regarding presentation rates for the portions of the media work; (c) correlating content or properties of the portion with the presentation rates; and; (d) associating audience affinity or aptitude with the correlated content or properties.

Type: Grant

Filed: September 16, 1999

Date of Patent: May 9, 2006

Assignee: Enounce, Inc.

Inventor: Donald J. Hejna, Jr.
Model adaptive apparatus and model adaptive method, recording medium, and pattern recognition apparatus

Patent number: 7043425

Abstract: In order to improve recognition performance, a no-speech sound model correction section performs an adaptation of a no-speech sound model which is a sound model representing a no-speech state on the basis of input data observed in an interval immediately before a speech recognition interval for the object of speech recognition and the degree of freshness representing the recentness of the input data.

Type: Grant

Filed: March 24, 2005

Date of Patent: May 9, 2006

Assignee: Sony Corporation

Inventor: Hongchang Pao
Method and apparatus for reducing access delay in discontinuous transmission packet telephony systems

Patent number: 7016850

Abstract: Speech at the beginning of a talkspurt in a discontinuous transmission (DTX) packet telephony system is speeded up to help make up for an access delay incurred during channel allocation. Incoming speech frames are buffered, a pitch period for a current portion of the signal is estimated, and then a pitch period=s worth of the signal is cut from that portion. This is continued until the original access delay, as estimated from the time lag between the commencement of voice input for the talkspurt, and notification that a channel is available, is eliminated. The remainder of the talkspurt is then transmitted without such compression.

Type: Grant

Filed: January 25, 2001

Date of Patent: March 21, 2006

Assignee: AT&T Corp.

Inventors: Richard Vandervoort Cox, David A Kapilow
Method and system for waveform compression and expansion with time axis

Patent number: 7010491

Abstract: With the goal of presenting a waveform compression and expansion apparatus with which the sound quality of such things as musical tones that are expressed by waveforms is satisfactory following the compression and expansion of the waveforms of the musical tones etc., a method and system for waveform compression and expansion is disclosed in which all of the multiple number of band divided waveforms that comprise the original waveform which has been band divided are apportioned to at least two kinds of compression and expansion formats and form a multiple number of compressed and expanded waveforms by compression or expansion an identical amount only in the direction of the temporal axis.

Type: Grant

Filed: December 9, 1999

Date of Patent: March 7, 2006

Assignee: Roland Corporation

Inventor: Tadao Kikumoto
Method and apparatus for performing speech segmentation

Patent number: 7010481

Abstract: In a method for performing a segmentation operation upon a synthesizing speech signal and an input speech signal, a synthesized speech signal and a speech element duration signal are generated from the synthesizing speech signal A first feature parameter is extracted from the synthesized speech signal, and a second feature parameter is extracted from the input speech signal. A dynamic programming matching operation is performed upon the second feature parameter with reference to the first feature parameter and the speech element duration signal to obtain segmentation points of the input speech signal.

Type: Grant

Filed: March 27, 2002

Date of Patent: March 7, 2006

Assignee: NEC Corporation

Inventor: Takuya Takizawa
Synchronization and overlap method and system for single buffer speech compression and expansion

Patent number: 6999922

Abstract: The present invention (110) permits a user to speed up and slow down speech without changing the speakers pitch (102, 110, 112, 128, 402–416). It is a user adjustable feature to change the spoken rate to the listeners' preferred listening rate or comfort. It can be included on the phone as a customer convenience feature without changing any characteristics of the speakers voice besides the speaking rate with soft key button (202) combinations (in interconnect or normal). From the users perspective, it would seem only that the talker changed his speaking rate, and not that the speech was digitally altered in any way. The pitch and general prosody of the speaker are preserved. The following uses of the time expansion/compression feature are listed to compliment already existing technologies or applications in progress including messaging services, messaging applications and games, real-time feature to slow down the listening rate.

Type: Grant

Filed: June 27, 2003

Date of Patent: February 14, 2006

Assignee: Motorola, Inc.

Inventors: Marc Andre Boillot, John Gregory Harris, Thomas Lawrence Reinke
Time-scale modification of music signals based on polyphase filterbanks and constrained time-domain processing

Patent number: 6982377

Abstract: A time scale modification method employs separate bands obtained through an analysis polyphase filter bank with separate time-scale modification processing for the bands. The outputs are combined using a synthesis filter bank. Some constraints are imposed on the time-scale modification processing, such a limitation of the range of overlap adjustment values for bands other than the greatest energy band, to eliminate noise due to aliasing and inter-channel phase mismatch. This invention produces output quality considerably higher than conventional time-domain time-scale modification methods for general music signals with computational requirements comparable to those of conventional time-domain time-scale modification methods.

Type: Grant

Filed: December 18, 2003

Date of Patent: January 3, 2006

Assignee: Texas Instruments Incorporated

Inventors: Atsuhiro Sakurai, Steven Trautmann, Daniel L. Zelazo
Method and apparatus for performing harmonic noise weighting in digital speech coders

Patent number: 6983241

Abstract: To address the need for choosing values of harmonic noise weighting (HNW) coefficient (?p) so that the amount of harmonic noise weighting can be optimized, a method and apparatus for performing harmonic noise weighting in digital speech coders is provided herein. During operation, received speech is analyzed to determine a pitch period. HNW coefficients are then chosen based on the pitch period, and a perceptual noise weighting filter (C(z)) is determined based on the harmonic-noise weighting (HNW) coefficients (?p).

Type: Grant

Filed: October 14, 2004

Date of Patent: January 3, 2006

Assignee: Motorola, Inc.

Inventors: Udar Mittal, James P. Ashley
System and method for adapting speech playback speed to typing speed

Patent number: 6952673

Abstract: A system and method for automatically adjusting the rate at which recorded speech is played back as a typist manually transcribes the speech. The typing speed is measured and a speech playback rate determined based on the measured speed. The playback rate of the audio is then automatically increased or decreased as appropriate to match the typing speed.

Type: Grant

Filed: February 20, 2001

Date of Patent: October 4, 2005

Assignee: International Business Machines Corporation

Inventors: Arnon Amir, Michael Rodeh
Low speed speech encoding method based on Internet protocol

Patent number: 6947887

Abstract: A low speed encoding method based on Internet protocol (IP) includes the steps of determining speech characteristic parameters in TN duration, determining an optimized frame length T for successive speech data processing according to the characteristic parameters, making compressed encoding of the speech data in every T, assembling a packet of the encoded bits with TCP and UDP, again assembling a packet of the assembled bits with IP, and finally outputting the channel. The method uses a single frame, variable length frame, intra-frame adaptive low speed speech encoding method, which has the advantages of reducing the bit rate and raising transmission efficiency. The method takes an optimized length encoded frame as a unit to break the IP datagram, and therefore raises encoding and decoding quality of the speech data greatly. Informal tests show that the method can raise a MOS (mean opinion score) value from 0.1 to 0.2.

Type: Grant

Filed: February 19, 2003

Date of Patent: September 20, 2005

Assignee: Huawei Technologies Co., Ltd.

Inventors: Shengxi Pan, Yingtao Li
Model adaptive apparatus for performing adaptation of a model used in pattern recognition considering recentness of a received pattern data

Patent number: 6920421

Abstract: In order to improve recognition performance, a no-speech sound model correction section performs an adaptation of a no-speech sound model which is a sound model representing a no-speech state on the basis of input data observed in an interval immediately before a speech recognition interval for the object of speech recognition and the degree of freshness representing the recentness of the input data.

Type: Grant

Filed: December 26, 2000

Date of Patent: July 19, 2005

Assignee: Sony Corporation

Inventor: Hongchang Pao
Virtual presence

Patent number: 6898565

Abstract: A system and terminal for facilitating a “virtual presence” allows users on a communication network to simply begin speaking through other users. A system immediately detects the destination party's name, and begins routing the audio signal to a particular destination without any noticeable call set-up. Additionally, the system performs pitch corrected speed control in order to allow the detection and processing of a speech pattern without causing delay to an end user.

Type: Grant

Filed: January 6, 2003

Date of Patent: May 24, 2005

Assignee: Intel Corporation

Inventor: Howard Bubb
Reduced complexity voice activity detector

Patent number: 6876965

Abstract: A voice activity detector is disclosed for use with a radio transmitter to continuously sense the presence of speech in an audio signal. Initially, the audio signal is processed to produce a train of signal samples. Signal peaks are identified therefrom, which are used to compute respective values for a succession of quasi-pitch periods associated with the signal sample train. The quasi-pitch period values are then selectively compared with one another, in order to determine the presence or absence of a speech component.

Type: Grant

Filed: February 28, 2001

Date of Patent: April 5, 2005

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Fisseha Mekuria, Joakim Persson
Multiband phase-vocoder for the modification of audio or speech signals

Patent number: 6868377

Abstract: A method and apparatus to inexpensively and efficiently process audio and speech signals. A method for processing a signal having at least one region of interest is provided. The method begins by dividing the signal into a plurality of sub-band signals, wherein a selected sub-band signal includes the region of interest. The selected sub-band is processed by a phase vocoder to produce a vocoder output signal. Next, at least a portion of the subbands are time-aligned with the vocoder output signal. Finally, the aligned sub-band signals and the vocoder output signal are combined to form an output signal.

Type: Grant

Filed: November 23, 1999

Date of Patent: March 15, 2005

Assignee: Creative Technology Ltd.

Inventor: Jean Laroche
System for measuring velar function during speech

Patent number: 6850882

Abstract: A method of and device for the diagnosis and treatment of speech dynamically measures the functioning of the velum in the control of nasality during speech. Various components of oral and nasal airflow are separated and selectively analyzed including (i) the fundamental frequency component of each airflow during voiced speech, (ii) a plurality of voice components that cover a frequency range encompassing at least the lowest vocal tract resonance (the first formant), and (iii) the subsonic and infrasonic components of at least the nasal airflow. By comparing the nasal and oral airflow components at the voice fundamental frequency, a nasalization measure for voiced speech sounds is formed which emulates methods that compare low frequency nasal and oral airflow during voiced speech, while eliminating or greatly reducing the problems associated with comparing these low frequency airflows, and which improves upon previous methods based on measuring and comparing nasal and oral radiated sound pressure.

Type: Grant

Filed: October 23, 2000

Date of Patent: February 1, 2005

Inventor: Martin Rothenberg

prev … 5 6 7 8 9 10 11 12 next