For Storage Or Transmission Patents (Class 704/201)

Neural network (Class 704/202)

Transformation (Class 704/203)

Orthogonal functions (Class 704/204)

Frequency (Class 704/205)

Specialized information (Class 704/206)

Time (Class 704/211)

Linear prediction (Class 704/219)

Analysis by synthesis (Class 704/220)

Pattern matching vocoders (Class 704/221)

Normalizing (Class 704/224)

Gain control (Class 704/225)

Noise (Class 704/226)

Adaptive bit allocation (Class 704/229)

Quantization (Class 704/230)

Distributed voice browser

Patent number: 8000970

Abstract: The present invention can include a method of call processing using a distributed voice browser including allocating a plurality of service processors configured to interpret parsed voice markup language data and allocating a plurality of voice markup language parsers configured to retrieve and parse voice markup language data representing a telephony service. The plurality of service processors and the plurality of markup language parsers can be registered with one or more session managers. Accordingly, components of received telephony service requests can be distributed to the voice markup language parsers and the parsed voice markup language data can be distributed to the service processors.

Type: Grant

Filed: June 14, 2002

Date of Patent: August 16, 2011

Assignee: Nuance Communications, Inc.

Inventors: Thomas E. Creamer, Victor S. Moore, Glen R. Walters, Scott L. Winters
Speech recognition system and method

Patent number: 8000974

Abstract: According to the present invention, a method for integrating processes with a multi-faceted human centered interface is provided. The interface is facilitated to implement a hands free, voice driven environment to control processes and applications. A natural language model is used to parse voice initiated commands and data, and to route those voice initiated inputs to the required applications or processes. The use of an intelligent context based parser allows the system to intelligently determine what processes are required to complete a task which is initiated using natural language. A single window environment provides an interface which is comfortable to the user by preventing the occurrence of distracting windows from appearing. The single window has a plurality of facets which allow distinct viewing areas. Each facet has an independent process routing its outputs thereto. As other processes are activated, each facet can reshape itself to bring a new process into one of the viewing areas.

Type: Grant

Filed: October 18, 2010

Date of Patent: August 16, 2011

Assignee: Nuance Communications, Inc.

Inventors: Richard Grant, Pedro E. McGregor
Apparatus and method of encoding and decoding audio signal

Patent number: 7996216

Abstract: In one embodiment, at least first and second channels in a frame of the audio signal are independently subdivided into blocks if the first and second channels are not correlated with each other. At least two of the blocks have different block lengths. Furthermore, the first and second channels are correspondingly subdivided into blocks such that the lengths of the blocks into which the second channel is subdivided correspond to the lengths of the blocks into which the first channel is subdivided if the first and second channels are correlated with each other. At least two of the blocks have different block lengths.

Type: Grant

Filed: July 7, 2006

Date of Patent: August 9, 2011

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
SYSTEMS AND METHODS FOR SPEECH EXTRACTION

Publication number: 20110191102

Abstract: In some embodiments, a processor-readable medium stores code representing instructions to cause a processor to receive an input signal having a first component and a second component. An estimate of the first component of the input signal is calculated based on an estimate of a pitch of the first component of the input signal. An estimate of the input signal is calculated based on the estimate of the first component of the input signal and an estimate of the second component of the input signal. The estimate of the first component of the input signal is modified based on a scaling function to produce a reconstructed first component of the input signal. The scaling function is a function of at least one of the input signal, the estimate of the first component of the input signal, the estimate of the second component of the input signal, or a residual signal.

Type: Application

Filed: January 31, 2011

Publication date: August 4, 2011

Applicant: UNIVERSITY OF MARYLAND, COLLEGE PARK

Inventors: Carol Espy-Wilson, Srikanth Vishnubhotla
Adaptive grouping of parameters for enhanced coding efficiency

Patent number: 7991610

Abstract: The present invention is based on the finding that parameters including a first set of parameters of a representation of a first portion of an original signal and including a second set of parameters of a representation of a second portion of the original signal can be efficiently encoded, when the parameters are arranged in a first sequence of tuples and in a second sequence of tuples, wherein the first sequence of tuples comprises tuples of parameters having two parameters from a single portion of the original signal and wherein the second sequence of tuples comprises tuples of parameters having one parameter from the first portion and one parameter from the second portion of the original signal. An efficient encoding can be achieved using a bit estimator to estimate the number of necessary bits to encode the first and the second sequence of tuples, wherein only the sequence of tuples is encoded, that results in the lower number of bits.

Type: Grant

Filed: October 5, 2005

Date of Patent: August 2, 2011

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung e.V.

Inventors: Ralph Sperschneider, Jürgen Herre, Karsten Linzmeier, Johannes Hilpert
Microphone apparatus

Patent number: 7991166

Abstract: A microphone apparatus for processing and outputting an output signal of a microphone array including at least nine microphones includes a directivity function processing circuit that converts the output signal of the microphone array into a unidirectional signal and that outputs the unidirectional signal. The directivity function processing circuit expands a directivity function whose variable is an incident angle of an acoustic wave into a Fourier series up to at least third order. The variable in the expanded expression is produced from output signals of the microphones forming the microphone array.

Type: Grant

Filed: February 14, 2006

Date of Patent: August 2, 2011

Assignee: Sony Corporation

Inventors: Nobuyuki Kihara, Yoshikazu Takahashi, Yasuhiko Kato
Audio compression and decompression using integer-reversible modulated lapped transforms

Patent number: 7991622

Abstract: A “STAC Codec” provides lossless audio compression and decompression by processing an audio signal using integer-reversible modulated lapped transforms (MLT) to produce transform coefficients. Transform coefficients are then encoded using a backward-adaptive run-length Golomb-Rice (RLGR) encoder to produce losslessly compressed audio signals. In additional embodiments, further compression gains are achieved via an inter-block spectral estimation and data sorting strategy. Further, compression in the transform domain allows the bitstream to be partially decoded, using the corresponding RLGR decoder, to reconstruct the frequency-domain coefficients. These frequency-domain coefficients are then directly used to speed up various transform-domain based applications such as transcoding media to lossy or other formats, search, identification, visualization, watermarking, etc.

Type: Grant

Filed: March 20, 2007

Date of Patent: August 2, 2011

Assignee: Microsoft Corporation

Inventor: Henrique S. Malvar
MULTI-DIMENSIONAL DISAMBIGUATION OF VOICE COMMANDS

Publication number: 20110184730

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing voice commands. In one aspect, a method includes receiving an audio signal at a server, performing, by the server, speech recognition on the audio signal to identify one or more candidate terms that match one or more portions of the audio signal, identifying one or more possible intended actions for each candidate term, providing information for display on a client device, the information specifying the candidate terms and the actions for each candidate term, receiving from the client device an indication of an action selected by a user, where the action was selected from among the actions included in the provided information, and invoking the action selected by the user.

Type: Application

Filed: January 22, 2010

Publication date: July 28, 2011

Applicant: GOOGLE INC.

Inventors: Michael J. LeBeau, William J. Byrne, Nicholas Jitkoff, Alexander H. Gruenstein
VOICE DIALOG SYSTEM WITH REJECT AVOIDANCE PROCESS

Publication number: 20110178797

Abstract: The invention relates to a process for operating a voice dialog system and a voice dialog system which can be controlled over a telecommunications link by a communications terminal, a speech element transmitted by the communications terminal being received by a receiving unit of the voice dialog system and being analyzed for statement content in a processing unit, the speech element being filed in a memory assigned to the processing unit and after the telecommunications link is broken being analyzed by the processing unit.

Type: Application

Filed: August 6, 2009

Publication date: July 21, 2011

Inventors: Guntbert Markefka, Klaus Dieter Liedtke
Single channel EVRCx, ISLP and G.711 transcoding in packet networks

Publication number: 20110172993

Abstract: An apparatus in one example has: a receiver configured to receive an input signal in a first encoding format, the input signal having an input payload; and a transcoder operatively coupled to the receiver, the transcoder structured to transcode in a single channel the first encoding format to a second encoding format, the transcoder configured to generate an output signal in the second encoding format based on the input signal, the output signal having an output payload; and wherein the transcoder is configured to switch between providing encrypted data in the output payload and non-encrypted data in the output payload.

Type: Application

Filed: January 11, 2010

Publication date: July 14, 2011

Inventor: Alan H. Matten
METHOD FOR EMOTION COMMUNICATION BETWEEN EMOTION SIGNAL SENSING DEVICE AND EMOTION SERVICE PROVIDING DEVICE

Publication number: 20110172992

Abstract: Provided are a method for emotion communication to share a user's emotions between an emotion signal sensing device and an emotion service providing device.

Type: Application

Filed: December 29, 2010

Publication date: July 14, 2011

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventors: Hyun Soon SHIN, Sung Won Lee, Choong Seon Hong
MICROPHONE UNIT, CLOSE-TALKING TYPE SPEECH INPUT DEVICE, INFORMATION PROCESSING SYSTEM, AND METHOD FOR MANUFACTURING MICROPHONE UNIT

Publication number: 20110170726

Abstract: A microphone unit 1 of the present invention includes a case 10 having an internal space 100, a partition member 20 which is provided in the case, and at least partially composed of a vibrating membrane 30, that splits the internal space into a first space 102 and a second space 104, and an electrical signal output circuit 40 that outputs an electrical signal on the basis of vibration of the vibrating membrane. A first through hole 12 through which the first space 102 and an external space of the case are communicated with each other, and a second through hole 14 through which the second space 104 and the external space of the case are communicated with each other are formed in the case 10. In accordance with the present invention, it is possible to provide a high-quality microphone unit whose outer shape is small and which is capable of performing thorough noise cancellation.

Type: Application

Filed: March 27, 2009

Publication date: July 14, 2011

Applicant: FUNAI ELECTRIC CO., LTD.

Inventors: Rikuo Takano, Kiyoshi Sugiyama, Toshimi Fukuoka, Masatoshi Ono, Ryusuke Horibe, Fuminori Tanaka, Hideki Chouji, Takeshi Inoda
Universal container for audio data

Patent number: 7979269

Abstract: Storing audio data encoded in any of a plurality of different audio encoding formats is enabled by parametrically defining the underlying format in which the audio data is encoded, in audio format and packet table chunks. A flag can be used to manage storage of the size of the audio data portion of the file, such that premature termination of an audio recording session does not result in an unreadable corrupted file. This capability can be enabled by initially setting the flag to a value that does not correspond to a valid audio data size and that indicates that the last chunk in the file contains the audio data. State information for the audio data, to effectively denote a version of the file, and a dependency indicator for dependent metadata, may be maintained, where the dependency indicator indicates the state of the audio data on which the metadata is dependent.

Type: Grant

Filed: October 6, 2009

Date of Patent: July 12, 2011

Assignee: Apple Inc.

Inventors: William G. Stewart, James E. McCartney, Douglas S. Wyatt
AUTOMATIC VIDEO STREAM SELECTION

Publication number: 20110164105

Abstract: A handheld communication device is used to capture video streams and generate a multiplexed video stream. The handheld communication device has at least two cameras facing in two opposite directions. The handheld communication device receives a first video stream and a second video stream simultaneously from the two cameras. The handheld communication device detects a speech activity of a person captured in the video streams. The speech activity may be detected from direction of sound or lip movement of the person. Based on the detection, the handheld communication device automatically switches between the first video stream and the second video stream to generate a multiplexed video stream. The multiplexed video stream interleaves segments of the first video stream and segments of the second video stream. Other embodiments are also described and claimed.

Type: Application

Filed: January 6, 2010

Publication date: July 7, 2011

Applicant: Apple Inc.

Inventors: Jae Han Lee, E-Cheng Chang
Method and apparatus for encoding/decoding MPEG-4 BSAC audio bitstream having ancillary information

Patent number: 7974840

Abstract: A method of and an apparatus for encoding/decoding an MPEG-4 bit sliced arithmetic coding (BSAC) audio bitstream having ancillary information. A time domain audio signal is converted to a frequency domain audio signal and quantized. A number of data bits is counted and a number of available bits per layer is obtained. The number of available bits per layer is modified considering the size of ancillary information. Actual audio data is encoded in units of layers and ancillary information is embedded in the encoded bitstream. A header is decoded and a layer structure of an audio bitstream is calculated to determine the size of the ancillary information as a difference between a size of data up to a top layer and a size of a frame. The ancillary information is extracted to improve meta data and sound quality of audio contents.

Type: Grant

Filed: November 24, 2004

Date of Patent: July 5, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Shihwa Lee, Sangwook Kim, Eunmi Oh, Dohyung Kim
REMOTE CONFERENCING CENTER

Publication number: 20110161074

Abstract: Certain embodiments disclosed herein relate to systems and methods for recording audio and video. In particular, in one embodiment, a method of recording audio signals is provided. The method includes recording audio signals with a plurality of distributed audio transducers to create multiple recordings of the audio signals and providing each of the multiple recordings of the audio signals to a computing device. The computing device combines each of the multiple recordings into a master recording and determines a source for each audio signal in the master recording. Additionally, the computing device stores each audio signal in separate audio files according to the determined source of each audio signal.

Type: Application

Filed: December 29, 2009

Publication date: June 30, 2011

Applicant: Apple Inc.

Inventors: Aleksandar Pance, Nicholas Vincent King
REAL-TIME VOICE RECOGNITION ON A HANDHELD DEVICE

Publication number: 20110161075

Abstract: A method and apparatus for implementation of real-time speech recognition using a handheld computing apparatus are provided. The handheld computing apparatus receives an audio signal, such as a user's voice. The handheld computing apparatus ultimately transmits the voice data to a remote or distal computing device with greater processing power and operating a speech recognition software application. The speech recognition software application processes the signal and outputs a set of instructions for implementation either by the computing device or the handheld apparatus. The instructions can include a variety of items including instructing the presentation of a textual representation of dictation, or a function or command to be executed by the handheld device (such as linking to a website, opening a file, cutting, pasting, saving, or other file menu type functionalities), or by the computing device itself.

Type: Application

Filed: December 1, 2010

Publication date: June 30, 2011

Inventor: Eric Hon-Anderson
Method and apparatus for managing speech decoders in a communication device

Patent number: 7970603

Abstract: A method and apparatus that manages speech decoders in a communication device may include detecting a change in transmission rate from a higher rate to a lower rate, decoding and shifting a first, second and third received first decoder set of frame parameters, generating a first decoder output audio frame from the previously shifted frame parameters, generating a first, second and third second decoder audio fill frame, the second decoder being a higher rate decoder than first decoder, outputting a first and second second decoder audio fill frame, combining the first decoder audio frame and the third second decoder audio fill frame with overlapping triangular windows, and outputting combined first decoder and second decoder frames to an audio buffer for subsequent transmission to a user of the communication device. In an alternative embodiment, another method may include detecting and processing a change in transmission rate from a lower rate to a higher rate.

Type: Grant

Filed: November 15, 2007

Date of Patent: June 28, 2011

Assignee: Lockheed Martin Corporation

Inventors: Richard L. Zinser, Jr., Martin W. Egan
Data reproduction device

Patent number: 7970602

Abstract: A data reproduction device is provided for achieving seamless reproduction of a stream where a validity of a bandwidth extension function is switched in the stream. The data reproduction device includes an input frequency obtainment unit analyzing header information Hdr and obtaining an input frequency FSin, which is the frequency of basic data, an output frequency determination unit performing predetermined processing based on the input frequency FSin and determining an output frequency FSout, which is the sampling frequency of a decoded frame Fdata, and a decoding unit (2003) which, if the SBR function is valid in a frame to be decoded, decodes sample data at the input frequency FSin and extends the bandwidth of the sampling frequency up to the output frequency FSout, while if the SBR function is not valid in the frame, upsamples the decoding result obtained at the input frequency FSin to the output frequency FSout.

Type: Grant

Filed: February 24, 2006

Date of Patent: June 28, 2011

Assignee: Panasonic Corporation

Inventors: Tadamasa Toma, Yoshinori Matsui, Shinya Kadono
SYSTEM AND METHOD FOR UPDATING INFORMATION IN ELECTRONIC CALENDARS

Publication number: 20110144980

Abstract: Systems and methods for updating electronic calendar information. Speech is received from a user at a vehicle telematics unit (VTU), wherein the speech is representative of information related to a particular vehicle trip. The received speech is recorded in the VTU as a voice memo, and data associated with the voice memo is communicated from the VTU to a computer running a calendaring application. The data is associated with a field of the calendaring application, and stored in association with the calendaring application field.

Type: Application

Filed: June 10, 2010

Publication date: June 16, 2011

Applicant: GENERAL MOTORS LLC

Inventor: Jeffrey P. Rysenga
Apparatus and method of encoding and decoding audio signal

Patent number: 7962332

Abstract: In one embodiment, the method includes receiving an audio data frame having at least first and second channels. The first and second channels are independently subdivided into blocks if the first and second channels are not correlated with each other. The first and second channels are decoded, and the subdivided blocks of the first and second channels are not interleaved if the first and second channels are independently subdivided.

Type: Grant

Filed: September 18, 2008

Date of Patent: June 14, 2011

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
Apparatus and method for automatic dissection of segmented audio signals

Patent number: 7962330

Abstract: An apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programs included in said audio signals and for identifying contents included in said programs. Content detection device detects programs and contents belonging to the respective programs in the information signal. Program weighting device weights each program includes in the information signal based on the contents of the respective program detected by the content detection device. Program ranking device indentifies programmers of the same category and ranking said programs based on a weighting result for each program provided by the program weighting device.

Type: Grant

Filed: November 10, 2004

Date of Patent: June 14, 2011

Assignee: Sony Deutschland GmbH

Inventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Hay Lam, Krzysztof Marasek, Raquel Tato
METHOD AND SYSTEM FOR COLLECTING VOICE AND IMAGE DATA ON A REMOTE DEVICE AND COVERTING THE COMBINED DATA

Publication number: 20110135144

Abstract: A system and method to capture and analyze image data of an object, the system including: an end unit to capture the image data, the end unit including: an image capturing unit to capture the image data of the object, and a transmitting unit to transmit the captured image data; and a data analysis server to analyze the image data, the data analysis server including: a receiving unit to receive the transmitted image data, and a control unit to analyze the received image data by performing one or more machine vision functions on the received image data, and to control a transmitting of a result of the one or more machine vision functions to the end unit and/or a performing of a function according to the result of the one or more machine vision functions. Accordingly, complex image data of an object can be collected by a remote image capturing device in order to identify the object by analyzing the image data in a separate server.

Type: Application

Filed: June 29, 2010

Publication date: June 9, 2011

Applicant: Hand Held Products, Inc.

Inventors: Richard Loy Franklin, JR., John Pettinelli, Sven Powilleit
Voice transcoder

Patent number: 7957963

Abstract: First encoded voice bits are transcoded into second encoded voice bits by dividing the first encoded voice bits into one or more received frames, with each received frame containing multiple ones of the first encoded voice bits. First parameter bits for at least one of the received frames are generated by applying error control decoding to one or more of the encoded voice bits contained in the received frame, speech parameters are computed from the first parameter bits, and the speech parameters are quantized to produce second parameter bits. Finally, a transmission frame is formed by applying error control encoding to one or more of the second parameter bits, and the transmission frame is included in the second encoded voice bits.

Type: Grant

Filed: December 14, 2009

Date of Patent: June 7, 2011

Assignee: Digital Voice Systems, Inc.

Inventor: John C. Hardwick
ADAPTIVE DATA TRANSMISSION FOR A DIGITAL IN-BAND MODEM OPERATING OVER A VOICE CHANNEL

Publication number: 20110125488

Abstract: In one example, a mobile device encodes a digital bitstream using a particular set of modulation parameters to generate an audio signal that has different audio tones selected to pass through a vocoder of the mobile device. The particular set of modulation parameters is optimized for a subset of a plurality of vocoding modes without a priori knowledge of which one of the vocoding modes is currently operated by the vocoder. The mobile device conducts transmissions over the wireless telecommunications network through the vocoder using the particular set of modulation parameters, and monitors these transmissions for errors. If the errors reach a threshold, then the vocoder may be using one of the vocoding modes that are not included in the subset for which the particular set of modulation parameters is optimized, and accordingly, the modulation device switches from the particular set of modulation parameters to a different set of modulation parameters.

Type: Application

Filed: October 13, 2010

Publication date: May 26, 2011

Applicant: AIRBIQUITY INC.

Inventor: Kiley Birmingham
APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO

Publication number: 20110119054

Abstract: Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.

Type: Application

Filed: July 14, 2009

Publication date: May 19, 2011

Inventors: Tae Jin Lee, Seung Kwon Beack, Minje Kim, Dae Young Jang, Kyeongok Kang, Jin Woo Hong, Hochong Park, Young-Cheol Park
System And Method For Leaving And Transmitting Speech Messages

Publication number: 20110119053

Abstract: A system for leaving and transmitting speech messages automatically analyzes input speech of at least a reminder, fetches a plurality of tag informations, and transmits speech message to at least a message receiver, according to the transmit criterions of the reminder. A command or message parser parses the tag informations at least including at least a reminder ID, at least a transmitted command and at least a speech message. The tag informations are sent to a message composer for being synthesized into a transmitted message. A transmitting controller controls a device switch according to the reminder ID and the transmitted command, to allow the transmitted message send to the message receiver via a transmitting device.

Type: Application

Filed: March 18, 2010

Publication date: May 19, 2011

Inventors: Chih-Chung Kuo, Shih-Chieh Chien, Chung-Jen Chiu, Hsin-Chang Chang
MULTI-STREAM VOICE TRANSMISSION SYSTEM AND METHOD, AND PLAYOUT SCHEDULING MODULE

Publication number: 20110119565

Abstract: A multi-stream voice transmission system includes a transmitting terminal and a receiving terminal for transmitting and receiving first and second packet streams via first and second network channels. The receiving terminal includes a playout buffer for buffering the first and second packet streams, generates an output voice signal from the buffered packets according to a playout schedule adjusting coefficient ?, generates packet loss parameters and packet delay parameters corresponding to loss and delay experienced by the first and second packet streams, and provides the parameters to the transmitting terminal.

Type: Application

Filed: April 7, 2010

Publication date: May 19, 2011

Applicant: GEMTEK TECHNOLOGY CO., LTD.

Inventors: Yung-Le Chang, Chun-Feng Wu, Wen-Whei Chang
Sound coding device and sound coding method

Patent number: 7945447

Abstract: A sound coding device having a monaural/stereo scalable structure and capable of efficiently coding stereo sound. even when the correlation between the channel signals of a stereo signal is small. In a core layer coding block of this device, a monaural signal generating section generates a monaural signal from first and second-channel sound signal, a monaural signal coding section codes the monaural signal, and a monaural signal decoding section greatest a monaural decoded signal from monaural signal coded data and outputs it to an expansion layer coding block. In the expansion layer coding block, a first-channel prediction signal synthesizing section synthesizes a first-channel prediction signal from the monaural decoded signal and a first-channel prediction filter digitizing parameter and a second-channel prediction signal synthesizing section synthesizes a second-channel prediction signal from the monaural decoded signal and second-channel prediction filter digitizing parameter.

Type: Grant

Filed: December 26, 2005

Date of Patent: May 17, 2011

Assignee: Panasonic Corporation

Inventors: Koji Yoshida, Michiyo Goto
Cue-based audio coding/decoding

Patent number: 7941320

Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.

Type: Grant

Filed: August 27, 2009

Date of Patent: May 10, 2011

Assignee: Agere Systems, Inc.

Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
Audio decoding using variable-length codebook application ranges

Patent number: 7937271

Abstract: Provided are, among other things, systems, methods and techniques for decoding an audio signal from a frame-based bit stream. Each frame includes processing information pertaining to the frame and entropy-encoded quantization indexes representing audio data within the frame. The processing information includes: (i) code book indexes, (ii) code book application information specifying ranges of entropy-encoded quantization indexes to which the code books are to be applied, and (iii) window information. The entropy-encoded quantization indexes are decoded by applying the identified code books to the corresponding ranges of entropy-encoded quantization indexes. Subband samples are then generated by dequantizing the decoded quantization indexes, and a sequence of different window functions that were applied within a single frame of the audio data is identified based on the window information.

Type: Grant

Filed: March 21, 2007

Date of Patent: May 3, 2011

Assignee: Digital Rise Technology Co., Ltd.

Inventor: Yuli You
Scalable encoding/decoding of audio signals

Patent number: 7937272

Abstract: An audio signal is encoded by a first waveform encoder (103) to generate a first waveform based bit-stream component. A second encoder (105) encodes the audio signal to generate a second bit-stream component comprising first enhancement data and a third encoder (107) encodes the audio signal to generate a third bit-stream component comprising second enhancement data for the first waveform based bit-stream component. The first and second bit-stream components correspond to a first representation of the audio signal and the first and third bit-stream components correspond to a second representation of the audio signal. A scalable audio bit-stream is generated by a bit-stream generator (109). The different representations may be selected between by a decoder thereby allowing a flexible and scalable bit-stream to be communicated. The second encoder (105) may specifically be a waveform encoder and the third encoder (107) may specifically be a parametric encoder.

Type: Grant

Filed: January 6, 2006

Date of Patent: May 3, 2011

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Arnoldus Werner Johannes Oomen, Leon Maria Van De Kerkhof
NETWORK/PEER ASSISTED SPEECH CODING

Publication number: 20110099009

Abstract: A communications network is used to transfer user attribute information about participants in a communication session to their respective communication terminals for storage and use thereon to configure a speech codec to operate in a speaker-dependent manner, thereby improving speech coding efficiency. In a network-assisted model, the user attribute information is stored on the communications network and selectively transmitted to the communication terminals while in a peer-assisted model, the user attribute information is derived by and transferred between communication terminals.

Type: Application

Filed: October 11, 2010

Publication date: April 28, 2011

Applicant: BROADCOM CORPORATION

Inventors: Robert W. Zopf, Kelly Hale
Serial transmission system, transmission device, and serial transmission method

Patent number: 7933778

Abstract: A serial transmission system has a transmission signal generator and a transmission signal receiver. The transmission signal generator generates a digital audio signal of multi-channel based on information of a sampling frequency, and serially transmits the information of the sampling frequency together with the digital audio signal. The transmission signal receiver serially receives the information of the sampling frequency and the digital audio signal from the transmission signal generator, and detects change in a transmission clock based on the information of the sampling frequency.

Type: Grant

Filed: June 14, 2007

Date of Patent: April 26, 2011

Assignee: Renesas Electronics Corporation

Inventor: Katsumasa Tanaka
SIGNAL CLASSIFYING METHOD AND APPARATUS

Publication number: 20110093260

Abstract: A signal classifying method and apparatus are disclosed. The signal classifying method includes: obtaining a spectrum fluctuation parameter of a current signal frame determined as a foreground frame, and buffering the spectrum fluctuation parameter; obtaining a spectrum fluctuation variance of the current signal frame according to spectrum fluctuation parameters of all buffered signal frames, and buffering the spectrum fluctuation variance; and calculating a ratio of signal frames whose spectrum fluctuation variance is above or equal to a first threshold to all the buffered signal frames, and determining the current signal frame as a speech frame if the ratio is above or equal to a second threshold or determining the current signal frame as a music frame if the ratio is below the second threshold.

Type: Application

Filed: December 28, 2010

Publication date: April 21, 2011

Inventors: Yuanyuan Liu, Zho Wang, Eyal Shlomot
METHOD AND SYSTEM FOR MEMORY USAGE IN REAL-TIME AUDIO SYSTEMS

Publication number: 20110087487

Abstract: System and method for encoding, transmitting and decoding audio data. Audio bit steam syntax is re-organized to allow system optimizations that work well with memory latency and memory burst operations. Multiple small entropy coding tables are stored in RAM and loaded to on-chip memory as needed. Audio prediction is pipelined in the bitstream syntax. Intra frames, independent of other frames in the bitstream, are included in the bitstream for error recovery and channel change. New algorithms are implemented in legacy syntax by including the new information in the user data space of the audio frame. The new decoder can use projection to determine where the new information is and read ahead in the stream. Audio prediction from the immediately previous frame is restricted. Audio prediction is performed across channels within a single audio frame. A variable re-order function comprises storing channels of data to DRAM in the order they are decoded and reading them out in presentation order.

Type: Application

Filed: December 16, 2010

Publication date: April 14, 2011

Inventor: Darren Neuman
Method and apparatus for processing a speech signal

Patent number: 7925499

Abstract: A method and apparatus for generating a control signal for processing a speech signal comprising the steps of: adjusting the signal relative to a threshold level; and responsive to detection of a falling edge of the signal, holding the signal level for a holding period. The technique further comprises ‘slowing’ each rising edge of the signal. The technique further comprises attenuating each falling edge of the signal. The steps are carried out on a signal representing the envelope of the speech signal.

Type: Grant

Filed: July 13, 2007

Date of Patent: April 12, 2011

Assignee: Avaya Inc.

Inventor: Kenneth Lee Thomas
SOUND MONITORING SYSTEM AND SPEECH COLLECTION SYSTEM

Publication number: 20110082690

Abstract: Monitoring accuracy degrades due to a noise in an environment where there are many sound sources other than those to be monitored. Easy initialization is required for an environment where many apparatuses operate. A sound monitoring system includes a microphone array having multiple microphones and a location-based abnormal sound monitoring section as a processing section. The location-based abnormal sound monitoring section is supplied with an input signal from the microphone array via a waveform acquisition section and a network. Using the input signal, the location-based abnormal sound monitoring section detects a temporal change in a sound source direction histogram. Based on a detected change result, the location-based abnormal sound monitoring section checks for abnormality in a sound field and outputs a monitoring result. The processing section searches for a microphone array near the sound source to be monitored.

Type: Application

Filed: September 29, 2010

Publication date: April 7, 2011

Inventors: Masahito Togami, Yohei Kawaguchi
VAMOS - DARP receiver switching for mobile receivers

Publication number: 20110082689

Abstract: Embodiments of the invention include apparatuses, systems, computer readable media, and methods for processing speech signals in a manner that enhances capacity, efficiency and hardware utilization of a communications network. A method, according to one embodiment, includes receiving speech signals, determining a subchannel power imbalance ratio of at least two subchannels, and selecting a receiver architecture for processing the speech signals in accordance with the determined subchannel power imbalance ratio.

Type: Application

Filed: October 7, 2009

Publication date: April 7, 2011

Inventors: Carsten Juncker, Morten With Pedersen
Method and device for updating status of synthesis filters

Patent number: 7921009

Abstract: A method and device for updating statuses of synthesis filters are provided. The method includes: exciting a synthesis filter corresponding to a first encoding rate by using an excitation signal of the first encoding rate, outputting reconstructed signal information, and updating status information of the synthesis filter and a synthesis filter corresponding to a second encoding rate. In the present disclosure, the status of the synthesis filter corresponding to the current rate and the statuses of the synthesis filters at other rates are updated. Thus, synchronization between the statuses of the synthesis filters corresponding to different rates at the encoding terminal may be realized, thereby facilitating the consistency of the reconstructed signals of the encoding and decoding terminals when the encoding rate is switched, and improving the quality of the reconstructed signal of the decoding terminal.

Type: Grant

Filed: September 16, 2010

Date of Patent: April 5, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventor: Jinliang Dai
Scalable audio coding

Patent number: 7921007

Abstract: The invention relates to an audio encoder and decoder and methods for audio encoding and decoding. In a preferred encoder embodiment an audio signal is encoded by deterministic encoder means to form a first encoded signal part. A spectrum of the audio signal is determined and represented by an excitation pattern, i.e. spectral values corresponding to human auditory filters, as a second encoded signal part. A masking curve is also extracted based on the excitation pattern, thus improving encoding efficiency in terms of bit rate. In a preferred decoder the first encoded signal part is decoded by deterministic decoder means. A noise generator uses the decoded first signal part together with the second signal part, i.e. the excitation pattern for the original audio signal, to generate a noise signal. The noise signal is then added to the first decoded signal part to form an output audio signal. At the decoder side the masking curve is also extracted based on the second encoded signal part, i.e.

Type: Grant

Filed: July 25, 2005

Date of Patent: April 5, 2011

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Steven Leonardus Josephus Dimphina Elisabeth Van de Par, Valery Stephanovich Kot, Nicolle Hanneke Van Schijndel
DATA REPRODUCTION METHOD AND DATA REPRODUCTION APPARATUS

Publication number: 20110077938

Abstract: A reproduction apparatus that reproduces compressed audio data recorded in a recording medium inserts dummy data between data to be concatenated and reproduces the data when performing a specific reproduction of the data obtained by concatenating data which are discontinuously read from the recording medium.

Type: Application

Filed: December 7, 2010

Publication date: March 31, 2011

Applicant: PANASONIC CORPORATION

Inventors: Atsutoshi Naraki, Yuuichirou Nakamura, Kaori Tajima, Satoshi Sasaki, Katsuhiro Okawara, Hiroshi Kamiki
User-defined marked locations for use in conjunction with a personal navigation device

Publication number: 20110077852

Abstract: A method of creating and storing a marked location in a portable electronic device includes receiving current position information of the portable electronic device, recording description data for a marked location when a user of the portable electronic device is located at the marked location, and storing the description data and position information corresponding to the marked location in a memory.

Type: Application

Filed: September 25, 2009

Publication date: March 31, 2011

Inventors: Mythreyi Ragavan, Takuya Otani
Receiving apparatus, sending apparatus and transmission system

Patent number: 7917237

Abstract: A sending apparatus includes a compressed music data outputting unit that outputs one of compressed music data including compressed basic music data in a first mode and compressed music data including the basic music data as well as high frequency information for extending high frequency of the basic music data in a second mode, and a formatter that transmits a compressed music data outputted by the compressed music data outputting unit first and then music information including a value indicating which mode is used for the compressed music data.

Type: Grant

Filed: June 16, 2004

Date of Patent: March 29, 2011

Assignee: Panasonic Corporation

Inventors: Akihisa Kawamura, Naoki Esima
Method and apparatus for determining a bit boundary of a repetition-coded signal

Patent number: 7917362

Abstract: A method for determining a bit boundary of a repetition-coded signal including bits each having a plurality of epochs includes (a) counting the epochs repeatedly from an initial number to a predetermined number in a predetermined time, (b) sensing sign changes in the epochs, (c) recording each sensed sign change with a weighting function to a corresponding counting number of the epoch, and (d) determining the bit boundary according to a result of step (c).

Type: Grant

Filed: April 19, 2006

Date of Patent: March 29, 2011

Assignee: MediaTek Inc.

Inventor: Jia-Horng Shieh
Methods and systems for creating and delivering group recordings

Patent number: 7903797

Abstract: Methods, systems and devices for creating special communications or recordings containing messages from family members and the like may be delivered to a recipient in the form of a keepsake type of device that can be used to play the communications when and as many times as desired.

Type: Grant

Filed: November 3, 2009

Date of Patent: March 8, 2011

Assignee: Hopechest Voices, LLC

Inventor: Hope Flammer
METHOD AND DEVICE OF BITRATE DISTRIBUTION/TRUNCATION FOR SCALABLE AUDIO CODING

Publication number: 20110046945

Abstract: Embodiments of the invention provides a method and device for assigning bitrates to a plurality of channels in a scalable audio encoding/truncation process. Different bitrates are assigned to different channels in the scalable audio encoding/truncation process.

Type: Application

Filed: January 31, 2008

Publication date: February 24, 2011

Applicant: AGENCY FOR SCIENCE, TECHNOLOGY AND RESEARCH

Inventors: Te Li, Susanto Rahardja, Haibin Huang
System and method for determining a common fundamental frequency of two harmonic signals via a distance comparison

Patent number: 7895033

Abstract: One embodiment of the present invention provides a method of determining an evidence value capturing whether two band-pass signals are harmonics of a common fundamental frequency. A further embodiment of the present invention evaluates the distance between significant points of a signal such as a sinusoidal signal. One embodiment of the present invention provides a method of determining whether two or more band-pass signals are harmonics of a fundamental frequency, comprising evaluating a first distance between a first set of two or more significant points of a first band-pass signal, evaluating a second distance between a second set of two or more significant points of a second band-pass signal, and comparing the first distance to the second distance to determine whether the first and second signals are harmonics of the fundamental frequency.

Type: Grant

Filed: May 31, 2005

Date of Patent: February 22, 2011

Assignee: Honda Research Institute Europe GmbH

Inventors: Frank Joublin, Martin Heckmann
Method and apparatus for performing packet loss or frame erasure concealment

Patent number: 7881925

Abstract: The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with a decoder receives encoded frames of compressed speech information transmitted from an encoder. A lost frame detector at the receiver determines if an encoded frame has been lost or corrupted in transmission, or erased. If the encoded frame is not erased, the encoded frame is decoded by a decoder and a temporary memory is updated with the decoder's output. A predetermined delay period is applied and the audio frame is then output. If the lost frame detector determines that the encoded frame is erased, a FEC module applies a frame concealment process to the signal. The FEC processing produces natural sounding synthetic speech for the erased frames.

Type: Grant

Filed: March 22, 2006

Date of Patent: February 1, 2011

Inventor: David A. Kapilow
Method of controlling output of ultrasonic speaker, ultrasonic speaker system, and display device

Patent number: 7873174

Abstract: A method of controlling an output of an ultrasonic speaker that reproduces audible-frequency-band signal sounds by modulating carrier waves with audible-frequency-band signal waves output from a signal source and driving an ultrasonic transducer with the modulated waves includes: dividing the audible-frequency-band signal waves into a plurality of frequency bands; separately adjusting amplitudes of the signal waves and amplitudes of the modulated waves in the respective frequency bands; and driving a plurality of ultrasonic transducers provided corresponding to the respective frequency bands with the modulated waves generated corresponding to the respective frequency bands.

Type: Grant

Filed: February 1, 2007

Date of Patent: January 18, 2011

Assignee: Seiko Epson Corporation

Inventors: Hiroyuki Yoshino, Shinichi Miyazaki

prev … 13 14 15 16 17 18 19 20 21 … next