Patents Examined by Daniel A. Nolan

Alternate window compression/decompression method, apparatus, and system

Patent number: 6785644

Abstract: With respect to data having periodicity to be compressed, windows of the same size are set for every two sections according to an interval of peaks appearing substantially periodically and processing for sorting sample data alternately among the set windows of the same size is sequentially performed, whereby a frequency of data having periodicity is replaced with an approximately half frequency without damaging reproducibility to original data at all to make it possible to apply compression processing to data of the replaced low frequency. If this sorting processing is applied to compression processing having a characteristic that a compression ratio is not increased in a high-frequency region, it becomes possible to improve a compression ratio without damaging a quality of reproduced data by decompression at all.

Type: Grant

Filed: December 16, 2002

Date of Patent: August 31, 2004

Assignee: Yasue Sakai

Inventor: Yukio Koyanagi
Digital audio reproducing apparatus

Patent number: 6775654

Abstract: A digital audio reproducing apparatus including a receiver receiving modulated data, a demodulator demodulating the modulated data received by the receiver, an audio decoder decoding, in a unit of a frame, digital audio information contained in the modulated data demodulated by the demodulator, and an audibility corrector for effecting audibility correction on failing digital audio information contained in a frame that failed to be decoded, when the audio decoder fails to decode the digital audio information.

Type: Grant

Filed: August 31, 1999

Date of Patent: August 10, 2004

Assignees: Fujitsu Limited, FFC Limited

Inventors: Hideaki Yokoyama, Kazuhisa Matsushima, Hiroshi Okubo, Tadayoshi Katoh, Takashi Saito
Concealment of frame erasures for speech transmission and storage system and method

Patent number: 6775649

Abstract: A decoder for packetized speech with differential quantization of line spectral frequencies and fixed-codebook gain conceals erased frames with interpolation of future and past frames by reconstruct future frame predicted parameters from presumed interpolations of erased frame parameters.

Type: Grant

Filed: August 15, 2000

Date of Patent: August 10, 2004

Assignee: Texas Instruments Incorporated

Inventor: Juan-Carlos DeMartin
Application server configured for dynamically generating web pages for voice enabled web applications

Patent number: 6766298

Abstract: A unified web-based voice messaging system provides voice application control between a web browser and an application server via an hypertext transport protocol (HTTP) connection on an Internet Protocol (IP) network. The web browser receives an HTML page from the application server having an XML element that defines data for an audio operation to be performed by an executable audio resource. The application server executes the voice-enabled web application by runtime execution of extensible markup language (XML) documents that define the voice-enabled web application to be executed. The application server, in response to receiving a user request from a user, accesses a selected XML page that defines at least a part of the voice application to be executed for the user. The application server then parses the XML page, and executes the operation describer by the XML page.

Type: Grant

Filed: January 11, 2000

Date of Patent: July 20, 2004

Assignee: Cisco Technology, Inc.

Inventors: Lewis Dean Dodrill, Geetha Ravishankar, Satish Joshi, Keith M. Basil, Ryan Alan Danner, James Richard Grove, Jr., Steven J. Martin
System and method for concurrent presentation of multiple audio information sources

Patent number: 6757656

Abstract: A method for concurrent presentation of multiple audio information sources. In the method, audio information from at least two audio information sources is concurrently presented, and a user speech selection of one of the audio information sources is accepted. At least one of the audio information sources can then be reconfigured. The reconfiguration audibly distinguishes the user selected audio information source from other audio information sources.

Type: Grant

Filed: June 15, 2000

Date of Patent: June 29, 2004

Assignee: International Business Machines Corporation

Inventors: Qing Gong, James R. Lewis, Ronald E. Vanbuskirk, Huifang Wang
Codebook tables for multi-rate encoding and decoding with pre-gain and delayed-gain quantization tables

Patent number: 6757649

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: April 8, 2003

Date of Patent: June 29, 2004

Assignee: Mindspeed Technologies Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Reassembling speech sentence fragments using associated phonetic property

Patent number: 6757653

Abstract: A method of composing messages for speech output and the improvement of the quality of reproduction of speech outputs. A series of original sentences for messages is segmented and stored as audio files with search criteria. The length, position, and transition values for the respective segments can be recorded and stored. A sentence to be reproduced is transmitted in a format corresponding to the format of the search criteria. It is determined whether the sentence to be reproduced can be fully reproduced by one segment or a succession of stored segments. The segments found in each case are examined using their entries as to how far the individual segments match as regards speech rhythm. The audio files of the segments in which the examination resulted in the pre-requisites for optimal maintaining of the natural speech rhythm are combined and output for reproduction.

Type: Grant

Filed: June 28, 2001

Date of Patent: June 29, 2004

Assignee: Nokia Mobile Phones, Ltd.

Inventors: Peter Buth, Simona Grothues, Amir Iman, Wolfgang Theimer
Information processing apparatus, information processing method and program storage medium

Patent number: 6757657

Abstract: An information processing apparatus including an image-sensing controller controlling image-sensing so as to take a picture upon detection of execution of a first operation, a word generator recognizing speech upon detection of execution of a second operation and generating a word or a phrase corresponding to the recognized voice, and a portion associating the word or a phrase with the picture. Accordingly a word, a generated phrase or the like can be easily associated with an image-sensed still picture (with ease).

Type: Grant

Filed: August 17, 2000

Date of Patent: June 29, 2004

Assignee: Sony Corporation

Inventors: Kiyonobu Kojima, Yasuhiko Kato, Shuji Yonekura, Satoshi Fujimura, Takashi Sasai, Naoki Fujisawa, Junji Ooi
Synthesis of speech from pitch prototype waveforms by time-synchronous waveform interpolation

Patent number: 6754630

Abstract: In a method of synthesizing voiced speech from pitch prototype waveforms by time-synchronous waveform interpolation (TSWI), one or more pitch prototypes is extracted from a speech signal or a residue signal. The extraction process is performed in such a way that the prototype has minimum energy at the boundary. Each prototype is circularly shifted so as to be time-synchronous with the original signal. A linear phase shift is applied to each extracted prototype relative to the previously extracted prototype so as to maximize the cross-correlation between successive extracted prototypes. A two-dimensional prototype-evolving surface is constructed by unsampling the prototypes to every sample point. The two-dimensional prototype-evolving surface is re-sampled to generate a one-dimensional, synthesized signal frame with sample points defined by piecewise continuous cubic phase contour functions computed from the pitch lags and the phase shifts added to the extracted prototypes.

Type: Grant

Filed: November 13, 1998

Date of Patent: June 22, 2004

Assignee: Qualcomm, Inc.

Inventors: Amitava Das, Eddie L. T. Choy
Detecting speech recognition errors in an embedded speech recognition system

Patent number: 6754627

Abstract: A method for processing a misrecognition error in an embedded speech recognition system during a speech recognition session can include the step of speech-to-text converting audio input in the embedded speech recognition system based on an active language model. The speech-to-text conversion can produce speech recognized text that can be presented through a user interface. A user-initiated misrecognition error notification can be detected. The audio input and a reference to the active language model can be provided to a speech recognition system training process associated with the embedded speech recognition system.

Type: Grant

Filed: March 1, 2001

Date of Patent: June 22, 2004

Assignee: International Business Machines Corporation

Inventor: Steven G. Woodward
Creating a hierarchical tree of language models for a dialog system based on prompt and dialog context

Patent number: 6754626

Abstract: The invention disclosed herein concerns a method of converting speech to text using a hierarchy of contextual models. The hierarchy of contextual models can be statistically smoothed into a language model. The method can include processing text with a plurality of contextual models. Each one of the plurality of contextual models can correspond to a node in a hierarchy of the plurality of contextual models. Also included can be identifying at least one of the contextual models relating to the text and processing subsequent user spoken utterances with the identified at least one contextual model.

Type: Grant

Filed: March 1, 2001

Date of Patent: June 22, 2004

Assignee: International Business Machines Corporation

Inventor: Mark E. Epstein
Codebook re-ordering to reduce undesired packet generation

Patent number: 6754624

Abstract: A method and apparatus for enhancing coding efficiency by reducing illegal or other undesirable packet generation while encoding a signal. The probability of generating illegal or other undesirable packets while encoding a signal is reduced by first analyzing a history of the frequency of codebook values selected while quantizing speech parameters. Codebook entries are then reordered so that the index/indices that create illegal or other undesirable packets contain the least frequently used entry/entries. Reordering multiple codebooks for various parameters further reduces the probability that an illegal or other undesirable packet will be created during signal encoding. The method and apparatus may be applied to reduce the probability of generating illegal null traffic channel data packets while encoding eighth rate speech.

Type: Grant

Filed: February 13, 2001

Date of Patent: June 22, 2004

Assignee: Qualcomm, Inc.

Inventors: Eddie-Lun Tik Choy, Arasanipalai K. Ananthapadmanabhan, Andrew P. DeJaco
Voice-actuated generation of documents containing photographic identification

Patent number: 6751589

Abstract: A preferred method for generating a document includes the steps of: providing an applicant with a visual representation, via a visual display device, of at least a portion of a document; prompting an applicant to provide first information corresponding to a first portion of the document; receiving the first information, as a first vocal response, from the applicant; converting the first vocal response to corresponding first textual data; providing the applicant with an updated visual representation, via the visual display device, of the first textual data appearing at the first portion of the document; and generating a printed document corresponding to the updated visual representation of the document. Systems and computer readable media also are provided.

Type: Grant

Filed: September 18, 2000

Date of Patent: June 15, 2004

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Gustavo M. Guillemin
Low power audio processor that multiplexes component distribution signals

Patent number: 6741961

Abstract: A low power audio processor is disclosed which includes: a bit stream processing unit for performing bit processing for an applied audio stream and for decoding the bit processed audio stream to have a format conducive to digital signal processing; a digital signal processing unit for receiving the decoded data from the bit stream processing unit to perform digital signal processing; a post processing unit for post processing audio data from the digital signal processing unit to output final audio data; and a host interface unit for interfacing with an external device to provide an audio parallel stream from the external device to the bit stream processing unit.

Type: Grant

Filed: March 14, 2001

Date of Patent: May 25, 2004

Assignee: Hyundai Electronics Industries Co., Ltd.

Inventor: Chae-Duck Lim
Data transfer system and data transfer method

Patent number: 6741964

Abstract: When recording digital data corresponding to a voice signal, a voice data recording and reproducing apparatus generates an error correction code and records this code together with the digital data in semiconductor memory. When transferring the digital data to the PC, a system control section in the voice data recording and reproducing apparatus transmits voice data including the error correction code without performing error correction. The system control section provides a lower data processing capability than that of a PC's CPU. The PC's CPU having a higher data processing capability performs error correction of the voice data by using the error correction code included in the received voice data.

Type: Grant

Filed: January 8, 2001

Date of Patent: May 25, 2004

Assignee: Olympus Optical Co., Ltd.

Inventor: Hideo Okano
Voiced speech preprocessing employing waveform interpolation or a harmonic model

Patent number: 6738739

Abstract: Voiced speech preprocessing employs waveform interpolation or a harmonic model circuit to smooth a transition region and simplify speech coding. At low bit rates, the speech is coded by a system that maintains a high perceptual quality in the transition region from a voiced (quasi-periodic) portion of the speech signal to an unvoiced (non-periodic) portion of the speech signal. Similarly, the transition region from an unvoiced portion to a voiced portion is conditioned to maintain a high perceptual quality at a low bandwidth. The transition region from one type of voiced region to another type of voiced region is also smoothed. The transition region is smoothed to create a quasi-periodic speech signal.

Type: Grant

Filed: February 15, 2001

Date of Patent: May 18, 2004

Assignee: Mindspeed Technologies, Inc.

Inventor: Yang Gao
Portrayal of talk group at a location in virtual audio space for identification in telecommunication system management

Patent number: 6735564

Abstract: A method and arrangement for managing talk groups of a telecommunication system at a dispatcher station of the telecommunications system having one or more talk groups which may consist of one or more users and which are controlled by the dispatcher at the dispatcher station. The arrangement includes a two-channel or a multichannel sound reproducing system which is configured to create an artificial acoustic space at the dispatcher station, and reproduce voices of each talk group so that the voices are heard from a certain point of the acoustic space, which allows the dispatcher to recognize the talk group to which the voice belongs on the basis of the location of the voice.

Type: Grant

Filed: December 28, 2000

Date of Patent: May 11, 2004

Assignee: Nokia Networks Oy

Inventor: Pekka Puhakainen
Encoding and decoding speech signals variably based on signal classification

Patent number: 6735567

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: April 8, 2003

Date of Patent: May 11, 2004

Assignee: Mindspeed Technologies, Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Interactive multimedia book

Patent number: 6728681

Abstract: An interactive multimedia book provides hands-on multimedia instruction to the user in response to voiced commands. The book is implemented on a computer system and includes both text and audio/video clips. The interactive multimedia book is accessed by voiced commands and natural language queries as the primary user input. The displayed text is written in a markup language and contains hyperlinks which link the current topic with other related topics. The user may command the book to read the text and, as the text is read by the voice synthesizer, a word which is also a hyperlink will change its attributes upon being spoken. The user will be able to observe or hear this and simply utter the word which is the hyperlink to navigate to the linked topic.

Type: Grant

Filed: January 5, 2001

Date of Patent: April 27, 2004

Inventor: Charles L. Whitham
System and method for automated multimedia content indexing and retrieval

Patent number: 6714909

Abstract: The invention provides a system and method for automatically indexing and retrieving multimedia content. The method may include separating a multimedia data stream into audio, visual and text components, segmenting the audio, visual and text components based on semantic differences, identifying at least one target speaker using the audio and visual components, identifying a topic of the multimedia event using the segmented text and topic category models, generating a summary of the multimedia event based on the audio, visual and text components, the identified topic and the identified target speaker, and generating a multimedia description of the multimedia event based on the identified target speaker, the identified topic, and the generated summary.

Type: Grant

Filed: November 21, 2000

Date of Patent: March 30, 2004

Assignee: AT&T Corp.

Inventors: David Crawford Gibbon, Qian Huang, Zhu Liu, Aaron Edward Rosenberg, Behzad Shahraray

prev 1 2 3 4 5 6 … next