Patents Examined by W. R. Young

Cross-talk removal apparatus and data reproduction apparatus

Patent number: 7050371

Abstract: A pick up irradiates light beams at specified intervals in the tangential direction of the main track to be reproduced and both of adjacent tracks on a disk 10 on which tracks are formed. A CTC unit 15 uses the delay amount ?d of the respective sample-value series corresponding to the reproduction signal RFm from the main track and reproduction signals RF1, RF2 from both adjacent tracks to correct the delay, then outputs a CTC output signal from which the cross-talk component has been removed. In addition, when adjusting the delay, the CPU 17 applies a disturbance to the control signal for the actuator of the servo-control unit 18 and changes the delay amount ?d within a specified range of change. At this time, the CPU 17 sets the delay amount ?d, which minimizes the jitter value found by the jitter detection unit 16 according to the CTC output signal, for the CTC unit 15. This makes it possible to set the optimum delay amount ?d when adjusting the delay, even when there is little steady cross-talk.

Type: Grant

Filed: March 6, 2002

Date of Patent: May 23, 2006

Assignee: Pioneer Corporation

Inventors: Shogo Miyanabe, Hiroki Kuribayashi
Optical scanning device

Patent number: 7050364

Abstract: An optical pickup unit for scanning an optical disk includes a first optical branch including a first radiation source for emitting a first radiation beam of a first wavelength, and a dichroic mirror (8) located in the path of the first radiation beam for reflecting the first beam towards the disk, and a second optical branch including a second radiation source for emitting a second radiation beam of a second wavelength, different to said first wavelength, and a folding mirror (28) located in the path of the second radiation beam for reflecting the second beam towards the disk. The first and second branches are stacked in the axial direction of the disk, and are arranged substantially perpendicularly when viewed along the axial direction.

Type: Grant

Filed: August 6, 2002

Date of Patent: May 23, 2006

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Rene Verbecque, Petrus Theodorus Jutte
System and method for compressed domain beat detection in audio bitstreams

Patent number: 7050980

Abstract: A system and method for detecting beats in a compressed audio domain is disclosed where a beat detector functions as part of an error concealment system in an audio decoding section used in audio information transfer and audio download-streaming system terminal devices such as mobile phones. The beat detector includes a MDCT coefficient extractor, a band feature value analyzer, a confidence score calculator; and a converging and storage unit. The method provides beat detection by means of beat information obtained using both MDCT coefficients as well as window-switching information. A baseline beat position is determined using MDCT coefficients obtained from the audio bitstream which also provides a window-switching pattern. A window-switching beat position is compared with the baseline beat position and, if a predetermined condition is satisfied, the window-switching beat position is validated as a detected beat.

Type: Grant

Filed: September 28, 2001

Date of Patent: May 23, 2006

Assignee: Nokia Corp.

Inventors: Ye Wang, Miikka Vilermo
Track traverse counting on an optical disk

Patent number: 7046591

Abstract: An apparatus detects the number of tracks traversed on an optical disk. The apparatus includes a differentiation part for differentiating an output signal of a bandpass filter provided on an adjacent stage which eliminates a low DC component, a high frequency component and noise of an inputted modulation signal. A sample trigger generating circuit generates top and bottom signals in response to a data signal outputted through the differentiation part and a signal outputted through a comparator. A top hold part and a bottom hold part holds the data signal at top and bottom. A subtraction part performs a subtraction to obtain a difference between the top and bottom signals outputted from the top and bottom hold parts, thereby resulting in counting the number of the tracks traversed, while increasing a jumping speed required for a detection of the number of the tracks traversed.

Type: Grant

Filed: June 12, 2000

Date of Patent: May 16, 2006

Assignee: CNS Co., Ltd.

Inventor: Young-San Ko
Method and system for providing automated captioning for AV signals

Patent number: 7047191

Abstract: System, method and computer-readable medium containing instructions for providing AV signals with open or closed captioning information. The system includes a speech-to-text processing system coupled to a signal separation processor and a signal combination processor for providing automated captioning for video broadcasts contained in AV signals. The method includes separating an audio signal from an AV signal, converting the audio signal to text data, encoding the original AV signal with the converted text data to produce a captioned AV signal and recording and displaying the captioned AV signal. The system may be mobile and portable and may be used in a classroom environment for producing recorded captioned lectures and used for broadcasting live, captioned lectures. Further, the system may automatically translate spoken words in a first language into words in a second language and include the translated words in the captioning information.

Type: Grant

Filed: March 6, 2001

Date of Patent: May 16, 2006

Assignee: Rochester Institute of Technology

Inventors: Jeffrey K. Lange, Robert H. Paine, Jeremiah L. Parry-Hill, Steven H. Wunrow
Real-time control of playback rates in presentations

Patent number: 7047201

Abstract: Media encoding, transmission, and playback processes and structures employ a multi-channel architecture with different audio channels corresponding to different playback rates for a presentation to be transmitted over a network. Audio frames in the various audio channels all correspond to the same amount of time in the original presentation and have frame indexes that identify in the different audio channels the frames corresponding to the same time interval in the presentation. A user can make a real-time change in playback rate causing selection of a channel corresponding to the new playback rate and a frame required for prompt and smooth transition in the playback rate of the presentation. The architecture can additionally provide channels for graphics data such as image data that are displayed according to the index of the audio, and different audio channels with the same playback rate but different compression schemes for use according to available bandwidth on the network.

Type: Grant

Filed: May 4, 2001

Date of Patent: May 16, 2006

Assignee: SSI Corporation

Inventor: Kenneth H. P. Chang
Multilingual document retrieval system

Patent number: 7047182

Abstract: As a retrieval result, appropriate text of a second language is provided in response to a retrieval request by text of a first language. A first directory storing part stores a first directory structure created for a first language. A second directory storing part stores a second directory structure created for a second language. A directory relation storing part stores correspondences between directories in the first directory structure and directories in the second directory structure. A directory retrieval part receives a retrieval request by the first language from a user and decides which directory in the first directory structure the request has a high degree of relation with. A multilingual retrieval part decides documents having a high degree of relation with the retrieval request, of documents belonging to a directory in the second directory structure that corresponds to the decided directory.

Type: Grant

Filed: December 13, 2001

Date of Patent: May 16, 2006

Assignee: Fuji Xerox Co., Ltd.

Inventor: Hiroshi Masuichi
Background noise estimation method for an improved G.729 annex B compliant voice activity detection circuit

Patent number: 7043428

Abstract: A method of initializing an ITU Recommendation G.729 Annex B compliant voice activity detection (VAD) device is disclosed, having the steps of (1) determining a first set of running average background noise characteristics in accordance with Recommendation G.729B; (2) determining a second set of running average background noise characteristics; and (3) substituting the second set of running average background noise characteristics for the first set when a specific event occurs. The specific event is a divergence between the first and second sets of running average background noise characteristics.

Type: Grant

Filed: August 3, 2001

Date of Patent: May 9, 2006

Assignee: Texas Instruments Incorporated

Inventor: Dunling Li
Standardized inpatient-outpatient nomenclatures and accepting both outpatient and inpatient data to commonly accessible storage

Patent number: 7043437

Abstract: Methods and systems which provide for standardized inpatient-outpatient nomenclatures and accepting both outpatient and inpatient data to commonly accessible storage. In a first method embodiment, the first method is characterized by accepting user input identifying at least two different names for a substantially similar grouping of one or more medical criteria; and accepting user input specifying at least one of the at least two different names as forming at least a part of an outpatient-inpatient standardized nomenclature for the substantially similar grouping of one or more medical criteria. In a second method embodiment, the second method is characterized by accepting either outpatient or inpatient data to commonly accessible storage.

Type: Grant

Filed: January 3, 2002

Date of Patent: May 9, 2006

Assignee: The United States of America as represented by the Secretary of the Army

Inventors: Peter E. Nielsen, Brook A. Thomson
Model adaptive apparatus and model adaptive method, recording medium, and pattern recognition apparatus

Patent number: 7043425

Abstract: In order to improve recognition performance, a no-speech sound model correction section performs an adaptation of a no-speech sound model which is a sound model representing a no-speech state on the basis of input data observed in an interval immediately before a speech recognition interval for the object of speech recognition and the degree of freshness representing the recentness of the input data.

Type: Grant

Filed: March 24, 2005

Date of Patent: May 9, 2006

Assignee: Sony Corporation

Inventor: Hongchang Pao
Method and system for text-to-speech caching

Patent number: 7043432

Abstract: In a text-to-speech system, a method of converting text-to-speech can include receiving a text input and comparing the received text input to at least one entry in a text-to-speech cache memory. Each entry in the text-to-speech cache memory can specify a corresponding spoken output. If the text input matches one of the entries in the text-to-speech cache memory, the cached speech output specified by the matching entry can be provided.

Type: Grant

Filed: August 29, 2001

Date of Patent: May 9, 2006

Assignee: International Business Machines Corporation

Inventors: Raimo Bakis, Hari Chittaluru, Edward A. Epstein, Steven J. Friedland, Abraham Ittycheriah, Stephen G. Lawrence, Michael A. Picheny, Charles Rutherfoord, Maria E. Smith
Structured speech recognition

Patent number: 7043426

Abstract: Speech recognition methods, systems, and interfaces are used in the generation of medical reports from data in a hierarchically-organized database for the entry and searching of data in a database based on spoken utterances of a user. A workflow function facilitates a series of contexts, typically based on information in a knowledge base, that are used to establish procedural rules and word-mapping databases for each context for word-matching data entry based on spoken utterances of a user. The generation of medical reports from the entered medical data provides for searching the database generated using the speech recognition methods. The series of contexts and word-mapping databases are developed using a hierarchically-organized database representation based on knowledge regarding the relationship of data items in the main database.

Type: Grant

Filed: August 24, 2001

Date of Patent: May 9, 2006

Assignee: Cyberpulse, L.L.C.

Inventors: James Robergé, James Wolfer, Jeffrey Soble
Monte Carlo method for natural language understanding and speech recognition language models

Patent number: 7039579

Abstract: A Monte Carlo method for use with natural language understanding and speech recognition language models can include a series of steps. The steps can include identifying at least one phrase embedded in a body of text wherein the phrase can belong to a phrase class. An additional attribute corresponding to the identified phrase can be determined. The body of text can be copied and the identified phrase can be replaced with a different phrase selected from a plurality of phrases. The different phrase can belong to the phrase class and correspond to the attribute.

Type: Grant

Filed: September 14, 2001

Date of Patent: May 2, 2006

Assignee: International Business Machines Corporation

Inventors: Mark E. Epstein, Jean-Christophe Marcadet, Kevin B. Smith
Method for the encoding of prosody for a speech encoder working at very low bit rates

Patent number: 7039584

Abstract: A speech encoding/decoding method using an encoder working at very low bit rates, comprises a learning step enabling the identification of the representatives of the speech signal; and an encoding step to segment the speech signal and determine the best representative associated with each recognized segment. The method also comprises at least one step for the encoding/decoding of at least one of the parameters of the prosody of the recognized segments, e.g., the energy, pitch, voicing, and/or length of the segments, by using a piece of information on prosody pertaining to the best representatives. The method can employ a bit rate of lower than 400 bits per second.

Type: Grant

Filed: October 18, 2001

Date of Patent: May 2, 2006

Assignee: Thales

Inventors: Philippe Gournay, Yves-Paul Nakache
Supervised automatic text generation based on word classes for language modeling

Patent number: 7035789

Abstract: A system and method is provided that randomly generates text with a given structure. The structure is taken from a number of learning examples. The structure of training examples is captured by word classification and the definition of the relationships between word classes in a given language. The text generated with this procedure is intended to replicate the information given by the original learning examples. The resulting text may be used to better model the structure of a language in a stochastic language model.

Type: Grant

Filed: September 4, 2001

Date of Patent: April 25, 2006

Assignees: Sony Corporation, Sony Electronics, Inc.

Inventors: Gustavo Hernandez Abrego, Xavier Menendez-Pidal
Compressing and using a concatenative speech database in text-to-speech systems

Patent number: 7035794

Abstract: A method and apparatus are provided for compressing and using a concatenative speech database in TTS systems to improve the quality of speech output generated by handheld TTS systems by allowing synthesis to occur on the client. According to one embodiment of the present invention, a G.723 encoder receives diphone waveforms, and compresses them into diphone residuals. While compressing the diphone waveforms, the encoder generates Linear Predictive Coding (LPC) coefficients. The diphone residuals, and the encoder-generated LPC coefficients are then stored in encoder-generated compressed packet.

Type: Grant

Filed: March 30, 2001

Date of Patent: April 25, 2006

Assignee: Intel Corporation

Inventor: Sudheer Sirivara
Feature-domain concatenative speech synthesis

Patent number: 7035791

Abstract: A method for speech synthesis includes receiving an input speech signal containing a set of speech segments, and estimating spectral envelopes of the input speech signal in a succession of time intervals during each of the speech segments. The spectral envelopes are integrated over a plurality of window functions in a frequency domain so as to determine elements of feature vectors corresponding to the speech segments. An output speech signal is reconstructed by concatenating the feature vectors corresponding to a sequence of the speech segments.

Type: Grant

Filed: July 10, 2001

Date of Patent: April 25, 2006

Assignee: International Business Machines Corporaiton

Inventors: Dan Chazan, Ron Hoory
Method for sending multi-media messages using customizable background images

Patent number: 7035803

Abstract: A system and method of providing sender customization of multi-media messages through the use of inserted images or video. The images or video may be sender-created or predefined and available to the sender via a web server. The method relates to customizing a multi-media message created by a sender for a recipient, the multi-media message having an animated entity audibly presenting speech converted from text created by the sender. The method comprises receiving at least one image from the sender, associating each at least one image with a tag, presenting the sender with options to insert the tag associated with one of the at least one image into the sender text, and after the sender inserts the tag associated with one of the at least one images into the sender text, delivering the multi-media message with the at least one image presented as background to the animated entity according to a position of the tag associated with the at least one image in the sender text.

Type: Grant

Filed: November 2, 2001

Date of Patent: April 25, 2006

Assignee: AT&T Corp.

Inventors: Joern Ostermann, Barbara Buda, Mehmet Reha Civanlar, Eric Cosatto, Hans Peter Graf, Thomas M. Isaacson, Yann Andre LeCun
Speech recognition system including speech section detecting section

Patent number: 7035798

Abstract: A trained vector generation section 16 generates beforehand a trained vector v of unvoiced sounds. An LPC Cepstrum analysis section 18 generates a feature vector A of a voice within the non-voice period, an inner product operation section 19 calculates an inner product value VTA between the feature vector A and the trained vector V, and a threshold generation section 20 generates a threshold ?v on the basis of the inner product value VTA. Also, the LFC Cepstrum analysis section 18 generates a prediction residual power ? of the signal within the non-voice period, and the threshold generation section 22 generates a threshold THD on the basis of the prediction residual power ?.

Type: Grant

Filed: September 12, 2001

Date of Patent: April 25, 2006

Assignee: Pioneer Corporation

Inventor: Hajime Kobayashi
Speech recognition apparatus using distance based acoustic models

Patent number: 7031917

Abstract: The present invention relates to a speech recognition apparatus and a speech recognition method for speech recognition with improved accuracy. A distance calculator 47 determines the distance from a microphone 21 to a user uttering. Data indicating the determined distance is supplied to a speech recognition unit 41B. The speech recognition unit 41B has plural sets of acoustic models produced from speech data obtained by capturing speeches uttered at various distances. From those sets of acoustic models, the speech recognition unit 41B selects a set of acoustic models produced from speech data uttered at a distance closest to the distance determined by the distance calculator 47, and the speech recognition unit 41B performs speech recognition using the selected set of acoustic models.

Type: Grant

Filed: October 21, 2002

Date of Patent: April 18, 2006

Assignee: Sony Corporation

Inventor: Yasuharu Asano

prev 1 2 3 4 5 6 … next