Patents Examined by V. Paul Harper

Tracking time using portable recorders and speech recognition

Patent number: 7171365

Abstract: In general, the present invention converts speech, preferably recorded on a portable recorder, to text, analyzes the text, and determines voice commands and times when the voice commands occurred. Task names are associated with voice commands and time segments. These time segments and tasks may be packaged as time increments and stored (e.g., in a file or database) for further processing. Preferably, phrase grammar rules are used when analyzing the text, as this helps to determine voice commands. Using phrase grammar rules also allows the text to contain a variety of topics, only some of which are pertinent to tracking time.

Type: Grant

Filed: February 16, 2001

Date of Patent: January 30, 2007

Assignee: International Business Machines Corporation

Inventors: James William Cooper, Donna Karen Byron
Multimedia information retrieval method, program, record medium and system

Patent number: 7167823

Abstract: Paired image information and text information correlated to each other are retrieved as information sets. Frequency information on words used in text is extracted from text information in a group of information sets, and text information features are extracted based on frequency information. Text features are used to lay out information sets in a virtual space such that similar pieces of text are located close to each other, and images are displayed at those positions. Further, important words are extracted from those words extracted from text information in a group of information sets, and those words are laid out in the virtual space in the same manner as with information sets and displayed as labels.

Type: Grant

Filed: November 27, 2002

Date of Patent: January 23, 2007

Assignee: Fujitsu Limited

Inventors: Susumu Endo, Yuusuke Uehara, Daiki Masumoto, Syuuichi Shiitani
Presentation-quality buffering process for real-time audio

Patent number: 7162418

Abstract: A buffering process for real-time digital audio is provided to effect of network “jitter” from inconsistent network packet delivery rates. The buffering algorithm is particularly useful for audio data including distinct bursts separated by silence, such as speech. The process holds incoming audio packets in a queue until either: (a) the buffer contents meet a predetermined threshold; or (b) the end packet of a burst is received. The result is that silent periods between bursts may expand or decrease relative to the original audio pattern, allowing cumulative jitter to be played out as silence. The threshold is sized such that the deviation in silence is unnoticeable by a listener. In an optional embodiment, the process periodically adjusts the threshold to adapt to network conditions.

Type: Grant

Filed: November 15, 2001

Date of Patent: January 9, 2007

Assignee: Microsoft Corporation

Inventors: Ivan J. Leichtling, Ido Ben-Shachar
Multi-channel speech enhancement system and method based on psychoacoustic masking effects

Patent number: 7158933

Abstract: The present invention is generally directed to a system and method for enhancing speech using a multi-channel noise filtering process that is based on psychoacoustic masking effects. A speech enhancement/noise reduction scheme according to the present invention is designed to satisfy the psychoacoustic masking principle and to minimize the signal total distortion by exploiting multiple microphone signals to enhance the useful speech signal at reduced level of artifacts.

Type: Grant

Filed: May 10, 2002

Date of Patent: January 2, 2007

Assignee: Siemens Corporate Research, Inc.

Inventors: Radu Victor Balan, Justinian Rosca
Noise spectrum subtraction method and system

Patent number: 7155387

Abstract: A method for reducing noise in a voice signal, and a voice operated system utilizing the same are presented. A noise component in a compressed digital signal representative of the voice signal is determined, and subtracted from the compressed digital signal.

Type: Grant

Filed: January 8, 2001

Date of Patent: December 26, 2006

Assignee: Art - Advanced Recognition Technologies Ltd.

Inventor: Amir Globerson
Transmission of voice in packet switched networks

Patent number: 7146312

Abstract: There is disclosed in a packet switched network a method of encoding speech packets into blocks, each speech packet including a speech header and a payload comprised of a speech frame, wherein at least two speech frames are encoded into a single block. Each speech frame may be associated with a different user. An EDGE network utilizes such a method.

Type: Grant

Filed: March 28, 2000

Date of Patent: December 5, 2006

Assignee: Lucent Technologies Inc.

Inventors: Christian Demetrescu, Konstantinos Samaras, Jian Jun Wu
Dynamic adjustment of noise separation in data handling, particularly voice activation

Patent number: 7146314

Abstract: Data handling dynamically responds to changing noise power conditions to separate valid data from noise. A reference power level acts as a threshold between dynamically assumed noise and valid data, and dynamically refers to the reference power level changing adaptively with the background noise. The introduction of dynamic noise control in VOX (Voice Activated Transmission) improves a VOX device operation in a noisy environment, even when the background noise profiles are changing. Processing is on a frame by frame basis for successive frames. The threshold is adaptively changed when a comparison of frame signal power to the threshold indicates speech or the absence of speech in the compared frame repeatedly and continuously for a period of time involving plural successive frames having no valid speech or noise above the threshold to correspondingly reduce or increase the threshold by changing the threshold to a value that is a function of the input signal power.

Type: Grant

Filed: December 20, 2001

Date of Patent: December 5, 2006

Assignee: Renesas Technology Corporation

Inventor: Yunbiao Wang
Methods and apparatus for audio data analysis and data mining using speech recognition

Patent number: 7133828

Abstract: The present invention provides an audio analysis intelligence tool that provides ad-hoc search capabilities using spoken words as an organized data form. The present invention provides an SQL like interface to process and search audio data and combine it with other traditional data forms.

Type: Grant

Filed: October 20, 2003

Date of Patent: November 7, 2006

Assignee: SER Solutions, Inc.

Inventors: Robert Scarano, Lawrence Mark
Recording and reproducing apparatus for use with optical recording medium having real-time, losslessly encoded data

Patent number: 7133832

Abstract: A lossless encoding apparatus encodes audio data and a lossless decoding apparatus restores the losslessly compression encoded audio data on a real-time basis, and a method therefor. The lossless encoding apparatus includes a lossless compression unit which losslessly compression encodes the audio data stored in an input buffer in units of predetermined data and outputs the encoded data in sequence, and an output buffer which stores the encoded audio data output from the lossless compression unit.

Type: Grant

Filed: October 28, 2003

Date of Patent: November 7, 2006

Assignee: Samsung Electronics Co., Ltd.

Inventor: Jae-Hoon Heo
Robust talker localization in reverberant environment

Patent number: 7130797

Abstract: A method of locating a talker in a reverberant environment comprises receiving multiple audio signals from a microphone array that include direct path audio signal and reverberation signal components. The direct path audio signal components of the multiple audio signals are detected and are used to weight the multiple audio signals. A position estimate based on the weighted audio signals is then calculated. Periods of speech activity are detected and a final position estimate is generated during the periods of speech activity.

Type: Grant

Filed: August 15, 2002

Date of Patent: October 31, 2006

Assignee: Mitel Networks Corporation

Inventors: Franck Beaucoup, Michael Tetelbaum
Method and apparatus for converting utterance representations into actions in a conversational system

Patent number: 7127402

Abstract: A conversation manager processes a spoken utterance from a user of a computer that is directed to an application program hosted on the computer. The conversation manager includes a reasoning facility which accesses goal-directed rules stored in a rules base (e.g., database). The reasoning facility also has access to a conversational record that includes a record of previous utterances and a semantic analysis for each utterance. The reasoning facility processes a representation of the utterance by using the goal-directed rules. The reasoning facility uses means-end analysis to determine the proper rules to execute, and thus the script calls to make to achieve the goal of processing the utterance. While processing the utterance, the reasoning facility attempts to resolve any ambiguities in the representation of the utterance and to fill in any missing information that is needed to achieve its goal.

Type: Grant

Filed: January 10, 2002

Date of Patent: October 24, 2006

Assignee: International Business Machines Corporation

Inventors: Steven I. Ross, Elizabeth A. Brownholtz, Jeffrey G. MacAllister
Filter banked gain control of audio in a noisy environment

Patent number: 7120579

Abstract: A method and device for boosting an input signal to overcome noise. Both the input signal, S(t), and an estimate of the noise, N(t), are bandpassed in adjacent pass bands to produce signal and noise subbands. Preferably, the input signal is delayed before being bandpassed. The power envelopes of the signal subbands are converted to signal masking functions, (70), that incorporate the phenomena of forward and backward masking. Signal masking functions whose amplitudes are below the amplitudes of their frequency neighbors are nulled. Similarly, noise subbands whose powers are below the powers of neighboring noise subbands are nulled. The surviving signal masking functions are compared to the corresponding surviving noise power envelopes to determine the degree to which the surviving signal subbands must be amplified, (78), to overcome the noise. The surviving signal subbands are so amplified and summed to provide the output signal, S?(t).

Type: Grant

Filed: July 27, 2000

Date of Patent: October 10, 2006

Assignee: Clear Audio Ltd.

Inventor: Zvi (Tsvika) Licht
Virtual presence

Patent number: 7120577

Abstract: A system and terminal for facilitating a “virtual presence” allows users on a communication network to simply begin speaking through other users. A system immediately detects the destination party's name, and begins routing the audio signal to a particular destination without any noticeable call set-up. Additionally, the system performs pitch corrected speed control in order to allow the detection and processing of a speech pattern without causing delay to an end user.

Type: Grant

Filed: January 9, 2003

Date of Patent: October 10, 2006

Assignee: Intel Corporation

Inventor: Howard Bubb
Systems, methods and computer program products for designing, deploying and managing interactive voice response (IVR) systems

Patent number: 7117158

Abstract: An IVR system may be designed by accepting designer inputs to generate, on a display screen, a flowchart of interconnected flowchart processing blocks and flowchart decision blocks that represent a process flow of processing steps and branches, respectively, in the IVR system. By allowing the designer to generate a flowchart of interconnected flowchart processing blocks and flowchart decision blocks on a single display screen, a potentially simplified graphical user interface may be provided for designing an IVR system. The flowchart of interconnected flowchart processing blocks and flowchart decision blocks may be executed, based on at least one designer input on a keypad image, to simulate or test the IVR system. A self-documenting audit trail may be provided during the design of the IVR system. These audit trails may be associated with a version of the IVR system, so that multiple versions of the system may be managed.

Type: Grant

Filed: April 25, 2002

Date of Patent: October 3, 2006

Assignee: Bilcare, Inc.

Inventors: Phyllis Marie Dyer Weldon, Edwin Bruce Shankle, III, James Arthur Klein, Jr.
Method and apparatus for performing packet loss or frame erasure concealment

Patent number: 7117156

Abstract: The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with a decoder receives encoded frames of compressed speech information transmitted from an encoder. A lost frame detector at the receiver determines if an encoded frame has been lost or corrupted in transmission, or erased. If the encoded frame is not erased, the encoded frame is decoded by a decoder and a temporary memory is updated with the decoder's output. A predetermined delay period is applied and the audio frame is then output. If the lost frame detector determines that the encoded frame is erased, a FEC module applies a frame concealment process to the signal. The FEC processing produces natural sounding synthetic speech for the erased frames.

Type: Grant

Filed: April 19, 2000

Date of Patent: October 3, 2006

Assignee: AT&T Corp.

Inventor: David A. Kapilow
Document expansion in speech retrieval

Patent number: 7113910

Abstract: Methods of document expansion for a speech retrieval document by a recognizer. A database of vectors of automatic transcriptions of documents is accessed and the vectors are truncated by removing all terms that are not recognizable by the recognizer to create truncated vectors. Terms in the vectors are then weighted to associate the truncated vectors with the untruncated vectors. Terms not recognized by the recognizer are then added back to the weighted, truncated vectors. The retrieval effectiveness may then be measured.

Type: Grant

Filed: December 19, 2000

Date of Patent: September 26, 2006

Assignee: AT&T Corp.

Inventors: Fernando Carlos Pereira, Amitabh Kumar Singhal
Translation leveraging

Patent number: 7110937

Abstract: An application archive is searched for an existing translation for a text string in an application to be localized. The text string is associated with context information that identifies a location of the text string in the application. If an existing translation is found that matches the text string, and all, or alternately part of, the context information, the existing translation is logically linked to the text string. In one aspect, the existing translation is selected from multiple matches based on number of occurrences. In another aspect, the existing translation is submitted to a manual validation process.

Type: Grant

Filed: June 20, 2002

Date of Patent: September 19, 2006

Assignee: Siebel Systems, Inc.

Inventors: Shu Lei, Sergey Parievsky, Mark Hastings
Computer network including a computer system transmitting screen image information and corresponding speech information to another computer system

Patent number: 7103551

Abstract: A described computer network includes a first computer system and a second computer system. The first computer system transmits screen image information and corresponding speech information to the second computer system. The screen image information includes information corresponding to a screen image intended for display within the first computer system. The speech information conveys a verbal description of the screen image. When the screen image includes one or more objects (e.g., menus, dialog boxes, icons, and the like) having corresponding semantic information, the speech information includes the corresponding semantic information. The second computer system responds to the speech information by producing an output (e.g., human speech via an audio output device, a tactile output via a Braille output device, and the like). The semantic information conveyed by the output allows a visually-impaired user of the second computer system to know intended purposes of the objects.

Type: Grant

Filed: May 2, 2002

Date of Patent: September 5, 2006

Assignee: International Business Machines Corporation

Inventors: Charles J. King, Hidemasa Muta, Richard Scott Schwerdtfeger, Andrea Snow-Weaver
Customizing the speaking style of a speech synthesizer based on semantic analysis

Patent number: 7096183

Abstract: A method is provided for customizing the speaking style of a speech synthesizer. The method includes: receiving input text; determining semantic information for the input text; determining a speaking style for rendering the input text based on the semantic information; and customizing the audible speech output of the speech synthesizer based on the identified speaking style.

Type: Grant

Filed: February 27, 2002

Date of Patent: August 22, 2006

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventor: Jean-Claude Junqua
Method to expand inputs for word or document searching

Patent number: 7089188

Abstract: An electronic document searching system or word searching system which when given an input, expands the input as a function of acoustic similarity and/or word sequence occurrence frequency. Results of the system are alternative input words or phrases. The alternative input words or phrases are output from the system for further processing.

Type: Grant

Filed: March 27, 2002

Date of Patent: August 8, 2006

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Beth T. Logan, Jean-Manuel Van Thong, Pedro J. Moreno

prev 1 2 3 4 5 6 7 … next