Sound Editing Patents (Class 704/278)

Network based music playing/song accompanying service system and method

Publication number: 20030014262

Abstract: A system and method for providing a service of playing an accompaniment/musical performance is disclosed. In order to embody the system and method for providing the service of playing the accompaniment/musical performance, virtual orchestra system (VOS) files, which is converted from digital music files, e.g., musical instrument digital interface (MIDI) files and includes play order notes and sound data for each musical instrument capable of being played, are used. A server provides the VOS files through a network, e.g., a local area network (LAN), an Intranet, a value added network (VAN), an Internet or a public switched telephone network. A music is selected by a user through at least a client terminal. The play order note for each musical instrument is provided and the sound data for each musical instrument is played based on the play order note, thereby playing in solo or in concert. (At this time, sound for the others musical instrument is silent or used as a background music.

Type: Application

Filed: June 20, 2002

Publication date: January 16, 2003

Inventor: Yun-Jong Kim
Speech recognition program mapping tool to align an audio file to verbatim text

Publication number: 20030004724

Abstract: The invention includes a method to determine time location of at least one audio segment in an original audio file comprising: (a) receiving the original audio file; (b) transcribing a current audio segment from the original audio file using speech recognition software; (c) extracting a transcribed element and a binary audio stream corresponding to the transcribed element from the speech recognition software; (d) saving an association between the transcribed element and the corresponding binary audio stream; (e) repeating (b) through (d) for each audio segment in the original audio file; (f) for each transcribed element, searching for the associated binary audio stream in the original audio file, while tracking an end time location of that search within the original audio file; and (g) inserting the end time location for each binary audio stream into the transcribed element-corresponding binary audio stream association.

Type: Application

Filed: April 5, 2002

Publication date: January 2, 2003

Inventors: Jonathan Kahn, Michael C. Huttinger, Stephen J. Scalpone
Audio reproducing apparatus

Patent number: 6484137

Abstract: An audio reproducing apparatus comprises: audio decoding means for decoding an input audio signal frame by frame; data expanding/compressing means for subjecting data in a decoded frame to time-scale modification process; a frame sequence table which contains a sequence determined according to a given speed rate in which respective frames are expanded/compressed; frame counting means for counting the number of frames of the input audio signal; and data expansion/compression control means for instructing the dalta expanding/compressing means to subject the frame to one of time-scale compression process, time-scale expansion process, and process without time-scale modification process, with reference to the frame sequence table based on a count value output from the frame counting means, the data expanding/compressing means subjecting the audio signal to time-scale modification process in accordance with an instruction signal from the data expansion/compression control means.

Type: Grant

Filed: October 29, 1998

Date of Patent: November 19, 2002

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Hirotsugu Taniguchi, Masayuki Misaki, Junichi Tagawa, Michio Matsumoto
Information recording medium, apparatus and method for performing after-recording on the recording medium

Patent number: 6480828

Abstract: The invention provides an information recording medium, such as an optical disk, having a large capacity and being capable of performing read/write operations at high speeds. The recording medium includes an audio stream prepared for after-recording data, and a audio attribute information having a bit rate information to the recorded audio stream as a management information. A recorder according to the invention has a check unit for checking, in advance, the possibility of after-recording operation of the recorder to the audio stream to be after-recorded with reference to the bit rate information of the audio attribute information.

Type: Grant

Filed: September 29, 2000

Date of Patent: November 12, 2002

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Tomoyuki Okada, Kaoru Murase, Noriko Sugimoto, Kazuhiro Tsuga
Audio/video reproducing apparatus and method

Publication number: 20020152082

Abstract: An audio/video reproducing apparatus is connectable to a communications network for selectively reproducing items of audio/video material from a recording medium in response to a request received via the communications network. The audio/video reproducing apparatus may comprise a control processor operable in use to receive data representing the request for the audio/video material item via the communications network. A reproducing processor is operable in response to signals identifying the audio/video material items from the control processor to reproduce the audio/video material items. The data identifying the audio/video material items includes meta data indicative of the audio/video material items. The meta data may be one of UMID, tape ID and time codes, and a Unique Material Identifier the material items.

Type: Application

Filed: December 4, 2001

Publication date: October 17, 2002

Inventors: Vincent Carl Harradine, Alan Turner, Morgan William Amos David, Michael Williams, Mark John McGrath, Andrew Kydd, Jonathan Thorpe
Method and system for providing audio playback of a multi-source document

Patent number: 6446041

Abstract: A multi-source input and playback utility that accepts inputs from various sources, transcribes the inputs as text, and plays aloud user-selected portions of the text is disclosed. The user may select a portion of the text and request audio playback thereof. The utility examines each transcribed word in the selected text. If stored audio data is associated with a given word, that audio data is retrieved and played. If no audio data is associated, then a textto-speech entry or series of entries is retrieved and played instead.

Type: Grant

Filed: October 27, 1999

Date of Patent: September 3, 2002

Assignee: Microsoft Corporation

Inventors: Jeffrey C. Reynar, Erik Rucker, Paul Kyong Hvan Kim
Method and arrangement for search and recording of media signals

Publication number: 20020120456

Abstract: The method and a system is for locating and recording time-limited signal sequences in media channels that may contain undesirable signal components, e.g., recording music in radio transmissions. The signals are continuously buffered in a memory. The user identifies a desired source material. Out of this desired source material a section may be taken as a search key. The device may also select search keys automatically. If a second instance of the search key is detected, signal sequences that in time are connected to the search keys are compared. The signal sequences that by comparison are substantially identical are identified as belonging to the same, wanted, source material. The next step is an iteration of the above procedure results in a longer and higher quality segment of source material than the initial common segment.

Type: Application

Filed: October 23, 2001

Publication date: August 29, 2002

Inventors: Jakob Berg, Rickard Berg, Tomas Ahrne
Method and apparatus for encoding and decoding pause informantion

Publication number: 20020111812

Abstract: At an audio source, pause information is added to audio data, the combination of which is subsequently packetized. The resulting packets are transmitted to an audio destination via a network in which different packets may be subjected to varying levels of delay. At the audio destination, the pause information may be used to insert pauses at appropriate times to accommodate the occurrence of delays in packet delivery. In one embodiment, pauses are inserted based on a hierarchy of pause types. During pauses, audio filler information may be injected. In this manner, the effects of variable network delays upon reconstructed audio may be mitigated.

Type: Application

Filed: February 9, 2001

Publication date: August 15, 2002

Inventors: Dale R. Buchholz, Bashar Jano, Ira Gerson
Sound device for expansion station

Patent number: 6427136

Abstract: A sound device includes a silent state detecting unit for detecting a silent state in a sound signal supplied by a personal computer; and a sound production preventing unit for preventing a sound from being produced from the sound signal supplied by the personal computer when the silent state is detected by the silent state detecting unit. By halting the production of a sound from the sound signal supplied from the personal computer when the silent state is detected, production of noise in a silent state is prevented so that the quality of sound in the expansion station is improved.

Type: Grant

Filed: September 1, 1998

Date of Patent: July 30, 2002

Assignee: Fujitsu Limited

Inventor: Toshiro Obitsu
Method and apparatus for directing an audio file to a speech recognition program that does not accept such files

Patent number: 6421643

Abstract: The present invention relates to a method and apparatus for directing a pre-recorded audio file to a speech recognition program that does not normally accept such files, such as IBM Corporation's Via Voice™ speech recognition program. The method includes: (a) launching the speech recognition program to accept speech as if the speech recognition program were receiving live audio from a microphone; (b) finding a mixer utility associated with the sound card; (c) opening the mixer utility, the mixer utility having settings that determine an input source and an output path; (d) changing the settings of the mixer utility to specify a line-in input source and a wave-out output path; (e) activating a microphone input of the speech recognition software; and (f) initiating a media player associated with the computer to play the pre-recorded audio file into the line-in input source. The method may additionally save and restore the original configuration settings of the mixer utility.

Type: Grant

Filed: October 29, 1999

Date of Patent: July 16, 2002

Assignee: Custom Speech USA, Inc.

Inventors: Jonathan Kahn, Charles Qin, Nicholas A. Linden, James A. Sells
Method and apparatus for synchronizing a computer-animated model with an audio wave output

Patent number: 6408274

Abstract: A computer-animated image of a video model is stored for synchronized outputting with an audio wave. When receiving the audio wave representation, the model is dynamically varied under control of the audio wave, and outputted together with the audio wave. In particular, an image parameter is associated to the model. By measuring an actual audio wave amplitude, and mapping the amplitude in a multivalued or analog manner on the image parameter the outputting is synchronized.

Type: Grant

Filed: September 1, 1998

Date of Patent: June 18, 2002

Assignee: Koninklijke Philips Electronics N.V.

Inventor: Douglas N. Tedd
Communication apparatus

Publication number: 20020072919

Abstract: A background sound sending side multiplexes and sends, in a multiplexer, uttered encoded speech data generated in a speech sending section and encoded background sound data outputted from a background sound storing section. Simultaneously, a background sound reproducing section, reproduces encoded background sound data and reproduced background sound signal is superposed on received speech in a receiving section and outputted from a receiver. A background sound receiving side demultiplexes, in a demultiplexer, received multiplexed data into received encoded speech data and encoded background sound data which are decoded in the receiving section and the background sound reproducing section respectively, and in the receiving section, a sound in which received speech and background sound are superposed is outputted from a receiver.

Type: Application

Filed: February 27, 2001

Publication date: June 13, 2002

Inventor: Tohru Yokoyama
Apparatus and method using speech recognition and scripts to capture, author and playback synchronized audio and video

Publication number: 20020069073

Abstract: Audio associated with a video program, such as an audio track or live or recorded commentary, may be analyzed to recognize or detect one or more predetermined sound patterns, such as words or sound effects. The recognized or detected sound patterns may be used to enhance video processing, by controlling video capture and/or delivery during editing, or to facilitate selection of clips or splice points during editing.

Type: Application

Filed: October 25, 2001

Publication date: June 6, 2002

Inventor: Peter Fasciano
Plug-in modules for digital signal processor functionalities

Publication number: 20020065568

Abstract: An electronic processing device for producing digitally processed audio-signal effects is provided. The electronic processing device comprises, an audio-signal input circuitry for receiving an audio input signal from a peripheral audio device, an audio-signal output circuitry for outputting the received audio-signal, the signal comprising a throughput signal after signal processing, a digital signal processor for applying audio-signal effects to the throughput audio-signal, one or more memory slots for receiving one or more modular memory components and an input control mechanism for controlling parameters of the throughput audio-signal. The one or more modular memory components are used as storage for externally sourced audio-signal effects such that when the one or more memory components are plugged into the electronic processing device, the processing device may utilize the effects applications stored on the one or more memory components in the processing of the throughput audio-signal.

Type: Application

Filed: November 30, 2000

Publication date: May 30, 2002

Inventors: Robert Denton Silfvast, Philip J.E. Campbell, Scott Silfvast, Mark David Goodwin, Andor Izsak
System and method for associating pre-recorded audio snippets with still photographic images

Patent number: 6397184

Abstract: A system for associating a prerecorded audio snippet with a photograph, includes: an audio data base containing a plurality of audio snippets, each snippet having a corresponding identification code. A scene identification display includes an identification code associated with the scene. A camera having a sensor for sensing the identification code on the display includes a memory for storing the identification code in association with a photograph of the scene taken by the camera. The audio snippet corresponding to the identification code is retrieved from the audio data base and reproduced in conjunction with the display of the photograph.

Type: Grant

Filed: October 24, 1996

Date of Patent: May 28, 2002

Assignee: Eastman Kodak Company

Inventor: Keith A. Walker
Audio dubbing system for digital audio recorder

Patent number: 6389399

Abstract: An audio dubbing system which is composed of an MD recorder 5 capable of recording digital data obtained by converting audio signals of respective tracks from one or plural CD(s) onto MD, and a personal computer 1 for causing the MD recorder 5 to record the digital audio signals, and in which the personal computer 1 is connected to a CD-ROM drive 12 for reading the audio signals from the CD(s) as digital data and to a hard disk drive 13 for storing the digital data read by the CD-ROM drive 12.

Type: Grant

Filed: July 30, 1999

Date of Patent: May 14, 2002

Assignee: Sanyo Electric Co., Ltd.

Inventor: Naotaka Yasuda
Speech manipulation for continuous speech playback over a packet network

Patent number: 6377931

Abstract: In a speech communications network, continuous play of audio packets is achieved using a jitter buffer in a receiver. Audio packets are stored in the jitter buffer before decoding the audio packets into an audible output. When the level of stored audio packets approaches the full capacity of the jitter buffer, the rate at which the audio packets are played out of the jitter buffer is increased signaling a compression operation in the decoder. When the level of stored audio packets approaches an empty level of the jitter buffer, the rate which the audio packets are played out of the jitter buffer is reduced signaling an expansion operation in the decoder. Audio packets are not modified when the level of stored audio packets is within a predetermined range. A speed controller is provided to instruct the decoder to decode the audio packets according to either a compressed, expanded or normal audio packet status.

Type: Grant

Filed: September 28, 1999

Date of Patent: April 23, 2002

Assignee: Mindspeed Technologies

Inventor: Eyal Shlomot
Signal transformation for aural classification

Patent number: 6366887

Abstract: A method transforms non-speech input signals into the temporal, spectral and redundancy patterns resembling that of human speech.

Type: Grant

Filed: January 12, 1998

Date of Patent: April 2, 2002

Assignee: The United States of America as represented by the Secretary of the Navy

Inventors: William J. Zehner, R. Lee Thompson
Editing system and method and distribution medium

Patent number: 6356701

Abstract: Temporarily storing audio data to be reproduced in the block form, temporarily storing synthesized audio data in the block form, generating and supplying a reference signal, and calculating the first address of the block of audio data, the editing system of the present invention enables identifying the buffering position of the recording signal and then matching the position to the reproduced signal, resulting in quick editing.

Type: Grant

Filed: April 2, 1999

Date of Patent: March 12, 2002

Assignee: Sony Corporation

Inventors: Seiji Tanizawa, Satoru Tobita, Hideaki Miyauchi, Kazushi Sato, Keiji Hirai
Script development systems and methods useful therefor

Patent number: 6356867

Abstract: A system for generating scripts having verbal content. The system includes a computer having a user input receiver operative to receive a user's definition of a script for at least one computer-controllable animated physical figure. The script includes a plurality of interconnected script elements each representing an action performable by the computer-controllable animated figure. The script comprises at least one verbal script element representing a verbal action performable by the computer-controllable animated figure. A graphics interface is operative to generate a pictorial image of the script as the script is generated by the user. The graphics interface including a drag and drop facility and a flowchart generating facility.

Type: Grant

Filed: January 4, 1999

Date of Patent: March 12, 2002

Assignee: Creator Ltd.

Inventors: Oz Gabai, Jacob Gabai, Nimrod Sandlerman
Method and apparatus for accommodating primary content audio and secondary content remaining audio capability in the digital audio production process

Patent number: 6351733

Abstract: The invention enables the inclusion of voice and remaining audio information at different parts of the audio production process. In particular, the invention embodies special techniques for VRA-capable digital mastering and accommodation of VRA by those classes of audio compression formats that sustain less losses of audio data as compared to any codecs that sustain comparable net losses equal or greater than the AC3 compression format. The invention facilitates an end-listener's voice-to-remaining audio (VRA) adjustment upon the playback of digital audio media formats by focusing on new configurations of multiple parts of the entire digital audio system, thereby enabling a new technique intended to benefit audio end-users (end-listeners) who wish to control the ratio of the primary vocal/dialog content of an audio program relative to the remaining portion of the audio content in that program.

Type: Grant

Filed: May 26, 2000

Date of Patent: February 26, 2002

Assignee: Hearing Enhancement Company, LLC

Inventors: William R. Saunders, Michael A. Vaudrey
Remote control and processing of wireless digital receiver

Patent number: 6349283

Abstract: An integrated receiver mixer system is disclosed wherein the plurality of wireless receivers is remotely controlled, and retained in synchronism, via reference and control signals outputted by the system mixer. Further, pairs of receivers are connected to each other, and to the mixer, in a manner which minimizes the requisite cabling therebetween.

Type: Grant

Filed: March 5, 1999

Date of Patent: February 19, 2002

Inventor: Glenn Sanders
Method and system for synchronization of decoded audio and video by adding dummy data to compressed audio data

Patent number: 6339760

Abstract: A method for editing audio data includes the steps of creating a header portion containing at least information for indicating a start of an audio unit to be decoded and having composite elements whose values are equal to those of the audio data to which dummy data is to be added, and creating the audio data composed of the dummy data to be ignored during a decoding time. The system for editing audio data is also provided for executing the editing method.

Type: Grant

Filed: April 27, 1999

Date of Patent: January 15, 2002

Assignee: Hitachi, Ltd.

Inventors: Eriko Koda, Kei Kudou
Apparatus and method using speech recognition and scripts to capture author and playback synchronized audio and video

Patent number: 6336093

Abstract: Audio associated with a video program, such as an audio track or live or recorded commentary, may be analyzed to recognize or detect one or more predetermined sound patterns, such as words or sound effects. The recognized or detected sound patterns may be used to enhance video processing, by controlling video capture and/or delivery during editing, or to facilitate selection of clips or splice points during editing.

Type: Grant

Filed: January 16, 1998

Date of Patent: January 1, 2002

Assignee: Avid Technology, Inc.

Inventor: Peter Fasciano
Sound effects affixing system and sound effects affixing method

Patent number: 6334104

Abstract: A sound effects affixing device which enables sound effects and background music to be affixed in relation to inputted sentences automatically. A keyword extraction device is provided with a onomatopoeias extraction measure, a sound source extraction measure, and a subjective words extraction measure, which measures extract keyword of the onomatopoeias, the sound source names, or the subjective words within inputted sentences. A sound retrieval device selects sound effects and music by these keywords, thus selected sound effects and music are outputted by an output sound control device synchronized with synthesized speech.

Type: Grant

Filed: September 3, 1999

Date of Patent: December 25, 2001

Assignee: NEC Corporation

Inventor: Sanae Hirai
Information recording and reproducing apparatus

Publication number: 20010047215

Abstract: When a point on a tablet is designated by a marker, a recording indication signal is on, and recording of sound data is started. When the marker is released from the tablet, the recording indication signal is off, the recording of the sound data is finished, and coordinate data of the point and the sound data are linked. When information, such as characters, is inputted on the tablet using the marker, coordinate data based on the input and the sound data are linked through the coordinate data of the point. When the information such as characters is designated using a marker for playback, the sound data is played back.

Type: Application

Filed: March 29, 2001

Publication date: November 29, 2001

Applicant: BROTHER KOGYO KABUSHIKI KAISHA

Inventor: Yoshiaki Komatsu
APPARATUS AND METHOD USING SPEECH RECOGNITION AND SCRIPTS TO CAPTURE AUTHOR AND PLAYBACK SYNCHRONIZED AUDIO AND VIDEO

Publication number: 20010047266

Abstract: Audio associated with a video program, such as an audio track or live or recorded commentary, may be analyzed to recognize or detect one or more predetermined sound patterns, such as words or sound effects. The recognized or detected sound patterns may be used to enhance video processing, by controlling video capture and/or delivery during editing, or to facilitate selection of clips or splice points during editing.

Type: Application

Filed: January 16, 1998

Publication date: November 29, 2001

Inventor: PETER FASCIANO
METHOD AND APPARATUS FOR SYNCHRONIZING A COMPUTER-ANIMATED MODEL WITH AN AUDIO WAVE OUTPUT

Publication number: 20010041983

Abstract: A computer-animated image of a video model is stored for synchronized outputting with an audio wave. When receiving the audio wave representation, the model is dynamically varied under control of the audio wave, and outputted together with the audio wave. In particular, an image parameter is associated to the model. By measuring an actual audio wave amplitude, and mapping the amplitude in a multivalued or analog manner on the image parameter the outputting is synchronized.

Type: Application

Filed: September 1, 1998

Publication date: November 15, 2001

Applicant: U.S. Philips Corporation

Inventor: DOUGLAS N. TEDD
Recording apparatus, reproducing apparatus, and recording and/or reproducing apparatus

Patent number: 6311092

Abstract: An apparatus having a microphone, an analog to digital converting circuit, a semiconductor memory an, input device, and a controller. Wherein the analog to digital converting circuit converts an output signal from the microphone into a digital signal. The semiconductor memory stores the output signal from the analog to digital converting circuit, and the input device carries out input of a record start and a record end. The controller, according to the input from the input device, carries out control to start and stop writing into the semiconductor memory a digital signal from the analog to digital converting circuit. When the input device is operated and a predetermined time interval has passed, the controller starts writing the digital signal from the analog/digital conversion circuit into the semiconductor memory.

Type: Grant

Filed: September 8, 1997

Date of Patent: October 30, 2001

Assignee: Sony Corporation

Inventor: Eiichi Yamada
Method and apparatus for reproducing audio information

Publication number: 20010027399

Abstract: An audio information reproducing method and apparatus are provided in which audio information read from an audio information source is at first stored in a buffer memory, the stored audio information is then read out at a preset speed magnification, and reproduced upon receiving a reproducing speed conversion treatment. The method comprises sending a request for reading audio information to the audio information source in accordance with an amount of information accumulated in the buffer memory; reading a predetermined amount of audio information from the buffer memory in accordance with the preset speed magnification, and reproducing the predetermined amount of audio information after performing a reproducing speed conversion treatment on the audio information.

Type: Application

Filed: March 29, 2001

Publication date: October 4, 2001

Applicant: PIONEER CORPORATION

Inventors: Mitsuo Yasushi, Masatoshi Yanagidaira, Kunio Yarita
Recording and/or reproducing apparatus and recording apparatus

Patent number: 6289253

Abstract: A recording and/or reproducing apparatus includes a microphone, a semiconductor memory, an operating section and a controller. An output signal from the microphone is written in the semiconductor memory and the written signals are read out from the semiconductor memory. The operating section performs input processing for writing a digital signal outputted by an analog/digital converter, for reading out the digital signal stored in the semiconductor memory and for erasing the digital signal stored in the semiconductor memory. The control section controls the writing of the microphone output signal in the semiconductor memory based on an input from the operating section and the readout of the digital signal stored in the semiconductor memory.

Type: Grant

Filed: September 5, 1997

Date of Patent: September 11, 2001

Assignee: Sony Corporation

Inventor: Kenichi Iida
Audio storing and reproducing apparatus

Patent number: 6278900

Abstract: An audio storing and reproducing apparatus is provided that has a semiconductor chip and a control unit. The semiconductor chip includes a semiconductor memory having a plurality of storage areas, a single storage specifying input terminal, a single reproduce specifying input terminal, and a storage medium controller that controls storing of the audio data to the storage areas and readout of the audio data therefrom in accordance with the signal input to the storage specifying and reproduce specifying input terminals. The control unit includes a single storage area specifying switch, a single storage specifying switch, a single reproduce specifying switch and a control circuit.

Type: Grant

Filed: January 7, 1998

Date of Patent: August 21, 2001

Assignee: Casio Computer Co., Ltd.

Inventor: Fumikazu Aihara
Portable type voice reproducer and guide system using the reproducer

Publication number: 20010013002

Abstract: Each of control signal transmitters (21-2n) is provided for each of a plurality of guide objects and transmits a control signal for discrimination of the corresponding guide object. A control signal detector (13) starts execution thereof by operation of a start button (12). A control signal detector receives the control signals, each of which is supplied from the control signal transmitters and detects one of the control signals, which has the maximum level for use of selection of one kind of the voice-data corresponding to the detected control signal transmitter. A controller (14) has received a message class such as a language class of Japanese or English for example from a class selector (19). The controller makes and sends out a selection signal with a detected control signal and a selected language class to a voice-data take-out circuit (15).

Type: Application

Filed: February 8, 2001

Publication date: August 9, 2001

Applicant: NEC Corporation

Inventor: Yoshinobu Murai
Method and apparatus for an enhanced presentation aid

Patent number: 6272461

Abstract: A method and an apparatus for providing visual aid to a presenter involve converting the spoken words of a presenter into an electronic text format, electronically comparing the converted spoken words to electronically stored reference text to find text string matches, utilizing the text string matches between the converted spoken words and the reference text to determine a current location of the presentation with respect to the reference text, and delivering upcoming portions of the reference text to the presenter as needed to enable a continuous presentation by the presenter. A preferred presentation support system is incorporated into a portable personal computer that includes a speech recognition subsystem. The speech recognition subsystems allows a presentation to be tracked in real-time so that presentation support material can be automatically displayed to the presenter in synchronization with the in-progress presentation.

Type: Grant

Filed: March 22, 1999

Date of Patent: August 7, 2001

Assignee: Siemens Information and Communication Networks, Inc.

Inventors: Phillip C. Meredith, Christoph A. Aktas
Speeding up audio without changing pitch by comparing dominant frequencies

Patent number: 6266643

Abstract: A fast and economical method for speeding up an audio signal without changing pitch can be accomplished by eliminating unneeded information from an audio signal. First, the signal is divided into chunks (frames or subframes), on which a mathematical manipulation such as a Fourier transformation is performed to identify the amplitudes of the componenet sinusoids (sines and cosines). These absolute values of the sine and cosine amplitudes for each frequency are averaged together, and the highest value(s) represents the signature, or dominant frequency/frequencies. The dominant frequency/frequencies or signatures from one chunk are compared to those of the next, and when identical the latter unit is marked as redundant. The final step consists of discarding redundant chunks from the original data, thus providing a shortened signal for replay. The pitch will not change because the only modification to the original signal was the elimination of redundant data.

Type: Grant

Filed: March 3, 1999

Date of Patent: July 24, 2001

Inventors: Kenneth Canfield, Bruce deGraaf, Kathyrn deGraaf
Methods and apparatus for automatically synchronizing electronic audio files with electronic text files

Patent number: 6260011

Abstract: Automated methods and apparatus for synchronizing audio and text data, e.g., in the form of electronic files, representing audio and text expressions of the same work or information are described. A statistical language model is generated from the text data. A speech recognition operation is then performed on the audio data using the generated language model and a speaker independent acoustic model. Silence is modeled as a word which can be recognized. The speech recognition operation produces a time indexed set of recognized words some of which may be silence. The recognized words are globally aligned with the words in the text data. Recognized periods of silence, which correspond to expected periods of silence, and are adjoined by one or more correctly recognized words are identified as points where the text and audio files should be synchronized, e.g., by the insertion of bi-directional pointers.

Type: Grant

Filed: March 20, 2000

Date of Patent: July 10, 2001

Assignee: Microsoft Corporation

Inventors: David E. Heckerman, Fileno A. Alleva, Robert L. Rounthwaite, Daniel Rosen, Mei-Yuh Hwang, Yoram Yaacovi, John L. Manferdelli
Apparatus for reading sounds, particularly recorded, that can be mounted on a vehicle

Publication number: 20010004716

Abstract: The invention relates to a portable apparatus or an apparatus mounted on a vehicle, for reading sounds recorded on supports, either according to an analog process on magnetically recorded supports, which is to say on cassettes, or according to a digital process on compact discs, comprising at least one reading assembly for the recordation supports and a broadcast assembly of sounds comprising an amplifier and at least one restitution element of sounds such as a loudspeaker.

Type: Application

Filed: November 29, 2000

Publication date: June 21, 2001

Inventor: Lyes Seba
Adaptive speech rate conversion without extension of input data duration, using speech interval detection

Patent number: 6236970

Abstract: A speech-rate converter slowing down input speech regularly monitors the data length of the input speech and the previously estimated extended output data length for the current rate scaling factor, computing new output data length estimates. The conversion rate is adaptively modified depending on the time lag between input and output speech so as to make input and output data lengths consistent without skipping any spoken input portions. Input signal power is monitored to discriminate speech and non-speech intervals, and the portions of input non-speech intervals exceeding a conversion-rate-dependent duration are deleted.

Type: Grant

Filed: December 22, 1998

Date of Patent: May 22, 2001

Assignee: Nippon Hoso Kyokai

Inventors: Atsushi Imai, Nobumasa Seiyama, Tohru Takagi
Continuous sound by concatenating selected digital sound segments

Patent number: 6230140

Abstract: Non-looped continuous sound made up of random sequencing of digital sound segments is generated by taking several short segments of an otherwise continuous sound and forming independent records of those short segments. The stored segments are re-assembled into a sound sequence of arbitrary length based on selecting the next sound segment according to some statistical algorithm. The selected algorithm may be simply a random or pseudo-random selection, or it may provide a probability weighting to emphasize some sound records over others, or some combination of factors also affected by external stimuli such as light, heat or operator input. Apparatus for generating random sequenced digital sound are disclosed. Another aspect of the invention is logical sequence sound in which the selection of sound segments proceeds according to a logical sequence which is programmable.

Type: Grant

Filed: June 11, 1998

Date of Patent: May 8, 2001

Inventors: Frederick E. Severson, Patrick A. Quinn
Reading system

Patent number: 6199042

Abstract: A reading system includes a computer and a mass storage device and software including instructions for causing a computer to accept an image file generated from optically scanning an image of a document. The software converts the image file into a converted text file that includes text information, and positional information associating the text with the position of its representation in the image file. The software records the voice of an operator of the reading machine as a series of voice samples in synchronization with a highlighting indicia applied to a displayed representation of the document and stores the series of voice samples in a data structure that associates the voice samples with displayed representation. The reading machine plays back the stored, recorded voice samples corresponding to words in the document as displayed by the monitor while highlighting is applied to the words in the displayed document.

Type: Grant

Filed: June 19, 1998

Date of Patent: March 6, 2001

Assignee: L&H Applications USA, Inc.

Inventor: Raymond C. Kurzweil
Messaging server language configuration method and apparatus

Patent number: 6192344

Abstract: A method for adding a spoken language for output generated by a messaging program including a voice messaging program and a voice messaging program running without re-compiling the messaging program includes providing the voice messaging program configured to generate an output message, providing the language server to receive the output message, to receive an ordered plurality of phrase references, to use phrase references from the ordered plurality of phrase references to identify a plurality of spoken phrases, and to output the plurality of spoken phrases, installing a set of language configuration data in a directory in the memory, the set of language configuration data configured to specify an ordered plurality of phrase references to the language server in response to the output message, installing a set of phrase files in a second directory in the memory, each phrase file in the set having an associated phrase reference and configured to store a unique spoken phrase, the set of language configuration d

Type: Grant

Filed: December 16, 1998

Date of Patent: February 20, 2001

Assignee: Altigen Communications, Inc.

Inventors: Scott Lee, Thiagarajan Rajagopalan, Chiaming Jen
System for editing digital video and audio information

Patent number: 6185538

Abstract: A system for non-linearly editing video and audio information, uses a device for recognizing speech in the audio information and for generating a character sequence, particularly an ASCII character sequence, to produce an edit decision list (EDL). The generated character sequence is displayed on the display screen of an indicator. With reference to marked parts of the character sequence displayed on the display screen of the indicator, editing data is derived for the EDL.

Type: Grant

Filed: July 29, 1998

Date of Patent: February 6, 2001

Assignee: US Philips Corporation

Inventor: Axel Schulz
Dense edit re-recording to reduce file fragmentation

Patent number: 6182200

Abstract: The present invention is a method and apparatus for re-recording audio events. Scattered audio events on a first track are determined based on a linked list. The scattered audio events are merged into a combined audio event on a second track. The combined audio event is copied on the second track to the first track.

Type: Grant

Filed: December 23, 1999

Date of Patent: January 30, 2001

Assignees: Sony Corporation, Sony Electronics, Inc.

Inventors: Roger Mather Duvall, Jeffrey Mark Claar
Distributed voice capture and recognition system

Patent number: 6178403

Abstract: A hand-held data acquisition device includes a display presenting at least one of an address book, a date book, a memo pad, a to-do list, a contact manager, an expense tracker, an e-mail client, and a project manager, at least one of which contains multiple data items. An input device is operatively connected to the device is suitable to receive voice data from the user. The data acquisition device stores the voice data and associates the voice data with at least one of the data items.

Type: Grant

Filed: December 16, 1998

Date of Patent: January 23, 2001

Assignee: Sharp Laboratories of America, Inc.

Inventor: Michael J. Detlef
Capture and application of sender voice dynamics to enhance communication in a speech-to-text environment

Patent number: 6175820

Abstract: A method for providing voice dynamics of human utterances converted to and represented by text within a data processing system. A plurality of predetermined parameters for recognition and representation of dynamics in human utterances are selected. An enhanced human speech recognition software program is created implementing the predetermined parameters on a data processing system. The enhanced software program includes an ability to monitor and record human voice dynamics and provide speech-to-text recognition. The dynamics in a human utterance is captured utilizing the enhanced human speech recognition software. The human utterance is converted into a textual representation utilizing the speech-to-text ability of the software. Finally, the dynamics are merged along with the textual representation of the human utterance to produce a marked-up text document on the data processing system.

Type: Grant

Filed: January 28, 1999

Date of Patent: January 16, 2001

Assignee: International Business Machines Corporation

Inventor: Timothy Alan Dietz
Method and apparatus for selecting information signal range and editing apparatus for information signal

Patent number: 6167350

Abstract: A method for selecting a range of an information signal comprises the steps of detecting the area in which the information signal is specified among a plurality of range-specifying areas displayed on a display device and selecting the information signal range in a unit of the information signal determined in accordance with the range-specifying area in which the specification is executed. The unit of information signal to be selected is different according to which area is selected, so that at least two units of information signal are available for selection.

Type: Grant

Filed: December 12, 1997

Date of Patent: December 26, 2000

Assignee: Sony Corporation

Inventors: Akihiko Hiramatsu, Toshiyuki Yamazaki
Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording

Patent number: 6161087

Abstract: A method for playback of speech in an audio recording. The method comprises performing full word-level recognition of the speech including recognition of silent pauses and filled pauses, suppressing playback of the filled pauses and silent pauses, alerting a listener of the audio recording to locations of suppressed filled pauses and silent pauses during playback of the audio recording, and accepting a user command to disable suppression of any filled pause or silent pause during playback of the audio recording.

Type: Grant

Filed: October 5, 1998

Date of Patent: December 12, 2000

Assignee: Lernout & Hauspie Speech Products N.V.

Inventors: Colin W. Wightman, Joan Bachenko
Device for phonological training

Patent number: 6151577

Abstract: The subject invention concerns a system for phonological training a sound reception device (1), an operating device (5) for controlling the system, interpreting and processing devices (2), and presentation device (3).The presentation device (3) includes a display screen divided into a plurality of windows (11-17) for simultaneous presentation of a graphic reproduction of the desired sound as well as of the sound produced by the user and received by the sound reception device (1), and of an animated reproduction of speech device (1), and of an animated reproduction of speech organs. The system is adapted to reproduce the sound by fields(s) (41, 42, 51, 52), the longitudinal extension of the field(s) in one direction reflecting the time during which the sound is produced and the graphic display content within each field, such as colours, shading or the like, of the fields denoting the place of formation of the sound in the oral cavity.

Type: Grant

Filed: June 25, 1999

Date of Patent: November 21, 2000

Assignee: Ewa Braun

Inventor: Ewa Braun
Method and arrangement for simultaneous recording of incoming and outgoing voice signals with compression of silence periods

Patent number: 6138091

Abstract: This invention relates to a method by means of which more than one audio signal can be recorded in compressed form in a memory element, and to a system implementing such a method. In the system according to the invention, audio signal samples are recorded only when voice is detected in the audio signals. The system according to the invention saves memory capacity required by the recording by combining the audio signal samples when voice is detected in samples of more than one audio signal. Furthermore, an audio signal is not recorded when no voice is detected in the signal. The invention also reduces the average computing capacity needed and thus power consumption, since signal combination, or mixing, is advantageously performed only when voice is detected in the samples of more than one audio signal.

Type: Grant

Filed: December 17, 1997

Date of Patent: October 24, 2000

Assignee: Nokia Mobile Phones Ltd.

Inventors: Tero Haataja, Ari Sinisalo
Apparatus and method for reproducing recorded signals by using recording medium

Patent number: 6134526

Abstract: An apparatus for reproducing recorded signals by using a recorded medium and a method for reproducing recorded signals. The learning of language is done by using the general recorded medium such as cassette tape and video tape with movies or music recorded thereon, thereby improving the learning efficiency. In the method for reproducing recorded signals of a recording medium in a language learning apparatus, the operation is carried out in the following manner. A control section switches a first switch in accordance with a reproduction command of a reproduction key inputting section, so that the audio signals of an audio signal processing section would be supplied to a speaker. Further, the control section turns on a second switch in accordance with a voice recognition command of a voice recognition key inputting section, so that the voices of a voice detecting section would be supplied and stored to a voice recognizing section.

Type: Grant

Filed: May 5, 1998

Date of Patent: October 17, 2000

Assignee: Samsung Electronics Co., Ltd.

Inventor: Yong Ho Kim

prev … 5 6 7 8 9 10 11 next