Sound Editing Patents (Class 704/278)

User interfaces for editing audio clips

Patent number: 8843375

Abstract: Methods, systems and apparatus for editing audio clips. A computer-implemented method includes displaying in a user interface, a first audio clip including a first plurality of time instants and a second audio clip including a second plurality of time instants; displaying a first transition point identifier associated with the first audio clip to designate a portion from a beginning of the first audio clip to the first transition point identifier that is playable; displaying a second transition point identifier associated with the second audio clip to designate a portion from the second transition point identifier to an end of the second audio clip that is playable; and generating a combined audio clip comprising the portion from the beginning of the first audio clip to the first transition point identifier and the portion from the second transition point identifier to the end of the second audio clip.

Type: Grant

Filed: December 19, 2008

Date of Patent: September 23, 2014

Assignee: Apple Inc.

Inventor: Randy Ubillos
Hierarchical quick note to allow dictated code phrases to be transcribed to standard clauses

Patent number: 8831940

Abstract: A dictation system that allows using trainable code phrases is provided. The dictation system operates by receiving audio and recognizing the audio as text. The text/audio may contain code phrases that are identified by a comparator that matches the text/audio and replaces the code phrase with a standard clause that is associated with the code phrase. The database or memory containing the code phrases is loaded with matched standard clauses that may be identified to provide a hierarchal system such that certain code phrases may have multiple meanings depending on the user.

Type: Grant

Filed: March 21, 2011

Date of Patent: September 9, 2014

Assignee: NVOQ Incorporated

Inventors: Charles Corfield, Brian Marquette, David Mondragon, Rebecca Heins
Customized audio data for verifying the authenticity of a service provider

Patent number: 8825487

Abstract: A method and a system for identity authentication are presented. In one example embodiment, audio data (e.g. a sound wave) may be received from a user. The audio data may be used to establish an identity of a first entity to the user. The audio data may be stored at a storage location; and be presented to the user to establish the identity of the first entity when the first entity participates in an electronic communication with the user. In another example embodiment, a server (e.g., a web client or client application server) may present a plurality of audio data instances to a user; receive the user selection of selected audio data from the plurality of audio data instances; responsive to the user selection, the server may communicate, via a network, the selected audio data to another server. The selected audio data may be used as an identity authentication.

Type: Grant

Filed: December 18, 2006

Date of Patent: September 2, 2014

Assignee: eBay Inc.

Inventor: Yihong Zhang
Mobile replacement-dialogue recording system

Patent number: 8802957

Abstract: A mobile replacement-dialogue recording system enables the creation of replacement-dialogue items by mobile users not at a media recording studio. Studio-users prepare guide media video, audio and text data which are made available to mobile users through a media server. A mobile user's mobile replacement-dialogue recording device obtains guide media and allows the user to view the guide media in rehearsal mode. The mobile replacement-dialogue recording device then records the mobile user's dialogue performance while presenting the mobile user with synchronized guide media. The mobile user can review, delete, and rerecord the resulting potential replacement dialogue, as well as create feedback media characterizing the replacement dialogue. Selected replacement dialogue items can be transmitted to the media server. A studio-module can then obtain the selected replacement dialogue items and feedback media from the media server so that they may be used in media-replacement.

Type: Grant

Filed: September 3, 2010

Date of Patent: August 12, 2014

Assignee: Boardwalk Technology Group, LLC

Inventors: Sean C Barker, Gary A Randall, Timothy Scott Bogart
Out-of-Band Notification of Muting During Voice Activity

Publication number: 20140222437

Abstract: Apparatus having corresponding methods and computer-readable media comprise: a muter configured to pass or block an audio signal; a voice activity detector configured to detect voice activity in the audio signal; and a vibrator configured to produce a mechanical vibration responsive to the contemporaneous occurrence of i) the voice activity detector detecting the voice activity in the audio signal; and ii) the muter being configured to block the audio signal.

Type: Application

Filed: February 1, 2013

Publication date: August 7, 2014

Applicant: PLANTRONICS, INC.

Inventors: Joe Burton, Shantanu Sarkar, Michael Gjerstad, Richard A. Dunning, JR.
Systems and methods for editing telecom web applications through a voice interface

Patent number: 8788272

Abstract: Systems and associated methods for editing telecom web applications through a voice interface are described. Systems and methods provide for editing telecom web applications over a connection, as for example accessed via a standard phone, using speech and/or DTMF inputs. The voice based editing includes exposing an editing interface to a user for a telecom web application that is editable, dynamically generating a voice-based interface for a given user for accomplishing editing tasks, and modifying the telecom web application to reflect the editing commands entered by the user.

Type: Grant

Filed: November 17, 2010

Date of Patent: July 22, 2014

Assignee: International Business Machines Corporation

Inventors: Sheetal K. Agarwal, Arun Kumar, Priyanka Manwani
System and method using feedback speech analysis for improving speaking ability

Patent number: 8756057

Abstract: A speech analysis system and method for analyzing speech. The system includes: a voice recognition system for converting inputted speech to text; an analytics system for generating feedback information by analyzing the inputted speech and text; and a feedback system for outputting the feedback information.

Type: Grant

Filed: November 2, 2005

Date of Patent: June 17, 2014

Assignee: Nuance Communications, Inc.

Inventors: Steven Michael Miller, Anne R. Sand
Multi-take compositing of digital media assets

Patent number: 8751022

Abstract: Methods, graphical user interfaces, computer apparatus and computer readable medium for producing media content are disclosed. For example, a user of a computing device can utilize the methods, graphical user interfaces, computer apparatus, and computer readable medium to edit the media content. In one embodiment, the media content pertains to media tracks, such as audio or video tracks. The media content can be a plurality of individual media tracks that can be segmented and the resulting segments from different media tracks can be combined into a composite media track.

Type: Grant

Filed: April 14, 2007

Date of Patent: June 10, 2014

Assignee: Apple Inc.

Inventor: Aaron Eppolito
Method and system for enhancing a speech database

Patent number: 8744851

Abstract: A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database, identifying segments in the labeled audio files that have varying pronunciations based on language differences, identifying replacement segments in a secondary speech database, enhancing the primary speech database by substituting the identified secondary speech database segments for the corresponding identified segments in the primary speech database, and storing the enhanced primary speech database for use in speech synthesis.

Type: Grant

Filed: August 13, 2013

Date of Patent: June 3, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Alistair Conkie, Ann K Syrdal
Encoder, decoder, encoding method, and decoding method

Patent number: 8737435

Abstract: An encoder includes a precoder for encoding an input information object according to a preset encoding scheme and storing the encoded information object in a precoder buffer, a sample number/address generation unit for generating a sample number of each sample and an address, which corresponds to each bit of each sample and the address of the precoder buffer, a multiplexer for selecting a bit of the precoder buffer corresponding to the address generated by the sample number/address generation module, a sampling buffer for storing a bit of each sample output from the multiplexer, a control packet generation module for generating a control packet including information on the sample number generated by the sample number/address generation module, a packet assembling unit for assembling the sample stored in the sampling buffer with the control packet generated by the control data generation module, and a modulation module for modulating the packet output from the packet assembling unit into a sound signal accordi

Type: Grant

Filed: May 18, 2010

Date of Patent: May 27, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hee-Won Jung, Seung-Gun Park, Gi-Sang Lee, Jun-Ho Koh, Sang-Mook Lee, Sergey Zhidkov
Messaging translation service application servers and methods for use in message translations

Patent number: 8738358

Abstract: A method for message translation and a Messaging Translation Service Application Server (MTS AS) are provided for translating messages exchanged with, and among, social network services alike Facebook™ and Tweeter™. According to the invention, a message written in a first language by a user is received by a first social media network, which further obtains from other social media network(s) information related to a language used by therein. Then, the first social media network requests translation of the message from the first language into the language used by the other social network systems, and further sends the translated message to the other social network systems.

Type: Grant

Filed: December 24, 2010

Date of Patent: May 27, 2014

Assignee: Telefonaktiebolaget L M Ericsson (Publ)

Inventors: Zhongwen Zhu, Patrick Parent
System and method for winding audio content using a voice activity detection algorithm

Patent number: 8731914

Abstract: A system and method for locating a preferable playback start location after a winding or rewinding action in an audio playing device. In response to an adjustment of the playing location for audio content to a desired playing position, the system determines whether at least one non-speech or silent period of at least a predetermined duration exists within the vicinity of the desired playing position. If at least one such non-speech or silent period exists within the vicinity of the desired playing position, the system adjusts the playing position to fall within one of the at least one non-speech period or silent period.

Type: Grant

Filed: November 15, 2005

Date of Patent: May 20, 2014

Assignee: Nokia Corporation

Inventors: Janne Vainio, Hannu J. Mikkola, Jari M. Makinen
Computer-implemented system and method for identifying and masking special information within recorded speech

Patent number: 8731938

Abstract: A computer-implemented system and method for identifying and masking special information within recorded speech is provided. A field for entry of special information is identified. Movement of a pointer device along a trajectory towards the field is also identified. A correlation of the pointer device movement and entry of the special information is determined based on a location of the trajectory in relation to the field. A threshold is applied to the correlation. The special information is received as verbal speech. A recording of the special information is rendered unintelligible when the threshold is satisfied.

Type: Grant

Filed: April 26, 2013

Date of Patent: May 20, 2014

Assignee: Intellisist, Inc.

Inventor: G. Kevin Doren
Systems, methods and automated technologies for translating words into music and creating music pieces

Patent number: 8731943

Abstract: Systems, methods and computer program products are provided for translating a natural language into music. Through systematic parsing, music compositions can be created. These compositions can be created by one or more persons who do not speak the same natural language.

Type: Grant

Filed: February 5, 2010

Date of Patent: May 20, 2014

Assignee: Little Wing World LLC

Inventors: Nicolle Ruetz, David Warhol
Recording and/or reproducing apparatus and recording apparatus

Patent number: 8725281

Abstract: A recording and/or reproducing apparatus includes a microphone, a semiconductor memory, an operating section and a controller. An output signal from the microphone is written in the semiconductor memory and the written signals are read out from the semiconductor memory. The operating section performs input processing for writing a digital signal outputted by an analog/digital converter, reading out the digital signal stored in the semiconductor memory and for erasing the digital signal stored in the semiconductor memory. The control section controls the writing of the microphone output signal in the semiconductor memory based on an input from the operating section and the readout of the digital signal stored in the semiconductor memory.

Type: Grant

Filed: October 16, 2012

Date of Patent: May 13, 2014

Assignee: Sony Corporation

Inventor: Kenichi Iida
Indexing digitized speech with words represented in the digitized speech

Patent number: 8706490

Abstract: Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital au

Type: Grant

Filed: August 7, 2013

Date of Patent: April 22, 2014

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Frank L. Jania
Real-time versioning of device-bound content

Patent number: 8700409

Abstract: Subject matter described herein relates to providing to a mobile device a version of content (e.g., music, video, text message, live call, etc.) that is consistent with a user's filter setting. That is, a user is allowed to specify content elements (e.g., words or images) that are proscribed from being presented on the mobile device, and the user's preferences are stored by a mobile telecommunications network. When the network receives content to be provided to the mobile device, the network edits the content in real time to prevent proscribed elements from being presented on the mobile device.

Type: Grant

Filed: November 1, 2010

Date of Patent: April 15, 2014

Assignee: Sprint Communications Company L.P.

Inventors: Carl J. Persson, Jeremy Richard Breau, Eric Eugene Miller, Sei Yen Ng
Automatic realtime speech impairment correction

Patent number: 8682678

Abstract: Automatic correcting of user's speech impairment in speech may include obtaining the audio signal of a given user's speech, and analyzing the obtained audio signal to identify artifacts caused by the user's impairment. The obtained audio signal may be modified by eliminating the identified artifacts from it. The modified audio signal may be provided, e.g., to be played or broadcast or transmitted.

Type: Grant

Filed: March 14, 2012

Date of Patent: March 25, 2014

Assignee: International Business Machines Corporation

Inventors: Peter K. Malkin, Sharon M. Trewin
World stage for pitch-corrected vocal performances

Patent number: 8682653

Abstract: Techniques have been developed to facilitate the capture performances on handheld or other portable computing devices and, in some cases, the pitch-correction and mixing of such vocal performances with backing tracks for audible rendering on such devices. Captivating visual animations and/or facilities for listener comment and ranking are provided in association with an audible rendering of a performance, e.g., a vocal performance captured and pitch-corrected at another similarly configured mobile device and mixed with backing instrumentals and/or vocals. Geocoding of captured vocal performances and/or listener feedback may facilitate animations or display artifacts in ways that are suggestive of a performance or endorsement emanating from a particular geographic locale on a user manipulable globe. In this way, implementations of the described functionality can transform otherwise mundane mobile devices into social instruments that foster a unique sense of global connectivity and community.

Type: Grant

Filed: September 4, 2010

Date of Patent: March 25, 2014

Assignee: Smule, Inc.

Inventors: Spencer Salazar, Rebecca A. Fiebrink, Ge Wang, Mattias Ljungström, Jeffrey C. Smith, Jeannie Yang
Apparatus and method for improving communication sound quality in mobile terminal

Patent number: 8682657

Abstract: An apparatus and a method for improving communication sound quality in a mobile terminal in order to remove a neighboring noise that occurs together with a user's voice signal in a mobile terminal by discriminating signals occurring at different distances using two microphones and removing a noise. The mobile terminal preferably includes a first microphone, a second microphone, and a voice processor. The first microphone receives a voice signal occurring at a closer distance from the mobile terminal and a voice signal occurring at a longer distance from the mobile terminal. The second microphone receives only a voice signal occurring at the long distance. The voice processor discriminates between the signal occurring at the long distance and the signal occurring at the close distance by receiving voice signals received via the first microphone and the second microphone at different phases.

Type: Grant

Filed: May 13, 2011

Date of Patent: March 25, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Ji-Hyuk Lim, Jang-Young Ryu, Dong-Seon Lee
Systems and methods for classifying sports video

Patent number: 8682654

Abstract: Disclosed are systems, methods, and computer readable media having programs for classifying sports video. In one embodiment, a method includes: extracting, from an audio stream of a video clip, a plurality of key audio components contained therein; and classifying, using at least one of the plurality of key audio components, a sport type contained in the video clip. In one embodiment, a computer readable medium having a computer program for classifying ports video includes: logic configured to extract a plurality of key audio components from a video clip; and logic configured to classify a sport type corresponding to the video clip.

Type: Grant

Filed: April 25, 2006

Date of Patent: March 25, 2014

Assignee: Cyberlink Corp.

Inventors: Ming-Jun Chen, Jiun-Fu Chen, Shih-Min Tang, Ho-Chao Huang
Method and apparatus for interaction or discourse analytics

Patent number: 8676586

Abstract: A method and apparatus for analyzing and segmenting a vocal interaction captured in a test audio source, the test audio source captured within an environment. The method and apparatus first use text and acoustic features extracted from the interaction with tagging information, for constructing a model. Then, at production time, text and acoustic features are extracted from the interactions, and by applying the model, tagging information is retrieved for the interaction, enabling analysis, flow visualization or further processing of the interaction.

Type: Grant

Filed: September 16, 2008

Date of Patent: March 18, 2014

Assignee: Nice Systems LTD

Inventors: Moshe Wasserblat, Oren Pereg, Yuval Lubowich
Web-based audio transcription tool

Patent number: 8676590

Abstract: A computer-implemented technique for transcribing audio data includes generating, along a vertical axis on a display of a client device, an image representing audio content. The technique further includes receiving, from a user of the client device, a selection of a portion of the image; and generating, via an audio module of the client device, an audio output corresponding to the selected portion of the image. The technique further includes receiving, from the user, a selection indicating a position along the vertical axis on the display to enter a text portion representing the audio output, wherein the position is aligned to the selected portion of the image. The technique further includes receiving, from the user, the text portion representing the audio output; and displaying, on the display, the text portion at the position, wherein the text portion extends along a horizontal axis on the display.

Type: Grant

Filed: September 26, 2012

Date of Patent: March 18, 2014

Assignee: Google Inc.

Inventors: Jeffrey Scott Sorensen, Masayuki Nanzawa, Ravindran Rajakumar
Editing telecom web applications through a voice interface

Patent number: 8676589

Abstract: Systems and associated methods for editing telecom web applications through a voice interface are described. Systems and methods provide for editing telecom web applications over a connection, as for example accessed via a standard phone, using speech and/or DTMF inputs. The voice based editing includes exposing an editing interface to a user for a telecom web application that is editable, dynamically generating a voice-based interface for a given user for accomplishing editing tasks, and modifying the telecom web application to reflect the editing commands entered by the user.

Type: Grant

Filed: August 28, 2012

Date of Patent: March 18, 2014

Assignee: International Business Machines Corporation

Inventors: Sheetal K. Agarwal, Arun Kumar, Priyanka Manwani
Method and apparatus for masking speech in a private environment

Patent number: 8670986

Abstract: A speech masking apparatus includes a microphone and a speaker. The microphone can detect a human voice. The speaker can output a masking language which can include phonemes resembling human speech. At least one component of the masking language can have a pitch, a volume, a theme, and/or a phonetic content substantially matching a pitch, a volume, a theme, and/or a phonetic content of the voice.

Type: Grant

Filed: March 6, 2013

Date of Patent: March 11, 2014

Assignee: Medical Privacy Solutions, LLC

Inventors: Babak Arvanaghi, Joel Fechter
System and method for audio snippet generation from a subset of music tracks

Patent number: 8666749

Abstract: The disclosure includes a system and method for generating audio snippets from a subset of audio tracks. In some embodiments an audio snippet is an audio summary of a group or collection of songs.

Type: Grant

Filed: January 17, 2013

Date of Patent: March 4, 2014

Assignee: Google Inc.

Inventors: Amarnag Subramanya, Jennifer Gillenwater, Garth Griffin, Fernando Pereira, Douglas Eck
Method for dynamic learning of individual voice patterns

Patent number: 8655660

Abstract: The present invention is a system and method for generating a personal voice font including, monitoring voice segments automatically from phone conversations of a user by a voice learning processor to generate a personalized voice font and delivering the personalized voice font (PVF) to the a server.

Type: Grant

Filed: February 10, 2009

Date of Patent: February 18, 2014

Assignee: International Business Machines Corporation

Inventors: Zsolt Szalai, Philippe Bazot, Bernard Pucci, Joel Vitale
Automated communication integrator

Patent number: 8639513

Abstract: An apparatus includes a plurality of applications and an integrator having a voice recognition module configured to identify at least one voice command from a user. The integrator is configured to integrate information from a remote source into at least one of the plurality of applications based on the identified voice command. A method includes analyzing speech from a first user of a first mobile device having a plurality of applications, identifying a voice command based on the analyzed speech using a voice recognition module, and incorporating information from the remote source into at least one of a plurality of applications based on the identified voice command.

Type: Grant

Filed: August 5, 2009

Date of Patent: January 28, 2014

Assignee: Verizon Patent and Licensing Inc.

Inventor: Robert Edward Opaluch
Identifying and generating audio cohorts based on audio data input

Patent number: 8626505

Abstract: A computer implemented method, system, and/or computer program product generates an audio cohort. Audio data from a set of audio sensors is received by an audio analysis engine. The audio data, which is associated with a plurality of objects, comprises a set of audio patterns. The audio data is processed to identify audio attributes associated with the plurality of objects to form digital audio data. This digital audio data comprises metadata that describes the audio attributes of the set of objects. A set of audio cohorts is generated using the audio attributes associated with the digital audio data and cohort criteria, where each audio cohort in the set of audio cohorts is a cohort of accompanied customers in a store, and where processing the audio data identifies a type of zoological creature that is accompanying each of the accompanied customers.

Type: Grant

Filed: September 6, 2012

Date of Patent: January 7, 2014

Assignee: International Business Machines Corporation

Inventors: Robert L. Angell, Robert R. Friedlander, James R. Kraemer
Automatic marking method for karaoke vocal accompaniment

Patent number: 8626497

Abstract: An automatic marking method for Karaoke vocal accompaniment is provided. In the method, pitch, beat position and volume of a singer are compared with the original pitch, beat position and volume of the theme of a song to generate a score of pitch, a score of beat and a score of emotion respectively, so as to obtain a weighted total score in a weighted marking method. By using the method, the pitch, beat position and volume error of each section of the song sung by the singer can be exactly worked out, and a pitch curve and a volume curve can be displayed, so that the singer can learn which part is sung incorrectly and which part needs to be enhanced. The present invention also has the advantages of dual effects of teaching and entertainment, high practicability and technical advancement.

Type: Grant

Filed: April 7, 2009

Date of Patent: January 7, 2014

Inventor: Wen-Hsin Lin
Insertion of sounds into audio content according to pattern

Patent number: 8626493

Abstract: Sounds are inserted into audio content according to a pattern. A library stores humanly perceptible voice sounds. Pattern control information is received that is associated with a device recording the audio content. A pattern is retrieved and washing machine sounds are inserted into the audio content according to the pattern. The humanly perceptible voice sounds are inserted into the audio content according to the pattern to generate a signed audio recording.

Type: Grant

Filed: April 26, 2013

Date of Patent: January 7, 2014

Assignee: AT&T Intellectual Property I, L.P.

Inventor: Steven N. Tischer
Automatic realtime speech impairment correction

Patent number: 8620670

Abstract: Automatic correcting of user's speech impairment in speech may include obtaining the audio signal of a given user's speech, and analyzing the obtained audio signal to identify artifacts caused by the user's impairment. The obtained audio signal may be modified by eliminating the identified artifacts from it. The modified audio signal may be provided, e.g., to be played or broadcast or transmitted.

Type: Grant

Filed: September 12, 2012

Date of Patent: December 31, 2013

Assignee: International Business Machines Corporation

Inventors: Peter K. Malkin, Sharon M. Trewin
System for controlling digital effects in live performances with vocal improvisation

Patent number: 8620661

Abstract: A system for controlling digital effects in live performances with vocal improvisation is described. The system features a controller that utilizes several switches attached to clothing that is worn by an artist during a live performance. The switches activate a digital vocal processor unit that provides a dual mode, multi-channel phrase looping capability wherein individual channels can be selected for recording and replay during the performance. This combination of features allows a sequence of digital audio and video effects to be controlled by the artist during a performance while maintaining the freedom of movement desired to enhance the performance.

Type: Grant

Filed: February 28, 2011

Date of Patent: December 31, 2013

Inventor: Momilani Ramstrum
Automatic detection of audio advertisements

Patent number: 8606585

Abstract: A method, apparatus, and computer-readable medium for editing a data stream based on a corpus are provided. The data stream includes stream words. A sequence includes a predetermined number of sequential words of the stream words. The method, apparatus, and computer-readable medium determine whether the sequence exists in the corpus at least at a predetermined minimum frequency. When the sequence exists in the corpus at least at the predetermined minimum frequency, the sequence is edited in the data stream.

Type: Grant

Filed: September 17, 2010

Date of Patent: December 10, 2013

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Ilya Dan Melamed, Yeon-Jun Kim
Method and apparatus for canceling vocal signal from audio signal

Patent number: 8583444

Abstract: Provided is a method of canceling a vocal signal, wherein the method includes obtaining a difference signal between two audio signals; and smoothing the frequency of the difference signal. Also provided is a device for canceling a vocal signal, the device including a subtracter which obtains a difference signal between two audio signals; and a frequency smoothing unit which smoothes a frequency of the difference signal.

Type: Grant

Filed: October 12, 2010

Date of Patent: November 12, 2013

Assignee: Samsung Electronics Co., Ltd.

Inventor: Jun-ho Lee
Recording and reproducing apparatus

Patent number: 8583443

Abstract: Disclosed is a recording and reproducing apparatus comprising: an apparatus main body; and a remote controller to perform remote control of the apparatus main body, wherein the remote controller comprises: a key operating section to receive a key operation by a user; a sound information inputting section to input sound information; and a transmitting section to transmit sound data based on the sound information to the apparatus main body, and the apparatus main body comprises: a recording section to record input content data on a recording medium; a reproducing section to reproduce the content data; a receiving section to receive the sound data; a sound information recording section to record the sound data so as to be associated with a piece of the content data; and a sound information outputting section to reproduce the sound data to output the reproduced sound data.

Type: Grant

Filed: April 10, 2008

Date of Patent: November 12, 2013

Assignee: Funai Electric Co., Ltd.

Inventor: Masayuki Misawa
Multipurpose media players

Patent number: 8577683

Abstract: Disclosed are Multipurpose Media Players that enable users to create transcriptions, closed captions, and/or logs of digitized recordings, that enable the presentation of transcripts, closed captions, logs, and digitized recordings in a correlated manner to users, that enable users to compose one or more scenes of a production, and that enable users to compose storyboards for a production. The multipurpose media players can be embodied within Internet browser environments; thereby providing high availability of the multipurpose players across software platforms, networks, and physical locations.

Type: Grant

Filed: June 15, 2012

Date of Patent: November 5, 2013

Assignee: Thomas Majchrowski & Associates, Inc.

Inventor: Keri DeWitt
Encoding and decoding speech signals

Patent number: 8571039

Abstract: A method and apparatus for transmitting an audio signal over a communication channel comprising encoding the audio signal with an encoder 204 using a first sampling rate, filtering the audio signal using a first cut off frequency, the first cut off frequency being chosen in dependence upon the first sampling rate, and transmitting the encoded and filtered audio signal over the communication channel. The presence of a condition in which the sampling rate of the encoder 204 is to be switched to a second sampling rate at a switching time is determined and if the condition has been determined to be present, the cut off frequency used in the filtering step is gradually changed from the first cut off frequency to a second cut off frequency, the second cut off frequency being chosen in dependence upon the second sampling rate, such that the audio bandwidth of the transmitted signal changes gradually when the sampling rate is switched to the second sampling rate.

Type: Grant

Filed: June 23, 2010

Date of Patent: October 29, 2013

Assignee: Skype

Inventors: Stefan Strommer, Karsten Vandborg Sorensen, Soren Skak Jensen, Koen Vos, Jon Bergenheim
Apparatus and method for generating avatar based video message

Patent number: 8566101

Abstract: An apparatus and method for generating an avatar based video message are provided. The apparatus and method are capable of generating an avatar based video message based on speech of a user. The avatar based video message apparatus and method displays information that corresponds to input user speech. The avatar based video message apparatus and method edits the input user speech according to a user input signal with reference to the displayed information, generates avatar animation according to the edited speech, and generates an avatar based video message based on the edited speech and the avatar animation.

Type: Grant

Filed: April 5, 2010

Date of Patent: October 22, 2013

Assignee: Samsung Electronics Co., Ltd.

Inventors: Ick-sang Han, Jeong-mi Cho
System and method for synchronizing sound and manually transcribed text

Patent number: 8560327

Abstract: A method for synchronizing sound data and text data, said text data being obtained by manual transcription of said sound data during playback of the latter. The proposed method comprises the steps of repeatedly querying said sound data and said text data to obtain a current time position corresponding to a currently played sound datum and a currently transcribed text datum, respectively, correcting said current time position by applying a time correction value in accordance with a transcription delay, and generating at least one association datum indicative of a synchronization association between said corrected time position and said currently transcribed text datum. Thus, the proposed method achieves cost-effective synchronization of sound and text in connection with the manual transcription of sound data.

Type: Grant

Filed: August 18, 2006

Date of Patent: October 15, 2013

Assignee: Nuance Communications, Inc.

Inventors: Andreas Neubacher, Miklos Papai
Method and apparatus for segmenting a multimedia program based upon audio events

Patent number: 8560319

Abstract: The present invention provides for a method and apparatus for segmenting a multi-media program based upon audio events. In an embodiment a method of classifying an audio stream is provided. This method includes receiving an audio stream. Sampling the audio stream at a predetermined rate and then combining a predetermined number of samples into a clip. A plurality of features are then determined for the clip and are analyzed using a linear approximation algorithm. The clip is then characterized based upon the results of the analysis conducted with the linear approximation algorithm.

Type: Grant

Filed: January 15, 2008

Date of Patent: October 15, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Qian Huang, Zhu Liu
Content Customization

Publication number: 20130262127

Abstract: A content processing service may analyze an item of original content and identify several objects, attributes of those objects, and relationships between those objects present in the item of original content. The content processing service may also analyze a source graph, such as a social graph or supplemental graph, and identify several objects, attributes of those objects, and relationships between objects present in the source graph. The content processing service may customize the item of original content by selecting an original object and selecting a source graph object. One or more of the attributes or relationships of the selected original object in the item of original content may be replaced by one or more of the attributes or relationships of the selected source graph object. To customize items of audio content, audio content associated with the source graph object may replace audio content associated with the target graph object.

Type: Application

Filed: March 29, 2012

Publication date: October 3, 2013

Inventors: Douglas S. Goldstein, Ajay Arora, Douglas Hwang, Guy A. Story, JR., Shirley C. Yang
Stretching/shrinking selected portions of a signal

Patent number: 8538761

Abstract: Techniques are described to allow a user of a signal editing tool to “stretch” or “shrink” a selected portion of a recorded signal to change the length of the selected portion of the signal relative to a particular domain, without stretching or shrinking other parts of the signal. In the context of audio signals, techniques are provided to allow a user to “time stretch” an audio signal file to change the duration of the stretched portion of the audio. The user may select how the change affects the total duration of the audio signal. Options are provided for “shifting” the non-selected portion of the signal, or for not shifting the non-selected portion of the signal. When the non-selected portion is not shifted, the signal editing tool automatically generates audio for the gap (for shrinking operations), and automatically deletes audio that overlaps with the stretched portion (for stretching operations).

Type: Grant

Filed: August 1, 2005

Date of Patent: September 17, 2013

Assignee: Apple Inc.

Inventors: Christopher J. Moulios, Nikhil M. Bhatt
Insertion of Sounds Into Audio Content According to Pattern

Publication number: 20130238342

Abstract: Sounds are inserted into audio content according to a pattern. A library stores humanly perceptible voice sounds. Pattern control information is received that is associated with a device recording the audio content. A pattern is retrieved and washing machine sounds are inserted into the audio content according to the pattern. The humanly perceptible voice sounds are inserted into the audio content according to the pattern to generate a signed audio recording.

Type: Application

Filed: April 26, 2013

Publication date: September 12, 2013

Applicant: AT&T Intellectual Property I, L.P.

Inventor: Steven N. Tischer
Audible post-it system

Patent number: 8532996

Abstract: An audible post-it system includes a post-it note printed with an index and an optical reading and recording device having an optical module, a switch, a storage device, an audio recording device, an audio playing device and a processor. The optical reading and recording device reads an image of the index. When the optical reading and recording device is at a recoding state, the processor receives the image of the index and obtains the index, then receives a digital audio outputted by the audio recording device to match the index with the digital audio, and stores the digital audio based on the index. When the optical reading and recording device is at a playing state, the processor receives the image of an index and retrieves the index, then reads a digital audio based on the index, and sends the digital audio to the audio playing device for playing.

Type: Grant

Filed: October 21, 2010

Date of Patent: September 10, 2013

Assignee: GeneralPlus Technology, Inc.

Inventor: Ching-Fu Hung
Method and apparatus for sculpting synthesized speech

Patent number: 8527281

Abstract: Methods and systems for sculpting synthesized speech using a graphic user interface are disclosed. An operator enters a stream of text that is used to produce a stream of target phonetic-units. The stream of target phonetic-units is then submitted to a unit-selection process to produce a stream of selected phonetic-units, each selected phonetic-unit derived from a database of sample phonetic-units. After the stream of sample phonetic-units is selected, an operator can remove various selected phonetic-units from the stream of selected phonetic-units, prune the sample phonetic-database and edit various cost functions using the graphic user interface. The edited speech information can then be submitted to the unit-selection process to produce a second stream of selected phonetic-units.

Type: Grant

Filed: June 29, 2012

Date of Patent: September 3, 2013

Assignee: Nuance Communications, Inc.

Inventors: Peter Rutten, Paul A. Taylor
Method for sending multi-media messages with customized audio

Patent number: 8521533

Abstract: A system and method of creating a customized multi-media message to a recipient is disclosed. The multi-media message is created by a sender and contains an animated entity that delivers an audible message. The sender chooses the animated entity from a plurality of animated entities. The system receives a text message from the sender and receives a sender audio message associated with the text message. The sender audio message is associated with the chosen animated entity to create the multi-media message. The multi-media message is delivered by the animated entity using as the voice the sender audio message wherein the mouth movements of the animated entity conform to the sender audio message.

Type: Grant

Filed: February 28, 2007

Date of Patent: August 27, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Joern Ostermann, Mehmet Reha Civanlar, Barbara Buda, Claudio Lande
Method for segmenting audio signals

Patent number: 8521529

Abstract: An input signal is converted to a feature-space representation. The feature-space representation is projected onto a discriminant subspace using a linear discriminant analysis transform to enhance the separation of feature clusters. Dynamic programming is used to find global changes to derive optimal cluster boundaries. The cluster boundaries are used to identify the segments of the audio signal.

Type: Grant

Filed: April 18, 2005

Date of Patent: August 27, 2013

Assignee: Creative Technology Ltd

Inventors: Michael M. Goodwin, Jean Laroche
Biochemical analyzer having microprocessing apparatus with expandable voice capacity

Patent number: 8521535

Abstract: A biochemical analyzer having a microprocessing apparatus with expandable voice capacity is characterized in that a driving module is installed in a data processor and a voice carrier is replaceable. Thereby, increase or decrease of voice files can be easily done by replacing the current voice carrier with an alternative voice carrier storing desired voice files, without the need of replacing the driving module together with the voice carrier, thereby saving costs and reducing processing procedures.

Type: Grant

Filed: November 10, 2010

Date of Patent: August 27, 2013

Inventor: Chun-Yu Chen
Selective feedback for text recognition systems

Patent number: 8515751

Abstract: This specification describes technologies relating to recognition of text in various media. In general, one aspect of the subject matter described in this specification can be embodied in methods that include receiving an input signal including data representing one or more words and passing the input signal to a text recognition system that generates a recognized text string based on the input signal. The methods may further include receiving the recognized text string from the text recognition system. The methods may further include presenting the recognized text string to a user and receiving a corrected text string based on input from the user. The methods may further include checking if an edit distance between the corrected text string and the recognized text string is below a threshold. If the edit distance is below the threshold, the corrected text string may be passed to the text recognition system for training purposes.

Type: Grant

Filed: September 26, 2012

Date of Patent: August 20, 2013

Assignee: Google Inc.

Inventors: Luca Zanolin, Marcus A. Foster, Richard Z. Cohen

prev 1 2 3 4 5 6 7 … next