Patents Examined by Daniel D Abebe
  • Patent number: 8170872
    Abstract: Embodiments of the present invention address deficiencies of the art in respect to chat transcript generation for instant messaging and provide a method, system and computer program product for emotional state transcription for chat sessions. In an embodiment of the invention, a method for emotional state transcription for chat sessions can be provided. The method can include initializing a chat session in an instant messenger, engaging in an audio conversation through the instant messenger, collecting emotion meta-data for the audio conversation and mapping the emotion meta-data to emoticons, and combining a speech recognized form of the audio conversation with the emoticons and text from the chat session into a chat transcript. The method further can include computing a milleau for the chat session from the emotion meta-data and incorporating the milleau for the chat session in the transcript.
    Type: Grant
    Filed: December 4, 2007
    Date of Patent: May 1, 2012
    Assignee: International Business Machines Corporation
    Inventor: Ruthie D. Lyle
  • Patent number: 8170884
    Abstract: Systems provide an audio/visual output to each of a plurality of listeners in a manner that permits individualized audio adjustment, wherein audio comprises a first signal that is substantially voice and a second signal that is substantially other than voice. The systems may include a video device, a storage medium, and a transmitter that transmits the first and second signals to a plurality of personal listening devices. Each of the plurality of personal listening devices may include first and second receivers, first and second adjustment devices, an audio signal combining device, and one or more transducers, wherein the systems permit each of the plurality of listeners to adjust the first and second signals independently of other ones of the plurality of listeners in an audience.
    Type: Grant
    Filed: January 8, 2008
    Date of Patent: May 1, 2012
    Assignee: Akiba Electronics Institute LLC
    Inventors: Michael A. Vaudrey, William R. Saunders
  • Patent number: 8170865
    Abstract: A speech recognition device and a method thereof are adapted to recognize a Chinese word. The speech recognition device includes a lexicon model, a language model, a speech recognition module, and a parsing module. The lexicon model keeps a plurality of words. The speech recognition module performs a speech recognition processing on a voice signal conforming to a syntax structure of Chinese word description. The speech recognition processing searches words related to the Chinese word description from the lexicon model according to a feature of the Chinese word description, and produces a literal word series in digital data form by referring a syntax combination probability. The language model based on the syntax structure of Chinese word description provides the syntax combination probability according to combination relations between the searched words. The parsing module analyzes the syntax structure of the literal word series for retrieving the Chinese word.
    Type: Grant
    Filed: September 16, 2008
    Date of Patent: May 1, 2012
    Assignee: DELTA Electronics, Inc.
    Inventors: Liang-Sheng Huang, Chao-Jen Huang, Jia-Lin Shen
  • Patent number: 8165875
    Abstract: A voice enhancement logic improves the perceptual quality of a processed voice. The voice enhancement system includes a noise detector and a noise attenuator. The noise detector detects a wind buffet and a continuous noise by modeling the wind buffet. The noise attenuator dampens the wind buffet to improve the intelligibility of an unvoiced, a fully voiced, or a mixed voice segment.
    Type: Grant
    Filed: October 12, 2010
    Date of Patent: April 24, 2012
    Assignee: QNX Software Systems Limited
    Inventors: Phillip A. Hetherington, Xueman Li, Pierre Zakarauskas
  • Patent number: 8165876
    Abstract: A method and apparatus for capturing data in a workstation, wherein a large number of data associated with a sample which is viewed, by a user, through an optical device, such as a microscope, is to be entered in a computer related file. The optical device can be moved to a data-sampling position utilizing voice commands. A pointer can then be moved to an appropriate place in the file to receive the data relating to the data-sampling position. Data can be then entered in the appropriate position utilizing a voice command. The steps of moving the pointer and entering the data can then be repeated until all data is provided with respect to the data-sampling positions.
    Type: Grant
    Filed: September 4, 2006
    Date of Patent: April 24, 2012
    Assignee: Nuance Communications, Inc.
    Inventors: Ossama Emam, Khaled Gamal
  • Patent number: 8155957
    Abstract: An automated transcription system includes an housing on a PC, and a portable electronic device including a mechanism for creating and managing a plurality of predetermined templates with a plurality of headings and sub-headings that are automatically populated in real time as a user speaks an audio message. The portable electronic device further includes a mechanism for converting and displaying the audio message to a text message on the portable electronic device and thereby enabling a user to read, edit and print the text message. Such an audio message converting and displaying mechanism includes an LCD screen, a microphone for receiving the audio message when the user speaks, and a data transfer interface.
    Type: Grant
    Filed: March 7, 2008
    Date of Patent: April 10, 2012
    Inventor: LuAnn C. Takens
  • Patent number: 8145478
    Abstract: An audio signal band expanding apparatus (100a) includes a harmonic generator (3) that receives an input audio signal having a predetermined band and generates, based on the input audio signal, harmonic signals, and an adder (2) that adds the harmonic signals generated by the harmonic generator (3) to the input audio signal. The harmonic generator (3) simulates the input-output characteristics of a predetermined amplifier or that of a device to generate the harmonic signals from the input audio signal.
    Type: Grant
    Filed: May 12, 2006
    Date of Patent: March 27, 2012
    Assignee: Panasonic Corporation
    Inventor: Kazuya Iwata
  • Patent number: 8140339
    Abstract: A sign language recognition apparatus and method is provided for translating hand gestures into speech or written text. The apparatus includes a number of sensors on the hand, arm and shoulder to measure dynamic and static gestures. The sensors are connected to a microprocessor to search a library of gestures and generate output signals that can then be used to produce a synthesized voice or written text. The apparatus includes sensors such as accelerometers on the fingers and thumb and two accelerometers on the back of the hand to detect motion and orientation of the hand. Sensors are also provided on the back of the hand or wrist to detect forearm rotation, an angle sensor to detect flexing of the elbow, two sensors on the upper arm to detect arm elevation and rotation, and a sensor on the upper arm to detect arm twist. The sensors transmit the data to the microprocessor to determine the shape, position and orientation of the hand relative to the body of the user.
    Type: Grant
    Filed: July 21, 2009
    Date of Patent: March 20, 2012
    Assignee: The George Washington University
    Inventor: Jose L. Hernandez-Rebollar
  • Patent number: 8135584
    Abstract: According to the invention, an excitation signal is generated as a result of sampled excitation values in order to excite an audio synthesis filter, the generated sampled excitation values being continuously stored in an adaptive codebook. A noise generator is provided which continuously generates random sampled values. A sequence of the stored sampled excitation values is selected from the adaptive codebook based on a fed audio fundamental frequency parameter by means of which a time gap between the sequence that is to be selected and the actual time reference is predefined. The excitation signal is generated by mixing the selected sequence with a random sequence encompassing actual random sampled valued of the noise generator.
    Type: Grant
    Filed: January 31, 2006
    Date of Patent: March 13, 2012
    Assignee: Siemens Enterprise Communications GmbH & Co. KG
    Inventors: Bernd Geiser, Peter Jax, Stefan Schandl, Herve Taddei
  • Patent number: 8131550
    Abstract: An apparatus for providing improved voice conversion includes a sub-feature generator and a transformation element. The sub-feature generator may be configured to define sub-feature units with respect to a feature of source speech. The transformation element may be configured to perform voice conversion of the source speech to target speech based on the conversion of the sub-feature units to corresponding target speech sub-feature units using a conversion model trained with respect to converting training source speech sub-feature units to training target speech sub-feature units.
    Type: Grant
    Filed: October 4, 2007
    Date of Patent: March 6, 2012
    Assignee: Nokia Corporation
    Inventors: Jani Nurminen, Elina Helander
  • Patent number: 8126697
    Abstract: Systems and methods are provided for establishing a communication session between a first and second communication device. A request may be received for a communication session from the first communication device, the request indicating a language for media streams transmitted to, and received from, the first communication device. A communication session invitation specifying a plurality of languages supported for the communication session may be transmitted to the second communication device. A response may be received, from the second communication device, indicating a language for media streams transmitted to, and received from, the second communication device. The communication session, between the first and second communications devices may be established such that the media streams transmitted to, and received from, the first and second communications device are in a language indicated by the first and second communications devices respectively.
    Type: Grant
    Filed: October 10, 2007
    Date of Patent: February 28, 2012
    Assignee: Nextel Communications Inc.
    Inventors: Arun Manroa, Zheng Cai
  • Patent number: 8121834
    Abstract: A method of modifying acoustic characteristics of an original audio signal as a function of modification instructions relating at least to the fundamental frequency and the spectral envelope of the original signal.
    Type: Grant
    Filed: March 12, 2008
    Date of Patent: February 21, 2012
    Assignee: France Telecom
    Inventors: Olivier Rosec, Didier Cadic
  • Patent number: 8121846
    Abstract: A user speech interface for interactive media guidance applications, such as television program guides, guides for audio services, guides for video-on-demand (VOD) services, guides for personal video recorders (PVRs), or other suitable guidance applications is provided. Voice commands may be received from a user and guidance activities may be performed in response to the voice commands.
    Type: Grant
    Filed: July 5, 2006
    Date of Patent: February 21, 2012
    Assignee: United Video Properties, Inc.
    Inventors: M. Scott Reichardt, David M. Berezowski, Michael D. Ellis, Toby DeWeese
  • Patent number: 8117029
    Abstract: Provided is a transmission apparatus for matching sound quality measurement sections of a variable bandwidth multi-codec. The apparatus includes a measurement section setting unit setting a measurement section, which is to be measured for sound quality, in units of time; a first conversion unit converting the measurement section into a measurement section in units of samples; and an information synthesis unit synthesizing information regarding the measurement section in units of samples with a digital original sound and outputting the synthesis result. In addition, provided is a method of matching a measurement section of a reference sound, based on which the end-to-end sound quality measurement of the variable bandwidth multi-codec is performed, and a measurement section of a sound produced by the variable bandwidth multi-codec in a real-time Internet multimedia service. Therefore, distortion of measurement results due to un-matching measurement sections can be reduced.
    Type: Grant
    Filed: October 30, 2007
    Date of Patent: February 14, 2012
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Dae-Ho Kim, Tae-Gyu Kang, Ki-Jong Koo, Do Young Kim, Hae Won Jung
  • Patent number: 8117037
    Abstract: An adaptive advertising apparatus and associated methods. In one embodiment, the apparatus comprises a computer readable medium having at least one computer program disposed thereon, the at least one program being configured to adaptively present (e.g., display) advertising-related content (e.g., audio, video, images, etc.) that is contextually related to inputs provided via an input device such as a for example touch-screen display device. In one variant, the at least one program analyzes user input to determine a context of the input, and selects advertising related to the context for presentation to the user.
    Type: Grant
    Filed: February 24, 2010
    Date of Patent: February 14, 2012
    Inventor: Robert F. Gazdzinski
  • Patent number: 8112276
    Abstract: A voice recognition apparatus 10 carries out voice recognition of an inputted voice with reference to a voice recognition dictionary, and outputs a voice recognition result. In this voice recognition apparatus, a plurality of voice recognition dictionaries 23-1 to 23-N are provided according to predetermined classification items.
    Type: Grant
    Filed: August 16, 2006
    Date of Patent: February 7, 2012
    Assignee: Mitsubishi Electric Corporation
    Inventors: Yuki Sumiyoshi, Reiko Okada
  • Patent number: 8112279
    Abstract: A method of building an audio description of a particular product of a class of products includes providing a plurality of human voice recordings, wherein each of the human voice recordings includes audio corresponding to an attribute value common to many of the products. The method also includes automatically obtaining attribute values of the particular product, wherein the attribute values reside electronically. The method also includes automatically applying a plurality of rules for selecting a subset of the human voice recordings that correspond to the obtained attribute values and automatically stitching the selected subset of human voice recordings together to provide a voiceover product description of the particular product. A similar method is used to build an audio description of a particular process.
    Type: Grant
    Filed: August 15, 2008
    Date of Patent: February 7, 2012
    Assignee: Dealer Dot Com, Inc.
    Inventors: Jamie M Addessi, Mark Paul Bonfigli, Richard F Gibbs, Jr., Christopher Nathaniel Scott
  • Patent number: 8108221
    Abstract: A mixed lossless audio compression has application to a unified lossy and lossless audio compression scheme that combines lossy and lossless audio compression within a same audio signal. The mixed lossless compression codes a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.
    Type: Grant
    Filed: May 18, 2009
    Date of Patent: January 31, 2012
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Chao He
  • Patent number: 8108222
    Abstract: An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.
    Type: Grant
    Filed: July 15, 2010
    Date of Patent: January 31, 2012
    Assignee: Panasonic Corporation
    Inventors: Mineo Tsushima, Takeshi Norimatsu, Kosuke Nishio, Naoya Tanaka
  • Patent number: 8108220
    Abstract: The invention enables the inclusion of voice and remaining audio information at different parts of the audio production process. In particular, the invention embodies special techniques for VRA-capable digital mastering, accommodation of PCPV/PCA and/or SCRA signals in audio CODECs, VRA-capable encoders and decoders, and VRA in DVD and other digital audio file formats. The invention facilitates an end-listener's voice-to-remaining audio (VRA) adjustment upon the playback of digital audio media formats by focusing on new configurations of multiple parts of the entire digital audio system, thereby enabling a new technique intended to benefit audio end-users (end-listeners) who wish to control the ratio of the primary vocal/dialog content of an audio program relative to the remaining portion of the audio content in that program.
    Type: Grant
    Filed: September 4, 2007
    Date of Patent: January 31, 2012
    Assignee: Akiba Electronics Institute LLC
    Inventors: William R. Saunders, Michael A. Vaudrey