Modification Of At Least One Characteristic Of Speech Waves (epo) Patents (Class 704/E21.001)

E Subclasses

Speech enhancement, e.g., noise reduction, echo cancellation, etc. (epo) (Class 704/E21.002)

Time compression or expansion (epo) (Class 704/E21.017)

Suppression or repetition of time signal segments (EPO) (Class 704/E21.018)

Transformation of speech into a nonaudible representation, e.g., speech visualization, speech processing for tactile aids, etc. (epo) (Class 704/E21.019)

Synchronization of speech with image or synthesis of the lips movement from speech, e.g., for "talking heads," etc.(EPO) (Class 704/E21.02)

Substituting or Replacing Components in Sound Based on Steganographic Encoding

Publication number: 20110046959

Abstract: The present disclosure relates to various methods and systems to provide substitute sound (e.g., audio). One claim includes an apparatus comprising: electronic memory for storing identifying information obtained from steganographically encoded sound; an electronic processor programmed for: providing the identifying information to a remote computer, the remote computer including substitute sound corresponding to the identifying information; providing format information to the remote computer, the format information identifying a format in which the substitute sound should be formatted prior to communication of the substitute sound; and controlling receipt of substitute sound corresponding to the identifying information. Of course, other apparatus, methods and combinations are provided as well.

Type: Application

Filed: August 10, 2010

Publication date: February 24, 2011

Inventors: Douglas B. Evans, William Y. Conwell
ENCODER, DECODER, AND THE METHODS THEREFOR

Publication number: 20110046946

Abstract: Provided is an encoder which can decode a high-quality stereo signal while keeping the amount of information in the bit allocation information to a minimum when a scalable coding technique is used for a stereo signal. In the encoder, a principal component analysis (PCA) converter (101) PCA converts the left signal and the right signal of the stereo signal and generates the main signal of the first layer and the sub-signal of the first layer. In the first layer to the M-th layer (where M is a natural number, 2 or greater), an adaptive residual encoder (102-m) (where m is a natural number from 1 to M) compares the importance of the main signal of the m-th layer and the importance of the sub-signal of the m-th layer, selects the signal having the higher importance, encodes the selected signal, and generates the encoded data of the m-th layer.

Type: Application

Filed: May 29, 2009

Publication date: February 24, 2011

Applicant: PANASONIC CORPORATION

Inventors: Zongxian Liu, Kok Seng Chong
METHOD AND DEVICE OF BITRATE DISTRIBUTION/TRUNCATION FOR SCALABLE AUDIO CODING

Publication number: 20110046945

Abstract: Embodiments of the invention provides a method and device for assigning bitrates to a plurality of channels in a scalable audio encoding/truncation process. Different bitrates are assigned to different channels in the scalable audio encoding/truncation process.

Type: Application

Filed: January 31, 2008

Publication date: February 24, 2011

Applicant: AGENCY FOR SCIENCE, TECHNOLOGY AND RESEARCH

Inventors: Te Li, Susanto Rahardja, Haibin Huang
Method and system for voice communication

Publication number: 20110040565

Abstract: A method and a system for voice communication, especially for a user who has voice or speaking problems, are disclosed. The method requires a communication sheet and a digital voice signal processing device. The communication sheet comprises a plurality of communication units and a plurality of function units for a user to click with the digital voice signal processing device. The plurality of function units comprise a whole sentence unit, and the method comprises a method for performing a function of emitting the sound of a whole sentence, which comprises the following steps: receiving sounds of words selected by the user; searching a voice file according to each of the sounds of words; receiving a command generated by the user's clicking the whole sentence unit; and playing voice files in order.

Type: Application

Filed: February 19, 2010

Publication date: February 17, 2011

Inventors: Chih-Kang Yang, Shu-Hua Guo, Kuo-Ping Yang, Ho-Hsin Liao, Chun-Kai Wang, Sin-Chen Lin, Kun-Yi Hua, Ming-Hsiang Cheng, Chih-Long Chang
ENTROPY CODING USING ESCAPE CODES TO SWITCH BETWEEN PLURAL CODE TABLES

Publication number: 20110035225

Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.

Type: Application

Filed: October 19, 2010

Publication date: February 10, 2011

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
METHODS AND APPARATUS TO ENHANCE HEALTHCARE INFORMATION ANALYSES

Publication number: 20110029325

Abstract: Methods and apparatus to enhance healthcare information analyses are disclosed herein.

Type: Application

Filed: July 28, 2009

Publication date: February 3, 2011

Applicant: General Electric Company, a New York Corporation

Inventors: Emil Markov Georgiev, Erik Paul Kemper
SPEECH RECOGNITION SYSTEM AND METHOD

Publication number: 20110029316

Abstract: According to the present invention, a method for integrating processes with a multi-faceted human centered interface is provided. The interface is facilitated to implement a hands free, voice driven environment to control processes and applications. A natural language model is used to parse voice initiated commands and data, and to route those voice initiated inputs to the required applications or processes. The use of an intelligent context based parser allows the system to intelligently determine what processes are required to complete a task which is initiated using natural language. A single window environment provides an interface which is comfortable to the user by preventing the occurrence of distracting windows from appearing. The single window has a plurality of facets which allow distinct viewing areas. Each facet has an independent process routing its outputs thereto. As other processes are activated, each facet can reshape itself to bring a new process into one of the viewing areas.

Type: Application

Filed: October 18, 2010

Publication date: February 3, 2011

Applicant: Nuance Communications, Inc.

Inventors: Richard Grant, Pedro E. McGregor
Video Game System with Mixing of Independent Pre-Encoded Digital Audio Bitstreams

Publication number: 20110028215

Abstract: A computer-implemented method of encoding audio includes accessing a plurality of independent audio source streams, each of which includes a sequence of source frames. Respective source frames of each sequence include respective pluralities of pulse-code modulated audio samples. Each of the plurality of independent audio source streams is separately encoded to generate a plurality of independent encoded streams, each of which corresponds to a respective independent audio source stream. The encoding includes, for respective source frames, converting respective pluralities of pulse-code modulated audio samples to respective pluralities of floating-point frequency samples that are divided into a plurality of frequency bands. An instruction to mix the plurality of independent encoded streams is received; in response, respective floating-point frequency samples of the independent encoded streams are combined. An output bitstream is generated that includes the combined respective floating-point frequency samples.

Type: Application

Filed: July 31, 2009

Publication date: February 3, 2011

Inventors: Stefan Herr, Ulrich Sigmund
Visual similarity

Publication number: 20110022394

Abstract: Methods and apparatus, including computer program products, for visual similarity. A method includes receiving a stream of video content, generating interpretations of the received video content using speech/natural language processing (NLP), associating the interpretations of the received video content with images extracted from video content based on timeline, and using the interpretations to obtain interpretations of other images or other video content.

Type: Application

Filed: July 23, 2010

Publication date: January 27, 2011

Inventor: Thomas Wide
METHOD, SYSTEM AND USER INTERFACE FOR AUTOMATICALLY CREATING AN ATMOSPHERE, PARTICULARLY A LIGHTING ATMOSPHERE, BASED ON A KEYWORD INPUT

Publication number: 20110022396

Abstract: The invention relates to the automatic creation of an atmosphere, particularly a lighting atmosphere, based on a keyword input such as a keyword typed or spoken by a user. A basic idea of the invention is to enable a user of an atmosphere creation system such as a lighting system to automatically create a specific atmosphere by simply using a keyword which is input to the system. The keyword, for example “eat”, “read”, “relax”, “sunny”, “cool”, “party”, “Christmas”, “beach”, may be spoken or typed by the user and may enable the user to find and explore numerous atmospheres in an interactive and playful way in embodiments of the invention. Finding atmosphere elements related to the keyword may be done in various ways according to embodiments of the invention. The invention allows also a non expert in designing or creating atmosphere scenes to control the creation of a desired atmosphere in an atmosphere creation system.

Type: Application

Filed: April 21, 2008

Publication date: January 27, 2011

Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.

Inventors: Bartel Marinus Van De Sluis, Elmo Marcus Attila Diederiks
VOICE SYNTHESIS AND PROCESSING

Publication number: 20110010179

Abstract: A method and an apparatus for voice synthesis and processing have been presented. In one exemplary method, a first audio recording of a human speech in a natural language is received. Then speech analysis synthesis algorithm is applied to the first audio recording to synthesize a second audio recording from the first audio recording such that the second audio recording sounds humanistic and consistent, but unintelligible.

Type: Application

Filed: July 13, 2009

Publication date: January 13, 2011

Inventor: Devang K. Naik
USE OF MULTIPLE SPEECH RECOGNITION SOFTWARE INSTANCES

Publication number: 20110010170

Abstract: A wireless communication device is disclosed that accepts recorded audio data from an end-user. The audio data can be in the form of a command requesting user action. Likewise, the audio data can be converted into a text file. The audio data is reduced to a digital file in a format that is supported by the device hardware, such as a .wav, .mp3, .vnf file, or the like. The digital file is sent via secured or unsecured wireless communication to one or more server computers for further processing. In accordance with an important aspect of the invention, the system evaluates the confidence level of the of the speech recognition process. If the confidence level is high, the system automatically builds the application command or creates the text file for transmission to the communication device.

Type: Application

Filed: September 17, 2010

Publication date: January 13, 2011

Inventors: Stephen S. Burns, Mickey W. Kowitz
METHOD AND APPARATUS FOR OPERATING A DEVICE IN A VEHICLE WITH A VOICE CONTROLLER

Publication number: 20110007006

Abstract: In a method for operating a device in a vehicle, an operating step is selected and a function associated with the operating step is performed. The operation of selecting the operating step automatically activates a voice controller for input following the operating step. An apparatus for operating a device in a vehicle has a display apparatus for displaying operating steps, a first operating unit for haptic input for selecting an operating step, a voice control unit for voice input for selecting an operating step, and a control unit which is coupled to the display apparatus, the first operating unit and the voice control unit and can be used to generate control signals for performing a function associated with a selected operating step. The control unit is designed in such a manner that the operation of selecting an operating step automatically activates the voice control unit for input following the operating step.

Type: Application

Filed: October 31, 2008

Publication date: January 13, 2011

Inventors: Lorenz Bohrer, Christof Bobzin
Creating Animations

Publication number: 20110007078

Abstract: Animation creation is described, for example, to enable children to create, record and play back stories. In an embodiment, one or more children are able to create animation components such as characters and backgrounds using a multi-touch panel display together with an image capture device. For example, a graphical user interface is provided at the multi-touch panel display to enable the animation components to be edited. In an example, children narrate a story whilst manipulating animation components using the multi-touch display panel and the sound and visual display is recorded. In embodiments image analysis is carried out automatically and used to autonomously modify story components during a narration. In examples, various types of handheld view-finding frames are provided for use with the image capture device. In embodiments saved stories can be restored from memory and retold from any point with different manipulations and narration.

Type: Application

Filed: July 10, 2009

Publication date: January 13, 2011

Applicant: Microsoft Corporation

Inventors: Xiang Cao, John Helmes, Abigail Sellen, Sian Elizabeth Lindley
REMOTE CONTROL OF HOST APPLICATION USING MOTION AND VOICE COMMANDS

Publication number: 20110001699

Abstract: A remote control microdisplay device that uses hand and head movement and voice commands to control the parameters of a field of view for the microdisplay within a larger virtual display area associated with a host application.

Type: Application

Filed: May 5, 2010

Publication date: January 6, 2011

Applicant: Kopin Corporation

Inventors: Jeffrey J. Jacobsen, Christopher Parkinson, Stephen A. Pombo
Medical Code Lookup Interface

Publication number: 20100328235

Abstract: Systems and methods for displaying a graphical user interface within a medical system for medical code lookup, comprising: (1) a first layer displaying at least one anatomical image for selecting at least one anatomical part and display of another layer, and layer selection regions for display of another layer; (2) a second layer displaying a secondary anatomical image including either a selectable cross section image, a selectable contents type image, a three dimensional image, or selectable regions similar to buttons; (3) a third layer displaying selectable classification regions corresponding to a classification group of medical codes relevant to selected sections from the secondary anatomical image; (4) a fourth layer displaying a results code set for selected classification regions, including selectable medical codes from each respective classification group; (5) navigation regions to display other layers; and (6) a medical codes display region for displaying selected codes from the results code set.

Type: Application

Filed: September 25, 2009

Publication date: December 30, 2010

Inventor: Frederick Charles Taute
ENCODING DEVICE, DECODING DEVICE, AND METHOD THEREOF

Publication number: 20100332221

Abstract: It is possible to improve quality of a decoding signal in a band spread for estimating a high band from a low band of a decoding signal. A first layer encoding unit (202) encodes a lower band portion below a predetermined frequency of an input signal so as to generate first layer encoded information. A first layer decoding unit (203) decodes the first layer encoded information so as to generate a first layer demodulated signal. A second layer encoding unit (206) divides a high band portion higher than a predetermined frequency of an input signal into a plurality of sub-bands and estimates each of the sub-bands from the input signal or the first layer decoded signal by using the estimation result of the sub-band adjacent to the lower band side so as to generate second encoded information including the estimation results of the sub-bands.

Type: Application

Filed: March 13, 2009

Publication date: December 30, 2010

Applicant: PANASONIC CORPORATION

Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
ENCODING AND DECODING APPARATUSES FOR HIGH QUALITY MULTI-CHANNEL AUDIO CODEC

Publication number: 20100324915

Abstract: Provided is an encoding apparatus for a High Quality Multi-channel Audio Codec (HQMAC) and a decoding apparatus for the HQMAC. The encoding/decoding apparatuses for the HQMAC may perform a High Quality Multi-channel Audio Codec-Channel Based (HQMAC-CB) encoding or an HQMAC-CB decoding in accordance with characteristics of inputted audio signals to provide compatibility with a lower channel.

Type: Application

Filed: June 23, 2010

Publication date: December 23, 2010

Applicant: Electronic and Telecommunications Research Institute

Inventors: Jeongil Seo, Jae-Hyoun Yoo, Kyeongok Kang
CVSD DECODER STATE UPDATE AFTER PACKET LOSS

Publication number: 20100324911

Abstract: A system and method is described for updating the state of an audio decoder, such as a CVSD decoder, after a packet loss has occurred. In response to the loss of a packet, the system and method encodes audio samples produced by a packet loss concealment (PLC) algorithm and effectively passes the encoded audio samples through the audio decoder in lieu of the contents of the lost packet. This operation brings the state of the audio decoder into better synchronization with the state of a remote audio encoder, thereby reducing or minimizing the degrading effect of the packet loss on the perceived quality of an output audio signal produced by a voice processing system that includes the audio decoder.

Type: Application

Filed: April 7, 2008

Publication date: December 23, 2010

Applicant: BROADCOM CORPORATION

Inventors: Mickael Jougit, Laurent Pilati, Mohammad Zad-Issa
GENERIC ON-CHIP HOMING AND RESIDENT, REAL-TIME BIT EXACT TESTS

Publication number: 20100325302

Abstract: Details of media encoding and decoding devices which support generic homing sequences, and methods for operating such devices are disclosed. The use of generic homing sequences may permit an embodiment of the disclosed invention to support real-time, bit-exact testing of existing and future media encoding and decoding devices. An embodiment of the present invention may permit the initialization of encoding and decoding algorithms to a known state, enabling bit-exact testing of a large group of devices using these algorithms, including those whose specifications do not support such functionality. This capability may permit the full-speed, bit-exact, testing, of both locally and remotely situated media encoders and decoders.

Type: Application

Filed: August 9, 2010

Publication date: December 23, 2010

Applicant: BROADCOM CORPORATION

Inventors: Darwin Rambo, Phil Houghton
ENHANCED TELECOMMUNICATION SYSTEM

Publication number: 20100324884

Abstract: A communication system for dynamically translating a verbal communication from a first language to a second language is disclosed. The system includes a communication device operably connected to a translation module which possesses the capability of converting a verbal communication in the first language into a verbal communication in the second language and a transcript in the second language. The system also includes an operator terminal operably connected to the translation module via a communication link. The operator terminal possesses the capability of generating and transmitting a verbal and/or written communication in the second language to the translation module wherein the verbal and/or written communication is dynamically translated into a verbal communication in the first language.

Type: Application

Filed: June 26, 2007

Publication date: December 23, 2010

Inventor: Therese M. Jeffrey
Method and Apparatus for Configuring Web-based data for Distribution to Users Accessing a Voice Portal System

Publication number: 20100318365

Abstract: In a system for developing and deploying a voice application using Web-based data as source data over a communications network to one or more recipients, a method for organizing, editing, and prioritizing the Web-based data before dialog creation is provided. The method includes harvesting the Web-based data source in the form of its original structure; generating an object tree representing the logical structure and content type of the harvested, Web-based data source; manipulating the object tree generated to a desired hierarchal structure and content; creating a voice application template in VXML and populating the template with the manipulated object tree; and creating a voice application capable of accessing the Web-based data source according to the constraints of the template.

Type: Application

Filed: April 7, 2010

Publication date: December 16, 2010

Applicant: Apptera, Inc.

Inventors: Michael S. Yuen, Leo Chiu
METHOD AND AN APPARATUS FOR PROCESSING A SIGNAL

Publication number: 20100312551

Abstract: Disclosed is a method of processing a signal, which includes receiving at least one of a first signal and a second signal, receiving mode information, and coding the at least one of the first signal and the second signal using at least one of a first coding scheme and a second coding scheme according to the mode information, wherein the mode information is information for indicating that a prescribed mode corresponds to which one of at least three modes.

Type: Application

Filed: October 15, 2008

Publication date: December 9, 2010

Applicants: LG Electronics Inc., Industry-Academic Cooperation Foundation, Yonsei University

Inventors: Hyen-O Oh, Hong Goo Kang, Chang Heon Lee, Sang Wook Shin, Yang Won Jung
NAVIGATION SYSTEM WITH SPEECH PROCESSING MECHANISM AND METHOD OF OPERATION THEREOF

Publication number: 20100312469

Abstract: A method of operation of a navigation system includes: receiving a single utterance of a spoken input; generating a search region from the spoken input with a region language model; and generating a location identifier based on a sub-region search grammar and the search region for displaying on a device.

Type: Application

Filed: June 5, 2009

Publication date: December 9, 2010

Applicant: TELENAV, INC.

Inventor: Hong Chen
AUDIO DEVICE FOR ADHERING TO FOLDABLE APPARATUS

Publication number: 20100305950

Abstract: A first audio playback device securable with a second folding or foldable device that adapts the second device to carry and playback a recorded message, and a second audio play device that is adherable/securable with any item or device to allow audio record/playback via actuation of push buttons or the like. In certain forms, the item or device may be a picture frame or may be a photo print or the like that becomes part of the audio playback device assembly.

Type: Application

Filed: July 22, 2010

Publication date: December 2, 2010

Inventor: Bart Vantieghem
Pitch Or Periodicity Estimation

Publication number: 20100305944

Abstract: A method of estimating a pitch period of a first portion of a signal wherein the first portion overlaps a previous portion. The method comprises computing a first autocorrelation value for part of the first portion not overlapping the previous portion. The method further comprises retrieving a stored second autocorrelation value for part of the first portion overlapping the previous portion, the second autocorrelation value having been computed during estimation of a pitch period of the previous portion. The method further comprises forming a combined autocorrelation value using the first and second autocorrelation values, and selecting the estimated pitch period in dependence on the combined autocorrelation value.

Type: Application

Filed: May 28, 2009

Publication date: December 2, 2010

Applicant: Cambridge Silicon Radio Limited

Inventor: Xuejing Sun
Hybrid Permanent/Reversible Dynamic Range Control System

Publication number: 20100286988

Abstract: A technique for controlling audio dynamic range in a manner that can be permanent, reversible, or anywhere in between, and can accomplish this goal in the baseband PCM or encoded domains.

Type: Application

Filed: May 6, 2010

Publication date: November 11, 2010

Inventors: Tim J. Carroll, Leif Claesson
Method of Pairing a Portable Device with a Communications Module of a Vehicular, Hands-Free Telephone System

Publication number: 20100279612

Abstract: A method of pairing Bluetooth™ enabled devices including a portable phone with a Bluetooth™ communications module of a vehicular, hands-free telephone system includes using vocal communications to prompt an operator of the phone to enter a given PIN number into the phone. The presence of any Bluetooth™ enabled devices within the vicinity of the communications module is searched. Vocal communications are used to prompt the operator to vocally state a name for the phone and to vocally state a pairing priority to be assigned to the phone. If the assigned pairing priority is not assigned to another Bluetooth™ enabled device, then the name and the pairing priority are associated with the phone. Communications between the communications module and the phone are then enabled if the phone has the highest pairing priority amongst all of the Bluetooth™ enabled devices present within the vicinity of the communications module.

Type: Application

Filed: July 16, 2010

Publication date: November 4, 2010

Applicant: LEAR CORPORATION

Inventors: Jody K. Harwood, Jason G. Bauman, Kenan Robert Rudnick
Photo Management Using Expression-Based Voice Commands

Publication number: 20100280829

Abstract: A system and method are provided for photo management using expression-based voice commands. The method interfaces a photo-image discovery device, having no dedicated display, to a display monitor. Expression-based user voice prompt are received and used to access a photo-image in storage at a storage site. The accessed photo-image is then presented on the display monitor. The photo-image in storage at the storage site can be accessed to perform an operation such as: selecting a storage site, selecting a photo-image, transforming a selected photo-image, converting a file format of a selected photo-image, and selecting a delivery option. In one aspect, a menu of photo-image user prompt options are presented on the display monitor, originating from the photo discovery device, and the expression-based user voice prompts are received in response to the presented menu.

Type: Application

Filed: May 5, 2009

Publication date: November 4, 2010

Inventors: Paramesh Gopi, Vinay Ravuri, Dimitry Vaysburg, Prodyut Hazarika
METHOD AND APPARATUS FOR CODING OR DECODING WIDEBAND SPEECH

Publication number: 20100250245

Abstract: A wideband speech coding method comprising identifying whether an input speech signal is a narrowband signal or a wideband signal, and coding the input speech signal by controlling a predetermined parameter of a wideband speech coding process based on the identification result.

Type: Application

Filed: March 31, 2010

Publication date: September 30, 2010

Inventor: Kimio Miseki
METHOD AND APPARATUS FOR CODING OR DECODING WIDEBAND SPEECH

Publication number: 20100250262

Abstract: A wideband speech coding method comprising identifying whether an input speech signal is a narrowband signal or a wideband signal, and coding the input speech signal by controlling a predetermined parameter of a wideband speech coding process based on the identification result.

Type: Application

Filed: March 31, 2010

Publication date: September 30, 2010

Inventor: Kimio MISEKI
SYSTEM AND MEHOD FOR PROVIDING INTERACTIVE CONTENT

Publication number: 20100251283

Abstract: A system and method for providing interactive content are disclosed. In one embodiment, the method comprises receiving, in a vehicle via a wireless broadcast, audiovisual content and user-requestable content, rendering the audiovisual content, receiving user input indicative of request for the user-requestable content, and rendering, in response to receiving the user input, the user-requestable content. The user-requestable content can include, but is not limited to, additional audiovisual content, climate control data, and navigation data.

Type: Application

Filed: March 31, 2009

Publication date: September 30, 2010

Applicant: QUALCOMM Incorporated

Inventor: Allen W. Smith
Service Oriented Speech Recognition for In-Vehicle Automated Interaction and In-Vehicle User Interfaces Requiring Minimal Cognitive Driver Processing for Same

Publication number: 20100250243

Abstract: A system and method for implementing a server-based speech recognition system for multi-modal automated interaction in a vehicle includes receiving, by a vehicle driver, audio prompts by an on-board human-to-machine interface and a response with speech to complete tasks such as creating and sending text messages, web browsing, navigation, etc. This service-oriented architecture is utilized to call upon specialized speech recognizers in an adaptive fashion. The human-to-machine interface enables completion of a text input task while driving a vehicle in a way that minimizes the frequency of the driver's visual and mechanical interactions with the interface, thereby eliminating unsafe distractions during driving conditions. After the initial prompting, the typing task is followed by a computerized verbalization of the text. Subsequent interface steps can be visual in nature, or involve only sound.

Type: Application

Filed: March 23, 2010

Publication date: September 30, 2010

Inventors: Thomas Barton Schalk, Leonel Saenz, Barry Burch
METHOD AND APPARATUS FOR CODING OR DECODING WIDEBAND SPEECH

Publication number: 20100250263

Abstract: A wideband speech coding method comprising identifying whether an input speech signal is a narrowband signal or a wideband signal, and coding the input speech signal by controlling a predetermined parameter of a wideband speech coding process based on the identification result.

Type: Application

Filed: March 31, 2010

Publication date: September 30, 2010

Inventor: Kimio MISEKI
Mechanism for Providing User Guidance and Latency Concealment for Automatic Speech Recognition Systems

Publication number: 20100248786

Abstract: Audio input to a user device is captured in a buffer and played back to the user while being sent to and recognized by an automatic speech recognition (ASR) system. Overlapping the playback with the speech recognition processing masks a portion of the true latency of the ASR system thus improving the user's perception of the ASR system's responsiveness. Further, upon hearing the playback, the user is intuitively guided to self-correct for any defects in the captured audio.

Type: Application

Filed: March 30, 2010

Publication date: September 30, 2010

Inventor: Laurent Charriere
VOICE QUALITY EDIT DEVICE AND VOICE QUALITY EDIT METHOD

Publication number: 20100250257

Abstract: This invention includes: a voice quality feature database (101) holding voice quality features; a speaker attribute database (106) holding, for each voice quality feature, an identifier enabling a user to expect a voice quality of the voice quality feature; a weight setting unit (103) setting a weight for each acoustic feature of a voice quality; a scaling unit (105) calculating display coordinates of each voice quality feature based on the acoustic features in the voice quality feature and the weights set by the weight setting unit (103); a display unit (107) displaying the identifier of each voice quality feature on the calculated display coordinates; a position input unit (108) receiving designated coordinates; and a voice quality mix unit (110) (i) calculating a distance between (1) the received designated coordinates and (2) the display coordinates of each of a part or all of the voice quality features, and (ii) mixing the acoustic features of the part or all of the voice quality features together based

Type: Application

Filed: June 4, 2008

Publication date: September 30, 2010

Inventors: Yoshifumi Hirose, Takahiro Kamai
Low-Complexity Spectral Analysis/Synthesis Using Selectable Time Resolution

Publication number: 20100250265

Abstract: The signal processing is based on the concept of using a time-domain aliased (12, TDA) frame as a basis for time segmentation (14) and spectral analysis (16), performing segmentation in time based on the time-domain aliased frame and performing spectral analysis based on the resulting time segments. The time resolution of the overall ?segmented? time-to-frequency transform can thus be changed by simply adapting the time segmentation to obtain a suitable number of time segments based on which spectral analysis is applied. The overall set of spectral coefficients, obtained for all the segments, provides a selectable time-frequency tiling of the original signal frame.

Type: Application

Filed: August 25, 2008

Publication date: September 30, 2010

Applicant: Telefonaktiebolaget L M Ericsson (publ)

Inventor: Anisse Taleb
SPEECH SIGNAL EVALUATION APPARATUS, STORAGE MEDIUM STORING SPEECH SIGNAL EVALUATION PROGRAM, AND SPEECH SIGNAL EVALUATION METHOD

Publication number: 20100250246

Abstract: A speech signal evaluation apparatus includes: an acquisition unit that acquires, as a first frame, a speech signal of a specified length from speech signals; a first detection unit that detects, on the basis of a speech condition, whether the first frame is voiced or unvoiced; a variation calculation unit that, when the first frame is unvoiced, calculates a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame that is unvoiced and precedes the first frame in time; and a second detection unit that detects, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation of the first frame satisfies the non-stationary condition.

Type: Application

Filed: March 24, 2010

Publication date: September 30, 2010

Applicant: FUJITSU LIMITED

Inventor: Chikako MATSUMOTO
Apparatus and method for encoding and decoding multi-channel signal

Publication number: 20100241436

Abstract: Provided are an encoding apparatus and a decoding apparatus of a multi-channel signal. The encoding apparatus of the multi-channel signal may process a phase parameter associated with phase information between a plurality of channels constituting the multi-channel signal, based on a characteristic of the multi-channel signal. The encoding apparatus may generate an encoded bitstream with respect to the multi-channel signal using the processed phase parameter and a mono signal extracted from the multi-channel signal.

Type: Application

Filed: March 17, 2010

Publication date: September 23, 2010

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Jung-Hoe Kim, Eun Mi OH
SYSTEM AND METHOD FOR SHORT-RANGE COMMUNICATION FOR A VEHICLE

Publication number: 20100240307

Abstract: A system for communicating information and/or instructions includes a transmitter and receiver configured for wireless communication with a user interface. The information and/or instructions may be audibly communicated to the user. The system includes voice activation and controls. The wireless communication may utilize Bluetooth communication protocol.

Type: Application

Filed: August 24, 2007

Publication date: September 23, 2010

Inventors: Michael J. Sims, Carl L. Shearer, Steven L. Geerlings, Todd R. Witkowski, Paul S. Vanlente, Ted W. Ringold, James E. Trainor
Apparatus for efficiently mixing narrowband and wideband voice data and a method therefor

Publication number: 20100241435

Abstract: A voice mixing apparatus decodes input encoded narrowband voice data and encoded voice data for narrowband region of input encoded wideband voice data, and detects a speaker in accordance with the decoded voice signals of the entire narrowband. When encoded voice data from a speaker is included in the narrowband, a signal in a region outside the narrowband of the expanded data is encoded. When the data is included in the wideband, encoded voice data of the region outside the narrowband is extracted for output. When the destination terminal is compatible with the encoded narrowband voice data, the narrowband voice signal mixed is encoded and output. When the destination terminal is compatible with wideband, the narrowband voice signal mixed is encoded for the narrowband region, and the voice data of the speaker is used as the encoded voice data for the region outside the narrowband.

Type: Application

Filed: February 3, 2010

Publication date: September 23, 2010

Applicant: OKI ELECTRIC INDUSTRY CO., LTD.

Inventors: Hiromi Aoyagi, Shinji Usuba
SYSTEMS, METHODS, AND SOFTWARE FOR PROVIDING WAYFINDING ORIENTATION AND WAYFINDING DATA TO BLIND TRAVELERS

Publication number: 20100241350

Abstract: To support the independence and mobility of blind pedestrians, the present inventors devised, among other things, free systems, methods, and software for providing narrative blind-ready wayfinding information. One exemplary system receives user input identifying a starting landmark and ending landmark in a particular selected geographic region, such as a city, university campus, government building, shopping mall, or airport. The system then searches a database for the corresponding narrative wayfinding instructions, and outputs them in the form of text or audio to guide a blind pedestrian from the starting landmark to the ending landmark. In the exemplary system, blind users select the geographic region as well as the starting and ending landmark from a voice-driven telephonic menu system and receive audible wayfinding instruction via mobile telephone. In some embodiments, the system also provides access to voice-driven restaurant menus.

Type: Application

Filed: March 18, 2010

Publication date: September 23, 2010

Inventors: Joseph Cioffi, Philip Agee
DIGITAL CAMERA FOR RECORDING STILL IMAGE WITH SPEECH

Publication number: 20100238304

Abstract: An image pickup method includes determining a start timing and an end timing of obtaining the speech to have a photographing timing of the still image taken by the image pickup unit therebetween, in accordance with a period in which the speech stored in the temporary speech storing unit satisfies a predetermined condition, and cutting out the speech stored in the temporary speech storing unit for a period from the start timing to the end timing determined, and storing the cut speech in the storing unit in association with the still image taken by the image pickup unit.

Type: Application

Filed: March 17, 2010

Publication date: September 23, 2010

Applicant: Casio Computer Co., Ltd.

Inventor: Akira MIYATA
Compressed Audio Information

Publication number: 20100238173

Abstract: Compressed entertainment content such as audio or video or both includes additional aspects and operations associated their way. The compressed audio may be used to signal computers such as a telephone or reminder for an appointment. A melody line may be extracted from the audio, or the audio may be used exactly as it is. Another aspect stores traders within the entertainment content such as in MP3. Those traders are used to trigger the system to retrieve other parts of the content to be displayed at the same time that that particular part of the MP3 is being play. The content may include video or text, or maybe links to other content such as broadband content four times sensitive content. Another aspect describes encryption which is keyed to the disk ID to prevent playing oven illegally copied disk. Another aspect reads a specified amount of information then spins down the disk to conserve battery power.

Type: Application

Filed: June 4, 2010

Publication date: September 23, 2010

Inventor: Scott C. Harris
Tagging Video Content

Publication number: 20100235865

Abstract: Areas of a video are marked with information about the areas at the marking. For example, an actor's shoes, and other clothing can be marked. That clothing is selected to get more information about the clothing.

Type: Application

Filed: March 11, 2010

Publication date: September 16, 2010

Applicant: Ubiquity Holdings

Inventors: Connie Jordan, Christopher Carmichael
Accessing Materials Via Voice and a Menu

Publication number: 20100235894

Abstract: A computer implemented method for accessing materials for a meeting may include receiving a call from a meeting participant by a system, wherein the meeting participant calls a prearranged teleconference number to participate in the meeting. The method may also include validating participation of the meeting participant in the meeting by the system. The method may further include providing access to an appropriate set of materials to the meeting participant based on a predetermined attribute associated with the meeting participant.

Type: Application

Filed: March 16, 2009

Publication date: September 16, 2010

Inventors: Lloyd W. Allen, JR., Jana H. Jenkins, Steven M. Miller
DIGITAL AUDIO SIGNAL COMPRESSION METHOD AND APPARATUS

Publication number: 20100235174

Abstract: Compression of audio signal data is described herein. In various embodiments, the compression of each unit of the audio signal data includes the employment of a distribution substantially representative of a subblock of residual data of the unit of audio signal data, to reduce the amount of data having to be transmitted to transmit the unit of audio signal data to a recipient.

Type: Application

Filed: May 24, 2010

Publication date: September 16, 2010

Inventor: Yuriy A. Reznik
Method and Apparatus for Audio Coding

Publication number: 20100223061

Abstract: In accordance with an example embodiment of the present invention, there is provided an apparatus for encoding an audio signal in two or more encoding stages, the audio signal comprising a set of frequency components. The apparatus comprises a frequency component selection unit configured to select a number of frequency components from the set for encoding in a current encoding stage, the selected frequency components being components of the set that have not been encoded to a non-zero value in a preceding encoding stage; and an encoding unit configured to encode at least one of the selected frequency components to a non-zero value using a number of bits less than or equal to a predetermined number of bits allocated for the current encoding stage.

Type: Application

Filed: February 27, 2009

Publication date: September 2, 2010

Applicant: NOKIA CORPORATION

Inventor: Juha Petteri Ojanpera
SYSTEM AND METHOD FOR PROCESSING MULTI-MODAL DEVICE INTERACTIONS IN A NATURAL LANGUAGE VOICE SERVICES ENVIRONMENT

Publication number: 20100217604

Abstract: A system and method for processing multi-modal device interactions in a natural language voice services environment may be provided. In particular, one or more multi-modal device interactions may be received in a natural language voice services environment that includes one or more electronic devices. The multi-modal device interactions may include a non-voice interaction with at least one of the electronic devices or an application associated therewith, and may further include a natural language utterance relating to the non-voice interaction. Context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multi-modal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent of the multi-modal device interaction.

Type: Application

Filed: February 20, 2009

Publication date: August 26, 2010

Applicant: VoiceBox Technologies, Inc.

Inventors: Larry Baldwin, Chris Weider
ADAPTIVE INFORMATION PRESENTATION APPARATUS AND METHODS

Publication number: 20100217657

Abstract: An adaptive information presentation apparatus and associated methods. In one embodiment, the apparatus comprises a computer readable medium having at least one computer program disposed thereon, the at least one program being configured to adaptively present (e.g., display or play out via an audio system) information that is related or in response to inputs provided via an input device such as a for example touch-screen display device. In one variant, the at least one program analyzes user input to determine a context of the input, and selects advertising related to the context for presentation to the user.

Type: Application

Filed: February 24, 2010

Publication date: August 26, 2010

Inventor: Robert F. Gazdzinski

prev … 4 5 6 7 8 9 10 11 12 … next