Modification Of At Least One Characteristic Of Speech Waves (epo) Patents (Class 704/E21.001)
  • Publication number: 20110046959
    Abstract: The present disclosure relates to various methods and systems to provide substitute sound (e.g., audio). One claim includes an apparatus comprising: electronic memory for storing identifying information obtained from steganographically encoded sound; an electronic processor programmed for: providing the identifying information to a remote computer, the remote computer including substitute sound corresponding to the identifying information; providing format information to the remote computer, the format information identifying a format in which the substitute sound should be formatted prior to communication of the substitute sound; and controlling receipt of substitute sound corresponding to the identifying information. Of course, other apparatus, methods and combinations are provided as well.
    Type: Application
    Filed: August 10, 2010
    Publication date: February 24, 2011
    Inventors: Douglas B. Evans, William Y. Conwell
  • Publication number: 20110046946
    Abstract: Provided is an encoder which can decode a high-quality stereo signal while keeping the amount of information in the bit allocation information to a minimum when a scalable coding technique is used for a stereo signal. In the encoder, a principal component analysis (PCA) converter (101) PCA converts the left signal and the right signal of the stereo signal and generates the main signal of the first layer and the sub-signal of the first layer. In the first layer to the M-th layer (where M is a natural number, 2 or greater), an adaptive residual encoder (102-m) (where m is a natural number from 1 to M) compares the importance of the main signal of the m-th layer and the importance of the sub-signal of the m-th layer, selects the signal having the higher importance, encodes the selected signal, and generates the encoded data of the m-th layer.
    Type: Application
    Filed: May 29, 2009
    Publication date: February 24, 2011
    Applicant: PANASONIC CORPORATION
    Inventors: Zongxian Liu, Kok Seng Chong
  • Publication number: 20110046945
    Abstract: Embodiments of the invention provides a method and device for assigning bitrates to a plurality of channels in a scalable audio encoding/truncation process. Different bitrates are assigned to different channels in the scalable audio encoding/truncation process.
    Type: Application
    Filed: January 31, 2008
    Publication date: February 24, 2011
    Applicant: AGENCY FOR SCIENCE, TECHNOLOGY AND RESEARCH
    Inventors: Te Li, Susanto Rahardja, Haibin Huang
  • Publication number: 20110040565
    Abstract: A method and a system for voice communication, especially for a user who has voice or speaking problems, are disclosed. The method requires a communication sheet and a digital voice signal processing device. The communication sheet comprises a plurality of communication units and a plurality of function units for a user to click with the digital voice signal processing device. The plurality of function units comprise a whole sentence unit, and the method comprises a method for performing a function of emitting the sound of a whole sentence, which comprises the following steps: receiving sounds of words selected by the user; searching a voice file according to each of the sounds of words; receiving a command generated by the user's clicking the whole sentence unit; and playing voice files in order.
    Type: Application
    Filed: February 19, 2010
    Publication date: February 17, 2011
    Inventors: Chih-Kang Yang, Shu-Hua Guo, Kuo-Ping Yang, Ho-Hsin Liao, Chun-Kai Wang, Sin-Chen Lin, Kun-Yi Hua, Ming-Hsiang Cheng, Chih-Long Chang
  • Publication number: 20110035225
    Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.
    Type: Application
    Filed: October 19, 2010
    Publication date: February 10, 2011
    Applicant: Microsoft Corporation
    Inventors: Sanjeev Mehrotra, Wei-Ge Chen
  • Publication number: 20110029325
    Abstract: Methods and apparatus to enhance healthcare information analyses are disclosed herein.
    Type: Application
    Filed: July 28, 2009
    Publication date: February 3, 2011
    Applicant: General Electric Company, a New York Corporation
    Inventors: Emil Markov Georgiev, Erik Paul Kemper
  • Publication number: 20110029316
    Abstract: According to the present invention, a method for integrating processes with a multi-faceted human centered interface is provided. The interface is facilitated to implement a hands free, voice driven environment to control processes and applications. A natural language model is used to parse voice initiated commands and data, and to route those voice initiated inputs to the required applications or processes. The use of an intelligent context based parser allows the system to intelligently determine what processes are required to complete a task which is initiated using natural language. A single window environment provides an interface which is comfortable to the user by preventing the occurrence of distracting windows from appearing. The single window has a plurality of facets which allow distinct viewing areas. Each facet has an independent process routing its outputs thereto. As other processes are activated, each facet can reshape itself to bring a new process into one of the viewing areas.
    Type: Application
    Filed: October 18, 2010
    Publication date: February 3, 2011
    Applicant: Nuance Communications, Inc.
    Inventors: Richard Grant, Pedro E. McGregor
  • Publication number: 20110028215
    Abstract: A computer-implemented method of encoding audio includes accessing a plurality of independent audio source streams, each of which includes a sequence of source frames. Respective source frames of each sequence include respective pluralities of pulse-code modulated audio samples. Each of the plurality of independent audio source streams is separately encoded to generate a plurality of independent encoded streams, each of which corresponds to a respective independent audio source stream. The encoding includes, for respective source frames, converting respective pluralities of pulse-code modulated audio samples to respective pluralities of floating-point frequency samples that are divided into a plurality of frequency bands. An instruction to mix the plurality of independent encoded streams is received; in response, respective floating-point frequency samples of the independent encoded streams are combined. An output bitstream is generated that includes the combined respective floating-point frequency samples.
    Type: Application
    Filed: July 31, 2009
    Publication date: February 3, 2011
    Inventors: Stefan Herr, Ulrich Sigmund
  • Publication number: 20110022394
    Abstract: Methods and apparatus, including computer program products, for visual similarity. A method includes receiving a stream of video content, generating interpretations of the received video content using speech/natural language processing (NLP), associating the interpretations of the received video content with images extracted from video content based on timeline, and using the interpretations to obtain interpretations of other images or other video content.
    Type: Application
    Filed: July 23, 2010
    Publication date: January 27, 2011
    Inventor: Thomas Wide
  • Publication number: 20110022396
    Abstract: The invention relates to the automatic creation of an atmosphere, particularly a lighting atmosphere, based on a keyword input such as a keyword typed or spoken by a user. A basic idea of the invention is to enable a user of an atmosphere creation system such as a lighting system to automatically create a specific atmosphere by simply using a keyword which is input to the system. The keyword, for example “eat”, “read”, “relax”, “sunny”, “cool”, “party”, “Christmas”, “beach”, may be spoken or typed by the user and may enable the user to find and explore numerous atmospheres in an interactive and playful way in embodiments of the invention. Finding atmosphere elements related to the keyword may be done in various ways according to embodiments of the invention. The invention allows also a non expert in designing or creating atmosphere scenes to control the creation of a desired atmosphere in an atmosphere creation system.
    Type: Application
    Filed: April 21, 2008
    Publication date: January 27, 2011
    Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.
    Inventors: Bartel Marinus Van De Sluis, Elmo Marcus Attila Diederiks
  • Publication number: 20110010179
    Abstract: A method and an apparatus for voice synthesis and processing have been presented. In one exemplary method, a first audio recording of a human speech in a natural language is received. Then speech analysis synthesis algorithm is applied to the first audio recording to synthesize a second audio recording from the first audio recording such that the second audio recording sounds humanistic and consistent, but unintelligible.
    Type: Application
    Filed: July 13, 2009
    Publication date: January 13, 2011
    Inventor: Devang K. Naik
  • Publication number: 20110010170
    Abstract: A wireless communication device is disclosed that accepts recorded audio data from an end-user. The audio data can be in the form of a command requesting user action. Likewise, the audio data can be converted into a text file. The audio data is reduced to a digital file in a format that is supported by the device hardware, such as a .wav, .mp3, .vnf file, or the like. The digital file is sent via secured or unsecured wireless communication to one or more server computers for further processing. In accordance with an important aspect of the invention, the system evaluates the confidence level of the of the speech recognition process. If the confidence level is high, the system automatically builds the application command or creates the text file for transmission to the communication device.
    Type: Application
    Filed: September 17, 2010
    Publication date: January 13, 2011
    Inventors: Stephen S. Burns, Mickey W. Kowitz
  • Publication number: 20110007006
    Abstract: In a method for operating a device in a vehicle, an operating step is selected and a function associated with the operating step is performed. The operation of selecting the operating step automatically activates a voice controller for input following the operating step. An apparatus for operating a device in a vehicle has a display apparatus for displaying operating steps, a first operating unit for haptic input for selecting an operating step, a voice control unit for voice input for selecting an operating step, and a control unit which is coupled to the display apparatus, the first operating unit and the voice control unit and can be used to generate control signals for performing a function associated with a selected operating step. The control unit is designed in such a manner that the operation of selecting an operating step automatically activates the voice control unit for input following the operating step.
    Type: Application
    Filed: October 31, 2008
    Publication date: January 13, 2011
    Inventors: Lorenz Bohrer, Christof Bobzin
  • Publication number: 20110007078
    Abstract: Animation creation is described, for example, to enable children to create, record and play back stories. In an embodiment, one or more children are able to create animation components such as characters and backgrounds using a multi-touch panel display together with an image capture device. For example, a graphical user interface is provided at the multi-touch panel display to enable the animation components to be edited. In an example, children narrate a story whilst manipulating animation components using the multi-touch display panel and the sound and visual display is recorded. In embodiments image analysis is carried out automatically and used to autonomously modify story components during a narration. In examples, various types of handheld view-finding frames are provided for use with the image capture device. In embodiments saved stories can be restored from memory and retold from any point with different manipulations and narration.
    Type: Application
    Filed: July 10, 2009
    Publication date: January 13, 2011
    Applicant: Microsoft Corporation
    Inventors: Xiang Cao, John Helmes, Abigail Sellen, Sian Elizabeth Lindley
  • Publication number: 20110001699
    Abstract: A remote control microdisplay device that uses hand and head movement and voice commands to control the parameters of a field of view for the microdisplay within a larger virtual display area associated with a host application.
    Type: Application
    Filed: May 5, 2010
    Publication date: January 6, 2011
    Applicant: Kopin Corporation
    Inventors: Jeffrey J. Jacobsen, Christopher Parkinson, Stephen A. Pombo
  • Publication number: 20100328235
    Abstract: Systems and methods for displaying a graphical user interface within a medical system for medical code lookup, comprising: (1) a first layer displaying at least one anatomical image for selecting at least one anatomical part and display of another layer, and layer selection regions for display of another layer; (2) a second layer displaying a secondary anatomical image including either a selectable cross section image, a selectable contents type image, a three dimensional image, or selectable regions similar to buttons; (3) a third layer displaying selectable classification regions corresponding to a classification group of medical codes relevant to selected sections from the secondary anatomical image; (4) a fourth layer displaying a results code set for selected classification regions, including selectable medical codes from each respective classification group; (5) navigation regions to display other layers; and (6) a medical codes display region for displaying selected codes from the results code set.
    Type: Application
    Filed: September 25, 2009
    Publication date: December 30, 2010
    Inventor: Frederick Charles Taute
  • Publication number: 20100332221
    Abstract: It is possible to improve quality of a decoding signal in a band spread for estimating a high band from a low band of a decoding signal. A first layer encoding unit (202) encodes a lower band portion below a predetermined frequency of an input signal so as to generate first layer encoded information. A first layer decoding unit (203) decodes the first layer encoded information so as to generate a first layer demodulated signal. A second layer encoding unit (206) divides a high band portion higher than a predetermined frequency of an input signal into a plurality of sub-bands and estimates each of the sub-bands from the input signal or the first layer decoded signal by using the estimation result of the sub-band adjacent to the lower band side so as to generate second encoded information including the estimation results of the sub-bands.
    Type: Application
    Filed: March 13, 2009
    Publication date: December 30, 2010
    Applicant: PANASONIC CORPORATION
    Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
  • Publication number: 20100324915
    Abstract: Provided is an encoding apparatus for a High Quality Multi-channel Audio Codec (HQMAC) and a decoding apparatus for the HQMAC. The encoding/decoding apparatuses for the HQMAC may perform a High Quality Multi-channel Audio Codec-Channel Based (HQMAC-CB) encoding or an HQMAC-CB decoding in accordance with characteristics of inputted audio signals to provide compatibility with a lower channel.
    Type: Application
    Filed: June 23, 2010
    Publication date: December 23, 2010
    Applicant: Electronic and Telecommunications Research Institute
    Inventors: Jeongil Seo, Jae-Hyoun Yoo, Kyeongok Kang
  • Publication number: 20100324911
    Abstract: A system and method is described for updating the state of an audio decoder, such as a CVSD decoder, after a packet loss has occurred. In response to the loss of a packet, the system and method encodes audio samples produced by a packet loss concealment (PLC) algorithm and effectively passes the encoded audio samples through the audio decoder in lieu of the contents of the lost packet. This operation brings the state of the audio decoder into better synchronization with the state of a remote audio encoder, thereby reducing or minimizing the degrading effect of the packet loss on the perceived quality of an output audio signal produced by a voice processing system that includes the audio decoder.
    Type: Application
    Filed: April 7, 2008
    Publication date: December 23, 2010
    Applicant: BROADCOM CORPORATION
    Inventors: Mickael Jougit, Laurent Pilati, Mohammad Zad-Issa
  • Publication number: 20100325302
    Abstract: Details of media encoding and decoding devices which support generic homing sequences, and methods for operating such devices are disclosed. The use of generic homing sequences may permit an embodiment of the disclosed invention to support real-time, bit-exact testing of existing and future media encoding and decoding devices. An embodiment of the present invention may permit the initialization of encoding and decoding algorithms to a known state, enabling bit-exact testing of a large group of devices using these algorithms, including those whose specifications do not support such functionality. This capability may permit the full-speed, bit-exact, testing, of both locally and remotely situated media encoders and decoders.
    Type: Application
    Filed: August 9, 2010
    Publication date: December 23, 2010
    Applicant: BROADCOM CORPORATION
    Inventors: Darwin Rambo, Phil Houghton
  • Publication number: 20100324884
    Abstract: A communication system for dynamically translating a verbal communication from a first language to a second language is disclosed. The system includes a communication device operably connected to a translation module which possesses the capability of converting a verbal communication in the first language into a verbal communication in the second language and a transcript in the second language. The system also includes an operator terminal operably connected to the translation module via a communication link. The operator terminal possesses the capability of generating and transmitting a verbal and/or written communication in the second language to the translation module wherein the verbal and/or written communication is dynamically translated into a verbal communication in the first language.
    Type: Application
    Filed: June 26, 2007
    Publication date: December 23, 2010
    Inventor: Therese M. Jeffrey
  • Publication number: 20100318365
    Abstract: In a system for developing and deploying a voice application using Web-based data as source data over a communications network to one or more recipients, a method for organizing, editing, and prioritizing the Web-based data before dialog creation is provided. The method includes harvesting the Web-based data source in the form of its original structure; generating an object tree representing the logical structure and content type of the harvested, Web-based data source; manipulating the object tree generated to a desired hierarchal structure and content; creating a voice application template in VXML and populating the template with the manipulated object tree; and creating a voice application capable of accessing the Web-based data source according to the constraints of the template.
    Type: Application
    Filed: April 7, 2010
    Publication date: December 16, 2010
    Applicant: Apptera, Inc.
    Inventors: Michael S. Yuen, Leo Chiu
  • Publication number: 20100312551
    Abstract: Disclosed is a method of processing a signal, which includes receiving at least one of a first signal and a second signal, receiving mode information, and coding the at least one of the first signal and the second signal using at least one of a first coding scheme and a second coding scheme according to the mode information, wherein the mode information is information for indicating that a prescribed mode corresponds to which one of at least three modes.
    Type: Application
    Filed: October 15, 2008
    Publication date: December 9, 2010
    Applicants: LG Electronics Inc., Industry-Academic Cooperation Foundation, Yonsei University
    Inventors: Hyen-O Oh, Hong Goo Kang, Chang Heon Lee, Sang Wook Shin, Yang Won Jung
  • Publication number: 20100312469
    Abstract: A method of operation of a navigation system includes: receiving a single utterance of a spoken input; generating a search region from the spoken input with a region language model; and generating a location identifier based on a sub-region search grammar and the search region for displaying on a device.
    Type: Application
    Filed: June 5, 2009
    Publication date: December 9, 2010
    Applicant: TELENAV, INC.
    Inventor: Hong Chen
  • Publication number: 20100305950
    Abstract: A first audio playback device securable with a second folding or foldable device that adapts the second device to carry and playback a recorded message, and a second audio play device that is adherable/securable with any item or device to allow audio record/playback via actuation of push buttons or the like. In certain forms, the item or device may be a picture frame or may be a photo print or the like that becomes part of the audio playback device assembly.
    Type: Application
    Filed: July 22, 2010
    Publication date: December 2, 2010
    Inventor: Bart Vantieghem
  • Publication number: 20100305944
    Abstract: A method of estimating a pitch period of a first portion of a signal wherein the first portion overlaps a previous portion. The method comprises computing a first autocorrelation value for part of the first portion not overlapping the previous portion. The method further comprises retrieving a stored second autocorrelation value for part of the first portion overlapping the previous portion, the second autocorrelation value having been computed during estimation of a pitch period of the previous portion. The method further comprises forming a combined autocorrelation value using the first and second autocorrelation values, and selecting the estimated pitch period in dependence on the combined autocorrelation value.
    Type: Application
    Filed: May 28, 2009
    Publication date: December 2, 2010
    Applicant: Cambridge Silicon Radio Limited
    Inventor: Xuejing Sun
  • Publication number: 20100286988
    Abstract: A technique for controlling audio dynamic range in a manner that can be permanent, reversible, or anywhere in between, and can accomplish this goal in the baseband PCM or encoded domains.
    Type: Application
    Filed: May 6, 2010
    Publication date: November 11, 2010
    Inventors: Tim J. Carroll, Leif Claesson
  • Publication number: 20100279612
    Abstract: A method of pairing Bluetooth™ enabled devices including a portable phone with a Bluetooth™ communications module of a vehicular, hands-free telephone system includes using vocal communications to prompt an operator of the phone to enter a given PIN number into the phone. The presence of any Bluetooth™ enabled devices within the vicinity of the communications module is searched. Vocal communications are used to prompt the operator to vocally state a name for the phone and to vocally state a pairing priority to be assigned to the phone. If the assigned pairing priority is not assigned to another Bluetooth™ enabled device, then the name and the pairing priority are associated with the phone. Communications between the communications module and the phone are then enabled if the phone has the highest pairing priority amongst all of the Bluetooth™ enabled devices present within the vicinity of the communications module.
    Type: Application
    Filed: July 16, 2010
    Publication date: November 4, 2010
    Applicant: LEAR CORPORATION
    Inventors: Jody K. Harwood, Jason G. Bauman, Kenan Robert Rudnick
  • Publication number: 20100280829
    Abstract: A system and method are provided for photo management using expression-based voice commands. The method interfaces a photo-image discovery device, having no dedicated display, to a display monitor. Expression-based user voice prompt are received and used to access a photo-image in storage at a storage site. The accessed photo-image is then presented on the display monitor. The photo-image in storage at the storage site can be accessed to perform an operation such as: selecting a storage site, selecting a photo-image, transforming a selected photo-image, converting a file format of a selected photo-image, and selecting a delivery option. In one aspect, a menu of photo-image user prompt options are presented on the display monitor, originating from the photo discovery device, and the expression-based user voice prompts are received in response to the presented menu.
    Type: Application
    Filed: May 5, 2009
    Publication date: November 4, 2010
    Inventors: Paramesh Gopi, Vinay Ravuri, Dimitry Vaysburg, Prodyut Hazarika
  • Publication number: 20100250245
    Abstract: A wideband speech coding method comprising identifying whether an input speech signal is a narrowband signal or a wideband signal, and coding the input speech signal by controlling a predetermined parameter of a wideband speech coding process based on the identification result.
    Type: Application
    Filed: March 31, 2010
    Publication date: September 30, 2010
    Inventor: Kimio Miseki
  • Publication number: 20100250262
    Abstract: A wideband speech coding method comprising identifying whether an input speech signal is a narrowband signal or a wideband signal, and coding the input speech signal by controlling a predetermined parameter of a wideband speech coding process based on the identification result.
    Type: Application
    Filed: March 31, 2010
    Publication date: September 30, 2010
    Inventor: Kimio MISEKI
  • Publication number: 20100251283
    Abstract: A system and method for providing interactive content are disclosed. In one embodiment, the method comprises receiving, in a vehicle via a wireless broadcast, audiovisual content and user-requestable content, rendering the audiovisual content, receiving user input indicative of request for the user-requestable content, and rendering, in response to receiving the user input, the user-requestable content. The user-requestable content can include, but is not limited to, additional audiovisual content, climate control data, and navigation data.
    Type: Application
    Filed: March 31, 2009
    Publication date: September 30, 2010
    Applicant: QUALCOMM Incorporated
    Inventor: Allen W. Smith
  • Publication number: 20100250243
    Abstract: A system and method for implementing a server-based speech recognition system for multi-modal automated interaction in a vehicle includes receiving, by a vehicle driver, audio prompts by an on-board human-to-machine interface and a response with speech to complete tasks such as creating and sending text messages, web browsing, navigation, etc. This service-oriented architecture is utilized to call upon specialized speech recognizers in an adaptive fashion. The human-to-machine interface enables completion of a text input task while driving a vehicle in a way that minimizes the frequency of the driver's visual and mechanical interactions with the interface, thereby eliminating unsafe distractions during driving conditions. After the initial prompting, the typing task is followed by a computerized verbalization of the text. Subsequent interface steps can be visual in nature, or involve only sound.
    Type: Application
    Filed: March 23, 2010
    Publication date: September 30, 2010
    Inventors: Thomas Barton Schalk, Leonel Saenz, Barry Burch
  • Publication number: 20100250263
    Abstract: A wideband speech coding method comprising identifying whether an input speech signal is a narrowband signal or a wideband signal, and coding the input speech signal by controlling a predetermined parameter of a wideband speech coding process based on the identification result.
    Type: Application
    Filed: March 31, 2010
    Publication date: September 30, 2010
    Inventor: Kimio MISEKI
  • Publication number: 20100248786
    Abstract: Audio input to a user device is captured in a buffer and played back to the user while being sent to and recognized by an automatic speech recognition (ASR) system. Overlapping the playback with the speech recognition processing masks a portion of the true latency of the ASR system thus improving the user's perception of the ASR system's responsiveness. Further, upon hearing the playback, the user is intuitively guided to self-correct for any defects in the captured audio.
    Type: Application
    Filed: March 30, 2010
    Publication date: September 30, 2010
    Inventor: Laurent Charriere
  • Publication number: 20100250257
    Abstract: This invention includes: a voice quality feature database (101) holding voice quality features; a speaker attribute database (106) holding, for each voice quality feature, an identifier enabling a user to expect a voice quality of the voice quality feature; a weight setting unit (103) setting a weight for each acoustic feature of a voice quality; a scaling unit (105) calculating display coordinates of each voice quality feature based on the acoustic features in the voice quality feature and the weights set by the weight setting unit (103); a display unit (107) displaying the identifier of each voice quality feature on the calculated display coordinates; a position input unit (108) receiving designated coordinates; and a voice quality mix unit (110) (i) calculating a distance between (1) the received designated coordinates and (2) the display coordinates of each of a part or all of the voice quality features, and (ii) mixing the acoustic features of the part or all of the voice quality features together based
    Type: Application
    Filed: June 4, 2008
    Publication date: September 30, 2010
    Inventors: Yoshifumi Hirose, Takahiro Kamai
  • Publication number: 20100250265
    Abstract: The signal processing is based on the concept of using a time-domain aliased (12, TDA) frame as a basis for time segmentation (14) and spectral analysis (16), performing segmentation in time based on the time-domain aliased frame and performing spectral analysis based on the resulting time segments. The time resolution of the overall ?segmented? time-to-frequency transform can thus be changed by simply adapting the time segmentation to obtain a suitable number of time segments based on which spectral analysis is applied. The overall set of spectral coefficients, obtained for all the segments, provides a selectable time-frequency tiling of the original signal frame.
    Type: Application
    Filed: August 25, 2008
    Publication date: September 30, 2010
    Applicant: Telefonaktiebolaget L M Ericsson (publ)
    Inventor: Anisse Taleb
  • Publication number: 20100250246
    Abstract: A speech signal evaluation apparatus includes: an acquisition unit that acquires, as a first frame, a speech signal of a specified length from speech signals; a first detection unit that detects, on the basis of a speech condition, whether the first frame is voiced or unvoiced; a variation calculation unit that, when the first frame is unvoiced, calculates a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame that is unvoiced and precedes the first frame in time; and a second detection unit that detects, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation of the first frame satisfies the non-stationary condition.
    Type: Application
    Filed: March 24, 2010
    Publication date: September 30, 2010
    Applicant: FUJITSU LIMITED
    Inventor: Chikako MATSUMOTO
  • Publication number: 20100241436
    Abstract: Provided are an encoding apparatus and a decoding apparatus of a multi-channel signal. The encoding apparatus of the multi-channel signal may process a phase parameter associated with phase information between a plurality of channels constituting the multi-channel signal, based on a characteristic of the multi-channel signal. The encoding apparatus may generate an encoded bitstream with respect to the multi-channel signal using the processed phase parameter and a mono signal extracted from the multi-channel signal.
    Type: Application
    Filed: March 17, 2010
    Publication date: September 23, 2010
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jung-Hoe Kim, Eun Mi OH
  • Publication number: 20100240307
    Abstract: A system for communicating information and/or instructions includes a transmitter and receiver configured for wireless communication with a user interface. The information and/or instructions may be audibly communicated to the user. The system includes voice activation and controls. The wireless communication may utilize Bluetooth communication protocol.
    Type: Application
    Filed: August 24, 2007
    Publication date: September 23, 2010
    Inventors: Michael J. Sims, Carl L. Shearer, Steven L. Geerlings, Todd R. Witkowski, Paul S. Vanlente, Ted W. Ringold, James E. Trainor
  • Publication number: 20100241435
    Abstract: A voice mixing apparatus decodes input encoded narrowband voice data and encoded voice data for narrowband region of input encoded wideband voice data, and detects a speaker in accordance with the decoded voice signals of the entire narrowband. When encoded voice data from a speaker is included in the narrowband, a signal in a region outside the narrowband of the expanded data is encoded. When the data is included in the wideband, encoded voice data of the region outside the narrowband is extracted for output. When the destination terminal is compatible with the encoded narrowband voice data, the narrowband voice signal mixed is encoded and output. When the destination terminal is compatible with wideband, the narrowband voice signal mixed is encoded for the narrowband region, and the voice data of the speaker is used as the encoded voice data for the region outside the narrowband.
    Type: Application
    Filed: February 3, 2010
    Publication date: September 23, 2010
    Applicant: OKI ELECTRIC INDUSTRY CO., LTD.
    Inventors: Hiromi Aoyagi, Shinji Usuba
  • Publication number: 20100241350
    Abstract: To support the independence and mobility of blind pedestrians, the present inventors devised, among other things, free systems, methods, and software for providing narrative blind-ready wayfinding information. One exemplary system receives user input identifying a starting landmark and ending landmark in a particular selected geographic region, such as a city, university campus, government building, shopping mall, or airport. The system then searches a database for the corresponding narrative wayfinding instructions, and outputs them in the form of text or audio to guide a blind pedestrian from the starting landmark to the ending landmark. In the exemplary system, blind users select the geographic region as well as the starting and ending landmark from a voice-driven telephonic menu system and receive audible wayfinding instruction via mobile telephone. In some embodiments, the system also provides access to voice-driven restaurant menus.
    Type: Application
    Filed: March 18, 2010
    Publication date: September 23, 2010
    Inventors: Joseph Cioffi, Philip Agee
  • Publication number: 20100238304
    Abstract: An image pickup method includes determining a start timing and an end timing of obtaining the speech to have a photographing timing of the still image taken by the image pickup unit therebetween, in accordance with a period in which the speech stored in the temporary speech storing unit satisfies a predetermined condition, and cutting out the speech stored in the temporary speech storing unit for a period from the start timing to the end timing determined, and storing the cut speech in the storing unit in association with the still image taken by the image pickup unit.
    Type: Application
    Filed: March 17, 2010
    Publication date: September 23, 2010
    Applicant: Casio Computer Co., Ltd.
    Inventor: Akira MIYATA
  • Publication number: 20100238173
    Abstract: Compressed entertainment content such as audio or video or both includes additional aspects and operations associated their way. The compressed audio may be used to signal computers such as a telephone or reminder for an appointment. A melody line may be extracted from the audio, or the audio may be used exactly as it is. Another aspect stores traders within the entertainment content such as in MP3. Those traders are used to trigger the system to retrieve other parts of the content to be displayed at the same time that that particular part of the MP3 is being play. The content may include video or text, or maybe links to other content such as broadband content four times sensitive content. Another aspect describes encryption which is keyed to the disk ID to prevent playing oven illegally copied disk. Another aspect reads a specified amount of information then spins down the disk to conserve battery power.
    Type: Application
    Filed: June 4, 2010
    Publication date: September 23, 2010
    Inventor: Scott C. Harris
  • Publication number: 20100235865
    Abstract: Areas of a video are marked with information about the areas at the marking. For example, an actor's shoes, and other clothing can be marked. That clothing is selected to get more information about the clothing.
    Type: Application
    Filed: March 11, 2010
    Publication date: September 16, 2010
    Applicant: Ubiquity Holdings
    Inventors: Connie Jordan, Christopher Carmichael
  • Publication number: 20100235894
    Abstract: A computer implemented method for accessing materials for a meeting may include receiving a call from a meeting participant by a system, wherein the meeting participant calls a prearranged teleconference number to participate in the meeting. The method may also include validating participation of the meeting participant in the meeting by the system. The method may further include providing access to an appropriate set of materials to the meeting participant based on a predetermined attribute associated with the meeting participant.
    Type: Application
    Filed: March 16, 2009
    Publication date: September 16, 2010
    Inventors: Lloyd W. Allen, JR., Jana H. Jenkins, Steven M. Miller
  • Publication number: 20100235174
    Abstract: Compression of audio signal data is described herein. In various embodiments, the compression of each unit of the audio signal data includes the employment of a distribution substantially representative of a subblock of residual data of the unit of audio signal data, to reduce the amount of data having to be transmitted to transmit the unit of audio signal data to a recipient.
    Type: Application
    Filed: May 24, 2010
    Publication date: September 16, 2010
    Inventor: Yuriy A. Reznik
  • Publication number: 20100223061
    Abstract: In accordance with an example embodiment of the present invention, there is provided an apparatus for encoding an audio signal in two or more encoding stages, the audio signal comprising a set of frequency components. The apparatus comprises a frequency component selection unit configured to select a number of frequency components from the set for encoding in a current encoding stage, the selected frequency components being components of the set that have not been encoded to a non-zero value in a preceding encoding stage; and an encoding unit configured to encode at least one of the selected frequency components to a non-zero value using a number of bits less than or equal to a predetermined number of bits allocated for the current encoding stage.
    Type: Application
    Filed: February 27, 2009
    Publication date: September 2, 2010
    Applicant: NOKIA CORPORATION
    Inventor: Juha Petteri Ojanpera
  • Publication number: 20100217604
    Abstract: A system and method for processing multi-modal device interactions in a natural language voice services environment may be provided. In particular, one or more multi-modal device interactions may be received in a natural language voice services environment that includes one or more electronic devices. The multi-modal device interactions may include a non-voice interaction with at least one of the electronic devices or an application associated therewith, and may further include a natural language utterance relating to the non-voice interaction. Context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multi-modal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent of the multi-modal device interaction.
    Type: Application
    Filed: February 20, 2009
    Publication date: August 26, 2010
    Applicant: VoiceBox Technologies, Inc.
    Inventors: Larry Baldwin, Chris Weider
  • Publication number: 20100217657
    Abstract: An adaptive information presentation apparatus and associated methods. In one embodiment, the apparatus comprises a computer readable medium having at least one computer program disposed thereon, the at least one program being configured to adaptively present (e.g., display or play out via an audio system) information that is related or in response to inputs provided via an input device such as a for example touch-screen display device. In one variant, the at least one program analyzes user input to determine a context of the input, and selects advertising related to the context for presentation to the user.
    Type: Application
    Filed: February 24, 2010
    Publication date: August 26, 2010
    Inventor: Robert F. Gazdzinski