Word Boundary Detection (epo) Patents (Class 704/E15.006)
  • Patent number: 8350683
    Abstract: A voice acquisition system for a vehicle includes an interior rearview mirror assembly attached at an inner portion of the windshield of a vehicle equipped with the interior rearview mirror assembly. The interior rearview mirror assembly includes at least two microphones for receiving audio signals within a cabin of the vehicle and generating an output indicative of the audio signals. A control is in the vehicle and is responsive to the output from the at least one microphone. The control at least partially distinguishes vocal signals from non-vocal signals present in the output. The at least two microphones provide sound capture for at least one of a hands free cell phone system, an audio recording system and a wireless communication system.
    Type: Grant
    Filed: August 15, 2011
    Date of Patent: January 8, 2013
    Assignee: Donnelly Corporation
    Inventors: Jonathan E. DeLine, Niall R. Lynam, Ralph A. Spooner, Phillip A. March
  • Publication number: 20110218798
    Abstract: Techniques implemented as systems, methods, and apparatuses, including computer program products, for obfuscating sensitive content in an audio source representative of an interaction between a contact center caller and a contact center agent. The techniques include performing, by an analysis engine of a contact center system, a context-sensitive content analysis of the audio source to identify each audio source segment that includes content determined by the analysis engine to be sensitive content based on its context; and processing, by an obfuscation engine of the contact center system, one or more identified audio source segments to generate corresponding altered audio source segments each including obfuscated sensitive content.
    Type: Application
    Filed: March 5, 2010
    Publication date: September 8, 2011
    Applicant: Nexdia Inc.
    Inventor: Marsal Gavalda
  • Patent number: 8004392
    Abstract: A voice acquisition system for a vehicle includes an interior rearview mirror assembly. The mirror assembly may include a microphone for receiving audio signals within a cabin of the vehicle and generating an output indicative of these audio signals. The microphone may provide sound capture for a hands free cell phone system, an audio recording system and/or an emergency communication system. The system may include a control that is responsive to the output from the microphone and that distinguishes vocal signals from non-vocal signals present in the output. The microphone may provide sound capture for at least one accessory of the equipped vehicle, and the accessory may be responsive to a vocal signal captured by the microphone. The interior rearview mirror assembly may include at least one accessory, such as an antenna, a video device, a security system status indicator, a tire pressure indicator display and/or a loudspeaker.
    Type: Grant
    Filed: December 19, 2008
    Date of Patent: August 23, 2011
    Assignee: Donnelly Corporation
    Inventors: Jonathan E. DeLine, Niall R. Lynam, Ralph A. Spooner, Phillip A. March
  • Publication number: 20110166858
    Abstract: A method for recognizing speech involves presenting an utterance to a speech recognition system and determining, via the speech recognition system, that the utterance contains a particular expression, where the particular expression is capable of being associated with at least two different meanings. The method further involves splitting the utterance into a plurality of speech frames, where each frame is assigned a predetermined time segment and a frame number, and indexing the utterance to i) a predetermined frame number, or ii) a predetermined time segment. The indexing of the utterance identifies that one of the frames includes the particular expression. Then the frame including the particular expression is re-presented to the speech recognition system to verify that the particular expression was actually recited in the utterance.
    Type: Application
    Filed: January 6, 2010
    Publication date: July 7, 2011
    Applicant: GENERAL MOTORS LLC
    Inventor: Uma Arun
  • Publication number: 20110029311
    Abstract: There is provided a voice processing device. The device includes: score calculation unit configured to calculate a score indicating compatibility of a voice signal input on the basis of an utterance of a user with each of plural pieces of intention information indicating each of a plurality of intentions; intention selection unit configured to select the intention information indicating the intention of the utterance of the user among the plural pieces of intention information on the basis of the score calculated by the score calculation unit; and intention reliability calculation unit configured to calculate the reliability with respect to the intention information selected by the intention selection unit on the basis of the score calculated by the score calculation unit.
    Type: Application
    Filed: June 17, 2010
    Publication date: February 3, 2011
    Applicant: Sony Corporation
    Inventors: Katsuki MINAMINO, Hitoshi Honda, Yoshinori Maeda, Hiroaki Ogawa
  • Patent number: 7742914
    Abstract: A method of reducing noise in an audio signal, comprising the steps of: using a furrow filter to select spectral components that are narrow in frequency but relatively broad in time; using a bar filter to select spectral components that are broad in frequency but relatively narrow in time; analyzing the relative energy distribution between the output of the furrow and bar filters to determine the optimal proportion of spectral components for the output signal; and reconstructing the audio signal to generate the output signal. A second pair of time-frequency filters may be used to further improve intelligibility of the output signal. The temporal relationship between the furrow filter output and the bar filter output may be monitored so that the fricative components are allowed primarily at boundaries between intervals with no voiced signal present and intervals with voice components. A noise reduction system for an audio signal.
    Type: Grant
    Filed: March 7, 2005
    Date of Patent: June 22, 2010
    Inventors: Daniel A. Kosek, Robert Crawford Maher
  • Publication number: 20090240496
    Abstract: According to one aspect of the invention, a speech recognizer includes: an audio data acquiring portion configured to acquire audio data via a microphone; a speech section detecting portion configured to detect a talking start time and a talking end time based on the audio data; a spoken word identifying portion configured to identify the audio in a speech section from the talking start time to the talking end time; and a noise suppressing portion configured to suppress a generation of a noise from an electrical noise source for the speech section.
    Type: Application
    Filed: March 16, 2009
    Publication date: September 24, 2009
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Daisuke Yamamoto, Hiroshi Sugiyama, Toshiyuki Koga, Kaoru Suzuki
  • Publication number: 20090063150
    Abstract: Sentence boundaries in noisy conversational transcription data are automatically identified. Noise and transcription symbols are removed, and a training set is formed with sentence boundaries marked based on long silences or on manual markings in the transcribed data. Frequencies of head and tail n-grams that occur at the beginning and ending of sentences are determined from the training set. N-grams that occur a significant number of times in the middle of sentences in relation to their occurrences at the beginning or ending of sentences are filtered out. A boundary is marked before every head n-gram and after every tail n-gram occurring in the conversational data and remaining after filtering. Turns are identified. A boundary is marked after each turn, unless the turn ends with an impermissible tail word or is an incomplete turn. The marked boundaries in the conversational data identify sentence boundaries.
    Type: Application
    Filed: August 27, 2007
    Publication date: March 5, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Tetsuya Nasukawa, Diwakar Punjani, Shourya Roy, L. Venkata Subramaniam, Hironori Takeuchi
  • Publication number: 20080172222
    Abstract: A very low bit rate communication system. In preferred embodiments, an off-the-shelf module is adapted to convert a speaker's voice to text. A processor is provided to separate the text into individual words. The processor is programmed with a dictionary which provides pre-assigned specific 14-bit numeric values to each word in the dictionary (words used more frequently may be assigned shorter codes). The processor creates a numeric stream from 14-bit numeric values and this numeric stream is then transmitted to a receiver. Typical speech contains 4 words/second, so bit rates as low as 50 bits/second may be achieved with this technique. At the receiving end, the stream of received 14-bit numeric values, representing the speaker's words, are looked up in a dictionary identical to that at the transmitting end and the text of the words reconstructed. Text-to-speech techniques common to the industry are then used to regenerate the speech.
    Type: Application
    Filed: January 12, 2007
    Publication date: July 17, 2008
    Inventor: Paul Johnson
  • Publication number: 20080133239
    Abstract: Provided are an apparatus and method for recognizing continuous speech using search space restriction based on phoneme recognition. In the apparatus and method, a search space can be primarily reduced by restricting connection words to be shifted at a boundary between words based on the phoneme recognition result. In addition, the search space can be secondarily reduced by rapidly calculating a degree of similarity between the connection word to be shifted and the phoneme recognition result using a phoneme code and shifting the corresponding phonemes to only connection words having degrees of similarity equal to or higher than a predetermined reference value. Therefore, the speed and performance of the speech recognition process can be improved in various speech recognition services.
    Type: Application
    Filed: December 4, 2007
    Publication date: June 5, 2008
    Inventors: Hyung Bae Jeon, Jun Park, Seung Hi Kim, Kyu Woong Hwang