Patents Examined by Talivaldis Ivars Smit
  • Patent number: 8364482
    Abstract: A system and method is provided for obtaining a message type identifier embedded in a vocoder packet via a speech codec (in-band) such as found in a wireless communication network. The vocoder packet is received and decoded. The decoded vocoder packet is filtered until a synchronization signal is detected, with the filtering comprising correlating the decoded vocoder packet with a predetermined sequence to generate the synchronization signal. The polarity of the synchronization signal is determined, and the message type identifier is derived based on the polarity of the detected synchronization signal. A first polarity identifies a first message type, and a second polarity identifies a second message type.
    Type: Grant
    Filed: June 28, 2010
    Date of Patent: January 29, 2013
    Assignee: QUALCOMM Incorporated
    Inventors: Christian Sgraja, Christoph A Joetten, Marc W Werner, Christian Pietsch
  • Patent number: 8359198
    Abstract: A method of pre-processing an audio signal transmitted to a user terminal via a communication network and an apparatus using the method are provided. The method of pre-processing the audio signal may prevent deterioration of a sound quality of the audio signal transmitted to the user terminal by pre-processing the audio signal, and by enabling a codec module, encoding the audio signal, to determine the audio signal as a speech signal. The method of pre-processing may include separating the audio signal into channels, measuring the channel energy for each of the channels, selecting a specific channel energy, and amplifying the specific channel energy. The method may include encoding an audio signal using a speech codec and/or decoding an encoded audio signal using the speech codec.
    Type: Grant
    Filed: March 21, 2012
    Date of Patent: January 22, 2013
    Assignee: Intel Corporation
    Inventors: Jae Woong Jeong, Seop Hyeong Park, Jong Kyu Ryu
  • Patent number: 8352261
    Abstract: A communication system includes at least one transmitting device and at least one receiving device, one or more network systems for connecting the transmitting device to the receiving device, and an automatic speech recognition (“ASR”) system, including an ASR engine. A user speaks an utterance into the transmitting device, and the recorded speech audio is sent to the ASR engine. The ASR engine returns intermediate transcription results to the transmitting device, which displays the intermediate transcription results in real-time to the user. The intermediate transcription results are also correlated by utterance fragment to final transcription results and displayed to the user. The user may use the information thus presented to make decisions as to whether to edit the final transcription results or to speak the utterance again, thereby repeating the process. The intermediate transcription results may also be used by the user to edit the final transcription results.
    Type: Grant
    Filed: March 9, 2009
    Date of Patent: January 8, 2013
    Assignee: Canyon IP Holdings, LLC
    Inventors: James Richard Terrell, II, Marc White
  • Patent number: 8352276
    Abstract: Systems and methods are provided for using automatic speech recognition to analyze a voice interaction and verify compliance of an agent reading a script to a client during the voice interaction. In one aspect of the invention, a method may include conducting the voice interaction between the agent and a client, wherein the agent follows the script via a plurality of panels. From there, the voice interaction is evaluated via the plurality of panels employing panel-by-panel playback with an automatic speech recognition component adapted to analyze the voice interaction. As such, it may be determined, via generating a score using confidence level thresholds of an automatic speech recognition component such that confidence level thresholds are assigned to each of the plurality of panels and evaluating the score against at least one of a static standard and a varying standard, whether the agent has adequately followed the script.
    Type: Grant
    Filed: July 3, 2012
    Date of Patent: January 8, 2013
    Assignee: West Corporation
    Inventors: Mark J. Pettay, Jill M Vacek
  • Patent number: 8346555
    Abstract: The present invention discloses a speech processing solution that utilizes an original speech recognition grammar in a speech recognition system to perform speech recognition operations for multiple recognition instances. Instance data associated with the recognition operations can be stored. A replacement grammar can be automatically generated from the stored instance data, where the replacement grammar is a statistical language model grammar. The original speech recognition grammar, which can be a grammar-based language model grammar or a statistical language model grammar, can be selectively replaced with the replacement grammar. For example when tested performance for the replacement grammar is better than that for the original grammar, the replacement grammar can replace the original grammar.
    Type: Grant
    Filed: August 22, 2006
    Date of Patent: January 1, 2013
    Assignee: Nuance Communications, Inc.
    Inventor: Brent D. Metz
  • Patent number: 8346561
    Abstract: A voice activatable system for providing the correct spelling of a spoken word is disposed in an elongated body of a writing instrument such as a ball point pen. The system includes a microphone the output of which is fed to an amplifier analog to a digital converter and from there to a speech recognition program, the output of the speech recognition program is fed to a computer, namely a word processor/controller that includes a data base. The output of the speech recognition is compared with the digital library of words and when a match is found, it is amplified and fed to digital to analog connector. The output of the digital/analog computer is fed to a speaker that repeats the word with the correct pronunciation followed by a correct spelling of the word. The system includes a battery for powering the system as well as an on/off switch and a repeat button for repeating information from the system.
    Type: Grant
    Filed: February 23, 2010
    Date of Patent: January 1, 2013
    Inventor: Fawzi Q. Behbehani
  • Patent number: 8335690
    Abstract: Grammars for interactive voice response systems using natural language understanding can be created using information which is available on websites. These grammars can be created in automated manners and can have various tuning measures applied to obtain optimal results when deployed in a customer contact environment. These grammars can allow a variety of statements to be appropriately handled by the system.
    Type: Grant
    Filed: January 17, 2012
    Date of Patent: December 18, 2012
    Assignee: Convergys Customer Management Delaware LLC
    Inventors: Dhananjay Bansal, Nancy Gardner, Chang-Quin Shu, Kristie Goss, Matthew Yuschik, Sunil Issar, Woosung Kim, Jayant Naik
  • Patent number: 8332229
    Abstract: The invention provides for the encoding of surround sound produced by any coincident microphone techniques with coincident-to-virtual microphone signal matrixing. An encoding scheme provides significantly lower computational demand, by deriving the spatial parameters and output downmixes from the coincident microphone array signals and the coincident-to-surround channel-coefficients matrix, instead of the multi-channel signals.
    Type: Grant
    Filed: March 16, 2009
    Date of Patent: December 11, 2012
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Samsudin, Sapna George
  • Patent number: 8326604
    Abstract: A dictionary for compressing and decompressing textual data has a number of keys. Each key is associated with an identifier. The keys include static word or phrase keys, where each static word or phrase key lists one or more unchanging words in a particular order. The keys further include dynamic phrase keys, where each dynamic phrase key lists a number of words and one or more placeholders in a particular order, and each placeholder denotes a place where a word or phrase other than the words of the dynamic phrase key is to be inserted. At least one of the dynamic phrase keys may identify one or more of the words by identifiers for corresponding static words or phrase keys. At least one of the static word or phrase keys may identify one or more of the words by identifiers for corresponding other static words or phrase keys.
    Type: Grant
    Filed: April 24, 2008
    Date of Patent: December 4, 2012
    Assignee: International Business Machines Corporation
    Inventors: Umesh Kumar B. Balegar, Rohit Shetty
  • Patent number: 8326605
    Abstract: A dictionary for compressing and decompressing textual data has a number of keys. Each key is associated with an identifier. The keys include static word or phrase keys, where each static word or phrase key lists one or more unchanging words in a particular order. The keys further include dynamic phrase keys, where each dynamic phrase key lists a number of words and one or more placeholders in a particular order, and each placeholder denotes a place where a word or phrase other than the words of the dynamic phrase key is to be inserted. At least one of the dynamic phrase keys may identify one or more of the words by identifiers for corresponding static words or phrase keys. At least one of the static word or phrase keys may identify one or more of the words by identifiers for corresponding other static words or phrase keys.
    Type: Grant
    Filed: April 24, 2008
    Date of Patent: December 4, 2012
    Assignee: International Business Machines Incorporation
    Inventors: Umesh Kumar B. Balegar, Rohit Shetty
  • Patent number: 8321216
    Abstract: Packet loss concealment (PLC) systems and methods are described that use time-warping to merge a concealment signal generated to replace one or more bad frames of an audio signal with a received signal representing one or more subsequent good frames of the audio signal in a manner that avoids signal discontinuity and audible artifacts resulting therefrom. Prediction-based PLC systems and methods are also described that use time-warping to conceal the loss of one or more frames containing a transition region in a manner that will not result in an audible artifact.
    Type: Grant
    Filed: February 23, 2010
    Date of Patent: November 27, 2012
    Assignee: Broadcom Corporation
    Inventor: Robert W. Zopf
  • Patent number: 8321224
    Abstract: A text-to-speech system adapted to operate on text in a first language including sections in a second language, includes a grapheme/phoneme transcriptor for converting the sections in the second language into phonemes of the second language; a mapping module configured for mapping at least part of the phonemes of the second language onto sets of phonemes of the first language; and a speech-synthesis module adapted to be fed with a resulting stream of phonemes including the sets of phonemes of the first language resulting from mapping and the stream of phonemes of the first language representative of the text, and to generate a speech signal from the resulting stream of phonemes.
    Type: Grant
    Filed: January 10, 2012
    Date of Patent: November 27, 2012
    Assignee: Loquendo S.p.A.
    Inventors: Leonardo Badino, Claudia Barolo, Silvia Quazza
  • Patent number: 8321203
    Abstract: A method of generating information on relationships between characters of a content includes dividing a text extracted from the content into one or more predetermined units, determining one or more dominant relationships between characters of the content by comparing the divided units with relationship keyword information in which keywords contained in categories are defined, wherein the categories represent one or more relationships between the characters, and generating information on the relationships between the characters in accordance with the determined dominant relationships. The dominant relationships are determined by matching the divided units of text to the categories with reference to the relationship keyword information, counting the number of divided units of text corresponding to each of the categories, and determining the relationship represented by the category measured by the highest number of divided units of text.
    Type: Grant
    Filed: April 21, 2008
    Date of Patent: November 27, 2012
    Assignee: SAMSUNG Electronics Co., Ltd.
    Inventor: Ju-hee Seo
  • Patent number: 8315861
    Abstract: A wideband speech decoding apparatus has means for producing an excitation signal from coded data, means for producing a synthesis filter, and means for decoding a speech signal from the excitation signal and the synthesis filter. The wideband speech decoding apparatus comprises acquisition means for acquiring identification information which identifies the speech signal to be decoded is narrowband. The wideband speech decoding apparatus further comprises control means for controlling decoding means based on the identification information.
    Type: Grant
    Filed: March 12, 2012
    Date of Patent: November 20, 2012
    Assignee: Kabushiki Kaisha Toshiba
    Inventor: Kimio Miseki
  • Patent number: 8315874
    Abstract: A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files.
    Type: Grant
    Filed: April 11, 2006
    Date of Patent: November 20, 2012
    Assignee: Microsoft Corporation
    Inventors: William F. Barton, Michelle S. Spina, David G. Ollason, Julian J. Odell
  • Patent number: 8311802
    Abstract: Embodiments consistent with the invention include a method of creating a document on a computing device and a computer-readable storage medium. The method includes: receiving input text in the computing device to initiate a document creation process, the computing device including a first portion of font data for a first language, the first portion including less than all of the font data for the first language; based on the input text, determining whether the first portion is sufficient to create the document on the computing device; loading a second portion of the font data to the computing device from a data storage location if the first portion is not sufficient; and creating the document using at least one of the first portion and the second portion.
    Type: Grant
    Filed: October 3, 2011
    Date of Patent: November 13, 2012
    Assignee: VeriSign, Inc.
    Inventor: Devendra Kalra
  • Patent number: 8306820
    Abstract: A is recognized using a predefinable vocabulary that is partitioned in sections of phonetically similar words. In a recognition process, first oral input is associated with one of the sections, then the oral input is determined from the vocabulary of the associated section.
    Type: Grant
    Filed: October 4, 2005
    Date of Patent: November 6, 2012
    Assignee: Siemens Aktiengesellschaft
    Inventor: Niels Kunstmann
  • Patent number: 8306819
    Abstract: Techniques for enhanced automatic speech recognition are described. An enhanced ASR system may be operative to generate an error correction function. The error correction function may represent a mapping between a supervised set of parameters and an unsupervised training set of parameters generated using a same set of acoustic training data, and apply the error correction function to an unsupervised testing set of parameters to form a corrected set of parameters used to perform speaker adaptation. Other embodiments are described and claimed.
    Type: Grant
    Filed: March 9, 2009
    Date of Patent: November 6, 2012
    Assignee: Microsoft Corporation
    Inventors: Chaojun Liu, Yifan Gong
  • Patent number: 8301439
    Abstract: A method of encoding a low bit-rate audio signal includes quantizing and encoding a plurality of low frequency sub-bands of an audio signal in a frequency domain, generating a codebook of codevectors using sub-bands of the audio signal spectrum, detecting an envelope of another frequency sub-band of the audio signal and quantizing and losslessly-encoding the detected envelope, selecting a codevector most similar to the higher frequency sub-band spectrum from the generated codebook's codevectors and determining its codebook codevector index, and generating a bit stream.
    Type: Grant
    Filed: July 12, 2006
    Date of Patent: October 30, 2012
    Assignee: Samsung Electronics Co., Ltd
    Inventors: Junghoe Kim, Eunmi Oh, Boris Kudryashov, Konstantin Osipov
  • Patent number: 8301448
    Abstract: The invention involves the loading and unloading of dynamic section grammars and language models in a speech recognition system. The values of the sections of the structured document are either determined in advance from a collection of documents of the same domain, document type, and speaker; or collected incrementally from documents of the same domain, document type, and speaker; or added incrementally to an already existing set of values. Speech recognition in the context of the given field is constrained to the contents of these dynamic values. If speech recognition fails or produces a poor match within this grammar or section language model, speech recognition against a larger, more general vocabulary that is not constrained to the given section is performed.
    Type: Grant
    Filed: March 29, 2006
    Date of Patent: October 30, 2012
    Assignee: Nuance Communications, Inc.
    Inventors: Alwin B. Carus, Larissa Lapshina, Raghu Vemula