Patents Examined by Talivaldis Ivars Smit

System and method for obtaining a message type identifier through an in-band modem

Patent number: 8364482

Abstract: A system and method is provided for obtaining a message type identifier embedded in a vocoder packet via a speech codec (in-band) such as found in a wireless communication network. The vocoder packet is received and decoded. The decoded vocoder packet is filtered until a synchronization signal is detected, with the filtering comprising correlating the decoded vocoder packet with a predetermined sequence to generate the synchronization signal. The polarity of the synchronization signal is determined, and the message type identifier is derived based on the polarity of the detected synchronization signal. A first polarity identifies a first message type, and a second polarity identifies a second message type.

Type: Grant

Filed: June 28, 2010

Date of Patent: January 29, 2013

Assignee: QUALCOMM Incorporated

Inventors: Christian Sgraja, Christoph A Joetten, Marc W Werner, Christian Pietsch
Pre-processing and speech codec encoding of ring-back audio signals transmitted over a communication network to a subscriber terminal

Patent number: 8359198

Abstract: A method of pre-processing an audio signal transmitted to a user terminal via a communication network and an apparatus using the method are provided. The method of pre-processing the audio signal may prevent deterioration of a sound quality of the audio signal transmitted to the user terminal by pre-processing the audio signal, and by enabling a codec module, encoding the audio signal, to determine the audio signal as a speech signal. The method of pre-processing may include separating the audio signal into channels, measuring the channel energy for each of the channels, selecting a specific channel energy, and amplifying the specific channel energy. The method may include encoding an audio signal using a speech codec and/or decoding an encoded audio signal using the speech codec.

Type: Grant

Filed: March 21, 2012

Date of Patent: January 22, 2013

Assignee: Intel Corporation

Inventors: Jae Woong Jeong, Seop Hyeong Park, Jong Kyu Ryu
Use of intermediate speech transcription results in editing final speech transcription results

Patent number: 8352261

Abstract: A communication system includes at least one transmitting device and at least one receiving device, one or more network systems for connecting the transmitting device to the receiving device, and an automatic speech recognition (“ASR”) system, including an ASR engine. A user speaks an utterance into the transmitting device, and the recorded speech audio is sent to the ASR engine. The ASR engine returns intermediate transcription results to the transmitting device, which displays the intermediate transcription results in real-time to the user. The intermediate transcription results are also correlated by utterance fragment to final transcription results and displayed to the user. The user may use the information thus presented to make decisions as to whether to edit the final transcription results or to speak the utterance again, thereby repeating the process. The intermediate transcription results may also be used by the user to edit the final transcription results.

Type: Grant

Filed: March 9, 2009

Date of Patent: January 8, 2013

Assignee: Canyon IP Holdings, LLC

Inventors: James Richard Terrell, II, Marc White
Script compliance and agent feedback

Patent number: 8352276

Abstract: Systems and methods are provided for using automatic speech recognition to analyze a voice interaction and verify compliance of an agent reading a script to a client during the voice interaction. In one aspect of the invention, a method may include conducting the voice interaction between the agent and a client, wherein the agent follows the script via a plurality of panels. From there, the voice interaction is evaluated via the plurality of panels employing panel-by-panel playback with an automatic speech recognition component adapted to analyze the voice interaction. As such, it may be determined, via generating a score using confidence level thresholds of an automatic speech recognition component such that confidence level thresholds are assigned to each of the plurality of panels and evaluating the score against at least one of a static standard and a varying standard, whether the agent has adequately followed the script.

Type: Grant

Filed: July 3, 2012

Date of Patent: January 8, 2013

Assignee: West Corporation

Inventors: Mark J. Pettay, Jill M Vacek
Automatic grammar tuning using statistical language model generation

Patent number: 8346555

Abstract: The present invention discloses a speech processing solution that utilizes an original speech recognition grammar in a speech recognition system to perform speech recognition operations for multiple recognition instances. Instance data associated with the recognition operations can be stored. A replacement grammar can be automatically generated from the stored instance data, where the replacement grammar is a statistical language model grammar. The original speech recognition grammar, which can be a grammar-based language model grammar or a statistical language model grammar, can be selectively replaced with the replacement grammar. For example when tested performance for the replacement grammar is better than that for the original grammar, the replacement grammar can replace the original grammar.

Type: Grant

Filed: August 22, 2006

Date of Patent: January 1, 2013

Assignee: Nuance Communications, Inc.

Inventor: Brent D. Metz
Voice activatable system for providing the correct spelling of a spoken word

Patent number: 8346561

Abstract: A voice activatable system for providing the correct spelling of a spoken word is disposed in an elongated body of a writing instrument such as a ball point pen. The system includes a microphone the output of which is fed to an amplifier analog to a digital converter and from there to a speech recognition program, the output of the speech recognition program is fed to a computer, namely a word processor/controller that includes a data base. The output of the speech recognition is compared with the digital library of words and when a match is found, it is amplified and fed to digital to analog connector. The output of the digital/analog computer is fed to a speaker that repeats the word with the correct pronunciation followed by a correct spelling of the word. The system includes a battery for powering the system as well as an on/off switch and a repeat button for repeating information from the system.

Type: Grant

Filed: February 23, 2010

Date of Patent: January 1, 2013

Inventor: Fawzi Q. Behbehani
Method and system for creating natural language understanding grammars

Patent number: 8335690

Abstract: Grammars for interactive voice response systems using natural language understanding can be created using information which is available on websites. These grammars can be created in automated manners and can have various tuning measures applied to obtain optimal results when deployed in a customer contact environment. These grammars can allow a variety of statements to be appropriately handled by the system.

Type: Grant

Filed: January 17, 2012

Date of Patent: December 18, 2012

Assignee: Convergys Customer Management Delaware LLC

Inventors: Dhananjay Bansal, Nancy Gardner, Chang-Quin Shu, Kristie Goss, Matthew Yuschik, Sunil Issar, Woosung Kim, Jayant Naik
Low complexity MPEG encoding for surround sound recordings

Patent number: 8332229

Abstract: The invention provides for the encoding of surround sound produced by any coincident microphone techniques with coincident-to-virtual microphone signal matrixing. An encoding scheme provides significantly lower computational demand, by deriving the spatial parameters and output downmixes from the coincident microphone array signals and the coincident-to-surround channel-coefficients matrix, instead of the multi-channel signals.

Type: Grant

Filed: March 16, 2009

Date of Patent: December 11, 2012

Assignee: STMicroelectronics Asia Pacific Pte. Ltd.

Inventors: Samsudin, Sapna George
Dictionary for textual data compression and decompression

Patent number: 8326604

Abstract: A dictionary for compressing and decompressing textual data has a number of keys. Each key is associated with an identifier. The keys include static word or phrase keys, where each static word or phrase key lists one or more unchanging words in a particular order. The keys further include dynamic phrase keys, where each dynamic phrase key lists a number of words and one or more placeholders in a particular order, and each placeholder denotes a place where a word or phrase other than the words of the dynamic phrase key is to be inserted. At least one of the dynamic phrase keys may identify one or more of the words by identifiers for corresponding static words or phrase keys. At least one of the static word or phrase keys may identify one or more of the words by identifiers for corresponding other static words or phrase keys.

Type: Grant

Filed: April 24, 2008

Date of Patent: December 4, 2012

Assignee: International Business Machines Corporation

Inventors: Umesh Kumar B. Balegar, Rohit Shetty
Dictionary for textual data compression and decompression

Patent number: 8326605

Abstract: A dictionary for compressing and decompressing textual data has a number of keys. Each key is associated with an identifier. The keys include static word or phrase keys, where each static word or phrase key lists one or more unchanging words in a particular order. The keys further include dynamic phrase keys, where each dynamic phrase key lists a number of words and one or more placeholders in a particular order, and each placeholder denotes a place where a word or phrase other than the words of the dynamic phrase key is to be inserted. At least one of the dynamic phrase keys may identify one or more of the words by identifiers for corresponding static words or phrase keys. At least one of the static word or phrase keys may identify one or more of the words by identifiers for corresponding other static words or phrase keys.

Type: Grant

Filed: April 24, 2008

Date of Patent: December 4, 2012

Assignee: International Business Machines Incorporation

Inventors: Umesh Kumar B. Balegar, Rohit Shetty
Time-warping of audio signals for packet loss concealment avoiding audible artifacts

Patent number: 8321216

Abstract: Packet loss concealment (PLC) systems and methods are described that use time-warping to merge a concealment signal generated to replace one or more bad frames of an audio signal with a received signal representing one or more subsequent good frames of the audio signal in a manner that avoids signal discontinuity and audible artifacts resulting therefrom. Prediction-based PLC systems and methods are also described that use time-warping to conceal the loss of one or more frames containing a transition region in a manner that will not result in an audible artifact.

Type: Grant

Filed: February 23, 2010

Date of Patent: November 27, 2012

Assignee: Broadcom Corporation

Inventor: Robert W. Zopf
Text-to-speech method and system, computer program product therefor

Patent number: 8321224

Abstract: A text-to-speech system adapted to operate on text in a first language including sections in a second language, includes a grapheme/phoneme transcriptor for converting the sections in the second language into phonemes of the second language; a mapping module configured for mapping at least part of the phonemes of the second language onto sets of phonemes of the first language; and a speech-synthesis module adapted to be fed with a resulting stream of phonemes including the sets of phonemes of the first language resulting from mapping and the stream of phonemes of the first language representative of the text, and to generate a speech signal from the resulting stream of phonemes.

Type: Grant

Filed: January 10, 2012

Date of Patent: November 27, 2012

Assignee: Loquendo S.p.A.

Inventors: Leonardo Badino, Claudia Barolo, Silvia Quazza
Apparatus and method of generating information on relationship between characters in content

Patent number: 8321203

Abstract: A method of generating information on relationships between characters of a content includes dividing a text extracted from the content into one or more predetermined units, determining one or more dominant relationships between characters of the content by comparing the divided units with relationship keyword information in which keywords contained in categories are defined, wherein the categories represent one or more relationships between the characters, and generating information on the relationships between the characters in accordance with the determined dominant relationships. The dominant relationships are determined by matching the divided units of text to the categories with reference to the relationship keyword information, counting the number of divided units of text corresponding to each of the categories, and determining the relationship represented by the category measured by the highest number of divided units of text.

Type: Grant

Filed: April 21, 2008

Date of Patent: November 27, 2012

Assignee: SAMSUNG Electronics Co., Ltd.

Inventor: Ju-hee Seo
Wideband speech decoding apparatus for producing excitation signal, synthesis filter, lower-band speech signal, and higher-band speech signal, and for decoding coded narrowband speech

Patent number: 8315861

Abstract: A wideband speech decoding apparatus has means for producing an excitation signal from coded data, means for producing a synthesis filter, and means for decoding a speech signal from the excitation signal and the synthesis filter. The wideband speech decoding apparatus comprises acquisition means for acquiring identification information which identifies the speech signal to be decoded is narrowband. The wideband speech decoding apparatus further comprises control means for controlling decoding means based on the identification information.

Type: Grant

Filed: March 12, 2012

Date of Patent: November 20, 2012

Assignee: Kabushiki Kaisha Toshiba

Inventor: Kimio Miseki
Voice user interface authoring tool

Patent number: 8315874

Abstract: A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files.

Type: Grant

Filed: April 11, 2006

Date of Patent: November 20, 2012

Assignee: Microsoft Corporation

Inventors: William F. Barton, Michelle S. Spina, David G. Ollason, Julian J. Odell
Text creating and editing method and computer-readable storage medium with dynamic data loading

Patent number: 8311802

Abstract: Embodiments consistent with the invention include a method of creating a document on a computing device and a computer-readable storage medium. The method includes: receiving input text in the computing device to initiate a document creation process, the computing device including a first portion of font data for a first language, the first portion including less than all of the font data for the first language; based on the input text, determining whether the first portion is sufficient to create the document on the computing device; loading a second portion of the font data to the computing device from a data storage location if the first portion is not sufficient; and creating the document using at least one of the first portion and the second portion.

Type: Grant

Filed: October 3, 2011

Date of Patent: November 13, 2012

Assignee: VeriSign, Inc.

Inventor: Devendra Kalra
Method for speech recognition using partitioned vocabulary

Patent number: 8306820

Abstract: A is recognized using a predefinable vocabulary that is partitioned in sections of phonetically similar words. In a recognition process, first oral input is associated with one of the sections, then the oral input is determined from the vocabulary of the associated section.

Type: Grant

Filed: October 4, 2005

Date of Patent: November 6, 2012

Assignee: Siemens Aktiengesellschaft

Inventor: Niels Kunstmann
Enhanced automatic speech recognition using mapping between unsupervised and supervised speech model parameters trained on same acoustic training data

Patent number: 8306819

Abstract: Techniques for enhanced automatic speech recognition are described. An enhanced ASR system may be operative to generate an error correction function. The error correction function may represent a mapping between a supervised set of parameters and an unsupervised training set of parameters generated using a same set of acoustic training data, and apply the error correction function to an unsupervised testing set of parameters to form a corrected set of parameters used to perform speaker adaptation. Other embodiments are described and claimed.

Type: Grant

Filed: March 9, 2009

Date of Patent: November 6, 2012

Assignee: Microsoft Corporation

Inventors: Chaojun Liu, Yifan Gong
Method and apparatus to encode/decode low bit-rate audio signal by approximiating high frequency envelope with strongly correlated low frequency codevectors

Patent number: 8301439

Abstract: A method of encoding a low bit-rate audio signal includes quantizing and encoding a plurality of low frequency sub-bands of an audio signal in a frequency domain, generating a codebook of codevectors using sub-bands of the audio signal spectrum, detecting an envelope of another frequency sub-band of the audio signal and quantizing and losslessly-encoding the detected envelope, selecting a codevector most similar to the higher frequency sub-band spectrum from the generated codebook's codevectors and determining its codebook codevector index, and generating a bit stream.

Type: Grant

Filed: July 12, 2006

Date of Patent: October 30, 2012

Assignee: Samsung Electronics Co., Ltd

Inventors: Junghoe Kim, Eunmi Oh, Boris Kudryashov, Konstantin Osipov
System and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy

Patent number: 8301448

Abstract: The invention involves the loading and unloading of dynamic section grammars and language models in a speech recognition system. The values of the sections of the structured document are either determined in advance from a collection of documents of the same domain, document type, and speaker; or collected incrementally from documents of the same domain, document type, and speaker; or added incrementally to an already existing set of values. Speech recognition in the context of the given field is constrained to the contents of these dynamic values. If speech recognition fails or produces a poor match within this grammar or section language model, speech recognition against a larger, more general vocabulary that is not constrained to the given section is performed.

Type: Grant

Filed: March 29, 2006

Date of Patent: October 30, 2012

Assignee: Nuance Communications, Inc.

Inventors: Alwin B. Carus, Larissa Lapshina, Raghu Vemula

prev … 2 3 4 5 6 7 8 9 10 … next