Patents Examined by Greg A Borsetti
  • Patent number: 7904290
    Abstract: Disclosed are a method, a system and a computer program for translating an application simulation into a plurality of languages. The method comprises the steps of creating a first simulation having a sequence of frames, in a first language; adding elements in said first language to said sequence of frames; and creating a second simulation having a sequence of frames in a second language. The method comprises the further steps of exporting said elements including frame sequence number, position of each of said elements, and settings to a document; translating said elements in said document into said second language; and automatically placing the translated elements on said frames of said second simulation, using said sequence numbers, said position, and said settings.
    Type: Grant
    Filed: December 17, 2004
    Date of Patent: March 8, 2011
    Assignee: International Business Machines Corporation
    Inventor: Bradley K. Wells
  • Patent number: 7904297
    Abstract: Representation-neutral dialogue systems and methods (“RNDS”) are described that include multi-application, multi-device spoken-language dialogue systems based on the information-state update approach. The RNDS includes representation-neutral core components of a dialogue system that provide scripted domain-specific extensions to routines such as dialogue move modeling and reference resolution, easy substitution of specific semantic representations and associated routines, and clean interfaces to external components for language-understanding (i.e., speech-recognition and parsing) and language-generation, and to domain-specific knowledge sources. The RNDS also resolves multi-device dialogue by evaluating and selecting among candidate dialogue moves based on features at multiple levels. Multiple sources of information are combined, multiple speech recognition and parsing hypotheses tested, and multiple device and moves considered to choose the highest scoring hypothesis overall.
    Type: Grant
    Filed: December 8, 2005
    Date of Patent: March 8, 2011
    Assignee: Robert Bosch GmbH
    Inventors: Danilo Mirkovic, Lawrence Cavedon, Matthew Purver, Florin Ratiu, Tobias Scheideck, Fuliang Weng, Qi Zhang, Kui Xu
  • Patent number: 7904296
    Abstract: An approach to wordspotting (180) using query data from one or more spoken instance of a query (140). The query data is processed to determining a representation of the query (160) that defines multiple sequences of subword (130) units each representing the query. Then putative instances of the query (190) are located in input data from an audio signal using the determined representation of the query.
    Type: Grant
    Filed: July 22, 2004
    Date of Patent: March 8, 2011
    Assignee: Nexidia Inc.
    Inventor: Robert W. Morris
  • Patent number: 7899761
    Abstract: Disclosed herein are a system and method for trend prediction of signals in a time series using a Markov model. The method includes receiving a plurality of data series and input parameters, where the input parameters include a time step parameter, preprocessing the plurality of data series according to the input parameters, to form binned and classified data series, and processing the binned and classified data series. The processing includes initializing a Markov model for trend prediction, and training the Markov model for trend prediction of the binned and classified data series to form a trained Markov model. The method further includes deploying the trained Markov model for trend prediction, including outputting trend predictions. The method develops an architecture for the Markov model from the data series and the input parameters, and disposes the Markov model, having the architecture, for trend prediction.
    Type: Grant
    Filed: April 25, 2005
    Date of Patent: March 1, 2011
    Assignee: GM Global Technology Operations LLC
    Inventors: Shubha Kadambe, Leandro G. Barajas, Youngkwan Cho, Pulak Bandyopadhyay
  • Patent number: 7877255
    Abstract: A method for automatic speech recognition includes determining for an input signal a plurality scores representative of certainties that the input signal is associated with corresponding states of a speech recognition model, using the speech recognition model and the determined scores to compute an average signal, computing a difference value representative of a difference between the input signal and the average signal, and processing the input signal in accordance with the difference value.
    Type: Grant
    Filed: March 31, 2006
    Date of Patent: January 25, 2011
    Assignee: Voice Signal Technologies, Inc.
    Inventor: Igor Zlokarnik
  • Patent number: 7869999
    Abstract: A system and method for generating synthetic speech, which operates in a computer implemented Text-To-Speech system. The system comprises at least a speaker database that has been previously created from user recordings, a Front-End system to receive an input text and a Text-To-Speech engine. The Front-End system generates multiple phonetic transcriptions for each word of the input text, and the TTS engine uses a cost function to select which phonetic transcription is the more appropriate for searching the speech segments within the speaker database to be concatenated and synthesized.
    Type: Grant
    Filed: August 10, 2005
    Date of Patent: January 11, 2011
    Assignee: Nuance Communications, Inc.
    Inventors: Christel Amato, Hubert Crepy, Stephane Revelin, Claire Waast-Richard
  • Patent number: 7860709
    Abstract: The invention relates to a method for supporting an encoding of an audio signal, wherein at least one section of the audio signal is to be encoded with a coding model that allows the use of different coding frame lengths. In order to enable a simple selection of the respectively best suited coding frame length, it is proposed that at least one control parameter is determined based on signal characteristics of the audio signal. The control parameter is then used for limiting the options of possible coding frame lengths for the at least one section. The invention relates equally to a module 10,11 in which this method is implemented, to a device 1 and a system comprising such a module 10,11, and to a software program product including a software code for realizing the proposed method.
    Type: Grant
    Filed: May 13, 2005
    Date of Patent: December 28, 2010
    Assignee: Nokia Corporation
    Inventor: Jari Mäkinen
  • Patent number: 7856355
    Abstract: In one embodiment, distortion in a received speech signal is estimated using at least one model trained based on subjective quality assessment data. A speech quality assessment for the received speech signal is then determined based on the estimated distortion.
    Type: Grant
    Filed: July 5, 2005
    Date of Patent: December 21, 2010
    Assignee: Alcatel-Lucent USA Inc.
    Inventor: Doh-Suk Kim
  • Patent number: 7848923
    Abstract: Provided is a method for converting a dimension of a vector. The vector dimension conversion method for vector quantization includes the steps of: extracting a specific parameter having a pitch period from an input speech signal and then generating a vector of a dimension that varies according to the pitch period; dividing an entire frequency domain of the generated vector of the variable dimension into at least two frequency domains; and converting the vector of the variable dimension into vectors of mutually different fixed dimensions according to the divided frequency domains. Thereby, not only an error due to the vector dimension conversion is suppressed but codebook memory required for the vector quantization is effectively reduced.
    Type: Grant
    Filed: April 24, 2006
    Date of Patent: December 7, 2010
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Kyung Jin Byun, Ik Soo Eo, Hee Bum Jung
  • Patent number: 7848928
    Abstract: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.
    Type: Grant
    Filed: August 10, 2005
    Date of Patent: December 7, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Lisa Abbott, Daniel E. Badt, John W. Eckhart, Harvey M. Ruback, Steven G. Woodward
  • Patent number: 7844464
    Abstract: Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.
    Type: Grant
    Filed: July 22, 2005
    Date of Patent: November 30, 2010
    Assignee: Multimodal Technologies, Inc.
    Inventors: Kjell Schubert, Juergen Fritsch, Michael Finke, Detlef Koll
  • Patent number: 7844457
    Abstract: Methods are disclosed for automatic accent labeling without manually labeled data. The methods are designed to exploit accent distribution between function and content words.
    Type: Grant
    Filed: February 20, 2007
    Date of Patent: November 30, 2010
    Assignee: Microsoft Corporation
    Inventors: YiNing Chen, Frank Kao-ping Soong, Min Chu
  • Patent number: 7831428
    Abstract: A speech segment is indexed by identifying at least two alternative word sequences for the speech segment. For each word in the alternative sequences, information is placed in an entry for the word in the index. Speech units are eliminated from entries in the index based on a comparison of a probability that the word appears in the speech segment and a threshold value.
    Type: Grant
    Filed: November 9, 2005
    Date of Patent: November 9, 2010
    Assignee: Microsoft Corporation
    Inventors: Ciprian I. Chelba, Alejandro Acero, Jorge F. Silva Sanchez
  • Patent number: 7818168
    Abstract: A method of measuring the degree of enhancement made to a voice signal by receiving the voice signal, identifying formant regions in the voice signal, computing stationarity for each identified formant region, enhancing the voice signal, identifying formant regions in the enhanced voice signal that correspond to those identified in the received voice signal, computing stationarity for each formant region identified in the enhanced voice signal, comparing corresponding stationarity results for the received and enhanced voice signals, and calculating at least one user-definable statistic of the comparison results as the degree of enhancement made to the received voice signal.
    Type: Grant
    Filed: December 1, 2006
    Date of Patent: October 19, 2010
    Assignee: The United States of America as represented by the Director, National Security Agency
    Inventor: Adolf Cusmariu
  • Patent number: 7813932
    Abstract: An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.
    Type: Grant
    Filed: April 14, 2006
    Date of Patent: October 12, 2010
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Miyoung Kim, Sangwook Kim, Donyung Kim, Shihwa Lee, Junghoe Kim
  • Patent number: 7809568
    Abstract: An index for searching spoken documents having speech data and text meta-data is created by obtaining probabilities of occurrence of words and positional information of the words of the speech data and combining it with at least positional information of the words in the text meta-data. A single index can be created because the speech data and the text meta-data are treated the same and considered only different categories.
    Type: Grant
    Filed: November 8, 2005
    Date of Patent: October 5, 2010
    Assignee: Microsoft Corporation
    Inventors: Alejandro Acero, Ciprian I. Chelba, Jorge F. Silva Sanchez
  • Patent number: 7809554
    Abstract: An apparatus, method, and medium for detecting a voiced sound and an unvoiced sound. The apparatus includes a blocking unit for dividing an input signal into block units; a parameter calculator for calculating a first parameter to determine the voiced sound and a second parameter to determine the unvoiced sound by using a slope and spectral flatness measure (SFM) of a mel-scaled filter bank spectrum of an input signal existing in a block; and a determiner for determining a voiced sound zone and an unvoiced sound zone in the block by comparing the first and second parameters to predetermined threshold values.
    Type: Grant
    Filed: February 7, 2005
    Date of Patent: October 5, 2010
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Kwangcheol Oh
  • Patent number: 7809551
    Abstract: A system for retrieving documents related to a concept from a text corpus includes a set of stored semantic classes which are combinable to express the concept each class including a set of keywords, each set of keywords including at least one keyword. Syntactic rules are applied to identified text portions which include one or more of the keywords. A rule is satisfied when keywords from the first and second semantic classes are in any one of a plurality of syntactic relationships. A concept matching module identifies text portions within the text corpus which include one or more of the keywords, for applying the syntactic rules to the text portions, and for identifying those text portions which satisfy at least one of the rules. Documents to be retrieved may include at least one of the identified text portions.
    Type: Grant
    Filed: July 1, 2005
    Date of Patent: October 5, 2010
    Assignee: Xerox Corporation
    Inventors: Ágnes Sándor, Aaron Kaplan
  • Patent number: 7805301
    Abstract: A reliable full covariance matrix estimation algorithm for pattern unit's state output distribution in pattern recognition system is discussed. An intermediate hierarchical tree structure is built to relate models for product units. Full covariance matrices of pattern unit's state output distribution are estimated based on all the related nodes in the tree.
    Type: Grant
    Filed: July 1, 2005
    Date of Patent: September 28, 2010
    Assignee: Microsoft Corporation
    Inventors: Ye Tian, Frank Kao-Ping Soong, Jian-Lai Zhou
  • Patent number: 7805296
    Abstract: An audio data processing device including: a first processor; and a second processor which is connected to the first processor wherein the first processor includes: an audio data acquisition which acquires audio data of digital data; an omitting section which omits a bit corresponding to low volume which is hard to be heard by human ears from the audio data; and a transmitter which transmits the audio data in which the bit corresponding to the low volume is omitted by the omitting section from the first processor to the second processor; wherein the second processor includes: a receiver which receives the audio data transmitted from the first processor; and a reproduction data generator which generates audio reproduction data necessary to reproduce the audio data based on the received audio data.
    Type: Grant
    Filed: October 27, 2005
    Date of Patent: September 28, 2010
    Assignee: Seiko Epson Corporation
    Inventors: Tatsuya Ichikawa, Mahesh Inamdar, Anand Kumar, Aditya S. Chikodi, Kazuto Mogami