Patents Examined by Greg A Borsetti

Speech detection using order statistics

Patent number: 8380494

Abstract: The method and system disclosed herein reduces total bandwidth requirement for communication in a voice over Internet protocol application. Sample [101] and convert [102] the analog input audio signal into digital signals and derive sampled frames [103]. Compute spacings of order statistics [104]. Measure the entropy for each of the sampled frames [105]. Set a threshold for entropy [106]. Mark the audio frames as active speech frames or inactive speech frames [107]. Mark an audio frame as an' inactive speech frame when the entropy is greater than the threshold, and mark the audio frame as an active speech frame when the entropy is lesser than the threshold [107]. Transmit only the active speech frames [108].

Type: Grant

Filed: January 24, 2007

Date of Patent: February 19, 2013

Assignee: P.E.S. Institute of Technology

Inventors: Muralishankar Rangarao, Vijay Satyanarayana Rao, Venkatesha Prasad Rangarao, Shankar Hebbale Narasimhiah
Semantically-driven extraction of relations between named entities

Patent number: 8370128

Abstract: A system and method of developing rules for text processing enable retrieval of instances of named entities in a predetermined semantic relation (such as the DATE and PLACE of an EVENT) by extracting patterns from text strings in which attested examples of named entities satisfying the semantic relation occur. The patterns are generalized to form rules which can be added to the existing rules of a syntactic parser and subsequently applied to text to find candidate instances of other named entities in the predetermined semantic relation.

Type: Grant

Filed: September 30, 2008

Date of Patent: February 5, 2013

Assignee: Xerox Corporation

Inventors: Caroline Brun, Caroline Hagege
Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks

Patent number: 8346566

Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.

Type: Grant

Filed: August 31, 2010

Date of Patent: January 1, 2013

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes
Speech recognition using channel verification

Patent number: 8346554

Abstract: A method for automatic speech recognition includes determining for an input signal a plurality scores representative of certainties that the input signal is associated with corresponding states of a speech recognition model, using the speech recognition model and the determined scores to compute an average signal, computing a difference value representative of a difference between the input signal and the average signal, and processing the input signal in accordance with the difference value.

Type: Grant

Filed: September 15, 2010

Date of Patent: January 1, 2013

Assignee: Nuance Communications, Inc.

Inventor: Igor Zlokarnik
Viterbi decoder and speech recognition method using same using non-linear filter for observation probabilities

Patent number: 8332222

Abstract: A Viterbi decoder includes: an observation vector sequence generator for generating an observation vector sequence by converting an input speech to a sequence of observation vectors; a local optimal state calculator for obtaining a partial state sequence having a maximum similarity up to a current observation vector as an optimal state; an observation probability calculator for obtaining, as a current observation probability, a probability for observing the current observation vector in the optimal state; a buffer for storing therein a specific number of previous observation probabilities; a non-linear filter for calculating a filtered probability by using the previous observation probabilities stored in the buffer and the current observation probability; and a maximum likelihood calculator for calculating a partial maximum likelihood by using the filtered probability.

Type: Grant

Filed: July 21, 2009

Date of Patent: December 11, 2012

Assignee: Electronics and Telecommunications Research Institute

Inventors: Hoon Chung, Jeon Gue Park, Yunkeun Lee, Ho-Young Jung, Hyung-Bae Jeon, Jeom Ja Kang, Sung Joo Lee, Euisok Chung, Ji Hyun Wang, Byung Ok Kang, Ki-young Park, Jong Jin Kim
Data extraction system, terminal, server, programs, and media for extracting data via a morphological analysis

Patent number: 8321198

Abstract: This invention provides a terminal searching for web pages on the web and extracting the prescribed data from the web pages and a server verifying and accumulating the extracted data. The prescribed data can be extracted from the web pages on the web in a manner that the process relating to the data extraction is distributed between the terminal and the server. Therefore, necessary processes up to the data extraction are distributed, and the burden placed on each apparatus can be lessened. Further, new data not formerly found in the web pages can be found out and extracted from the web pages that has been updated or newly made.

Type: Grant

Filed: October 27, 2005

Date of Patent: November 27, 2012

Assignee: Kabushiki Kaisha Square Enix

Inventor: Kengo Nakajima
Assisted multi-modal dialogue

Patent number: 8311835

Abstract: Controls are provided for a web server to generate client side markups that include recognition and/or audible prompting. The controls are organized in collections to obtain information pertaining to different topics. Each collection of controls create a separate dialog. In this manner, the collections can be selectively specified to execute the corresponding dialog.

Type: Grant

Filed: August 29, 2003

Date of Patent: November 13, 2012

Assignee: Microsoft Corporation

Inventor: Renaud J. Lecoeuche
Voice emphasizing device and voice emphasizing method

Patent number: 8311831

Abstract: A voice emphasizing device emphasizes in a speech a “strained rough voice” at a position where a speaker or user of the speech intends to generate emphasis or musical expression. Thereby, the voice emphasizing device can provide the position with emphasis of anger, excitement, tension, or an animated way of speaking, or musical expression of Enka (Japanese ballad), blues, rock, or the like. As a result, rich vocal expression can be achieved. The voice emphasizing device includes: an emphasis utterance section detection unit (12) detecting, from an input speech waveform, an emphasis section that is a time duration having a waveform intended by the speaker or user to be converted; and a voice emphasizing unit (13) increasing fluctuation of an amplitude envelope of the waveform in the detected emphasis section.

Type: Grant

Filed: September 29, 2008

Date of Patent: November 13, 2012

Assignee: Panasonic Corporation

Inventors: Yumiko Kato, Takahiro Kamai, Masakatsu Hoshimi
Keyword spotting using a phoneme-sequence index

Patent number: 8311828

Abstract: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.

Type: Grant

Filed: August 27, 2008

Date of Patent: November 13, 2012

Assignee: Nexidia Inc.

Inventors: Jon A. Arrowood, Robert W. Morris, Mark Finlay, Scott A. Judy
Voice activity detection system and method

Patent number: 8311813

Abstract: Discrimination between at least two classes of events in an input signal is carried out in the following way. A set of frames containing an input signal is received, and at least two different feature vectors are determined for each of said frames. Said at least two different feature vectors are classified using respective sets of preclassifiers trained for said at least two classes of events. Values for at least one weighting factor are determined based on outputs of said preclassifiers for each of said frames. A combined feature vector is calculated for each of said frames by applying said at least one weighting factor to said at least two different feature vectors. Said combined feature vector is classified using a set of classifiers trained for said at least two classes of events.

Type: Grant

Filed: October 26, 2007

Date of Patent: November 13, 2012

Assignee: International Business Machines Corporation

Inventor: Zica Valsan
Transform coder and transform coding method

Patent number: 8311818

Abstract: A transform coding apparatus includes an input scale factor calculating section that calculates an input scale factor having a predetermined number of scale factors associated with an input spectrum as an element, and a codebook that stores a plurality of scale factor candidates having a predetermined number of elements and outputs one scale factor candidate. The transform coding apparatus also includes an error calculating section that calculates an error on a per element basis, a weighted error calculating section that determines a weight on a per element basis and calculates a sum of products of the error and the weight to calculate a weighted error, and a searching section that searches for a scale factor candidate that minimizes the weighted error in the codebook.

Type: Grant

Filed: February 7, 2012

Date of Patent: November 13, 2012

Assignee: Panasonic Corporation

Inventors: Masahiro Oshikiri, Tomofumi Yamanashi
Data detection in a sequence of tokens using decision tree reductions

Patent number: 8311806

Abstract: An apparatus for processing a sequence of tokens to detect predetermined data, wherein each said token has a token type, and the predetermined data has a structure that comprises a predetermined sequence of token types, including at least one optional token type. The apparatus comprises a processor arranged to: provide a tree for detecting the predetermined data, the tree comprising a plurality of states, each said state being linked with at least one other state by a respective condition, the arrangement of linked states forming a plurality of paths; and compare the token types of the sequence of tokens to respective conditions in the tree to match the sequence of tokens to one or more paths in the tree, wherein the predetermined data can be detected without using an epsilon reduction to take account of said at least one optional token type.

Type: Grant

Filed: September 29, 2008

Date of Patent: November 13, 2012

Assignee: Apple Inc.

Inventors: Olivier Bonnet, Frederic de Jaeger, Romain Goyet
Technology for supporting modification of messages displayed by program

Patent number: 8311801

Abstract: A method, system and computer program product for improving the efficiency of changing or modifying a message displayed by a program. A memory unit stores a message read and displayed by the execution of a program, associating it with a language in which the message is written. An execution unit reads from the memory unit and displays the message corresponding to a language set by a user by executing the program. An editing unit edits the message stored in the memory unit and stores the edited message into the memory unit, associating it with a different language from that of the unedited message. A setting unit changes the language of the message displayed by the execution unit, where the execution unit reads from the memory unit and displays the message corresponding to the language changed by the setting unit thereby displaying the edited message instead of the unedited message.

Type: Grant

Filed: July 24, 2008

Date of Patent: November 13, 2012

Assignee: International Business Machines Corporation

Inventors: Nozomu Aoyama, Shinkichi Hamada, Shinsaku Kudomi, Yuko Ito
Emotion recognition apparatus

Patent number: 8204747

Abstract: An emotion recognition apparatus performs accurate and stable speech-based emotion recognition, irrespective of individual, regional, and language differences of prosodic information.

Type: Grant

Filed: May 21, 2007

Date of Patent: June 19, 2012

Assignee: Panasonic Corporation

Inventors: Yumiko Kato, Takahiro Kamai, Yoshihisa Nakatoh, Yoshifumi Hirose
Multi language exchange system

Patent number: 8180625

Abstract: The invention discloses a multi language exchange system which includes a communication device having an input screen on which a message in a first language is to be translated into a second language. The input screen displays a programmable grid or list having at least one sentence or phrase formed by the grid or list in the first language. Each grid element of the at least one sentence or phrase contains a word or words in the at least one sentence or phrase. Each grid element of the at least one sentence or phrase having a sequence based on the order in which the word or words of the respective grid element would appear in the translation of the at least one sentence or phrase in the second language. The user follows the sequence to allow the at least one sentence or phrase to be translated in the correct order.

Type: Grant

Filed: November 14, 2006

Date of Patent: May 15, 2012

Inventor: Fumitaka Noda
Sequential speech recognition with two unequal ASR systems

Patent number: 8180641

Abstract: Sequential speech recognition using two unequal automatic speech recognition (ASR) systems may be provided. The system may provide two sets of vocabulary data. A determination may be made as to whether entries in one set of vocabulary data are likely to be confused with entries in the other set of vocabulary data. If confusion is likely, a decoy entry from one set of the vocabulary data may be placed in the other set of vocabulary data to ensure more efficient and accurate speech recognition processing may take place.

Type: Grant

Filed: September 29, 2008

Date of Patent: May 15, 2012

Assignee: Microsoft Corporation

Inventors: Michael Levit, Shuangyu Chang, Bruce Melvin Buntschuh
Method and system for accent correction

Patent number: 8175882

Abstract: A method for task execution improvement, the method includes: generating a baseline model for executing a task; recording a user executing a task; comparing the baseline model to the user's execution of the task; and providing feedback to the user based on the differences in the user's execution and the baseline model.

Type: Grant

Filed: January 25, 2008

Date of Patent: May 8, 2012

Assignee: International Business Machines Corporation

Inventors: Sara H. Basson, Dimitiri Kanevsky, Edward E. Kelley, Bhuvana Ramabhadran
Multichannel audio coding

Patent number: 8170882

Abstract: Multiple channels of audio are combined either to a monophonic composite signal or to multiple channels of audio along with related auxiliary information from which multiple channels of audio are reconstructed, including improved downmixing of multiple audio channels to a monophonic audio signal or to multiple audio channels and improved decorrelation of multiple audio channels derived from a monophonic audio channel or from multiple audio channels. Aspects of the disclosed invention are usable in audio encoders, decoders, encode/decode systems, downmixers, upmixers, and decorrelators.

Type: Grant

Filed: July 31, 2007

Date of Patent: May 1, 2012

Assignee: Dolby Laboratories Licensing Corporation

Inventor: Mark Franklin Davis
Geodesic search and retrieval system and method of semi-structured databases

Patent number: 8155949

Abstract: A knowledge-based decision support system that allows for communication and learning to occur using natural language is presented. The system has a capability to automatically extract features from the natural language using symmetric reductions and random search. The iterative generalization of the rule base and checking of the resultant base against a case base from which the generalizations are induced is also provided. The decision support system can be used to search semi-structured databases and automatically learns new knowledge and search control knowledge where it is most needed based on the pattern of previous rule firings.

Type: Grant

Filed: October 1, 2008

Date of Patent: April 10, 2012

Assignee: The United States of America as represented by the Secretary of the Navy

Inventor: Stuart H Rubin
Subtitle generation and retrieval combining document processing with voice processing

Patent number: 8155969

Abstract: An apparatus for retrieving a character string includes: storage for storing text data obtained by recognizing a voice in a presentation, second text data extracted from document data used in the presentation, and associated information of the first text data and the second text data. The apparatus also includes a retrieval unit for retrieving, by use of the associated information, the character string from text data composed from the first text data and the second text data.

Type: Grant

Filed: August 11, 2009

Date of Patent: April 10, 2012

Assignee: International Business Machines Corporation

Inventors: Kohtaroh Miyamoto, Noriko Nagishi, Kenichi Arakawa

prev 1 2 3 4 5 6 … next