Patents Examined by Justin W. Rider

Disambiguation systems and methods for use in generating grammars

Patent number: 8010343

Abstract: A method and system for addressing disambiguation issues in interactive applications by creating a disambiguation system for generating complex grammars that includes homonym detection and grouping, and provides optimization feedback that eliminates time-consuming and repetitive iterative steps during the grammar generation portion of the interactive application configuration.

Type: Grant

Filed: December 15, 2005

Date of Patent: August 30, 2011

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Brent D. Metz
Surgical system controlling apparatus and surgical system controlling method

Patent number: 8010368

Abstract: In this invention, a voice recognition engine 110 outputs to a controlling section 103 a matching state of a voice input signal as an error code. Then, the controlling section 103 determines the matching state based on the error code in the error determination section 105 and outputs to a voice synthesizing engine 113 guidance data according to the matching state based on a timing control by a guidance timing controlling section 107. According to such a configuration, this invention improves operatability by voice operation, while reducing a risk of erroneous recognition by maintaining a predetermined matching rate.

Type: Grant

Filed: March 19, 2007

Date of Patent: August 30, 2011

Assignee: Olympus Medical Systems Corp.

Inventor: Masahide Yamaki
Methods and systems for sample rate conversion

Patent number: 8005667

Abstract: Methods and systems for sample rate conversion convert a sampled signal to a higher data rate signal. Conversion pulses are received, having a conversion rate that is higher than the sample rate of the sampled signal. Sample points are then reconstructed from the sampled signal, in real time, on either side of a conversion pulse. An interpolation is performed between the reconstructed sample points, at the time of the conversion pulse. The interpolation results are outputted in real time. The process is repeated for additional conversion pulses. The outputted interpolated amplitudes form the higher data rate signal having a data rate equal to the conversion rate. Sample rate conversion is thus performed in real time according to the higher data rate clock, rather than with fixed ratios. As a result, when the higher data rate clock is affected by, for example, jitter or other frequency variations, the higher data rate samples immediately track the lower data rate samples.

Type: Grant

Filed: August 4, 2008

Date of Patent: August 23, 2011

Assignee: Broadcom Corporation

Inventor: Hoang Nhu
Inferring switching conditions for switching between modalities in a speech application environment extended for interactive text exchanges

Patent number: 8000969

Abstract: The disclosed solution includes a method for dynamically switching modalities based upon inferred conditions in a dialogue session involving a speech application. The method establishes a dialogue session between a user and the speech application. During the dialogue session, the user interacts using an original modality and a second modality. The speech application interacts using a speech modality only. A set of conditions indicative of interaction problems using the original modality can be inferred. Responsive to the inferring step, the original modality can be changed to the second modality. A modality transition to the second modality can be transparent the speech application and can occur without interrupting the dialogue session. The original modality and the second modality can be different modalities; one including a text exchange modality and another including a speech modality.

Type: Grant

Filed: December 19, 2006

Date of Patent: August 16, 2011

Assignee: Nuance Communications, Inc.

Inventors: William V. Da Palma, Baiju D. Mandalia, Victor S. Moore, Wendi L. Nusbickel
Method and apparatus for estimating degree of similarity between voices

Patent number: 7996213

Abstract: A similarity degree estimation method is performed by two processes. In a first process, an inter-band correlation matrix is created from spectral data of an input voice such that the spectral data are divided into a plurality of discrete bands which are separated from each other with spaces therebetween along a frequency axis, a plurality of envelope components of the spectral data are obtained from the plurality of the discrete bands, and elements of the inter-band correlation matrix are correlation values between the respective envelope components of the input voice. In a second process, a degree of similarity is calculated between a pair of input voices to be compared with each other by using respective inter-band correlation matrices obtained for the pair of the input voices through the inter-band correlation matrix creation process.

Type: Grant

Filed: March 20, 2007

Date of Patent: August 9, 2011

Assignee: Yamaha Corporation

Inventors: Mikio Tohyama, Michiko Kazama, Satoru Goto, Takehiko Kawahara, Yasuo Yoshioka
Noise-canceling device for voice communication terminal using configurable multiple digital filters

Patent number: 7983908

Abstract: A noise-canceling device of a voice communication terminal that removes noise elements included in received voice signals. The device comprises: a digital filter array that exhibits filter qualities in response to a coefficient setting signal showing each supplied arrays of filter coefficients, and includes a first-stage filter that receives the received voice signals as well as multiple later-stage filters connected thereto in a straight line; a filter qualities designator that generates input designation that designates each qualities of the multiple digital filters forming the digital filters array; and a filter coefficient setter that retains multiple arrays of filter coefficients, extracts a filter coefficient array corresponding to the designation input from among the multiple filter coefficient arrays, and supplies to each multiple digital filters.

Type: Grant

Filed: March 15, 2007

Date of Patent: July 19, 2011

Assignee: Oki Electric Industry Co., Ltd.

Inventors: Hiroshi Kuboki, Kenichi Kurihara
Method and apparatus for preparing a document to be read by text-to-speech reader

Patent number: 7953601

Abstract: There is disclosed a method and system for preparing a document to be read by a text-to-speech reader. The method can include identifying two or more voice types available to the text-to-speech reader, identifying the text elements within the document, grouping related text elements together, and classifying the text elements according to voice types available to the text-to-speech reader. The method of grouping the related text elements together can include syntactic and intelligent clustering. The classification of text elements can include performing latent semantic analysis on the text elements and characteristics of the available voice types.

Type: Grant

Filed: December 19, 2008

Date of Patent: May 31, 2011

Assignee: Nuance Communications, Inc.

Inventor: John B. Pickering
Active labeling for spoken language understanding

Patent number: 7949525

Abstract: A spoken language understanding method and system are provided. The method includes classifying a set of labeled candidate utterances based on a previously trained classifier, generating classification types for each candidate utterance, receiving confidence scores for the classification types from the trained classifier, sorting the classified utterances based on an analysis of the confidence score of each candidate utterance compared to a respective label of the candidate utterance, and rechecking candidate utterances according to the analysis. The system includes modules configured to control a processor in the system to perform the steps of the method.

Type: Grant

Filed: June 16, 2009

Date of Patent: May 24, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Dilek Z. Hakkani-Tur, Mazin G. Rahim, Gokhan Tur
Apparatus, method, and computer program product for processing voice in speech

Patent number: 7949523

Abstract: A speech processing apparatus includes a rule storing unit that stores therein a rule that correlates one another causes of errors in speech recognition, responding methods each of which is used when an error has occurred during the speech recognition, and responding users each of whom is one of a plurality of users and serving as a target of a response; a detecting unit that detects a cause of an error that has occurred during the recognition of the speech; a method selecting unit that selects one of the responding methods that is correlated with the detected cause of the error from the rule storing unit; a user selecting unit that selects one of the responding users that is correlated with the detected cause of the error from the rule storing unit; and an executing unit that executes the response by the selected responding method to the selected responding user.

Type: Grant

Filed: March 14, 2007

Date of Patent: May 24, 2011

Assignee: Kabushiki Kaisha Toshiba

Inventor: Kazunori Imoto
User authentication system, fraudulent user determination method and computer program product

Patent number: 7949535

Abstract: A system and method is provided for easily detecting a fraudulent user who attempts to obtain authentication using voice reproduced by a reproducer. A personal computer is provided with an audio data obtaining portion for picking up ambient sound around a person as a target of user authentication using voice authentication technology during a period before the person utters, and a fraud determination portion for calculating an intensity level showing intensity of the picked-up ambient sound per predetermined time for each of sections into which the period is divided and for determining that the person is a fraudulent user who attempts to obtain authentication using reproduced voice, when, of two of the calculated intensity levels, the intensity level of the later section is larger than a sum of the intensity level of the earlier section and a predetermined value.

Type: Grant

Filed: July 26, 2006

Date of Patent: May 24, 2011

Assignee: Fujitsu Limited

Inventors: Toshiyuki Ohta, Maiko Hirahara, Kazunari Hirakawa
Cue-based audio coding/decoding

Patent number: 7941320

Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.

Type: Grant

Filed: August 27, 2009

Date of Patent: May 10, 2011

Assignee: Agere Systems, Inc.

Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
Global boundary-centric feature extraction and associated discontinuity metrics

Patent number: 7930172

Abstract: Portions from time-domain speech segments are extracted. Feature vectors that represent the portions in a vector space are created. The feature vectors incorporate phase information of the portions. A distance between the feature vectors in the vector space is determined. In one aspect, the feature vectors are created by constructing a matrix W from the portions and decomposing the matrix W. In one aspect, decomposing the matrix W comprises extracting global boundary-centric features from the portions. In one aspect, the portions include at least one pitch period. In another aspect, the portions include centered pitch periods.

Type: Grant

Filed: December 8, 2009

Date of Patent: April 19, 2011

Assignee: Apple Inc.

Inventor: Jerome R. Bellegarda
Computer-implemented tool for creation of speech application code and associated functional specification

Patent number: 7930182

Abstract: A machine-implemented method of building a speech application includes generating a graphical user interface to enable a user to create and edit a speech application, and receiving user inputs directed to the graphical user interface, where the user inputs specify a set of dialog flows representing the speech application. The method further includes, based on the user inputs, automatically generating executable code representing the speech application and a functional specification document describing the speech application.

Type: Grant

Filed: March 14, 2006

Date of Patent: April 19, 2011

Assignee: Nuance Communications, Inc.

Inventors: Julian Sinai, James E. White, Richard B. Unger, R. Douglas Sharp, James M. Riseman, Eylon Stroh
Systems and methods for voice control of a medical device

Patent number: 7921017

Abstract: The invention is generally directed to systems and methods for medical care, and more particularly to systems and methods for voice control of a medical device. A first embodiment includes a voice controlled surgical system, such as a phacoemulsification system, a microphone coupled to the surgical system, and a voice controlled computer interface coupled with the surgical system. The voice controlled interface is configured to receive a request to invoke a voice command via the microphone, to listen for a voice command upon receipt of a valid request to invoke a voice command, and to forward a valid voice command upon receipt of the valid voice command to the surgical system for execution.

Type: Grant

Filed: July 20, 2006

Date of Patent: April 5, 2011

Assignee: Abbott Medical Optics Inc

Inventors: Michael J. Claus, James W. Staggs
Method and device for updating status of synthesis filters

Patent number: 7921009

Abstract: A method and device for updating statuses of synthesis filters are provided. The method includes: exciting a synthesis filter corresponding to a first encoding rate by using an excitation signal of the first encoding rate, outputting reconstructed signal information, and updating status information of the synthesis filter and a synthesis filter corresponding to a second encoding rate. In the present disclosure, the status of the synthesis filter corresponding to the current rate and the statuses of the synthesis filters at other rates are updated. Thus, synchronization between the statuses of the synthesis filters corresponding to different rates at the encoding terminal may be realized, thereby facilitating the consistency of the reconstructed signals of the encoding and decoding terminals when the encoding rate is switched, and improving the quality of the reconstructed signal of the decoding terminal.

Type: Grant

Filed: September 16, 2010

Date of Patent: April 5, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventor: Jinliang Dai
Language converter with enhanced search capability

Patent number: 7917351

Abstract: A weighted search program is disclosed. The weighted search program may be integrated into a translation program, or the weighted search program may be used independently with an available search engine. When integrated with the translation program, setting and weighting may be combined in a single search. In one embodiment, the weighting would be used in conjunction with a Pin Yin translation program so that a user could set some terms, and allocate a search weight to the remaining terms. The invention may be applied independently in Internet searching so that a user can apply weights to multiple elements of a search term.

Type: Grant

Filed: February 20, 2009

Date of Patent: March 29, 2011

Assignee: International Business Machines Corporation

Inventors: Yen-Fu Chen, John W. Dunsmoir, Hari Shankar
Talking book

Patent number: 7912723

Abstract: A combination of a book and a voice phonation apparatus comprising a book having a plurality of pages, at least one of which is carrying a plurality of printed words and a plurality of specific codes associated with said words, said printed words being divided into a plurality of specific segments; and a voice phonation apparatus forming an integral part of said book and comprising (i) a housing with a plurality of switches and a plurality of keys, (ii) a voice output unit, and (iii) a control unit having a memory for storing data representing spoken words and connected to said switches, said keys, and said voice output unit.

Type: Grant

Filed: November 21, 2006

Date of Patent: March 22, 2011

Inventor: Ping Qu
Methods, storage medium and apparatus for encoding and decoding sound signals from multiple channels

Patent number: 7912731

Abstract: A method for encoding sound signals on multiple channels includes extracting an arbitrary number of sine waves from each of the sound signals. The sine waves include at least a first sine wave, extracted from a first one of the channels and having first-channel information, and a second sine wave, extracted from a second one of the channels and having second-channel information. Using the first-channel information and one of the second-channel information and sine wave information corresponding to a predetermined sine wave, one of the second-channel information and the sine wave information corresponding to the predetermined sine wave is selected as a to-be-correlated object for encoding in a correlation with the first-channel information.

Type: Grant

Filed: May 12, 2003

Date of Patent: March 22, 2011

Assignee: Sony Corporation

Inventors: Minoru Tsuji, Shiro Suzuki, Keisuke Toyama
Method and system for text retrieval for computer-assisted item creation

Patent number: 7912722

Abstract: A tool, method, and system for use in the development of sentence-based test items are disclosed. The tool may include a user interface that may include a database selection field, a sentence pattern entry field, an option pane, and an output pane. The tool may search a database for one or more sentences and may generate one or more responses to the one or more sentences. The one or more sentences and one or more responses may be used to produce the sentence-based test items. The tool may allow test items to be developed more quickly and easily than manual test item authoring. Accordingly, test item development costs may be lowered and test security may be enhanced.

Type: Grant

Filed: January 10, 2006

Date of Patent: March 22, 2011

Assignee: Educational Testing Service

Inventor: Derrick Higgins
Audio comparison using phoneme matching

Patent number: 7912724

Abstract: Audio comparison using phoneme matching is described, including evaluating audio data associated with a file, identifying a sequence of phonemes in the audio data, associating the file with a product category based on a match indicating the sequence of phonemes is substantially similar to another sequence of phonemes, the file being stored, and accessing the file when a request associated with the product category is detected.

Type: Grant

Filed: January 18, 2007

Date of Patent: March 22, 2011

Assignee: Adobe Systems Incorporated

Inventor: James Moorer

1 2 3 4 5 … next