Patents by Inventor Hsiao-Wuen Hon

Hsiao-Wuen Hon has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Dynamic pronunciation support for Japanese and Chinese speech recognition training

Publication number: 20040236581

Abstract: A speech recognition training system for Kanji-based languages is provided. The system loads a pronunciation aid for each and every ideograph in the training speech, but does not in fact display an ideograph until the training system recognizes a pronunciation difficulty. Once a pronunciation difficulty is identified, the associated pronunciation aid (rubi) for the troubling ideograph is displayed.

Type: Application

Filed: May 1, 2003

Publication date: November 25, 2004

Applicant: Microsoft Corporation

Inventors: Yun-Cheng Ju, Hsiao-Wuen Hon, Kazuhiro Senju
Speech recognition method and apparatus utilizing segment models

Patent number: 6782362

Abstract: A method and apparatus determine the likelihood of a sequence of words based in part on a segment model. The segment model includes trajectory expressions formed as the product of a polynomial matrix and a generation matrix. The likelihood of the sequence of words is based in part on a segment probability derived by subtracting the trajectory expressions from a feature vector matrix that contains a sequence of feature vectors for a segment of speech. Aspects of the method and apparatus also include training the segment model using such a segment probability.

Type: Grant

Filed: April 27, 2000

Date of Patent: August 24, 2004

Assignee: Microsoft Corporation

Inventors: Hsiao-Wuen Hon, Kuansan Wang
Web server controls for web enabled recognition and/or audible prompting

Publication number: 20040113908

Abstract: Web server controls are provided for generating client side markups with recognition and/or audible prompting. Three approaches are disclosed for implementation of the controls.

Type: Application

Filed: April 28, 2003

Publication date: June 17, 2004

Inventors: Francisco M. Galanes, Hsiao-Wuen Hon, James D. Jacoby, Renaud J. Lecoeuche, Stephen F. Potter, Susan M. Warren
Application abstraction with dialog purpose

Publication number: 20040073431

Abstract: Controls are provided for a web server to generate client side markups that include recognition and/or audible prompting. The controls comprise elements of a dialog such as a question, answer, confirmation, command or statement. A module forms a dialog by making use of the information carried in the controls.

Type: Application

Filed: April 28, 2003

Publication date: April 15, 2004

Inventors: Francisco M. Galanes, Hsiao-Wuen Hon, James D. Jacoby, Renaud J. Lecoeuche, Stephen F. Potter
Temporal pattern recognition method and apparatus utilizing segment and frame-based models

Patent number: 6662158

Abstract: A method and apparatus is provided for identifying patterns from a series of feature vectors representing a time-varying signal. The method and apparatus use both a frame-based model and a segment model in a unified framework. The frame-based model determines the probability of an individual feature vector given a frame state. The segment model determines the probability of sub-sequences of feature vectors given a single segment state. The probabilities from the frame-based model and the segment model are then combined to form a single path score that is indicative of the probability of a sequence of patterns. Another aspect of the invention is the use of a frame-based model and a segment model to segment feature vectors during model training. Under this aspect of the invention, the frame-based model and the segment model are used together to identify probabilities associated with different segmentations.

Type: Grant

Filed: April 27, 2000

Date of Patent: December 9, 2003

Assignee: Microsoft Corporation

Inventors: Hsiao-Wuen Hon, Kuansan Wang
Multi-modal entry of ideogrammatic languages

Publication number: 20030212563

Abstract: A method for inputting ideograms into a computer system includes receiving phonetic information related to a desired ideogram to be entered and forming a candidate list of possible ideograms as a function of the phonetic information received. Stroke information, comprising one or more strokes in the desired ideogram, is received in order to obtain the desired ideogram from the candidate list.

Type: Application

Filed: May 8, 2002

Publication date: November 13, 2003

Inventors: Yun-Cheng Ju, Hsiao-Wuen Hon
Web server controls for web enabled recognition and/or audible prompting

Publication number: 20030200080

Abstract: Web server controls are provided for generating client side markups with recognition and/or audible prompting. Three approaches are disclosed for implementation of the controls.

Type: Application

Filed: October 21, 2001

Publication date: October 23, 2003

Inventors: Francisco M. Galanes, Hsiao-Wuen Hon, James D. Jacoby, Renaud J. Lecoueche, Stephen F. Potter, Susan M. Warren
Speech recognition method and apparatus utilizing multi-unit models

Patent number: 6629073

Abstract: A speech recognition method and system utilize an acoustic model that is capable of providing probabilities for both a large acoustic unit and an acoustic sub-unit. Each of these probabilities describes the likelihood of a set of feature vectors from a series of feature vectors representing a speech signal. The large acoustic unit is formed from a plurality of acoustic sub-units. At least one sub-unit probability and at least on large unit probability from the acoustic model are used by a decoder to generate a score for a sequence of hypothesized words. When combined, the acoustic sub-units associated with all of the sub-unit probabilities used to determine the score span fewer than all of the feature vectors in the series of feature vectors. An overlapping decoding technique is also provided.

Type: Grant

Filed: April 27, 2000

Date of Patent: September 30, 2003

Assignee: Microsoft Corporation

Inventors: Hsiao-Wuen Hon, Kuansan Wang
Application abstraction with dialog purpose

Publication number: 20030130854

Abstract: Controls are provided for a web server to generate client side markups that include recognition and/or audible prompting. The controls comprise elements of a dialog such as a question, answer, confirmation, command or statement. A module forms a dialog by making use of the information carried in the controls.

Type: Application

Filed: October 21, 2001

Publication date: July 10, 2003

Inventors: Francisco M. Galanes, Hsiao-Wuen Hon, James D. Jacoby, Renaud J. Lecoueche, Stephen F. Potter
Predictive keyboard

Patent number: 6573844

Abstract: Predictive keyboards, such as predictive soft keyboards, are disclosed. In one embodiment, a computer-implemented method predicts at least one key to be entered next within a sequence of keys. The method displays a soft keyboard where the predicted keys are displayed on the soft keyboard differently than the other keys on the keyboard. For example, the predicted keys may be larger in size on the soft keyboard as compared to the other keys. This makes the predicted keys more easily typed by a user as compared to the other keys.

Type: Grant

Filed: January 18, 2000

Date of Patent: June 3, 2003

Assignee: Microsoft Corporation

Inventors: Daniel Venolia, Joshua Goodman, Xuedong Huang, Hsiao-Wuen Hon
Confidence measure system using a near-miss pattern

Patent number: 6571210

Abstract: A method and system of performing confidence measure in a speech recognition system includes receiving an utterance of input speech and creating a near-miss pattern or a near-miss list of possible word entries for the utterance. Each word entry includes an associated value of probability that the utterance corresponds to the word entry. The near-miss list of possible word entries is compared with corresponding stored near-miss confidence templates. Each word in the vocabulary (or keyword list) of near-miss confidence template, which includes a list of word entries and each word entry in each list includes an associated value. Confidence measure for a particular hypothesis word is performed based on the comparison of the values in the near-miss list of possible word entries with the values of the corresponding near-miss confidence template.

Type: Grant

Filed: November 13, 1998

Date of Patent: May 27, 2003

Assignee: Microsoft Corporation

Inventors: Hsiao-Wuen Hon, Asela J. R. Gunawardana
Web enabled recognition architecture

Publication number: 20030009517

Abstract: A server/client system for processing data includes a network having a web server with information accessible remotely. A client device includes a microphone and a rendering component such as a speaker or display. The client device is configured to obtain the information from the web server and record input data associated with fields contained in the information. The client device is adapted to send the input data to a remote location with an indication of a grammar to use for recognition. A recognition server receives the input data and the indication of the grammar. The recognition server returns data indicative of what was recognized to at least one of the client and the web server.

Type: Application

Filed: September 20, 2001

Publication date: January 9, 2003

Inventors: Kuansan Wang, Hsiao-Wuen Hon
Proofreading with text to speech feedback

Patent number: 6490563

Abstract: A computer implemented system and method of proofreading text in a computer system includes receiving text from a user into a text editing module. At least a portion of the text is converted to an audio signal upon the detection of an indicator, the indicator defining a boundary in the text by either being embodied therein or comprising delays in receiving text. The audio signal is played through a speaker to the user to provide feedback.

Type: Grant

Filed: August 17, 1998

Date of Patent: December 3, 2002

Assignee: Microsoft Corporation

Inventors: Hsiao-Wuen Hon, Dong Li, Xuedong Huang, Yun-Chen Ju, Xianghui Sean Zhang
Markup language extensions for web enabled recognition

Publication number: 20020178182

Abstract: A markup language for execution on a client device in a client/server system includes extensions for recognition.

Type: Application

Filed: September 20, 2001

Publication date: November 28, 2002

Inventors: Kuansan Wang, Hsiao-Wuen Hon
Markup language extensions for web enabled recognition

Publication number: 20020169806

Abstract: A markup language for execution on a client device in a client/server system includes extensions for recognition.

Type: Application

Filed: April 5, 2002

Publication date: November 14, 2002

Inventors: Kuansan Wang, Hsiao-Wuen Hon
Servers for web enabled speech recognition

Publication number: 20020165719

Abstract: A markup language for execution on a client device in a client/server system includes instructions to unify at least one of recognition-related events, GUI events and telephony events on non-display, voice input based client device and a multimodal based client for a web server interacting with each of the client devices. A recognition server for receiving data indicative of inputted data provided to a client device and an indication of a grammar to use for recognition is also provided.

Type: Application

Filed: September 20, 2001

Publication date: November 7, 2002

Inventors: Kuansan Wang, Hsiao-Wuen Hon
PROOFREADING WITH TEXT TO SPEECH FEEDBACK

Publication number: 20010044724

Abstract: A computer implemented system and method of proofreading text in a computer system includes receiving text from a user into a text editing module. At least a portion of the text is converted to an audio signal. The audio signal is played through a speaker to the user to provide feedback.

Type: Application

Filed: August 17, 1998

Publication date: November 22, 2001

Inventors: HSIAO-WUEN HON, DONG LI, XUEDONG HUANG, YUN-CHEN JU, XIANGHUI SEAN ZHANG
CONFIDENCE MEASURE SYSTEM USING A NEAR-MISS PATTERN

Publication number: 20010018654

Abstract: A method and system of performing confidence measure in a speech recognition system includes receiving an utterance of input speech and creating a near-miss pattern or a near-miss list of possible word entries for the utterance. Each word entry includes an associated value of probability that the utterance corresponds to the word entry. The near-miss list of possible word entries is compared with corresponding stored near-miss confidence templates. Each word in the vocabulary (or keyword list) of near-miss confidence template, which includes a list of word entries and each word entry in each list includes an associated value. Confidence measure for a particular hypothesis word is performed based on the comparison of the values in the near-miss list of possible word entries with the values of the corresponding near-miss confidence template.

Type: Application

Filed: November 13, 1998

Publication date: August 30, 2001

Inventors: HSIAO-WUEN HON, ASELA J.R. GUNAWARDANA
Text-to-speech using clustered context-dependent phoneme-based units

Patent number: 6163769

Abstract: A text-to-speech system includes a storage device for storing a clustered set of context-dependent phoneme-based units of a target speaker. In one embodiment, decision trees are used wherein each decision tree based context-dependent phoneme-based unit is arranged based on context of at least one immediately preceding and succeeding phoneme. At least one of the context-dependent phoneme-based units represents other non-stored context-dependent phoneme units of similar sound due to similar contexts. A text analyzer obtains a string of phonetic symbols representative of text to be converted to speech. A concatenation module selects stored decision tree based context-dependent phoneme-based units from the set decision tree based context-dependent phoneme-based units based on the context of the phonetic symbols and synthesizes the selected phoneme-based units to generate speech corresponding to the text.

Type: Grant

Filed: October 2, 1997

Date of Patent: December 19, 2000

Assignee: Microsoft Corporation

Inventors: Alejandro Acero, Hsiao-Wuen Hon, Xuedong D. Huang
Method and system for dynamically adjusted training for speech recognition

Patent number: 5963903

Abstract: A method and system for dynamically selecting words for training a speech recognition system. The speech recognition system models each phoneme using a hidden Markov model and represents each word as a sequence of phonemes. The training system ranks each phoneme for each frame according to the probability that the corresponding codeword will be spoken as part of the phoneme. The training system collects spoken utterances for which the corresponding word is known. The training system then aligns the codewords of each utterance with the phoneme that it is recognized to be part of. The training system then calculates an average rank for each phoneme using the aligned codewords for the aligned frames. Finally, the training system selects words for training that contain phonemes with a low rank.

Type: Grant

Filed: June 28, 1996

Date of Patent: October 5, 1999

Assignee: Microsoft Corporation

Inventors: Hsiao-Wuen Hon, Xuedong D. Huang, Mei-Yuh Hwang, Li Jiang, Yun-Cheng Ju, Milind V. Mahajan, Michael J. Rozak

prev 1 2 3 4 next