Patents by Inventor Huakai LIAO

Huakai LIAO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND SYSTEM OF DETECTING AND IMPROVING REAL-TIME MISPRONUNCIATION OF WORDS

Publication number: 20240013790

Abstract: A method and system for enhancing pronunciation during a speech, the method including receiving audio data, the audio data including a speech, performing at least one of acoustic scoring and language scoring on the speech, determining a pronunciation score of one or more words of the speech based on the acoustic scoring and the language scoring, determining that the pronunciation score for the word does not satisfy a threshold score, responsive to determining that the pronunciation score does satisfy the threshold score, identifying the word as mispronounced, and responsive to identifying the word as mispronounced, outputting the word and the pronunciation score thereof.

Type: Application

Filed: May 28, 2021

Publication date: January 11, 2024

Applicant: Microsoft Technology Licensing, LLC

Inventors: Runnan LI, Sheng ZHAO, Amit SRIVASTAVA, Huakai LIAO, Ana PARRA, Tapan BOHRA, Akshay MALLIPEDDI, Siliang KANG, Lisha MA, Yinhe WEI
APPLICATION SOFTWARE AND SERVICES WITH REGISTER CLASSIFICATION CAPABILITIES

Publication number: 20230395064

Abstract: A computing apparatus comprises one or more computer readable storage media, one or more processors operatively coupled with the one or more computer readable storage media, and program instructions stored on the one or more computer readable storage media. The program instructions, when executed by the one or more processors, direct the computing apparatus to at least generate an audio recording of speech, extract features from the audio recording indicative of vocal patterns in the speech, determine a register classification of the speech based at least on the features, and display an indication of the register classification in a user interface.

Type: Application

Filed: June 7, 2022

Publication date: December 7, 2023

Inventors: Huakai LIAO, Ana PARRA, Gaurav Vinayak TENDOLKAR, Amit SRIVASTAVA, Siliang KANG
Speaking technique improvement assistant

Patent number: 11341331

Abstract: An intelligent speech assistant receives information collected while a user is speaking. The information can comprise speech data, vision data, or both, where the speech data is from the user speaking and the vision data is of the user while speaking. The assistant evaluates the speech data against a script which can contain information that the user should speak, information that the user should not speak, or both. The assistant collects instances where the user utters phrases that match the script or instances where the user utters phrases that do not match the script, depending on whether phases should or should not be spoken. The assistant evaluates vision data to identify gestures, facial expressions, and/or emotions of the user. Instances where the gestures, facial expressions, and/or emotions are not appropriate to the context are flagged. Real-time prompts and/or a summary is presented to the user as feedback.

Type: Grant

Filed: October 4, 2019

Date of Patent: May 24, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Huakai Liao, Priyanka Vikram Sinha, Kevin Dara Khieu, Derek Martin Johnson, Siliang Kang, Huey-Ru Tsai, Amit Srivastava
SPEAKING TECHNIQUE IMPROVEMENT ASSISTANT

Publication number: 20210103635

Abstract: An intelligent speech assistant receives information collected while a user is speaking. The information can comprise speech data, vision data, or both, where the speech data is from the user speaking and the vision data is of the user while speaking. The assistant evaluates the speech data against a script which can contain information that the user should speak, information that the user should not speak, or both. The assistant collects instances where the user utters phrases that match the script or instances where the user utters phrases that do not match the script, depending on whether phases should or should not be spoken. The assistant evaluates vision data to identify gestures, facial expressions, and/or emotions of the user. Instances where the gestures, facial expressions, and/or emotions are not appropriate to the context are flagged. Real-time prompts and/or a summary is presented to the user as feedback.

Type: Application

Filed: October 4, 2019

Publication date: April 8, 2021

Inventors: Huakai LIAO, Priyanka Vikram SINHA, Kevin Dara KHIEU, Derek Martin JOHNSON, Siliang KANG, Huey-Ru TSAI, Amit SRIVASTAVA
PERSONALIZED PROACTIVE PANE POP-UP

Publication number: 20210097133

Abstract: A system and method for personalizing a display of a recommendation in a user interface element of an application is described. The system accesses application activities of a user of the application. A user preference is formed based on the application activities. The system identifies a context of a current activity of the application and generates a content recommendation in the application based on the context of the current activity of the application and the user preference.

Type: Application

Filed: September 27, 2019

Publication date: April 1, 2021

Inventors: Huakai Liao, Debapriya Pal, Sun Mao, Erik Thomas Oveson, Huitian Jiao, Daniel M Cheung, Derek Martin Johnson, Bogdan Popp
Method and System of Providing Speech Rehearsal Assistance

Publication number: 20210065582

Abstract: A method and system for speech rehearsal assistant during a presentation rehearsal includes receiving audio data from a speech rehearsal session over a network, receiving a transcript for the audio data, the transcript including a plurality of words spoken during the speech rehearsal session, calculating a real time speaking rate for the speech rehearsal session, determining if the speaking rate is within a threshold range, detecting utterance of a filler phrase or sound during the speech rehearsal session using at least in part a machine learning model trained for identifying filler phrases and sounds in a text, and upon determining the speaking rate falls outside the threshold range or detecting the utterance of the filler phrase or sound, enabling real time display of a notification on a display device.

Type: Application

Filed: September 4, 2019

Publication date: March 4, 2021

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Huakai LIAO, Kevin Dara KHIEU, Amit SRIVASTAVA, Huey-Ru TSAI, Gregory Alexander DEPAUL, John Christian LEONE, Hemin Kiran Merchant

METHOD AND SYSTEM OF DETECTING AND IMPROVING REAL-TIME MISPRONUNCIATION OF WORDS

APPLICATION SOFTWARE AND SERVICES WITH REGISTER CLASSIFICATION CAPABILITIES

Speaking technique improvement assistant

SPEAKING TECHNIQUE IMPROVEMENT ASSISTANT

PERSONALIZED PROACTIVE PANE POP-UP

Method and System of Providing Speech Rehearsal Assistance