Patents by Inventor Huakai LIAO

Huakai LIAO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240013790
    Abstract: A method and system for enhancing pronunciation during a speech, the method including receiving audio data, the audio data including a speech, performing at least one of acoustic scoring and language scoring on the speech, determining a pronunciation score of one or more words of the speech based on the acoustic scoring and the language scoring, determining that the pronunciation score for the word does not satisfy a threshold score, responsive to determining that the pronunciation score does satisfy the threshold score, identifying the word as mispronounced, and responsive to identifying the word as mispronounced, outputting the word and the pronunciation score thereof.
    Type: Application
    Filed: May 28, 2021
    Publication date: January 11, 2024
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Runnan LI, Sheng ZHAO, Amit SRIVASTAVA, Huakai LIAO, Ana PARRA, Tapan BOHRA, Akshay MALLIPEDDI, Siliang KANG, Lisha MA, Yinhe WEI
  • Publication number: 20230395064
    Abstract: A computing apparatus comprises one or more computer readable storage media, one or more processors operatively coupled with the one or more computer readable storage media, and program instructions stored on the one or more computer readable storage media. The program instructions, when executed by the one or more processors, direct the computing apparatus to at least generate an audio recording of speech, extract features from the audio recording indicative of vocal patterns in the speech, determine a register classification of the speech based at least on the features, and display an indication of the register classification in a user interface.
    Type: Application
    Filed: June 7, 2022
    Publication date: December 7, 2023
    Inventors: Huakai LIAO, Ana PARRA, Gaurav Vinayak TENDOLKAR, Amit SRIVASTAVA, Siliang KANG
  • Patent number: 11341331
    Abstract: An intelligent speech assistant receives information collected while a user is speaking. The information can comprise speech data, vision data, or both, where the speech data is from the user speaking and the vision data is of the user while speaking. The assistant evaluates the speech data against a script which can contain information that the user should speak, information that the user should not speak, or both. The assistant collects instances where the user utters phrases that match the script or instances where the user utters phrases that do not match the script, depending on whether phases should or should not be spoken. The assistant evaluates vision data to identify gestures, facial expressions, and/or emotions of the user. Instances where the gestures, facial expressions, and/or emotions are not appropriate to the context are flagged. Real-time prompts and/or a summary is presented to the user as feedback.
    Type: Grant
    Filed: October 4, 2019
    Date of Patent: May 24, 2022
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Huakai Liao, Priyanka Vikram Sinha, Kevin Dara Khieu, Derek Martin Johnson, Siliang Kang, Huey-Ru Tsai, Amit Srivastava
  • Publication number: 20210103635
    Abstract: An intelligent speech assistant receives information collected while a user is speaking. The information can comprise speech data, vision data, or both, where the speech data is from the user speaking and the vision data is of the user while speaking. The assistant evaluates the speech data against a script which can contain information that the user should speak, information that the user should not speak, or both. The assistant collects instances where the user utters phrases that match the script or instances where the user utters phrases that do not match the script, depending on whether phases should or should not be spoken. The assistant evaluates vision data to identify gestures, facial expressions, and/or emotions of the user. Instances where the gestures, facial expressions, and/or emotions are not appropriate to the context are flagged. Real-time prompts and/or a summary is presented to the user as feedback.
    Type: Application
    Filed: October 4, 2019
    Publication date: April 8, 2021
    Inventors: Huakai LIAO, Priyanka Vikram SINHA, Kevin Dara KHIEU, Derek Martin JOHNSON, Siliang KANG, Huey-Ru TSAI, Amit SRIVASTAVA
  • Publication number: 20210097133
    Abstract: A system and method for personalizing a display of a recommendation in a user interface element of an application is described. The system accesses application activities of a user of the application. A user preference is formed based on the application activities. The system identifies a context of a current activity of the application and generates a content recommendation in the application based on the context of the current activity of the application and the user preference.
    Type: Application
    Filed: September 27, 2019
    Publication date: April 1, 2021
    Inventors: Huakai Liao, Debapriya Pal, Sun Mao, Erik Thomas Oveson, Huitian Jiao, Daniel M Cheung, Derek Martin Johnson, Bogdan Popp
  • Publication number: 20210065582
    Abstract: A method and system for speech rehearsal assistant during a presentation rehearsal includes receiving audio data from a speech rehearsal session over a network, receiving a transcript for the audio data, the transcript including a plurality of words spoken during the speech rehearsal session, calculating a real time speaking rate for the speech rehearsal session, determining if the speaking rate is within a threshold range, detecting utterance of a filler phrase or sound during the speech rehearsal session using at least in part a machine learning model trained for identifying filler phrases and sounds in a text, and upon determining the speaking rate falls outside the threshold range or detecting the utterance of the filler phrase or sound, enabling real time display of a notification on a display device.
    Type: Application
    Filed: September 4, 2019
    Publication date: March 4, 2021
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Huakai LIAO, Kevin Dara KHIEU, Amit SRIVASTAVA, Huey-Ru TSAI, Gregory Alexander DEPAUL, John Christian LEONE, Hemin Kiran Merchant