Patents by Inventor YUEH-TUNG WU

YUEH-TUNG WU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11881224
    Abstract: The present invention provides a multilingual speech recognition and translation method for a conference. The conference includes at least one attendee, and the method includes: receiving, at a server, at least one piece of audio data and at least one piece of video data generated by at least one terminal apparatus; analyzing the video data to generate a video recognition result related to an attendance, and an ethnic of the attendee and a body movement, and a facial movement of the attendee when talking; generating at least one language family recognition result according to the video recognition result and the audio data, and obtaining a plurality of audio segments corresponding to the attendee; performing speech recognition on and translating the audio segments; and displaying a translation result on the terminal apparatus. The method further determines a quantity of conference attendees according to their respective distances from their device microphones.
    Type: Grant
    Filed: August 5, 2021
    Date of Patent: January 23, 2024
    Assignee: PEGATRON CORPORATION
    Inventors: Yueh-Tung Wu, Jun-Ying Li
  • Publication number: 20220076679
    Abstract: The present invention provides a multilingual speech recognition and translation method for a conference. The conference includes at least one attendee, and the method includes: receiving, at a server , at least one piece of audio data and at least one piece of video data generated by at least one terminal apparatus; analyzing the video data to generate a video recognition result related to an attendance, and an ethnic of the attendee and a body movement, and a facial movement of the attendee when talking; generating at least one language family recognition result according to the video recognition result and the audio data, and obtaining a plurality of audio segments corresponding to the attendee; performing speech recognition on and translating the audio segments; and displaying a translation result on the terminal apparatus.
    Type: Application
    Filed: August 5, 2021
    Publication date: March 10, 2022
    Inventors: YUEH-TUNG WU, JUN-YING LI