Patents by Inventor YUEH-TUNG WU

YUEH-TUNG WU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multilingual speech recognition and translation method and related system for a conference which determines quantity of attendees according to their distances from their microphones

Patent number: 11881224

Abstract: The present invention provides a multilingual speech recognition and translation method for a conference. The conference includes at least one attendee, and the method includes: receiving, at a server, at least one piece of audio data and at least one piece of video data generated by at least one terminal apparatus; analyzing the video data to generate a video recognition result related to an attendance, and an ethnic of the attendee and a body movement, and a facial movement of the attendee when talking; generating at least one language family recognition result according to the video recognition result and the audio data, and obtaining a plurality of audio segments corresponding to the attendee; performing speech recognition on and translating the audio segments; and displaying a translation result on the terminal apparatus. The method further determines a quantity of conference attendees according to their respective distances from their device microphones.

Type: Grant

Filed: August 5, 2021

Date of Patent: January 23, 2024

Assignee: PEGATRON CORPORATION

Inventors: Yueh-Tung Wu, Jun-Ying Li
MULTILINGUAL SPEECH RECOGNITION AND TRANSLATION METHOD AND RELATED SYSTEM

Publication number: 20220076679

Abstract: The present invention provides a multilingual speech recognition and translation method for a conference. The conference includes at least one attendee, and the method includes: receiving, at a server , at least one piece of audio data and at least one piece of video data generated by at least one terminal apparatus; analyzing the video data to generate a video recognition result related to an attendance, and an ethnic of the attendee and a body movement, and a facial movement of the attendee when talking; generating at least one language family recognition result according to the video recognition result and the audio data, and obtaining a plurality of audio segments corresponding to the attendee; performing speech recognition on and translating the audio segments; and displaying a translation result on the terminal apparatus.

Type: Application

Filed: August 5, 2021

Publication date: March 10, 2022

Inventors: YUEH-TUNG WU, JUN-YING LI

Multilingual speech recognition and translation method and related system for a conference which determines quantity of attendees according to their distances from their microphones

MULTILINGUAL SPEECH RECOGNITION AND TRANSLATION METHOD AND RELATED SYSTEM