Patents by Inventor Yide ZOU

Yide ZOU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Personalized batch and streaming speech-to-text transcription of audio

Patent number: 12198681

Abstract: Techniques for personalized batch and streaming speech-to-text transcription of audio reduce the error rate of automatic speech recognition (ASR) systems in transcribing rare and out-of-vocabulary words. The techniques achieve personalization of connectionist temporal classification (CT) models by using adaptive boosting to perform biasing at the level of sub-words. In addition to boosting, the techniques encompass a phone alignment network to bias sub-word predictions towards rare long-tail words and out-of-vocabulary words. A technical benefit of the techniques is that the accuracy of speech-to-text transcription of rare and out-of-vocabulary words in a custom vocabulary by automatic speech recognition (ASR) system can be improved without having to train the ASR system on the custom vocabulary. Instead, the techniques allow the same ASR system trained on a base vocabulary to realize the accuracy improvements for different custom vocabularies spanning different domains.

Type: Grant

Filed: September 30, 2022

Date of Patent: January 14, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Monica Lakshmi Sunkara, Srikanth Ronanki, Sravan Babu Bodapati, Jeffrey John Farris, Katrin Kirchhoff, Vivek Govindan, Yide Zou, Mohit Narendra Gupta, Silviu Mihai Burz
Systems and/or methods for accelerating facial feature vector matching with supervised machine learning

Patent number: 10747989

Abstract: A face recognition neural network generates small dimensional facial feature vectors for faces in received images. A multi-task learning classifier network receives generated vectors and outputs classifications for the corresponding images. The classifications are built by respective sub-networks each comprising classification layers arranged in levels. The layers in at least some levels receive as input intermediate outputs from an immediately upstream classification layer in the same sub-network and in each other sub-network. After training, a resultant facial feature vector corresponding to an input image is received from the neural network. Resultant classifiers for the input image are received from the classifier network. A database is searched for a subset of reference facial feature vectors having associated classification features matching the resultant classifiers.

Type: Grant

Filed: August 21, 2018

Date of Patent: August 18, 2020

Assignee: Software AG

Inventors: Yide Zou, Kevin Muench
SYSTEMS AND/OR METHODS FOR ACCELERATING FACIAL FEATURE VECTOR MATCHING WITH SUPERVISED MACHINE LEARNING

Publication number: 20200065563

Abstract: A face recognition neural network generates small dimensional facial feature vectors for faces in received images. A multi-task learning classifier network receives generated vectors and outputs classifications for the corresponding images. The classifications are built by respective sub-networks each comprising classification layers arranged in levels. The layers in at least some levels receive as input intermediate outputs from an immediately upstream classification layer in the same sub-network and in each other sub-network. After training, a resultant facial feature vector corresponding to an input image is received from the neural network. Resultant classifiers for the input image are received from the classifier network. A database is searched for a subset of reference facial feature vectors having associated classification features matching the resultant classifiers.

Type: Application

Filed: August 21, 2018

Publication date: February 27, 2020

Inventors: Yide ZOU, Kevin MUENCH

Personalized batch and streaming speech-to-text transcription of audio

Systems and/or methods for accelerating facial feature vector matching with supervised machine learning

SYSTEMS AND/OR METHODS FOR ACCELERATING FACIAL FEATURE VECTOR MATCHING WITH SUPERVISED MACHINE LEARNING