Patents by Inventor Pedro Chen

Pedro Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Using speech recognition to improve cross-language speech synthesis

Patent number: 11990117

Abstract: A method for training a speech recognition model includes obtaining a multilingual text-to-speech (TTS) model. The method also includes generating a native synthesized speech representation for an input text sequence in a first language that is conditioned on speaker characteristics of a native speaker of the first language. The method also includes generating a cross-lingual synthesized speech representation for the input text sequence in the first language that is conditioned on speaker characteristics of a native speaker of a different second language. The method also includes generating a first speech recognition result for the native synthesized speech representation and a second speech recognition result for the cross-lingual synthesized speech representation. The method also includes determining a consistent loss term based on the first speech recognition result and the second speech recognition result and updating parameters of the speech recognition model based on the consistent loss term.

Type: Grant

Filed: October 20, 2021

Date of Patent: May 21, 2024

Assignee: Google LLC

Inventors: Zhehuai Chen, Bhuvana Ramabhadran, Andrew Rosenberg, Yu Zhang, Pedro J. Moreno Mengibar
MULTI-DIALECT AND MULTILINGUAL SPEECH RECOGNITION

Publication number: 20240161732

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer-readable media, for speech recognition using multi-dialect and multilingual models. In some implementations, audio data indicating audio characteristics of an utterance is received. Input features determined based on the audio data are provided to a speech recognition model that has been trained to output score indicating the likelihood of linguistic units for each of multiple different language or dialects. The speech recognition model can be one that has been trained using cluster adaptive training. Output that the speech recognition model generated in response to receiving the input features determined based on the audio data is received. A transcription of the utterance generated based on the output of the speech recognition model is provided.

Type: Application

Filed: January 20, 2024

Publication date: May 16, 2024

Applicant: Google LLC

Inventors: Zhifeng Chen, Bo Li, Eugene Weinstein, Yonghui Wu, Pedro J. Moreno Mengibar, Ron J. Weiss, Khe Chai Sim, Tara N. Sainath, Patrick An Phu Nguyen
METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR MONITORING AUTHENTICATION BASED ON RADAR

Publication number: 20240134026

Abstract: A method in an illustrative embodiment includes determining that a first object authenticated by an electronic device is accessing the electronic device. The method further includes, in response to a second object being detected within a detection range using a radar of the electronic device, determining, based on a detected signal, that the second object is a person. The method further includes determining a distance and an angle between the second object and the electronic device based on an azimuth signal in the detected signal. The method further includes in response to determining that the distance is less than a distance threshold and the angle is less than an angle threshold, determining, based on the biological feature signal, whether the second object is trustworthy. The method further includes deauthenticating the first object in response to determining that the second object is untrustworthy.

Type: Application

Filed: November 10, 2022

Publication date: April 25, 2024

Inventors: Pedro Fernandez, Qiang Chen, Zhen Jia
Method, electronic device, and computer program product for using virtual desktop

Patent number: 11941438

Abstract: Embodiments of the present disclosure provide a method, an electronic device, and a computer program product for using a virtual desktop. A method in one embodiment includes receiving, at a first edge node in a plurality of edge nodes, an instruction from a first set of input devices in a plurality of peripheral devices. The instruction is for use of a first virtual desktop deployed on the first edge node. The method further includes: using the first virtual desktop based on the instruction by using resources at the first edge node. The method further includes: sending data to an output device in the plurality of peripheral devices, wherein the data is associated with the use of the first virtual desktop. The solution for using a virtual desktop of the present application enables the use of a virtual desktop using resources at an edge node without requiring a client.

Type: Grant

Filed: August 10, 2021

Date of Patent: March 26, 2024

Assignee: EMC IP Holding Company LLC

Inventors: Pedro Fernandez Orellana, Qiang Chen
METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR VIDEO PROCESSING

Publication number: 20240095878

Abstract: Embodiments of the present disclosure relate to a method, an electronic device, and a computer program product for video processing. The method includes: converting, based on a sample frame in a first video having a first resolution as well as a template, a first group of image frames in the first video that correspond to the sample frame to a second group of image frames that are more similar to the template. The method also includes: converting the second group of image frames to a third group of image frames having a higher resolution to generate a higher-resolution video. In this manner, low-resolution image frames can be first converted to image frames that are more suitable for resolution conversion, so that a high-resolution video of a higher quality can be obtained when reconstructing the high-resolution video.

Type: Application

Filed: October 10, 2022

Publication date: March 21, 2024

Inventors: Qiang Chen, Pedro Fernandez
Consistency prediction on streaming sequence models

Patent number: 11929060

Abstract: A method for training a speech recognition model includes receiving a set of training utterance pairs each including a non-synthetic speech representation and a synthetic speech representation of a same corresponding utterance. At each of a plurality of output steps for each training utterance pair in the set of training utterance pairs, the method also includes determining a consistent loss term for the corresponding training utterance pair based on a first probability distribution over possible non-synthetic speech recognition hypotheses generated for the corresponding non-synthetic speech representation and a second probability distribution over possible synthetic speech recognition hypotheses generated for the corresponding synthetic speech representation. The first and second probability distributions are generated for output by the speech recognition model.

Type: Grant

Filed: February 8, 2021

Date of Patent: March 12, 2024

Assignee: Google LLC

Inventors: Zhehuai Chen, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro Jose Moreno Mengibar
Method, device, and computer program product for video processing

Patent number: 11928855

Abstract: Embodiments of the disclosure include a method, a device, and a computer program product for video processing. This method includes: selecting frames having features of a first type from a first instance of a video as a first candidate set, the first instance having a first resolution; generating a set of training frames based at least on the first candidate set; acquiring a set of corresponding frames for the set of training frames in a second instance of the video, the second instance having a second resolution lower than the first resolution; and determining, using the set of training frames and the set of corresponding frames, a conversion parameter for conversion from the second resolution to a third resolution. This solution provides a smaller-scale and higher-quality training set for the training of a video conversion model, thus improving the quality of training while saving computational resources and increasing training speed.

Type: Grant

Filed: January 10, 2022

Date of Patent: March 12, 2024

Assignee: Dell Products L.P.

Inventors: Pedro Fernandez Orellana, Qiang Chen, Zhen Jia
SYSTEMS AND METHODS FOR PROVIDING SUBSEQUENT PAYMENT OPTIONS FOR IDENTIFIED ELIGIBLE USERS

Publication number: 20160335717

Abstract: Systems, methods, and non-transitory computer-readable media can determine that a user has requested a service that involves payment. Information associated with the user can be analyzed. It can be determined, based on the information associated with the user, that the user is eligible for a subsequent payment option. The subsequent payment option can be provided to the user as a line of credit applicable toward payment for the service.

Type: Application

Filed: May 11, 2015

Publication date: November 17, 2016

Inventors: Aaron Patrick O'Brien, Luis Medina, John Stephen Anderson, Pedro Chen, Marius Mircea Lazer
Wearable device with graphical user interface

Patent number: D1026900

Type: Grant

Filed: May 20, 2022

Date of Patent: May 14, 2024

Assignee: Apple Inc.

Inventors: Jody Akana, Molly Anderson, Bartley K. Andre, Shota Aoyagi, Marine C. Bataille, Kevin Will Chen, Abidur Rahman Chowdhury, Andrew Patrick Clymer, Clara Geneviève Marine Courtaigne, Markus Diebel, Alexandre B. Girard, Jonathan Gomez Garcia, Aurelio Guzmán, M. Evans Hankey, Anne-Marie Heck, Moises Hernandez Hernandez, Richard P. Howarth, Julian Jaede, Duncan Robert Kerr, Kainoa Kwon-Perez, Nicolas Pedro Lylyk, Aaron Mathew Melim, Peter Russell-Clarke, Benjamin Andrew Shaffer, Clement Tissandier