Patents by Inventor Zhifeng Chen
Zhifeng Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11138392Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for machine translation using neural networks. In some implementations, a text in one language is translated into a second language using a neural network model. The model can include an encoder neural network comprising a plurality of bidirectional recurrent neural network layers. The encoding vectors are processed using a multi-headed attention module configured to generate multiple attention context vectors for each encoding vector. A decoder neural network generates a sequence of decoder output vectors using the attention context vectors. The decoder output vectors can represent distributions over various language elements of the second language, allowing a translation of the text into the second language to be determined based on the sequence of decoder output vectors.Type: GrantFiled: July 25, 2019Date of Patent: October 5, 2021Assignee: Google LLCInventors: Zhifeng Chen, Macduff Richard Hughes, Yonghui Wu, Michael Schuster, Xu Chen, Llion Owen Jones, Niki J. Parmar, George Foster, Orhan Firat, Ankur Bapna, Wolfgang Macherey, Melvin Jose Johnson Premkumar
-
Publication number: 20210295858Abstract: Methods, systems, and computer program products for generating, from an input character sequence, an output sequence of audio data representing the input character sequence. The output sequence of audio data includes a respective audio output sample for each of a number of time steps. One example method includes, for each of the time steps: generating a mel-frequency spectrogram for the time step by processing a representation of a respective portion of the input character sequence using a decoder neural network; generating a probability distribution over a plurality of possible audio output samples for the time step by processing the mel-frequency spectrogram for the time step using a vocoder neural network; and selecting the audio output sample for the time step from the possible audio output samples in accordance with the probability distribution.Type: ApplicationFiled: April 5, 2021Publication date: September 23, 2021Inventors: Yonghui Wu, Jonathan Shen, Ruoming Pang, Ron J. Weiss, Michael Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, Russell John Wyatt Skerry-Ryan, Ryan M. Rifkin, Ioannis Agiomyrgiannakis
-
Publication number: 20210279465Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing data generated by a sensing system that rotationally senses an environment. In one aspect, a method comprises partitioning a predetermined period of time into a plurality of sub-periods, wherein the predetermined period of time is a period of time for which data generated by the sensing system constitutes a complete rotational sensing of the environment; for each sub-period: receiving current data generated by the sensing system during the sub-period and characterizing a respective partial scene of the environment; processing the current data using an object detection neural network to generate a current object detection output that is specific to the respective partial scene of the environment.Type: ApplicationFiled: March 6, 2020Publication date: September 9, 2021Inventors: Jonathon Shlens, Vijay Vasudevan, Jiquan Ngiam, Wei Han, Zhifeng Chen, Brandon Chauloon Yang, Benjamin James Caine, Zhengdong Zhang, Christoph Sprunk, Ouais Alsharif, Junhua Mao, Chen Wu
-
Patent number: 11113480Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural machine translation. One of the systems includes an encoder neural network comprising: an input forward long short-term memory (LSTM) layer configured to process each input token in the input sequence in a forward order to generate a respective forward representation of each input token, an input backward LSTM layer configured to process each input token in a backward order to generate a respective backward representation of each input token and a plurality of hidden LSTM layers configured to process a respective combined representation of each of the input tokens in the forward order to generate a respective encoded representation of each of the input tokens; and a decoder subsystem configured to receive the respective encoded representations and to process the encoded representations to generate an output sequence.Type: GrantFiled: September 25, 2017Date of Patent: September 7, 2021Assignee: Google LLCInventors: Mohammad Norouzi, Zhifeng Chen, Yonghui Wu, Michael Schuster, Quoc V. Le
-
Patent number: 11107463Abstract: Methods, systems, and apparatus, including computer programs encoded on computer-readable storage media, for speech recognition using attention-based sequence-to-sequence models. In some implementations, audio data indicating acoustic characteristics of an utterance is received. A sequence of feature vectors indicative of the acoustic characteristics of the utterance is generated. The sequence of feature vectors is processed using a speech recognition model that has been trained using a loss function that uses N-best lists of decoded hypotheses, the speech recognition model including an encoder, an attention module, and a decoder. The encoder and decoder each include one or more recurrent neural network layers. A sequence of output vectors representing distributions over a predetermined set of linguistic units is obtained. A transcription for the utterance is obtained based on the sequence of output vectors. Data indicating the transcription of the utterance is provided.Type: GrantFiled: August 1, 2019Date of Patent: August 31, 2021Assignee: Google LLCInventors: Rohit Prakash Prabhavalkar, Tara N. Sainath, Yonghui Wu, Patrick An Phu Nguyen, Zhifeng Chen, Chung-Cheng Chiu, Anjuli Patricia Kannan
-
Patent number: 11107457Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.Type: GrantFiled: November 26, 2019Date of Patent: August 31, 2021Assignee: Google LLCInventors: Samuel Bengio, Yuxuan Wang, Zongheng Yang, Zhifeng Chen, Yonghui Wu, Ioannis Agiomyrgiannakis, Ron J. Weiss, Navdeep Jaitly, Ryan M. Rifkin, Robert Andrew James Clark, Quoc V. Le, Russell J. Ryan, Ying Xiao
-
Publication number: 20210235126Abstract: Described herein are methods and systems associated with viewing condition adaption of multimedia content. A method for receiving multimedia content with a device from a network may include determining a viewing parameter, transmitting a request for the multimedia content to the network, whereby the request may be based on the viewing parameter, and receiving the multimedia content from the network, whereby the multimedia content may be processed at a rate according to the viewing parameter. The viewing parameter may include at least one of: a user viewing parameter, a device viewing parameter, or a content viewing parameter. The method may further include receiving a multimedia presentation description (MPD) file from the network. The MPD file may include information relating to the rate of the multimedia content and information relating to the rate may include a descriptor relating to the viewing parameter, whereby the descriptor may be required or optional.Type: ApplicationFiled: April 8, 2021Publication date: July 29, 2021Applicant: Vid Scale, Inc.Inventors: Yuriy Reznik, Eduardo Asbun, Zhifeng Chen, Yan Ye, Eldad M. Zeira, Ariela Zeira, Naresh Soni, Hang Liu
-
Publication number: 20210217404Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech synthesis. The methods, systems, and apparatus include actions of obtaining an audio representation of speech of a target speaker, obtaining input text for which speech is to be synthesized in a voice of the target speaker, generating a speaker vector by providing the audio representation to a speaker encoder engine that is trained to distinguish speakers from one another, generating an audio representation of the input text spoken in the voice of the target speaker by providing the input text and the speaker vector to a spectrogram generation engine that is trained using voices of reference speakers to generate audio representations, and providing the audio representation of the input text spoken in the voice of the target speaker for output.Type: ApplicationFiled: May 17, 2019Publication date: July 15, 2021Applicant: Google LLCInventors: Ye Jia, Zhifeng Chen, Yonghui Wu, Jonathan Shen, Ruoming Pang, Ron J. Weiss, Ignacio Lopez Moreno, Fei Ren, Yu Zhang, Quan Wang, Patrick Nguyen
-
Publication number: 20210209315Abstract: The present disclosure provides systems and methods that train and use machine-learned models such as, for example, sequence-to-sequence models, to perform direct and text-free speech-to-speech translation. In particular, aspects of the present disclosure provide an attention-based sequence-to-sequence neural network which can directly translate speech from one language into speech in another language, without relying on an intermediate text representation.Type: ApplicationFiled: March 7, 2020Publication date: July 8, 2021Inventors: Ye Jia, Zhifeng Chen, Yonghui Wu, Melvin Johnson, Fadi Biadsy, Ron Weiss, Wolfgang Macherey
-
Publication number: 20210188951Abstract: The present invention relates to monoclonal antibodies which have high anti-RSV neutralizing titers. The invention further provides for isolated nucleic acids encoding the antibodies of the invention and host cells transformed therewith. The invention yet further provides for diagnostic, prophylactic and therapeutic methods employing the antibodies and nucleic acids of the invention, particularly as a passive immunotherapy agent in infants and the elderly.Type: ApplicationFiled: December 8, 2020Publication date: June 24, 2021Applicant: Merck Sharp & Dohme Corp.Inventors: Kalpit A. Vora, Kara S. Cox, Aimin Tang, Zhifeng Chen, Daniel DiStefano, Lan Zhang, Hua-Poo Su
-
Patent number: 11008380Abstract: The present invention relates to monoclonal antibodies which have high anti-RSV neutralizing titers. The invention further provides for isolated nucleic acids encoding the antibodies of the invention and host cells transformed therewith. The invention yet further provides for diagnostic, prophylactic and therapeutic methods employing the antibodies and nucleic acids of the invention, particularly as a passive immunotherapy agent in infants and the elderly.Type: GrantFiled: June 7, 2019Date of Patent: May 18, 2021Assignee: Merck Sharp & Dohme Corp.Inventors: Kalpit A. Vora, Kara S. Cox, Aimin Tang, Zhifeng Chen, Daniel DiStefano, Lan Zhang, Hua-Poo Su
-
Publication number: 20210114611Abstract: The present invention relates to a system for effectively identifying pressing line of vehicle and giving an early prompt, comprising an image acquisition module, a lane line extraction module, a distance calculation module and an early-warning judgment module. The image acquisition module acquires front images through an optical camera. The lane line extraction module processes the front images to extract lane lines in each of the images. The distance calculation module calculates a distance between the optical camera and each of left and right lane lines, and calculates the distance between the vehicle and each of the left and right lane lines through the position of the camera in the vehicle and vehicle dimensions. Then, the early-warning judgment module judges whether or not to give a driver an early-warning prompt.Type: ApplicationFiled: January 17, 2019Publication date: April 22, 2021Applicant: FU ZHOU UNIVERSITYInventors: Zhifeng CHEN, Ente GUO, Zhenjia FAN, Chenhao PEI, Yanan CHEN, Liqin HUANG, Lin PAN
-
Patent number: 10971170Abstract: Methods, systems, and computer program products for generating, from an input character sequence, an output sequence of audio data representing the input character sequence. The output sequence of audio data includes a respective audio output sample for each of a number of time steps. One example method includes, for each of the time steps: generating a mel-frequency spectrogram for the time step by processing a representation of a respective portion of the input character sequence using a decoder neural network; generating a probability distribution over a plurality of possible audio output samples for the time step by processing the mel-frequency spectrogram for the time step using a vocoder neural network; and selecting the audio output sample for the time step from the possible audio output samples in accordance with the probability distribution.Type: GrantFiled: August 8, 2018Date of Patent: April 6, 2021Assignee: Google LLCInventors: Yonghui Wu, Jonathan Shen, Ruoming Pang, Ron J. Weiss, Michael Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, Russell John Wyatt Skerry-Ryan, Ryan M. Rifkin, Ioannis Agiomyrgiannakis
-
Patent number: 10947159Abstract: Provided are a granulated blast-furnace slag activator and a method of manufacturing the same. The granulated blast-furnace slag activator includes, in percent by weight, the following raw materials: 62% to 95% of gypsum and 5% to 38% of high belite sulfoaluminate cement clinker. Also provided is a method of manufacturing cement by mixing the granulated blast-furnace slag activator with granulated blast-furnace slag at a certain ratio.Type: GrantFiled: March 21, 2019Date of Patent: March 16, 2021Assignee: TANGSHAN POLAR BEAR BUILDING MATERIALS CO., LTD.Inventors: Jian Zhou, Zhifeng Chen, Zhenqiu Zhang, Zhongxi Ge, Shujuan Zhang, Qiao Chen, Chengjian Liu
-
Patent number: 10951732Abstract: A service processing method is applied to a system including a first distributed node and at least two second distributed nodes to ensure correct service processing. The first distributed node communicates with a controller using the second distributed nodes. The service processing method performed by the first distributed node, includes obtaining a first operation request, where the first operation request includes a first service object, allocating a first identification code to the first operation request according to a preset rule, where the first identification code identifies a processing sequence of the first operation request for the first service object, and sending the first operation request and the first identification code to the controller using a second distributed node such that the controller determines, according to the preset rule and the first identification code, whether the first operation request needs to be processed.Type: GrantFiled: March 26, 2018Date of Patent: March 16, 2021Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventor: Zhifeng Chen
-
Publication number: 20210042620Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training giant neural networks. One of the methods includes obtaining data specifying a partitioning of the neural network into N composite layers that form a sequence of composite layers, wherein each composite layer comprises a distinct plurality of layers from the multiple network layers of the neural network; obtaining data assigning each of the N composite layers to one or more computing devices from a set of N computing devices; partitioning a mini-batch of training examples into a plurality of micro-batches; and training the neural network, comprising: performing a forward pass through the neural network until output activations have been computed for each micro-batch for a final composite layer in the sequence, and performing a backward pass through the neural network until output gradients have been computed for each micro-batch for the first composite layer in the sequence.Type: ApplicationFiled: August 10, 2020Publication date: February 11, 2021Inventors: Zhifeng Chen, Yanping Huang, Youlong Cheng, HyoukJoong Lee, Dehao Chen, Jiquan Ngiam
-
Publication number: 20210012089Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing point cloud data representing a sensor measurement of a scene captured by one or more sensors to generate an object detection output that identifies locations of one or more objects in the scene. When deployed within an on-board system of a vehicle, the object detection output that is generated can be used to make autonomous driving decisions for the vehicle with enhanced accuracy.Type: ApplicationFiled: July 8, 2020Publication date: January 14, 2021Inventors: Jonathon Shlens, Patrick An Phu Nguyen, Benjamin James Caine, Jiquan Ngiam, Wei Han, Brandon Chauloon Yang, Yuning Chai, Pei Sun, Yin Zhou, Xi Yi, Ouais Alsharif, Zhifeng Chen, Vijay Vasudevan
-
Patent number: 10888939Abstract: A miter saw that includes a base, a worktable arranged on the base and defining a worktable plane, and a cutting head formed with or connected to an operating member operable by a user. The cutting head further includes a circular saw blade operative to rotate around a first axis and a motor operative to drive the circular saw blade. A fence is arranged on the worktable. The cutting head is further connected to a first guiding member configured for guiding chips to be discharged. The fence is formed with a guiding portion. The cutting head is operative to rotate around a second axis parallel to the worktable plane and, when the cutting head rotates around the second axis, the guiding portion is operative to guide the first guiding member to cross the fence.Type: GrantFiled: September 17, 2019Date of Patent: January 12, 2021Assignee: Nanjing Chervon Industry Co., Ltd.Inventors: Zhifeng Chen, Yinglu Ai, Guigong Ni
-
Publication number: 20200410396Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for performing machine learning tasks. One method includes receiving (i) a model input, and (ii) data identifying a first machine learning task to be performed on the model input to generate a first type of model output for the model input; augmenting the model input with an identifier for the first machine learning task to generate an augmented model input; and processing the augmented model input using a machine learning model, wherein the machine learning model has been trained on training data to perform a plurality of machine learning tasks including the first machine learning task, and wherein the machine learning model has been configured through training to process the augmented model input to generate a machine learning model output of the first type for the model input.Type: ApplicationFiled: July 13, 2020Publication date: December 31, 2020Inventors: Zhifeng Chen, Michael Schuster, Melvin Jose Johnson Premkumar, Yonghui Wu, Quoc V. Le, Maxim Krikun, Thorsten Brants
-
Patent number: 10880349Abstract: Quality-based optimizations of a delivery process of streaming content may be enabled. The optimization may take the form of quality-based switching. To enable quality-based switching in a streaming client, the client may have access to information about the quality of an encoded segment and/or sub-segment. Quality-related information may include any number of added quality metrics relating to an encoded segment and/or sub-segment of an encoded video stream. The addition of quality-related information may be accomplished by including the quality-related information in a manifest file, including the quality-related information in segment indices stored in a segment index file, and/or providing additional files with quality-related segment information and providing a link to the information from an MPD file. Upon receiving the quality-related information, the client may request and receive a stream that has a lower bitrate, thereby saving bandwidth while retaining quality of the streaming content.Type: GrantFiled: November 13, 2018Date of Patent: December 29, 2020Assignee: VID SCALE, Inc.Inventors: Yuriy Reznik, Eduardo Asbun, Zhifeng Chen, Rahul Vanam