Patents by Inventor Shaoxiong YANG

Shaoxiong YANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for generating animation, electronic device, and computer readable medium

Patent number: 11948236

Abstract: The present disclosure discloses a method and apparatus for generating animation. An implementation of the method may include: processing a to-be-processed material to generate a normalized text; analyzing the normalized text to generate a Chinese pinyin sequence of the normalized text; generating a reference audio based on the to-be-processed material; and obtaining a animation of facial expressions corresponding to the timing sequence of the reference audio based on the Chinese pinyin sequence and the reference audio.

Type: Grant

Filed: November 16, 2021

Date of Patent: April 2, 2024

Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Shaoxiong Yang, Yang Zhao, Chen Zhao
Methods and apparatuses for generating model and generating 3D animation, devices and storage mediums

Patent number: 11836836

Abstract: Methods and apparatuses for generating a model and generating a 3D animation, devices, and storage mediums are provided. The method for generating a model may include: acquiring a preset sample set; acquiring pre-established generative adversarial nets, the generative adversarial nets including a generator and a discriminator; and performing training steps as follows: selecting a sample from the sample set; extracting a sample audio feature from the sample audio of the sample; inputting the sample audio feature into the generator to obtain a pseudo 3D mesh vertex sequence of the sample; inputting the pseudo 3D mesh vertex sequence and the real 3D mesh vertex sequence of the sample into the discriminator to discriminate authenticity of 3D mesh vertices; and in response to determining that the generative adversarial nets meet a training completion condition, obtaining a trained generator as a model for generating a 3D animation.

Type: Grant

Filed: November 15, 2021

Date of Patent: December 5, 2023

Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor: Shaoxiong Yang
Method and device for generating avatar, electronic equipment, medium and product

Patent number: 11830236

Abstract: Provided are a method and a device for generating an avatar, an electronic equipment, a medium and a product. In the method, a to-be-detected face image of a current user is acquired. The to-be-detected face image is analyzed and at least one original component of the to-be-detected face image is obtained. Each original component of the at least one original component of the to-be-detected face image is matched with each candidate component in a component set corresponding to the each original component, and a target component corresponding to the each original component of the to-be-detected face image is obtained. The target component corresponding to the each original component of the to-be-detected face image is assembled into a personalized avatar of the current user.

Type: Grant

Filed: November 23, 2021

Date of Patent: November 28, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor: Shaoxiong Yang
Method, apparatus, and device for identifying human body and computer readable storage medium

Patent number: 11790483

Abstract: Provided are a method, an apparatus, and a device for identifying human body, including: acquiring a first original picture captured; adjusting a resolution according to the acquired picture to obtain a target picture; processing the target picture based on a preset model for human body feature point detection to determine whether the target picture includes human body information; if the target picture includes the human body information, determining human body area information in the original picture according to the human body information and inputting the human body area information into a filter, enabling that the filter determines target human body area information according to the human body area information; acquiring a next original picture captured; and determining a possible human body area in the next original picture according to the target human body area information, and performing the step of adjusting the resolution according to the possible human body area.

Type: Grant

Filed: September 6, 2019

Date of Patent: October 17, 2023

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventors: Chen Zhao, Shaoxiong Yang, Xiaoyin Zhang
Method for translating image and method for training image translation model

Patent number: 11526971

Abstract: The present disclosure provides a computer-implemented method for translating an image and a computer-implemented method for training an image translation model. In the computer-implemented method for translating an image, an image translation request carrying an original image is obtained. The original image is processed to generate a pre-translated image, a mask image and a deformation parameter. The original image is deformed based on the deformation parameter to obtain a deformed image. The deformed image, the pre-translated image and the mask image are merged to generate a target translated image.

Type: Grant

Filed: November 30, 2020

Date of Patent: December 13, 2022

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Shaoxiong Yang, Chen Zhao
METHOD FOR ANIMATION SYNTHESIS, ELECTRONIC DEVICE AND STORAGE MEDIUM

Publication number: 20220375456

Abstract: A method for animation synthesis includes: obtaining an audio stream to be processed and a syllable sequence, wherein both the audio stream and the syllable sequence correspond to the same text and each syllable in the syllable sequence is pinyin of each character of the text; obtaining a phoneme information sequence of the audio stream by performing phoneme detection on the audio stream, wherein each piece of phoneme information in the phoneme information sequence comprises a phoneme category and a pronunciation time period; determining a pronunciation time period corresponding to each syllable in the syllable sequence based on the syllable sequence, phoneme categories and pronunciation time periods in the phoneme information sequence; and generating an animation video corresponding to the audio stream based on the pronunciation time period corresponding to each syllable in the syllable sequence and an animation frame sequence corresponding to each syllable.

Type: Application

Filed: June 30, 2022

Publication date: November 24, 2022

Inventors: Shaoxiong YANG, Chen ZHAO
Method for translating image, method for training image translation model

Patent number: 11508044

Abstract: A method for translating an image, a method for training an image translation model, and related electronic devices are proposed. In the method for translating an image, an image translation request carrying an original image is obtained. A down-sampled image is generated by down sampling the original image. A pre-translated image, a mask image, and deformation parameters are generated based on the down-sampled image. A size of the pre-translated image and a size of the mask image are the same as a size of the original image. A deformed image is obtained by deforming original image based on the deformation parameters. The deformed image, the pre-translated image and the mask image are fused to generate a target translation image.

Type: Grant

Filed: December 9, 2020

Date of Patent: November 22, 2022

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Shaoxiong Yang, Chen Zhao
Method for processing automobile image data, apparatus, and readable storage medium

Patent number: 11449707

Abstract: Provided are a method for processing automobile image data, an apparatus, and a readable storage medium. The method includes training a preset adversarial network according to automobile image sample data obtained by collecting and preset random noise information, to obtain a trained adversarial network for generating automobile image data; and inputting the automobile image sample data into the trained adversarial network, and generating the automobile image data including a variety of the random noise information, according to a preset generation target, wherein the automobile image data are configured to train an automobile image recognition neural network to perform automobile image recognition. Therefore, the automobile image data that may be used to train an automobile image recognition neural network are generated by using the adversarial network, which amplifies the amount of data, and reduces the cost of obtaining data.

Type: Grant

Filed: July 9, 2019

Date of Patent: September 20, 2022

Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.

Inventors: Shaoxiong Yang, Chen Zhao
METHOD AND APPARATUS FOR GENERATING ANIMATION, ELECTRONIC DEVICE, AND COMPUTER READABLE MEDIUM

Publication number: 20220180584

Abstract: The present disclosure discloses a method and apparatus for generating animation. An implementation of the method may include: processing a to-be-processed material to generate a normalized text; analyzing the normalized text to generate a Chinese pinyin sequence of the normalized text; generating a reference audio based on the to-be-processed material; and obtaining a animation of facial expressions corresponding to the timing sequence of the reference audio based on the Chinese pinyin sequence and the reference audio.

Type: Application

Filed: November 16, 2021

Publication date: June 9, 2022

Inventors: Shaoxiong Yang, Yang Zhao, Chen Zhao
METHOD AND DEVICE FOR GENERATING AVATAR, ELECTRONIC EQUIPMENT, MEDIUM AND PRODUCT

Publication number: 20220084307

Abstract: Provided are a method and a device for generating an avatar, an electronic equipment, a medium and a product. In the method, a to-be-detected face image of a current user is acquired. The to-be-detected face image is analyzed and at least one original component of the to-be-detected face image is obtained. Each original component of the at least one original component of the to-be-detected face image is matched with each candidate component in a component set corresponding to the each original component, and a target component corresponding to the each original component of the to-be-detected face image is obtained. The target component corresponding to the each original component of the to-be-detected face image is assembled into a personalized avatar of the current user.

Type: Application

Filed: November 23, 2021

Publication date: March 17, 2022

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor: Shaoxiong YANG
METHODS AND APPARATUSES FOR GENERATING MODEL AND GENERATING 3D ANIMATION, DEVICES AND STORAGE MEDIUMS

Publication number: 20220076470

Abstract: Methods and apparatuses for generating a model and generating a 3D animation, devices, and storage mediums are provided. The method for generating a model may include: acquiring a preset sample set; acquiring pre-established generative adversarial nets, the generative adversarial nets including a generator and a discriminator; and performing training steps as follows: selecting a sample from the sample set; extracting a sample audio feature from the sample audio of the sample; inputting the sample audio feature into the generator to obtain a pseudo 3D mesh vertex sequence of the sample; inputting the pseudo 3D mesh vertex sequence and the real 3D mesh vertex sequence of the sample into the discriminator to discriminate authenticity of 3D mesh vertices; and in response to determining that the generative adversarial nets meet a training completion condition, obtaining a trained generator as a model for generating a 3D animation.

Type: Application

Filed: November 15, 2021

Publication date: March 10, 2022

Inventor: Shaoxiong YANG
METHOD FOR TRANSLATING IMAGE AND METHOD FOR TRAINING IMAGE TRANSLATION MODEL

Publication number: 20210374924

Abstract: The present disclosure provides a computer-implemented method for translating an image and a computer-implemented method for training an image translation model. In the computer-implemented method for translating an image, an image translation request carrying an original image is obtained. The original image is processed to generate a pre-translated image, a mask image and a deformation parameter. The original image is deformed based on the deformation parameter to obtain a deformed image. The deformed image, the pre-translated image and the mask image are merged to generate a target translated image.

Type: Application

Filed: November 30, 2020

Publication date: December 2, 2021

Inventors: Shaoxiong YANG, Chen ZHAO
METHOD FOR TRANSLATING IMAGE, METHOD FOR TRAINING IMAGE TRANSLATION MODEL

Publication number: 20210374920

Abstract: A method for translating an image, a method for training an image translation model, and related electronic devices are proposed. In the method for translating an image, an image translation request carrying an original image is obtained. A down-sampled image is generated by down sampling the original image. A pre-translated image, a mask image, and deformation parameters are generated based on the down-sampled image. A size of the pre-translated image and a size of the mask image are the same as a size of the original image. A deformed image is obtained by deforming original image based on the deformation parameters. The deformed image, the pre-translated image and the mask image are fused to generate a target translation image.

Type: Application

Filed: December 9, 2020

Publication date: December 2, 2021

Inventors: Shaoxiong YANG, Chen ZHAO
Gesture recognition method, device, electronic device, and storage medium

Patent number: 10983596

Abstract: The present disclosure provides a gesture recognition method, a device, an electronic device, and a storage medium. The method includes: sequentially performing a recognition process on each image from a target video using a preset recognition model of palm orientation to determine a probability of containing a palm image in each image and a palm normal vector corresponding to each image; determining a group of target images from the target video based on the probability of containing the palm image in each image; and determining a target gesture corresponding to the target video based on the palm normal vector corresponding to each target image in the group of target images.

Type: Grant

Filed: February 14, 2020

Date of Patent: April 20, 2021

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Chen Zhao, Shaoxiong Yang, Yuan Gao
GESTURE RECOGNITION METHOD, DEVICE, ELECTRONIC DEVICE, AND STORAGE MEDIUM

Publication number: 20200301514

Abstract: The present disclosure provides a gesture recognition method, a device, an electronic device, and a storage medium. The method includes: sequentially performing a recognition process on each image from a target video using a preset recognition model of palm orientation to determine a probability of containing a palm image in each image and a palm normal vector corresponding to each image; determining a group of target images from the target video based on the probability of containing the palm image in each image; and determining a target gesture corresponding to the target video based on the palm normal vector corresponding to each target image in the group of target images.

Type: Application

Filed: February 14, 2020

Publication date: September 24, 2020

Inventors: Chen ZHAO, Shaoxiong YANG, Yuan GAO
METHOD, APPARATUS, AND DEVICE FOR IDENTIFYING HUMAN BODY AND COMPUTER READABLE STORAGE MEDIUM

Publication number: 20200134305

Abstract: Provided are a method, an apparatus, and a device for identifying human body, including: acquiring a first original picture captured; adjusting a resolution according to the acquired picture to obtain a target picture; processing the target picture based on a preset model for human body feature point detection to determine whether the target picture includes human body information; if the target picture includes the human body information, determining human body area information in the original picture according to the human body information and inputting the human body area information into a filter, enabling that the filter determines target human body area information according to the human body area information; acquiring a next original picture captured; and determining a possible human body area in the next original picture according to the target human body area information, and performing the step of adjusting the resolution according to the possible human body area.

Type: Application

Filed: September 6, 2019

Publication date: April 30, 2020

Inventors: Chen ZHAO, Shaoxiong YANG, Xiaoyin ZHANG
Method for Processing Automobile Image Data, Apparatus, and Readable Storage Medium

Publication number: 20190332894

Abstract: Provided are a method for processing automobile image data, an apparatus, and a readable storage medium. The method includes training a preset adversarial network according to automobile image sample data obtained by collecting and preset random noise information, to obtain a trained adversarial network for generating automobile image data; and inputting the automobile image sample data into the trained adversarial network, and generating the automobile image data including a variety of the random noise information, according to a preset generation target, wherein the automobile image data are configured to train an automobile image recognition neural network to perform automobile image recognition. Therefore, the automobile image data that may be used to train an automobile image recognition neural network are generated by using the adversarial network, which amplifies the amount of data, and reduces the cost of obtaining data.

Type: Application

Filed: July 9, 2019

Publication date: October 31, 2019

Applicant: Baidu Online Network Technology (Beijing) Co., Ltd.

Inventors: Shaoxiong YANG, Chen ZHAO