Patents Assigned to DEEPBRAIN AI INC.

Apparatus and method for generating speech synthesis image

Patent number: 12651394

Abstract: An apparatus for generating a speech synthesis image includes a first global geometric transformation predictor to receive a source image and a target image including the same person and predict a global geometric transformation for a global motion of the person between the source image and the target image, a local geometric transformation predictor to predict a local geometric transformation for a local motion of the person based on preset input data, a geometric transformation combiner to calculate a full motion geometric transformation for a full motion of the person by combining the global geometric transformation and the local geometric transformation, an optical flow predictor to calculate an optical flow between the source image and the target image based on the source image and the full motion geometric transformation, and an image generator to reconstruct the target image based on the source image and the optical flow.

Type: Grant

Filed: March 15, 2022

Date of Patent: June 9, 2026

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeong Su Chae, Guem Buel Hwang
Apparatus and method for generating speech synthesis image

Patent number: 12573119

Abstract: An apparatus according to an embodiment is a speech synthesis image generating apparatus based on machine learning. The apparatus includes a first global geometric transformation predictor to receive a source image and a target image including the same person, and predict a global geometric transformation for a global motion of the person between the source image and the target image based on the source image and the target image, a local geometric transformation predictor to predict a local geometric transformation for a local motion of the person between the source image and the target image based on preset input data, a geometric transformation combiner to calculate a full motion geometric transformation for a full motion of the person by combining the global geometric transformation and the local geometric transformation, and an image generator to reconstruct the target image based on the source image and the full motion geometric transformation.

Type: Grant

Filed: March 15, 2022

Date of Patent: March 10, 2026

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeong Su Chae, Guem Buel Hwang
Speech image providing method and computing device for performing the same

Patent number: 12536727

Abstract: A computing device according to an embodiment disclosed includes one or more processors and a memory storing one or more programs executed by the one or more processors, and a standby state image generating module configured to generate a standby state image in which a person is in a standby state, an interpolation image generating module configured to generate an interpolation image set for interpolation between the standby state image and a pre-stored speech preparation image, and an image playback module configured to generate a connection image for connecting the standby state image and a speech state image based on the interpolation image set when the speech state image is generated.

Type: Grant

Filed: July 9, 2021

Date of Patent: January 27, 2026

Assignee: DEEPBRAIN AI INC.

Inventor: Doo Hyun Kim
Apparatus and method for generating speech synthesis image

Patent number: 12437773

Abstract: An apparatus for generating a speech synthesis image according to a disclosed embodiment is an apparatus for generating a speech synthesis image based on machine learning, the apparatus including a first global geometric transformation predictor configured to be trained to receive each of a source image and a target image including the same person, and predict a global geometric transformation for a global motion of the person between the source image and the target image based on the source image and the target image, a local feature tensor predictor configured to be trained to predict a feature tensor for a local motion of the person based on input target image-related information, and an image generator configured to be trained to reconstruct the target image based on the global geometric transformation, the source image, and the feature tensor for the local motion.

Type: Grant

Filed: March 15, 2022

Date of Patent: October 7, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeong Su Chae, Guem Buel Hwang
Method for providing speech video and computing device for executing the method

Patent number: 12367892

Abstract: In a method of providing a speech video according to an embodiment, a standby state video in which a person in a video is in a standby state is reproduced, a speech state video in which a person in a video is in a speech state based on a source of speech content is generated, the standby state video being reproduced to a reference frame of the standby state video being reproduced based on a back motion image is returned, and a synthesized speech video by synthesizing the returned reference frame and the speech state video is generated.

Type: Grant

Filed: February 14, 2024

Date of Patent: July 22, 2025

Assignee: DEEPBRAIN AI INC.

Inventor: Doohyun Kim
Device and method for generating speech video along with landmark

Patent number: 12347197

Abstract: A speech video generation device according to an embodiment includes a first encoder, which receives an input of a person background image that is a video part in a speech video of a predetermined person, and extracts an image feature vector from the person background image, a second encoder, which receives an input of a speech audio signal that is an audio part in the speech video, and extracts a voice feature vector from the speech audio signal, a combining unit, which generates a combined vector by combining the image feature vector output from the first encoder and the voice feature vector output from the second encoder, a first decoder, which reconstructs the speech video of the person using the combined vector as an input, and a second decoder, which predicts a landmark of the speech video using the combined vector as an input.

Type: Grant

Filed: December 15, 2020

Date of Patent: July 1, 2025

Assignee: DEEPBRAIN AI INC.

Inventor: Gyeongsu Chae
Apparatus and method for generating speech synthesis image

Patent number: 12322016

Abstract: An apparatus for generating a speech synthesis image according to a disclosed embodiment is an apparatus for generating a speech synthesis image based on machine learning, the apparatus including a first global geometric transformation predictor configured to be trained to receive each of a source image and a target image including the same person, and predict a global geometric transformation for a global motion of the person between the source image and the target image based on the source image and the target image, a local feature tensor predictor configured to be trained to predict a feature tensor for a local motion of the person based on preset input data, and an image generator configured to be trained to reconstruct the target image based on the global geometric transformation, the source image, and the feature tensor for the local motion.

Type: Grant

Filed: March 15, 2022

Date of Patent: June 3, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeong Su Chae, Guem Buel Hwang
Speech image providing method and computing device for performing the same

Patent number: 12243550

Abstract: A speech image providing method according to an embodiment includes generating a standby state image in which a person is in a standby state, generating a plurality of back-motion images at a preset frame interval from the standby state image for image interpolation between a preset reference frame of the standby state image, generating a speech state image in which a person is in a speech state based on a source of speech content, returning the standby state image being played to the reference frame based on the plurality of back-motion images of the standby state image, based on a point of time when the generating of the speech state image is completed, and generating a synthetic speech image in combination with frames of the speech state image from the reference frame.

Type: Grant

Filed: July 9, 2021

Date of Patent: March 4, 2025

Assignee: DEEPBRAIN AI INC.

Inventor: Doo Hyun Kim
Apparatus and method for generating lip sync image

Patent number: 12236943

Abstract: An apparatus for generating a lip sync image according to disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The apparatus includes a first artificial neural network model configured to generate an utterance match synthesis image by using a person background image and an utterance match audio signal corresponding to the person background image as an input, and generate an utterance mismatch synthesis image by using the person background image and an utterance mismatch audio signal not corresponding to the person background image as an input, and a second artificial neural network model configured to output classification values for an input pair in which an image and a voice match and an input pair in which an image and a voice do not match by using the input pairs as an input.

Type: Grant

Filed: June 8, 2021

Date of Patent: February 25, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Guem Buel Hwang, Gyeong Su Chae
Device and method for synthesizing image capable of improving image quality

Patent number: 12236558

Abstract: An image synthesis device according to a disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The image synthesis device includes a first artificial neural network provided to learn each of a first task of using a damaged image as an input to output a restored image and a second task of using an original image as an input to output a reconstructed image, and a second artificial neural network connected to an output layer of the first artificial neural network, and trained to use the reconstructed image output from the first artificial neural network as an input and improve the image quality of the reconstructed image.

Type: Grant

Filed: June 8, 2021

Date of Patent: February 25, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeong Su Chae, Guem Buel Hwang
Device and method for generating speech video

Patent number: 12205342

Abstract: A speech video generation device according to an embodiment includes a first encoder that receives an input of a first person background image of a predetermined person partially hidden by a first mask, and extracts a first image feature vector from the first person background image, a second encoder, which receives an input of a second person background image of the person partially hidden by a second mask, and extracts a second image feature vector from the second person background image, a third encoder, which receives an input of a speech audio signal of the person, and extracts a voice feature vector from the speech audio signal, a combining unit, which generates a combined vector of the first image feature vector, the second image feature vector, and the voice feature vector, and a decoder, which reconstructs a speech video of the person using the combined vector as an input.

Type: Grant

Filed: December 15, 2020

Date of Patent: January 21, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeongsu Chae, Guembuel Hwang
Method and device for generating speech moving image

Patent number: 12205212

Abstract: A device which generates a speech moving image includes a first encoder, a second encoder, a combination unit, and an image reconstruction unit. The first encoder receives a person background image in which a portion related to speech of a person that is a video part of the speech moving image of the person is covered with a mask, extracts an image feature vector from the person background image, and compresses the extracted image feature vector. The second encoder receives a speech audio signal that is an audio part of the speech moving image, extracts a voice feature vector from the speech audio signal, and compresses the extracted voice feature vector. The combination unit generates a combination vector of the compressed image feature vector and the compressed voice feature vector. The image reconstruction unit reconstructs the speech moving image of the person with the combination as an input.

Type: Grant

Filed: December 8, 2020

Date of Patent: January 21, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeongsu Chae, Guembuel Hwang
Learning method for generating lip sync image based on machine learning and lip sync image generation device for performing same

Patent number: 12198713

Abstract: A lip sync image generation device based on machine learning according to a disclosed embodiment includes an image synthesis model, which is an artificial neural network model, and which uses a person background image and an utterance audio signal as an input to generate a lip sync image, and a lip sync discrimination model, which is an artificial neural network model, and which discriminates the degree of match between the lip sync image generated by the image synthesis model and the utterance audio signal input to the image synthesis model.

Type: Grant

Filed: June 17, 2021

Date of Patent: January 14, 2025

Assignee: DEEPBRAIN AI INC.

Inventor: Gyeong Su Chae
Device and method for synthesizing image capable of improving image quality

Patent number: 12190480

Abstract: An image synthesis device according to a disclosed embodiment is an image synthesis device has one or more processors and a memory which stores one or more programs executed by the one or more processors. The image synthesis device includes a first artificial neural network model provided to learn each of a first task of using a damaged image as an input to output a restored image and a second task of using an original image as an input to output a reconstructed image, and a second artificial neural network model trained to use the reconstructed image output from the first artificial neural network model as an input and improve the image quality of the reconstructed image.

Type: Grant

Filed: June 8, 2021

Date of Patent: January 7, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeong Su Chae, Guem Buel Hwang
Apparatus and method for generating lip sync image

Patent number: 12190903

Abstract: An apparatus for generating a lip sync image according to a disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The apparatus includes a first artificial neural network model configured to generate an utterance synthesis image by using a person background image and an utterance audio signal corresponding to the person background image as an input, and generate a silence synthesis image by using only the person background image as an input, and a second artificial neural network model configured to output, from a preset utterance maintenance image and the first artificial neural network model, classification values for the preset utterance maintenance image and the silence synthesis image by using the silence synthesis image as an input.

Type: Grant

Filed: June 3, 2021

Date of Patent: January 7, 2025

Assignee: DEEPBRAIN AI INC.

Inventors: Guem Buel Hwang, Gyeong Su Chae
Method and device for generating speech video using audio signal

Patent number: 12148431

Abstract: A device according to an embodiment has one or more processors and a memory storing one or more programs executable by the one or more processors. The device includes a first encoder configured to receive a person background image corresponding to a video part of a speech video of a person and extract an image feature vector from the person background image, a second encoder configured to receive a speech audio signal corresponding to an audio part of the speech video and extract a voice feature vector from the speech audio signal, a combiner configured to generate a combined vector by combining the image feature vector output from the first encoder and the voice feature vector output from the second encoder, and a decoder configured to reconstruct the speech video of the person using the combined vector as an input.

Type: Grant

Filed: June 19, 2020

Date of Patent: November 19, 2024

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeongsu Chae, Guembuel Hwang, Sungwoo Park, Seyoung Jang
Learning device and method for generating image

Patent number: 12131441

Abstract: A learning device for generating an image according to an embodiment disclosed is a computing device including one or more processors and a memory storing one or more programs executed by the one or more processors. The learning device includes a first machine learning model that generates a mask for masking a portion related to speech in a person basic image with the person basic image as an input, and generates a person background image by synthesizing the person basic image and the mask.

Type: Grant

Filed: December 1, 2020

Date of Patent: October 29, 2024

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeongsu Chae, Guembuel Hwang
Neural network-based key point training apparatus and method

Patent number: 12112571

Abstract: A neural network-based key point training apparatus according to an embodiment disclosed includes a key point model trained to extract key points from an input image and an image reconstruction model trained to reconstruct the input image with the key points output by the key point model as the input.

Type: Grant

Filed: December 1, 2020

Date of Patent: October 8, 2024

Assignee: DEEPBRAIN AI INC.

Inventors: Gyeongsu Chae, Guembuel Hwang
Kiosk

Patent number: D1090705

Type: Grant

Filed: February 27, 2023

Date of Patent: August 26, 2025

Assignee: DEEPBRAIN AI INC.

Inventor: Boo Won Park
Kiosk

Patent number: D1091528

Type: Grant

Filed: February 27, 2023

Date of Patent: September 2, 2025

Assignee: DEEPBRAIN AI INC.

Inventor: Boo Won Park

1 2 next