Patents Assigned to DEEPBRAIN AI INC.
-
Patent number: 12651394Abstract: An apparatus for generating a speech synthesis image includes a first global geometric transformation predictor to receive a source image and a target image including the same person and predict a global geometric transformation for a global motion of the person between the source image and the target image, a local geometric transformation predictor to predict a local geometric transformation for a local motion of the person based on preset input data, a geometric transformation combiner to calculate a full motion geometric transformation for a full motion of the person by combining the global geometric transformation and the local geometric transformation, an optical flow predictor to calculate an optical flow between the source image and the target image based on the source image and the full motion geometric transformation, and an image generator to reconstruct the target image based on the source image and the optical flow.Type: GrantFiled: March 15, 2022Date of Patent: June 9, 2026Assignee: DEEPBRAIN AI INC.Inventors: Gyeong Su Chae, Guem Buel Hwang
-
Patent number: 12573119Abstract: An apparatus according to an embodiment is a speech synthesis image generating apparatus based on machine learning. The apparatus includes a first global geometric transformation predictor to receive a source image and a target image including the same person, and predict a global geometric transformation for a global motion of the person between the source image and the target image based on the source image and the target image, a local geometric transformation predictor to predict a local geometric transformation for a local motion of the person between the source image and the target image based on preset input data, a geometric transformation combiner to calculate a full motion geometric transformation for a full motion of the person by combining the global geometric transformation and the local geometric transformation, and an image generator to reconstruct the target image based on the source image and the full motion geometric transformation.Type: GrantFiled: March 15, 2022Date of Patent: March 10, 2026Assignee: DEEPBRAIN AI INC.Inventors: Gyeong Su Chae, Guem Buel Hwang
-
Patent number: 12536727Abstract: A computing device according to an embodiment disclosed includes one or more processors and a memory storing one or more programs executed by the one or more processors, and a standby state image generating module configured to generate a standby state image in which a person is in a standby state, an interpolation image generating module configured to generate an interpolation image set for interpolation between the standby state image and a pre-stored speech preparation image, and an image playback module configured to generate a connection image for connecting the standby state image and a speech state image based on the interpolation image set when the speech state image is generated.Type: GrantFiled: July 9, 2021Date of Patent: January 27, 2026Assignee: DEEPBRAIN AI INC.Inventor: Doo Hyun Kim
-
Patent number: 12437773Abstract: An apparatus for generating a speech synthesis image according to a disclosed embodiment is an apparatus for generating a speech synthesis image based on machine learning, the apparatus including a first global geometric transformation predictor configured to be trained to receive each of a source image and a target image including the same person, and predict a global geometric transformation for a global motion of the person between the source image and the target image based on the source image and the target image, a local feature tensor predictor configured to be trained to predict a feature tensor for a local motion of the person based on input target image-related information, and an image generator configured to be trained to reconstruct the target image based on the global geometric transformation, the source image, and the feature tensor for the local motion.Type: GrantFiled: March 15, 2022Date of Patent: October 7, 2025Assignee: DEEPBRAIN AI INC.Inventors: Gyeong Su Chae, Guem Buel Hwang
-
Patent number: 12367892Abstract: In a method of providing a speech video according to an embodiment, a standby state video in which a person in a video is in a standby state is reproduced, a speech state video in which a person in a video is in a speech state based on a source of speech content is generated, the standby state video being reproduced to a reference frame of the standby state video being reproduced based on a back motion image is returned, and a synthesized speech video by synthesizing the returned reference frame and the speech state video is generated.Type: GrantFiled: February 14, 2024Date of Patent: July 22, 2025Assignee: DEEPBRAIN AI INC.Inventor: Doohyun Kim
-
Patent number: 12347197Abstract: A speech video generation device according to an embodiment includes a first encoder, which receives an input of a person background image that is a video part in a speech video of a predetermined person, and extracts an image feature vector from the person background image, a second encoder, which receives an input of a speech audio signal that is an audio part in the speech video, and extracts a voice feature vector from the speech audio signal, a combining unit, which generates a combined vector by combining the image feature vector output from the first encoder and the voice feature vector output from the second encoder, a first decoder, which reconstructs the speech video of the person using the combined vector as an input, and a second decoder, which predicts a landmark of the speech video using the combined vector as an input.Type: GrantFiled: December 15, 2020Date of Patent: July 1, 2025Assignee: DEEPBRAIN AI INC.Inventor: Gyeongsu Chae
-
Patent number: 12322016Abstract: An apparatus for generating a speech synthesis image according to a disclosed embodiment is an apparatus for generating a speech synthesis image based on machine learning, the apparatus including a first global geometric transformation predictor configured to be trained to receive each of a source image and a target image including the same person, and predict a global geometric transformation for a global motion of the person between the source image and the target image based on the source image and the target image, a local feature tensor predictor configured to be trained to predict a feature tensor for a local motion of the person based on preset input data, and an image generator configured to be trained to reconstruct the target image based on the global geometric transformation, the source image, and the feature tensor for the local motion.Type: GrantFiled: March 15, 2022Date of Patent: June 3, 2025Assignee: DEEPBRAIN AI INC.Inventors: Gyeong Su Chae, Guem Buel Hwang
-
Patent number: 12243550Abstract: A speech image providing method according to an embodiment includes generating a standby state image in which a person is in a standby state, generating a plurality of back-motion images at a preset frame interval from the standby state image for image interpolation between a preset reference frame of the standby state image, generating a speech state image in which a person is in a speech state based on a source of speech content, returning the standby state image being played to the reference frame based on the plurality of back-motion images of the standby state image, based on a point of time when the generating of the speech state image is completed, and generating a synthetic speech image in combination with frames of the speech state image from the reference frame.Type: GrantFiled: July 9, 2021Date of Patent: March 4, 2025Assignee: DEEPBRAIN AI INC.Inventor: Doo Hyun Kim
-
Patent number: 12236943Abstract: An apparatus for generating a lip sync image according to disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The apparatus includes a first artificial neural network model configured to generate an utterance match synthesis image by using a person background image and an utterance match audio signal corresponding to the person background image as an input, and generate an utterance mismatch synthesis image by using the person background image and an utterance mismatch audio signal not corresponding to the person background image as an input, and a second artificial neural network model configured to output classification values for an input pair in which an image and a voice match and an input pair in which an image and a voice do not match by using the input pairs as an input.Type: GrantFiled: June 8, 2021Date of Patent: February 25, 2025Assignee: DEEPBRAIN AI INC.Inventors: Guem Buel Hwang, Gyeong Su Chae
-
Patent number: 12236558Abstract: An image synthesis device according to a disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The image synthesis device includes a first artificial neural network provided to learn each of a first task of using a damaged image as an input to output a restored image and a second task of using an original image as an input to output a reconstructed image, and a second artificial neural network connected to an output layer of the first artificial neural network, and trained to use the reconstructed image output from the first artificial neural network as an input and improve the image quality of the reconstructed image.Type: GrantFiled: June 8, 2021Date of Patent: February 25, 2025Assignee: DEEPBRAIN AI INC.Inventors: Gyeong Su Chae, Guem Buel Hwang
-
Patent number: 12205342Abstract: A speech video generation device according to an embodiment includes a first encoder that receives an input of a first person background image of a predetermined person partially hidden by a first mask, and extracts a first image feature vector from the first person background image, a second encoder, which receives an input of a second person background image of the person partially hidden by a second mask, and extracts a second image feature vector from the second person background image, a third encoder, which receives an input of a speech audio signal of the person, and extracts a voice feature vector from the speech audio signal, a combining unit, which generates a combined vector of the first image feature vector, the second image feature vector, and the voice feature vector, and a decoder, which reconstructs a speech video of the person using the combined vector as an input.Type: GrantFiled: December 15, 2020Date of Patent: January 21, 2025Assignee: DEEPBRAIN AI INC.Inventors: Gyeongsu Chae, Guembuel Hwang
-
Patent number: 12205212Abstract: A device which generates a speech moving image includes a first encoder, a second encoder, a combination unit, and an image reconstruction unit. The first encoder receives a person background image in which a portion related to speech of a person that is a video part of the speech moving image of the person is covered with a mask, extracts an image feature vector from the person background image, and compresses the extracted image feature vector. The second encoder receives a speech audio signal that is an audio part of the speech moving image, extracts a voice feature vector from the speech audio signal, and compresses the extracted voice feature vector. The combination unit generates a combination vector of the compressed image feature vector and the compressed voice feature vector. The image reconstruction unit reconstructs the speech moving image of the person with the combination as an input.Type: GrantFiled: December 8, 2020Date of Patent: January 21, 2025Assignee: DEEPBRAIN AI INC.Inventors: Gyeongsu Chae, Guembuel Hwang
-
Patent number: 12198713Abstract: A lip sync image generation device based on machine learning according to a disclosed embodiment includes an image synthesis model, which is an artificial neural network model, and which uses a person background image and an utterance audio signal as an input to generate a lip sync image, and a lip sync discrimination model, which is an artificial neural network model, and which discriminates the degree of match between the lip sync image generated by the image synthesis model and the utterance audio signal input to the image synthesis model.Type: GrantFiled: June 17, 2021Date of Patent: January 14, 2025Assignee: DEEPBRAIN AI INC.Inventor: Gyeong Su Chae
-
Patent number: 12190480Abstract: An image synthesis device according to a disclosed embodiment is an image synthesis device has one or more processors and a memory which stores one or more programs executed by the one or more processors. The image synthesis device includes a first artificial neural network model provided to learn each of a first task of using a damaged image as an input to output a restored image and a second task of using an original image as an input to output a reconstructed image, and a second artificial neural network model trained to use the reconstructed image output from the first artificial neural network model as an input and improve the image quality of the reconstructed image.Type: GrantFiled: June 8, 2021Date of Patent: January 7, 2025Assignee: DEEPBRAIN AI INC.Inventors: Gyeong Su Chae, Guem Buel Hwang
-
Patent number: 12190903Abstract: An apparatus for generating a lip sync image according to a disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The apparatus includes a first artificial neural network model configured to generate an utterance synthesis image by using a person background image and an utterance audio signal corresponding to the person background image as an input, and generate a silence synthesis image by using only the person background image as an input, and a second artificial neural network model configured to output, from a preset utterance maintenance image and the first artificial neural network model, classification values for the preset utterance maintenance image and the silence synthesis image by using the silence synthesis image as an input.Type: GrantFiled: June 3, 2021Date of Patent: January 7, 2025Assignee: DEEPBRAIN AI INC.Inventors: Guem Buel Hwang, Gyeong Su Chae
-
Patent number: 12148431Abstract: A device according to an embodiment has one or more processors and a memory storing one or more programs executable by the one or more processors. The device includes a first encoder configured to receive a person background image corresponding to a video part of a speech video of a person and extract an image feature vector from the person background image, a second encoder configured to receive a speech audio signal corresponding to an audio part of the speech video and extract a voice feature vector from the speech audio signal, a combiner configured to generate a combined vector by combining the image feature vector output from the first encoder and the voice feature vector output from the second encoder, and a decoder configured to reconstruct the speech video of the person using the combined vector as an input.Type: GrantFiled: June 19, 2020Date of Patent: November 19, 2024Assignee: DEEPBRAIN AI INC.Inventors: Gyeongsu Chae, Guembuel Hwang, Sungwoo Park, Seyoung Jang
-
Patent number: 12131441Abstract: A learning device for generating an image according to an embodiment disclosed is a computing device including one or more processors and a memory storing one or more programs executed by the one or more processors. The learning device includes a first machine learning model that generates a mask for masking a portion related to speech in a person basic image with the person basic image as an input, and generates a person background image by synthesizing the person basic image and the mask.Type: GrantFiled: December 1, 2020Date of Patent: October 29, 2024Assignee: DEEPBRAIN AI INC.Inventors: Gyeongsu Chae, Guembuel Hwang
-
Patent number: 12112571Abstract: A neural network-based key point training apparatus according to an embodiment disclosed includes a key point model trained to extract key points from an input image and an image reconstruction model trained to reconstruct the input image with the key points output by the key point model as the input.Type: GrantFiled: December 1, 2020Date of Patent: October 8, 2024Assignee: DEEPBRAIN AI INC.Inventors: Gyeongsu Chae, Guembuel Hwang
-
Patent number: D1090705Type: GrantFiled: February 27, 2023Date of Patent: August 26, 2025Assignee: DEEPBRAIN AI INC.Inventor: Boo Won Park
-
Patent number: D1091528Type: GrantFiled: February 27, 2023Date of Patent: September 2, 2025Assignee: DEEPBRAIN AI INC.Inventor: Boo Won Park