Patents by Inventor Inbar Mosseri

Inbar Mosseri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Audio-visual speech separation

Patent number: 11894014

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

Type: Grant

Filed: September 22, 2022

Date of Patent: February 6, 2024

Assignee: Google LLC

Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
FACIAL IMAGE EDITING AND ENHANCEMENT USING A PERSONALIZED PRIOR

Publication number: 20230325998

Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.

Type: Application

Filed: June 14, 2023

Publication date: October 12, 2023

Inventors: Kfir Aberman, Yotam Nitzan, Orly Liba, Yael Pritch Knaan, Qiurui He, Inbar Mosseri, Yossi Gandelsman, Michal Yarom
AUDIO-VISUAL HEARING AID

Publication number: 20230267942

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.

Type: Application

Filed: October 1, 2020

Publication date: August 24, 2023

Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
Facial image editing and enhancement using a personalized prior

Patent number: 11721007

Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.

Type: Grant

Filed: November 8, 2022

Date of Patent: August 8, 2023

Assignee: Google LLC

Inventors: Kfir Aberman, Yotam Nitzan, Orly Liba, Yael Pritch Knaan, Qiurui He, Inbar Mosseri, Yossi Gandelsman, Michal Yarom
FACIAL IMAGE EDITING AND ENHANCEMENT USING A PERSONALIZED PRIOR

Publication number: 20230222636

Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.

Type: Application

Filed: November 8, 2022

Publication date: July 13, 2023

Inventors: Kfir Aberman, Yotam Nitzan, Orly Liba, Yael Pritch Knaan, Qiurui He, Inbar Mosseri, Yossi Gandelsman, Michal Yarom
AUDIO-VISUAL SPEECH SEPARATION

Publication number: 20230122905

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

Type: Application

Filed: September 22, 2022

Publication date: April 20, 2023

Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
Deep Saliency Prior

Publication number: 20230015117

Abstract: Techniques for tuning an image editing operator for reducing a distractor in raw image data are presented herein. The image editing operator can access the raw image data and a mask. The mask can indicate a region of interest associated with the raw image data. The image editing operator can process the raw image data and the mask to generate processed image data. Additionally, a trained saliency model can process at least the processed image data within the region of interest to generate a saliency map that provides saliency values. Moreover, a saliency loss function can compare the saliency values provided by the saliency map for the processed image data within the region of interest to one or more target saliency values. Subsequently, the one or more parameter values of the image editing operator can be modified based at least in part on the saliency loss function.

Type: Application

Filed: July 1, 2022

Publication date: January 19, 2023

Inventors: Kfir Aberman, David Edward Jacobs, Kai Jochen Kohlhoff, Michael Rubinstein, Yossi Gandelsman, Junfeng He, Inbar Mosseri, Yael Pritch Knaan
Audio-visual speech separation

Patent number: 11456005

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

Type: Grant

Filed: November 21, 2018

Date of Patent: September 27, 2022

Assignee: Google LLC

Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
Generating cartoon images from photos

Patent number: 10853987

Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.

Type: Grant

Filed: December 3, 2019

Date of Patent: December 1, 2020

Assignee: Google LLC

Inventors: Aaron Sarna, Dilip Krishnan, Forrester Cole, Inbar Mosseri
AUDIO-VISUAL SPEECH SEPARATION

Publication number: 20200335121

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

Type: Application

Filed: November 21, 2018

Publication date: October 22, 2020

Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
GENERATING CARTOON IMAGES FROM PHOTOS

Publication number: 20200175740

Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.

Type: Application

Filed: December 3, 2019

Publication date: June 4, 2020

Applicant: Google LLC

Inventors: Aaron SARNA, Dilip KRISHNAN, Forrester COLE, Inbar MOSSERI
Generating cartoon images from photos

Patent number: 10529115

Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.

Type: Grant

Filed: March 14, 2018

Date of Patent: January 7, 2020

Assignee: Google LLC

Inventors: Aaron Sarna, Dilip Krishnan, Forrester Cole, Inbar Mosseri
GENERATING CARTOON IMAGES FROM PHOTOS

Publication number: 20180268595

Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.

Type: Application

Filed: March 14, 2018

Publication date: September 20, 2018

Inventors: Aaron Sarna, Dilip Krishnan, Forrester Cole, Inbar Mosseri
SYSTEM AND METHOD FOR INCREMENTAL BACKUP STORAGE

Publication number: 20110196840

Abstract: A system and method for incremental backup storage. The systems and methods include one or more incremental backup servers comprising a current version store configured to store data corresponding to a most recent backup and a historical version store corresponding to a prior backup to said most recent backup. One or more incremental backup clients are configured to communicate with one or more incremental backup servers . The one or more incremental backup clients each comprise a localized data set. The incremental backup server is configured to back up the localized data set of the one or more incremental backup clients.

Type: Application

Filed: February 8, 2010

Publication date: August 11, 2011

Inventors: Yoram BARZILAI, Inbar Mosseri, Paul N. Weinberg