Patents by Inventor Yuanzhen Li

Yuanzhen Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Video Diffusion Model

Publication number: 20250238905

Abstract: Provided is a video generation model for performing text-to-video (T2V) or other video generation techniques. The proposed model reduces the computational costs associated with video generation. In particular, unlike traditional T2V methods, the disclosed technology can generate the full temporal duration of a video clip at once, bypassing the need for extensive computation. As one example, a machine-learned denoising diffusion model can simultaneously process a plurality of noisy inputs that correspond to various timestamps spanning the temporal dimension of a video to simultaneously generate synthetic frames for the video that match the timestamps.

Type: Application

Filed: January 22, 2025

Publication date: July 24, 2025

Inventors: Inbar Mosseri, Omer Bar Tal, Hila Chefer-Livshen, Omer Tov, Charles Irwin Herrmann, Rony Paiss, Shiran Elyahu Zada, Ariel Ephrat, Junhwa Hur, Guanghui Liu, Amit Raj, Yuanzhen Li, Michael Rubinstein, Tomer Michaeli, Oliver Wang, Deqing Sun, Tali Dekel
SPHERICAL DESIGNER ELECTROMAGNETIC SURFACE PLASMON OPEN RESONATOR

Publication number: 20250226564

Abstract: A spherical designer electromagnetic surface plasmon open resonator is provided. The open resonator includes a resonator inner core and a resonator outer shell. The resonator inner core is located in an inner center of the resonator outer shell, and the resonator inner core and the resonator outer shell are coaxial. The disclosure provides a device that implements the superscattering function for incident waves in all incident directions and in all polarization directions, and a spherical open resonator is implemented. The scattering cross section of the disclosure can be more than five times greater than that of a metal sphere of the same size, and the operating frequency can be flexibly designed. By utilizing the characteristic that the scattering cross section of the spherical designer electromagnetic surface plasmon resonator is much greater than its own geometric cross section, the electromagnetic super-scattering device can be implemented.

Type: Application

Filed: August 30, 2023

Publication date: July 10, 2025

Applicant: ZHEJIANG UNIVERSITY

Inventors: Fei GAO, Yuanzhen LI, Baile ZHANG, Hongsheng CHEN
DIFFUSION-GUIDED THREE-DIMENSIONAL RECONSTRUCTION

Publication number: 20240412458

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for editing images based on decoder-based accumulative score sampling (DASS) losses.

Type: Application

Filed: June 12, 2024

Publication date: December 12, 2024

Inventors: Varun Jampani, Chun-Han Yao, Amit Raj, Wei-Chih Hung, Ming-Hsuan Yang, Michael Rubinstein, Yuanzhen Li
DISPLAY RESPONSIVE COMMUNICATION SYSTEM AND METHOD

Publication number: 20240404154

Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising analyzing image data included in an auxiliary display content to detect an object image or a background image, determining a special effect based on the analysis of the image data, applying the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops out, and sending the auxiliary display content with modified display properties to the end-user communication device. The special effect can comprise a non-customization special effect, a simple foreground special effect or a selective foreground special effect.

Type: Application

Filed: August 14, 2024

Publication date: December 5, 2024

Applicant: Google LLC

Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
System And Method For Generating Digital Content Including Portions Of Captured Images

Publication number: 20240394840

Abstract: Using artificial intelligence (AI), imagery may be created for content in response to verbal or textual input. The imagery includes an object, such as a product, and a quality of the image is improved using pre-processing techniques before the image is generated and post-processing techniques after the image is generated. The pre-processing may include upscaling the object in the original image, segmenting the object from its background in the captured image, adding an outline or border stroke to the object. The post-processing techniques may include removing the object from the AI-generated background while keeping shadows and other effects in place, blurring portions of the AI-generated background where the object will be positioned, removing the outline from the object, and re-positioning the object in the AI-generated background with the outline removed.

Type: Application

Filed: May 21, 2024

Publication date: November 28, 2024

Inventors: Elchonon Zeav Lapin, Xibing Yang, Amit Handa, Apurv Suman, Siddhant Mittal, Ashish Dilipchand Bora, Thorne Wolfenbarger, Naga Sreenivas Meruva, Yudong Sun, Rahul Guin, Arie Sharon, Beatriz Alessio Robles Orozco, Yuanzhen Li, Zhongyue Zheng, Mohammad Izadi
Optimizing Generative Machine-Learned Models for Subject-Driven Text-to-3D Generation

Publication number: 20240320912

Abstract: A fractional training process can be performed training images to an instance of a machine-learned generative image model to obtain a partially trained instance of the model. A fractional optimization process can be performed with the partially trained instance to an instance of a machine-learned three-dimensional (3D) implicit representation model obtain a partially optimized instance of the model. Based on the plurality of training images, pseudo multi-view subject images can be generated with the partially optimized instance of the 3D implicit representation model and a fully trained instance of the generative image model; The partially trained instance of the model can be trained with a set of training data. The partially optimized instance of the machine-learned 3D implicit representation model can be trained with the machine-learned multi-view image model.

Type: Application

Filed: March 20, 2024

Publication date: September 26, 2024

Inventors: Yuanzhen Li, Amit Raj, Varun Jampani, Benjamin Joseph Mildenhall, Benjamin Michael Poole, Jonathan Tilton Barron, Kfir Aberman, Michael Niemeyer, Michael Rubinstein, Nataniel Ruiz Gutierrez, Shiran Elyahu Zada, Srinivas Kaza
FLEXIBLE IMAGE ASPECT RATIO USING MACHINE LEARNING

Publication number: 20240311960

Abstract: To adjust an aspect ratio of an image to match the aspect ratio of a display area for presenting the image, a computing device receives an image having a first aspect ratio, and obtains a second aspect ratio for a display area of a display in which to present the image, where the second aspect ratio is different from the first aspect ratio. The computing device extends the image to include one or more additional features which were not included in the image. Additionally, the computing device automatically crops the extended image around an identified region of interest by selecting a portion of the extended image that has an aspect ratio which matches the second aspect ratio of the display area, and provides the cropped image for presentation within the display area of the display.

Type: Application

Filed: May 20, 2022

Publication date: September 19, 2024

Inventors: Xiao Feng, Yuanzhen LI, Yihui Wang, Omer Gimenez, Han Xu, Mengjie Wang, Huiwen Chang, AJ Maschinot, Dilip Krishnan
Display responsive communication system and method

Patent number: 12086913

Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.

Type: Grant

Filed: October 6, 2022

Date of Patent: September 10, 2024

Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
PERSONALIZED TEXT-TO-IMAGE DIFFUSION MODEL

Publication number: 20240296596

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a text-to-image model so that the text-to-image model generates images that each depict a variable instance of an object class when the object class without the unique identifier is provided as a text input, and that generates images that each depict a same subject instance of the object class when the unique identifier is provided as the text input.

Type: Application

Filed: August 23, 2023

Publication date: September 5, 2024

Inventors: Kfir Aberman, Nataniel Ruiz Gutierrez, Michael Rubinstein, Yuanzhen Li, Yael Pritch Knaan, Varun Jampani
Automated product identification within hosted and streamed videos

Patent number: 12026201

Abstract: Automated product identification within hosted and streamed videos is performed based on video content of a video received at an online video platform and text content associated with the video. First embeddings representative of one or more first candidate products are determined based on video content of the video, such as one or more frames selected from within the video. Second embeddings representative of one or more second candidate products are determined based on text content associated with the video, such as a title, description, or transcript of the video. A product candidate index is produced based on the second embeddings. A product identification representative of a product featured in the video is determined based on a comparison of the first embeddings against entries of the product candidate index, such as including by a nearest neighbor search responsive to the comparison. An indication of the product identification is then output at the online video platform.

Type: Grant

Filed: May 31, 2021

Date of Patent: July 2, 2024

Assignee: GOOGLE LLC

Inventor: Yuanzhen Li
PRODUCT IDENTIFICATION IN MEDIA ITEMS

Publication number: 20240037145

Abstract: A method includes obtaining first data including a first identifier of a first product determine in association with a content item based on first metadata of the content item. The method further includes obtaining a first confidence value associated with the first product and the content item. The method further includes obtaining second data including a second identifier of the first product and a second confidence value. The method further includes providing the first data and the second data to a trained machine learning model. The method further includes obtaining a third confidence value from the trained machine learning model associated with the first product. The method further includes adjusting second metadata of the content item in view of the third confidence value.

Type: Application

Filed: August 1, 2022

Publication date: February 1, 2024

Inventors: Marco Ziccardi, Min-hsuan Tsai, Wei-Hong Chuang, Rahul Sunil Bhalerao, Ye Xia, Madhuri Shanbhogue, Mojtaba Seyedhosseini, Mike Krainin, Andrei Kapishnikov, Yuanzhen Li
DISPLAY RESPONSIVE COMMUNICATION SYSTEM AND METHOD

Publication number: 20230021805

Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.

Type: Application

Filed: October 6, 2022

Publication date: January 26, 2023

Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
AUTOMATED PRODUCT IDENTIFICATION WITHIN HOSTED AND STREAMED VIDEOS

Publication number: 20220382808

Abstract: Automated product identification within hosted and streamed videos is performed based on video content of a video received at an online video platform and text content associated with the video. First embeddings representative of one or more first candidate products are determined based on video content of the video, such as one or more frames selected from within the video. Second embeddings representative of one or more second candidate products are determined based on text content associated with the video, such as a title, description, or transcript of the video. A product candidate index is produced based on the second embeddings. A product identification representative of a product featured in the video is determined based on a comparison of the first embeddings against entries of the product candidate index, such as including by a nearest neighbor search responsive to the comparison. An indication of the product identification is then output at the online video platform.

Type: Application

Filed: May 31, 2021

Publication date: December 1, 2022

Inventor: Yuanzhen Li
Display responsive communication system and method

Patent number: 11481941

Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.

Type: Grant

Filed: September 1, 2020

Date of Patent: October 25, 2022

Assignee: GOOGLE LLC

Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
DISPLAY RESPONSIVE COMMUNICATION SYSTEM AND METHOD

Publication number: 20220036613

Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.

Type: Application

Filed: September 1, 2020

Publication date: February 3, 2022

Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
Systems and methods for mask generation for an image or video

Patent number: 8437573

Abstract: Methods and systems for mask generation for an image or video are disclosed. In one embodiment, the method for creating a mask associated with an image includes receiving a first input of pixels of the image to be included in the mask. The method further includes receiving a second input of pixels of the image to not be included in the mask. The method also includes creating a classifier to classify each pixel of the image as included in the mask, not included in the mask, or unsure. The method also includes training the classifier using the first input of pixels and the second input of pixels. The method further includes classifying each pixel of the image as included in the mask, not included in the mask, or unsure.

Type: Grant

Filed: November 21, 2007

Date of Patent: May 7, 2013

Assignee: Adobe Systems Incorporated

Inventors: Aseem Agarwala, Yuanzhen Li
Method and apparatus for acquiring HDR flash images

Patent number: 7454136

Abstract: A method generates a high dynamic range image by first acquiring a set of images of a scene illuminated by different lighting conditions. The set of images are then combined to generate a high dynamic range image.

Type: Grant

Filed: July 28, 2005

Date of Patent: November 18, 2008

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Ramesh Raskar, Amit Agrawal, Shree K. Nayar, Yuanzhen Li
Method for estimating camera settings adaptively

Patent number: 7403707

Abstract: A camera is configured to adaptively determine camera settings. The camera includes a plurality of sensors elements configured to acquire a current image of a scene according to a current set of camera settings. A number of sensor elements having a set of desirable properties is measured. Then, a next set of camera settings that maximize an overall number of sensor elements having the set of desirable properties is determined to acquire a next better image.

Type: Grant

Filed: July 28, 2005

Date of Patent: July 22, 2008

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Ramesh Raskar, Amit Agrawal, Shree K. Nayar, Yuanzhen Li
Method and apparatus for acquiring HDR flash images

Publication number: 20070025717

Abstract: A method generates a high dynamic range image by first acquiring a set of images of a scene illuminated by different lighting conditions. The set of images are then combined to generate a high dynamic range image.

Type: Application

Filed: July 28, 2005

Publication date: February 1, 2007

Inventors: Ramesh Raskar, Amit Agrawal, Shree Nayar, Yuanzhen Li
Method for estimating camera settings adaptively

Publication number: 20070025720

Abstract: A camera is configured to adaptively determine camera settings. The camera includes a plurality of sensors elements configured to acquire a current image of a scene according to a current set of camera settings. A number of sensor elements having a set of desirable properties is measured. Then, a next set of camera settings that maximize an overall number of sensor elements having the set of desirable properties is determined to acquire a next better image.

Type: Application

Filed: July 28, 2005

Publication date: February 1, 2007

Inventors: Ramesh Raskar, Amit Agrawal, Shree Nayar, Yuanzhen Li