Patents by Inventor Yuanzhen Li
Yuanzhen Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250238905Abstract: Provided is a video generation model for performing text-to-video (T2V) or other video generation techniques. The proposed model reduces the computational costs associated with video generation. In particular, unlike traditional T2V methods, the disclosed technology can generate the full temporal duration of a video clip at once, bypassing the need for extensive computation. As one example, a machine-learned denoising diffusion model can simultaneously process a plurality of noisy inputs that correspond to various timestamps spanning the temporal dimension of a video to simultaneously generate synthetic frames for the video that match the timestamps.Type: ApplicationFiled: January 22, 2025Publication date: July 24, 2025Inventors: Inbar Mosseri, Omer Bar Tal, Hila Chefer-Livshen, Omer Tov, Charles Irwin Herrmann, Rony Paiss, Shiran Elyahu Zada, Ariel Ephrat, Junhwa Hur, Guanghui Liu, Amit Raj, Yuanzhen Li, Michael Rubinstein, Tomer Michaeli, Oliver Wang, Deqing Sun, Tali Dekel
-
Publication number: 20250226564Abstract: A spherical designer electromagnetic surface plasmon open resonator is provided. The open resonator includes a resonator inner core and a resonator outer shell. The resonator inner core is located in an inner center of the resonator outer shell, and the resonator inner core and the resonator outer shell are coaxial. The disclosure provides a device that implements the superscattering function for incident waves in all incident directions and in all polarization directions, and a spherical open resonator is implemented. The scattering cross section of the disclosure can be more than five times greater than that of a metal sphere of the same size, and the operating frequency can be flexibly designed. By utilizing the characteristic that the scattering cross section of the spherical designer electromagnetic surface plasmon resonator is much greater than its own geometric cross section, the electromagnetic super-scattering device can be implemented.Type: ApplicationFiled: August 30, 2023Publication date: July 10, 2025Applicant: ZHEJIANG UNIVERSITYInventors: Fei GAO, Yuanzhen LI, Baile ZHANG, Hongsheng CHEN
-
Publication number: 20240412458Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for editing images based on decoder-based accumulative score sampling (DASS) losses.Type: ApplicationFiled: June 12, 2024Publication date: December 12, 2024Inventors: Varun Jampani, Chun-Han Yao, Amit Raj, Wei-Chih Hung, Ming-Hsuan Yang, Michael Rubinstein, Yuanzhen Li
-
Publication number: 20240404154Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising analyzing image data included in an auxiliary display content to detect an object image or a background image, determining a special effect based on the analysis of the image data, applying the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops out, and sending the auxiliary display content with modified display properties to the end-user communication device. The special effect can comprise a non-customization special effect, a simple foreground special effect or a selective foreground special effect.Type: ApplicationFiled: August 14, 2024Publication date: December 5, 2024Applicant: Google LLCInventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
-
Publication number: 20240394840Abstract: Using artificial intelligence (AI), imagery may be created for content in response to verbal or textual input. The imagery includes an object, such as a product, and a quality of the image is improved using pre-processing techniques before the image is generated and post-processing techniques after the image is generated. The pre-processing may include upscaling the object in the original image, segmenting the object from its background in the captured image, adding an outline or border stroke to the object. The post-processing techniques may include removing the object from the AI-generated background while keeping shadows and other effects in place, blurring portions of the AI-generated background where the object will be positioned, removing the outline from the object, and re-positioning the object in the AI-generated background with the outline removed.Type: ApplicationFiled: May 21, 2024Publication date: November 28, 2024Inventors: Elchonon Zeav Lapin, Xibing Yang, Amit Handa, Apurv Suman, Siddhant Mittal, Ashish Dilipchand Bora, Thorne Wolfenbarger, Naga Sreenivas Meruva, Yudong Sun, Rahul Guin, Arie Sharon, Beatriz Alessio Robles Orozco, Yuanzhen Li, Zhongyue Zheng, Mohammad Izadi
-
Publication number: 20240320912Abstract: A fractional training process can be performed training images to an instance of a machine-learned generative image model to obtain a partially trained instance of the model. A fractional optimization process can be performed with the partially trained instance to an instance of a machine-learned three-dimensional (3D) implicit representation model obtain a partially optimized instance of the model. Based on the plurality of training images, pseudo multi-view subject images can be generated with the partially optimized instance of the 3D implicit representation model and a fully trained instance of the generative image model; The partially trained instance of the model can be trained with a set of training data. The partially optimized instance of the machine-learned 3D implicit representation model can be trained with the machine-learned multi-view image model.Type: ApplicationFiled: March 20, 2024Publication date: September 26, 2024Inventors: Yuanzhen Li, Amit Raj, Varun Jampani, Benjamin Joseph Mildenhall, Benjamin Michael Poole, Jonathan Tilton Barron, Kfir Aberman, Michael Niemeyer, Michael Rubinstein, Nataniel Ruiz Gutierrez, Shiran Elyahu Zada, Srinivas Kaza
-
Publication number: 20240311960Abstract: To adjust an aspect ratio of an image to match the aspect ratio of a display area for presenting the image, a computing device receives an image having a first aspect ratio, and obtains a second aspect ratio for a display area of a display in which to present the image, where the second aspect ratio is different from the first aspect ratio. The computing device extends the image to include one or more additional features which were not included in the image. Additionally, the computing device automatically crops the extended image around an identified region of interest by selecting a portion of the extended image that has an aspect ratio which matches the second aspect ratio of the display area, and provides the cropped image for presentation within the display area of the display.Type: ApplicationFiled: May 20, 2022Publication date: September 19, 2024Inventors: Xiao Feng, Yuanzhen LI, Yihui Wang, Omer Gimenez, Han Xu, Mengjie Wang, Huiwen Chang, AJ Maschinot, Dilip Krishnan
-
Patent number: 12086913Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.Type: GrantFiled: October 6, 2022Date of Patent: September 10, 2024Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
-
Publication number: 20240296596Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a text-to-image model so that the text-to-image model generates images that each depict a variable instance of an object class when the object class without the unique identifier is provided as a text input, and that generates images that each depict a same subject instance of the object class when the unique identifier is provided as the text input.Type: ApplicationFiled: August 23, 2023Publication date: September 5, 2024Inventors: Kfir Aberman, Nataniel Ruiz Gutierrez, Michael Rubinstein, Yuanzhen Li, Yael Pritch Knaan, Varun Jampani
-
Patent number: 12026201Abstract: Automated product identification within hosted and streamed videos is performed based on video content of a video received at an online video platform and text content associated with the video. First embeddings representative of one or more first candidate products are determined based on video content of the video, such as one or more frames selected from within the video. Second embeddings representative of one or more second candidate products are determined based on text content associated with the video, such as a title, description, or transcript of the video. A product candidate index is produced based on the second embeddings. A product identification representative of a product featured in the video is determined based on a comparison of the first embeddings against entries of the product candidate index, such as including by a nearest neighbor search responsive to the comparison. An indication of the product identification is then output at the online video platform.Type: GrantFiled: May 31, 2021Date of Patent: July 2, 2024Assignee: GOOGLE LLCInventor: Yuanzhen Li
-
Publication number: 20240037145Abstract: A method includes obtaining first data including a first identifier of a first product determine in association with a content item based on first metadata of the content item. The method further includes obtaining a first confidence value associated with the first product and the content item. The method further includes obtaining second data including a second identifier of the first product and a second confidence value. The method further includes providing the first data and the second data to a trained machine learning model. The method further includes obtaining a third confidence value from the trained machine learning model associated with the first product. The method further includes adjusting second metadata of the content item in view of the third confidence value.Type: ApplicationFiled: August 1, 2022Publication date: February 1, 2024Inventors: Marco Ziccardi, Min-hsuan Tsai, Wei-Hong Chuang, Rahul Sunil Bhalerao, Ye Xia, Madhuri Shanbhogue, Mojtaba Seyedhosseini, Mike Krainin, Andrei Kapishnikov, Yuanzhen Li
-
Publication number: 20230021805Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.Type: ApplicationFiled: October 6, 2022Publication date: January 26, 2023Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
-
Publication number: 20220382808Abstract: Automated product identification within hosted and streamed videos is performed based on video content of a video received at an online video platform and text content associated with the video. First embeddings representative of one or more first candidate products are determined based on video content of the video, such as one or more frames selected from within the video. Second embeddings representative of one or more second candidate products are determined based on text content associated with the video, such as a title, description, or transcript of the video. A product candidate index is produced based on the second embeddings. A product identification representative of a product featured in the video is determined based on a comparison of the first embeddings against entries of the product candidate index, such as including by a nearest neighbor search responsive to the comparison. An indication of the product identification is then output at the online video platform.Type: ApplicationFiled: May 31, 2021Publication date: December 1, 2022Inventor: Yuanzhen Li
-
Patent number: 11481941Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.Type: GrantFiled: September 1, 2020Date of Patent: October 25, 2022Assignee: GOOGLE LLCInventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
-
Publication number: 20220036613Abstract: A multimedia communication system and computer-implemented method for transmitting auxiliary display content to an end-user communication device to be rendered on a display device with a special effect to emphasize an image included in the auxiliary display content, comprising a processor and a transmitter. The processor can be arranged to analyze image data included in an auxiliary display content, detect an object image or a background image in the auxiliary display content based on the analysis of the image data, determine a special effect based on the analysis of the image data, and apply the special effect to the auxiliary display content to modify display properties for the auxiliary display content such that the object image is emphasized or pops-out. The transmitter can be arranged to send the auxiliary display content with modified display properties to an end-user communication device.Type: ApplicationFiled: September 1, 2020Publication date: February 3, 2022Inventors: Mikaël Bonnevie, Yuanzhen Li, Ce Liu
-
Patent number: 8437573Abstract: Methods and systems for mask generation for an image or video are disclosed. In one embodiment, the method for creating a mask associated with an image includes receiving a first input of pixels of the image to be included in the mask. The method further includes receiving a second input of pixels of the image to not be included in the mask. The method also includes creating a classifier to classify each pixel of the image as included in the mask, not included in the mask, or unsure. The method also includes training the classifier using the first input of pixels and the second input of pixels. The method further includes classifying each pixel of the image as included in the mask, not included in the mask, or unsure.Type: GrantFiled: November 21, 2007Date of Patent: May 7, 2013Assignee: Adobe Systems IncorporatedInventors: Aseem Agarwala, Yuanzhen Li
-
Patent number: 7454136Abstract: A method generates a high dynamic range image by first acquiring a set of images of a scene illuminated by different lighting conditions. The set of images are then combined to generate a high dynamic range image.Type: GrantFiled: July 28, 2005Date of Patent: November 18, 2008Assignee: Mitsubishi Electric Research Laboratories, Inc.Inventors: Ramesh Raskar, Amit Agrawal, Shree K. Nayar, Yuanzhen Li
-
Patent number: 7403707Abstract: A camera is configured to adaptively determine camera settings. The camera includes a plurality of sensors elements configured to acquire a current image of a scene according to a current set of camera settings. A number of sensor elements having a set of desirable properties is measured. Then, a next set of camera settings that maximize an overall number of sensor elements having the set of desirable properties is determined to acquire a next better image.Type: GrantFiled: July 28, 2005Date of Patent: July 22, 2008Assignee: Mitsubishi Electric Research Laboratories, Inc.Inventors: Ramesh Raskar, Amit Agrawal, Shree K. Nayar, Yuanzhen Li
-
Publication number: 20070025717Abstract: A method generates a high dynamic range image by first acquiring a set of images of a scene illuminated by different lighting conditions. The set of images are then combined to generate a high dynamic range image.Type: ApplicationFiled: July 28, 2005Publication date: February 1, 2007Inventors: Ramesh Raskar, Amit Agrawal, Shree Nayar, Yuanzhen Li
-
Publication number: 20070025720Abstract: A camera is configured to adaptively determine camera settings. The camera includes a plurality of sensors elements configured to acquire a current image of a scene according to a current set of camera settings. A number of sensor elements having a set of desirable properties is measured. Then, a next set of camera settings that maximize an overall number of sensor elements having the set of desirable properties is determined to acquire a next better image.Type: ApplicationFiled: July 28, 2005Publication date: February 1, 2007Inventors: Ramesh Raskar, Amit Agrawal, Shree Nayar, Yuanzhen Li