Patents by Inventor Yuke Zhu

Yuke Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Data selection based on uncertainty quantification

Patent number: 11941899

Abstract: Apparatuses, systems, and techniques generate poses of an object based on image data of the object obtained from a first viewpoint of the object and a second viewpoint of the object. The poses can be evaluated to determine a portion of the image data usable by an estimator to generate a pose of the object.

Type: Grant

Filed: May 26, 2021

Date of Patent: March 26, 2024

Assignee: NVIDIA Corporation

Inventors: Jonathan Tremblay, Fabio Tozeto Ramos, Yuke Zhu, Anima Anandkumar, Guanya Shi
Data selection based on uncertainty quantification

Patent number: 11931909

Abstract: Apparatuses, systems, and techniques generate poses of an object based on data of the object observed from a first viewpoint and a second viewpoint. The poses can be evaluated to determine a portion of the data usable by an estimator to generate a pose of the object.

Type: Grant

Filed: May 26, 2021

Date of Patent: March 19, 2024

Assignee: NVIDIA Corporation

Inventors: Jonathan Tremblay, Fabio Tozeto Ramos, Yuke Zhu, Anima Anandkumar, Guanya Shi
PERFORMING VISUAL RELATIONAL REASONING

Publication number: 20240078423

Abstract: A vision transformer (ViT) is a deep learning model that performs one or more vision processing tasks. ViTs may be modified to include a global task that clusters images with the same concept together to produce semantically consistent relational representations, as well as a local task that guides the ViT to discover object-centric semantic correspondence across images. A database of concepts and associated features may be created and used to train the global and local tasks, which may then enable the ViT to perform visual relational reasoning faster, without supervision, and outside of a synthetic domain.

Type: Application

Filed: August 22, 2022

Publication date: March 7, 2024

Inventors: Xiaojian Ma, Weili Nie, Zhiding Yu, Huaizu Jiang, Chaowei Xiao, Yuke Zhu, Anima Anandkumar
PERFORMING VISUAL RELATIONAL REASONING

Publication number: 20240062534

Abstract: A vision transformer (ViT) is a deep learning model that performs one or more vision processing tasks. ViTs may be modified to include a global task that clusters images with the same concept together to produce semantically consistent relational representations, as well as a local task that guides the ViT to discover object-centric semantic correspondence across images. A database of concepts and associated features may be created and used to train the global and local tasks, which may then enable the ViT to perform visual relational reasoning faster, without supervision, and outside of a synthetic domain.

Type: Application

Filed: August 22, 2022

Publication date: February 22, 2024

Inventors: Xiaojian Ma, Weili Nie, Zhiding Yu, Huaizu Jiang, Chaowei Xiao, Yuke Zhu, Anima Anandkumar
Automated content curation and communication

Patent number: 11895068

Abstract: Systems, devices, methods, media, and instructions for automated image processing and content curation are described. In one embodiment a server computer system receives a plurality of content communications from a plurality of client devices, each content communication comprising an associated piece of content and a corresponding metadata. Each content communication is processed to determine associated context values for each piece of content, each associated context value comprising at least one content value generated by machine vision processing of the associated piece of content. A first content collection is automatically generated based on context values, and a set of user accounts are associated with the collection. An identifier associated with the first content collection is published to user devices associated with user accounts. In various additional embodiments, different content values, image processing operations, and content selection operations are used to curate content collections.

Type: Grant

Filed: July 12, 2021

Date of Patent: February 6, 2024

Assignee: Snap Inc.

Inventors: Jianchao Yang, Yuke Zhu, Ning Xu, Kevin Dechau Tang, Jia Li
REINFORCEMENT AND IMITATION LEARNING FOR A TASK

Publication number: 20230330848

Abstract: A neural network control system for controlling an agent to perform a task in a real-world environment, operates based on both image data and proprioceptive data describing the configuration of the agent. The training of the control system includes both imitation learning, using datasets generated from previous performances of the task, and reinforcement learning, based on rewards calculated from control data output by the control system.

Type: Application

Filed: April 25, 2023

Publication date: October 19, 2023

Inventors: Saran Tunyasuvunakool, Yuke Zhu, Joshua Merel, János Kramár, Ziyu Wang, Nicolas Manfred Otto Heess
ACTION-CONDITIONAL IMPLICIT DYNAMICS OF DEFORMABLE OBJECTS

Publication number: 20230290057

Abstract: One or more machine learning models (MLMs) may learn implicit 3D representations of geometry of an object and of dynamics of the object from performing an action on the object. Implicit neural representations may be used to reconstruct high-fidelity full geometry of the object and predict a flow-based dynamics field from one or more images, which may provide a partial view of the object. Correspondences between locations of an object may be learned based at least on distances between the locations on a surface corresponding to the object, such as geodesic distances. The distances may be incorporated into a contrastive learning loss function to train one or more MLMs to learn correspondences between locations of the object, such as a correspondence embedding field. The correspondences may be used to evaluate state changes when evaluating one or more actions that may be performed on the object.

Type: Application

Filed: March 10, 2022

Publication date: September 14, 2023

Inventors: Yuke Zhu, Bokui Shen, Christopher Bongsoo Choy, Animashree Anandkumar
FINE-TUNING POLICIES TO FACILITATE CHAINING

Publication number: 20230280726

Abstract: A manipulation task may include operations performed by one or more manipulation entities on one or more objects. This manipulation task may be broken down into a plurality of sequential sub-tasks (policies). These policies may be fine-tuned so that a terminal state distribution of a given policy matches an initial state distribution of another policy that immediately follows the given policy within the plurality of policies. The fine-tuned plurality of policies may then be chained together and implemented within a manipulation environment.

Type: Application

Filed: March 1, 2022

Publication date: September 7, 2023

Inventors: Yuke Zhu, Anima Anandkumar, Youngwoon Lee
Methods and Systems to Remotely Operate Robotic Devices

Publication number: 20230226696

Abstract: Methods and systems to remotely operate robotic devices are provided. A number of embodiments allow users to remotely operate robotic devices using generalized consumer devices (e.g., cell phones). Additional embodiments provide for a platform to allow communication between consumer devices and the robotic devices. Further embodiments allow for training robotic devices to operate autonomously by training the robotic device with machine learning algorithms using data collected from scalable methods of controlling robotic devices.

Type: Application

Filed: November 2, 2020

Publication date: July 20, 2023

Applicant: The Board of Trustees of the Leland Stanford Junior University

Inventors: Ajay U. Mandlekar, Yuke Zhu, Animesh Garg, Silvio Savarese, Fei-Fei Li
Automated image processing and content curation

Patent number: 11637797

Abstract: Systems, devices, methods, media, and instructions for automated image processing and content curation are described. In one embodiment a server computer system receives a content message from a first content source, and analyzes the content message to determine one or more quality scores and one or more content values associated with the content message. The server computer system analyzes the content message with a plurality of content collections of the database to identify a match between at least one of the one or more content values and a topic associated with at least a first content collection of the one or more content collections and automatically adds the content message to the first content collection based at least in part on the match. In various embodiments, different content values, image processing operations, and content selection operations are used to curate content collections.

Type: Grant

Filed: August 9, 2021

Date of Patent: April 25, 2023

Assignee: Snap Inc.

Inventors: Jianchao Yang, Yuke Zhu, Ning Xu, Kevin Dechau Tang, Jia Li
CONTENT NAVIGATION WITH AUTOMATED CURATION

Publication number: 20230063920

Abstract: Systems, devices, methods, media, and instructions for automated image processing and content curation are described. In one embodiment a server computer system communicates at least a portion of a first content collection to a first client device, and receives a first selection communication in response, the first selection communication identifying a first piece of content of the first plurality of pieces of content. The server analyzes analyzing the first piece of content to identify a set of context values for the first piece of content, and accesses accessing a second content collection comprising pieces of content sharing at least a portion of the set of context values of the first piece of content. In various embodiments, different content values, image processing operations, and content selection operations are used to curate the content collections.

Type: Application

Filed: October 24, 2022

Publication date: March 2, 2023

Inventors: Jianchao Yang, Yuke Zhu, Ning Xu, Kevin Dechau Tang, Jia Li
DATA SELECTION BASED ON UNCERTAINTY QUANTIFICATION

Publication number: 20220383019

Abstract: Apparatuses, systems, and techniques generate poses of an object based on image data of the object obtained from a first viewpoint of the object and a second viewpoint of the object. The poses can be evaluated to determine a portion of the image data usable by an estimator to generate a pose of the object.

Type: Application

Filed: May 26, 2021

Publication date: December 1, 2022

Inventors: Jonathan Tremblay, Fabio Tozeto Ramos, Yuke Zhu, Anima Anandkumar, Guanya Shi
DATA SELECTION BASED ON UNCERTAINTY QUANTIFICATION

Publication number: 20220379484

Abstract: Apparatuses, systems, and techniques generate poses of an object based on data of the object observed from a first viewpoint and a second viewpoint. The poses can be evaluated to determine a portion of the data usable by an estimator to generate a pose of the object.

Type: Application

Filed: May 26, 2021

Publication date: December 1, 2022

Inventors: Jonathan Tremblay, Fabio Tozeto Ramos, Yuke Zhu, Anima Anandkumar, Guanya Shi
Content navigation with automated curation

Patent number: 11483268

Abstract: Systems, devices, methods, media, and instructions for automated image processing and content curation are described. In one embodiment a server computer system communicates at least a portion of a first content collection to a first client device, and receives a first selection communication in response, the first selection communication identifying a first piece of content of the first plurality of pieces of content. The server analyzes analyzing the first piece of content to identify a set of context values for the first piece of content, and accesses accessing a second content collection comprising pieces of content sharing at least a portion of the set of context values of the first piece of content. In various embodiments, different content values, image processing operations, and content selection operations are used to curate the content collections.

Type: Grant

Filed: July 1, 2020

Date of Patent: October 25, 2022

Assignee: Snap Inc.

Inventors: Jianchao Yang, Yuke Zhu, Ning Xu, Kevin Dechau Tang, Jia Li
USING NEURAL NETWORKS TO PERFORM OBJECT DETECTION, INSTANCE SEGMENTATION, AND SEMANTIC CORRESPONDENCE FROM BOUNDING BOX SUPERVISION

Publication number: 20220261593

Abstract: Apparatuses, systems, and techniques to train one or more neural networks. In at least one embodiment, one or more neural networks are trained to perform segmentation tasks based at least in part on training data comprising bounding box annotations.

Type: Application

Filed: February 16, 2021

Publication date: August 18, 2022

Inventors: Zhiding Yu, Shiyi Lan, Chris Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Anima Anandkumar
MACHINE LEARNING MODEL FOR TASK AND MOTION PLANNING

Publication number: 20220126445

Abstract: Apparatuses, systems, and techniques are described that solve task and motion planning problems. In at least one embodiment, a task and motion planning problem is modeled using a geometric scene graph that records positions and orientations of objects within a playfield, and a symbolic scene graph that represents states of objects within context of a task to be solved. In at least one embodiment, task planning is performed using symbolic scene graph, and motion planning is performed using a geometric scene graph.

Type: Application

Filed: October 28, 2020

Publication date: April 28, 2022

Inventors: Yuke Zhu, Yifeng Zhu, Stanley Thomas Birchfield, Jonathan Tremblay
ONLINE TASK INFERENCE FOR COMPOSITIONAL TASKS WITH CONTEXT ADAPTATION

Publication number: 20220036179

Abstract: One embodiment of a method for performing a task includes generating a first posterior distribution of a global latent context variable for the task based on a pool of contexts sampled from one or more previous episodes of the task. The method also includes generating a second posterior distribution of a local latent context variable for a current time step in a current episode of the task based on one or more recent contexts sampled at one or more previous time steps of the current episode. The method further includes causing an agent to perform an action related to carrying out the task based on the first posterior distribution, the second posterior distribution, and a current state associated with the current time step.

Type: Application

Filed: July 31, 2020

Publication date: February 3, 2022

Inventors: Animesh GARG, Hongyu REN, Yuke ZHU, Anima ANANDKUMAR
AUTOMATED CONTENT CURATION AND COMMUNICATION

Publication number: 20220038402

Abstract: Systems, devices, methods, media, and instructions for automated image processing and content curation are described. In one embodiment a server computer system receives a plurality of content communications from a plurality of client devices, each content communication comprising an associated piece of content and a corresponding metadata. Each content communication is processed to determine associated context values for each piece of content, each associated context value comprising at least one content value generated by machine vision processing of the associated piece of content. A first content collection is automatically generated based on context values, and a set of user accounts are associated with the collection. An identifier associated with the first content collection is published to user devices associated with user accounts. In various additional embodiments, different content values, image processing operations, and content selection operations are used to curate content collections.

Type: Application

Filed: July 12, 2021

Publication date: February 3, 2022

Inventors: Jianchao Yang, Yuke Zhu, Ning Xu, Kevin Dechau Tang, Jia Li
AUTOMATED IMAGE PROCESSING AND CONTENT CURATION

Publication number: 20220027405

Abstract: Systems, devices, methods, media, and instructions for automated image processing and content curation are described. In one embodiment a server computer system receives a content message from a first content source, and analyzes the content message to determine one or more quality scores and one or more content values associated with the content message. The server computer system analyzes the content message with a plurality of content collections of the database to identify a match between at least one of the one or more content values and a topic associated with at least a first content collection of the one or more content collections and automatically adds the content message to the first content collection based at least in part on the match. In various embodiments, different content values, image processing operations, and content selection operations are used to curate content collections.

Type: Application

Filed: August 9, 2021

Publication date: January 27, 2022

Inventors: Jianchao Yang, Yuke Zhu, Ning Xu, Kevin Dechau Tang, Jia Li
Automated image processing and content curation

Patent number: 11088977

Abstract: Systems, devices, methods, media, and instructions for automated image processing and content curation are described. In one embodiment a server computer system receives a content message from a first content source, and analyzes the content message to determine one or more quality scores and one or more content values associated with the content message. The server computer system analyzes the content message with a plurality of content collections of the database to identify a match between at least one of the one or more content values and a topic associated with at least a first content collection of the one or more content collections and automatically adds the content message to the first content collection based at least in part on the match. In various embodiments, different content values, image processing operations, and content selection operations are used to curate content collections.

Type: Grant

Filed: July 8, 2019

Date of Patent: August 10, 2021

Assignee: Snap Inc.

Inventors: Jianchao Yang, Yuke Zhu, Ning Xu, Kevin Dechau Tang, Jia Li

1 2 next