Patents by Inventor Weiyu LIU

Weiyu LIU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12223949
    Abstract: A robotic system is provided for performing rearrangement tasks guided by a natural language instruction. The system can include a number of neural networks used to determine a selected rearrangement of the objects in accordance with the natural language instruction. A target object predictor network processes a point cloud of the scene and the natural language instruction to identify a set of query objects that are to-be-rearranged. A language conditioned prior network processes the point cloud, natural language instruction, and the set of query objects to sample a distribution of rearrangements to generate a number of sets of pose offsets for the set of query objects. A discriminator network then processes the samples to generate scores for the samples. The samples may be refined until a score for at least one of the sample generated by the discriminator network is above a threshold value.
    Type: Grant
    Filed: September 7, 2022
    Date of Patent: February 11, 2025
    Assignee: NVIDIA Corporation
    Inventors: Christopher Jason Paxton, Weiyu Liu, Tucker Ryer Hermans, Dieter Fox
  • Publication number: 20240386733
    Abstract: In various examples, 3D object knowledge can be developed to extract diverse knowledge from large language models, and a part-grounding model can be trained to ground part semantics in terms of local shape features and spatial relations between parts. For example, knowledge that “the opening part of a mug that affords the pouring action is located on the top of the mug body and is often circular” can be grounded by identifying a previously unknown “opening” part based on its spatial relation to the known “body” part and its circular shape. A robotic system, for example, may use a model to identify an unlabeled part of a 3D object in imaging data. The model may be generated using natural language descriptions of relationships between parts of 3D objects, with descriptions generated using a language model that produces text in response to queries related to spatial relationships between the parts.
    Type: Application
    Filed: May 18, 2023
    Publication date: November 21, 2024
    Applicant: NVIDIA Corporation
    Inventors: Animesh GARG, Dieter FOX, Tucker Ryer HERMANS, Weiyu LIU
  • Publication number: 20230073154
    Abstract: A robotic system is provided for performing rearrangement tasks guided by a natural language instruction. The system can include a number of neural networks used to determine a selected rearrangement of the objects in accordance with the natural language instruction. A target object predictor network processes a point cloud of the scene and the natural language instruction to identify a set of query objects that are to-be-rearranged. A language conditioned prior network processes the point cloud, natural language instruction, and the set of query objects to sample a distribution of rearrangements to generate a number of sets of pose offsets for the set of query objects. A discriminator network then processes the samples to generate scores for the samples. The samples may be refined until a score for at least one of the sample generated by the discriminator network is above a threshold value.
    Type: Application
    Filed: September 7, 2022
    Publication date: March 9, 2023
    Inventors: Christopher Jason Paxton, Weiyu Liu, Tucker Ryer Hermans, Dieter Fox
  • Publication number: 20090172763
    Abstract: A method for supporting multi audio tracks in wireless communication field, which uses multi stream media servers to share the assignment for supporting multi audio tracks. One stream media server receives one video data and multi audio data, but outputs only one determinate audio data; or one stream media server receives one video data and one audio data from multi audio data. User can select the required language by using portal website, and then connect to the stream media server for obtaining one video data and one audio data. The invention also provides a stream media server and a system for supporting multi audio tracks.
    Type: Application
    Filed: February 27, 2009
    Publication date: July 2, 2009
    Applicant: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Weiyu LIU