Patents by Inventor Hung Le

Hung Le has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20260111179
    Abstract: Embodiments described herein provide a code generation framework that explores a code search space of code generation tasks through a tree-based structure. Specifically, the code generation framework comprises a Thinker model, a Solver model, and a Debugger model to implement strategy-planning, solution implementation, and solution improving correspondingly. posing comprehensive roles needed for code generation.
    Type: Application
    Filed: June 6, 2025
    Publication date: April 23, 2026
    Inventors: Jierui Li, Hung Le, Doyen Sahoo
  • Publication number: 20250365481
    Abstract: Embodiments described herein provide a video generation framework built on a decoupled multimodal cross-attention module to simultaneously condition the generation on both an input image and a text input. The video generation may thus be conditioned on the visual appearance of a target object reflected in the input image. In this way, zero-shot video generation may be achieved with little fine-tuning efforts.
    Type: Application
    Filed: August 11, 2025
    Publication date: November 27, 2025
    Inventors: Junhao Zhang, Dongxu Li, Hung Le, Caiming Xiong, Doyen Sahoo
  • Publication number: 20250362890
    Abstract: Embodiments described herein provide a method of jointly generating a code output. A first language model (LM) generates a code output in response to a task description. Second and third LMs generate critiques based on the task description and the generated code. The second LM may critique the accuracy of the generated code, and the third LM may critique the safety of the generated code (e.g., susceptibility to hacks). The first LM may revise the generated code based on the critiques. The revised code may be executed, and based on the results of the execution, the first LM may revise the code again. The process of critiques, revisions, and execution may be repeated. The final generated code is output to a user (e.g., in a programming environment).
    Type: Application
    Filed: October 18, 2024
    Publication date: November 27, 2025
    Inventors: Hung Le, Doyen Sahoo, Yingbo Zhou, Caiming Xiong, Silvio Savarese
  • Patent number: 12430584
    Abstract: Embodiments described herein provide a reinforcement learning based framework engaging pretrained language models (LMs) for program synthesis tasks. Specifically, the framework adopts a training strategy that optimizes pretrained LMs for program synthesis tasks in an actor-critic approach.
    Type: Grant
    Filed: August 26, 2022
    Date of Patent: September 30, 2025
    Assignee: Salesforce, Inc.
    Inventors: Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Chu Hong Hoi
  • Patent number: 12430585
    Abstract: Embodiments described herein provide a reinforcement learning based framework engaging pretrained language models (LMs) for program synthesis tasks. Specifically, the framework adopts a training strategy that optimizes pretrained LMs for program synthesis tasks in an actor-critic approach.
    Type: Grant
    Filed: August 26, 2022
    Date of Patent: September 30, 2025
    Assignee: Salesforce, Inc.
    Inventors: Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Chu Hong Hoi
  • Patent number: 12413829
    Abstract: Embodiments described herein provide a video generation framework built on a decoupled multimodal cross-attention module to simultaneously condition the generation on both an input image and a text input. The video generation may thus be conditioned on the visual appearance of a target object reflected in the input image. In this way, zero-shot video generation may be achieved with little fine-tuning efforts.
    Type: Grant
    Filed: January 31, 2024
    Date of Patent: September 9, 2025
    Assignee: Salesforce, Inc.
    Inventors: Junhao Zhang, Dongxu Li, Hung Le, Caiming Xiong, Doyen Sahoo
  • Publication number: 20250175679
    Abstract: Embodiments described herein provide a video generation framework built on a decoupled multimodal cross-attention module to simultaneously condition the generation on both an input image and a text input. The video generation may thus be conditioned on the visual appearance of a target object reflected in the input image. In this way, zero-shot video generation may be achieved with little fine-tuning efforts.
    Type: Application
    Filed: January 31, 2024
    Publication date: May 29, 2025
    Inventors: Junhao Zhang, Dongxu Li, Hung Le, Caiming Xiong, Doyen Sahoo
  • Publication number: 20250103300
    Abstract: The embodiments are directed to generating source code for a program from a problem description. One or more pre-trained code large language models (LLMs) generate sub-modules from a problem description in a natural language. The sub-modules are filtered based on testing criteria and encoded into sub-module encodings in an embedding space. The sub-module encodings are clustered into multiple clusters. A subset of sub-modules encoding that are close to the centroids of the clusters are selected. The sub-set of sub-modules is decoded into representative sub-modules. The problem description is augmented with the representative sub-modules and fed into one or more pre-trained code LLMs and new sub-modules are generated. The iterations continue until a program is generated from the representative sub-modules.
    Type: Application
    Filed: January 26, 2024
    Publication date: March 27, 2025
    Inventors: Hung Le, Hailin Chen, Amrita Saha, Akash Gokul, Doyen Sahoo, Shafiq Rayhan Joty
  • Publication number: 20240289606
    Abstract: Embodiments described herein provide a mixture of encoder-decoder Transformer framework for multi-task pretraining and flexible finetuning for both code understanding and generation tasks. Specifically, the framework is built on multimodal encoder and decoder modules. During pre-training, the encoder-decoder framework is trained with multiple learning objectives, including a diverse set of self-supervised tasks over two major stages of pretraining on unimodal and bimodal data.
    Type: Application
    Filed: February 24, 2023
    Publication date: August 29, 2024
    Inventors: Yue Wang, Hung Le, Akhilesh Deepak Gotmare, Junnan Li, Chu Hong Hoi
  • Publication number: 20240125259
    Abstract: A broadband resonator for a fuel cell compressor is disclosed. The resonator having a resonator insert having a tubular pipe surrounded by a plurality of disc-shaped walls separating the resonator insert receiving chamber into one or more individual resonator chambers. The tubular pipe is positioned eccentrically within a resonator insert receiving chamber. An entrained water removal system is formed in the resonator, preventing entrained water from accumulating in the resonator chambers which would result in an undesirable detuning of the designed amplitude frequency spectrum response.
    Type: Application
    Filed: October 12, 2022
    Publication date: April 18, 2024
    Inventors: Hung LE, Eric GALLAGHER, Hoa LE
  • Patent number: 11946398
    Abstract: A broadband resonator for a fuel cell compressor is disclosed. The resonator having a resonator insert having a tubular pipe surrounded by a plurality of disc-shaped walls separating the resonator insert receiving chamber into one or more individual resonator chambers. The tubular pipe is positioned eccentrically within a resonator insert receiving chamber. An entrained water removal system is formed in the resonator, preventing entrained water from accumulating in the resonator chambers which would result in an undesirable detuning of the designed amplitude frequency spectrum response.
    Type: Grant
    Filed: October 12, 2022
    Date of Patent: April 2, 2024
    Assignee: MANN+HUMMEL GmbH
    Inventors: Hung Le, Eric Gallagher, Hoa Le
  • Publication number: 20230376841
    Abstract: Embodiments described herein provide a reinforcement learning based framework engaging pretrained language models (LMs) for program synthesis tasks. Specifically, the framework adopts a training strategy that optimizes pretrained LMs for program synthesis tasks in an actor-critic approach.
    Type: Application
    Filed: August 26, 2022
    Publication date: November 23, 2023
    Inventors: Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Chu Hong Hoi
  • Publication number: 20230376840
    Abstract: Embodiments described herein provide a reinforcement learning based framework engaging pretrained language models (LMs) for program synthesis tasks. Specifically, the framework adopts a training strategy that optimizes pretrained LMs for program synthesis tasks in an actor-critic approach.
    Type: Application
    Filed: August 26, 2022
    Publication date: November 23, 2023
    Inventors: Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Chu Hong Hoi
  • Patent number: 11568000
    Abstract: A method for dialog state tracking includes decoding, by a fertility decoder, encoded dialog information associated with a dialog to generate fertilities for generating dialog states of the dialog. Each dialog state includes one or more domains. Each domain includes one or more slots. Each slot includes one or more slot tokens. The method further includes generating an input sequence to a state decoder based on the fertilities. A total number of each slot token in the input sequence is based on a corresponding fertility. The method further includes encoding, by a state encoder, the input sequence to the state decoder, and decoding, by the state decoder, the encoded input sequence to generate a complete sequence of the dialog states.
    Type: Grant
    Filed: January 7, 2020
    Date of Patent: January 31, 2023
    Assignee: SALESFORCE.COM, INC.
    Inventors: Hung Le, Chu Hong Hoi
  • Patent number: 11487999
    Abstract: A system and method for generating a response in a video grounded dialogue are provided. A video-grounded dialogue neural network language model receives video input and text input. The text input includes a dialogue history between the model and a human user and a current utterance by the user. Encoded video input is generated using video encoding layers. Encoded text input is generated using text encoding layers. The encoded video input and the encoded text input are concatenated in to a single input sequence. A generative pre-trained transformer model generates the response to the current utterance from the singe input sequence.
    Type: Grant
    Filed: April 28, 2020
    Date of Patent: November 1, 2022
    Assignee: Salesforce.com, Inc.
    Inventors: Hung Le, Chu Hong Hoi
  • Patent number: 11288438
    Abstract: Systems and methods are provided for performing a video-grounded dialogue task by a neural network model using bi-directional spatial-temporal reasoning. According to some embodiments, the systems and methods implement a dual network architecture or framework. This framework includes one network or reasoning module that learns dependencies between text and video in the direction of spatial?temporal, and another network or reasoning module that learns in the direction of temporal?spatial. The output of the multimodal reasoning modules may be combined to learn dependencies between language features in dialogues. The result joint representation is used as a contextual feature to the decoding components which allow the model to semantically generate meaningful responses to the users. In some embodiments, pointer networks are extended to the video-grounded dialogue task to allow the model to point to specific tokens from multiple source sequences to generate responses.
    Type: Grant
    Filed: February 4, 2020
    Date of Patent: March 29, 2022
    Assignee: salesforce.com, inc.
    Inventors: Hung Le, Chu Hong Hoi
  • Publication number: 20210174162
    Abstract: A system and method for generating a response in a video grounded dialogue are provided. A video-grounded dialogue neural network language model receives video input and text input. The text input includes a dialogue history between the model and a human user and a current utterance by the user. Encoded video input is generated using video encoding layers. Encoded text input is generated using text encoding layers. The encoded video input and the encoded text input are concatenated in to a single input sequence. A generative pre-trained transformer model generates the response to the current utterance from the singe input sequence.
    Type: Application
    Filed: April 28, 2020
    Publication date: June 10, 2021
    Inventors: Hung LE, Chu Hong HOI
  • Publication number: 20210150118
    Abstract: Systems and methods are provided for performing a video-grounded dialogue task by a neural network model using bi-directional spatial-temporal reasoning. According to some embodiments, the systems and methods implement a dual network architecture or framework. This framework includes one network or reasoning module that learns dependencies between text and video in the direction of spatial?temporal, and another network or reasoning module that learns in the direction of temporal?spatial. The output of the multimodal reasoning modules may be combined to learn dependencies between language features in dialogues. The result joint representation is used as a contextual feature to the decoding components which allow the model to semantically generate meaningful responses to the users. In some embodiments, pointer networks are extended to the video-grounded dialogue task to allow the model to point to specific tokens from multiple source sequences to generate responses.
    Type: Application
    Filed: February 4, 2020
    Publication date: May 20, 2021
    Inventors: Hung LE, Chu Hong HOI
  • Publication number: 20210089588
    Abstract: A method for dialog state tracking includes decoding, by a fertility decoder, encoded dialog information associated with a dialog to generate fertilities for generating dialog states of the dialog. Each dialog state includes one or more domains. Each domain includes one or more slots. Each slot includes one or more slot tokens. The method further includes generating an input sequence to a state decoder based on the fertilities. A total number of each slot token in the input sequence is based on a corresponding fertility. The method further includes encoding, by a state encoder, the input sequence to the state decoder, and decoding, by the state decoder, the encoded input sequence to generate a complete sequence of the dialog states.
    Type: Application
    Filed: January 7, 2020
    Publication date: March 25, 2021
    Inventors: Hung LE, Chu Hong HOI
  • Patent number: 10949205
    Abstract: A computer system includes a dispatch routing network to dispatch a plurality of instructions, and a processor in signal communication with the dispatch routing network. The processor determines a move instruction from the plurality of instructions to move data produced by an older second instruction, and copies a splice target file (STF) tag from a source register of the move instruction to a destination register of the move instruction without physically copying data in a slice target register and without assigning a new STF tag destination to the move instruction.
    Type: Grant
    Filed: December 20, 2018
    Date of Patent: March 16, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Joshua Bowman, Dung Q. Nguyen, Hung Le, Brian Thompto, Maureen A. Delaney, Cliff Kucharski, Steven J Battle