Patents by Inventor Yaguang Li

Yaguang Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250086405
    Abstract: Some implementations relate to generating a training and/or evaluation dataset with LLM prompts (e.g., derived from user queries) based on a prompt complexity. An input prompt, for example derived from a user query, is received. The input prompt is decomposed into a prompt tree comprising a plurality of nodes. The plurality of nodes comprise: a plurality of leaf nodes corresponding to simple sub-prompts of the input query; a plurality of branch nodes of sub-prompts each corresponding to multiple simple sub-prompts; and a root node corresponding to the input prompt. A prompt complexity is determined based on a path length of the prompt tree. The prompt complexity is compared to a threshold complexity. If the prompt complexity is above the threshold complexity, the input prompt is included in a set of training prompts and/or a set of evaluation prompts.
    Type: Application
    Filed: October 5, 2023
    Publication date: March 13, 2025
    Inventors: Swaroop Mishra, Ragha Kotikalapudi, Obaid Sarvana, Sahitya Potluri, YaGuang Li, Taylor Bos, Steven Zheng, Hanzhao Lin, Chenkai Kuang, Heng-Tze Cheng, Ed H. Chi, Quoc Le
  • Publication number: 20250053751
    Abstract: Implementations relate to generating multi-modal response(s) through utilization of large language model(s) (LLM(s)). Processor(s) of a system can: receive natural language (NL) based input, generate a multi-modal response that is responsive to the NL based output, and cause the multi-modal response to be rendered. In some implementations, and in generating the multi-modal response, the processor(s) can process, using a LLM, LLM input (e.g., that includes at least the NL based input) to generate LLM output, and determine, based on the LLM output, textual content for inclusion in the multi-modal response and multimedia content for inclusion in the multi-modal response. In some implementations, the multimedia content can be obtained based on a multimedia content tag that is included in the LLM output and that is indicative of the multimedia content. In various implementations, the multimedia content can be interleaved between segments of the textual content.
    Type: Application
    Filed: January 16, 2024
    Publication date: February 13, 2025
    Inventors: Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Thang Luong, Yifeng Lu, Igor Petrovski, Agoston Weisz, Wei Yu, Rakesh Shivanna, Michael Andrew Goodman, Apoorv Kulshreshtha, Yu Du, Amin Ghafouri, Sanil Jain, Dustin Tran, Vikas Peswani, YaGuang Li
  • Publication number: 20250045534
    Abstract: Implementations relate to a method implemented by one or more processors, the method including: receiving natural language (NL) based input associated with a client device; generating, using a large language model (LLM) and based on processing the NL based input, LLM output; determining, based on the LLM output, a sequence of LLM responses, the sequence of LLM responses including at least one intermediate LLM response and a final LLM response. In some implementations, the method may further include causing the final LLM response to be rendered at the client device. In additional or alternative implementations, the method may further include storing, as an instance of training data for fine-tuning the LLM or an additional LLM, the NL based input along with the final LLM response.
    Type: Application
    Filed: October 10, 2023
    Publication date: February 6, 2025
    Inventors: Swaroop Mishra, Ragha Kotikalapudi, Sahitya Potluri, Taylor Bos, YaGuang Li, Hanzhao Lin, Steven Zheng, Yu Du, Chen Zhu, Chenkai Kuang, Xinying Song, Heng-Tze Cheng, Ed H. Chi, Quoc Le
  • Publication number: 20240428006
    Abstract: Implementations relate to asymmetric quantization of large language models (LLMs). Processor(s) of a system can: obtain a trained LLM, wherein the trained LLM includes a plurality of layers, each layer comprising a respective plurality of weights; for each layer of the plurality of layers: calculate an optimal clipping range for the respective plurality of weights, and clip one or more weights of the respective plurality of weights that lie outside of the optimal clipping range to produce a clipped layer; quantize the LLM to generate a quantized LLM, wherein the instructions to quantize include instructions to map weights of the plurality of clipped layers of the LLM from continuous values to discrete values; and provide the quantized LLM for downstream processing.
    Type: Application
    Filed: June 20, 2023
    Publication date: December 26, 2024
    Inventors: Jian Li, Zhifeng Chen, Yanping Huang, Yuanzhong Xu, Tao Wang, YaGuang Li
  • Publication number: 20240330334
    Abstract: Implementations relate to reducing latency in generating and/or rendering a given stream of natural language (NL) based output generated using a large language model (LLM). Processor(s) of a system can: receive NL based input associated with a client device, generate the stream of NL based output utilizing the LLM that is responsive to the NL based input and that is for a given dialog context of an ongoing dialog, and cause the stream of NL based output to be rendered at the client device. Notably, the processor(s) can employ attribute classifier(s) and a multi-objective scorer to implement a blockwise controlled decoding technique in generating the stream of NL based output utilizing the LLM. By implementing the blockwise controlled decoding technique in generating the stream of NL based output utilizing the LLM, the processor(s) can reduce latency in generating and/or of the stream of NL based output generated utilizing the LLM.
    Type: Application
    Filed: July 25, 2023
    Publication date: October 3, 2024
    Inventors: Sidharth Mudgal, Ahmad Beirami, Jilin Chen, Alex Beutel, Harish Ganapathy, YaGuang Li, Tao Wang, Yanping Huang, Trevor Strohman
  • Patent number: 11907674
    Abstract: Implementations relate to generating multi-modal response(s) through utilization of large language model(s) (LLM(s)). Processor(s) of a system can: receive natural language (NL) based input, generate a multi-modal response that is responsive to the NL based output, and cause the multi-modal response to be rendered. In some implementations, and in generating the multi-modal response, the processor(s) can process, using a LLM, LLM input (e.g., that includes at least the NL based input) to generate LLM output, and determine, based on the LLM output, textual content for inclusion in the multi-modal response and multimedia content for inclusion in the multi-modal response. In some implementations, the multimedia content can be obtained based on a multimedia content tag that is included in the LLM output and that is indicative of the multimedia content. In various implementations, the multimedia content can be interleaved between segments of the textual content.
    Type: Grant
    Filed: September 20, 2023
    Date of Patent: February 20, 2024
    Assignee: GOOGLE LLC
    Inventors: Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Thang Luong, Yifeng Lu, Igor Petrovski, Ágoston Weisz, Wei Yu, Rakesh Shivanna, Michael Andrew Goodman, Apoorv Kulshreshtha, Yu Du, Amin Ghafouri, Sanil Jain, Dustin Tran, Vikas Peswani, YaGuang Li
  • Patent number: 11758203
    Abstract: Devices, computer-readable media, and methods for making a cache admission decision regarding a video chunk are described. For instance, a processing system including at least one processor may obtain a request for a first chunk of a first video, determine that the first chunk is not stored in a cache, and apply, in response to the determining that the first chunk is not stored in the cache, a classifier to predict whether the first chunk will be re-requested within a time horizon, where the classifier is trained in accordance with a set of features associated with a plurality of chunks of a plurality of videos. When it is predicted via the classifier that the first chunk will be re-requested within the time horizon, the processing system may store the first chunk in the cache.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: September 12, 2023
    Assignees: AT&T Intellectual Property I, L.P., University of Southern California
    Inventors: Shuai Hao, Subhabrata Sen, Emir Halepovic, Zahaib Akhtar, Ramesh Govindan, Yaguang Li
  • Publication number: 20230230088
    Abstract: Provided are methodology and system countering fraudulent document and/or image use when authentication of a transaction based on a given document or image use is required. Additionally provided is a manner of machine learning adapting the methodology for implementation thereof.
    Type: Application
    Filed: January 6, 2022
    Publication date: July 20, 2023
    Applicant: Socure, Inc.
    Inventors: Pablo Ysrrael ABREU, Feng Xiao, Yaguang Li, Yiwen Hua
  • Publication number: 20220383145
    Abstract: A method for regression and time series forecasting includes obtaining a set of hierarchical time series, each time series in the set of hierarchical time series including a plurality of time series data values. The method includes determining, using the set of hierarchical time series, a basis regularization of the set of hierarchical time series and an embedding regularization of the set of hierarchical time series. The method also includes training a model using the set of hierarchical time series and a loss function based on the basis regularization and the embedding regularization. The method includes forecasting, using the trained model and one of the time series in the set of hierarchical time series, an expected time series data value in the one of the time series.
    Type: Application
    Filed: May 25, 2022
    Publication date: December 1, 2022
    Applicant: Google LLC
    Inventors: Rajat Sen, Shuxin Nie, Yaguang Li, Abhimanyu Das, Nicolas Loeff, Ananda Theertha Suresh, Pranjal Awasthi, Biswajit Paria
  • Publication number: 20210185368
    Abstract: Devices, computer-readable media, and methods for making a cache admission decision regarding a video chunk are described. For instance, a processing system including at least one processor may obtain a request for a first chunk of a first video, determine that the first chunk is not stored in a cache, and apply, in response to the determining that the first chunk is not stored in the cache, a classifier to predict whether the first chunk will be re-requested within a time horizon, where the classifier is trained in accordance with a set of features associated with a plurality of chunks of a plurality of videos. When it is predicted via the classifier that the first chunk will be re-requested within the time horizon, the processing system may store the first chunk in the cache.
    Type: Application
    Filed: December 13, 2019
    Publication date: June 17, 2021
    Inventors: Shuai Hao, Subhabrata Sen, Emir Halepovic, Zahaib Akhtar, Ramesh Govindan, Yaguang Li
  • Patent number: 9736442
    Abstract: A device, system and method for content-adaptive resolution-enhancement is provided. A plurality of subframe streams are generated from a video stream, each of the plurality of subframe streams comprising a lower resolution version of the video stream, pixel-shifted from one another. A plurality of output subframe streams are generated from the plurality of subframe streams in a one-to-one relationship by: applying a plurality of video enhancement filters to each of the plurality of subframe streams, each of the plurality of video enhancement filters for enhancing different features of the video stream; and, combining one or more resulting enhanced subframe streams into a respective output subframe stream based on data in one or more regions of the video stream. One or more projectors are controlled to project the plurality of output subframe streams to combine the plurality of output subframe streams into a higher resolution projected video stream.
    Type: Grant
    Filed: August 29, 2016
    Date of Patent: August 15, 2017
    Assignee: CHRISTIE DIGITAL SYSTEMS USA, INC.
    Inventors: Alexander Wong, Yaguang Li, Mark Lamm, Hicham Sekkati
  • Patent number: RE47845
    Abstract: A device, system and method for content-adaptive resolution-enhancement is provided. A plurality of subframe streams are generated from a video stream, each of the plurality of subframe streams comprising a lower resolution version of the video stream, pixel-shifted from one another. A plurality of output subframe streams are generated from the plurality of subframe streams in a one-to-one relationship by: applying a plurality of video enhancement filters to each of the plurality of subframe streams, each of the plurality of video enhancement filters for enhancing different features of the video stream; and, combining one or more resulting enhanced subframe streams into a respective output subframe stream based on data in one or more regions of the video stream. One or more projectors are controlled to project the plurality of output subframe streams to combine the plurality of output subframe streams into a higher resolution projected video stream.
    Type: Grant
    Filed: November 23, 2017
    Date of Patent: February 4, 2020
    Assignee: CHRISTIE DIGITAL SYSTEMS USA, INC.
    Inventors: Alexander Wong, Yaguang Li, Mark Lamm, Hicham Sekkati