Patents by Inventor Yaguang Li

Yaguang Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

PROMPT COMPLEXITY FOR LARGE LANGUAGE MODELS

Publication number: 20250086405

Abstract: Some implementations relate to generating a training and/or evaluation dataset with LLM prompts (e.g., derived from user queries) based on a prompt complexity. An input prompt, for example derived from a user query, is received. The input prompt is decomposed into a prompt tree comprising a plurality of nodes. The plurality of nodes comprise: a plurality of leaf nodes corresponding to simple sub-prompts of the input query; a plurality of branch nodes of sub-prompts each corresponding to multiple simple sub-prompts; and a root node corresponding to the input prompt. A prompt complexity is determined based on a path length of the prompt tree. The prompt complexity is compared to a threshold complexity. If the prompt complexity is above the threshold complexity, the input prompt is included in a set of training prompts and/or a set of evaluation prompts.

Type: Application

Filed: October 5, 2023

Publication date: March 13, 2025

Inventors: Swaroop Mishra, Ragha Kotikalapudi, Obaid Sarvana, Sahitya Potluri, YaGuang Li, Taylor Bos, Steven Zheng, Hanzhao Lin, Chenkai Kuang, Heng-Tze Cheng, Ed H. Chi, Quoc Le
GENERATING MULTI-MODAL RESPONSE(S) THROUGH UTILIZATION OF LARGE LANGUAGE MODEL(S)

Publication number: 20250053751

Abstract: Implementations relate to generating multi-modal response(s) through utilization of large language model(s) (LLM(s)). Processor(s) of a system can: receive natural language (NL) based input, generate a multi-modal response that is responsive to the NL based output, and cause the multi-modal response to be rendered. In some implementations, and in generating the multi-modal response, the processor(s) can process, using a LLM, LLM input (e.g., that includes at least the NL based input) to generate LLM output, and determine, based on the LLM output, textual content for inclusion in the multi-modal response and multimedia content for inclusion in the multi-modal response. In some implementations, the multimedia content can be obtained based on a multimedia content tag that is included in the LLM output and that is indicative of the multimedia content. In various implementations, the multimedia content can be interleaved between segments of the textual content.

Type: Application

Filed: January 16, 2024

Publication date: February 13, 2025

Inventors: Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Thang Luong, Yifeng Lu, Igor Petrovski, Agoston Weisz, Wei Yu, Rakesh Shivanna, Michael Andrew Goodman, Apoorv Kulshreshtha, Yu Du, Amin Ghafouri, Sanil Jain, Dustin Tran, Vikas Peswani, YaGuang Li
EFFICIENT TRAINING AND UTILIZATION OF LARGE LANGUAGE MODELS

Publication number: 20250045534

Abstract: Implementations relate to a method implemented by one or more processors, the method including: receiving natural language (NL) based input associated with a client device; generating, using a large language model (LLM) and based on processing the NL based input, LLM output; determining, based on the LLM output, a sequence of LLM responses, the sequence of LLM responses including at least one intermediate LLM response and a final LLM response. In some implementations, the method may further include causing the final LLM response to be rendered at the client device. In additional or alternative implementations, the method may further include storing, as an instance of training data for fine-tuning the LLM or an additional LLM, the NL based input along with the final LLM response.

Type: Application

Filed: October 10, 2023

Publication date: February 6, 2025

Inventors: Swaroop Mishra, Ragha Kotikalapudi, Sahitya Potluri, Taylor Bos, YaGuang Li, Hanzhao Lin, Steven Zheng, Yu Du, Chen Zhu, Chenkai Kuang, Xinying Song, Heng-Tze Cheng, Ed H. Chi, Quoc Le
LARGE LANGUAGE MODEL (LLM) QUANTIZATION

Publication number: 20240428006

Abstract: Implementations relate to asymmetric quantization of large language models (LLMs). Processor(s) of a system can: obtain a trained LLM, wherein the trained LLM includes a plurality of layers, each layer comprising a respective plurality of weights; for each layer of the plurality of layers: calculate an optimal clipping range for the respective plurality of weights, and clip one or more weights of the respective plurality of weights that lie outside of the optimal clipping range to produce a clipped layer; quantize the LLM to generate a quantized LLM, wherein the instructions to quantize include instructions to map weights of the plurality of clipped layers of the LLM from continuous values to discrete values; and provide the quantized LLM for downstream processing.

Type: Application

Filed: June 20, 2023

Publication date: December 26, 2024

Inventors: Jian Li, Zhifeng Chen, Yanping Huang, Yuanzhong Xu, Tao Wang, YaGuang Li
BLOCKWISE CONTROLLED DECODING OF NATURAL LANGUAGE (NL) BASED OUTPUT GENERATED USING A LARGE LANGUAGE MODEL (LLM) TO REDUCE LATENCY IN RENDERING THEREOF

Publication number: 20240330334

Abstract: Implementations relate to reducing latency in generating and/or rendering a given stream of natural language (NL) based output generated using a large language model (LLM). Processor(s) of a system can: receive NL based input associated with a client device, generate the stream of NL based output utilizing the LLM that is responsive to the NL based input and that is for a given dialog context of an ongoing dialog, and cause the stream of NL based output to be rendered at the client device. Notably, the processor(s) can employ attribute classifier(s) and a multi-objective scorer to implement a blockwise controlled decoding technique in generating the stream of NL based output utilizing the LLM. By implementing the blockwise controlled decoding technique in generating the stream of NL based output utilizing the LLM, the processor(s) can reduce latency in generating and/or of the stream of NL based output generated utilizing the LLM.

Type: Application

Filed: July 25, 2023

Publication date: October 3, 2024

Inventors: Sidharth Mudgal, Ahmad Beirami, Jilin Chen, Alex Beutel, Harish Ganapathy, YaGuang Li, Tao Wang, Yanping Huang, Trevor Strohman
Generating multi-modal response(s) through utilization of large language model(s)

Patent number: 11907674

Abstract: Implementations relate to generating multi-modal response(s) through utilization of large language model(s) (LLM(s)). Processor(s) of a system can: receive natural language (NL) based input, generate a multi-modal response that is responsive to the NL based output, and cause the multi-modal response to be rendered. In some implementations, and in generating the multi-modal response, the processor(s) can process, using a LLM, LLM input (e.g., that includes at least the NL based input) to generate LLM output, and determine, based on the LLM output, textual content for inclusion in the multi-modal response and multimedia content for inclusion in the multi-modal response. In some implementations, the multimedia content can be obtained based on a multimedia content tag that is included in the LLM output and that is indicative of the multimedia content. In various implementations, the multimedia content can be interleaved between segments of the textual content.

Type: Grant

Filed: September 20, 2023

Date of Patent: February 20, 2024

Assignee: GOOGLE LLC

Inventors: Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Thang Luong, Yifeng Lu, Igor Petrovski, Ágoston Weisz, Wei Yu, Rakesh Shivanna, Michael Andrew Goodman, Apoorv Kulshreshtha, Yu Du, Amin Ghafouri, Sanil Jain, Dustin Tran, Vikas Peswani, YaGuang Li
Adaptive bitrate video cache

Patent number: 11758203

Abstract: Devices, computer-readable media, and methods for making a cache admission decision regarding a video chunk are described. For instance, a processing system including at least one processor may obtain a request for a first chunk of a first video, determine that the first chunk is not stored in a cache, and apply, in response to the determining that the first chunk is not stored in the cache, a classifier to predict whether the first chunk will be re-requested within a time horizon, where the classifier is trained in accordance with a set of features associated with a plurality of chunks of a plurality of videos. When it is predicted via the classifier that the first chunk will be re-requested within the time horizon, the processing system may store the first chunk in the cache.

Type: Grant

Filed: December 13, 2019

Date of Patent: September 12, 2023

Assignees: AT&T Intellectual Property I, L.P., University of Southern California

Inventors: Shuai Hao, Subhabrata Sen, Emir Halepovic, Zahaib Akhtar, Ramesh Govindan, Yaguang Li
Method and System of Predictive Document Verification and Machine Learning Therefor

Publication number: 20230230088

Abstract: Provided are methodology and system countering fraudulent document and/or image use when authentication of a transaction based on a given document or image use is required. Additionally provided is a manner of machine learning adapting the methodology for implementation thereof.

Type: Application

Filed: January 6, 2022

Publication date: July 20, 2023

Applicant: Socure, Inc.

Inventors: Pablo Ysrrael ABREU, Feng Xiao, Yaguang Li, Yiwen Hua
Regression and Time Series Forecasting

Publication number: 20220383145

Abstract: A method for regression and time series forecasting includes obtaining a set of hierarchical time series, each time series in the set of hierarchical time series including a plurality of time series data values. The method includes determining, using the set of hierarchical time series, a basis regularization of the set of hierarchical time series and an embedding regularization of the set of hierarchical time series. The method also includes training a model using the set of hierarchical time series and a loss function based on the basis regularization and the embedding regularization. The method includes forecasting, using the trained model and one of the time series in the set of hierarchical time series, an expected time series data value in the one of the time series.

Type: Application

Filed: May 25, 2022

Publication date: December 1, 2022

Applicant: Google LLC

Inventors: Rajat Sen, Shuxin Nie, Yaguang Li, Abhimanyu Das, Nicolas Loeff, Ananda Theertha Suresh, Pranjal Awasthi, Biswajit Paria
ADAPTIVE BITRATE VIDEO CACHE

Publication number: 20210185368

Abstract: Devices, computer-readable media, and methods for making a cache admission decision regarding a video chunk are described. For instance, a processing system including at least one processor may obtain a request for a first chunk of a first video, determine that the first chunk is not stored in a cache, and apply, in response to the determining that the first chunk is not stored in the cache, a classifier to predict whether the first chunk will be re-requested within a time horizon, where the classifier is trained in accordance with a set of features associated with a plurality of chunks of a plurality of videos. When it is predicted via the classifier that the first chunk will be re-requested within the time horizon, the processing system may store the first chunk in the cache.

Type: Application

Filed: December 13, 2019

Publication date: June 17, 2021

Inventors: Shuai Hao, Subhabrata Sen, Emir Halepovic, Zahaib Akhtar, Ramesh Govindan, Yaguang Li
Device, system and method for content-adaptive resolution-enhancement

Patent number: 9736442

Abstract: A device, system and method for content-adaptive resolution-enhancement is provided. A plurality of subframe streams are generated from a video stream, each of the plurality of subframe streams comprising a lower resolution version of the video stream, pixel-shifted from one another. A plurality of output subframe streams are generated from the plurality of subframe streams in a one-to-one relationship by: applying a plurality of video enhancement filters to each of the plurality of subframe streams, each of the plurality of video enhancement filters for enhancing different features of the video stream; and, combining one or more resulting enhanced subframe streams into a respective output subframe stream based on data in one or more regions of the video stream. One or more projectors are controlled to project the plurality of output subframe streams to combine the plurality of output subframe streams into a higher resolution projected video stream.

Type: Grant

Filed: August 29, 2016

Date of Patent: August 15, 2017

Assignee: CHRISTIE DIGITAL SYSTEMS USA, INC.

Inventors: Alexander Wong, Yaguang Li, Mark Lamm, Hicham Sekkati
Device, system and method for content-adaptive resolution-enhancement

Patent number: RE47845

Abstract: A device, system and method for content-adaptive resolution-enhancement is provided. A plurality of subframe streams are generated from a video stream, each of the plurality of subframe streams comprising a lower resolution version of the video stream, pixel-shifted from one another. A plurality of output subframe streams are generated from the plurality of subframe streams in a one-to-one relationship by: applying a plurality of video enhancement filters to each of the plurality of subframe streams, each of the plurality of video enhancement filters for enhancing different features of the video stream; and, combining one or more resulting enhanced subframe streams into a respective output subframe stream based on data in one or more regions of the video stream. One or more projectors are controlled to project the plurality of output subframe streams to combine the plurality of output subframe streams into a higher resolution projected video stream.

Type: Grant

Filed: November 23, 2017

Date of Patent: February 4, 2020

Assignee: CHRISTIE DIGITAL SYSTEMS USA, INC.

Inventors: Alexander Wong, Yaguang Li, Mark Lamm, Hicham Sekkati