Patents by Inventor Zhao SONG

Zhao SONG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Online inference and learning for nonsymmetric determinantal point processes

Patent number: 12288237

Abstract: Embodiments provide systems, methods, and computer storage media for a Nonsymmetric Determinantal Point Process (NDPPs) for compatible set recommendations in a setting where data representing entities (e.g., items) arrives in a stream. A stream representing compatible sets of entities is received and used to update a latent representation of the entities and a compatibility distribution indicating likelihood of compatibility of subsets of the entities. The probability distribution is accessed in a single sequential pass to predict a compatible complete set of entities that completes an incomplete set of entities. The predicted complete compatible set is provided a recommendation for entities that complete the incomplete set of entities.

Type: Grant

Filed: May 12, 2022

Date of Patent: April 29, 2025

Assignee: Adobe Inc.

Inventors: Ryan A. Rossi, Aravind Reddy Talla, Zhao Song, Anup Rao, Tung Mai, Nedim Lipka, Gang Wu, Eunyee Koh
Actively-learned context modeling for image compression

Patent number: 12219180

Abstract: Embodiments described herein provide methods and systems for facilitating actively-learned context modeling. In one embodiment, a subset of data is selected from a training dataset corresponding with an image to be compressed, the subset of data corresponding with a subset of data of pixels of the image. A context model is generated using the selected subset of data. The context model is generally in the form of a decision tree having a set of leaf nodes. Entropy values corresponding with each leaf node of the set of leaf nodes are determined. Each entropy value indicates an extent of diversity of context associated with the corresponding leaf node. Additional data from the training dataset is selected based on the entropy values corresponding with the leaf nodes. The updated subset of data is used to generate an updated context model for use in performing compression of the image.

Type: Grant

Filed: May 20, 2022

Date of Patent: February 4, 2025

Assignee: Adobe Inc.

Inventors: Gang Wu, Yang Li, Stefano Petrangeli, Viswanathan Swaminathan, Haoliang Wang, Ryan A. Rossi, Zhao Song
Assisted database anomaly mitigation

Patent number: 12130788

Abstract: An anomalous period of operation of a database management system is detected by analyzing a time series of data points indicating the number of database queries pending processing by the system. Conditions associated with execution of the pending database queries are recorded and analyzed to identify conditions correlated with the anomalous period of operation. A recommendation for tuning the database is generated based on analysis of the conditions.

Type: Grant

Filed: September 30, 2021

Date of Patent: October 29, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Vikramank Yogendra Singh, Zhao Song, Balakrishnan Narayanaswamy, Maxym Kharchenko, Jeremiah C Wilton, Vijay Gopal Joshi, Joshua Tobey Oberwetter, Kyle Henderson Hailey
SYSTEMS AND METHODS FOR LEARNING AT AN EDGE DEVICE

Publication number: 20240273378

Abstract: Systems and methods for distributed machine learning are provided. According to one aspect, a method for distributed machine learning includes obtaining, by an edge device, a static machine learning model from a hub device, computing, by the edge device, an objective function for a dynamic machine learning model based on a relationship between the dynamic machine learning model and the static machine learning model, and updating, by the edge device, the dynamic machine learning model based on the objective function.

Type: Application

Filed: February 2, 2023

Publication date: August 15, 2024

Inventors: Saayan Mitra, Arash Givchi, Xiang Chen, Somdeb Sarkhel, Ryan A. Rossi, Zhao Song
Active control system for data stream allocation

Patent number: 12047273

Abstract: A control system facilitates active management of a streaming data system. Given historical data traffic for each data stream processed by a streaming data system, the control system uses a machine learning model to predict future data traffic for each data stream. The control system selects a matching between data streams and servers for a future time that minimizes a cost comprising a switching cost and a server imbalance cost based on the predicted data traffic for the future time. In some configurations, the matching is selected using a planning window comprising a number of future time steps dynamically selected based on uncertainty associated with the predicted data traffic. Given the selected matching, the control system may manage the streaming data system by causing data streams to be moved between servers based on the matching.

Type: Grant

Filed: February 14, 2022

Date of Patent: July 23, 2024

Assignee: ADOBE INC.

Inventors: Georgios Theocharous, Kai Wang, Zhao Song, Sridhar Mahadevan
GENERATIVE GRAPH MODELING FRAMEWORK

Publication number: 20240152799

Abstract: Systems and methods for data augmentation are described. Embodiments of the present disclosure receive a dataset that includes a plurality of nodes and a plurality of edges, wherein each of the plurality of edges connects two of the plurality of nodes; compute a first nonnegative matrix representing a homophilous cluster affinity; compute a second nonnegative matrix representing a heterophilous cluster affinity; compute a probability of an additional edge based on the dataset using a machine learning model that represents a homophilous cluster and a heterophilous cluster based on the first nonnegative matrix and the second nonnegative matrix; and generate an augmented dataset including the plurality of nodes, the plurality of edges, and the additional edge.

Type: Application

Filed: October 31, 2022

Publication date: May 9, 2024

Inventors: Sudhanshu Chanpuriya, Ryan A. Rossi, Nedim Lipka, Anup Bandigadi Rao, Tung Mai, Zhao Song
SEGMENT SIZE ESTIMATION

Publication number: 20240144307

Abstract: One aspect of systems and methods for segment size estimation includes identifying a segment of users for a first time period based on time series data, wherein the time series data includes a series of interactions between users and a content channel and wherein the segment includes a portion of the users interacting with the content channel during the first time period; computing a segment return value for a second time period based on the time series data by computing a first subset and a second subset of the segment, wherein the first subset includes users that interact with the content channel greater than a threshold number of times during a range of the time series data and the second subset comprises a complement of the first subset with respect to the segment; and providing customized content to a user in the segment based on the segment return value.

Type: Application

Filed: October 18, 2022

Publication date: May 2, 2024

Inventors: Tung Mai, Ritwik Sinha, Trevor Hyrum Paulsen, Xiang Chen, William Brandon George, Nate Purser, Zhao Song
Speech denoising via discrete representation learning

Patent number: 11875809

Abstract: Developed and presented herein are embodiments of a new end-to-end approach for audio denoising, from a synthesis perspective. Instead of explicitly modelling the noise component in the input signal, embodiments directly synthesize the denoised audio from a generative model (or vocoder), as in text-to-speech systems. In one or more embodiments, to generate the phonetic contents for the autoregressive generative model, it is learned via a variational autoencoder with discrete latent representations. Furthermore, in one or more embodiments, a new matching loss is presented for the denoising purpose, which is masked on when the corresponding latent codes differ. As compared against other method on test datasets, embodiments achieve competitive performance and can be trained from scratch.

Type: Grant

Filed: October 1, 2020

Date of Patent: January 16, 2024

Assignee: Baidu USA LLC

Inventors: Zhao Song, Wei Ping
ACTIVELY-LEARNED CONTEXT MODELING FOR IMAGE COMPRESSION

Publication number: 20230379507

Abstract: Embodiments described herein provide methods and systems for facilitating actively-learned context modeling. In one embodiment, a subset of data is selected from a training dataset corresponding with an image to be compressed, the subset of data corresponding with a subset of data of pixels of the image. A context model is generated using the selected subset of data. The context model is generally in the form of a decision tree having a set of leaf nodes. Entropy values corresponding with each leaf node of the set of leaf nodes are determined. Each entropy value indicates an extent of diversity of context associated with the corresponding leaf node. Additional data from the training dataset is selected based on the entropy values corresponding with the leaf nodes. The updated subset of data is used to generate an updated context model for use in performing compression of the image.

Type: Application

Filed: May 20, 2022

Publication date: November 23, 2023

Inventors: Gang Wu, Yang Li, Stefano Petrangeli, Viswanathan Swaminathan, Haoliang Wang, Ryan A. Rossi, Zhao Song
ONLINE INFERENCE AND LEARNING FOR NONSYMMETRIC DETERMINANTAL POINT PROCESSES

Publication number: 20230368265

Abstract: Embodiments provide systems, methods, and computer storage media for a Nonsymmetric Determinantal Point Process (NDPPs) for compatible set recommendations in a setting where data representing entities (e.g., items) arrives in a stream. A stream representing compatible sets of entities is received and used to update a latent representation of the entities and a compatibility distribution indicating likelihood of compatibility of subsets of the entities. The probability distribution is accessed in a single sequential pass to predict a compatible complete set of entities that completes an incomplete set of entities. The predicted complete compatible set is provided a recommendation for entities that complete the incomplete set of entities.

Type: Application

Filed: May 12, 2022

Publication date: November 16, 2023

Inventors: Ryan A. Rossi, Aravind Reddy Talla, Zhao Song, Anup Rao, Tung Mai, Nedim Lipka, Gang Wu, Anup Rao
METHOD FOR RECONSTRUCTING THREE-DIMENSIONAL OBJECT COMBINING STRUCTURED LIGHT AND PHOTOMETRY AND TERMINAL DEVICE

Publication number: 20230298189

Abstract: The present application is applicable to the technical field of computer vision, and provides a method for reconstructing a three-dimensional object combining structured light and photometry and a terminal device, wherein the method comprises: acquiring N first images, wherein each first image is obtained by shooting after a coded pattern having a coding stripe sequence is projected to a three-dimensional object, and N is a positive integer; determining structured light depth information of the three-dimensional object based on the N first images; acquiring M second images, wherein the M second images are obtained by shooting after P light sources are respectively projected to the three-dimensional object from different directions, and M and P are positive integers; determining photometric information of the three-dimensional object based on the M second images; and reconstructing the three-dimensional object based on the structured light depth information and the photometric information.

Type: Application

Filed: November 17, 2020

Publication date: September 21, 2023

Applicant: SHENZHEN INSTITUTES OF ADVANCED TECHNOLOGY CHINESE ACADEMY OF SCIENCES

Inventors: Zhan SONG, Zhao SONG
SYSTEM AND METHOD FOR SECURE AND ROBUST DISTRIBUTED DEEP LEARNING

Publication number: 20230289473

Abstract: According to various embodiments, a method for encrypting image data for a neural network are disclosed. The method includes mixing the image data with other datapoints to form mixed data; and applying a pixel-wise random mask to the mixed data to form encrypted data. According to various embodiments, a method for encrypting text data for a neural network for natural language processing is disclosed. The method includes encoding each text datapoint via a pretrained text encoder to form encoded datapoints; mixing the encoded datapoints with other encoded datapoints to form mixed data; applying a random mask to the mixed data to form encrypted data; and incorporating the encrypted data into training a classifier of the neural network and fine-tuning the text encoder.

Type: Application

Filed: June 17, 2021

Publication date: September 14, 2023

Applicant: The Trustees of Princeton University

Inventors: Sanjeev ARORA, Kai LI, Yangsibo HUANG, Zhao SONG, Danqi CHEN
SYSTEM AND METHODS FOR RESOURCE ALLOCATION

Publication number: 20230281680

Abstract: Systems and methods for resource allocation are described. The systems and methods include receiving utilization data for computing resources shared by a plurality of users, updating a pricing agent using a reinforcement learning model based on the utilization data, identifying resource pricing information using the pricing agent, and allocating the computing resources to the plurality of users based on the resource pricing information.

Type: Application

Filed: March 1, 2022

Publication date: September 7, 2023

Inventors: Michail Mamakos, Sridhar Mahadevan, Viswanathan Swaminathan, Mariette Philippe Souppe, Ritwik Sinha, Saayan Mitra, Zhao Song
ACTIVE CONTROL SYSTEM FOR DATA STREAM ALLOCATION

Publication number: 20230261966

Abstract: A control system facilitates active management of a streaming data system. Given historical data traffic for each data stream processed by a streaming data system, the control system uses a machine learning model to predict future data traffic for each data stream. The control system selects a matching between data streams and servers for a future time that minimizes a cost comprising a switching cost and a server imbalance cost based on the predicted data traffic for the future time. In some configurations, the matching is selected using a planning window comprising a number of future time steps dynamically selected based on uncertainty associated with the predicted data traffic. Given the selected matching, the control system may manage the streaming data system by causing data streams to be moved between servers based on the matching.

Type: Application

Filed: February 14, 2022

Publication date: August 17, 2023

Inventors: Georgios Theocharous, Kai Wang, Zhao Song, Sridhar Mahadevan
Small-footprint flow-based models for raw audio

Patent number: 11521592

Abstract: WaveFlow is a small-footprint generative flow for raw audio, which may be directly trained with maximum likelihood. WaveFlow handles the long-range structure of waveform with a dilated two-dimensional (2D) convolutional architecture, while modeling the local variations using expressive autoregressive functions. WaveFlow may provide a unified view of likelihood-based models for raw audio, including WaveNet and WaveGlow, which may be considered special cases. It generates high-fidelity speech, while synthesizing several orders of magnitude faster than existing systems since it uses only a few sequential steps to generate relatively long waveforms. WaveFlow significantly reduces the likelihood gap that has existed between autoregressive models and flow-based models for efficient synthesis. Its small footprint with 5.91M parameters makes it 15 times smaller than some existing models. WaveFlow can generate 22.05 kHz high-fidelity audio 42.

Type: Grant

Filed: August 5, 2020

Date of Patent: December 6, 2022

Assignee: Baidu USA LLC

Inventors: Wei Ping, Kainan Peng, Kexin Zhao, Zhao Song
SPEECH DENOISING VIA DISCRETE REPRESENTATION LEARNING

Publication number: 20220108712

Abstract: Developed and presented herein are embodiments of a new end-to-end approach for audio denoising, from a synthesis perspective. Instead of explicitly modelling the noise component in the input signal, embodiments directly synthesize the denoised audio from a generative model (or vocoder), as in text-to-speech systems. In one or more embodiments, to generate the phonetic contents for the autoregressive generative model, it is learned via a variational autoencoder with discrete latent representations. Furthermore, in one or more embodiments, a new matching loss is presented for the denoising purpose, which is masked on when the corresponding latent codes differ. As compared against other method on test datasets, embodiments achieve competitive performance and can be trained from scratch.

Type: Application

Filed: October 1, 2020

Publication date: April 7, 2022

Applicant: Baidu USA LLC

Inventors: Zhao SONG, Wei PING
Parallel neural text-to-speech

Patent number: 11017761

Abstract: Presented herein are embodiments of a non-autoregressive sequence-to-sequence model that converts text to an audio representation. Embodiment are fully convolutional, and a tested embodiment obtained about 46.7 times speed-up over a prior model at synthesis while maintaining comparable speech quality using a WaveNet vocoder. Interestingly, a tested embodiment also has fewer attention errors than the autoregressive model on challenging test sentences. In one or more embodiments, the first fully parallel neural text-to-speech system was built by applying the inverse autoregressive flow (IAF) as the parallel neural vocoder. System embodiments can synthesize speech from text through a single feed-forward pass. Also disclosed herein are embodiments of a novel approach to train the IAF from scratch as a generative model for raw waveform, which avoids the need for distillation from a separately trained WaveNet.

Type: Grant

Filed: October 16, 2019

Date of Patent: May 25, 2021

Assignee: Baidu USA LLC

Inventors: Kainan Peng, Wei Ping, Zhao Song, Kexin Zhao
SMALL-FOOTPRINT FLOW-BASED MODELS FOR RAW AUDIO

Publication number: 20210090547

Abstract: WaveFlow is a small-footprint generative flow for raw audio, which may be directly trained with maximum likelihood. WaveFlow handles the long-range structure of waveform with a dilated two-dimensional (2D) convolutional architecture, while modeling the local variations using expressive autoregressive functions. WaveFlow may provide a unified view of likelihood-based models for raw audio, including WaveNet and WaveGlow, which may be considered special cases. It generates high-fidelity speech, while synthesizing several orders of magnitude faster than existing systems since it uses only a few sequential steps to generate relatively long waveforms. WaveFlow significantly reduces the likelihood gap that has existed between autoregressive models and flow-based models for efficient synthesis. Its small footprint with 5.91M parameters makes it 15 times smaller than some existing models. WaveFlow can generate 22.05 kHz high-fidelity audio 42.

Type: Application

Filed: August 5, 2020

Publication date: March 25, 2021

Applicant: Baidu USA LLC

Inventors: Wei PING, Kainan PENG, Kexin ZHAO, Zhao SONG
PARALLEL NEURAL TEXT-TO-SPEECH

Publication number: 20200066253

Abstract: Presented herein are embodiments of a non-autoregressive sequence-to-sequence model that converts text to an audio representation. Embodiment are fully convolutional, and a tested embodiment obtained about 46.7 times speed-up over a prior model at synthesis while maintaining comparable speech quality using a WaveNet vocoder. Interestingly, a tested embodiment also has fewer attention errors than the autoregressive model on challenging test sentences. In one or more embodiments, the first fully parallel neural text-to-speech system was built by applying the inverse autoregressive flow (IAF) as the parallel neural vocoder. System embodiments can synthesize speech from text through a single feed-forward pass. Also disclosed herein are embodiments of a novel approach to train the IAF from scratch as a generative model for raw waveform, which avoids the need for distillation from a separately trained WaveNet.

Type: Application

Filed: October 16, 2019

Publication date: February 27, 2020

Applicant: Baidu USA LLC

Inventors: Kainan PENG, Wei PING, Zhao SONG, Kexin ZHAO
MODEL ESTIMATION DEVICE, MODEL ESTIMATION METHOD, AND MODEL ESTIMATION PROGRAM

Publication number: 20200042872

Abstract: A parameter estimation unit 81 estimates parameters of a neural network model that maximize the lower limit of a log marginal likelihood related to observation value data and hidden layer nodes. A variational probability estimation unit 82 estimates parameters of the variational probability of nodes that maximize the lower limit of the log marginal likelihood. A node deletion determination unit 83 determines nodes to be deleted on the basis of the variational probability of which the parameters have been estimated, and deletes nodes determined to correspond to the nodes to be deleted. A convergence determination unit 84 determines the convergence of the neural network model on the basis of the change in the variational probability.

Type: Application

Filed: August 16, 2017

Publication date: February 6, 2020

Applicant: NEC CORPORATION

Inventors: Yusuke MURAOKA, Ryohei FUJIMAKI, Zhao SONG