Patents by Inventor Yukun Zhu

Yukun Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MEDIA TREND DETECTION AND MAINTENANCE AT A CONTENT SHARING PLATFORM

Publication number: 20250111675

Abstract: Methods and systems for media trend detection and maintenance are provided herein. A set of media items each having common media characteristics is identified. A set of pose values is determined for each respective media item of the set of media items. Each pose value is associated with a particular predefined pose for objects depicted by the set of media items. A set of distance scores is calculated. Each distance score represents a distance between the respective set of pose values determined for a media item and a respective set of pose values determined for an additional media item. A coherence score is determined for the set of media items based on the calculated set of distance scores. Responsive to a determination that the coherence score satisfies one or more coherence criteria, a determination is made that the set of media items corresponds to a media trend of a platform.

Type: Application

Filed: September 27, 2024

Publication date: April 3, 2025

Inventors: Hui Miao, Chun-Te Chu, Mingyan Gao, Huanfen Yao, Ting Liu, Long Zhao, Liangzhe Yuan, Yukun Zhu, Vinay Kumar Bettadapura, Ye Jin
AUDIO-VISUAL HEARING AID

Publication number: 20240428816

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.

Type: Application

Filed: August 7, 2024

Publication date: December 26, 2024

Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
ELECTRIC POWER STABILITY CONTROL SIMULATION MODEL CONSTRUCTION METHOD AND APPARATUS, AND ELECTRIC POWER STABILITY CONTROL SIMULATION SYSTEM

Publication number: 20240386159

Abstract: An electric power stability control simulation model construction method and apparatus, and an electric power stability control simulation system. According to an external interface function extracted from a stability control device program, a corresponding callback interface function is generated by using the external interface function, and an electric power stability control simulation model is constructed by means of the callback interface function in combination with the correspondence between the callback interface function and the external interface function. When the electric power stability control simulation model is executed to perform a simulation test, the corresponding external interface function can be called by means of the callback interface function, so that the function simulation of the corresponding stability control device program can be realized.

Type: Application

Filed: May 10, 2022

Publication date: November 21, 2024

Applicant: ELECTRIC POWER RESEARCH INSTITUTE, CHINA SOUTHERN POWER GRID

Inventors: Yihua ZHU, Qi GUO, Dongxu CHANG, Binjiang HU, Mingkang WU, Jie ZHANG, Yukun ZHU, Chengxiang LI, Hengdao GUO
Audio-visual hearing aid

Patent number: 12073844

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.

Type: Grant

Filed: October 1, 2020

Date of Patent: August 27, 2024

Assignee: Google LLC

Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
Safe and privacy preserving video representation

Patent number: 11949724

Abstract: A computing system and method that can be used for safe and privacy preserving video representations of participants in a videoconference. In particular, the present disclosure provides a general pipeline for generating reconstructions of videoconference participants based on semantic statuses and/or activity statuses of the participants. The systems and methods of the present disclosure allow for videoconferences that convey necessary or meaningful information of participants through presentation of generalized representations of participants while filtering unnecessary or unwanted information from the representations by leveraging machine-learning models.

Type: Grant

Filed: August 27, 2021

Date of Patent: April 2, 2024

Assignee: GOOGLE LLC

Inventors: Colvin Pitts, Yukun Zhu, Xuhui Jia
Multi-Stage Machine Learning Model Synthesis for Efficient Inference

Publication number: 20230297852

Abstract: Example implementations of the present disclosure combine efficient model design and dynamic inference. With a standalone lightweight model, the unnecessary computation on easy examples is avoided and the information extracted by the lightweight model also guide the synthesis of a specialist network from the basis models. With extensive experiments on ImageNet it is shown that a proposed example BasisNet is particularly effective for image classification and a BasisNet-MV3 achieves 80.3% top-1 accuracy with 290 M MAdds without early termination.

Type: Application

Filed: July 29, 2021

Publication date: September 21, 2023

Inventors: Li Zhang, Andrew Gerald Howard, Brendan Wesley Jou, Yukun Zhu, Mingda Zhang, Andrey Zhmoginov
GENERATING PANOPTIC SEGMENTATION LABELS

Publication number: 20230281824

Abstract: Methods, systems, and apparatus for generating a panoptic segmentation label for a sensor data sample. In one aspect, a system comprises one or more computers configured to obtain a sensor data sample characterizing a scene in an environment. The one or more computers obtain a 3D bounding box annotation at each time point for a point cloud characterizing the scene at the time point. The one or more computers obtain, for each camera image and each time point, annotation data identifying object instances depicted in the camera image, and the one or more computers generate a panoptic segmentation label for the sensor data sample characterizing the scene in the environment.

Type: Application

Filed: March 7, 2023

Publication date: September 7, 2023

Inventors: Jieru Mei, Hang Yan, Liang-Chieh Chen, Siyuan Qiao, Yukun Zhu, Alex Zihao Zhu, Xinchen Yan, Henrik Kretzschmar
AUDIO-VISUAL HEARING AID

Publication number: 20230267942

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.

Type: Application

Filed: October 1, 2020

Publication date: August 24, 2023

Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
Training a Restoration Model for Balanced Generation and Reconstruction

Publication number: 20230222628

Abstract: Systems and methods for training a restoration model can leverage training for two sub-tasks to train the restoration model to generate realistic and identity-preserved outputs. The systems and methods can balance the training of the generation task and the reconstruction task to ensure the generated outputs preserve the identity of the original subject while generating realistic outputs. The systems and methods can further leverage a feature quantization model and skip connections to improve the model output and overall training.

Type: Application

Filed: January 11, 2022

Publication date: July 13, 2023

Inventors: Yang Zhao, Yu-Chuan Su, Chun-Te Chu, Yandong Li, Marius Renn, Yukun Zhu, Xuhui Jia, Bradley Ray Green
Self-Supervised Learning of Photo Quality Using Implicitly Preferred Photos in Temporal Clusters

Publication number: 20230113131

Abstract: The present disclosure is directed to systems and methods for performing automated labeling of images. Labeled images can be used to train machine-learned models to infer image attributes such as quality for suggesting user actions.

Type: Application

Filed: March 5, 2020

Publication date: April 13, 2023

Inventors: Shawn Ryan O'Banion, Wenhuan Wei, Yukun Zhu
FLEXIBLE FRAMEWORK FOR JOINT REPRESENTATION LEARNING AND UNKNOWN CATEGORY DISCOVERY

Publication number: 20230103872

Abstract: Systems and methods for providing deep learning models capable of performing joint representation learning and new category discovery on a mixture of labeled and unlabeled data, which may include single- and multi-modal data. In some examples, a flexible end-to-end framework uses unified contrastive learning on labeled and unlabeled data based on both instance discrimination and category discrimination, and further uses Winner-Take-All hashing to generate a pseudo-label based on the similarity between each pair of unlabeled data points that can be used to train the model to generate clustering assignments for each unlabeled data point. In some examples, the unified contrastive learning may be further based on cross-modal discrimination.

Type: Application

Filed: October 4, 2021

Publication date: April 6, 2023

Inventors: Xuhui Jia, Yukun Zhu, Bradley Green, Kai Han
Safe and Privacy Preserving Video Representation

Publication number: 20230064328

Abstract: A computing system and method that can be used for safe and privacy preserving video representations of participants in a videoconference. In particular, the present disclosure provides a general pipeline for generating reconstructions of videoconference participants based on semantic statuses and/or activity statuses of the participants. The systems and methods of the present disclosure allow for videoconferences that convey necessary or meaningful information of participants through presentation of generalized representations of participants while filtering unnecessary or unwanted information from the representations by leveraging machine-learning models.

Type: Application

Filed: August 27, 2021

Publication date: March 2, 2023

Inventors: Colvin Pitts, Yukun Zhu, Xuhui Jia
Method and device for quantum key fusion-based virtual power plant security communication and medium

Patent number: 11233639

Abstract: A method for quantum key fusion-based virtual power plant security communication includes: identity authentication, performing identity authentication between a client and a server in a virtual power plant based on a communication requirement to acquire a root key; key distribution: generating a key encryption key and a message authentication key based on the acquired root key and performing negotiation on a data encryption key to obtain the data encryption key; and data encryption: encrypting to-be-encrypted data using the data encryption key, and implementing communication of the data. During the identity authentication or the key distribution, negotiation on a quantum key is performed by a quantum key server, and the quantum key obtained by the negotiation is used for implementing the identity authentication or used as the data encryption key. A device for quantum key fusion-based virtual power plant security communication and a computer storage medium are provided.

Type: Grant

Filed: August 24, 2018

Date of Patent: January 25, 2022

Assignees: BEIJING GUODIAN TONG NETWORK TECHNOLOGY CO., LTD, NORTH CHINA ELECTRIC POWER UNIVERSITY, STATE GRID CORPORATION OF CHINA

Inventors: Wei Deng, Wenzhao Wu, Zhuozhi Yu, Yefeng Zhang, Bingyang Han, Man Leng, Yonghong Ma, Jinglun Zhang, Runze Wu, Wenwei Chen, Nanxiang Li, Yukun Zhu
NEURAL ARCHITECTURE SEARCH FOR DENSE IMAGE PREDICTION TASKS

Publication number: 20210081796

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.

Type: Application

Filed: November 30, 2020

Publication date: March 18, 2021

Inventors: Barret Zoph, Jonathon Shlens, Yukun Zhu, Maxwell Donald Collins, Liang-Chieh Chen, Gerhard Florian Schroff, Hartwig Adam, Georgios Papandreou
Neural architecture search for dense image prediction tasks

Patent number: 10853726

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.

Type: Grant

Filed: May 29, 2019

Date of Patent: December 1, 2020

Assignee: Google LLC

Inventors: Barret Zoph, Jonathon Shlens, Yukun Zhu, Maxwell Donald Emmet Collins, Liang-Chieh Chen, Gerhard Florian Schroff, Hartwig Adam, Georgios Papandreou
METHOD AND DEVICE FOR QUANTUM KEY FUSION-BASED VIRTUAL POWER PLANT SECURITY COMMUNICATION AND MEDIUM

Publication number: 20190394031

Abstract: A method for quantum key fusion-based virtual power plant security communication includes: identity authentication, performing identity authentication between a client and a server in a virtual power plant based on a communication requirement to acquire a root key; key distribution: generating a key encryption key and a message authentication key based on the acquired root key and performing negotiation on a data encryption key to obtain the data encryption key; and data encryption: encrypting to-be-encrypted data using the data encryption key, and implementing communication of the data. During the identity authentication or the key distribution, negotiation on a quantum key is performed by a quantum key server, and the quantum key obtained by the negotiation is used for implementing the identity authentication or used as the data encryption key. A device for quantum key fusion-based virtual power plant security communication and a computer storage medium are provided.

Type: Application

Filed: August 24, 2018

Publication date: December 26, 2019

Applicants: BEIJING GUODIAN TONG NETWORK TECHNOLOGY CO., LTD, NORTH CHINA ELECTRIC POWER UNIVERSITY, STATE GRID CORPORATION OF CHINA

Inventors: Wei DENG, Wenzhao WU, Zhuozhi YU, Yefeng ZHANG, Bingyang HAN, Man LENG, Yonghong MA, Jinglun ZHANG, Runze WU, Wenwei CHEN, Nanxiang LI, Yukun ZHU
NEURAL ARCHITECTURE SEARCH FOR DENSE IMAGE PREDICTION TASKS

Publication number: 20190370648

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.

Type: Application

Filed: May 29, 2019

Publication date: December 5, 2019

Inventors: Barret Zoph, Jonathon Shlens, Yukun Zhu, Maxwell Donald Emmet Collins, Liang-Chieh Chen, Gerhard Florian Schroff, Hartwig Adam, Georgios Papandreou