Patents by Inventor Yukun Zhu
Yukun Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250111675Abstract: Methods and systems for media trend detection and maintenance are provided herein. A set of media items each having common media characteristics is identified. A set of pose values is determined for each respective media item of the set of media items. Each pose value is associated with a particular predefined pose for objects depicted by the set of media items. A set of distance scores is calculated. Each distance score represents a distance between the respective set of pose values determined for a media item and a respective set of pose values determined for an additional media item. A coherence score is determined for the set of media items based on the calculated set of distance scores. Responsive to a determination that the coherence score satisfies one or more coherence criteria, a determination is made that the set of media items corresponds to a media trend of a platform.Type: ApplicationFiled: September 27, 2024Publication date: April 3, 2025Inventors: Hui Miao, Chun-Te Chu, Mingyan Gao, Huanfen Yao, Ting Liu, Long Zhao, Liangzhe Yuan, Yukun Zhu, Vinay Kumar Bettadapura, Ye Jin
-
Publication number: 20240428816Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.Type: ApplicationFiled: August 7, 2024Publication date: December 26, 2024Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
-
Publication number: 20240386159Abstract: An electric power stability control simulation model construction method and apparatus, and an electric power stability control simulation system. According to an external interface function extracted from a stability control device program, a corresponding callback interface function is generated by using the external interface function, and an electric power stability control simulation model is constructed by means of the callback interface function in combination with the correspondence between the callback interface function and the external interface function. When the electric power stability control simulation model is executed to perform a simulation test, the corresponding external interface function can be called by means of the callback interface function, so that the function simulation of the corresponding stability control device program can be realized.Type: ApplicationFiled: May 10, 2022Publication date: November 21, 2024Applicant: ELECTRIC POWER RESEARCH INSTITUTE, CHINA SOUTHERN POWER GRIDInventors: Yihua ZHU, Qi GUO, Dongxu CHANG, Binjiang HU, Mingkang WU, Jie ZHANG, Yukun ZHU, Chengxiang LI, Hengdao GUO
-
Patent number: 12073844Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.Type: GrantFiled: October 1, 2020Date of Patent: August 27, 2024Assignee: Google LLCInventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
-
Patent number: 11949724Abstract: A computing system and method that can be used for safe and privacy preserving video representations of participants in a videoconference. In particular, the present disclosure provides a general pipeline for generating reconstructions of videoconference participants based on semantic statuses and/or activity statuses of the participants. The systems and methods of the present disclosure allow for videoconferences that convey necessary or meaningful information of participants through presentation of generalized representations of participants while filtering unnecessary or unwanted information from the representations by leveraging machine-learning models.Type: GrantFiled: August 27, 2021Date of Patent: April 2, 2024Assignee: GOOGLE LLCInventors: Colvin Pitts, Yukun Zhu, Xuhui Jia
-
Publication number: 20230297852Abstract: Example implementations of the present disclosure combine efficient model design and dynamic inference. With a standalone lightweight model, the unnecessary computation on easy examples is avoided and the information extracted by the lightweight model also guide the synthesis of a specialist network from the basis models. With extensive experiments on ImageNet it is shown that a proposed example BasisNet is particularly effective for image classification and a BasisNet-MV3 achieves 80.3% top-1 accuracy with 290 M MAdds without early termination.Type: ApplicationFiled: July 29, 2021Publication date: September 21, 2023Inventors: Li Zhang, Andrew Gerald Howard, Brendan Wesley Jou, Yukun Zhu, Mingda Zhang, Andrey Zhmoginov
-
Publication number: 20230281824Abstract: Methods, systems, and apparatus for generating a panoptic segmentation label for a sensor data sample. In one aspect, a system comprises one or more computers configured to obtain a sensor data sample characterizing a scene in an environment. The one or more computers obtain a 3D bounding box annotation at each time point for a point cloud characterizing the scene at the time point. The one or more computers obtain, for each camera image and each time point, annotation data identifying object instances depicted in the camera image, and the one or more computers generate a panoptic segmentation label for the sensor data sample characterizing the scene in the environment.Type: ApplicationFiled: March 7, 2023Publication date: September 7, 2023Inventors: Jieru Mei, Hang Yan, Liang-Chieh Chen, Siyuan Qiao, Yukun Zhu, Alex Zihao Zhu, Xinchen Yan, Henrik Kretzschmar
-
Publication number: 20230267942Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.Type: ApplicationFiled: October 1, 2020Publication date: August 24, 2023Inventors: Anatoly Efros, Noam Etzion-Rosenberg, Tal Remez, Oran Lang, Inbar Mosseri, Israel Or Weinstein, Benjamin Schlesinger, Michael Rubinstein, Ariel Ephrat, Yukun Zhu, Stella Laurenzo, Amit Pitaru, Yossi Matias
-
Publication number: 20230222628Abstract: Systems and methods for training a restoration model can leverage training for two sub-tasks to train the restoration model to generate realistic and identity-preserved outputs. The systems and methods can balance the training of the generation task and the reconstruction task to ensure the generated outputs preserve the identity of the original subject while generating realistic outputs. The systems and methods can further leverage a feature quantization model and skip connections to improve the model output and overall training.Type: ApplicationFiled: January 11, 2022Publication date: July 13, 2023Inventors: Yang Zhao, Yu-Chuan Su, Chun-Te Chu, Yandong Li, Marius Renn, Yukun Zhu, Xuhui Jia, Bradley Ray Green
-
Publication number: 20230113131Abstract: The present disclosure is directed to systems and methods for performing automated labeling of images. Labeled images can be used to train machine-learned models to infer image attributes such as quality for suggesting user actions.Type: ApplicationFiled: March 5, 2020Publication date: April 13, 2023Inventors: Shawn Ryan O'Banion, Wenhuan Wei, Yukun Zhu
-
Publication number: 20230103872Abstract: Systems and methods for providing deep learning models capable of performing joint representation learning and new category discovery on a mixture of labeled and unlabeled data, which may include single- and multi-modal data. In some examples, a flexible end-to-end framework uses unified contrastive learning on labeled and unlabeled data based on both instance discrimination and category discrimination, and further uses Winner-Take-All hashing to generate a pseudo-label based on the similarity between each pair of unlabeled data points that can be used to train the model to generate clustering assignments for each unlabeled data point. In some examples, the unified contrastive learning may be further based on cross-modal discrimination.Type: ApplicationFiled: October 4, 2021Publication date: April 6, 2023Inventors: Xuhui Jia, Yukun Zhu, Bradley Green, Kai Han
-
Publication number: 20230064328Abstract: A computing system and method that can be used for safe and privacy preserving video representations of participants in a videoconference. In particular, the present disclosure provides a general pipeline for generating reconstructions of videoconference participants based on semantic statuses and/or activity statuses of the participants. The systems and methods of the present disclosure allow for videoconferences that convey necessary or meaningful information of participants through presentation of generalized representations of participants while filtering unnecessary or unwanted information from the representations by leveraging machine-learning models.Type: ApplicationFiled: August 27, 2021Publication date: March 2, 2023Inventors: Colvin Pitts, Yukun Zhu, Xuhui Jia
-
Method and device for quantum key fusion-based virtual power plant security communication and medium
Patent number: 11233639Abstract: A method for quantum key fusion-based virtual power plant security communication includes: identity authentication, performing identity authentication between a client and a server in a virtual power plant based on a communication requirement to acquire a root key; key distribution: generating a key encryption key and a message authentication key based on the acquired root key and performing negotiation on a data encryption key to obtain the data encryption key; and data encryption: encrypting to-be-encrypted data using the data encryption key, and implementing communication of the data. During the identity authentication or the key distribution, negotiation on a quantum key is performed by a quantum key server, and the quantum key obtained by the negotiation is used for implementing the identity authentication or used as the data encryption key. A device for quantum key fusion-based virtual power plant security communication and a computer storage medium are provided.Type: GrantFiled: August 24, 2018Date of Patent: January 25, 2022Assignees: BEIJING GUODIAN TONG NETWORK TECHNOLOGY CO., LTD, NORTH CHINA ELECTRIC POWER UNIVERSITY, STATE GRID CORPORATION OF CHINAInventors: Wei Deng, Wenzhao Wu, Zhuozhi Yu, Yefeng Zhang, Bingyang Han, Man Leng, Yonghong Ma, Jinglun Zhang, Runze Wu, Wenwei Chen, Nanxiang Li, Yukun Zhu -
Publication number: 20210081796Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.Type: ApplicationFiled: November 30, 2020Publication date: March 18, 2021Inventors: Barret Zoph, Jonathon Shlens, Yukun Zhu, Maxwell Donald Collins, Liang-Chieh Chen, Gerhard Florian Schroff, Hartwig Adam, Georgios Papandreou
-
Patent number: 10853726Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.Type: GrantFiled: May 29, 2019Date of Patent: December 1, 2020Assignee: Google LLCInventors: Barret Zoph, Jonathon Shlens, Yukun Zhu, Maxwell Donald Emmet Collins, Liang-Chieh Chen, Gerhard Florian Schroff, Hartwig Adam, Georgios Papandreou
-
METHOD AND DEVICE FOR QUANTUM KEY FUSION-BASED VIRTUAL POWER PLANT SECURITY COMMUNICATION AND MEDIUM
Publication number: 20190394031Abstract: A method for quantum key fusion-based virtual power plant security communication includes: identity authentication, performing identity authentication between a client and a server in a virtual power plant based on a communication requirement to acquire a root key; key distribution: generating a key encryption key and a message authentication key based on the acquired root key and performing negotiation on a data encryption key to obtain the data encryption key; and data encryption: encrypting to-be-encrypted data using the data encryption key, and implementing communication of the data. During the identity authentication or the key distribution, negotiation on a quantum key is performed by a quantum key server, and the quantum key obtained by the negotiation is used for implementing the identity authentication or used as the data encryption key. A device for quantum key fusion-based virtual power plant security communication and a computer storage medium are provided.Type: ApplicationFiled: August 24, 2018Publication date: December 26, 2019Applicants: BEIJING GUODIAN TONG NETWORK TECHNOLOGY CO., LTD, NORTH CHINA ELECTRIC POWER UNIVERSITY, STATE GRID CORPORATION OF CHINAInventors: Wei DENG, Wenzhao WU, Zhuozhi YU, Yefeng ZHANG, Bingyang HAN, Man LENG, Yonghong MA, Jinglun ZHANG, Runze WU, Wenwei CHEN, Nanxiang LI, Yukun ZHU -
Publication number: 20190370648Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes obtaining training data for a dense image prediction task; and determining an architecture for a neural network configured to perform the dense image prediction task, comprising: searching a space of candidate architectures to identify one or more best performing architectures using the training data, wherein each candidate architecture in the space of candidate architectures comprises (i) the same first neural network backbone that is configured to receive an input image and to process the input image to generate a plurality of feature maps and (ii) a different dense prediction cell configured to process the plurality of feature maps and to generate an output for the dense image prediction task; and determining the architecture for the neural network based on the best performing candidate architectures.Type: ApplicationFiled: May 29, 2019Publication date: December 5, 2019Inventors: Barret Zoph, Jonathon Shlens, Yukun Zhu, Maxwell Donald Emmet Collins, Liang-Chieh Chen, Gerhard Florian Schroff, Hartwig Adam, Georgios Papandreou