Patents by Inventor Qiyong Liu

Qiyong Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SPATIAL AUDIO IN VIRTUAL CONFERENCE MINGLING

Publication number: 20240147177

Abstract: One example method includes presenting, by a client device, a view of a virtual conference hosted by a virtual conference provider, the virtual conference including a plurality of participants, the client device associated with a participant of the plurality of participants, the view including a plurality of groupings of participants within a virtual conference area, each grouping associated with a different meeting or sub-meeting of the virtual conference; assign a location within the virtual conference area to the participant; receiving, at the client device from the conference provider, one or more audio streams associated with one or more audio sources within the plurality of groupings, the one or more audio streams provided by one or more remote client devices; determining a first location within the virtual conference area of a first audio source of the one or more audio sources; generating a plurality of spatialized audio streams based on the first location of the first audio source, the location of th

Type: Application

Filed: October 28, 2022

Publication date: May 2, 2024

Inventors: Zhaofeng Jia, Qiyong Liu, Mengfan Zhang
MUSIC COLLABORATION USING VIRTUAL CONFERENCING

Publication number: 20240129685

Abstract: One example method for music collaboration using virtual conferencing includes receiving, by a client device, audio streams associated with a plurality of musicians in a virtual conference, each musician assigned to a virtual position within a virtual space established by the virtual conference, the client device associated with a participant in the virtual conference, the participant having a participant virtual position within the virtual space; determining relative virtual positions of each musician of at least a subset of the plurality of musicians in the virtual conference with respect to the participant virtual position; generating a plurality of spatialized audio streams based on the relative virtual positions of the respective musicians and the respective audio streams; and outputting the spatialized audio streams.

Type: Application

Filed: October 17, 2022

Publication date: April 18, 2024

Applicant: Zoom Video Communications, Inc.

Inventors: Zhaofeng Jia, Qiyong Liu, Mengfan Zhang, Xiangming Zhu
ONE-SHOT ACOUSTIC ECHO GENERATION NETWORK

Publication number: 20240087556

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.

Type: Application

Filed: November 13, 2023

Publication date: March 14, 2024

Applicant: Zoom Video Communications, Inc.

Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
ACOUSTIC FENCE

Publication number: 20240071356

Abstract: For online audio/video conferencing applications deployed in an open office environment, using shared conference devices, it can be advantageous to define an acoustic fence. A non-participant audio received from outside the acoustic fence can be considered noise and filtered out before transmission of an audio signal to a far end recipient. Three suppression stages are used to filter the non-participant audio. The first suppression stage uses beamformers for suppression. The second suppression stage is mask-based, and the third suppression stage is reference-based. The three suppression stages filter out non-participant audio signals, having a wide range of frequencies.

Type: Application

Filed: August 29, 2022

Publication date: February 29, 2024

Inventors: Zhenghang Gu, Zhaofeng Jia, Qiyong Liu, Ye Wang, Zexian Wu, Chunyu Zhang
ENHANCING GROUP SOUND REACTIONS

Publication number: 20240056529

Abstract: Systems and methods for enhancing group sound during a networked conference are provided. A server computer establishes a networked conference among a plurality of computer devices. The server computer receives one or more group sound indicators from one or more computer devices of the plurality of computer devices within a selected time interval. In response to determining that the total number of the one or more computer devices corresponding to the one or more group sound indicators is equal to or greater than a selected threshold, the server computer transmits to the plurality of computer devices a control signal identifying a group sound corresponding to the one or more group sound indicators. The server computer causes the plurality of computer devices to reproduce the group sound identified in the control signal.

Type: Application

Filed: October 24, 2023

Publication date: February 15, 2024

Applicant: Zoom Video Communications, Inc.

Inventors: Oded Gal, Lin Han, Qiyong Liu
Reference picture selection and coding type decision processing based on scene contents

Patent number: 11881945

Abstract: An adaptive screen encoding method comprising: using a computer, creating and storing, in computer memory, a plurality of conditions for use by a server configured to determine which of picture coding type to select; detecting a current picture by a sender for a content type including textual content, graphical content, and natural image content; determining a percentage of static macroblocks corresponding to the current picture; selecting the picture coding type based on the content type, the plurality of conditions, and the percentage of static macroblocks, wherein the method is performed by one or more special-purpose computing devices.

Type: Grant

Filed: February 2, 2022

Date of Patent: January 23, 2024

Assignee: Zoom Video Communications, Inc.

Inventors: Jing Wu, Zhaofeng Jia, Bo Ling, Qiyong Liu
One-shot acoustic echo generation network

Patent number: 11847999

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.

Type: Grant

Filed: October 27, 2021

Date of Patent: December 19, 2023

Assignee: Zoom Video Communications, Inc.

Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
Enhancing group sound reactions

Patent number: 11818301

Abstract: Enhancing group sound during a networked conference is disclosed. In an embodiment, a method includes generating an audio signal at a user equipment (UE), detecting a group sound in the audio signal, generating a group sound indicator that identifies the detected group sound, and transmitting the group sound indicator to a network server. The method also includes receiving, from the network server, a control signal that identifies a selected group sound, and reproducing, at the UE, the selected group sound identified by the control signal.

Type: Grant

Filed: October 6, 2022

Date of Patent: November 14, 2023

Assignee: Zoom Video Communications, Inc.

Inventors: Qiyong Liu, Oded Gal, Lin Han
PROVIDING SPATIAL AUDIO IN VIRTUAL CONFERENCES

Publication number: 20230353678

Abstract: One example method for providing spatial audio in virtual conference includes receiving, at a client device from a conference provider, an audio stream associated with an audio source, the audio stream provided by a remote client device, the client device and the remote client device participating in a virtual conference hosted by the conference provider, the client device associated with a user; determining a location of the audio source in the virtual conference with respect to the user's head; generating a plurality of spatialized audio streams based on the locations of the audio source and the audio stream; and outputting the spatialized audio streams.

Type: Application

Filed: August 29, 2022

Publication date: November 2, 2023

Applicant: Zoom Video Communications, Inc.

Inventors: Zhaofeng JIA, Rui Li, Qiyong Liu, Mengfan Zhang
DYNAMIC NOISE AND SPEECH REMOVAL

Publication number: 20230282225

Abstract: Online audio and video conference applications can utilize a noise removal module to eliminate unwanted audio from a participant’s speech. A noise removal module can rely on differentiating between human speech versus other audio to filter out noise. However, in some conference environments, participant and non-participant human speech can be present. Artificial intelligence models can be trained to detect both noise and non-participant audio, based on a variety of factors. The models can label captured audio and various noise removal modules can filter noise based on the output of the models.

Type: Application

Filed: March 22, 2022

Publication date: September 7, 2023

Inventors: Jiachuan Deng, Cheng-Lun Hu, Zhaofeng Jia, Qiyong Liu, Qi Yang
CONTEXT SIMILARITY DETECTOR FOR ARTIFICIAL INTELLIGENCE

Publication number: 20230222360

Abstract: Artificial intelligence models are trained with training datasets of known input/output values. Test datasets are used to evaluate the trained artificial intelligence models. Context mismatch between the training dataset and the test dataset can slow down the development of artificial intelligence models. The described systems and methods can identify context similar datasets for the purpose of training and testing an artificial intelligence model. In one embodiment, a context similarity detector can ingest and combine a training dataset and a test dataset and generate a context similarity score for the two. If the score is above a threshold, the datasets are similar, and the relevant artificial intelligence model can be trained with one and tested with the other.

Type: Application

Filed: January 28, 2022

Publication date: July 13, 2023

Inventors: Qiyong Liu, Yang Liu, Saisamarth Rajesh Phaye
INTELLIGENT NOISE SUPPRESSION FOR AUDIO SIGNALS WITHIN A COMMUNICATION PLATFORM

Publication number: 20230206938

Abstract: Methods and systems provide users of a communication platform with intelligent, real-time noise suppression for audio signals broadcasted in a communication session. The system receives an input audio signal from an audio capture device; processes the input audio signal to provide a second version of the audio signal with noise suppression based on DSP techniques; transmits the second version of the audio signal to a communication platform for real-time streaming; classifies, via a machine learning algorithm, whether the second version of the audio signal contains noise beyond a noise threshold; based on a classification that the second version of the audio signal contains noise beyond the noise threshold, processes the second version of the audio signal to provide a third version of the audio signal with noise suppression based on AI techniques; and transmits the third version of the audio signal to the communication platform.

Type: Application

Filed: February 28, 2023

Publication date: June 29, 2023

Inventors: Jiachuan Deng, Qiyong Liu, Chuanfei Wang, Xiuyu Xu
ENHANCING MUSICAL SOUND DURING A NETWORKED CONFERENCE

Publication number: 20230124470

Abstract: Dynamic adjustment of audio characteristics for enhancing musical sound during a networked conference is disclosed. In an embodiment, a method is provided for sound enhancement performed by a device coupled to a network. The method includes receiving an audio signal to be transmitted over the network, detecting when musical content is present in the audio signal, processing the audio signal to enhance voice characteristics to generate an enhanced audio signal when the musical content is not detected, processing the audio signal to enhance music characteristic to generate the enhanced audio signal when the musical content is detected, and transmitting the enhanced audio signal over the network.

Type: Application

Filed: December 16, 2022

Publication date: April 20, 2023

Inventors: Qiyong Liu, Jiachuan Deng, Yuhui Chen, Oded Gal
AUDIO SUPER RESOLUTION

Publication number: 20230110255

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media relate to a method for audio super resolution. The system receives an audio signal. When the sampling rate of the audio signal is below a sampling rate threshold or the frequency range of the audio signal is below a frequency range threshold, the audio signal is input to an audio super resolution model comprising a machine learning model. The audio signal is processed by the audio super resolution model to generate a synthetic audio signal with a wider frequency range than the frequency range of the audio signal.

Type: Application

Filed: October 31, 2021

Publication date: April 13, 2023

Inventors: Yuhui Chen, Zhaofeng Jia, Qiyong Liu, Zhengwei Wei
Intelligent noise suppression for audio signals within a communication platform

Patent number: 11621016

Abstract: Methods and systems provide users of a communication platform with intelligent, real-time noise suppression for audio signals broadcasted in a communication session. The system receives an input audio signal from an audio capture device; processes the input audio signal to provide a second version of the audio signal with noise suppression based on DSP techniques; transmits the second version of the audio signal to a communication platform for real-time streaming; classifies, via a machine learning algorithm, whether the second version of the audio signal contains noise beyond a noise threshold; based on a classification that the second version of the audio signal contains noise beyond the noise threshold, processes the second version of the audio signal to provide a third version of the audio signal with noise suppression based on AI techniques; and transmits the third version of the audio signal to the communication platform.

Type: Grant

Filed: July 31, 2021

Date of Patent: April 4, 2023

Assignee: Zoom Video Communications, Inc.

Inventors: Jiachuan Deng, Qiyong Liu, Chuanfei Wang, Xiuyu Xu
ONE-SHOT ACOUSTIC ECHO GENERATION NETWORK

Publication number: 20230100986

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.

Type: Application

Filed: October 27, 2021

Publication date: March 30, 2023

Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
TARGET SPEAKER MODE

Publication number: 20230095526

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media relate to a method for target speaker extraction. A target speaker extraction system receives an audio frame of an audio signal. A multi-speaker detection model analyzes the audio frame to determine whether the audio frame includes only a single-speaker or multiple speakers. When the audio frame includes only a single-speaker, the system inputs the audio frame to a target speaker VAD model to suppress speech in the audio frame from a non-target speaker based on comparing the audio frame to a voiceprint of a target speaker. When the audio frame includes multiple speakers, the system inputs the audio frame to a speech separation model to separate the voice of the target speaker from a voice mixture in the audio frame.

Type: Application

Filed: October 31, 2021

Publication date: March 30, 2023

Inventors: Yuhui Chen, Qiyong Liu, Zhengwei Wei, Yangbin Zeng
REAL-TIME LOW-COMPLEXITY ECHO CANCELLATION

Publication number: 20230096565

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media relate to a method for acoustic echo cancellation. The system inputs one or more signal representations into an acoustic echo cancellation network comprising one or more network blocks to generate a mask, each network block comprising one or more convolutional blocks, each convolutional block comprising one or more neural networks. The system combines the mask and a near-end audio signal representation to generate an echo-cancelled audio signal representation. The system generates an echo-cancelled audio signal based on the echo-cancelled audio signal representation.

Type: Application

Filed: October 27, 2021

Publication date: March 30, 2023

Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
ENHANCING GROUP SOUND REACTIONS

Publication number: 20230102108

Abstract: Enhancing group sound during a networked conference is disclosed. In an embodiment, a method includes generating an audio signal at a user equipment (UE), detecting a group sound in the audio signal, generating a group sound indicator that identifies the detected group sound, and transmitting the group sound indicator to a network server. The method also includes receiving, from the network server, a control signal that identifies a selected group sound, and reproducing, at the UE, the selected group sound identified by the control signal.

Type: Application

Filed: October 6, 2022

Publication date: March 30, 2023

Inventors: Qiyong Liu, Oded Gal, Lin Han
INTELLIGENT NOISE SUPPRESSION FOR AUDIO SIGNALS WITHIN A COMMUNICATION PLATFORM

Publication number: 20230032785

Abstract: Methods and systems provide users of a communication platform with intelligent, real-time noise suppression for audio signals broadcasted in a communication session. The system receives an input audio signal from an audio capture device; processes the input audio signal to provide a second version of the audio signal with noise suppression based on DSP techniques; transmits the second version of the audio signal to a communication platform for real-time streaming; classifies, via a machine learning algorithm, whether the second version of the audio signal contains noise beyond a noise threshold; based on a classification that the second version of the audio signal contains noise beyond the noise threshold, processes the second version of the audio signal to provide a third version of the audio signal with noise suppression based on AI techniques; and transmits the third version of the audio signal to the communication platform.

Type: Application

Filed: July 31, 2021

Publication date: February 2, 2023

Inventors: Jiachuan Deng, Qiyong Liu, Chuanfei Wang, Irina Xu

1 2 next