Patents by Inventor Qiyong Liu
Qiyong Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250150537Abstract: Systems and methods for enhancing group sound during a networked conference are provided. A computer device accesses audio data, detects a first group sound in the audio data, and generates a first group sound identifier that identifies the first group sound. The computer device is one of a plurality of computer devices connected to the networked conference. The computer device transmits the first group sound identifier to a network server and receives a control signal from the network server. The network server receives multiple group sound identifiers from the plurality of computer devices and generates the control signal based on the multiple group sound identifiers. The multiple group sound identifiers include the first group sound identifier and a second group identifier. The control signal includes the second group sound identifier. The computer device reproduces a second group sound based on the second group sound identifier.Type: ApplicationFiled: January 13, 2025Publication date: May 8, 2025Applicant: Zoom Video Communications, Inc.Inventors: Oded Gal, Lin Han, Qiyong Liu
-
Patent number: 12272345Abstract: For online audio/video conferencing applications deployed in an open office environment, using shared conference devices, it can be advantageous to define an acoustic fence. A non-participant audio received from outside the acoustic fence can be considered noise and filtered out before transmission of an audio signal to a far end recipient. Three suppression stages are used to filter the non-participant audio. The first suppression stage uses beamformers for suppression. The second suppression stage is mask-based, and the third suppression stage is reference-based. The three suppression stages filter out non-participant audio signals, having a wide range of frequencies.Type: GrantFiled: August 29, 2022Date of Patent: April 8, 2025Assignee: Zoom Communications, Inc.Inventors: Zhenghang Gu, Zhaofeng Jia, Qiyong Liu, Ye Wang, Zexian Wu, Chunyu Zhang
-
Publication number: 20250078852Abstract: Audio enhancement of musical content is performed by a device coupled to a network. The device receives an audio signal to be transmitted over the network, and detects when musical content is present in the audio signal based on a content probability threshold. The device disables noise suppression for the audio signal and applies a linear filter to cancel echo for the audio signal. The device disables gain control for the audio signal and encodes the audio signal using a codec designed for music.Type: ApplicationFiled: November 18, 2024Publication date: March 6, 2025Inventors: Qiyong Liu, Jiachuan Deng, Yuhui Chen, Oded Gal
-
Patent number: 12231869Abstract: One example method includes presenting, by a client device, a view of a virtual conference hosted by a virtual conference provider, the virtual conference including a plurality of participants, the client device associated with a participant of the plurality of participants, the view including a plurality of groupings of participants within a virtual conference area, each grouping associated with a different meeting or sub-meeting of the virtual conference; assign a location within the virtual conference area to the participant; receiving, at the client device from the conference provider, one or more audio streams associated with one or more audio sources within the plurality of groupings, the one or more audio streams provided by one or more remote client devices; determining a first location within the virtual conference area of a first audio source of the one or more audio sources; generating a plurality of spatialized audio streams based on the first location of the first audio source, the location of thType: GrantFiled: October 28, 2022Date of Patent: February 18, 2025Assignee: Zoom Video Communications, Inc.Inventors: Zhaofeng Jia, Qiyong Liu, Mengfan Zhang
-
Patent number: 12219098Abstract: Systems and methods for enhancing group sound during a networked conference are provided. A server computer establishes a networked conference among a plurality of computer devices. The server computer receives one or more group sound indicators from one or more computer devices of the plurality of computer devices within a selected time interval. In response to determining that the total number of the one or more computer devices corresponding to the one or more group sound indicators is equal to or greater than a selected threshold, the server computer transmits to the plurality of computer devices a control signal identifying a group sound corresponding to the one or more group sound indicators. The server computer causes the plurality of computer devices to reproduce the group sound identified in the control signal.Type: GrantFiled: October 24, 2023Date of Patent: February 4, 2025Assignee: Zoom Video Communications, Inc.Inventors: Oded Gal, Lin Han, Qiyong Liu
-
Patent number: 12217761Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media relate to a method for target speaker extraction. A target speaker extraction system receives an audio frame of an audio signal. A multi-speaker detection model analyzes the audio frame to determine whether the audio frame includes only a single-speaker or multiple speakers. When the audio frame includes only a single-speaker, the system inputs the audio frame to a target speaker VAD model to suppress speech in the audio frame from a non-target speaker based on comparing the audio frame to a voiceprint of a target speaker. When the audio frame includes multiple speakers, the system inputs the audio frame to a speech separation model to separate the voice of the target speaker from a voice mixture in the audio frame.Type: GrantFiled: October 31, 2021Date of Patent: February 4, 2025Assignee: Zoom Video Communications, Inc.Inventors: Yuhui Chen, Qiyong Liu, Zhengwei Wei, Yangbin Zeng
-
Patent number: 12183357Abstract: Dynamic adjustment of audio characteristics for enhancing musical sound during a networked conference is disclosed. In an embodiment, a method is provided for sound enhancement performed by a device coupled to a network. The method includes receiving an audio signal to be transmitted over the network, detecting when musical content is present in the audio signal, processing the audio signal to enhance voice characteristics to generate an enhanced audio signal when the musical content is not detected, processing the audio signal to enhance music characteristic to generate the enhanced audio signal when the musical content is detected, and transmitting the enhanced audio signal over the network.Type: GrantFiled: December 16, 2022Date of Patent: December 31, 2024Assignee: Zoom Video Communications, Inc.Inventors: Qiyong Liu, Jiachuan Deng, Yuhui Chen, Oded Gal
-
Publication number: 20240251039Abstract: Example methods and systems provide hybrid DSP-AI acoustic echo cancellation for virtual conferences. A digital signal processing (DSP)-based linear acoustic echo cancelation (AEC) can be performed on an input audio signal to filter out linear echo present in the input audio signal and generate a first filtered audio signal. A level of nonlinear echo present in the first filtered audio signal can then be determined. When the level of nonlinear echo satisfies a threshold, an artificial intelligence (AI)-based nonlinear AEC can be performed on the first filtered audio signal to generate an AI-filtered audio signal. When the level of nonlinear echo does not satisfy the threshold, a DSP-based nonlinear AEC can be performed on the first filtered audio signal to generate a second filtered audio signal.Type: ApplicationFiled: January 20, 2023Publication date: July 25, 2024Applicant: Zoom Video Communications, IncInventors: Jiachuan DENG, Cheng Lun Hu, Zhaofeng Jia, Qiyong Liu, Wei Wang, Yueguan Wang
-
Publication number: 20240212702Abstract: Various embodiments of an apparatus, method(s), system(s) and computer program product(s) described herein are directed to a Denoise Engine. The Denoise Engine collects segments of voice content of a first user account from audio data associated with a virtual meeting. The audio data further includes additional types of audio content. The Denoise Engine identifies an audio embedding model. The Denoise Engine receives a speaker embedding generated by the audio embedding model. The speaker embedding based on the collected segments of voice content. The Denoise Engine generates personalized denoised voice content of the first user account for the virtual meeting by applying the speaker embedding to the audio data associated with a virtual meeting.Type: ApplicationFiled: December 23, 2022Publication date: June 27, 2024Inventors: Jiachuan Deng, Cheng Lun Hu, Zhaofeng Jia, Qiyong Liu, Zhengwei Wei, Da-Yi Wu
-
Publication number: 20240195530Abstract: Scrolling motion is detected within a video stream to output an indication of a scrolling motion vector for use in encoding a current picture of the video stream. A first line of pixels within a motion region of the current picture is identified. A second line of pixels matching the first line of pixels is identified within a last played picture of the video stream. The scrolling motion vector is determined based on a comparison of lines of pixels nearby the second line of pixels within the last played picture. The indication of the scrolling motion vector is then output for use in encoding the current picture.Type: ApplicationFiled: December 7, 2023Publication date: June 13, 2024Inventors: Jing Wu, Zhaofeng Jia, Bo Ling, Qiyong Liu
-
Publication number: 20240147177Abstract: One example method includes presenting, by a client device, a view of a virtual conference hosted by a virtual conference provider, the virtual conference including a plurality of participants, the client device associated with a participant of the plurality of participants, the view including a plurality of groupings of participants within a virtual conference area, each grouping associated with a different meeting or sub-meeting of the virtual conference; assign a location within the virtual conference area to the participant; receiving, at the client device from the conference provider, one or more audio streams associated with one or more audio sources within the plurality of groupings, the one or more audio streams provided by one or more remote client devices; determining a first location within the virtual conference area of a first audio source of the one or more audio sources; generating a plurality of spatialized audio streams based on the first location of the first audio source, the location of thType: ApplicationFiled: October 28, 2022Publication date: May 2, 2024Inventors: Zhaofeng Jia, Qiyong Liu, Mengfan Zhang
-
Publication number: 20240129685Abstract: One example method for music collaboration using virtual conferencing includes receiving, by a client device, audio streams associated with a plurality of musicians in a virtual conference, each musician assigned to a virtual position within a virtual space established by the virtual conference, the client device associated with a participant in the virtual conference, the participant having a participant virtual position within the virtual space; determining relative virtual positions of each musician of at least a subset of the plurality of musicians in the virtual conference with respect to the participant virtual position; generating a plurality of spatialized audio streams based on the relative virtual positions of the respective musicians and the respective audio streams; and outputting the spatialized audio streams.Type: ApplicationFiled: October 17, 2022Publication date: April 18, 2024Applicant: Zoom Video Communications, Inc.Inventors: Zhaofeng Jia, Qiyong Liu, Mengfan Zhang, Xiangming Zhu
-
Publication number: 20240087556Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.Type: ApplicationFiled: November 13, 2023Publication date: March 14, 2024Applicant: Zoom Video Communications, Inc.Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
-
Publication number: 20240071356Abstract: For online audio/video conferencing applications deployed in an open office environment, using shared conference devices, it can be advantageous to define an acoustic fence. A non-participant audio received from outside the acoustic fence can be considered noise and filtered out before transmission of an audio signal to a far end recipient. Three suppression stages are used to filter the non-participant audio. The first suppression stage uses beamformers for suppression. The second suppression stage is mask-based, and the third suppression stage is reference-based. The three suppression stages filter out non-participant audio signals, having a wide range of frequencies.Type: ApplicationFiled: August 29, 2022Publication date: February 29, 2024Inventors: Zhenghang Gu, Zhaofeng Jia, Qiyong Liu, Ye Wang, Zexian Wu, Chunyu Zhang
-
Publication number: 20240056529Abstract: Systems and methods for enhancing group sound during a networked conference are provided. A server computer establishes a networked conference among a plurality of computer devices. The server computer receives one or more group sound indicators from one or more computer devices of the plurality of computer devices within a selected time interval. In response to determining that the total number of the one or more computer devices corresponding to the one or more group sound indicators is equal to or greater than a selected threshold, the server computer transmits to the plurality of computer devices a control signal identifying a group sound corresponding to the one or more group sound indicators. The server computer causes the plurality of computer devices to reproduce the group sound identified in the control signal.Type: ApplicationFiled: October 24, 2023Publication date: February 15, 2024Applicant: Zoom Video Communications, Inc.Inventors: Oded Gal, Lin Han, Qiyong Liu
-
Patent number: 11881945Abstract: An adaptive screen encoding method comprising: using a computer, creating and storing, in computer memory, a plurality of conditions for use by a server configured to determine which of picture coding type to select; detecting a current picture by a sender for a content type including textual content, graphical content, and natural image content; determining a percentage of static macroblocks corresponding to the current picture; selecting the picture coding type based on the content type, the plurality of conditions, and the percentage of static macroblocks, wherein the method is performed by one or more special-purpose computing devices.Type: GrantFiled: February 2, 2022Date of Patent: January 23, 2024Assignee: Zoom Video Communications, Inc.Inventors: Jing Wu, Zhaofeng Jia, Bo Ling, Qiyong Liu
-
Patent number: 11847999Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.Type: GrantFiled: October 27, 2021Date of Patent: December 19, 2023Assignee: Zoom Video Communications, Inc.Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
-
Patent number: 11818301Abstract: Enhancing group sound during a networked conference is disclosed. In an embodiment, a method includes generating an audio signal at a user equipment (UE), detecting a group sound in the audio signal, generating a group sound indicator that identifies the detected group sound, and transmitting the group sound indicator to a network server. The method also includes receiving, from the network server, a control signal that identifies a selected group sound, and reproducing, at the UE, the selected group sound identified by the control signal.Type: GrantFiled: October 6, 2022Date of Patent: November 14, 2023Assignee: Zoom Video Communications, Inc.Inventors: Qiyong Liu, Oded Gal, Lin Han
-
Publication number: 20230353678Abstract: One example method for providing spatial audio in virtual conference includes receiving, at a client device from a conference provider, an audio stream associated with an audio source, the audio stream provided by a remote client device, the client device and the remote client device participating in a virtual conference hosted by the conference provider, the client device associated with a user; determining a location of the audio source in the virtual conference with respect to the user's head; generating a plurality of spatialized audio streams based on the locations of the audio source and the audio stream; and outputting the spatialized audio streams.Type: ApplicationFiled: August 29, 2022Publication date: November 2, 2023Applicant: Zoom Video Communications, Inc.Inventors: Zhaofeng JIA, Rui Li, Qiyong Liu, Mengfan Zhang
-
Publication number: 20230282225Abstract: Online audio and video conference applications can utilize a noise removal module to eliminate unwanted audio from a participant’s speech. A noise removal module can rely on differentiating between human speech versus other audio to filter out noise. However, in some conference environments, participant and non-participant human speech can be present. Artificial intelligence models can be trained to detect both noise and non-participant audio, based on a variety of factors. The models can label captured audio and various noise removal modules can filter noise based on the output of the models.Type: ApplicationFiled: March 22, 2022Publication date: September 7, 2023Inventors: Jiachuan Deng, Cheng-Lun Hu, Zhaofeng Jia, Qiyong Liu, Qi Yang