Patents by Inventor Zhaofeng Jia

Zhaofeng Jia has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240146876
    Abstract: Various embodiments of an apparatus, method(s), system(s) and computer program product(s) described herein are directed to a Visualization Engine. The Visualization Engine receives audio data associated with a user account accessing a virtual meeting via a communications environment client software application. The Visualization Engine detects presence of a pre-selected type(s) of audio event(s) in the received audio data. The Visualization Engine generates a visualization representative of at least one attribute of the detected audio event(s). During playback of the audio data in the virtual meeting, the Visualization Engine renders the visualization within the communications environment client software application of the user account.
    Type: Application
    Filed: November 1, 2022
    Publication date: May 2, 2024
    Inventors: Zhaofeng Jia, Yuhui Chen
  • Publication number: 20240147177
    Abstract: One example method includes presenting, by a client device, a view of a virtual conference hosted by a virtual conference provider, the virtual conference including a plurality of participants, the client device associated with a participant of the plurality of participants, the view including a plurality of groupings of participants within a virtual conference area, each grouping associated with a different meeting or sub-meeting of the virtual conference; assign a location within the virtual conference area to the participant; receiving, at the client device from the conference provider, one or more audio streams associated with one or more audio sources within the plurality of groupings, the one or more audio streams provided by one or more remote client devices; determining a first location within the virtual conference area of a first audio source of the one or more audio sources; generating a plurality of spatialized audio streams based on the first location of the first audio source, the location of th
    Type: Application
    Filed: October 28, 2022
    Publication date: May 2, 2024
    Inventors: Zhaofeng Jia, Qiyong Liu, Mengfan Zhang
  • Publication number: 20240129685
    Abstract: One example method for music collaboration using virtual conferencing includes receiving, by a client device, audio streams associated with a plurality of musicians in a virtual conference, each musician assigned to a virtual position within a virtual space established by the virtual conference, the client device associated with a participant in the virtual conference, the participant having a participant virtual position within the virtual space; determining relative virtual positions of each musician of at least a subset of the plurality of musicians in the virtual conference with respect to the participant virtual position; generating a plurality of spatialized audio streams based on the relative virtual positions of the respective musicians and the respective audio streams; and outputting the spatialized audio streams.
    Type: Application
    Filed: October 17, 2022
    Publication date: April 18, 2024
    Applicant: Zoom Video Communications, Inc.
    Inventors: Zhaofeng Jia, Qiyong Liu, Mengfan Zhang, Xiangming Zhu
  • Publication number: 20240107230
    Abstract: Example methods and systems provide automatic audio equalization for online conferences. A target, ideal, audio frequency response for use in online conferencing audio can be designed and preset for reference, and the equalizer can operate in real time to reach the target frequency response for any input. The equalizer can selectively apply per-band equalization to the audio signal as needed to adjust an energy value for each of multiple frequency bands to produce an output audio signal that that can be, as an example, routed through meeting servers or other infrastructure to other conference participants. The equalization can compensate for conditions local to a speaker that would otherwise adversely affect audio quality.
    Type: Application
    Filed: September 26, 2022
    Publication date: March 28, 2024
    Applicant: Zoom Video Communications, Inc.
    Inventors: Yuhui Chen, Zhaofeng Jia
  • Publication number: 20240098185
    Abstract: Techniques for modification of far end audio signals are provided. In an example method, a computing device establishes a conference meeting including a first user equipment (UE) among a plurality of UE. The computing device then receives far end audio (FEA) data from the first UE. The computing device generates first modified FEA comprising determining a time-domain sum-of-absolute-difference (SAD) from the FEA data and determines a difference between the first modified FEA and a predefined value. Responsive to the difference exceeding a predefined threshold, the computing device determines a second modified FEA comprising determining a frequency-domain SAD from the FEA data and stores the second modified FEA in a buffer. The second modified is then output from the buffer.
    Type: Application
    Filed: November 28, 2023
    Publication date: March 21, 2024
    Inventors: Zhaofeng Jia, Huipin Zhang
  • Publication number: 20240087556
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.
    Type: Application
    Filed: November 13, 2023
    Publication date: March 14, 2024
    Applicant: Zoom Video Communications, Inc.
    Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
  • Publication number: 20240071356
    Abstract: For online audio/video conferencing applications deployed in an open office environment, using shared conference devices, it can be advantageous to define an acoustic fence. A non-participant audio received from outside the acoustic fence can be considered noise and filtered out before transmission of an audio signal to a far end recipient. Three suppression stages are used to filter the non-participant audio. The first suppression stage uses beamformers for suppression. The second suppression stage is mask-based, and the third suppression stage is reference-based. The three suppression stages filter out non-participant audio signals, having a wide range of frequencies.
    Type: Application
    Filed: August 29, 2022
    Publication date: February 29, 2024
    Inventors: Zhenghang Gu, Zhaofeng Jia, Qiyong Liu, Ye Wang, Zexian Wu, Chunyu Zhang
  • Publication number: 20240037371
    Abstract: One example method includes receiving, by a machine learning (“ML”) model of a conference client application, audio signals received from a microphone of a client device, the client device connected to a virtual meeting via the conference client application, the virtual meeting hosted by a virtual conference provider; determining, by the ML model, a plurality of candidate reactions associated with the audio signals, the ML comprising a plurality of convolutional neural network (“CNN”) layers and at least one fully connected layer; selecting a reaction from the plurality of candidate reactions; and transmitting the reaction to the virtual conference provider.
    Type: Application
    Filed: July 26, 2022
    Publication date: February 1, 2024
    Applicant: Zoom Video Communications, Inc.
    Inventors: Yuhui Chen, Qiang Gao, Zhaofeng Jia, Rongrong Liu
  • Patent number: 11881945
    Abstract: An adaptive screen encoding method comprising: using a computer, creating and storing, in computer memory, a plurality of conditions for use by a server configured to determine which of picture coding type to select; detecting a current picture by a sender for a content type including textual content, graphical content, and natural image content; determining a percentage of static macroblocks corresponding to the current picture; selecting the picture coding type based on the content type, the plurality of conditions, and the percentage of static macroblocks, wherein the method is performed by one or more special-purpose computing devices.
    Type: Grant
    Filed: February 2, 2022
    Date of Patent: January 23, 2024
    Assignee: Zoom Video Communications, Inc.
    Inventors: Jing Wu, Zhaofeng Jia, Bo Ling, Qiyong Liu
  • Patent number: 11870940
    Abstract: An apparatus and/or method discloses a video conference with enhanced audio quality using high-fidelity audio sharing (“HAS”). In one embodiment, a network connection between a first user equipment (“UE”) and a second UE is established via a communication network for providing an interactive real-time meeting. After sending a first calibration audio signal from the first UE to the second UE, a second calibration audio signal is retuned from the second UE to the first UE according to the first calibration audio signal. Upon identifying a far end audio (“FEA”) delay based on the first calibration audio signal and the second calibration audio signal, a first mixed audio data containing the first shared audio data and first FEA data is fetched from an audio buffer. The first FEA data is subsequently removed or extracted from the mixed audio data in response to the FEA delay.
    Type: Grant
    Filed: June 7, 2021
    Date of Patent: January 9, 2024
    Assignee: Zoom Video Communications, Inc.
    Inventors: Zhaofeng Jia, Huipin Zhang
  • Patent number: 11847999
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.
    Type: Grant
    Filed: October 27, 2021
    Date of Patent: December 19, 2023
    Assignee: Zoom Video Communications, Inc.
    Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
  • Publication number: 20230353678
    Abstract: One example method for providing spatial audio in virtual conference includes receiving, at a client device from a conference provider, an audio stream associated with an audio source, the audio stream provided by a remote client device, the client device and the remote client device participating in a virtual conference hosted by the conference provider, the client device associated with a user; determining a location of the audio source in the virtual conference with respect to the user's head; generating a plurality of spatialized audio streams based on the locations of the audio source and the audio stream; and outputting the spatialized audio streams.
    Type: Application
    Filed: August 29, 2022
    Publication date: November 2, 2023
    Applicant: Zoom Video Communications, Inc.
    Inventors: Zhaofeng JIA, Rui Li, Qiyong Liu, Mengfan Zhang
  • Publication number: 20230282225
    Abstract: Online audio and video conference applications can utilize a noise removal module to eliminate unwanted audio from a participant’s speech. A noise removal module can rely on differentiating between human speech versus other audio to filter out noise. However, in some conference environments, participant and non-participant human speech can be present. Artificial intelligence models can be trained to detect both noise and non-participant audio, based on a variety of factors. The models can label captured audio and various noise removal modules can filter noise based on the output of the models.
    Type: Application
    Filed: March 22, 2022
    Publication date: September 7, 2023
    Inventors: Jiachuan Deng, Cheng-Lun Hu, Zhaofeng Jia, Qiyong Liu, Qi Yang
  • Publication number: 20230110255
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media relate to a method for audio super resolution. The system receives an audio signal. When the sampling rate of the audio signal is below a sampling rate threshold or the frequency range of the audio signal is below a frequency range threshold, the audio signal is input to an audio super resolution model comprising a machine learning model. The audio signal is processed by the audio super resolution model to generate a synthetic audio signal with a wider frequency range than the frequency range of the audio signal.
    Type: Application
    Filed: October 31, 2021
    Publication date: April 13, 2023
    Inventors: Yuhui Chen, Zhaofeng Jia, Qiyong Liu, Zhengwei Wei
  • Publication number: 20230100986
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for generating echo recordings. The system receives, by an autoencoder, an audio signal representation that represents an audio signal and a target echo embedding that comprises information about a target room. The autoencoder comprises an encoder and a decoder. The system generates, by the encoder, a content embedding and an estimated echo embedding. The system generates, by the decoder, an echo recording representation based on the content embedding and the target echo embedding.
    Type: Application
    Filed: October 27, 2021
    Publication date: March 30, 2023
    Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
  • Publication number: 20230096565
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media relate to a method for acoustic echo cancellation. The system inputs one or more signal representations into an acoustic echo cancellation network comprising one or more network blocks to generate a mask, each network block comprising one or more convolutional blocks, each convolutional block comprising one or more neural networks. The system combines the mask and a near-end audio signal representation to generate an echo-cancelled audio signal representation. The system generates an echo-cancelled audio signal based on the echo-cancelled audio signal representation.
    Type: Application
    Filed: October 27, 2021
    Publication date: March 30, 2023
    Inventors: Zhaofeng Jia, Yang Liu, Qiyong Liu
  • Publication number: 20220255666
    Abstract: An adaptive screen encoding method comprising: using a computer, creating and storing, in computer memory, a plurality of conditions for use by a server configured to determine which of picture coding type to select; detecting a current picture by a sender for a content type including textual content, graphical content, and natural image content; determining a percentage of static macroblocks corresponding to the current picture; selecting the picture coding type based on the content type, the plurality of conditions, and the percentage of static macroblocks, wherein the method is performed by one or more special-purpose computing devices.
    Type: Application
    Filed: February 2, 2022
    Publication date: August 11, 2022
    Inventors: Jing Wu, Zhaofeng Jia, Bo Ling, Qiyong Liu
  • Patent number: 11277227
    Abstract: An adaptive screen encoding method comprising: using a computer, creating and storing, in computer memory, a plurality of conditions for use by a server configured to determine which of picture coding type to select; detecting a current picture by a sender for a content type including textual content, graphical content, and natural image content; determining a percentage of static macroblocks corresponding to the current picture; selecting the picture coding type based on the content type, the plurality of conditions, and the percentage of static macroblocks, wherein the method is performed by one or more special-purpose computing devices.
    Type: Grant
    Filed: February 21, 2020
    Date of Patent: March 15, 2022
    Assignee: Zoom Video Communications, Inc.
    Inventors: Jing Wu, Zhaofeng Jia, Bo Ling, Qiyong Liu
  • Publication number: 20210297534
    Abstract: An apparatus and/or method discloses a video conference with enhanced audio quality using high-fidelity audio sharing (“HAS”). In one embodiment, a network connection between a first user equipment (“UE”) and a second UE is established via a communication network for providing an interactive real-time meeting. After sending a first calibration audio signal from the first UE to the second UE, a second calibration audio signal is retuned from the second UE to the first UE according to the first calibration audio signal. Upon identifying a far end audio (“FEA”) delay based on the first calibration audio signal and the second calibration audio signal, a first mixed audio data containing the first shared audio data and first FEA data is fetched from an audio buffer. The first FEA data is subsequently removed or extracted from the mixed audio data in response to the FEA delay.
    Type: Application
    Filed: June 7, 2021
    Publication date: September 23, 2021
    Applicant: Zoom Video Communications, Inc.
    Inventors: Zhaofeng Jia, Hulpin Zhang
  • Patent number: 11039015
    Abstract: An apparatus and/or method discloses a video conference with enhanced audio quality using high-fidelity audio sharing (“HAS”). In one embodiment, a network connection between a first user equipment (“UE”) and a second UE is established via a communication network for providing an interactive real-time meeting. After sending a first calibration audio signal from the first UE to the second UE, a second calibration audio signal is retuned from the second UE to the first UE according to the first calibration audio signal. Upon identifying a far end audio (“FEA”) delay based on the first calibration audio signal and the second calibration audio signal, a first mixed audio data containing the first shared audio data and first FEA data is fetched from an audio buffer. The first FEA data is subsequently removed or extracted from the mixed audio data in response to the FEA delay.
    Type: Grant
    Filed: March 19, 2020
    Date of Patent: June 15, 2021
    Assignee: Zoom Video Communications, Inc.
    Inventors: Zhaofeng Jia, Huipin Zhang