Patents by Inventor Yuanxing MA

Yuanxing MA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS, APPARATUS AND SYSTEMS FOR USER GENERATED CONTENT CAPTURE AND ADAPTIVE RENDERING

Publication number: 20250218450

Abstract: Methods of processing audio data relating to user generated content are described. One method includes obtaining the audio data; applying frame-wise audio enhancement to the audio data; generating metadata for the enhanced audio data, based on one or more processing parameters of the frame-wise audio enhancement; and outputting the enhanced audio data together with the metadata. Another method includes obtaining the audio data and metadata for the audio data, wherein the metadata comprises first metadata indicative of one or more processing parameters of a previous frame-wise audio enhancement of the audio data; applying restore processing to the audio data, using the one or more processing parameters, to at least partially reverse the previous frame-wise audio enhancement; and applying frame-wise audio enhancement or editing processing to the restored raw audio data. Further described are corresponding apparatus, programs, and computer-readable storage media.

Type: Application

Filed: April 3, 2023

Publication date: July 3, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
METHOD AND AUDIO PROCESSING SYSTEM FOR WIND NOISE SUPPRESSION

Publication number: 20250191601

Abstract: The present disclosure relates to a method and system (1) for suppressing wind noise. The method comprises obtaining an input audio signal (100, 100?) comprising a plurality of consecutive audio signal segments (101, 102, 103, 101?, 102?, 103?) and suppressing wind noise in the input audio signal with a wind noise suppressor module (20) to generate a wind noise reduced audio signal. The method further comprises sing a neural network (10) trained to predict a set of gains for reducing noise in the input audio signal (100, 100?) given samples of the input audio signal (100, 100?), wherein a noise reduced audio signal is formed by applying said set of gains to the input audio signal (100, 100?) and mixing the wind noise reduced audio signal and the noise reduced audio signal with a mixer (30) to obtain an output audio signal with suppressed wind noise.

Type: Application

Filed: March 8, 2023

Publication date: June 12, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Qingyuan Bin, Yuanxing Ma, Zhiwei Shuang
DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS

Publication number: 20250078858

Abstract: Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts.

Type: Application

Filed: January 12, 2022

Publication date: March 6, 2025

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Giulio CENGARLE, Yuanxing MA
SOURCE SEPARATION AND REMIXING IN SIGNAL PROCESSING

Publication number: 20250046328

Abstract: The present disclosure relates to a method and audio processing system (1) for performing source separation. The method comprises obtaining (S1) an audio signal (Sin) including a mixture of speech content and noise content, determining (S2a, S2b, S2c), from the audio signal, speech content (formula A), stationary noise content (formula C) and non-speech content (formula B). The stationary noise content (formula C) is a true subset of the non-speech content (formula B) and the method further comprises determining (S3), based on a difference between the stationary noise content (formula C) and the non-speech content (formula B) a non-stationary noise content formula D), obtaining (S5) a set of weighting factors and forming (S6) a processed audio signal based on a combination of the speech content (formula A), the stationary noise content (formula C), and the non-stationary noise content (formula D) weighted with their respective weighting factor.

Type: Application

Filed: October 26, 2022

Publication date: February 6, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jundai SUN, Zhiwei SHUANG, Yuanxing MA
METHOD FOR NEURAL NETWORK TRAINING WITH MULTIPLE SUPERVISORS

Publication number: 20250045585

Abstract: The present disclosure relates to a method for designing a processor (20) and a computer implemented neural network. The method comprises obtaining input data and corresponding ground truth target data and providing the input data to a processor (20) for outputting a first prediction of target data given the input data. The method further comprises providing the latent variables output by a processor module (21: 1, 21: 2, . . . 21: n?1) to a supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n?1) which outputs a second prediction of target data based on latent variables and determining a first and second loss measure by comparing the predictions of target data with the ground truth target data. The method further comprises training the processor (20) and the supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n?1) based on the first and second loss measure and adjusting the processor by at least one of removing, replacing and adding a processor module.

Type: Application

Filed: December 8, 2022

Publication date: February 6, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jundai SUN, Lie LU, Zhiwei SHUANG, Yuanxing MA
AUDIO ENHANCEMENT FOR MOBILE CAPTURE

Publication number: 20250008284

Abstract: A system for real-time monitoring of user-generated audio content for audio anomaly and a related method are disclosed. In some embodiments, the system is programmed to receive, in real time, audio data generated by a first mobile device, such as a smartphone. The system is programed to determine, in real time, whether an audio anomaly has occurred from the audio data. The system is programmed to cause, in real time, a presentation of an alert to a second mobile device, which could be the same smartphone, in response to detecting an occurrence of audio anomaly.

Type: Application

Filed: September 7, 2022

Publication date: January 2, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Kai Li, Hao Luo, Lei Gan, Xu Li, Weiwei Wen, Yuanxing Ma
IMPROVING PERCEPTUAL QUALITY OF DEREVERBERATION

Publication number: 20240170001

Abstract: A method for reverberation suppression may involve receiving an input audio signal. The method may involve calculating an initial reverberation suppression gain for the input audio signal for at least one frame of the input audio signal. The method may involve calculating at least one adjusted reverberation suppression gain, where the at least one adjusted reverberation suppression gain adjusts at least one of: 1) a reverberation suppression decay based on a reverberation intensity detected in the input audio signal; 2) gains applied to different frequency bands of the input audio signal based on an amount of room resonance detected in the input audio signal; or 3) a loudness of the input audio signal based on a direct part of the input audio signal. The method may involve generating an output audio signal by applying the at least one adjusted reverberation suppression gain to the input audio signal.

Type: Application

Filed: March 9, 2022

Publication date: May 23, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yuanxing MA, Kai LI
CONTEXT AWARE AUDIO PROCESSING

Publication number: 20240170004

Abstract: Embodiments are disclosed for context aware audio processing. In an embodiment, an audio processing method comprises: receiving, with one or more sensors of a device, environment information about an audio recording captured by the device; detecting, with at least one processor of the device, a context of the audio recording based on the audio recording and the environment information; determining, with the at least one processor, a model based on the context; processing, with the at least one processor, the audio recording based on the model to produce a processed audio recording with suppressed noise; determining, with the at least one processor, an audio processing profile based on the context; and combining, with the at least one processor, the audio recording and the processed audio recording based on the audio processing profile.

Type: Application

Filed: April 28, 2022

Publication date: May 23, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Zhiwei SHUANG, Yuanxing MA, Yang LIU
DEREVERBERATION BASED ON MEDIA TYPE

Publication number: 20240170002

Abstract: A method for reverberation suppression may involve receiving an input audio signal. The method may involve classifying a media type of the input audio signal as one of a group comprising at least: 1) speech; 2) music; or 3) speech over music. The method may involve determining whether to perform dereverberation on the input audio signal based at least on a determination that the media type of the input audio signal has been classified as speech. The method may involve generating an output audio signal by performing dereverberation on the input audio signal in response to determining that dereverberation is to be performed on the input audio signal.

Type: Application

Filed: March 10, 2022

Publication date: May 23, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Kai LI, Shaofan YANG, Yuanxing MA
CONTEXT AWARE SOUNDSCAPE CONTROL

Publication number: 20240155289

Abstract: Embodiments are disclosed for context aware soundscape control. In an embodiment, an audio processing method comprises: capturing, using a first set of microphones on a mobile device, a first audio signal from an audio scene; capturing, using a second set of microphones on a pair of earbuds, a second audio signal from the audio scene; capturing, using a camera on the mobile device, a video signal from a video scene; generating, with at least one processor, a processed audio signal from the first audio signal and the second audio signal, the processed audio signal generated with adaptive soundscape control based on context information; and combining, with the at least one processor, the processed audio signal and the captured video signal as multimedia output.

Type: Application

Filed: April 28, 2022

Publication date: May 9, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Zhiwei SHUANG, Yuanxing MA, Yang LIU
PERCEPTUAL ENHANCEMENT FOR BINAURAL AUDIO RECORDING

Publication number: 20240080608

Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.

Type: Application

Filed: December 14, 2021

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
METHOD AND DEVICE FOR PROCESSING A BINAURAL RECORDING

Publication number: 20230360662

Abstract: The present invention relates to a method and device for processing a first and a second audio signal representing an input binaural audio signal acquired by a binaural recording device. The present invention further relates to a method for rendering a binaural audio signal on a speaker system. The method for processing a binaural signal comprising extracting audio information from the first audio signal, computing a band gain for reducing noise in the first audio signal and applying the band gains to respective frequency bands of the first audio signal in accordance with a dynamic scaling factor, to provide a first output audio signal. Wherein the dynamic scaling factor has a value between zero and one and is selected so as to reduce quality degradation for the first audio signal.

Type: Application

Filed: September 15, 2021

Publication date: November 9, 2023

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Zhiwei Shuang, Yuanxing Ma, Yang Liu, Ziyu Yang, Giulio Cengarle
ADAPTING SIBILANCE DETECTION BASED ON DETECTING SPECIFIC SOUNDS IN AN AUDIO SIGNAL

Publication number: 20220383889

Abstract: A method is disclosed herein for adapting parameters of a sibilance detector. Time-frequency features are extracted from an audio signal being received and. Based on those time-frequency features, a determination is made of whether the audio signal includes a short-term feature or a long-term feature. In accordance with determining that the audio signal includes the short-term feature or the long-term feature, one or more parameters of a sibilance detector for detecting sibilance in the audio signal are adapted. Sibilance in the audio signal, is detected using the sibilance detector with the one or more adapted parameters.

Type: Application

Filed: July 16, 2020

Publication date: December 1, 2022

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yuanxing Ma, Kai Li, Qianqian Fang
Distortion reducing multi-band compressor with dynamic thresholds based on scene switch analyzer guided distortion audibility model

Patent number: 11282533

Abstract: The present application relates to a method, system, and computer program product of dynamically adjusting thresholds of a compressor responsive to an input audio signal. A scene switch analyzer receives an input audio signal having a plurality of frequency band components. The scene switch analyzer determines whether a scene switch has occurred in the input audio signal. The frequency band components of the input audio signal are processed. In response to determine that scene switch has not occurred, a distortion audibility system applies slow smoothing to compressor thresholds of the frequency band components. In response to determine that scene switch has occurred, the distortion audibility system applies fast smoothing or no smoothing to the compressor thresholds of the frequency band components.

Type: Grant

Filed: September 26, 2019

Date of Patent: March 22, 2022

Assignee: Dolby Laboratories Licensing Corporation

Inventor: Yuanxing Ma
DISTORTION REDUCING MULTI-BAND COMPRESSOR WITH DYNAMIC THRESHOLDS BASED ON SCENE SWITCH ANALYZER GUIDED DISTORTION AUDIBILITY MODEL

Publication number: 20210343308

Abstract: The present application relates to a method, system, and computer program product of dynamically adjusting thresholds of a compressor responsive to an input audio signal. A scene switch analyzer receives an input audio signal having a plurality of frequency band components. The scene switch analyzer determines whether a scene switch has occurred in the input audio signal. The frequency band components of the input audio signal are processed. In response to determine that scene switch has not occurred, a distortion audibility system applies slow smoothing to compressor thresholds of the frequency band components. In response to determine that scene switch has occurred, the distortion audibility system applies fast smoothing or no smoothing to the compressor thresholds of the frequency band components.

Type: Application

Filed: September 26, 2019

Publication date: November 4, 2021

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Yuanxing MA