Patents by Inventor Chuxiang SHANG

Chuxiang SHANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12159649
    Abstract: According to the embodiments of the disclosure, a multimedia processing method, device, electronic device, and storage medium are provided by obtaining a first multimedia resource; determining an initial text content corresponding to the first multimedia resource by performing speech recognition on audio data of the first multimedia resource, the audio data of the first multimedia resource comprises speech data of the initial text content; determining an invalid text content in the initial text content, the invalid text content is semantically non-informative; determining a first playing position of speech data of the invalid text content in the first multimedia resource; and cropping the first multimedia resource based on the first playing position to obtain a second multimedia resource, wherein audio data of the second multimedia resource comprises speech data of a target text content but does not comprise the speech data of the invalid text content.
    Type: Grant
    Filed: December 11, 2023
    Date of Patent: December 3, 2024
    Assignee: LEMON INC.
    Inventors: Xin Zheng, Conghui Zhu, Rui Xia, Chuxiang Shang, Dejian Zhong, Yongsen Jiang, Ming Tu, Lelai Deng
  • Publication number: 20240284100
    Abstract: An audio denoising method and device, an apparatus, a computer-readable storage medium, and a program product. The method includes: obtaining audio data to be denoised; estimating amplitude time-frequency mask of the audio data to be denoised by using a preset real-valued network model to obtain a first-order enhanced amplitude spectrum corresponding to the audio data to be denoised; estimating complex time-frequency masking of the audio data to be denoised by using a preset complex-valued network model; and determining denoising resulted audio data corresponding to the audio data to be denoised by combining the first-order enhanced amplitude spectrum with the complex time-frequency mask.
    Type: Application
    Filed: September 9, 2022
    Publication date: August 22, 2024
    Inventors: Xiaofeng SHU, Yehang ZHU, Chuxiang SHANG, Yanjie CHEN
  • Publication number: 20240105234
    Abstract: According to the embodiments of the disclosure, a multimedia processing method, device, electronic device, and storage medium are provided by obtaining a first multimedia resource; determining an initial text content corresponding to the first multimedia resource by performing speech recognition on audio data of the first multimedia resource, the audio data of the first multimedia resource comprises speech data of the initial text content; determining an invalid text content in the initial text content, the invalid text content is semantically non-informative; determining a first playing position of speech data of the invalid text content in the first multimedia resource; and cropping the first multimedia resource based on the first playing position to obtain a second multimedia resource, wherein audio data of the second multimedia resource comprises speech data of a target text content but does not comprise the speech data of the invalid text content.
    Type: Application
    Filed: December 11, 2023
    Publication date: March 28, 2024
    Inventors: Xin Zheng, Conghui Zhu, Rui Xia, Chuxiang Shang, Dejian Zhong, Yongsen Jiang, Ming Tu, Lelai Deng
  • Patent number: 11822854
    Abstract: The present disclosure relates to an automatic volume adjustment method and apparatus, a medium, and a device, which belong to the field of computer technologies, and can adjust the playback volume of audio or video. The automatic volume adjustment method includes: acquiring, in a case that a terminal does not output loudspeaker sound, a noise signal outside the terminal; determining noise energy based on the noise signal; and adjusting playback volume of audio or video on the terminal based on the noise energy.
    Type: Grant
    Filed: August 17, 2022
    Date of Patent: November 21, 2023
    Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.
    Inventors: Hequn Bai, Chuxiang Shang
  • Publication number: 20230289622
    Abstract: Provided in the present disclosure are a volume recommendation method and apparatus, a device, and a storage medium, relating to the technical field of artificial intelligence, the method comprising: acquiring features corresponding to the playback operation of any audio/video file by a user, the features reflecting the playback habits of the user; inputting the features into a user volume recommendation model and, after processing by the volume recommendation model, outputting a recommended volume for the user; the volume recommendation model is a machine learning model obtained by performing training on the basis of the corresponding relationship between features and volume settings in the historical audio/video playback behaviour of the user. The present disclosure can effectively reduce volume discomfort, enhancing the user experience.
    Type: Application
    Filed: August 10, 2021
    Publication date: September 14, 2023
    Inventor: Chuxiang SHANG