Patents by Inventor Wei-Lien Hsu

Wei-Lien Hsu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Noise management during an online conference session

Patent number: 12266337

Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.

Type: Grant

Filed: July 11, 2024

Date of Patent: April 1, 2025

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
NOISE MANAGEMENT DURING AN ONLINE CONFERENCE SESSION

Publication number: 20240363095

Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.

Type: Application

Filed: July 11, 2024

Publication date: October 31, 2024

Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
Noise management during an online conference session

Patent number: 12039966

Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.

Type: Grant

Filed: April 19, 2023

Date of Patent: July 16, 2024

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
NOISE MANAGEMENT DURING AN ONLINE CONFERENCE SESSION

Publication number: 20230252970

Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.

Type: Application

Filed: April 19, 2023

Publication date: August 10, 2023

Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
Noise management during an online conference session

Patent number: 11688384

Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.

Type: Grant

Filed: August 14, 2020

Date of Patent: June 27, 2023

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
NOISE MANAGEMENT DURING AN ONLINE CONFERENCE SESSION

Publication number: 20220051652

Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.

Type: Application

Filed: August 14, 2020

Publication date: February 17, 2022

Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
Acoustic echo cancellation based sub band domain active speaker detection for audio and video conferencing applications

Patent number: 11245788

Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo canceller model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.

Type: Grant

Filed: August 26, 2020

Date of Patent: February 8, 2022

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
ACOUSTIC ECHO CANCELLATION BASED SUB BAND DOMAIN ACTIVE SPEAKER DETECTION FOR AUDIO AND VIDEO CONFERENCING APPLICATIONS

Publication number: 20200396329

Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.

Type: Application

Filed: August 26, 2020

Publication date: December 17, 2020

Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
Noise mitigation using machine learning

Patent number: 10867616

Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.

Type: Grant

Filed: October 10, 2019

Date of Patent: December 15, 2020

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
Acoustic echo cancellation based sub band domain active speaker detection for audio and video conferencing applications

Patent number: 10771621

Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.

Type: Grant

Filed: April 2, 2018

Date of Patent: September 8, 2020

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
NOISE MITIGATION USING MACHINE LEARNING

Publication number: 20200043509

Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.

Type: Application

Filed: October 10, 2019

Publication date: February 6, 2020

Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
Noise mitigation using machine learning

Patent number: 10446170

Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.

Type: Grant

Filed: June 19, 2018

Date of Patent: October 15, 2019

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
ACOUSTIC ECHO CANCELLATION BASED SUB BAND DOMAIN ACTIVE SPEAKER DETECTION FOR AUDIO AND VIDEO CONFERENCING APPLICATIONS

Publication number: 20190132452

Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.

Type: Application

Filed: April 2, 2018

Publication date: May 2, 2019

Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
Methods and apparatus for transcoding digital video

Patent number: 9674523

Abstract: Methods and apparatus for transcoding digital video data are disclosed. In an embodiment, a transcoder (300) decodes a digital video block (304) using a first coding scheme, such as 8×8 MPEG-2/4, to produce domain transformed data (306) and a motion vector (308). The transcoder (300) then estimates an energy level of each sub-block in the digital video block (304) in the frequency domain (as opposed to the spatial domain), thereby reducing or eliminating the need for motion compensation. For each sub-block with an estimated energy level below a desired threshold (e.g., likely an all-zero sub-block), the transcoder (300) transcodes the sub-block by converting the motion vector (308) from the first coding scheme (e.g., MPEG-2/4) to the second coding scheme (e.g., H.264) (e.g., convert 8×8 MPEG-2/4 vector to 4×4 H.264 vector or reuse the MPEG-2/4 vector if all four sub-blocks are AZB and coding in H.264 as an 8×8 block). The transcoded sub-block may then be used (e.g., stored or transmitted).

Type: Grant

Filed: November 14, 2012

Date of Patent: June 6, 2017

Assignee: Advanced Micro Devices, Inc.

Inventor: Wei-Lien Hsu
METHODS AND APPARATUS FOR TRANSCODING DIGITAL VIDEO DATA

Publication number: 20140133573

Abstract: Methods and apparatus for transcoding digital video data are disclosed. In an embodiment, a transcoder (300) decodes a digital video block (304) using a first coding scheme, such as 8×8 MPEG-2/4, to produce domain transformed data (306) and a motion vector (308). The transcoder (300) then estimates an energy level of each sub-block in the digital video block (304) in the frequency domain (as opposed to the spatial domain), thereby reducing or eliminating the need for motion compensation. For each sub-block with an estimated energy level below a desired threshold (e.g., likely an all-zero sub-block), the transcoder (300) transcodes the sub-block by converting the motion vector (308) from the first coding scheme (e.g., MPEG-2/4) to the second coding scheme (e.g., H.264) (e.g., convert 8×8 MPEG-2/4 vector to 4×4 H.264 vector or reuse the MPEG-2/4 vector if all four sub-blocks are AZB and coding in H.264 as an 8×8 block). The transcoded sub-block may then be used (e.g., stored or transmitted).

Type: Application

Filed: November 14, 2012

Publication date: May 15, 2014

Applicant: ADVANCED MICRO DEVICES, INC.

Inventor: Wei-Lien Hsu
Method and System for Encoding Video Frames Using a Plurality of Processors

Publication number: 20120027091

Abstract: Methods and system provide for the encoding of video frames using a plurality of processors. In one example, a first processor provides a location of a plurality of non-stationary pixels in a current frame by comparing pixel data in the current frame with corresponding pixel data in a previous frame for use by a second processor. The first processor also provides pixel data describing substantially only non-stationary pixels in the current frame, for use by the second processor. The second processor calculates motion vector data for the plurality of non-stationary pixels based on the non-stationary pixel location information and the pixel data describing substantially only non-stationary pixels. The first processor encodes the current frame using the motion vector data for the plurality of non-stationary pixels from the second processor.

Type: Application

Filed: July 28, 2010

Publication date: February 2, 2012

Inventor: Wei-Lien Hsu
Two-dimensional discrete cosine transform using SIMD instructions

Patent number: 6973469

Abstract: A method is disclosed for performing a discrete cosine transform (DCT) using a microprocessor having an instruction set that includes SIMD floating point instructions. In one embodiment, the method includes: (1) receiving a block of integer data having C columns and R rows; and (2) for each row, (a) loading the row data into registers; (b) converting the row data into floating point form so that the registers each hold two floating point row data values; and (c) using SIMD floating point instructions to perform weighted-rotation operations on the values in the registers. Suitable SIMD floating point instructions include the pswap, pfmul, and pfpnacc instructions. For the row-DCT, the data values are preferably ordered in the registers so as to permit the use of these instructions. For the column-DCT, two columns are preferably processed in parallel using SIMD instructions to improve computational efficiency.

Type: Grant

Filed: February 1, 2001

Date of Patent: December 6, 2005

Assignee: Advanced Micro Devices, Inc.

Inventors: Wei-Lien Hsu, David Horton
Method for enabling the user of a local computer operating in an instant messenger environment to make a voice communications link

Publication number: 20050141439

Abstract: A method for making a voice communications link is implemented using local and remote computers, each of which operates in an instant messenger environment and is connected to a data network. The local computer is equipped with a call forwarding device for connecting to a telephone network. The method includes: a) in response to an incoming phone call placed by a calling party and received by the local computer through the call forwarding device, enabling activation of the instant messenger function of the local computer for paging the remote computer; and b) in response to an acknowledge message issued by the remote computer to accept establishment of the voice communications link and received by the local computer through the data network, activating the instant messenger function of the local and remote computers for establishing the voice communications link.

Type: Application

Filed: December 15, 2004

Publication date: June 30, 2005

Inventors: Wei-Lien Hsu, Pao-Chu Chiao
Two-dimensional inverse discrete cosine transform using SIMD instructions

Patent number: 6907438

Abstract: A method is disclosed for performing an inverse discrete cosine transform (IDCT) using a microprocessor having an instruction set that includes SIMD floating-point instructions. In one embodiment, the method includes: (1) receiving a block of integer data having C columns and R rows; and (2) for each row, (a) loading the row data into registers; (b) converting the row data into floating-point form so that the registers each hold two floating-point row data values; and (c) using SIMD floating-point instructions to perform weighted-rotation operations on the values in the registers. Suitable SIMD floating-point instructions include the pswap, pfmul, and pfpnacc instructions. For the row-IDCT, the data values are preferably ordered in the registers so as to permit the use of these instructions. For the column-IDCT, two columns are preferably processed concurrently using SIMD instructions to improve computational efficiency.

Type: Grant

Filed: March 2, 2001

Date of Patent: June 14, 2005

Assignee: Advanced Micro Devices, Inc.

Inventors: David Horton, Yi Liu, Wei-Lien Hsu
Efficient finite length POW10 calculation for MPEG audio encoding

Patent number: 6882976

Abstract: An efficient finite length POW10 calculation for MPEG audio encoding. A method for encoding an audio input signal includes storing a plurality of predetermined tonal values corresponding to a plurality of predetermined power levels. The method also includes receiving a plurality of input values each representative of a power level of a spectral component of the audio input signal at a corresponding frequency sub-band and accessing at least one corresponding tonal value of the plurality of predetermined tonal values. The method further includes generating an encoded output signal representative of the audio input signal by using at least one corresponding tonal value for each of the plurality of input values. Further, the storing of the plurality of predetermined tonal values is performed prior to the receiving of the plurality of input values.

Type: Grant

Filed: February 28, 2001

Date of Patent: April 19, 2005

Assignee: Advanced Micro Devices, Inc.

Inventors: Wei-Lien Hsu, Travis Wheatley

1 2 next