Patents by Inventor Wei-Lien Hsu

Wei-Lien Hsu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230252970
    Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.
    Type: Application
    Filed: April 19, 2023
    Publication date: August 10, 2023
    Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
  • Patent number: 11688384
    Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.
    Type: Grant
    Filed: August 14, 2020
    Date of Patent: June 27, 2023
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
  • Publication number: 20220051652
    Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.
    Type: Application
    Filed: August 14, 2020
    Publication date: February 17, 2022
    Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
  • Patent number: 11245788
    Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo canceller model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
    Type: Grant
    Filed: August 26, 2020
    Date of Patent: February 8, 2022
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
  • Publication number: 20200396329
    Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
    Type: Application
    Filed: August 26, 2020
    Publication date: December 17, 2020
    Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
  • Patent number: 10867616
    Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.
    Type: Grant
    Filed: October 10, 2019
    Date of Patent: December 15, 2020
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
  • Patent number: 10771621
    Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
    Type: Grant
    Filed: April 2, 2018
    Date of Patent: September 8, 2020
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
  • Publication number: 20200043509
    Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.
    Type: Application
    Filed: October 10, 2019
    Publication date: February 6, 2020
    Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
  • Patent number: 10446170
    Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.
    Type: Grant
    Filed: June 19, 2018
    Date of Patent: October 15, 2019
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
  • Publication number: 20190132452
    Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
    Type: Application
    Filed: April 2, 2018
    Publication date: May 2, 2019
    Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
  • Patent number: 9674523
    Abstract: Methods and apparatus for transcoding digital video data are disclosed. In an embodiment, a transcoder (300) decodes a digital video block (304) using a first coding scheme, such as 8×8 MPEG-2/4, to produce domain transformed data (306) and a motion vector (308). The transcoder (300) then estimates an energy level of each sub-block in the digital video block (304) in the frequency domain (as opposed to the spatial domain), thereby reducing or eliminating the need for motion compensation. For each sub-block with an estimated energy level below a desired threshold (e.g., likely an all-zero sub-block), the transcoder (300) transcodes the sub-block by converting the motion vector (308) from the first coding scheme (e.g., MPEG-2/4) to the second coding scheme (e.g., H.264) (e.g., convert 8×8 MPEG-2/4 vector to 4×4 H.264 vector or reuse the MPEG-2/4 vector if all four sub-blocks are AZB and coding in H.264 as an 8×8 block). The transcoded sub-block may then be used (e.g., stored or transmitted).
    Type: Grant
    Filed: November 14, 2012
    Date of Patent: June 6, 2017
    Assignee: Advanced Micro Devices, Inc.
    Inventor: Wei-Lien Hsu
  • Publication number: 20140133573
    Abstract: Methods and apparatus for transcoding digital video data are disclosed. In an embodiment, a transcoder (300) decodes a digital video block (304) using a first coding scheme, such as 8×8 MPEG-2/4, to produce domain transformed data (306) and a motion vector (308). The transcoder (300) then estimates an energy level of each sub-block in the digital video block (304) in the frequency domain (as opposed to the spatial domain), thereby reducing or eliminating the need for motion compensation. For each sub-block with an estimated energy level below a desired threshold (e.g., likely an all-zero sub-block), the transcoder (300) transcodes the sub-block by converting the motion vector (308) from the first coding scheme (e.g., MPEG-2/4) to the second coding scheme (e.g., H.264) (e.g., convert 8×8 MPEG-2/4 vector to 4×4 H.264 vector or reuse the MPEG-2/4 vector if all four sub-blocks are AZB and coding in H.264 as an 8×8 block). The transcoded sub-block may then be used (e.g., stored or transmitted).
    Type: Application
    Filed: November 14, 2012
    Publication date: May 15, 2014
    Applicant: ADVANCED MICRO DEVICES, INC.
    Inventor: Wei-Lien Hsu
  • Publication number: 20120027091
    Abstract: Methods and system provide for the encoding of video frames using a plurality of processors. In one example, a first processor provides a location of a plurality of non-stationary pixels in a current frame by comparing pixel data in the current frame with corresponding pixel data in a previous frame for use by a second processor. The first processor also provides pixel data describing substantially only non-stationary pixels in the current frame, for use by the second processor. The second processor calculates motion vector data for the plurality of non-stationary pixels based on the non-stationary pixel location information and the pixel data describing substantially only non-stationary pixels. The first processor encodes the current frame using the motion vector data for the plurality of non-stationary pixels from the second processor.
    Type: Application
    Filed: July 28, 2010
    Publication date: February 2, 2012
    Inventor: Wei-Lien Hsu
  • Patent number: 6973469
    Abstract: A method is disclosed for performing a discrete cosine transform (DCT) using a microprocessor having an instruction set that includes SIMD floating point instructions. In one embodiment, the method includes: (1) receiving a block of integer data having C columns and R rows; and (2) for each row, (a) loading the row data into registers; (b) converting the row data into floating point form so that the registers each hold two floating point row data values; and (c) using SIMD floating point instructions to perform weighted-rotation operations on the values in the registers. Suitable SIMD floating point instructions include the pswap, pfmul, and pfpnacc instructions. For the row-DCT, the data values are preferably ordered in the registers so as to permit the use of these instructions. For the column-DCT, two columns are preferably processed in parallel using SIMD instructions to improve computational efficiency.
    Type: Grant
    Filed: February 1, 2001
    Date of Patent: December 6, 2005
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Wei-Lien Hsu, David Horton
  • Publication number: 20050141439
    Abstract: A method for making a voice communications link is implemented using local and remote computers, each of which operates in an instant messenger environment and is connected to a data network. The local computer is equipped with a call forwarding device for connecting to a telephone network. The method includes: a) in response to an incoming phone call placed by a calling party and received by the local computer through the call forwarding device, enabling activation of the instant messenger function of the local computer for paging the remote computer; and b) in response to an acknowledge message issued by the remote computer to accept establishment of the voice communications link and received by the local computer through the data network, activating the instant messenger function of the local and remote computers for establishing the voice communications link.
    Type: Application
    Filed: December 15, 2004
    Publication date: June 30, 2005
    Inventors: Wei-Lien Hsu, Pao-Chu Chiao
  • Patent number: 6907438
    Abstract: A method is disclosed for performing an inverse discrete cosine transform (IDCT) using a microprocessor having an instruction set that includes SIMD floating-point instructions. In one embodiment, the method includes: (1) receiving a block of integer data having C columns and R rows; and (2) for each row, (a) loading the row data into registers; (b) converting the row data into floating-point form so that the registers each hold two floating-point row data values; and (c) using SIMD floating-point instructions to perform weighted-rotation operations on the values in the registers. Suitable SIMD floating-point instructions include the pswap, pfmul, and pfpnacc instructions. For the row-IDCT, the data values are preferably ordered in the registers so as to permit the use of these instructions. For the column-IDCT, two columns are preferably processed concurrently using SIMD instructions to improve computational efficiency.
    Type: Grant
    Filed: March 2, 2001
    Date of Patent: June 14, 2005
    Assignee: Advanced Micro Devices, Inc.
    Inventors: David Horton, Yi Liu, Wei-Lien Hsu
  • Patent number: 6882976
    Abstract: An efficient finite length POW10 calculation for MPEG audio encoding. A method for encoding an audio input signal includes storing a plurality of predetermined tonal values corresponding to a plurality of predetermined power levels. The method also includes receiving a plurality of input values each representative of a power level of a spectral component of the audio input signal at a corresponding frequency sub-band and accessing at least one corresponding tonal value of the plurality of predetermined tonal values. The method further includes generating an encoded output signal representative of the audio input signal by using at least one corresponding tonal value for each of the plurality of input values. Further, the storing of the plurality of predetermined tonal values is performed prior to the receiving of the plurality of input values.
    Type: Grant
    Filed: February 28, 2001
    Date of Patent: April 19, 2005
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Wei-Lien Hsu, Travis Wheatley
  • Patent number: 6847684
    Abstract: In a multimedia compression system such as is used in compressing data in a video stream, “zero blocks” from an MPEG block array are identified prior to encoding and subjecting blocks to the process of quantized discrete cosine transform (DCT). A zero block as understood herein is an MPEG video data block that results when an unencoded block of video data is identical to an adjacent un-encoded block of video data, i.e., when there is little or no change between consecutive frames of video. The efficiency of the compression system is increased in this invention by not having to perform the functions of DCT, quantization, dequantization or inverse DCT on the identified zero blocks, but simply directly encoding zero blocks.
    Type: Grant
    Filed: June 1, 2000
    Date of Patent: January 25, 2005
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Wei-Lien Hsu
  • Patent number: 6823087
    Abstract: The invention, in a first aspect, is a method for mitigating edge effects in a decompressed video image. The method comprises first reads an N×N group of pixels defining a vertical edge between two blocks in a video frame row by row into N registers, wherein N is a predetermined number defining the length of a filter. The content of the N registers is then transposed and then filtered in the filter. The filtered content of the N registers is then transposed and stored back from where it was read. In other aspects, the invention is a program storage device encoded with instructions that, when executed by a computer, perform such a method; a computer programmed to perform such a method; and a computing system capable of performing such a method.
    Type: Grant
    Filed: May 15, 2001
    Date of Patent: November 23, 2004
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Yi Liu, Wei-Lien Hsu, Frank Gorishek
  • Patent number: 6757330
    Abstract: An improved form of half pixel accuracy motion estimation/compensation using the MPEG recommended half pixel approach resides in method and system. In the inventive method, an interpolated reference image is created before coding so that it can be preloaded into a cache memory and used whenever needed, without having to create it each time. To avoid redundant processing during accessing of subsampled interpolated data, the interpolated image using half pixel method is partitioned into four areas. The four areas are defined based on where in a 2×2 square region the pixels fall.
    Type: Grant
    Filed: June 1, 2000
    Date of Patent: June 29, 2004
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Wei-Lien Hsu