Patents by Inventor Wei-Lien Hsu
Wei-Lien Hsu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20230252970Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.Type: ApplicationFiled: April 19, 2023Publication date: August 10, 2023Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
-
Patent number: 11688384Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.Type: GrantFiled: August 14, 2020Date of Patent: June 27, 2023Assignee: CISCO TECHNOLOGY, INC.Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
-
Publication number: 20220051652Abstract: Managing noise during an online conference session includes obtaining audio data from an endpoint participating in an online conference session. The audio data is derived from audio captured at the endpoint that includes musical sounds. The audio data is processed to identify a portion of the audio data in which a decibel level of the musical sounds is stable for a period of time. Non-musical noise present, if any, in the audio data with the musical sounds is identified and the non-musical noise is attenuated from the audio data to generate noise-reduced musical audio data. The noise-reduced musical audio data is transmitted for play out at one or more other endpoints participating in the online conference session.Type: ApplicationFiled: August 14, 2020Publication date: February 17, 2022Inventors: Bjørn Winsvold, Eric Yi-hua Chen, Wei-Lien Hsu, Pi-Hsin Liu
-
Patent number: 11245788Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo canceller model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.Type: GrantFiled: August 26, 2020Date of Patent: February 8, 2022Assignee: CISCO TECHNOLOGY, INC.Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
-
Publication number: 20200396329Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.Type: ApplicationFiled: August 26, 2020Publication date: December 17, 2020Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
-
Patent number: 10867616Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.Type: GrantFiled: October 10, 2019Date of Patent: December 15, 2020Assignee: CISCO TECHNOLOGY, INC.Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
-
Patent number: 10771621Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.Type: GrantFiled: April 2, 2018Date of Patent: September 8, 2020Assignee: CISCO TECHNOLOGY, INC.Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
-
Publication number: 20200043509Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.Type: ApplicationFiled: October 10, 2019Publication date: February 6, 2020Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
-
Patent number: 10446170Abstract: This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.Type: GrantFiled: June 19, 2018Date of Patent: October 15, 2019Assignee: CISCO TECHNOLOGY, INC.Inventors: Eric Chen, Asbjørn Therkelsen, Espen Moberg, Wei-Lien Hsu
-
Publication number: 20190132452Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.Type: ApplicationFiled: April 2, 2018Publication date: May 2, 2019Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
-
Patent number: 9674523Abstract: Methods and apparatus for transcoding digital video data are disclosed. In an embodiment, a transcoder (300) decodes a digital video block (304) using a first coding scheme, such as 8×8 MPEG-2/4, to produce domain transformed data (306) and a motion vector (308). The transcoder (300) then estimates an energy level of each sub-block in the digital video block (304) in the frequency domain (as opposed to the spatial domain), thereby reducing or eliminating the need for motion compensation. For each sub-block with an estimated energy level below a desired threshold (e.g., likely an all-zero sub-block), the transcoder (300) transcodes the sub-block by converting the motion vector (308) from the first coding scheme (e.g., MPEG-2/4) to the second coding scheme (e.g., H.264) (e.g., convert 8×8 MPEG-2/4 vector to 4×4 H.264 vector or reuse the MPEG-2/4 vector if all four sub-blocks are AZB and coding in H.264 as an 8×8 block). The transcoded sub-block may then be used (e.g., stored or transmitted).Type: GrantFiled: November 14, 2012Date of Patent: June 6, 2017Assignee: Advanced Micro Devices, Inc.Inventor: Wei-Lien Hsu
-
Publication number: 20140133573Abstract: Methods and apparatus for transcoding digital video data are disclosed. In an embodiment, a transcoder (300) decodes a digital video block (304) using a first coding scheme, such as 8×8 MPEG-2/4, to produce domain transformed data (306) and a motion vector (308). The transcoder (300) then estimates an energy level of each sub-block in the digital video block (304) in the frequency domain (as opposed to the spatial domain), thereby reducing or eliminating the need for motion compensation. For each sub-block with an estimated energy level below a desired threshold (e.g., likely an all-zero sub-block), the transcoder (300) transcodes the sub-block by converting the motion vector (308) from the first coding scheme (e.g., MPEG-2/4) to the second coding scheme (e.g., H.264) (e.g., convert 8×8 MPEG-2/4 vector to 4×4 H.264 vector or reuse the MPEG-2/4 vector if all four sub-blocks are AZB and coding in H.264 as an 8×8 block). The transcoded sub-block may then be used (e.g., stored or transmitted).Type: ApplicationFiled: November 14, 2012Publication date: May 15, 2014Applicant: ADVANCED MICRO DEVICES, INC.Inventor: Wei-Lien Hsu
-
Publication number: 20120027091Abstract: Methods and system provide for the encoding of video frames using a plurality of processors. In one example, a first processor provides a location of a plurality of non-stationary pixels in a current frame by comparing pixel data in the current frame with corresponding pixel data in a previous frame for use by a second processor. The first processor also provides pixel data describing substantially only non-stationary pixels in the current frame, for use by the second processor. The second processor calculates motion vector data for the plurality of non-stationary pixels based on the non-stationary pixel location information and the pixel data describing substantially only non-stationary pixels. The first processor encodes the current frame using the motion vector data for the plurality of non-stationary pixels from the second processor.Type: ApplicationFiled: July 28, 2010Publication date: February 2, 2012Inventor: Wei-Lien Hsu
-
Patent number: 6973469Abstract: A method is disclosed for performing a discrete cosine transform (DCT) using a microprocessor having an instruction set that includes SIMD floating point instructions. In one embodiment, the method includes: (1) receiving a block of integer data having C columns and R rows; and (2) for each row, (a) loading the row data into registers; (b) converting the row data into floating point form so that the registers each hold two floating point row data values; and (c) using SIMD floating point instructions to perform weighted-rotation operations on the values in the registers. Suitable SIMD floating point instructions include the pswap, pfmul, and pfpnacc instructions. For the row-DCT, the data values are preferably ordered in the registers so as to permit the use of these instructions. For the column-DCT, two columns are preferably processed in parallel using SIMD instructions to improve computational efficiency.Type: GrantFiled: February 1, 2001Date of Patent: December 6, 2005Assignee: Advanced Micro Devices, Inc.Inventors: Wei-Lien Hsu, David Horton
-
Publication number: 20050141439Abstract: A method for making a voice communications link is implemented using local and remote computers, each of which operates in an instant messenger environment and is connected to a data network. The local computer is equipped with a call forwarding device for connecting to a telephone network. The method includes: a) in response to an incoming phone call placed by a calling party and received by the local computer through the call forwarding device, enabling activation of the instant messenger function of the local computer for paging the remote computer; and b) in response to an acknowledge message issued by the remote computer to accept establishment of the voice communications link and received by the local computer through the data network, activating the instant messenger function of the local and remote computers for establishing the voice communications link.Type: ApplicationFiled: December 15, 2004Publication date: June 30, 2005Inventors: Wei-Lien Hsu, Pao-Chu Chiao
-
Patent number: 6907438Abstract: A method is disclosed for performing an inverse discrete cosine transform (IDCT) using a microprocessor having an instruction set that includes SIMD floating-point instructions. In one embodiment, the method includes: (1) receiving a block of integer data having C columns and R rows; and (2) for each row, (a) loading the row data into registers; (b) converting the row data into floating-point form so that the registers each hold two floating-point row data values; and (c) using SIMD floating-point instructions to perform weighted-rotation operations on the values in the registers. Suitable SIMD floating-point instructions include the pswap, pfmul, and pfpnacc instructions. For the row-IDCT, the data values are preferably ordered in the registers so as to permit the use of these instructions. For the column-IDCT, two columns are preferably processed concurrently using SIMD instructions to improve computational efficiency.Type: GrantFiled: March 2, 2001Date of Patent: June 14, 2005Assignee: Advanced Micro Devices, Inc.Inventors: David Horton, Yi Liu, Wei-Lien Hsu
-
Patent number: 6882976Abstract: An efficient finite length POW10 calculation for MPEG audio encoding. A method for encoding an audio input signal includes storing a plurality of predetermined tonal values corresponding to a plurality of predetermined power levels. The method also includes receiving a plurality of input values each representative of a power level of a spectral component of the audio input signal at a corresponding frequency sub-band and accessing at least one corresponding tonal value of the plurality of predetermined tonal values. The method further includes generating an encoded output signal representative of the audio input signal by using at least one corresponding tonal value for each of the plurality of input values. Further, the storing of the plurality of predetermined tonal values is performed prior to the receiving of the plurality of input values.Type: GrantFiled: February 28, 2001Date of Patent: April 19, 2005Assignee: Advanced Micro Devices, Inc.Inventors: Wei-Lien Hsu, Travis Wheatley
-
Patent number: 6847684Abstract: In a multimedia compression system such as is used in compressing data in a video stream, “zero blocks” from an MPEG block array are identified prior to encoding and subjecting blocks to the process of quantized discrete cosine transform (DCT). A zero block as understood herein is an MPEG video data block that results when an unencoded block of video data is identical to an adjacent un-encoded block of video data, i.e., when there is little or no change between consecutive frames of video. The efficiency of the compression system is increased in this invention by not having to perform the functions of DCT, quantization, dequantization or inverse DCT on the identified zero blocks, but simply directly encoding zero blocks.Type: GrantFiled: June 1, 2000Date of Patent: January 25, 2005Assignee: Hewlett-Packard Development Company, L.P.Inventor: Wei-Lien Hsu
-
Patent number: 6823087Abstract: The invention, in a first aspect, is a method for mitigating edge effects in a decompressed video image. The method comprises first reads an N×N group of pixels defining a vertical edge between two blocks in a video frame row by row into N registers, wherein N is a predetermined number defining the length of a filter. The content of the N registers is then transposed and then filtered in the filter. The filtered content of the N registers is then transposed and stored back from where it was read. In other aspects, the invention is a program storage device encoded with instructions that, when executed by a computer, perform such a method; a computer programmed to perform such a method; and a computing system capable of performing such a method.Type: GrantFiled: May 15, 2001Date of Patent: November 23, 2004Assignee: Advanced Micro Devices, Inc.Inventors: Yi Liu, Wei-Lien Hsu, Frank Gorishek
-
Patent number: 6757330Abstract: An improved form of half pixel accuracy motion estimation/compensation using the MPEG recommended half pixel approach resides in method and system. In the inventive method, an interpolated reference image is created before coding so that it can be preloaded into a cache memory and used whenever needed, without having to create it each time. To avoid redundant processing during accessing of subsampled interpolated data, the interpolated image using half pixel method is partitioned into four areas. The four areas are defined based on where in a 2×2 square region the pixels fall.Type: GrantFiled: June 1, 2000Date of Patent: June 29, 2004Assignee: Hewlett-Packard Development Company, L.P.Inventor: Wei-Lien Hsu