Patents by Inventor Haohai Sun

Haohai Sun has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9686605
    Abstract: A video conference endpoint detects a face and determines a face angle of the detected face relative to a reference direction based on images captured with a camera. The endpoint determines an angle of arrival of sound (i.e., a sound angle) received at a microphone array that transduces the sound relative to the reference direction based on the transduced sound and a sound speed parameter indicative of a speed of sound in air. The endpoint compares the face angle against the sound angle, and adjusts the sound speed parameter so as to reduce the angle difference if the compare indicates an angle difference greater than zero between the face and sound angles.
    Type: Grant
    Filed: May 20, 2014
    Date of Patent: June 20, 2017
    Assignee: Cisco Technology, Inc.
    Inventor: Haohai Sun
  • Publication number: 20170171396
    Abstract: A system and method for joint acoustic echo control and adaptive array processing, comprising the decomposition of a captured sound field into N sub-sound fields, applying linear echo cancellation to each sub-sound field, selecting L sub-sound fields from the N sub-sound fields, performing L channel adaptive array processing utilizing the L selected sub-sound fields, and applying non-linear audio echo cancellation.
    Type: Application
    Filed: November 21, 2016
    Publication date: June 15, 2017
    Inventor: Haohai Sun
  • Patent number: 9485574
    Abstract: Systems, processes, devices, apparatuses, algorithms and computer readable medium for suppressing spatial interference using a dual microphone array for receiving, from a first microphone and a second microphone that are separated by a predefined distance, and that are configured to receive source signals, respective first and second microphone signals based on received source signals. A phase difference between the first and the second microphone signals is calculated based on the predefined distance. An angular distance between directions of arrival of the source signals and a desired capture direction is calculated based on the phase difference. Directional-filter coefficients are calculated based on the angular distance. Undesired source signals are filtered from an output based on the directional-filter coefficients.
    Type: Grant
    Filed: November 6, 2015
    Date of Patent: November 1, 2016
    Assignee: Cisco Technology, Inc.
    Inventors: Haohai Sun, Espen Moberg
  • Patent number: 9385779
    Abstract: A processing system can include a tracking microphone array; audio tracker circuitry connected to the tracking microphone array to track an audio source based on an audio input from the array; communication microphones; and a processor. The processor can include audio circuitry to receive an audio input from the communication microphones and process the audio input to apply one or more of acoustic echo cancellation (AEC) and acoustic echo suppression (AES) processing to the audio input. The processor can further include calculating circuitry to calculate a ratio of signal power after and before the AEC and/or the AES processing, and control circuitry to generate an acoustic echo presence indication based on the ratio calculated by the calculating circuitry. The processor can transmit, via transmitting circuitry, the acoustic echo presence indication to an audio tracking device via a data communication channel between the processor and the audio tracker.
    Type: Grant
    Filed: December 5, 2013
    Date of Patent: July 5, 2016
    Assignee: Cisco Technology, Inc.
    Inventor: Haohai Sun
  • Patent number: 9338544
    Abstract: A video conference endpoint determines a position of a best audio pick-up region for placement of a sound source relative to a microphone having a receive pattern configured to capture sound signals from the best region. The endpoint captures an image of a scene that encompasses the best region and displays the image of the scene. The endpoint generates an image representative of the best region and displays the generated image representative of the best region as an overlay of the scene image.
    Type: Grant
    Filed: June 3, 2014
    Date of Patent: May 10, 2016
    Assignee: Cisco Technology, Inc.
    Inventors: Haohai Sun, Bjørn Winsvold, Per Waagø, Amund Holmboe Basmo, Øystein Bredvei
  • Publication number: 20160066092
    Abstract: Systems, processes, devices, apparatuses, algorithms and computer readable medium for suppressing spatial interference using a dual microphone array for receiving, from a first microphone and a second microphone that are separated by a predefined distance, and that are configured to receive source signals, respective first and second microphone signals based on received source signals. A phase difference between the first and the second microphone signals is calculated based on the predefined distance. An angular distance between directions of arrival of the source signals and a desired capture direction is calculated based on the phase difference. Directional-filter coefficients are calculated based on the angular distance. Undesired source signals are filtered from an output based on the directional-filter coefficients.
    Type: Application
    Filed: November 6, 2015
    Publication date: March 3, 2016
    Inventors: Haohai Sun, Espen Moberg
  • Patent number: 9226062
    Abstract: A telepresence video conference endpoint device includes spaced-apart microphone arrays each configured to transduce sound into corresponding sound signals. A processor receives the sound signals from the arrays and determines a direction-of-arrival (DOA) of sound at each array based on the set of sound signals from that array, determines if each array is blocked or unblocked based on the DOA determined for that array, selects an array among the arrays based on whether each array is determined to be blocked or unblocked, and perform subsequent sound processing based on one or more of the sound signals from the selected array.
    Type: Grant
    Filed: March 18, 2014
    Date of Patent: December 29, 2015
    Assignee: Cisco Technology, Inc.
    Inventors: Haohai Sun, Espen Moberg, Bjørn Winsvold
  • Patent number: 9225937
    Abstract: Presented herein are techniques for controlling the level of ultrasound pairing signals generated in a teleconferencing environment. The levels of ultrasound pairing signals transmitted in a meeting room are adjusted automatically based on the ultrasound signal levels received at one or more of the sound receiving devices that can communicate with a teleconferencing endpoint in the meeting room.
    Type: Grant
    Filed: November 19, 2013
    Date of Patent: December 29, 2015
    Assignee: Cisco Technology, Inc.
    Inventors: Haohai Sun, Bjorn Winsvold
  • Patent number: 9215543
    Abstract: A processing system can include a processor that includes circuitry. The circuitry can be configured to: receive far-end and near-end audio signals; detect silence events and voice activities from the audio signals; determine whether an audio event in the audio signals is an interference event or a speaker event based on the detected silence events and voice activities, and further based on localized acoustic source data and faces or motion detected from an image; and generate a mute or unmute indication based on whether the audio event is the interference event or the speaker event. The system can include a near-end microphone array to output the near-end audio signals, one or more far-end microphones to output the far-end audio signals, and one or more cameras to capture the image of the environment.
    Type: Grant
    Filed: December 3, 2013
    Date of Patent: December 15, 2015
    Assignee: Cisco Technology, Inc.
    Inventors: Haohai Sun, Bjorn Winsvold
  • Patent number: 9210499
    Abstract: Systems, processes, devices, apparatuses, algorithms and computer readable medium for suppressing spatial interference using a dual microphone array for receiving, from a first microphone and a second microphone that are separated by a predefined distance, and that are configured to receive source signals, respective first and second microphone signals based on received source signals. A phase difference between the first and the second microphone signals is calculated based on the predefined distance. An angular distance between directions of arrival of the source signals and a desired capture direction is calculated based on the phase difference. Directional-filter coefficients are calculated based on the angular distance. Undesired source signals are filtered from an output based on the directional-filter coefficients.
    Type: Grant
    Filed: December 13, 2012
    Date of Patent: December 8, 2015
    Assignee: Cisco Technology, Inc.
    Inventors: Haohai Sun, Espen Moberg
  • Publication number: 20150350769
    Abstract: A video conference endpoint determines a position of a best audio pick-up region for placement of a sound source relative to a microphone having a receive pattern configured to capture sound signals from the best region. The endpoint captures an image of a scene that encompasses the best region and displays the image of the scene. The endpoint generates an image representative of the best region and displays the generated image representative of the best region as an overlay of the scene image.
    Type: Application
    Filed: June 3, 2014
    Publication date: December 3, 2015
    Applicant: Cisco Technology, Inc.
    Inventors: Haohai Sun, Bjørn Winsvold, Per Waagø, Amund Holmboe Basmo, Øystein Bredvei
  • Publication number: 20150341719
    Abstract: A video conference endpoint detects a face and determines a face angle of the detected face relative to a reference direction based on images captured with a camera. The endpoint determines an angle of arrival of sound (i.e., a sound angle) received at a microphone array that transduces the sound relative to the reference direction based on the transduced sound and a sound speed parameter indicative of a speed of sound in air. The endpoint compares the face angle against the sound angle, and adjusts the sound speed parameter so as to reduce the angle difference if the compare indicates an angle difference greater than zero between the face and sound angles.
    Type: Application
    Filed: May 20, 2014
    Publication date: November 26, 2015
    Applicant: Cisco Technology, Inc.
    Inventor: Haohai Sun
  • Publication number: 20150271593
    Abstract: A telepresence video conference endpoint device includes spaced-apart microphone arrays each configured to transduce sound into corresponding sound signals. A processor receives the sound signals from the arrays and determines a direction-of-arrival (DOA) of sound at each array based on the set of sound signals from that array, determines if each array is blocked or unblocked based on the DOA determined for that array, selects an array among the arrays based on whether each array is determined to be blocked or unblocked, and perform subsequent sound processing based on one or more of the sound signals from the selected array.
    Type: Application
    Filed: March 18, 2014
    Publication date: September 24, 2015
    Applicant: Cisco Technology, Inc.
    Inventors: Haohai Sun, Espen Moberg, Bjørn Winsvold
  • Publication number: 20150156598
    Abstract: A processing system can include a processor that includes circuitry. The circuitry can be configured to: receive far-end and near-end audio signals; detect silence events and voice activities from the audio signals; determine whether an audio event in the audio signals is an interference event or a speaker event based on the detected silence events and voice activities, and further based on localized acoustic source data and faces or motion detected from an image; and generate a mute or unmute indication based on whether the audio event is the interference event or the speaker event. The system can include a near-end microphone array to output the near-end audio signals, one or more far-end microphones to output the far-end audio signals, and one or more cameras to capture the image of the environment.
    Type: Application
    Filed: December 3, 2013
    Publication date: June 4, 2015
    Applicant: CISCO TECHNOLOGY, INC.
    Inventors: Haohai SUN, Bjorn Winsvold
  • Publication number: 20150109404
    Abstract: Presented herein are techniques for controlling the level of ultrasound pairing signals generated in a teleconferencing environment. The levels of ultrasound pairing signals transmitted in a meeting room are adjusted automatically based on the ultrasound signal levels received at one or more of the sound receiving devices that can communicate with a teleconferencing endpoint in the meeting room.
    Type: Application
    Filed: November 19, 2013
    Publication date: April 23, 2015
    Applicant: CISCO TECHNOLOGY, INC.
    Inventors: Haohai Sun, Bjorn Winsvold
  • Publication number: 20150110282
    Abstract: A processing system can include a tracking microphone array; audio tracker circuitry connected to the tracking microphone array to track an audio source based on an audio input from the array; communication microphones; and a processor. The processor can include audio circuitry to receive an audio input from the communication microphones and process the audio input to apply one or more of acoustic echo cancellation (AEC) and acoustic echo suppression (AES) processing to the audio input. The processor can further include calculating circuitry to calculate a ratio of signal power after and before the AEC and/or the AES processing, and control circuitry to generate an acoustic echo presence indication based on the ratio calculated by the calculating circuitry. The processor can transmit, via transmitting circuitry, the acoustic echo presence indication to an audio tracking device via a data communication channel between the processor and the audio tracker.
    Type: Application
    Filed: December 5, 2013
    Publication date: April 23, 2015
    Applicant: CISCO tECHNOLOGY, INC.
    Inventor: Haohai SUN
  • Publication number: 20140169576
    Abstract: Systems, processes, devices, apparatuses, algorithms and computer readable medium for suppressing spatial interference using a dual microphone array for receiving, from a first microphone and a second microphone that are separated by a predefined distance, and that are configured to receive source signals, respective first and second microphone signals based on received source signals. A phase difference between the first and the second microphone signals is calculated based on the predefined distance. An angular distance between directions of arrival of the source signals and a desired capture direction is calculated based on the phase difference. Directional-filter coefficients are calculated based on the angular distance. Undesired source signals are filtered from an output based on the directional-filter coefficients.
    Type: Application
    Filed: December 13, 2012
    Publication date: June 19, 2014
    Applicant: CISCO TECHNOLOGY, INC.
    Inventors: Haohai Sun, Espen Moberg
  • Publication number: 20120093344
    Abstract: A method of forming a beampattern in a beamformer of the type in which the beamformer receives input signals from a sensor array, decomposes the input signals into the spherical harmonics domain, applies weighting coefficients to the spherical harmonics and combines them to form an output signal, wherein the weighting coefficients are optimized for a given set of input parameters by convex optimization. Formulations are provided for forming second order cone programming constraints for multiple main lobe generation, uniform and non-uniform side lobe control, automatic null steering, robustness and white noise gain.
    Type: Application
    Filed: April 9, 2010
    Publication date: April 19, 2012
    Applicant: NTNU TECHNOLOGY TRANSFER AS
    Inventors: Haohai Sun, Shefeng Yan, U. Peter Svensson