Patents by Inventor Hannes Muesch

Hannes Muesch has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20190156852
    Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.
    Type: Application
    Filed: June 7, 2017
    Publication date: May 23, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Dong SHI, Kai LI, Hannes MUESCH, David GUNAWAN, Paul HOLMBERG, Glenn N. DICKINS
  • Publication number: 20190081902
    Abstract: Disclosed is an apparatus and method operative to receive packets of media from a network including a receiver unit operative to receive the packets from the network, a jitter buffer data structure for receiving the packets in an ordered queue, the jitter buffer data structure having a tail into which the packets are input; a plurality of heads defining points in the jitter buffer data structure from which the ordered queue of packets are to be played back, the heads comprise an adjustable actual playback head coupled to an actual playback unit and at least one prototype head, each prototype head having associated therewith a target latency a processor having decision logic operable to determine a cost of achieving the associated target latency for each prototype head, wherein the decision logic compares the costs determined for each prototype head to identify a particular target latency and head location for the actual playback head of the buffer and a playback unit coupled to the processor for actual playba
    Type: Application
    Filed: March 16, 2017
    Publication date: March 14, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Richard J. CARTWRIGHT, Hannes MUESCH
  • Publication number: 20190057713
    Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.
    Type: Application
    Filed: October 22, 2018
    Publication date: February 21, 2019
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeroen KOPPENS, Hannes MUESCH
  • Patent number: 10141004
    Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.
    Type: Grant
    Filed: August 27, 2014
    Date of Patent: November 27, 2018
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Jeroen Koppens, Hannes Muesch
  • Patent number: 10057707
    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations involve receiving or determining conversational dynamics data. One or more variables of a cost function may be based, at least in part, on the conversational dynamics data. The cost function may be a spatial optimization cost function of a vector describing a virtual conference participant position for each of the conference participants in a virtual acoustic space. The virtual acoustic space may be determined relative to a listener's head. The virtual conference participant positions may be assigned according to a solution of the cost function.
    Type: Grant
    Filed: February 3, 2016
    Date of Patent: August 21, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Richard J. Cartwright, Hannes Muesch
  • Publication number: 20180167581
    Abstract: Systems and methods are described for determining orientation of an external audio device in a video conference, which may be used to provide congruent multimodal representation for a video conference. A camera of a video conferencing system may be used to detect a potential location of an external audio device within a room in which the video conferencing system is providing a video conference. Within the detected potential location, a visual pattern associated with the external audio device may be identified. Using the identified visual pattern, the video conferencing system may estimate an orientation of the external audio device, the orientation being used by the video conferencing system to provide spatial audio video congruence to a far end audience.
    Type: Application
    Filed: December 12, 2017
    Publication date: June 14, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Erwin GOESNAR, Hannes MUESCH, David GUNAWAN, Michael ECKERT, Glenn N. DICKINS
  • Patent number: 9985855
    Abstract: Described are: a method, an apparatus, and a tangible computer-readable storage medium comprising instructions to instruct one or more processors to carry out a method. One set of methods is for the transmit side of a communication link and another set of methods is for the receive side. A transmit side method includes assigning one of a set of classifications to media, e.g., voice/audio packets transmitted in a sequence, different classifications impacting differently a measure of perceptual quality calculated at the receive side if packets of the respective classifications are lost. A present packet is sent to the receive side containing the classification of a previous packet.
    Type: Grant
    Filed: June 26, 2013
    Date of Patent: May 29, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Glenn N. Dickins, Hannes Muesch
  • Publication number: 20180033453
    Abstract: According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.
    Type: Application
    Filed: October 12, 2017
    Publication date: February 1, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Patent number: 9881635
    Abstract: A method and system for filtering a multi-channel audio signal having a speech channel and at least one non-speech channel, to improve intelligibility of speech determined by the signal. In typical embodiments, the method includes steps of determining at least one attenuation control value indicative of a measure of similarity between speech-related content determined by the speech channel and speech-related content determined by the non-speech channel, and attenuating the non-speech channel in response to the at least one attenuation control value. Typically, the attenuating step includes scaling of a raw attenuation control signal (e.g., a ducking gain control signal) for the non-speech channel in response to the at least one attenuation control value. Some embodiments are a general or special purpose processor programmed with software or firmware and/or otherwise configured to perform filtering in accordance the invention.
    Type: Grant
    Filed: November 16, 2015
    Date of Patent: January 30, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Publication number: 20180027351
    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations involve receiving or determining conversational dynamics data. One or more variables of a cost function may be based, at least in part, on the conversational dynamics data. The cost function may be a spatial optimization cost function of a vector describing a virtual conference participant position for each of the conference participants in a virtual acoustic space. The virtual acoustic space may be determined relative to a listener's head. The virtual conference participant positions may be assigned according to a solution of the cost function.
    Type: Application
    Filed: February 3, 2016
    Publication date: January 25, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, Hannes MUESCH
  • Patent number: 9818433
    Abstract: According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.
    Type: Grant
    Filed: July 11, 2016
    Date of Patent: November 14, 2017
    Assignee: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Publication number: 20170324931
    Abstract: Example embodiments disclosed herein relate to spatial congruency adjustment. A method for adjusting spatial congruency in a video conference is disclosed. The method includes detecting spatial congruency between a visual scene captured by a video endpoint device and an auditory scene captured by an audio endpoint device that is positioned in relation to the video endpoint device, the spatial congruency being a degree of alignment between the auditory scene and the visual scene, comparing the detected spatial congruency with a predefined threshold and in response to the detected spatial congruency being below the threshold, adjusting the spatial congruency. Corresponding system and computer program products are also disclosed.
    Type: Application
    Filed: November 17, 2015
    Publication date: November 9, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing SUN, Dong SHI, Shen HUANG, Kai LI, Hannes MUESCH, Glenn N. DICKINS, Gary SPITTLE
  • Publication number: 20170272375
    Abstract: Disclosed is a method and apparatus operative to process packets of media received from a network including a receiver unit operative, a jitter buffer data structure and a playback head defining a point in the jitter buffer data structure from which the ordered queue of packets are to be played back, and at least one prototype head. Each prototype head having a predetermined latency assigned thereto and defining a point in the jitter buffer data structure from which the ordered queue of packets is being played back containing said latency a processor operable to determine a measure of conversational quality associated with the ordered queue of packets being played back by each prototype head. Also described is a head selector operable to compare the measures of conversational quality associated with the ordered queue of packets being played back by each prototype head to select the prototype head with the highest measure of conversational quality and a playback unit coupled to the playback head.
    Type: Application
    Filed: March 16, 2017
    Publication date: September 21, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Hannes MUESCH, Richard J. CARTWRIGHT
  • Patent number: 9558744
    Abstract: An audio processing apparatus and an audio processing method are described. In one embodiment, the audio processing apparatus include an audio masker separator for separating from a first audio signal an audio material comprising a sound other than stationary noise and utterance meaningful in semantics, as an audio masker candidate. The apparatus also includes a first context analyzer for obtaining statistics regarding contextual information of detected audio masker candidates, and a masker library builder for building a masker library or updating an existing masker library by adding, based on the statistics, at least one audio masker candidate as an audio masker into the masker library, wherein audio maskers in the maker library are used to be inserted into a target position in a second audio signal to conceal defects in the second audio signal.
    Type: Grant
    Filed: November 27, 2013
    Date of Patent: January 31, 2017
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Shen Huang, Poppy Crum, Hannes Muesch, Glenn N. Dickins, Michael Eckert
  • Patent number: 9552845
    Abstract: Metadata comprising a set of gain values for creating a dominance effect is automatically generated. Automatically generating the metadata includes receiving multiple audio streams and a dominance criterion for at least one of the audio streams. A set of gains is computed for one or more audio streams based on the dominance criterion for the at least one audio stream and metadata is generated with the set of gains.
    Type: Grant
    Filed: October 5, 2010
    Date of Patent: January 24, 2017
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Jeffrey C. Riedmiller, Regunathan Radhakrishnan, Hannes Muesch
  • Publication number: 20160322068
    Abstract: According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.
    Type: Application
    Filed: July 11, 2016
    Publication date: November 3, 2016
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Patent number: 9418680
    Abstract: According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.
    Type: Grant
    Filed: May 1, 2015
    Date of Patent: August 16, 2016
    Assignee: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Publication number: 20160225387
    Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.
    Type: Application
    Filed: August 27, 2014
    Publication date: August 4, 2016
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeroen KOPPENS, Hannes MUESCH
  • Patent number: 9368128
    Abstract: The invention relates to audio signal processing. More specifically, the invention relates to enhancing multichannel audio, such as television audio, by applying a gain to the audio that has been smoothed between portions of the audio. The invention relates to methods, apparatus for performing such methods, and to software stored on a computer-readable medium for causing a computer to perform such methods.
    Type: Grant
    Filed: January 26, 2015
    Date of Patent: June 14, 2016
    Assignee: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Publication number: 20160071527
    Abstract: A method and system for filtering a multi-channel audio signal having a speech channel and at least one non-speech channel, to improve intelligibility of speech determined by the signal. In typical embodiments, the method includes steps of determining at least one attenuation control value indicative of a measure of similarity between speech-related content determined by the speech channel and speech-related content determined by the non-speech channel, and attenuating the non-speech channel in response to the at least one attenuation control value. Typically, the attenuating step includes scaling of a raw attenuation control signal (e.g., a ducking gain control signal) for the non-speech channel in response to the at least one attenuation control value. Some embodiments are a general or special purpose processor programmed with software or firmware and/or otherwise configured to perform filtering in accordance the invention.
    Type: Application
    Filed: November 16, 2015
    Publication date: March 10, 2016
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Hannes Muesch