Patents by Inventor Hannes Muesch
Hannes Muesch has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20230121651Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.Type: ApplicationFiled: December 15, 2022Publication date: April 20, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Dong SHI, Kai LI, Hannes MUESCH, David GUNAWAN, Paul HOLMBERG, Glenn N. DICKINS
-
Patent number: 11538486Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.Type: GrantFiled: October 20, 2020Date of Patent: December 27, 2022Assignee: Dolby Laboratories Licensing CorporationInventors: Dong Shi, Kai Li, Hannes Muesch, David Gunawan, Paul Holmberg, Glenn N. Dickins
-
Publication number: 20210104254Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.Type: ApplicationFiled: October 20, 2020Publication date: April 8, 2021Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Dong SHI, Kai LI, Hannes MUESCH, David GUNAWAN, Paul HOLMBERG, Glenn N. DICKINS
-
Patent number: 10812401Abstract: Disclosed is an apparatus and method operative to receive packets of media from a network including a receiver unit operative to receive the packets from the network, a jitter buffer data structure for receiving the packets in an ordered queue, the jitter buffer data structure having a tail into which the packets are input; a plurality of heads defining points in the jitter buffer data structure from which the ordered queue of packets are to be played back, the heads comprise an adjustable actual playback head coupled to an actual playback unit and at least one prototype head, each prototype head having associated therewith a target latency a processor having decision logic operable to determine a cost of achieving the associated target latency for each prototype head, wherein the decision logic compares the costs determined for each prototype head to identify a particular target latency and head location for the actual playback head of the buffer and a playback unit coupled to the processor for actual playbaType: GrantFiled: March 16, 2017Date of Patent: October 20, 2020Assignee: Dolby Laboratories Licensing CorporationInventors: Richard J. Cartwright, Hannes Muesch
-
Patent number: 10811027Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.Type: GrantFiled: June 7, 2017Date of Patent: October 20, 2020Assignee: Dolby Laboratories Licensing CorporationInventors: Dong Shi, Kai Li, Hannes Muesch, David Gunawan, Paul Holmberg, Glenn N. Dickins
-
Patent number: 10812759Abstract: Systems and methods are described for determining orientation of an external audio device in a video conference, which may be used to provide congruent multimodal representation for a video conference. A camera of a video conferencing system may be used to detect a potential location of an external audio device within a room in which the video conferencing system is providing a video conference. Within the detected potential location, a visual pattern associated with the external audio device may be identified. Using the identified visual pattern, the video conferencing system may estimate an orientation of the external audio device, the orientation being used by the video conferencing system to provide spatial audio video congruence to a far end audience.Type: GrantFiled: July 22, 2019Date of Patent: October 20, 2020Assignee: Dolby Laboratories Licensing CorporationInventors: Erwin Goesnar, Hannes Muesch, David Gunawan, Michael Eckert, Glenn N. Dickins
-
Patent number: 10607629Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.Type: GrantFiled: October 22, 2018Date of Patent: March 31, 2020Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Jeroen Koppens, Hannes Muesch
-
Patent number: 10586557Abstract: According to one aspect, a method for determining voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having a sample rate, and spitting the audio signal into a plurality of subbands, the plurality of subbands including at least a lowest subband and a highest subband. The method further comprises filtering the lowest subband to reduce an energy of the lowest subband, estimating a noise level for at least some of the plurality of subbands, and computing a signal-to-noise ratio for at least some of the plurality of subbands. The method also includes determining a speech activity level based at least in part on the computed signal-to-noise ratios and an average of an energy of at least some of the plurality of subbands.Type: GrantFiled: July 19, 2019Date of Patent: March 10, 2020Assignee: Dolby Laboratories Licensing CorporationInventor: Hannes Muesch
-
Publication number: 20190342521Abstract: Systems and methods are described for determining orientation of an external audio device in a video conference, which may be used to provide congruent multimodal representation for a video conference. A camera of a video conferencing system may be used to detect a potential location of an external audio device within a room in which the video conferencing system is providing a video conference. Within the detected potential location, a visual pattern associated with the external audio device may be identified. Using the identified visual pattern, the video conferencing system may estimate an orientation of the external audio device, the orientation being used by the video conferencing system to provide spatial audio video congruence to a far end audience.Type: ApplicationFiled: July 22, 2019Publication date: November 7, 2019Applicant: Dolby Laboratories Licensing CorporationInventors: Erwin GOESNAR, Hannes MUESCH, David GUNAWAN, Michael ECKERT, Glenn N. DICKINS
-
Publication number: 20190341069Abstract: According to one aspect, a method for determining voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having a sample rate, and spitting the audio signal into a plurality of subbands, the plurality of subbands including at least a lowest subband and a highest subband. The method further comprises filtering the lowest subband to reduce an energy of the lowest subband, estimating a noise level for at least some of the plurality of subbands, and computing a signal-to-noise ratio for at least some of the plurality of subbands. The method also includes determining a speech activity level based at least in part on the computed signal-to-noise ratios and an average of an energy of at least some of the plurality of subbands.Type: ApplicationFiled: July 19, 2019Publication date: November 7, 2019Applicant: Dolby Laboratories Licensing CorporationInventor: Hannes Muesch
-
Patent number: 10439951Abstract: Disclosed is a method and apparatus operative to process packets of media received from a network including a receiver unit operative, a jitter buffer data structure and a playback head defining a point in the jitter buffer data structure from which the ordered queue of packets are to be played back, and at least one prototype head. Each prototype head having a predetermined latency assigned thereto and defining a point in the jitter buffer data structure from which the ordered queue of packets is being played back containing said latency a processor operable to determine a measure of conversational quality associated with the ordered queue of packets being played back by each prototype head. Also described is a head selector operable to compare the measures of conversational quality associated with the ordered queue of packets being played back by each prototype head to select the prototype head with the highest measure of conversational quality and a playback unit coupled to the playback head.Type: GrantFiled: March 16, 2017Date of Patent: October 8, 2019Assignee: Dolby Laboratories Licensing CorporationInventors: Hannes Muesch, Richard J. Cartwright
-
Patent number: 10418052Abstract: According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.Type: GrantFiled: October 12, 2017Date of Patent: September 17, 2019Assignee: Dolby Laboratories Licensing CorporationInventor: Hannes Muesch
-
Patent number: 10362270Abstract: Systems and methods are described for determining orientation of an external audio device in a video conference, which may be used to provide congruent multimodal representation for a video conference. A camera of a video conferencing system may be used to detect a potential location of an external audio device within a room in which the video conferencing system is providing a video conference. Within the detected potential location, a visual pattern associated with the external audio device may be identified. Using the identified visual pattern, the video conferencing system may estimate an orientation of the external audio device, the orientation being used by the video conferencing system to provide spatial audio video congruence to a far end audience.Type: GrantFiled: December 12, 2017Date of Patent: July 23, 2019Assignee: Dolby Laboratories Licensing CorporationInventors: Erwin Goesnar, Hannes Muesch, David Gunawan, Michael Eckert, Glenn N. Dickins
-
Publication number: 20190156852Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.Type: ApplicationFiled: June 7, 2017Publication date: May 23, 2019Applicant: Dolby Laboratories Licensing CorporationInventors: Dong SHI, Kai LI, Hannes MUESCH, David GUNAWAN, Paul HOLMBERG, Glenn N. DICKINS
-
Publication number: 20190081902Abstract: Disclosed is an apparatus and method operative to receive packets of media from a network including a receiver unit operative to receive the packets from the network, a jitter buffer data structure for receiving the packets in an ordered queue, the jitter buffer data structure having a tail into which the packets are input; a plurality of heads defining points in the jitter buffer data structure from which the ordered queue of packets are to be played back, the heads comprise an adjustable actual playback head coupled to an actual playback unit and at least one prototype head, each prototype head having associated therewith a target latency a processor having decision logic operable to determine a cost of achieving the associated target latency for each prototype head, wherein the decision logic compares the costs determined for each prototype head to identify a particular target latency and head location for the actual playback head of the buffer and a playback unit coupled to the processor for actual playbaType: ApplicationFiled: March 16, 2017Publication date: March 14, 2019Applicant: Dolby Laboratories Licensing CorporationInventors: Richard J. CARTWRIGHT, Hannes MUESCH
-
Publication number: 20190057713Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.Type: ApplicationFiled: October 22, 2018Publication date: February 21, 2019Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL ABInventors: Jeroen KOPPENS, Hannes MUESCH
-
Patent number: 10141004Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.Type: GrantFiled: August 27, 2014Date of Patent: November 27, 2018Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Jeroen Koppens, Hannes Muesch
-
Patent number: 10057707Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations involve receiving or determining conversational dynamics data. One or more variables of a cost function may be based, at least in part, on the conversational dynamics data. The cost function may be a spatial optimization cost function of a vector describing a virtual conference participant position for each of the conference participants in a virtual acoustic space. The virtual acoustic space may be determined relative to a listener's head. The virtual conference participant positions may be assigned according to a solution of the cost function.Type: GrantFiled: February 3, 2016Date of Patent: August 21, 2018Assignee: Dolby Laboratories Licensing CorporationInventors: Richard J. Cartwright, Hannes Muesch
-
Publication number: 20180167581Abstract: Systems and methods are described for determining orientation of an external audio device in a video conference, which may be used to provide congruent multimodal representation for a video conference. A camera of a video conferencing system may be used to detect a potential location of an external audio device within a room in which the video conferencing system is providing a video conference. Within the detected potential location, a visual pattern associated with the external audio device may be identified. Using the identified visual pattern, the video conferencing system may estimate an orientation of the external audio device, the orientation being used by the video conferencing system to provide spatial audio video congruence to a far end audience.Type: ApplicationFiled: December 12, 2017Publication date: June 14, 2018Applicant: Dolby Laboratories Licensing CorporationInventors: Erwin GOESNAR, Hannes MUESCH, David GUNAWAN, Michael ECKERT, Glenn N. DICKINS
-
Patent number: 9985855Abstract: Described are: a method, an apparatus, and a tangible computer-readable storage medium comprising instructions to instruct one or more processors to carry out a method. One set of methods is for the transmit side of a communication link and another set of methods is for the receive side. A transmit side method includes assigning one of a set of classifications to media, e.g., voice/audio packets transmitted in a sequence, different classifications impacting differently a measure of perceptual quality calculated at the receive side if packets of the respective classifications are lost. A present packet is sent to the receive side containing the classification of a previous packet.Type: GrantFiled: June 26, 2013Date of Patent: May 29, 2018Assignee: Dolby Laboratories Licensing CorporationInventors: Glenn N. Dickins, Hannes Muesch