Noise Patents (Class 704/226)
  • Patent number: 11404061
    Abstract: A computer includes a processor and a memory storing instructions executable by the processor to receive sensor data of an occupant of a vehicle, identify a type of mask worn by the occupant based on the sensor data, select a sound filter according to the type of mask from a plurality of sound filters stored in the memory, receive sound data, apply the selected sound filter to the sound data, and perform an operation using the filtered sound data.
    Type: Grant
    Filed: January 11, 2021
    Date of Patent: August 2, 2022
    Assignee: FORD GLOBAL TECHNOLOGIES, LLC
    Inventors: Scott Andrew Amman, Cynthia M. Neubecker, Joshua Wheeler, Pietro Buttolo, Brian Bennie, John Robert Van Wiemeersch
  • Patent number: 11398241
    Abstract: Techniques for improving microphone noise suppression are provided. A system for noise-suppression may include a beam selector component that applies logic to select a beam most likely corresponding to a direction of a noise source and keeps the beam selection steady rather than switching the beam too often to avoid processing complications. The selected beam may be used as a reference in an adaptive filter which outputs a noise estimate. The noise estimate and raw microphone data may be used to adapt the adaptive filter. A parallel filter which adapts after a time delay may be applied to the reference in order to prevent interference. An attenuation factor may be used to scale the noise estimate based on noise diffuseness, signal quality, and/or a gain limit. The scaled noise estimate may be subtracted from microphone input data to produce output audio data with improved signal quality and maintained signal coherence.
    Type: Grant
    Filed: March 31, 2021
    Date of Patent: July 26, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Mohamed Mansour, Shobha Devi Kuruba Buchannagari
  • Patent number: 11398239
    Abstract: A process for compressing an audio speech signal utilizes ASR processing to generate a corresponding text representation and, depending on confidence in the corresponding text representation, selectively applies more, less, or no compression to the audio signal. The result is a compressed audio signal, with corresponding text, that is compact and well suited for searching, analytics, or additional ASR processing.
    Type: Grant
    Filed: December 2, 2020
    Date of Patent: July 26, 2022
    Assignee: Medallia, Inc.
    Inventor: David Garrod
  • Patent number: 11393485
    Abstract: A method of enhancing distorted signal having a speech and noise component, with a processing device having memory with stored training information T, comprising a step of removing noise in spectrum domain according to a noise and speech model to obtain a clear signal spectrum, wherein the training information T comprises dictionaries of compact spectra prototypes of speech and noise, speech gains and noise gains forming together composite states and it comprises probabilities of state history. Dictionaries of compact spectra prototypes comprise compact spectra prototypes of a first dimension P1 and of a second dimension P2 where second dimension P2 is higher or equal to the first dimension P1.
    Type: Grant
    Filed: November 13, 2018
    Date of Patent: July 19, 2022
    Assignee: MED-EL ELEKTROMEDIZINISCHE GERÄTE GMBH
    Inventor: Marcin Kuropatwinski
  • Patent number: 11380302
    Abstract: A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process.
    Type: Grant
    Filed: October 22, 2020
    Date of Patent: July 5, 2022
    Assignee: Google LLC
    Inventors: Nolan Andrew Miller, Ramin Mehran
  • Patent number: 11373667
    Abstract: Systems and methods for processing an audio signal include an audio input operable to receive an input signal comprising a time-domain, single-channel audio signal, a subband analysis block operable to transform the input signal to a frequency domain input signal comprising a plurality of k-spaced under-sampled subband signals, a reverberation reduction block operable to reduce reverberation effect, including late reverberation, in the plurality of k-spaced under-sampled subband signals, a noise reduction block operable to reduce background noise from the plurality of k-spaced under-sampled subband signals, and a subband synthesis block operable to transform the subband signals to the time-domain, thereby producing an enhanced output signal.
    Type: Grant
    Filed: April 19, 2018
    Date of Patent: June 28, 2022
    Assignee: SYNAPTICS INCORPORATED
    Inventors: Saeed Mosayyebpour Kaskari, Francesco Nesta, Trausti Thormundsson, Thomas Aaron Gulliver
  • Patent number: 11363147
    Abstract: Operations related to performing gain operations with respect to a receive-path signal of a first device may be performed. The operations may include obtaining the receive-path signal, which includes an echo speech signal and a receive speech signal originating at a second device. In addition, the operations may include identifying a portion of the receive-path signal that includes, at a particular time, a first frequency component that corresponds to the echo speech signal and a second frequency component that corresponds to the receive speech signal in which the first frequency component is different from the second frequency component. Moreover, the operations may include attenuating the first frequency component of the portion while avoiding attenuating the second frequency component of the portion based on the first frequency component corresponding to the echo speech signal and the second frequency component corresponding to the receive speech signal.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: June 14, 2022
    Assignee: Sorenson IP Holdings, LLC
    Inventor: Mark Adams
  • Patent number: 11355108
    Abstract: Aspects of the present disclosure relate to voice command filtering. Communication is established with a voice controlled device located at a location. Data indicating blocked directions is received from the voice controlled device. A voice command is received. A determination is made that the voice command is received from a blocked direction indicated in the data. The received voice command is then ignored.
    Type: Grant
    Filed: August 20, 2019
    Date of Patent: June 7, 2022
    Assignee: International Business Machines Corporation
    Inventors: Eunjin Lee, Daniel Thomas Cunnington, Giacomo Giuseppe Chiarella, John Jesse Wood
  • Patent number: 11341987
    Abstract: In a general aspect, an apparatus for detecting speech can include a signal conditioning stage that receives a signal corresponding with acoustic energy, filters the received signal to produce a speech-band signal, calculates a first sequence of energy values for the received signal and calculates a second sequence of energy values for the speech-band signal. The apparatus can also include a detection stage including a plurality of speech and noise differentiators. The detection stage can being configured to receive the first and second sequences of energy values and, based on the first sequence of energy values and the second sequence of energy values, provide, for each speech and noise differentiator of the plurality of speech and noise differentiators, a respective speech-detection indication signal.
    Type: Grant
    Filed: April 4, 2019
    Date of Patent: May 24, 2022
    Assignee: SEMICONDUCTOR COMPONENTS INDUSTRIES, LLC
    Inventors: Pejman Dehghani, Robert L. Brennan
  • Patent number: 11341983
    Abstract: A system is provided. The system comprises at least one artificial neural network configured to: receive an audio signal; for a time period, determine if at least one human voice audio spectrum is in the audio signal; for the time period, identify at least one human voice audio power spectrum; for the time period, extract each of the at least one identified human voice audio power spectrum; remove artifacts from each extracted human voice audio power spectrum to synthesize an estimation of an original human voice prior to its distortion; and transmit the synthesized estimation of an original human voice.
    Type: Grant
    Filed: September 17, 2018
    Date of Patent: May 24, 2022
    Assignee: Honeywell International Inc.
    Inventors: Akash Nandi, Showvik Chakraborty
  • Patent number: 11322156
    Abstract: With recent real-world applications of speaker and speech recognition systems, robust features for degraded speech have become a necessity. In general, degraded speech results in poor performance of any speech-based system. This poor performance can be attributed to feature extraction functionality of speech-based system which takes input speech file and converts it into a representation called as a feature. Embodiments of the present disclosure provide systems and methods that compute distance between each degraded speech feature extracted from an input speech signal with each clean speech feature comprised in a memory of the system to obtain set of matched clean speech features wherein at least a subset of cleaned speech features are dynamically selected based on a pre-defined threshold and the computed distance, thereby computing statistics for the dynamically selected clean speech features set for utilizing in at least one of a speech recognition system and a speaker recognition system.
    Type: Grant
    Filed: December 26, 2019
    Date of Patent: May 3, 2022
    Assignee: Tata Consultancy Services Limited
    Inventors: Ashish Panda, Sunilkumar Kopparapu, Sonal Sunil Joshi
  • Patent number: 11322134
    Abstract: An artificial intelligence (AI) device may acquire a probability that a received speech signal is classified as a noise signal, calculate a confidence level of a first model for determining to which phoneme the speech signal belongs, based on the speech signal, determine a weight of the first model based on the probability and the confidence level of the first model, and output a speech recognition result of the speech signal using the determined weight of the first model.
    Type: Grant
    Filed: April 22, 2020
    Date of Patent: May 3, 2022
    Assignee: LG ELECTRONICS INC.
    Inventor: Hwansik Yun
  • Patent number: 11315576
    Abstract: Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.
    Type: Grant
    Filed: June 6, 2019
    Date of Patent: April 26, 2022
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Pontus Carlsson, Kristofer Kjoerling
  • Patent number: 11315542
    Abstract: An active noise control device includes: a control target signal extractor for extracting a signal component of a control target frequency from an error signal as a control target signal which is a complex-valued signal having a real part and an imaginary part; a control signal generator for generating a control signal for controlling a control actuator, by signal-processing the control target signal through a control filter; and a control filter coefficient updater for successively and adaptively updating the coefficient of the control filter.
    Type: Grant
    Filed: March 29, 2021
    Date of Patent: April 26, 2022
    Assignee: Honda Motor Co., Ltd.
    Inventors: Xun Wang, Toshio Inoue
  • Patent number: 11317202
    Abstract: Methods and devices for processing and voice operated control are provided. The method can include performing a non-difference comparison between a first received sound and a second received sound, determining if speech exists based on the comparison, and transmitting or providing a decision that the speech is present to at least one among the device, a cell phone, a media player, or a portable computing device. Other embodiments are disclosed.
    Type: Grant
    Filed: February 26, 2020
    Date of Patent: April 26, 2022
    Assignee: Staton Techiya, LLC
    Inventors: John Usher, Steven Goldstein, Marc Boillot
  • Patent number: 11295759
    Abstract: Systems and methods are provided for measuring the distortion and muffling caused by a face mask. For example, in one embodiment a simulated voice source produces a sound. The sound is then acoustically coupled to a simulated vocal tract and a face mask. A microphone receives sound and produces a signal and an analyzer receives the signal from the microphone. A manikin head or other facial structure may also simulate fitting of the face mask onto a face. The analyzer may further produce a quantitative assessment of the distortion and muffling of the face mask, for example, by comparing at least one spectrum obtained with the face mask and at least one spectrum obtained without the face mask.
    Type: Grant
    Filed: January 30, 2021
    Date of Patent: April 5, 2022
    Assignee: Acoustic Mask LLC
    Inventor: Martin Rothenberg
  • Patent number: 11295543
    Abstract: Embodiments of the present disclosure relate to object detection in an image. In an embodiment, a computer-implemented method is disclosed. According to the method, image data representing a scene is obtained and sound distribution information related to the scene is obtained. A detection strategy to be applied in object detection is determined based on the sound distribution information. The object detection is performed on the image data by applying the detection strategy. In other embodiments, a system and a computer program product are disclosed.
    Type: Grant
    Filed: March 31, 2020
    Date of Patent: April 5, 2022
    Assignee: International Business Machines Corporation
    Inventors: Wen Liu, Shi Lei Zhang, Qin Shi, Songfang Huang
  • Patent number: 11290814
    Abstract: A method, apparatus, and computer-readable storage medium that modulate a composition of an audio output in accordance with a noise level of an environment. For instance, the present disclosure describes a method for modulating an audio output of a microphone array, comprising receiving two or more audio signals from two or more microphone capsules in the microphone array, each audio signal comprising an electrical noise of a corresponding microphone capsule and a response to acoustic stimuli in an environment perceived by the microphone capsule, estimating an acoustic contribution level of the environment based on the received audio signals, and determining, by processing circuitry, a composition of the audio output of the microphone array based on the estimated acoustic contribution level of the environment, the composition being based on at least a relationship between acoustic noise and directivity indices of each of a plurality of beamformers.
    Type: Grant
    Filed: December 15, 2020
    Date of Patent: March 29, 2022
    Assignee: Valeo North America, Inc.
    Inventors: Brandon Hook, Daniel Soberal
  • Patent number: 11270720
    Abstract: A method includes selecting a frame of an audio signal. The method further includes determining a first power spectral density (PSD) distribution of the frame. The method further includes generating a first reference PSD distribution indicating an estimate of background noise in the frame based on a non-linear weight, a second reference PSD distribution of a previous frame of the audio signal, and a second PSD distribution of the previous frame. The method further includes determining whether voice activity is detected in the frame based on the first PSD distribution of the frame and the first reference PSD distribution.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: March 8, 2022
    Assignee: TEXAS INSTRUMENTS INCORPORATED
    Inventors: Aditya Sunil Joglekar, Charles Kasimer Sestok, IV, David Patrick Magee
  • Patent number: 11259119
    Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.
    Type: Grant
    Filed: October 6, 2020
    Date of Patent: February 22, 2022
    Assignee: QUALCOMM Incorporated
    Inventors: Lae-Hoon Kim, Rogerio Guedes Alves, Jacob Jon Bean, Erik Visser
  • Patent number: 11252497
    Abstract: Headphone providing fully natural interface are described. According to one aspect of such headphones, the headphones comprises a microphone configured for capturing an ambient sound, a speaker configured for playing audio signals, a command interface configured for receiving one or more external control commands, and a control unit having an ambient sound monitoring function. The control unit captures the ambient sound through the microphone, and automatically causes the headphones to enter an interactive mode when a preset interested sound is detected to appear in the ambient sound. The control unit controls the headphones to output an interactive reminder in the interactive mode, and the interactive reminder comprises one or more of visual reminders, a tactile reminder and an auditory reminder. Thus, interaction between the user and the ambience can be realized in a fully natural interface manner according to user preferences.
    Type: Grant
    Filed: August 9, 2020
    Date of Patent: February 15, 2022
    Assignee: Nanjing Zgmicro Company Limited
    Inventors: David Xiaodong Yang, Tiffany Xiao Chen
  • Patent number: 11245788
    Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo canceller model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
    Type: Grant
    Filed: August 26, 2020
    Date of Patent: February 8, 2022
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
  • Patent number: 11237630
    Abstract: Invoking one or more previously dormant functions of an automated assistant in response to detecting, based on processing of vision data from one or more vision components: (1) a particular gesture (e.g., of one or more “invocation gestures”) of a user; and/or (2) detecting that a gaze of the user is directed at an assistant device that provides an automated assistant interface (graphical and/or audible) of the automated assistant. For example, the previously dormant function(s) can be invoked in response to detecting the particular gesture, detecting that the gaze of the user is directed at an assistant device for at least a threshold amount of time, and optionally that the particular gesture and the directed gaze of the user co-occur or occur within a threshold temporal proximity of one another.
    Type: Grant
    Filed: December 3, 2020
    Date of Patent: February 1, 2022
    Assignee: Google LLC
    Inventors: Yuan Yuan, Kenneth Mixter, Tuan Nguyen
  • Patent number: 11227587
    Abstract: The present disclosure relates to a method, apparatus, and computer-readable storage medium that modulate an audio output of a microphone array in order to isolate speech of a talker in a vehicle. For instance, the present disclosure describes a method for modulating an audio output of a microphone array, comprising receiving two or more audio signals from two or more microphone capsules in the microphone array, each audio signal comprising a response to acoustic stimuli in an environment perceived by a respective microphone capsule of the microphone array, estimating an acoustic noise contribution level of the environment based on the received audio signals, estimating a voice contribution level of the environment based on the received audio signals, and determining, by processing circuitry, a direct voice contribution level of the environment based on the estimated acoustic noise contribution level of the environment and the estimated voice contribution level of the environment.
    Type: Grant
    Filed: December 23, 2019
    Date of Patent: January 18, 2022
    Assignee: Peiker acustic GmbH
    Inventor: Brandon Hook
  • Patent number: 11227621
    Abstract: The present disclosure provides new variants of non-negative matrix factorization suitable for separating desired audio content from undesired audio content. In certain embodiments, a multi-dimensional non-negative representation of an audio signal is decomposed into desired content and undesired content by performing convolutional non-negative matrix factorization (CNMF) on multiple layers, each layer having a respective non-negative matrix representation. In certain embodiments, the desired content is represented by a first dictionary and the undesired content is represented by a second dictionary, and sparsity is imposed on activations of basic elements of the first or the second dictionary, wherein a degree of sparsity is controlled by setting a minimum number of components with significant activations of the first or second dictionary, respectively.
    Type: Grant
    Filed: September 16, 2019
    Date of Patent: January 18, 2022
    Assignee: Dolby International AB
    Inventors: Pedro Jafeth Villasana Tinajero, Stanislaw Gorlow
  • Patent number: 11223916
    Abstract: A method operates a hearing device. The hearing device has a microphone by which ambient sound is picked up and is converted into an input signal that has a wanted component and a noise component. A stationarity of the input signal is determined. A signal-to-noise ratio of the input signal is determined on a basis of a scaling factor. The scaling factor is determined on a basis of the stationarity, namely on a basis of a function that indicates the scaling factor on a basis of the stationarity of the input signal. A corresponding hearing device implements such a method.
    Type: Grant
    Filed: September 18, 2020
    Date of Patent: January 11, 2022
    Assignee: Sivantos Pte. Ltd.
    Inventors: Homayoun Kamkar-Parsi, Jens Hain
  • Patent number: 11217076
    Abstract: Tampering with an audio/video (A/V) recording and communication device is detected based on audio data captured by a microphone and/or video data captured by a camera of the A/V recording and communication device. The detection of the tampering may be based on, for example, processing of the audio and/or video data. Additional data may be collected and/or other actions taken in response to detection of the tampering.
    Type: Grant
    Filed: January 28, 2019
    Date of Patent: January 4, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: James Siminoff, Michael Jason Mitura
  • Patent number: 11215472
    Abstract: An information providing device to be mounted on a vehicle. The vehicle includes control means and control means for transmitting, when the vehicle is controlled by automatic driving, a control signal for outputting route guidance information for informing a driver of the vehicle about a specified route by mechanical speech, and a control signal for outputting automatic driving guidance information for informing the driver of the vehicle about a route of the vehicle driven by automatic driving by a sound effect.
    Type: Grant
    Filed: October 3, 2019
    Date of Patent: January 4, 2022
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventors: Akira Iijima, Hironobu Sugimoto, Hiroaki Sakakibara
  • Patent number: 11200890
    Abstract: Aspects of the present disclosure relate to distinguishing voice commands. One or more stored blocked directions of background voice noise from one or more audio output devices for a location of a voice command device are accessed. A voice input is received at the voice command device at the location and a determination is made that the voice input is received from a blocked direction. A status of an audio output device is queried to determine whether it is emitting audio. In response to a determination that the audio output device is currently emitting audio, an audio file is obtained from the audio output device, the audio file corresponding to a time when the voice input was received. The obtained audio file is compared with the received voice input. The received voice input is ignored if there is a substantial match with the obtained audio file.
    Type: Grant
    Filed: May 1, 2018
    Date of Patent: December 14, 2021
    Assignee: International Business Machines Corporation
    Inventors: Jack Dunning, Daniel T. Cunnington, Eunjin Lee, Giacomo G. Chiarella, John J. Wood
  • Patent number: 11183197
    Abstract: A method for processing audio data includes obtaining a first noise frame of an audio signal, wherein the first noise frame includes a first low-band signal and a first high-band signal, obtaining a first low-band parameter corresponding to the first low-band signal and a first high-band parameter corresponding to the first high-band signal, encoding a first silence insertion descriptor (SID) corresponding to the first noise frame to comprise the first low-band parameter and the first high-band parameter, obtaining a second noise frame of the audio signal, wherein the second noise frame includes a second low-band signal and a second high-band signal, where the first noise frame is prior to the second noise frame in the audio signal, and determining whether a second SID corresponding to the second noise frame should comprise a second high-band parameter of the second high-band signal.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: November 23, 2021
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Zhe Wang
  • Patent number: 11170766
    Abstract: A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking.
    Type: Grant
    Filed: February 25, 2019
    Date of Patent: November 9, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Ty Loren Carlson, Rohan Mutagi
  • Patent number: 11170799
    Abstract: Embodiments of the present disclosure set forth a method of decomposing an audio signal into a set of sub-band signals and detecting a set of signal energy values, where each signal energy value is associated with a sub-band signal. The method also includes generating a noise reduction threshold based on at least one sub-band signal, and, for each sub-band signal, comparing the associated signal energy value to the noise reduction threshold. Based on determining that at least one sub-band signal is associated with a signal energy value below the noise reduction threshold, the method includes attenuating the at least one the sub-band signal to generate a set of attenuated sub-band signals. The method also includes combining at least one sub-band signal included in the set of sub-band signals with at least one attenuated sub-band signal included in the set of attenuated sub-band signals to generate an output audio signal.
    Type: Grant
    Filed: February 13, 2019
    Date of Patent: November 9, 2021
    Assignee: Harman International Industries, Incorporated
    Inventor: James Lambrick
  • Patent number: 11169773
    Abstract: A voice-interaction system enables mobile workers to capture measurements, observations and complete inspections using their voice as they move about, leaving one or both hands and eyes free to safely and effectively focus on work tasks. The system has the flexibility to recognize highly specialized vocabulary, prompting for and error-checking utterances that are unique to an industry, company, government agency, user or specific task. This data is saved and formatted to be viewed, listened to, or input into a structured data-base for further use.
    Type: Grant
    Filed: July 12, 2017
    Date of Patent: November 9, 2021
    Assignee: TekWear, LLC
    Inventors: John David Swansey, Bruce Wayne Rasa, Bruce Balentine, Erick Christian Kobres
  • Patent number: 11164593
    Abstract: A hearing device includes: an input module for provision of a first input signal; a processor configured to provide an electrical output signal based on the first input signal; a receiver configured to provide an audio output signal; and a controller comprising a speech intelligibility estimator configured to determine a speech intelligibility indicator indicative of speech intelligibility based on the first input signal, wherein the controller is configured to control the processor based on the speech intelligibility indicator; wherein the speech intelligibility estimator comprises a decomposition module configured to decompose the first input signal into a first representation of the first input signal in a frequency domain, wherein the first representation comprises one or more elements representative of the first input signal; and wherein the decomposition module comprises one or more characterization blocks for characterizing the one or more elements of the first representation in the frequency domain.
    Type: Grant
    Filed: June 19, 2018
    Date of Patent: November 2, 2021
    Assignee: GN Hearing A/S
    Inventors: Charlotte Sørensen, Jesper B. Boldt, Angeliki Xenaki, Mathew Shaji Kavalekalam, Mads G Christensen
  • Patent number: 11159258
    Abstract: The invention addresses the problem of recovering an unknown signal from multiple records of brief duration which are presumed to contain the signal at mutually random delays in a background of independent noise. The scenario is relevant to many applications, among which are the recovery of weak transients from large arrays of sensors and the identification of recurring patterns through a comparison of sequential intervals within a single record of longer duration. A simple and practical approach is provided by solving this problem through higher-order spectra. Applying the method to the third-order spectrum, the bispectrum, leads to filters derived from cross bicoherence.
    Type: Grant
    Filed: July 3, 2019
    Date of Patent: October 26, 2021
    Assignee: University of Iowa Research Foundation
    Inventor: Christopher K. Kovach
  • Patent number: 11146607
    Abstract: A system and method are described for smart noise cancellation. In some implementations, the system may receive audio data describing an audio signal, which the system may use to determine a set of frames of the audio signal. Spectral analysis, which may include a signal-to-noise ratio estimate, may be performed on the one or more frames of the audio. In some instances, the system may identify a noise frame from among the one or more frames based on the spectral analysis, and may reduce noise in the one or more frames based on the noise frame and the spectral analysis on the one or more frames of the audio signal.
    Type: Grant
    Filed: May 31, 2019
    Date of Patent: October 12, 2021
    Assignee: DIALPAD, INC.
    Inventors: Qian-Yu Tang, John Rector, Kathiravan Ramaswami
  • Patent number: 11138986
    Abstract: A method for filtering a sound signal acquired by a voice recognition system is proposed wherein the filter used to delete the unwanted sound signal is based on a noise model created from a history of preceding acquisitions of the sound signal. The method is based on the history for predicting the characteristics of the signal in the course of acquisition and relies on this prediction to establish the noise model used by the filtering function.
    Type: Grant
    Filed: September 19, 2019
    Date of Patent: October 5, 2021
    Assignee: SAGEMCOM BROADBAND SAS
    Inventor: Jérôme Berger
  • Patent number: 11087774
    Abstract: A log spectral envelope sequence L0, L1, . . . , LN?1 and an envelope code for the log spectral envelope sequence L0, L1, . . . , LN?1 are obtained. The log spectral envelope sequence L0, L1, . . . , LN?1 is an integer value sequence corresponding to binary logarithms of respective sample values of a spectral envelope sequence and is an integer value sequence whose total sum is 0. For a quantized spectral sequence {circumflex over (?)}X0, {circumflex over (?)}X1, . . . , {circumflex over (?)}XN?1, a smoothed spectral sequence ˜X0, ˜X1, . . . , ˜XN?1 is obtained by: for {circumflex over (?)}Xk with Lk being a positive value, adopting {circumflex over (?)}Xk with Lk digits from its least significant digit removed as ˜Xk; for {circumflex over (?)}Xk with Lk being a negative value, adopting {circumflex over (?)}Xk with ?Lk digits added to its least significant digit in accordance with a predefined rule as ˜Xk; and when Lk is 0, adopting {circumflex over (?)}Xk as ˜Xk.
    Type: Grant
    Filed: April 24, 2018
    Date of Patent: August 10, 2021
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Ryosuke Sugiura, Yutaka Kamamoto, Takehiro Moriya
  • Patent number: 11081120
    Abstract: A method for encoded-sound determination performed by a computer includes: executing a first process that includes obtaining information indicating intensities of sound signals, the frequencies being calculated from the sound signals and corresponding to frequencies; and executing a second process that includes determining whether or not the sound signals are signals of encoded sound, based on whether or not the intensities of the sound signals in predetermined frequency bands that are adjacent to each other in a frequency direction have a difference that is larger than or equal to a predetermined threshold.
    Type: Grant
    Filed: March 22, 2019
    Date of Patent: August 3, 2021
    Assignee: FUJITSU LIMITED
    Inventors: Akira Kamano, Masanao Suzuki, Nobuyuki Washio, Yohei Kishi
  • Patent number: 11069365
    Abstract: A mechanism is described for facilitating wind detection and wind noise reduction in computing environments according to one embodiment. An apparatus of embodiments, as described herein, includes wind detection logic to detect wind associated with the apparatus including a wearable computing device, wherein the wind is detected based on samples from multiple microphones and extraction and use of multiple features including spectral sub-band centroid (SSC) features and coherence features; and decision and execution logic to reduce wind noise associated with the detected wind.
    Type: Grant
    Filed: March 30, 2018
    Date of Patent: July 20, 2021
    Assignee: INTEL CORPORATION
    Inventors: Swarnendu Kar, Anthony Rhodes
  • Patent number: 11057705
    Abstract: A method includes generating, by a noise generator of an apparatus, a noise signal stream. The method includes outputting, via an audio output device of the apparatus, the noise signal stream towards an audio input device of a personal communication device. The noise signal stream travels from the audio output device towards the audio input device via an audio-sealing pathway. The method includes receiving a recorded representation of the noise signal stream from the personal communication device. The recorded representation of the noise signal stream is a function of a transformation of the noise signal stream, by the audio-sealing pathway, and characterizes an audio transfer function of the audio-sealing pathway. The method includes determining, by the controller of the apparatus, whether or not the audio-sealing pathway satisfies an operational criterion based on the recorded representation of the noise signal stream and a reference transfer function associated with the audio-sealing pathway.
    Type: Grant
    Filed: March 23, 2020
    Date of Patent: July 6, 2021
    Assignee: PPIP, LLC
    Inventors: Michael Fong, Neric Hsin-wu Fong, Teddy David Thomas
  • Patent number: 11056128
    Abstract: An apparatus for processing an audio signal includes an audio signal analyzer and a filter. The audio signal analyzer is configured to analyze an audio signal to determine a plurality of noise suppression filter values for a plurality of bands of the audio signal, wherein the analyzer is configured to determine a noise suppression filter value so that a noise suppression filter value is greater than or equal to a minimum noise suppression filter value and so that the minimum noise suppression value depends on a characteristic of the audio signal. The filter is configured for filtering the audio signal, wherein the filter is adjusted based on the noise suppression filter values.
    Type: Grant
    Filed: April 11, 2019
    Date of Patent: July 6, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Anthony Lombard, Bernhard Birzer, Dirk Mahne, Edwin Mabande, Fabian Kuech, Emanuel Habets, Paolo Annibale
  • Patent number: 11056130
    Abstract: The present disclosure provides a speech enhancement method and apparatus, a device and a storage medium. The method includes: acquiring a first speech signal and a second speech signal; obtaining a signal to noise ratio of the first speech signal; determining, according to the signal to noise ratio of the first speech signal, a fusion coefficient of filtered signals corresponding to the first speech signal and the second speech signal; and performing, according to the fusion coefficient, speech fusion processing on the filtered signals corresponding to the first speech signal and the second speech signal to obtain an enhanced speech signal. Thereby, it is realized that a fusion coefficient of speech signals of a non-air conduction speech sensor and an air conduction speech sensor is adaptively adjusted according to environment noise, thereby improving the signal quality after speech fusion, and improving the effect of speech enhancement.
    Type: Grant
    Filed: October 23, 2019
    Date of Patent: July 6, 2021
    Assignee: SHENZHEN GOODIX TECHNOLOGY CO., LTD.
    Inventors: Hu Zhu, Xinshan Wang, Guoliang Li, Duan Zeng, Hongjing Guo
  • Patent number: 11037556
    Abstract: Method and apparatus are disclosed for speech recognition for vehicle voice commands. An example vehicle includes a microphone to collect a signal including a voice command, memory, and a controller. The controller is configured to determine an initial identification by feeding the signal into a first automatic speech recognition (ASR) engine and determine habits by feeding user history into a habits engine. The controller also is configured to identify the voice command by feeding the signal, the initial identification, and the habits into a second ASR engine. The controller also is configured to perform a vehicle function based on the voice command.
    Type: Grant
    Filed: July 17, 2018
    Date of Patent: June 15, 2021
    Assignee: Ford Global Technologies, LLC
    Inventors: Ranjani Rangarajan, John Edward Huber, Leah N. Busch, Joshua Wheeler, Scott Andrew Amman
  • Patent number: 11031020
    Abstract: A speech/audio bitstream decoding method includes acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame, performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame, and recovering a speech/audio signal using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the speech/audio bitstream decoding method help improve quality of an output speech/audio signal.
    Type: Grant
    Filed: March 19, 2019
    Date of Patent: June 8, 2021
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
  • Patent number: 11025462
    Abstract: Systems and methods are disclosed herein for modifying modulated signals for transmission. The system receives a modulated signal comprising a speech signal and a carrier wave and generates first and second spectral signals by converting the modulation signal and carrier wave from the time domain to the frequency domain respectively. The system then determines spectral bands for the first and second spectral signals. For each spectral band, the system calculates a weighted spectral band value based on a magnitude of the first spectral signal within the spectral band and generates a modified spectral signal by modifying the second spectral signal with the weighted spectral band value. The system then converts the modified spectral signal from the frequency domain to the time domain and transmits the converted modified spectral signal to a server.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: June 1, 2021
    Assignee: Rovi Guides, Inc.
    Inventor: Jeffry Copps
  • Patent number: 11024330
    Abstract: A signal processing apparatus includes a detection unit configured to perform a voice detection process on each of a plurality of audio signals captured by a plurality of microphones arranged at mutually different positions, a determination unit configured to determine a degree of similarity between two or more of the plurality of audio signals in which voice is detected by the detection unit, and a suppression unit configured to perform a process of suppressing the voice contained in at least one of the two or more audio signals, in response to a determination that the degree of similarity between the two or more audio signals is less than a threshold by the determination unit.
    Type: Grant
    Filed: April 16, 2019
    Date of Patent: June 1, 2021
    Assignee: CANON KABUSHIKI KAISHA
    Inventor: Masanobu Funakoshi
  • Patent number: 11024324
    Abstract: Disclosed herein is a method for RNN-based noise reduction in a real-time conference, comprising: performing frame-and-window for a speech signal to obtain a logarithmic spectrum of the speech signal, and placing the logarithmic spectrum into the RNN model to determine a noise reduction suppression coefficient, and then obtaining the denoised speech signal by applying the noise reduction suppression coefficient to the logarithmic spectrum of the original signal, thereby achieving utilization of the RNN noise reduction method in real-time conferences. In the present disclosure, when inputting the RNN model for estimation, only the logarithmic spectrum of the current frame needs to be inputted. The RNN model of the present disclosure has few requirements on inputted information, without performing huge preprocessing on the received speech signal, which in turn reduces computation burden, increases response speed, and enhances real-time performance.
    Type: Grant
    Filed: August 22, 2018
    Date of Patent: June 1, 2021
    Assignee: YEALINK (XIAMEN) NETWORK TECHNOLOGY CO., LTD.
    Inventor: Yuanxun Kang
  • Patent number: 11024323
    Abstract: An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal includes a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also includes an audio stream provider for providing the audio stream such that the audio stream includes information describing an audio content of the frequency bands and information describing the multi-band quantization error. A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal includes a noise filler for introducing noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.
    Type: Grant
    Filed: July 7, 2017
    Date of Patent: June 1, 2021
    Assignee: Fraunhofer-Gesellschaft zur Fcerderung der angewandten Forschung e.V.
    Inventors: Nikolaus Rettelbach, Bernhard Grill, Guillaume Fuchs, Stefan Geyersberger, Markus Multrus, Harald Popp, Juergen Herre, Stefan Wabnik, Gerald Schuller, Jens Hirschfeld
  • Patent number: 11017799
    Abstract: The present disclosure discloses a method for processing a voice in interior environment of a vehicle, an electronic device and a storage medium. The method includes the following. A reference audio is acquired, and the reference audio is recorded to obtain a recorded audio. A pure voice is acquired. Noise data for each part or period of the recorded audio satisfying a target signal-to-noise ratio condition pertaining to that part is selected from the recorded audio, and the noise data is superimposed to the pure data to obtain a noisy voice. The noisy voice and the reference audio are inputted to an acoustic echo canceller (AEC) module as inputted data. The AEC module is configured to perform an echo cancellation operation on the inputted data to obtain training data having AEC residual noise.
    Type: Grant
    Filed: October 15, 2018
    Date of Patent: May 25, 2021
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventor: Hui Song