Patents by Inventor Gordon Wichern

Gordon Wichern has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240170003
    Abstract: An audio processing system and method for processing audio is disclosed. The audio processing system collects an input audio signal indicative of degraded measurements of a target audio waveform. The input audio signal is restored with recursive restoration that recursively restores the input audio signal until a termination condition is met. A current iteration of the recursive restoration applies a restoration operator configured to restore a degraded audio signal conditioned on a current level of severity of degradation and degrades the degraded audio signal deterministically with a level of severity less than the current level of severity. A target signal estimate indicative of enhanced measurements of the audio waveform is generated as output.
    Type: Application
    Filed: October 23, 2023
    Publication date: May 23, 2024
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Jonathan Le Roux, François G. Germain, Gordon Wichern, Hao Yen
  • Patent number: 11978476
    Abstract: A system and method for detecting anomalous sound are disclosed. The method includes receiving a spectrogram of an audio signal with elements defined by values in a time-frequency domain of the spectrogram. Each of the values corresponds to an element of the spectrogram that is identified by a coordinate in the time-frequency domain. The time-frequency domain of the spectrogram is partitioned into a context region and a target region. The context region and the target region are processed by a neural network using an attentive neural process to recover values of the spectrogram for elements with coordinates in the target region. The recovered values of the elements of the target region are compared with values of elements of the partitioned target region. An anomaly score is determined based on the comparison. The anomaly score is used for performing a control action.
    Type: Grant
    Filed: September 19, 2021
    Date of Patent: May 7, 2024
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Gordon Wichern, Ankush Chakrabarty, Zhong-Qiu Wang, Jonathan Le Roux
  • Publication number: 20240055012
    Abstract: A system and method for reverberation reduction is disclosed. A first Deep Neural Network (DNN) produces a first estimate of a target direct-path signal from a mixture of acoustic signals that include the target direct-path signal and a reverberation of the target direct-path signal. A filter modeling a room impulse response (RIR) for the first estimate is estimated. The filter when applied to the first estimate of the target direct-path signal generates a result closest to a residual between the mixture of the acoustic signals and the first estimate of the target direct-path signal according to a distance function. The estimated filter is used for modeling the RIR.
    Type: Application
    Filed: August 15, 2022
    Publication date: February 15, 2024
    Inventors: Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux
  • Publication number: 20240003737
    Abstract: A system and a method for detecting anomalous sound are disclosed. The method includes receiving an audio signal from a sound source in a recording environment. The sound source and the recording environment are characterized by a set of attributes including a first attribute pertaining to a first attribute type and a second attribute pertaining to a second attribute type. A multi-head neural network is trained to extract from the received audio signal a first embedding vector indicative of the first attribute type and a second embedding vector indicative of the second attribute type. The first embedding vector is compared with a first set of embedding vectors to classify attributes of the first attribute type and the second embedding vector is compared with a second set of embedding vectors to classify attributes of the second attribute type, to determine a result of anomaly detection.
    Type: Application
    Filed: March 23, 2023
    Publication date: January 4, 2024
    Inventors: Gordon Wichern, Satvik Venkatesh, Aswin Shanmugam Subramanian, Jonathan Le Roux
  • Patent number: 11790930
    Abstract: A system and method for reverberation reduction is disclosed. A first Deep Neural Network (DNN) produces a first estimate of a target direct-path signal from a mixture of acoustic signals that include the target direct-path signal and a reverberation of the target direct-path signal. A filter modeling a room impulse response (RIR) for the first estimate is estimated. The filter when applied to the first estimate of the target direct-path signal generates a result closest to a residual between the mixture of the acoustic signals and the first estimate of the target direct-path signal according to a distance function. A mixture with reduced reverberation of the target direct-path signal is obtained by removing the result of applying the filter to the first estimate of the target direct-path signal from the received mixture. A second DNN produces a second estimate of the target direct-path signal from the mixture with reduced reverberation.
    Type: Grant
    Filed: March 10, 2022
    Date of Patent: October 17, 2023
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux
  • Publication number: 20230326478
    Abstract: Embodiments of the present disclosure disclose a system and method for extraction of a target sound signal. The system collects collect a mixture of sound signals. The system selects a query identifying the target sound signal to be extracted from the mixture of sound signals, the query comprising one or more identifiers. Each identifier is present in a predetermined set of one or more identifiers and defines at least one of mutually inclusive and mutually exclusive characteristics of the mixture of sound signals. The system determined one or more logical operators connecting the extracted one or more identifiers. The system transforms the one or more identifiers and the extracted logical operators into a digital representation. The system executes a neural network trained to extract the target sound signal by mixing the digital representation with intermediate outputs of intermediate layers of the neural network.
    Type: Application
    Filed: October 9, 2022
    Publication date: October 12, 2023
    Inventors: Gordon Wichern, Efthymios Tzinis, Aswin Shanmugam Subramanian, Jonathan Le Roux
  • Publication number: 20230306980
    Abstract: A system and method for low-latency audio signal enhancement is provided. An input mixture of audio signals is partitioned into a sequence of overlapping frames by using a first sliding window method. The first sliding window method comprises a first window function having a first width associated with a window of the corresponding frame and a shift length associated with shifting of the window of the first sliding window method. Each frame is then processed using a first DNN, a frequency domain causal linear filter and a second DNN, to generate final enhanced overlapping frames for each of the processed frames. The final enhanced overlapping frames are then combined using a second sliding window method associated with a second window function having a second width less than the first width and the same shift length as the first sliding window method.
    Type: Application
    Filed: October 10, 2022
    Publication date: September 28, 2023
    Inventors: Zhong Qiu Wang, Gordon Wichern, Jonathan Le Roux
  • Patent number: 11756551
    Abstract: An audio processing system is provided. The audio processing system comprises an input interface configured to accept an audio signal. Further, the audio processing system comprises a memory configured to store a neural network trained to determine different types of attributes of multiple concurrent audio events of different origins, wherein the types of attributes include time-dependent and time-agnostic attributes of speech and non-speech audio events. Further, the audio processing system comprises a processor configured to process the audio signal with the neural network to produce metadata of the audio signal, the metadata including one or multiple attributes of one or multiple audio events in the audio signal.
    Type: Grant
    Filed: October 7, 2020
    Date of Patent: September 12, 2023
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Niko Moritz, Gordon Wichern, Takaaki Hori, Jonathan Le Roux
  • Publication number: 20230283950
    Abstract: Embodiments of the present disclosure disclose a system and method for localization of a target sound event. The system collects a first digital representation of an acoustic mixture of sounds of a plurality of sound events, by using an acoustic sensor. The system receives a second digital representation of a sound corresponding to the target sound event. Further, the first digital representation and the second digital representation are processed by a neural network to produce a localization information indicative of a location of an origin of the target sound event with respect to a location of the acoustic sensor.
    Type: Application
    Filed: March 7, 2022
    Publication date: September 7, 2023
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Gordon Wichern, Olga Slizovskaia, Jonathan Le Roux
  • Publication number: 20230196149
    Abstract: A controller and a method for optimizing a controlled operation of a system performing a task is provided. The method for optimizing the controlled operation of the system comprises accessing a probabilistic distribution of a performance function trained to provide a relationship between different combinations of control parameters for controlling the system and their corresponding costs of operation, selecting a combination of control parameters from the different combinations of control parameters, such that the selected combination of control parameters is having the largest likelihood of being optimal at the probabilistic distribution of the performance function. The method further comprises controlling the system using the selected combination of the control parameters and modifying the probabilistic distribution of the performance function conditioned on the selected combination of the control parameters and the corresponding cost of operation.
    Type: Application
    Filed: December 10, 2021
    Publication date: June 22, 2023
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Ankush Chakrabarty, Sicheng Zhan, Christopher Laughman, Gordon Wichern
  • Publication number: 20230086355
    Abstract: A system and method for detecting anomalous sound are disclosed. The method includes receiving a spectrogram of an audio signal with elements defined by values in a time-frequency domain of the spectrogram. Each of the values corresponds to an element of the spectrogram that is identified by a coordinate in the time-frequency domain. The time-frequency domain of the spectrogram is partitioned into a context region and a target region. The context region and the target region are processed by a neural network using an attentive neural process to recover values of the spectrogram for elements with coordinates in the target region. The recovered values of the elements of the target region are compared with values of elements of the partitioned target region. An anomaly score is determined based on the comparison. The anomaly score is used for performing a control action.
    Type: Application
    Filed: September 19, 2021
    Publication date: March 23, 2023
    Inventors: Gordon Wichern, Ankush Chakrabarty, Zhong-Qiu Wang, Jonathan Le Roux
  • Patent number: 11579598
    Abstract: A system for controlling an operation of a machine including a plurality of actuators assisting one or multiple tools to perform one or multiple tasks, in response to receiving an acoustic mixture of signals generated by the tool performing a task and by the plurality of actuators actuating the tool, submit the acoustic mixture of signals into a neural network trained to separate from the acoustic mixture a signal generated by the tool performing the task from signals generated by the actuators actuating the tool to extract the signal generated by the tool performing the task from the acoustic mixture of signals, analyze the extracted signal to produce a state of performance of the task, and execute a control action selected according to the state of performance of the task.
    Type: Grant
    Filed: October 17, 2019
    Date of Patent: February 14, 2023
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Gordon Wichern, Jonathan Le Roux, Fatemeh Pishdadian
  • Publication number: 20230042468
    Abstract: A system and method for reverberation reduction is disclosed. A first Deep Neural Network (DNN) produces a first estimate of a target direct-path signal from a mixture of acoustic signals that include the target direct-path signal and a reverberation of the target direct-path signal. A filter modeling a room impulse response (RIR) for the first estimate is estimated. The filter when applied to the first estimate of the target direct-path signal generates a result closest to a residual between the mixture of the acoustic signals and the first estimate of the target direct-path signal according to a distance function. A mixture with reduced reverberation of the target direct-path signal is obtained by removing the result of applying the filter to the first estimate of the target direct-path signal from the received mixture. A second DNN produces a second estimate of the target direct-path signal from the mixture with reduced reverberation.
    Type: Application
    Filed: March 10, 2022
    Publication date: February 9, 2023
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux
  • Publication number: 20220335179
    Abstract: A system and a method for calibrating a model of thermal dynamics of thermal state in an environment of a building conditioned by an operation of a heating, ventilating, and air-conditioning (HVAC) system is provided. The method includes receiving values of the control inputs to the actuators of the HVAC system and values of the thermal state at locations of the environment caused by the operation of the HVAC system according to the values of the control inputs, and computing a probabilistic surrogate model iteratively, using a Bayesian optimization, until a termination condition is met. The method further comprises outputting, when the termination condition is met, an optimal combination of the different parameters of the model of thermal dynamics having the largest likelihood of being a global minimum at the probabilistic surrogate model according to an acquisition function of the first two order moments of the calibration errors.
    Type: Application
    Filed: April 7, 2021
    Publication date: October 20, 2022
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Ankush Chakrabarty, Christopher Laughman, Gordon Wichern, Emilio Maddalena
  • Patent number: 11475908
    Abstract: The audio processing system includes a memory to store a neural network trained to process an audio mixture to output estimation of at least a subset of a set of audio sources present in the audio mixture. The audio sources are subject to hierarchical constraints enforcing a parent-children hierarchy on the set of audio sources, such that a parent audio source in includes a mixture of its one or multiple children audio sources. The subset includes a parent audio source and at least one of its children audio sources. The system further comprises a processor to process a received input audio mixture using the neural network to estimate the subset of audio sources and their mutual relationships according to the parent-children hierarchy. The system further includes an output interface configured to render the extracted audio sources and their mutual relationships.
    Type: Grant
    Filed: October 7, 2020
    Date of Patent: October 18, 2022
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Gordon Wichern, Jonathan Le Roux, Ethan Manilow
  • Patent number: 11469731
    Abstract: Some embodiments of the invention are directed to enabling a user to easily identify the frequency range(s) at which sound masking occurs, and addressing the masking, if desired. In this respect, the extent to which a first stem is masked by one or more second stems in a frequency range may depend not only on the absolute value of the energy of the second stem(s) in the frequency range, but also on the relative energy of the first stem with respect to the second stem(s) in the frequency range. Accordingly, some embodiments are directed to modeling sound masking as a function of the energy of the stem being masked and of the relative energy of the masked stem with respect to the masking stem(s) in the frequency range, such as by modeling sound masking as loudness loss, a value indicative of the reduction in loudness of a stem of interest caused by the presence of one or more other stems in a frequency range.
    Type: Grant
    Filed: March 5, 2021
    Date of Patent: October 11, 2022
    Assignee: iZotope, Inc.
    Inventors: James McClellan, Gordon Wichern, Hannah Robertson, Aaron Wishnick, Alexey Lukin, Matthew Hines, Nicholas LaPenn
  • Publication number: 20220108698
    Abstract: An audio processing system is provided. The audio processing system comprises an input interface configured to accept an audio signal. Further, the audio processing system comprises a memory configured to store a neural network trained to determine different types of attributes of multiple concurrent audio events of different origins, wherein the types of attributes include time-dependent and time-agnostic attributes of speech and non-speech audio events. Further, the audio processing system comprises a processor configured to process the audio signal with the neural network to produce metadata of the audio signal, the metadata including one or multiple attributes of one or multiple audio events in the audio signal.
    Type: Application
    Filed: October 7, 2020
    Publication date: April 7, 2022
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Niko Moritz, Gordon Wichern, Takaaki Hori, Jonathan Le Roux
  • Publication number: 20220101869
    Abstract: The audio processing system includes a memory to store a neural network trained to process an audio mixture to output estimation of at least a subset of a set of audio sources present in the audio mixture. The audio sources are subject to hierarchical constraints enforcing a parent-children hierarchy on the set of audio sources, such that a parent audio source in includes a mixture of its one or multiple children audio sources. The subset includes a parent audio source and at least one of its children audio sources. The system further comprises a processor to process a received input audio mixture using the neural network to estimate the subset of audio sources and their mutual relationships according to the parent-children hierarchy. The system further includes an output interface configured to render the extracted audio sources and their mutual relationships.
    Type: Application
    Filed: October 7, 2020
    Publication date: March 31, 2022
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Gordon Wichern, Jonathan Le Roux, Ethan Manilow
  • Publication number: 20210234524
    Abstract: Some embodiments of the invention are directed to enabling a user to easily identify the frequency range(s) at which sound masking occurs, and addressing the masking, if desired. In this respect, the extent to which a first stem is masked by one or more second stems in a frequency range may depend not only on the absolute value of the energy of the second stem(s) in the frequency range, but also on the relative energy of the first stem with respect to the second stem(s) in the frequency range. Accordingly, some embodiments are directed to modeling sound masking as a function of the energy of the stem being masked and of the relative energy of the masked stem with respect to the masking stem(s) in the frequency range, such as by modeling sound masking as loudness loss, a value indicative of the reduction in loudness of a stem of interest caused by the presence of one or more other stems in a frequency range.
    Type: Application
    Filed: March 5, 2021
    Publication date: July 29, 2021
    Applicant: iZotope, Inc.
    Inventors: James McClellan, Gordon Wichern, Hannah Robertson, Aaron Wishnick, Alexey Lukin, Matthew Hines, Nicholas LaPenn
  • Publication number: 20210116894
    Abstract: A system for controlling an operation of a machine including a plurality of actuators assisting one or multiple tools to perform one or multiple tasks, in response to receiving an acoustic mixture of signals generated by the tool performing a task and by the plurality of actuators actuating the tool, submit the acoustic mixture of signals into a neural network trained to separate from the acoustic mixture a signal generated by the tool performing the task from signals generated by the actuators actuating the tool to extract the signal generated by the tool performing the task from the acoustic mixture of signals, analyze the extracted signal to produce a state of performance of the task, and execute a control action selected according to the state of performance of the task.
    Type: Application
    Filed: October 17, 2019
    Publication date: April 22, 2021
    Applicant: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Gordon Wichern, Jonathan Le Roux, Fatemeh Pishdadian