Patents by Inventor Hadis Nosrati

Hadis Nosrati has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250061914
    Abstract: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
    Type: Application
    Filed: August 30, 2024
    Publication date: February 20, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Hadis Nosrati, Glenn N. Dickins, Nicholas Luke Appleton
  • Publication number: 20250006170
    Abstract: Some disclosed methods involve receiving microphone signals from a microphone system, including signals corresponding to one or more sounds detected by the microphone system. Some methods may involve determining, via a trained neural network, a filtering scheme for the microphone signals, the filtering scheme including one or more filtering processes. The trained neural network may be configured to implement one or more subband-domain adaptive filter management modules. Some methods may involve applying the filtering scheme to the microphone signals, to produce enhanced microphone signals.
    Type: Application
    Filed: November 1, 2022
    Publication date: January 2, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Hadis NOSRATI, Benjamin John SOUTHWELL
  • Publication number: 20250006208
    Abstract: Some disclosed methods involve receiving audio data of at least a first audio data type and a second audio data type, including audio signals and associated spatial data indicating intended perceived spatial positions for the audio signals, determining at least a first feature type from the audio data and applying a positional encoding process to the audio data, to produce encoded audio data. The encoded audio data may include representations of at least the spatial data and the first feature type in first embedding vectors of an embedding dimension. Some methods may involve training a neural network, based on the encoded audio data, to transform audio data from an input audio data type having an input spatial data type to a transformed audio data type having a transformed spatial data type. Some methods may involve training a neural network to identify an input audio data type.
    Type: Application
    Filed: November 3, 2022
    Publication date: January 2, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Brenton James POTTER, Hadis NOSRATI
  • Patent number: 12080317
    Abstract: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
    Type: Grant
    Filed: August 27, 2020
    Date of Patent: September 3, 2024
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Hadis Nosrati, Glenn N. Dickins, Nicholas Luke Appleton
  • Publication number: 20220319532
    Abstract: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
    Type: Application
    Filed: August 27, 2020
    Publication date: October 6, 2022
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Hadis Nosrati, Glenn N. Dickins, Nicholas Luke Appleton
  • Patent number: 10872602
    Abstract: Computer-implemented methods for training an acoustic model for a far-field utterance processing system are provided. The acoustic model may be configured to map an input audio signal into linguistic or paralinguistic units. The training may involve imparting far-field acoustic characteristics upon near-field training vectors that include a plurality of near-microphone utterance signals. Imparting the far-field acoustic characteristics may involve generating a plurality of simulated room impulse responses, convolving one or more of the simulated room impulse responses with the near-field training vectors, to produce a plurality of simulated far-field utterance signals and saving the results of the training in one or more non-transitory memory devices corresponding with the acoustic model. Generating simulated room impulse responses may involve simulating room reverberation times but not simulating early reflections from room surfaces.
    Type: Grant
    Filed: May 2, 2019
    Date of Patent: December 22, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Hadis Nosrati, David S. McGrath, Richard J. Cartwright
  • Publication number: 20190362711
    Abstract: Computer-implemented methods for training an acoustic model for a far-field utterance processing system are provided. The acoustic model may be configured to map an input audio signal into linguistic or paralinguistic units. The training may involve imparting far-field acoustic characteristics upon near-field training vectors that include a plurality of near-microphone utterance signals. Imparting the far-field acoustic characteristics may involve generating a plurality of simulated room impulse responses, convolving one or more of the simulated room impulse responses with the near-field training vectors, to produce a plurality of simulated far-field utterance signals and saving the results of the training in one or more non-transitory memory devices corresponding with the acoustic model. Generating simulated room impulse responses may involve simulating room reverberation times but not simulating early reflections from room surfaces.
    Type: Application
    Filed: May 2, 2019
    Publication date: November 28, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Hadis Nosrati, David S. McGrath, Richard J. Cartwright