Patents by Inventor Patrick A. NAYLOR
Patrick A. NAYLOR has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12165668Abstract: A method of performing at least de-reverberation and noise-reduction of an input sound signal of at least one input channel includes: performing, using at least one filter element, at least one of de-reverberation and noise-reduction of the input sound signal to generate a clean output sound signal; and determining, by a non-intrusive measure (NIM) estimation element, at least one non-intrusive measure (NIM) from the sound signal, wherein the at least one NIM includes at least one of voice activity detection (VAD) posterior, reverberation time, clarity index, direct-to-reverberant ratio (DRR), and signal-to-noise ratio (SNR); the de-reverberation is achieved by applying at least one channel shortening (CS) filter component of the at least one filter element in conjunction with the at least one NIM; and the noise reduction is performed in combination with the de-reverberation by the channel shortening (CS) filter component.Type: GrantFiled: February 18, 2022Date of Patent: December 10, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, James Fosburgh, Patrick Naylor
-
Patent number: 12154541Abstract: A method, computer program product, and computing system for receiving feature-based voice data associated with a first acoustic domain. One or more reverberation-based augmentations may be performed on at least a portion of the feature-based voice data, thus defining reverberation-augmented feature-based voice data.Type: GrantFiled: March 10, 2021Date of Patent: November 26, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, James W. Fosburgh, Do Yeong Kim
-
Patent number: 12149914Abstract: A method, computer program product, and computing system for obtaining machine vision encounter information using one or more machine vision systems. Audio encounter information may be obtained using a plurality of audio acquisition devices of an audio recording system. The audio encounter information may be encoded using an audio codec. The encoding of the audio encounter information by the audio codec may be adapted based upon, at least in part, the machine vision encounter information.Type: GrantFiled: February 11, 2022Date of Patent: November 19, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
-
Patent number: 12142293Abstract: There is provided a speech processing system that includes a neural encoder module. A processor that receives an audio signal; and the memory that contains instructions that control said processor to perform operations that process speech. In an implementation, a front end module can include a Neural Spatial RTF Estimator and a neural spatial and residual encoder (NSRE) configured accept as inputs a spectral encoded reference channel stream to output Neural Transfer Functions (NTFs). In another implementation, a front end module encodes and outputs a Ch1 bitstream; computes a plurality of relative transfer functions (RTFs) for an N-Channel signal and outputs an N?1 RTFs or an RTF codebook ids and computes and processes an N?1 residual stream; and a back end module comprising a neural encoder module configured to accept the RTFs and output an encoded speech signal comprising an embedding that comprises features extracted from RTFs.Type: GrantFiled: June 30, 2022Date of Patent: November 12, 2024Assignee: Microsoft Technology Licensing, LLC.Inventors: Dushyant Sharma, Patrick Naylor, Daniel T. Jones
-
Patent number: 12143798Abstract: A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions associated with a plurality of audio acquisition devices of an audio recording system deployed in an acoustic environment. At least a pair of the plurality of acoustic relative transfer functions from time frames may be compared. A change in the acoustic environment may be detected based upon, at least in part, the comparison of the plurality of acoustic relative transfer functions from at least the pair of time frames.Type: GrantFiled: February 11, 2022Date of Patent: November 12, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
-
Patent number: 12112741Abstract: A method, computer program product, and computing system for defining a model representative of a plurality of acoustic variations to a speech signal, thus defining a plurality of time-varying spectral modifications. The plurality of time-varying spectral modifications may be applied to a reference signal using a filtering operation, thus generating a time-varying spectrally-augmented signal.Type: GrantFiled: February 18, 2021Date of Patent: October 8, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Patrick A. Naylor, Dushyant Sharma, Uwe Helmut Jost, William F Ganong, III
-
Patent number: 12114147Abstract: A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions between a plurality of audio acquisition devices of an audio recording system based upon, at least in part, one or more of a predefined speech processing application and a predefined acoustic environment. An acoustic relative transfer function codebook may be generated using the plurality of acoustic relative transfer functions. One or more channels from the plurality of audio acquisition devices of the audio recording system may be encoded using the acoustic relative transfer function codebook.Type: GrantFiled: February 11, 2022Date of Patent: October 8, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
-
Publication number: 20240330495Abstract: A method, computer program product, and computing system for receiving input content that may include sensitive information for processing by a machine learning model; processing the input content with the machine learning model to generate output content; processing the output content to determine if the output content includes any sensitive information; and if the output content includes any sensitive information, scrutinizing the input content to determine if the input content supports the inclusion of the sensitive information in the output content.Type: ApplicationFiled: March 27, 2023Publication date: October 3, 2024Inventors: Uwe Helmut Jost, Patrick A. Naylor, Dushyant Sharma, Ljubomir Milanovic, William F. Ganong, III
-
Publication number: 20240323630Abstract: A method, computer program product, and computing system for encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information. Location information may be estimated, via a machine vision system, for an acoustic source within an acoustic environment. One or more acoustic relative transfer functions may be selected from a plurality of acoustic relative transfer functions for the plurality of audio acquisition devices of the audio recording system based upon, at least in part, the location information. The encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer function may be transmitted.Type: ApplicationFiled: May 28, 2024Publication date: September 26, 2024Inventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
-
Patent number: 12073818Abstract: A method, computer program product, and computing system for receiving feature-based voice data. One or more data augmentation characteristics may be received. One or more augmentations of the feature-based voice data may be generated, via a machine learning model, based upon, at least in part, the feature-based voice data and the one or more data augmentation characteristics.Type: GrantFiled: March 10, 2021Date of Patent: August 27, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, James W. Fosburgh, Do Yeong Kim
-
Patent number: 12062016Abstract: A method, computer program product, and computing system for obtaining encounter information of a patient encounter, wherein the encounter information includes machine vision encounter information; and processing the encounter information to generate an encounter transcript.Type: GrantFiled: February 23, 2022Date of Patent: August 13, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Daniel Paulino Almendro Barreda, Dushyant Sharma, Joel Praveen Pinto, Uwe Helmut Jost, Patrick A. Naylor
-
Patent number: 12014722Abstract: A method, computer program product, and computing system for receiving feature-based voice data associated with a first acoustic domain. One or more gain-based augmentations may be performed on at least a portion of the feature-based voice data, thus defining gain-augmented feature-based voice data.Type: GrantFiled: March 10, 2021Date of Patent: June 18, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, James W. Fosburgh
-
Patent number: 11997469Abstract: A method, computer program product, and computing system for encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information. Location information may be estimated, via a machine vision system, for an acoustic source within an acoustic environment. One or more acoustic relative transfer functions may be selected from a plurality of acoustic relative transfer functions for the plurality of audio acquisition devices of the audio recording system based upon, at least in part, the location information. The encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer function may be transmitted.Type: GrantFiled: February 11, 2022Date of Patent: May 28, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
-
Patent number: 11967305Abstract: A method, computer program product, and computing system for generating a three-dimensional model of at least a portion of a three-dimensional space incorporating an ACI system via a video recording subsystem of an ACI calibration platform; and generating one or more audio calibration signals for receipt by an audio recording system included within the ACI system via an audio generation subsystem of the ACI calibration platform.Type: GrantFiled: June 15, 2022Date of Patent: April 23, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, Joel Praveen Pinto, Daniel Paulino Almendro Barreda
-
Patent number: 11961504Abstract: A method, computer program product, and computing system for receiving feature-based voice data associated with a first acoustic domain. One or more rate-based augmentations may be performed on at least a portion of the feature-based voice data, thus defining rate-based augmented feature-based voice data.Type: GrantFiled: March 10, 2021Date of Patent: April 16, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor
-
Patent number: 11950081Abstract: A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions for a plurality of audio acquisition devices of an audio recording system deployed in an acoustic environment. The plurality of acoustic relative transfer functions may be encoded into a first embedding of acoustic relative transfer functions and at least a second embedding of acoustic relative transfer functions. Information may be extracted from at least the first embedding of acoustic relative transfer functions.Type: GrantFiled: February 11, 2022Date of Patent: April 2, 2024Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
-
Patent number: 11924624Abstract: A method, computer program product, and computing system for selecting a reference audio acquisition device from a plurality of audio acquisition devices of an audio recording system. Audio encounter information of the reference microphone may be encoded, thus defining encoded reference audio encounter information. A plurality of acoustic relative transfer functions between the reference microphone and the plurality of audio acquisition devices of the audio recording system may be generated. The encoded reference audio encounter information and a representation of the plurality of acoustic relative transfer functions may be transmitted.Type: GrantFiled: February 11, 2022Date of Patent: March 5, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
-
Publication number: 20240005946Abstract: There is provided a speech processing system that includes a neural encoder module. A processor that receives an audio signal; and the memory that contains instructions that control said processor to perform operations that process speech. In an implementation, a front end module can include a Neural Spatial RTF Estimator and a neural spatial and residual encoder (NSRE) configured accept as inputs a spectral encoded reference channel stream to output Neural Transfer Functions (NTFs). In another implementation, a front end module encodes and outputs a Ch1 bitstream; computes a plurality of relative transfer functions (RTFs) for an N-Channel signal and outputs an N?1 RTFs or an RTF codebook ids and computes and processes an N?1 residual stream; and a back end module comprising a neural encoder module configured to accept the RTFs and output an encoded speech signal comprising an embedding that comprises features extracted from RTFs.Type: ApplicationFiled: June 30, 2022Publication date: January 4, 2024Applicant: NUANCE COMMUNICATIONS, INC.Inventors: Dushyant SHARMA, Patrick NAYLOR, Daniel T. JONES
-
Patent number: 11837228Abstract: A method, computer program product, and computing system for receiving a speech signal from each microphone of a plurality of microphones, thus defining a plurality of signals. One or more noise signals associated with microphone self-noise may be received. One or more self-noise-based augmentations may be performed on the plurality of signals based upon, at least in part, the one or more noise signals associated with microphone self-noise, thus defining one or more self-noise-based augmented signals.Type: GrantFiled: May 7, 2021Date of Patent: December 5, 2023Assignee: Nuance Communications, Inc.Inventors: Dushyant Sharma, Patrick A. Naylor, Rong Gong, Stanislav Kruchinin, Ljubomir Milanovic
-
Patent number: 11835625Abstract: A method of performing distance estimation between a first recording device at a first location and a second recording device at a second location includes: estimating acoustic relative transfer function (RTF) between the first recording device and the second recording device for a sound signal, e.g., by applying an improved proportionate normalized least mean square (IPNLMS) filter; and estimating the distance between the first recording device and the second recording device based on the RTF. The at least one acoustic feature extracted from the RTF estimated between the first recording device and the second recording device includes at least one of clarity index, direct-to-reverberant ratio (DRR), and reverberation time. A distributed-gradient-boosting algorithm with regression trees is used in combination with signal-to-reverberation ratio (SRR) and the at least one acoustic feature extracted from the RTF to estimate the distance between the first recording device and the second recording device.Type: GrantFiled: March 15, 2022Date of Patent: December 5, 2023Assignee: Microsoft Technology Licensing, LLC.Inventors: Francesco Nespoli, Patrick Naylor, Daniel Barreda