Patents by Inventor Turaj Zakizadeh SHABESTARY

Turaj Zakizadeh SHABESTARY has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Methods And Systems For Detecting And Processing Speech Signals

Publication number: 20230169979

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Application

Filed: January 24, 2023

Publication date: June 1, 2023

Applicant: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
STFT-Based Echo Muter

Publication number: 20230079828

Abstract: A method for Short-Time Fourier Transform-based echo muting includes receiving a microphone signal including acoustic echo captured by a microphone and corresponding to audio content from an acoustic speaker, and receiving a reference signal including a sequence of frames representing the audio content. For each frame in a sequence of frames, the method includes processing, using an acoustic echo canceler configured to receive a respective frame as input to generate a respective output signal frame that cancels the acoustic echo from the respective frame, and determining, using a Double-talk Detector (DTD), based on the respective frame and the respective output signal frame, whether the respective frame includes a double-talk frame or an echo-only frame. For each respective frame that includes the echo-only frame, muting the respective output signal frame, and performing speech processing on the respective output signal frame for each respective frame that includes the double-talk frame.

Type: Application

Filed: December 11, 2021

Publication date: March 16, 2023

Applicant: Google LLC

Inventors: Turaj Zakizadeh Shabestary, Arun Narayanan
Methods and systems for detecting and processing speech signals

Patent number: 11568874

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Grant

Filed: December 8, 2020

Date of Patent: January 31, 2023

Assignee: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION

Publication number: 20220392441

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

Type: Application

Filed: August 12, 2022

Publication date: December 8, 2022

Inventors: Christopher Hughes, Yiteng Huang, Turaj Zakizadeh Shabestary, Taylor Applebaum
Selective adaptation and utilization of noise reduction technique in invocation phrase detection

Patent number: 11417324

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

Type: Grant

Filed: May 28, 2020

Date of Patent: August 16, 2022

Assignee: GOOGLE LLC

Inventors: Christopher Hughes, Yiteng Huang, Turaj Zakizadeh Shabestary, Taylor Applebaum
Multi-channel echo cancellation with scenario memory

Patent number: 11417351

Abstract: According to an aspect, a method for multi-channel echo cancellation includes receiving a microphone signal and a multi-channel loudspeaker driving signal. The multi-channel loudspeaker driving signal includes a first driving signal that drives a first loudspeaker, and a second driving signal that drives a second loudspeaker. The first driving signal is substantially the same as second driving signal. The microphone signal includes a near-end signal with echo. The method includes determining a unique solution for acoustic transfer functions for a present acoustic scenario based on the microphone signal and the multi-channel loudspeaker driving signal. The acoustic transfer functions include first and second acoustic transfer function. The unique solution is determined based on time-frequency transforms of observations from the present acoustic scenario and at least one previous acoustic scenario.

Type: Grant

Filed: June 26, 2018

Date of Patent: August 16, 2022

Assignee: GOOGLE LLC

Inventors: Willem Bastiaan Kleijn, Turaj Zakizadeh Shabestary
Methods And Systems For Detecting And Processing Speech Signals

Publication number: 20210090574

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Application

Filed: December 8, 2020

Publication date: March 25, 2021

Applicant: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
Methods and systems for detecting and processing speech signals

Patent number: 10878820

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Grant

Filed: February 20, 2019

Date of Patent: December 29, 2020

Assignee: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
Echo cancellation for keyword spotting

Patent number: 10861479

Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

Type: Grant

Filed: October 10, 2019

Date of Patent: December 8, 2020

Assignee: GOOGLE LLC

Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION

Publication number: 20200294496

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

Type: Application

Filed: May 28, 2020

Publication date: September 17, 2020

Inventors: Christopher Hughes, Yiteng Huang, Turaj Zakizadeh Shabestary, Taylor Applebaum
Selective adaptation and utilization of noise reduction technique in invocation phrase detection

Patent number: 10706842

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. Various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

Type: Grant

Filed: January 14, 2019

Date of Patent: July 7, 2020

Assignee: GOOGLE LLC

Inventors: Christopher Hughes, Yiteng Huang, Turaj Zakizadeh Shabestary, Taylor Applebaum
ECHO CANCELLATION FOR KEYWORD SPOTTING

Publication number: 20200152220

Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

Type: Application

Filed: October 10, 2019

Publication date: May 14, 2020

Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
SELECTIVE ADAPTATION AND UTILIZATION OF NOISE REDUCTION TECHNIQUE IN INVOCATION PHRASE DETECTION

Publication number: 20200066263

Abstract: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.

Type: Application

Filed: January 14, 2019

Publication date: February 27, 2020

Inventors: Christopher Hughes, Yiteng Huang, Turaj Zakizadeh Shabestary, Taylor Applebaum
MULTI-CHANNEL ECHO CANCELLATION WITH SCENARIO MEMORY

Publication number: 20190392853

Abstract: According to an aspect, a method for multi-channel echo cancellation includes receiving a microphone signal and a multi-channel loudspeaker driving signal. The multi-channel loudspeaker driving signal includes a first driving signal that drives a first loudspeaker, and a second driving signal that drives a second loudspeaker. The first driving signal is substantially the same as second driving signal. The microphone signal includes a near-end signal with echo. The method includes determining a unique solution for acoustic transfer functions for a present acoustic scenario based on the microphone signal and the multi-channel loudspeaker driving signal. The acoustic transfer functions include first and second acoustic transfer function. The unique solution is determined based on time-frequency transforms of observations from the present acoustic scenario and at least one previous acoustic scenario.

Type: Application

Filed: June 26, 2018

Publication date: December 26, 2019

Inventors: Willem Bastiaan Kleijn, Turaj Zakizadeh Shabestary
Echo cancellation for keyword spotting

Patent number: 10490203

Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

Type: Grant

Filed: December 18, 2017

Date of Patent: November 26, 2019

Assignee: GOOGLE LLC

Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
Methods And Systems For Detecting And Processing Speech Signals

Publication number: 20190189128

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Application

Filed: February 20, 2019

Publication date: June 20, 2019

Applicant: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
Methods and systems for detecting and processing speech signals

Patent number: 10255920

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Grant

Filed: June 16, 2017

Date of Patent: April 9, 2019

Assignee: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
Methods and systems for detecting and processing speech signals

Patent number: 10249303

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Grant

Filed: June 14, 2017

Date of Patent: April 2, 2019

Assignee: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
Methods and systems for detecting and processing speech signals

Patent number: 10163443

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Grant

Filed: June 16, 2017

Date of Patent: December 25, 2018

Assignee: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska
Methods and systems for detecting and processing speech signals

Patent number: 10163442

Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

Type: Grant

Filed: May 17, 2017

Date of Patent: December 25, 2018

Assignee: Google LLC

Inventors: Jay Pierre Civelli, Mikhal Shemer, Turaj Zakizadeh Shabestary, David Tapuska

1 2 next