Patents by Inventor Aleksandar Kracun

Aleksandar Kracun has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Detecting and suppressing voice queries

Patent number: 11341969

Abstract: A computing system receives requests from client devices to process voice queries that have been detected in local environments of the client devices. The system identifies that a value that is based on a number of requests to process voice queries received by the system during a specified time interval satisfies one or more criteria. In response, the system triggers analysis of at least some of the requests received during the specified time interval to trigger analysis of at least some received requests to determine a set of requests that each identify a common voice query. The system can generate an electronic fingerprint that indicates a distinctive model of the common voice query. The fingerprint can then be used to detect an illegitimate voice query identified in a request from a client device at a later time.

Type: Grant

Filed: May 27, 2020

Date of Patent: May 24, 2022

Assignee: Google LLC

Inventors: Alexander H. Gruenstein, Aleksandar Kracun, Matthew Sharifi
Navigation with Dynamic Regrouping Points

Publication number: 20220120573

Abstract: The present disclosure is directed to a system and method for providing dynamic grouping and regrouping for users in a joint positional tracking session. The method includes receiving positional data associated with a first user and at least one other user in the plurality of users in the joint positional tracking session. The method includes determining that a separation parameter associated with the first user has exceeded a threshold separation value, the separation parameter associated with the first user representing a distance between the first user and one other user in the plurality of users. The method includes automatically generating navigational data for reducing the separation parameter between the first user and one other user in the joint positional tracking session to below the threshold separation value. The method includes transmitting the navigational data to at least the first user in the joint positional tracking session.

Type: Application

Filed: June 25, 2019

Publication date: April 21, 2022

Inventors: Matthew Sharifi, Aleksandar Kracun
END-TO-END SPEECH CONVERSION

Publication number: 20220122579

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for end to end speech conversion are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance of one or more first terms spoken by a user. The actions further include providing the first audio data as an input to a model that is configured to receive first given audio data in a first voice and output second given audio data in a synthesized voice without performing speech recognition on the first given audio data. The actions further include receiving second audio data of a second utterance of the one or more first terms spoken in the synthesized voice. The actions further include providing, for output, the second audio data of the second utterance of the one or more first terms spoken in the synthesized voice.

Type: Application

Filed: November 26, 2019

Publication date: April 21, 2022

Applicant: Google LLC

Inventors: Fadi Biadsy, Ron J. Weiss, Aleksandar Kracun, Pedro J. Moreno Mengibar
Hotword-aware speech synthesis

Patent number: 11308934

Abstract: A method includes receiving text input data for conversion into synthesized speech and determining, using a hotword-aware model trained to detect a presence of a hotword assigned to a user device, whether a pronunciation of the text input data includes the hotword. The hotword is configured to initiate a wake-up process on the user device for processing the hotword and/or one or more other terms following the hotword in the audio input data. When the pronunciation of the text input data includes the hotword, the method also includes generating an audio output signal from the text input data and providing the audio output signal to an audio output device to output the audio output signal. The audio output signal when captured by an audio capture device of the user device, configured to prevent initiation of the wake-up process on the user device.

Type: Grant

Filed: June 25, 2018

Date of Patent: April 19, 2022

Assignee: Google LLC

Inventors: Matthew Sharifi, Aleksandar Kracun
Voice Query QoS Based On Client-Computed Content Metadata

Publication number: 20220093104

Abstract: A method includes receiving an automated speech recognition (ASR) request from a user device that includes a speech input captured by the user device and content metadata associated with the speech input. The content metadata is generated by the user device. The method also includes determining a priority score for the ASR request based on the content metadata associated with the speech input and caching the ASR request in a pre-processing backlog of pending ASR requests each having a corresponding priority score. The pending ASR requests in the pre-processing backlog are ranked in order of the priority scores. The method also includes providing, from the pre-processing backlog, one or more of the pending ASR requests to a backend-side ASR module, wherein pending ASR requests associated with higher priority scores are processed before pending ASR requests associated with lower priority scores.

Type: Application

Filed: February 6, 2019

Publication date: March 24, 2022

Applicant: Google LLC

Inventors: Matthew Sharifi, Aleksandar Kracun
Hotword-Aware Speech Synthesis

Publication number: 20210366459

Abstract: A method includes receiving text input data for conversion into synthesized speech and determining, using a hotword-aware model trained to detect a presence of a hotword assigned to a user device, whether a pronunciation of the text input data includes the hotword. The hotword is configured to initiate a wake-up process on the user device for processing the hotword and/or one or more other terms following the hotword in the audio input data. When the pronunciation of the text input data includes the hotword, the method also includes generating an audio output signal from the text input data and providing the audio output signal to an audio output device to output the audio output signal. The audio output signal when captured by an audio capture device of the user device, configured to prevent initiation of the wake-up process on the user device.

Type: Application

Filed: August 5, 2021

Publication date: November 25, 2021

Applicant: Google LLC

Inventors: Matthew Sharifi, Aleksandar Kracun
SPEAKER DIARIZATION

Publication number: 20210295824

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

Type: Application

Filed: April 5, 2021

Publication date: September 23, 2021

Applicant: Google LLC

Inventors: Aleksandar Kracun, Richard Cameron Rose
Speaker diarization

Patent number: 10978070

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

Type: Grant

Filed: August 27, 2019

Date of Patent: April 13, 2021

Inventors: Aleksandar Kracun, Richard Cameron Rose
CONTEXTUAL HOTWORDS

Publication number: 20210043210

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.

Type: Application

Filed: October 12, 2020

Publication date: February 11, 2021

Applicant: Google LLC

Inventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun
Contextual hotwords

Patent number: 10839803

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.

Type: Grant

Filed: March 25, 2019

Date of Patent: November 17, 2020

Assignee: Google LLC

Inventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun
SPEAKER DIARIZATION

Publication number: 20200098374

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

Type: Application

Filed: August 27, 2019

Publication date: March 26, 2020

Inventors: Aleksandar Kracun, Richard Cameron Rose
CONTEXTUAL HOTWORDS

Publication number: 20190287528

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.

Type: Application

Filed: March 25, 2019

Publication date: September 19, 2019

Inventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun
Speaker diarization

Patent number: 10403288

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

Type: Grant

Filed: October 17, 2017

Date of Patent: September 3, 2019

Assignee: Google LLC

Inventors: Aleksandar Kracun, Richard Cameron Rose
Contextual hotwords

Patent number: 10276161

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.

Type: Grant

Filed: December 27, 2016

Date of Patent: April 30, 2019

Assignee: Google LLC

Inventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun
SPEAKER DIARIZATION

Publication number: 20190115029

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

Type: Application

Filed: October 17, 2017

Publication date: April 18, 2019

Inventors: Aleksandar Kracun, Richard Cameron Rose
Detecting and suppressing voice queries

Patent number: 10170112

Abstract: A computing system receives requests from client devices to process voice queries that have been detected in local environments of the client devices. The system identifies that a value that is based on a number of requests to process voice queries received by the system during a specified time interval satisfies one or more criteria. In response, the system triggers analysis of at least some of the requests received during the specified time interval to trigger analysis of at least some received requests to determine a set of requests that each identify a common voice query. The system can generate an electronic fingerprint that indicates a distinctive model of the common voice query. The fingerprint can then be used to detect an illegitimate voice query identified in a request from a client device at a later time.

Type: Grant

Filed: May 11, 2017

Date of Patent: January 1, 2019

Assignee: Google LLC

Inventors: Alexander H. Gruenstein, Aleksandar Kracun, Matthew Sharifi
DETECTING AND SUPPRESSING VOICE QUERIES

Publication number: 20180330728

Abstract: A computing system receives requests from client devices to process voice queries that have been detected in local environments of the client devices. The system identifies that a value that is based on a number of requests to process voice queries received by the system during a specified time interval satisfies one or more criteria. In response, the system triggers analysis of at least some of the requests received during the specified time interval to trigger analysis of at least some received requests to determine a set of requests that each identify a common voice query. The system can generate an electronic fingerprint that indicates a distinctive model of the common voice query. The fingerprint can then be used to detect an illegitimate voice query identified in a request from a client device at a later time.

Type: Application

Filed: May 11, 2017

Publication date: November 15, 2018

Inventors: Alexander H. Gruenstein, Aleksandar Kracun, Matthew Sharifi
CONTEXTUAL HOTWORDS

Publication number: 20180182390

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.

Type: Application

Filed: December 27, 2016

Publication date: June 28, 2018

Inventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun

prev 1 2