Patents by Inventor STANISLAW IGNACY PASKO

STANISLAW IGNACY PASKO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech interface device with caching component

Patent number: 11887604

Abstract: A speech interface device is configured to receive response data from a remote speech processing system for responding to user speech. This response data may be enhanced with information such as a remote ASR result(s) and a remote NLU result(s). The response data from the remote speech processing system may include one or more cacheable status indicators associated with the NLU result(s) and/or remote directive data, which indicate whether the remote NLU result(s) and/or the remote directive data are individually cacheable. A caching component of the speech interface device allows for caching at least some of this cacheable remote speech processing information, and using the cached information locally on the speech interface device when responding to user speech in the future. This allows for responding to user speech, even when the speech interface device is unable to communicate with a remote speech processing system over a wide area network.

Type: Grant

Filed: September 2, 2022

Date of Patent: January 30, 2024

Assignee: Amazon Technologies, Inc.

Inventor: Stanislaw Ignacy Pasko
INTERMEDIATE DATA FOR INTER-DEVICE SPEECH PROCESSING

Publication number: 20240029743

Abstract: Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.

Type: Application

Filed: June 6, 2023

Publication date: January 25, 2024

Inventors: Stanislaw Ignacy Pasko, Pawel Zelazko, Cagdas Bak, Eli Joshua Fidler, Michal Kowalczuk, Andrew Oberlin, Ariya Rastrow
Architecture for a hub configured to control a second device while a connection to a remote system is unavailable

Patent number: 11822857

Abstract: A hub is configured to provide voice control without assistance from a remote system, which allows the hub to provide a user with the ability to control second devices in an environment by issuing voice commands, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet). The hub is also configured to execute rules without assistance from the remote system, which allows the hub to execute rules, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet).

Type: Grant

Filed: July 9, 2020

Date of Patent: November 21, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Mark Aran Aiken, Stanislaw Ignacy Pasko, Olusanya Temitope Soyannwo, Vibhav Hemant Salgaonkar, Adam Barry Fineberg, Roger Robert Webster, Makarand Damle, Rohan Mutagi, Philip Alexander Lee
DEVICE ARBITRATION BY MULTIPLE SPEECH PROCESSING SYSTEMS

Publication number: 20230297327

Abstract: A device can perform device arbitration, even when the device is unable to communicate with a remote system over a wide area network (e.g., the Internet). Upon detecting a wakeword in an utterance, the device can wait a period of time for data to arrive at the device, which, if received, indicates to the device that another speech interface device in the environment detected an utterance. If the device receives data prior to the period of time lapsing, the device can determine the earliest-occurring wakeword based on multiple wakeword occurrence times, and may designate whichever device that detected the wakeword first as the designated device to perform an action with respect to the user speech. To account for differences in sound capture latency between speech interface devices, a pre-calculated time offset value can be applied to wakeword occurrence time(s) during device arbitration.

Type: Application

Filed: November 23, 2022

Publication date: September 21, 2023

Inventor: Stanislaw Ignacy Pasko
Intermediate data for inter-device speech processing

Patent number: 11721347

Abstract: Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.

Type: Grant

Filed: June 29, 2021

Date of Patent: August 8, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Stanislaw Ignacy Pasko, Pawel Zelazko, Cagdas Bak, Eli Joshua Fidler, Michal Kowalczuk, Andrew Oberlin, Ariya Rastrow
Device arbitration by multiple speech processing systems

Patent number: 11513766

Abstract: A device can perform device arbitration, even when the device is unable to communicate with a remote system over a wide area network (e.g., the Internet). Upon detecting a wakeword in an utterance, the device can wait a period of time for data to arrive at the device, which, if received, indicates to the device that another speech interface device in the environment detected an utterance. If the device receives data prior to the period of time lapsing, the device can determine the earliest-occurring wakeword based on multiple wakeword occurrence times, and may designate whichever device that detected the wakeword first as the designated device to perform an action with respect to the user speech. To account for differences in sound capture latency between speech interface devices, a pre-calculated time offset value can be applied to wakeword occurrence time(s) during device arbitration.

Type: Grant

Filed: June 8, 2020

Date of Patent: November 29, 2022

Assignee: Amazon Technologies, Inc.

Inventor: Stanislaw Ignacy Pasko
Speech interface device with caching component

Patent number: 11437041

Abstract: A speech interface device is configured to receive response data from a remote speech processing system for responding to user speech. This response data may be enhanced with information such as a remote ASR result(s) and a remote NLU result(s). The response data from the remote speech processing system may include one or more cacheable status indicators associated with the NLU result(s) and/or remote directive data, which indicate whether the remote NLU result(s) and/or the remote directive data are individually cacheable. A caching component of the speech interface device allows for caching at least some of this cacheable remote speech processing information, and using the cached information locally on the speech interface device when responding to user speech in the future. This allows for responding to user speech, even when the speech interface device is unable to communicate with a remote speech processing system over a wide area network.

Type: Grant

Filed: September 11, 2020

Date of Patent: September 6, 2022

Assignee: Amazon Technologies, Inc.

Inventor: Stanislaw Ignacy Pasko
Utilization of natural language understanding (NLU) models

Patent number: 11132509

Abstract: A speech interface device is configured to perform natural language understanding (NLU) processing in a manner that optimizes the use of resources on the speech interface device. In an example process, a domain classifier(s) is used to generate domain classifier scores associated with multiple candidate domains, and the candidate domains can then be evaluated, one candidate domain at a time, in accordance with the domain classifier scores (e.g., starting with a highest scoring candidate domain). For each candidate domain undergoing the evaluation, input data is by that domain's NLU model(s), and, as soon as a domain-specific NLU model(s) produces a NLU result with a confidence score that satisfies a threshold confidence score, the evaluation can be stopped for any remaining candidate domains.

Type: Grant

Filed: December 3, 2018

Date of Patent: September 28, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Stanislaw Ignacy Pasko, Ross William McGowan, Aliaksei Kuzmin, Rui Liu
HYBRID SPEECH INTERFACE DEVICE

Publication number: 20210241775

Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.

Type: Application

Filed: April 19, 2021

Publication date: August 5, 2021

Inventors: Stanislaw Ignacy Pasko, Michal Papierski, Maciej Makowski, Marcin Fuszara
Hybrid speech interface device

Patent number: 10984799

Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.

Type: Grant

Filed: March 23, 2018

Date of Patent: April 20, 2021

Inventors: Stanislaw Ignacy Pasko, Michal Papierski, Maciej Makowski, Marcin Fuszara
Architecture for a Hub Configured to Control a Second Device While a Connection to a Remote System is Unavailable

Publication number: 20200409657

Abstract: A hub is configured to provide voice control without assistance from a remote system, which allows the hub to provide a user with the ability to control second devices in an environment by issuing voice commands, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet). The hub is also configured to execute rules without assistance from the remote system, which allows the hub to execute rules, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet).

Type: Application

Filed: July 9, 2020

Publication date: December 31, 2020

Inventors: Mark Aran Aiken, Stanislaw Ignacy Pasko, Olusanya Temitope Soyannwo, Vibhav Hemant Salgaonkar, Adam Barry Fineberg, Roger Robert Webster, Makarand Damle, Rohan Mutagi, Philip Alexander Lee
DEVICE ARBITRATION BY MULTIPLE SPEECH PROCESSING SYSTEMS

Publication number: 20200301661

Abstract: A device can perform device arbitration, even when the device is unable to communicate with a remote system over a wide area network (e.g., the Internet). Upon detecting a wakeword in an utterance, the device can wait a period of time for data to arrive at the device, which, if received, indicates to the device that another speech interface device in the environment detected an utterance. If the device receives data prior to the period of time lapsing, the device can determine the earliest-occurring wakeword based on multiple wakeword occurrence times, and may designate whichever device that detected the wakeword first as the designated device to perform an action with respect to the user speech. To account for differences in sound capture latency between speech interface devices, a pre-calculated time offset value can be applied to wakeword occurrence time(s) during device arbitration.

Type: Application

Filed: June 8, 2020

Publication date: September 24, 2020

Inventor: Stanislaw Ignacy Pasko
Speech interface device with caching component

Patent number: 10777203

Abstract: A speech interface device is configured to receive response data from a remote speech processing system for responding to user speech. This response data may be enhanced with information such as a remote ASR result(s) and a remote NLU result(s). The response data from the remote speech processing system may include one or more cacheable status indicators associated with the NLU result(s) and/or remote directive data, which indicate whether the remote NLU result(s) and/or the remote directive data are individually cacheable. A caching component of the speech interface device allows for caching at least some of this cacheable remote speech processing information, and using the cached information locally on the speech interface device when responding to user speech in the future. This allows for responding to user speech, even when the speech interface device is unable to communicate with a remote speech processing system over a wide area network.

Type: Grant

Filed: March 23, 2018

Date of Patent: September 15, 2020

Assignee: Amazon Technologies, Inc.

Inventor: Stanislaw Ignacy Pasko
Architecture for a hub configured to control a second device while a connection to a remote system is unavailable

Patent number: 10713007

Abstract: A hub is configured to provide voice control without assistance from a remote system, which allows the hub to provide a user with the ability to control second devices in an environment by issuing voice commands, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet). The hub is also configured to execute rules without assistance from the remote system, which allows the hub to execute rules, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet).

Type: Grant

Filed: December 12, 2017

Date of Patent: July 14, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Mark Aran Aiken, Stanislaw Ignacy Pasko, Olusanya Temitope Soyannwo, Vibhav Hemant Salgaonkar, Adam Barry Fineberg, Roger Robert Webster, Makarand Damle, Rohan Mutagi, Philip Alexander Lee
Device arbitration by multiple speech processing systems

Patent number: 10679629

Abstract: A device can perform device arbitration, even when the device is unable to communicate with a remote system over a wide area network (e.g., the Internet). Upon detecting a wakeword in an utterance, the device can wait a period of time for data to arrive at the device, which, if received, indicates to the device that another speech interface device in the environment detected an utterance. If the device receives data prior to the period of time lapsing, the device can determine the earliest-occurring wakeword based on multiple wakeword occurrence times, and may designate whichever device that detected the wakeword first as the designated device to perform an action with respect to the user speech. To account for differences in sound capture latency between speech interface devices, a pre-calculated time offset value can be applied to wakeword occurrence time(s) during device arbitration.

Type: Grant

Filed: April 9, 2018

Date of Patent: June 9, 2020

Assignee: Amazon Technologies, Inc.

Inventor: Stanislaw Ignacy Pasko
DEVICE ARBITRATION BY MULTIPLE SPEECH PROCESSING SYSTEMS

Publication number: 20190311720

Abstract: A device can perform device arbitration, even when the device is unable to communicate with a remote system over a wide area network (e.g., the Internet). Upon detecting a wakeword in an utterance, the device can wait a period of time for data to arrive at the device, which, if received, indicates to the device that another speech interface device in the environment detected an utterance. If the device receives data prior to the period of time lapsing, the device can determine the earliest-occurring wakeword based on multiple wakeword occurrence times, and may designate whichever device that detected the wakeword first as the designated device to perform an action with respect to the user speech. To account for differences in sound capture latency between speech interface devices, a pre-calculated time offset value can be applied to wakeword occurrence time(s) during device arbitration.

Type: Application

Filed: April 9, 2018

Publication date: October 10, 2019

Inventor: Stanislaw Ignacy Pasko
SPEECH INTERFACE DEVICE

Publication number: 20190295552

Abstract: A speech interface device is configured with “hybrid” capabilities, which allows the speech interface device to perform actions in response to user speech, even when the speech interface device is unable to communicate with a remote system over a wide area network (e.g., the Internet). A hybrid request selector of the speech interface device sends audio data representing user speech to both a remote speech processing system and a local speech processing component executing on the speech interface device, and then waits for a response from either or both components. The local speech processing component may start execution based on the audio data and subsequently suspend the execution until further instruction from the hybrid request selector. The hybrid request selector can then determine which response to use, and, depending on which response is chosen, may instruct the local speech processing component to either continue or terminate the suspended execution.

Type: Application

Filed: March 23, 2018

Publication date: September 26, 2019

Inventors: Stanislaw Ignacy Pasko, Michal Papierski, Maciej Makowski, Marcin Fuszara
ARCHITECTURE FOR A HUB CONFIGURED TO CONTROL A SECOND DEVICE WHILE A CONNECTION TO A REMOTE SYSTEM IS UNAVAILABLE

Publication number: 20190179610

Abstract: A hub is configured to provide voice control without assistance from a remote system, which allows the hub to provide a user with the ability to control second devices in an environment by issuing voice commands, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet). The hub is also configured to execute rules without assistance from the remote system, which allows the hub to execute rules, even when the hub is unable to communicate with the remote system over a wide area network (e.g., the Internet).

Type: Application

Filed: December 12, 2017

Publication date: June 13, 2019

Inventors: MARK ARAN AIKEN, STANISLAW IGNACY PASKO, OLUSANYA TEMITOPE SOYANNWO, VIBHAV HEMANT SALGAONKAR, ADAM BARRY FINEBERG, ROGER ROBERT WEBSTER, MAKARAND DAMLE, ROHAN MUTAGI, PHILIP ALEXANDER LEE