Patents by Inventor Michal Kowalczuk

Michal Kowalczuk has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Language switching on a speech interface device

Patent number: 12170086

Abstract: A speech interface device is configured to switch between languages, at the request of a user, in order to locally process utterances spoken in different languages, even in instances when a remote system is unavailable to, slower than, or otherwise less preferred than the speech interface device. For example, a user can request to set the language setting of the speech interface device to a second language, different from a first language to which the language setting of the device is currently set. Based on this user request, a local speech processing component of the device may load a language model(s) associated with the second language. The speech interface can also output voice prompts in the second language to manage the user's experience while a language update is in progress on the speech interface device.

Type: Grant

Filed: November 12, 2021

Date of Patent: December 17, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Ashwin Venkatesh Raman, Bruno Dufour, Sasi Kiran Vepanjeri Lokanadha Reddy, Michal Kowalczuk, Maciej Grabon, Maciej Makowski, Fabian Andreas Bumberger
INTERMEDIATE DATA FOR INTER-DEVICE SPEECH PROCESSING

Publication number: 20240029743

Abstract: Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.

Type: Application

Filed: June 6, 2023

Publication date: January 25, 2024

Inventors: Stanislaw Ignacy Pasko, Pawel Zelazko, Cagdas Bak, Eli Joshua Fidler, Michal Kowalczuk, Andrew Oberlin, Ariya Rastrow
Intermediate data for inter-device speech processing

Patent number: 11721347

Abstract: Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.

Type: Grant

Filed: June 29, 2021

Date of Patent: August 8, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Stanislaw Ignacy Pasko, Pawel Zelazko, Cagdas Bak, Eli Joshua Fidler, Michal Kowalczuk, Andrew Oberlin, Ariya Rastrow
Language switching on a speech interface device

Patent number: 11176934

Abstract: A speech interface device is configured to switch between languages, at the request of a user, in order to locally process utterances spoken in different languages, even in instances when a remote system is unavailable to, slower than, or otherwise less preferred than the speech interface device. For example, a user can request to set the language setting of the speech interface device to a second language, different from a first language to which the language setting of the device is currently set. Based on this user request, a local speech processing component of the device may load a language model(s) associated with the second language. The speech interface can also output voice prompts in the second language to manage the user's experience while a language update is in progress on the speech interface device.

Type: Grant

Filed: March 22, 2019

Date of Patent: November 16, 2021

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Ashwin Venkatesh Raman, Bruno Dufour, Sasi Kiran Vepanjeri Lokanadha Reddy, Michal Kowalczuk, Maciej Grabon, Maciej Makowski, Fabian Andreas Bumberger

Language switching on a speech interface device

INTERMEDIATE DATA FOR INTER-DEVICE SPEECH PROCESSING

Intermediate data for inter-device speech processing

Language switching on a speech interface device