Patents by Inventor Michal T. Kaszczuk

Michal T. Kaszczuk has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Single interface for local and remote speech synthesis

Patent number: 9595255

Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.

Type: Grant

Filed: February 13, 2015

Date of Patent: March 14, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
Automated text to speech voice development

Patent number: 9196240

Abstract: A group of users may be presented with text and a synthesized speech recording of the text. The users can listen to the synthesized speech recording and submit feedback regarding errors or other issues with the synthesized speech. A system of one or more computing devices can analyze the feedback, modify the voice or language rules, and recursively test the modifications. The modifications may be determined through the use of machine learning algorithms or other automated processes.

Type: Grant

Filed: December 19, 2012

Date of Patent: November 24, 2015

Assignee: IVONA Software Sp. z.o.o.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
Generating personalized audio programs from text content

Patent number: 9190049

Abstract: Features are disclosed for generating text-to-speech (TTS) audio programs from textual content received from multiple sources. A TTS system may assemble an audio program from several individual audio presentations of user-selected network-accessible content. Users may configure the TTS system to retrieve personal content as well as publically accessible content. The audio program may include segues, introductions, summaries, and the like. Voices may be selected for individual content items based on user selections or on characteristics of the content or content source.

Type: Grant

Filed: December 19, 2012

Date of Patent: November 17, 2015

Assignee: IVONA Software Sp. z.o.o.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
Distributed speech unit inventory for TTS systems

Patent number: 9159314

Abstract: In a text-to-speech (TTS) system, a database including sample speech units for unit selection may be configured for use by a local device. The local unit database may be created from a more comprehensive unit database. The local unit database may include units which provide sufficient TTS results for frequently input text. Speech synthesis may then be performed by concatenating locally available units with units from a remote device including the comprehensive unit database. Aspects of the speech synthesis may be performed by the remote device and/or the local device.

Type: Grant

Filed: January 14, 2013

Date of Patent: October 13, 2015

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Lukasz M. Osowski, Michal T. Kaszczuk
SINGLE INTERFACE FOR LOCAL AND REMOTE SPEECH SYNTHESIS

Publication number: 20150262571

Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.

Type: Application

Filed: February 13, 2015

Publication date: September 17, 2015

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
Hybrid compression of text-to-speech voice data

Patent number: 9064489

Abstract: Recorded or synthesized speech segments of text-to-speech (TTS) systems may be compressed though the use of both time domain compression and perceptual compression techniques. The twice-compressed recording may be separated into speech segments corresponding to words or subword units for use in a TTS system. The compression rate of time domain compression, and the ratio of time domain compression to perceptual compression, may be modified for any speech segment. The compression amount or ratio may be determined based on linguistic or acoustic features of the word or subword unit that the speech segment represents. Differing compression amounts and ratios may be applied to portions of a single speech segment.

Type: Grant

Filed: December 19, 2012

Date of Patent: June 23, 2015

Assignee: IVONA Software Sp. z o.o.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
Single interface for local and remote speech synthesis

Patent number: 8959021

Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.

Type: Grant

Filed: December 19, 2012

Date of Patent: February 17, 2015

Assignee: IVONA Software Sp. z.o.o.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
HYBRID COMPRESSION OF TEXT-TO-SPEECH VOICE DATA

Publication number: 20140122060

Abstract: Recorded or synthesized speech segments of text-to-speech (TTS) systems may be compressed though the use of both time domain compression and perceptual compression techniques. The twice-compressed recording may be separated into speech segments corresponding to words or subword units for use in a TTS system. The compression rate of time domain compression, and the ratio of time domain compression to perceptual compression, may be modified for any speech segment. The compression amount or ratio may be determined based on linguistic or acoustic features of the word or subword unit that the speech segment represents. Differing compression amounts and ratios may be applied to portions of a single speech segment.

Type: Application

Filed: December 19, 2012

Publication date: May 1, 2014

Applicant: IVONA SOFTWARE SP. Z O.O.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
AUTOMATED TEXT TO SPEECH VOICE DEVELOPMENT

Publication number: 20140122081

Abstract: A group of users may be presented with text and a synthesized speech recording of the text. The users can listen to the synthesized speech recording and submit feedback regarding errors or other issues with the synthesized speech. A system of one or more computing devices can analyze the feedback, modify the voice or language rules, and recursively test the modifications. The modifications may be determined through the use of machine learning algorithms or other automated processes.

Type: Application

Filed: December 19, 2012

Publication date: May 1, 2014

Applicant: IVONA SOFTWARE Sp. z.o.o.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
GENERATING PERSONALIZED AUDIO PROGRAMS FROM TEXT CONTENT

Publication number: 20140122079

Abstract: Features are disclosed for generating text-to-speech (TTS) audio programs from textual content received from multiple sources. A TTS system may assemble an audio program from several individual audio presentations of user-selected network-accessible content. Users may configure the TTS system to retrieve personal content as well as publically accessible content. The audio program may include segues, introductions, summaries, and the like. Voices may be selected for individual content items based on user selections or on characteristics of the content or content source.

Type: Application

Filed: December 19, 2012

Publication date: May 1, 2014

Applicant: IVONA Software Sp. z.o.o.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
SINGLE INTERFACE FOR LOCAL AND REMOTE SPEECH SYNTHESIS

Publication number: 20140122080

Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.

Type: Application

Filed: December 19, 2012

Publication date: May 1, 2014

Applicant: IVONA Software Sp. z.o.o.

Inventors: Michal T. Kaszczuk, Lukasz M. Osowski