Patents by Inventor Michal T. Kaszczuk
Michal T. Kaszczuk has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9595255Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.Type: GrantFiled: February 13, 2015Date of Patent: March 14, 2017Assignee: Amazon Technologies, Inc.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Patent number: 9196240Abstract: A group of users may be presented with text and a synthesized speech recording of the text. The users can listen to the synthesized speech recording and submit feedback regarding errors or other issues with the synthesized speech. A system of one or more computing devices can analyze the feedback, modify the voice or language rules, and recursively test the modifications. The modifications may be determined through the use of machine learning algorithms or other automated processes.Type: GrantFiled: December 19, 2012Date of Patent: November 24, 2015Assignee: IVONA Software Sp. z.o.o.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Patent number: 9190049Abstract: Features are disclosed for generating text-to-speech (TTS) audio programs from textual content received from multiple sources. A TTS system may assemble an audio program from several individual audio presentations of user-selected network-accessible content. Users may configure the TTS system to retrieve personal content as well as publically accessible content. The audio program may include segues, introductions, summaries, and the like. Voices may be selected for individual content items based on user selections or on characteristics of the content or content source.Type: GrantFiled: December 19, 2012Date of Patent: November 17, 2015Assignee: IVONA Software Sp. z.o.o.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Patent number: 9159314Abstract: In a text-to-speech (TTS) system, a database including sample speech units for unit selection may be configured for use by a local device. The local unit database may be created from a more comprehensive unit database. The local unit database may include units which provide sufficient TTS results for frequently input text. Speech synthesis may then be performed by concatenating locally available units with units from a remote device including the comprehensive unit database. Aspects of the speech synthesis may be performed by the remote device and/or the local device.Type: GrantFiled: January 14, 2013Date of Patent: October 13, 2015Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Lukasz M. Osowski, Michal T. Kaszczuk
-
Publication number: 20150262571Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.Type: ApplicationFiled: February 13, 2015Publication date: September 17, 2015Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Patent number: 9064489Abstract: Recorded or synthesized speech segments of text-to-speech (TTS) systems may be compressed though the use of both time domain compression and perceptual compression techniques. The twice-compressed recording may be separated into speech segments corresponding to words or subword units for use in a TTS system. The compression rate of time domain compression, and the ratio of time domain compression to perceptual compression, may be modified for any speech segment. The compression amount or ratio may be determined based on linguistic or acoustic features of the word or subword unit that the speech segment represents. Differing compression amounts and ratios may be applied to portions of a single speech segment.Type: GrantFiled: December 19, 2012Date of Patent: June 23, 2015Assignee: IVONA Software Sp. z o.o.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Patent number: 8959021Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.Type: GrantFiled: December 19, 2012Date of Patent: February 17, 2015Assignee: IVONA Software Sp. z.o.o.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Publication number: 20140122060Abstract: Recorded or synthesized speech segments of text-to-speech (TTS) systems may be compressed though the use of both time domain compression and perceptual compression techniques. The twice-compressed recording may be separated into speech segments corresponding to words or subword units for use in a TTS system. The compression rate of time domain compression, and the ratio of time domain compression to perceptual compression, may be modified for any speech segment. The compression amount or ratio may be determined based on linguistic or acoustic features of the word or subword unit that the speech segment represents. Differing compression amounts and ratios may be applied to portions of a single speech segment.Type: ApplicationFiled: December 19, 2012Publication date: May 1, 2014Applicant: IVONA SOFTWARE SP. Z O.O.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Publication number: 20140122081Abstract: A group of users may be presented with text and a synthesized speech recording of the text. The users can listen to the synthesized speech recording and submit feedback regarding errors or other issues with the synthesized speech. A system of one or more computing devices can analyze the feedback, modify the voice or language rules, and recursively test the modifications. The modifications may be determined through the use of machine learning algorithms or other automated processes.Type: ApplicationFiled: December 19, 2012Publication date: May 1, 2014Applicant: IVONA SOFTWARE Sp. z.o.o.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Publication number: 20140122079Abstract: Features are disclosed for generating text-to-speech (TTS) audio programs from textual content received from multiple sources. A TTS system may assemble an audio program from several individual audio presentations of user-selected network-accessible content. Users may configure the TTS system to retrieve personal content as well as publically accessible content. The audio program may include segues, introductions, summaries, and the like. Voices may be selected for individual content items based on user selections or on characteristics of the content or content source.Type: ApplicationFiled: December 19, 2012Publication date: May 1, 2014Applicant: IVONA Software Sp. z.o.o.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski
-
Publication number: 20140122080Abstract: Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may be installed on a client device, and some may be present on a remote system accessible via a network link. Determinations can be made regarding which TTS system components to implement on the client device and which to implement on the remote server. The consistent interface facilitates connecting to or otherwise employing the TTS system through use of the same methods and techniques regardless of the which TTS system configuration is implemented.Type: ApplicationFiled: December 19, 2012Publication date: May 1, 2014Applicant: IVONA Software Sp. z.o.o.Inventors: Michal T. Kaszczuk, Lukasz M. Osowski