Patents by Inventor Martin Reber
Martin Reber has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240086793Abstract: An omni-channel, intelligent, proactive virtual agent system and method of use are provided by which a user may engage in a conversation with the agent to interact with structured and unstructured data of an enterprise that is stored in a domain-specific world model for the enterprise.Type: ApplicationFiled: August 18, 2023Publication date: March 14, 2024Inventors: Stephen Brown, Martin Reber, Vijeta Avijeet, Josselyn Boudet
-
Publication number: 20230368775Abstract: A method, computer program product, and computer system for text-to-speech synthesis is disclosed. Synthetic speech data for an input text may be generated. The synthetic speech data may be compared to recorded reference speech data corresponding to the input text. Based on, at least in part, the comparison of the synthetic speech data to the recorded reference speech data, at least one feature indicative of at least one difference between the synthetic speech data and the recorded reference speech data may be extracted. A speech gap filling model may be generated based on, at least in part, the at least one feature extracted. A speech output may be generated based on, at least in part, the speech gap filling model.Type: ApplicationFiled: July 3, 2023Publication date: November 16, 2023Applicant: TELEPATHY LABS, INC.Inventors: Piero Perucci, Martin Reber, Vijeta Avijeet
-
Publication number: 20230351999Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signal.Type: ApplicationFiled: July 3, 2023Publication date: November 2, 2023Inventors: Martin Reber, Vijeta Avijeet
-
Patent number: 11775891Abstract: An omni-channel, intelligent, proactive virtual agent system and method of use are provided by which a user may engage in a conversation with the agent to interact with structured and unstructured data of an enterprise that is stored in a domain-specific world model for the enterprise.Type: GrantFiled: July 31, 2018Date of Patent: October 3, 2023Assignee: Telepathy Labs, Inc.Inventors: Stephen Brown, Martin Reber, Vijeta Avijeet, Josselyn Boudett
-
Patent number: 11741942Abstract: A method, computer program product, and computer system for text-to-speech synthesis is disclosed. Synthetic speech data for an input text may be generated. The synthetic speech data may be compared to recorded reference speech data corresponding to the input text. Based on, at least in part, the comparison of the synthetic speech data to the recorded reference speech data, at least one feature indicative of at least one difference between the synthetic speech data and the recorded reference speech data may be extracted. A speech gap filling model may be generated based on, at least in part, the at least one feature extracted. A speech output may be generated based on, at least in part, the speech gap filling model.Type: GrantFiled: August 3, 2022Date of Patent: August 29, 2023Assignee: Telepathy Labs, IncInventors: Piero Perucci, Martin Reber, Vijeta Avijeet
-
Patent number: 11735161Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: GrantFiled: January 31, 2022Date of Patent: August 22, 2023Assignee: Telepathy Labs, IncInventors: Martin Reber, Vijeta Avijeet
-
Publication number: 20220375452Abstract: A method, computer program product, and computer system for text-to-speech synthesis is disclosed. Synthetic speech data for an input text may be generated. The synthetic speech data may be compared to recorded reference speech data corresponding to the input text. Based on, at least in part, the comparison of the synthetic speech data to the recorded reference speech data, at least one feature indicative of at least one difference between the synthetic speech data and the recorded reference speech data may be extracted. A speech gap filling model may be generated based on, at least in part, the at least one feature extracted. A speech output may be generated based on, at least in part, the speech gap filling model.Type: ApplicationFiled: August 3, 2022Publication date: November 24, 2022Inventors: Piero Perucci, Martin Reber, Vijeta Avijeet
-
Patent number: 11450307Abstract: A method, computer program product, and computer system for text-to-speech synthesis is disclosed. Synthetic speech data for an input text may be generated. The synthetic speech data may be compared to recorded reference speech data corresponding to the input text. Based on, at least in part, the comparison of the synthetic speech data to the recorded reference speech data, at least one feature indicative of at least one difference between the synthetic speech data and the recorded reference speech data may be extracted. A speech gap filling model may be generated based on, at least in part, the at least one feature extracted. A speech output may be generated based on, at least in part, the speech gap filling model.Type: GrantFiled: March 27, 2019Date of Patent: September 20, 2022Assignee: TELEPATHY LABS, INC.Inventors: Piero Perucci, Martin Reber, Vijeta Avijeet
-
Publication number: 20220148564Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: ApplicationFiled: January 31, 2022Publication date: May 12, 2022Inventors: Martin Reber, Vijeta Avijeet
-
Patent number: 11244670Abstract: A technique proves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: GrantFiled: June 20, 2019Date of Patent: February 8, 2022Assignee: TELEPATHY LABS, INC.Inventors: Martin Reber, Vijeta Avijeet
-
Patent number: 11244669Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: GrantFiled: June 20, 2019Date of Patent: February 8, 2022Assignee: TELEPATHY LABS, INC.Inventors: Martin Reber, Vijeta Avijeet
-
Publication number: 20210366460Abstract: A method, computer program product, and computer system for text-to-speech synthesis is disclosed. Synthetic speech data for an input text may be generated. The synthetic speech data may be compared to recorded reference speech data corresponding to the input text. Based on, at least in part, the comparison of the synthetic speech data to the recorded reference speech data, at least one feature indicative of at least one difference between the synthetic speech data and the recorded reference speech data may be extracted. A speech gap filling model may be generated based on, at least in part, the at least one feature extracted. A speech output may be generated based on, at least in part, the speech gap filling model.Type: ApplicationFiled: March 27, 2019Publication date: November 25, 2021Inventors: Piero Perucci, Martin Reber, Vijeta Avijeet
-
Publication number: 20210312355Abstract: A method, computer program product, and virtual agent system for an organization. The virtual agent system may include one or more processors and one or more memories configured to perform operations. The operations may include loading at least one model related to one or more processes of the organization where the model may be based on the structure information and one or more of procedures and protocols related to the organization.Type: ApplicationFiled: August 9, 2019Publication date: October 7, 2021Inventor: Martin Reber
-
Publication number: 20190304434Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: ApplicationFiled: June 20, 2019Publication date: October 3, 2019Inventors: Martin Reber, Vijeta Avijeet
-
Publication number: 20190304435Abstract: A technique proves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: ApplicationFiled: June 20, 2019Publication date: October 3, 2019Inventors: Martin Reber, Vijeta Avijeet
-
Patent number: 10373605Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: GrantFiled: June 29, 2018Date of Patent: August 6, 2019Assignee: Telepathy Labs, Inc.Inventors: Martin Reber, Vijeta Avijeet
-
Patent number: 10319364Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. Speech signal specific modelling techniques in combination with applied psychoacoustic principles drive training efficiency of neural networks with positive impact on quality of generated speech signals.Type: GrantFiled: May 17, 2018Date of Patent: June 11, 2019Assignee: Telepathy Labs, Inc.Inventors: Martin Reber, Vijeta Avijeet
-
Publication number: 20190042988Abstract: An omni-channel, intelligent, proactive virtual agent system and method of use are provided by which a user may engage in a conversation with the agent to interact with structured and unstructured data of an enterprise that is stored in a domain-specific world model for the enterprise.Type: ApplicationFiled: July 31, 2018Publication date: February 7, 2019Inventors: Stephen Brown, Martin Reber, Vijeta Avijeet, Josselyn Boudett
-
Publication number: 20180336881Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. Speech signal specific modelling techniques in combination with applied psychoacoustic principles drive training efficiency of neural networks with positive impact on quality of generated speech signals.Type: ApplicationFiled: May 17, 2018Publication date: November 22, 2018Inventors: Martin Reber, Vijeta Avijeet
-
Publication number: 20180336882Abstract: A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f0, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signals.Type: ApplicationFiled: June 29, 2018Publication date: November 22, 2018Inventors: Martin Reber, Vijeta Avijeet