Patents by Inventor Mark Charles Beutnagel

Mark Charles Beutnagel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11335320
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify speech units that are required for synthesizing speech. The system can request from a server the text-to-speech unit needed to synthesize the speech. The system can then synthesize speech using text-to-speech units already stored and a received text-to-speech unit from the server.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: May 17, 2022
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Benjamin J. Stern, Mark Charles Beutnagel, Alistair D. Conkie, Horst J. Schroeter, Amanda Joy Stent
  • Publication number: 20200320973
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify speech units that are required for synthesizing speech. The system can request from a server the text-to-speech unit needed to synthesize the speech. The system can then synthesize speech using text-to-speech units already stored and a received text-to-speech unit from the server.
    Type: Application
    Filed: June 23, 2020
    Publication date: October 8, 2020
    Inventors: Benjamin J. STERN, Mark Charles BEUTNAGEL, Alistair D. CONKIE, Horst J. SCHROETER, Amanda Joy STENT
  • Patent number: 10699694
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify speech units that are required for synthesizing speech. The system can request from a server the text-to-speech unit needed to synthesize the speech. The system can then synthesize speech using text-to-speech units already stored and a received text-to-speech unit from the server.
    Type: Grant
    Filed: November 19, 2018
    Date of Patent: June 30, 2020
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Benjamin J. Stern, Mark Charles Beutnagel, Alistair D. Conkie, Horst J. Schroeter, Amanda Joy Stent
  • Publication number: 20190088249
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify, in a local cache of text-to-speech units for a text-to-speech voice an absent text-to-speech unit which is not in the local cache. The system can request from a server the absent text-to-speech unit. The system can then synthesize speech using the text-to-speech units and a received text-to-speech unit from the server.
    Type: Application
    Filed: November 19, 2018
    Publication date: March 21, 2019
    Inventors: Benjamin J. STERN, Mark Charles BEUTNAGEL, Alistair D. CONKIE, Horst J. SCHROETER, Amanda Joy STENT
  • Patent number: 10134383
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify, in a local cache of text-to-speech units for a text-to-speech voice an absent text-to-speech unit which is not in the local cache. The system can request from a server the absent text-to-speech unit. The system can then synthesize speech using the text-to-speech units and a received text-to-speech unit from the server.
    Type: Grant
    Filed: September 8, 2017
    Date of Patent: November 20, 2018
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Benjamin J. Stern, Mark Charles Beutnagel, Alistair D. Conkie, Horst J. Schroeter, Amanda Joy Stent
  • Patent number: 9978360
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for detecting and correcting abnormal stress patterns in unit-selection speech synthesis. A system practicing the method detects incorrect stress patterns in selected acoustic units representing speech to be synthesized, and corrects the incorrect stress patterns in the selected acoustic units to yield corrected stress patterns. The system can further synthesize speech based on the corrected stress patterns. In one aspect, the system also classifies the incorrect stress patterns using a machine learning algorithm such as a classification and regression tree, adaptive boosting, support vector machine, and maximum entropy. In this way a text-to-speech unit selection speech synthesizer can produce more natural sounding speech with suitable stress patterns regardless of the stress of units in a unit selection database.
    Type: Grant
    Filed: February 22, 2016
    Date of Patent: May 22, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Yeon-Jun Kim, Mark Charles Beutnagel, Alistair D. Conkie, Ann K. Syrdal
  • Publication number: 20180047384
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS server then converts intonational phrases of the text into audio and responds to the browser with the audio file. The system saves the audio file in a cache, with the file indexed by a unique identifier. As the system continues converting text into speech, when identical text appears the system uses the cached audio corresponding to the identical text without the need for re-synthesis via the TTS server.
    Type: Application
    Filed: October 23, 2017
    Publication date: February 15, 2018
    Inventors: Alistair D. CONKIE, Mark Charles BEUTNAGEL, Taniya MISHRA
  • Publication number: 20170372692
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify, in a local cache of text-to-speech units for a text-to-speech voice an absent text-to-speech unit which is not in the local cache. The system can request from a server the absent text-to-speech unit. The system can then synthesize speech using the text-to-speech units and a received text-to-speech unit from the server.
    Type: Application
    Filed: September 8, 2017
    Publication date: December 28, 2017
    Inventors: Benjamin J. STERN, Mark Charles BEUTNAGEL, Alistair D. CONKIE, Horst J. SCHROETER, Amanda Joy STENT
  • Publication number: 20170358292
    Abstract: A speech synthesis process can record concatenation costs of unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying an acoustic unit sequential pair in the speech, searching for a concatenation cost for the acoustic unit sequential pair in a database using a hash table for the database, and when the concatenation cost is not found in the database, assigning a default value as the concatenation cost for the acoustic unit sequential pair.
    Type: Application
    Filed: June 26, 2017
    Publication date: December 14, 2017
    Inventors: Mark Charles BEUTNAGEL, Mehryar MOHRI, Michael Dennis RILEY
  • Patent number: 9799323
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS server then converts intonational phrases of the text into audio and responds to the browser with the audio file. The system saves the audio file in a cache, with the file indexed by a unique identifier. As the system continues converting text into speech, when identical text appears the system uses the cached audio corresponding to the identical text without the need for re-synthesis via the TTS server.
    Type: Grant
    Filed: December 14, 2015
    Date of Patent: October 24, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Alistair D. Conkie, Mark Charles Beutnagel, Taniya Mishra
  • Patent number: 9761218
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify, in a local cache of text-to-speech units for a text-to-speech voice an absent text-to-speech unit which is not in the local cache. The system can request from a server the absent text-to-speech unit. The system can then synthesize speech using the text-to-speech units and a received text-to-speech unit from the server.
    Type: Grant
    Filed: November 30, 2015
    Date of Patent: September 12, 2017
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Benjamin J. Stern, Mark Charles Beutnagel, Alistair D. Conkie, Horst J. Schroeter, Amanda Joy Stent
  • Patent number: 9691376
    Abstract: A speech synthesis process can record concatenation costs of unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying an acoustic unit sequential pair in the speech, searching for a concatenation cost for the acoustic unit sequential pair in a database using a hash table for the database, and when the concatenation cost is not found in the database, assigning a default value as the concatenation cost for the acoustic unit sequential pair.
    Type: Grant
    Filed: December 8, 2015
    Date of Patent: June 27, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
  • Patent number: 9412359
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating speech. One variation of the method is from a server side, and another variation of the method is from a client side. The server side method, as implemented by a network-based automatic speech processing system, includes first receiving, from a network client independent of knowledge of internal operations of the system, a request to generate a text-to-speech voice. The request can include speech samples, transcriptions of the speech samples, and metadata describing the speech samples. The system extracts sound units from the speech samples based on the transcriptions and generates an interactive demonstration of the text-to-speech voice based on the sound units, the transcriptions, and the metadata, wherein the interactive demonstration hides a back end processing implementation from the network client. The system provides access to the interactive demonstration to the network client.
    Type: Grant
    Filed: April 13, 2015
    Date of Patent: August 9, 2016
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Mark Charles Beutnagel, Alistair D. Conkie, Yeon-Jun Kim, Horst Juergen Schroeter
  • Publication number: 20160171970
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for detecting and correcting abnormal stress patterns in unit-selection speech synthesis. A system practicing the method detects incorrect stress patterns in selected acoustic units representing speech to be synthesized, and corrects the incorrect stress patterns in the selected acoustic units to yield corrected stress patterns. The system can further synthesize speech based on the corrected stress patterns. In one aspect, the system also classifies the incorrect stress patterns using a machine learning algorithm such as a classification and regression tree, adaptive boosting, support vector machine, and maximum entropy. In this way a text-to-speech unit selection speech synthesizer can produce more natural sounding speech with suitable stress patterns regardless of the stress of units in a unit selection database.
    Type: Application
    Filed: February 22, 2016
    Publication date: June 16, 2016
    Inventors: Yeon-Jun KIM, Mark Charles BEUTNAGEL, Alistair D. CONKIE, Ann K. Syrdal
  • Publication number: 20160098985
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS server then converts intonational phrases of the text into audio and responds to the browser with the audio file. The system saves the audio file in a cache, with the file indexed by a unique identifier. As the system continues converting text into speech, when identical text appears the system uses the cached audio corresponding to the identical text without the need for re-synthesis via the TTS server.
    Type: Application
    Filed: December 14, 2015
    Publication date: April 7, 2016
    Inventors: Alistair D. CONKIE, Mark Charles BEUTNAGEL, Taniya MISHRA
  • Publication number: 20160093288
    Abstract: A speech synthesis can record concatenation costs of most common acoustic unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying a most common acoustic unit sequential pair in the speech, assigning a concatenation cost to the most common acoustic sequential pair, and recording the concatenation cost of the most common acoustic sequential pair to a concatenation cost database.
    Type: Application
    Filed: December 8, 2015
    Publication date: March 31, 2016
    Inventors: Mark Charles BEUTNAGEL, Mehryar MOHRI, Michael Dennis RILEY
  • Publication number: 20160086598
    Abstract: Systems, methods, and computer-readable storage media for intelligent caching of concatenative speech units for use in speech synthesis. A system configured to practice the method can identify a speech synthesis context, and determine, based on a local cache of text-to-speech units for a text-to-speech voice and based on the speech synthesis context, additional text-to-speech units which are not in the local cache. The system can request from a server the additional text-to-speech units, and store the additional text-to-speech units in the local cache. The system can then synthesize speech using the text-to-speech units and the additional text-to-speech units in the local cache. The system can prune the cache as the context changes, based on availability of local storage, or after synthesizing the speech. The local cache can store a core set of text-to-speech units associated with the text-to-speech voice that cannot be pruned from the local cache.
    Type: Application
    Filed: November 30, 2015
    Publication date: March 24, 2016
    Inventors: Benjamin J. STERN, Mark Charles BEUTNAGEL, Alistair D. CONKIE, Horst J. SCHROETER, Amanda Joy STENT
  • Patent number: 9269348
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for detecting and correcting abnormal stress patterns in unit-selection speech synthesis. A system practicing the method detects incorrect stress patterns in selected acoustic units representing speech to be synthesized, and corrects the incorrect stress patterns in the selected acoustic units to yield corrected stress patterns. The system can further synthesize speech based on the corrected stress patterns. In one aspect, the system also classifies the incorrect stress patterns using a machine learning algorithm such as a classification and regression tree, adaptive boosting, support vector machine, and maximum entropy. In this way a text-to-speech unit selection speech synthesizer can produce more natural sounding speech with suitable stress patterns regardless of the stress of units in a unit selection database.
    Type: Grant
    Filed: February 23, 2015
    Date of Patent: February 23, 2016
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Yeon-Jun Kim, Mark Charles Beutnagel, Alistair D. Conkie, Ann K. Syrdal
  • Patent number: 9240180
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS server then converts intonational phrases of the text into audio and responds to the browser with the audio file. The system saves the audio file in a cache, with the file indexed by a unique identifier. As the system continues converting text into speech, when identical text appears the system uses the cached audio corresponding to the identical text without the need for re-synthesis via the TTS server.
    Type: Grant
    Filed: December 1, 2011
    Date of Patent: January 19, 2016
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Alistair D. Conkie, Mark Charles Beutnagel, Taniya Mishra
  • Patent number: 9236044
    Abstract: A speech synthesis system can record concatenation costs of most common acoustic unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying a most common acoustic unit sequential pair in the speech, assigning a concatenation cost to the most common acoustic sequential pair, and recording the concatenation cost of the most common acoustic sequential pair to a concatenation cost database.
    Type: Grant
    Filed: July 18, 2014
    Date of Patent: January 12, 2016
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley