Patents by Inventor Maria E. Smith

Maria E. Smith has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System for tuning synthesized speech

Patent number: 8849669

Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech, including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.

Type: Grant

Filed: April 3, 2013

Date of Patent: September 30, 2014

Assignee: Nuance Communications, Inc.

Inventors: Raimo Bakis, Ellen Marie Eide, Roberto Pieraccini, Maria E. Smith, Jie Z. Zeng
DYNAMIC GENERATION OF POLICY ENFORCEMENT RULES AND ACTIONS FROM POLICY ATTACHMENT SEMANTICS

Publication number: 20140229596

Abstract: At least one defined service policy to be enforced by a policy enforcement point (PEP) is obtained. The obtained at least one defined service policy is parsed to identify at least one set of enforceable policy provisions and the at least one set of enforceable policy provisions is identified. Each set of enforceable policy provisions includes a policy subject, a policy domain, and at least one assertion as the enforceable policy provisions within the at least one defined service policy. At least one runtime processing rule including at least one processing action usable by the PEP is created to enforce the policy subject, the policy domain, and the at least one assertion of each identified at least one set of enforceable policy provisions.

Type: Application

Filed: March 25, 2014

Publication date: August 14, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Thomas C. Burke, Mario E. De Armas, Oswaldo Gago, Gaurang Shah, Maria E. Smith
APPLYING POLICY ATTACHMENT SERVICE LEVEL MANAGEMENT (SLM) SEMANTICS WITHIN A PEERED POLICY ENFORCEMENT DEPLOYMENT

Publication number: 20140229594

Abstract: A peer policy object named with a policy enforcement metric context identifier is created at a first policy enforcement point (PEP) platform of a group of peered PEP platforms. The peer policy object uniquely identifies a registered service policy to be enforced during runtime as a shared runtime policy enforcement activity by the group of peered PEP platforms within a peered policy enforcement deployment. The peer policy object named with the policy enforcement metric context identifier is deployed to at least one other peered PEP platform within the peered policy enforcement deployment to initiate the shared runtime policy enforcement activity by the group of peered PEP platforms. At least one runtime policy enforcement metric associated with runtime enforcement of the registered service policy is shared with the at least one other peered PEP platform during runtime using the policy enforcement metric context identifier.

Type: Application

Filed: February 12, 2013

Publication date: August 14, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Thomas C. Burke, Mario E. De Armas, Oswaldo Gago, Gaurang Shah, Maria E. Smith
POLICY ASSERTION LINKING TO PROCESSING RULE CONTEXTS FOR POLICY ENFORCEMENT

Publication number: 20140229595

Abstract: A selectable policy enforcement mode of a policy enforcement point (PEP) is configured that, based upon a selected policy enforcement mode, selectively modifies an input message processing context. The selectively modified input message processing context is used to invoke policy enforcement rules from multiple policy domains to implement one of independent policy enforcement and sequential cascaded policy enforcement of the policy enforcement rules from the multiple policy domains. The selection of the policy enforcement mode is detected. The input message processing context used to invoke the policy enforcement rules is selectively modified based upon on the selected policy enforcement mode. The policy enforcement rules from the multiple policy domains are enforced during runtime based upon the selected policy enforcement mode and the modified input message processing context used to invoke the policy enforcement rules.

Type: Application

Filed: February 12, 2013

Publication date: August 14, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Thomas C. Burke, Mario E. De Armas, Oswaldo Gago, Gaurang Shah, Maria E. Smith
DYNAMIC GENERATION OF POLICY ENFORCEMENT RULES AND ACTIONS FROM POLICY ATTACHMENT SEMANTICS

Publication number: 20140229593

Abstract: At least one defined service policy to be enforced by a policy enforcement point (PEP) is obtained. The obtained at least one defined service policy is parsed to identify at least one set of enforceable policy provisions and the at least one set of enforceable policy provisions is identified. Each set of enforceable policy provisions includes a policy subject, a policy domain, and at least one assertion as the enforceable policy provisions within the at least one defined service policy. At least one runtime processing rule including at least one processing action usable by the PEP is created to enforce the policy subject, the policy domain, and the at least one assertion of each identified at least one set of enforceable policy provisions.

Type: Application

Filed: February 12, 2013

Publication date: August 14, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Thomas C. Burke, Mario E. De Armas, Oswaldo Gago, Gaurang Shah, Maria E. Smith
SYSTEM FOR TUNING SYNTHESIZED SPEECH

Publication number: 20140058734

Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech, including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.

Type: Application

Filed: April 3, 2013

Publication date: February 27, 2014

Inventors: Raimo Bakis, Ellen Marie Eide, Roberto Pieraccini, Maria E. Smith, Jie Z. Zeng
Disambiguating text that is to be converted to speech using configurable lexeme based rules

Patent number: 8538743

Abstract: A software language including language constructs for disambiguating text that is to be converted to speech using configurable lexeme based rules. The language can include at least one conditional statement and a significance indicator. The conditional statement can define a sense of usage for a lexeme. The significance indicator can define a criteria for selecting an associated sense of usage. The language can also include an action expression that is associated with a conditional statement that defines a set of programmatic actions to be executed upon a selection of the associated usage sense. The conditional statement can include a context range specification that defines a scope of an input string for examination when evaluating the conditional statement. Further, the conditional statement can include a directive that represents a defined condition of the lexeme or the text surrounding the lexeme.

Type: Grant

Filed: March 21, 2007

Date of Patent: September 17, 2013

Assignee: Nuance Communications, Inc.

Inventors: Oswaldo Gago, Steven M. Hancock, Maria E. Smith
System for tuning synthesized speech

Patent number: 8438032

Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.

Type: Grant

Filed: January 9, 2007

Date of Patent: May 7, 2013

Assignee: Nuance Communications, Inc.

Inventors: Raimo Bakis, Ellen M. Eide, Roberto Pieraccini, Maria E. Smith, Jie Zeng
Interactive debugging and tuning of methods for CTTS voice building

Patent number: 7853452

Abstract: A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

Type: Grant

Filed: December 3, 2008

Date of Patent: December 14, 2010

Assignee: Nuance Communications, Inc.

Inventors: Philip Gleason, Maria E. Smith, Mahesh Viswanathan, Jie Zeng
Method, apparatus and computer program providing a multi-speaker database for concatenative text-to-speech synthesis

Patent number: 7716052

Abstract: A method, apparatus and a computer program product to generate an audible speech word that corresponds to text. The method includes providing a text word and, in response to the text word, processing pre-recorded speech segments that are derived from a plurality of speakers to selectively concatenate together speech segments based on at least one cost function to form audio data for generating an audible speech word that corresponds to the text word. A data structure is also provided for use in a concatenative text-to-speech system that includes a plurality of speech segments derived from a plurality of speakers, where each speech segment includes an associated attribute vector each of which is comprised of at least one attribute vector element that identifies the speaker from which the speech segment was derived.

Type: Grant

Filed: April 7, 2005

Date of Patent: May 11, 2010

Assignee: Nuance Communications, Inc.

Inventors: Andrew S. Aaron, Ellen M. Eide, Wael M. Hamza, Michael A. Picheny, Charles T. Rutherfoord, Zhi Wei Shuang, Maria E. Smith
INTERACTIVE DEBUGGING AND TUNING OF METHODS FOR CTTS VOICE BUILDING

Publication number: 20090083037

Abstract: A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

Type: Application

Filed: December 3, 2008

Publication date: March 26, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Philip Gleason, Maria E. Smith, Mahesh Viswanathan, Jie Zeng
Interactive debugging and tuning method for CTTS voice building

Patent number: 7487092

Abstract: A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

Type: Grant

Filed: October 17, 2003

Date of Patent: February 3, 2009

Assignee: International Business Machines Corporation

Inventors: Philip Gleason, Maria E. Smith, Mahesh Viswanathan, Jie Z. Zeng
Speech segment clustering and ranking

Patent number: 7475016

Abstract: A system, method, and apparatus for identifying problematic speech segments is provided. The system includes a clustering module for generating a first cluster of one or more consecutive speech segments if the consecutive speech segments satisfy a predetermined filtering test, and for generating a second cluster comprising at least one different consecutive speech segment selected from the ordered sequence if the at least one different consecutive speech segment satisfies the predetermined filtering test. The system also includes a combining module for combining the first and second clusters as well as the at least one intervening consecutive speech segment to form an aggregated cluster if the aggregated cluster satisfies a predetermined combining criterion. The system can further include a ranking module for ranking aggregated clusters, the ranking reflecting a relative severity of misalignments among problematic speech segments.

Type: Grant

Filed: December 15, 2004

Date of Patent: January 6, 2009

Assignee: International Business Machines Corporation

Inventors: Maria E. Smith, Jie Z. Zeng
Object specific language extension interface for a multi-level data structure

Patent number: 7464065

Abstract: A computerized method (300) and software product (200) is provided for querying and modifying a Multi-Level Data Structure (106) stored in a Text-to-Speech (100) engine of a data processing system having a Central Processing Unit (202), a processing system memory (203), and an operating system (201), using an application program written in an interpretive programming language. The method includes the steps of initializing (302) by means of the CPU implementing a set of commands, a data processing environment for processing the application program, processing (306) the application program, where the processing includes identifying a marked command that encapsulates a DPMS program, and upon identifying a marked command, operating (318) on the MLDS using a DPMS interpreter for producing a result from the MLDS, the result available to the application program during execution of the application program.

Type: Grant

Filed: November 21, 2005

Date of Patent: December 9, 2008

Assignee: International Business Machines Corporation

Inventors: Philip Gleason, Steven M. Hancock, Maria E. Smith
DISAMBIGUATING TEXT THAT IS TO BE CONVERTED TO SPEECH USING CONFIGURABLE LEXEME BASED RULES

Publication number: 20080235004

Abstract: A software language including language constructs for disambiguating text that is to be converted to speech using configurable lexeme based rules. The language can include at least one conditional statement and a significance indicator. The conditional statement can define a sense of usage for a lexeme. The significance indicator can define a criteria for selecting an associated sense of usage. The language can also include an action expression that is associated with a conditional statement that defines a set of programmatic actions to be executed upon a selection of the associated usage sense. The conditional statement can include a context range specification that defines a scope of an input string for examination when evaluating the conditional statement. Further, the conditional statement can include a directive that represents a defined condition of the lexeme or the text surrounding the lexeme.

Type: Application

Filed: March 21, 2007

Publication date: September 25, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: OSWALDO GAGO, STEVEN M. HANCOCK, MARIA E. SMITH
SYSTEM FOR TUNING SYNTHESIZED SPEECH

Publication number: 20080167875

Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.

Type: Application

Filed: January 9, 2007

Publication date: July 10, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Raimo Bakis, Ellen M. Eide, Roberto Pieraccini, Maria E. Smith, Jie Zeng
Method for detecting misaligned phonetic units for a concatenative text-to-speech voice

Patent number: 7280967

Abstract: A method of filtering phonetic units to be used within a concatenative text-to-speech (CTTS) voice. Initially, a normality threshold can be established. At least one phonetic unit that has been automatically extracted from a speech corpus in order to construct the CTTS voice can be received. An abnormality index can be calculated for the phonetic unit. Then, the abnormality index can be compared to the established normality threshold. If the abnormality index exceeds the normality threshold, the phonetic unit can be marked as a suspect phonetic unit. If the abnormality index does not exceed the normality threshold, the phonetic unit can be marked as a verified phonetic unit. The concatenative text-to-speech voice can be built using the verified phonetic units.

Type: Grant

Filed: July 30, 2003

Date of Patent: October 9, 2007

Assignee: International Business Machines Corporation

Inventors: Philip Gleason, Maria E. Smith, Jie Z. Zeng
Method for preserving contextual accuracy in an extendible speech recognition language model

Patent number: 7103533

Abstract: A method of generating language model statistics for a new word added to a language model incorporating at least one class file containing contextually related words. The method can include the following steps: First, language model statistics can be computed based on references to at least one incorporated class file. Second, a new word can be substituted for each reference to a selected class file. Additionally, the language model statistics can be re-computed based on the new word having been substituted for the reference. Third, the re-computed language model statistics can be displayed in a user interface and modifications can be accepted to the re-computed language model statistics through the user interface. Fourth, the language model statistics can be further re-computed based on the modifications. In consequence, the language model statistics are re-computed for the new word without introducing contextual inaccuracies in the language model.

Type: Grant

Filed: February 21, 2001

Date of Patent: September 5, 2006

Assignee: International Business Machines Corporation

Inventors: James R. Lewis, Kerry A. Ortega, C. Thomas Rutherfoord, Maria E. Smith
Method and system for text-to-speech caching

Patent number: 7043432

Abstract: In a text-to-speech system, a method of converting text-to-speech can include receiving a text input and comparing the received text input to at least one entry in a text-to-speech cache memory. Each entry in the text-to-speech cache memory can specify a corresponding spoken output. If the text input matches one of the entries in the text-to-speech cache memory, the cached speech output specified by the matching entry can be provided.

Type: Grant

Filed: August 29, 2001

Date of Patent: May 9, 2006

Assignee: International Business Machines Corporation

Inventors: Raimo Bakis, Hari Chittaluru, Edward A. Epstein, Steven J. Friedland, Abraham Ittycheriah, Stephen G. Lawrence, Michael A. Picheny, Charles Rutherfoord, Maria E. Smith
Method and apparatus for recognizing from here to here voice command structures in a finite grammar speech recognition system

Patent number: 6745165

Abstract: A method and system uses a finite state command grammar coordinated with application scripting to recognize voice command structures for performing an event from an initial location to a new location. The method involves a series of steps, including: recognizing an enabling voice command specifying the event to be performed from the initial location; determining a functional expression for the enabling voice command defined by one or more actions and objects; storing the action and object in a memory location; receiving input specifying the new location; recognizing an activating voice command for performing the event up to the new location; retrieving the stored action and object from the memory location; and performing the event from the initial location to the new location according to the retrieved action and object. Preferably, the enabling-activating command is phrased as “from here . . . to here”.

Type: Grant

Filed: June 16, 1999

Date of Patent: June 1, 2004

Assignee: International Business Machines Corporation

Inventors: James R. Lewis, Kerry A. Ortega, Maria E. Smith, Thomas A. Kist, Linda M. Boyer

prev 1 2 3 next