Patents by Inventor Mehryar Mohri

Mehryar Mohri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

CONCATENATION COST IN SPEECH SYNTHESIS FOR ACOUSTIC UNIT SEQUENTIAL PAIR USING HASH TABLE AND DEFAULT CONCATENATION COST

Publication number: 20170358292

Abstract: A speech synthesis process can record concatenation costs of unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying an acoustic unit sequential pair in the speech, searching for a concatenation cost for the acoustic unit sequential pair in a database using a hash table for the database, and when the concatenation cost is not found in the database, assigning a default value as the concatenation cost for the acoustic unit sequential pair.

Type: Application

Filed: June 26, 2017

Publication date: December 14, 2017

Inventors: Mark Charles BEUTNAGEL, Mehryar MOHRI, Michael Dennis RILEY
Concatenation cost in speech synthesis for acoustic unit sequential pair using hash table and default concatenation cost

Patent number: 9691376

Abstract: A speech synthesis process can record concatenation costs of unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying an acoustic unit sequential pair in the speech, searching for a concatenation cost for the acoustic unit sequential pair in a database using a hash table for the database, and when the concatenation cost is not found in the database, assigning a default value as the concatenation cost for the acoustic unit sequential pair.

Type: Grant

Filed: December 8, 2015

Date of Patent: June 27, 2017

Assignee: Nuance Communications, Inc.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
Recording Concatenation Costs of Most Common Acoustic Unit Sequential Pairs to a Concatenation Cost Database for Speech Synthesis

Publication number: 20160093288

Abstract: A speech synthesis can record concatenation costs of most common acoustic unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying a most common acoustic unit sequential pair in the speech, assigning a concatenation cost to the most common acoustic sequential pair, and recording the concatenation cost of the most common acoustic sequential pair to a concatenation cost database.

Type: Application

Filed: December 8, 2015

Publication date: March 31, 2016

Inventors: Mark Charles BEUTNAGEL, Mehryar MOHRI, Michael Dennis RILEY
Recording concatenation costs of most common acoustic unit sequential pairs to a concatenation cost database for speech synthesis

Patent number: 9236044

Abstract: A speech synthesis system can record concatenation costs of most common acoustic unit sequential pairs to a concatenation cost database for speech synthesis by synthesizing speech from a text, identifying a most common acoustic unit sequential pair in the speech, assigning a concatenation cost to the most common acoustic sequential pair, and recording the concatenation cost of the most common acoustic sequential pair to a concatenation cost database.

Type: Grant

Filed: July 18, 2014

Date of Patent: January 12, 2016

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
Minimum bayesian risk methods for automatic speech recognition

Patent number: 9123333

Abstract: A hypothesis space of a search graph may be determined. The hypothesis space may include n hypothesis-space transcriptions of an utterance, each selected from a search graph that includes t>n transcriptions of the utterance. An evidence space of the search graph may also be determined. The evidence space may include m evidence-space transcriptions of the utterance that are randomly selected from the search graph, where t>m. For each particular hypothesis-space transcription in the hypothesis space, an expected word error rate may be calculated by comparing the particular hypothesis-space transcription to each of the evidence-space transcriptions. Based on the expected word error rates, a lowest expected word error rate may be obtained, and the particular hypothesis-space transcription that is associated with the lowest expected word error rate may be provided.

Type: Grant

Filed: February 20, 2013

Date of Patent: September 1, 2015

Assignee: Google Inc.

Inventors: Antoine Amarilli, Mehryar Mohri, Cyril Allauzen
SPEECH SYNTHESIS FROM ACOUSTIC UNITS WITH DEFAULT VALUES OF CONCATENATION COST

Publication number: 20140330567

Abstract: A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. When a pair of acoustic units in the database does not have an associated concatenation cost, the system assigns a default concatenation cost. The system then synthesizes speech, identifies the acoustic unit sequential pairs generated and their respective concatenation costs, and stores those concatenation costs likely to occur.

Type: Application

Filed: July 18, 2014

Publication date: November 6, 2014

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
Speech synthesis from acoustic units with default values of concatenation cost

Patent number: 8788268

Abstract: A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. When a pair of acoustic units in the database does not have an associated concatenation cost, the system assigns a default concatenation cost. The system then synthesizes speech, identifies the acoustic unit sequential pairs generated and their respective concatenation costs, and stores those concatenation costs likely to occur.

Type: Grant

Filed: November 19, 2012

Date of Patent: July 22, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
Systems and methods for generating weighted finite-state automata representing grammars

Patent number: 8543383

Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.

Type: Grant

Filed: October 28, 2011

Date of Patent: September 24, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mehryar Mohri, Mark-Jan Nederhof
Systems and methods for determining the N-best strings

Patent number: 8527273

Abstract: Systems and methods for identifying the N-best strings of a weighted automaton. A potential for each state of an input automaton to a set of destination states of the input automaton is first determined. Then, the N-best paths are found in the result of an on-the-fly determinization of the input automaton. Only the portion of the input automaton needed to identify the N-best paths is determinized. As the input automaton is determinized, a potential for each new state of the partially determinized automaton is determined and is used in identifying the N-best paths of the determinized automaton, which correspond exactly to the N-best strings of the input automaton.

Type: Grant

Filed: July 30, 2012

Date of Patent: September 3, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mehryar Mohri, Michael Dennis Riley
SYSTEMS AND METHODS FOR DETERMINING THE N-BEST STRINGS

Publication number: 20120296648

Abstract: Systems and methods for identifying the N-best strings of a weighted automaton. A potential for each state of an input automaton to a set of destination states of the input automaton is first determined. Then, the N-best paths are found in the result of an on-the-fly determinization of the input automaton. Only the portion of the input automaton needed to identify the N-best paths is determinized. As the input automaton is determinized, a potential for each new state of the partially determinized automaton is determined and is used in identifying the N-best paths of the determinized automaton, which correspond exactly to the N-best strings of the input automaton.

Type: Application

Filed: July 30, 2012

Publication date: November 22, 2012

Applicant: AT&T Corp.

Inventors: Mehryar Mohri, Michael Dennis Riley
Methods and apparatus for rapid acoustic unit selection from a large speech corpus

Patent number: 8315872

Abstract: A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs of acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and caching the concatenation costs. Unfortunately, the number of possible sequential pairs of acoustic units makes such caching prohibitive. However, statistical experiments reveal that while about 85% of the acoustic units are typically used in common speech, less than 1% of the possible sequential pairs of acoustic units occur in practice.

Type: Grant

Filed: November 29, 2011

Date of Patent: November 20, 2012

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
System and method of epsilon removal of weighted automata and transducers

Patent number: 8290774

Abstract: An improved -removal method is disclosed that computes for any input weighted automaton A with -transitions an equivalent weighted automaton B with no -transitions. The method comprises two main steps. The first step comprises computing for each state “p” of the automaton A its -closure. The second step in the method comprises modifying the outgoing transitions of each state “p” by removing those labeled with . The method next comprises adding to the set of transitions leaving the state “p” non--transitions leaving each state “q” in the set of states reachable from “p” via a path labeled with with their weights pre--multiplied by the -distance from state “p” to state “q” in the automaton A. State “p” is a final state if some state “q” within the set of states reachable from “p” via a path labeled with is final and the final weight ? ? [ p ] = ? q ? ? ? ? ? e ? [ p ] ? F ? ( d ? [ p , q ] ? ? ? ? [ q ] ) .

Type: Grant

Filed: July 20, 2001

Date of Patent: October 16, 2012

Assignee: AT&T Intellectual Property II, L.P.

Inventor: Mehryar Mohri
Systems and methods for determining the N-best strings

Patent number: 8234115

Abstract: Systems and methods for identifying the N-best strings of a weighted automaton. A potential for each state of an input automaton to a set of destination states of the input automaton is first determined. Then, the N-best paths are found in the result of an on-the-fly determinization of the input automaton. Only the portion of the input automaton needed to identify the N-best paths is determinized. As the input automaton is determinized, a potential for each new state of the partially determinized automaton is determined and is used in identifying the N-best paths of the determinized automaton, which correspond exactly to the N-best strings of the input automaton.

Type: Grant

Filed: November 21, 2002

Date of Patent: July 31, 2012

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mehryar Mohri, Michael Dennis Riley
METHODS AND APPARATUS FOR RAPID ACOUSTIC UNIT SELECTION FROM A LARGE SPEECH CORPUS

Publication number: 20120136663

Abstract: A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs or acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and aching the concatenation costs. The number of possible sequential pairs of acoustic units makes such caching prohibitive. Statistical experiments reveal that while about 85% of the acoustic units are typically used in common speech, less than 1% of the possible sequential pairs or acoustic units occur in practice. The system synthesizes a large body of speech, identifies the acoustic unit sequential pairs generated and their respective concatenation costs, and stores those concatenation costs likely to occur.

Type: Application

Filed: November 29, 2011

Publication date: May 31, 2012

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
SYSTEMS AND METHODS FOR GENERATING WEIGHTED FINITE-STATE AUTOMATA REPRESENTING GRAMMARS

Publication number: 20120046939

Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.

Type: Application

Filed: October 28, 2011

Publication date: February 23, 2012

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Mehryar Mohri, Mark-Jan Nederhof
Methods and apparatus for rapid acoustic unit selection from a large speech corpus

Patent number: 8086456

Abstract: A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs of acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and caching the concatenation costs. Unfortunately, the number of possible sequential pairs of acoustic units makes such caching prohibitive. A method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenation costs. By constructing a concatenation cost database in this fashion, the processing power required at run-time is greatly reduced with negligible effect on speech quality.

Type: Grant

Filed: July 20, 2010

Date of Patent: December 27, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
Systems and methods for generating weighted finite-state automata representing grammars

Patent number: 8050908

Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.

Type: Grant

Filed: June 6, 2008

Date of Patent: November 1, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mehryar Mohri, Mark-Jan Nederhof
Methods and Apparatus for Rapid Acoustic Unit Selection From a Large Speech Corpus

Publication number: 20100286986

Abstract: A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs of acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and aching the concatenation costs. Unfortunately, the number of possible sequential pairs of acoustic units makes such caching prohibitive. However, statistical experiments reveal that while about 85% of the acoustic units are typically used in common speech, less than 1% of the possible sequential pairs of acoustic units occur in practice.

Type: Application

Filed: July 20, 2010

Publication date: November 11, 2010

Applicant: AT&T Intellectual Property II, L.P. via transfer from AT&T Corp.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley
Systems and methods for determining the determinizability of finite-state automata and transducers

Patent number: 7783485

Abstract: Finite-state transducers and weighted finite-state automata may not be determinizable. The twins property can be used to characterize the determinizability of such devices. For a weighted finite-state automaton or transducer, that weighted finite-state automaton or transducer and its inverse are intersected or composed, respectively. The resulting device is checked to determine if it has the cycle-identity property. If not, the original weighted finite-state automaton or transducer is not determinizable. For a weighted or unweighted finite-state transducer, that device is checked to determine if it is functional. If not, that device is not determinizable. That device is then composed with its inverse. The composed device is checked to determine if every edge in the composed device having a cycle-accessible end state meets at least one of a number of conditions. If so, the original device has the twins property. If the original device has the twins property, then it is determinizable.

Type: Grant

Filed: June 29, 2007

Date of Patent: August 24, 2010

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Cyril Allauzen, Mehryar Mohri
Methods and apparatus for rapid acoustic unit selection from a large speech corpus

Patent number: 7761299

Abstract: A speech synthesis system can select recorded speech fragments, or acoustic units, from a large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. Concatenation costs are expensive to compute. Processing is reduced by pre-computing and caching the concatenation costs. The number of possible sequential pairs of acoustic units makes such caching prohibitive. A method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenation costs, and storing those concatenation costs likely to occur.

Type: Grant

Filed: March 27, 2008

Date of Patent: July 20, 2010

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mark Charles Beutnagel, Mehryar Mohri, Michael Dennis Riley

1 2 3 next