Patents by Inventor Dragos Stefan Munteanu

Dragos Stefan Munteanu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8943080
    Abstract: Systems, computer programs, and methods for identifying parallel documents and/or fragments in a bilingual collection are provided. The method for identifying parallel sub-sentential fragments in a bilingual collection comprises translating a source document from a bilingual collection. The method further includes querying a target library associated with the bilingual collection using the translated source document, and identifying one or more target documents based on the query. Subsequently, a source sentence associated with the source document is aligned to one or more target sentences associated with the one or more target documents. Finally, the method includes determining whether a source fragment associated with the source sentence comprises a parallel translation of a target fragment associated with the one or more target sentences.
    Type: Grant
    Filed: December 5, 2006
    Date of Patent: January 27, 2015
    Assignee: University of Southern California
    Inventors: Daniel Marcu, Dragos Stefan Munteanu
  • Patent number: 8296127
    Abstract: A translation training device which extracts from two nonparallel Corpora a set of parallel sentences. The system finds parameters between different sentences or phrases, in order to find parallel sentences. The parallel sentences are then used for training a data-driven machine translation system. The process can be applied repetitively until sufficient data is collected or until the performance of the translation system stops improving.
    Type: Grant
    Filed: March 22, 2005
    Date of Patent: October 23, 2012
    Assignee: University of Southern California
    Inventors: Daniel Marcu, Dragos Stefan Munteanu
  • Patent number: 8234106
    Abstract: A machine translation system may use non-parallel monolingual corpora to generate a translation lexicon. The system may identify identically spelled words in the two corpora, and use them as a seed lexicon. The system may use various clues, e.g., context and frequency, to identify and score other possible translation pairs, using the seed lexicon as a basis. An alternative system may use a small bilingual lexicon in addition to non-parallel corpora to learn translations of unknown words and to generate a parallel corpus.
    Type: Grant
    Filed: October 8, 2009
    Date of Patent: July 31, 2012
    Assignee: University of Southern California
    Inventors: Daniel Marcu, Kevin Knight, Dragos Stefan Munteanu, Philipp Koehn
  • Publication number: 20100042398
    Abstract: A machine translation system may use non-parallel monolingual corpora to generate a translation lexicon. The system may identify identically spelled words in the two corporal and use them as a seed lexicon. The system may use various clues 1 e.g., context and frequency, to identify and score other possible translation pairs 1 using the seed lexicon as a basis. An alternative system may use a small bilingual lexicon in addition to non-parallel corpora to learn translations of unknown words and to generate a parallel corpus.
    Type: Application
    Filed: October 8, 2009
    Publication date: February 18, 2010
    Inventors: Daniel Marcu, Kevin Knight, Dragos Stefan Munteanu, Philipp Kohen
  • Patent number: 7620538
    Abstract: A machine translation system may use non-parallel monolingual corpora to generate a translation lexicon. The system may identify identically spelled words in the two corpora, and use them as a seed lexicon. The system may use various clues, e.g., context and frequency, to identify and score other possible translation pairs, using the seed lexicon as a basis. An alternative system may use a small bilingual lexicon in addition to non-parallel corpora to learn translations of unknown words and to generate a parallel corpus.
    Type: Grant
    Filed: March 26, 2003
    Date of Patent: November 17, 2009
    Assignee: University of Southern California
    Inventors: Daniel Marcu, Kevin Knight, Dragos Stefan Munteanu, Philipp Koehn
  • Publication number: 20030204400
    Abstract: A machine translation system may use non-parallel monolingual corpora to generate a translation lexicon. The system may identify identically spelled words in the two corpora, and use them as a seed lexicon. The system may use various clues, e.g., context and frequency, to identify and score other possible translation pairs, using the seed lexicon as a basis. An alternative system may use a small bilingual lexicon in addition to non-parallel corpora to learn translations of unknown words and to generate a parallel corpus.
    Type: Application
    Filed: March 26, 2003
    Publication date: October 30, 2003
    Inventors: Daniel Marcu, Kevin Knight, Dragos Stefan Munteanu, Philipp Koehn