Patents by Inventor Salim Estephan Roukos

Salim Estephan Roukos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8768686
    Abstract: A method of identifying and using side information available to statistical machine translation systems within an enterprise setting, the method including extracting user-specific interaction and non-interaction-based information from at least one corresponding database within the enterprise for each of a plurality of users, aggregating the user-specific interaction and non-interaction based information from a plurality of users, by using a processor on a computer, to tune and adapt background translation and language models, and updating all relevant models within the enterprise after user activity based on the tuned and adapted translation and language models.
    Type: Grant
    Filed: May 13, 2010
    Date of Patent: July 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Ruhi Sarikaya, Jiri Navratil, Bhuvana Ramabhadran, David Eubensky, Salim Estephan Roukos
  • Publication number: 20110282648
    Abstract: A method of identifying and using side information available to statistical machine translation systems within an enterprise setting, the method including extracting user-specific interaction and non-interaction-based information from at least one corresponding database within the enterprise for each of a plurality of users, aggregating the user-specific interaction and non-interaction based information from a plurality of users, by using a processor on a computer, to tune and adapt background translation and language models, and updating all relevant models within the enterprise after user activity based on the tuned and adapted translation and language models.
    Type: Application
    Filed: May 13, 2010
    Publication date: November 17, 2011
    Applicant: International Business Machines Corporation
    Inventors: Ruhi Sarikaya, Jiri Navratil, Bhuvana Ramabhadran, David Eubenski, Salim Estephan Roukos
  • Publication number: 20080222511
    Abstract: Methods and apparatus are provided for annotating documents with one or more of entities, events and relations. Documents are annotated by presenting the document to a user; presenting the user with a list of possible entity types, wherein the list of possible entity types is configurable; and obtaining at least one mention annotation that associates a selected phrase in the document with one of the possible entity types. The selected phrase can be presented to the user, for example, based on one or more presentation rules associated with the associated entity type. The method can be implemented, for example, in a client-server configuration where a browser communicates with a remote server.
    Type: Application
    Filed: April 2, 2008
    Publication date: September 11, 2008
    Applicant: International Business Machines Corporation
    Inventors: Nandakishore Kambhatla, Salim Estephan Roukos
  • Patent number: 6073095
    Abstract: A fast vocabulary independent method for spotting words in speech utilizes a preprocessing step and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing includes a Viterbi-beam phone level decoding using a tree-based phone language model. The coarse search matches phone-ngrams to identify regions of speech as putative word hits, and the detailed search performs an acoustic match at the putative hits with a model of the given word included in the vocabulary of the recognizer.
    Type: Grant
    Filed: October 15, 1997
    Date of Patent: June 6, 2000
    Assignee: International Business Machines Corporation
    Inventors: Satyanarayana Dharanipragada, Ellen Marie Eide, Salim Estephan Roukos
  • Patent number: 5991710
    Abstract: A system for translating a first word set in a source language into a second word set in a target language, the system comprising: input means for inputting the first word set into the system; tagging means for tagging the first word set input to the system so as to at least substantially reduce non-essential variability in the first word set; translation means including a single a posteriori conditional probability model and a target candidate store for storing target language candidate word sets, wherein the translation means employs the single model to evaluate the target language candidate word sets in order to select the target language candidate word set having a best score with respect to the first word set; and output means for outputting the best scoring target language candidate word set as the second word set in the target language.
    Type: Grant
    Filed: May 20, 1997
    Date of Patent: November 23, 1999
    Assignee: International Business Machines Corporation
    Inventors: Kishore Ananda Papineni, Salim Estephan Roukos, Robert Todd Ward
  • Patent number: 5987404
    Abstract: The invention proposes using statistical methods to do natural language understanding. The key notion is that there are "strings" of words in the natural language, that correspond to a single semantic concept. One can then define an alignment between an entire semantic meaning (consisting of a set of semantic concepts), and the English. This is modeled using P(E,A.vertline.S). One can model p(S) separately. This allows each parameter to be modeled using many different statistical models.
    Type: Grant
    Filed: January 29, 1996
    Date of Patent: November 16, 1999
    Assignee: International Business Machines Corporation
    Inventors: Stephen Andrew Della Pietra, Mark Edward Epstein, Martin Franz, Joshua David Sherer Koppelman, Salim Estephan Roukos, Robert Todd Ward
  • Patent number: 5953701
    Abstract: A method of gender dependent speech recognition includes the steps of identifying phone state models common to both genders, identifying gender specific phone state models, identifying a gender of a speaker and recognizing acoustic data from the speaker.
    Type: Grant
    Filed: January 22, 1998
    Date of Patent: September 14, 1999
    Assignee: International Business Machines Corporation
    Inventors: Chalapathy Venkata Neti, Salim Estephan Roukos
  • Patent number: 5835888
    Abstract: A statistical language model for inflected languages, having very large vocabularies, is generated by splitting words into stems, prefixes and endings, and deriving trigrams for the stems, ending and prefixes. The statistical dependence of endings and prefixes from each stem is also obtained, and the resulting language model is a weighted sum of these scores.
    Type: Grant
    Filed: June 10, 1996
    Date of Patent: November 10, 1998
    Assignee: International Business Machines Corporation
    Inventors: Dimitri Kanevsky, Salim Estephan Roukos, Jan Sedivy