Patents by Inventor Juan Miguel Pino

Juan Miguel Pino has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11227110
    Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Grant
    Filed: March 27, 2020
    Date of Patent: January 18, 2022
    Assignee: FACEBOOK, INC.
    Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
  • Patent number: 10810380
    Abstract: Embodiments are disclosed for transliteration based on a machine translation model training pipeline. A method according to some embodiments includes steps of: training a probabilistic model for transliteration from a first script system to a second script system using a machine translation model training pipeline; segmenting, using the probabilistic model, an input string in the first script system into phonemes that correspond to characters in the second script system; converting the phonemes in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Grant
    Filed: December 21, 2016
    Date of Patent: October 20, 2020
    Assignee: FACEBOOK, INC.
    Inventors: Juan Miguel Pino, Madhu Ramanathan
  • Patent number: 10643028
    Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Grant
    Filed: July 19, 2019
    Date of Patent: May 5, 2020
    Assignee: FACEBOOK, INC.
    Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
  • Patent number: 10489507
    Abstract: In one embodiment, a method includes identifying a plurality of dyslexic users on an online social network. The plurality of dyslexic users may be identified based on content objects posted by these users over a particular time period, where the content objects may include one or more of word-level errors or sentence-level errors. A machine-learning model may be trained for text correction using a corpus of social network data, which may include at least the content objects with one or more of word-level errors or sentence-level errors, and a corresponding set of corrected content objects. A text string including one or more errors may be received from a client system associated with a first user. The text string may be transformed into a vector representation using an encoder of the machine-learning model. A corrected text string may be generated from the vector representation using a decoder of the machine-learning model.
    Type: Grant
    Filed: January 2, 2018
    Date of Patent: November 26, 2019
    Assignee: Facebook, Inc.
    Inventors: Xian Li, Irina-Elena Veliche, Debnil Sur, Shaomei Wu, Amit Bahl, Juan Miguel Pino
  • Patent number: 10474751
    Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.
    Type: Grant
    Filed: January 11, 2018
    Date of Patent: November 12, 2019
    Assignee: FACEBOOK, INC.
    Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira
  • Patent number: 10402489
    Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Grant
    Filed: December 21, 2016
    Date of Patent: September 3, 2019
    Assignee: FACEBOOK, INC.
    Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
  • Patent number: 10394960
    Abstract: Embodiments are disclosed for transliteration decoding using a tree structure. A method according to some embodiments includes steps of: generating a tree structure for an input string in a first script system, the tree structure including nodes representing segments of the input string; identifying segmentation candidates for the input string based on paths of the tree structure, the segmentation candidates segmenting the input string into character groups; selecting a segmentation candidate based on probabilities of the segmentation candidates predicted by a probabilistic model; segmenting the input string into character groups that correspond to characters in a second script system; decoding the character groups in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Grant
    Filed: December 21, 2016
    Date of Patent: August 27, 2019
    Assignee: FACEBOOK, INC.
    Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
  • Publication number: 20190205372
    Abstract: In one embodiment, a method includes identifying a plurality of dyslexic users on an online social network. The plurality of dyslexic users may be identified based on content objects posted by these users over a particular time period, where the content objects may include one or more of word-level errors or sentence-level errors. A machine-learning model may be trained for text correction using a corpus of social network data, which may include at least the content objects with one or more of word-level errors or sentence-level errors, and a corresponding set of corrected content objects. A text string including one or more errors may be received from a client system associated with a first user. The text string may be transformed into a vector representation using an encoder of the machine-learning model. A corrected text string may be generated from the vector representation using a decoder of the machine-learning model.
    Type: Application
    Filed: January 2, 2018
    Publication date: July 4, 2019
    Inventors: Xian Li, Irina-Elena Veliche, Debnil Sur, Shaomei Wu, Amit Bahl, Juan Miguel Pino
  • Publication number: 20190018837
    Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.
    Type: Application
    Filed: January 11, 2018
    Publication date: January 17, 2019
    Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira
  • Publication number: 20180173695
    Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Application
    Filed: December 21, 2016
    Publication date: June 21, 2018
    Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Guarav Lochan
  • Publication number: 20180174572
    Abstract: Embodiments are disclosed for transliteration based on a machine translation model training pipeline. A method according to some embodiments includes steps of: training a probabilistic model for transliteration from a first script system to a second script system using a machine translation model training pipeline; segmenting, using the probabilistic model, an input string in the first script system into phonemes that correspond to characters in the second script system; converting the phonemes in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Application
    Filed: December 21, 2016
    Publication date: June 21, 2018
    Inventors: Juan Miguel Pino, Madhu Ramanathan
  • Publication number: 20180173689
    Abstract: Embodiments are disclosed for transliteration decoding using a tree structure. A method according to some embodiments includes steps of: generating a tree structure for an input string in a first script system, the tree structure including nodes representing segments of the input string; identifying segmentation candidates for the input string based on paths of the tree structure, the segmentation candidates segmenting the input string into character groups; selecting a segmentation candidate based on probabilities of the segmentation candidates predicted by a probabilistic model; segmenting the input string into character groups that correspond to characters in a second script system; decoding the character groups in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
    Type: Application
    Filed: December 21, 2016
    Publication date: June 21, 2018
    Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Guarav Lochan
  • Patent number: 10002125
    Abstract: Specialized language processing engines can use author-specific or reader-specific language models to improve language processing results by selecting phrases most likely to be used by an author or by tailoring output to language with which the reader is familiar. Language models that are author-specific can be generated by identifying characteristics of an author or author type such as age, gender, and location. An author-specific language model can be built using, as training data, language items written by users with the identified characteristics. Language models that are reader-specific can be generated using, as training data, language items written by or viewed by that reader. When implementing a specialized machine translation engine, multiple possible translations can be generated. An author-specific language model or a reader-specific language model can provide scores for possible translations, which can be used to select the best translation.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: June 19, 2018
    Assignee: FACEBOOK, INC.
    Inventors: Juan Miguel Pino, Ying Zhang, Matthias Gerhard Eck
  • Patent number: 9904672
    Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.
    Type: Grant
    Filed: June 30, 2015
    Date of Patent: February 27, 2018
    Assignee: Facebook, Inc.
    Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira
  • Publication number: 20170185583
    Abstract: Specialized language processing engines can use author-specific or reader-specific language models to improve language processing results by selecting phrases most likely to be used by an author or by tailoring output to language with which the reader is familiar. Language models that are author-specific can be generated by identifying characteristics of an author or author type such as age, gender, and location. An author-specific language model can be built using, as training data, language items written by users with the identified characteristics. Language models that are reader-specific can be generated using, as training data, language items written by or viewed by that reader. When implementing a specialized machine translation engine, multiple possible translations can be generated. An author-specific language model or a reader-specific language model can provide scores for possible translations, which can be used to select the best translation.
    Type: Application
    Filed: December 28, 2015
    Publication date: June 29, 2017
    Inventors: Juan Miguel Pino, Ying Zhang, Matthias Gerhard Eck
  • Publication number: 20170004121
    Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.
    Type: Application
    Filed: June 30, 2015
    Publication date: January 5, 2017
    Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira