Patents by Inventor Juan Miguel Pino

Juan Miguel Pino has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Transliteration of text entry across scripts

Patent number: 11227110

Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Grant

Filed: March 27, 2020

Date of Patent: January 18, 2022

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
Transliteration using machine translation pipeline

Patent number: 10810380

Abstract: Embodiments are disclosed for transliteration based on a machine translation model training pipeline. A method according to some embodiments includes steps of: training a probabilistic model for transliteration from a first script system to a second script system using a machine translation model training pipeline; segmenting, using the probabilistic model, an input string in the first script system into phonemes that correspond to characters in the second script system; converting the phonemes in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Grant

Filed: December 21, 2016

Date of Patent: October 20, 2020

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Madhu Ramanathan
Transliteration of text entry across scripts

Patent number: 10643028

Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Grant

Filed: July 19, 2019

Date of Patent: May 5, 2020

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
Text correction for dyslexic users on an online social network

Patent number: 10489507

Abstract: In one embodiment, a method includes identifying a plurality of dyslexic users on an online social network. The plurality of dyslexic users may be identified based on content objects posted by these users over a particular time period, where the content objects may include one or more of word-level errors or sentence-level errors. A machine-learning model may be trained for text correction using a corpus of social network data, which may include at least the content objects with one or more of word-level errors or sentence-level errors, and a corresponding set of corrected content objects. A text string including one or more errors may be received from a client system associated with a first user. The text string may be transformed into a vector representation using an encoder of the machine-learning model. A corrected text string may be generated from the vector representation using a decoder of the machine-learning model.

Type: Grant

Filed: January 2, 2018

Date of Patent: November 26, 2019

Assignee: Facebook, Inc.

Inventors: Xian Li, Irina-Elena Veliche, Debnil Sur, Shaomei Wu, Amit Bahl, Juan Miguel Pino
Machine-translation based corrections

Patent number: 10474751

Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.

Type: Grant

Filed: January 11, 2018

Date of Patent: November 12, 2019

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira
Transliteration of text entry across scripts

Patent number: 10402489

Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Grant

Filed: December 21, 2016

Date of Patent: September 3, 2019

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
Transliteration decoding using a tree structure

Patent number: 10394960

Abstract: Embodiments are disclosed for transliteration decoding using a tree structure. A method according to some embodiments includes steps of: generating a tree structure for an input string in a first script system, the tree structure including nodes representing segments of the input string; identifying segmentation candidates for the input string based on paths of the tree structure, the segmentation candidates segmenting the input string into character groups; selecting a segmentation candidate based on probabilities of the segmentation candidates predicted by a probabilistic model; segmenting the input string into character groups that correspond to characters in a second script system; decoding the character groups in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Grant

Filed: December 21, 2016

Date of Patent: August 27, 2019

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
Text Correction for Dyslexic Users on an Online Social Network

Publication number: 20190205372

Abstract: In one embodiment, a method includes identifying a plurality of dyslexic users on an online social network. The plurality of dyslexic users may be identified based on content objects posted by these users over a particular time period, where the content objects may include one or more of word-level errors or sentence-level errors. A machine-learning model may be trained for text correction using a corpus of social network data, which may include at least the content objects with one or more of word-level errors or sentence-level errors, and a corresponding set of corrected content objects. A text string including one or more errors may be received from a client system associated with a first user. The text string may be transformed into a vector representation using an encoder of the machine-learning model. A corrected text string may be generated from the vector representation using a decoder of the machine-learning model.

Type: Application

Filed: January 2, 2018

Publication date: July 4, 2019

Inventors: Xian Li, Irina-Elena Veliche, Debnil Sur, Shaomei Wu, Amit Bahl, Juan Miguel Pino
MACHINE-TRANSLATION BASED CORRECTIONS

Publication number: 20190018837

Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.

Type: Application

Filed: January 11, 2018

Publication date: January 17, 2019

Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira
TRANSLITERATION DECODING USING A TREE STRUCTURE

Publication number: 20180173689

Abstract: Embodiments are disclosed for transliteration decoding using a tree structure. A method according to some embodiments includes steps of: generating a tree structure for an input string in a first script system, the tree structure including nodes representing segments of the input string; identifying segmentation candidates for the input string based on paths of the tree structure, the segmentation candidates segmenting the input string into character groups; selecting a segmentation candidate based on probabilities of the segmentation candidates predicted by a probabilistic model; segmenting the input string into character groups that correspond to characters in a second script system; decoding the character groups in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Application

Filed: December 21, 2016

Publication date: June 21, 2018

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Guarav Lochan
TRANSLITERATION USING MACHINE TRANSLATION PIPELINE

Publication number: 20180174572

Abstract: Embodiments are disclosed for transliteration based on a machine translation model training pipeline. A method according to some embodiments includes steps of: training a probabilistic model for transliteration from a first script system to a second script system using a machine translation model training pipeline; segmenting, using the probabilistic model, an input string in the first script system into phonemes that correspond to characters in the second script system; converting the phonemes in the first script system into the characters in the second script system, the characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Application

Filed: December 21, 2016

Publication date: June 21, 2018

Inventors: Juan Miguel Pino, Madhu Ramanathan
TRANSLITERATION OF TEXT ENTRY ACROSS SCRIPTS

Publication number: 20180173695

Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Application

Filed: December 21, 2016

Publication date: June 21, 2018

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Guarav Lochan
Language model personalization

Patent number: 10002125

Abstract: Specialized language processing engines can use author-specific or reader-specific language models to improve language processing results by selecting phrases most likely to be used by an author or by tailoring output to language with which the reader is familiar. Language models that are author-specific can be generated by identifying characteristics of an author or author type such as age, gender, and location. An author-specific language model can be built using, as training data, language items written by users with the identified characteristics. Language models that are reader-specific can be generated using, as training data, language items written by or viewed by that reader. When implementing a specialized machine translation engine, multiple possible translations can be generated. An author-specific language model or a reader-specific language model can provide scores for possible translations, which can be used to select the best translation.

Type: Grant

Filed: December 28, 2015

Date of Patent: June 19, 2018

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Ying Zhang, Matthias Gerhard Eck
Machine-translation based corrections

Patent number: 9904672

Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.

Type: Grant

Filed: June 30, 2015

Date of Patent: February 27, 2018

Assignee: Facebook, Inc.

Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira
LANGUAGE MODEL PERSONALIZATION

Publication number: 20170185583

Abstract: Specialized language processing engines can use author-specific or reader-specific language models to improve language processing results by selecting phrases most likely to be used by an author or by tailoring output to language with which the reader is familiar. Language models that are author-specific can be generated by identifying characteristics of an author or author type such as age, gender, and location. An author-specific language model can be built using, as training data, language items written by users with the identified characteristics. Language models that are reader-specific can be generated using, as training data, language items written by or viewed by that reader. When implementing a specialized machine translation engine, multiple possible translations can be generated. An author-specific language model or a reader-specific language model can provide scores for possible translations, which can be used to select the best translation.

Type: Application

Filed: December 28, 2015

Publication date: June 29, 2017

Inventors: Juan Miguel Pino, Ying Zhang, Matthias Gerhard Eck
MACHINE-TRANSLATION BASED CORRECTIONS

Publication number: 20170004121

Abstract: Technology is disclosed for building correction models that correct natural language snippets. Correction models can include rules comprising pairs of word sequences identified from viable correction snippet pairs, where a first sequence of words in the pair should be replaced with a second sequence of words in the pair. Viable correction snippet pairs can be identified from among pairs of language snippets, such as a post to a social media website and a subsequent update to that post. Viable corrections can be the snippet pairs that both have no more unaligned words than a word alignment threshold and have no aligned word pair with a character edit difference above an edit distance threshold. In some implementations, word alignments can be found by aligning all the characters between a pair of language snippets, and identifying aligned words as those that have at least one aligned letter in common.

Type: Application

Filed: June 30, 2015

Publication date: January 5, 2017

Inventors: Juan Miguel Pino, Matthias Gerhard Eck, Rui Andre Augusto Ferreira