Patents by Inventor Jean-Marc Coursimault
Jean-Marc Coursimault has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9576052Abstract: Methods and systems for dynamically training a web crawler. The web crawler maintains one or more categories each comprising a set of words. The method includes selecting at least one hyperlink in response to a query received from a user. The method further includes determining a hyperlink score for the at least one hyperlink based on a category score associated with each of one or more categories. The category score associated with each of the one or more categories is updated based at least in part on the hyperlink score. The updated category score is compared with the hyperlink score to select a category from the one or more categories. The set of words associated with the category is updated based on content of a web page pointed by the at least one hyperlink.Type: GrantFiled: July 16, 2013Date of Patent: February 21, 2017Assignee: XEROX CORPORATIONInventors: Nidhi Singh, Jean-Marc Coursimault, Herve Poirier, Nicolas Monet
-
Patent number: 9552355Abstract: A system and a method for phrase-based translation are disclosed. The method includes receiving source language text to be translated into target language text. One or more dynamic bi-phrases are generated, based on the source text and the application of one or more rules, which may be based on user descriptions. A dynamic feature value is associated with each of the dynamic bi-phrases. For a sentence of the source text, static bi-phrases are retrieved from a bi-phrase table, each of the static bi-phrases being associated with one or more values of static features. Any of the dynamic bi-phrases which each cover at least one word of the source text are also retrieved, which together form a set of active bi-phrases. Translation hypotheses are generated using active bi-phrases from the set and scored with a translation scoring model which takes into account the static and dynamic feature values of the bi-phrases used in the respective hypothesis. A translation, based on the hypothesis scores, is then output.Type: GrantFiled: May 20, 2010Date of Patent: January 24, 2017Assignee: XEROX CORPORATIONInventors: Marc Dymetman, Wilker Ferreira Aziz, Nicola Cancedda, Jean-Marc Coursimault, Vassilina Nikoulina, Lucia Specia
-
Patent number: 9189557Abstract: A web page identified by a URL stored in a downloads queue is downloaded, and hyperlinks in the downloaded web page are identified. Each hyperlink is screened by parsing the hyperlink (optionally only the URL of the hyperlink) to identify features comprising character strings, computing for each feature values for one or more meta-features indicative of the hyperlinked web page being in a target language, aggregating the meta-feature values to generate a score for the hyperlink, and adding the URL of the hyperlink to the downloads queue conditional upon the score satisfying a screening criterion. The downloading, identifying, and screening are iteratively repeated to perform web crawling, and an index of web pages in the target language is constructed based on analysis of content of the downloaded web pages. The meta-features may include a transliterated target word meta-feature, a language code meta-feature, a country code meta-feature, or so forth.Type: GrantFiled: March 11, 2013Date of Patent: November 17, 2015Assignee: XEROX CORPORATIONInventors: Nidhi Singh, Jean-Marc Coursimault, Nicolas Monet, Herve Poirer
-
Publication number: 20150026152Abstract: Methods and systems for dynamically training a web crawler. The web crawler maintains one or more categories each comprising a set of words. The method includes selecting at least one hyperlink in response to a query received from a user. The method further includes determining a hyperlink score for the at least one hyperlink based on a category score associated with each of one or more categories. The category score associated with each of the one or more categories is updated based at least in part on the hyperlink score. The updated category score is compared with the hyperlink score to select a category from the one or more categories. The set of words associated with the category is updated based on content of a web page pointed by the at least one hyperlink.Type: ApplicationFiled: July 16, 2013Publication date: January 22, 2015Inventors: Nidhi Singh, Jean-Marc Coursimault, Herve Poirier, Nicolas Monet
-
Publication number: 20140258261Abstract: A web page identified by a URL stored in a downloads queue is downloaded, and hyperlinks in the downloaded web page are identified. Each hyperlink is screened by parsing the hyperlink (optionally only the URL of the hyperlink) to identify features comprising character strings, computing for each feature values for one or more meta-features indicative of the hyperlinked web page being in a target language, aggregating the meta-feature values to generate a score for the hyperlink, and adding the URL of the hyperlink to the downloads queue conditional upon the score satisfying a screening criterion. The downloading, identifying, and screening are iteratively repeated to perform web crawling, and an index of web pages in the target language is constructed based on analysis of content of the downloaded web pages. The meta-features may include a transliterated target word meta-feature, a language code meta-feature, a country code meta-feature, or so forth.Type: ApplicationFiled: March 11, 2013Publication date: September 11, 2014Applicant: Xerox CorporationInventors: Nidhi Singh, Jean-Marc Coursimault, Nicolas Monet, Herve Poirer
-
Publication number: 20110288852Abstract: A system and a method for phrase-based translation are disclosed. The method includes receiving source language text to be translated into target language text. One or more dynamic bi-phrases are generated, based on the source text and the application of one or more rules, which may be based on user descriptions. A dynamic feature value is associated with each of the dynamic bi-phrases. For a sentence of the source text, static bi-phrases are retrieved from a bi-phrase table, each of the static bi-phrases being associated with one or more values of static features. Any of the dynamic bi-phrases which each cover at least one word of the source text are also retrieved, which together form a set of active bi-phrases. Translation hypotheses are generated using active bi-phrases from the set and scored with a translation scoring model which takes into account the static and dynamic feature values of the bi-phrases used in the respective hypothesis. A translation, based on the hypothesis scores, is then output.Type: ApplicationFiled: May 20, 2010Publication date: November 24, 2011Applicant: Xerox CorporationInventors: Marc Dymetman, Wilker Ferreira Aziz, Nicola Cancedda, Jean-Marc Coursimault, Vassilina Nikoulina, Lucia Specia
-
Publication number: 20110270771Abstract: A system and method are disclosed for determining when to escalate from one support level to a higher support level in assisting a user resolve a problem with a device. The method includes storing information on past users' interactions with a self help server during prior self help sessions, the self help server providing a first remote support level for troubleshooting a problem with a device. Information is stored on a current user's interactions with the self help server during a current self help session for troubleshooting the current user's problem with a device. A determination of when to provide for escalation to a higher remote support level for troubleshooting the current user's problem with the device is based on the stored information on the past users' interactions and the current user's interactions.Type: ApplicationFiled: May 3, 2010Publication date: November 3, 2011Applicant: Xerox CorporationInventors: Jean-Marc Coursimault, Maria Antonietta Grasso, Frederic Roulland, Stefania Castellani