Patents by Inventor Caroline Brun

Caroline Brun has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7788084
    Abstract: A parser for parsing text includes a tokenizing module which divides the text into an ordered sequence of linguistic tokens. A morphological module associates parts of speech with the linguistic tokens. A detection module identifies candidate titles of creative works, such as works of art. A filtering module filters the candidate titles of works to exclude citations of direct speech from the candidate titles of works. A comparison module compares any remaining candidate titles of works with titles of works in an associated knowledge base. The comparison module annotates the text when a match is found.
    Type: Grant
    Filed: September 19, 2006
    Date of Patent: August 31, 2010
    Assignee: Xerox Corporation
    Inventors: Caroline Brun, Caroline Hagège
  • Patent number: 7788085
    Abstract: String replacement is performed in text using linguistic processing. The linguistic processing identifies the existence of direct or indirect links between the string to be replaced and other strings in the text. Morphological, syntactic, anaphoric, or semantic inconsistencies, which are introduced in strings with the identified direct or indirect links to the string that is to be replaced are detected and corrected.
    Type: Grant
    Filed: December 17, 2004
    Date of Patent: August 31, 2010
    Assignee: Xerox Corporation
    Inventors: Caroline Brun, Herve Dejean, Caroline Hagege
  • Patent number: 7717712
    Abstract: A method for testing a language learner's ability to create semantically coherent grammatical text in a language, comprising generating text having at least one active region and inactive regions; displaying the text in a graphical user interface on a display unit, wherein at least one active region comprises a key word or phrase; identifying at least one active region in the graphical user interface; selecting at least one active region to display a menu of linguistic choices comprised of at least one grammatically correct linguistic choice and at least one grammatically incorrect linguistic choice; selecting one of the linguistic choices; and displaying an error message when at least one grammatically incorrect linguistic choice is selected.
    Type: Grant
    Filed: December 19, 2003
    Date of Patent: May 18, 2010
    Assignee: Xerox Corporation
    Inventors: Caroline Brun, Marc Dymetman
  • Publication number: 20100082331
    Abstract: A system and method of developing rules for text processing enable retrieval of instances of named entities in a predetermined semantic relation (such as the DATE and PLACE of an EVENT) by extracting patterns from text strings in which attested examples of named entities satisfying the semantic relation occur. The patterns are generalized to form rules which can be added to the existing rules of a syntactic parser and subsequently applied to text to find candidate instances of other named entities in the predetermined semantic relation.
    Type: Application
    Filed: September 30, 2008
    Publication date: April 1, 2010
    Applicant: Xerox Corporation
    Inventors: Caroline Brun, Caroline Hagege
  • Publication number: 20090204596
    Abstract: A computer implemented system and method for processing text are disclosed. Partially processed text, in which named entities have been extracted by a standard named entity system, is processed to identify attributive relations between a named entity or proper noun and a corresponding attribute. A concept for the attribute is identified and, in the case of a named entity, compared with the named entity's context, enabling a confirmation or conflict between the two to be determined. In the case of a proper name, the attribute's context can be associated with the proper name, allowing the proper name to be recognized as a new named entity.
    Type: Application
    Filed: February 8, 2008
    Publication date: August 13, 2009
    Applicant: Xerox Corporation
    Inventors: Caroline Brun, Caroline Hagege
  • Publication number: 20080319978
    Abstract: A method for named entity resolution includes parsing an input text string to identify a context in which an identified named entity of the input text string is used. The identified context is compared with at least one stored context in which the named entity in the stored context is associated with a class of named entity, the named entity class being selected from a plurality of classes, at least one of the plurality of classes corresponding to a metonymic use of a respective named entity. A named entity class is assigned to the identified named entity from the plurality of named entity classes, based on at least one of the identified context and the comparison.
    Type: Application
    Filed: August 29, 2007
    Publication date: December 25, 2008
    Inventors: Caroline Brun, Maud Ehrmann, Guillaume Jacquet
  • Patent number: 7386550
    Abstract: Named entities in a document are identified. Each named entity is classified as either anonymous or public based on analysis including at least syntactic analysis of one or more portions of the document containing the named entity. In one suitable approach, each named person entity is classified by default as anonymous, and each named entity that is not a named person is classified by default as public. Named entities are selectively re-classified based on evidence contained in the document indicating that the default classification is incorrect. The classification of a named entity as either anonymous or public is propagated to multiple occurrences of that named entity in the document Those named entities classified as anonymous are anonymized.
    Type: Grant
    Filed: August 12, 2005
    Date of Patent: June 10, 2008
    Assignee: Xerox Corporation
    Inventor: Caroline Brun
  • Patent number: 7383171
    Abstract: A method and apparatus converts input data such as short notes into a global text realization to provide semantically-coherent grammatical text. In various exemplary embodiments, an individual inputs short notes into a computer system, the computer system associates local text realizations with the short notes. Subsequently, the user may select the appropriate local text realizations, which may be converted to semantic representations and to semantically coherent grammatical text or a global text realization.
    Type: Grant
    Filed: December 5, 2003
    Date of Patent: June 3, 2008
    Assignee: Xerox Corporation
    Inventors: Marc Dymetman, Caroline Brun, Aurelien Max
  • Publication number: 20080071519
    Abstract: A parser for parsing text includes a tokenizing module which divides the text into an ordered sequence of linguistic tokens. A morphological module associates parts of speech with the linguistic tokens. A detection module identifies candidate titles of creative works, such as works of art. A filtering module filters the candidate titles of works to exclude citations of direct speech from the candidate titles of works. A comparison module compares any remaining candidate titles of works with titles of works in an associated knowledge base. The comparison module annotates the text when a match is found.
    Type: Application
    Filed: September 19, 2006
    Publication date: March 20, 2008
    Inventors: Caroline Brun, Caroline Hagege
  • Publication number: 20070168430
    Abstract: An email organizer operates in conjunction with an email system (20) and a natural language processor (42, 44). An action deadline detector (50) detects action deadlines contained in email messages (30) based on syntactic information about the email messages provided by the natural language processor. A scorer (56) assigns priority scores to the email messages based at least on the action deadlines and a current date (58).
    Type: Application
    Filed: November 23, 2005
    Publication date: July 19, 2007
    Inventors: Caroline Brun, Caroline Hagege
  • Publication number: 20070038437
    Abstract: Named entities in a document are identified. Each named entity is classified as either anonymous or public based on analysis including at least syntactic analysis of one or more portions of the document containing the named entity. In one suitable approach, each named person entity is classified by default as anonymous, and each named entity that is not a named person is classified by default as public. Named entities are selectively re-classified based on evidence contained in the document indicating that the default classification is incorrect. The classification of a named entity as either anonymous or public is propagated to multiple occurrences of that named entity in the document Those named entities classified as anonymous are anonymized.
    Type: Application
    Filed: August 12, 2005
    Publication date: February 15, 2007
    Inventor: Caroline Brun
  • Publication number: 20060136352
    Abstract: String replacement is performed in text using linguistic processing. The linguistic processing identifies the existence of direct or indirect links between the string to be replaced and other strings in the text. Morphological, syntactic, anaphoric, or semantic inconsistencies, which are introduced in strings with the identified direct or indirect links to the string that is to be replaced are detected and corrected.
    Type: Application
    Filed: December 17, 2004
    Publication date: June 22, 2006
    Inventors: Caroline Brun, Herve Dejean, Caroline Hagega
  • Publication number: 20060136223
    Abstract: A bilingual authoring apparatus includes a user interface (20) for inputting partially translated text including a text portion in a source language and surrounding or adjacent text in a target language. A bilingual dictionary (34) associates words and phrases in the target language and words and phrases in a source language. A context sensitive translation tool (30, 32, 38) communicates with the user interface, receives the partially translated text, and provides at least one proposed translation in the target language of the text portion in the source language. The at least one proposed translation in the target language is derived from the bilingual dictionary based on contextual analysis of at least a portion of the partially translated text.
    Type: Application
    Filed: December 21, 2004
    Publication date: June 22, 2006
    Inventors: Caroline Brun, Marc Dymetman, Frederique Segond
  • Publication number: 20060136196
    Abstract: A linguistic rewriting rule for use in linguistic processing of an ordered sequence of linguistic tokens includes a token pattern recognition rule that matches the ordered sequence of linguistic tokens with a syntactical pattern. The token pattern recognition rule incorporates a character pattern recognition rule to match characters contained in an ambiguous portion of the ordered sequence of linguistic tokens with a character pattern defining a corresponding portion of the syntactical pattern.
    Type: Application
    Filed: December 21, 2004
    Publication date: June 22, 2006
    Inventors: Caroline Brun, Caroline Hagege, Claude Roux
  • Publication number: 20050137847
    Abstract: A method for testing a language learner's ability to create semantically coherent grammatical text in a language, comprising generating text having at least one active region and inactive regions; displaying the text in a graphical user interface on a display unit, wherein at least one active region comprises a key word or phrase; identifying at least one active region in the graphical user interface; selecting at least one active region to display a menu of linguistic choices comprised of at least one grammatically correct linguistic choice and at least one grammatically incorrect linguistic choice; selecting one of the linguistic choices; and displaying an error message when at least one grammatically incorrect linguistic choice is selected.
    Type: Application
    Filed: December 19, 2003
    Publication date: June 23, 2005
    Applicant: XEROX CORPORATION
    Inventors: Caroline Brun, Marc Dymetman
  • Publication number: 20050138556
    Abstract: Normalized output texts, such as rundowns or summaries, from raw texts belonging to a given domain are produced. The normalized output text may be generated in different languages and may take into account a user's interest. To this end, linguistic resources associated with a model of the domain are used both for input text analysis and output text generation.
    Type: Application
    Filed: December 18, 2003
    Publication date: June 23, 2005
    Inventors: Caroline Brun, Jean-Pierre Chanod, Caroline Hagege
  • Publication number: 20050125219
    Abstract: A method and apparatus converts input data such as short notes into a global text realization to provide semantically-coherent grammatical text. In various exemplary embodiments, an individual inputs short notes into a computer system, the computer system associates local text realizations with the short notes. Subsequently, the user may select the appropriate local text realizations, which may be converted to semantic representations and to semantically coherent grammatical text or a global text realization.
    Type: Application
    Filed: December 5, 2003
    Publication date: June 9, 2005
    Applicant: XEROX CORPORATION
    Inventors: Marc Dymetman, Caroline Brun, Aurelien Max
  • Patent number: 6405162
    Abstract: In semantically disambiguating words, where more than one disambiguation applies to the context in which a word occurs, a rule can be selected based on the type of information from which it was obtained. The rules can be derived from different types of information in a corpus such as a dictionary, and rules can be selected in accordance with a prioritization of the types of information.
    Type: Grant
    Filed: September 23, 1999
    Date of Patent: June 11, 2002
    Assignee: Xerox Corporation
    Inventors: Frédérique Segond, Caroline Brun