Patents by Inventor Ágnes Sándor

Ágnes Sándor has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10120864
    Abstract: A method for categorizing an issue includes, for each of a plurality of categories of issue, providing at least one discourse pattern for identify text sequences that meet the discourse pattern. At least one of the discourse patterns specifies that an instance of a domain term in a domain term vocabulary be present in the text sequence for the pattern to be met. An issue is received which includes a text sequence. The text sequence is categorized based on which, if any, of the discourse patterns are met by the text sequence of the received issue. Information based on the categorization of the text sequence is output.
    Type: Grant
    Filed: March 29, 2016
    Date of Patent: November 6, 2018
    Assignee: Conduent Business Services LLC
    Inventors: Ágnes Sandor, Nikolaos Lagos, Caroline Brun, Ngoc Phuoc An Vo
  • Publication number: 20170286396
    Abstract: A method for categorizing an issue includes, for each of a plurality of categories of issue, providing at least one discourse pattern for identify text sequences that meet the discourse pattern. At least one of the discourse patterns specifies that an instance of a domain term in a domain term vocabulary be present in the text sequence for the pattern to be met. An issue is received which includes a text sequence. The text sequence is categorized based on which, if any, of the discourse patterns are met by the text sequence of the received issue. Information based on the categorization of the text sequence is output.
    Type: Application
    Filed: March 29, 2016
    Publication date: October 5, 2017
    Applicant: Xerox Corporation
    Inventors: Ágnes Sandor, Nikolaos Lagos, Caroline Brun, Ngoc Phuoc An Vo
  • Publication number: 20160343086
    Abstract: The present disclosure relates to a computer-implemented method, device, and computer-readable storage medium used for contextual linking information in a financial report. The method can include obtaining portions of the financial report; detecting one or more line items in the portions of the financial report based on one or more properties of the one or more line items; detecting one or more section headers in the portions of the financial report based on one or more properties of the one or more section headers; parsing, by a processor, the one or more line items and the one or more section headers that are detected; and linking the one or more line items to the one or more section headers based on the parsing.
    Type: Application
    Filed: May 19, 2015
    Publication date: November 24, 2016
    Inventors: Anirban Mondal, Agnes Sandor, Diana Nicoleta Popa, Anna Stavrianou, Denys Proux
  • Patent number: 9400779
    Abstract: A system and method for classifying comments are disclosed. The method includes receiving a collection of comments. Each of the comments in the collection includes text in a natural language and is associated with a previously-submitted idea submission which includes a description of an idea. The method further includes natural language processing each of the comments to identify dependencies (syntactic and/or semantic relations between text elements) in at least a part of the comment. Based on the identified dependencies, the comments are each automatically classified into one (or more) of a plurality of comment classes. The comment classes may include a first class for reaction to the content of the idea, a second class for expression of a commenter's judgment of an idea's value, and a third class for reaction to an idea generation process in which the associated idea submission is made. Information based on the assigned comment classes is output.
    Type: Grant
    Filed: June 6, 2013
    Date of Patent: July 26, 2016
    Assignee: XEROX CORPORATION
    Inventors: Gregorio Convertino, Agnes Sandor
  • Publication number: 20140365206
    Abstract: A computer-implemented system and method provide for identifying the core of an idea. The method includes receiving an idea submission which includes a textual description of an idea. The textual description of the idea is natural language processed to identify dependencies (syntactic and/or semantic relations between text elements) in at least a part of the textual description. Provision is made for identifying directive illocutionary acts in the textual description, based on the identified dependencies. The method further includes providing for identifying an idea core of the idea submission, based on an identified directive illocutionary act, where present, and outputting information based on the identified idea core.
    Type: Application
    Filed: June 6, 2013
    Publication date: December 11, 2014
    Inventors: Gregorio Convertino, Agnes Sandor
  • Publication number: 20140365207
    Abstract: A system and method for classifying comments are disclosed. The method includes receiving a collection of comments. Each of the comments in the collection includes text in a natural language and is associated with a previously-submitted idea submission which includes a description of an idea. The method further includes natural language processing each of the comments to identify dependencies (syntactic and/or semantic relations between text elements) in at least a part of the comment. Based on the identified dependencies, the comments are each automatically classified into one (or more) of a plurality of comment classes. The comment classes may include a first class for reaction to the content of the idea, a second class for expression of a commenter's judgment of an idea's value, and a third class for reaction to an idea generation process in which the associated idea submission is made. Information based on the assigned comment classes is output.
    Type: Application
    Filed: June 6, 2013
    Publication date: December 11, 2014
    Inventors: Gregorio Convertino, Agnes Sandor
  • Publication number: 20140163951
    Abstract: A machine translation method includes receiving a source text string and identifying any named entities. The identified named entities may be processed to exclude common nouns and function words. Features are extracted from the source text string relating to the identified named entities. Based on the extracted features, a protocol is selected for translating the source text string. A first translation protocol includes forming a reduced source string from the source text string in which the named entity is replaced by a placeholder, translating the reduced source string by machine translation to generate a translated reduced target string, while processing the named entity separately to be incorporated into the translated reduced target string. A second translation protocol includes translating the source text string by machine translation, without replacing the named entity with the placeholder. The target text string produced by the selected protocol is output.
    Type: Application
    Filed: December 7, 2012
    Publication date: June 12, 2014
    Applicant: XEROX CORPORATION
    Inventors: Vassilina Nikoulina, Agnes Sandor
  • Publication number: 20130346402
    Abstract: A method, system and a computer program for identifying unexplored research avenues in a plurality of publications is provided. Citation maps for the plurality of publications are generated. The initial set of publications is filtered on the basis of the citation maps and resulting set of publications are ranked according to their prestige value. Natural language processing means are used to perform context matching in order to identify set of sentences in the publications. Paragraphs containing the set of sentences are displayed to a user along with pointers to the respective publication.
    Type: Application
    Filed: June 26, 2012
    Publication date: December 26, 2013
    Applicant: XEROX CORPORATION
    Inventors: Anna Stavrianou, Agnes Sandor
  • Patent number: 8554542
    Abstract: A system and method are provided for processing an input document which enable assessment of the coherence of an abstract of the document. The method includes storing the document in memory and, for each sentence of the abstract, comparing the sentence with sentences of a main body of the document using textual entailment techniques to identify whether the sentence of the abstract entails a sentence in the main body of the document. Links can then be generated between the entailing sentences of the abstract and the corresponding entailed sentences of the document. The document and generated links are output. The links enable the coherence of the abstract to be evaluated, either manually or automatically, using an evaluation component of the system.
    Type: Grant
    Filed: May 5, 2010
    Date of Patent: October 8, 2013
    Assignee: Xerox Corporation
    Inventors: Ágnès Sandor, Guillaume Jacquet
  • Patent number: 8086557
    Abstract: A system and method for providing a factuality assessment of a retrieved information source's statement are disclosed. The method includes receiving a user's query which identifies an information source whose statements are to be retrieved, retrieving documents which refer to the information source, mapping statements in the retrieved documents to their authors, identifying as information source statements, the mapped statements that are mapped to an author which is compatible with the information source, and for at least one of the information source's statements, assessing a factuality of the information source's statement according to the information source.
    Type: Grant
    Filed: April 22, 2008
    Date of Patent: December 27, 2011
    Assignee: Xerox Corporation
    Inventors: Salah Ait-Mokhtar, Aude Rebotier, Agnes Sandor
  • Publication number: 20110276322
    Abstract: Aspects of the exemplary embodiment relate to a system and method for processing a document which enables assessment of the coherence of an abstract of the document. The method includes storing the document in memory and, for each sentence of the abstract, comparing the sentence with sentences of a main body of the document using textual entailment techniques to identify whether the sentence of the abstract entails a sentence in the main body of the document. Links can then be generated between the entailing sentences of the abstract and the corresponding entailed sentences of the document. The document and generated links are output. The links enable the coherence of the abstract to be evaluated, either manually or automatically, using an evaluation component of the system.
    Type: Application
    Filed: May 5, 2010
    Publication date: November 10, 2011
    Applicant: Xerox Corporation
    Inventors: Ágnès Sandor, Guillaume Jacquet
  • Publication number: 20110271173
    Abstract: A system and a method for filling a form are provided which take as input a user's data file, which is configured for use in filling in forms, and an image of an original form to be filled in using the user's personal data. Form filling rules encoded in the image are decoded and used to determine values of a plurality of fields of the form by applying the decoded rules to the user's data. The plurality of fields of the form are filled with the determined values to generate an at least partially filled form, which is then output, e.g., to a printer or a display. The exemplary system and method are able to operate independently of the language used in the text of the form, have the capability of filling in previously unseen forms, and are particularly suited to filling in paper forms.
    Type: Application
    Filed: May 3, 2010
    Publication date: November 3, 2011
    Applicant: Xerox Corporation
    Inventors: Salah Aït-Mokhtar, Ágnes Sándor
  • Patent number: 7809551
    Abstract: A system for retrieving documents related to a concept from a text corpus includes a set of stored semantic classes which are combinable to express the concept each class including a set of keywords, each set of keywords including at least one keyword. Syntactic rules are applied to identified text portions which include one or more of the keywords. A rule is satisfied when keywords from the first and second semantic classes are in any one of a plurality of syntactic relationships. A concept matching module identifies text portions within the text corpus which include one or more of the keywords, for applying the syntactic rules to the text portions, and for identifying those text portions which satisfy at least one of the rules. Documents to be retrieved may include at least one of the identified text portions.
    Type: Grant
    Filed: July 1, 2005
    Date of Patent: October 5, 2010
    Assignee: Xerox Corporation
    Inventors: Ágnes Sándor, Aaron Kaplan
  • Patent number: 7689411
    Abstract: A method for developing a system for retrieving text related to a selected concept within a text corpus includes identifying a set of semantic classes which express the concept and identifying a set of keywords for each of the semantic classes to be used in text searching in a text corpus. Each set of keywords includes at least one keyword. A plurality of syntactic rules are established which are to be applied to retrieved text which includes keywords. Each of the syntactic rules identifies a first of the semantic classes and a second of the semantic classes. A rule is satisfied when a keyword from the first of the semantic classes is in a syntactic relationship with a keyword from the second of the semantic classes. The syntactic relationship can be any one of a plurality of syntactic relationships.
    Type: Grant
    Filed: July 1, 2005
    Date of Patent: March 30, 2010
    Assignee: Xerox Corporation
    Inventors: Ágnes Sándor, Aaron Kaplan
  • Publication number: 20090265304
    Abstract: A system and method for providing a factuality assessment of a retrieved information source's statement are disclosed. The method includes receiving a user's query which identifies an information source whose statements are to be retrieved, retrieving documents which refer to the information source, mapping statements in the retrieved documents to their authors, identifying as information source statements, the mapped statements that are mapped to an author which is compatible with the information source, and for at least one of the information source's statements, assessing a factuality of the information source's statement according to the information source.
    Type: Application
    Filed: April 22, 2008
    Publication date: October 22, 2009
    Applicant: Xerox Corporation
    Inventors: Salah Ait-Mokhtar, Aude Rebotier, Agnes Sandor
  • Publication number: 20070005343
    Abstract: A method for developing a system for retrieving text related to a selected concept within a text corpus includes identifying a set of semantic classes which express the concept and identifying a set of keywords for each of the semantic classes to be used in text searching in a text corpus. Each set of keywords includes at least one keyword. A plurality of syntactic rules are established which are to be applied to retrieved text which includes keywords. Each of the syntactic rules identifies a first of the semantic classes and a second of the semantic classes. A rule is satisfied when a keyword from the first of the semantic classes is in a syntactic relationship with a keyword from the second of the semantic classes. The syntactic relationship can be any one of a plurality of syntactic relationships.
    Type: Application
    Filed: July 1, 2005
    Publication date: January 4, 2007
    Inventors: Agnes Sandor, Aaron Kaplan
  • Publication number: 20070005344
    Abstract: A system for retrieving documents related to a concept from a text corpus includes a set of stored semantic classes which are combinable to express the concept each class including a set of keywords, each set of keywords including at least one keyword. Syntactic rules are applied to identified text portions which include one or more of the keywords. A rule is satisfied when keywords from the first and second semantic classes are in any one of a plurality of syntactic relationships. A concept matching module identifies text portions within the text corpus which include one or more of the keywords, for applying the syntactic rules to the text portions, and for identifying those text portions which satisfy at least one of the rules. Documents to be retrieved may include at least one of the identified text portions.
    Type: Application
    Filed: July 1, 2005
    Publication date: January 4, 2007
    Inventors: Agnes Sandor, Aaron Kaplan