Patents by Inventor Michel Decary

Michel Decary has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20180121545
    Abstract: The disclosed methods, systems, and apparatus use Natural Language Processing (NLP) in conjunction with a world model and cognitive frames to semantically analyze, understand, rank, store, and retrieve digital text. The goal is to improve the relevance, usefulness and efficiency of information search. The world model represents things existing in the real world whereas cognitive frames specify possible user interaction with such a world. Using NLP in conjunction with a world model and cognitive frames to understand text is an advancement in automated text analysis. It addresses three serious shortcomings of the existing search technology: the inadequate measure of the meaningful content in web pages; a poor understanding of users' goals and tasks in their search and, the irrelevant search results. The disclosed methods have led to the successful implementation of a full-scale semantic search engine in medicine, and they are applicable and adaptable to other disciplines.
    Type: Application
    Filed: September 17, 2017
    Publication date: May 3, 2018
    Applicant: Cogilex R&D inc.
    Inventors: Mei Chen, Michel Decary
  • Patent number: 7356761
    Abstract: Computer method and apparatus determines content type of contents of a subject Web page. A predefined set of potential content types is first provided. For each potential content type, there are one or more tests having test results that enable quantitative evaluation of the contents of the subject Web page. A respective probability of each potential content type being detected in some contents of the subject Web page is determined. A Bayesian network combines the test results to provide indications of the types of contents detected on the subject Web page. A confidence level per detected content type is also provided. A database stores the determined probabilities and confidence levels, and thus provides a cross reference between Web pages and respective content types of contents found on the Web pages.
    Type: Grant
    Filed: January 24, 2001
    Date of Patent: April 8, 2008
    Assignee: Zoom Information, Inc.
    Inventors: Kosmas Karadimitriou, Jonathan Stern, Michel Decary, Jeremy W. Rothman-Shore
  • Publication number: 20070027672
    Abstract: Computer method and apparatus for extracting information from a Web page is disclosed. The invention apparatus is formed of an extractor coupled to receive Web pages from a source. The extractor uses natural language processing to extract desired information from the Web page. A storage subsystem receives from the extractor the extracted desired information and stores the extracted desired information in a database. The invention method for extracting data from a Web page includes the computer implemented steps of (i) using natural language processing, finding possible formal names on a given Web page, (ii) using pattern matching, searching the given Web page for formal names not found by the natural language processing, and (iii) refining a combined set of the found formal names to produce a working set of people and organization names extracted from the given Web page. The refining includes determining aliases of respective people and organization names, so as to effectively reduce duplicate names.
    Type: Application
    Filed: May 18, 2006
    Publication date: February 1, 2007
    Inventors: Michel Decary, Jonathan Stern, Kosmas Karadimitriou, Jeremy Rothman-Shore
  • Patent number: 7065483
    Abstract: Computer method and apparatus for extracting information from a Web page is disclosed. The invention apparatus is formed of an extractor coupled to receive Web pages from a source. The extractor uses natural language processing to extract desired information from the Web page. A storage subsystem receives from the extractor the extracted desired information and stores the extracted desired information in a database. The invention method for extracting data from a Web page includes the computer implemented steps of (i) using natural language processing, finding possible formal names on a given Web page, (ii) using pattern matching, searching the given Web page for formal names not found by the natural language processing, and (iii) refining a combined set of the found formal names to produce a working set of people and organization names extracted from the given Web page. The refining includes determining aliases of respective people and organization names, so as to effectively reduce duplicate names.
    Type: Grant
    Filed: July 20, 2001
    Date of Patent: June 20, 2006
    Assignee: Zoom Information, Inc.
    Inventors: Michel Decary, Jonathan Stern, Kosmas Karadimitriou, Jeremy W. Rothman-Shore
  • Patent number: 7054886
    Abstract: A database is formed and maintained by computer-automated means extracting information from a global computer network. The database contains information about people and organizations. The present invention method provides continual updates to the information stored in the database by the people named in the database and by the automated means. Integrity of the automatically extracted information is maintained. A link from the invention database to a third party data system provides updates in the information in the database to be communicated to the third party data system for updating and maintaining data of the third party data system. The database may serve as an email communication clearinghouse where senders do not need to know the email address of a person named in the database but rather leaves messages through that person's record in the database. Targeted advertising to a named person is provided during his accessing the database.
    Type: Grant
    Filed: July 27, 2001
    Date of Patent: May 30, 2006
    Assignee: Zoom Information, Inc.
    Inventors: Jonathan Stern, Jeremy W. Rothman-Shore, Kosmas Karadimitriou, Michel Decary
  • Patent number: 6983282
    Abstract: Computer processing method and apparatus for searching and retrieving Web pages to collect people and organization information are disclosed. A Web site of potential interest is accessed. A subset of Web pages from the accessed site are determined for processing. According to types of contents found on a subject Web page, extraction of people and organization information is enabled. Internal links of a Web site are collected and recorded in a links-to-visit table. To avoid duplicate processing of Web sites, unique identifiers or Web site signatures are utilized. Respective time thresholds (time-outs) for processing a Web site and for processing a Web page are employed.
    Type: Grant
    Filed: March 30, 2001
    Date of Patent: January 3, 2006
    Assignee: Zoom Information, Inc.
    Inventors: Jonathan Stern, Kosmas Karadimitriou, Jeremy W. Rothman-Shore, Michel Decary
  • Patent number: 6965857
    Abstract: A method and apparatus for deriving information from a textual representation of a sentence are provided, the sentence having a plurality of words. The apparatus comprises an input, a processing unit and an output. The input is for receiving data elements indicative of the textual representation of the sentence. The processing unit is operative for processing the textual representation of the sentence to derive an information record on the basis of a set of information extraction rules, the information record being indicative of a semantic representation of at least part of the sentence. The information record is then released at the output. A computer readable medium comprising a program element suitable for execution by a computing apparatus for deriving information from a textual representation of a sentence is also provided.
    Type: Grant
    Filed: June 2, 2000
    Date of Patent: November 15, 2005
    Assignee: Cogilex Recherches & Developpement Inc.
    Inventor: Michel Décary
  • Patent number: 6778986
    Abstract: Computer method and apparatus identifies content owner of a Web site. A collecting step or element collects candidate names from the subject Web site. For each candidate name, a test module (or testing step) runs tests that provide quantitative/statistical evaluation of the candidate name being the content owner name of the subject Web site. The test results are combined mathematically, such as by a Bayesian network, into an indication of content owner name.
    Type: Grant
    Filed: November 1, 2000
    Date of Patent: August 17, 2004
    Assignee: Eliyon Technologies Corporation
    Inventors: Jonathan Stern, Kosmas Karadimitriou, Michel Decary, Jeremy W. Rothman-Shore
  • Patent number: 6618717
    Abstract: Computer method and apparatus identifies content owner of a Web site. A collecting step or element collects candidate names from the subject Web site. For each candidate name, a test module (or testing step) runs tests that provide quantitative/statistical evaluation of the candidate name being the content owner name of the subject Web site. The test results are combined mathematically, such as by a Bayesian network, into an indication of content owner name. The determined indications of content owner name correlated with respective Web site are stored in a database or other storage means.
    Type: Grant
    Filed: November 1, 2000
    Date of Patent: September 9, 2003
    Assignee: Eliyon Technologies Corporation
    Inventors: Kosmas Karadimitriou, Jonathan Stern, Michel Decary, Jeremy W. Rothman-Shore
  • Publication number: 20020138525
    Abstract: Computer method and apparatus determines content type of contents of a subject Web page. A predefined set of potential content types is first provided. For each potential content type, there are one or more tests having test results that enable quantitative evaluation of the contents of the subject Web page. A respective probability of each potential content type being detected in some contents of the subject Web page is determined. A Bayesian network combines the test results to provide indications of the types of contents detected on the subject Web page. A confidence level per detected content type is also provided. A database stores the determined probabilities and confidence levels, and thus provides a cross reference between Web pages and respective content types of contents found on the Web pages.
    Type: Application
    Filed: January 24, 2001
    Publication date: September 26, 2002
    Applicant: Eliyon Technologies Corporation
    Inventors: Kosmas Karadimitriou, Jonathan Stern, Michel Decary, Jeremy W. Rothman-Shore
  • Publication number: 20020091688
    Abstract: Computer method and apparatus for extracting information from a Web page is disclosed. The invention apparatus is formed of an extractor coupled to receive Web pages from a source. The extractor uses natural language processing to extract desired information from the Web page. A storage subsystem receives from the extractor the extracted desired information and stores the extracted desired information in a database. The invention method for extracting data from a Web page includes the computer implemented steps of (i) using natural language processing, fmding possible formal names on a given Web page, (ii) using pattern matching, searching the given Web page for formal names not found by the natural language processing, and (iii) refining a combined set of the found formal names to produce a working set of people and organization names extracted from the given Web page. The refining includes determining aliases of respective people and organization names, so as to effectively reduce duplicate names.
    Type: Application
    Filed: July 20, 2001
    Publication date: July 11, 2002
    Applicant: Eliyon Technologies Corporation
    Inventors: Michel Decary, Jonathan Stern, Kosmas Karadimitriou, Jeremy W. Rothman-Shore
  • Publication number: 20020059251
    Abstract: A database is formed and maintained by computer-automated means extracting information from a global computer network. The database contains information about people and organizations. The present invention method provides continual updates to the information stored in the database by the people named in the database and by the automated means. Integrity of the automatically extracted information is maintained. A link from the invention database to a third party data system provides updates in the information in the database to be communicated to the third party data system for updating and maintaining data of the third party data system. The database may serve as an email communication clearinghouse where senders do not need to know the email address of a person named in the database but rather leaves messages through that person's record in the database. Targeted advertising to a named person is provided during his accessing the database.
    Type: Application
    Filed: July 27, 2001
    Publication date: May 16, 2002
    Applicant: Eliyon Technologies Corporation
    Inventors: Jonathan Stern, Jeremy W. Rothman-Shore, Kosmas Karadimitriou, Michel Decary
  • Publication number: 20020052928
    Abstract: Computer processing means and method for searching and retrieving Web pages to collect people and organization information are disclosed. A Web site of potential interest is accessed. A subset of Web pages from the accessed site are determined for processing. According to types of contents found on a subject Web page, extraction of people and organization information is enabled. Internal links of a Web site are collected and recorded in a links-to-visit table. To avoid duplicate processing of Web sites, unique identifiers or Web site signatures are utilized. Respective time thresholds (time-outs) for processing a Web site and for processing a Web page are employed. A database is maintained for storing indications of domain URL's, names of respective owners of the URL's as identified from the corresponding Web sites, type of each Web site, processing frequencies, dates of last processings, outcomes of last processings, size of each domain and number of data items found in last processing of each Web site.
    Type: Application
    Filed: March 30, 2001
    Publication date: May 2, 2002
    Applicant: Eliyon Technologies Corporation
    Inventors: Jonathan Stern, Kosmas Karadimitriou, Jeremy W. Rothman-Shore, Michel Decary
  • Publication number: 20020032740
    Abstract: A computer automated method and system mines from a global computer network, information about people and organizations. The system includes automated crawling means, a distributor controlling the crawling means processing, an extractor storing extracted information of interest in a database, an integrator and post-processor. The integrater resolves duplicate information and combines related information in the database. The post-processor analyzes the database contents and generates (by deduction) missing information. Also disclosed is a method and means for generating a business email address of a person named in the database, from the respective organization named in the database for the subject person.
    Type: Application
    Filed: July 30, 2001
    Publication date: March 14, 2002
    Applicant: Eliyon Technologies Corporation
    Inventors: Jonathan Stern, Jeremy W. Rothman-Shore, Kosmas Karadimitriou, Michel Decary