Taxonomy Discovery Patents (Class 707/777)
  • Patent number: 7974984
    Abstract: A system and method may include retrieving a first taxonomy comprising at least one first category and one or more second taxonomies, at least one second category being associated with at least one of the one or more second taxonomies. The system and method may further include creating a new taxonomy by merging the first taxonomy with the second taxonomy based on a comparison of a first category profile of the at least one first category with a second category profile of the at least one second category.
    Type: Grant
    Filed: April 19, 2007
    Date of Patent: July 5, 2011
    Assignee: Mobile Content Networks, Inc.
    Inventor: Phyllis Reuther
  • Patent number: 7970767
    Abstract: One or more classification algorithms are applied to at least one natural language document in order to extract both attributes and values of a given product. Supervised classification algorithms, semi-supervised classification algorithms, unsupervised classification algorithms or combinations of such classification algorithms may be employed for this purpose. The at least one natural language document may be obtained via a public communication network. Two or more attributes (or two or more values) thus identified may be merged to form one or more attribute phrases or value phrases. Once attributes and values have been extracted in this manner, association or linking operations may be performed to establish attribute-value pairs that are descriptive of the product. In a presently preferred embodiment, an (unsupervised) algorithm is used to generate seed attributes and values which can then support a supervised or semi-supervised classification algorithm.
    Type: Grant
    Filed: April 30, 2007
    Date of Patent: June 28, 2011
    Assignee: Accenture Global Services Limited
    Inventors: Katharina Probst, Rayid Ghani, Andrew E. Fano, Marko Krema, Yan Liu
  • Patent number: 7970764
    Abstract: A system, method and computer program product for navigating categorized information, including (a) a two-dimensional map displayed to a user on a screen, the map showing search terms relating to a subject matter, where the display of the search terms corresponds to relationship between the terms, and wherein a manner of display of the terms corresponds to their relative importance to the subject matter; and (b) a neural network underlying the map, wherein the manner of display and a selection of the search terms is derived from the neural network. The manner of display includes font color, font size, font transparency, distance between search terms and positioning of the search terms within the map. Positioning of a cursor over one of the search terms rearranges the search terms on the map to correspond to an increased relevance of the one of the search terms, while the cursor is over the one of the search terms.
    Type: Grant
    Filed: May 4, 2009
    Date of Patent: June 28, 2011
    Assignee: Dranias Development LLC
    Inventor: Alexander V. Ershov
  • Patent number: 7962512
    Abstract: A federated system and methods and mechanisms of implementing and using such a system is disclosed. In some embodiments, one or more mappings are created between a taxonomy view at a node and one or more taxonomies of one or more data sources. The one or more data sources can then be accessed via the taxonomy view. In other embodiments, one or more mappings are created between content from different data sources and content from those data sources are merged using the one or more mappings.
    Type: Grant
    Filed: April 16, 2010
    Date of Patent: June 14, 2011
    Assignee: Cadence Design Systems, Inc.
    Inventors: Steven Sholtis, Terry LeClair, Kenneth Jerome Henderson
  • Patent number: 7930288
    Abstract: Systems, methods, and other embodiments associated with extracting knowledge from application data and maintaining an ontology based on the extracted knowledge are described. One example system includes a mapping logic to store mappings between application objects and ontology classes and an information extraction (IE) logic that accesses the mapping logic to identify application data to process based on the mappings. The application data may be stored in application data repositories belonging to an enterprise and may be characterized by the application object. Having identified application data to process, the IE logic may locate data in the application data repositories and selectively manipulate an ontology based on selected application data elements.
    Type: Grant
    Filed: February 28, 2007
    Date of Patent: April 19, 2011
    Assignee: Oracle International Corp.
    Inventors: Joaquin A. Delgado, Muralidhar Krishnaprasad, Ciya Liao
  • Patent number: 7895233
    Abstract: A first subset of attributes of documents responsive to at least one restriction may be displayed. Thereafter, a selection of a graphical user interface element associated with one of the attribute in the first subset may be received resulting in a display of a window comprising an alphanumeric input element. A key word search query may be received in the input element so that a second subset of attributes of documents responsive to the at least one restriction, the attribute associated with the selected graphical user interface element, and the key word search query may be displayed. Related methods, apparatuses, computer program products, and computer systems are also described.
    Type: Grant
    Filed: December 28, 2005
    Date of Patent: February 22, 2011
    Assignee: SAP AG
    Inventors: Achim Weigel, Darin Krasle, Hans-Juergen Richstein
  • Patent number: 7885918
    Abstract: A method and system is provided for managing business taxonomy. The system comprises an indexing engine for indexing content of source business oriented metadata. The indexing engine has a content scanner for reading the business oriented metadata, defining taxonomy of the business oriented metadata, and building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata. The system also comprises an index store for storing the content index of the business oriented metadata, and a taxonomy engine for providing taxonomy services to users using the content index.
    Type: Grant
    Filed: July 28, 2006
    Date of Patent: February 8, 2011
    Assignee: International Business Machines Corporation
    Inventor: Craig Statchuk
  • Patent number: 7882128
    Abstract: Methods and apparatus, including computer program products, implementing and using techniques for pattern detection in input data containing several transactions, each transaction having at least one item. Filter conditions for interesting patterns are received, and a first set of filter conditions applicable in connection with generation of candidate patterns is determined. An evaluated candidate pattern is selected as a parent candidate pattern, and evaluation information about the parent candidate pattern is maintained. Child candidate patterns are generated by extending the parent candidate pattern and taking into account the first set of filter conditions. The child candidate patterns are evaluated with respect to the input data together in sets of similar candidate patterns and based on the evaluation information about the parent candidate pattern. At least one child candidate pattern successfully passing the evaluation step is recursively used as a parent candidate pattern.
    Type: Grant
    Filed: February 6, 2007
    Date of Patent: February 1, 2011
    Assignee: International Business Machines Corporation
    Inventors: Toni Bollinger, Ansgar Dorneich, Christoph Lingenfelder
  • Patent number: 7877407
    Abstract: A database server contains pointers to useful information, such as on the World Wide Web. Users of the server may have hypertext links added automatically into documents they submit. Users may additionally contribute to the link database, thereby extending it, and may add additional qualifying information pertaining to the links.
    Type: Grant
    Filed: February 15, 2007
    Date of Patent: January 25, 2011
    Inventor: Julius O. Smith, III
  • Publication number: 20100274809
    Abstract: A process is disclosed for retrieving information in large heterogeneous data bases, wherein information retrieval through visual querying/browsing is supported by dynamic taxonomies; the process comprises the steps of: initially showing (F1) a complete taxonomy for the retrieval; refining (F2) the retrieval through a selection of subsets of interest, where the refining is performed by selecting concepts in the taxonomy and combining them through boolean operations; showing (F3) a reduced taxonomy for the selected set; and further refining (F4) the retrieval through an iterative execution of the refining and showing steps.
    Type: Application
    Filed: July 1, 2010
    Publication date: October 28, 2010
    Inventor: Giovanni Sacco
  • Patent number: 7822769
    Abstract: Systems and methods are provided for analysis of financial and business information based on interactive data, such as XBRL data. According to one embodiment, a method is provided for mapping extended taxonomy elements to base taxonomy elements. A list of base taxonomy elements is displayed on a display device. A taxonomy map is displayed on the display device. The taxonomy map includes information regarding one or more extended taxonomy elements of a reporting entity that are not mapped to any base taxonomy elements. Responsive to one or more user input events corresponding to a selection of a base taxonomy element and corresponding to a request to map an extended taxonomy element to the selected base taxonomy element, the compatibility of the selected base taxonomy element with the extended taxonomy element is validated. If the compatibility is affirmed, then an association is formed between the extended taxonomy element and the selected base taxonomy element.
    Type: Grant
    Filed: August 30, 2007
    Date of Patent: October 26, 2010
    Assignee: Rivet Software, Inc.
    Inventors: Michael L. Rohan, Rob Blake, Emily Huang
  • Patent number: 7818342
    Abstract: A computer program product that is tangibly embodied in an information carrier is described. The computer program product includes instructions that, when executed, perform operations for tracking data elements that are used in electronic documents. The method includes identifying an instance of a data element in a first electronic document comprising one or more data elements, modifying stored information based on the identification of the data element, periodically retrieving the stored information specifying a number of times a data element is used during a time interval, applying a calculation process to the information to determine a usage trend for the data element, and providing a visual display on a display device that shows an identifier for the data element and the usage trend.
    Type: Grant
    Filed: November 21, 2005
    Date of Patent: October 19, 2010
    Assignee: SAP AG
    Inventor: Gunther Stuhec
  • Patent number: 7809727
    Abstract: A system and method for clustering unstructured documents is provided. Documents having terms with frequencies of occurrence that satisfy upper and lower edge conditions are selected. Concepts are generated for the selected documents. The selected documents are grouped into clusters of the documents. A weight for each of the clusters is evaluated. A similarity value is determined from the frequencies of occurrence for at least one of the terms from the concepts and the cluster weights for each selected document. Each selected document is assigned into one such cluster based on the similarity value of the selected document.
    Type: Grant
    Filed: December 24, 2007
    Date of Patent: October 5, 2010
    Assignee: FTI Technology LLC
    Inventors: Dan Gallivan, Kenji Kawai
  • Patent number: 7797314
    Abstract: A method and computer program product for receiving a search result set including one or more search results, and defining one or more ranking cues based upon, at least in part, ancillary user data. The one or more search results are ranked based upon, at least in part, the one or more ranking cues. The one or more ranked search results are provided to a search user.
    Type: Grant
    Filed: December 31, 2007
    Date of Patent: September 14, 2010
    Assignee: International Business Machines Corporation
    Inventors: Edith Helen Stern, Patrick Joseph O'Sullivan, Robert Cameron Weir, Barry E. Willner
  • Patent number: 7792838
    Abstract: Improved information processing techniques for measuring similarity between instances in an ontology are disclosed. For example, a method of measuring similarity between instances in an ontology for use in an information retrieval system includes the following steps. A set of instances from the ontology is obtained. At least one of the following similarity metrics for the set of instances is computed: (i) a first metric that measures similarity between instances in the set of instances with respect to ontology concepts to which the instances belong; (ii) a second metric which measures similarity between instances in the set of instances where the instances are subjects in statements involving a given ontology property; and (iii) a third metric which measures similarity between instances in the set of instances where the instances are objects in statements involving a given ontology property.
    Type: Grant
    Filed: March 29, 2007
    Date of Patent: September 7, 2010
    Assignee: International Business Machines Corporation
    Inventors: Anand Ranganathan, Royi Ronen
  • Patent number: 7792786
    Abstract: A method and analytics tools for locating experts with specific sets of expertise are disclosed, the method including providing a collection of documents P0; generating categories representing fields of expertise derived from the collection of documents P0; refining the taxonomy of the categories by applying user domain knowledge; extracting structured fields from the collection of documents P0; constructing a contingency table having a first axis defined by the extracted structured fields and a second axis defined by the categories; and using the contingency table to identify a set of experts having a related expertise. The method may also include a network graph analysis that aids visualization of the relationship between people and expertise.
    Type: Grant
    Filed: February 13, 2007
    Date of Patent: September 7, 2010
    Assignee: International Business Machines Corporation
    Inventors: Ying Chen, Jeffrey Thomas Kreulen, Ana Lelescu, James J. Rhodes, William Scott Spangler
  • Patent number: 7783668
    Abstract: A search system and method are provided that uses taxonomies, entities, facets, and ontologies to provide a user with a more comprehensive set of search results in response to a query. The search system has an indexing engine that performs one or more indexing steps that permit the search engine to return a comprehensive set of search results. For example, the indexing engine may index a document according to a set of synsets so that the search engine may use the synsets, during retrieval of results to a query, to return a more comprehensive set of search results.
    Type: Grant
    Filed: January 16, 2008
    Date of Patent: August 24, 2010
    Assignee: Convera Corporation
    Inventors: Claude Vogel, Paul Gardner, Jr., Eric Germundson, Joshua Michael Powers, Joel Wayne Robertson, Jon Michael Van Winkle
  • Patent number: 7774360
    Abstract: Described is a technology by which an intermediate taxonomy is processed (e.g., offline) with respect to a target taxonomy to determine relationship values between categories represented in the intermediate taxonomy and the target taxonomy. The relationship values are used to construct a bridging classifier for use in online query processing to relate queries to categories in the target taxonomy. The relation is based on each target category's relationship to one or more categories that were represented in the intermediate taxonomy. Further, only a relevant subset of the categories represented in the intermediate taxonomy may be chosen for use in the bridging classifier, e.g., based on relative probability scores and/or mutual information scores computed between the categories represented in the intermediate taxonomy and categories in the target taxonomy.
    Type: Grant
    Filed: May 1, 2007
    Date of Patent: August 10, 2010
    Assignee: Microsoft Corporation
    Inventors: Jian-Tao Sun, Dou Shen, Qiang Yang, Zheng Chen
  • Patent number: 7769768
    Abstract: Folders of data from disparate application programs are organized in conformity to a reference taxonomy. A reference taxonomy for representing an organization of stored data found in multiple disparate application programs is created. Each application taxonomy from the multiple disparate application programs is compared to the reference taxonomy by a user. If the user decides to use the reference taxonomy, then the reference taxonomy replaces the application taxonomy for each of the multiple disparate application programs.
    Type: Grant
    Filed: June 28, 2007
    Date of Patent: August 3, 2010
    Assignee: International Business Machines Corporation
    Inventors: Timothy N Holloway, Graham D Wallis
  • Patent number: 7730085
    Abstract: The present invention is directed to a system, method and computer program for automatically extracting and mining relations and related entities from unstructured text. A method in accordance with an embodiment of the invention includes: extracting relations and related entities from unstructured text data, representing the extracted information into a graph, and manipulating the resulting graph to gain more insight into the information it contains. The extraction of relations and related entities is performed first by automatically inducting pattern and second by applying these induced patterns to unstructured text data. For each relation and entity, several features are extracted in order to build a graph whose nodes are entities and edges are relations.
    Type: Grant
    Filed: November 8, 2006
    Date of Patent: June 1, 2010
    Assignee: International Business Machines Corporation
    Inventors: Hany M. Hassan, Hala Mostafa
  • Publication number: 20100131507
    Abstract: A dynamic classification dictionary is built for use in profiling and targeting users for additional relevant content. Behavioral data is gathered from user activity, and user documents and actions are categorized. Author-generated document classification information is analyzed and assigned a first taxonomic noun to characterize the document. User-generated tags characterizing a portion of the document are assigned a second taxonomic noun. Search terms that resulted in the user accessing the document are identified and assigned a third taxonomic noun. Attributes related to the manner in which the document was accessed are evaluated and assigned a fourth taxonomic noun. The document is processed using pattern rules to extract a fifth taxonomic noun. The taxonomic nouns are aggregated into a composite set of taxonomic nouns, and the dynamic classification dictionary is build by storing the composite set of taxonomic nouns.
    Type: Application
    Filed: January 29, 2010
    Publication date: May 27, 2010
    Applicant: CBS INTERACTIVE, INC.
    Inventors: Tushar PRADHAN, Thomas OSBORNE, John POTTER
  • Patent number: 7716229
    Abstract: A method and system to generate variants, including misspells from query log context usage are provided. Usage context obtained from the query logs is utilized to facilitate similarity determination. A Similarity Graph generation process generates a Similarity Graph, which is transformed to provide variants having varying edit distances. The transformed Similarity Graph is loaded into a hash table and provides query corrections in a search engine or related terms when bidding on keyword in an advertising system.
    Type: Grant
    Filed: April 14, 2006
    Date of Patent: May 11, 2010
    Assignee: Microsoft Corporation
    Inventors: Abhinai Srivastava, Lee Wang, Ying Li
  • Publication number: 20100077001
    Abstract: A search system and method for uncovering unexpected links between different concepts related to a user's query during a search comprise a semantic indexing server, which builds a faceted classification index of text objects, and a query server, which receives and analyzes the user's query. A query thus processed is then sent from the query server to the semantic indexing server through an interface in order to perform a search in the faceted classification index. The search system and method further comprise a result handler, which provides the user with a search result set comprising a list of unexpected links and a list of result elements. The list of unexpected links corresponds to filters which allow the user to narrow down or refine the original query.
    Type: Application
    Filed: March 26, 2009
    Publication date: March 25, 2010
    Inventors: Claude Vogel, Alkis Papadopoullos, Jean Pierre Lahargue