Patents by Inventor John Christopher Shafer

John Christopher Shafer has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9158813
    Abstract: A structured query may specify attribute values for attributes. An estimate of the number of items that will match the structured query if it is applied to a structured database is determined. If the estimated number of items is below a threshold, the structured query may be relaxed to form new candidate structured queries. The number of candidate queries may be determined based on a desired running time. Each of the candidate structured queries may be determined by changing one or more attribute values of the attributes of the structured query. Estimates of the number of items each of the candidate structured queries will match is determined, and the candidate structured query that has the highest matching estimation is used to query the database. The matching results may be output.
    Type: Grant
    Filed: June 9, 2010
    Date of Patent: October 13, 2015
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Alexandros Ntoulas, Sreenivas Gollapudi, Samuel Ieong, Stelios Paparizos, John Christopher Shafer
  • Publication number: 20110307517
    Abstract: A structured query may specify attribute values for attributes. An estimate of the number of items that will match the structured query if it is applied to a structured database is determined. If the estimated number of items is below a threshold, the structured query may be relaxed to form new candidate structured queries. The number of candidate queries may be determined based on a desired running time. Each of the candidate structured queries may be determined by changing one or more attribute values of the attributes of the structured query. Estimates of the number of items each of the candidate structured queries will match is determined, and the candidate structured query that has the highest matching estimation is used to query the database. The matching results may be output.
    Type: Application
    Filed: June 9, 2010
    Publication date: December 15, 2011
    Applicant: Microsoft Corporation
    Inventors: Alexandros Ntoulas, Sreenivas Gollapudi, Samuel Ieong, Stelios Paparizos, John Christopher Shafer
  • Publication number: 20110307504
    Abstract: A user submits an unstructured query that is analyzed to determine a mapping from attributes to attribute values. One or more matching items from a structured data set are determined based on the attribute values of attributes associated with the items. The matching items are displayed. One or more refinement attributes are displayed, each with one or more attribute values. The attribute values in the refinements that correspond to the attribute values of the query are shown as selected. If the user selects any of the refinement attributes, the query is revised to incorporate the attribute values of the selected refinements. New matching items are determined using the revised structured query. The revised structured query and the new matching items are displayed. This process can be iterated, by modification of the query or the refinements. The matching items, the selected refinement attribute values and the query are synchronized.
    Type: Application
    Filed: June 9, 2010
    Publication date: December 15, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Rakesh Agrawal, John Christopher Shafer, Fabian Martin Suchanek
  • Patent number: 6633885
    Abstract: A system and method for exporing a web-accessible database includes providing a GUI that a user can manipulate to quickly modify the results of a query to expand or contract the results set, without requiring additional querying. Attribute controls can be manipulated to impose restrictions on the results set, including by designating example records the attributes of which are used to restrict the records displayed to the user. Only records that can be displayed are instantiated, to further increase the speed of the system.
    Type: Grant
    Filed: January 4, 2000
    Date of Patent: October 14, 2003
    Assignee: International Business Machines Corporation
    Inventors: Rakesh Agrawal, John Christopher Shafer
  • Patent number: 6592627
    Abstract: A user can easily organize computerized document folders by associating a few sample documents in the document database with each folder. The present invention learns folder profiles based on the sample documents and moves the remaining documents into the folders accordingly. In this way, the user can construct new folders, or rearrange existing folders, or cause the computer to automatically rearrange and maintain the folders. This is particularly useful for managing a database of perhaps thousands of emails.
    Type: Grant
    Filed: June 10, 1999
    Date of Patent: July 15, 2003
    Assignee: International Business Machines Corporation
    Inventors: Rakesh Agrawal, Roberto Javier Bayardo, Dimitrios Gunopulos, Ching-Tien Howard Ho, Sunita Sarawagi, John Christopher Shafer, Ramakrishnan Srikant
  • Patent number: 6138115
    Abstract: A method and system are disclosed for generating a decision-tree classifier in parallel in a multi-processor system, from a training set of records. The method comprises the steps of: partitioning the records among the processors, each processor generating an attribute list for each attribute, and the processors cooperatively generating a decision tree by repeatedly partitioning the records using the attribute lists. For each node, each processor determines its best split test and, along with other processors, selects the best overall split for the records at that node. Preferably, the gini-index and class histograms are used in determining the best splits. Also, each processor builds a hash table using the attribute list of the split attribute and shares it with other processors. The hash tables are used for splitting the remaining attribute lists.
    Type: Grant
    Filed: February 5, 1999
    Date of Patent: October 24, 2000
    Assignee: International Business Machines Corporation
    Inventors: Rakesh Agrawal, Manish Mehta, John Christopher Shafer
  • Patent number: 5884320
    Abstract: A method and system for performing spatial proximity joins on high-dimensional points representing data objects of a database in parallel in a multiprocessor system. The method comprises the steps of: partitioning the data points among the processors; creating index structures for the data points of the processors in parallel; assigning the join operations to the processors using the index structures; and simultaneously redistributing and joining the data points in the processors in parallel based on a predetermined joining condition. An efficient data structure, .epsilon.-K-D-B tree, is used to provide fast access to the high-dimensional points and to minimize system storage requirements. The invention achieves fast response time and requires minimum storage space by having structurally identical indices among the processors, assigning workload based on the join costs, and redistributing the data points among the processors while joining the data whenever possible.
    Type: Grant
    Filed: August 20, 1997
    Date of Patent: March 16, 1999
    Assignee: International Business Machines Corporation
    Inventors: Rakesh Agrawal, John Christopher Shafer
  • Patent number: 5870735
    Abstract: A method and system are disclosed for generating a decision-tree classifier in parallel in a multi-processor system, from a training set of records. The method comprises the steps of: partitioning the records among the processors, each processor generating an attribute list for each attribute, and the processors cooperatively generating a decision tree by repeatedly partitioning the records using the attribute lists. For each node, each processor determines its best split test and, along with other processors, selects the best overall split for the records at that node. Preferably, the gini-index and class histograms are used in determining the best splits. Also, each processor builds a hash table using the attribute list of the split attribute and shares it with other processors. The hash tables are used for splitting the remaining attribute lists.
    Type: Grant
    Filed: May 1, 1996
    Date of Patent: February 9, 1999
    Assignee: International Business Machines Corporation
    Inventors: Rakesh Agrawal, Manish Mehta, John Christopher Shafer
  • Patent number: 5842200
    Abstract: A multiprocessor including a plurality of processing systems is disclosed for discovering consumer purchasing tendencies. Each processing system of the multiprocessor identifies consumer transaction itemsets that are stored in a database that is distributed among the processing systems and which appear in the database a user-defined minimum number of times, referred to as minimum support. Then, the system discovers association rules in the itemsets by comparing the ratio of the number of times each of the large itemsets appears in the database to the number of times particular subsets of the itemset appear in the database. When the ratio exceeds a predetermined minimum confidence value, the system outputs an association rule which is representative of purchasing tendencies of consumers.
    Type: Grant
    Filed: July 11, 1995
    Date of Patent: November 24, 1998
    Assignee: International Business Machines Corporation
    Inventors: Rakesh Agrawal, John Christopher Shafer
  • Patent number: 5799311
    Abstract: A method and system are disclosed for generating a decision-tree classifier from a training set of records, independent of the system memory size. The method comprises the steps of: generating an attribute list for each attribute of the records, sorting the attribute lists for numeric attributes, and generating a decision tree by repeatedly partitioning the records using the attribute lists. For each node, split points are evaluated to determine the best split test for partitioning the records at the node. Preferably, a gini index and class histograms are used in determining the best splits. The gini index indicates how well a split point separates the records while the class histograms reflect the class distribution of the records at the node. Also, a hash table is built as the attribute list of the split attribute is divided among the child nodes, which is then used for splitting the remaining attribute lists of the node.
    Type: Grant
    Filed: May 8, 1996
    Date of Patent: August 25, 1998
    Assignee: International Business Machines Corporation
    Inventors: Rakesh Agrawal, Manish Mehta, John Christopher Shafer