Patents by Inventor William W. Cohen

William W. Cohen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8538972
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining similarity measures for objects in a dataset that include contextual associations of the objects with contexts. In one aspect, a method includes calculating a similarity measure for any two objects that include a common feature f based, in part, on the likelihood that the two object representations in the dataset that both include f will we associated with distinct contexts, and the likelihood that the two objects in the dataset that both include f will be associated with the same context.
    Type: Grant
    Filed: June 26, 2012
    Date of Patent: September 17, 2013
    Assignee: Google Inc.
    Inventor: William W. Cohen
  • Patent number: 8234285
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining similarity measures for objects in a dataset that include contextual associations of the objects with contexts. In one aspect, a method includes calculating a similarity measure for any two objects that include a common feature f based, in part, on the likelihood that the two object representations in the dataset that both include f will we associated with distinct contexts, and the likelihood that the two objects in the dataset that both include f will be associated with the same context.
    Type: Grant
    Filed: July 21, 2009
    Date of Patent: July 31, 2012
    Assignee: Google Inc.
    Inventor: William W. Cohen
  • Patent number: 6516308
    Abstract: A method and apparatus is provided for producing a general data extraction procedure capable of extracting data from data sources on a network regardless of data format. The general data extraction procedure is determined from a plurality of pairs of data from the network, each pair including a data source and a program which accurately extracts data from the data source. The pairs of data are processed by a learning system to learn a general program for extracting data from new data sources.
    Type: Grant
    Filed: May 10, 2000
    Date of Patent: February 4, 2003
    Assignee: AT&T Corp.
    Inventor: William W. Cohen
  • Patent number: 6418432
    Abstract: An information retrieval system finds information in a Distributed Information System (DIS), e.g. the Internet using query learning and meta search for adding documents to resource directories contained in the DIS. A selection means generates training data characterized as positive and negative examples of a particular class of data residing in the DIS. A learning means generates from the training data at least one query that can be submitted to any one of a plurality of search engines for searching the DIS to find “new” items of the particular class. An evaluation means determines and verifies that the new item(s) is a new subset of the particular class and adds or updates the particular class in the resource directory.
    Type: Grant
    Filed: July 24, 1998
    Date of Patent: July 9, 2002
    Assignee: AT&T Corporation
    Inventors: William W Cohen, Yoram Singer
  • Patent number: 6295533
    Abstract: A system and method are provided for answering queries concerning information stored in a set of collections. Each collection includes a structured entity, and each structured entity includes a field. A query is received that specifies a subset of the set of collections and a logical constraint between fields that includes a requirement that a first field match a second field. The probability that the first field matches the second field is determined automatically based upon the contents of the fields. A collection of lists is generated in response to the query, where each list includes members of the subset of collections specified in the query, and where each list has an estimate of the probability that the members of the list satisfies the logical constraint specified in the query.
    Type: Grant
    Filed: February 24, 1998
    Date of Patent: September 25, 2001
    Assignee: AT&T Corp.
    Inventor: William W. Cohen
  • Publication number: 20010013035
    Abstract: A system and method are provided for answering queries concerning information stored in a set of collections. Each collection includes a structured entity, and each structured entity includes a field. A query is received that specifies a subset of the set of collections and a logical constraint between fields that includes a requirement that a first field match a second field. The probability that the first field matches the second field is determined automatically based upon the contents of the fields. A collection of lists is generated in response to the query, where each list includes members of the subset of collections specified in the query, and where each list has an estimate of the probability that the members of the list satisfies the logical constraint specified in the query.
    Type: Application
    Filed: February 24, 1998
    Publication date: August 9, 2001
    Inventor: WILLIAM W. COHEN
  • Patent number: 5719692
    Abstract: Efficient techniques for inducing rules used in classifying data items on a noisy data set. The prior-art IREP technique, which produces a set of classification rules by inducing each rule and then pruning it and continuing thus until a stopping condition is reached, is improved with a new rule-value metric for stopping pruning and with a stopping condition which depends on the description length of the rule set. The rule set which results from the improved IREP technique is then optimized by pruning rules from the set to minimize the description length and further optimized by making a replacement rule and a modified rule for each rule and using the description length to determine whether to use the replacement rule, the modified rule, or the original rule in the rule set. Further improvement is achieved by inducing rules for data items not covered by the original set and then pruning these rules.
    Type: Grant
    Filed: July 7, 1995
    Date of Patent: February 17, 1998
    Assignee: Lucent Technologies Inc.
    Inventor: William W. Cohen
  • Patent number: 5642472
    Abstract: Apparatus and methods which employ a machine learning system to "learn" the specification for a program from a trace of an execution of the program on a set of test problems. The program is instrumented to produce the trace. Performance is improved by means of a declarative bias which expresses knowledge of the user about the program and constrains the learning system to produce only specifications which are consistent with the declarative bias. The apparatus and methods of the preferred embodiment are employed to learn specifications of views in a data base for a telephone switching system from traces produced by executing the programs which produce the views. Techniques for producing more than one specification and for dealing with views which involve conversions are also disclosed.
    Type: Grant
    Filed: May 20, 1994
    Date of Patent: June 24, 1997
    Assignee: Lucent Technologies Inc.
    Inventor: William W. Cohen
  • Patent number: 5627945
    Abstract: The invention permits various types of background knowledge for a concept learning system to be represented in a single formal structure known as an antecedent description grammar. A user formulates background knowledge for a learning problem into such a grammar, which then becomes an input to a learning system, together with training data representing the concept to be learned. The learning system, constrained by the grammar, then uses the training data to generate a hypothesis for the concept to be learned. Such hypothesis is in the form of a set of logic clauses known as Horn clauses.
    Type: Grant
    Filed: December 1, 1995
    Date of Patent: May 6, 1997
    Assignee: Lucent Technologies Inc.
    Inventor: William W. Cohen
  • Patent number: 5481650
    Abstract: The invention permits various types of background knowledge for a concept learning system to be represented in a single formal structure known as an antecedent description grammar. A user formulates background knowledge for a learning problem into such a grammar, which then becomes an input to a learning system, together with training data representing the concept to be learned. The learning system, constrained by the grammar, then uses the training data to generate a hypothesis for the concept to be learned. Such hypothesis is in the form of a set of logic clauses known as Horn clauses.
    Type: Grant
    Filed: October 7, 1994
    Date of Patent: January 2, 1996
    Assignee: AT&T Corp.
    Inventor: William W. Cohen