Patents by Inventor David E. Heckerman

David E. Heckerman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7562064
    Abstract: The invention provides systems and methods that can be used for targeted advertising. The system determines where to present impressions, such as advertisements, to maximize an expected utility subject to one or more constraints, which can include quotas and minimum utilities for groups of one or more impression. The traditional measure of utility in web-based advertising is click-though rates, but the present invention provides a broader definition of utility, including measures of sales, profits, or brand awareness, for example. This broader definition permits advertisements to be allocated more in accordance with the actual interests of advertisers.
    Type: Grant
    Filed: August 14, 2006
    Date of Patent: July 14, 2009
    Assignee: Microsoft Corporation
    Inventors: David Maxwell Chickering, David E. Heckerman
  • Patent number: 7558832
    Abstract: The subject invention provides for a feedback loop system and method that facilitate classifying items in connection with spam prevention in server and/or client-based architectures. The invention makes uses of a machine-learning approach as applied to spam filters, and in particular, randomly samples incoming email messages so that examples of both legitimate and junk/spam mail are obtained to generate sets of training data. Users which are identified as spam-fighters are asked to vote on whether a selection of their incoming email messages is individually either legitimate mail or junk mail. A database stores the properties for each mail and voting transaction such as user information, message properties and content summary, and polling results for each message to generate training data for machine learning systems. The machine learning systems facilitate creating improved spam filter(s) that are trained to recognize both legitimate mail and spam mail and to distinguish between them.
    Type: Grant
    Filed: May 2, 2007
    Date of Patent: July 7, 2009
    Assignee: Microsoft Corporation
    Inventors: Robert L. Rounthwaite, Joshua T. Goodman, David E. Heckerman, John D. Mehr, Nathan D. Howell, Micah C. Rupersburg, Dean A. Slawson
  • Publication number: 20090106172
    Abstract: The claimed subject matter provides systems and/or methods that determines a number of non-spurious arcs associated with a learned graphical model. The system can include devices and mechanisms that utilize learning algorithms and datasets to generate learned graphical models and graphical models associated with null permutations of the datasets, ascertaining the average number of arcs associated with the graphical models associated with null permutations of the datasets, enumerating the total number of arcs affiliated with the learned graphical model, and presenting a ratio of the average number of arcs to the total number of arcs, the ratio indicative of the number of non-spurious arcs associated the learned graphical model.
    Type: Application
    Filed: October 17, 2007
    Publication date: April 23, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: David E. Heckerman, Jennifer Listgarten, Carl M. Kadie
  • Publication number: 20090089082
    Abstract: The claimed subject matter provides a system and/or a method that facilitates dynamically providing a question to ask a medical professional during an appointment. An interface can receive a portion of medical data. A counselor component can generate a question based on the portion of medical data, wherein the question is generated to elicit an answer from a medical professional during an appointment. Moreover, the counselor component can dynamically generate a second question directed toward the medical professional based upon at least one of the answer or a value of information (VOI) computation.
    Type: Application
    Filed: September 28, 2007
    Publication date: April 2, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: David E. Heckerman, Pablo Argon, Behrooz Chitsaz, Hong L. Choing, James R. Hamilton, Nuria M. Oliver, Vladimir G. Sadovsky, Chris Demetrios Karkanias, Hubert Van Hoof, Oren Rosenbloom
  • Publication number: 20090088726
    Abstract: Provided are systems and/or methods that facilitate sensing, detecting, or treatment of a condition or need of a living body using a genetically engineered symbiotic agent.
    Type: Application
    Filed: September 28, 2007
    Publication date: April 2, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Eric J. Horvitz, Steven Bathiche, David E. Heckerman, Chris Demetrios Karkanias
  • Patent number: 7483947
    Abstract: Architecture for detecting and removing obfuscating clutter from the subject and/or body of a message, e.g., e-mail, prior to filtering of the message, to identify junk messages commonly referred to as spam. The technique utilizes the powerful features built into an HTML rendering engine to strip the HTML instructions for all non-substantive aspects of the message. Pre-processing includes pre-rendering of the message into a final format, which final format is that which is displayed by the rendering engine to the user. The final format message is then converted to a text-only format to remove graphics, color, non-text decoration, and spacing that cannot be rendered as ASCII-style or Unicode-style characters. The result is essentially to reduce each message to its common denominator essentials so that the junk mail filter can view each message on an equal basis.
    Type: Grant
    Filed: May 2, 2003
    Date of Patent: January 27, 2009
    Assignee: Microsoft Corporation
    Inventors: Bryan T. Starbuck, Robert L. Rounthwaite, David E. Heckerman, Joshua T. Goodman
  • Patent number: 7472102
    Abstract: Targeted delivery of items with inventory management using a cluster-based approach or a rule-based approach is disclosed. An example of items is advertisements. Each item is allocated to one or more clusters. The allocation is made based on a predetermined criterion accounting for at least a quota for each item and possibly a constraint for each cluster. The former can refer to the number of times an item must be shown. The latter can refer to the number of times a given group of web pages is likely to be visited by users, and hence is the number of times items can be shown in a given cluster. The invention is not limited to any particular definition of what constitutes a cluster or item.
    Type: Grant
    Filed: October 29, 1999
    Date of Patent: December 30, 2008
    Assignee: Microsoft Corporation
    Inventors: David E. Heckerman, D. Maxwell Chickering, Daniel Rosen
  • Publication number: 20080312095
    Abstract: Systems and methodologies for efficient vaccine design are disclosed herein. A methodology for efficient vaccine design in accordance with one or more embodiments disclosed herein may be operable to receive a graph having vertices corresponding to epitope sequences present in the pathogen population, weights for respective vertices corresponding to respective frequencies with which corresponding epitope sequences appear in the pathogen population, and directed edges that connect vertices that correspond to overlapping epitope sequences. Such a methodology may also be operable to determine a candidate vaccine sequence of overlapping epitope sequences by identifying a path though the graph corresponding to a series of connected vertices and directed edges that maximizes the total weight of the vertices in the path for a desired vaccine sequence length.
    Type: Application
    Filed: June 18, 2007
    Publication date: December 18, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Darko Kirovski, David E. Heckerman, Nebojsa Jojic
  • Publication number: 20080294465
    Abstract: The methods/systems described herein facilitate large-scale data collection and aggregation. One exemplary system that facilitates large-scale reporting of health-related data comprises a data collection component, a database and an aggregation component. The data collection component can collect health-related data on a large-scale from a non-selected population. The database can store at least some of the health-related data. The aggregation component can facilitate automatically ascertaining at least one pattern from the health-related data at least in part by applying one or more statistical, data-mining or machine-learning techniques to the database.
    Type: Application
    Filed: July 28, 2008
    Publication date: November 27, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Craig J. Mundie, David E. Heckerman, Nebojsa Jojic, Randy J. Hinrichs
  • Patent number: 7454705
    Abstract: Visualizing Internet web traffic is disclosed. In one embodiment, a number of windows are displayed, corresponding to a number of clusters into which users have been partitioned based on similar web browsing behavior. The windows are ordered from the cluster having the greatest number of users to the cluster having the least number of users. Each window has one or more rows, where each row corresponds to a user within the cluster. Each row has an ordered number of visible units, such as blocks, where each block corresponds to a web page visited by the user. The blocks can be color coded by the type of web page they represent. In one embodiment, the corresponding cluster models for the clusters are alternatively displayed in the windows.
    Type: Grant
    Filed: April 22, 2004
    Date of Patent: November 18, 2008
    Assignee: Microsoft Corporation
    Inventors: Igor Cadez, David E. Heckerman, Christopher A. Meek, Steven J. White
  • Patent number: 7409371
    Abstract: A model is constructed for an initial subset of the data using a first parameter estimation algorithm. The model may be evaluated, for example, by applying the model to a holdout data set of the data. If the model is not acceptable, additional data is added to the data subset and the first parameter estimation algorithm is repeated for the aggregate data subset. An appropriate subset of the data exists when the first parameter estimation algorithm produces an acceptable model. The appropriate subset of the data may then be employed by a second parameter estimation algorithm, which may be a more accurate version of the first algorithm or a different algorithm altogether, to build a statistical model to characterize the data.
    Type: Grant
    Filed: June 4, 2001
    Date of Patent: August 5, 2008
    Assignee: Microsoft Corporation
    Inventors: David E. Heckerman, Christopher A. Meek, Bo Thiesson
  • Patent number: 7406453
    Abstract: The methods/systems described herein facilitate large-scale data collection and aggregation. One exemplary system that facilitates large-scale reporting of health-related data comprises a data collection component, a database and an aggregation component. The data collection component can collect health-related data on a large-scale from a non-selected population. The database can store at least some of the health-related data. The aggregation component can facilitate automatically ascertaining at least one pattern from the health-related data at least in part by applying one or more statistical, data-mining or machine-learning techniques to the database.
    Type: Grant
    Filed: November 4, 2005
    Date of Patent: July 29, 2008
    Assignee: Microsoft Corporation
    Inventors: Craig J. Mundie, David E. Heckerman, Nebojsa Jojic, Randy J. Hinrichs
  • Publication number: 20080172215
    Abstract: Epitope prediction models are described herein. By way of example, a system for predicting epitope information relating to a epitope can include a classification model (e.g., logistic regression model). The trained classification model can illustratively operatively execute one ore logistic functions on received protein data, and incorporate one or more of hidden binary variables and shift variables that when processed represent the identification (e.g., prediction) of one or more desired epitopes. The classification model can be configured to predict the epitope information by processing data including various features of an epitope, MHC, MHC supertype, and Boolean combinations thereof.
    Type: Application
    Filed: December 21, 2007
    Publication date: July 17, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: David E. Heckerman, Carl M. Kadie, Jennifer Listgarten, Noah Aaron Zaitlen, Nebojsa Jojic
  • Publication number: 20080172209
    Abstract: Statistical models for identifying associations are described herein. By way of example, a system for identifying associations between variables can include a model builder and an association identifier. The model builder can receive observations about the variables and generate a null model and a non-null model. The association identifier can assess the strength of the association between the variables by determining how much the non-null model better explains the observed data than the null model. Additionally or alternatively, the structure of the observed data can be inferred simultaneously with the statistical model.
    Type: Application
    Filed: January 12, 2007
    Publication date: July 17, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: David E. Heckerman, Jonathan M. Carlson, Carl M. Kadie
  • Publication number: 20080172351
    Abstract: Computer-executable instructions for identifying associations are described herein. By way of example, a method for facilitating developing a treatment can include employing computer-executable instructions stored on one or more computer-readable media to determine correlations and utilizing at least some of the determined correlations to develop a treatment.
    Type: Application
    Filed: April 4, 2007
    Publication date: July 17, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: David E. Heckerman, Jonathan M. Carlson, Carl M. Kadie
  • Patent number: 7389201
    Abstract: The system and method of the present invention automatically extracts the top k recommendations of objects, such as topics, items, products, books, movies, food, drinks, etc., from a local probabilistic recommendation system. Unlike prior systems, the present invention accomplishes the extraction of the top k recommendations of objects without examining a probability for every object that can be recommended. Further, the system and method of the present invention is capable of being implemented using probabilistic recommendation systems based on any conventional type of probabilistic distribution or machine learning technique, including, for example, decision trees and Bayesian networks.
    Type: Grant
    Filed: May 30, 2001
    Date of Patent: June 17, 2008
    Assignee: Microsoft Corporation
    Inventors: David Maxwell Chickering, David E. Heckerman, Robert Rounthwaite
  • Patent number: 7370002
    Abstract: Advertisement response probabilities are utilized to alter advertisement scores. A plurality of possible advertisements is accessed from, for example, an advertisement database or advertisement pipeline. A response probability for each advertisement is determined. A response probability may be a probability that a user will “click,” or otherwise select an advertisement. Advertisements may be associated with probabilistic prediction models that take advertisement recipient attribute values as inputs and provide a probability distribution as output. A score associated with each of the possible advertisements is altered based on the response probability for each of the advertisements. Statistical prediction is used to determine how scores are to be altered. Advertisements with response probabilities less than a mean probability may have associated scores decreased. Conversely, advertisements with response probabilities greater than a mean probability may have associated scores increased.
    Type: Grant
    Filed: June 5, 2002
    Date of Patent: May 6, 2008
    Assignee: Microsoft Corporation
    Inventors: David E. Heckerman, Martin Luo, Guy Shani, Mahbubul Alam Ali
  • Patent number: 7346471
    Abstract: Data slices of historical time series are leveraged to facilitate in more accurately predicting like data slices of future time series. Different predictive models are employed to detect outliers in different data slices to enhance the accuracy of the predictions. The data slices can be temporal and/or non-temporal attributes of a data set represented by the historical time series. In this manner, for example, a historical time series for a network location can be sliced temporally into one hour time periods as a function of a day, a week, a month, a year, etc. Outliers detected in these data slices can then be mitigated utilizing the predictive time series model by replacing the outlier with the expected value. The mitigated historical time series can then be employed in a predictive model to predict future web traffic for the network location (and advertising revenue values) with a substantial increase in accuracy.
    Type: Grant
    Filed: September 2, 2005
    Date of Patent: March 18, 2008
    Assignee: Microsoft Corporation
    Inventors: David M. Chickering, Ashis Kumar Roy, Lawrence Andrew Koch, David E. Heckerman
  • Patent number: 7333998
    Abstract: A system that incorporates an interactive graphical user interface for visualizing clusters (categories) and segments (summarized clusters) of data. Specifically, the system automatically categorizes incoming case data into clusters, summarizes those clusters into segments, determines similarity measures for the segments, scores the selected segments through the similarity measures, and then forms and visually depicts hierarchical organizations of those selected clusters. The system also automatically and dynamically reduces, as necessary, a depth of the hierarchical organization, through elimination of unnecessary hierarchical levels and inter-nodal links, based on similarity measures of segments or segment groups. Attribute/value data that tends to meaningfully characterize each segment is also scored, rank ordered based on normalized scores, and then graphically displayed.
    Type: Grant
    Filed: March 24, 2004
    Date of Patent: February 19, 2008
    Assignee: Microsoft Corporation
    Inventors: David E. Heckerman, Paul S. Bradley, David M. Chickering, Christopher A. Meek
  • Patent number: 7251636
    Abstract: The present invention leverages scalable learning methods to efficiently obtain a Bayesian network for a set of variables of which the total ordering in a domain is known. Certain criteria are employed to generate a Bayesian network which is then evaluated and utilized as a guide to generate another Bayesian network for the set of variables. Successive iterations are performed utilizing a prior Bayesian network as a guide until a stopping criterion is reached, yielding a best-effort Bayesian network for the set of variables.
    Type: Grant
    Filed: December 10, 2003
    Date of Patent: July 31, 2007
    Assignee: Microsoft Corporation
    Inventors: David M. Chickering, David E. Heckerman