Patents by Inventor Christopher A. Meek

Christopher A. Meek has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20050267717
    Abstract: Determining the near-optimal block size for incremental-type expectation maximization (EM) algorithms is disclosed. Block size is determined based on the novel insight that the speed increase resulting from using an incremental-type EM algorithm as opposed to the standard EM algorithm is roughly the same for a given range of block sizes. Furthermore, this block size can be determined by an initial version of the EM algorithm that does not reach convergence. For a current block size, the speed increase is determined, and if the speed increase is the greatest determined so far, the current block size is set as the target block size. This process is repeated for new block sizes, until no new block sizes can be determined.
    Type: Application
    Filed: July 8, 2005
    Publication date: December 1, 2005
    Applicant: Microsoft Corporation
    Inventors: Bo Thiesson, Christopher Meek, David Heckerman
  • Publication number: 20050234960
    Abstract: The present invention leverages machine learning techniques to provide automatic generation of conditioning variables for constructing a data perspective for a given target variable. The present invention determines and analyzes the best target variable predictors for a given target variable, employing them to facilitate the conveying of information about the target variable to a user. It automatically discretizes continuous and discrete variables utilized as target variable predictors to establish their granularity. In other instances of the present invention, a complexity and/or utility parameter can be specified to facilitate generation of the data perspective via analyzing a best target variable predictor versus the complexity of the conditioning variable(s) and/or utility. The present invention can also adjust the conditioning variables (i.e.
    Type: Application
    Filed: April 14, 2004
    Publication date: October 20, 2005
    Applicant: Microsoft Corporation
    Inventors: David Chickering, Bo Thiesson, Carl Kadie, David Heckerman, Christopher Meek, Allan Folting, Eric Vigesaa
  • Publication number: 20050192936
    Abstract: Systems and methods are described that facilitate predictive web-crawling in a computer environment. Aspects of the invention provide for predictive, utility-based, and decision theoretic probability assessments of changes in subsets of web pages, enhancing web-crawling ability and ensuring that web page information is maintained in a fresh state. Additionally, the invention facilitates selective crawling of pages with a high probability of change.
    Type: Application
    Filed: February 12, 2004
    Publication date: September 1, 2005
    Inventors: Christopher Meek, Carl Kadie
  • Patent number: 6922660
    Abstract: Determining the near-optimal block size for incremental-type expectation maximization (EM) algorithms is disclosed. Block size is determined based on the novel insight that the speed increase resulting from using an incremental-type EM algorithm as opposed to the standard EM algorithm is roughly the same for a given range of block sizes. Furthermore, this block size can be determined by an initial version of the EM algorithm that does not reach convergence. For a current block size, the speed increase is determined, and if the speed increase is the greatest determined so far, the current block size is set as the target block size. This process is repeated for new block sizes, until no new block sizes can be determined.
    Type: Grant
    Filed: December 1, 2000
    Date of Patent: July 26, 2005
    Assignee: Microsoft Corporation
    Inventors: Bo Thiesson, Christopher A. Meek, David E. Heckerman
  • Publication number: 20050149052
    Abstract: A driver and method are presented for driving fasteners inside of an orthopaedic implant.
    Type: Application
    Filed: December 31, 2003
    Publication date: July 7, 2005
    Inventor: Christopher Meek
  • Publication number: 20050071766
    Abstract: The present invention provides systems and methods for obtaining information from a networked system utilizing a distributed web crawler. The distributed nature of clients of a server is leveraged to provide fast and accurate web crawling data. Information gathered by a server's web crawler is compared to data retrieved by clients of the server to update the crawler's data. In one instance of the present invention, data comparison is achieved by utilizing information disseminated via a search engine results page. In another instance of the present invention, data validation is accomplished by client dictionaries, emanating from a server, that summarize web crawler data. The present invention also facilitates data analysis by providing a means to resist spoofing of a web crawler to increase data accuracy.
    Type: Application
    Filed: September 25, 2003
    Publication date: March 31, 2005
    Inventors: Eric Brill, Christopher Meek
  • Publication number: 20050049987
    Abstract: The present invention provides collaborative filtering systems and methods employing default scores of decision graphs/trees to quickly create a top-n prediction list that can efficiently determine a user's interest in items. In one instance of the present invention, the list is refined by utilizing a variable maximum score algorithm and/or an invalidation list algorithm to insert items that score above an inclusion threshold set by a last item in the top-n prediction list. In another instance of the present invention, an invalidation list for a decision graph and/or decision tree is utilized to create a top-n prediction list. An algorithm employing default scores is then utilized to refine the top-n prediction list to insert items with default scores above an inclusion threshold set by a last item in the top-n prediction list.
    Type: Application
    Filed: September 3, 2003
    Publication date: March 3, 2005
    Inventors: Christopher Meek, David Chickering, Christopher Weare, Pradeep Gupta
  • Publication number: 20040267596
    Abstract: The present invention provides collaborative filtering systems and methods employing statistical smoothing to provide quickly creatable models that can efficiently predict probability that a user likes an item and/or similarities between items. Smoothing is accomplished by utilizing statistical methods such as support cutoff, single and multiple prior on counts, and prior on measure of association and the like. By improving model-based collaborative filtering with such techniques, performance is increased with regard to product-to-product recommendations. The present invention also provides improvements over systems based on dependency nets (DN) in both areas of quality of recommendations and speed of model creation. It can also be complementary to DN to improve the value of an existing collaborative filtering system's overall efficiency. It is also employable with low frequency user preference data.
    Type: Application
    Filed: June 25, 2003
    Publication date: December 30, 2004
    Inventors: Jesper B. Lind, Carl M. Kadie, Christopher A. Meek, David E. Heckerman
  • Publication number: 20040260664
    Abstract: The present invention utilizes a cross-prediction scheme to predict values of discrete and continuous time observation data, wherein conditional variance of each continuous time tube variable is fixed to a small positive value. By allowing cross-predictions in an ARMA based model, values of continuous and discrete observations in a time series are accurately predicted. The present invention accomplishes this by extending an ARMA model such that a first time series “tube” is utilized to facilitate or “cross-predict” values in a second time series tube to form an “ARMAxp” model. In general, in the ARMAxp model, the distribution of each continuous variable is a decision graph having splits only on discrete variables and having linear regressions with continuous regressors at all leaves, and the distribution of each discrete variable is a decision graph having splits only on discrete variables and having additional distributions at all leaves.
    Type: Application
    Filed: June 17, 2003
    Publication date: December 23, 2004
    Inventors: Bo Thiesson, Christopher A. Meek, David M. Chickering, David E. Heckerman
  • Publication number: 20040254903
    Abstract: The present invention leverages approximations of distributions to provide tractable variational approximations, based on at least one continuous variable, for inference utilization in Bayesian networks where local distributions are decision-graphs. These tractable approximations are employed in lieu of exact inferences that are normally NP-hard to solve. By utilizing Jensen's inequality applied to logarithmic distributions composed of a generalized sum including an introduced arbitrary conditional distribution, a means is acquired to resolve a tightly bound likelihood distribution. The means includes application of Mean-Field Theory, approximations of conditional probability distributions, and/or other means that allow for a tractable variational approximation to be achieved.
    Type: Application
    Filed: June 10, 2003
    Publication date: December 16, 2004
    Inventors: David E. Heckerman, Christopher A. Meek, David M. Chickering
  • Publication number: 20040236576
    Abstract: The present invention utilizes a discriminative density model selection method to provide an optimized density model subset employable in constructing a classifier. By allowing multiple alternative density models to be considered for each class in a multi-class classification system and then developing an optimal configuration comprised of a single density model for each class, the classifier can be tuned to exhibit a desired characteristic such as, for example, high classification accuracy, low cost, and/or a balance of both. In one instance of the present invention, error graph, junction tree, and min-sum propagation algorithms are utilized to obtain an optimization from discriminatively selected density models.
    Type: Application
    Filed: May 20, 2003
    Publication date: November 25, 2004
    Inventors: Bo Thiesson, Christopher A. Meek
  • Publication number: 20040234128
    Abstract: The present invention utilizes generic and user-specific features of handwriting samples to provide adaptive handwriting recognition with a minimum level of user-specific enrollment data. By allowing generic and user-specific classifiers to facilitate in a recognition process, the features of a specific user's handwriting can be exploited to quickly ascertain characteristics of handwriting characters not yet entered by the user. Thus, new characters can be recognized without requiring a user to first enter that character as enrollment or “training” data. In one instance of the present invention, processing of generic features is accomplished by a generic classifier trained on multiple users. In another instance of the present invention, a user-specific classifier is employed to modify a generic classifier's classification as required to provide user-specific handwriting recognition.
    Type: Application
    Filed: May 21, 2003
    Publication date: November 25, 2004
    Inventors: Bo Thiesson, Christopher A. Meek
  • Patent number: 6807537
    Abstract: One aspect of the invention is the construction of mixtures of Bayesian networks. Another aspect of the invention is the use of such mixtures of Bayesian networks to perform inferencing. A mixture of Bayesian networks (MBN) consists of plural hypothesis-specific Bayesian networks (HSBNs) having possibly hidden and observed variables. A common external hidden variable is associated with the MBN, but is not included in any of the HSBNs. The number of HSBNs in the MBN corresponds to the number of states of the common external hidden variable, and each HSBN is based upon the hypothesis that the common external hidden variable is in a corresponding one of those states. In one mode of the invention, the MBN having the highest MBN score is selected for use in performing inferencing.
    Type: Grant
    Filed: December 4, 1997
    Date of Patent: October 19, 2004
    Assignee: Microsoft Corporation
    Inventors: Bo Thiesson, Christopher A. Meek, David Maxwell Chickering, David Earl Heckerman
  • Publication number: 20040196311
    Abstract: Visualizing Internet web traffic is disclosed. In one embodiment, a number of windows are displayed, corresponding to a number of clusters into which users have been partitioned based on similar web browsing behavior. The windows are ordered from the cluster having the greatest number of users to the cluster having the least number of users. Each window has one or more rows, where each row corresponds to a user within the cluster. Each row has an ordered number of visible units, such as blocks, where each block corresponds to a web page visited by the user. The blocks can be color coded by the type of web page they represent. In one embodiment, the corresponding cluster models for the clusters are alternatively displayed in the windows.
    Type: Application
    Filed: April 22, 2004
    Publication date: October 7, 2004
    Applicant: Microsoft Corporation
    Inventors: Igor Cadez, David E. Heckerman, Christopher A. Meek, Steven J. White
  • Publication number: 20040181554
    Abstract: A system that incorporates an interactive graphical user interface for visualizing clusters (categories) and segments (summarized clusters) of data. Specifically, the system automatically categorizes incoming case data into clusters, summarizes those clusters into segments, determines similarity measures for the segments, scores the selected segments through the similarity measures, and then forms and visually depicts hierarchical organizations of those selected clusters. The.system also automatically and dynamically reduces, as necessary, a depth of the hierarchical organization, through elimination of unnecessary hierarchical levels and inter-nodal links, based on similarity measures of segments or segment groups. Attribute/value data that tends to meaningfully characterize each segment is also scored, rank ordered based on normalized scores, and then graphically displayed.
    Type: Application
    Filed: March 24, 2004
    Publication date: September 16, 2004
    Inventors: David E. Heckerman, Paul S. Bradley, David M. Chickering, Christopher A. Meek
  • Publication number: 20040177123
    Abstract: Improving object organization by presenting controlling attribute-specific lists is disclosed. For example, the object can be an email and the controlling attribute the sender of the email. Sender-specific lists are dynamically maintained and can include the most recent folders into which email have been moved. When a current email is selected, or when the user otherwise so indicates, a sender-specific list for the sender of the current email is displayed to the user. The user can select one of the folders from the list into which to move the current email. Besides email, the object can be a file, such that the controlling attribute can be the creator of the file.
    Type: Application
    Filed: March 12, 2004
    Publication date: September 9, 2004
    Inventor: Christopher A. Meek
  • Patent number: 6771289
    Abstract: Visualizing Internet web traffic is disclosed. In one embodiment, a number of windows are displayed, corresponding to a number of clusters into which users have been partitioned based on similar web browsing behavior. The windows are ordered from the cluster having the greatest number of users to the cluster having the least number of users. Each window has one or more rows, where each row corresponds to a user within the cluster. Each row has an ordered number of visible units, such as blocks, where each block corresponds to a web page visited by the user. The blocks can be color coded by the type of web page they represent. In one embodiment, the corresponding cluster models for the clusters are alternatively displayed in the windows.
    Type: Grant
    Filed: March 2, 2000
    Date of Patent: August 3, 2004
    Assignee: Microsoft Corporation
    Inventors: Igor Cadez, David E. Heckerman, Christopher A. Meek, Steven J. White
  • Patent number: 6742003
    Abstract: A system that incorporates an interactive graphical user interface for visualizing clusters (categories) and segments (summarized clusters) of data. Specifically, the system automatically categorizes incoming case data into clusters, summarizes those clusters into segments, determines similarity measures for the segments, scores the selected segments through the similarity measures, and then forms and visually depicts hierarchical organizations of those selected clusters. The system also automatically and dynamically reduces, as necessary, a depth of the hierarchical organization, through elimination of unnecessary hierarchical levels and inter-nodal links, based on similarity measures of segments or segment groups. Attribute/value data that tends to meaningfully characterize each segment is also scored, rank ordered based on normalized scores, and then graphically displayed.
    Type: Grant
    Filed: April 30, 2001
    Date of Patent: May 25, 2004
    Assignee: Microsoft Corporation
    Inventors: David E. Heckerman, Paul S. Bradley, David M. Chickering, Christopher A. Meek
  • Patent number: 6732155
    Abstract: Improving object organization by presenting controlling attribute-specific lists is disclosed. For example, the object can be an email and the controlling attribute the sender of the email. Sender-specific lists are dynamically maintained and can include the most recent folders into which email have been moved. When a current email is selected, or when the user otherwise so indicates, a sender-specific list for the sender of the current email is displayed to the user. The user can select one of the folders from the list into which to move the current email. Besides email, the object can be a file, such that the controlling attribute can be the creator of the file.
    Type: Grant
    Filed: December 1, 2000
    Date of Patent: May 4, 2004
    Assignee: Microsoft Corporation
    Inventor: Christopher A. Meek
  • Patent number: 6728690
    Abstract: A training system for a classifier utilizes both a back-propagation system to iteratively modify parameters of functions which provide raw output indications of desired categories, wherein the parameters are modified based on a weighted decay, and a probability determining system with further parameters that are determined during iterative training. A margin error metric may be combined with weight decay, and a sigmoid is used to calibrate the raw outputs to probability percentages for each category. A method of training such a system involves gathering a training set of inputs and desired corresponding outputs. Classifier parameters are then initialized and an error margin is calculated with respect to the classifier parameters. A weight decay is then used to adjust the parameters.
    Type: Grant
    Filed: November 23, 1999
    Date of Patent: April 27, 2004
    Assignee: Microsoft Corporation
    Inventors: Christopher A. Meek, John C. Platt