Patents by Inventor Dengyong Zhou

Dengyong Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20130282632
    Abstract: A spam detection system is disclosed. The system includes a classifier training component that receives a first set of training pages labeled as normal pages and a second set of training pages labeled as spam pages. The training component trains a web page classifier based on both the first set of training pages and the second set of training pages. A spam detector then receives unlabeled web pages uses the web page classifier to classify the unlabeled web pages as spam pages or normal pages.
    Type: Application
    Filed: June 19, 2013
    Publication date: October 24, 2013
    Inventors: Dengyong Zhou, Christopher J.C. Burges, Tao Tao
  • Patent number: 8494998
    Abstract: A spam detection system is disclosed. The system includes a classifier training component that receives a first set of training pages labeled as normal pages and a second set of training pages labeled as spam pages. The training component trains a web page classifier based on both the first set of training pages and the second set of training pages. A spam detector then receives unlabeled web pages uses the web page classifier to classify the unlabeled web pages as spam pages or normal pages.
    Type: Grant
    Filed: April 1, 2011
    Date of Patent: July 23, 2013
    Assignee: Microsoft Corporation
    Inventors: Dengyong Zhou, Christopher J. C. Burges, Tao Tao
  • Patent number: 8484016
    Abstract: A method is described herein that includes acts of receiving a selection of a first phrase in a first language and executing a random walk over a computer-implemented multipartite graph, wherein the multipartite-graph includes a first set of nodes that are representative of phrases in the first language, a second set of nodes that are representative of phrases in a second language, and edges between nodes that are representative of relationships between the respective phrases. The random walk includes traversals over edges of the graph between nodes. The method also includes the act of indicating that a second phrase in the first language is a paraphrase of the first phrase based at least in part upon the random walk.
    Type: Grant
    Filed: May 28, 2010
    Date of Patent: July 9, 2013
    Assignee: Microsoft Corporation
    Inventors: Christopher John Brockett, Stanley Kok, Dengyong Zhou
  • Publication number: 20120166366
    Abstract: The claimed subject matter provides a method for hierarchical classification. The method includes receiving a hierarchical structure with a first level comprising a parent node and a sibling node. The structure also includes a second level comprising two child nodes. The method further includes receiving training examples. Each training example may be associated with a class of the parent node, the sibling node, or the two child nodes. The method also includes generating a first classifier for the first level. The first classifier includes a first hyperplane distinguishing the parent and sibling nodes. A first vector is normal to the first hyperplane. Additionally, the method includes generating a second classifier for the second level. The second classifier includes a second hyperplane distinguishing the two child nodes. A second vector is normal to the second hyperplane. An orthogonality of the second vector in relation to the first vector is maximized.
    Type: Application
    Filed: December 22, 2010
    Publication date: June 28, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Dengyong Zhou, Lin Xiao, Mingrui Wu
  • Patent number: 8156129
    Abstract: A system described herein includes analyzer component that analyzes queries submitted by users and corresponding URLs selected by the users, wherein the queries include a first query and a second query, and wherein the analyzer component determines that the first query and the second query are substantially similar queries. The system additionally includes a correlator component that, responsive to the analyzer component determining that the first query and the second query are substantially similar, generates correlation data that indicates that the first and second queries are substantially similar.
    Type: Grant
    Filed: January 15, 2009
    Date of Patent: April 10, 2012
    Assignee: Microsoft Corporation
    Inventors: Dengyong Zhou, Christopher J. C. Burges, Robert L. Rounthwaite
  • Publication number: 20110295589
    Abstract: A method is described herein that includes acts of receiving a selection of a first phrase in a first language and executing a random walk over a computer-implemented multipartite graph, wherein the multipartite-graph includes a first set of nodes that are representative of phrases in the first language, a second set of nodes that are representative of phrases in a second language, and edges between nodes that are representative of relationships between the respective phrases. The random walk includes traversals over edges of the graph between nodes. The method also includes the act of indicating that a second phrase in the first language is a paraphrase of the first phrase based at least in part upon the random walk.
    Type: Application
    Filed: May 28, 2010
    Publication date: December 1, 2011
    Applicant: Microsoft Corporation
    Inventors: Christopher John Brockett, Stanley Kok, Dengyong Zhou
  • Publication number: 20110282816
    Abstract: A spam detection system is disclosed. The system includes a classifier training component that receives a first set of training pages labeled as normal pages and a second set of training pages labeled as spam pages. The training component trains a web page classifier based on both the first set of training pages and the second set of training pages. A spam detector then receives unlabeled web pages uses the web page classifier to classify the unlabeled web pages as spam pages or normal pages.
    Type: Application
    Filed: April 1, 2011
    Publication date: November 17, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Dengyong Zhou, Christopher J.C. Burges, Tao Tao
  • Patent number: 7941391
    Abstract: A collection of web pages is considered as a directed graph in which the pages themselves are nodes and the hyperlinks between the pages are directed edges in the graph. A trusted entity identifies training examples for spam pages and normal pages. A random walk is conducted through the directed graph that includes the collection of web pages and the stationary probabilities, and transitional probabilities, among the nodes in the directed graph are obtained. A classifier training component estimates a classification function that changes slowly on densely connected subgraphs within the directed graph. The classification function assigns a value to each of the nodes in the directed graph and identifies them as spam or normal pages based upon whether the value meets a given function threshold value.
    Type: Grant
    Filed: September 14, 2007
    Date of Patent: May 10, 2011
    Assignee: Microsoft Corporation
    Inventors: Dengyong Zhou, Christopher J. C. Burges, Tao Tao
  • Patent number: 7788254
    Abstract: A collection of web pages is modeled as a directed graph, in which the nodes of the graph are the web pages and directed edges are hyperlinks. Web pages can also be represented by content, or by other features, to obtain a similarity graph over the web pages, where nodes again denote the web pages and the links or edges between each pair of nodes is weighted by a corresponding similarity between those two nodes. A random walk is defined for each graph, and a mixture of the random walks is obtained for the set of graphs. The collection of web pages is then analyzed based on the mixture to obtain a web page analysis result. The web page analysis result can be, for example, clustering of the web pages to discover web communities, classifying or categorizing the web pages, or spam detection indicating whether a given web page is spam or content.
    Type: Grant
    Filed: September 14, 2007
    Date of Patent: August 31, 2010
    Assignee: Microsoft Corporation
    Inventors: Christopher J. C. Burges, Dengyong Zhou
  • Publication number: 20100185649
    Abstract: A system described herein includes analyzer component that analyzes queries submitted by users and corresponding URLs selected by the users, wherein the queries include a first query and a second query, and wherein the analyzer component determines that the first query and the second query are substantially similar queries. The system additionally includes a correlator component that, responsive to the analyzer component determining that the first query and the second query are substantially similar, generates correlation data that indicates that the first and second queries are substantially similar.
    Type: Application
    Filed: January 15, 2009
    Publication date: July 22, 2010
    Applicant: Microsoft Corporation
    Inventors: Dengyong Zhou, Christopher J. C. Burges, Robert L. Rounthwaite
  • Publication number: 20080275902
    Abstract: A collection of web pages is modeled as a directed graph, in which the nodes of the graph are the web pages and directed edges are hyperlinks. Web pages can also be represented by content, or by other features, to obtain a similarity graph over the web pages, where nodes again denote the web pages and the links or edges between each pair of nodes is weighted by a corresponding similarity between those two nodes. A random walk is defined for each graph, and a mixture of the random walks is obtained for the set of graphs. The collection of web pages is then analyzed based on the mixture to obtain a web page analysis result. The web page analysis result can be, for example, clustering of the web pages to discover web communities, classifying or categorizing the web pages, or spam detection indicating whether a given web page is spam or content.
    Type: Application
    Filed: September 14, 2007
    Publication date: November 6, 2008
    Applicant: Microsoft Corporation
    Inventors: Christopher J.C. Burges, Dengyong Zhou
  • Publication number: 20080275833
    Abstract: A collection of web pages is considered as a directed graph in which the pages themselves are nodes and the hyperlinks between the pages are directed edges in the graph. A trusted entity identifies training examples for spam pages and normal pages. A random walk is conducted through the directed graph that includes the collection of web pages and the stationary probabilities, and transitional probabilities, among the nodes in the directed graph are obtained. A classifier training component estimates a classification function that changes slowly on densely connected subgraphs within the directed graph. The classification function assigns a value to each of the nodes in the directed graph and identifies them as spam or normal pages based upon whether the value meets a given function threshold value.
    Type: Application
    Filed: September 14, 2007
    Publication date: November 6, 2008
    Applicant: Microsoft Corporation
    Inventors: Dengyong Zhou, Chrisopher J.C. Burges, Tao Tao