Patents by Inventor Benyu Zhang

Benyu Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7711735
    Abstract: Described is a behavioral targeting technology for online advertising, by which an original attribute is uniformly expanded. Users that meet an original attribute are aggregated into a mid-result used to determine similarity relative to candidate attribute types. The most similar candidate attributes are selected for the expanded attribute. A URL/URL pattern suggestion technology is provided, with similarity computed from users/URLs visited by the users. URLs are separated into URL tree nodes, for calculating the number of users who have visited each URL and the number of users who have visited the URL on a sub-tree whose root is the node. URL/URL patterns are output based on similarity. Domains are also suggested based on user-visits. Similarities between pairs of domains may be computed (e.g., offline), with an output for a given domain provided in based on its similarity with each other domain.
    Type: Grant
    Filed: May 15, 2007
    Date of Patent: May 4, 2010
    Assignee: Microsoft Corporation
    Inventors: Min Wu, Chenxi Lin, Benyu Zhang, Zheng Chen, Jian Wang
  • Patent number: 7707129
    Abstract: Embodiments of the invention relate to improvements to the support vector machine (SVM) classification model. When text data is significantly unbalanced (i.e., positive and negative labeled data are in disproportion), the classification quality of standard SVM deteriorates. Embodiments of the invention are directed to a weighted proximal SVM (WPSVM) model that achieves substantially the same accuracy as the traditional SVM model while requiring significantly less computational time. A weighted proximal SVM (WPSVM) model in accordance with embodiments of the invention may include a weight for each training error and a method for estimating the weights, which automatically solves the unbalanced data problem.
    Type: Grant
    Filed: March 20, 2006
    Date of Patent: April 27, 2010
    Assignee: Microsoft Corporation
    Inventors: Dong Zhuang, Benyu Zhang, Zheng Chen, Hua-Jun Zeng, Jian Wang
  • Patent number: 7698339
    Abstract: A method and system for calculating the significance of a sentence within a document is provided. The summarization system calculates the significance of the sentences of a document and selects the most significant sentences as the summary of the document. The summarization system calculates the significance of a sentence based on the “important” words of the document that are contained within the sentence. The summarization system calculates the importance of words of the document using various scoring techniques and then combines the scores to classify a word as important or not important. The summarization system can then be used to identify significant sentences of the document based on the important words that a sentence contains and select significant sentences as a summary of the document.
    Type: Grant
    Filed: August 13, 2004
    Date of Patent: April 13, 2010
    Assignee: Microsoft Corporation
    Inventors: Benyu Zhang, Wei-Ying Ma, Zheng Chen, Hua-Jun Zeng, Dou Shen
  • Patent number: 7693908
    Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
    Type: Grant
    Filed: June 28, 2007
    Date of Patent: April 6, 2010
    Assignee: Microsoft Corporation
    Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
  • Patent number: 7693823
    Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
    Type: Grant
    Filed: June 28, 2007
    Date of Patent: April 6, 2010
    Assignee: Microsoft Corporation
    Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
  • Patent number: 7689622
    Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
    Type: Grant
    Filed: June 28, 2007
    Date of Patent: March 30, 2010
    Assignee: Microsoft Corporation
    Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
  • Patent number: 7689585
    Abstract: Systems and methods for related term suggestion are described. In one aspect, relationships among respective ones of two or more multi-type data objects are identified. The respective ones of the multi-type data objects include at least one object of a first type and at least one object of a second type that is different from the first type. The multi-type data objects are iteratively clustered in view of respective ones of the relationships to generate reinforced clusters.
    Type: Grant
    Filed: April 15, 2004
    Date of Patent: March 30, 2010
    Assignee: Microsoft Corporation
    Inventors: Hua-Jun Zeng, Benyu Zhang, Zheng Chen, Wei-Ying Ma, Li Li, Ying Li, Tarek Najm
  • Patent number: 7685100
    Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
    Type: Grant
    Filed: June 28, 2007
    Date of Patent: March 23, 2010
    Assignee: Microsoft Corporation
    Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
  • Patent number: 7685099
    Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.
    Type: Grant
    Filed: June 28, 2007
    Date of Patent: March 23, 2010
    Assignee: Microsoft Corporation
    Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
  • Publication number: 20100057798
    Abstract: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.
    Type: Application
    Filed: November 11, 2009
    Publication date: March 4, 2010
    Applicant: Microsoft Corporation
    Inventors: Benyu Zhang, Gui-Rong Xue, Hua-Jun Zeng, Wei-Ying Ma, Xue-Mei Jiang, Zheng Chen
  • Patent number: 7664735
    Abstract: A method and system for ranking documents of search results based on information richness and diversity of topics. A ranking system determines the information richness of each document within a search result. The ranking system groups documents of a search result based on their relatedness, meaning that they are directed to similar topics. The ranking system ranks the documents to ensure that the highest ranking documents may include at least one document covering each topic, that is, one document from each of the groups. The ranking system selects the document from each group that has the highest information richness of the documents within the group. When the documents are presented to a user in rank order, the user will likely find on the first page of the search result documents that cover a variety of topics, rather than just a single popular topic.
    Type: Grant
    Filed: April 30, 2004
    Date of Patent: February 16, 2010
    Assignee: Microsoft Corporation
    Inventors: Benyu Zhang, Zheng Chen, Hua-Jun Zeng, Wei-Ying Ma
  • Patent number: 7653627
    Abstract: Systems and methods for analyzing the content of online conversations using data mining technologies are provided. Methods and systems for utilizing the results of data mining technology implementation to retrieve relevant advertising content and/or other relevant information for display in association with the content of an online conversation are also provided.
    Type: Grant
    Filed: May 13, 2005
    Date of Patent: January 26, 2010
    Assignee: Microsoft Corporation
    Inventors: Ying Li, Li Li, Tarek Najm, Xianfang Wang, Hongbin Gao, Hua-Jun Zeng, Benyu Zhang, Zheng Chen
  • Publication number: 20090313706
    Abstract: A method and system for detecting whether an outgoing communication contains confidential information or other target information is provided. The detection system is provided with a collection of documents that contain confidential information, referred to as “confidential documents.” When the detection system is provided with an outgoing communication, it compares the content of the outgoing communication to the content of the confidential documents. If the outgoing communication contains confidential information, then the detection system may prevent the outgoing communication from being sent outside the organization. The detection system detects confidential information based on the similarity between the content of an outgoing communication and the content of confidential documents that are known to contain confidential information.
    Type: Application
    Filed: July 27, 2009
    Publication date: December 17, 2009
    Applicant: Microsoft Corporation
    Inventors: Benyu Zhang, Hua-Jun Zeng, Wei-Ying Ma, Zheng Chen
  • Patent number: 7634471
    Abstract: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.
    Type: Grant
    Filed: March 30, 2006
    Date of Patent: December 15, 2009
    Assignee: Microsoft Corporation
    Inventors: Zheng Chen, Lei Li, Chenxi Lin, Qiaoling Liu, Jian Wang, Benyu Zhang
  • Patent number: 7630976
    Abstract: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.
    Type: Grant
    Filed: May 10, 2005
    Date of Patent: December 8, 2009
    Assignee: Microsoft Corporation
    Inventors: Benyu Zhang, Gui-Rong Xue, Hua-Jun Zeng, Wei-Ying Ma, Xue-Mei Jiang, Zheng Chen
  • Patent number: 7627542
    Abstract: A method and system are provided for identifying groups in large-scale networks. The large-scale networks include a collection of nodes and edges that may represent relationships between entities or individuals. The large-scale network is split into a number of fractions satisfying an edge threshold. In turn, the nodes in each fraction are merged to generate one or more clusters based on a specified similarity metric. The large-scale network is recursively split and clustered until distinct groups are identified.
    Type: Grant
    Filed: September 22, 2006
    Date of Patent: December 1, 2009
    Assignee: Microsoft Corporation
    Inventors: Jeremy Tantrum, Heng Zhang, Teresa B. Mah, Benyu Zhang, Abhinai Srivastava
  • Patent number: 7624130
    Abstract: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.
    Type: Grant
    Filed: March 30, 2006
    Date of Patent: November 24, 2009
    Assignee: Microsoft Corporation
    Inventors: Zheng Chen, Lei Li, Chenxi Lin, Qiaoling Liu, Jian Wang, Benyu Zhang
  • Patent number: 7617176
    Abstract: A clustering architecture that dynamically groups the search result documents into clusters labeled by phrases extracted from the search result snippets. Documents related to the same topic usually share a common vocabulary. The words are first clustered based on their co-occurrences and each cluster forms a potentially interesting topic. Keywords are chosen and then clustered by counting co-occurrences of pairs of keywords. Documents are assigned to relevant topics based on the feature vectors of the clusters.
    Type: Grant
    Filed: July 13, 2004
    Date of Patent: November 10, 2009
    Assignee: Microsoft Corporation
    Inventors: Hua-Jun Zeng, Qicai He, Guimei Liu, Zheng Chen, Benyu Zhang, Wei-Ying Ma
  • Patent number: 7594013
    Abstract: A method of creating a personal home page containing information of interest assembled from various web sites. The method includes the partitioning of web pages into web blocks. Users may collect various web blocks from different web pages and utilize those web blocks to define the dynamic personal homepage. In addition, the web blocks may be tracked to update content in the personal home page based on corresponding changes in the original web page.
    Type: Grant
    Filed: May 24, 2005
    Date of Patent: September 22, 2009
    Assignee: Microsoft Corporation
    Inventors: Jian Wang, Hua-Jun Zeng, Chenxi Lin, Zheng Chen, Benyu Zhang, Bing Sun
  • Patent number: 7594277
    Abstract: A method and system for detecting whether an outgoing communication contains confidential information or other target information is provided. The detection system is provided with a collection of documents that contain confidential information, referred to as “confidential documents.” When the detection system is provided with an outgoing communication, it compares the content of the outgoing communication to the content of the confidential documents. If the outgoing communication contains confidential information, then the detection system may prevent the outgoing communication from being sent outside the organization. The detection system detects confidential information based on the similarity between the content of an outgoing communication and the content of confidential documents that are known to contain confidential information.
    Type: Grant
    Filed: June 30, 2004
    Date of Patent: September 22, 2009
    Assignee: Microsoft Corporation
    Inventors: Benyu Zhang, Hua-Jun Zeng, Wei-Ying Ma, Zheng Chen