Patents by Inventor Benyu Zhang
Benyu Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7917587Abstract: A method and system for calculating the importance of persons based on interpersonal relationships and prioritizing communications based on importance of participants in the communications is provided. A prioritization system identifies relationships between persons and identifies the importance of a person to other persons based on these relationships. After the prioritization system identifies the importance of persons, the prioritization system can prioritize communications based on the importance of the senders or recipients.Type: GrantFiled: July 30, 2004Date of Patent: March 29, 2011Assignee: Microsoft CorporationInventors: Hua-Jun Zeng, Zheng Chen, Benyu Zhang, Wei-Ying Ma
-
Patent number: 7873904Abstract: Systems and methods are described for an Internet visualization system and related user interfaces. In one implementation, the system analyzes Internet search logs to determine most popular search queries across the world at a current time. A user interface displays a keyword of each of the most popular queries in a single visual display that relates each query to a geographical location of greatest popularity. The system can also filter queries according to demographics. In one implementation the user interface provides a 3-dimensional Internet visualization that adopts an ocean or seascape theme. The ocean floor displays a map of the world, and query bubbles rise from geographical locations on the map. The size and duration of each query bubble denotes the relative popularity of a given query.Type: GrantFiled: January 10, 2008Date of Patent: January 18, 2011Assignee: Microsoft CorporationInventors: Min Wang, Weizhu Chen, Benyu Zhang, Zheng Chen, Jian Wang
-
Patent number: 7870132Abstract: The claimed subject matter is directed to constructing query hierarchies in response to a query request. To construct a query hierarchy, a list of related candidate queries is generated in response to the received query request. The list of related candidate queries is generated by determining the relative coverage of information shared by the candidate queries and the query request. Relationships between the submitted query request and the candidate queries in the list are determined based upon the extent of relative coverage of information shared by the candidate queries and the query request. A query hierarchy is then constructed to reflect the determined relationships between the query request and the candidate queries.Type: GrantFiled: January 28, 2008Date of Patent: January 11, 2011Assignee: Microsoft CorporationInventors: Weizhu Chen, Benyu Zhang, Zheng Chen, Jian Wang, Dou Shen
-
Patent number: 7861149Abstract: Computer-readable media having computer-executable instructions and apparatuses provide a keyphrase navigation map (KNM) for a document page. Keyphrases are extracted from the document page. Keyphrase clusters are subsequently formed by a measure of relevancy, and a salient keyphrase is determined for each cluster. A thumbnail is formed with tags corresponding to the salient keyphrases. A selected tag is expanded with associated keyphrases. An associated keyphrase may be further selected in order to facilitate the navigation of the document page. The displayed tags on the thumbnail are positioned in accordance with locations of associated keyphrases in the document page.Type: GrantFiled: March 9, 2006Date of Patent: December 28, 2010Assignee: Microsoft CorporationInventors: Min Wang, Benyu Zhang, Hua-Jun Zeng, Jian Wang, Shiguang Liu, Zheng Chen
-
Patent number: 7849089Abstract: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.Type: GrantFiled: November 11, 2009Date of Patent: December 7, 2010Assignee: Microsoft CorporationInventors: Benyu Zhang, Gui-Rong Xue, Hua-Jun Zeng, Wei-Ying Ma, Xue-Mei Jiang, Zheng Chen
-
Patent number: 7844449Abstract: A scalable two-pass scalable probabilistic latent semantic analysis (PLSA) methodology is disclosed that may perform more efficiently, and in some cases more accurately, than traditional PLSA, especially where large and/or sparse data sets are provided for analysis. The improved methodology can greatly reduce the storage and/or computational costs of training a PLSA model. In the first pass of the two-pass methodology, objects are clustered into groups, and PLSA is performed on the groups instead of the original individual objects. In the second pass, the conditional probability of a latent class, given an object, is obtained. This may be done by extending the training results of the first pass. During the second pass, the most likely latent classes for each object are identified.Type: GrantFiled: March 30, 2006Date of Patent: November 30, 2010Assignee: Microsoft CorporationInventors: Chenxi Lin, Jie Han, Guirong Xue, Hua-Jun Zeng, Benyu Zhang, Zheng Chen, Jian Wang
-
Patent number: 7822752Abstract: Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.Type: GrantFiled: May 18, 2007Date of Patent: October 26, 2010Assignee: Microsoft CorporationInventors: Chenxi Lin, Lei Ji, Huajun Zeng, Benyu Zhang, Zheng Chen, Jian Wang
-
Patent number: 7818330Abstract: Described is a technology by which blocks of web pages may be selected, such as for building a user-personalized web page containing selected blocks. A selection mechanism, such as a browser toolbar add-on, provides a user interface for selecting blocks, and records information about selected blocks. A block tracking mechanism (e.g., a daemon program) uses the information to locate selected blocks of the web pages, including when the web page containing the block is updated with respect to content and/or layout. The block tracking mechanism may update a local gadget that when invoked, such as by browsing to a particular web page, which shows updated versions of the block on a personalized web page. Blocks may be efficiently located by processing trees representing web pages into reduced trees, and then by performing a minimum distance mapping algorithm on the reduced trees.Type: GrantFiled: May 9, 2007Date of Patent: October 19, 2010Assignee: Microsoft CorporationInventors: Min Wu, Chenxi Lin, Benyu Zhang, Huajun Zeng, Zheng Chen, Jian Wang
-
Patent number: 7788131Abstract: Seed keywords are leveraged to provide expanded keywords that are then associated with relevant advertisers. Instances can also include locating potential advertisers based on the expanded keywords. Inverse lookup techniques are employed to determine which keywords are associated with an advertiser. Filtering can then be employed to eliminate inappropriate keywords for that advertiser. The keywords are then automatically revealed to the advertiser for consideration as relevant search terms for their advertisements. In this manner, revenue for a search engine and/or for an advertiser can be substantially enhanced through the automatic expansion of relevant search terms. Advertisers also benefit by having larger and more relevant search term selections automatically available to them, saving them both time and money.Type: GrantFiled: December 15, 2005Date of Patent: August 31, 2010Assignee: Microsoft CorporationInventors: Shuzhen Nong, Ying Li, Tarek Najm, Li Li, Hua-Jun Zeng, Zheng Chen, Benyu Zhang
-
Patent number: 7779001Abstract: The described systems, methods and data structures are directed to ranking Web pages with hierarchical considerations. The hierarchical structures and the linking relationships of the World Wide Web are used to provide a page importance ranking for Web searches. The linking relationships are aggregated to a high level node at each of the hierarchical structures. A link graph analysis is performed on the aggregated linking relationships to determine the importance of each node. The importance of each node may be propagated to pages associated with that node. For each page, the importance of that page and the importance of the node associated with the page are used to calculate the page importance ranking.Type: GrantFiled: October 29, 2004Date of Patent: August 17, 2010Assignee: Microsoft CorporationInventors: Hua-Jun Zeng, Zheng Chen, Benyu Zhang, Wei-Ying Ma, Guirong Xue
-
Patent number: 7774340Abstract: A system for calculating the importance of web pages is provided. The web pages are organized hierarchically into collections. The system calculates the importance of each collection based on inter-collection links from a web page in one collection to a web page in another collection. The system then calculates the importance of web pages in the collections with a high calculated importance based on links between the web pages in those collections using, for example, a conventional page rank algorithm. The system may also calculate the importance of web pages in each collection with a low calculated importance separately based on the links between the web pages in the collection using, for example, a conventional page rank algorithm.Type: GrantFiled: June 30, 2004Date of Patent: August 10, 2010Assignee: Microsoft CorporationInventors: Benyu Zhang, Hua-Jun Zeng, Wei-Ying Ma, Zheng Chen
-
Patent number: 7747618Abstract: A system for augmenting click-through data with latent information present in the click-through data for use in generating search results that are better tailored to the information needs of a user submitting a query is provided. The augmentation system creates a three-dimensional matrix with the dimensions of users, queries, and documents. The augmentation system then performs a three-order singular value decomposition of the three-dimensional matrix to generate a three-dimensional core singular value matrix and a left singular matrix for each dimension. The augmentation system finally multiplies the three-dimensional core singular value matrix by the left singular matrices to generate an augmented three-dimensional matrix that explicitly contains the information that was latent in the un-augmented three-dimensional matrix.Type: GrantFiled: September 8, 2005Date of Patent: June 29, 2010Assignee: Microsoft CorporationInventors: Hua-Jun Zeng, Jian-Tao Sun, Wei-Ying Ma, Zheng Chen, Benyu Zhang, Huan Liu
-
Patent number: 7711735Abstract: Described is a behavioral targeting technology for online advertising, by which an original attribute is uniformly expanded. Users that meet an original attribute are aggregated into a mid-result used to determine similarity relative to candidate attribute types. The most similar candidate attributes are selected for the expanded attribute. A URL/URL pattern suggestion technology is provided, with similarity computed from users/URLs visited by the users. URLs are separated into URL tree nodes, for calculating the number of users who have visited each URL and the number of users who have visited the URL on a sub-tree whose root is the node. URL/URL patterns are output based on similarity. Domains are also suggested based on user-visits. Similarities between pairs of domains may be computed (e.g., offline), with an output for a given domain provided in based on its similarity with each other domain.Type: GrantFiled: May 15, 2007Date of Patent: May 4, 2010Assignee: Microsoft CorporationInventors: Min Wu, Chenxi Lin, Benyu Zhang, Zheng Chen, Jian Wang
-
Patent number: 7707129Abstract: Embodiments of the invention relate to improvements to the support vector machine (SVM) classification model. When text data is significantly unbalanced (i.e., positive and negative labeled data are in disproportion), the classification quality of standard SVM deteriorates. Embodiments of the invention are directed to a weighted proximal SVM (WPSVM) model that achieves substantially the same accuracy as the traditional SVM model while requiring significantly less computational time. A weighted proximal SVM (WPSVM) model in accordance with embodiments of the invention may include a weight for each training error and a method for estimating the weights, which automatically solves the unbalanced data problem.Type: GrantFiled: March 20, 2006Date of Patent: April 27, 2010Assignee: Microsoft CorporationInventors: Dong Zhuang, Benyu Zhang, Zheng Chen, Hua-Jun Zeng, Jian Wang
-
Patent number: 7698339Abstract: A method and system for calculating the significance of a sentence within a document is provided. The summarization system calculates the significance of the sentences of a document and selects the most significant sentences as the summary of the document. The summarization system calculates the significance of a sentence based on the “important” words of the document that are contained within the sentence. The summarization system calculates the importance of words of the document using various scoring techniques and then combines the scores to classify a word as important or not important. The summarization system can then be used to identify significant sentences of the document based on the important words that a sentence contains and select significant sentences as a summary of the document.Type: GrantFiled: August 13, 2004Date of Patent: April 13, 2010Assignee: Microsoft CorporationInventors: Benyu Zhang, Wei-Ying Ma, Zheng Chen, Hua-Jun Zeng, Dou Shen
-
Patent number: 7693823Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.Type: GrantFiled: June 28, 2007Date of Patent: April 6, 2010Assignee: Microsoft CorporationInventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
-
Patent number: 7693908Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.Type: GrantFiled: June 28, 2007Date of Patent: April 6, 2010Assignee: Microsoft CorporationInventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
-
Patent number: 7689585Abstract: Systems and methods for related term suggestion are described. In one aspect, relationships among respective ones of two or more multi-type data objects are identified. The respective ones of the multi-type data objects include at least one object of a first type and at least one object of a second type that is different from the first type. The multi-type data objects are iteratively clustered in view of respective ones of the relationships to generate reinforced clusters.Type: GrantFiled: April 15, 2004Date of Patent: March 30, 2010Assignee: Microsoft CorporationInventors: Hua-Jun Zeng, Benyu Zhang, Zheng Chen, Wei-Ying Ma, Li Li, Ying Li, Tarek Najm
-
Patent number: 7689622Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.Type: GrantFiled: June 28, 2007Date of Patent: March 30, 2010Assignee: Microsoft CorporationInventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
-
Patent number: 7685100Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.Type: GrantFiled: June 28, 2007Date of Patent: March 23, 2010Assignee: Microsoft CorporationInventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang