Patents by Inventor Benyu Zhang

Benyu Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

IDENTIFICATION OF SIMILAR QUERIES BASED ON OVERALL AND PARTIAL SIMILARITY OF TIME SERIES

Publication number: 20090006365

Abstract: Techniques for identifying similar queries based on their overall similarity and partial similarity of time series of frequencies of the queries are provided. To identify queries that are similar to a target query, the query analysis system generates, for each query, an overall similarity score for that query and the target query based on the time series of the query and the target query. The query analysis system also generates, for each query, partial similarity scores for the query and the target query based on various time sub-series of the overall time series of the queries. The query analysis system then identifies queries as being similar to the target query based on the overall similarity scores and the partial similarity scores of the queries.

Type: Application

Filed: June 28, 2007

Publication date: January 1, 2009

Applicant: Microsoft Corporation

Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
FORECASTING SEARCH QUERIES BASED ON TIME DEPENDENCIES

Publication number: 20090006313

Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

Type: Application

Filed: June 28, 2007

Publication date: January 1, 2009

Applicant: Microsoft Corporation

Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
IDENTIFICATION OF EVENTS OF SEARCH QUERIES

Publication number: 20090006294

Abstract: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

Type: Application

Filed: June 28, 2007

Publication date: January 1, 2009

Applicant: Microsoft Corporation

Inventors: Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Jian Wang
User segment suggestion for online advertising

Publication number: 20080288491

Abstract: Described is a behavioral targeting technology for online advertising, by which an original attribute is uniformly expanded. Users that meet an original attribute are aggregated into a mid-result used to determine similarity relative to candidate attribute types. The most similar candidate attributes are selected for the expanded attribute. A URL/URL pattern suggestion technology is provided, with similarity computed from users/URLs visited by the users. URLs are separated into URL tree nodes, for calculating the number of users who have visited each URL and the number of users who have visited the URL on a sub-tree whose root is the node. URL/URL patterns are output based on similarity. Domains are also suggested based on user-visits. Similarities between pairs of domains may be computed (e.g., offline), with an output for a given domain provided in based on its similarity with each other domain.

Type: Application

Filed: May 15, 2007

Publication date: November 20, 2008

Applicant: Microsoft Corporation

Inventors: Min Wu, Chenxi Lin, Benyu Zhang, Zheng Chen, Jian Wang
Ranking online advertisement using product and seller reputation

Publication number: 20080288481

Abstract: Described is a technology by which online advertisements for returning with a query response are ranked according to reputation. The reputation may correspond to a product or service and/or seller reputation. In one example, a set of relevant advertisement items are located and ranked using reputation data as a factor. For example, for each item, a ranking value is based on a mathematical combination of a product reputation score, a seller reputation score and a relevance score, with the items ranked by their computed values. The scores may be weighted differently. The reputation data may be mined from a review source, such as customer reviews available on the web. In one example implementation, a 3-gram model that considers terms in the review along with the two terms proceeding each term is used to analyze the reviews to determine whether each review is positive or negative with respect to the reputation.

Type: Application

Filed: May 15, 2007

Publication date: November 20, 2008

Applicant: Microsoft Corporation

Inventors: Huajun Zeng, Chenxi Lin, Dingyi Han, Benyu Zhang, Zheng Chen, Jian Wang
Efficient retrieval algorithm by query term discrimination

Publication number: 20080288483

Abstract: Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.

Type: Application

Filed: May 18, 2007

Publication date: November 20, 2008

Applicant: Microsoft Corporation

Inventors: Chenxi Lin, Lei Ji, Huajun Zeng, Benyu Zhang, Zheng Chen, Jian Wang
Ranking online advertisements using retailer and product reputations

Publication number: 20080288348

Abstract: A method for ranking online advertisements using retailer reputation and product reputation. In one implementation, a query may be received. Advertisements may be selected by determining a level of relevance between the query and each advertisement and selecting the advertisements with a level of relevance above a pre-determined level of relevance. A predicted reputation for a retailer and a predicted reputation for a product may be retrieved for each of the selected advertisements. The selected advertisements may then be ranked based on the predicted reputation for the retailer and the predicted reputation of the product. The ranking of the selected advertisements may be accomplished by calculating a ranking score for each selected advertisement based on the retailer predicted reputation and the product predicted reputation. The selected advertisements may then be displayed according to the ranking.

Type: Application

Filed: May 15, 2007

Publication date: November 20, 2008

Applicant: Microsoft Corporation

Inventors: Huajun Zeng, Chenxi Lin, Dingyi Han, Benyu Zhang, Zheng Chen, Jian Wang
Block tracking mechanism for web personalization

Publication number: 20080281834

Abstract: Described is a technology by which blocks of web pages may be selected, such as for building a user-personalized web page containing selected blocks. A selection mechanism, such as a browser toolbar add-on, provides a user interface for selecting blocks, and records information about selected blocks. A block tracking mechanism (e.g., a daemon program) uses the information to locate selected blocks of the web pages, including when the web page containing the block is updated with respect to content and/or layout. The block tracking mechanism may update a local gadget that when invoked, such as by browsing to a particular web page, which shows updated versions of the block on a personalized web page. Blocks may be efficiently located by processing trees representing web pages into reduced trees, and then by performing a minimum distance mapping algorithm on the reduced trees.

Type: Application

Filed: May 9, 2007

Publication date: November 13, 2008

Applicant: Microsoft Corporation

Inventors: Min Wu, Chenxi Lin, Benyu Zhang, Huajun Zeng, Zheng Chen, Jian Wang
Internet Visualization System and Related User Interfaces

Publication number: 20080256444

Abstract: Systems and methods are described for an Internet visualization system and related user interfaces. In one implementation, the system analyzes Internet search logs to determine most popular search queries across the world at a current time. A user interface displays a keyword of each of the most popular queries in a single visual display that relates each query to a geographical location of greatest popularity. The system can also filter queries according to demographics. In one implementation the user interface provides a 3-dimensional Internet visualization that adopts an ocean or seascape theme. The ocean floor displays a map of the world, and query bubbles rise from geographical locations on the map. The size and duration of each query bubble denotes the relative popularity of a given query.

Type: Application

Filed: January 10, 2008

Publication date: October 16, 2008

Applicant: Microsoft Corporation

Inventors: Min Wang, Weizhu Chen, Benyu Zhang, Zheng Chen, Jian Wang
Method and system for ranking messages of discussion threads

Patent number: 7437382

Abstract: A method and system for ranking messages of discussion threads based on relationships between messages and authors is provided. The ranking system defines an equation for attributes of a message and an author. The equations define the attribute values and are based on relationships between the attribute and the attributes associated with the same type of object, and different types of objects. The ranking system iteratively calculates the attribute values for the objects using the equations until the attribute values converge on a solution. The ranking system then ranks the messages based on attribute values.

Type: Grant

Filed: May 16, 2005

Date of Patent: October 14, 2008

Assignee: Microsoft Corporation

Inventors: Benyu Zhang, Zheng Chen, Wensi Xi, Hua-Jun Zeng, Wei-Ying Ma
Term suggestion for multi-sense query

Patent number: 7428529

Abstract: Systems and methods for related term suggestion are described. In one aspect, term clusters are generated as a function of calculated similarity of term vectors. Each term vector having been generated from search results associated with a set of high frequency of occurrence (FOO) historical queries previously submitted to a search engine. Responsive to receiving a term/phrase from an entity, the term/phrase is evaluated in view of terms/phrases in the term clusters to identify one or more related term suggestions.

Type: Grant

Filed: April 15, 2004

Date of Patent: September 23, 2008

Assignee: Microsoft Corporation

Inventors: Hua-Jun Zeng, Benyu Zhang, Zheng Chen, Wei-Ying Ma, Li Li, Ying Li, Tarek Najm
Efficient Retrieval Algorithm by Query Term Discrimination

Publication number: 20080215574

Abstract: An exemplary method for use in information retrieval includes, for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term; receiving a plurality of terms, optionally as a query; ranking the plurality of terms for importance based at least in part on the document sets for the plurality of terms where the ranking comprises using an inverse document frequency algorithm; selecting a number of ranked terms based on importance where each selected, ranked term comprises its corresponding document set wherein each document in a respective document set comprises a document identification number; forming a union set based on the document sets associated with the selected number of ranked terms; and, for a document identification number in the union set, scanning a document set corresponding to an unselected term for a matching document identification number. Various other exemplary systems, methods, devices, etc.

Type: Application

Filed: February 27, 2008

Publication date: September 4, 2008

Applicant: Microsoft Corporation

Inventors: Chenxi Lin, Lei Ji, HuaJun Zeng, Benyu Zhang, Zheng Chen, Jian Wang
WEBPAGE BLOCK TRACKING GADGET

Publication number: 20080215997

Abstract: An exemplary web browser system includes a selection module for selecting a webpage block and recording information about a selected webpage block; a tracking module for tracking changes to a selected webpage block based at least in part on the recorded information for that webpage block; and a display module for displaying a selected webpage block wherein the tracking module updates the display module as to changes to the selected webpage block. Various other exemplary systems, methods, devices are also disclosed.

Type: Application

Filed: February 27, 2008

Publication date: September 4, 2008

Applicant: Microsoft Corporation

Inventors: Min Wu, Chenxi Lin, Benyu Zhang, HuaJun Zeng, Zheng Chen, Jian Wang
Diverse Topic Phrase Extraction

Publication number: 20080208840

Abstract: Systems and methods for implementing diverse topic phrase extraction are disclosed. According to one implementation, multiple word candidate phrases are extracted from a corpus and weighed. One or more documents are re-weighed to identify less obvious candidate topics using latent semantic analysis (LSA). Phrase diversification is then used to remove redundancy and select informative and distinct topic phrases.

Type: Application

Filed: September 21, 2007

Publication date: August 28, 2008

Applicant: Microsoft Corporation

Inventors: Benyu Zhang, Jilin Chen, Zheng Chen, HuaJun Zeng, Jian Wang
Method and system for classifying display pages using summaries

Patent number: 7392474

Abstract: A method and system for classifying display pages based on automatically generated summaries of display pages. A web page classification system uses a web page summarization system to generate summaries of web pages. The summary of a web page may include the sentences of the web page that are most closely related to the primary topic of the web page. The summarization system may combine the benefits of multiple summarization techniques to identify the sentences of a web page that represent the primary topic of the web page. Once the summary is generated, the classification system may apply conventional classification techniques to the summary to classify the web page. The classification system may use conventional classification techniques such as a Naïve Bayesian classifier or a support vector machine to identify the classifications of a web page based on the summary generated by the summarization system.

Type: Grant

Filed: April 30, 2004

Date of Patent: June 24, 2008

Assignee: Microsoft Corporation

Inventors: Zheng Chen, Dou Shen, Benyu Zhang, Hua-Jun Zeng, Wei-Ying Ma
HIERARCHICAL CLUSTERING OF LARGE-SCALE NETWORKS

Publication number: 20080126523

Abstract: A method and system are provided for identifying groups in large-scale networks. The large-scale networks include a collection of nodes and edges that may represent relationships between entities or individuals. The large-scale network is split into a number of fractions satisfying an edge threshold. In turn, the nodes in each fraction are merged to generate one or more clusters based on a specified similarity metric. The large-scale network is recursively split and clustered until distinct groups are identified.

Type: Application

Filed: September 22, 2006

Publication date: May 29, 2008

Applicant: MICROSOFT CORPORATION

Inventors: Jeremy Tantrum, Heng Zhang, Teresa B. Mah, Benyu Zhang, Abhinai Srivastava
DEMOGRAPHIC PREDICTION USING A SOCIAL LINK NETWORK

Publication number: 20080126411

Abstract: A system, method, computer-readable media, and related techniques are disclosed for predicting demographic information of a user. A social link network is created and a search request for demographic information related to a first user within the social link network is received. The requested demographic information based on the demographic information of other users connected to the first user within the social link network is provided.

Type: Application

Filed: September 26, 2006

Publication date: May 29, 2008

Applicant: MICROSOFT CORPORATION

Inventors: Dong Zhuang, Benyu Zhang, Heng Zhang, Jeremy Tantrum, Teresa B. Mah, Hua-Jun Zeng, Zheng Chen, Jian Wang
VISUALIZATION APPLICATION FOR MINING OF SOCIAL NETWORKS

Publication number: 20080104225

Abstract: A social network visualization and mining system that includes a visualization application for mining social networks of users in an online social network. This visualization can be used to mine the social network for additional information and intelligence. The social network is displaying in graphical form, such as a node-link graph, with a center node representing the social network of a user being examined, and secondary nodes represent the primary user's friends. Lines represent links between the primary user and his friends, while various visualization features such as line thickness, line color, and text size are used to easily identify the type of relationship between users. The system also includes a topics visualization module, which builds and displays a social network based on a certain topic or keyword that is entered by the application user. A demographic prediction module examines a user's social network to predict demographics of users.

Type: Application

Filed: November 1, 2006

Publication date: May 1, 2008

Applicant: Microsoft Corporation

Inventors: Heng Zhang, Benyu Zhang, Teresa Mah, Dong Zhuang, Jeremy Tantrum, Ying Li
DETERMINING RELEVANCE OF A TERM TO CONTENT USING A COMBINED MODEL

Publication number: 20080103886

Abstract: A method and system for generating and using a combined model to identify whether a bid term is relevant to an advertisement is provided. A relevance system trains a combined model that includes an initial model and a decision tree model that are trained using features that represent relationships between bid terms and advertisements. The relevance system trains the initial model to map initial model features to a modeled relevance. The relevance system trains the decision tree model to map the decision tree features and the modeled relevance to a final relevance. The trained initial model and decision tree model represent the combined model. The relevance system then uses the combined model to determine the relevance of bid terms to advertisements.

Type: Application

Filed: October 27, 2006

Publication date: May 1, 2008

Applicant: Microsoft Corporation

Inventors: Hua Li, Zheng Chen, Benyu Zhang, Hua-Jun Zeng, Jian Wang
Clustering based text classification

Patent number: 7366705

Abstract: Systems and methods for clustering-based text classification are described. In one aspect text is clustered as a function of labeled data to generate cluster(s). The text includes the labeled data and unlabeled data. Expanded labeled data is then generated as a function of the cluster(s). The expanded label data includes the labeled data and at least a portion of unlabeled data. Discriminative classifier(s) are then trained based on the expanded labeled data and remaining ones of the unlabeled data.

Type: Grant

Filed: August 16, 2004

Date of Patent: April 29, 2008

Assignee: Microsoft Corporation

Inventors: Hua-Jun Zeng, Xuanhui Wang, Zheng Chen, Benyu Zhang, Wei-Ying Ma

prev … 2 3 4 5 6 7 8 9 next