Patents by Inventor Wei-Ying Ma

Wei-Ying Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7627571
    Abstract: A method and system for identifying explanatory text for a referenced web page based on a reference to the referenced web page contained in a repeated pattern of a referencing web page is provided. An anchor explanatory text (“AET”) system uses the hierarchical organization of the web page to identify a repeated pattern of hierarchical elements that contain references to other display pages. After the AET system identifies a repeated pattern, it identifies the dominant reference or anchor within each occurrence of the pattern. The AET system uses the explanatory text surrounding a dominant anchor as a description of the referenced web page.
    Type: Grant
    Filed: March 31, 2006
    Date of Patent: December 1, 2009
    Assignee: Microsoft Corporation
    Inventors: Feng Jing, Kefeng Deng, Lei Zhang, Wei-Ying Ma
  • Patent number: 7624081
    Abstract: A community mining system analyzes objects of different types and relationships between the objects of different types to identify communities. The relationships between the objects have an associated time. The community mining system extracts various features related to objects of a designated type from the relationships between objects of different types that represent the evolution of the features over time. The community mining system collects training data that indicates extracted features associated with members of the communities. The community mining system then classifies an object of the designated type as being within the community based on closeness of the features of the object to the features of the training data.
    Type: Grant
    Filed: March 28, 2006
    Date of Patent: November 24, 2009
    Assignee: Microsoft Corporation
    Inventors: Qiankun Zhao, Tie-Yan Liu, Wei-Ying Ma
  • Publication number: 20090281906
    Abstract: An exemplary method includes defining a vocabulary for emotions; extracting descriptions for songs; generating distributions for the songs in an emotion space based at least in part on the vocabulary and the extracted descriptions; extracting salient words from a document; generating a distribution for the document in an emotion space based at least in part on the vocabulary and the extracted salient words; and matching the distribution for the document to one or more of the distributions for the songs. Various other exemplary methods, devices, systems, etc., are also disclosed.
    Type: Application
    Filed: May 7, 2008
    Publication date: November 12, 2009
    Applicant: Microsoft Corporation
    Inventors: Rui Cai, Lei Zhang, Wei-Ying Ma
  • Publication number: 20090282032
    Abstract: A method and system for generating a search result for a query of hierarchically organized documents based on retrieval of subtrees that are key resources for topic distillation is provided. The retrieval system may identify documents relevant to a query using conventional searching techniques. The retrieval system then calculates a subtree feature for subtrees that have an identified document as their root. After the retrieval system calculates the subtree feature for the subtrees, the retrieval system may generate a subtree relevance score for each subtree based on its subtree feature. The retrieval system may then order the identified documents based on their corresponding subtree relevances.
    Type: Application
    Filed: July 17, 2009
    Publication date: November 12, 2009
    Applicant: Microsoft Corporation
    Inventors: Tie-Yan Liu, Tao Qin, Wei-Ying Ma
  • Publication number: 20090277322
    Abstract: An exemplary method includes providing a music collection of a particular scale, determining a distance parameter for locality sensitive hashing based at least in part on the scale of the music collection and constructing an index for the music collection. Another exemplary method includes providing a song, extracting snippets from the song, analyzing time-varying timbre characteristics of the snippets and constructing one or more queries based on the analyzing. Such exemplary methods may be implemented by a portable device configured to maintain an index, to perform searches based on selected songs or portions of songs and to generate playlists from search results. Other exemplary methods, devices, systems, etc., are also disclosed.
    Type: Application
    Filed: May 7, 2008
    Publication date: November 12, 2009
    Applicant: Microsoft Corporation
    Inventors: Rui Cai, Lei Zhang, Wei-Ying Ma
  • Patent number: 7617194
    Abstract: A method and system for ranking importance of vertices of a directed graph based on links between the vertices and some prior knowledge of importance of the vertices is provided. A ranking system inputs an indication of the vertices along with an indication of the links between the vertices as the directed graph. The ranking system generates a transition-probability matrix that represents the probability of transitioning from vertex to vertex. The ranking system then generates a ranking of the vertices based on the links between the vertices represented by the stationary distribution of the transition-probability matrix that is minimally perturbed to satisfy the prior knowledge, which may be a partial ranking of the vertices.
    Type: Grant
    Filed: December 29, 2006
    Date of Patent: November 10, 2009
    Assignee: Microsoft Corporation
    Inventors: Tie-Yan Liu, Tao Qin, Wei-Ying Ma
  • Patent number: 7617176
    Abstract: A clustering architecture that dynamically groups the search result documents into clusters labeled by phrases extracted from the search result snippets. Documents related to the same topic usually share a common vocabulary. The words are first clustered based on their co-occurrences and each cluster forms a potentially interesting topic. Keywords are chosen and then clustered by counting co-occurrences of pairs of keywords. Documents are assigned to relevant topics based on the feature vectors of the clusters.
    Type: Grant
    Filed: July 13, 2004
    Date of Patent: November 10, 2009
    Assignee: Microsoft Corporation
    Inventors: Hua-Jun Zeng, Qicai He, Guimei Liu, Zheng Chen, Benyu Zhang, Wei-Ying Ma
  • Patent number: 7613995
    Abstract: Vision-based document segmentation identifies one or more portions of semantic content of a document. The one or more portions are identified by identifying a plurality of visual blocks in the document, and detecting one or more separators between the visual blocks of the plurality of visual blocks. A content structure for the document is constructed based at least in part on the plurality of visual blocks and the one or more separators, and the content structure identifies the one or more portions of semantic content of the document. The content structure obtained using the vision-based document segmentation can optionally be used during document retrieval.
    Type: Grant
    Filed: January 9, 2006
    Date of Patent: November 3, 2009
    Assignee: Microsoft Corporation
    Inventors: Ji-Rong Wen, Shipeng Yu, Deng Cai, Wei-Ying Ma
  • Patent number: 7609898
    Abstract: A method and system for transmitting an image progressively is provided. The transmission system identifies a first region and a second region of the image. The transmission system also identifies a first resolution and a second resolution. The transmission system then transmits the image by transmitting, in the following order, the first region in the first resolution, the second region in the first resolution, the first region in the second resolution, and the second region in the second resolution. The transmission system may identify the regions based on the likelihood of being the focus of user attention.
    Type: Grant
    Filed: October 1, 2004
    Date of Patent: October 27, 2009
    Assignee: Microsoft Corporation
    Inventors: Wei-Ying Ma, Xing Xie
  • Publication number: 20090265363
    Abstract: Described is a technology by which forum web pages are processed into clusters for classification purposes, including by determining repetitive regions between pages and associating pages that have similar repetitive regions into a common cluster. Patterns corresponding to the regions are determined, and a feature set based at least in part on those patterns (e.g., pattern frequency) is extracted from the page. The feature set of a page is compared against the feature set of another page to determine similarity therewith, e.g., via a feature space distance computation that is evaluated against a threshold distance.
    Type: Application
    Filed: April 16, 2008
    Publication date: October 22, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Wei Lai, Rui Cai, Jiangming Yang, Lei Zhang, Wei-Ying Ma
  • Patent number: 7606875
    Abstract: Methods and systems for determining the serving area of a web resource by address, by query content, and by business category are provided. A location system may determine the serving area of a web resource based on addresses of users who access the web resource. The location system may determine the serving area for a web site (or other web resource) based on query terms that resulted in a click-through to the web site. The location system may determine the serving area of a web site (or other web resource) based on the business category of the web site and a “provider location” associated with the web site.
    Type: Grant
    Filed: March 28, 2006
    Date of Patent: October 20, 2009
    Assignee: Microsoft Corporation
    Inventors: Xing Xie, Lee Wang, Qi Zhang, Wei-Ying Ma
  • Patent number: 7607082
    Abstract: Categorizing page block functionality to improve document layout for browsing is described. In one aspect, document content is analyzed with respect to multiple block function criteria. Results of this analysis are used to assign a respective block function to blocks of the document content. These block function assignments are used to generate one or more customized document layouts for browsing by a user.
    Type: Grant
    Filed: September 26, 2005
    Date of Patent: October 20, 2009
    Assignee: Microsoft Corporation
    Inventors: Xing Xie, Gengxin Miao, Guomao Xin, Ruihua Song, Ji-Rong Wen, Wei-Ying Ma
  • Patent number: 7599931
    Abstract: A crawling system crawls a web site initially in a pattern detection phase and subsequently in a pattern usage phase. The pattern detection phase attempts to identify patterns of references to pages that contain informational content of interest and patterns of references to pages that contain little informational content of interest. During the pattern usage phase, the crawling system crawls the web site. When the crawling system encounters a reference contained on an accessed page, the crawling system determines whether the reference matches a reference pattern. If the reference matches a reference pattern associated with pages that contain informational content of interest, the crawling system accesses the referenced page. If, however, the reference matches a reference pattern of pages with little informational content, then the crawling system discards that reference without accessing the referenced page.
    Type: Grant
    Filed: March 3, 2006
    Date of Patent: October 6, 2009
    Assignee: Microsoft Corporation
    Inventors: Bin Shi, Gu Xu, Wei-Ying Ma
  • Patent number: 7596597
    Abstract: A method and system for recommending potential contacts to a target user is provided. A recommendation system identifies users who are related to the target user through no more than a maximum degree of separation. The recommendation system identifies the users by starting with the contacts of the target user and identifying users who are contacts of the target user's contacts, contacts of those contacts, and so on. The recommendation system then ranks the identified users, who are potential contacts for the target user, based on a likelihood that the target user will want to have a direct relationship with the identified users. The recommendation system then presents to the target user a ranking of the users who have not been filtered out.
    Type: Grant
    Filed: August 31, 2006
    Date of Patent: September 29, 2009
    Assignee: Microsoft Corporation
    Inventors: Dafan Liu, Hua Chen, Wei-Ying Ma
  • Patent number: 7594277
    Abstract: A method and system for detecting whether an outgoing communication contains confidential information or other target information is provided. The detection system is provided with a collection of documents that contain confidential information, referred to as “confidential documents.” When the detection system is provided with an outgoing communication, it compares the content of the outgoing communication to the content of the confidential documents. If the outgoing communication contains confidential information, then the detection system may prevent the outgoing communication from being sent outside the organization. The detection system detects confidential information based on the similarity between the content of an outgoing communication and the content of confidential documents that are known to contain confidential information.
    Type: Grant
    Filed: June 30, 2004
    Date of Patent: September 22, 2009
    Assignee: Microsoft Corporation
    Inventors: Benyu Zhang, Hua-Jun Zeng, Wei-Ying Ma, Zheng Chen
  • Patent number: 7590603
    Abstract: A method and system for classifying messages of a discussion thread as questions is provided. A classification system generates a classifier to classify messages of discussion threads as question messages or non-question messages. The system trains the classifier using the feature vectors and input classifications derived from a training set of discussion threads. After the classifier is trained, the classification system uses the classifier to classify messages within a corpus of discussion threads as question or non-question messages. To classify a message, the classification system generates a feature vector for the messages and submits that feature vector to the classifier. The classifier generates a score for the message indicating a likelihood that the message is a question message.
    Type: Grant
    Filed: October 1, 2004
    Date of Patent: September 15, 2009
    Assignee: Microsoft Corporation
    Inventors: Benyu Zhang, Zheng Chen, Hua-Jun Zeng, Wei-Ying Ma
  • Patent number: 7590267
    Abstract: A method and system for detecting faces at different views within images that allocates the computational effort based on a prior probability associated with a view is provided. A face detection system determines whether an image contains a face using detectors that are adapted to detect faces at various views and a filter that filters out windows of the image that are provided to a detector based on a prior probability associated with the view of the detector. Each view has an associated prior probability that a face from a collection of real-life home photographs will be at that view. The face detection system allocates increasing computational effort to a detector as the prior probability of its view increases.
    Type: Grant
    Filed: May 31, 2005
    Date of Patent: September 15, 2009
    Assignee: Microsoft Corporation
    Inventors: Rong Xiao, Lei Zhang, Yan-Feng Sun, Wei-Ying Ma
  • Publication number: 20090228452
    Abstract: A method and system for identifying information about people is provided. The information system identifies groups of people that have relationships based on their relationships to documents or more generally to objects. The information system initially is provided with an indication of which people have which relationships to which documents. The information system then identifies clusters of people based on having a relationship to the same objects. The information system may also identify clusters of related objects associated with a cluster of people. When a user wants to identify information about a person, the user can provide the name of that person to the information system. The information system then can retrieve and display the names of the other people who are in the same cluster as the person.
    Type: Application
    Filed: March 17, 2009
    Publication date: September 10, 2009
    Applicant: Microsoft Corporation
    Inventors: Benyu Zhang, Wei-Ying Ma, Gu Xu, Hongbin Gao, Zheng Chen, Randy Hinrichs, Hua-Jun Zeng
  • Patent number: 7584100
    Abstract: A method and system for clustering documents based on generalized sentence patterns of the topics of the documents is provided. A generalized sentence patterns (“GSP”) system identifies a “sentence” that describes the topic of a document. To cluster documents, the GSP system generates a “generalized sentence” form of the sentence that describes the topic of each document. The generalized sentence is an abstraction of the words of the sentence. The GSP system identifies clusters of documents based on the patterns of their generalized sentences. The GSP system clusters documents when the generalized sentence representations of their topics have a similar pattern.
    Type: Grant
    Filed: June 30, 2004
    Date of Patent: September 1, 2009
    Assignee: Microsoft Corporation
    Inventors: Benyu Zhang, Wei-Ying Ma, Zheng Chen, Hua-Jun Zeng
  • Patent number: 7584181
    Abstract: An implicit links enhancement system and method for search engines that generates implicit links obtained from mining user access logs to facilitate enhanced local searching of web sites and intranets. The implicit links search enhancement system and method includes extracting implicit links by mining users' access patterns and then using a modified link analysis algorithm to re-rank search results obtained from traditional search engines. More specifically, the implicit links search enhancement method includes extracting implicit links from a user access log, generating an implicit links graph from the extracted implicit links, and computing page rankings using the implicit links graph. The implicit links are extracted from the log using a two-item sequential pattern mining technique. Search results obtained from a search engine are re-ranked based on an implicit links analysis performed using an updated implicit links graph, a modified re-ranking formula, and at least one re-ranking technique.
    Type: Grant
    Filed: September 30, 2003
    Date of Patent: September 1, 2009
    Assignee: Microsoft Corporation
    Inventors: Hua-Jun Zeng, Gui-Rong Xue, Zheng Chen, Wei-Ying Ma