Patents by Inventor Wei-Ying Ma

Wei-Ying Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AIR QUALITY INFERENCE USING MULTIPLE DATA SOURCES

Publication number: 20160125307

Abstract: The use of data from multiple data source provides inferred air quality indices with respect to a particular pollutant for multiple areas without the addition of air quality monitor stations to those areas. Labeled air quality index data for a pollutant in a region may be obtained from one or more air quality monitor stations. Spatial features for the region may be extracted from spatially-related data for the region. The spatially-related data may include information on fixed infrastructures in the region. Likewise, temporal features for the region may be extracted from temporally-related data for the region that changes over time. A co-training based learning framework may be further applied to co-train a spatial classifier and a temporal classifier based at least on the labeled air quality index data, the spatial features for the region, and the temporal features for the region.

Type: Application

Filed: June 5, 2013

Publication date: May 5, 2016

Inventors: Yu Zheng, Xing Xie, Wei-Ying Ma, Hsiao-Wuen Hon, Eric I-Chao Chang
Method and system for mining information based on relationships

Patent number: 9195942

Abstract: A method and system for identifying information about people is provided. The information system identifies groups of people that have relationships based on their relationships to documents or more generally to objects. The information system initially is provided with an indication of which people have which relationships to which documents. The information system then identifies clusters of people based on having a relationship to the same objects. The information system may also identify clusters of related objects associated with a cluster of people. When a user wants to identify information about a person, the user can provide the name of that person to the information system. The information system then can retrieve and display the names of the other people who are in the same cluster as the person.

Type: Grant

Filed: March 17, 2009

Date of Patent: November 24, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Benyu Zhang, Wei-Ying Ma, Gu Xu, Hongbin Gao, Zheng Chen, Randy Hinrichs, Hua-Jun Zeng
Detecting spatial outliers in a location entity dataset

Patent number: 9063226

Abstract: Disclosed herein are one or more embodiments that arrange a plurality of location entities into a hierarchy of location descriptors. One or more of the disclosed embodiments may determine whether one of the location entities is a spatial outlier based at least in part on presence of one or more other location entities within a predetermined distance of the one location entity. Also, the other location entities and the one location entity may share a location descriptor.

Type: Grant

Filed: January 14, 2009

Date of Patent: June 23, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yu Zheng, Jianqiao Feng, Xing Xie, Wei-Ying Ma
Augmenting a training set for document categorization

Patent number: 9058382

Abstract: A method and system for augmenting a training set used to train a classifier of documents is provided. The augmentation system augments a training set with training data derived from features of documents based on a document hierarchy. The training data of the initial training set may be derived from the root documents of the hierarchies of documents. The augmentation system generates additional training data that includes an aggregate feature that represents the overall characteristics of a hierarchy of documents, rather than just the root document. After the training data is generated, the augmentation system augments the initial training set with the newly generated training data.

Type: Grant

Filed: October 20, 2008

Date of Patent: June 16, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Tie-Yan Liu, Wei-Ying Ma
Identification of duplicates within an image space

Patent number: 8995771

Abstract: Implementations for identifying duplicate images in an image space are described. An image space is partitioned into a plurality of coarse clusters based on signatures of the images within the image space. The signatures are determined from compact descriptors of the images. Refined clusters that include one or more images of an individual coarse cluster are created based on pair-wise comparisons of the compact descriptors of images in the coarse cluster, and the refined clusters are identified as sets of duplicate images. The refined clusters are grown by searching in similar coarse clusters for images to add to the refined clusters.

Type: Grant

Filed: April 30, 2012

Date of Patent: March 31, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Lei Zhang, Xin-Jing Wang, Wei-Ying Ma
Method and system for detecting when an outgoing communication contains certain content

Patent number: 8782805

Abstract: A method and system for detecting whether an outgoing communication contains confidential information or other target information is provided. The detection system is provided with a collection of documents that contain confidential information, referred to as “confidential documents.” When the detection system is provided with an outgoing communication, it compares the content of the outgoing communication to the content of the confidential documents. If the outgoing communication contains confidential information, then the detection system may prevent the outgoing communication from being sent outside the organization. The detection system detects confidential information based on the similarity between the content of an outgoing communication and the content of confidential documents that are known to contain confidential information.

Type: Grant

Filed: July 27, 2009

Date of Patent: July 15, 2014

Assignee: Microsoft Corporation

Inventors: Benyu Zhang, Hua-Jun Zeng, Wei-Ying Ma, Zheng Chen
Web forum crawling using skeletal links

Patent number: 8700600

Abstract: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.

Type: Grant

Filed: January 17, 2012

Date of Patent: April 15, 2014

Assignee: Microsoft Corporation

Inventors: Lei Zhang, Wei-Ying Ma, Wei Lai, Jiangming Yang, Rui Cai
Selecting advertisements based on serving area and map area

Patent number: 8666821

Abstract: Methods and systems for selecting advertisements to present to a user of a computing device are provided. An advertisement system selects advertisements to display to a user based on the serving area of candidate advertisements. The advertisement system selects those candidate advertisements whose serving area encompasses the user's current location. The advertisement system may also select candidate advertisements to present to a user based on a map area currently being displayed to the user. The advertisement system may filter the candidate advertisements based on the provider location being within the map area that is currently being displayed to the user.

Type: Grant

Filed: August 28, 2006

Date of Patent: March 4, 2014

Assignee: Microsoft Corporation

Inventors: Xing Xie, Xianfang Wang, Ying Li, Wei-Ying Ma, Lee Wang
Music recommendation using emotional allocation modeling

Patent number: 8650094

Abstract: An exemplary method includes defining a vocabulary for emotions; extracting descriptions for songs; generating distributions for the songs in an emotion space based at least in part on the vocabulary and the extracted descriptions; extracting salient words from a document; generating a distribution for the document in an emotion space based at least in part on the vocabulary and the extracted salient words; and matching the distribution for the document to one or more of the distributions for the songs. Various other exemplary methods, devices, systems, etc., are also disclosed.

Type: Grant

Filed: May 7, 2008

Date of Patent: February 11, 2014

Assignee: Microsoft Corporation

Inventors: Rui Cai, Lei Zhang, Wei-Ying Ma
Scoring relevance of a document based on image text

Patent number: 8645370

Abstract: A method and system for determining relevance of a document having text and images to a text string is provided. A scoring system identifies image text associated with an image of the document. The scoring system calculates an image score indicating relevance of the image text to the text string. The image score may be used in many applications, such as searching, summary generation, and document classification, image search, and image classification.

Type: Grant

Filed: December 17, 2010

Date of Patent: February 4, 2014

Assignee: Microsoft Corporation

Inventors: Qing Yu, Shuming Shi, Zhiwei Li, Ji-Rong Wen, Wei-Ying Ma
Ranking advertisement(s) based upon advertisement feature(s)

Patent number: 8620912

Abstract: While browsing, a user may interact with a wide variety of images. The user may upload and share images taken with a digital camera and/or search for image using a search engine. Because images are rich in contextual information, it may be advantageous to provide additional information, such as adjacent market advertising based upon matching advertisements with contextual information of the images. Accordingly, a query image may be used to retrieve a video frame set. The video frame set may be expanded with related video frames corresponding to adjacent markets. The expanded video frame set may be grouped into clusters of similar frames. The clusters may be used to rank advertisements based upon how similar the advertisements are to the clusters and/or video frames within the clusters. In this way, one or more ranked advertisements may be presented with the query image.

Type: Grant

Filed: June 16, 2010

Date of Patent: December 31, 2013

Assignee: Microsoft Corporation

Inventors: Xin-Jing Wang, Lei Zhang, Wei-Ying Ma
Long-Query Retrieval

Publication number: 20130346416

Abstract: Described herein is a technology that facilitates efficient large-scale similarity-based retrieval. In several embodiments documents, images, and/or other multimedia files are compactly represented and efficiently indexed to enable robust search using a long-query in a large-scale corpus. As described herein, these techniques include performing decomposition of a file, e.g., an image, a document containing an image, or a document-like representation of an image. The techniques use dimension reduction to obtain three parts, low-dimensional representations (major semantics), file specific terms (minor semantics), and background words, representing the major semantics in a feature vector and the minor semantics as keywords. Using the techniques described, file vectors are matched in a topic model and the results ranked based on the keywords.

Type: Application

Filed: December 3, 2012

Publication date: December 26, 2013

Applicant: Microsoft Corporation

Inventors: Zhiwei Li, Lei Zhang, Rui Cai, Wei-Ying Ma, Heung-Yeung Shum
Topic distillation via subsite retrieval

Patent number: 8612453

Abstract: A method and system for generating a search result for a query of hierarchically organized documents based on retrieval of subtrees that are key resources for topic distillation is provided. The retrieval system may identify documents relevant to a query using conventional searching techniques. The retrieval system then calculates a subtree feature for subtrees that have an identified document as their root. After the retrieval system calculates the subtree feature for the subtrees, the retrieval system may generate a subtree relevance score for each subtree based on its subtree feature. The retrieval system may then order the identified documents based on their corresponding subtree relevances.

Type: Grant

Filed: July 17, 2009

Date of Patent: December 17, 2013

Assignee: Microsoft Corporation

Inventors: Tie-Yan Liu, Tao Qin, Wei-Ying Ma
IDENTIFICATION OF DUPLICATES WITHIN AN IMAGE SPACE

Publication number: 20130287302

Abstract: Implementations for identifying duplicate images in an image space are described. An image space is partitioned into a plurality of coarse clusters based on signatures of the images within the image space. The signatures are determined from compact descriptors of the images. Refined clusters that include one or more images of an individual coarse cluster are created based on pair-wise comparisons of the compact descriptors of images in the coarse cluster, and the refined clusters are identified as sets of duplicate images. The refined clusters are grown by searching in similar coarse clusters for images to add to the refined clusters.

Type: Application

Filed: April 30, 2012

Publication date: October 31, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Lei Zhang, Xin-Jing Wang, Wei-Ying Ma
Dual cross-media relevance model for image annotation

Patent number: 8571850

Abstract: A dual cross-media relevance model (DCMRM) is used for automatic image annotation. In contrast to the traditional relevance models which calculate the joint probability of words and images over a training image database, the DCMRM model estimates the joint probability by calculating the expectation over words in a predefined lexicon. The DCMRM model may be advantageous because a predefined lexicon potentially has better behavior than a training image database. The DCMRM model also takes advantage of content-based techniques and image search techniques to define the word-to-image and word-to-word relations involved in image annotation. Both relations can be estimated by using image search techniques on the web data as well as available training data.

Type: Grant

Filed: December 13, 2007

Date of Patent: October 29, 2013

Assignee: Microsoft Corporation

Inventors: Mingjing Li, Jing Lui, Bin Wang, Zhiwei Li, Wei-Ying Ma
Automated rich presentation of a semantic topic

Patent number: 8572088

Abstract: Automated rich presentation of a semantic topic is described. In one aspect, respective portions of multimodal information corresponding to a semantic topic are evaluated to locate events associated with the semantic topic. The probability that a document belongs to an event is determined based on document inclusion of one or more of persons, times, locations, and keywords, and document distribution along a timeline associated with the event. For each event, one or more documents objectively determined to be substantially representative of the event are identified. One or more other types of media (e.g., video, images, etc.) related to the event are then extracted from the multimodal information. The representative documents and the other media are for presentation to a user in a storyboard.

Type: Grant

Filed: October 21, 2005

Date of Patent: October 29, 2013

Assignee: Microsoft Corporation

Inventors: Lie Lu, Wei-Ying Ma, Zhiwei Li
Building a person profile database

Patent number: 8559682

Abstract: Names of entities, such as people, in an image may be identified automatically. Visually similar images of entities are retrieved, including text proximate to the visually similar images. The collected text is mined for names of entities, and the detected names are analyzed. A name may be associated with the entity in the image, based on the analysis.

Type: Grant

Filed: November 9, 2010

Date of Patent: October 15, 2013

Assignee: Microsoft Corporation

Inventors: Lei Zhang, Xin-Jing Wang, Wei-Ying Ma
Peer-to-peer advertisement platform

Patent number: 8548853

Abstract: A peer-to-peer advertisement platform is provided to ubiquitously promote products or services supplied by advertisers across content-based applications executing on nodes in a peer-to-peer network. The peer-to-peer advertisement platform may include a registration component to register nodes in the peer-to-peer advertising platform, an advertisement submission component to receive advertisement data from the advertisers, and a distribution component to distribute the advertisement data to the nodes registered in the peer-to-peer advertisement platform. The peer-to-peer advertisement platform also includes a money sharing component to reward nodes based on a contribution level assigned to the node. Accordingly, the peer-to-peer advertisement platform stores the advertisement data locally at the plurality of nodes registered in the peer-to-peer advertising platform and shares a portion of the revenue generated from the advertisement data with the nodes registered in the peer-to-peer advertising platform.

Type: Grant

Filed: June 8, 2005

Date of Patent: October 1, 2013

Assignee: Microsoft Corporation

Inventors: Benyu Zhang, Fengping Zeng, Hua-Jun Zeng, Li Li, Tarek Najm, Wei-Ying Ma, Ying Li, Zheng Chen
Cost-Per-Action Model Based on Advertiser-Reported Actions

Publication number: 20130246167

Abstract: According to a cost-per-action advertising model, advertisers submit ads with cost-per-action bids. Ad auctions are conducted and winning ads are returned with contextually relevant search results. Each time a winning ad is selected by a user, resulting in the user being redirected to a website associated with the advertiser, a selected impression and a price is recorded for the winning ad. Periodically, an advertiser submits a report indicating a number of actions attributed to the ads that have occurred through the advertiser website. The advertiser is then charged a fee for each reported action based on the recorded prices for the winning ads and based on the number of selected impressions recorded for the winning ads.

Type: Application

Filed: March 15, 2012

Publication date: September 19, 2013

Applicant: Microsoft Corporation

Inventors: Tao Qin, Tie-Yan Liu, Wenkui Ding, Wei-Ying Ma, Hsiao-Wuen Hon
Interactive framework for name disambiguation

Patent number: 8538898

Abstract: A “Name Disambiguator” provides various techniques for implementing an interactive framework for resolving or disambiguating entity names (associated with objects such as publications) for entity searches where two or more same or similar names may refer to different entities. More specifically, the Name Disambiguator uses a combination of user input and automatic models to address the disambiguation problem. In various embodiments, the Name Disambiguator uses a two part process, including: 1) a global SVM trained from large sets of documents or objects in a simulated interactive mode, and 2) further personalization of local SVM models (associated with individual names or groups of names such as, for example, a group of coauthors) derived from the global SVM model. The result of this process is that large sets of documents or objects are rapidly and accurately condensed or clustered into ordered sets by that are organized by entity names.

Type: Grant

Filed: May 28, 2011

Date of Patent: September 17, 2013

Assignee: Microsoft Corporation

Inventors: Zhengdong Lu, Zaiqing Nie, Gang Luo, Yong Cao, Ji-Rong Wen, Wei-Ying Ma

prev 1 2 3 4 5 6 … next