Patents by Inventor Rong Wen

Rong Wen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Scalable model-based product matching

Patent number: 7979459

Abstract: Aspects of the subject matter described herein relate to matching product information to products. In aspects, a product matching component receives product information. The product matching component normalizes the product information and obtains keywords from the product information. By querying a database of recognized products, the keywords are used to obtain a list of products that potentially match the product information. A confidence level is assigned to each of the potential matches in the list. A match may be returned for the highest matched product or for a selectable number of products whose confidence level(s) exceed a selectable threshold.

Type: Grant

Filed: June 15, 2007

Date of Patent: July 12, 2011

Assignee: Microsoft Corporation

Inventors: Kai Wu, Daniel Takacs, Tong Yao, Jiyu Zhang, Hua Yang, Ji-Rong Wen, Jonathan R M Hart, Eric Anthony Reel
Assessing mobile readiness of a page using a trained scorer

Patent number: 7974957

Abstract: A method and system for ranking pages of a search result based on the mobile readiness of the pages is provided. A mobile-readiness system receives an indication of pages that are to be ranked. The mobile-readiness system evaluates the mobile readiness for each of the pages. Mobile readiness indicates suitability of the page for a mobile device. The mobile readiness system then ranks the pages based on the generated mobile readiness and some other criterion such as a relevance score or an importance score. The mobile-readiness system may train a classifier to classify pages based on their mobile readiness.

Type: Grant

Filed: April 5, 2007

Date of Patent: July 5, 2011

Assignee: Microsoft Corporation

Inventors: Xing Xie, Jihwan Song, Ji-Rong Wen
Data-Centric Search Engine Architecture

Publication number: 20110137886

Abstract: Described is a data-centric web search engine technology/architecture, in which document metadata, including offline-extracted metadata, is used as part of a search indexing and ranking pipeline. A web data management component receives crawled documents and extracts document metadata from the documents. An indexing component uses the document metadata to build an index for the documents. A serving component uses the index and the document metadata to serve content, e.g., search results. Also described is the use of query metadata extracted from queries of a query log for use in the pipeline.

Type: Application

Filed: December 8, 2009

Publication date: June 9, 2011

Applicant: Microsoft Corporation

Inventors: Ji-Rong Wen, Guomao Xin, Yunxiao Ma, Yu Chen, Qing Yu, Yi Liu, Zhicheng Dou, Shuming Shi
SCORING RELEVANCE OF A DOCUMENT BASED ON IMAGE TEXT

Publication number: 20110087660

Abstract: A method and system for determining relevance of a document having text and images to a text string is provided. A scoring system identifies image text associated with an image of the document. The scoring system calculates an image score indicating relevance of the image text to the text string. The image score may be used in many applications, such as searching, summary generation, and document classification, image search, and image classification.

Type: Application

Filed: December 17, 2010

Publication date: April 14, 2011

Applicant: Microsoft Corporation

Inventors: Qing Yu, Shuming Shi, Zhiwei Li, Ji-Rong Wen, Wei-Ying Ma
WEB-SCALE ENTITY SUMMARIZATION

Publication number: 20110078162

Abstract: Described is a summarizing a web entity (e.g., a person, place, product or so forth) based upon the entity's appearance in web documents (e.g., on the order of hundreds of millions or billions of webpages). Webpages are separated into blocks, which are then processed according to various features to filter the number of blocks to further process, and rank the most relevant blocks with respect to the entity that remain. A redundancy removal mechanism removes redundant blocks, leaving a set of remaining blocks that are used to provide a summary of information that is relevant to the entity.

Type: Application

Filed: September 30, 2009

Publication date: March 31, 2011

Applicant: Microsoft Corporation

Inventors: Zaiqing Nie, Ji-Rong Wen, Liu Yang
EXPERIMENTAL WEB SEARCH SYSTEM

Publication number: 20110078131

Abstract: Described is the running of search-related experiments on a full (or partial) offline snapshot copy of the search engine documents of an actual production system. A snapshot experimentation subsystem runs experimental code related to web searches on the offline data, including to run experimental index building code to build an experimental index (e.g., to test a new document feature), and/or to run experimental search-related code, such as to rank search results according to experimental ranking code, to implement an experimental search strategy, and/or to generate experimental captions.

Type: Application

Filed: September 30, 2009

Publication date: March 31, 2011

Applicant: Microsoft Corporation

Inventors: Ji-Rong Wen, Yu Chen, Guomao Xin, Yunxiao Ma, Yi Liu, Zhicheng Dou, Qing Yu, Shuming Shi
FLEXIBLE INDEXING AND RANKING FOR SEARCH

Publication number: 20110078132

Abstract: Described is a flexible framework for index building and document retrieval in a search environment that allows different search scenario applications to reuse index building and document retrieval code for non-scenario-specific functionality. Interfaces to various functionality of an index builder and retrieval engine are defined. An application calls the interfaces to specify custom code to perform a search scenario when needed, or use default code when non-scenario-specific functionality may be used.

Type: Application

Filed: September 30, 2009

Publication date: March 31, 2011

Applicant: Microsoft Corporation

Inventors: Guomao Xin, Shuming Shi, Yunxiao Ma, Ji-Rong Wen
WEBPAGE ENTITY EXTRACTION THROUGH JOINT UNDERSTANDING OF PAGE STRUCTURES AND SENTENCES

Publication number: 20110078554

Abstract: Described is a technology for understanding entities of a webpage, e.g., to label the entities on the webpage. An iterative and bidirectional framework processes a webpage, including a text understanding component (e.g., extended Semi-CRF model) that provides text segmentation features to a structure understanding component (e.g., extended HCRF model). The structure understanding component uses the text segmentation features and visual layout features of the webpage to identify a structure (e.g., labeled block). The text understanding component in turn uses the labeled block to further understand the text. The process continues iteratively until a similarity criterion is met, at which time the entities may be labeled. Also described is the use of multiple mentions of a set of text in the webpage to help in labeling an entity.

Type: Application

Filed: September 30, 2009

Publication date: March 31, 2011

Applicant: Microsoft Corporation

Inventors: Zaiqing Nie, Yong Cao, Ji-Rong Wen, Chunyu Yang
Community mining based on core objects and affiliated objects

Patent number: 7885960

Abstract: In community mining based on core objects and affiliated objects, a set of core objects for a community of objects are identified from a plurality of objects. The community is expanded, based on the set of core objects, to include a set of affiliated objects. According to one aspect, a model of a community of objects is obtained by grouping a first collection of a plurality of objects into a center portion, and grouping a second collection of the plurality of objects into one or more concentric portions around the center portion. The groupings of the first and second collections of the objects are identified as the community of objects.

Type: Grant

Filed: July 22, 2003

Date of Patent: February 8, 2011

Assignee: Microsoft Corporation

Inventors: Ji-Rong Wen, Wen-Jun Zhou, Wei-Ying Ma, Hong-Jiang Zhang
Scoring relevance of a document based on image text

Patent number: 7877384

Abstract: A method and system for determining relevance of a document having text and images to a text string is provided. A scoring system identifies image text associated with an image of the document. The scoring system calculates an image score indicating relevance of the image text to the text string. The image score may be used in many applications, such as searching, summary generation, and document classification, image search, and image classification.

Type: Grant

Filed: March 1, 2007

Date of Patent: January 25, 2011

Assignee: Microsoft Corporation

Inventors: Qing Yu, Shuming Shi, Zhiwei Li, Ji-Rong Wen, Wei-Ying Ma
Automatic detection of online commercial intention

Patent number: 7831685

Abstract: Features extracted from network browser pages and/or network search queries are leveraged to facilitate in detecting a user's browsing and/or searching intent. Machine learning classifiers constructed from these features automatically detect a user's online commercial intention (OCI). A user's intention can be commercial or non-commercial, with commercial intentions being informational or transactional. In one instance, an OCI ranking mechanism is employed with a search engine to facilitate in providing search results that are ranked according to a user's intention. This also provides a means to match purchasing advertisements with potential customers who are more than likely ready to make a purchase (transactional stage). Additionally, informational advertisements can be matched to users who are researching a potential purchase (informational stage).

Type: Grant

Filed: December 14, 2005

Date of Patent: November 9, 2010

Assignee: Microsoft Corporation

Inventors: Honghua Dai, Lee Wang, Ying Li, Zaiqing Nie, Ji-Rong Wen, Lingzhi Zhao
HIERARCHICAL CONDITIONAL RANDOM FIELDS FOR WEB EXTRACTION

Publication number: 20100281009

Abstract: A method and system for labeling object information of an information page is provided. A labeling system identifies an object record of an information page based on the labeling of object elements within an object record and labels object elements based on the identification of an object record that contains the object elements. To identify the records and label the elements, the labeling system generates a hierarchical representation of blocks of an information page. The labeling system identifies records and elements within the records by propagating probability-related information of record labels and element labels through the hierarchy of the blocks. The labeling system generates a feature vector for each block to represent the block and calculates a probability of a label for a block being correct based on a score derived from the feature vectors associated with related blocks. The labeling system searches for the labeling of records and elements that has the highest probability of being correct.

Type: Application

Filed: May 7, 2010

Publication date: November 4, 2010

Applicant: Microsoft Corporation

Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie, Jun Zhu
Mechanism for enabling a program to be executed while the execution of an operating system is suspended

Patent number: 7827558

Abstract: A mechanism is provided for enabling an auxiliary program to be executed in a computer system, after an operating system has been loaded and executed, without rebooting the computer system. This may be achieved by suspending execution of the operating system, executing the auxiliary program while execution of the operating system is suspended, and then resuming execution of the operating system. In suspending execution of the operating system, state information defining a current state of the computer system is saved. In resuming execution of the operating system, the saved state information is used to restore the computer system to the current state. No state information or data should be lost as a result of suspending and resuming execution of the operating system. From the viewpoint of the operating system and the applications (if any) executing within the environment provided by the operating system, execution of the auxiliary program is transparent.

Type: Grant

Filed: June 30, 2004

Date of Patent: November 2, 2010

Assignee: Devicevm, Inc.

Inventor: Rong-Wen Chang
PSEUDO-ANCHOR TEXT EXTRACTION

Publication number: 20100145956

Abstract: A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help rank the objects in a search result to improve search performance. The method may be used in vertical search of objects such as published articles, products and images that lack explicit URLs and anchor text information.

Type: Application

Filed: January 29, 2010

Publication date: June 10, 2010

Applicant: Microsoft Corporation

Inventors: Shuming Shi, Zaiqing Nie, Ji-Rong Wen, Mingjie Zhu, Fei Xing
Mechanism for enabling a program to be executed while the execution of an operating system is suspended

Publication number: 20100138641

Abstract: A mechanism is provided for enabling an auxiliary program to be executed in a computer system, after an operating system has been loaded and executed, without rebooting the computer system. This may be achieved by suspending execution of the operating system, executing the auxiliary program while execution of the operating system is suspended, and then resuming execution of the operating system. In suspending execution of the operating system, state information defining a current state of the computer system is saved. In resuming execution of the operating system, the saved state information is used to restore the computer system to the current state. No state information or data should be lost as a result of suspending and resuming execution of the operating system. From the viewpoint of the operating system and the applications (if any) executing within the environment provided by the operating system, execution of the auxiliary program is transparent.

Type: Application

Filed: June 30, 2004

Publication date: June 3, 2010

Inventor: Rong-Wen Chang
Hierarchical conditional random fields for web extraction

Patent number: 7720830

Abstract: A method and system for labeling object information of an information page is provided. A labeling system identifies an object record of an information page based on the labeling of object elements within an object record and labels object elements based on the identification of an object record that contains the object elements. To identify the records and label the elements, the labeling system generates a hierarchical representation of blocks of an information page. The labeling system identifies records and elements within the records by propagating probability-related information of record labels and element labels through the hierarchy of the blocks. The labeling system generates a feature vector for each block to represent the block and calculates a probability of a label for a block being correct based on a score derived from the feature vectors associated with related blocks. The labeling system searches for the labeling of records and elements that has the highest probability of being correct.

Type: Grant

Filed: July 31, 2006

Date of Patent: May 18, 2010

Assignee: Microsoft Corporation

Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie, Jun Zhu
Content object indexing using domain knowledge

Patent number: 7698294

Abstract: A content object indexing process including creating a content object knowledge index, calculating a description vector of a target content object, and indexing the target content object by searching for the description vector in the content object knowledge database. It may be difficult to search for an exact content object such as a music file or academic researcher as a conventional search index may not include related hierarchical information. A content object indexing process may add hierarchical information taken from a content object knowledge index and incorporate the hierarchical information to the index entry for a specific content object. An application of such a content object indexing process may be a world wide web search engine.

Type: Grant

Filed: January 11, 2006

Date of Patent: April 13, 2010

Assignee: Microsoft Corporation

Inventors: Wei-Ying Ma, Lie Lu, Ji-Rong Wen, Zhiwei Li, Zaiqing Nie, Hsiao-Wuen Hon
Semi-structured data storage schema selection

Patent number: 7668847

Abstract: In one aspect, this disclosure relates to a method and associated apparatus that allows a user to obtain a semi-structured data input and a workload input. An improved semi-structured data storage schema is selected for a relational schema in response to the semi-structured data input and the workload input. The semi-structured data is segmented based on the selected improved semi-structured data storage schema. In one aspect, the semi-structured data is XML data.

Type: Grant

Filed: November 4, 2005

Date of Patent: February 23, 2010

Assignee: Microsoft Corporation

Inventors: Ji-Rong Wen, Shihui Zheng, Hongjun Lu
Joint optimization of wrapper generation and template detection

Patent number: 7660804

Abstract: A method and system for generating wrappers for hierarchically organized documents by jointly optimizing template detection and wrapper generation is provided. A wrapper generation system generates a wrapper for documents with similar templates by identifying a cluster of document trees and generating a wrapper tree for the cluster. A wrapper tree defines the wrapper for documents that match the template of the cluster. The wrapper generation system clusters document trees by generating a wrapper tree for the cluster based on an initial document tree. The wrapper generation system then repeatedly determines whether any other document tree matches or nearly matches the wrapper tree for the cluster and, if so, adds the document tree to the cluster and adjusts the wrapper tree as appropriate so that all the document trees, including the newly added one, match the wrapper tree.

Type: Grant

Filed: August 16, 2006

Date of Patent: February 9, 2010

Assignee: Microsoft Corporation

Inventors: Ji-Rong Wen, Min Wan, Ruihua Song, Wei-Ying Ma, Shuyi Zeng
Pseudo-anchor text extraction for vertical search

Patent number: 7657507

Abstract: A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help ranking the objects in a search result to improve search performance. Method may be used in vertical search of objects such as published articles, products and images that lack explicit URL and anchor text information.

Type: Grant

Filed: March 2, 2007

Date of Patent: February 2, 2010

Assignee: Microsoft Corporation

Inventors: Shuming Shi, Zaiqing Nie, Ji-Rong Wen, Mingjie Zhu, Fei Xing

prev 1 2 3 4 5 6 7 8 9 next