Patents by Inventor Zaiqing Nie

Zaiqing Nie has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7921106
    Abstract: Search results are ranked by applying sub-relevancies within a search result list. This permits search result lists to be further refined into more manageable relevant groupings from a user's perspective. The sub-relevancies or ‘group-by’ parameters are derived from search result attributes. Attribute values from the attributes are employed in a ranking scheme to group the search results based on attribute value relevancy. The grouped search results can then be displayed to users via a search result page. In one instance users can select which attribute value is used to group the search result list. Ranking processes are based on object ranking algorithms that consider each attribute value as an object type. Some instances provide for search result list group condensing based on relevancy of the attribute values as well. A top-k instance can be employed to limit the search results to control the amount of time required for processing groupings.
    Type: Grant
    Filed: August 3, 2006
    Date of Patent: April 5, 2011
    Assignee: Microsoft Corporation
    Inventors: Hua Chen, Zaiqing Nie, Yue Liu, Qiang Wu, Wei-Ying Ma
  • Publication number: 20110078162
    Abstract: Described is a summarizing a web entity (e.g., a person, place, product or so forth) based upon the entity's appearance in web documents (e.g., on the order of hundreds of millions or billions of webpages). Webpages are separated into blocks, which are then processed according to various features to filter the number of blocks to further process, and rank the most relevant blocks with respect to the entity that remain. A redundancy removal mechanism removes redundant blocks, leaving a set of remaining blocks that are used to provide a summary of information that is relevant to the entity.
    Type: Application
    Filed: September 30, 2009
    Publication date: March 31, 2011
    Applicant: Microsoft Corporation
    Inventors: Zaiqing Nie, Ji-Rong Wen, Liu Yang
  • Publication number: 20110078554
    Abstract: Described is a technology for understanding entities of a webpage, e.g., to label the entities on the webpage. An iterative and bidirectional framework processes a webpage, including a text understanding component (e.g., extended Semi-CRF model) that provides text segmentation features to a structure understanding component (e.g., extended HCRF model). The structure understanding component uses the text segmentation features and visual layout features of the webpage to identify a structure (e.g., labeled block). The text understanding component in turn uses the labeled block to further understand the text. The process continues iteratively until a similarity criterion is met, at which time the entities may be labeled. Also described is the use of multiple mentions of a set of text in the webpage to help in labeling an entity.
    Type: Application
    Filed: September 30, 2009
    Publication date: March 31, 2011
    Applicant: Microsoft Corporation
    Inventors: Zaiqing Nie, Yong Cao, Ji-Rong Wen, Chunyu Yang
  • Patent number: 7831685
    Abstract: Features extracted from network browser pages and/or network search queries are leveraged to facilitate in detecting a user's browsing and/or searching intent. Machine learning classifiers constructed from these features automatically detect a user's online commercial intention (OCI). A user's intention can be commercial or non-commercial, with commercial intentions being informational or transactional. In one instance, an OCI ranking mechanism is employed with a search engine to facilitate in providing search results that are ranked according to a user's intention. This also provides a means to match purchasing advertisements with potential customers who are more than likely ready to make a purchase (transactional stage). Additionally, informational advertisements can be matched to users who are researching a potential purchase (informational stage).
    Type: Grant
    Filed: December 14, 2005
    Date of Patent: November 9, 2010
    Assignee: Microsoft Corporation
    Inventors: Honghua Dai, Lee Wang, Ying Li, Zaiqing Nie, Ji-Rong Wen, Lingzhi Zhao
  • Publication number: 20100281009
    Abstract: A method and system for labeling object information of an information page is provided. A labeling system identifies an object record of an information page based on the labeling of object elements within an object record and labels object elements based on the identification of an object record that contains the object elements. To identify the records and label the elements, the labeling system generates a hierarchical representation of blocks of an information page. The labeling system identifies records and elements within the records by propagating probability-related information of record labels and element labels through the hierarchy of the blocks. The labeling system generates a feature vector for each block to represent the block and calculates a probability of a label for a block being correct based on a score derived from the feature vectors associated with related blocks. The labeling system searches for the labeling of records and elements that has the highest probability of being correct.
    Type: Application
    Filed: May 7, 2010
    Publication date: November 4, 2010
    Applicant: Microsoft Corporation
    Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie, Jun Zhu
  • Publication number: 20100145956
    Abstract: A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help rank the objects in a search result to improve search performance. The method may be used in vertical search of objects such as published articles, products and images that lack explicit URLs and anchor text information.
    Type: Application
    Filed: January 29, 2010
    Publication date: June 10, 2010
    Applicant: Microsoft Corporation
    Inventors: Shuming Shi, Zaiqing Nie, Ji-Rong Wen, Mingjie Zhu, Fei Xing
  • Patent number: 7720830
    Abstract: A method and system for labeling object information of an information page is provided. A labeling system identifies an object record of an information page based on the labeling of object elements within an object record and labels object elements based on the identification of an object record that contains the object elements. To identify the records and label the elements, the labeling system generates a hierarchical representation of blocks of an information page. The labeling system identifies records and elements within the records by propagating probability-related information of record labels and element labels through the hierarchy of the blocks. The labeling system generates a feature vector for each block to represent the block and calculates a probability of a label for a block being correct based on a score derived from the feature vectors associated with related blocks. The labeling system searches for the labeling of records and elements that has the highest probability of being correct.
    Type: Grant
    Filed: July 31, 2006
    Date of Patent: May 18, 2010
    Assignee: Microsoft Corporation
    Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie, Jun Zhu
  • Patent number: 7698294
    Abstract: A content object indexing process including creating a content object knowledge index, calculating a description vector of a target content object, and indexing the target content object by searching for the description vector in the content object knowledge database. It may be difficult to search for an exact content object such as a music file or academic researcher as a conventional search index may not include related hierarchical information. A content object indexing process may add hierarchical information taken from a content object knowledge index and incorporate the hierarchical information to the index entry for a specific content object. An application of such a content object indexing process may be a world wide web search engine.
    Type: Grant
    Filed: January 11, 2006
    Date of Patent: April 13, 2010
    Assignee: Microsoft Corporation
    Inventors: Wei-Ying Ma, Lie Lu, Ji-Rong Wen, Zhiwei Li, Zaiqing Nie, Hsiao-Wuen Hon
  • Patent number: 7657507
    Abstract: A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help ranking the objects in a search result to improve search performance. Method may be used in vertical search of objects such as published articles, products and images that lack explicit URL and anchor text information.
    Type: Grant
    Filed: March 2, 2007
    Date of Patent: February 2, 2010
    Assignee: Microsoft Corporation
    Inventors: Shuming Shi, Zaiqing Nie, Ji-Rong Wen, Mingjie Zhu, Fei Xing
  • Patent number: 7577650
    Abstract: A method and system for ranking objects of different object types based on their popularity is provided. A ranking system calculates the popularity of objects based on relationships between the objects. A relationship indicates how one object is related to another object. Thus, objects of one object type may have one or more relationships with objects of another object type. One goal of the ranking system is to rank the objects of the different object types based on their popularity. The objects and their relationships can be represented using a graph with nodes representing objects and links representing relationships between objects. The ranking system assigns a popularity propagation factor to each relationship to represent its contribution to the popularity of objects of that type.
    Type: Grant
    Filed: April 13, 2005
    Date of Patent: August 18, 2009
    Assignee: Microsoft Corporation
    Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie
  • Patent number: 7529748
    Abstract: A mechanism to classify source documents into one of two categories, either likely to contain desired information or unlikely to contain desired information. Generally some form of rules based classification in conjunction with deeper analysis using advanced techniques on difficult cases is utilized. The rules based classification is generally good for eliminating cases from further consideration and for identifying documents of interest based on generally discernable relationships between data or based on the presence or absence of data. The deeper analysis is used to uncover more complex relationships between data that may identify documents of interest. Portions of the process may use the entire document while other portions of the process may use only a portion of the document.
    Type: Grant
    Filed: March 15, 2006
    Date of Patent: May 5, 2009
    Inventors: Ji-Rong Wen, Yan-Feng Sun, Wei-Ying Ma, Zaiqing Nie, Renkuan Jiang
  • Patent number: 7529761
    Abstract: A labeling system uses a two-dimensional conditional random fields technique to label the object elements. The labeling system represents transition features and state features that depend on object elements that are adjacent in two dimensions. The labeling system represents the grid as a graph of vertices and edges with a vertex representing an object element and an edge representing a relationship between the object elements. The labeling system represents each diagonal of the graph as a sequence of states. The labeling system selects a labeling for the vertices of the diagonals that has the highest probability based on transition probabilities between vertices of adjacent diagonals and on the state probabilities of a position within a diagonal.
    Type: Grant
    Filed: December 14, 2005
    Date of Patent: May 5, 2009
    Assignee: Microsoft Corporation
    Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie, Jun Zhu
  • Publication number: 20080215563
    Abstract: A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help ranking the objects in a search result to improve search performance. Method may be used in vertical search of objects such as published articles, products and images that lack explicit URL and anchor text information.
    Type: Application
    Filed: March 2, 2007
    Publication date: September 4, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Shuming Shi, Zaiqing Nie, Ji-Rong Wen, Mingjie Zhu, Fei Xing
  • Patent number: 7383254
    Abstract: A method and system for identifying object information of an information page is provided. An information extraction system identifies the object blocks of an information page. The extraction system classifies the object blocks into object types. Each object type has associated attributes that define a schema for the information of the object type. The extraction system identifies object elements within an object block that may represent an attribute value for the object. After the object elements are identified, the extraction system attempts to identify which object elements correspond to which attributes of the object type in a process referred to as “labeling.” The extraction system uses an algorithm to determine the confidence that a certain object element corresponds to a certain attribute. The extraction system then selects the set of labels with the highest confidence as being the labels for the object elements.
    Type: Grant
    Filed: April 13, 2005
    Date of Patent: June 3, 2008
    Assignee: Microsoft Corporation
    Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie
  • Publication number: 20080033915
    Abstract: Search results are ranked by applying sub-relevancies within a search result list. This permits search result lists to be further refined into more manageable relevant groupings from a user's perspective. The sub-relevancies or ‘group-by’ parameters are derived from search result attributes. Attribute values from the attributes are employed in a ranking scheme to group the search results based on attribute value relevancy. The grouped search results can then be displayed to users via a search result page. In one instance users can select which attribute value is used to group the search result list. Ranking processes are based on object ranking algorithms that consider each attribute value as an object type. Some instances provide for search result list group condensing based on relevancy of the attribute values as well. A top-k instance can be employed to limit the search results to control the amount of time required for processing groupings.
    Type: Application
    Filed: August 3, 2006
    Publication date: February 7, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Hua Chen, Zaiqing Nie, Yue Liu, Qiang Wu, Wei-Ying Ma
  • Publication number: 20080027969
    Abstract: A method and system for labeling object information of an information page is provided. A labeling system identifies an object record of an information page based on the labeling of object elements within an object record and labels object elements based on the identification of an object record that contains the object elements. To identify the records and label the elements, the labeling system generates a hierarchical representation of blocks of an information page. The labeling system identifies records and elements within the records by propagating probability-related information of record labels and element labels through the hierarchy of the blocks. The labeling system generates a feature vector for each block to represent the block and calculates a probability of a label for a block being correct based on a score derived from the feature vectors associated with related blocks. The labeling system searches for the labeling of records and elements that has the highest probability of being correct.
    Type: Application
    Filed: July 31, 2006
    Publication date: January 31, 2008
    Applicant: Microsoft Corporation
    Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie, Jun Zhu
  • Publication number: 20080027910
    Abstract: A method and system is provided for determining relevance of an object to a term based on a language model. The relevance system provides records extracted from web pages that relate to the object. To determine the relevance of the object to a term, the relevance system first determines, for each record of the object, a probability of generating that term using a language model of the record of that object. The relevance system then calculates the relevance of the object to the term by combining the probabilities. The relevance system may also weight the probabilities based on the accuracy or reliability of the extracted information for each data source.
    Type: Application
    Filed: July 25, 2006
    Publication date: January 31, 2008
    Applicant: Microsoft Corporation
    Inventors: Ji-Rong Wen, Shuming Shi, Wei-Ying Ma, Yunxiao Ma, Zaiqing Nie
  • Publication number: 20070162408
    Abstract: A content object indexing process including creating a content object knowledge index, calculating a description vector of a target content object, and indexing the target content object by searching for the description vector in the content object knowledge database. It may be difficult to search for an exact content object such as a music file or academic researcher as a conventional search index may not include related hierarchical information. A content object indexing process may add hierarchical information taken from a content object knowledge index and incorporate the hierarchical information to the index entry for a specific content object. An application of such a content object indexing process may be a world wide web search engine.
    Type: Application
    Filed: January 11, 2006
    Publication date: July 12, 2007
    Applicant: Microsoft Corporation
    Inventors: Wei-Ying Ma, Lie Lu, Ji-Rong Wen, Zhiwei Li, Zaiqing Nie, Hsiao-Wuen Hon
  • Publication number: 20070150486
    Abstract: A labeling system uses a two-dimensional conditional random fields technique to label the object elements. The labeling system represents transition features and state features that depend on object elements that are adjacent in two dimensions. The labeling system represents the grid as a graph of vertices and edges with a vertex representing an object element and an edge representing a relationship between the object elements. The labeling system represents each diagonal of the graph as a sequence of states. The labeling system selects a labeling for the vertices of the diagonals that has the highest probability based on transition probabilities between vertices of adjacent diagonals and on the state probabilities of a position within a diagonal.
    Type: Application
    Filed: December 14, 2005
    Publication date: June 28, 2007
    Applicant: Microsoft Corporation
    Inventors: Ji-Rong Wen, Wei-Ying Ma, Zaiqing Nie, Jun Zhu
  • Publication number: 20070136457
    Abstract: Features extracted from network browser pages and/or network search queries are leveraged to facilitate in detecting a user's browsing and/or searching intent. Machine learning classifiers constructed from these features automatically detect a user's online commercial intention (OCI). A user's intention can be commercial or non-commercial, with commercial intentions being informational or transactional. In one instance, an OCI ranking mechanism is employed with a search engine to facilitate in providing search results that are ranked according to a user's intention. This also provides a means to match purchasing advertisements with potential customers who are more than likely ready to make a purchase (transactional stage). Additionally, informational advertisements can be matched to users who are researching a potential purchase (informational stage).
    Type: Application
    Filed: December 14, 2005
    Publication date: June 14, 2007
    Applicant: Microsoft Corporation
    Inventors: Honghua Dai, Lee Wang, Ying Li, Zaiqing Nie, Ji-Rong Wen, Lingzhi Zhao