Patents by Inventor Jiangming Yang

Jiangming Yang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8977632
    Abstract: A location extraction component analyzes a set of travelogues to identify all of the locations mentioned therein. A co-occurrence extraction component computes co-occurrence values for the identified locations. When the identity of a specified location is received, suggested locations for the specified location are identified through the use of the co-occurrence values. A map is displayed that encompasses an area including the specified location and the suggested locations. The map might include indicators for the specified location and for each of the suggested locations. Attributes of the indicators, such as their size or color, can be modified based upon the co-occurrence value associated with the corresponding suggested location.
    Type: Grant
    Filed: September 29, 2009
    Date of Patent: March 10, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Patent number: 8700600
    Abstract: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.
    Type: Grant
    Filed: January 17, 2012
    Date of Patent: April 15, 2014
    Assignee: Microsoft Corporation
    Inventors: Lei Zhang, Wei-Ying Ma, Wei Lai, Jiangming Yang, Rui Cai
  • Patent number: 8676807
    Abstract: Concepts and technologies are described herein for identifying location names within document text. Through an implementation of the concepts and technologies presented herein, functionality can be provided for identifying location names within articles, websites, travelogues, or other such documents. For instance, documents containing the names of cities, regions, countries, landmarks, or other locations may be associated with those locations. The location names may be unambiguously identified even when the location names may also have common word meanings that are not location associated or when the location name may be associated with more than one location.
    Type: Grant
    Filed: April 22, 2010
    Date of Patent: March 18, 2014
    Assignee: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Patent number: 8572076
    Abstract: Concepts and technologies are described herein for mining location contexts within document text. Through an implementation of the concepts and technologies presented herein, functionality can be provided for location context mining within articles, websites, travelogues, or other such documents. A location context is a concept associated with a specific location. For example, the contexts “beach” and “hula” are associated with Hawaii. Similarly, “glacier” and “polar bear” are contexts associated with Alaska. Location context mining can automatically discover locations and location contexts by mining information from a set of documents. User interfaces to support queries of the mined information are also presented herein.
    Type: Grant
    Filed: April 22, 2010
    Date of Patent: October 29, 2013
    Assignee: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Patent number: 8281246
    Abstract: A map user interface control provides functionality for displaying a map in conjunction with the display of a Web page. The map control operates in combination with a location extraction component that analyzes the contents of the Web page to identify locations mentioned therein. Once the location extraction component has identified the locations mentioned in the Web page, a map is generated that encompasses the locations identified in the Web page. Once the map has been generated, the map control displays the map in conjunction with the display of the Web page. The map might include visual indicators corresponding to the locations mentioned in the Web page. The map might also include visual indicators corresponding to other locations near the locations identified in the Web page that have been identified using co-occurrence values generated through an analysis of a set of travelogues.
    Type: Grant
    Filed: September 29, 2009
    Date of Patent: October 2, 2012
    Assignee: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Patent number: 8275546
    Abstract: A location extraction component analyzes a set of travelogues to identify locations mentioned therein. A co-occurrence extraction component computes co-occurrence values for the identified locations. When a request to generate a travel route from a starting location to an ending location is received, suggested locations on or near the travel route are identified through the use of the co-occurrence values. A suggested travel route is then generated that passes through the starting location, the ending location, and the suggested locations. A map may be displayed showing the starting location, the ending location, the suggested locations, and the suggested travel route.
    Type: Grant
    Filed: September 29, 2009
    Date of Patent: September 25, 2012
    Assignee: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang
  • Publication number: 20120117052
    Abstract: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.
    Type: Application
    Filed: January 17, 2012
    Publication date: May 10, 2012
    Applicant: Microsoft Corporation
    Inventors: Lei Zhang, Wei-Ying Ma, Wei Lai, Jiangming Yang, Rui Cai
  • Patent number: 8099408
    Abstract: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.
    Type: Grant
    Filed: June 27, 2008
    Date of Patent: January 17, 2012
    Assignee: Microsoft Corporation
    Inventors: Lei Zhang, Wei-Ying Ma, Wei Lai, Jiangming Yang, Rui Cai
  • Patent number: 8051083
    Abstract: Described is a technology by which forum web pages are processed into clusters for classification purposes, including by determining repetitive regions between pages and associating pages that have similar repetitive regions into a common cluster. Patterns corresponding to the regions are determined, and a feature set based at least in part on those patterns (e.g., pattern frequency) is extracted from the page. The feature set of a page is compared against the feature set of another page to determine similarity therewith, e.g., via a feature space distance computation that is evaluated against a threshold distance.
    Type: Grant
    Filed: April 16, 2008
    Date of Patent: November 1, 2011
    Assignee: Microsoft Corporation
    Inventors: Wei Lai, Rui Cai, Jiangming Yang, Lei Zhang, Wei-Ying Ma
  • Publication number: 20110264664
    Abstract: Concepts and technologies are described herein for identifying location names within document text. Through an implementation of the concepts and technologies presented herein, functionality can be provided for identifying location names within articles, websites, travelogues, or other such documents. For instance, documents containing the names of cities, regions, countries, landmarks, or other locations may be associated with those locations. The location names may be unambiguously identified even when the location names may also have common word meanings that are not location associated or when the location name may be associated with more than one location.
    Type: Application
    Filed: April 22, 2010
    Publication date: October 27, 2011
    Applicant: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Publication number: 20110264655
    Abstract: Concepts and technologies are described herein for mining location contexts within document text. Through an implementation of the concepts and technologies presented herein, functionality can be provided for location context mining within articles, websites, travelogues, or other such documents. A location context is a concept associated with a specific location. For example, the contexts “beach” and “hula” are associated with Hawaii. Similarly, “glacier” and “polar bear” are contexts associated with Alaska. Location context mining can automatically discover locations and location contexts by mining information from a set of documents. User interfaces to support queries of the mined information are also presented herein.
    Type: Application
    Filed: April 22, 2010
    Publication date: October 27, 2011
    Applicant: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Patent number: 7962487
    Abstract: Techniques described herein allow for suggesting creation of tools for improving search engine performance. Specifically, these tools focus on producing more relevant search engine results via a URL-based query clustering method. These tools first extract tokens from Uniform Resource Locators associated to search queries. With these tokens, these tools form query clusters of common tokens. The resulting clusters can be used to help understand the similarities in user search queries via URL-based cluster queries to produce more relevant search results.
    Type: Grant
    Filed: December 29, 2008
    Date of Patent: June 14, 2011
    Assignee: Microsoft Corporation
    Inventors: Qi Liu, Ruihua Song, Jiangming Yang
  • Publication number: 20110078139
    Abstract: A location extraction component analyzes a set of travelogues to identify all of the locations mentioned therein. A co-occurrence extraction component computes co-occurrence values for the identified locations. When the identity of a specified location is received, suggested locations for the specified location are identified through the use of the co-occurrence values. A map is displayed that encompasses an area including the specified location and the suggested locations. The map might include indicators for the specified location and for each of the suggested locations. Attributes of the indicators, such as their size or color, can be modified based upon the co-occurrence value associated with the corresponding suggested location.
    Type: Application
    Filed: September 29, 2009
    Publication date: March 31, 2011
    Applicant: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Publication number: 20110077848
    Abstract: A location extraction component analyzes a set of travelogues to identify locations mentioned therein. A co-occurrence extraction component computes co-occurrence values for the identified locations. When a request to generate a travel route from a starting location to an ending location is received, suggested locations on or near the travel route are identified through the use of the co-occurrence values. A suggested travel route is then generated that passes through the starting location, the ending location, and the suggested locations. A map may be displayed showing the starting location, the ending location, the suggested locations, and the suggested travel route.
    Type: Application
    Filed: September 29, 2009
    Publication date: March 31, 2011
    Applicant: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang
  • Publication number: 20110078575
    Abstract: A map user interface control provides functionality for displaying a map in conjunction with the display of a Web page. The map control operates in combination with a location extraction component that analyzes the contents of the Web page to identify locations mentioned therein. Once the location extraction component has identified the locations mentioned in the Web page, a map is generated that encompasses the locations identified in the Web page. Once the map has been generated, the map control displays the map in conjunction with the display of the Web page. The map might include visual indicators corresponding to the locations mentioned in the Web page. The map might also include visual indicators corresponding to other locations near the locations identified in the Web page that have been identified using co-occurrence values generated through an analysis of a set of travelogues.
    Type: Application
    Filed: September 29, 2009
    Publication date: March 31, 2011
    Applicant: Microsoft Corporation
    Inventors: Rong Xiao, Jiangming Yang, Lei Zhang, Xingrong Chen
  • Publication number: 20100211533
    Abstract: The web forum data extraction technique is designed for the structured data extraction of data on web forums using both page-level information and site-level knowledge. To do this, the technique finds the kinds of page objects a forum site has, which object a page belongs to, and how different page objects are connected with each other. This information can be obtained by re-constructing the sitemap of the target forum which is based on a Data Object Model of the target forum. The web forum data extraction technique collects three kinds of evidence for data extraction: 1) inner-page features which cover both semantic and layout information on an individual page; 2) inter-vertex features which describe linkage-related observations; and 3) inner-vertex features which characterize interrelationships among pages in one vertex. The technique employs Markov Logic Networks to combine the types of evidence statistically for inference and thereby can extract the desired structures.
    Type: Application
    Filed: February 18, 2009
    Publication date: August 19, 2010
    Applicant: Microsoft Corporation
    Inventors: Jiangming Yang, Rui Cai, Lei Zhang, Wei-Ying Ma
  • Publication number: 20100205168
    Abstract: The incremental web forum crawling technique described herein is a web forum crawling technique that employs a thread-wise strategy that takes into account thread-level statistics, for example, the number of replies and the frequency of replies, to estimate the activity trend of each thread. To extract such statistical information, the technique employs a simple yet very robust approach to extract the timestamp of each post in a discussion thread. It also employs a regression model to predict the time of the next post for each thread.
    Type: Application
    Filed: February 10, 2009
    Publication date: August 12, 2010
    Applicant: Microsoft Corporation
    Inventors: Jiangming Yang, Rui Cai, Lei Zhang, Wei-Ying Ma
  • Publication number: 20100169300
    Abstract: Techniques described herein allow for suggesting creation of tools for improving search engine performance. Specifically, these tools focus on producing more relevant search engine results via a URL-based query clustering method. These tools first extract tokens from Uniform Resource Locators associated to search queries. With these tokens, these tools form query clusters of common tokens. The resulting clusters can be used to help understand the similarities in user search queries via URL-based cluster queries to produce more relevant search results.
    Type: Application
    Filed: December 29, 2008
    Publication date: July 1, 2010
    Applicant: Microsoft Corporation
    Inventors: Qi Liu, Ruihua Song, Jiangming Yang
  • Publication number: 20090327237
    Abstract: A method and system for identifying informative links of a web site for use in crawling the web site is provided. A forum crawler analyzes sample web pages of a web forum to identify informative links and then crawls the web forum by following links determined to be informative and not following other links. The forum crawler system determines whether links are informative based on whether they are part of the overall structure of the web site or are used to select sequential information that has been split onto multiple web pages.
    Type: Application
    Filed: June 27, 2008
    Publication date: December 31, 2009
    Applicant: Microsoft Corporation
    Inventors: Lei Zhang, Wei-Ying Ma, Wei Lai, Jiangming Yang, Rui Cai
  • Publication number: 20090265363
    Abstract: Described is a technology by which forum web pages are processed into clusters for classification purposes, including by determining repetitive regions between pages and associating pages that have similar repetitive regions into a common cluster. Patterns corresponding to the regions are determined, and a feature set based at least in part on those patterns (e.g., pattern frequency) is extracted from the page. The feature set of a page is compared against the feature set of another page to determine similarity therewith, e.g., via a feature space distance computation that is evaluated against a threshold distance.
    Type: Application
    Filed: April 16, 2008
    Publication date: October 22, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Wei Lai, Rui Cai, Jiangming Yang, Lei Zhang, Wei-Ying Ma