Patents by Inventor Chin-Yew Lin

Chin-Yew Lin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8473499
    Abstract: Techniques for unsupervised management of a question and answer (QA) forum include labeling of answers for quality purposes, and identification of experts. In a QA thread, a ranking of answers may include an initial labeling of the longest answer in each thread as the best answer. Such a labeling provides an initial point of reference. Then, in an iterative manner answerers are ranked using the labeling. The ranking of answerers allows selection of experts and poor or inexpert answerers. A label update is performed using the experts (and perhaps inexpert answerers) as input. The label update may be used to train a model, which may describe quality of answers in one or more QA threads and an indication of expert and inexpert answerers. The iterative process may be ended upon convergence or upon a maximum number of iterations.
    Type: Grant
    Filed: October 17, 2011
    Date of Patent: June 25, 2013
    Assignee: Microsoft Corporation
    Inventors: Young-In Song, Liu Jing, Chin-Yew Lin, Tetsuya Sakai
  • Publication number: 20130097178
    Abstract: Techniques for unsupervised management of a question and answer (QA) forum include labeling of answers for quality purposes, and identification of experts. In a QA thread, a ranking of answers may include an initial labeling of the longest answer in each thread as the best answer. Such a labeling provides an initial point of reference. Then, in an iterative manner answerers are ranked using the labeling. The ranking of answerers allows selection of experts and poor or inexpert answerers. A label update is performed using the experts (and perhaps inexpert answerers) as input. The label update may be used to train a model, which may describe quality of answers in one or more QA threads and an indication of expert and inexpert answerers. The iterative process may be ended upon convergence or upon a maximum number of iterations.
    Type: Application
    Filed: October 17, 2011
    Publication date: April 18, 2013
    Applicant: Microsoft Corporation
    Inventors: Young-In Song, Liu Jing, Chin-Yew Lin, Tetsuya Sakai
  • Publication number: 20120124086
    Abstract: Described herein are techniques for extracting data records containing user-generated content from documents. The documents may be processed into document trees in which sub-trees represent the data records of the document. Domain constraints may be used to locate structured portions of the document tree. For example, anchor trees may be located as being sets of sibling sub-trees with similar tag paths that contain the domain constraints. The anchor trees may then be used to determine a record boundary (e.g., the start offset and length) of the data records. Finally, the data records may be extracted based on the anchor trees and the record boundaries.
    Type: Application
    Filed: January 23, 2012
    Publication date: May 17, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Xinying Song, Zhiyuan Chen, Yunbo Cao, Chin-Yew Lin
  • Publication number: 20120124077
    Abstract: Embodiments for a Mining Data Records based on Anchor Trees (MiBAT) process are disclosed. In accordance with at least one embodiment, the MiBAT process extracts data records containing user-generated content from web documents. The web document is processed into a Document Object Model (DOM) tree in which sub-trees of the DOM tree represent the data records of the web document. Domain constraints are used to locate structured portions of the DOM tree. Anchor trees are then located as being sets of sibling sub-trees which contain the domain constraints. The anchor trees are then used to determine a record boundary (i.e. the start offset and length) of the data records. Finally, the data records are extracted based on the anchor trees and the record boundaries.
    Type: Application
    Filed: November 12, 2010
    Publication date: May 17, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Xinying Song, Yunbo Cao, Chin-Yew Lin
  • Patent number: 8112269
    Abstract: A question search system provides a collection of questions having words for use in evaluating the utility of the questions based on a language model. The question search system calculates n-gram probabilities for words within the questions of the collection. The n-gram probability of a word for a sequence of n?1 words indicates the probability of that word being next after that sequence in the collection of questions. The n-gram probabilities for the words of the collection represent the language model of the collection. The question search system calculates a language model utility score for each question within a collection that indicates the likelihood that a question is repeatedly asked by users. The question search system derives the language model utility score for a question from the n-gram probabilities of the words within that question.
    Type: Grant
    Filed: August 25, 2008
    Date of Patent: February 7, 2012
    Assignee: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Publication number: 20110302157
    Abstract: One or more techniques and systems are disclosed for generating comparative patterns for use in identifying comparators. A set of comparator pairs is extracted from a first comparative pattern in a pattern database that comprises one or more comparative patterns. Questions are retrieved from a question collection using respective comparator pairs to generate comparative questions. Potential comparative patterns are generated from a combination of the comparator pairs and comparative questions, and the potential comparative patterns are evaluated by determining their reliability, in order to generate second comparative patterns for the pattern database.
    Type: Application
    Filed: June 8, 2010
    Publication date: December 8, 2011
    Applicant: Microsoft Corporation
    Inventors: Shasha Li, Chin-Yew Lin, Youngin Song
  • Patent number: 8027973
    Abstract: A method and system for determining the relevance of questions to a queried question based on topics and focuses of the questions is provided. A question search system provides a collection of questions with topics and focuses. Upon receiving a queried question, the question search system identifies a queried topic and queried focus of the queried question. The question search system generates a score indicating the relevance of a question of the collection to the queried question based on a language model of the topic of the question and a language model of the focus of the question.
    Type: Grant
    Filed: August 4, 2008
    Date of Patent: September 27, 2011
    Assignee: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Patent number: 8024332
    Abstract: A method and system for presenting questions that are relevant to a queried question based on clusters of topics and clusters of focuses of the questions is provided. A question search system provides a collection of questions. Each question of the collection has an associated topic and focus. Upon receiving a queried question, the question search system identifies questions of the collection that may be relevant to the queried question and generates a score or ranking indicating relevance of the identified questions. The question search system clusters the identified questions into topic clusters of questions with similar topics. The question search system may also cluster the questions within each topic cluster into focus clusters of questions with similar focuses.
    Type: Grant
    Filed: August 4, 2008
    Date of Patent: September 20, 2011
    Assignee: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Patent number: 7966316
    Abstract: In a question answering system, the system identifies a type of question input by a user. The system then generates answer summaries that summarize answers to the input question in a format that is determined based on the type of question asked by the user. The answer summaries are output, in the corresponding format, in answer to the input question.
    Type: Grant
    Filed: April 15, 2008
    Date of Patent: June 21, 2011
    Assignee: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Publication number: 20100235343
    Abstract: Exemplary methods, computer-readable media, and systems are presented for learning to recommend questions and other user-generated submissions to community sites based on user ratings. The size of available training data is enlarged by taking into consideration questions without user ratings, which in turn benefits the learned model. Question or other user-generated submissions are obtained by crawling Internet-accessible Web sites including community sites. Questions and other submissions, even when not tagged, voted or indicated as “popular” or “interesting” by users are quantitatively indentified as “interesting.
    Type: Application
    Filed: September 29, 2009
    Publication date: September 16, 2010
    Applicant: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin, Young-In Song
  • Publication number: 20100235311
    Abstract: Exemplary methods, computer-readable media, and systems are presented for leveraging question-answering knowledge from community sites by complementing product search services with a search of questions, answers, reviews and other Internet accessible content including user-generated content. Product or service information is obtained by crawling Internet-accessible Web sites including community sites. An integrated index of such information is generated. A user is able to browse questions by product or service feature, by topic, by identified comparative questions, and by question ranking (for example, interestingness or popularity).
    Type: Application
    Filed: March 13, 2009
    Publication date: September 16, 2010
    Applicant: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin, Bo Wang
  • Patent number: 7725442
    Abstract: A probability distribution for a reference summary of a document is determined. The probability distribution for the reference summary is then used to generate a score for a machine-generated summary of the document.
    Type: Grant
    Filed: February 6, 2007
    Date of Patent: May 25, 2010
    Assignee: Microsoft Corporation
    Inventors: Chin-Yew Lin, Jianfeng Gao, Guihong Cao, Jian-Yun Nie
  • Publication number: 20100076978
    Abstract: In this paper, we propose a new approach to extracting question-context-answer triples from online discussion forums. More specifically, we propose a general framework based on Conditional Random Fields (CRFs) for context and answer detection, and also extend the basic framework to utilize contexts for answer detection and to better accommodate the features of forums.
    Type: Application
    Filed: September 9, 2008
    Publication date: March 25, 2010
    Applicant: Microsoft Corporation
    Inventors: Gao Cong, Chin-Yew Lin, Shilin Ding
  • Publication number: 20100063797
    Abstract: The present invention provides a new approach to extracting question-answer pairs from online forums. The system develops a classification-based technique to discover questions in forums using sequential patterns automatically extracted from both questions and non-question sentences in forums as features. Once the questions are discovered, the system discovers the answers. The invention includes a graph-based method is that it is complementary with supervised methods for knowledge extraction, and techniques for question answering.
    Type: Application
    Filed: September 9, 2008
    Publication date: March 11, 2010
    Applicant: Microsoft Corporation
    Inventors: Gao Cong, Chin-Yew Lin
  • Publication number: 20100049498
    Abstract: A question search system provides a collection of questions having words for use in evaluating the utility of the questions based on a language model. The question search system calculates n-gram probabilities for words within the questions of the collection. The n-gram probability of a word for a sequence of n?1 words indicates the probability of that word being next after that sequence in the collection of questions. The n-gram probabilities for the words of the collection represent the language model of the collection. The question search system calculates a language model utility score for each question within a collection that indicates the likelihood that a question is repeatedly asked by users. The question search system derives the language model utility score for a question from the n-gram probabilities of the words within that question.
    Type: Application
    Filed: August 25, 2008
    Publication date: February 25, 2010
    Applicant: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Publication number: 20100030770
    Abstract: A method and system for determining the relevance of questions to a queried question based on topics and focuses of the questions is provided. A question search system provides a collection of questions with topics and focuses. Upon receiving a queried question, the question search system identifies a queried topic and queried focus of the queried question. The question search system generates a score indicating the relevance of a question of the collection to the queried question based on a language model of the topic of the question and a language model of the focus of the question.
    Type: Application
    Filed: August 4, 2008
    Publication date: February 4, 2010
    Applicant: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Publication number: 20100030769
    Abstract: A method and system for presenting questions that are relevant to a queried question based on clusters of topics and clusters of focuses of the questions is provided. A question search system provides a collection of questions. Each question of the collection has an associated topic and focus. Upon receiving a queried question, the question search system identifies questions of the collection that may be relevant to the queried question and generates a score or ranking indicating relevance of the identified questions. The question search system clusters the identified questions into topic clusters of questions with similar topics. The question search system may also cluster the questions within each topic cluster into focus clusters of questions with similar focuses.
    Type: Application
    Filed: August 4, 2008
    Publication date: February 4, 2010
    Applicant: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Publication number: 20090259642
    Abstract: In a question answering system, the system identifies a type of question input by a user. The system then generates answer summaries that summarize answers to the input question in a format that is determined based on the type of question asked by the user. The answer summaries are output, in the corresponding format, in answer to the input question.
    Type: Application
    Filed: April 15, 2008
    Publication date: October 15, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Publication number: 20090253112
    Abstract: The present system graphs topic terms in stored cQA questions and also converts a submitted question into a graph of topic terms. Topic terms that correspond to a question topic are delineated from topic terms that correspond to question focus. New questions are recommended to the user based on a comparison between the topics of the new questions and the topic of the submitted question as well as the focus of the new questions and the focus of the submitted question.
    Type: Application
    Filed: April 7, 2008
    Publication date: October 8, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Yunbo Cao, Chin-Yew Lin
  • Publication number: 20090083096
    Abstract: A method for handling product reviews can detect a first quality product review from a second quality product review. The first and second quality product reviews can be associated with a product. The first quality product review can be filtered. An opinion segment in the second quality product review can be identified and the polarity can be determined of the opinion segment. An opinion set can be generated with the opinion segment for a product feature. A score (or weighty can be aggregated of segments in the opinion set for the product feature.
    Type: Application
    Filed: September 20, 2007
    Publication date: March 26, 2009
    Applicant: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin, Ming Zhou