Patents by Inventor Jianfeng Gao

Jianfeng Gao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20150032767
    Abstract: Various technologies described herein pertain to use of path-constrained random walks for query expansion and/or query document matching. Clickthrough data from search logs is represented as a labeled and directed graph. Path-constrained random walks are executed over the graph based upon an input query. The graph includes a first set of nodes that represent queries included in the clickthrough data from search logs, a second set of nodes that represent documents included in the clickthrough data from the search logs, a third set of nodes that represent words from the queries and the documents, and edges between nodes that represent relationships between queries, documents, and words. The path-constrained random walks include traversals over edges of the graph between nodes. Further, a score for a relationship between a target node and a source node representative of the input query is computed based at least in part upon the path-constrained random walks.
    Type: Application
    Filed: July 26, 2013
    Publication date: January 29, 2015
    Applicant: Microsoft Corporation
    Inventors: Jianfeng Gao, Gu Xu, Jinxi Xu
  • Publication number: 20140365201
    Abstract: Various technologies described herein pertain to training and utilizing a general, statistical framework for modeling translation via Markov random fields (MRFs). An MRF-based translation model can be employed in a statistical machine translation (SMT) system. The MRF-based translation model allows for arbitrary features extracted from a phrase pair to be incorporated as evidence. The parameters of the model are estimated using a large-scale discriminative training approach based on stochastic gradient ascent and an N-best list based expected Bilingual Evaluation Understudy (BLEU) as an objective function.
    Type: Application
    Filed: February 18, 2014
    Publication date: December 11, 2014
    Applicant: Microsoft Corporation
    Inventors: Jianfeng Gao, Xiaodong He
  • Patent number: 8909573
    Abstract: An alteration candidate for a query can be scored. The scoring may include computing one or more query-dependent feature scores and/or one or more intra-candidate dependent feature scores. The computation of the query-dependent feature score(s) can be based on dependencies to multiple query terms from each of one or more alteration terms (i.e., for each of the one or more alteration terms, there can be dependencies to multiple query terms that form at least a portion of the basis for the query-dependent feature score(s)). The computation of the intra-candidate dependent feature score(s) can be based on dependencies between different terms in the alteration candidate. A candidate score can be computed using the query dependent feature score(s) and/or the intra-candidate dependent feature score(s). Additionally, the candidate score can be used in determining whether to select the candidate to expand the query. If selected, the candidate can be used to expand the query.
    Type: Grant
    Filed: July 29, 2013
    Date of Patent: December 9, 2014
    Assignee: Microsoft Corporation
    Inventors: Shasha Xie, Xiaodong He, Jianfeng Gao
  • Publication number: 20140336149
    Abstract: A docetaxel inclusion complex having improved water-solubility (up to 5 mg/ml) and stability (stability constant Ka=2056M?1-13051M?1), comprises docetaxel and hydroxypropyl-beta-cyclodextrin and/or sulfobutyl-beta-cyclodextrin in a ratio of 1:10-150. The method includes steps as follows: docetaxel dissolved in ethanol is added into water solution of cyclodextrin via stirring, until docetaxel is completely dissolved; said solution is filtered in 0.2-04 ?m microporous membrane then ethanol is removed through reduced pressure to obtain the inclusion complex in a liquid form; or ethanol, followed by water is removed through reduced pressure, then dried to obtain the inclusion complex in a solid form.
    Type: Application
    Filed: May 21, 2014
    Publication date: November 13, 2014
    Inventors: Yong Ren, Jianfeng Gao, Shuqin Yu, Ling Wu
  • Patent number: 8838433
    Abstract: An architecture is discussed that provides the capability to subselect the most relevant data from an out-domain corpus to use either in isolation or in combination conjunction with in-domain data. The architecture is a domain adaptation for machine translation that selects the most relevant sentences from a larger general-domain corpus of parallel translated sentences. The methods for selecting the data include monolingual cross-entropy measure, monolingual cross-entropy difference, bilingual cross entropy, and bilingual cross-entropy difference. A translation model is trained on both the in-domain data and an out-domain subset, and the models can be interpolated together to boost performance on in-domain translation tasks.
    Type: Grant
    Filed: February 8, 2011
    Date of Patent: September 16, 2014
    Assignee: Microsoft Corporation
    Inventors: Amittai Axelrod, Jianfeng Gao, Xiaodong He
  • Publication number: 20140222724
    Abstract: A log-linear model may be trained using a modified version of an original limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) algorithm. The modified version may be based on modifying the original L-BFGS algorithm using a single map-reduce implementation. In another aspect, a sparse log-linear model may be accessed. The sparse log-linear model may be trained with L1-regularization, based on data indicating past user ad selection behaviors. A probability of a user selection of an ad may be determined based on the sparse log-linear model.
    Type: Application
    Filed: February 2, 2013
    Publication date: August 7, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Jianfeng Gao, Xuedong Huang, Zhenghao Wang, Yunhong Zhou
  • Patent number: 8765716
    Abstract: A docetaxel inclusion complex having improved water-solubility (up to 15 mg/ml) and stability (stability constant Ka=2056 M?1-13051 M?1), comprises docetaxel and hydroxypropyl-beta-cyclodextrin and/or sulfobutyl-beta-cyclodextrin in a ratio of 1:10-150. The method includes steps as follows: docetaxel dissolved in ethanol is added into water solution of cyclodextrin via stirring, until docetaxel is completely dissolved; said solution is filtered in 0.2-04 ?m microporous membrane then ethanol is removed through reduced pressure to obtain the inclusion complex in a liquid form; or ethanol, followed by water is removed through reduced pressure, then dried to obtain the inclusion complex in a solid form.
    Type: Grant
    Filed: July 1, 2013
    Date of Patent: July 1, 2014
    Assignee: Meridian Laboratories, Inc.
    Inventors: Yong Ren, Jianfeng Gao, Shuqin Yu, Ling Wu
  • Publication number: 20140149429
    Abstract: A computer-implemented method and system for Web search ranking are provided herein. The method includes generating a number of training samples from clickthrough data, wherein the training samples include positive query-document pairs and negative query-document pairs. The method also includes discriminatively training a translation model based on the training samples and ranking a number of documents for a Web search based on the translation model.
    Type: Application
    Filed: November 29, 2012
    Publication date: May 29, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Jianfeng Gao, Zhonghua Qu, Gu Xu
  • Patent number: 8738356
    Abstract: The universal text input technique described herein addresses the difficulties of typing text in various languages and scripts, and offers a unified solution, which combines character conversion, next word prediction, spelling correction and automatic script switching to make it extremely simple to type any language from any device. The technique provides a rich and seamless input experience in any language through a universal IME (input method editor). It allows a user to type in any script for any language using a regular qwerty keyboard via phonetic input and at the same time allows for auto-completion and spelling correction of words and phrases while typing. The technique also provides a modeless input that automatically turns on and off an input mode that changes between different types of script.
    Type: Grant
    Filed: May 18, 2011
    Date of Patent: May 27, 2014
    Assignee: Microsoft Corp.
    Inventors: Hisami Suzuki, Vikram Dendi, Christopher Brian Quirk, Pallavi Choudhury, Jianfeng Gao, Achraf Chalabi
  • Patent number: 8732151
    Abstract: Systems, methods, and computer media for identifying query rewriting replacement terms are provided. A list of related string pairs each comprising a first string and second string is received. The first string of each related string pair is a user search query extracted from user click log data. For one or more of the related string pairs, the string pair is provided as inputs to a statistical machine translation model. The model identifies one or more pairs of corresponding terms, each pair of corresponding terms including a first term from the first string and a second term from the second string. The model also calculates a probability of relatedness for each of the one or more pairs of corresponding terms. Term pairs whose calculated probability of relatedness exceeds a threshold are characterized as query term replacements and incorporated, along with the probability of relatedness, into a query rewriting candidate database.
    Type: Grant
    Filed: April 1, 2011
    Date of Patent: May 20, 2014
    Assignee: Microsoft Corporation
    Inventors: Alnur Ali, Jianfeng Gao, Xiaodong He, Bodo von Billerbeck, Sanaz Ahari
  • Publication number: 20140061306
    Abstract: A wireless scanner is described that performs a pairing operation with a wireless scanner base before commencing scanning operations in a wireless scanner network. Radio frequency identification (RFID) is used to achieve the pairing operation of the wireless scanner with the wireless scanner base by using an RFID tag associated with the wireless scanner base. The RFID tag in the wireless scanner base may contain pairing information such as a network address of the wireless scanner base for use in automatically establishing a wireless communication session with the wireless scanner base in accordance with another wireless protocol.
    Type: Application
    Filed: August 22, 2013
    Publication date: March 6, 2014
    Applicant: Hand Held Products, Inc.
    Inventors: Jerry Wu, Jianfeng Gao, Hong Jian Jin
  • Patent number: 8645289
    Abstract: A “Cross-Lingual Unified Relevance Model” provides a feedback model that improves a machine-learned ranker for a language with few training resources, using feedback from a more complete ranker for a language that has more training resources. The model focuses on linguistically non-local queries, such as “world cup” (English language/U.S. market) and “copa mundial” (Spanish language/Mexican market), that have similar user intent in different languages and markets or regions, thus allowing the low-resource ranker to receive direct relevance feedback from the high-resource ranker. Among other things, the Cross-Lingual Unified Relevance Model differs from conventional relevancy-based techniques by incorporating both query- and document-level features.
    Type: Grant
    Filed: December 16, 2010
    Date of Patent: February 4, 2014
    Assignee: Microsoft Corporation
    Inventors: Paul Nathan Bennett, Jianfeng Gao, Jagadeesh Jagarlamudi, Kristen Patricia Parton
  • Publication number: 20130311504
    Abstract: An alteration candidate for a query can be scored. The scoring may include computing one or more query-dependent feature scores and/or one or more intra-candidate dependent feature scores. The computation of the query-dependent feature score(s) can be based on dependencies to multiple query terms from each of one or more alteration terms (i.e., for each of the one or more alteration terms, there can be dependencies to multiple query terms that form at least a portion of the basis for the query-dependent feature score(s)). The computation of the intra-candidate dependent feature score(s) can be based on dependencies between different terms in the alteration candidate. A candidate score can be computed using the query dependent feature score(s) and/or the intra-candidate dependent feature score(s). Additionally, the candidate score can be used in determining whether to select the candidate to expand the query. If selected, the candidate can be used to expand the query.
    Type: Application
    Filed: July 29, 2013
    Publication date: November 21, 2013
    Applicant: Microsoft Corporation
    Inventors: Shasha Xie, Xiaodong He, Jianfeng Gao
  • Publication number: 20130296268
    Abstract: A docetaxel inclusion complex having improved water-solubility (up to 15 mg/ml) and stability (stability constant Ka=2056 M?1-13051 M?1), comprises docetaxel and hydroxypropyl-beta-cyclodextrin and/or sulfobutyl-beta-cyclodextrin in a ratio of 1:10-150. The method includes steps as follows: docetaxel dissolved in ethanol is added into water solution of cyclodextrin via stirring, until docetaxel is completely dissolved; said solution is filtered in 0.2-04 ?m microporous membrane then ethanol is removed through reduced pressure to obtain the inclusion complex in a liquid form; or ethanol, followed by water is removed through reduced pressure, then dried to obtain the inclusion complex in a solid form.
    Type: Application
    Filed: July 1, 2013
    Publication date: November 7, 2013
    Inventors: Yong Ren, Jianfeng Gao, Shuqin Yu
  • Patent number: 8521672
    Abstract: An alteration candidate for a query can be scored. The scoring may include computing one or more query-dependent feature scores and/or one or more intra-candidate dependent feature scores. The computation of the query-dependent feature score(s) can be based on dependencies to multiple query terms from each of one or more alteration terms (i.e., for each of the one or more alteration terms, there can be dependencies to multiple query terms that form at least a portion of the basis for the query-dependent feature score(s)). The computation of the intra-candidate dependent feature score(s) can be based on dependencies between different terms in the alteration candidate. A candidate score can be computed using the query dependent feature score(s) and/or the intra-candidate dependent feature score(s). Additionally, the candidate score can be used in determining whether to select the candidate to expand the query. If selected, the candidate can be used to expand the query.
    Type: Grant
    Filed: November 22, 2010
    Date of Patent: August 27, 2013
    Assignee: Microsoft Corporation
    Inventors: Shasha Xie, Xiaodong He, Jianfeng Gao
  • Patent number: 8515950
    Abstract: Log-based rankers and document-based rankers may be combined for searching. In an example embodiment, there is a method for combining rankers to perform a search operation. A count of query instances in log data is ascertained based on a query. A search for the query is performed to produce a set of search results. The set of search results is ranked by relevance score with a document-based ranker and a log-based ranker using a weighting factor that is adapted responsive to the count of the query instances in the log data.
    Type: Grant
    Filed: October 1, 2008
    Date of Patent: August 20, 2013
    Assignee: Microsoft Corporation
    Inventors: Jianfeng Gao, Kuansan Wang
  • Patent number: 8481511
    Abstract: A docetaxel inclusion complex having improved water-solubility (up to 15 mg/ml) and stability (stability constant Ka=2056M?1-13051M?1), comprises docetaxel and hydroxypropyl-beta-cyclodextrin and/or sulfobutyl-beta-cyclodextrin in a ratio of 1:10-150. The method includes steps as follows: docetaxel dissolved in ethanol is added into water solution of cyclodextrin via stirring, until docetaxel is completely dissolved; said solution is filtered in 0.2-04 ?m microporous membrane then ethanol is removed through reduced pressure to obtain the inclusion complex in a liquid form; or ethanol, followed by water is removed through reduced pressure, then dried to obtain the inclusion complex in a solid form.
    Type: Grant
    Filed: October 13, 2006
    Date of Patent: July 9, 2013
    Assignee: Hainan Hdeton Science and Technology Co., Ltd.
    Inventors: Yong Ren, Jianfeng Gao, Shuqin Yu, Ling Wu
  • Patent number: 8473486
    Abstract: A supervised technique uses relevance judgments to train a dependency parser such that it approximately optimizes Normalized Discounted Cumulative Gain (NDCG) in information retrieval. A weighted tree edit distance between the parse tree for a query and the parse tree for a document is added to a ranking function, where the edit distance weights are parameters from the parser. Using parser parameters in the ranking function enables approximate optimization of the parser's parameters for NDCG by adding some constraints to the objective function.
    Type: Grant
    Filed: December 8, 2010
    Date of Patent: June 25, 2013
    Assignee: Microsoft Corporation
    Inventors: Xiaodong He, Jianfeng Gao, Jennifer Gillenwater
  • Publication number: 20130159320
    Abstract: There is provided a computer-implemented method and system for ranking documents. The method includes identifying a number of query-document pairs based on clickthrough data for a number of documents. The method also includes building a latent semantic model based on the query-document pairs and ranking the documents for a search based on the latent semantic model.
    Type: Application
    Filed: December 19, 2011
    Publication date: June 20, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Jianfeng Gao, Kristina Toutanova, Wen-tau Yih
  • Publication number: 20130124492
    Abstract: Statistical Machine Translation (SMT) based search query spelling correction techniques are described herein. In one or more implementations, search data regarding searches performed by clients may be logged. The logged data includes query correction pairs that may be used to ascertain error patterns indicating how misspelled substrings may be translated to corrected substrings. The error patterns may be used to determine suggestions for an input query and to develop query correction models used to translate the input query to a corrected query. In one or more implementations, probabilistic features from multiple query correction models are combined to score different correction candidates. One or more top scoring correction candidates may then be exposed as suggestions for selection by a user and/or provided to a search engine to conduct a corresponding search using the corrected query version(s).
    Type: Application
    Filed: November 15, 2011
    Publication date: May 16, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Jianfeng Gao, Mei-Yuh Hwang, Xuedong D. Huang, Christopher Brian Quirk, Zhenghao Wang