Patents by Inventor Tapas Kanungo
Tapas Kanungo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9524335Abstract: Systems, methods, and computer-readable storage media are provided for conflating entities using a persistent entity index. Information (including attributes) pertaining to a plurality of entities is received. The received information is either matched with one or more existing entities in the persistent entity index or, if no match is found, selected for addition to the persistent entity index. The persistent entity index includes entity-attribute pairs associated therewith. Attributes associated with matching entities for which information is received are aggregated and/or reconciled with the entity-attribute pairs associated with existing entities included in the persistent entity index. The persistent entity index may be incrementally updated at predetermined time intervals to insure the accuracy and freshness of the information associated therewith.Type: GrantFiled: June 18, 2013Date of Patent: December 20, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Tapas Kanungo, Ashok Ponnuswami, Richard Jungiang Qian
-
Patent number: 9495349Abstract: A system and method for processing a document to generate a set of related documents. A system is provided that includes a textual analytics system that analyzes unstructured data contained in a source document and extracts a set of structured information about the source document; and a compare system that identifies a set of related documents by comparing the set of structured information with metadata indexed from a set of publications.Type: GrantFiled: November 17, 2005Date of Patent: November 15, 2016Assignee: International Business Machines CorporationInventors: Robert L. Angell, Stephen K. Boyer, James W. Cooper, Richard A. Hennessy, Tapas Kanungo, Jeffrey T. Kreulen, David C. Martin, James J. Rhodes, W. Scott Spangler, Herschel J. R. Weintraub
-
Patent number: 8984398Abstract: The subject matter disclosed herein relates to generating an abstract of an electronic document based at least in part on a utility metric associated with a group phrases as well as on a ranked order of sentences.Type: GrantFiled: August 28, 2008Date of Patent: March 17, 2015Assignee: Yahoo! Inc.Inventors: Tapas Kanungo, Jan Pedersen, Tamas Sarlos
-
Publication number: 20140372441Abstract: Systems, methods, and computer-readable storage media are provided for conflating entities using a persistent entity index. Information (including attributes) pertaining to a plurality of entities is received. The received information is either matched with one or more existing entities in the persistent entity index or, if no match is found, selected for addition to the persistent entity index. The persistent entity index includes entity-attribute pairs associated therewith. Attributes associated with matching entities for which information is received are aggregated and/or reconciled with the entity-attribute pairs associated with existing entities included in the persistent entity index. The persistent entity index may be incrementally updated at predetermined time intervals to insure the accuracy and freshness of the information associated therewith.Type: ApplicationFiled: June 18, 2013Publication date: December 18, 2014Inventors: TAPAS KANUNGO, ASHOK PONNUSWAMI, RICHARD JUNGIANG QIAN
-
Publication number: 20140067816Abstract: In an effort to enhance computer user engagement with a search results page, systems and methods are presented which are configured to identify an entity as being the subject matter of a user's search query. If the entity is a known entity, i.e., entity information is stored in an entity store for the identified entity, a subset of entity attributes are identified and a representative entity attribute question is obtained for each of the attributes in the subset of entity attributes. The representative entity attribute questions are identified according to the probability that they are formed linguistically correct. The representative entity attribute questions are included in a search results page that is generated in response to the user's search query.Type: ApplicationFiled: August 29, 2012Publication date: March 6, 2014Applicant: MICROSOFT CORPORATIONInventors: Tapas Kanungo, Ashok Ponnuswami
-
Patent number: 8554700Abstract: This patent application pertains to answer model comparison. One implementation can determine a first frequency at which an individual answer category appears in an individual slot on a query results page when utilizing a first model. The method can ascertain a second frequency at which the individual answer category appears in the individual slot on the query results page when utilizing a second model. The method can calibrate the second model so that the second frequency approaches the first frequency.Type: GrantFiled: December 3, 2010Date of Patent: October 8, 2013Assignee: Microsoft CorporationInventors: Tapas Kanungo, Kuansan Wang, Ran Gilad-Bachrach, Kieran McDonald, Kumaresh Pattabiraman, Christopher Meyers, Ashok Ponnuswami, Luke Simon
-
Patent number: 8515684Abstract: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.Type: GrantFiled: December 21, 2011Date of Patent: August 20, 2013Assignee: International Business Machines CorporationInventors: Stephen K. Boyer, Gregory Breyta, Tapas Kanungo, Jeffrey Thomas Kreulen, James J. Rhodes
-
Patent number: 8370337Abstract: Methods and computer-storage media having computer-executable instructions embodied thereon that facilitate generating a machine-learned model for ranking search results using click-based data are provided. Data is referenced from user queries, which may include search results generated by general search engines and vertical search engines. A training set is generated from the search results and click-based judgments are associated with the search results in the training set. Based on click-based judgments, identifiable features are determined from the search results in a training set. Based on determining identifiable features in a training set, a rule set is generated for ranking subsequent search results.Type: GrantFiled: April 19, 2010Date of Patent: February 5, 2013Assignee: Microsoft CorporationInventors: Tapas Kanungo, Kumaresh Pattabiraman, Nitin Agrawal, Kieran Richard McDonald, Christopher Avery Meyers, Nipoon Malhotra
-
Publication number: 20120143794Abstract: This patent application pertains to answer model comparison. One implementation can determine a first frequency at which an individual answer category appears in an individual slot on a query results page when utilizing a first model. The method can ascertain a second frequency at which the individual answer category appears in the individual slot on the query results page when utilizing a second model. The method can calibrate the second model so that the second frequency approaches the first frequency.Type: ApplicationFiled: December 3, 2010Publication date: June 7, 2012Applicant: Microsoft CorporationInventors: Tapas Kanungo, Kuansan Wang, Ran Gilad-Bachrach, Kieran McDonald, Kumaresh Pattabiraman, Christopher Meyers, Ashok Ponnuswami, Luke Simon
-
Publication number: 20120109972Abstract: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.Type: ApplicationFiled: December 21, 2011Publication date: May 3, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Stephen Kane Boyer, GREGORY BREYTA, TAPAS KANUNGO, JEFFREY THOMAS KREULEN, JAMES J. RHODES
-
Patent number: 8140267Abstract: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.Type: GrantFiled: June 30, 2006Date of Patent: March 20, 2012Assignee: International Business Machines CorporationInventors: Stephen Kane Boyer, Gregory Breyta, Tapas Kanungo, Jeffrey Thomas Kreulen, James J. Rhodes
-
Publication number: 20110258149Abstract: Methods and computer-storage media having computer-executable instructions embodied thereon that facilitate generating a machine-learned model for ranking search results using click-based data are provided. Data is referenced from user queries, which may include search results generated by general search engines and vertical search engines. A training set is generated from the search results and click-based judgments are associated with the search results in the training set. Based on click-based judgments, identifiable features are determined from the search results in a training set. Based on determining identifiable features in a training set, a rule set is generated for ranking subsequent search results.Type: ApplicationFiled: April 19, 2010Publication date: October 20, 2011Applicant: MICROSOFT CORPORATIONInventors: TAPAS KANUNGO, KUMARESH PATTABIRAMAN, NITIN AGRAWAL, KIERAN RICHARD McDONALD, CHRISTOPHER AVERY MEYERS, NIPOON MALHOTRA
-
Patent number: 8005845Abstract: Disclosed are apparatus and methods for ranking lines of text. In one embodiment, an intent of a query is ascertained. A relevance of each one of a plurality of lines of text of a document is determined based upon the intent of the query, content of the query, and content of each of the plurality of lines of text. The plurality of lines of text may then be ranked according to the determined relevance of each of the plurality of lines of text.Type: GrantFiled: May 20, 2008Date of Patent: August 23, 2011Assignee: Yahoo! Inc.Inventors: Tapas Kanungo, Donald Metzler
-
Publication number: 20100306224Abstract: Determine a plurality of first dwell durations for a plurality of first web pages, each first dwell duration indicating a time period a user has spent with a first web page. Access a plurality of first quality ratings for the first web pages, each first quality rating indicating a quality of a first web page as a part of a search result generated for a first search query. Access a predefined quality rating threshold. Correlate the first dwell durations and the first quality ratings. And, determine a dwell duration threshold, such that a second user spending a second dwell duration greater than or equal to the dwell duration threshold with a second web page indicates that the second user is satisfied with the second web page identified in a second search result generated by a search engine in response to a second search query requested by the second user.Type: ApplicationFiled: June 2, 2009Publication date: December 2, 2010Applicant: Yahoo! Inc.Inventors: David Ciemiewicz, Tapas Kanungo, Arun Lakshminarayanan, Maria Stone
-
Publication number: 20100153365Abstract: Disclosed herein are systems and methods for identifying phrases using break points. Break points can be identified using stop words identified in content. Identified phrases can be used to generate a summary of the content.Type: ApplicationFiled: December 15, 2008Publication date: June 17, 2010Inventors: Hadar Shemtov, Tapas Kanungo, Rajhans Samdani, Donald Metzler
-
Patent number: 7720772Abstract: A document (or multiple documents) is analyzed to identify entities of interest within that document. This is accomplished by constructing n-gram or bi-gram models that correspond to different kinds of text entities, such as chemistry-related words and generic English words. The models can be constructed from training text selected to reflect a particular kind of text entity. The document is tokenized, and the tokens are run against the models to determine, for each token, which kind of text entity is most likely to be associated with that token. The entities of interest in the document can then be annotated accordingly.Type: GrantFiled: December 15, 2008Date of Patent: May 18, 2010Assignee: International Business Machines CorporationInventors: Tapas Kanungo, James Rhodes
-
Publication number: 20100057710Abstract: The subject matter disclosed herein relates to generating an abstract of an electronic document based at least in part on a utility metric associated with a group phrases as well as on a ranked order of sentences.Type: ApplicationFiled: August 28, 2008Publication date: March 4, 2010Applicant: Yahoo! IncInventors: Tapas Kanungo, Jan Pedersen, Tamas Sarlos
-
Patent number: 7640488Abstract: A focused random walk system produces samples of on-topic pages from a collection of hyper-linked pages such as Web pages. The focused random walk system utilizes a focused random walk to produce a focused sample, which is a random sample of Web pages focused on a topic. The focused random walk system uniformly samples pages iteratively, where each iteration follows a random link from a union of the in-links and out-links of a page. The system then classifies this randomly selected link to determine whether the page is on-topic. The random walk sampling process could comprise a hard-focus method that selects only on-topic pages at each step of the focused random walk, or a soft-focus method that allows limited divergence to off-topic pages.Type: GrantFiled: December 4, 2004Date of Patent: December 29, 2009Assignee: International Business Machines CorporationInventors: Ziv Bar-Yossef, Tapas Kanungo, Robert Krauthgamer
-
Publication number: 20090292683Abstract: Disclosed are apparatus and methods for ranking lines of text. In one embodiment, an intent of a query is ascertained. A relevance of each one of a plurality of lines of text of a document is determined based upon the intent of the query, content of the query, and content of each of the plurality of lines of text. The plurality of lines of text may then be ranked according to the determined relevance of each of the plurality of lines of text.Type: ApplicationFiled: May 20, 2008Publication date: November 26, 2009Applicant: Yahoo! Inc.Inventors: Tapas Kanungo, Donald Metzler
-
Publication number: 20090187516Abstract: Methods and systems are provided herein for establishing and/or using an evaluation model that is adapted to determine a model judgment value based, at least in part, on measured summary feature values associated with a search result summary. The evaluation model may be established through a learning process based, at least in part, on human judgment values associated with a set of search result summaries.Type: ApplicationFiled: January 18, 2008Publication date: July 23, 2009Inventors: Tapas Kanungo, David M. Orr