Patents by Inventor Tapas Kanungo

Tapas Kanungo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9524335
    Abstract: Systems, methods, and computer-readable storage media are provided for conflating entities using a persistent entity index. Information (including attributes) pertaining to a plurality of entities is received. The received information is either matched with one or more existing entities in the persistent entity index or, if no match is found, selected for addition to the persistent entity index. The persistent entity index includes entity-attribute pairs associated therewith. Attributes associated with matching entities for which information is received are aggregated and/or reconciled with the entity-attribute pairs associated with existing entities included in the persistent entity index. The persistent entity index may be incrementally updated at predetermined time intervals to insure the accuracy and freshness of the information associated therewith.
    Type: Grant
    Filed: June 18, 2013
    Date of Patent: December 20, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Tapas Kanungo, Ashok Ponnuswami, Richard Jungiang Qian
  • Patent number: 9495349
    Abstract: A system and method for processing a document to generate a set of related documents. A system is provided that includes a textual analytics system that analyzes unstructured data contained in a source document and extracts a set of structured information about the source document; and a compare system that identifies a set of related documents by comparing the set of structured information with metadata indexed from a set of publications.
    Type: Grant
    Filed: November 17, 2005
    Date of Patent: November 15, 2016
    Assignee: International Business Machines Corporation
    Inventors: Robert L. Angell, Stephen K. Boyer, James W. Cooper, Richard A. Hennessy, Tapas Kanungo, Jeffrey T. Kreulen, David C. Martin, James J. Rhodes, W. Scott Spangler, Herschel J. R. Weintraub
  • Patent number: 8984398
    Abstract: The subject matter disclosed herein relates to generating an abstract of an electronic document based at least in part on a utility metric associated with a group phrases as well as on a ranked order of sentences.
    Type: Grant
    Filed: August 28, 2008
    Date of Patent: March 17, 2015
    Assignee: Yahoo! Inc.
    Inventors: Tapas Kanungo, Jan Pedersen, Tamas Sarlos
  • Publication number: 20140372441
    Abstract: Systems, methods, and computer-readable storage media are provided for conflating entities using a persistent entity index. Information (including attributes) pertaining to a plurality of entities is received. The received information is either matched with one or more existing entities in the persistent entity index or, if no match is found, selected for addition to the persistent entity index. The persistent entity index includes entity-attribute pairs associated therewith. Attributes associated with matching entities for which information is received are aggregated and/or reconciled with the entity-attribute pairs associated with existing entities included in the persistent entity index. The persistent entity index may be incrementally updated at predetermined time intervals to insure the accuracy and freshness of the information associated therewith.
    Type: Application
    Filed: June 18, 2013
    Publication date: December 18, 2014
    Inventors: TAPAS KANUNGO, ASHOK PONNUSWAMI, RICHARD JUNGIANG QIAN
  • Publication number: 20140067816
    Abstract: In an effort to enhance computer user engagement with a search results page, systems and methods are presented which are configured to identify an entity as being the subject matter of a user's search query. If the entity is a known entity, i.e., entity information is stored in an entity store for the identified entity, a subset of entity attributes are identified and a representative entity attribute question is obtained for each of the attributes in the subset of entity attributes. The representative entity attribute questions are identified according to the probability that they are formed linguistically correct. The representative entity attribute questions are included in a search results page that is generated in response to the user's search query.
    Type: Application
    Filed: August 29, 2012
    Publication date: March 6, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Tapas Kanungo, Ashok Ponnuswami
  • Patent number: 8554700
    Abstract: This patent application pertains to answer model comparison. One implementation can determine a first frequency at which an individual answer category appears in an individual slot on a query results page when utilizing a first model. The method can ascertain a second frequency at which the individual answer category appears in the individual slot on the query results page when utilizing a second model. The method can calibrate the second model so that the second frequency approaches the first frequency.
    Type: Grant
    Filed: December 3, 2010
    Date of Patent: October 8, 2013
    Assignee: Microsoft Corporation
    Inventors: Tapas Kanungo, Kuansan Wang, Ran Gilad-Bachrach, Kieran McDonald, Kumaresh Pattabiraman, Christopher Meyers, Ashok Ponnuswami, Luke Simon
  • Patent number: 8515684
    Abstract: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.
    Type: Grant
    Filed: December 21, 2011
    Date of Patent: August 20, 2013
    Assignee: International Business Machines Corporation
    Inventors: Stephen K. Boyer, Gregory Breyta, Tapas Kanungo, Jeffrey Thomas Kreulen, James J. Rhodes
  • Patent number: 8370337
    Abstract: Methods and computer-storage media having computer-executable instructions embodied thereon that facilitate generating a machine-learned model for ranking search results using click-based data are provided. Data is referenced from user queries, which may include search results generated by general search engines and vertical search engines. A training set is generated from the search results and click-based judgments are associated with the search results in the training set. Based on click-based judgments, identifiable features are determined from the search results in a training set. Based on determining identifiable features in a training set, a rule set is generated for ranking subsequent search results.
    Type: Grant
    Filed: April 19, 2010
    Date of Patent: February 5, 2013
    Assignee: Microsoft Corporation
    Inventors: Tapas Kanungo, Kumaresh Pattabiraman, Nitin Agrawal, Kieran Richard McDonald, Christopher Avery Meyers, Nipoon Malhotra
  • Publication number: 20120143794
    Abstract: This patent application pertains to answer model comparison. One implementation can determine a first frequency at which an individual answer category appears in an individual slot on a query results page when utilizing a first model. The method can ascertain a second frequency at which the individual answer category appears in the individual slot on the query results page when utilizing a second model. The method can calibrate the second model so that the second frequency approaches the first frequency.
    Type: Application
    Filed: December 3, 2010
    Publication date: June 7, 2012
    Applicant: Microsoft Corporation
    Inventors: Tapas Kanungo, Kuansan Wang, Ran Gilad-Bachrach, Kieran McDonald, Kumaresh Pattabiraman, Christopher Meyers, Ashok Ponnuswami, Luke Simon
  • Publication number: 20120109972
    Abstract: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.
    Type: Application
    Filed: December 21, 2011
    Publication date: May 3, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Stephen Kane Boyer, GREGORY BREYTA, TAPAS KANUNGO, JEFFREY THOMAS KREULEN, JAMES J. RHODES
  • Patent number: 8140267
    Abstract: A vectorization process is employed in which chemical identifier strings are converted into respective vectors. These vectors may then be searched to identify molecules that are identical or similar to each other. The dimensions of the vector space can be defined by sequences of symbols that make up the chemical identifier strings. The International Chemical Identifier (InChI) string defined by the International Union of Pure and Applied Chemistry (IUPAC) is particularly well suited for these methods.
    Type: Grant
    Filed: June 30, 2006
    Date of Patent: March 20, 2012
    Assignee: International Business Machines Corporation
    Inventors: Stephen Kane Boyer, Gregory Breyta, Tapas Kanungo, Jeffrey Thomas Kreulen, James J. Rhodes
  • Publication number: 20110258149
    Abstract: Methods and computer-storage media having computer-executable instructions embodied thereon that facilitate generating a machine-learned model for ranking search results using click-based data are provided. Data is referenced from user queries, which may include search results generated by general search engines and vertical search engines. A training set is generated from the search results and click-based judgments are associated with the search results in the training set. Based on click-based judgments, identifiable features are determined from the search results in a training set. Based on determining identifiable features in a training set, a rule set is generated for ranking subsequent search results.
    Type: Application
    Filed: April 19, 2010
    Publication date: October 20, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: TAPAS KANUNGO, KUMARESH PATTABIRAMAN, NITIN AGRAWAL, KIERAN RICHARD McDONALD, CHRISTOPHER AVERY MEYERS, NIPOON MALHOTRA
  • Patent number: 8005845
    Abstract: Disclosed are apparatus and methods for ranking lines of text. In one embodiment, an intent of a query is ascertained. A relevance of each one of a plurality of lines of text of a document is determined based upon the intent of the query, content of the query, and content of each of the plurality of lines of text. The plurality of lines of text may then be ranked according to the determined relevance of each of the plurality of lines of text.
    Type: Grant
    Filed: May 20, 2008
    Date of Patent: August 23, 2011
    Assignee: Yahoo! Inc.
    Inventors: Tapas Kanungo, Donald Metzler
  • Publication number: 20100306224
    Abstract: Determine a plurality of first dwell durations for a plurality of first web pages, each first dwell duration indicating a time period a user has spent with a first web page. Access a plurality of first quality ratings for the first web pages, each first quality rating indicating a quality of a first web page as a part of a search result generated for a first search query. Access a predefined quality rating threshold. Correlate the first dwell durations and the first quality ratings. And, determine a dwell duration threshold, such that a second user spending a second dwell duration greater than or equal to the dwell duration threshold with a second web page indicates that the second user is satisfied with the second web page identified in a second search result generated by a search engine in response to a second search query requested by the second user.
    Type: Application
    Filed: June 2, 2009
    Publication date: December 2, 2010
    Applicant: Yahoo! Inc.
    Inventors: David Ciemiewicz, Tapas Kanungo, Arun Lakshminarayanan, Maria Stone
  • Publication number: 20100153365
    Abstract: Disclosed herein are systems and methods for identifying phrases using break points. Break points can be identified using stop words identified in content. Identified phrases can be used to generate a summary of the content.
    Type: Application
    Filed: December 15, 2008
    Publication date: June 17, 2010
    Inventors: Hadar Shemtov, Tapas Kanungo, Rajhans Samdani, Donald Metzler
  • Patent number: 7720772
    Abstract: A document (or multiple documents) is analyzed to identify entities of interest within that document. This is accomplished by constructing n-gram or bi-gram models that correspond to different kinds of text entities, such as chemistry-related words and generic English words. The models can be constructed from training text selected to reflect a particular kind of text entity. The document is tokenized, and the tokens are run against the models to determine, for each token, which kind of text entity is most likely to be associated with that token. The entities of interest in the document can then be annotated accordingly.
    Type: Grant
    Filed: December 15, 2008
    Date of Patent: May 18, 2010
    Assignee: International Business Machines Corporation
    Inventors: Tapas Kanungo, James Rhodes
  • Publication number: 20100057710
    Abstract: The subject matter disclosed herein relates to generating an abstract of an electronic document based at least in part on a utility metric associated with a group phrases as well as on a ranked order of sentences.
    Type: Application
    Filed: August 28, 2008
    Publication date: March 4, 2010
    Applicant: Yahoo! Inc
    Inventors: Tapas Kanungo, Jan Pedersen, Tamas Sarlos
  • Patent number: 7640488
    Abstract: A focused random walk system produces samples of on-topic pages from a collection of hyper-linked pages such as Web pages. The focused random walk system utilizes a focused random walk to produce a focused sample, which is a random sample of Web pages focused on a topic. The focused random walk system uniformly samples pages iteratively, where each iteration follows a random link from a union of the in-links and out-links of a page. The system then classifies this randomly selected link to determine whether the page is on-topic. The random walk sampling process could comprise a hard-focus method that selects only on-topic pages at each step of the focused random walk, or a soft-focus method that allows limited divergence to off-topic pages.
    Type: Grant
    Filed: December 4, 2004
    Date of Patent: December 29, 2009
    Assignee: International Business Machines Corporation
    Inventors: Ziv Bar-Yossef, Tapas Kanungo, Robert Krauthgamer
  • Publication number: 20090292683
    Abstract: Disclosed are apparatus and methods for ranking lines of text. In one embodiment, an intent of a query is ascertained. A relevance of each one of a plurality of lines of text of a document is determined based upon the intent of the query, content of the query, and content of each of the plurality of lines of text. The plurality of lines of text may then be ranked according to the determined relevance of each of the plurality of lines of text.
    Type: Application
    Filed: May 20, 2008
    Publication date: November 26, 2009
    Applicant: Yahoo! Inc.
    Inventors: Tapas Kanungo, Donald Metzler
  • Publication number: 20090187516
    Abstract: Methods and systems are provided herein for establishing and/or using an evaluation model that is adapted to determine a model judgment value based, at least in part, on measured summary feature values associated with a search result summary. The evaluation model may be established through a learning process based, at least in part, on human judgment values associated with a set of search result summaries.
    Type: Application
    Filed: January 18, 2008
    Publication date: July 23, 2009
    Inventors: Tapas Kanungo, David M. Orr