Patents by Inventor Surajit Chaudhuri

Surajit Chaudhuri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Associating metadata with images in a personal image collection

Patent number: 9405771

Abstract: Various technologies pertaining to assigning metadata to images in a personal image collection of a user based upon images and associated metadata assigned thereto that are accessible to the user by way of a social network application are described. An account of the user in a social network application is accessed to retrieve images and metadata that is accessible to the user. A face recognition algorithm is trained based upon the retrieved images and metadata, and the trained face recognition algorithm is executed over the personal image collection of the user, where the personal image collection of the user is external to the social network application.

Type: Grant

Filed: March 14, 2013

Date of Patent: August 2, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shobana Balakrishnan, Surajit Chaudhuri
Integrated fuzzy joins in database management systems

Patent number: 9317544

Abstract: A fuzzy joins system that is integrated in a database system generates fuzzy joins between records from two datasets. The fuzzy joins system includes a tokenizer to generate tokens for data records and a transformer to find transforms for the tokens. The fuzzy joins system invokes a signature generator, running within a runtime layer of the database system, to generate signatures for data records based on the tokens and their transforms. Subsequently, an equi-join operation joins the records from the two datasets with at least one equal signature. A similarity calculator, running within a runtime layer of the database system, computes a similarity measure using the token information of the joined records. If the similarity measure for any two records is above a threshold, the fuzzy joins system generates a fuzzy join between such two records.

Type: Grant

Filed: October 5, 2011

Date of Patent: April 19, 2016

Assignee: Microsoft Corporation

Inventors: Kris Ganjam, Vivek Ravindranath Narasayya, Raghav Kaushik, Arvind Arasu, Surajit Chaudhuri
Performance service level agreements in multi-tenant database systems

Patent number: 9311376

Abstract: Various technologies described herein pertain to evaluating service provider compliance with terms of a performance service level agreement (SLA) for a tenant in a multi-tenant database system. The terms of the performance SLA can set a performance criterion as though a level of a resource of hardware of the multi-tenant database system is dedicated to the tenant. An actual performance metric of the resource can be tracked for a workload of the tenant. Further, a baseline performance metric of the resource can be determined for the workload of the tenant. The baseline performance metric can be based on a simulation as though the level of the resource as set in the performance SLA is dedicated to the workload of the tenant. Moreover, the actual performance metric can be compared with the baseline performance metric to evaluate compliance with the performance SLA.

Type: Grant

Filed: May 2, 2012

Date of Patent: April 12, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Vivek Ravindranath Narasayya, Feng Li, Surajit Chaudhuri
Tagging entities with descriptive phrases

Patent number: 9298825

Abstract: A plurality of description phrases associated with a first domain may be determined, based on an analysis of a first plurality of documents to determine co-occurrences of the description phrases with one or more name labels associated with the first domain. An entity associated with the first domain may be obtained. An analysis of a second plurality of documents may be initiated to identify co-occurrences of mentions of the obtained entity and one or more of the plurality of description phrases, and contexts associated with each of the co-occurrences of the mentions and description phrases, in each one of the second plurality of documents. A description tag association between the obtained entity and one of the description phrases may be determined, based on an analysis of the identified contexts.

Type: Grant

Filed: November 17, 2011

Date of Patent: March 29, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Kaushik Chakrabarti, Surajit Chaudhuri, Tao Cheng
RANKING TABLES FOR KEYWORD SEARCH

Publication number: 20160012052

Abstract: The present invention extends to methods, systems, and computer program products for ranking tables for keyword search. Aspects of the invention include generating lists of candidate tables for inclusion in a search query response, computing table hit matrices, retrieving content from fields of candidate tables having keyword hits, generating ranking features of tables, and computing ranking scores for tables. Aspects of the invention can be used to match keywords against column names, to match keywords against values in subject and non-subject columns, and to match keywords against table descriptions like page titles, table captions, cell values, nearest headings and surrounding text. Which keywords are matched against which fields can depend on the table and/to the query (referred to as “late binding”).

Type: Application

Filed: July 8, 2014

Publication date: January 14, 2016

Inventors: Kanstantsyn Zoryn, Zhimin Chen, Kaushik Chakrabarti, James P. Finnigan, Vivek R. Narasayya, Surajit Chaudhuri, Kris Ganjam
ANNOTATING STRUCTURED DATA FOR SEARCH

Publication number: 20160012091

Abstract: The present invention extends to methods, systems, and computer program products for annotating structured data for search. Aspects of the invention include associating structured data, such as, for example, tables, with additional content to improve indexing of the structured data for search and/or provide improved search results for structured data. Web pages can include tables as well as other content. The other content in a web page, such as, for example, content outside the <table> and </table> tags of a web table, can be useful in supporting searches for web tables. Content in one web page can also be useful in supporting searches for a table in another web page.

Type: Application

Filed: July 8, 2014

Publication date: January 14, 2016

Inventors: Kanstantsyn Zoryn, Zhimin Chen, Kaushik Chakrabarti, James P. Finnigan, Vivek R. Narasayya, Surajit Chaudhuri, Kris Ganjam
COMPUTING FEATURES OF STRUCTURED DATA

Publication number: 20160012051

Abstract: The present invention extends to methods, systems, and computer program products for computing features of structured data. Aspects of the invention include computing features of table components (e.g., of rows, columns, cells, etc.). Computed features can be used for ranking the table components. When aggregated, features for different components of a table can be used for ranking the table (e.g., a web table).

Type: Application

Filed: July 8, 2014

Publication date: January 14, 2016

Inventors: Kanstantsyn Zoryn, Zhimin Chen, Kaushik Chakrabarti, James P. Finnigan, Vivek R. Narasayya, Surajit Chaudhuri, Kris Ganjam
UNDERSTANDING TABLES FOR SEARCH

Publication number: 20150379057

Abstract: The present invention extends to methods, systems, and computer program products for understanding tables for search. Aspects of the invention include identifying a subject column for a table, detecting a column header using other tables, and detecting a column header using a knowledge base. Implementations can be utilized in a structured data search system (SDSS) that indexes structured information, such as, tables in a relational database or html tables extracted from web pages. The SDSS allows users to search over the structured information (tables) using different mechanisms including keyword search and data finding data.

Type: Application

Filed: October 2, 2014

Publication date: December 31, 2015

Inventors: Zhongyuan Wang, Kanstantsyn Zoryn, Zhimin Chen, Kaushik Chakrabarti, James P. Finnigan, Vivek R. Narasayya, Surajit Chaudhuri, Kris Ganjam
FINDING PATTERNS IN A KNOWLEDGE BASE TO COMPOSE TABLE ANSWERS

Publication number: 20150310073

Abstract: In general, the knowledge base table composer embodiments described herein provide table answers to keyword queries against one or more knowledge bases. Highly relevant patterns in a knowledge base are found for user-given keyword queries. These patterns are used to compose table answers. To this end, a knowledge base is modeled as a directed graph called a knowledge graph, where nodes represent entities in the knowledge base and edges represent the relationships among them. Each node/edge is labeled with a type and text. A pattern that is an aggregation of subtrees which contain all keywords in the texts and have the same structure and types on node/edges is sought. Patterns that are relevant to a query for a class can be found using a set of scoring functions. Furthermore, path-based indexes and various query-processing procedures can be employed to speed up processing.

Type: Application

Filed: April 29, 2014

Publication date: October 29, 2015

Applicant: MICROSOFT CORPORATION

Inventors: Kaushik Chakrabarti, Surajit Chaudhuri, Bolin Ding, Mohan Yang
Entity augmentation service from latent relational data

Patent number: 9171081

Abstract: The subject disclosure is directed towards providing data for augmenting an entity-attribute-related task. Pre-processing is preformed on entity-attribute tables extracted from the web, e.g., to provide indexes that are accessible to find data that completes augmentation tasks. The indexes are based on both direct mappings and indirect mappings between tables. Example augmentation tasks include queries for augmented data based on an attribute name or examples, or finding synonyms for augmentation. An online query is efficiently processed by accessing the indexes to return augmented data related to the task.

Type: Grant

Filed: March 6, 2012

Date of Patent: October 27, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Kris K. Ganjam, Kaushik Chakrabarti, Mohamed A. Yakout, Surajit Chaudhuri
Finding Data in Connected Corpuses Using Examples

Publication number: 20150193533

Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.

Type: Application

Filed: March 16, 2015

Publication date: July 9, 2015

Applicant: Microsoft Technology Licensing, LLC

Inventors: John C. Platt, Surajit Chaudhuri, Lev Novik, Henricus Johannes Maria Meijer, Efim Hudis, Kunal Mukerjee, Christopher Alan Hays
PROGRESSIVE SPATIAL SEARCHING USING AUGMENTED STRUCTURES

Publication number: 20150088904

Abstract: A location associated with a user of a computing device and a prefix portion of an input string may be received as one or more successive characters of the input string are provided by the user via the computing device. A list of suggested items may be obtained based on a function of respective recommendation indicators and proximities of the items to the location in response to receiving the prefix portion, and based on partially traversing a character string search structure having a plurality of non-terminal nodes augmented with bound indicators associated with spatial regions. The list of suggested items and descriptive information associated with each suggested item may be returned to the user, in response to receiving the prefix portion, for rendering an image illustrating indicators associated with the list in a manner relative to the location, as the user provides each successive character of the input string.

Type: Application

Filed: November 30, 2014

Publication date: March 26, 2015

Inventors: Kaushik Chakrabarti, Surajit Chaudhuri, Senjuti Basu Roy
Finding data in connected corpuses using examples

Patent number: 8983954

Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.

Type: Grant

Filed: April 10, 2012

Date of Patent: March 17, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: John C. Platt, Surajit Chaudhuri, Lev Novik, Henricus Johannes Maria Meijer, Efim Hudis, Kunal Mukerjee, Christopher Alan Hays
RETRIEVAL OF ATTRIBUTE VALUES BASED UPON IDENTIFIED ENTITIES

Publication number: 20150019540

Abstract: Various technologies that facilitate performance of a data finding data (DFD) search are described herein. A user specifies entities, for example, by entering the entities into a query field, selecting the entities from a computer-executable application, or the like. The user further specifies an attribute of the entities that is of interest. A query is constructed based upon the entities and the attribute, and a search for tables is performed based upon the entities and the attribute. Values of the attribute for the selected entities are identified in a table, and the values of the attribute are returned.

Type: Application

Filed: May 21, 2014

Publication date: January 15, 2015

Applicant: Microsoft Corporation

Inventors: Kris Ganjam, Zhimin Chen, Kaushik Chakrabarti, Surajit Chaudhuri, Vivek Narasayya, James Finnigan, Kanstantsyn Zoryn
Progressive spatial searching using augmented structures

Patent number: 8930391

Abstract: A location associated with a user of a computing device and a prefix portion of an input string may be received as one or more successive characters of the input string are provided by the user via the computing device. A list of suggested items may be obtained based on a function of respective recommendation indicators and proximities of the items to the location in response to receiving the prefix portion, and based on partially traversing a character string search structure having a plurality of non-terminal nodes augmented with bound indicators associated with spatial regions. The list of suggested items and descriptive information associated with each suggested item may be returned to the user, in response to receiving the prefix portion, for rendering an image illustrating indicators associated with the list in a manner relative to the location, as the user provides each successive character of the input string.

Type: Grant

Filed: December 29, 2010

Date of Patent: January 6, 2015

Assignee: Microsoft Corporation

Inventors: Kaushik Chakrabarti, Surajit Chaudhuri, Senjuti Basu Roy
SCALABLE LOOKUP-DRIVEN ENTITY EXTRACTION FROM INDEXED DOCUMENT COLLECTIONS

Publication number: 20140351274

Abstract: A set of documents is filtered for entity extraction. A list of entity strings is received. A set of token sets that covers the entity strings in the list is determined. An inverted index generated on a first set of documents is queried using the set of token sets to determine a set of document identifiers for a subset of the documents in the first set. A second set of documents identified by the set of document identifiers is retrieved from the first set of documents. The second set of documents is filtered to include one or more documents of the second set that each includes a match with at least one entity string of the list of entity strings. Entity recognition may be performed on the filtered second set of documents.

Type: Application

Filed: June 3, 2014

Publication date: November 27, 2014

Applicant: Microsoft Corporation

Inventors: Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaudhuri, Venkatesh Ganti
Search guided by location and context

Patent number: 8874592

Abstract: The subject disclosure pertains to web searches and more particularly toward influencing resultant content to increase relevancy. The resultant content can be influenced by reconfiguring a query and/or filtering results based on user location and/or context information (e.g., user characteristics/profile, prior interaction/usage temporal, current events, and third party state/context . . . ). Furthermore, the disclosure provides for query execution on at least a subset of designated web content, for example as specified by a user. Still further yet, a localized marketing system is disclosed that provides discount offers to users that match merchant criteria including proximity. A system for actively probing populations of users with different parameters and monitoring responses can be employed to collect data for identifying the best discounts and deadlines to offer to users to achieve desired results.

Type: Grant

Filed: June 28, 2006

Date of Patent: October 28, 2014

Assignee: Microsoft Corporation

Inventors: Gary W. Flake, William H. Gates, III, Eric J. Horvitz, Joshua T. Goodman, Surajit Chaudhuri, Trenholme J. Griffin, Oliver Hurst-Hiller, Kenneth A. Moss
ASSOCIATING METADATA WITH IMAGES IN A PERSONAL IMAGE COLLECTION

Publication number: 20140270407

Abstract: Various technologies pertaining to assigning metadata to images in a personal image collection of a user based upon images and associated metadata assigned thereto that are accessible to the user by way of a social network application are described. An account of the user in a social network application is accessed to retrieve images and metadata that is accessible to the user. A face recognition algorithm is trained based upon the retrieved images and metadata, and the trained face recognition algorithm is executed over the personal image collection of the user, where the personal image collection of the user is external to the social network application.

Type: Application

Filed: March 14, 2013

Publication date: September 18, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Shobana Balakrishnan, Surajit Chaudhuri
Isolating Resources and Performance in a Database Management System

Publication number: 20140207740

Abstract: Techniques for tenant performance isolation in a multiple-tenant database management system are described. These techniques may include providing a reservation of server resources. The server resources reservation may include a reservation of a central processing unit (CPU), a reservation of Input/Output throughput, and/or a reservation of buffer pool memory or working memory. The techniques may also include a metering mechanism that determines whether the resource reservation is satisfied. The metering mechanism may be independent of an actual resource allocation mechanism associated with the server resource reservation.

Type: Application

Filed: January 23, 2013

Publication date: July 24, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Vivek R. Narasayya, Sudipto Das, Manoj A. Syamala, Hyunjung Park, Surajit Chaudhuri, Badrish Chandramouli, Feng Li
Scalable lookup-driven entity extraction from indexed document collections

Patent number: 8782061

Abstract: A set of documents is filtered for entity extraction. A list of entity strings is received. A set of token sets that covers the entity strings in the list is determined. An inverted index generated on a first set of documents is queried using the set of token sets to determine a set of document identifiers for a subset of the documents in the first set. A second set of documents identified by the set of document identifiers is retrieved from the first set of documents. The second set of documents is filtered to include one or more documents of the second set that each includes a match with at least one entity string of the list of entity strings. Entity recognition may be performed on the filtered second set of documents.

Type: Grant

Filed: June 24, 2008

Date of Patent: July 15, 2014

Assignee: Microsoft Corporation

Inventors: Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaudhuri, Venkatesh Ganti

prev … 2 3 4 5 6 7 8 9 10 … next