Patents by Inventor Kaushik Shriraghav

Kaushik Shriraghav has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for providing distributed tree traversal using hardware-based processing

Patent number: 10572442

Abstract: A database management system (DBMS) run a host CPU and a hardware coprocessor accelerate traversal of a tree-type data structure by allocating reusable memory in cache to store portions of the tree-type data structure as the tree-type data structure is being requested by the host CPU. The hardware coprocessor manages the cached tree-type data structure in a manner that is transparent to the host CPU. A driver located at the host CPU or at a separate computing device can provide an interface between the host CPU and the hardware coprocessor, thus reducing communications between the host CPU and the hardware coprocessor.

Type: Grant

Filed: November 26, 2014

Date of Patent: February 25, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Kenneth H. Eguro, Zsolt Istvan, Arvind Arasu, Ravishankar Ramamurthy, Kaushik Shriraghav
Customized trusted computer for secure data processing and storage

Patent number: 9953184

Abstract: The techniques discussed herein facilitate the transmission, storage, and manipulation of data in an encrypted database management system (EDBMS). An untrusted machine is connected to a data store having encrypted records, a client machine that sends encrypted queries, and a trusted machine that receives and decrypts the encrypted records and encrypted queries. The trusted machine processes the query using semantically secure query operators to produce a query result. The trusted machine ensures the size of the query result conforms to an upper bound on the number or records in the query result and returns the query result.

Type: Grant

Filed: April 17, 2015

Date of Patent: April 24, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Arvind Arasu, Kenneth Hiroshi Eguro, Ravishankar Ramamurthy, Kaushik Shriraghav
Customized Trusted Computer For Secure Data Processing and Storage

Publication number: 20160306995

Abstract: The techniques discussed herein facilitate the transmission, storage, and manipulation of data in an encrypted database management system (EDBMS). An untrusted machine is connected to a data store having encrypted records, a client machine that sends encrypted queries, and a trusted machine that receives and decrypts the encrypted records and encrypted queries. The trusted machine processes the query using semantically secure query operators to produce a query result. The trusted machine ensures the size of the query result conforms to an upper bound on the number or records in the query result and returns the query result.

Type: Application

Filed: April 17, 2015

Publication date: October 20, 2016

Inventors: Arvind Arasu, Kenneth Hiroshi Eguro, Ravishankar Ramamurthy, Kaushik Shriraghav
Systems and Methods for Providing Distributed Tree Traversal Using Hardware-Based Processing

Publication number: 20160147779

Abstract: A database management system (DBMS) run a host CPU and a hardware coprocessor accelerate traversal of a tree-type data structure by allocating reusable memory in cache to store portions of the tree-type data structure as the tree-type data structure is being requested by the host CPU. The hardware coprocessor manages the cached tree-type data structure in a manner that is transparent to the host CPU. A driver located at the host CPU or at a separate computing device can provide an interface between the host CPU and the hardware coprocessor, thus reducing communications between the host CPU and the hardware coprocessor.

Type: Application

Filed: November 26, 2014

Publication date: May 26, 2016

Inventors: Kenneth H. Eguro, Zsolt Istvan, Arvind Arasu, Ravishankar Ramamurthy, Kaushik Shriraghav
Efficient indexing of error tolerant set containment

Patent number: 8606771

Abstract: The claimed subject matter provides a method and a system for the efficient indexing of error tolerant set containment. An exemplary method comprises obtaining a frequency threshold and a query set. All tokens or token sets within the query set are determined, and then all minimal infrequent tokens or all minimal infrequent tokens sets of data records are found and used to build an index. The minimal infrequent tokens or minimal infrequent tokensets are processed in a fixed order, and then a collection of signatures for each minimal infrequent token or token set is determined.

Type: Grant

Filed: December 21, 2010

Date of Patent: December 10, 2013

Assignee: Microsoft Corporation

Inventors: Arvind Arasu, Parag Agrawal, Kaushik Shriraghav
SYNTHETIC DATA GENERATION

Publication number: 20120330880

Abstract: The claimed subject matter provides a method for data generation. The method includes identifying a generative probability distribution based on one or more cardinality constraints for populating a database table. The method also includes selecting one or more values for a corresponding one or more attributes in the database table based on the generative probability distribution and the cardinality constraints. Additionally, the method includes generating a tuple for the database table. The tuple comprises the one or more values.

Type: Application

Filed: June 23, 2011

Publication date: December 27, 2012

Applicant: Microsoft Corporation

Inventors: Arvind Arasu, Kaushik Shriraghav, Jian Li
EFFICIENT INDEXING OF ERROR TOLERANT SET CONTAINMENT

Publication number: 20120158696

Abstract: The claimed subject matter provides a method and a system for the efficient indexing of error tolerant set containment. An exemplary method comprises obtaining a frequency threshold and a query set. All tokens or token sets within the query set are determined, and then all minimal infrequent tokens or all minimal infrequent tokens sets of data records are found and used to build an index. The minimal infrequent tokens or minimal infrequent tokensets are processed in a fixed order, and then a collection of signatures for each minimal infrequent token or token set is determined.

Type: Application

Filed: December 21, 2010

Publication date: June 21, 2012

Applicant: Microsoft Corporation

Inventors: Arvind Arasu, Parag Agrawal, Kaushik Shriraghav
Efficient exact set similarity joins

Patent number: 7865505

Abstract: A machine implemented system and method that efficiently facilitates and effectuates exact similarity joins between collections of sets. The system and method obtains a collection of sets and a threshold value from an interface, and based at least in part on an identifiable similarity, such as an overlap or intersection, between the collection of sets the analysis component generates and outputs a candidate pair that at least equals or exceeds the threshold value.

Type: Grant

Filed: January 30, 2007

Date of Patent: January 4, 2011

Assignee: Microsoft Corporation

Inventors: Arvind Arasu, Venkatesh Ganti, Kaushik Shriraghav
Robust cardinality and cost estimation for skyline operator

Patent number: 7707207

Abstract: The claimed subject matter relates to incorporating a skyline operator within a relational database engine, and more particularly to a database engine that utilizes novel techniques to determine the lowest cost of generating the skyline produced by the skyline operator. The database engine receives queries and associated preferences and, based on a cardinality estimate and a cost estimate, an appropriate skyline generating technique is utilized to produce a skyline representative of the received queries and its associated preferences.

Type: Grant

Filed: February 17, 2006

Date of Patent: April 27, 2010

Assignee: Microsoft Corporation

Inventors: Kaushik Shriraghav, Surajit Chaudhuri, Nilesh N. Dalvi
Designing record matching queries utilizing examples

Patent number: 7634464

Abstract: The subject disclosure pertains to a powerful and flexible framework for record matching. The framework facilitates design of a record matching query or package composed of a set of well-defined primitive operators (e.g., relational, data cleaning . . . ), which can ultimately be executed to match records. To assist design of such packages, a learning technique based on examples is provided. More specifically, a set of matching and non-matching record pairs can be input and employed to facilitate automatic package generation. A generated package can subsequently be transformed manually and/or automatically into a semantically equivalent form optimized for execution.

Type: Grant

Filed: June 14, 2006

Date of Patent: December 15, 2009

Assignee: Microsoft Corporation

Inventors: Bee-Chung Chen, Venkatesh Ganti, Kaushik Shriraghav
Systems and methods for estimating functional relationships in a database

Patent number: 7562067

Abstract: A system that facilitates estimating functional relationships associated with one or more columns in a database comprises a sampling component that receives a random sample of records within the database. An estimate generator component calculates an estimate of strength of functional relationships based at least in part upon the received samples. For example, the estimate generator component can calculate an estimate of strength of a column as a key column based at least in part upon the received samples.

Type: Grant

Filed: May 6, 2005

Date of Patent: July 14, 2009

Assignee: Microsoft Corporation

Inventors: Surajit Chaudhuri, Venkatesh Ganti, Kaushik Shriraghav
Minimal difference query and view matching

Patent number: 7558780

Abstract: The subject disclosure pertains to efficient computation of the difference between queries by exploiting commonality between them. A minimal difference query (MDQ) is generated that roughly corresponds to removal of as many joins as possible while still accurately representing the query difference. The minimal difference can be employed to further substantially the scope of view matching where a query is not wholly subsumed by a view. Additionally, the minimal difference query can be employed as an analytical tool in various contexts.

Type: Grant

Filed: November 30, 2006

Date of Patent: July 7, 2009

Assignee: Microsoft Corporation

Inventors: Kaushik Shriraghav, Venkatesh Ganti, Xin Dong
Techniques for estimating progress of database queries

Patent number: 7454407

Abstract: Techniques for estimating the progress of database queries are described herein. In a first implementation, a respective lower-bound parameter is associated with each node in an operator tree that representing a given database query, and the progress of the database query at a given point is estimated based upon the lower-bound parameters. In a second implementation, the progress of the query is estimated by associating respective lower-bound and upper-bound parameters with each node in the operator tree. The progress of the query at the given point is then estimated based on the lower-bound and upper-bound parameters.

Type: Grant

Filed: June 10, 2005

Date of Patent: November 18, 2008

Assignee: Microsoft Corporation

Inventors: Surajit Chaudhuri, Ravishankar Ramamurthy, Kaushik Shriraghav
Primitive operator for similarity joins in data cleaning

Patent number: 7406479

Abstract: A set similarity join system and method are provided. The system can be employed to facilitate data cleaning based on similarities through the identification of “close” tuples (e.g., records and/or rows). “Closeness” can be is evaluated using a similarity function(s) chosen to suit the domain and/or application. Thus, the system facilitates generic domain-independent data cleansing. The system can be employed with a foundational primitive, the set similarity join (SSJoin) operator, which can be used as a building block to implement a broad variety of notions of similarity (e.g., edit similarity, Jaccard similarity, generalized edit similarity, hamming distance, soundex, etc.) as well as similarity based on co-occurrences. The SSJoin operator can exploit the observation that set overlap can be used effectively to support a variety of similarity functions. The SSJoin operator compares values based on “sets” associated with (or explicitly constructed for) each one of them.

Type: Grant

Filed: February 10, 2006

Date of Patent: July 29, 2008

Assignee: Microsoft Corporation

Inventors: Kaushik Shriraghav, Surajit Chaudhuri, Venkatesh Ganti
DESIGNING RECORD MATCHING QUERIES UTILIZING EXAMPLES

Publication number: 20070294221

Abstract: The subject disclosure pertains to a powerful and flexible framework for record matching. The framework facilitates design of a record matching query or package composed of a set of well-defined primitive operators (e.g., relational, data cleaning . . . ), which can ultimately be executed to match records. To assist design of such packages, a learning technique based on examples is provided. More specifically, a set of matching and non-matching record pairs can be input and employed to facilitate automatic package generation. A generated package can subsequently be transformed manually and/or automatically into a semantically equivalent form optimized for execution.

Type: Application

Filed: June 14, 2006

Publication date: December 20, 2007

Applicant: MICROSOFT CORPORATION

Inventors: Bee-Chung Chen, Venkatesh Ganti, Kaushik Shriraghav
Robust cardinality and cost estimation for skyline operator

Publication number: 20070198439

Abstract: The claimed subject matter relates to incorporating a skyline operator within a relational database engine, and more particularly to a database engine that utilizes novel techniques to determine the lowest cost of generating the skyline produced by the skyline operator. The database engine receives queries and associated preferences and based on a cardinality estimate and a cost estimate an appropriate skyline generating technique is utilized to produce a skyline representative of the received queries and its associated preferences.

Type: Application

Filed: February 17, 2006

Publication date: August 23, 2007

Applicant: Microsoft Corporation

Inventors: Kaushik Shriraghav, Surajit Chaudhuri, Nilesh Dalvi
MINIMAL DIFFERENCE QUERY AND VIEW MATCHING

Publication number: 20070198469

Abstract: The subject disclosure pertains to efficient computation of the difference between queries by exploiting commonality between them. A minimal difference query (MDQ) is generated that roughly corresponds to removal of as many joins as possible while still accurately representing the query difference. The minimal difference can be employed to further substantially the scope of view matching where a query is not wholly subsumed by a view. Additionally, the minimal difference query can be employed as an analytical tool in various contexts.

Type: Application

Filed: November 30, 2006

Publication date: August 23, 2007

Applicant: MICROSOFT CORPORATION

Inventors: Kaushik Shriraghav, Venkatesh Ganti, Xin Dong
MINIMAL DIFFERENCE QUERY AND VIEW MATCHING

Publication number: 20070192297

Abstract: The subject disclosure pertains to efficient computation of the difference between queries by exploiting commonality between them. A minimal difference query (MDQ) is generated that roughly corresponds to removal of as many joins as possible while still accurately representing the query difference. The minimal difference can be employed to further substantially the scope of view matching where a query is not wholly subsumed by a view. Additionally, the minimal difference query can be employed as an analytical tool in various contexts.

Type: Application

Filed: November 9, 2006

Publication date: August 16, 2007

Applicant: MICROSOFT CORPORATION

Inventors: Kaushik Shriraghav, Venkatesh Ganti, Xin Dong
MINIMAL DIFFERENCE QUERY AND VIEW MATCHING

Publication number: 20070192282

Abstract: The subject disclosure pertains to efficient computation of the difference between queries by exploiting commonality between them. A minimal difference query (MDQ) is generated that roughly corresponds to removal of as many joins as possible while still accurately representing the query difference. The minimal difference can be employed to further substantially the scope of view matching where a query is not wholly subsumed by a view. Additionally, the minimal difference query can be employed as an analytical tool in various contexts.

Type: Application

Filed: February 13, 2006

Publication date: August 16, 2007

Applicant: Microsoft Corporation

Inventors: Kaushik Shriraghav, Venkatesh Ganti, Xin Dong
Primitive operator for similarity joins in data cleaning

Publication number: 20070192342

Abstract: A set similarity join system and method are provided. The system can be employed to facilitate data cleaning based on similarities through the identification of “close” tuples (e.g., records and/or rows). “Closeness” can be is evaluated using a similarity function(s) chosen to suit the domain and/or application. Thus, the system facilitates generic domain-independent data cleansing. The system can be employed with a foundational primitive, the set similarity join (SSJoin) operator, which can be used as a building block to implement a broad variety of notions of similarity (e.g., edit similarity, Jaccard similarity, generalized edit similarity, hamming distance, soundex, etc.) as well as similarity based on co-occurrences. The SSJoin operator can exploit the observation that set overlap can be used effectively to support a variety of similarity functions. The SSJoin operator compares values based on “sets” associated with (or explicitly constructed for) each one of them.

Type: Application

Filed: February 10, 2006

Publication date: August 16, 2007

Applicant: Microsoft Corporation

Inventors: Kaushik Shriraghav, Surajit Chaudhuri, Venkatesh Ganti

1 2 next