Patents by Inventor Atreyee DEY
Atreyee DEY has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11228568Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for anonymization of user data for privacy across distributed computing systems. Example methods may include determining, by a first computer system, a request for content to present at a user device, wherein the request for content is associated with a user account, determining a first search query associated with the user account, and determining a first keyword associated with the first search query. Some methods may include generating a first hash value for the first keyword, sending the first hash value to a second computer system for identification of first content for presentation at the user device, and causing the second computer system to send the first content to the user device for presentation, wherein the first computer system does not receive the first content.Type: GrantFiled: November 30, 2018Date of Patent: January 18, 2022Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Atreyee Dey, Debasish Das, Gaurav Bhatnagar
-
Patent number: 10324947Abstract: A data analysis server maintains database operation history data and context data for database operations performed on tables by a set of training users. The data analysis server builds predictive models for using the maintained data to recommend database operations and operands to a set of guided users. The data analysis server trains the predictive models by determining and weighting features derived from context data that are predictive of performing database operations to tables with similar context data. Using the predictive model, the data analysis server generates recommended database operations and operands based on context data received from a data analysis application of a guided user and sends the recommendations to the data analysis application for presentation to the guided user.Type: GrantFiled: April 26, 2016Date of Patent: June 18, 2019Assignee: Informatica LLCInventors: Atreyee Dey, Sanjay Kaluskar, Udayakumar Dhansingh
-
Patent number: 10120930Abstract: Entity mappings that produce matching entities for a first data asset having attributes and a second data asset having attributes are generated by: generating entity mappings that produce matching entities for a first data asset having attributes with attribute values and a second data asset having attributes with attribute values by: matching the attribute values of the attributes of the first data asset with the attribute values of the attributes of the second data asset, using the matching attribute values to generate matching attribute pairs, and using the matching attribute pairs to identify entity mappings; computing an entity mapping score for each of the entity mappings based on a combination of factors; ranking the entity mappings based on each entity mapping score; and using some of the ranked entity mappings to determine whether a same real-world entity is described by the first data asset and the second data asset.Type: GrantFiled: September 16, 2016Date of Patent: November 6, 2018Assignee: International Business Machines CorporationInventors: Prasad M. Deshpande, Atreyee Dey, Rajeev Gupta, Sanjeev K. Gupta, Salil Joshi, Sriram K. Padmanabhan
-
Patent number: 10025846Abstract: Entity mappings that produce matching entities for a first data asset having attributes and a second data asset having attributes are generated by: generating entity mappings that produce matching entities for a first data asset having attributes with attribute values and a second data asset having attributes with attribute values by: matching the attribute values of the attributes of the first data asset with the attribute values of the attributes of the second data asset, using the matching attribute values to generate matching attribute pairs, and using the matching attribute pairs to identify entity mappings; computing an entity mapping score for each of the entity mappings based on a combination of factors; ranking the entity mappings based on each entity mapping score; and using some of the ranked entity mappings to determine whether a same real-world entity is described by the first data asset and the second data asset.Type: GrantFiled: September 14, 2015Date of Patent: July 17, 2018Assignee: International Business Machines CorporationInventors: Prasad M. Deshpande, Atreyee Dey, Rajeev Gupta, Sanjeev K. Gupta, Salil Joshi, Sriram K. Padmanabhan
-
Patent number: 9886711Abstract: Embodiments of the present invention disclose a method, computer program product, and system for identifying matching products relative to a reference product. A reference product is identified from a received product query and a query is generated based on the reference product. A generated query comprises of an ontology, at least one word appearing in a title of the reference product, and a set of key words appearing in social media data associated with the reference product. A database is searched using the generated query to find matching product sets and the results are returned and filtered. Results are filtered by calculating a relationship score between the reference product and one or more matching products in the set of matching products, and/or by filtering a subset of the set of matching products based on a customer profile. The filtered subset of results are communicated to a recipient.Type: GrantFiled: September 29, 2014Date of Patent: February 6, 2018Assignee: International Business Machines CorporationInventors: Prasad M. Deshpande, Atreyee Dey, Salil Joshi, Songhua Xing
-
Publication number: 20170308595Abstract: A data analysis server maintains database operation history data and context data for database operations performed on tables by a set of training users. The data analysis server builds predictive models for using the maintained data to recommend database operations and operands to a set of guided users. The data analysis server trains the predictive models by determining and weighting features derived from context data that are predictive of performing database operations to tables with similar context data. Using the predictive model, the data analysis server generates recommended database operations and operands based on context data received from a data analysis application of a guided user and sends the recommendations to the data analysis application for presentation to the guided user.Type: ApplicationFiled: April 26, 2016Publication date: October 26, 2017Inventors: Atreyee Dey, Sanjay Kaluskar, Udayakumar Dhansingh
-
Patent number: 9785657Abstract: Generation of synthetic database data includes annotated query subplans for a multiple table query workload that includes a desired cardinality for nodes (v) in the subplans. The subplans may be merged and represented by a direct acyclic graph (DAG). The maximum entropy joint probability distribution for each attribute (x) for each node (v) is determined as: p ? ( x ) = exp [ ( ? v ? ? w v ? f v ? ( x ) Z ] ) for each node v, where wv is a weight of node v, fv is a conjunct of predicates in a subplan rooted at node v, and Z is a normalization factor. This distribution is determined such that the desired cardinality, and selectivities for each node v determined from the desired cardinality, are satisfied. The data for a plurality of tables are generated by sampling the maximum entropy joint probability distribution for a domain of attributes (x) of a plurality of tables. Data may be efficiently generated for multiple table queries and for DAGs.Type: GrantFiled: September 13, 2014Date of Patent: October 10, 2017Assignee: International Business Machines CorporationInventors: Atreyee Dey, Prasan Roy
-
Patent number: 9740749Abstract: Methods and arrangements for identifying related data in different data sets to assist in searching the data sets. A first data asset and a second data asset are accessed. Common entities are identified between the first and second data assets. A score is determined for the relationship between the first and second data assets, based on the identified common entities. One or more relationship contexts are determined for the relationship between the first and second data assets, and the relationship score and one or more relationship contexts are used to join at least a portion of each of the first and second data assets as a basis for subsequent searching. Other variants and embodiments are broadly contemplated herein.Type: GrantFiled: August 19, 2014Date of Patent: August 22, 2017Assignee: International Business Machines CorporationInventors: Prasad Manikarao Deshpande, Atreyee Dey, Rajeev Gupta, Sriram K. Padmanabhan
-
Publication number: 20170075898Abstract: Entity mappings that produce matching entities for a first data asset having attributes and a second data asset having attributes are generated by: generating entity mappings that produce matching entities for a first data asset having attributes with attribute values and a second data asset having attributes with attribute values by: matching the attribute values of the attributes of the first data asset with the attribute values of the attributes of the second data asset, using the matching attribute values to generate matching attribute pairs, and using the matching attribute pairs to identify entity mappings; computing an entity mapping score for each of the entity mappings based on a combination of factors; ranking the entity mappings based on each entity mapping score; and using some of the ranked entity mappings to determine whether a same real-world entity is described by the first data asset and the second data asset.Type: ApplicationFiled: September 16, 2016Publication date: March 16, 2017Inventors: Prasad M. Deshpande, Atreyee Dey, Rajeev Gupta, Sanjeev K. Gupta, Salil Joshi, Sriram K. Padmanabhan
-
Publication number: 20170075984Abstract: Entity mappings that produce matching entities for a first data asset having attributes and a second data asset having attributes are generated by: generating entity mappings that produce matching entities for a first data asset having attributes with attribute values and a second data asset having attributes with attribute values by: matching the attribute values of the attributes of the first data asset with the attribute values of the attributes of the second data asset, using the matching attribute values to generate matching attribute pairs, and using the matching attribute pairs to identify entity mappings; computing an entity mapping score for each of the entity mappings based on a combination of factors; ranking the entity mappings based on each entity mapping score; and using some of the ranked entity mappings to determine whether a same real-world entity is described by the first data asset and the second data asset.Type: ApplicationFiled: September 14, 2015Publication date: March 16, 2017Inventors: Prasad M. Deshpande, Atreyee Dey, Rajeev Gupta, Sanjeev K. Gupta, Salil Joshi, Sriram K. Padmanabhan
-
Publication number: 20160092960Abstract: Embodiments of the present invention disclose a method, computer program product, and system for identifying matching products relative to a reference product. A reference product is identified from a received product query and a query is generated based on the reference product. A generated query comprises of an ontology, at least one word appearing in a title of the reference product, and a set of key words appearing in social media data associated with the reference product. A database is searched using the generated query to find matching product sets and the results are returned and filtered. Results are filtered by calculating a relationship score between the reference product and one or more matching products in the set of matching products, and/or by filtering a subset of the set of matching products based on a customer profile. The filtered subset of results are communicated to a recipient.Type: ApplicationFiled: September 29, 2014Publication date: March 31, 2016Inventors: Prasad M. Deshpande, Atreyee Dey, Salil Joshi, Songhua Xing
-
Publication number: 20160055158Abstract: Methods and arrangements for identifying related data in different data sets to assist in searching the data sets. A first data asset and a second data asset are accessed. Common entities are identified between the first and second data assets. A score is determined for the relationship between the first and second data assets, based on the identified common entities. One or more relationship contexts are determined for the relationship between the first and second data assets, and the relationship score and one or more relationship contexts are used to join at least a portion of each of the first and second data assets as a basis for subsequent searching. Other variants and embodiments are broadly contemplated herein.Type: ApplicationFiled: August 19, 2014Publication date: February 25, 2016Inventors: Prasad Manikarao Deshpande, Atreyee Dey, Rajeev Gupta, Sriram K. Padmanabhan
-
Patent number: 9244950Abstract: Generation of synthetic database data includes annotated query subplans for a multiple table query workload that includes a desired cardinality for nodes (v) in the subplans. The subplans may be merged and represented by a direct acyclic graph (DAG). The maximum entropy joint probability distribution for each attribute (x) for each node (v) is determined as: p ? ( x ) = exp ( ? v ? ? w v ? f v ? ( x ) Z ) for each node ?, where wv is a weight of node v, fv is a conjunct of predicates in a subplan rooted at node v, and Z is a normalization factor. This distribution is determined such that the desired cardinality, and selectivities for each node v determined from the desired cardinality, are satisfied. The data for a plurality of tables are generated by sampling the maximum entropy joint probability distribution for a domain of attributes (x) of a plurality of tables. Data may be efficiently generated for multiple table queries and for DAGs.Type: GrantFiled: July 3, 2013Date of Patent: January 26, 2016Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Atreyee Dey, Prasan Roy
-
Publication number: 20150012522Abstract: Generation of synthetic database data includes annotated query subplans for a multiple table query workload that includes a desired cardinality for nodes (v) in the subplans. The subplans may be merged and represented by a direct acyclic graph (DAG). The maximum entropy joint probability distribution for each attribute (x) for each node (v) is determined as: p ? ( x ) = exp ( ? v ? ? w v ? f v ? ( x ) Z ) for each node v, where wv is a weight of node v, fv is a conjunct of predicates in a subplan rooted at node v, and Z is a normalization factor. This distribution is determined such that the desired cardinality, and selectivities for each node v determined from the desired cardinality, are satisfied. The data for a plurality of tables are generated by sampling the maximum entropy joint probability distribution for a domain of attributes (x) of a plurality of tables. Data may be efficiently generated for multiple table queries and for DAGs.Type: ApplicationFiled: July 3, 2013Publication date: January 8, 2015Inventors: Atreyee DEY, Prasan ROY
-
Publication number: 20150012523Abstract: Generation of synthetic database data includes annotated query subplans for a multiple table query workload that includes a desired cardinality for nodes (v) in the subplans. The subplans may be merged and represented by a direct acyclic graph (DAG). The maximum entropy joint probability distribution for each attribute (x) for each node (v) is determined as: p ? ( x ) = exp [ ( ? v ? ? w v ? f v ? ( x ) Z ] ) for each node v, where wv is a weight of node v, fv is a conjunct of predicates in a subplan rooted at node v, and Z is a normalization factor. This distribution is determined such that the desired cardinality, and selectivities for each node v determined from the desired cardinality, are satisfied. The data for a plurality of tables are generated by sampling the maximum entropy joint probability distribution for a domain of attributes (x) of a plurality of tables. Data may be efficiently generated for multiple table queries and for DAGs.Type: ApplicationFiled: September 13, 2014Publication date: January 8, 2015Inventors: Atreyee DEY, Prasan ROY