Patents by Inventor Snigdha Chaturvedi

Snigdha Chaturvedi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for artificial intelligence story generation allowing content introduction

Patent number: 11520971

Abstract: Techniques for artificial intelligence assisted story generation includes training a neural network with first training data that indicates text for one or more portions of a training story and second training data that indicates text for a subset of text for an immediately following portion and third training data that indicates full text for the same portion. First data is retrieved that indicates text for a first one or more portions of a different new story. Second data is also received that indicates text for a cued subset of a next portion of the new story. Third data is generated that indicates full text for the next portion of the new story based on the first data and the second data and the neural network. The third data is concatenated to the first data to produce output data that is stored.

Type: Grant

Filed: March 30, 2020

Date of Patent: December 6, 2022

Assignee: THE REGENTS OF THE UNIVERSITY OF CALIFORNIA

Inventors: Snigdha Chaturvedi, Faeze Brahman, Alexandru Petrusca
SYSTEM AND METHOD FOR ARTIFICIAL INTELLIGENCE STORY GENERATION ALLOWING CONTENT INTRODUCTION

Publication number: 20200311341

Abstract: Techniques for artificial intelligence assisted story generation includes training a neural network with first training data that indicates text for one or more portions of a training story and second training data that indicates text for a subset of text for an immediately following portion and third training data that indicates full text for the same portion. First data is retrieved that indicates text for a first one or more portions of a different new story. Second data is also received that indicates text for a cued subset of a next portion of the new story. Third data is generated that indicates full text for the next portion of the new story based on the first data and the second data and the neural network. The third data is concatenated to the first data to produce output data that is stored.

Type: Application

Filed: March 30, 2020

Publication date: October 1, 2020

Inventors: Snigdha Chaturvedi, Faeze Brahman, Alexandru Petrusca
Automatically mining patterns for rule based data standardization systems

Patent number: 10163063

Abstract: Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.

Type: Grant

Filed: March 7, 2012

Date of Patent: December 25, 2018

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
Automatically mining patterns for rule based data standardization systems

Patent number: 10095780

Abstract: Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.

Type: Grant

Filed: February 7, 2017

Date of Patent: October 9, 2018

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
AUTOMATICALLY MINING PATTERNS FOR RULE BASED DATA STANDARDIZATION SYSTEMS

Publication number: 20170147688

Abstract: Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.

Type: Application

Filed: February 7, 2017

Publication date: May 25, 2017

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
Cleansing a database system to improve data quality

Patent number: 9104709

Abstract: According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

Type: Grant

Filed: March 16, 2012

Date of Patent: August 11, 2015

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A Faruquie, Hima P Karanam, Mukesh K Mohania, L Venkata Subramaniam
Automatically mining patterns for rule based data standardization systems

Patent number: 8996524

Abstract: Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.

Type: Grant

Filed: March 8, 2012

Date of Patent: March 31, 2015

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
Efficient development of a rule-based system using crowd-sourcing

Patent number: 8949204

Abstract: Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.

Type: Grant

Filed: August 29, 2012

Date of Patent: February 3, 2015

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer Afzal Faruquie, L. Venkata Subramaniam
Systems and methods for efficient development of a rule-based system using crowd-sourcing

Patent number: 8635197

Abstract: Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.

Type: Grant

Filed: February 28, 2011

Date of Patent: January 21, 2014

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer Afzal Faruquie, L. Venkata Subramaniam
Automatic selection of blocking column for de-duplication

Patent number: 8560506

Abstract: A method of blocking column selection can include determining a first parameter for each column set of a plurality of column sets, wherein the first parameter indicates distribution of blocks in the column set, and determining a second parameter for each column set. The second parameter can indicate block size for the column set. For each column set, a measure of blockability that is dependent upon at least the first parameter and the second parameter can be calculated using a processor. The plurality of column sets can be ranked according to the measures of blockability.

Type: Grant

Filed: April 16, 2012

Date of Patent: October 15, 2013

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
Automatic selection of blocking column for de-duplication

Patent number: 8560505

Abstract: Blocking column selection can include determining a first parameter for each column set of a plurality of column sets, wherein the first parameter indicates distribution of blocks in the column set, and determining a second parameter for each column set. The second parameter can indicate block size for the column set. For each column set, a measure of blockability that is dependent upon at least the first parameter and the second parameter can be calculated using a processor. The plurality of column sets can be ranked according to the measures of blockability.

Type: Grant

Filed: December 7, 2011

Date of Patent: October 15, 2013

Assignee: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
Automatically Mining Patterns For Rule Based Data Standardization Systems

Publication number: 20130238610

Abstract: Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.

Type: Application

Filed: March 7, 2012

Publication date: September 12, 2013

Applicant: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
Automatically Mining Patterns for Rule Based Data Standardization Systems

Publication number: 20130238611

Abstract: Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.

Type: Application

Filed: March 8, 2012

Publication date: September 12, 2013

Applicant: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
AUTOMATIC SELECTION OF BLOCKING COLUMN FOR DE-DUPLICATION

Publication number: 20130151490

Abstract: A method of blocking column selection can include determining a first parameter for each column set of a plurality of column sets, wherein the first parameter indicates distribution of blocks in the column set, and determining a second parameter for each column set. The second parameter can indicate block size for the column set. For each column set, a measure of blockability that is dependent upon at least the first parameter and the second parameter can be calculated using a processor. The plurality of column sets can be ranked according to the measures of blockability.

Type: Application

Filed: April 16, 2012

Publication date: June 13, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: SNIGDHA CHATURVEDI, TANVEER A. FARUQUIE, HIMA P. KARANAM, MARVIN MENDELSSOHN, MUKESH K. MOHANIA, L. VENKATA SUBRAMANIAM
AUTOMATIC SELECTION OF BLOCKING COLUMN FOR DE-DUPLICATION

Publication number: 20130151487

Abstract: Blocking column selection can include determining a first parameter for each column set of a plurality of column sets, wherein the first parameter indicates distribution of blocks in the column set, and determining a second parameter for each column set. The second parameter can indicate block size for the column set. For each column set, a measure of blockability that is dependent upon at least the first parameter and the second parameter can be calculated using a processor. The plurality of column sets can be ranked according to the measures of blockability.

Type: Application

Filed: December 7, 2011

Publication date: June 13, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: SNIGDHA CHATURVEDI, TANVEER A. FARUQUIE, HIMA P. KARANAM, MARVIN MENDELSSOHN, MUKESH K. MOHANIA, L. VENKATA SUBRAMANIAM
EFFICIENT DEVELOPMENT OF A RULE-BASED SYSTEM USING CROWD-SOURCING

Publication number: 20120323866

Abstract: Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.

Type: Application

Filed: August 29, 2012

Publication date: December 20, 2012

Applicant: INTERNATIONAL MACHINES CORPORATION

Inventors: Snigdha Chaturvedi, Tanveer Afzal Faruquie, L. Venkata Subramaniam
SYSTEMS AND METHODS FOR EFFICIENT DEVELOPMENT OF A RULE-BASED SYSTEM USING CROWD-SOURCING

Publication number: 20120221508

Abstract: Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.

Type: Application

Filed: February 28, 2011

Publication date: August 30, 2012

Applicant: INTERNATIONAL MACHINES CORPORATION

Inventors: Snigdha Chaturvedi, Tanveer Afzal Faruquie, L. Venkata Subramaniam
Cleansing a Database System to Improve Data Quality

Publication number: 20120179658

Abstract: According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

Type: Application

Filed: March 16, 2012

Publication date: July 12, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Mukesh K. Mohania, L. Venkata Subramaniam
Cleansing a Database System to Improve Data Quality

Publication number: 20120150825

Abstract: According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

Type: Application

Filed: December 13, 2010

Publication date: June 14, 2012

Applicant: International Business Machines Corporation

Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Mukesh K. Mohania, L. Venkata Subramaniam