Patents by Inventor Shilpi Ahuja

Shilpi Ahuja has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Inferring location attributes from data entries

Patent number: 11704345

Abstract: A system and method are provided for inferring location attributes from data entries. The method comprises for data entries in a structured data set format, a computer system selecting a sample of rows. The computer system then identifies columns containing geospatial and temporal information based on the column headings. The computer system next identifies location information within the structured data set. The computer system determines implied location information based on the identified location information. The computer system derives location values based on the identified and implied location information using consolidation rules, resulting in a final set of location attributes for the data entries. The computer system then associates the final set of location attributes with the data entries.

Type: Grant

Filed: January 4, 2019

Date of Patent: July 18, 2023

Assignee: International Business Machines Corporation

Inventors: Shilpi Ahuja, Thomas Kemp, Charles D. Wolfson
Method to determine columns that contain location data in a data set

Patent number: 10956456

Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.

Type: Grant

Filed: November 29, 2016

Date of Patent: March 23, 2021

Assignee: International Business Machines Corporation

Inventors: Shilpi Ahuja, Rafael J. Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
Method to determine columns that contain location data in a data set

Patent number: 10909473

Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.

Type: Grant

Filed: January 9, 2018

Date of Patent: February 2, 2021

Assignee: International Business Machines Corporation

Inventors: Shilpi Ahuja, Rafael J. Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
INFERRING LOCATION ATTRIBUTES FROM DATA ENTRIES

Publication number: 20200218741

Abstract: A system and method are provided for inferring location attributes from data entries. The method comprises for data entries in a structured data set format, a computer system selecting a sample of rows. The computer system then identifies columns containing geospatial and temporal information based on the column headings. The computer system next identifies location information within the structured data set. The computer system determines implied location information based on the identified location information. The computer system derives location values based on the identified and implied location information using consolidation rules, resulting in a final set of location attributes for the data entries. The computer system then associates the final set of location attributes with the data entries.

Type: Application

Filed: January 4, 2019

Publication date: July 9, 2020

Inventors: Shilpi Ahuja, Thomas Kemp, Charles D. Wolfson
Merging synonymous entities from multiple structured sources into a dataset

Patent number: 10671577

Abstract: Merging synonymous entities from multiple structured sources into a dataset includes receiving a first set of paired terms from a first authoritative source for a domain and a second set of paired terms from a second authoritative source for the domain. The first set of paired terms is compared to the second set of paired terms with a similarity assessment based on a clustering statistical algorithm to identify paired terms from the first set of paired terms that share a synonymous term with one or more paired terms from the second set of paired terms. The paired terms associated with the synonymous term are merged and a dataset is generated that associates a normalized version of the synonymous term with any terms included in the merged paired terms.

Type: Grant

Filed: September 23, 2016

Date of Patent: June 2, 2020

Assignee: International Business Machines Corporation

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
Weather-based natural language text processing

Patent number: 10572526

Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.

Type: Grant

Filed: June 28, 2019

Date of Patent: February 25, 2020

Assignee: International Business Machines Corporation

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
Weather-based natural language text processing

Patent number: 10558695

Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.

Type: Grant

Filed: May 30, 2017

Date of Patent: February 11, 2020

Assignee: International Business Machines Corporation

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
WEATHER-BASED NATURAL LANGUAGE TEXT PROCESSING

Publication number: 20190317957

Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.

Type: Application

Filed: June 28, 2019

Publication date: October 17, 2019

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
Automatic detection and cleansing of erroneous concepts in an aggregated knowledge base

Patent number: 10331659

Abstract: A mechanism is provided for automatically detecting and cleansing erroneous concepts in an aggregated knowledge base. A graph data structure representing the concept present in a portion of the natural language content is generated. The graph data structure is analyzed to determine whether or not the graph data structure comprises one or more concept conflicts in association with a set of nodes in the graph data structure, the one or more concept conflicts are associated with the set of nodes if two or more nodes represent separate and distinct concepts. Responsive to determining that there are one or more concept conflicts due to there being two or more nodes representing separate and distinct concepts, the two or more nodes are split into separate distinct concepts within the knowledge base.

Type: Grant

Filed: September 6, 2016

Date of Patent: June 25, 2019

Assignee: International Business Machines Corporation

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
WEATHER-BASED NATURAL LANGUAGE TEXT PROCESSING

Publication number: 20180349326

Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.

Type: Application

Filed: May 30, 2017

Publication date: December 6, 2018

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
METHOD TO DETERMINE COLUMNS THAT CONTAIN LOCATION DATA IN A DATA SET

Publication number: 20180150769

Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.

Type: Application

Filed: January 9, 2018

Publication date: May 31, 2018

Inventors: Shilpi Ahuja, Rafael J.Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
METHOD TO DETERMINE COLUMNS THAT CONTAIN LOCATION DATA IN A DATA SET

Publication number: 20180150765

Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.

Type: Application

Filed: November 29, 2016

Publication date: May 31, 2018

Inventors: Shilpi Ahuja, Rafael J.Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
MERGING SYNONYMOUS ENTITIES FROM MULTIPLE STRUCTURED SOURCES INTO A DATASET

Publication number: 20180089300

Abstract: Merging synonymous entities from multiple structured sources into a dataset includes receiving a first set of paired terms from a first authoritative source for a domain and a second set of paired terms from a second authoritative source for the domain. The first set of paired terms is compared to the second set of paired terms with a similarity assessment based on a clustering statistical algorithm to identify paired terms from the first set of paired terms that share a synonymous term with one or more paired terms from the second set of paired terms. The paired terms associated with the synonymous term are merged and a dataset is generated that associates a normalized version of the synonymous term with any terms included in the merged paired terms.

Type: Application

Filed: September 23, 2016

Publication date: March 29, 2018

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
Automatic Detection and Cleansing of Erroneous Concepts in an Aggregated Knowledge Base

Publication number: 20180067981

Abstract: A mechanism is provided for automatically detecting and cleansing erroneous concepts in an aggregated knowledge base. A graph data structure representing the concept present in a portion of the natural language content is generated. The graph data structure is analyzed to determine whether or not the graph data structure comprises one or more concept conflicts in association with a set of nodes in the graph data structure, the one or more concept conflicts are associated with the set of nodes if two or more nodes represent separate and distinct concepts. Responsive to determining that there are one or more concept conflicts due to there being two or more nodes representing separate and distinct concepts, the two or more nodes are split into separate distinct concepts within the knowledge base.

Type: Application

Filed: September 6, 2016

Publication date: March 8, 2018

Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah