Patents by Inventor Shilpi Ahuja
Shilpi Ahuja has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11704345Abstract: A system and method are provided for inferring location attributes from data entries. The method comprises for data entries in a structured data set format, a computer system selecting a sample of rows. The computer system then identifies columns containing geospatial and temporal information based on the column headings. The computer system next identifies location information within the structured data set. The computer system determines implied location information based on the identified location information. The computer system derives location values based on the identified and implied location information using consolidation rules, resulting in a final set of location attributes for the data entries. The computer system then associates the final set of location attributes with the data entries.Type: GrantFiled: January 4, 2019Date of Patent: July 18, 2023Assignee: International Business Machines CorporationInventors: Shilpi Ahuja, Thomas Kemp, Charles D. Wolfson
-
Patent number: 10956456Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.Type: GrantFiled: November 29, 2016Date of Patent: March 23, 2021Assignee: International Business Machines CorporationInventors: Shilpi Ahuja, Rafael J. Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
-
Patent number: 10909473Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.Type: GrantFiled: January 9, 2018Date of Patent: February 2, 2021Assignee: International Business Machines CorporationInventors: Shilpi Ahuja, Rafael J. Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
-
Publication number: 20200218741Abstract: A system and method are provided for inferring location attributes from data entries. The method comprises for data entries in a structured data set format, a computer system selecting a sample of rows. The computer system then identifies columns containing geospatial and temporal information based on the column headings. The computer system next identifies location information within the structured data set. The computer system determines implied location information based on the identified location information. The computer system derives location values based on the identified and implied location information using consolidation rules, resulting in a final set of location attributes for the data entries. The computer system then associates the final set of location attributes with the data entries.Type: ApplicationFiled: January 4, 2019Publication date: July 9, 2020Inventors: Shilpi Ahuja, Thomas Kemp, Charles D. Wolfson
-
Patent number: 10671577Abstract: Merging synonymous entities from multiple structured sources into a dataset includes receiving a first set of paired terms from a first authoritative source for a domain and a second set of paired terms from a second authoritative source for the domain. The first set of paired terms is compared to the second set of paired terms with a similarity assessment based on a clustering statistical algorithm to identify paired terms from the first set of paired terms that share a synonymous term with one or more paired terms from the second set of paired terms. The paired terms associated with the synonymous term are merged and a dataset is generated that associates a normalized version of the synonymous term with any terms included in the merged paired terms.Type: GrantFiled: September 23, 2016Date of Patent: June 2, 2020Assignee: International Business Machines CorporationInventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
-
Patent number: 10572526Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.Type: GrantFiled: June 28, 2019Date of Patent: February 25, 2020Assignee: International Business Machines CorporationInventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
-
Patent number: 10558695Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.Type: GrantFiled: May 30, 2017Date of Patent: February 11, 2020Assignee: International Business Machines CorporationInventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
-
Publication number: 20190317957Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.Type: ApplicationFiled: June 28, 2019Publication date: October 17, 2019Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
-
Patent number: 10331659Abstract: A mechanism is provided for automatically detecting and cleansing erroneous concepts in an aggregated knowledge base. A graph data structure representing the concept present in a portion of the natural language content is generated. The graph data structure is analyzed to determine whether or not the graph data structure comprises one or more concept conflicts in association with a set of nodes in the graph data structure, the one or more concept conflicts are associated with the set of nodes if two or more nodes represent separate and distinct concepts. Responsive to determining that there are one or more concept conflicts due to there being two or more nodes representing separate and distinct concepts, the two or more nodes are split into separate distinct concepts within the knowledge base.Type: GrantFiled: September 6, 2016Date of Patent: June 25, 2019Assignee: International Business Machines CorporationInventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
-
Publication number: 20180349326Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.Type: ApplicationFiled: May 30, 2017Publication date: December 6, 2018Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
-
Publication number: 20180150769Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.Type: ApplicationFiled: January 9, 2018Publication date: May 31, 2018Inventors: Shilpi Ahuja, Rafael J.Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
-
Publication number: 20180150765Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.Type: ApplicationFiled: November 29, 2016Publication date: May 31, 2018Inventors: Shilpi Ahuja, Rafael J.Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
-
Publication number: 20180089300Abstract: Merging synonymous entities from multiple structured sources into a dataset includes receiving a first set of paired terms from a first authoritative source for a domain and a second set of paired terms from a second authoritative source for the domain. The first set of paired terms is compared to the second set of paired terms with a similarity assessment based on a clustering statistical algorithm to identify paired terms from the first set of paired terms that share a synonymous term with one or more paired terms from the second set of paired terms. The paired terms associated with the synonymous term are merged and a dataset is generated that associates a normalized version of the synonymous term with any terms included in the merged paired terms.Type: ApplicationFiled: September 23, 2016Publication date: March 29, 2018Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
-
Publication number: 20180067981Abstract: A mechanism is provided for automatically detecting and cleansing erroneous concepts in an aggregated knowledge base. A graph data structure representing the concept present in a portion of the natural language content is generated. The graph data structure is analyzed to determine whether or not the graph data structure comprises one or more concept conflicts in association with a set of nodes in the graph data structure, the one or more concept conflicts are associated with the set of nodes if two or more nodes represent separate and distinct concepts. Responsive to determining that there are one or more concept conflicts due to there being two or more nodes representing separate and distinct concepts, the two or more nodes are split into separate distinct concepts within the knowledge base.Type: ApplicationFiled: September 6, 2016Publication date: March 8, 2018Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah