Patents by Inventor Shilpi Ahuja

Shilpi Ahuja has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11704345
    Abstract: A system and method are provided for inferring location attributes from data entries. The method comprises for data entries in a structured data set format, a computer system selecting a sample of rows. The computer system then identifies columns containing geospatial and temporal information based on the column headings. The computer system next identifies location information within the structured data set. The computer system determines implied location information based on the identified location information. The computer system derives location values based on the identified and implied location information using consolidation rules, resulting in a final set of location attributes for the data entries. The computer system then associates the final set of location attributes with the data entries.
    Type: Grant
    Filed: January 4, 2019
    Date of Patent: July 18, 2023
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Thomas Kemp, Charles D. Wolfson
  • Patent number: 10956456
    Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.
    Type: Grant
    Filed: November 29, 2016
    Date of Patent: March 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Rafael J. Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
  • Patent number: 10909473
    Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.
    Type: Grant
    Filed: January 9, 2018
    Date of Patent: February 2, 2021
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Rafael J. Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
  • Publication number: 20200218741
    Abstract: A system and method are provided for inferring location attributes from data entries. The method comprises for data entries in a structured data set format, a computer system selecting a sample of rows. The computer system then identifies columns containing geospatial and temporal information based on the column headings. The computer system next identifies location information within the structured data set. The computer system determines implied location information based on the identified location information. The computer system derives location values based on the identified and implied location information using consolidation rules, resulting in a final set of location attributes for the data entries. The computer system then associates the final set of location attributes with the data entries.
    Type: Application
    Filed: January 4, 2019
    Publication date: July 9, 2020
    Inventors: Shilpi Ahuja, Thomas Kemp, Charles D. Wolfson
  • Patent number: 10671577
    Abstract: Merging synonymous entities from multiple structured sources into a dataset includes receiving a first set of paired terms from a first authoritative source for a domain and a second set of paired terms from a second authoritative source for the domain. The first set of paired terms is compared to the second set of paired terms with a similarity assessment based on a clustering statistical algorithm to identify paired terms from the first set of paired terms that share a synonymous term with one or more paired terms from the second set of paired terms. The paired terms associated with the synonymous term are merged and a dataset is generated that associates a normalized version of the synonymous term with any terms included in the merged paired terms.
    Type: Grant
    Filed: September 23, 2016
    Date of Patent: June 2, 2020
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Patent number: 10572526
    Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: February 25, 2020
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Patent number: 10558695
    Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.
    Type: Grant
    Filed: May 30, 2017
    Date of Patent: February 11, 2020
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Publication number: 20190317957
    Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.
    Type: Application
    Filed: June 28, 2019
    Publication date: October 17, 2019
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Patent number: 10331659
    Abstract: A mechanism is provided for automatically detecting and cleansing erroneous concepts in an aggregated knowledge base. A graph data structure representing the concept present in a portion of the natural language content is generated. The graph data structure is analyzed to determine whether or not the graph data structure comprises one or more concept conflicts in association with a set of nodes in the graph data structure, the one or more concept conflicts are associated with the set of nodes if two or more nodes represent separate and distinct concepts. Responsive to determining that there are one or more concept conflicts due to there being two or more nodes representing separate and distinct concepts, the two or more nodes are split into separate distinct concepts within the knowledge base.
    Type: Grant
    Filed: September 6, 2016
    Date of Patent: June 25, 2019
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Publication number: 20180349326
    Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.
    Type: Application
    Filed: May 30, 2017
    Publication date: December 6, 2018
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Publication number: 20180150769
    Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.
    Type: Application
    Filed: January 9, 2018
    Publication date: May 31, 2018
    Inventors: Shilpi Ahuja, Rafael J.Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
  • Publication number: 20180150765
    Abstract: A method of identifying location data in a data set comprises generating a data sample from the data set, training a plurality of models with the data sample to identify the location data in the data set, and applying the data set to the trained models to determine the location data within the data set. The plurality of models includes one or more first models to identify primary attributes of the location data indicating a geographical area and one or more second models to identify secondary attributes of the location data used to determine corresponding primary attributes.
    Type: Application
    Filed: November 29, 2016
    Publication date: May 31, 2018
    Inventors: Shilpi Ahuja, Rafael J.Z. Bastidas, Rashmi Gangadharaiah, Mary A. Roth
  • Publication number: 20180089300
    Abstract: Merging synonymous entities from multiple structured sources into a dataset includes receiving a first set of paired terms from a first authoritative source for a domain and a second set of paired terms from a second authoritative source for the domain. The first set of paired terms is compared to the second set of paired terms with a similarity assessment based on a clustering statistical algorithm to identify paired terms from the first set of paired terms that share a synonymous term with one or more paired terms from the second set of paired terms. The paired terms associated with the synonymous term are merged and a dataset is generated that associates a normalized version of the synonymous term with any terms included in the merged paired terms.
    Type: Application
    Filed: September 23, 2016
    Publication date: March 29, 2018
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Publication number: 20180067981
    Abstract: A mechanism is provided for automatically detecting and cleansing erroneous concepts in an aggregated knowledge base. A graph data structure representing the concept present in a portion of the natural language content is generated. The graph data structure is analyzed to determine whether or not the graph data structure comprises one or more concept conflicts in association with a set of nodes in the graph data structure, the one or more concept conflicts are associated with the set of nodes if two or more nodes represent separate and distinct concepts. Responsive to determining that there are one or more concept conflicts due to there being two or more nodes representing separate and distinct concepts, the two or more nodes are split into separate distinct concepts within the knowledge base.
    Type: Application
    Filed: September 6, 2016
    Publication date: March 8, 2018
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah