Patents by Inventor CRAIG A. STATCHUK

CRAIG A. STATCHUK has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11243919
    Abstract: A mechanism is provide for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared data is transformed based on a set of functions into a set of transformed data. A set of semantic text descriptions that detail the transformation of the set of unprepared data to the set of transformed data is generated using a first set of semantic associations, a second set of semantic associations, and a set of semantic transformation associations. The set of transformed data is tested against one or more governance policies that tracks data lineage to ultimately show that prepared data is in compliance. Responsive to the set of transformed data adhering to the one or more governance policies, a high-quality data repository is automatically built using the transformed data.
    Type: Grant
    Filed: October 16, 2015
    Date of Patent: February 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: Neil E. Bartlett, Craig A. Statchuk
  • Patent number: 11226995
    Abstract: Techniques are described for creating geospatial elements based on geographical data. In one example, a method for creating geospatial elements based on geographical data includes collecting one or more geographical indicators comprised in a set of operational data, the geographical indicators indicating one or more geographical areas. The method further includes collecting, from a set of geographical data, geospatial data associated with the one or more geographical areas indicated by the one or more geographical indicators. The method further includes generating one or more aggregated geospatial elements based on the one or more geographical indicators and the geospatial data.
    Type: Grant
    Filed: July 11, 2013
    Date of Patent: January 18, 2022
    Assignee: International Business Machines Corporation
    Inventors: Steven R. McDougall, Craig A. Statchuk
  • Patent number: 11194772
    Abstract: A mechanism is provided for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared data is transformed based on a set of functions into a set of transformed data. A set of semantic text descriptions that detail the transformation of the set of unprepared data to the set of transformed data is generated using a first set of semantic associations, a second set of semantic associations, and a set of semantic transformation associations. The set of transformed data is tested against one or more governance policies that tracks data lineage to ultimately show that prepared data is in compliance. Responsive to the set of transformed data adhering to the one or more governance policies, a high-quality data repository is automatically built using the transformed data.
    Type: Grant
    Filed: January 31, 2017
    Date of Patent: December 7, 2021
    Assignee: International Business Machines Corporation
    Inventors: Neil E. Bartlett, Craig A. Statchuk
  • Patent number: 10936678
    Abstract: A mechanism is provided for advanced search-term disambiguation. Responsive to detecting a search term being entered into an autocomplete search field of a search engine, a determination is made of a set of terms from a storage device upon which a search is to be performed. For each term in the set of terms, a determination is made of a semantic distance and independence (Sdi) score to each term candidate in a set of term candidates. The results of the semantic distance and independence (Sdi) scores are ranked and then pruned down to a predetermined number of autocomplete results. A subset of term candidates associated with the predetermined number of autocomplete results from the set of term candidates is then presented to a user as autocomplete suggestions to the search.
    Type: Grant
    Filed: November 29, 2018
    Date of Patent: March 2, 2021
    Assignee: International Business Machines Corporation
    Inventors: Alexandru Pulbere, Craig A. Statchuk, Changying Sun
  • Patent number: 10747824
    Abstract: A method, system and computer program product for building a data query engine. Initial taxonomies that describe and categorize data are built by expert users (e.g., data scientists) employing machine learning algorithms. The data is also indexed and stored in an index. Queries are then received from non-expert users to query the data based on data categorization from built taxonomies and the indexing. After the queries are executed using the machine learning algorithms in an environment (e.g., Hadoop®), the results of the queries are rated for relevance, precision and accuracy. The machine learning algorithms are also rated based on the number of successful queries. Those machine learning algorithms with a rating above a threshold are identified to be utilized to scan new data to be stored in the index to provide a new environment that replaces the initial environment.
    Type: Grant
    Filed: December 6, 2016
    Date of Patent: August 18, 2020
    Assignee: International Business Machines Corporation
    Inventor: Craig A. Statchuk
  • Patent number: 10546026
    Abstract: A mechanism is provided for advanced search-term disambiguation. Responsive to detecting a search term being entered into an autocomplete search field of a search engine, a determination is made of a set of terms from a storage device upon which a search is to be performed. For each term in the set of terms, a determination is made of a semantic distance and independence (Sdi) score to each term candidate in a set of term candidates. The results of the semantic distance and independence (Sdi) scores are ranked and then pruned down to a predetermined number of autocomplete results. A subset of term candidates associated with the predetermined number of autocomplete results from the set of term candidates is then presented to a user as autocomplete suggestions to the search.
    Type: Grant
    Filed: March 31, 2017
    Date of Patent: January 28, 2020
    Assignee: International Business Machines Corporation
    Inventors: Alexandru Pulbere, Craig A. Statchuk, Changying Sun
  • Publication number: 20190171675
    Abstract: A mechanism is provided for advanced search-term disambiguation. Responsive to detecting a search term being entered into an autocomplete search field of a search engine, a determination is made of a set of terms from a storage device upon which a search is to be performed. For each term in the set of terms, a determination is made of a semantic distance and independence (Sdi) score to each term candidate in a set of term candidates. The results of the semantic distance and independence (Sdi) scores are ranked and then pruned down to a predetermined number of autocomplete results. A subset of term candidates associated with the predetermined number of autocomplete results from the set of term candidates is then presented to a user as autocomplete suggestions to the search.
    Type: Application
    Filed: November 29, 2018
    Publication date: June 6, 2019
    Inventors: Alexandru Pulbere, Craig A. Statchuk, Changying Sun
  • Publication number: 20180285457
    Abstract: A mechanism is provided for advanced search-term disambiguation. Responsive to detecting a search term being entered into an autocomplete search field of a search engine, a determination is made of a set of terms from a storage device upon which a search is to be performed. For each term in the set of terms, a determination is made of a semantic distance and independence (Sdi) score to each term candidate in a set of term candidates. The results of the semantic distance and independence (Sdi) scores are ranked and then pruned down to a. predetermined number of autocomplete results. A subset of term candidates associated with the predetermined number of autocomplete results from the set of term candidates is then presented to a user as autocomplete suggestions to the search.
    Type: Application
    Filed: March 31, 2017
    Publication date: October 4, 2018
    Inventors: Alexandru Pulbere, Craig A. Statchuk, Changying Sun
  • Publication number: 20180157749
    Abstract: A method, system and computer program product for building a data query engine. Initial taxonomies that describe and categorize data are built by expert users (e.g., data scientists) employing machine learning algorithms. The data is also indexed and stored in an index. Queries are then received from non-expert users to query the data based on data categorization from built taxonomies and the indexing. After the queries are executed using the machine learning algorithms in an environment (e.g., Hadoop®), the results of the queries are rated for relevance, precision and accuracy. The machine learning algorithms are also rated based on the number of successful queries. Those machine learning algorithms with a rating above a threshold are identified to be utilized to scan new data to be stored in the index to provide a new environment that replaces the initial environment.
    Type: Application
    Filed: December 6, 2016
    Publication date: June 7, 2018
    Inventor: Craig A. Statchuk
  • Patent number: 9984107
    Abstract: A system for joining modeled data with new data sources. Modeled tabular data containing categorical and metric data is received. New data containing structured or non-structured data is received. The new data is indexed and data clusters and/or data facets are identified. Data clusters and data facets are also identified within the received modeled data. A textual query from a user is received and semantically indexed into keywords. Data clusters and data facets within the new data are matched to the keywords in the indexed query. Data clusters and data facets within in the modeled data are matched to the keywords in the indexed query. A table is generated containing the modeled data and matching data clusters and data facets from the new data.
    Type: Grant
    Filed: December 18, 2014
    Date of Patent: May 29, 2018
    Assignee: International Business Machines Corporation
    Inventors: Marius Cosma, Yandu S. Oppacher, Craig A. Statchuk
  • Patent number: 9984108
    Abstract: A system for joining modeled data with new data sources. Modeled tabular data containing categorical and metric data is received. New data containing structured or non-structured data is received. The new data is indexed and data clusters and/or data facets are identified. Data clusters and data facets are also identified within the received modeled data. A textual query from a user is received and semantically indexed into keywords. Data clusters and data facets within the new data are matched to the keywords in the indexed query. Data clusters and data facets within in the modeled data are matched to the keywords in the indexed query. A table is generated containing the modeled data and matching data clusters and data facets from the new data.
    Type: Grant
    Filed: June 11, 2015
    Date of Patent: May 29, 2018
    Assignee: International Business Machines Corporation
    Inventors: Marius Cosma, Yandu S. Oppacher, Craig A. Statchuk
  • Patent number: 9798864
    Abstract: A computer program product, a method, and an information handling system is provided to track and retrieve licensing agreements of features integrated into an application by detecting during development of an application installation of one or more features having a licensing agreement associated with using the one or more features; storing in a database a mapping between the one more installed features to a corresponding licensing agreement for each of the one or more installed features; and providing support to retrieve for the each installed features, the corresponding licensing agreement.
    Type: Grant
    Filed: March 6, 2015
    Date of Patent: October 24, 2017
    Assignee: International Business Machines Corporation
    Inventor: Craig A. Statchuk
  • Patent number: 9734162
    Abstract: Embodiments of the present invention disclose a method, computer program product, and system for optimizing spatial queries, the method comprising a computer receiving a spatial data set, and a target spatial shape. The computer determining a distribution data set which aligns with the geometry of the spatial data set. The computer performing a query to determine a list of full and partial shapes of the spatial data set located within the target spatial shape. The computer determining a ratio for a determined partial shape of the spatial data set using the distribution data set. The computer determining an apportioned value of a variable of the spatial data set within the determined partial shape. The computer summarizing into a result set the value of the variable within the determined list of full and partial shapes of the spatial data set.
    Type: Grant
    Filed: March 25, 2016
    Date of Patent: August 15, 2017
    Assignee: International Business Machines Corporation
    Inventor: Craig A. Statchuk
  • Patent number: 9679399
    Abstract: A method and system for generating a map using a computer is based on data and weighted factors to minimize corresponding projection distortions. The method and system includes determining visualization goals from analyzing a set of datasets for a map using the computer. A set of visualization characteristics are calculated for each dataset based on the visualization goals using the computer. The visualization characteristics are analyzed to weight factors for each of the datasets. Each of the weighted factors is adjusted based on the relevance of each of the datasets for visualization of the map. An aggregate vector of weighted factors is calculated based on all of the datasets, and the map for visualization is generated based on the aggregate vector of weighted factors.
    Type: Grant
    Filed: August 10, 2016
    Date of Patent: June 13, 2017
    Assignee: International Business Machines Corporation
    Inventors: Peter Bak, Gilad M. Saadoun, Harold J. Ship, Craig A. Statchuk, Avi Yaeli
  • Publication number: 20170140016
    Abstract: A mechanism is provided for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared data is transformed based on a set of functions into a set of transformed data. A set of semantic text descriptions that detail the transformation of the set of unprepared data to the set of transformed data is generated using a first set of semantic associations, a second set of semantic associations, and a set of semantic transformation associations. The set of transformed data is tested against one or more governance policies that tracks data lineage to ultimately show that prepared data is in compliance. Responsive to the set of transformed data adhering to the one or more governance policies, a high-quality data repository is automatically built using the transformed data.
    Type: Application
    Filed: January 31, 2017
    Publication date: May 18, 2017
    Inventors: Neil E. Bartlett, Craig A. Statchuk
  • Publication number: 20170109418
    Abstract: A mechanism is provide for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared data is transformed based on a set of functions into a set of transformed data. A set of semantic text descriptions that detail the transformation of the set of unprepared data to the set of transformed data is generated using a first set of semantic associations, a second set of semantic associations, and a set of semantic transformation associations. The set of transformed data is tested against one or more governance policies that tracks data lineage to ultimately show that prepared data is in compliance. Responsive to the set of transformed data adhering to the one or more governance policies, a high-quality data repository is automatically built using the transformed data.
    Type: Application
    Filed: October 16, 2015
    Publication date: April 20, 2017
    Inventors: Neil E. Bartlett, Craig A. Statchuk
  • Patent number: 9626796
    Abstract: A method and system for generating a map using a computer is based on data and weighted factors to minimize corresponding projection distortions. The method and system includes determining visualization goals from analyzing a set of datasets for a map using the computer. A set of visualization characteristics are calculated for each dataset based on the visualization goals using the computer. The visualization characteristics are analyzed to weight factors for each of the datasets. Each of the weighted factors is adjusted based on the relevance of each of the datasets for visualization of the map. An aggregate vector of weighted factors is calculated based on all of the datasets, and the map for visualization is generated based on the aggregate vector of weighted factors.
    Type: Grant
    Filed: October 28, 2015
    Date of Patent: April 18, 2017
    Assignee: International Business Machines Corporation
    Inventors: Peter Bak, Gilad M. Saadoun, Harold J. Ship, Craig A. Statchuk, Avi Yaeli
  • Patent number: 9536328
    Abstract: A method and system for generating a map using a computer is based on data and weighted factors to minimize corresponding projection distortions. The method and system includes determining visualization goals from analyzing a set of datasets for a map using the computer. A set of visualization characteristics are calculated for each dataset based on the visualization goals using the computer. The visualization characteristics are analyzed to weight factors for each of the datasets. Each of the weighted factors is adjusted based on the relevance of each of the datasets for visualization of the map. An aggregate vector of weighted factors is calculated based on all of the datasets, and the map for visualization is generated based on the aggregate vector of weighted factors.
    Type: Grant
    Filed: January 6, 2015
    Date of Patent: January 3, 2017
    Assignee: International Business Machines Corporation
    Inventors: Peter Bak, Gilad M. Saadoun, Harold J. Ship, Craig A. Statchuk, Avi Yaeli
  • Publication number: 20160350970
    Abstract: A method and system for generating a map using a computer is based on data and weighted factors to minimize corresponding projection distortions. The method and system includes determining visualization goals from analyzing a set of datasets for a map using the computer. A set of visualization characteristics are calculated for each dataset based on the visualization goals using the computer. The visualization characteristics are analyzed to weight factors for each of the datasets. Each of the weighted factors is adjusted based on the relevance of each of the datasets for visualization of the map. An aggregate vector of weighted factors is calculated based on all of the datasets, and the map for visualization is generated based on the aggregate vector of weighted factors.
    Type: Application
    Filed: August 10, 2016
    Publication date: December 1, 2016
    Inventors: Peter Bak, Gilad M. Saadoun, Harold J. Ship, Craig A. Statchuk, Avi Yaeli
  • Publication number: 20160259948
    Abstract: A computer program product, a method, and an information handling system is provided to monitor each user's compliance to a governance policy for each embedded component for each user attempting to execute applications in the system and responsive to detecting a non-compliance by a user for an embedded component, performing a remediation action for the embedded component.
    Type: Application
    Filed: March 6, 2015
    Publication date: September 8, 2016
    Inventor: Craig A. Statchuk