Patents by Inventor Joseph W. Dain

Joseph W. Dain has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240187391
    Abstract: Computer-implemented methods for management of data collection devices. Aspects include creating a cluster of data collection devices and a distributed meta-key manager for the cluster and providing an authentication key for each data collection device to access the distributed meta-key manager. Aspects also include collecting and storing data by one or more of the data collection devices and periodically perform a quorum check for each data collection device of the cluster. Aspects further include updating an operational mode of each data collection device based on the quorum check and offloading the stored data from a data collection device based on successful verification of the stored data and the operational mode of the data collection device.
    Type: Application
    Filed: December 5, 2022
    Publication date: June 6, 2024
    Inventors: Christopher J. Vollmar, Adriana Pellegrini Furnielis, Sarvesh S. Patel, Frank N. Lee, Abhishek Jain, Joseph W. Dain, Daniel DE SOUZA CASALI
  • Publication number: 20230409384
    Abstract: A method, computer system, and a computer program for managing computer jobs in a queue is provided. This comprises extracting metadata from a new job received for processing and upon determining when a similar enriched metadata exists in a database. A job score and storage footprint may then be determined for the new job from the extracted metadata. It is then determined whether the new job can be grouped for processing with any other jobs already placed on a queue. The new job is then added to the queue based on the new job's score and footprint, and whether it can be grouped with other jobs. The queue is then updated and sent to a scheduler for further processing.
    Type: Application
    Filed: June 16, 2022
    Publication date: December 21, 2023
    Inventors: Joseph W. Dain, Simon Lorenz, Piyush Chaudhary, Gero Friedrich Wolf Schmidt, Qais Noorshams, Gregory T. Kishi
  • Patent number: 11755592
    Abstract: Provided are techniques for data search with automated selection of one or more AI inference models and inference label indexing. Metadata for a data object that is stored in a data source is retrieved. The metadata is matched to filtering criteria. The filtering criteria is used to identify an Artificial Intelligence (AI) inference model. The data object is sent to an inference engine for the AI inference model. An inference result is received from the inference engine. The inference result is stored in a database to associate the inference result with the data object. Then, in response to receiving a search request with one or more terms, the one or more terms are matched to the inference result in the database. and the data object and the inference result are returned.
    Type: Grant
    Filed: August 25, 2021
    Date of Patent: September 12, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Joseph W. Dain, Frank N. Lee, Nilesh Prabhakar Bhosale, Christopher Vollmar
  • Publication number: 20230222116
    Abstract: A method and/or system of managing metadata are disclosed that include connecting a source data storage system (DSS) that stores both data and metadata to a metadata management platform (MMP); scanning metadata records onto the MMP from the DSS; storing metadata attributes for at least one of the group consisting of directories and buckets on the DSS in a look-up table on the MMP; and adding updated metadata attributes to the look-up table on the MMP for each subsequent scan of the DSS.
    Type: Application
    Filed: March 16, 2023
    Publication date: July 13, 2023
    Inventors: Raul E. Saba, Joseph W. Dain, Theodore Timothy Harris, JR., Gregory T. Kishi, Scott Brewer, Sara Megan Coronado
  • Patent number: 11693878
    Abstract: A dataset formatted in a first format is generated, wherein the dataset formatted in the first format includes at least a plurality of data elements that comprise data. A selection is made of machine learning framework, wherein the machine learning framework operates on datasets formatted in a second format that is different from the first format. The dataset is converted from the first format to the second format and links are generated from the converted dataset formatted in the second format to the plurality of data elements of the dataset formatted in the first format to avoid creating additional copies of the plurality of data elements.
    Type: Grant
    Filed: November 19, 2020
    Date of Patent: July 4, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Joseph W. Dain, Gregory T. Kishi, Deepavali M. Bhagwat, Lukas Rupprecht, Constantine Arnold
  • Patent number: 11675915
    Abstract: A computer-implemented method according to an aspect includes training a cognitive network, utilizing metadata associated with historic data threats, inputting metadata associated with a current data threat into the trained cognitive network, identifying, by the trained cognitive network, one or more stored instances of data determined to be vulnerable to the current data threat, and adjusting one or more security aspects of the one or more stored instances of data determined to be vulnerable to the current data threat.
    Type: Grant
    Filed: April 16, 2019
    Date of Patent: June 13, 2023
    Assignee: International Business Machines Corporation
    Inventors: Nilesh P. Bhosale, Joseph W. Dain, Gregory T. Kishi, Sandeep R. Patil
  • Publication number: 20230171273
    Abstract: Storage network with controller that includes using deep packet inspection to determine whether a computer file or computer object is sensitive and/or critical, and then, if it is sensitive or critical, applying an artificial intelligence algorithm to determine whether there is a potential threat that is represented by the file or object. If there is a potential threat, then a cyber-resiliency workflow is performed to eliminate or at least mitigate the potential threat.
    Type: Application
    Filed: December 1, 2021
    Publication date: June 1, 2023
    Inventors: Mandar Jayant Vaidya, Nilesh Prabhakar Bhosale, Joseph W. Dain, Thomas Keith Clark
  • Patent number: 11636096
    Abstract: A method and/or system of managing metadata are disclosed that include connecting a source data storage system (DSS) that stores both data and metadata to a metadata management platform (MMP); scanning metadata records onto the MMP from the DSS; storing metadata attributes for at least one of the group consisting of directories and buckets on the DSS in a look-up table on the MMP; and adding updated metadata attributes to the look-up table on the MMP for each subsequent scan of the DSS.
    Type: Grant
    Filed: April 30, 2020
    Date of Patent: April 25, 2023
    Assignee: International Business Machines Corporation
    Inventors: Raul E. Saba, Joseph W. Dain, Theodore Timothy Harris, Jr., Gregory T. Kishi, Scott Brewer, Sara Megan Coronado
  • Patent number: 11620189
    Abstract: A system, according to one embodiment, includes: a processor, as well as logic that is integrated with the processor, executable by the processor, or integrated with and executable by the processor. The logic is configured to: send, by the processor, point in time copies of data to an object storage system. A directive for manipulating the point in time copies of the data are also set to the object storage system by the processor. Moreover, the point in time copies of the data are manipulated by a storlet on the object storage system according to the directive.
    Type: Grant
    Filed: July 7, 2021
    Date of Patent: April 4, 2023
    Assignee: International Business Machines Corporation
    Inventors: Robert B. Basham, Joseph W. Dain, Matthew J. Fairhurst
  • Patent number: 11620262
    Abstract: A set of metafiles that stores at least metadata information and provenance information of an artificial intelligence (AI) process is generated, where the AI process is trained with a source data. The set of metafiles is accessed via an application programming interface (API) to the set of metafiles. In response to accessing the set of metafiles, the source data in the set of metafiles is transferred to a cache for processing by the AI process.
    Type: Grant
    Filed: November 30, 2020
    Date of Patent: April 4, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Frank N Lee, Joseph W. Dain
  • Publication number: 20230074640
    Abstract: Provided is a computer-implemented method, system, and computer program product for detecting and processing a duplicate scene. A processor may receive image data from a plurality of images of a scene into a storage system, the image data including inference labels and contextual information associated with each of the plurality of images. The processor may detect, based on the inference labels and contextual information, that one or more images of the plurality of images is a duplicate of the scene. The processor may execute, in response to detecting that the one or more images of the plurality of images is the duplicate of the scene, an action related to the plurality of images.
    Type: Application
    Filed: September 7, 2021
    Publication date: March 9, 2023
    Inventors: Christopher J. Vollmar, Joseph W. Dain
  • Publication number: 20230061011
    Abstract: Provided are techniques for data search with automated selection of one or more AI inference models and inference label indexing. Metadata for a data object that is stored in a data source is retrieved. The metadata is matched to filtering criteria. The filtering criteria is used to identify an Artificial Intelligence (AI) inference model. The data object is sent to an inference engine for the AI inference model. An inference result is received from the inference engine. The inference result is stored in a database to associate the inference result with the data object. Then, in response to receiving a search request with one or more terms, the one or more terms are matched to the inference result in the database. and the data object and the inference result are returned.
    Type: Application
    Filed: August 25, 2021
    Publication date: March 2, 2023
    Inventors: Joseph W. Dain, Frank N. Lee, Nilesh Prabhakar Bhosale, Christopher Vollmar
  • Patent number: 11429578
    Abstract: A method includes generating a table in a relational database, the table comprising an empty column reserved for a custom tag. The column has a default column name. The method also includes, in response to an instruction to create the custom tag, generating a column mapping that maps a name of the custom tag to the column name. The column mapping is stored outside the relational database and is used by a parser to translate between the name of the custom tag and the column name.
    Type: Grant
    Filed: November 19, 2020
    Date of Patent: August 30, 2022
    Assignee: International Business Machines Corporation
    Inventors: Raul E. Saba, Scott Brewer, Gregory T. Kishi, Sara Megan Coronado, Drew Olson, Joseph W. Dain, Theodore Timothy Harris, Jr.
  • Patent number: 11429674
    Abstract: Provided are a computer program product, method, and system for determining data to redact in an object. Event messages for objects are received. Each event message includes at least one facet, each facet for an instance of data in an object resulting from deep data inspection of the object that is used to determine whether the instance of data should be redacted. The event messages are added to a message queue. Information is added to a database for each of the event messages include the at least one facet for an object in the event message. The database is queried to determine an object having a facet for an instance of data that indicates the instance of data is to be redacted. The instance of data is redacted in the determined object having the facet that indicates the instance of data is to be redacted data.
    Type: Grant
    Filed: November 15, 2018
    Date of Patent: August 30, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Joseph W. Dain, Nilesh P. Bhosale, Gregory T. Kishi
  • Patent number: 11409900
    Abstract: Provided are a computer program product, method, and system for determining data to redact in an object. Event messages for objects are received. Each event message includes at least one facet, each facet for an instance of data in an object resulting from deep data inspection of the object that is used to determine whether the instance of data should be redacted. The event messages are added to a message queue. A determination is made of an event message in the message queue for an object having a facet for an instance of data that indicates the instance of data is to be redacted. Redaction is performed on the instance of data in the object in the determined event message having the facet indicating the instance of data is to be redacted data.
    Type: Grant
    Filed: November 15, 2018
    Date of Patent: August 9, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Joseph W. Dain, Nilesh P. Bhosale, Gregory T. Kishi
  • Publication number: 20220171738
    Abstract: A set of metafiles that stores at least metadata information and provenance information of an artificial intelligence (AI) process is generated, where the AI process is trained with a source data. The set of metafiles is accessed via an application programming interface (API) to the set of metafiles. In response to accessing the set of metafiles, the source data in the set of metafiles is transferred to a cache for processing by the AI process.
    Type: Application
    Filed: November 30, 2020
    Publication date: June 2, 2022
    Inventors: Frank N Lee, Joseph W. Dain
  • Publication number: 20220156111
    Abstract: A method includes scanning, by a device of a plurality of devices, a directory of a file system. The directory includes a plurality of subdirectories, and each device of the plurality of devices corresponds to a different queue of a plurality of queues. The method also includes inserting, by the device, a plurality of scanning tasks into a subset of the plurality of queues in a round robin fashion. The plurality of scanning tasks correspond to the plurality of subdirectories and responding to a scanning task of the plurality of scanning tasks includes scanning a subdirectory of the plurality of subdirectories.
    Type: Application
    Filed: November 19, 2020
    Publication date: May 19, 2022
    Inventors: Scott BREWER, Theodore Timothy HARRIS, JR., Sara Megan CORONADO, Raul E. SABA, Drew OLSON, Joseph W. DAIN, Gregory T. KISHI
  • Publication number: 20220156239
    Abstract: A method includes generating a table in a relational database, the table comprising an empty column reserved for a custom tag. The column has a default column name. The method also includes, in response to an instruction to create the custom tag, generating a column mapping that maps a name of the custom tag to the column name. The column mapping is stored outside the relational database and is used by a parser to translate between the name of the custom tag and the column name.
    Type: Application
    Filed: November 19, 2020
    Publication date: May 19, 2022
    Inventors: Raul E. SABA, Scott BREWER, Gregory T. KISHI, Sara Megan CORONADO, Drew OLSON, Joseph W. DAIN, Theodore Timothy HARRIS, JR.
  • Publication number: 20220156276
    Abstract: A dataset formatted in a first format is generated, wherein the dataset formatted in the first format includes at least a plurality of data elements that comprise data. A selection is made of machine learning framework, wherein the machine learning framework operates on datasets formatted in a second format that is different from the first format. The dataset is converted from the first format to the second format and links are generated from the converted dataset formatted in the second format to the plurality of data elements of the dataset formatted in the first format to avoid creating additional copies of the plurality of data elements.
    Type: Application
    Filed: November 19, 2020
    Publication date: May 19, 2022
    Inventors: Joseph W. DAIN, Gregory T. KISHI, Deepavali M. BHAGWAT, Lukas RUPPRECHT, Constantine ARNOLD
  • Publication number: 20220150065
    Abstract: Provided are a computer program product, system, and method for determining an encryption technique for a modified data object to backup. Deep data inspection is performed on an object using a natural language processing module to determine facets for the object. The facets provide facet values for instances of sensitive information in the object. The facet values for the object are processed to determine a cumulative facet value. A determination is made of an encryption technique comprising one of a plurality of different encryption techniques to use on the object based on the cumulative facet value. The determined encryption technique is used to encrypt the object to transfer to a backup storage.
    Type: Application
    Filed: November 11, 2020
    Publication date: May 12, 2022
    Inventors: Gregory T. KISHI, Joseph W. DAIN, Abhishek JAIN, Nilesh Prabhakar BHOSALE