Patents by Inventor Lior Aronovich

Lior Aronovich has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Utilizing global digests caching in similarity based data deduplication

Patent number: 10013202

Abstract: Input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. The processor prefers to match the input digests of the input data with the repository digests contained in the global digests cache which are of the similar repository data, rather than repository digests which are of other repository data that was not determined as similar to the input data chunks. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data.

Type: Grant

Filed: November 30, 2017

Date of Patent: July 3, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. Akirav, Lior Aronovich
Tuning global digests caching in a data deduplication system

Patent number: 10007610

Abstract: Input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. A sample of the repository digests is loaded into a search mechanism within the global digests cache. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data in a sequence corresponding to a placement order of calculated values of the digests of the similar repository data.

Type: Grant

Filed: November 30, 2017

Date of Patent: June 26, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. Akirav, Lior Aronovich
Global digests caching in a data deduplication system

Patent number: 10007672

Abstract: For utilizing a global digests cache in deduplication processing in a data deduplication system using a processor device in a computing environment, input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data in a sequence corresponding to a placement order of calculated values of the digests of the similar repository data.

Type: Grant

Filed: November 30, 2017

Date of Patent: June 26, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. Akirav, Lior Aronovich
Reducing resource consumption of a similarity index in data deduplication

Patent number: 9984123

Abstract: Embodiments for reducing resource consumption of a similarity index in data deduplication by a processor. In a similarity index of a deduplication system configured to process snapshots, only a latest generation of repository data is represented in the similarity index where a single latest representative value of an index entry of a snapshot is maintained in the similarity index. Implicit deletion is applied in the similarity index such that the similarly index entry is not removed or overwritten until a change with associated data of the similarity index entry is detected. A subset of bytes of the representative value is maintained in a similarity index entry thereby reducing an input/output (I/O) load on the similarity index of the deduplication system.

Type: Grant

Filed: November 25, 2015

Date of Patent: May 29, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Lior Aronovich
Conversion of forms of user data segment IDs in a deduplication system

Patent number: 9965487

Abstract: Various embodiments for managing data in a data storage having data deduplication. For a back reference data structure incorporating reference information for at least one user data segment to a storage block, using a plurality of hash functions to convert between a plurality of form types of user data segment identification (ID's) representative of the at least one user data segment.

Type: Grant

Filed: June 18, 2015

Date of Patent: May 8, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Lior Aronovich
Back referencing of deduplicated data

Patent number: 9965488

Abstract: Various embodiments for managing data in a data storage having data deduplication. A back reference data structure is configured for user data segments as a mechanism to identify an affected storage block to which information in the back reference data structure refers. The back reference data structure is initialized such that a resolution of the back reference data structure diminishes as a number of the user data segments referencing the affected storage block increases.

Type: Grant

Filed: June 18, 2015

Date of Patent: May 8, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. Akirav, Lior Aronovich, Yariv Bachar, Shira Ben-Dor, Rafael Buchbinder, Amir Kredi
FEEDBACK MECHANISM FOR CONTROLLING DISPATCHING WORK TASKS IN A MULTI-TIER STORAGE ENVIRONMENT

Publication number: 20180107517

Abstract: A method, a computer program product, and a computer system for controlling dispatching work tasks in a multi-tier storage environment. A computer system receives storage demands of work tasks. The computer system determines placement and migration policies for data in storage tiers in a storage system. The computer system prepares the storage tiers for meeting the storage demands of work tasks, based on the placement and migration policies. The computer system determines a state of preparation of the storage tiers for meeting the storage demands of work tasks. The computer system determines a list including work tasks that can proceed and work tasks that cannot proceed, based on the state of the preparation. The computer system modifies a schedule of the work tasks, based on the list.

Type: Application

Filed: October 14, 2016

Publication date: April 19, 2018

Inventors: LIOR ARONOVICH, SAMUEL M. Black
FEEDBACK MECHANISM FOR CONTROLLING DISPATCHING WORK TASKS IN A MULTI-TIER STORAGE ENVIRONMENT

Publication number: 20180107518

Abstract: A method for controlling dispatching work tasks in a multi-tier storage environment. A computer system receives storage demands of work tasks. The computer system determines placement and migration policies for data in storage tiers in a storage system. The computer system prepares the storage tiers for meeting the storage demands of work tasks, based on the placement and migration policies. The computer system determines a state of preparation of the storage tiers for meeting the storage demands of work tasks. The computer system determines a list including work tasks that can proceed and work tasks that cannot proceed, based on the state of the preparation. The computer system modifies a schedule of the work tasks, based on the list.

Type: Application

Filed: December 17, 2017

Publication date: April 19, 2018

Inventors: LIOR ARONOVICH, SAMUEL M. Black
CREATION OF SYNTHETIC BACKUPS WITHIN DEDUPLICATION STORAGE SYSTEM BY A BACKUP APPLICATION

Publication number: 20180095986

Abstract: Input backup data is deduplicated with data of a synthetic backup previously constructed by a deduplication storage system. A synthetic backup is constructed by processing metadata instructions provided by a backup application. Deduplication digests are calculated based on the data of the synthetic backup and the deduplication digests are stored in a digests index. When new backup data is processed, deduplication digests of the new data are calculated and searched in the digests index. Matching digests of previously constructed synthetic backups are located in the digests index. Each of the located matching digest references stored data are included in the synthetic backup, and the stored data is similar to the input backup data. Data matches are found in the input backup data and data in the synthetic backup.

Type: Application

Filed: November 17, 2017

Publication date: April 5, 2018

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior ARONOVICH, Michael Hirsch, Yair Toaff
RESTORING DISTRIBUTED SHARED MEMORY DATA CONSISTENCY WITHIN A RECOVERY PROCESS FROM A CLUSTER NODE FAILURE

Publication number: 20180095848

Abstract: A DSM component is organized as a matrix of page. The data structure of a set of data structures occupies a column in the matrix of pages. A recovery file is maintained in a persistent storage. The recovery file consists of entries and each one of the entries corresponds to a column in the matrix of pages by a location of each one of the entries. The set of data structures is stored in the DSM component and in the persistent storage. Incorporated into each one of the plurality of entries in the recovery file is an indication if an associated column in the matrix of pages is assigned with the data structure of the set of data structures; and additionally incorporated into each one of the plurality of entries in the recovery file are identifying key properties of the data structure of the set of data structures.

Type: Application

Filed: November 22, 2017

Publication date: April 5, 2018

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior ARONOVICH, Asaf LEVY, Liran LOYA
GLOBAL DIGESTS CACHING IN A DATA DEDUPLICATION SYSTEM

Publication number: 20180089220

Abstract: For utilizing a global digests cache in deduplication processing in a data deduplication system using a processor device in a computing environment, input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data in a sequence corresponding to a placement order of calculated values of the digests of the similar repository data.

Type: Application

Filed: November 30, 2017

Publication date: March 29, 2018

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. AKIRAV, Lior ARONOVICH
DATA STRUCTURES FOR DIGESTS MATCHING IN A DATA DEDUPLICATION SYSTEM

Publication number: 20180089219

Abstract: Data matches are calculated in a data deduplication system by matching input and repository digests using a digest based data matching process where the reference digests corresponding to a repository interval of data identified as similar to an input interval of data are loaded into two data structures. The dual data structures include a sequential buffer containing a plurality of digest entries in a sequence corresponding to a placement order of calculated values of the reference digests, the placement order of the calculated values of the reference digests correlative to an order in which input digest values were calculated such that the plurality of digests are stored in a linear form independent of a deduplicated form by which data the plurality of digests describe is stored.

Type: Application

Filed: November 10, 2017

Publication date: March 29, 2018

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Lior ARONOVICH
UTILIZING GLOBAL DIGESTS CACHING IN SIMILARITY BASED DATA DEDUPLICATION

Publication number: 20180088855

Abstract: Input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. The processor prefers to match the input digests of the input data with the repository digests contained in the global digests cache which are of the similar repository data, rather than repository digests which are of other repository data that was not determined as similar to the input data chunks. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data.

Type: Application

Filed: November 30, 2017

Publication date: March 29, 2018

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. AKIRAV, Lior ARONOVICH
TUNING GLOBAL DIGESTS CACHING IN A DATA DEDUPLICATION SYSTEM

Publication number: 20180081812

Abstract: Input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. A sample of the repository digests is loaded into a search mechanism within the global digests cache. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data in a sequence corresponding to a placement order of calculated values of the digests of the similar repository data.

Type: Application

Filed: November 30, 2017

Publication date: March 22, 2018

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. AKIRAV, Lior ARONOVICH
DEDUPLICATING INPUT BACKUP DATA WITH DATA OF A SYNTHETIC BACKUP PREVIOUSLY CONSTRUCTED BY A DEDUPLICATION STORAGE SYSTEM

Publication number: 20180081898

Abstract: Input backup data is deduplicated with data of a synthetic backup previously constructed by a deduplication storage system. A synthetic backup is constructed by processing metadata instructions provided by a backup application. Deduplication digests are calculated based on the data of the synthetic backup and the deduplication digests are stored in a digests index. When new backup data is processed, deduplication digests of the new data are calculated and searched in the digests index. Matching digests of previously constructed synthetic backups are located in the digests index. Each of the located matching digest references stored data are included in the synthetic backup, and the stored data is similar to the input backup data. Data matches are found in the input backup data and data in the synthetic backup.

Type: Application

Filed: November 29, 2017

Publication date: March 22, 2018

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior ARONOVICH, Michael HIRSCH, Yair TOAFF
Producing alternative segmentations of data into blocks in a data deduplication system

Patent number: 9922042

Abstract: For producing secondary segmentations of data into blocks and corresponding digests for input data in a data deduplication system using a processor device in a computing environment, digests are calculated for an input data chunk using a primary segmentation into blocks. Secondary segmentations are produced for each of the data mismatches based on reference data, and used to calculate further data matches. The primary segmentation and the corresponding primary digests are stored for the input data chunk.

Type: Grant

Filed: July 15, 2013

Date of Patent: March 20, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Lior Aronovich
Global digests caching in a data deduplication system

Patent number: 9892127

Abstract: For utilizing a global digests cache in deduplication processing in a data deduplication system using a processor device in a computing environment, input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The repository digests of the similar repository data are located and loaded into the global digests cache. The global digests cache contains digests previously loaded by other deduplication processes. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches.

Type: Grant

Filed: July 15, 2013

Date of Patent: February 13, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. Akirav, Lior Aronovich
Tuning global digests caching in a data deduplication system

Patent number: 9892048

Abstract: Input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The repository digests of the similar repository data are located and loaded into the global digests cache. The global digests cache contains digests previously loaded by other deduplication processes. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. A sample of the repository digests is loaded into a search mechanism within the global digests cache.

Type: Grant

Filed: July 15, 2013

Date of Patent: February 13, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. Akirav, Lior Aronovich
Utilizing global digests caching in similarity based data deduplication

Patent number: 9891857

Abstract: Input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The repository digests of the similar repository data are located and loaded into the global digests cache. The global digests cache contains digests previously loaded by other deduplication processes. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. The processor prefers to match the input digests of the input data with the repository digests contained in the global digests cache which are of the similar repository data, rather than repository digests which are of other repository data that was not determined as similar to the input data chunks.

Type: Grant

Filed: July 15, 2013

Date of Patent: February 13, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. Akirav, Lior Aronovich
Deduplicating input backup data with data of a synthetic backup previously constructed by a deduplication storage system

Patent number: 9858286

Abstract: Input backup data is deduplicated with data of a synthetic backup previously constructed by a deduplication storage system. A synthetic backup is constructed by processing metadata instructions provided by a backup application. Deduplication digests are calculated based on the data of the synthetic backup and the deduplication digests are stored in a digests index. When new backup data is processed, deduplication digests of the new data are calculated and searched in the digests index. Matching digests of previously constructed synthetic backups are located in the digests index. Each of the located matching digest references stored data are included in the synthetic backup, and the stored data is similar to the input backup data. Data matches are found in the input backup data and data in the synthetic backup.

Type: Grant

Filed: March 13, 2013

Date of Patent: January 2, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior Aronovich, Michael Hirsch, Yair Toaff

prev … 3 4 5 6 7 8 9 10 11 … next