Patents by Inventor Michael Hirsch

Michael Hirsch has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

EFFICIENT CALCULATION OF SIMILARITY SEARCH VALUES AND DIGEST BLOCK BOUNDARIES FOR DATA DEDUPLICATION

Publication number: 20140279952

Abstract: For efficient calculation of both similarity search values and boundaries of digest blocks in data deduplication, input data is partitioned into chunks, and for each chunk a set of rolling hash values is calculated. A single linear scan of the rolling hash values is used to produce both similarity search values and boundaries of the digest blocks of the chunk.

Type: Application

Filed: March 15, 2013

Publication date: September 18, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. AKIRAV, Lior ARONOVICH, Shira BEN-DOR, Michael HIRSCH, Ofer LENEMAN
Incremental modification of an error detection code background of the invention

Patent number: 8839062

Abstract: Exemplary method, system, and computer program product embodiments for an incremental modification of an error detection code operation are provided. In one embodiment, by way of example only, for a data block requiring a first error detection code (EDC) value to be calculated and verified and is undergoing modification for at least one randomly positioned sub-blocks that becomes available and modified in independent time intervals, a second EDC value is calculated for each of the randomly positioned sub-blocks. An incremental effect of the second EDC value is applied for calculating the first EDC value and for recalculating the first EDC value upon replacing at least one of the randomly positioned sub-blocks. The resource consumption is proportional to the size of at least one of the randomly positioned sub-blocks that are added and modified. Additional system and computer program product embodiments are disclosed and provide related advantages.

Type: Grant

Filed: January 11, 2012

Date of Patent: September 16, 2014

Assignee: International Business Machines Corporation

Inventors: Lior Aronovich, Michael Hirsch, Shmuel T. Klein, Yair Toaff
Computation of a remainder by division using pseudo-remainders

Patent number: 8819098

Abstract: Methods, computer systems, and computer program products for calculating a remainder by division of a sequence of bytes interpreted as a first number by a second number is provided. A pseudo-remainder by division associated with a first subsequence of the sequence of bytes is calculated. A property of this pseudo-remainder is that the first subsequence of the sequence of bytes, interpreted as a third number, and the pseudo-remainder by division have the same remainder by division when divided by the second number. A second subsequence of the sequence of bytes interpreted as the first number is appended to the pseudo-remainder, interpreted as a sequence of bytes, so as to create a sequence of bytes interpreted as a fourth number. The first number and the fourth number have the same remainder by division when divided by the second number.

Type: Grant

Filed: November 23, 2010

Date of Patent: August 26, 2014

Assignee: International Business Machines Corporation

Inventors: Michael Hirsch, Shmuel T. Klein, Yair Toaff
Data Recovery Scheme Based on Data Backup Status

Publication number: 20140201480

Abstract: Machines, systems and methods for increasing data resiliency in a computing system, the method comprising distinguishing between first data and second data stored in one or more data storage mediums, wherein the first data is more vulnerable than the second data for the purpose of recovering lost data; and recovering the first data before recovering the second data. Increasing redundancy protection for the first data to increase chances for data recovery by way of data reconstruction; and decreasing redundancy protection for the first data, after the first data has been backed up at least once.

Type: Application

Filed: January 14, 2013

Publication date: July 17, 2014

Applicant: International Business Machines Corporation

Inventors: Michael E. Factor, Itzhack Goldberg, Michael Hirsch, Ronen I. Kat, Neil Sondhi
PACKING DEDUPLICATED DATA IN A SELF-CONTAINED DEDUPLICATED REPOSITORY

Publication number: 20140195495

Abstract: Deduplicated data is packed in a self-contained deduplicated repository having unique data blocks with each being referenced by a globally unique identifier (GUID). The self-contained deduplicated repository has information regarding both deduplicated data files and the unique data blocks of each of the deduplicated data files and a master GUID list containing a location of each of the unique data blocks.

Type: Application

Filed: November 7, 2013

Publication date: July 10, 2014

Applicant: International Business Machines Corporation

Inventors: Shay H. AKIRAV, Michael HIRSCH, Ofer LENEMAN
CONTROLLING SEGMENT SIZE DISTRIBUTION IN HASH-BASED DEDUPLICATION

Publication number: 20140188828

Abstract: Segment sizes are controlled by setting the size of a segment boundary in a hash-based deduplication system. A subsequence of size K of a sequence of characters S is set. An increasing sequence of n probabilities and a corresponding sequence of n decreasingly restrictive logical tests are chosen to be applied on the sequence of characters S. Segment boundaries are set by using the sequence of the decreasingly restrictive logical tests by deciding to declare a segment boundary at a current position if one of the sequence of the decreasingly restrictive logical tests, with a corresponding probability of the sequence of n probabilities, returns a true value when applied on the sequence of characters S.

Type: Application

Filed: January 2, 2013

Publication date: July 3, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael HIRSCH, Shmuel T. KLEIN, Yair TOAFF
OPTIMIZING A PARTITION IN DATA DEDUPLICATION

Publication number: 20140188818

Abstract: For optimizing a partition of a data block into matching and non-matching segments in data deduplication using a processor device in a computing environment, an optimal calculation operation is applied in polynomial time to the matching segments for selecting a globally optimal subset of a set of matching segments according to overhead considerations for minimizing an overall size of a deduplicated file by determining a trade off between a time complexity and a space complexity.

Type: Application

Filed: January 2, 2013

Publication date: July 3, 2014

Applicant: International Business Machines Corporation

Inventors: Michael HIRSCH, Ariel J. ISH-SHALOM, Shmuel T. KLEIN
Systems and methods for searching of storage data with reduced bandwidth requirements

Patent number: 8725705

Abstract: Systems and methods enabling search of a repository for the location of data that is similar to input data, using a defined measure of similarity, in a time that is independent of the size of the repository and linear in a size of the input data, and a space that is proportional to a small fraction of the size of the repository. Additionally, remote operations are accomplished with significantly reduced system bandwidth by implementing remote differencing operations.

Type: Grant

Filed: July 29, 2005

Date of Patent: May 13, 2014

Assignee: International Business Machines Corporation

Inventors: Michael Hirsch, Haim Bitner, Lior Aronovich, Ron Asher, Eitan Bachmat, Shmuel T. Klein
METHOD AND SYSTEM FOR PROCESSING DATA

Publication number: 20140101114

Abstract: Methods, computer systems, and computer program products for processing data a computing environment are provided. The computer environment for data deduplication storage receives a plurality of write operations for deduplication storage of the data. The data is buffered in a plurality of buffers with overflow temporarily stored to a memory hierarchy when the data received for deduplication storage is sequential or non sequential. The data is accumulated and updated in the plurality of buffers per a data structure, the data structure serving as a fragment map between the plurality of buffers and a plurality of user file locations. The data is restructured in the plurality of buffers to form a complete sequence of a required sequence size. The data is provided as at least one stream to a stream-based deduplication algorithm for processing and storage.

Type: Application

Filed: October 9, 2012

Publication date: April 10, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. AKIRAV, Ron EDELSTEIN, Michael HIRSCH, Ariel J. ISH-SHALOM, Liran LOYA, Itai TZUR
EFFICIENT FILE RECLAMATION IN DEDUPLICATING VIRTUAL MEDIA

Publication number: 20140089269

Abstract: Expired files in the deduplicating virtual media are selectively erased using a backup application for notifying a backup repository of which expired files are no longer required. The space of the expired files is reclaimed for reuse. Virtual space of the expired files is reserved for allowing the backup application to seek past the reclaimed space to subsequent data in the deduplicating virtual media.

Type: Application

Filed: September 24, 2012

Publication date: March 27, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. AKIRAV, Michael HIRSCH
EFFICIENT FILE RECLAMATION IN DEDUPLICATING VIRTUAL MEDIA

Publication number: 20140089275

Abstract: Expired files in the deduplicating virtual media are selectively erased using a backup application for notifying a backup repository of which expired files are no longer required. The space of the expired files is reclaimed for reuse. Virtual space of the expired files is reserved for allowing the backup application to seek past the reclaimed space to subsequent data in the deduplicating virtual media.

Type: Application

Filed: October 29, 2013

Publication date: March 27, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Shay H. AKIRAV, Michael HIRSCH
Efficient construction of synthetic backups within deduplication storage system

Patent number: 8682873

Abstract: A deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.

Type: Grant

Filed: December 1, 2010

Date of Patent: March 25, 2014

Assignee: International Business Machines Corporation

Inventors: Lior Aronovich, Michael Hirsch, Yair Toaff
Efficient construction of synthetic backups within deduplication storage system

Patent number: 8682854

Abstract: A deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.

Type: Grant

Filed: June 4, 2012

Date of Patent: March 25, 2014

Assignee: International Business Machines Corporation

Inventors: Lior Aronovich, Michael Hirsch, Yair Toaff
SUB-BLOCK PARTITIONING FOR HASH-BASED DEDUPLICATION

Publication number: 20140012822

Abstract: Sub-block partitioning for hash-based deduplication is performed by defining a minimal size and maximum size of the sub-block. For each boundary start position of the sub-block, starting a search, after the minimal size of the sub-block, for a boundary position of a subsequent sub-block by using multiple search criteria to test hash values that are calculated during the search. If one of the multiple search criteria is satisfied by one of the hash values, declaring the position of the hash value as a boundary end position of the sub-block. If the maximum size of the sub-block is reached prior to satisfying one of the multiple search criteria, declaring a position of an alternative one of the hash values that is selected based upon another one of the multiple search criteria as the boundary end position of the sub-block.

Type: Application

Filed: July 3, 2012

Publication date: January 9, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior ARONOVICH, Michael HIRSCH
PACKING DEDUPLICATED DATA INTO FINITE-SIZED CONTAINERS

Publication number: 20130339316

Abstract: Deduplicated data is packed into finite-sized containers. A similarity score is calculated between files that are similarly of the deduplicated data. The similarity score is used for grouping the similarly compared files of the deduplicated data into subsets for destaging each of the subsets from a deduplication system to one a finite-sized container.

Type: Application

Filed: June 19, 2012

Publication date: December 19, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael HIRSCH, Thorsten KRAUSE
SCALABLE DEDUPLICATION SYSTEM WITH SMALL BLOCKS

Publication number: 20130290278

Abstract: Exemplary method, system, and computer program product embodiments for scalable data deduplication working with small data chunk in a computing environment are provided. In one embodiment, by way of example only, for each of the small data chunk, a signature is generated based on a combination of a representation of characters that appear in the small data chunk with a representation of frequencies of the small data chunk. A signature is generated based on a combination of a representation of characters that appear. The signature is used to help in selecting the data to be deduplicated. Additional system and computer program product embodiments are disclosed and provide related advantages.

Type: Application

Filed: June 27, 2013

Publication date: October 31, 2013

Inventors: Lior ARONOVICH, Ron ASHER, Michael HIRSCH, Shmuel T. KLEIN, Ehud MEIRI, Yair TOAFF
SCALABLE DEDUPLICATION SYSTEM WITH SMALL BLOCKS

Publication number: 20130290279

Abstract: Exemplary method, system, and computer program product embodiments for scalable data deduplication working with small data chunk in a computing environment are provided. In one embodiment, by way of example only, for each of the small data chunk, a signature is generated based on a combination of a representation of characters that appear in the small data chunk with a representation of frequencies of the small data chunk. A signature is generated based on a combination of a representation of characters that appear. The signature is used to help in selecting the data to be deduplicated. Additional system and computer program product embodiments are disclosed and provide related advantages.

Type: Application

Filed: June 27, 2013

Publication date: October 31, 2013

Inventors: Lior ARONOVICH, Ron ASHER, Michael HIRSCH, Shmuel T. KLEIN, Ehud MEIRI, Yair TOAFF
METHOD AND DEVICE FOR RECOVERING A DIGITAL IMAGE FROM A SEQUENCE OF OBSERVED DIGITAL IMAGES

Publication number: 20130242129

Abstract: A computer-implemented method for recovering a digital image (x) from a sequence of observed digital images (y1, . . . , yT), includes: obtaining an observed digital image (yt); estimating a point spread function (ft) based on the observed image (yt); estimating the recovered digital image (x), based on the estimated point spread function (ft) and the observed image (yt); and repeating the above steps. In order to correct optical aberrations of a lens, a point spread function of the lens may be used.

Type: Application

Filed: September 28, 2011

Publication date: September 19, 2013

Inventors: Stefan Harmeling, Michael Hirsch, Suvrit Sra, Bernhard Schölkopf, Christian J. Schuler
CREATION OF SYNTHETIC BACKUPS WITHIN DEDUPLICATION STORAGE SYSTEM BY A BACKUP APPLICATION

Publication number: 20130232117

Abstract: A deduplication storage system and a backup application create a synthetic backup. Metadata instructions are provided to the deduplication storage system. Each of the metadata instructions specifies the data segment of an originating backup and a designated location of the data segment in the synthetic backup. A set of metadata instructions is transformed into a transformed set of metadata instructions.

Type: Application

Filed: March 13, 2013

Publication date: September 5, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior ARONOVICH, Michael HIRSCH, Yair TOAFF
CREATION OF SYNTHETIC BACKUPS WITHIN DEDUPLICATION STORAGE SYSTEM

Publication number: 20130232119

Abstract: A deduplication storage system and a backup application create a synthetic backup. Metadata instructions are provided to the deduplication storage system. Each of the metadata instructions specifies the data segment of an originating backup and a designated location of the data segment in the synthetic backup. Each of the metadata instructions are processed by locating those data sub-segments in the deduplication storage system specified by the data segment in each of the metadata instructions, and creating metadata references to each of the data sub-segments and adding the metadata references to metadata of the synthetic backup being created.

Type: Application

Filed: March 13, 2013

Publication date: September 5, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior ARONOVICH, Michael HIRSCH, Yair TOAFF

prev … 3 4 5 6 7 8 9 10 next