Patents by Inventor Shira Ben Dor
Shira Ben Dor has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10664448Abstract: Various embodiments for repository management in a data deduplication system, by a processor device, are provided. Metadata of an inode structure of an entire pre-allocated file system is captured, exported, and compressed from an existing deduplication appliance, the pre-allocated file system comprising a fully padded file system. The exported and compressed metadata of the pre-allocated file system is decompressed and imported into a data deduplication repository of a new deduplication appliance having an identical file system size as within the existing deduplication appliance, to initially configure or subsequently scale the inode structure of a file system of the data deduplication repository of the new deduplication appliance efficiently.Type: GrantFiled: October 25, 2017Date of Patent: May 26, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Oded Aviyam, Shira Ben-Dor, Joseph W. Dain, Gil E. Paz
-
Patent number: 9965488Abstract: Various embodiments for managing data in a data storage having data deduplication. A back reference data structure is configured for user data segments as a mechanism to identify an affected storage block to which information in the back reference data structure refers. The back reference data structure is initialized such that a resolution of the back reference data structure diminishes as a number of the user data segments referencing the affected storage block increases.Type: GrantFiled: June 18, 2015Date of Patent: May 8, 2018Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. Akirav, Lior Aronovich, Yariv Bachar, Shira Ben-Dor, Rafael Buchbinder, Amir Kredi
-
Publication number: 20180046642Abstract: Various embodiments for repository management in a data deduplication system, by a processor device, are provided. Metadata of an inode structure of an entire pre-allocated file system is captured, exported, and compressed from an existing deduplication appliance, the pre-allocated file system comprising a fully padded file system. The exported and compressed metadata of the pre-allocated file system is decompressed and imported into a data deduplication repository of a new deduplication appliance having an identical file system size as within the existing deduplication appliance, to initially configure or subsequently scale the inode structure of a file system of the data deduplication repository of the new deduplication appliance efficiently.Type: ApplicationFiled: October 25, 2017Publication date: February 15, 2018Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Oded AVIYAM, Shira BEN-DOR, Joseph W. DAIN, Gil E. PAZ
-
Patent number: 9836475Abstract: Various embodiments for repository management in a data deduplication system, by a processor device, are provided. Metadata of a pre-allocated file system is captured and exported. The exported metadata is then imported into a data deduplication repository for configuring the data deduplication repository with minimum overhead.Type: GrantFiled: November 16, 2015Date of Patent: December 5, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Oded Aviyam, Shira Ben-Dor, Joseph W. Dain, Gil E. Paz
-
Publication number: 20170139949Abstract: Various embodiments for repository management in a data deduplication system, by a processor device, are provided. Metadata of a pre-allocated file system is captured and exported. The exported metadata is then imported into a data deduplication repository for configuring the data deduplication repository with minimum overhead.Type: ApplicationFiled: November 16, 2015Publication date: May 18, 2017Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Oded AVIYAM, Shira BEN-DOR, Joseph W. DAIN, Gil E. PAZ
-
Efficient calculation of similarity search values and digest block boundaries for data deduplication
Patent number: 9600515Abstract: For efficient calculation of both similarity search values and boundaries of digest blocks in data deduplication, input data is partitioned into chunks, and for each chunk a set of rolling hash values is calculated. A single linear scan of the rolling hash values is used to produce both similarity search values and boundaries of the digest blocks of the chunk. The rolling hash values are used to contribute to the calculation of the similarity search values and to the calculation of the boundaries of the digest blocks.Type: GrantFiled: December 16, 2015Date of Patent: March 21, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. Akirav, Lior Aronovich, Shira Ben-Dor, Michael Hirsch, Ofer Leneman -
Patent number: 9547662Abstract: For digest retrieval based on similarity search in deduplication processing in a data deduplication system using a processor device in a computing environment, input data is partitioned into fixed sized data chunks. Similarity elements and digest block boundaries and digest values are calculated for each of the fixed sized data chunks. Matching similarity elements are searched for in a search structure containing the similarity elements for each of the fixed sized data chunks in a repository of data. Positions of similar data are located in the repository. The positions of the similar data are used to locate and load into the memory stored digest values and corresponding stored digest block boundaries of the similar data in the repository. The digest values and the corresponding digest block boundaries of the input data are matched with the stored digest values and the corresponding stored digest block boundaries to find data matches.Type: GrantFiled: March 15, 2013Date of Patent: January 17, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. Akirav, Lior Aronovich, Shira Ben-Dor, Michael Hirsch, Ofer Leneman
-
Publication number: 20160371308Abstract: Various embodiments for managing data in a data storage having data deduplication. A back reference data structure is configured for user data segments as a mechanism to identify an affected storage block to which information in the back reference data structure refers. The back reference data structure is initialized such that a resolution of the back reference data structure diminishes as a number of the user data segments referencing the affected storage block increases.Type: ApplicationFiled: June 18, 2015Publication date: December 22, 2016Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. AKIRAV, Lior ARONOVICH, Yariv BACHAR, Shira BEN-DOR, Rafael BUCHBINDER, Amir KREDI
-
EFFICIENT CALCULATION OF SIMILARITY SEARCH VALUES AND DIGEST BLOCK BOUNDARIES FOR DATA DEDUPLICATION
Publication number: 20160103868Abstract: For efficient calculation of both similarity search values and boundaries of digest blocks in data deduplication, input data is partitioned into chunks, and for each chunk a set of rolling hash values is calculated. A single linear scan of the rolling hash values is used to produce both similarity search values and boundaries of the digest blocks of the chunk. The rolling hash values are used to contribute to the calculation of the similarity search values and to the calculation of the boundaries of the digest blocks.Type: ApplicationFiled: December 16, 2015Publication date: April 14, 2016Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. AKIRAV, Lior ARONOVICH, Shira BEN-DOR, Michael HIRSCH, Ofer LENEMAN -
Efficient calculation of similarity search values and digest block boundaries for data deduplication
Patent number: 9244937Abstract: For efficient calculation of both similarity search values and boundaries of digest blocks in data deduplication, input data is partitioned into chunks, and for each chunk a set of rolling hash values is calculated. A single linear scan of the rolling hash values is used to produce both similarity search values and boundaries of the digest blocks of the chunk.Type: GrantFiled: March 15, 2013Date of Patent: January 26, 2016Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. Akirav, Lior Aronovich, Shira Ben-Dor, Michael Hirsch, Ofer Leneman -
Patent number: 8918605Abstract: A deduplication storage capacity is estimated as a function of an expected deduplication ratio, the expected deduplication ratio being a combined average of a current deduplication ratio and a configured deduplication ratio, the current deduplication ratio depending on the data currently stored in the deduplication storage, and the configured deduplication ratio being an estimate made at a configuration stage of the deduplication computing storage environment.Type: GrantFiled: June 4, 2012Date of Patent: December 23, 2014Assignee: International Business Machines CorporationInventors: Lior Aronovich, Shira Ben-Dor, Aviv Caro, Elena Drobchenko, Samuel Krikler, Ofer Leneman, Asaf Levy, Liran Loya, Dan Melamed, Tzafrir Z. Taub
-
Publication number: 20140279951Abstract: For digest retrieval based on similarity search in deduplication processing in a data deduplication system using a processor device in a computing environment, input data is partitioned into fixed sized data chunks. Similarity elements and digest block boundaries and digest values are calculated for each of the fixed sized data chunks. Matching similarity elements are searched for in a search structure containing the similarity elements for each of the fixed sized data chunks in a repository of data. Positions of similar data are located in the repository. The positions of the similar data are used to locate and load into the memory stored digest values and corresponding stored digest block boundaries of the similar data in the repository. The digest values and the corresponding digest block boundaries of the input data are matched with the stored digest values and the corresponding stored digest block boundaries to find data matches.Type: ApplicationFiled: March 15, 2013Publication date: September 18, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. AKIRAV, Lior ARONOVICH, Shira BEN-DOR, Michael HIRSCH, Ofer LENEMAN
-
EFFICIENT CALCULATION OF SIMILARITY SEARCH VALUES AND DIGEST BLOCK BOUNDARIES FOR DATA DEDUPLICATION
Publication number: 20140279952Abstract: For efficient calculation of both similarity search values and boundaries of digest blocks in data deduplication, input data is partitioned into chunks, and for each chunk a set of rolling hash values is calculated. A single linear scan of the rolling hash values is used to produce both similarity search values and boundaries of the digest blocks of the chunk.Type: ApplicationFiled: March 15, 2013Publication date: September 18, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shay H. AKIRAV, Lior ARONOVICH, Shira BEN-DOR, Michael HIRSCH, Ofer LENEMAN -
Patent number: 8533407Abstract: A deduplication storage capacity is estimated as a function of an expected deduplication ratio, the expected deduplication ratio being a combined average of a current deduplication ratio and a configured deduplication ratio, the current deduplication ratio depending on the data currently stored in the deduplication storage, and the configured deduplication ratio being an estimate made at a configuration stage of the deduplication computing storage environment.Type: GrantFiled: December 1, 2010Date of Patent: September 10, 2013Assignee: International Business Machines CorporationInventors: Lior Aronovich, Shira Ben-Dor, Aviv Caro, Elena Drobchenko, Samuel Krikler, Ofer Leneman, Asaf Levy, Liran Loya, Dan Melamed, Tzafrir Z. Taub
-
Patent number: 8296532Abstract: A data storage system including at least one storage controller having a first color policy and operative to store data onto a first data storage unit at a primary site as part of a current color of the primary site, at least one storage controller having a second color policy and operative to store data onto a second data storage unit at the primary site as part of the current color, and a color control node operative to provide each of the controllers with new color information while maintaining the integrity of dependent writes across color boundaries.Type: GrantFiled: April 25, 2005Date of Patent: October 23, 2012Assignee: International Business Machines CorporationInventors: Shira Ben-Dor, Harry Butterworth, Amir Kredi, Orit Nissan-Messing, Adam Wolman, Aviad Zlotnick
-
Publication number: 20120246438Abstract: A deduplication storage capacity is estimated as a function of an expected deduplication ratio, the expected deduplication ratio being a combined average of a current deduplication ratio and a configured deduplication ratio, the current deduplication ratio depending on the data currently stored in the deduplication storage, and the configured deduplication ratio being an estimate made at a configuration stage of the deduplication computing storage environment.Type: ApplicationFiled: June 4, 2012Publication date: September 27, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Lior ARONOVICH, Shira BEN-DOR, Aviv CARO, Elena DROBCHENKO, Samuel KRIKLER, Ofer LENEMAN, Asaf LEVY, Liran LOYA, Dan MELAMED, Tzafrir Z. TAUB
-
Patent number: 8200916Abstract: A color control node includes an interface for communicating with multiple storage controllers, wherein the storage controllers maintain a primary storage system at a primary site and a secondary storage system at a secondary site; and wherein the storage controllers maintain a current color and associate all writes with the current color without polling the color control node. The color control node also includes operational capability for issuing a polling command to instruct the storage controllers to poll the color control node for the current color prior to associating each write with a new color; receiving an acknowledgment of receipt of the polling command; changing the current color to a new color responsive to receiving the acknowledgment; issuing a storage command to the storage controllers indicating the new color; and instructing each storage controller to cease polling the color control node for the current color.Type: GrantFiled: January 28, 2009Date of Patent: June 12, 2012Assignee: International Business Machines CorporationInventors: Shira Ben Dor, Amir Kredi, Avied Zlotnick, Henry Butterworth
-
Publication number: 20120144149Abstract: Various embodiments for capacity management in a deduplication computing storage environment by a processor device are provided. A deduplication storage capacity is estimated as a function of an expected deduplication ratio, the expected deduplication ratio being a combined average of a current deduplication ratio and a configured deduplication ratio, the current deduplication ratio depending on the data currently stored in the deduplication storage, and the configured deduplication ratio being an estimate made at a configuration stage of the deduplication computing storage environment.Type: ApplicationFiled: December 1, 2010Publication date: June 7, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Lior ARONOVICH, Shira BEN-DOR, Aviv CARO, Elena DROBCHENKO, Samuel KRIKLER, Ofer LENEMAN, Asaf LEVY, Liran LOYA, Dan MELAMED, Tzafrir Z. TAUB
-
Publication number: 20110138138Abstract: A data storage system including at least one storage controller having a first color policy and operative to store data onto a first data storage unit at a primary site as part of a current color of the primary site, at least one storage controller having a second color policy and operative to store data onto a second data storage unit at the primary site as part of the current color, and a color control node operative to provide each of the controllers with new color information while maintaining the integrity of dependent writes across color boundaries.Type: ApplicationFiled: April 25, 2005Publication date: June 9, 2011Applicant: International Business Machines CorporationInventors: Shira Ben-Dor, Harry Butterworth, Amir Kredi, Orit Nissan-Messing, Adam Wolman, Aviad Zlotnick
-
Publication number: 20090138666Abstract: A color control node includes an interface for communicating with multiple storage controllers, wherein the storage controllers maintain a primary storage system at a primary site and a secondary storage system at a secondary site; and wherein the storage controllers maintain a current color and associate all writes with the current color without polling the color control node. The color control node also includes operational capability for issuing a polling command to instruct the storage controllers to poll the color control node for the current color prior to associating each write with a new color; receiving an acknowledgment of receipt of the polling command; changing the current color to a new color responsive to receiving the acknowledgment; issuing a storage command to the storage controllers indicating the new color; and instructing each storage controller to cease polling the color control node for the current color.Type: ApplicationFiled: January 28, 2009Publication date: May 28, 2009Applicant: International Business Machines CorporationInventors: Shira Ben Dor, Amir Kredi, Avied Zlotnick, Henry Butterworth