Patents by Inventor Alyssa Proulx

Alyssa Proulx has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Adjustment of garbage collection parameters in a storage system

Patent number: 11816029

Abstract: A system, method, and machine-readable storage medium for performing garbage collection in a distributed storage system are provided. In some embodiments, an efficiency level of a garbage collection process is monitored. The garbage collection process may include removal of one or more data blocks of a set of data blocks that is referenced by a set of content identifiers. The set of slice services and the set of data blocks may reside in a cluster, and a set of probabilistic filters (e.g., Bloom filters) may indicate whether the set of data blocks is in-use. At least one parameter of a probabilistic filter of the set of probabilistic filters may be adjusted (e.g., increased or reduced) if the efficiency level is below the efficiency threshold. Garbage collection may be performed on the set of data blocks in accordance with the set of probabilistic filters.

Type: Grant

Filed: March 10, 2022

Date of Patent: November 14, 2023

Assignee: NetApp, Inc.

Inventors: Alyssa Proulx, Wei Sun
EFFICIENCY SETS FOR DETERMINATION OF UNIQUE DATA

Publication number: 20230176773

Abstract: A system, method, and machine-readable storage medium for determining an amount of unique data in a distributed storage system are provided. In some embodiments, a combined efficiency set for a first data set stored in the distributed storage system, such as at a volume, may be generated. The first data set may include a first subset of data and a second subset of data in the distributed storage system. Additionally, a set of efficiency sets for the first subset of data may be generated. A set difference based on the combined efficiency set and the set of efficiency sets may be computed. An amount of memory used for storing unique data of the second subset of data may be estimated based on the set difference. The unique data may be present in the second subset of data but absent from the first subset of data.

Type: Application

Filed: January 30, 2023

Publication date: June 8, 2023

Inventors: Alyssa Proulx, Mark David Olson
CREATION AND USE OF AN EFFICIENCY SET TO ESTIMATE AN AMOUNT OF DATA STORED IN A DATA SET OF A STORAGE SYSTEM HAVING ONE OR MORE CHARACTERISTICS

Publication number: 20230077764

Abstract: Systems and methods for sampling a set of block IDs to facilitate estimating an amount of data stored in a data set of a storage system having one or more characteristics are provided. According to an example, metadata (e.g., block headers and block IDs) may be maintained regarding multiple data blocks of the data set. When one or more metrics relating to the data set are desired, an efficiency set, representing a subset of the block IDs of the data set, may be created to facilitate efficient calculation of the metrics by sampling the block IDs of the data set. Finally, the metrics may be estimated based on the efficiency set by analyzing one or more of the metadata (e.g., block headers) and the data contained in the data blocks corresponding to the subset of the block IDs and extrapolating the metrics for the entirety of the data set.

Type: Application

Filed: November 22, 2022

Publication date: March 16, 2023

Applicant: NetApp, Inc.

Inventors: Charles Randall, Alyssa Proulx
Efficiency sets for determination of unique data

Patent number: 11567694

Abstract: A system, method, and machine-readable storage medium for determining an amount of unique data in a distributed storage system are provided. In some embodiments, a combined efficiency set for a first data set stored in the distributed storage system, such as at a volume, may be generated. The first data set may include a first subset of data and a second subset of data in the distributed storage system. Additionally, a set of efficiency sets for the first subset of data may be generated. A set difference based on the combined efficiency set and the set of efficiency sets may be computed. An amount of memory used for storing unique data of the second subset of data may be estimated based on the set difference. The unique data may be present in the second subset of data but absent from the first subset of data.

Type: Grant

Filed: December 1, 2021

Date of Patent: January 31, 2023

Assignee: NETAPP, INC.

Inventors: Alyssa Proulx, Mark David Olson
Creation and use of an efficiency set to estimate an amount of data stored in a data set of a storage system having one or more characteristics

Patent number: 11526275

Abstract: Systems and methods for sampling a set of block IDs to facilitate estimating an amount of data stored in a data set of a storage system having one or more characteristics are provided. According to an example, metadata (e.g., block headers and block IDs) may be maintained regarding multiple data blocks of the data set. When one or more metrics relating to the data set are desired, an efficiency set, representing a subset of the block IDs of the data set, may be created to facilitate efficient calculation of the metrics by statistically sampling the block IDs of the data set. Finally, the metrics may be estimated based on the efficiency set by analyzing one or more of the metadata (e.g., block headers) and the data contained in the data blocks corresponding to the subset of the block IDs and extrapolating the metrics for the entirety of the data set.

Type: Grant

Filed: October 23, 2020

Date of Patent: December 13, 2022

Assignee: NetApp, Inc.

Inventors: Charles Randall, Alyssa Proulx
ADJUSTMENT OF GARBAGE COLLECTION PARAMETERS IN A STORAGE SYSTEM

Publication number: 20220197789

Abstract: A system, method, and machine-readable storage medium for performing garbage collection in a distributed storage system are provided. In some embodiments, an efficiency level of a garbage collection process is monitored. The garbage collection process may include removal of one or more data blocks of a set of data blocks that is referenced by a set of content identifiers. The set of slice services and the set of data blocks may reside in a cluster, and a set of probabilistic filters (e.g., Bloom filters) may indicate whether the set of data blocks is in-use. At least one parameter of a probabilistic filter of the set of probabilistic filters may be adjusted (e.g., increased or reduced) if the efficiency level is below the efficiency threshold. Garbage collection may be performed on the set of data blocks in accordance with the set of probabilistic filters.

Type: Application

Filed: March 10, 2022

Publication date: June 23, 2022

Inventors: Alyssa Proulx, Wei Sun
CREATION AND USE OF AN EFFICIENCY SET TO ESTIMATE AN AMOUNT OF DATA STORED IN A DATA SET OF A STORAGE SYSTEM HAVING ONE OR MORE CHARACTERISTICS

Publication number: 20220129159

Abstract: Systems and methods for sampling a set of block IDs to facilitate estimating an amount of data stored in a data set of a storage system having one or more characteristics are provided. According to an example, metadata (e.g., block headers and block IDs) may be maintained regarding multiple data blocks of the data set. When one or more metrics relating to the data set are desired, an efficiency set, representing a subset of the block IDs of the data set, may be created to facilitate efficient calculation of the metrics by statistically sampling the block IDs of the data set. Finally, the metrics may be estimated based on the efficiency set by analyzing one or more of the metadata (e.g., block headers) and the data contained in the data blocks corresponding to the subset of the block IDs and extrapolating the metrics for the entirety of the data set.

Type: Application

Filed: October 23, 2020

Publication date: April 28, 2022

Applicant: NetApp, Inc.

Inventors: Charles Randall, Alyssa Proulx
Adjustment of garbage collection parameters in a storage system

Patent number: 11288186

Abstract: A system, method, and machine-readable storage medium for performing garbage collection in a distributed storage system are provided. In some embodiments, an efficiency level of a garbage collection process is monitored. The garbage collection process may include removal of one or more data blocks of a set of data blocks that is referenced by a set of content identifiers. The set of slice services and the set of data blocks may reside in a cluster, and a set of filters may indicate whether the set of data blocks is in-use. At least one parameter of a filter of the set of filters may be adjusted (e.g., increased or reduced) if the efficiency level is below the efficiency threshold. Garbage collection may be performed on the set of data blocks in accordance with the set of filters.

Type: Grant

Filed: April 23, 2020

Date of Patent: March 29, 2022

Assignee: NetApp, Inc.

Inventors: Alyssa Proulx, Wei Sun
EFFICIENCY SETS FOR DETERMINATION OF UNIQUE DATA

Publication number: 20220083262

Abstract: A system, method, and machine-readable storage medium for determining an amount of unique data in a distributed storage system are provided. In some embodiments, a combined efficiency set for a first data set stored in the distributed storage system, such as at a volume, may be generated. The first data set may include a first subset of data and a second subset of data in the distributed storage system. Additionally, a set of efficiency sets for the first subset of data may be generated. A set difference based on the combined efficiency set and the set of efficiency sets may be computed. An amount of memory used for storing unique data of the second subset of data may be estimated based on the set difference. The unique data may be present in the second subset of data but absent from the first subset of data.

Type: Application

Filed: December 1, 2021

Publication date: March 17, 2022

Inventors: Alyssa Proulx, Mark David Olson
Efficiency sets for determination of unique data

Patent number: 11194506

Abstract: A system, method, and machine-readable storage medium for determining an amount of unique data in a distributed storage system are provided. In some embodiments, a combined efficiency set for a first data set stored in the distributed storage system, such as at a volume, may be generated. The first data set may include a first subset of data and a second subset of data in the distributed storage system. Additionally, a set of efficiency sets for the first subset of data may be generated. A set difference based on the combined efficiency set and the set of efficiency sets may be computed. An amount of memory used for storing unique data of the second subset of data may be estimated based on the set difference. The unique data may be present in the second subset of data but absent from the first subset of data.

Type: Grant

Filed: July 28, 2020

Date of Patent: December 7, 2021

Assignee: NETAPP, INC.

Inventors: Alyssa Proulx, Mark David Olson
ADJUSTMENT OF GARBAGE COLLECTION PARAMETERS IN A STORAGE SYSTEM

Publication number: 20210334208

Abstract: A system, method, and machine-readable storage medium for performing garbage collection in a distributed storage system are provided. In some embodiments, an efficiency level of a garbage collection process is monitored. The garbage collection process may include removal of one or more data blocks of a set of data blocks that is referenced by a set of content identifiers. The set of slice services and the set of data blocks may reside in a cluster, and a set of filters may indicate whether the set of data blocks is in-use. At least one parameter of a filter of the set of filters may be adjusted (e.g., increased or reduced) if the efficiency level is below the efficiency threshold. Garbage collection may be performed on the set of data blocks in accordance with the set of filters.

Type: Application

Filed: April 23, 2020

Publication date: October 28, 2021

Inventors: Alyssa Proulx, Wei Sun
Efficiency sets in a distributed system

Patent number: 9377953

Abstract: Disclosed are systems, computer-readable mediums, and methods for efficiency sets in a distributed system. A first efficiency set is determined for a first volume of data. Determining the first efficiency set includes selecting block identifiers for data blocks of the first volume, where each block identifier is used to access a particular data block corresponding to the first volume. Determining the first efficiency set further includes applying a mask to the selected block identifiers to mask at least one bit of each selected block identifier. The first efficiency set is compared to a second efficiency set for a second data store, and based on the comparison, an amount of unique data blocks of the first volume is approximated.

Type: Grant

Filed: April 23, 2014

Date of Patent: June 28, 2016

Assignee: NETAPP, INC.

Inventors: Mattias Fornander, Alyssa Proulx, Jared Cantwell, Travis Gockel
Efficiency sets in a distributed system

Patent number: 9348514

Abstract: Disclosed are systems, computer-readable mediums, and methods for efficiency sets in a distributed system. A first efficiency set is determined for a first volume of data. Determining the first efficiency set includes selecting block identifiers for data blocks of the first volume, where each block identifier is used to access a particular data block corresponding to the first volume. Determining the first efficiency set further includes applying a mask to the selected block identifiers to mask at least one bit of each selected block identifier. The first efficiency set is compared to a second efficiency set for a second data store, and based on the comparison, an amount of unique data blocks of the first volume is approximated.

Type: Grant

Filed: April 13, 2015

Date of Patent: May 24, 2016

Assignee: NETAPP, INC.

Inventors: Mattias Fornander, Alyssa Proulx, Jared Cantwell, Travis Gockel
EFFICIENCY SETS IN A DISTRIBUTED SYSTEM

Publication number: 20150309746

Abstract: Disclosed are systems, computer-readable mediums, and methods for efficiency sets in a distributed system. A first efficiency set is determined for a first volume of data. Determining the first efficiency set includes selecting block identifiers for data blocks of the first volume, where each block identifier is used to access a particular data block corresponding to the first volume. Determining the first efficiency set further includes applying a mask to the selected block identifiers to mask at least one bit of each selected block identifier. The first efficiency set is compared to a second efficiency set for a second data store, and based on the comparison, an amount of unique data blocks of the first volume is approximated.

Type: Application

Filed: April 13, 2015

Publication date: October 29, 2015

Inventors: Mattias Fornander, Alyssa Proulx, Jared Cantwell, Travis Gockel
EFFICIENCY SETS IN A DISTRIBUTED SYSTEM

Publication number: 20150309733

Abstract: Disclosed are systems, computer-readable mediums, and methods for efficiency sets in a distributed system. A first efficiency set is determined for a first volume of data. Determining the first efficiency set includes selecting block identifiers for data blocks of the first volume, where each block identifier is used to access a particular data block corresponding to the first volume. Determining the first efficiency set further includes applying a mask to the selected block identifiers to mask at least one bit of each selected block identifier. The first efficiency set is compared to a second efficiency set for a second data store, and based on the comparison, an amount of unique data blocks of the first volume is approximated.

Type: Application

Filed: April 23, 2014

Publication date: October 29, 2015

Applicant: SolidFire, Inc.

Inventors: Mattias Fornander, Alyssa Proulx, Jared Cantwell, Travis Gockel