Patents by Inventor Tony T. WONG

Tony T. WONG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250053589
    Abstract: One example method includes, beginning at a parent node of a data structure, calculating a first W-index for the parent node and a left child node, calculating a second W-index for the parent node and a right child node, when the first W-index exceeds a threshold ?, marking the left child node as a candidate to be included in a sub-cluster separate from a sub-cluster that includes the parent node, when the second W-index exceeds the threshold ?, marking the right child node as a candidate to be included in a sub-cluster separate from the sub-cluster that includes the parent node, and recursively calculating respective W-indices for all other parent nodes in the data structure until all child nodes of the data structure have been assigned to a sub-cluster.
    Type: Application
    Filed: October 29, 2024
    Publication date: February 13, 2025
    Inventors: Smriti Thakkar, Tony T. Wong
  • Patent number: 12222815
    Abstract: One example method includes generating a fingerprint:tag dictionary that includes a group of fingerprints and a group of tags, and the fingerprint:tag dictionary identifies, for each fingerprint, the tag or tags which include that fingerprint, computing a similarity matrix based on the fingerprint:tag dictionary, and the similarity matrix identifies, for each pair of tags in the fingerprint:tag dictionary, a relative similarity of the tags in the pair to each other, running a clustering algorithm to identify groups of similar tags in the similarity matrix, and deduplicating, based on the groups of similar tags, respective data associated with the fingerprints.
    Type: Grant
    Filed: December 17, 2020
    Date of Patent: February 11, 2025
    Assignee: EMC IP Holding Company LLC
    Inventors: Smriti Thakkar, Tony T. Wong, Abhinav Duggal
  • Patent number: 12204576
    Abstract: One example method includes, beginning at a parent node of a data structure, calculating a first W-index for the parent node and a left child node, calculating a second W-index for the parent node and a right child node, when the first W-index exceeds a threshold ?, marking the left child node as a candidate to be included in a sub-cluster separate from a sub-cluster that includes the parent node, when the second W-index exceeds the threshold ?, marking the right child node as a candidate to be included in a sub-cluster separate from the sub-cluster that includes the parent node, and recursively calculating respective W-indices for all other parent nodes in the data structure until all child nodes of the data structure have been assigned to a sub-cluster.
    Type: Grant
    Filed: July 18, 2022
    Date of Patent: January 21, 2025
    Assignee: Dell Products L.P.
    Inventors: Smriti Thakkar, Tony T. Wong
  • Patent number: 12164478
    Abstract: One example method includes receiving at a worker node, from each deduplication filesystem (DDFS) node in a group of DDFS nodes, respective parts of a fingerprint-file object dictionary, wherein each of the parts is created using a common fingerprint mask, merging, by the worker node, the parts, scanning the merged parts to update an intersection count between file objects stored at the DDFS nodes, generating, based on the scanning, an intersection matrix, and transferring the intersection matrix to a master node.
    Type: Grant
    Filed: July 18, 2022
    Date of Patent: December 10, 2024
    Assignee: Dell Products L.P.
    Inventors: Smriti Thakkar, Tony T. Wong
  • Publication number: 20240020329
    Abstract: One example method includes, beginning at a parent node of a data structure, calculating a first W-index for the parent node and a left child node, calculating a second W-index for the parent node and a right child node, when the first W-index exceeds a threshold ?, marking the left child node as a candidate to be included in a sub-cluster separate from a sub-cluster that includes the parent node, when the second W-index exceeds the threshold ?, marking the right child node as a candidate to be included in a sub-cluster separate from the sub-cluster that includes the parent node, and recursively calculating respective W-indices for all other parent nodes in the data structure until all child nodes of the data structure have been assigned to a sub-cluster.
    Type: Application
    Filed: July 18, 2022
    Publication date: January 18, 2024
    Inventors: Smriti Thakkar, Tony T. Wong
  • Publication number: 20240020274
    Abstract: One example method includes receiving at a worker node, from each deduplication filesystem (DDFS) node in a group of DDFS nodes, respective parts of a fingerprint-file object dictionary, wherein each of the parts is created using a common fingerprint mask, merging, by the worker node, the parts, scanning the merged parts to update an intersection count between file objects stored at the DDFS nodes, generating, based on the scanning, an intersection matrix, and transferring the intersection matrix to a master node.
    Type: Application
    Filed: July 18, 2022
    Publication date: January 18, 2024
    Inventors: Smriti Thakkar, Tony T. Wong
  • Publication number: 20240020273
    Abstract: One example method includes finding an Ln segment in a namespace, and adding the Ln segment to a fingerprint hashtable, scanning a container set and locating Ln in the container set, looking for Ln in the fingerprint hashtable, when Ln is found in the fingerprint hashtable, deleting Ln from the fingerprint hashtable, otherwise, retaining Ln in the fingerprint hashtable, when Ln is found in in the fingerprint hashtable, adding all fingerprints, pointed to by Ln, to the fingerprint hashtable, and when all levels of a tree that includes Ln have been checked, identifying as missing, any fingerprints still remaining in the fingerprint hashtable.
    Type: Application
    Filed: July 18, 2022
    Publication date: January 18, 2024
    Inventors: Smriti Thakkar, Tony T. Wong, Pooja Mahadik
  • Patent number: 11675536
    Abstract: An intelligent method of scheduling garbage collection (GC) in a storage system. A GC scheduler obtains capacity utilization and ingest rate of the storage system and calculate therefrom a predicted capacity utilization. When the predicted capacity utilization reaches a threshold, the GC scheduler invokes GC, but otherwise skips GC until such time as predicted capacity utilization reaches the threshold. The ingest rage may be calculated by performing linear fit on past data ingest. The GC scheduler may calculate predicted capacity utilization periodically according to preset period. The GC scheduler may calculate the predicted capacity utilization to a future date beyond the next period. The future date may be at least as far as the next period plus total ingest time.
    Type: Grant
    Filed: October 13, 2020
    Date of Patent: June 13, 2023
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Tony T. Wong, Abhinav Duggal, Joseph Jobi
  • Publication number: 20220197755
    Abstract: One example method includes generating a fingerprint:tag dictionary that includes a group of fingerprints and a group of tags, and the fingerprint:tag dictionary identifies, for each fingerprint, the tag or tags which include that fingerprint, computing a similarity matrix based on the fingerprint:tag dictionary, and the similarity matrix identifies, for each pair of tags in the fingerprint:tag dictionary, a relative similarity of the tags in the pair to each other, running a clustering algorithm to identify groups of similar tags in the similarity matrix, and deduplicating, based on the groups of similar tags, respective data associated with the fingerprints.
    Type: Application
    Filed: December 17, 2020
    Publication date: June 23, 2022
    Inventors: Smriti Thakkar, Tony T. Wong, Abhinav Duggal
  • Publication number: 20220113902
    Abstract: An intelligent method of scheduling garbage collection (GC) in a storage system. A GC scheduler obtains capacity utilization and ingest rate of the storage system and calculate therefrom a predicted capacity utilization. When the predicted capacity utilization reaches a threshold, the GC scheduler invokes GC, but otherwise skips GC until such time as predicted capacity utilization reaches the threshold. The ingest rage may be calculated by performing linear fit on past data ingest. The GC scheduler may calculate predicted capacity utilization periodically according to preset period. The GC scheduler may calculate the predicted capacity utilization to a future date beyond the next period. The future date may be at least as far as the next period plus total ingest time.
    Type: Application
    Filed: October 13, 2020
    Publication date: April 14, 2022
    Inventors: Tony T. WONG, Abhinav DUGGAL, Joseph JOBI