Patents by Inventor Wangyuan Zhang
Wangyuan Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11954024Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.Type: GrantFiled: January 24, 2022Date of Patent: April 9, 2024Assignee: Google LLCInventors: Wangyuan Zhang, Sandeep Singhal, Sangho Yoon, Guangda Lai, Arash Baratloo, Zhifan Zhang, Gael Hatchue Njouyep, Pramod Gaud
-
Patent number: 11588891Abstract: A system and method for storing data in a distributed network having a plurality of datacenters distributed over a plurality of geographic regions. The method may involve receiving data, including metadata, uploaded to a first datacenter of the distributed network, receiving access information about previous data that was previously stored in the plurality of datacenters of the distributed network, predicting one or more of the plurality of geographic regions from which the uploaded data will be accessed based on the metadata and the access information, and instructing the uploaded data to be transferred from the first datacenter to one or more second datacenters located at each of the one or more predicted geographic regions.Type: GrantFiled: November 4, 2019Date of Patent: February 21, 2023Assignee: Google LLCInventors: Wangyuan Zhang, Vivienne Zhang, Pramod Gaud, Sangho Yoon, Xudong Shi, Kaifeng Yao
-
Publication number: 20220147448Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.Type: ApplicationFiled: January 24, 2022Publication date: May 12, 2022Applicant: Google LLCInventors: Wangyuan Zhang, Sandeep Singhal, Sangho Yoon, Guangda Lai, Arash Baratloo, Zhifan Zhang, Gael Hatchue Njouyep, Pramod Gaud
-
Patent number: 11263128Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.Type: GrantFiled: October 27, 2017Date of Patent: March 1, 2022Assignee: Google LLCInventors: Wangyuan Zhang, Sandeep Singhal, Sangho Yoon, Guangda Lai, Arash Baratloo, Zhifan Zhang, Gael Hatchue Njouyep, Pramod Gaud
-
Publication number: 20210136150Abstract: A system and method for storing data in a distributed network having a plurality of datacenters distributed over a plurality of geographic regions. The method may involve receiving data, including metadata, uploaded to a first datacenter of the distributed network, receiving access information about previous data that was previously stored in the plurality of datacenters of the distributed network, predicting one or more of the plurality of geographic regions from which the uploaded data will be accessed based on the metadata and the access information, and instructing the uploaded data to be transferred from the first datacenter to one or more second datacenters located at each of the one or more predicted geographic regions.Type: ApplicationFiled: November 4, 2019Publication date: May 6, 2021Applicant: Google LLCInventors: Wangyuan Zhang, Vivienne Zhang, Pramod Gaud, Sangho Yoon, Xudong Shi, Kaifeng Yao
-
Publication number: 20200265068Abstract: A method includes identifying a first table including data. The first table has associated metadata, an associated replication state, an associated replication log file including replication logs logging mutations of the first table, and an associated replication configuration file including a first association that associates the first table with a replication family. The method includes inserting a second association in the replication configuration file that associates a second table having a non-loadable state with the replication family. The association of the second table with the replication family causes persistence of any replication logs in the replication log file that correspond to any mutations of the first table during the existence of the second table. The method further includes generating a third table from the first table, the metadata associated with the first table, and the associated replication state of the first table.Type: ApplicationFiled: April 25, 2020Publication date: August 20, 2020Applicant: Google LLCInventors: Wangyuan Zhang, Li Moore
-
Patent number: 10650024Abstract: A method includes identifying a first table including data. The first table has associated metadata, an associated replication state, an associated replication log file including replication logs logging mutations of the first table, and an associated replication configuration file including a first association that associates the first table with a replication family. The method includes inserting a second association in the replication configuration file that associates a second table having a non-loadable state with the replication family. The association of the second table with the replication family causes persistence of any replication logs in the replication log file that correspond to any mutations of the first table during the existence of the second table. The method further includes generating a third table from the first table, the metadata associated with the first table, and the associated replication state of the first table.Type: GrantFiled: July 30, 2015Date of Patent: May 12, 2020Assignee: Google LLCInventors: Wangyuan Zhang, Li Moore
-
Patent number: 10530855Abstract: Techniques for synchronization between data structures for original locks and mirror lock data structures are disclosed herein. The mirror lock data structures are being maintained during various scenarios including volume move and aggregate relocation, in order to preserve the non-disruptive persistent operation on storage initiated by clients. According to one embodiment, a storage node determines a plurality of data container locks to be synchronized to a partner node of the storage node and transfers metadata that indicates states of variables that represent the plurality of data container locks to the partner node in a batch. When a client initiates a data access operation that causes an attempt to modify a data container lock of the plurality of data container locks, the storage node sends a retry code to a client that prompts the client to retry the data access operation after a predetermined time period.Type: GrantFiled: February 26, 2016Date of Patent: January 7, 2020Assignee: NETAPP, INC.Inventors: Omprakaash C. Thoppai, William Zumach, Wangyuan Zhang, Vinay Sridhar, Robert Wyckoff Hyer, Jr.
-
Publication number: 20190129844Abstract: A method includes receiving data objects, determining a predicted lifespan of each data object, and instantiating multiple shard files. Each shard file has an associated predicted lifespan range. The method also includes writing each data object into a corresponding shard file having the associated predicted lifespan range that includes the predicted lifespan of the respective data object and storing the shard files in a distributed system. The method also includes determining whether any stored shard files satisfy a compaction criteria based on a number of deleted data objects in each corresponding stored shard file. For each stored shard file satisfying the compaction criteria, the method also includes compacting the stored shard file by rewriting the remaining data objects of the stored shard file into a new shard file.Type: ApplicationFiled: October 27, 2017Publication date: May 2, 2019Applicant: Google LLCInventors: Wangyuan Zhang, Sandeep Singhal, Sangho Yoon, Guangda Lai, Arash Baratloo, Zhifan Zhang, Gael Hatchue Njouyep, Pramod Gaud
-
Publication number: 20170032012Abstract: A method includes identifying a first table including data. The first table has associated metadata, an associated replication state, an associated replication log file including replication logs logging mutations of the first table, and an associated replication configuration file including a first association that associates the first table with a replication family. The method includes inserting a second association in the replication configuration file that associates a second table having a non-loadable state with the replication family. The association of the second table with the replication family causes persistence of any replication logs in the replication log file that correspond to any mutations of the first table during the existence of the second table. The method further includes generating a third table from the first table, the metadata associated with the first table, and the associated replication state of the first table.Type: ApplicationFiled: July 30, 2015Publication date: February 2, 2017Applicant: Google Inc.Inventors: Wangyuan Zhang, Li Moore
-
Publication number: 20160182630Abstract: Techniques for synchronization between data structures for original locks and mirror lock data structures are disclosed herein. The mirror lock data structures are being maintained during various scenarios including volume move and aggregate relocation, in order to preserve the non-disruptive persistent operation on storage initiated by clients. According to one embodiment, a storage node determines a plurality of data container locks to be synchronized to a partner node of the storage node and transfers metadata that indicates states of variables that represent the plurality of data container locks to the partner node in a batch. When a client initiates a data access operation that causes an attempt to modify a data container lock of the plurality of data container locks, the storage node sends a retry code to a client that prompts the client to retry the data access operation after a predetermined time period.Type: ApplicationFiled: February 26, 2016Publication date: June 23, 2016Inventors: Omprakaash C. Thoppai, William Zumach, Wangyuan Zhang, Vinay Sridhar, Robert Wyckoff Hyer
-
Patent number: 9280396Abstract: Techniques for synchronization between data structures for original locks and mirror lock data structures are disclosed herein. The mirror lock data structures are being maintained during various scenarios including volume move and aggregate relocation, in order to preserve the non-disruptive persistent operation on storage initiated by clients. According to one embodiment, a storage node determines a plurality of data container locks to be synchronized to a partner node of the storage node and transfers metadata that indicates states of variables that represent the plurality of data container locks to the partner node in a batch. When a client initiates a data access operation that causes an attempt to modify a data container lock of the plurality of data container locks, the storage node sends a retry code to a client that prompts the client to retry the data access operation after a predetermined time period.Type: GrantFiled: November 1, 2012Date of Patent: March 8, 2016Assignee: NetApp, Inc.Inventors: Omprakaash C. Thoppai, William Zumach, Wangyuan Zhang, Vinay Sridhar, Robert Wyckoff Hyer, Jr.