Patents by Inventor Ming Benjamin Zhu
Ming Benjamin Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20130232125Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.Type: ApplicationFiled: February 11, 2013Publication date: September 5, 2013Applicant: EMC CorporationInventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
-
Patent number: 8527568Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.Type: GrantFiled: October 22, 2010Date of Patent: September 3, 2013Assignee: EMC CorporationInventors: Kai Li, Ming Benjamin Zhu
-
Patent number: 8447740Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.Type: GrantFiled: November 14, 2008Date of Patent: May 21, 2013Assignee: EMC CorporationInventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
-
Publication number: 20120317381Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.Type: ApplicationFiled: August 23, 2012Publication date: December 13, 2012Applicant: EMC CorporationInventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson
-
Patent number: 8312006Abstract: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.Type: GrantFiled: April 19, 2011Date of Patent: November 13, 2012Assignee: EMC CorporationInventors: R. Hugo Patterson, Kai Li, Ming Benjamin Zhu, Sazzala Venkata Reddy, Umesh Maheshwari, Edward K. Lee
-
Patent number: 8275955Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.Type: GrantFiled: June 21, 2010Date of Patent: September 25, 2012Assignee: EMC CorporationInventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
-
Patent number: 8145863Abstract: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a second prior stored segment wherein the second prior stored segment is represented as a first stored delta and a first prior stored segment. A second delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the second delta is stored.Type: GrantFiled: April 13, 2011Date of Patent: March 27, 2012Assignee: EMC CorporationInventor: Ming Benjamin Zhu
-
Publication number: 20110196869Abstract: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.Type: ApplicationFiled: April 19, 2011Publication date: August 11, 2011Applicants: EMC CORPORATIONInventors: R. Hugo Patterson, Kai Li, Ming Benjamin Zhu, Sazzala Venkata Reddy, Umesh Maheshwari, Edward K. Lee
-
Publication number: 20110191560Abstract: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a second prior stored segment wherein the second prior stored segment is represented as a first stored delta and a first prior stored segment. A second delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the second delta is stored.Type: ApplicationFiled: April 13, 2011Publication date: August 4, 2011Applicant: EMC CORPORATIONInventor: Ming Benjamin Zhu
-
Patent number: 7962520Abstract: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.Type: GrantFiled: April 9, 2008Date of Patent: June 14, 2011Assignee: EMC CorporationInventors: R. Hugo Patterson, Kai Li, Ming Benjamin Zhu, Sazzala Venkata Reddy, Umesh Maheshwari, Edward K. Lee
-
Patent number: 7949824Abstract: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a second prior stored segment wherein the second prior stored segment is represented as a first stored delta and a first prior stored segment. A second delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the second delta is stored.Type: GrantFiled: April 11, 2006Date of Patent: May 24, 2011Assignee: EMC CorporationInventor: Ming Benjamin Zhu
-
Publication number: 20110040819Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.Type: ApplicationFiled: October 22, 2010Publication date: February 17, 2011Applicant: EMC CORPORATIONInventors: Kai Li, Ming Benjamin Zhu
-
Patent number: 7882064Abstract: File system replication includes determining whether one of a plurality of files included in an original file system has been updated since a previous replication, the file having a plurality of data segments, and in the event that the file has been updated, locating among the plurality of data segments a previously stored data segment that is newly referenced by the file, and that does not require replication.Type: GrantFiled: July 6, 2006Date of Patent: February 1, 2011Assignee: EMC CorporationInventors: Edward K. Lee, Ming Benjamin Zhu, Umesh Maheshwari, R. Hugo Patterson
-
Patent number: 7844652Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.Type: GrantFiled: April 11, 2006Date of Patent: November 30, 2010Assignee: EMC CorporationInventors: Kai Li, Ming Benjamin Zhu
-
Publication number: 20100257315Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.Type: ApplicationFiled: June 21, 2010Publication date: October 7, 2010Applicant: EMC CORPORATIONInventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson
-
Patent number: 7769967Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.Type: GrantFiled: March 28, 2008Date of Patent: August 3, 2010Assignee: EMC CorporationInventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson
-
Patent number: 7747581Abstract: A network file system-based data storage system that converts random I/O requests into a piecewise sequential data structure to facilitate variable length data segment redundancy identification and elimination. For one embodiment of the invention a stateless network file system is employed. For one such embodiment, that provides multiple-client access to stored data, multiple Writes are buffered and then broken into variable length data segments. Redundant segment elimination is then effected. One embodiment of the invention allows sharing of the variable length data segments among files.Type: GrantFiled: April 19, 2007Date of Patent: June 29, 2010Assignee: EMC CorporationInventors: Kai Li, R. Hugo Patterson, Ming Benjamin Zhu, Allan Bricker, Richard Johnsson, Sazzala Reddy, Jeffery Zabarsky
-
Publication number: 20100125553Abstract: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.Type: ApplicationFiled: November 14, 2008Publication date: May 20, 2010Inventors: Mark Huang, Edward K. Lee, Kai Li, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
-
Patent number: 7689633Abstract: A network file system-based data storage system that converts random I/O requests into a piecewise sequential data structure to facilitate variable length data segment redundancy identification and elimination. For one embodiment of the invention a stateless network file system is employed. For one such embodiment, that provides multiple-client access to stored data, multiple Writes are buffered and then broken into variable length data segments. Redundant segment elimination is then effected. One embodiment of the invention allows sharing of the variable length data segments among files.Type: GrantFiled: September 15, 2004Date of Patent: March 30, 2010Assignee: Data Domain, Inc.Inventors: Kai Li, R. Hugo Patterson, Ming Benjamin Zhu, Allan Bricker, Richard Johnsson, Sazzala Reddy, Jeffery Zabarsky
-
Patent number: 7631144Abstract: A method for storing data is disclosed. The method comprises receiving a data stream comprising a plurality of data segments and preliminarily checking in a memory having a relatively low latency whether one of the plurality of data segments has been stored previously. The method further comprises in the event that the preliminary check does not conclusively determine whether the data segment has been stored previously, limiting checking in a memory having a relatively high latency to conclusively determine whether the data segment has been previously stored, and in the event that checking is limited or in the event that the check in the memory having relatively high latency conclusively determines the data segment has not been previously stored, storing the data segment.Type: GrantFiled: September 13, 2004Date of Patent: December 8, 2009Assignee: DataDomain, Inc.Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Allan J. Bricker, Edward K. Lee