Patents by Inventor Ming Benjamin Zhu
Ming Benjamin Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9841898Abstract: Transmitting or storing subsegments is disclosed. A data stream or a data block is received and broken into a plurality of segments. For at least one segment, the segment is broken into a plurality of subsegments. A previously stored or transmitted segment similar to the at least one segment is identified. A fingerprint is computed for at least one subsegment. And, using the fingerprint for the at least one subsegment, determining whether the at least one subsegment is identical to a subsegment of the previously stored or transmitted segment without directly comparing the content of the at least one subsegment with the content of the subsegment of the previously stored or transmitted segment.Type: GrantFiled: May 16, 2014Date of Patent: December 12, 2017Assignee: EMC IP Holding Company LLCInventors: R. Hugo Patterson, Ming Benjamin Zhu
-
Patent number: 9823975Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.Type: GrantFiled: April 26, 2016Date of Patent: November 21, 2017Assignee: EMC IP Holding Company LLCInventors: Kai Li, Ming Benjamin Zhu
-
Patent number: 9690802Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.Type: GrantFiled: May 27, 2015Date of Patent: June 27, 2017Assignee: EMC IP Holding Company LLCInventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
-
Patent number: 9535624Abstract: A method of managing duplicate segments from a segmented file storage system is disclosed. The method comprises indexing a segment according to a key for the segments wherein the index includes an identification of a first data location where the segment is stored and identifying a duplicate segment having the same key that is stored in a second location. The method further comprises determining that the duplicate segment is an undesired duplicate segment and eliminating the undesired duplicate segment.Type: GrantFiled: August 4, 2005Date of Patent: January 3, 2017Assignee: EMC IP Holding Company LLCInventors: R. Hugo Patterson, Ming Benjamin Zhu, Edward K. Lee
-
Publication number: 20160314139Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.Type: ApplicationFiled: April 26, 2016Publication date: October 27, 2016Inventors: Kai Li, Ming Benjamin Zhu
-
Patent number: 9454318Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.Type: GrantFiled: June 9, 2015Date of Patent: September 27, 2016Assignee: EMC CorporationInventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
-
Patent number: 9418133Abstract: Data replication with delta compression is disclosed. A primary system and a replica system are determined to both have an identical first data segment that is similar to a second data segment. The second data segment is encoded, wherein the encoding refers to the first data segment.Type: GrantFiled: August 22, 2014Date of Patent: August 16, 2016Assignee: EMC CorporationInventors: Mark Huang, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
-
Patent number: 9348797Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.Type: GrantFiled: August 5, 2013Date of Patent: May 24, 2016Assignee: EMC CorporationInventors: Kai Li, Ming Benjamin Zhu
-
Publication number: 20160034212Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.Type: ApplicationFiled: June 9, 2015Publication date: February 4, 2016Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
-
Publication number: 20150261779Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.Type: ApplicationFiled: May 27, 2015Publication date: September 17, 2015Inventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
-
Patent number: 9081728Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.Type: GrantFiled: October 31, 2013Date of Patent: July 14, 2015Assignee: EMC CorporationInventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
-
Patent number: 9069785Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.Type: GrantFiled: February 11, 2013Date of Patent: June 30, 2015Assignee: EMC CorporationInventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
-
Publication number: 20150052103Abstract: Data replication with delta compression is disclosed. A primary system and a replica system are determined to both have an identical first data segment that is similar to a second data segment. The second data segment is encoded, wherein the encoding refers to the first data segment.Type: ApplicationFiled: August 22, 2014Publication date: February 19, 2015Inventors: Mark Huang, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
-
Publication number: 20140337363Abstract: Transmitting or storing subsegments is disclosed. A data stream or a data block is received and broken into a plurality of segments. For at least one segment, the segment is broken into a plurality of subsegments. A previously stored or transmitted segment similar to the at least one segment is identified. A fingerprint is computed for at least one subsegment. And, using the fingerprint for the at least one subsegment, determining whether the at least one subsegment is identical to a subsegment of the previously stored or transmitted segment without directly comparing the content of the at least one subsegment with the content of the subsegment of the previously stored or transmitted segment.Type: ApplicationFiled: May 16, 2014Publication date: November 13, 2014Applicant: Data Domain, Inc.Inventors: R. Hugo Patterson, Ming Benjamin Zhu
-
Patent number: 8849772Abstract: Data replication with delta compression is disclosed. A primary system and a replica system are determined to both have an identical first data segment that is similar to a second data segment. The second data segment is encoded, wherein the encoding refers to the first data segment.Type: GrantFiled: November 14, 2008Date of Patent: September 30, 2014Assignee: EMC CorporationInventors: Mark Huang, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
-
Patent number: 8768895Abstract: Transmitting or storing subsegments is disclosed. A data stream or a data block is received and broken into a plurality of segments. For at least one segment, the segment is broken into a plurality of subsegments. A previously stored or transmitted segment similar to the at least one segment is identified. A fingerprint is computed for at least one subsegment. And, using the fingerprint for the at least one subsegment, determining whether the at least one subsegment is identical to a subsegment of the previously stored or transmitted segment without directly comparing the content of the at least one subsegment with the content of the subsegment of the previously stored or transmitted segment.Type: GrantFiled: May 18, 2007Date of Patent: July 1, 2014Assignee: EMC CorporationInventors: R. Hugo Patterson, Ming Benjamin Zhu
-
Patent number: 8751462Abstract: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.Type: GrantFiled: November 14, 2008Date of Patent: June 10, 2014Assignee: EMC CorporationInventors: Mark Huang, Edward K. Lee, Kai Li, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
-
Publication number: 20140129790Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.Type: ApplicationFiled: October 31, 2013Publication date: May 8, 2014Applicant: EMC CorporationInventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
-
Publication number: 20140040341Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.Type: ApplicationFiled: August 5, 2013Publication date: February 6, 2014Inventors: Kai Li, Ming Benjamin Zhu
-
Patent number: 8612696Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.Type: GrantFiled: August 23, 2012Date of Patent: December 17, 2013Assignee: EMC CorporationInventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson