Patents by Inventor Ming Benjamin Zhu

Ming Benjamin Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9841898
    Abstract: Transmitting or storing subsegments is disclosed. A data stream or a data block is received and broken into a plurality of segments. For at least one segment, the segment is broken into a plurality of subsegments. A previously stored or transmitted segment similar to the at least one segment is identified. A fingerprint is computed for at least one subsegment. And, using the fingerprint for the at least one subsegment, determining whether the at least one subsegment is identical to a subsegment of the previously stored or transmitted segment without directly comparing the content of the at least one subsegment with the content of the subsegment of the previously stored or transmitted segment.
    Type: Grant
    Filed: May 16, 2014
    Date of Patent: December 12, 2017
    Assignee: EMC IP Holding Company LLC
    Inventors: R. Hugo Patterson, Ming Benjamin Zhu
  • Patent number: 9823975
    Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
    Type: Grant
    Filed: April 26, 2016
    Date of Patent: November 21, 2017
    Assignee: EMC IP Holding Company LLC
    Inventors: Kai Li, Ming Benjamin Zhu
  • Patent number: 9690802
    Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.
    Type: Grant
    Filed: May 27, 2015
    Date of Patent: June 27, 2017
    Assignee: EMC IP Holding Company LLC
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
  • Patent number: 9535624
    Abstract: A method of managing duplicate segments from a segmented file storage system is disclosed. The method comprises indexing a segment according to a key for the segments wherein the index includes an identification of a first data location where the segment is stored and identifying a duplicate segment having the same key that is stored in a second location. The method further comprises determining that the duplicate segment is an undesired duplicate segment and eliminating the undesired duplicate segment.
    Type: Grant
    Filed: August 4, 2005
    Date of Patent: January 3, 2017
    Assignee: EMC IP Holding Company LLC
    Inventors: R. Hugo Patterson, Ming Benjamin Zhu, Edward K. Lee
  • Publication number: 20160314139
    Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
    Type: Application
    Filed: April 26, 2016
    Publication date: October 27, 2016
    Inventors: Kai Li, Ming Benjamin Zhu
  • Patent number: 9454318
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
    Type: Grant
    Filed: June 9, 2015
    Date of Patent: September 27, 2016
    Assignee: EMC Corporation
    Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
  • Patent number: 9418133
    Abstract: Data replication with delta compression is disclosed. A primary system and a replica system are determined to both have an identical first data segment that is similar to a second data segment. The second data segment is encoded, wherein the encoding refers to the first data segment.
    Type: Grant
    Filed: August 22, 2014
    Date of Patent: August 16, 2016
    Assignee: EMC Corporation
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
  • Patent number: 9348797
    Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
    Type: Grant
    Filed: August 5, 2013
    Date of Patent: May 24, 2016
    Assignee: EMC Corporation
    Inventors: Kai Li, Ming Benjamin Zhu
  • Publication number: 20160034212
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
    Type: Application
    Filed: June 9, 2015
    Publication date: February 4, 2016
    Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
  • Publication number: 20150261779
    Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.
    Type: Application
    Filed: May 27, 2015
    Publication date: September 17, 2015
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
  • Patent number: 9081728
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.
    Type: Grant
    Filed: October 31, 2013
    Date of Patent: July 14, 2015
    Assignee: EMC Corporation
    Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
  • Patent number: 9069785
    Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.
    Type: Grant
    Filed: February 11, 2013
    Date of Patent: June 30, 2015
    Assignee: EMC Corporation
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
  • Publication number: 20150052103
    Abstract: Data replication with delta compression is disclosed. A primary system and a replica system are determined to both have an identical first data segment that is similar to a second data segment. The second data segment is encoded, wherein the encoding refers to the first data segment.
    Type: Application
    Filed: August 22, 2014
    Publication date: February 19, 2015
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
  • Publication number: 20140337363
    Abstract: Transmitting or storing subsegments is disclosed. A data stream or a data block is received and broken into a plurality of segments. For at least one segment, the segment is broken into a plurality of subsegments. A previously stored or transmitted segment similar to the at least one segment is identified. A fingerprint is computed for at least one subsegment. And, using the fingerprint for the at least one subsegment, determining whether the at least one subsegment is identical to a subsegment of the previously stored or transmitted segment without directly comparing the content of the at least one subsegment with the content of the subsegment of the previously stored or transmitted segment.
    Type: Application
    Filed: May 16, 2014
    Publication date: November 13, 2014
    Applicant: Data Domain, Inc.
    Inventors: R. Hugo Patterson, Ming Benjamin Zhu
  • Patent number: 8849772
    Abstract: Data replication with delta compression is disclosed. A primary system and a replica system are determined to both have an identical first data segment that is similar to a second data segment. The second data segment is encoded, wherein the encoding refers to the first data segment.
    Type: Grant
    Filed: November 14, 2008
    Date of Patent: September 30, 2014
    Assignee: EMC Corporation
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
  • Patent number: 8768895
    Abstract: Transmitting or storing subsegments is disclosed. A data stream or a data block is received and broken into a plurality of segments. For at least one segment, the segment is broken into a plurality of subsegments. A previously stored or transmitted segment similar to the at least one segment is identified. A fingerprint is computed for at least one subsegment. And, using the fingerprint for the at least one subsegment, determining whether the at least one subsegment is identical to a subsegment of the previously stored or transmitted segment without directly comparing the content of the at least one subsegment with the content of the subsegment of the previously stored or transmitted segment.
    Type: Grant
    Filed: May 18, 2007
    Date of Patent: July 1, 2014
    Assignee: EMC Corporation
    Inventors: R. Hugo Patterson, Ming Benjamin Zhu
  • Patent number: 8751462
    Abstract: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.
    Type: Grant
    Filed: November 14, 2008
    Date of Patent: June 10, 2014
    Assignee: EMC Corporation
    Inventors: Mark Huang, Edward K. Lee, Kai Li, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
  • Publication number: 20140129790
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.
    Type: Application
    Filed: October 31, 2013
    Publication date: May 8, 2014
    Applicant: EMC Corporation
    Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
  • Publication number: 20140040341
    Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
    Type: Application
    Filed: August 5, 2013
    Publication date: February 6, 2014
    Inventors: Kai Li, Ming Benjamin Zhu
  • Patent number: 8612696
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.
    Type: Grant
    Filed: August 23, 2012
    Date of Patent: December 17, 2013
    Assignee: EMC Corporation
    Inventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson