Patents by Inventor Ming Benjamin Zhu

Ming Benjamin Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20130232125
    Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.
    Type: Application
    Filed: February 11, 2013
    Publication date: September 5, 2013
    Applicant: EMC Corporation
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
  • Patent number: 8527568
    Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
    Type: Grant
    Filed: October 22, 2010
    Date of Patent: September 3, 2013
    Assignee: EMC Corporation
    Inventors: Kai Li, Ming Benjamin Zhu
  • Patent number: 8447740
    Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.
    Type: Grant
    Filed: November 14, 2008
    Date of Patent: May 21, 2013
    Assignee: EMC Corporation
    Inventors: Mark Huang, Philip Shilane, Grant Wallace, Nitin Garg, Edward K. Lee, Ming Benjamin Zhu, Kai Li
  • Publication number: 20120317381
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.
    Type: Application
    Filed: August 23, 2012
    Publication date: December 13, 2012
    Applicant: EMC Corporation
    Inventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson
  • Patent number: 8312006
    Abstract: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
    Type: Grant
    Filed: April 19, 2011
    Date of Patent: November 13, 2012
    Assignee: EMC Corporation
    Inventors: R. Hugo Patterson, Kai Li, Ming Benjamin Zhu, Sazzala Venkata Reddy, Umesh Maheshwari, Edward K. Lee
  • Patent number: 8275955
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
    Type: Grant
    Filed: June 21, 2010
    Date of Patent: September 25, 2012
    Assignee: EMC Corporation
    Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Kai Li
  • Patent number: 8145863
    Abstract: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a second prior stored segment wherein the second prior stored segment is represented as a first stored delta and a first prior stored segment. A second delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the second delta is stored.
    Type: Grant
    Filed: April 13, 2011
    Date of Patent: March 27, 2012
    Assignee: EMC Corporation
    Inventor: Ming Benjamin Zhu
  • Publication number: 20110196869
    Abstract: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
    Type: Application
    Filed: April 19, 2011
    Publication date: August 11, 2011
    Applicants: EMC CORPORATION
    Inventors: R. Hugo Patterson, Kai Li, Ming Benjamin Zhu, Sazzala Venkata Reddy, Umesh Maheshwari, Edward K. Lee
  • Publication number: 20110191560
    Abstract: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a second prior stored segment wherein the second prior stored segment is represented as a first stored delta and a first prior stored segment. A second delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the second delta is stored.
    Type: Application
    Filed: April 13, 2011
    Publication date: August 4, 2011
    Applicant: EMC CORPORATION
    Inventor: Ming Benjamin Zhu
  • Patent number: 7962520
    Abstract: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.
    Type: Grant
    Filed: April 9, 2008
    Date of Patent: June 14, 2011
    Assignee: EMC Corporation
    Inventors: R. Hugo Patterson, Kai Li, Ming Benjamin Zhu, Sazzala Venkata Reddy, Umesh Maheshwari, Edward K. Lee
  • Patent number: 7949824
    Abstract: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a second prior stored segment wherein the second prior stored segment is represented as a first stored delta and a first prior stored segment. A second delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the second delta is stored.
    Type: Grant
    Filed: April 11, 2006
    Date of Patent: May 24, 2011
    Assignee: EMC Corporation
    Inventor: Ming Benjamin Zhu
  • Publication number: 20110040819
    Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
    Type: Application
    Filed: October 22, 2010
    Publication date: February 17, 2011
    Applicant: EMC CORPORATION
    Inventors: Kai Li, Ming Benjamin Zhu
  • Patent number: 7882064
    Abstract: File system replication includes determining whether one of a plurality of files included in an original file system has been updated since a previous replication, the file having a plurality of data segments, and in the event that the file has been updated, locating among the plurality of data segments a previously stored data segment that is newly referenced by the file, and that does not require replication.
    Type: Grant
    Filed: July 6, 2006
    Date of Patent: February 1, 2011
    Assignee: EMC Corporation
    Inventors: Edward K. Lee, Ming Benjamin Zhu, Umesh Maheshwari, R. Hugo Patterson
  • Patent number: 7844652
    Abstract: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
    Type: Grant
    Filed: April 11, 2006
    Date of Patent: November 30, 2010
    Assignee: EMC Corporation
    Inventors: Kai Li, Ming Benjamin Zhu
  • Publication number: 20100257315
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
    Type: Application
    Filed: June 21, 2010
    Publication date: October 7, 2010
    Applicant: EMC CORPORATION
    Inventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson
  • Patent number: 7769967
    Abstract: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
    Type: Grant
    Filed: March 28, 2008
    Date of Patent: August 3, 2010
    Assignee: EMC Corporation
    Inventors: Ming Benjamin Zhu, Kai Li, R. Hugo Patterson
  • Patent number: 7747581
    Abstract: A network file system-based data storage system that converts random I/O requests into a piecewise sequential data structure to facilitate variable length data segment redundancy identification and elimination. For one embodiment of the invention a stateless network file system is employed. For one such embodiment, that provides multiple-client access to stored data, multiple Writes are buffered and then broken into variable length data segments. Redundant segment elimination is then effected. One embodiment of the invention allows sharing of the variable length data segments among files.
    Type: Grant
    Filed: April 19, 2007
    Date of Patent: June 29, 2010
    Assignee: EMC Corporation
    Inventors: Kai Li, R. Hugo Patterson, Ming Benjamin Zhu, Allan Bricker, Richard Johnsson, Sazzala Reddy, Jeffery Zabarsky
  • Publication number: 20100125553
    Abstract: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.
    Type: Application
    Filed: November 14, 2008
    Publication date: May 20, 2010
    Inventors: Mark Huang, Edward K. Lee, Kai Li, Philip Shilane, Grant Wallace, Ming Benjamin Zhu
  • Patent number: 7689633
    Abstract: A network file system-based data storage system that converts random I/O requests into a piecewise sequential data structure to facilitate variable length data segment redundancy identification and elimination. For one embodiment of the invention a stateless network file system is employed. For one such embodiment, that provides multiple-client access to stored data, multiple Writes are buffered and then broken into variable length data segments. Redundant segment elimination is then effected. One embodiment of the invention allows sharing of the variable length data segments among files.
    Type: Grant
    Filed: September 15, 2004
    Date of Patent: March 30, 2010
    Assignee: Data Domain, Inc.
    Inventors: Kai Li, R. Hugo Patterson, Ming Benjamin Zhu, Allan Bricker, Richard Johnsson, Sazzala Reddy, Jeffery Zabarsky
  • Patent number: 7631144
    Abstract: A method for storing data is disclosed. The method comprises receiving a data stream comprising a plurality of data segments and preliminarily checking in a memory having a relatively low latency whether one of the plurality of data segments has been stored previously. The method further comprises in the event that the preliminary check does not conclusively determine whether the data segment has been stored previously, limiting checking in a memory having a relatively high latency to conclusively determine whether the data segment has been previously stored, and in the event that checking is limited or in the event that the check in the memory having relatively high latency conclusively determines the data segment has not been previously stored, storing the data segment.
    Type: Grant
    Filed: September 13, 2004
    Date of Patent: December 8, 2009
    Assignee: DataDomain, Inc.
    Inventors: Ming Benjamin Zhu, R. Hugo Patterson, Allan J. Bricker, Edward K. Lee