Patents by Inventor Moses Charikar

Moses Charikar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11074723
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.
    Type: Grant
    Filed: March 19, 2019
    Date of Patent: July 27, 2021
    Assignee: EMC IP Holding Company LLC
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 11049283
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.
    Type: Grant
    Filed: February 14, 2019
    Date of Patent: June 29, 2021
    Assignee: EMC IP Holding Company LLC
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20190266257
    Abstract: A query that includes an entity is received. One or more entities from a plurality of entities that are similar to the entity included in the query are determined based on a sim hash associated with the entity included in the query and one or more corresponding sim hashes associated with the one or more entities. The sim hash associated with the entity included in the query and the corresponding sim hashes associated with the entity are based on a plurality of random hyperplanes. A content feed is updated based on the determined one or more entities.
    Type: Application
    Filed: March 15, 2018
    Publication date: August 29, 2019
    Inventors: Vishnu Priya Natchu, Moses Charikar
  • Publication number: 20190213758
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.
    Type: Application
    Filed: March 19, 2019
    Publication date: July 11, 2019
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20190180478
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.
    Type: Application
    Filed: February 14, 2019
    Publication date: June 13, 2019
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 10282863
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.
    Type: Grant
    Filed: May 18, 2017
    Date of Patent: May 7, 2019
    Assignee: EMC IP Holding Company LLC
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 10249059
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.
    Type: Grant
    Filed: December 20, 2016
    Date of Patent: April 2, 2019
    Assignee: EMC IP Holding Company LLC
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 10114839
    Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragment boundaries are determined and a data format for each fragment is found based on continuity properties including by: dividing the stream of information into windows, determining whether each window has a known or unknown format; and comparing portions of windows having an unknown format with neighboring windows to determine fragment boundaries. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.
    Type: Grant
    Filed: June 30, 2013
    Date of Patent: October 30, 2018
    Assignee: EMC IP Holding Company LLC
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20170256074
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.
    Type: Application
    Filed: May 18, 2017
    Publication date: September 7, 2017
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 9684974
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.
    Type: Grant
    Filed: June 30, 2013
    Date of Patent: June 20, 2017
    Assignee: EMC IP Holding Company LLC
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20170104999
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.
    Type: Application
    Filed: December 20, 2016
    Publication date: April 13, 2017
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 9558566
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.
    Type: Grant
    Filed: June 30, 2013
    Date of Patent: January 31, 2017
    Assignee: EMC IP Holding Company LLC
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 9495390
    Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragments are determined and a data format for each fragment is found based on continuity properties including by: determining a physical to canonical layout transformation, determining a tuple size, and partitioning bit positions in tuples into disjoint groups wherein each group corresponds to a channel. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.
    Type: Grant
    Filed: June 30, 2013
    Date of Patent: November 15, 2016
    Assignee: EMC Corporation
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 9384218
    Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that includes a continuity property is received. A format identifier of at least a portion of the stream is determined, wherein the format identifier includes a data representation size, a group size, and an alignment that is consistent with the continuity property. The stream of information is compressed using a compression technique selected based on the format identifier to produce a compressed stream, and the compressed stream is stored.
    Type: Grant
    Filed: June 30, 2013
    Date of Patent: July 5, 2016
    Assignee: EMC Corporation
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20140059091
    Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragments are determined and a data format for each fragment is found based on continuity properties including by: determining a physical to canonical layout transformation, determining a tuple size, and partitioning bit positions in tuples into disjoint groups wherein each group corresponds to a channel. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.
    Type: Application
    Filed: June 30, 2013
    Publication date: February 27, 2014
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20140056532
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.
    Type: Application
    Filed: June 30, 2013
    Publication date: February 27, 2014
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20140059022
    Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragment boundaries are determined and a data format for each fragment is found based on continuity properties including by: dividing the stream of information into windows, determining whether each window has a known or unknown format; and comparing portions of windows having an unknown format with neighboring windows to determine fragment boundaries. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.
    Type: Application
    Filed: June 30, 2013
    Publication date: February 27, 2014
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20140056533
    Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.
    Type: Application
    Filed: June 30, 2013
    Publication date: February 27, 2014
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Publication number: 20140059021
    Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that includes a continuity property is received. A format identifier of at least a portion of the stream is determined, wherein the format identifier includes a data representation size, a group size, and an alignment that is consistent with the continuity property. The stream of information is compressed using a compression technique selected based on the format identifier to produce a compressed stream, and the compressed stream is stored.
    Type: Application
    Filed: June 30, 2013
    Publication date: February 27, 2014
    Inventors: Moses Charikar, Deepa Ramakrishna
  • Patent number: 7966327
    Abstract: A content-addressable and -searchable storage system for managing and exploring massive amounts of feature-rich data such as images, audio or scientific data, is shown. A segmentation and feature extraction unit segments data corresponding to an object into a plurality of data segments and -generates a feature vector for each data segment. A sketch construction component converts the feature vector into a compact bit-vector corresponding to the object. The system also has a similarity index having plurality of compact bit-vectors corresponding to a plurality of objects and an index insertion component for inserting a compact bit-vector corresponding to an object into the similarity index. The system may further have an indexing unit for identifying a candidate set of objects from said similarity index based upon a compact bit-vector corresponding to a query object.
    Type: Grant
    Filed: September 7, 2005
    Date of Patent: June 21, 2011
    Assignee: The Trustees of Princeton University
    Inventors: Kai Li, Qin Lv, Moses Charikar