Patents by Inventor Moses Charikar
Moses Charikar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11074723Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.Type: GrantFiled: March 19, 2019Date of Patent: July 27, 2021Assignee: EMC IP Holding Company LLCInventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 11049283Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.Type: GrantFiled: February 14, 2019Date of Patent: June 29, 2021Assignee: EMC IP Holding Company LLCInventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20190266257Abstract: A query that includes an entity is received. One or more entities from a plurality of entities that are similar to the entity included in the query are determined based on a sim hash associated with the entity included in the query and one or more corresponding sim hashes associated with the one or more entities. The sim hash associated with the entity included in the query and the corresponding sim hashes associated with the entity are based on a plurality of random hyperplanes. A content feed is updated based on the determined one or more entities.Type: ApplicationFiled: March 15, 2018Publication date: August 29, 2019Inventors: Vishnu Priya Natchu, Moses Charikar
-
Publication number: 20190213758Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.Type: ApplicationFiled: March 19, 2019Publication date: July 11, 2019Inventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20190180478Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.Type: ApplicationFiled: February 14, 2019Publication date: June 13, 2019Inventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 10282863Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.Type: GrantFiled: May 18, 2017Date of Patent: May 7, 2019Assignee: EMC IP Holding Company LLCInventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 10249059Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.Type: GrantFiled: December 20, 2016Date of Patent: April 2, 2019Assignee: EMC IP Holding Company LLCInventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 10114839Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragment boundaries are determined and a data format for each fragment is found based on continuity properties including by: dividing the stream of information into windows, determining whether each window has a known or unknown format; and comparing portions of windows having an unknown format with neighboring windows to determine fragment boundaries. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.Type: GrantFiled: June 30, 2013Date of Patent: October 30, 2018Assignee: EMC IP Holding Company LLCInventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20170256074Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.Type: ApplicationFiled: May 18, 2017Publication date: September 7, 2017Inventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 9684974Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.Type: GrantFiled: June 30, 2013Date of Patent: June 20, 2017Assignee: EMC IP Holding Company LLCInventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20170104999Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.Type: ApplicationFiled: December 20, 2016Publication date: April 13, 2017Inventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 9558566Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.Type: GrantFiled: June 30, 2013Date of Patent: January 31, 2017Assignee: EMC IP Holding Company LLCInventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 9495390Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragments are determined and a data format for each fragment is found based on continuity properties including by: determining a physical to canonical layout transformation, determining a tuple size, and partitioning bit positions in tuples into disjoint groups wherein each group corresponds to a channel. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.Type: GrantFiled: June 30, 2013Date of Patent: November 15, 2016Assignee: EMC CorporationInventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 9384218Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that includes a continuity property is received. A format identifier of at least a portion of the stream is determined, wherein the format identifier includes a data representation size, a group size, and an alignment that is consistent with the continuity property. The stream of information is compressed using a compression technique selected based on the format identifier to produce a compressed stream, and the compressed stream is stored.Type: GrantFiled: June 30, 2013Date of Patent: July 5, 2016Assignee: EMC CorporationInventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20140059091Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragments are determined and a data format for each fragment is found based on continuity properties including by: determining a physical to canonical layout transformation, determining a tuple size, and partitioning bit positions in tuples into disjoint groups wherein each group corresponds to a channel. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.Type: ApplicationFiled: June 30, 2013Publication date: February 27, 2014Inventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20140056532Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information is received, wherein the stream of information comprises a sequence of tuples and wherein each of the tuples comprises data elements corresponding to one of a plurality of input channels. A channel transformer is employed to rearrange the data elements into a plurality of output channels for an output stream wherein the output channels have higher compressibility than the input channels. The compressed output stream is stored.Type: ApplicationFiled: June 30, 2013Publication date: February 27, 2014Inventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20140059022Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that is divided into fragments is received. Fragment boundaries are determined and a data format for each fragment is found based on continuity properties including by: dividing the stream of information into windows, determining whether each window has a known or unknown format; and comparing portions of windows having an unknown format with neighboring windows to determine fragment boundaries. The stream of information is compressed using a compression technique selected based on the data format, and the compressed stream is stored.Type: ApplicationFiled: June 30, 2013Publication date: February 27, 2014Inventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20140056533Abstract: Lossless compression of fragmented image data is disclosed. In some embodiments, a stream of information comprising data elements having statistical characteristics is received. An encoded output is produced by an encoder comprising a data compressor that implements a variable length code that is adapted to the statistical characteristics of the data elements. The output and information from which the variable length code can be derived are stored.Type: ApplicationFiled: June 30, 2013Publication date: February 27, 2014Inventors: Moses Charikar, Deepa Ramakrishna
-
Publication number: 20140059021Abstract: Format identification for fragmented data is disclosed. In some embodiments, an input stream of information that includes a continuity property is received. A format identifier of at least a portion of the stream is determined, wherein the format identifier includes a data representation size, a group size, and an alignment that is consistent with the continuity property. The stream of information is compressed using a compression technique selected based on the format identifier to produce a compressed stream, and the compressed stream is stored.Type: ApplicationFiled: June 30, 2013Publication date: February 27, 2014Inventors: Moses Charikar, Deepa Ramakrishna
-
Patent number: 7966327Abstract: A content-addressable and -searchable storage system for managing and exploring massive amounts of feature-rich data such as images, audio or scientific data, is shown. A segmentation and feature extraction unit segments data corresponding to an object into a plurality of data segments and -generates a feature vector for each data segment. A sketch construction component converts the feature vector into a compact bit-vector corresponding to the object. The system also has a similarity index having plurality of compact bit-vectors corresponding to a plurality of objects and an index insertion component for inserting a compact bit-vector corresponding to an object into the similarity index. The system may further have an indexing unit for identifying a candidate set of objects from said similarity index based upon a compact bit-vector corresponding to a query object.Type: GrantFiled: September 7, 2005Date of Patent: June 21, 2011Assignee: The Trustees of Princeton UniversityInventors: Kai Li, Qin Lv, Moses Charikar