Patents by Inventor Graham Cormode

Graham Cormode has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8463906
    Abstract: A method and system of monitoring computer network activity including determining a first phase frequency estimate, associated with a first frequency vector, determined in response to receiving first bits from a first plurality of remote computer network devices. The first bits received from the first plurality of remote devices in response to satisfying a first activity threshold. Also, determining a second phase frequency estimate associated with a second frequency vector and determined in response to receiving second bits from a second plurality of remote devices. The second bits received from the second plurality of remote devices in response to a second activity threshold being satisfied. The second phase frequency estimate determined in response to the first phase frequency estimate exceeding a global threshold. Further, providing a frequency moment Fp in response to the second phase frequency estimate exceeding a refined threshold.
    Type: Grant
    Filed: October 5, 2012
    Date of Patent: June 11, 2013
    Assignee: AT&T Intellectual Property I, LP
    Inventors: Graham Cormode, Ke Yi
  • Patent number: 8458326
    Abstract: The present disclosure is directed to systems, methods, and computer-readable storage media for sampling from distributed data streams. Data elements are received at site servers configured to collect and report data to a coordinator device. The site servers assign a binary string to each of the data elements. Each bit of the binary strings can be independently set to a 0 or a 1 with a probability of one half. The binary string is used to sample from the received data elements, and the data elements and/or the sampled data elements can be transmitted to a coordinator device. The coordinator device can examine one or more bits of the binary string to draw samples of the received data elements in accordance with desired probabilities.
    Type: Grant
    Filed: June 30, 2010
    Date of Patent: June 4, 2013
    Assignee: AT&T Intellecutal Property I, L.P.
    Inventors: Graham Cormode, Ke Yi, Qin Zhang
  • Patent number: 8438650
    Abstract: The present disclosure is directed to systems, methods, and computer-readable storage media for anonymizing data over multiple temporal releases. Data is received, and nodes and connections in the data are identified. The data also is analyzed to identify predicted connections. The nodes, the connections, and the predicted connections are analyzed to determine how to group the nodes in the data. The data is published, and the grouping of the nodes is extended to subsequent temporal releases of the data, the nodes of which are grouped in accordance with the grouping used with the data.
    Type: Grant
    Filed: July 6, 2010
    Date of Patent: May 7, 2013
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Smriti Bhagat, Balachander Krishnamurthy, Divesh Srivastava
  • Patent number: 8391164
    Abstract: Aggregates are calculated from a data stream in which data is sent in a sequence of tuples, in which each tuple comprises an item identifier and a timestamp indicating when the tuple was transmitted. The tuples may arrive out-of-order, that is, the sequence in which the tuples arrive are not necessarily in the sequence of their corresponding timestamps. In calculating aggregates, more recent data may be given more weight by multiplying each tuple by a decay function which is a function of the timestamp associated with the tuple and the current time. The tuples are recorded in a quantile-digest data structure. Aggregates are calculated from the data stored in the quantile-digest data structure.
    Type: Grant
    Filed: January 2, 2008
    Date of Patent: March 5, 2013
    Assignees: AT&T Intellectual Property I, L.P., Iowa State University Research Foundation, Inc.
    Inventors: Graham Cormode, Philip Korn, Srikanta Tirthapura
  • Patent number: 8386534
    Abstract: Disclosed are method and apparatus for identifying members of a social network who have a high likelihood of providing a useful response to a query. A query engine examines the personal pages of a set of members and automatically gleans semantic information relevant to the query. From the automatically-gleaned semantic information, a score indicative of the likelihood that the member may provide a useful response is calculated.
    Type: Grant
    Filed: October 30, 2008
    Date of Patent: February 26, 2013
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Balachander Krishnamurthy
  • Patent number: 8386412
    Abstract: Example methods and apparatus to construct histogram and wavelet synopses for probabilistic data are disclosed. A disclosed example method involves receiving probabilistic data associated with probability measures and generating a plurality of histograms based on the probabilistic data. Each histogram is generated based on items represented by the probabilistic data. In addition, each histogram is generated using a different quantity of buckets containing different ones of the items. An error measure associated with each of the plurality of histograms is determined and one of the plurality of histograms is selected based on its associated error measure. The method also involves displaying parameter information associated with the one of the plurality of histograms to represent the data.
    Type: Grant
    Filed: December 12, 2008
    Date of Patent: February 26, 2013
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Minos Garofalakis
  • Publication number: 20120296898
    Abstract: The present disclosure is directed to systems, methods, and computer-readable storage media for publishing data. A data summary summarizing the data can be generated and published according to several publishing schemes. In some embodiments, non-zero entries are selected and modified and zero entries are sampled according to one or more distribution functions. The sampled and modified values are added to a data summary, or a sample of the sampled and modified values are added to the data summary. The data summary is published, released, used, or otherwise output. In other embodiments, priority values are assigned to each value associated with the data, and a number of entries with the highest values are selected and added to the data summary.
    Type: Application
    Filed: May 19, 2011
    Publication date: November 22, 2012
    Inventors: Graham Cormode, Cecilia Procopiuc, Divesh Srivastava, Thanh Tran
  • Patent number: 8307080
    Abstract: A method and system of monitoring computer network activity including determining a first phase frequency estimate, associated with a first frequency vector, determined in response to receiving first bits from a first plurality of remote computer network devices. The first bits received from the first plurality of remote devices in response to satisfying a first activity threshold. Also, determining a second phase frequency estimate associated with a second frequency vector and determined in response to receiving second bits from a second plurality of remote devices. The second bits received from the second plurality of remote devices in response to a second activity threshold being satisfied. The second phase frequency estimate determined in response to the first phase frequency estimate exceeding a global threshold. Further, providing a frequency moment Fp in response to the second phase frequency estimate exceeding a refined threshold.
    Type: Grant
    Filed: November 23, 2011
    Date of Patent: November 6, 2012
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Ke Yi
  • Publication number: 20120159500
    Abstract: A method for validating outsourced processing of a priority queue includes configuring a verifier for independent, single-pass processing of priority queue operations that include insertion operations and extraction operations and priorities associated with each operation. The verifier may be configured to validate N operations using a memory space having a size that is proportional to the square root of N using an algorithm to buffer the operations as a series of R epochs. Extractions associated with each individual epoch may be monitored using arrays Y and Z. Insertions for the epoch k may monitored using arrays X and Z. The processing of the priority queue operations may be verified based on the equality or inequality of the arrays X, Y, and Z. Hashed values for the arrays may be used to test their equality to conserve storage requirements.
    Type: Application
    Filed: December 17, 2010
    Publication date: June 21, 2012
    Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Graham Cormode, Amit Chakrabarti, Ranganath Kondapally, Andrew Iain Shaw McGregor
  • Publication number: 20120143830
    Abstract: A method for validating outsourced processing of a data stream arriving at a streaming data warehouse of a data service provider includes a proof protocol. A verifier acting on behalf of a data owner of the data stream may interact with a prover acting on behalf of the data service provider. The verifier may calculate a first root hash value of a binary tree during single-pass processing of the original data stream with limited computational effort. A second root hash value may be calculated using the proof protocol between the verifier and the prover. The prover is requested to provide certain queried values before receiving random numbers used to generate subsequent responses dependent on the provided values. The proof protocol may be used to validate the data processing performed by the data service provider.
    Type: Application
    Filed: December 2, 2010
    Publication date: June 7, 2012
    Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Graham Cormode, Ke Yi
  • Patent number: 8160837
    Abstract: Methods and apparatus to determine statistical dominance point descriptors for multidimensional data are disclosed. An example method disclosed herein comprises determining a first joint dominance value for a first data point in a multidimensional data set, data points in the multidimensional data set comprising multidimensional values, each dimension corresponding to a different measurement of a physical event, the first joint dominance value corresponding to a number of data points in the multidimensional data set dominated by the first data point in every dimension, determining a first skewness value for the first data point, the first skewness value corresponding to a size of a first dimension of the first data point relative to a combined size of all dimensions of the first data point, and combining the first joint dominance and first skewness values to determine a first statistical dominance point descriptor associated with the first data point.
    Type: Grant
    Filed: December 12, 2008
    Date of Patent: April 17, 2012
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Philip Korn, Divesh Srivastava
  • Patent number: 8145669
    Abstract: Methods and apparatus for representing probabilistic data using a probabilistic histogram are disclosed. An example method comprises partitioning a plurality of ordered data items into a plurality of buckets, each of the data items capable of having a data value from a plurality of possible data values with a probability characterized by a respective individual probability distribution function (PDF), each bucket associated with a respective subset of the ordered data items bounded by a respective beginning data item and a respective ending data item, and determining a first representative PDF for a first bucket associated with a first subset of the ordered data items by partitioning the plurality of possible data values into a first plurality of representative data ranges and respective representative probabilities based on an error between the first representative PDF and a first plurality of individual PDFs characterizing the first subset of the ordered data items.
    Type: Grant
    Filed: December 11, 2009
    Date of Patent: March 27, 2012
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Antonios Deligiannakis, Minos Garofalakis, Andrew Iain Shaw McGregor
  • Publication number: 20120066383
    Abstract: A method and system of monitoring computer network activity including determining a first phase frequency estimate, associated with a first frequency vector, determined in response to receiving first bits from a first plurality of remote computer network devices. The first bits received from the first plurality of remote devices in response to satisfying a first activity threshold. Also, determining a second phase frequency estimate associated with a second frequency vector and determined in response to receiving second bits from a second plurality of remote devices. The second bits received from the second plurality of remote devices in response to a second activity threshold being satisfied. The second phase frequency estimate determined in response to the first phase frequency estimate exceeding a global threshold. Further, providing a frequency moment Fp in response to the second phase frequency estimate exceeding a refined threshold.
    Type: Application
    Filed: November 23, 2011
    Publication date: March 15, 2012
    Applicant: AT&T INTELLECTUAL PROPERTY I, LP
    Inventors: Graham Cormode, Ke Yi
  • Patent number: 8112802
    Abstract: Embodiments disclosed herein are directed to verifying query results of an untrusted server. A data owner outsources a data stream to the untrusted server, which is configured to respond to a query from a client with the query result, which is returned to the client. The data owner can maintain a vector associated with query results returned by the server and can generate a verification synopsis using the vector and a seed. The verification synopsis includes a polynomial, where coefficients of the polynomial are determined based on the seed. The data owner outputs the verification synopsis and the seed to a client for verification of the query results.
    Type: Grant
    Filed: November 21, 2008
    Date of Patent: February 7, 2012
    Assignee: AT&T Intellectual Property I, LP
    Inventors: Marios Hadjieleftheriou, George Kollios, Feifei Li, Divesh Srivastava, Ke Yi, Graham Cormode
  • Patent number: 8103708
    Abstract: Described is a system and method for receiving a data stream of multi-dimensional items, collecting a sample of the data stream having a predetermined number of items and dividing the sample into a plurality of subsamples, each subsample corresponding to a single dimension of each of the predetermined number of items. A query is then executed on a particular item in at least two of the subsamples to generate data for the corresponding subsample. This data is combined into a single value.
    Type: Grant
    Filed: December 15, 2009
    Date of Patent: January 24, 2012
    Assignee: AT & T Intellectual Property II, LP
    Inventors: Graham Cormode, Philip Russell Korn, Shanmugavelayutham Muthukrishnan, Divesh Srivastava
  • Publication number: 20120011591
    Abstract: The present disclosure is directed to systems, methods, and computer-readable storage media for anonymizing data over multiple temporal releases. Data is received, and nodes and connections in the data are identified. The data also is analyzed to identify predicted connections. The nodes, the connections, and the predicted connections are analyzed to determine how to group the nodes in the data. The data is published, and the grouping of the nodes is extended to subsequent temporal releases of the data, the nodes of which are grouped in accordance with the grouping used with the data.
    Type: Application
    Filed: July 6, 2010
    Publication date: January 12, 2012
    Inventors: Graham Cormode, Smriti Bhagat, Balachander Krishnamurthy, Divesh Srivastava
  • Publication number: 20120005329
    Abstract: The present disclosure is directed to systems, methods, and computer-readable storage media for sampling from distributed data streams. Data elements are received at site servers configured to collect and report data to a coordinator device. The site servers assign a binary string to each of the data elements. Each bit of the binary strings can be independently set to a 0 or a 1 with a probability of one half. The binary string is used to sample from the received data elements, and the data elements and/or the sampled data elements can be transmitted to a coordinator device. The coordinator device can examine one or more bits of the binary string to draw samples of the received data elements in accordance with desired probabilities.
    Type: Application
    Filed: June 30, 2010
    Publication date: January 5, 2012
    Inventors: Graham Cormode, Ke Yi, Qin Zhang
  • Publication number: 20110321169
    Abstract: The present disclosure is directed to systems, methods, and computer-readable storage media for generating data and data sets that are resistant to minimality attacks. Data sets having a number of tuples are received, and the tuples are ordered according to an aspect of the tuples. The tuples can be split into groups of tuples, and each of the groups may be analyzed to determine if the group complies with a privacy requirement. Groups that satisfy the privacy requirement may be output as new data sets that are resistant to minimality attacks.
    Type: Application
    Filed: June 29, 2010
    Publication date: December 29, 2011
    Inventors: Graham Cormode, Tiancheng Li, Divesh Srivastava
  • Patent number: 8078710
    Abstract: This invention discloses continuous functional monitoring of distributed network activity using algorithms based on frequency moment calculations given by Fp=?imip. The frequency moment calculations are used to raise an alarm when a value exceeds a certain threshold. Frequency moments for p=0, 1, and 2 are described.
    Type: Grant
    Filed: December 21, 2007
    Date of Patent: December 13, 2011
    Assignee: AT&T Intellectual Property I, LP
    Inventors: Graham Cormode, Ke Yi
  • Publication number: 20110145223
    Abstract: Methods and apparatus for representing probabilistic data using a probabilistic histogram are disclosed. An example method comprises partitioning a plurality of ordered data items into a plurality of buckets, each of the data items capable of having a data value from a plurality of possible data values with a probability characterized by a respective individual probability distribution function (PDF), each bucket associated with a respective subset of the ordered data items bounded by a respective beginning data item and a respective ending data item, and determining a first representative PDF for a first bucket associated with a first subset of the ordered data items by partitioning the plurality of possible data values into a first plurality of representative data ranges and respective representative probabilities based on an error between the first representative PDF and a first plurality of individual PDFs characterizing the first subset of the ordered data items.
    Type: Application
    Filed: December 11, 2009
    Publication date: June 16, 2011
    Inventors: Graham Cormode, Antonios Deligiannakis, Minos Garofalakis, Andrew Iain Shaw McGregor