Patents by Inventor Minos Garofalakis

Minos Garofalakis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8775362
    Abstract: A disclosed example method involves generating a plurality of wavelet coefficient quantities. Each wavelet coefficient quantity is generated based on items represented by probabilistic data. Each wavelet coefficient quantity represents different ones of the items by multiplying corresponding wavelet vectors. The example method also involves determining an error measure associated with each of the plurality of wavelet coefficient quantities, and selecting at least one of the plurality of wavelet coefficient quantities based on its associated error measure. The method also involves displaying parameter information associated with the one of the plurality of wavelet coefficient quantities to represent the probabilistic data.
    Type: Grant
    Filed: February 25, 2013
    Date of Patent: July 8, 2014
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Minos Garofalakis
  • Patent number: 8386412
    Abstract: Example methods and apparatus to construct histogram and wavelet synopses for probabilistic data are disclosed. A disclosed example method involves receiving probabilistic data associated with probability measures and generating a plurality of histograms based on the probabilistic data. Each histogram is generated based on items represented by the probabilistic data. In addition, each histogram is generated using a different quantity of buckets containing different ones of the items. An error measure associated with each of the plurality of histograms is determined and one of the plurality of histograms is selected based on its associated error measure. The method also involves displaying parameter information associated with the one of the plurality of histograms to represent the data.
    Type: Grant
    Filed: December 12, 2008
    Date of Patent: February 26, 2013
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Minos Garofalakis
  • Patent number: 8145669
    Abstract: Methods and apparatus for representing probabilistic data using a probabilistic histogram are disclosed. An example method comprises partitioning a plurality of ordered data items into a plurality of buckets, each of the data items capable of having a data value from a plurality of possible data values with a probability characterized by a respective individual probability distribution function (PDF), each bucket associated with a respective subset of the ordered data items bounded by a respective beginning data item and a respective ending data item, and determining a first representative PDF for a first bucket associated with a first subset of the ordered data items by partitioning the plurality of possible data values into a first plurality of representative data ranges and respective representative probabilities based on an error between the first representative PDF and a first plurality of individual PDFs characterizing the first subset of the ordered data items.
    Type: Grant
    Filed: December 11, 2009
    Date of Patent: March 27, 2012
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Graham Cormode, Antonios Deligiannakis, Minos Garofalakis, Andrew Iain Shaw McGregor
  • Publication number: 20110145223
    Abstract: Methods and apparatus for representing probabilistic data using a probabilistic histogram are disclosed. An example method comprises partitioning a plurality of ordered data items into a plurality of buckets, each of the data items capable of having a data value from a plurality of possible data values with a probability characterized by a respective individual probability distribution function (PDF), each bucket associated with a respective subset of the ordered data items bounded by a respective beginning data item and a respective ending data item, and determining a first representative PDF for a first bucket associated with a first subset of the ordered data items by partitioning the plurality of possible data values into a first plurality of representative data ranges and respective representative probabilities based on an error between the first representative PDF and a first plurality of individual PDFs characterizing the first subset of the ordered data items.
    Type: Application
    Filed: December 11, 2009
    Publication date: June 16, 2011
    Inventors: Graham Cormode, Antonios Deligiannakis, Minos Garofalakis, Andrew Iain Shaw McGregor
  • Patent number: 7890494
    Abstract: The subject matter disclosed herein relates to processing information regarding events. In one particular example, a stabbing query may be formulated in response to an event. One or more sets are associated with and/or mapped to nodes of a tree.
    Type: Grant
    Filed: October 31, 2007
    Date of Patent: February 15, 2011
    Assignee: Yahoo! Inc.
    Inventors: Erik N. Vee, Minos Garofalakis, Jayavel Shanmugasundaram, Ashwin Kumar V Machanavajjhala
  • Patent number: 7797260
    Abstract: Subject matter disclosed herein relates to document classification and/or automated document classifier tuning. In an example embodiment, a document received from a user computing platform in an online database stored in a memory of a server computing platform may be classified based, at least in part, on a training set. Also for an example embodiment, the training set may be modified based, at least in part, on statistics gathered from user browsing behavior.
    Type: Grant
    Filed: February 11, 2008
    Date of Patent: September 14, 2010
    Assignee: Yahoo! Inc.
    Inventors: Minos Garofalakis, Jayavel Shanmugasundaram, Erik Nathan Vee, Rajagopal Baskaran, Jagadish P. Samantarai, Chyr-Chong (Joseph) Ting, Yuan Zhuge
  • Publication number: 20100185515
    Abstract: A method and system for allocating inventory in an Internet environment is provided. A method employed by the system may include generating an inventory pool that represents a number of impressions deliverable to all users, then determining, from multiple past orders for booking impressions, a hierarchy of parameters utilized to target users and a number of impressions deliverable to users characterized by the parameters. The inventory pool may then be partitioned into multiple inventory pools according to the hierarchy, where each inventory pool represents a number of impressions deliverable to users characterized by parameters associated with the inventory pool. The hierarchy of pools may then be stored to a database.
    Type: Application
    Filed: January 16, 2009
    Publication date: July 22, 2010
    Applicant: Yahoo! Inc.
    Inventors: Ramana Yerneni, Erik Vee, Minos Garofalakis, Srinivasan Rajagopal, Jayavel Shanmugasundaram, Mark Sordo, Donald Swanson, Sergei Vassilvitskii
  • Publication number: 20100153328
    Abstract: Example methods and apparatus to construct histogram and wavelet synopses for probabilistic data are disclosed. A disclosed example method involves receiving probabilistic data associated with probability measures and generating a plurality of histograms based on the probabilistic data. Each histogram is generated based on items represented by the probabilistic data. In addition, each histogram is generated using a different quantity of buckets containing different ones of the items. An error measure associated with each of the plurality of histograms is determined and one of the plurality of histograms is selected based on its associated error measure. The method also involves displaying parameter information associated with the one of the plurality of histograms to represent the data.
    Type: Application
    Filed: December 12, 2008
    Publication date: June 17, 2010
    Inventors: Graham Cormode, Minos Garofalakis
  • Publication number: 20100082428
    Abstract: Example embodiments described herein may relate to estimating inventory for a display advertising system utilized, for example, in Web-based advertising.
    Type: Application
    Filed: September 30, 2008
    Publication date: April 1, 2010
    Applicant: Yahoo! Inc.
    Inventors: Sergei Vassilvitskii, Jayavel Shanmugasundaram, Srinivasan Rajagopal, Minos Garofalakis, Erik N. Vee, Ramana Yerneni
  • Patent number: 7669241
    Abstract: A distinct-count estimate is obtained in a guaranteed small footprint using a two level hash, distinct count sketch. A first hash fills the first-level hash buckets with an exponentially decreasing number of data-elements. These are then uniformly hashed to an array of second-level-hash tables, and have an associated total-element counter and bit-location counters. These counters are used to identify singletons and so provide a distinct-sample and a distinct-count. An estimate of the total distinct-count is obtained by dividing by the distinct-count by the probability of mapping a data-element to that bucket. An estimate of the total distinct-source frequencies of destination address can be found in a similar fashion. By further associating the distinct-count sketch with a list of singletons, a total singleton count and a heap containing the destination addresses ordered by their distinct-source frequencies, a tracking distinct-count sketch may be formed that has considerably improved query time.
    Type: Grant
    Filed: September 30, 2004
    Date of Patent: February 23, 2010
    Assignee: Alcatel-Lucent USA Inc.
    Inventors: Sumit Ganguly, Minos Garofalakis, Rajeev Rastogi, Krishan Sabnani
  • Publication number: 20100036865
    Abstract: A method of constructing a score-optimal R-tree to support top-k stabbing queries over a set of scored intervals generates a constraint graph from the set, and determines over each node in the constraint graph that has no other nodes pointing to it the node with the smallest left endpoint; for each of these nodes, the associated interval is added to the tree and the node is removed from the constraint graph.
    Type: Application
    Filed: August 7, 2008
    Publication date: February 11, 2010
    Applicant: Yahoo! Inc.
    Inventors: Jayavel SHANMUGASUNDARAM, Minos Garofalakis, Erik Vee, Ashwin Kumar Machanavajjhala
  • Patent number: 7596544
    Abstract: A method of estimating set-expression cardinalities over data streams with guaranteed small maintenance time per data-element update. The method only examines each data element once and uses a limited amount of memory. The time-efficient stream synopsis extends 2-level hash-sketches by randomly, but uniformly, pre-hashing data-elements prior to logarithmically hashing them to a first-level hash-table. This generates a set of independent 2-level hash-sketches. The set-union cardinality can be estimated by determining the smallest hash-bucket index j at which only a predetermined fraction of the b hash-buckets has a non-empty union |A?B|. Once a set-union cardinality is estimated, general set-expression cardinalities may be estimated by counting witness elements for the set-expression, i.e., those first-level hash-buckets that are both a singleton for the set-expression and a set-union singleton.
    Type: Grant
    Filed: December 29, 2004
    Date of Patent: September 29, 2009
    Assignee: Alcatel-Lucent USA Inc.
    Inventors: Sumit Ganguly, Minos Garofalakis, Rajeev Rastogi
  • Publication number: 20090204703
    Abstract: Subject matter disclosed herein relates to document classification and/or automated document classifier tuning.
    Type: Application
    Filed: February 11, 2008
    Publication date: August 13, 2009
    Inventors: Minos Garofalakis, Jayavel Shanmugasundaram, Erik Nathan Vee, Rajagopal Baskaran, Jagadish P. Samantarai, Chyr-Chong Joseph Ting, Yuan Zhuge
  • Publication number: 20090125502
    Abstract: A method of generating a diversified vertical search results listing, including listing attribute values related to search criteria and their frequency of occurrence to create a plurality of listings; creating a plurality of interval bands based on the plurality of listings; generating a random diversity score for each listing over a substantially uniform distribution within each of the plurality of bands; and sorting a set of search results for diversified listing in response to a user searching for the search criteria according to the diversity score of each listing.
    Type: Application
    Filed: November 13, 2007
    Publication date: May 14, 2009
    Applicant: Yahoo! Inc.
    Inventors: Chyr-Chong Ting, Minos Garofalakis, Erik Nathan Vee, Jayavel Shanmugasudaram, Rajagopal Baskaran, Yuan Zhuge, Jagadish Prasad Samantarai
  • Publication number: 20090112846
    Abstract: The subject matter disclosed herein relates to processing information regarding events. In one particular example, a stabbing query may be formulated in response to an event. One or more sets are associated with and/or mapped to nodes of a tree.
    Type: Application
    Filed: October 31, 2007
    Publication date: April 30, 2009
    Inventors: Erik N. Vee, Minos Garofalakis, Jayavel Shanmugasundaram, Ashwin Kumar V. Machanavajjhala
  • Patent number: 7483907
    Abstract: A method of estimating an aggregate of a join over data-streams in real-time using skimmed sketches, that only examines each data element once and has a worst case space requirement of O(n2/J), where J is the size of the join and n is the number of data elements. The skimmed sketch is an atomic sketch, formed as the inner product of the data-stream frequency vector and a random binary variable, from which the frequency values that exceed a predetermined threshold have been skimmed off and placed in a dense frequency vector. The join size is estimated as the sum of the sub-joins of skimmed sketches and dense frequency vectors. The atomic sketches may be arranged in a hash structure so that processing a data element only requires updating a single sketch per hash table. This keeps the per-element overhead logarithmic in the domain and stream sizes.
    Type: Grant
    Filed: December 29, 2004
    Date of Patent: January 27, 2009
    Assignee: Alcatel-Lucent USA Inc.
    Inventors: Sumit Ganguly, Minos Garofalakis, Rajeev Rastogi
  • Patent number: 7328220
    Abstract: A method of efficiently providing estimated answers to workloads of aggregate, multi-join SQL-like queries over a number of input data-streams. The method only examines each data elements once and uses a limited amount of computer memory. The method uses join graphs and atomic sketches that are essentially pseudo-random summaries formed using random binary variables. The estimated answer is the product of all the atomic sketches for all the vertices in the query join graph. A query workload is processed efficiently by identifying and sharing atomic sketches common to distinct queries, while ensuring that the join graphs remain well formed. The method may automatically minimize either the average query error or the maximum query error over the workload.
    Type: Grant
    Filed: December 29, 2004
    Date of Patent: February 5, 2008
    Assignee: Lucent Technologies Inc.
    Inventors: Alin Dobra, Johannes Gehrke, Rajeev Rastogi, Minos Garofalakis
  • Publication number: 20070240061
    Abstract: A method of distributed approximate query tracking relies on tracking general-purpose randomized sketch summaries of local streams at remote sites along with concise prediction models of local site behavior in order to produce highly communication-efficient and space/time-efficient solutions. A powerful approximate query tracking framework readily incorporates several complex analysis queries, including distributed join and multi-join aggregates and approximate wavelet representations, thus giving the first known low-overhead tracking solution for such queries in the distributed-streams model.
    Type: Application
    Filed: March 29, 2006
    Publication date: October 11, 2007
    Inventors: Graham Cormode, Minos Garofalakis
  • Publication number: 20070237410
    Abstract: The first fast solution to the problem of tracking wavelet representations of one-dimensional and multi-dimensional data streams based on a stream synopsis, the Group-Count Sketch (GCS) is provided. By imposing a hierarchical structure of groups over the data and applying the GCS, our algorithms can quickly recover the most important wavelet coefficients with guaranteed accuracy. A tradeoff between query time and update time is established, by varying the hierarchical structure of groups, allowing the right balance to be found for specific data streams. Experimental analysis confirmed this tradeoff, and showed that all the methods significantly outperformed previously known methods in terms of both update time and query time, while maintaining a high level of accuracy.
    Type: Application
    Filed: March 24, 2006
    Publication date: October 11, 2007
    Inventors: Graham Cormode, Minos Garofalakis
  • Publication number: 20070136285
    Abstract: The invention comprises a method and apparatus for determining a rank of a query value. Specifically, the method comprises receiving a rank query request, determining, for each of the at least one remote monitor, a predicted lower-bound rank value and upper-bound rank value, wherein the predicted lower-bound rank value and upper-bound rank value are determined according to at least one respective prediction model used by each of the at least one remote monitor to compute the at least one local quantile summary, computing a predicted average rank value for each of the at least one remote monitor using the at least one predicted lower-bound rank value and the at least one predicted upper-bound rank value associated with the respective at least one remote monitor, and computing the rank of the query value using the at least one predicted average rank value associated with the respective at least one remote monitor.
    Type: Application
    Filed: December 13, 2005
    Publication date: June 14, 2007
    Inventors: Graham Cormode, Minos Garofalakis, Shanmugavelayutham Muthukrishnan, Rajeev Rastogi