Patents by Inventor Marcus Fontoura

Marcus Fontoura has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9483568
    Abstract: A hybrid-sharded index includes document-sharded posting lists and term-sharded posting lists. Implementations include systems and methods using a distributed hybrid-sharded index. For example, a method may include receiving, at a root node, a query having a first term and a second term and determining, that the first term is term-sharded. The method may also include retrieving a term-sharded posting list for the first term from a first leaf node that stores the term-sharded posting list and determining, at the root node, a second leaf node that stores a document-sharded posting list for the second term. The method may include sending the second term and a sub-set of documents from the term-sharded posting list to the second leaf node, the sub-set being documents assigned to the second leaf node; and generating a search result using a response received from the second leaf node.
    Type: Grant
    Filed: December 16, 2013
    Date of Patent: November 1, 2016
    Assignee: Google Inc.
    Inventors: Marcus Fontoura, Daniel N. Meredith, Douglas Lee Taylor Rohde, Mahesh S. Palekar, Asim Shankar, Denis Murray Baylor, Zigmars Rasscevskis, Andras Csomai
  • Patent number: 9405846
    Abstract: Various methods and apparatuses are provided which may be implemented using one or more computing devices within a networked computing environment to employ publish-subscribe techniques to associate subscriber encoded data files with a set of publisher encoded data files.
    Type: Grant
    Filed: November 15, 2011
    Date of Patent: August 2, 2016
    Assignee: Yahoo! Inc.
    Inventors: Alexander Shraer, Maxim Gurevich, Vanja Josifovski, Marcus Fontoura
  • Publication number: 20160014236
    Abstract: Systems and methods include a distributed system that uses a declarative language to separate application logic from the network technology used to communicate. For example, in a system with a plurality of processing nodes, each processing node may include at least one processor, a networking layer for communicating with other processing nodes, and a memory storing instructions that, when executed by the at least one processor, cause the processing node to perform operations. The operations may include receiving a processing plan using the networking layer and analyzing the processing plan to determine a service to perform and a number of inputs anticipated for the service. The operations may also include performing the determined service. The processing plan defines a processing tree comprising a subset of the plurality of processing nodes, the processing tree having unidirectional communication from the root to a final level of the tree.
    Type: Application
    Filed: September 24, 2015
    Publication date: January 14, 2016
    Inventors: Michael Roger Marty, Nicholas Andrew Kidd, Marcus Fontoura, Charles Acknin
  • Patent number: 9185156
    Abstract: Systems and methods include a distributed system that uses a declarative language to separate application logic from the network technology used to communicate. For example, in a system with a plurality of processing nodes, each processing node may include at least one processor, a networking layer for communicating with other processing nodes, and a memory storing instructions that, when executed by the at least one processor, cause the processing node to perform operations. The operations may include receiving a processing plan using the networking layer and analyzing the processing plan to determine a service to perform and a number of inputs anticipated for the service. The operations may also include performing the determined service. The processing plan defines a processing tree comprising a subset of the plurality of processing nodes, the processing tree having unidirectional communication from the root to a final level of the tree.
    Type: Grant
    Filed: February 14, 2013
    Date of Patent: November 10, 2015
    Assignee: Google Inc.
    Inventors: Michael Roger Marty, Nicholas Andrew Kidd, Marcus Fontoura, Charles Acknin
  • Publication number: 20150317357
    Abstract: Systems and techniques are disclosed for generating entries for a searchable index based on rules generated by one or more machine-learned models. The index entries can include one or more tokens correlated with an outcome and an outcome probability. A subset of tokens can be identified based on the characteristics of an event. The index may be searched for outcomes and their respective probabilities that correspond to tokens that are similar to or match the subset of tokens based on the event.
    Type: Application
    Filed: May 2, 2014
    Publication date: November 5, 2015
    Applicant: Google Inc.
    Inventors: Jeremiah Harmsen, Tushar Deepak Chandra, Marcus Fontoura
  • Patent number: 9129300
    Abstract: A system and a method are provided for using external sources (e.g., landing pages) for sponsored search ad selection. In one example, the system identifies one or more regions of an external source. The one or more regions are relevant to a query. The external source includes a source that includes relevant data that is usable for augmenting an ad selection process. The system extracts one or more features from the one or more regions. The system determines which of the one or more features are relevant for item indexing. The system then augments an item selection process by using the one or more features that are relevant for item indexing.
    Type: Grant
    Filed: April 21, 2010
    Date of Patent: September 8, 2015
    Assignee: Yahoo! Inc.
    Inventors: Marcus Fontoura, Vanja Josifovski, Evgeniy Gabrilovich, Bo Pang, Yejin Choi, Mauricio Riguette Mediano
  • Patent number: 8818971
    Abstract: Systems and methods for deleting non-key values from an index distributed over a plurality of computing devices maintains a non-key master list that includes the non-key values that are stored on each of the plurality of computing devices and receives a list of non-key values to delete. The systems and methods further intersect the list of non-key values to delete with the non-key master list, creating a first delete list for a first one of the plurality of computing devices that includes non-key values to be deleted that are stored on the first computing device. The systems and methods further transmit the first delete list to the first computing device and update the non-key master list based on the list of non-key values to delete.
    Type: Grant
    Filed: December 21, 2012
    Date of Patent: August 26, 2014
    Assignee: Google Inc.
    Inventors: Marcus Fontoura, Jan Hendrik Pieper, Krishna Tatavarthi, Bjoern Carlin, Hsiang-ling Lin
  • Patent number: 8447764
    Abstract: A method is carried out by storing information describing configurations of discussion threads formed of respective series of EMTs that are exchanged among at least two individuals. The discussion threads have a root EMT, zero or more reply EMTs, and a last offspring EMT. The method is further carried out by compacting the EMT discussion threads, and indexing the compacted EMT discussion threads.
    Type: Grant
    Filed: June 12, 2012
    Date of Patent: May 21, 2013
    Assignee: International Business Machines Corporation
    Inventors: Andrei Z Broder, Nadav Eiron, Marcus Fontoura, Michael Herscovici, Ronny Lempel, John McPherson, Jr., Eugene Shekita
  • Publication number: 20130124509
    Abstract: Various methods and apparatuses are provided which may be implemented using one or more computing devices within a networked computing environment to employ publish-subscribe techniques to associate subscriber encoded data files with a set of publisher encoded data files.
    Type: Application
    Filed: November 15, 2011
    Publication date: May 16, 2013
    Applicant: YAHOO! Inc., a Delaware Corporation
    Inventors: Alexander Shraer, Maxim Gurevich, Vanja Josifovski, Marcus Fontoura
  • Patent number: 8380682
    Abstract: A method includes describing the thread configurations of a volume of well-ordered electronic message transmissions (EMT) and utilizing the thread configuration data to conduct selective searches of the EMT volume. An apparatus includes a thread processor and a query manager. The thread processor analyzes the EMT threads and records the thread configuration data. The query manager utilizes the thread configuration data to conduct selective searches of the EMT volume.
    Type: Grant
    Filed: June 14, 2009
    Date of Patent: February 19, 2013
    Assignee: International Business Machines Corporation
    Inventors: Andrei Z. Broder, Nadav Eiron, Marcus Fontoura, Michael Herscovici, Ronny Lempel, John McPherson, Jr., Eugene Shekita
  • Patent number: 8380570
    Abstract: Methods and systems are provided for click through rate prediction and advertisement selection in online advertising. Methods are provided in which output information from a feature-based machine learning model is utilized. The output information includes predicted click through rate information. The output information is used to form a matrix. The matrix is modeled using a latent variable model. Machine learning techniques can be used in determining values for unfilled cells of one or more model matrices. The latent variable model can be used in determining predicted click through rate information, and in advertisement selection in connection with serving opportunities.
    Type: Grant
    Filed: October 27, 2009
    Date of Patent: February 19, 2013
    Assignee: Yahoo! Inc.
    Inventors: Deepak K. Agarwal, Joaquin Arturo Delgado Rodriguez, Marcus Fontoura
  • Publication number: 20120259834
    Abstract: A method is carried out by storing information describing configurations of discussion threads formed of respective series of EMTs that are exchanged among at least two individuals. The discussion threads have a root EMT, zero or more reply EMTs, and a last offspring EMT. The method is further carried out by compacting the EMT discussion threads, and indexing the compacted EMT discussion threads.
    Type: Application
    Filed: June 12, 2012
    Publication date: October 11, 2012
    Applicant: International Business Machines Corporation
    Inventors: Andrei Z. Broder, Nadav Eiron, Marcus Fontoura, Michael Herscovici, Ronny Lempel, John McPherson, JR., Eugene Shekita
  • Patent number: 8229933
    Abstract: An item of inventory is described as a Boolean expression, which is converted into a multi-level, alternating AND/OR impression tree representation with leaf nodes representing conjuncts. Processing the conjuncts of the tree through a contract index results in retrieving a set of candidate contracts that match at least some but not necessarily all impression tree leaf node predicates. Next, an AND/OR contract tree representation is constructed with each contract tree leaf node having a label representing a projection onto a discrete set of ordered symbols. Contracts with projections that cover the entire range of discrete set of ordered symbols are deemed to satisfy the item of inventory. Implementation of the contract index includes retrieval techniques to support multi-valued predicates as well as confidence threshold functions using a multi-level tree representation of multi-valued predicates.
    Type: Grant
    Filed: February 26, 2010
    Date of Patent: July 24, 2012
    Assignee: Yahoo! Inc.
    Inventors: Marcus Fontoura, Suhas Sadanandan, Jayavel Shanmugasundaram, Sergei Vassilvitskii, Erik Vee, Srihari Venkatesan, Jason Zien
  • Publication number: 20110264640
    Abstract: A system and a system are provided for using external sources (e.g., landing pages) for sponsored search ad selection. In one example, the system identifies one or more regions of an external source. The one or more regions are relevant to a query. The external source includes a source that includes relevant data that is usable for augmenting an ad selection process. The system extracts one or more features from the one or more regions. The system determines which of the one or more features are relevant for item indexing. The system then augments an item selection process by using the one or more features that are relevant for item indexing.
    Type: Application
    Filed: April 21, 2010
    Publication date: October 27, 2011
    Inventors: Marcus Fontoura, Vanja Josifovski, Evgeniy Gabrilovich, Bo Pang, Yejin Choi, Mauricio Riguette Mediano
  • Patent number: 8032532
    Abstract: A method and system for querying multifaceted information. An inverted index is constructed to include unique indexed tokens associated with posting lists of one or more documents. An indexed token is either a facet token included in a document as an annotation or a path prefix of the facet token. The annotation indicates a path within a tree structure representing a facet that includes the document. The tree structure includes nodes representing categories of documents. A query is received that includes constraints on documents. The constraints are associated with indexed tokens and corresponding posting lists. An execution of the query includes identifying the corresponding posting lists by utilizing the constraints and the inverted index and intersecting the posting lists to obtain a query result.
    Type: Grant
    Filed: May 21, 2008
    Date of Patent: October 4, 2011
    Assignee: International Business Machines Corporation
    Inventors: Andrei Z. Broder, Nadav Eiron, Felipe Marcus Fontoura, Ronny Lempel, Ning Li, John Ai McPherson, Jr., Andreas Neumann, Shila Ofek-Koifman, Runping Qi, Eugene J. Shekita
  • Publication number: 20110225038
    Abstract: An improved system and method for efficiently evaluating complex Boolean expressions is provided. Leaf nodes of Boolean expression trees for objects represented by Boolean expressions of attribute-value pairs may be assigned a positional identifier that indicates the position of a node in the Boolean expression tree. The positional identifiers of each object may be indexed by attribute-value pairs of the leaf nodes of the Boolean expression trees in an inverted index. Given an input set of attribute-value pairs, a list of positional identifiers for leaf nodes of virtual Boolean expression trees may be found in the index matching the attribute-value pairs of the input set. The list of positional identifiers of leaf nodes may be sorted in order by positional identifier for each contract. An expression evaluator may then verify whether a virtual Boolean expression tree for each contract is satisfied by the list of positional identifiers.
    Type: Application
    Filed: March 15, 2010
    Publication date: September 15, 2011
    Applicant: Yahoo! Inc.
    Inventors: Marcus Fontoura, Jayavel Shanmugasundaram, Serguel Vassilvitskii, Erik Vee, Jason Zien
  • Publication number: 20110213659
    Abstract: A method for automatic matching of contracts to inventory using a fixed-length complex predicate representation. An item of inventory is described as a Boolean expression, which is converted into a multi-level, alternating AND/OR impression tree representation with leaf nodes representing conjuncts. Processing the conjuncts of the tree through a contract index results in retrieving a set of candidate contracts that match the at least some but not necessarily all impression tree leaf node predicates. Next, an AND/OR contract tree representation is constructed with each contract tree leaf node having a label representing a projection onto a discrete set of ordered symbols. Contracts with projections that cover the entire range of discrete set of ordered symbols are deemed to satisfy the item of inventory.
    Type: Application
    Filed: February 26, 2010
    Publication date: September 1, 2011
    Inventors: Marcus Fontoura, Subas Sadanandan, Jayavel Shanmugasundaram, Sergei Vassilvitskii, Erik Vee, Srihari Venkatesan, Jason Zien
  • Publication number: 20110213660
    Abstract: A method for automatic matching of contracts to inventory using a fixed-length complex predicate representation. An item of inventory is described as a Boolean expression, which is converted into a multi-level, alternating AND/OR impression tree representation with leaf nodes representing conjuncts. Processing the conjuncts of the tree through a contract index results in retrieving a set of candidate contracts that match the at least some but not necessarily all impression tree leaf node predicates. Next, an AND/OR contract tree representation is constructed with each contract tree leaf node having a label representing a projection onto a discrete set of ordered symbols. Contracts with projections that cover the entire range of discrete set of ordered symbols are deemed to satisfy the item of inventory.
    Type: Application
    Filed: February 26, 2010
    Publication date: September 1, 2011
    Inventors: Marcus Fontoura, Suhas Sadanandan, Jayavel Shanmugasundaram, Sergei Vassilvitskii, Erik Vee, Srihari Venkatesan, Jason Zien
  • Publication number: 20110213767
    Abstract: A method for automatic matching of contracts to inventory using a fixed-length complex predicate representation. An item of inventory is described as a Boolean expression, which is converted into a multi-level, alternating AND/OR impression tree representation with leaf nodes representing conjuncts. Processing the conjuncts of the tree through a contract index results in retrieving a set of candidate contracts that match the at least some but not necessarily all impression tree leaf node predicates. Next, an AND/OR contract tree representation is constructed with each contract tree leaf node having a label representing a projection onto a discrete set of ordered symbols. Contracts with projections that cover the entire range of discrete set of ordered symbols are deemed to satisfy the item of inventory.
    Type: Application
    Filed: February 26, 2010
    Publication date: September 1, 2011
    Inventors: Marcus Fontoura, Suhas Sadanandan, Jayavel Shanmugasundaram, Sergei Vassilvitskii, Erik Vee, Srihari Venkatesan, Jason Zien
  • Publication number: 20110208559
    Abstract: Automatic management of networked publisher-subscriber relationships in an advertising server network. The method comprises steps for constructing a directed graph representation comprising at least one publisher node (e.g. an Internet property), at least one subscriber node (e.g. an Internet advertiser), at least one intermediary node (e.g. an Internet advertising agent), and at least one edge (e.g. an advertising target predicate) wherein any one of the edges is directly associated with at least one target predicate. The directed graph representation is used in conjunction with an inverted index for retrieving a valid node list comprising only nodes having at least one target predicate that matches at least one event predicate. The event predicate (as well as any target predicate) is any arbitrarily complex Boolean expression, and is used in producing a result node list comprising only nodes that concurrently match the event predicate with an advertising target predicate and are reachable.
    Type: Application
    Filed: February 24, 2010
    Publication date: August 25, 2011
    Inventors: Marcus Fontoura, Sergei Vassilvitskii, Jayavel Shanmugasundaram, Andrei Broder, Shirshanka Das, Bhaskar Ghosh, Vanja Josifovski