Patents by Inventor Ayman O. Farahat

Ayman O. Farahat has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8650187
    Abstract: Techniques for training and using linked event detection systems and transforming source-identified stopwords are provided. A training corpus of source identified stories and a reference language is determined. Optionally, stopwords for source-identified stories are transformed based on statistical analysis of parallel verified and un-verified transformations. Reference language and non-reference language terms are selectively included in source-pair term frequency-inverse story frequency models. Optionally, incremental source-identified term frequency-inverse story frequency models are determined. Selected terms are weighted and similarity metrics determined. Associated source-pair statistics, computed in part from a training corpus, are combined with the values of each similarity metric in the set of similarity metrics to form a similarity vector. Similarity vectors and verified link label information are used to determine a predictive model.
    Type: Grant
    Filed: July 25, 2003
    Date of Patent: February 11, 2014
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Francine R. Chen, Ayman O. Farahat, Thorsten H. Brants
  • Patent number: 7945854
    Abstract: Techniques are presented for determining a corpus of content portions, each content portion associated with at least one element. A first set of feature values is determined for each content portion. Clusters of content portions are then determined based on the first set of feature values. The features values are optionally associated with topics. Structural links between the elements are determined based on a second set of feature values. A layout of the element is then determined based on the clusters and the structural links. Optionally the N-most dominant topics are determined and also used to inform the layout of the elements in a display.
    Type: Grant
    Filed: October 30, 2006
    Date of Patent: May 17, 2011
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Yevgeniy Medynskiy, Ayman O. Farahat, Nicolas B. Ducheneaut
  • Patent number: 7580926
    Abstract: Embodiments of a data representation system for describing specific data sets, such as documents, web pages, or search engine queries, based on data tokens, such as words or n-grams, contained in a collection of documents are described. Such a system can be used in any type of information retrieval application, such as a document, web page, or online advertisement serving process, based on an information request, such as a query executed through an Internet search engine. For example, when a search is performed at a search engine, a content provider uses the system to represent the search query and compares the query representation against representations of a set of content in order to identify, retrieve and aggregate the content from the set most relevant to the search query, in the form of a web page or other data unit for display or access through the web browser.
    Type: Grant
    Filed: December 1, 2006
    Date of Patent: August 25, 2009
    Assignee: Adchemy, Inc.
    Inventors: Shyam Kapur, Ayman O. Farahat, Richard E. Chatwin
  • Patent number: 7577654
    Abstract: Techniques for new event detection are provided. For a new story and a corpus of stories, story-pairs based on the new story and each corpus story are determined. Adjustments to the importance of terms are determined based on story characteristics associated with each story. Story characteristics are based on direct or indirect characteristics. Direct story characteristics include authorship, language associated with a story and the like. Indirect story characteristics may include derived characteristics such as an ROI category characteristic, a same ROI characteristic, a same event-same source characteristic, an average story similarity characteristic or any other known or later developed characteristic associated with a story. Adjustments to the inter-story similarity metrics are then determined based on story characteristics and/or a weighting function.
    Type: Grant
    Filed: July 25, 2003
    Date of Patent: August 18, 2009
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Thorsten H. Brants, Francine R. Chen, Ayman O. Farahat
  • Publication number: 20080104002
    Abstract: Techniques are presented for determining a corpus of content portions, each content portion associated with at least one element. A first set of feature values is determined for each content portion. Clusters of content portions are then determined based on the first set of feature values. The features values are optionally associated with topics. Structural links between the elements are determined based on a second set of feature values. A layout of the element is then determined based on the clusters and the structural links. Optionally the N-most dominant topics are determined and also used to inform the layout of the elements in a display.
    Type: Application
    Filed: October 30, 2006
    Publication date: May 1, 2008
    Inventors: Yevgeniy Medynskiy, Ayman O. Farahat, Nicolas B. Ducheneaut
  • Patent number: 7188117
    Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.
    Type: Grant
    Filed: September 3, 2002
    Date of Patent: March 6, 2007
    Assignee: Xerox Corporation
    Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
  • Patent number: 7167871
    Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.
    Type: Grant
    Filed: September 3, 2002
    Date of Patent: January 23, 2007
    Assignee: Xerox Corporation
    Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
  • Publication number: 20030225750
    Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.
    Type: Application
    Filed: September 3, 2002
    Publication date: December 4, 2003
    Applicant: XEROX CORPORATION
    Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
  • Publication number: 20030226100
    Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.
    Type: Application
    Filed: September 3, 2002
    Publication date: December 4, 2003
    Applicant: XEROX CORPORATION
    Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
  • Publication number: 20030221166
    Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of ran-ordered lists with numerically-ordered lists.
    Type: Application
    Filed: September 3, 2002
    Publication date: November 27, 2003
    Applicant: XEROX CORPORATION
    Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg