Patents by Inventor Ayman O. Farahat

Ayman O. Farahat has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for linked event detection

Patent number: 8650187

Abstract: Techniques for training and using linked event detection systems and transforming source-identified stopwords are provided. A training corpus of source identified stories and a reference language is determined. Optionally, stopwords for source-identified stories are transformed based on statistical analysis of parallel verified and un-verified transformations. Reference language and non-reference language terms are selectively included in source-pair term frequency-inverse story frequency models. Optionally, incremental source-identified term frequency-inverse story frequency models are determined. Selected terms are weighted and similarity metrics determined. Associated source-pair statistics, computed in part from a training corpus, are combined with the values of each similarity metric in the set of similarity metrics to form a similarity vector. Similarity vectors and verified link label information are used to determine a predictive model.

Type: Grant

Filed: July 25, 2003

Date of Patent: February 11, 2014

Assignee: Palo Alto Research Center Incorporated

Inventors: Francine R. Chen, Ayman O. Farahat, Thorsten H. Brants
Systems and methods for the combination and display of social and textual content

Patent number: 7945854

Abstract: Techniques are presented for determining a corpus of content portions, each content portion associated with at least one element. A first set of feature values is determined for each content portion. Clusters of content portions are then determined based on the first set of feature values. The features values are optionally associated with topics. Structural links between the elements are determined based on a second set of feature values. A layout of the element is then determined based on the clusters and the structural links. Optionally the N-most dominant topics are determined and also used to inform the layout of the elements in a display.

Type: Grant

Filed: October 30, 2006

Date of Patent: May 17, 2011

Assignee: Palo Alto Research Center Incorporated

Inventors: Yevgeniy Medynskiy, Ayman O. Farahat, Nicolas B. Ducheneaut
Method and apparatus for representing text using search engine, document collection, and hierarchal taxonomy

Patent number: 7580926

Abstract: Embodiments of a data representation system for describing specific data sets, such as documents, web pages, or search engine queries, based on data tokens, such as words or n-grams, contained in a collection of documents are described. Such a system can be used in any type of information retrieval application, such as a document, web page, or online advertisement serving process, based on an information request, such as a query executed through an Internet search engine. For example, when a search is performed at a search engine, a content provider uses the system to represent the search query and compares the query representation against representations of a set of content in order to identify, retrieve and aggregate the content from the set most relevant to the search query, in the form of a web page or other data unit for display or access through the web browser.

Type: Grant

Filed: December 1, 2006

Date of Patent: August 25, 2009

Assignee: Adchemy, Inc.

Inventors: Shyam Kapur, Ayman O. Farahat, Richard E. Chatwin
Systems and methods for new event detection

Patent number: 7577654

Abstract: Techniques for new event detection are provided. For a new story and a corpus of stories, story-pairs based on the new story and each corpus story are determined. Adjustments to the importance of terms are determined based on story characteristics associated with each story. Story characteristics are based on direct or indirect characteristics. Direct story characteristics include authorship, language associated with a story and the like. Indirect story characteristics may include derived characteristics such as an ROI category characteristic, a same ROI characteristic, a same event-same source characteristic, an average story similarity characteristic or any other known or later developed characteristic associated with a story. Adjustments to the inter-story similarity metrics are then determined based on story characteristics and/or a weighting function.

Type: Grant

Filed: July 25, 2003

Date of Patent: August 18, 2009

Assignee: Palo Alto Research Center Incorporated

Inventors: Thorsten H. Brants, Francine R. Chen, Ayman O. Farahat
Systems and methods for the combination and display of social and textual content

Publication number: 20080104002

Abstract: Techniques are presented for determining a corpus of content portions, each content portion associated with at least one element. A first set of feature values is determined for each content portion. Clusters of content portions are then determined based on the first set of feature values. The features values are optionally associated with topics. Structural links between the elements are determined based on a second set of feature values. A layout of the element is then determined based on the clusters and the structural links. Optionally the N-most dominant topics are determined and also used to inform the layout of the elements in a display.

Type: Application

Filed: October 30, 2006

Publication date: May 1, 2008

Inventors: Yevgeniy Medynskiy, Ayman O. Farahat, Nicolas B. Ducheneaut
Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

Patent number: 7188117

Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.

Type: Grant

Filed: September 3, 2002

Date of Patent: March 6, 2007

Assignee: Xerox Corporation

Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

Patent number: 7167871

Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.

Type: Grant

Filed: September 3, 2002

Date of Patent: January 23, 2007

Assignee: Xerox Corporation

Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

Publication number: 20030225750

Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.

Type: Application

Filed: September 3, 2002

Publication date: December 4, 2003

Applicant: XEROX CORPORATION

Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

Publication number: 20030226100

Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.

Type: Application

Filed: September 3, 2002

Publication date: December 4, 2003

Applicant: XEROX CORPORATION

Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg
Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

Publication number: 20030221166

Abstract: Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of ran-ordered lists with numerically-ordered lists.

Type: Application

Filed: September 3, 2002

Publication date: November 27, 2003

Applicant: XEROX CORPORATION

Inventors: Ayman O. Farahat, Francine R. Chen, Charles R. Mathis, Geoffrey D. Nunberg