Patents by Inventor André Elisseeff

André Elisseeff has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Selection of features predictive of biological conditions using protein mass spectrographic data

Patent number: 7676442

Abstract: Support vector machines are used to classify data contained within a structured dataset such as a plurality of signals generated by a spectral analyzer. The signals are pre-processed to ensure alignment of peaks across the spectra. Similarity measures are constructed to provide a basis for comparison of pairs of samples of the signal. A support vector machine is trained to discriminate between different classes of the samples. to identify the most predictive features within the spectra. In a preferred embodiment feature selection is performed to reduce the number of features that must be considered.

Type: Grant

Filed: October 30, 2007

Date of Patent: March 9, 2010

Assignee: Health Discovery Corporation

Inventors: Asa Ben-Hur, André Elisseeff, Olivier Chapelle, Jason Aaron Edward Weston
Methods for feature selection in a learning machine

Patent number: 7624074

Abstract: In a pre-processing step prior to training a learning machine, pre-processing includes reducing the quantity of features to be processed using feature selection methods selected from the group consisting of recursive feature elimination (RFE), minimizing the number of non-zero parameters of the system (l0-norm minimization), evaluation of cost function to identify a subset of features that are compatible with constraints imposed by the learning set, unbalanced correlation score and transductive feature selection. The features remaining after feature selection are then used to train a learning machine for purposes of pattern classification, regression, clustering and/or novelty detection.

Type: Grant

Filed: October 30, 2007

Date of Patent: November 24, 2009

Assignee: Health Discovery Corporation

Inventors: Jason Aaron Edward Weston, Andre′ Elisseeff, Bernard Schoelkopf, Fernando Pérez-Cruz
Kernels and kernel methods for spectral data

Patent number: 7617163

Abstract: Support vector machines are used to classify data contained within a structured dataset such as a plurality of signals generated by a spectral analyzer. The signals are pre-processed to ensure alignment of peaks across the spectra. Similarity measures are constructed to provide a basis for comparison of pairs of samples of the signal. A support vector machine is trained to discriminate between different classes of the samples. to identify the most predictive features within the spectra. In a preferred embodiment feature selection is performed to reduce the number of features that must be considered.

Type: Grant

Filed: October 9, 2002

Date of Patent: November 10, 2009

Assignee: Health Discovery Corporation

Inventors: Asa Ben-Hur, André Elisseeff, Olivier Chapelle, Jason Aaron Edward Weston
SYSTEM AND METHOD FOR ANALYZING ELECTRONIC DATA RECORDS

Publication number: 20090083231

Abstract: A system and method for analyzing electronic data records including an annotation unit being operable to receive a set of electronic data records and to compute concept vectors for the set of electronic data records, wherein the coordinates of the concept vectors represent scores of the concepts in the respective electronic data record and wherein the concepts are part of an ontology, a similarity network unit being operable to compute a similarity network by means of the concept vectors and by at least one relationship between the concepts of the ontology, the similarity network representing similarities between the electronic data records, wherein the vertices of the similarity network represent the electronic data records and the edges of the similarity network represent similarity values indicating a degree of similarity between the vertices and steps for executing the system.

Type: Application

Filed: September 18, 2008

Publication date: March 26, 2009

Inventors: Frey Aagaard Eberholst, Andre Elisseeff, Peter Lundkvist, Ulf H. Nielsen, Erich M. Ruetsche
Pre-processed feature ranking for a support vector machine

Patent number: 7475048

Abstract: A computer-implemented method is provided for ranking features within a large dataset containing a large number of features according to each feature's ability to separate data into classes. For each feature, a support vector machine separates the dataset into two classes and determines the margins between extremal points in the two classes. The margins for all of the features are compared and the features are ranked based upon the size of the margin, with the highest ranked features corresponding to the largest margins. A subset of features for classifying the dataset is selected from a group of the highest ranked features. In one embodiment, the method is used to identify the best genes for disease prediction and diagnosis using gene expression data from micro-arrays.

Type: Grant

Filed: November 7, 2002

Date of Patent: January 6, 2009

Assignee: Health Discovery Corporation

Inventors: Jason Weston, André Elisseeff, Bernhard Schölkopf, Fernando Perez-Cruz, Isabelle Guyon
KERNELS AND METHODS FOR SELECTING KERNELS FOR USE IN LEARNING MACHINES

Publication number: 20080301070

Abstract: Learning machines, such as support vector machines, are used to analyze datasets to recognize patterns within the dataset using kernels that are selected according to the nature of the data to be analyzed. Where the datasets possesses structural characteristics, locational kernels can be utilized to provide measures of similarity among data points within the dataset. The locational kernels are then combined to generate a decision function, or kernel, that can be used to analyze the dataset. Where an invariance transformation or noise is present, tangent vectors are defined to identify relationships between the invariance or noise and the data points. A covariance matrix is formed using the tangent vectors, then used in generation of the kernel.

Type: Application

Filed: October 30, 2007

Publication date: December 4, 2008

Inventors: Peter L. Bartlett, Andre Elisseeff, Bernhard Schoelkopf, Olivier Chapelle
METHODS FOR FEATURE SELECTION IN A LEARNING MACHINE

Publication number: 20080215513

Abstract: In a pre-processing step prior to training a learning machine, pre-processing includes reducing the quantity of features to be processed using feature selection methods selected from the group consisting of recursive feature elimination (RFE), minimizing the number of non-zero parameters of the system (l0-norm minimization), evaluation of cost function to identify a subset of features that are compatible with constraints imposed by the learning set, unbalanced correlation score and transductive feature selection. The features remaining after feature selection are then used to train a learning machine for purposes of pattern classification, regression, clustering and/or novelty detection.

Type: Application

Filed: October 30, 2007

Publication date: September 4, 2008

Inventors: Jason Aaron Edward Weston, Andre' Elisseeff, Bernard Schoelkopf, Fernando Perez-Cruz
MODEL SELECTION FOR CLUSTER DATA ANALYSIS

Publication number: 20080140592

Abstract: A model selection method is provided for choosing the number of clusters, or more generally the parameters of a clustering algorithm. The algorithm is based on comparing the similarity between pairs of clustering runs on sub-samples or other perturbations of the data. High pairwise similarities show that the clustering represents a stable pattern in the data. The method is applicable to any clustering algorithm, and can also detect lack of structure. We show results on artificial and real data using a hierarchical clustering algorithm.

Type: Application

Filed: October 30, 2007

Publication date: June 12, 2008

Inventors: Asa Ben-Hur, Andre Elisseeff, Isabelle Guyon
KERNELS AND KERNEL METHODS FOR SPECTRAL DATA

Publication number: 20080097940

Abstract: Support vector machines are used to classify data contained within a structured dataset such as a plurality of signals generated by a spectral analyzer. The signals are pre-processed to ensure alignment of peaks across the spectra. Similarity measures are constructed to provide a basis for comparison of pairs of samples of the signal. A support vector machine is trained to discriminate between different classes of the samples. to identify the most predictive features within the spectra. In a preferred embodiment feature selection is performed to reduce the number of features that must be considered.

Type: Application

Filed: October 30, 2007

Publication date: April 24, 2008

Inventors: Asa Ben-Hur, Andre Elisseeff, Olivier Chapelle, Jason Weston
Kernels and methods for selecting kernels for use in learning machines

Patent number: 7353215

Abstract: Learning machines, such as support vector machines, are used to analyze datasets to recognize patterns within the dataset using kernels that are selected according to the nature of the data to be analyzed. Where the datasets possesses structural characteristics, locational kernels can be utilized to provide measures of similarity among data points within the dataset. The locational kernels are then combined to generate a decision function, or kernel, that can be used to analyze the dataset. Where invariance transformations or noise is present, tangent vectors are defined to identify relationships between the invariance or noise and the data points. A covariance matrix is formed using the tangent vectors, then used in generation of the kernel for recognizing patterns in the dataset.

Type: Grant

Filed: May 7, 2002

Date of Patent: April 1, 2008

Assignee: Health Discovery Corporation

Inventors: Peter L. Bartlett, André Elisseeff, Bernhard Schoelkopf
Methods for feature selection in a learning machine

Patent number: 7318051

Abstract: In a pre-processing step prior to training a learning machine, pre-processing includes reducing the quantity of features to be processed using feature selection methods selected from the group consisting of recursive feature elimination (RFE), minimizing the number of non-zero parameters of the system (lo-norm minimization), evaluation of cost function to identify a subset of features that are compatible with constraints imposed by the learning set, unbalanced correlation score and transductive feature selection. The features remaining after feature selection are then used to train a learning machine for purposes of pattern classification, regression, clustering and/or novelty detection. (FIG.

Type: Grant

Filed: May 20, 2002

Date of Patent: January 8, 2008

Assignee: Health Discovery Corporation

Inventors: Jason Aaron Edward Weston, André Elisseeff, Bernhard Schoelkopf, Fernando Pérez-Cruz
Methods for feature selection in a learning machine

Publication number: 20050216426

Abstract: In a pre-processing step prior to training a learning machine, pre-processing includes reducing the quantity of features to be processed using feature selection methods selected from the group consisting of recursive feature elimination (RFE), minimizing the number of non-zero parameters of the system (lo-norm minimization), evaluation of cost function to identify a subset of features that are compatible with constraints imposed by the learning set, unbalanced correlation score and transductive feature selection. The features remaining after feature selection are then used to train a learning machine for purposes of pattern classification, regression, clustering and/or novelty detection.

Type: Application

Filed: May 20, 2002

Publication date: September 29, 2005

Inventors: Jason Aaron Weston, Andre Elisseeff, Bernhard Schoelkopf, Fernando Perez-Cruz
Pre-processed feature ranking for a support vector machine

Publication number: 20050131847

Abstract: Features are preprocessed (204) to minimize classification error in a Support Vector Machines (200) used to identify patterns in large databases. Pre-processing (204) is performed to constrain features used to train (210) the SVM learning machine. Live data (226) is collected and processed (232) with SVM.

Type: Application

Filed: November 7, 2002

Publication date: June 16, 2005

Inventors: Jason Weston, Andre Elisseeff, Bernhard Scholkopf, Fernando Perez-Cruz, Isabelle Guyon
Model selection for cluster data analysis

Publication number: 20050071140

Abstract: A model selection method is provided for choosing the number of clusters, or more generally the parameters of a clustering algorithm. The algorithm is based on comparing the similarity between pairs of clustering runs on sub-samples or other perturbations of the data. High pairwise similarities show that the clustering represents a stable pattern in the data. The method is applicable to any clustering algorithm, and can also detect lack of structure. We show results on artificial and real data using a hierarchical clustering algorithm.

Type: Application

Filed: May 17, 2002

Publication date: March 31, 2005

Inventors: Asa Ben-Hur, Andre Elisseeff, Isabelle Guyon
Kernels and methods for selecting kernels for use in learning machines

Publication number: 20050071300

Abstract: Kernels (206) for use in learning machines, such as support vector machines, and methods are provided for selection and construction of such kernels are controlled by the nature of the data to be analyzed (203). In particular, data which may possess characteristics such as structure, for example DNA sequences, documents; graphs, signals, such as ECG signals and microarray expression profiles; spectra; images; spatio-temporal data; and relational data, and which may possess invariances or noise components that can interfere with the ability to accurately extract the desired information. Where structured datasets are analyzed, locational kernels are defined to provide measures of similarity among data points (210). The locational kernels are then combined to generate the decision function, or kernel. Where invariance transformations or noise is present, tangent vectors are defined to identify relationships between the invariance or noise and the data points (222).

Type: Application

Filed: May 7, 2002

Publication date: March 31, 2005

Inventors: Peter Bartlett, Andre Elisseeff, Bernard Schoelkopf

prev 1 2