Abstract: Systems and methods for enhancing knowledge discovery from data using multiple learning machines in general and multiple support vector machines in particular. Training data for a learning machine is pre-processed in order to add meaning thereto. Multiple support vector machines, each comprising distinct kernels, are trained with the pre-processed training data and are tested with test data that is pre-processed in the same manner. The test outputs from multiple support vector machines are compared in order to determine which of the test outputs if any represents a optimal solution. Selection of one or more kernels may be adjusted and one or more support vector machines may be retrained and retested. Optimal solutions based on distinct input data sets may be combined to form a new input data set to be input into one or more additional support vector machine.
Type:
Grant
Filed:
August 7, 2000
Date of Patent:
April 19, 2005
Assignee:
Biowulf Technologies, LLC
Inventors:
Stephen Barnhill, Isabelle Guyon, Jason Weston
Abstract: A learning machine is used to extract useful information from vast quantities of biological data. The method includes pre-processing of training data and test data to add dimensionality or to identify missing or erroneous data points. The training data is used to train the learning machine after which the success of the training is tested using the test data. The test output is pre-processed to determine whether the knowledge discovered from the pre-processed test data set is desirable. After the training has been confirmed, live biological data can be pre-processed then input into the trained learning machine for extraction of useful information. In the preferred embodiment, the learning machine is one or more support vector machines.
Type:
Grant
Filed:
August 7, 2000
Date of Patent:
September 7, 2004
Assignee:
BIOwulf Technologies LLC
Inventors:
Stephen Barnhill, Isabelle Guyon, Jason Weston
Abstract: A system and method for enhancing knowledge discovery from data using multiple learning machines in general and multiple support vector machines in particular. Training data for a learning machine is pre-processed in order to add meaning thereto. Pre-processing data involves transforming the data points and/or expanding the data points. By adding meaning to the data, the learning machine is provided with a greater amount of information for processing. With regard to support vector machines in particular, the greater the amount of information that is processed, the better generalizations about the derived data. Multiple support vector machines, each comprising distinct kernels, are trained with the pre-processed training data and are tested with test data that is pre-processed in the same manner. The test outputs from multiple support vector machines are compared in order to determine which of the test outputs if any represents a optimal solution.