Patents by Inventor Warren Sarle

Warren Sarle has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9501522
    Abstract: This disclosure describes a method, system and computer-program product for parallelized feature selection. The method, system and computer-program product may be used to access a first set of features, wherein the first set of features includes multiple features, wherein the features are characterized by a variance measure, and wherein accessing the first set of features includes using a computing system to access the features, determine components of a covariance matrix, the components of the covariance matrix indicating a covariance with respect to pairs of features in the first set, and select multiple features from the first set, wherein selecting is based on the determined components of the covariance matrix and an amount of the variance measure attributable to the selected multiple features, and wherein selecting the multiple features includes executing a greedy search performed using parallelized computation.
    Type: Grant
    Filed: August 19, 2013
    Date of Patent: November 22, 2016
    Assignee: SAS Institute Inc.
    Inventors: Zheng Zhao, James Cox, David Duling, Warren Sarle
  • Patent number: 9424337
    Abstract: A method of determining a number of clusters for a dataset is provided. Centroid locations for a defined number of clusters are determined using a clustering algorithm. Boundaries for each of the defined clusters are defined. A reference distribution that includes a plurality of data points is created. The plurality of data points are within the defined boundary of at least one cluster of the defined clusters. Second centroid locations for the defined number of clusters are determined using the clustering algorithm and the reference distribution. A gap statistic for the defined number of clusters based on a comparison between a first residual sum of squares and a second residual sum of squares is computed. The processing is repeated for a next number of clusters to create. An estimated best number of clusters for the received data is determined by comparing the gap statistic computed for each iteration of the number of clusters.
    Type: Grant
    Filed: March 4, 2014
    Date of Patent: August 23, 2016
    Assignee: SAS Institute Inc.
    Inventors: Patrick Hall, Ilknur Kaynar Kabul, Warren Sarle, Jorge Silva
  • Publication number: 20150019554
    Abstract: A method of determining a number of clusters for a dataset is provided. Centroid locations for a defined number of clusters are determined using a clustering algorithm. Boundaries for each of the defined clusters are defined. A reference distribution that includes a plurality of data points is created. The plurality of data points are within the defined boundary of at least one cluster of the defined clusters. Second centroid locations for the defined number of clusters are determined using the clustering algorithm and the reference distribution. A gap statistic for the defined number of clusters based on a comparison between a first residual sum of squares and a second residual sum of squares is computed. The processing is repeated for a next number of clusters to create. An estimated best number of clusters for the received data is determined by comparing the gap statistic computed for each iteration of the number of clusters.
    Type: Application
    Filed: March 4, 2014
    Publication date: January 15, 2015
    Applicant: SAS Institute Inc.
    Inventors: Patrick Hall, Ilknur Kaynar Kabul, Warren Sarle, Jorge Silva
  • Publication number: 20140059073
    Abstract: This disclosure describes a method, system and computer-program product for parallelized feature selection. The method, system and computer-program product may be used to access a first set of features, wherein the first set of features includes multiple features, wherein the features are characterized by a variance measure, and wherein accessing the first set of features includes using a computing system to access the features, determine components of a covariance matrix, the components of the covariance matrix indicating a covariance with respect to pairs of features in the first set, and select multiple features from the first set, wherein selecting is based on the determined components of the covariance matrix and an amount of the variance measure attributable to the selected multiple features, and wherein selecting the multiple features includes executing a greedy search performed using parallelized computation.
    Type: Application
    Filed: August 19, 2013
    Publication date: February 27, 2014
    Applicant: SAS Institute Inc.
    Inventors: Zheng Zhao, James Cox, David Duling, Warren Sarle