Patents by Inventor Guixian Lin

Guixian Lin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10956835
    Abstract: A computing device compresses a gradient boosting tree predictive model. A gradient boosting tree predictive model is trained using a plurality of observation vectors. Each observation vector includes an explanatory variable value of an explanatory variable and a response variable value for a response variable. The gradient boosting tree predictive type model is trained to predict the response variable value of each observation vector based on a respective explanatory variable value of each observation vector. The trained gradient boosting tree predictive model is compressed using a compression model with a predefined penalty constant value and with a predefined array of coefficients to reduce a number of trees of the trained gradient boosting tree predictive model. The compression model minimizes a sparsity norm loss function. The compressed, trained gradient boosting tree predictive model is output for predicting a new response variable value from a new observation vector.
    Type: Grant
    Filed: March 11, 2019
    Date of Patent: March 23, 2021
    Assignee: SAS Institute Inc.
    Inventors: Rui Shi, Guixian Lin, Xiangqian Hu, Yan Xu
  • Publication number: 20200027028
    Abstract: A computing device compresses a gradient boosting tree predictive model. A gradient boosting tree predictive model is trained using a plurality of observation vectors. Each observation vector includes an explanatory variable value of an explanatory variable and a response variable value for a response variable. The gradient boosting tree predictive type model is trained to predict the response variable value of each observation vector based on a respective explanatory variable value of each observation vector. The trained gradient boosting tree predictive model is compressed using a compression model with a predefined penalty constant value and with a predefined array of coefficients to reduce a number of trees of the trained gradient boosting tree predictive model. The compression model minimizes a sparsity norm loss function. The compressed, trained gradient boosting tree predictive model is output for predicting a new response variable value from a new observation vector.
    Type: Application
    Filed: March 11, 2019
    Publication date: January 23, 2020
    Inventors: Rui Shi, Guixian Lin, Xiangqian Hu, Yan Xu
  • Patent number: 9703852
    Abstract: In accordance with the teachings described herein, systems and methods are provided for estimating or determining quantiles for data stored in a distributed system. In one embodiment, an instruction is received to estimate or determine a specified quantile for a variate in a set of data stored at a plurality of nodes in the distributed system. A plurality of data bins for the variate are defined that are each associated with a different range of data values in the set of data. Lower and upper quantile bounds for each of the plurality of data bins are determined based on the total number of data values that fall within each of the plurality of data bins. The specified quantile is estimated or determined based on an identified one of the plurality of data bins that includes the specified quantile based on the lower and upper quantile bounds.
    Type: Grant
    Filed: July 15, 2016
    Date of Patent: July 11, 2017
    Assignee: SAS INSTITUTE INC.
    Inventors: Guy Blanc, Georges H. Guirguis, Xiangqian Hu, Guixian Lin, Scott Pope
  • Publication number: 20160350396
    Abstract: In accordance with the teachings described herein, systems and methods are provided for estimating or determining quantiles for data stored in a distributed system. In one embodiment, an instruction is received to estimate or determine a specified quantile for a variate in a set of data stored at a plurality of nodes in the distributed system. A plurality of data bins for the variate are defined that are each associated with a different range of data values in the set of data. Lower and upper quantile bounds for each of the plurality of data bins are determined based on the total number of data values that fall within each of the plurality of data bins. The specified quantile is estimated or determined based on an identified one of the plurality of data bins that includes the specified quantile based on the lower and upper quantile bounds.
    Type: Application
    Filed: July 15, 2016
    Publication date: December 1, 2016
    Applicant: SAS Institute Inc.
    Inventors: Guy Blanc, Georges H. Guirguis, Xiangqian Hu, Guixian Lin, Scott Pope