Patents Assigned to SAS Institute Inc.
  • Publication number: 20190286440
    Abstract: In some examples, computing devices can partition timestamped data into groups. The computing devices can then distribute the timestamped data based on the groups. The computing devices can also obtain copies of a script configured to process the timestamped data, such that each computing device receives a copy of the script. The computing devices can determine one or more code segments associated with the groups based on content of the script. The one or more code segments can be in one or more programming languages that are different than a programming language of the script. The computing devices can then run the copies of the script to process the timestamped data within the groups. This may involve interacting with one or more job servers configured to run the one or more code segments associated with the groups.
    Type: Application
    Filed: May 22, 2019
    Publication date: September 19, 2019
    Applicant: SAS Institute Inc.
    Inventors: Michael James Leonard, Thiago Santos Quirino, Edward Tilden Blair, Jennifer Leigh Sloan Beeman, David Bruce Elsheimer, Javier Delgado
  • Patent number: 10417528
    Abstract: An assessment dataset is selected from an input dataset using a first stratified sampling process based on a value of an event assessment variable. A remainder of the input dataset is allocated to a training/validation dataset that is partitioned into an oversampled training/validation dataset using an oversampling process based on a predefined value of the event assessment variable. A validation sample is selected from the oversampled training/validation dataset using a second stratified sampling process based on the value of the event assessment variable. A training sample is selected from the oversampled training/validation dataset using the second stratified sampling process based on the value of the event assessment variable. The validation sample and the training sample are mutually exclusive. A predictive type model is trained using the selected training sample. A plurality of predictive type models are trained, validated, and scored using the samples to select a best predictive model.
    Type: Grant
    Filed: August 9, 2018
    Date of Patent: September 17, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Yongjin Ma, Xinmin Wu, Xiaomei Liu
  • Patent number: 10418811
    Abstract: An apparatus includes a processor to: assign each value of each set of values of an initial supply meter data and of an initial load meter data to one of multiple buckets based on weather conditions and/or time and date; for each bucket, generate upper and lower bounds of power provision and power consumption values, and use the upper and lower bounds to identify outlier values assigned to the bucket; for each set of values within the initial supply meter data and within the initial load meter data, generate a naive model from the non-outlier values, and use interpolation and the naive model to fill in gaps, thereby generating cleansed supply meter data and cleansed load meter data; and store the cleansed supply meter data and cleansed load meter data together as merged meter data for use in making predictions.
    Type: Grant
    Filed: September 12, 2018
    Date of Patent: September 17, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Mark Joseph Konya, Bradley Lawson, Jr., Jennifer Short Whaley, Sen-Hao Lai, Tom Anderson, Emily Jean Forney, Glenn D. Good, Tae Yoon Lee
  • Patent number: 10409863
    Abstract: An apparatus includes a processor to: receive a request for a federated area (FA) package including a FA storing a job flow definition; retrieve an instance log of a past performance that includes identifiers of the job flow definition, a data object employed as an input to a past performance, and a task routine executed in the past performance; employ an identifier to identify an FA storing a data object employed as an input; employ an identifier to identify an FA storing an executed task routine; generate the FA package to include the FAs storing the job flow definition, the instance log, the data object and the task routine, and include a copy of each in its respective federated area within the FA package; include an integrity value for each FA in the FA package; and transmit the FA package to the requesting device.
    Type: Grant
    Filed: December 29, 2018
    Date of Patent: September 10, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Henry Gabriel Victor Bequet, Eric Jian Yang, Kais Arfaoui, Ronald Earl Stogner, Partha Dutta
  • Patent number: 10402741
    Abstract: A computing device computes a weight matrix to predict a value for a characteristic in a scoring dataset. For each of a plurality of related tasks, an augmented observation matrix, a plug-in autocovariance matrix, and a plug-in covariance vector are computed. A weight matrix used to predict the characteristic for each of a plurality of variables and each of a plurality of related tasks is computed. (a) and (b) are repeated with the computed updated weight matrix as the computed weight matrix until a convergence criterion is satisfied: (a) a gradient descent matrix is computed using the computed plug-in autocovariance matrix, the computed plug-in covariance vector, the computed weight matrix, and a predefined relationship matrix, wherein the predefined relationship matrix defines a relationship between the plurality of related tasks, and (b) an updated weight matrix is computed using the computed gradient descent matrix.
    Type: Grant
    Filed: December 6, 2017
    Date of Patent: September 3, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Xin Jiang Hunt, Saba Emrani, Jorge Manuel Gomes da Silva, Ilknur Kaynar Kabul
  • Patent number: 10402372
    Abstract: An apparatus includes a processor component to receive a node device identifier defining an ordering among multiple node devices and among multiple blocks of data distributed among the multiple node devices, and transmit a size of a first subset of the multiple blocks stored within the node device to a control device. In response to receiving instructions to receive a second subset from another node device, perform operations including: receive and store the second subset; group the blocks of data of the first and second subsets into multiple segments in an order that corresponds to the ordering among the multiple blocks, wherein each segment is sized to fit minimum and maximum sizes for transmission to storage device(s); transmit the multiple segments to the storage device(s); and relay multiple segment identifiers from the storage device(s) to the control device in an order corresponding to the ordering among the multiple segments.
    Type: Grant
    Filed: December 27, 2018
    Date of Patent: September 3, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Brian Payton Bowman, Jeff Ira Cleveland, III
  • Patent number: 10394890
    Abstract: An apparatus includes a processor to: receive a request to generate a DAG of a job flow of multiple tasks of an analysis based on data table(s) and formulae of a spreadsheet data structure; correlate each indication of data required as input or output to at least a subpart of a data table; identify data dependencies and determine an order of performance among the multiple tasks based on the formulae; generate, within the specified federated area, a job flow definition that specifies the order of performance of the multiple tasks; for each task of the multiple tasks, generate, within the specified federated area, a corresponding macro data structure of multiple macro data structures; and generate the requested visualization based on the job flow definition and the multiple macro data structures.
    Type: Grant
    Filed: December 20, 2018
    Date of Patent: August 27, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Minna Jin, Huina Chen, Juan Du, Henry Gabriel Victor Bequet
  • Patent number: 10387214
    Abstract: Data processing can be managed in a distributed computing environment (DCE). In one example, the DCE can receive a sequence of computing operations to be consecutively executed in the DCE. For each computing operation in the sequence, the DCE can receive input data for the computing operation, partition the input data into subsets, and determine whether the computing operation is linear or non-linear. The DCE can then apply different processing techniques to the subsets depending on whether the computing operation is linear or non-linear.
    Type: Grant
    Filed: March 19, 2019
    Date of Patent: August 20, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Stacey Michelle Christian, Steven Daniel Miles, Katherine Fullington Taylor
  • Patent number: 10386271
    Abstract: The computing device receives information representing a design of an experiment. The design of the experiment comprises a plurality of test cases. Each element of a test case of the design is a test condition for testing one of factors for the experiment. The computing device generates a graphical representation based on the information. The graphical representation comprises a plurality of axes for respective ones of factors used in a fractal sequence. Each of the plurality of axes has two endpoints comprising a first endpoint that corresponds to a first level of a respective factor, and a second endpoint that corresponds to a second level of the respective factor. The computing device plots, on the graphical representation, data corresponding to one or more test cases of the plurality of test cases. The computing device displays an interactive graphical user interface comprising the graphical representation with the plotted data.
    Type: Grant
    Filed: January 4, 2019
    Date of Patent: August 20, 2019
    Assignee: SAS Institute Inc.
    Inventors: Caleb Bridges King, Joseph Albert Morgan, Ryan Adam Lekivetz
  • Publication number: 20190250569
    Abstract: Machines can be controlled using advanced control systems that implement an automated version of singular spectrum analysis (SSA). For example, a control system can perform SSA on a time series having one or more time-dependent variables by: generating a trajectory matrix from the time series, performing singular value decomposition on the trajectory matrix to determine elementary matrices; and categorizing the elementary matrices into groups. The elementary matrices can be automatically categorized into the groups by: generating one or more w-correlation matrices based on spectral components associated with the time series, determining w-correlation values based on the one or more w-correlation matrices; categorizing the w-correlation values into a predefined number of w-correlation sets, and forming the groups based on the predefined number of w-correlation sets. The control system can then generate a predictive forecast using the groups and control operation of a machine using the predictive forecast.
    Type: Application
    Filed: April 18, 2019
    Publication date: August 15, 2019
    Applicant: SAS Institute Inc.
    Inventors: Michael James Leonard, David Bruce Elsheimer, Yuelei Sui
  • Patent number: 10380185
    Abstract: An apparatus includes a processor to: receive a request to provide, within a specified federated area, a set of objects that enable a performance of a job flow to perform multiple tasks of an analysis based on data table(s) and formulae of a spreadsheet data structure, wherein the set of objects includes at least one task routine to perform a task of the multiple tasks; correlate each indication of data required as input or output to at least a subpart of a data table; identify data dependencies and determine an order of performance among the multiple tasks based on the formulae; generate, within the specified federated area, a job flow definition that specifies the order of performance of the multiple tasks; and for each task routine of the at least one task routine, generate, within the specified federated area, a corresponding macro data structure.
    Type: Grant
    Filed: December 18, 2018
    Date of Patent: August 13, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Minna Jin, Huina Chen, Juan Du, Henry Gabriel Victor Bequet
  • Patent number: 10380214
    Abstract: An apparatus includes a processor to: receive multiple normalized metadata portions based on metadata portions originating from vendor devices storing data sets of a distributed online library system; compare the multiple pieces of information between pairs of normalized metadata portions to identify at least one pair of identical portions of data; analyze the pieces of information of normalized metadata portions corresponding to an identified pair of identical portions of data to determine if there is a dependency relationship between each portion of data of the pair and another identical portion of data stored within another device; and in response to there being such a pair of dependency relationships, generate a visualization that includes a combination of graphical elements depicting the pair of dependency relationships, and transmit the visualization to the client device to enable a visual presentation of the visualization.
    Type: Grant
    Filed: December 29, 2018
    Date of Patent: August 13, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Nancy Anne Rausch, Christopher L. Replogle, III, Elizabeth Jane McIntosh
  • Publication number: 20190243865
    Abstract: An apparatus includes a processor to: receive multiple normalized metadata portions based on metadata portions originating from vendor devices storing data sets of a distributed online library system; compare the multiple pieces of information between pairs of normalized metadata portions to identify at least one pair of identical portions of data; analyze the pieces of information of normalized metadata portions corresponding to an identified pair of identical portions of data to determine if there is a dependency relationship between each portion of data of the pair and another identical portion of data stored within another device; and in response to there being such a pair of dependency relationships, generate a visualization that includes a combination of graphical elements depicting the pair of dependency relationships, and transmit the visualization to the client device to enable a visual presentation of the visualization.
    Type: Application
    Filed: December 29, 2018
    Publication date: August 8, 2019
    Applicant: SAS Institute Inc.
    Inventors: Nancy Anne RAUSCH, Christopher L. REPLOGLE, III, Elizabeth Jane MCINTOSH
  • Patent number: 10372734
    Abstract: The operation of a machine can be controlled by performing reconciliation using a cluster of nodes. In one example, a node can receive parent timestamped data from a parent dataset and child timestamped data from child datasets that are children of the parent dataset in a hierarchical relationship. The parent timestamped data and the child timestamped data can relate to an operational characteristic of the machine. The node can generate computer processing-threads. Each computer processing-thread can solve one or more respective reconciliation problems between a parent data point that has a particular timestamp in the parent timestamped data and child data points that also have the particular timestamp in the child timestamp data to generate a reconciled dataset. An operational setting of the machine can then be adjusted based on the reconciled dataset.
    Type: Grant
    Filed: December 8, 2017
    Date of Patent: August 6, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Michele Angelo Trovero, Byron Davis Biggs, Jennifer Leigh Sloan Beeman, Michael James Leonard
  • Patent number: 10366117
    Abstract: Systems and methods are provided for generating a set of classifiers. A term is identified within a document and a pre-defined threshold distance is determined. A plurality of additional terms in the document are identified, the additional terms being located within the pre-defined threshold distance of the time. A distance between the term and an additional term of the plurality of additional terms is calculated. A corresponding weight for the calculated distance is determined using a proximity weighting scheme. A score for the additional term is calculated using the calculated distance and the corresponding weight. A colocation matrix is generated and a classifier determined using the colocation matrix.
    Type: Grant
    Filed: July 13, 2015
    Date of Patent: July 30, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Bruce Monroe Mills, John Courtney Haws, John Clare Brocklebank, Thomas Robert Lehman
  • Patent number: 10360500
    Abstract: A computing system provides distributed training of a neural network model. Explore phase options, exploit phase options, a subset of a training dataset, and a validation dataset are distributed to a plurality of computing devices. (a) Execution of the model by the computing devices is requested using the subset stored at each computing device. (b) A first result of the execution is received from a computing device. (c) Next configuration data for the neural network model is selected based on the first result and distributed to the computing device. (a) to (c) is repeated until an exploration phase is complete. (d) Execution of the neural network model is requested. (e) A second result is received. (f) Next configuration data is computed based on the second result and distributed to the computing device. (d) to (f) is repeated until an exploitation phase is complete. The next configuration data defines the model.
    Type: Grant
    Filed: April 5, 2018
    Date of Patent: July 23, 2019
    Assignee: SAS Institute Inc.
    Inventors: Mustafa Onur Kabul, Lawrence E. Lewis
  • Patent number: 10360069
    Abstract: An apparatus includes a processor to: perform a testing job flow at least partly within a testing federated area to test a neural network defined by configuration data specifying hyperparameters and trained parameters thereof; and perform a transfer flow to transfer an object indicative of results of the testing from the testing federated area to another federated area, wherein: in response to the degree of accuracy falling below a predetermined minimum threshold, the processor is caused to transfer a specification of the degree of accuracy or a portion of inaccurate output to a training federated area in which the neural network was at least partly trained; and in response to the degree of accuracy exceeding a predetermined maximum threshold, the processor is caused to transfer a copy of the neural network configuration data to a usage federated area in which the neural network is to be made available for use.
    Type: Grant
    Filed: July 19, 2018
    Date of Patent: July 23, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Henry Gabriel Victor Bequet, Huina Chen, Juan Du
  • Patent number: 10360517
    Abstract: A computing device automatically selects hyperparameter values based on objective criteria to train a predictive model. Each session of a plurality of sessions executes training and scoring of a model type using an input dataset in parallel with other sessions of the plurality of sessions. Unique hyperparameter configurations are determined using a search method and assigned to each session. For each session of the plurality of sessions, training of a model of the model type is requested using a training dataset and the assigned hyperparameter configuration, scoring of the trained model using a validation dataset and the assigned hyperparameter configuration is requested to compute an objective function value, and the received objective function value and the assigned hyperparameter configuration are stored. A best hyperparameter configuration is identified based on an extreme value of the stored objective function values.
    Type: Grant
    Filed: November 27, 2017
    Date of Patent: July 23, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Patrick Nathan Koch, Brett Alan Wujek, Oleg Borisovich Golovidov, Steven Joseph Gardner, Joshua David Griffin, Scott Russell Pope, Yan Xu
  • Patent number: 10354204
    Abstract: A computing device automatically classifies an observation vector. A label set defines permissible values for a target variable. Supervised data includes a labeled subset that has one of the permissible values. A converged classification matrix is computed based on the supervised data and an unlabeled subset using a prior class distribution matrix that includes a row for each observation vector. Each column is associated with a single permissible value of the label set. A cell value in each column is a likelihood that each associated permissible value of the label set occurs based on prior class distribution information. The value of the target variable is selected using the converged classification matrix. A weighted classification label distribution matrix is computed from the converged classification matrix. The value of the target variable for each observation vector of the plurality of observation vectors is output to a labeled dataset.
    Type: Grant
    Filed: October 17, 2018
    Date of Patent: July 16, 2019
    Assignee: SAS Institute Inc.
    Inventors: Xu Chen, Saratendu Sethi
  • Patent number: 10346211
    Abstract: An apparatus includes a processor to: assign a portion of currently available instruction-based processing resources to a first non-neuromorphic performance of an analytical function; in response to availability of sufficient remaining processing resources for a first neuromorphic performance of the analytical function with the same input values, assign a portion of the remaining processing resources to the first neuromorphic performance; analyze the output values generated by the first neuromorphic and non-neuromorphic performances to determine a degree of accuracy of the neural network in performing the analytical function; in response to at least the degree of accuracy exceeding a predetermined threshold, assign a portion of currently available processing resources to a second neuromorphic performance of the analytical function; and in response to availability of sufficient remaining processing resources for a second non-neuromorphic performance of the analytical function, assign a portion of the remaining
    Type: Grant
    Filed: July 19, 2018
    Date of Patent: July 9, 2019
    Assignee: SAS INSTITUTE INC.
    Inventors: Henry Gabriel Victor Bequet, Huina Chen, III, Juan Du