Patents Assigned to SAS Institute
-
Patent number: 11100428Abstract: A computing device predicts occurrence of an event or classifies an object using distributed unlabeled data. A Laplacian matrix is computed using a kernel function. A predefined number of eigenvectors is selected from a decomposed Laplacian matrix to define a decomposition matrix. A gradient value is computed as a function of the defined decomposition matrix, a plurality of sparse coefficients, and a label matrix, a value of each coefficient of the plurality of sparse coefficients is updated based on the computed gradient value, and the computations are repeated until a convergence parameter value indicates the plurality of sparse coefficients have converged. A classification matrix is defined using the plurality of sparse coefficients to determine the target variable value for each observation vector of the plurality of unclassified observation vectors. The target variable value for each observation vector of the plurality of unclassified observation vectors is output.Type: GrantFiled: December 9, 2019Date of Patent: August 24, 2021Assignee: SAS Institute Inc.Inventor: Xu Chen
-
Patent number: 11100395Abstract: An analytic system provides direct functional principal component analysis. (A) A next group variable value is selected from values of a group variable. (B) Explanatory variable values of observations having the selected next group variable value are sorted in ascending order. (C) The response variable value associated with each sorted explanatory variable value is stored in a next row of a data matrix. (D) (A) through (C) are repeated. (E) An eigenfunction index is incremented. (F) An FPCA is performed using the data matrix to define an eigenfunction for the eigenfunction index. (G) (E) and (F) are repeated. (H) FPCA results from the performed FPCA are presented within a window of a display. The FPCA results include an eigenvalue and an eigenfunction associated with the eigenvalue for each functional principal component identified from the performed FPCA in (F).Type: GrantFiled: January 26, 2021Date of Patent: August 24, 2021Assignee: SAS Institute Inc.Inventors: Ryan Jeremy Parker, Clayton Adam Barker, Christopher Michael Gotwalt
-
Publication number: 20210255843Abstract: An apparatus includes a processor to: based on data dependencies specified in a job flow definition, identify first and second tasks of the corresponding job flow to be performed sequentially, wherein the first task outputs a data object used as an input to the second; store, within a task queue, at least one message conveying at least an identifier of the first task, and an indication that the data object is to be exchanged through a shared memory space; within a task container, in response to storage of the at least one message within the task queue, sequentially execute first and second task routines to sequentially perform the first and second tasks, respectively, and instantiate the shared memory space to be accessible to the first and second task routines during their executions; and upon completion of the job flow, transmit an indication of completion to another device via a network.Type: ApplicationFiled: May 5, 2021Publication date: August 19, 2021Applicant: SAS Institute Inc.Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Qing Gong, Partha Dutta, Kais Arfaoui
-
Patent number: 11093833Abstract: Tuned hyperparameter values are determined for training a machine learning model. When a selected hyperparameter configuration does not satisfy a linear constraint, if a projection of the selected hyperparameter configuration is included in a first cache that stores previously computed projections is determined. When the projection is included in the first cache, the projection is extracted from the first cache using the selected hyperparameter configuration, and the selected hyperparameter configuration is replaced with the extracted projection in the plurality of hyperparameter configurations. When the projection is not included in the first cache, a projection computation for the selected hyperparameter configuration is assigned to a session. A computed projection is received from the session for the selected hyperparameter configuration.Type: GrantFiled: October 27, 2020Date of Patent: August 17, 2021Assignee: SAS Institute Inc.Inventors: Steven Joseph Gardner, Joshua David Griffin, Yan Xu, Patrick Nathan Koch, Brett Alan Wujek, Oleg Borisovich Golovidov
-
Patent number: 11093864Abstract: A computing system computes a variable relevance using a trained tree model. (A) A next child node is selected. (B) A number of observations associated with the next child node is computed. (C) A population ratio value is computed. (D) A next leaf node is selected. (E) First observations are identified. (F) A first impurity value is computed for the first observations. (G) Second observations are identified when the first observations are associated with the descending child nodes. (H) A second impurity value is computed for the second observations. (I) A gain contribution is computed. (J) A node gain value is updated. (K) (D) through (J) are repeated. (L) A variable gain value is updated for a variable associated with the split test. (M) (A) through (L) are repeated. (N) A set of relevant variables is selected based on the variable gain value.Type: GrantFiled: November 10, 2020Date of Patent: August 17, 2021Assignee: SAS Institute Inc.Inventor: Brandon Michael Reese
-
Patent number: 11087033Abstract: A computing system generates a subset of design cases of candidate design cases. The system indexes, in the subset, data elements. The system generates a design of an experiment by, for each respective data element, determining a status indicating whether the respective data element corresponds to an uncontrolled factor or a controlled factor. When the status indicates the uncontrolled factor, the system determines if substituting a respective set of specified options of a respective candidate design case comprising the respective data element with a different set of specified options of the candidate design cases improves a criterion measure according to a design criterion. When the status indicates the controlled factor, the system determines if changing an assigned option of the respective data element improves the criterion measure. The system updates the criterion measure with an updated criterion measure according to a change of the subset based on generating the design.Type: GrantFiled: January 20, 2021Date of Patent: August 10, 2021Assignee: SAS Institute Inc.Inventors: Ryan Adam Lekivetz, Bradley Allen Jones, Joseph Albert Morgan, Caleb Bridges King
-
Patent number: 11087215Abstract: A computing device classifies unclassified observations. A first batch of noise observations is generated. (A) A first batch of unclassified observations is selected. (B) A first batch of classified observations is selected. (C) A discriminator neural network model trained to classify unclassified observations and noise observations is updated with observations that include the first batch of unclassified observations, the first batch of classified observations, and the first batch of noise observations. (D) A discriminator loss value is computed that includes an adversarial loss term computed using a predefined transition matrix. (E) A second batch of unclassified observations is selected. (F) A second batch of noise observations is generated. (G) A generator neural network model trained to generate a fake observation vector for the second batch of noise observations is updated with the second batch of unclassified observations and the second batch of noise observations. (H) (A) to (G) is repeated.Type: GrantFiled: April 7, 2021Date of Patent: August 10, 2021Assignee: SAS Institute Inc.Inventor: Xu Chen
-
Patent number: 11080602Abstract: A computing system trains a reinforcement learning model comprising multiple different attention model components. The reinforcement learning model trains on training data of a first environment (e.g., a first traffic intersection). The reinforcement learning model trains by training a state attention computer model on the training data that weighs each of respective inputs of a respective state. The reinforcement learning model trains by training an action attention computer model that determines a probability of switching from a first action to a second action of the first set of the multiple candidate actions (e.g., changing traffic colors of traffic lights). Alternatively, or additionally, a computing system generates an indication of a selected outcome according to the reinforcement learning model and sends a selection output to the second environment (e.g., a second traffic intersection with more lanes than the first traffic intersection) to implement the selected action in the second environment.Type: GrantFiled: February 17, 2021Date of Patent: August 3, 2021Assignee: SAS Institute Inc.Inventors: Afshin Oroojlooyjadid, Mohammadreza Nazari, Davood Hajinezhad, Jorge Manuel Gomes da Silva
-
Patent number: 11074483Abstract: A computing system receives a request to validate, according to a validation specification, a response of a system of operation. The validation specification relates to determining deviation from a specified result in response to testing one or more candidate inputs. The initial design space defines design points for the system of operation according to the respective candidate inputs for each factor. The computing system, responsive to the request to validate the response of the system of operation, generates data representing a covering array for design points defined by the initial design space or a subset design space. The computing system generates, based on the data, a test suite for testing the system of operation. The computing system obtains the response of the system of operation and generates an output indicating the deviation from the specified result.Type: GrantFiled: October 19, 2020Date of Patent: July 27, 2021Assignee: SAS Institute Inc.Inventors: Joseph Albert Morgan, Ryan Adam Lekivetz, Caleb Bridges King, Bradley Allen Jones
-
Patent number: 11074412Abstract: A system trains a classification model. Text windows are defined from tokens based on a window size. A network model including a transformer network is trained with the text windows to define classification information. A first accuracy value is computed. (A) The window size is reduced using a predefined reduction factor value. (B) Second text windows are defined based on the reduced window size. (C) Retrain the network model with the second text windows to define classification information. (D) A second accuracy value is computed. (E) An accuracy reduction value is computed from the second accuracy value relative to the first accuracy value. When the computed accuracy reduction value is ?an accuracy reduction tolerance value, repeat (A)-(E) until the accuracy reduction value is <the accuracy reduction tolerance value. Otherwise, increase the window size, define final text windows based on the increased window size, and retrain the network model.Type: GrantFiled: March 16, 2021Date of Patent: July 27, 2021Assignee: SAS Institute Inc.Inventors: Samuel Paul Leeman-Munk, James Allen Cox, David Blake Styles, Richard Welland Crowell
-
Publication number: 20210224051Abstract: An apparatus includes a processor to: use an identifier of a requesting device or operator thereof to identify federated area(s) to which access is authorized; based on data dependencies among a set of tasks of a job flow, derive an order of performance specifying the first task to be performed; store, within a task queue, a task routine execution request message including an identifier associated with the first task, and federated area identifier(s) of the identified federated area(s); within a resolver container, in response to storage of the task routine execution request message, use the identifier associated with the first task and identifier(s) of the federated area(s) to identify one in which a first task routine is stored; within a task container, execute the first task routine to perform the first task; and upon completion of the job flow, transmit an indication of completion to the requesting device.Type: ApplicationFiled: April 7, 2021Publication date: July 22, 2021Applicant: SAS Institute Inc.Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Qing Gong, Partha Dutta, Kais Arfaoui
-
Patent number: 11062219Abstract: A computer solves a nonlinear optimization problem. An optimality check is performed for a current solution to an objective function that is a nonlinear equation with constraint functions on decision variables. When the performed optimality check indicates that the current solution is not an optimal solution, a barrier parameter value is updated, and a Lagrange multiplier value is updated for each constraint function based on a result of a complementarity slackness test. The current solution to the objective function is updated using a search direction vector determined by solving a primal-dual linear system that includes a dual variable for each constraint function and a step length value determined for each decision variable and for each dual variable. The operations are repeated until the optimality check indicates that the current solution is the optimal solution or a predefined number of iterations has been performed.Type: GrantFiled: November 30, 2020Date of Patent: July 13, 2021Assignee: SAS Institute Inc.Inventors: Joshua David Griffin, Riadh Omheni, Yan Xu
-
Patent number: 11055620Abstract: A computing system trains a clustering model. (A) Beta distribution parameter values are computed for each cluster using a mass parameter value and a responsibility parameter vector of each observation vector. (B) Parameter values are computed for a normal-Wishart distribution for each observation vector included in a batch of a plurality of observation vectors. (C) Each responsibility parameter vector defined for each observation vector of the batch is updated using the beta distribution parameter values, the parameter values for the normal-Wishart distribution, and a respective observation vector of the selected batch of plurality of observation vectors. (D) A convergence parameter value is computed. (E) (A) to (D) are repeated until the convergence parameter value indicates the responsibility parameter vector defined for each observation vector is converged. A cluster membership is determined for each observation vector using the responsibility parameter vector.Type: GrantFiled: November 17, 2020Date of Patent: July 6, 2021Assignee: SAS Institute Inc.Inventors: Yingjian Wang, Raymond Eugene Wright
-
Patent number: 11055861Abstract: A computing system receives historical data. The historical data comprises physical actions taken in an experiment in a physical environment. The experiment comprises user-defined stages. The historical data comprises a recorded outcome, according to user-defined performance indicator(s) related to the user-defined stages, for each physical action taken in the experiment. The system generates, by a discrete event simulator, a computing representation of a simulated environment of the physical environment. The simulated environment comprises processing stages. The system obtains simulation data. The simulation data comprises simulated actions taken by the discrete event simulator. The simulation data comprises a predicted outcome, according to user-defined performance indicator(s) related to the processing stages, for each simulated action taken by the discrete event simulator. The system validates accuracy of the discrete event simulator at predicting the recorded outcome in the experiment.Type: GrantFiled: October 1, 2020Date of Patent: July 6, 2021Assignee: SAS Institute Inc.Inventors: Mohammadreza Nazari, Afshin Oroojlooyjadid, Alexander Richard Phelps, Davood Hajinezhad, Bahar Biller, Jonathan Lee Walker, Hamza Mustafa Ghadyali, Kedar Shriram Prabhudesai, Xunlei Wu, Xingqi Du, Jorge Manuel Gomes da Silva, Varunraj Valsaraj, Jinxin Yi
-
Publication number: 20210200937Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image.Type: ApplicationFiled: November 5, 2020Publication date: July 1, 2021Applicant: SAS Institute Inc.Inventors: David James Wheaton, William Robert Nadolski, Heather Michelle GoodyKoontz
-
Patent number: 11048884Abstract: A computing system receives a collection comprising multiple sets of ordered terms, including a first set. The system generates a dataset indicating an association between each pair of terms within a same set of the collection by generating co-occurrence score(s) for the first set. The system generates computed probabilities based on the co-occurrence score(s) for the first set. The computed probabilities indicate a likelihood that one term in a given pair of terms of the collection appears in a given set of the collection given that another term in the given pair of terms of the collection occurs. The system smoothes the computed probabilities by adding one or more random observations. The system generates one or more association indications for the first set based on the smoothed computed probabilities. The system outputs an indication of the dataset. Additionally, or alternatively, based on association measure(s), the system generates a virtual term.Type: GrantFiled: October 1, 2020Date of Patent: June 29, 2021Assignee: SAS Institute Inc.Inventors: James Allen Cox, Russell Albright, Saratendu Sethi
-
Patent number: 11042549Abstract: A computing system receives a program. The program is in a first computer language and specifies computer operations on stored data. The computing system is configured to partition the stored data into sets of partitioned data for performing parallel execution on each of the sets of partitioned data. The computing system determines whether the program comprises a thread program component. The computing system, responsive to determining that the program comprises a thread program component, generates computer-generated computer instructions. The computer-generated computer instructions are in a second computer language. The computer-generated computer instructions are dependent on whether the thread program component specifies information for partitioning and grouping the stored data; whether the program comprises a data program component; or whether the data program component specifies information for partitioning and grouping the output data of the thread program component.Type: GrantFiled: September 11, 2019Date of Patent: June 22, 2021Assignee: SAS Institute Inc.Inventor: David Abu Ghazaleh
-
Publication number: 20210158171Abstract: An apparatus includes processor(s) to: receive a request for a data catalog; in response to the request specifying a structural feature, analyze metadata of multiple data sets for an indication of including it, and to retrieve an indicated degree of certainty of detecting it for data sets including it; in response to the request specifying a contextual aspect, analyze context data of the multiple data sets for an indication of being subject to it, and to retrieve an indicated degree of certainty concerning it for data sets subject to it; selectively include each data set in the data catalog based on the request specifying a structural feature and/or a contextual aspect, and whether each data set meets what is specified; for each data set in the data catalog, generate a score indicative of the likelihood of meeting what is specified; and transmit the data catalog to the requesting device.Type: ApplicationFiled: February 2, 2021Publication date: May 27, 2021Applicant: SAS Institute Inc.Inventors: Nancy Anne Rausch, Roger Jay Barney, John P. Trawinski
-
Publication number: 20210157595Abstract: An apparatus includes a processor core to: receive a request to execute application code including a trigger instruction and an instruction block that reads a row of data values from a data structure and outputs a data value from a function using the row as input, wherein the data structure is divided into multiple portions and the trigger instruction indicates that multiple instances of the instruction block are to be executed concurrently; and in response to the request, and to identification of the instruction block and trigger instruction: generate multiple instances of a support block that causes independent repetitive execution of each instance of the instruction block until all rows of the corresponding portion of the data structure are used as input; assign instances of the instruction and support blocks to multiple processor cores; and provide each instance of the instruction block with the corresponding portion of the data structure.Type: ApplicationFiled: November 27, 2020Publication date: May 27, 2021Applicant: SAS Institute Inc.Inventors: Jack Joseph Rouse, Robert William Pratt, Jack Carl Erickson, Manoj Keshavmurthi Chari
-
Patent number: 11017308Abstract: The computing device receives a first user input request to modify a structural equation model (SEM) in a graphical user interface. The modification of the SEM includes modifying one or more SEM path diagram elements. The computing device detects whether a first SEM path diagram element is modified responsive to the received first user input request. Based on the detection, the computing device determines whether the modification violates a first set of SEM rules, a second set of SEM rules, or one or more launch conditions prior to initiating execution of the SEM. Based on determining a violation of the SEM rules or the launch conditions or that there was not a violation, the computing device displays a graphical indicator for indicating a fatal error for the SEM modification, a warning error for the SEM modification, or a valid SEM modification.Type: GrantFiled: October 13, 2020Date of Patent: May 25, 2021Assignee: SAS Institute Inc.Inventors: Laura Castro-Schilo, James Robert Koepfler, Christopher Michael Gotwalt