Patents Assigned to SAS Institute Inc.
  • Patent number: 11416712
    Abstract: A computing device generates synthetic tabular data.
    Type: Grant
    Filed: December 23, 2021
    Date of Patent: August 16, 2022
    Assignee: SAS Institute, Inc.
    Inventors: Amirhassan Fallah Dizche, Ye Liu, Xin Jiang Hunt, Jorge Manuel Gomes da Silva
  • Publication number: 20220253335
    Abstract: An apparatus includes a processor to: receive a request to perform a job flow; within a performance container, based on the data dependencies among a set of tasks of the job flow, derive an order of performance of the set of tasks that includes a subset able to be performed in parallel, and derive a quantity of task containers to enable the parallel performance of the subset; based on the derived quantity of task containers, derive a quantity of virtual machines (VMs) to enable the parallel performance of the subset; provide, to a VM allocation routine, an indication of a need for provision of the quantity of VMs; and store, within a task queue, multiple task routine execution request messages to enable parallel execution of task routines within the quantity of task containers to cause the parallel performance of the subset.
    Type: Application
    Filed: April 29, 2022
    Publication date: August 11, 2022
    Applicant: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang "Ricky" Zhang
  • Patent number: 11409966
    Abstract: An apparatus to: analyze a data set to identify a candidate topic not in a set of topics; determine whether the prominence of the candidate topic within the data set meets a threshold; in response to meeting the threshold, retrieve a rate of increase in frequency of the candidate topic in online searches; in response to meeting a threshold rate of increase, retrieve the keyword most frequently used in online searches for the candidate topic, use the keyword to retrieve a supplemental data set, and analyze input data extracted from the supplemental data set to determine whether the candidate topic can change the accuracy of a forecast model; and in response to determining that the candidate topic can change the accuracy, add the candidate topic to the set of topics and replace the forecast model with a forecast model trained for the set of topics augmented with the candidate topic.
    Type: Grant
    Filed: December 17, 2021
    Date of Patent: August 9, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Anand Arun Phand, Sudeshna Guhaneogi, Narender Ceechamangalam Veeraraghavan, Ravinder Singh Chauhan, Shikha Bhat, Kaustubh Yashvant Khandwe, Shalini Sinha, Vineet Roy, Alina Olegovna Asadullina, Vitaly Igorevich Plekhanov, Elizaveta Alekseevna Lavrenova, Dmitry Sergeevich Bodunov, Assol Raufjonovna Kubaeva, Stephen Joseph Ondrik, Steffen-Horst Schl├╝ter, Joseph Michael Martino, John Zhiqiang Zhao, Pravinkumar Bhalerao, Valentina Larina
  • Patent number: 11404053
    Abstract: An apparatus includes processor(s) to: generate a set of candidate n-grams based on probability distributions from an acoustic model for candidate graphemes of a next word most likely spoken following at least one preceding word spoken within speech audio; provide the set of candidate n-grams to multiple devices; provide, to each node device, an indication of which candidate n-grams are to be searched for within the n-gram corpus by each node device to enable searches for multiple candidate n-grams to be performed, independently and at least partially in parallel, across the node devices; receive, from each node device, an indication of a probability of occurrence of at least one candidate n-gram within the speech audio; based on the received probabilities of occurrence, identify the next word most likely spoken within the speech audio; and add the next word most likely spoken to a transcript of the speech audio.
    Type: Grant
    Filed: July 8, 2021
    Date of Patent: August 2, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Xiaozhuo Cheng, Xu Yang, Xiaolong Li, Biljana Belamaric Wilsey, Haipeng Liu, Jared Peterson
  • Patent number: 11403527
    Abstract: A computing device trains a neural network machine learning model. A forward propagation of a first neural network is executed. A backward propagation of the first neural network is executed from a last layer to a last convolution layer to compute a gradient vector. A discriminative localization map is computed for each observation vector with the computed gradient vector using a discriminative localization map function. An activation threshold value is selected for each observation vector from at least two different values based on a prediction error of the first neural network. A biased feature map is computed for each observation vector based on the activation threshold value selected for each observation vector. A masked observation vector is computed for each observation vector using the biased feature map. A forward and a backward propagation of a second neural network is executed a predefined number of iterations using the masked observation vector.
    Type: Grant
    Filed: October 13, 2021
    Date of Patent: August 2, 2022
    Assignee: SAS Institute Inc.
    Inventors: Xinmin Wu, Yingjian Wang, Xiangqian Hu
  • Patent number: 11379685
    Abstract: A computing device classifies unclassified observations. A first batch of unclassified observation vectors and a first batch of classified observation vectors are selected. A prior regularization error value and a decoder reconstruction error value are computed. A first batch of noise observation vectors is generated. An evidence lower bound (ELBO) value is computed. A gradient of an encoder neural network model is computed, and the ELBO value is updated. A decoder neural network model and an encoder neural network model are updated. The decoder neural network model is trained. The target variable value is determined for each observation vector of the unclassified observation vectors based on an output of the trained decoder neural network model. The target variable value is output.
    Type: Grant
    Filed: July 28, 2021
    Date of Patent: July 5, 2022
    Assignee: SAS Institute Inc.
    Inventor: Xu Chen
  • Patent number: 11379743
    Abstract: A computing device determines a recommendation. (A) A first parameter matrix is updated using a first direction matrix and a first step-size parameter value that is greater than one. The first parameter matrix includes a row dimension equal to a number of users of a plurality of users included in a ratings matrix and the ratings matrix includes a missing matrix value. (B) A second parameter matrix is updated using a second direction matrix and a second step-size parameter value that is greater than one. The second parameter matrix includes a column dimension equal to a number of items of a plurality of items included in the ratings matrix. (C) An objective function value is updated based on the first parameter matrix and the second parameter matrix. (D) (A) through (C) are repeated until the first parameter matrix and the second parameter matrix satisfy a convergence test.
    Type: Grant
    Filed: July 28, 2021
    Date of Patent: July 5, 2022
    Assignee: SAS Institute Inc.
    Inventors: Xuejun Liao, Patrick Nathan Koch, Shunping Huang, Yan Xu
  • Patent number: 11373655
    Abstract: An apparatus includes processor(s) to: perform preprocessing operations of a segmentation technique including divide speech data set into data chunks representing chunks of speech audio, use an acoustic model with each data chunk to identify pauses in the speech audio, and analyze a length of time of each identified pause to identify a candidate set of likely sentence pauses in the speech audio; and perform speech-to-text operations including divide the speech data set into data segments that each representing segments of the speech audio based on the candidate set of likely sentence pauses, use the acoustic model with each data segment to identify likely speech sounds in the speech audio, analyze the identified likely speech sounds to identify candidate sets of words likely spoken in the speech audio, and generate a transcript of the speech data set based at least on the candidate sets of words likely spoken.
    Type: Grant
    Filed: October 12, 2021
    Date of Patent: June 28, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Xiaolong Li, Xiaozhuo Cheng, Xu Yang
  • Patent number: 11373121
    Abstract: The computing device transforms lab data and field data into a first format suitable for execution with a supervised machine learning model to determine an input variable importance for a first set of input variables in predicting a field outcome. Based on the determination, the computing device generates one or more logical rules of decision metrics, selects the one or more input variables that yields a higher input variable importance, and generates one or more pass-fail indicators. The computing device combines the one or more pass-fail indicators and generates one or more prediction factor rules. The computing device transforms the field data and the one or more prediction factor rules into a second format suitable for execution with a model to determine a treatment effect for the one or more prediction factor rules. The computing device selects the prediction factor rule that maximizes the treatment effect.
    Type: Grant
    Filed: September 23, 2021
    Date of Patent: June 28, 2022
    Assignee: SAS Institute Inc.
    Inventors: John Wesley Gottula, Bryan Matthew Mutell, Michael Lee Henderson, II
  • Publication number: 20220197697
    Abstract: An apparatus includes a processor to: derive an order of performance of a set of tasks of a job flow; based on the order of performance, store, within a task queue, a first task routine execution request message to cause a first task to be performed; within a first task container, and in response to storage of the first task routine execution request message, execute instructions of a first task routine of a set of task routines, store a mid-flow data set output of the first task within a federated area, and store a first task completion message within the task queue after completion of storage of the mid-flow data set; and in response to the storage of the first task completion message, and based on the order of performance, store, within the task queue, a second task routine execution request message to cause a second task to be performed.
    Type: Application
    Filed: February 28, 2022
    Publication date: June 23, 2022
    Applicant: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Chaowang "RICKY" Zhang
  • Patent number: 11366699
    Abstract: Some examples describes herein relate to handling bulk requests for resources. In one example, a system can determine a bulk request parameter-value associated with a bulk request. The system can then predict a baseline benefit value, which can be a benefit value when the bulk request parameter-value is used as a lower boundary for a unit parameter-value. The system can also determine a lower boundary constraint on the unit parameter-value independently of the bulk request parameter-value. The system can then execute an iterative process using the baseline benefit value and the lower boundary constraint. Based on a result of the iterative process, the system can determine whether and how much the bulk request parameter-value should be adjusted. The system may adjust the bulk request parameter-value accordingly or output a recommendation to do so.
    Type: Grant
    Filed: February 11, 2022
    Date of Patent: June 21, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Altan Gulcu, Xiaodong Yao
  • Patent number: 11361255
    Abstract: Graphical interactive model selection is provided. A response variable vector for each value of a group variable and an explanatory variable vector are defined. A wavelet function is fit to the explanatory variable vector paired with the response variable vector defined for each value of the group variable. Each fit wavelet function defines coefficients for each value of the group variable. A curve is presented for each value of the group variable and is defined by the plurality of coefficients of an associated fit wavelet function. An indicator is received of a request to perform functional analysis using the coefficients for each value of the of the group variable based on a predefined factor variable. A model is trained using the coefficients for each value of the group variable and a factor variable value associated with each observation vector of each plurality of observation vectors as a model effect.
    Type: Grant
    Filed: July 28, 2021
    Date of Patent: June 14, 2022
    Assignee: SAS Institute Inc.
    Inventors: Ryan Jeremy Parker, Clayton Adam Barker, Jeremy Ryan Ash, Christopher Michael Gotwalt
  • Patent number: 11354583
    Abstract: Logical rules can be automatically generated for use with event detection systems according to some aspects of the present disclosure. For example, a computing device can extract a group of logical rules from trained decision trees and apply a test data set to the group of logical rules to determine count values corresponding to the logical rules. The computing device can then determine performance metric values based on the count values, select a subset of logical rules from among the group of logical rules based on the performance metric values, and provide at least one logical rule in the subset for use with an event detection system. The event detection system can be configured to detect an event in relation to a target data set that was not used to train the decision trees.
    Type: Grant
    Filed: April 8, 2021
    Date of Patent: June 7, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Hamoon Azizsoltani, Prathaban Mookiah, Weichen Wang, Thomas J. O'Connell
  • Patent number: 11354566
    Abstract: A treatment model that is a first neural network is trained to optimize a treatment loss function based on a treatment variable t using a plurality of observation vectors by regressing t on x(1),z. The trained treatment model is executed to compute an estimated treatment variable value {circumflex over (t)}i for each observation vector. An outcome model that is a second neural network is trained to optimize an outcome loss function by regressing y on x(2) and an estimated treatment variable t. The trained outcome model is executed to compute an estimated first unknown function value {circumflex over (?)}(xi(2)) and an estimated second unknown function value {circumflex over (?)}(xi(2)) for each observation vector. An influence function value is computed for a parameter of interest using {circumflex over (?)}(xi(2)) and {circumflex over (?)}(xi(2)). A value is computed for the predefined parameter of interest using the computed influence function value.
    Type: Grant
    Filed: October 21, 2021
    Date of Patent: June 7, 2022
    Assignee: SAS Institute Inc.
    Inventors: Xilong Chen, Douglas Allan Cairns, Jan Chvosta, David Bruce Elsheimer, Yang Zhao, Ming-Chun Chang, Gunce Eryuruk Walton, Michael Thomas Lamm
  • Patent number: 11346751
    Abstract: A computing system receives a request for computer-generated likelihood(s) for candidate evaluations of an industrial product set according to a measurement system analysis (MSA). The MSA comprises tests for evaluating, according to a measurement standard, the industrial product set. The request indicates a metric set representing metric(s) each quantifying an estimate of contribution to variation in evaluating the industrial product set according to the MSA. The system generates a design comprising a respective input set for each test of the MSA. The respective input set comprises setting(s) for conducting a test of the MSA. The design is designed to isolate candidate sources for contributing to the variation in evaluating the industrial product set according to the MSA. The system (e.g., prior to the MSA) outputs, based on the metric set and the design, the computer-generated likelihood(s) for the candidate evaluations of the industrial product set according to the MSA.
    Type: Grant
    Filed: July 29, 2021
    Date of Patent: May 31, 2022
    Assignee: SAS Institute Inc.
    Inventors: Caleb Bridges King, Joseph Albert Morgan, Ryan Adam Lekivetz, Bradley Allen Jones
  • Patent number: 11347686
    Abstract: An apparatus includes a processor to: instantiate data buffers of a queue, reading threads, and provision threads; within each reading thread, use an identifier provided in a data buffer of the queue to retrieve the corresponding data set part and part metadata from storage device(s), and store both within the data buffer; operate the queue as a (FIFO) buffer; within each provision thread, retrieve a row group from among multiple row groups and corresponding metadata from within the data buffer, use information in the metadata to decompress at least one column, and provide the data values of the row group to the requesting device or an application routine; and in response to each instance of storage of a data set part within a data buffer of the queue, analyze the availability of storage space and/or of processing resources to determine whether to dynamically adjust the quantity of reading threads.
    Type: Grant
    Filed: September 30, 2020
    Date of Patent: May 31, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Brian Payton Bowman, Gordon Lyle Keener
  • Publication number: 20220167469
    Abstract: An apparatus includes a processor to: within a first container, and prior to its uninstantiation, execute a first instance of a routine to cause the processor to monitor for and detect a job performance request in a job queue; and within a second container, execute a second instance of the routine to cause the processor to search the job queue for a job performance request, and in response to a combination of the uninstantiation of the first container, the storage of the job performance request in the job queue and there being no indication of completion of the job flow in the job queue, perform a combination of store an indication of the job flow performance commencing in the job queue, derive an order of performance of the set of tasks of the job flow and store a first task execution request in a task queue.
    Type: Application
    Filed: December 28, 2021
    Publication date: May 26, 2022
    Applicant: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Qing Gong
  • Patent number: 11341414
    Abstract: An apparatus includes processor(s) to: receive a request for a data catalog; in response to the request specifying a structural feature, analyze metadata of multiple data sets for an indication of including it, and to retrieve an indicated degree of certainty of detecting it for data sets including it; in response to the request specifying a contextual aspect, analyze context data of the multiple data sets for an indication of being subject to it, and to retrieve an indicated degree of certainty concerning it for data sets subject to it; selectively include each data set in the data catalog based on the request specifying a structural feature and/or a contextual aspect, and whether each data set meets what is specified; for each data set in the data catalog, generate a score indicative of the likelihood of meeting what is specified; and transmit the data catalog to the requesting device.
    Type: Grant
    Filed: February 2, 2021
    Date of Patent: May 24, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Nancy Anne Rausch, Roger Jay Barney, John P. Trawinski
  • Patent number: 11334809
    Abstract: A computing system receives as candidate predictors, for a model set, a list of terms for computer identification in dataset(s). The system receives initial user selections in the graphical user interface (GUI) of a term set, a response variable, and target value(s) for the response variable. The term set comprises candidate predictors from the list. The response variable is for a response to input to an initial model of the model set. The system generates the initial model that estimates a relationship between the target value(s) and the term set. The system displays in the GUI a performance representation of the initial model for user comparison of models and an indication of a contribution, to the initial model, of terms of a subset of the term set. The system receives a subsequent user selection, in the GUI, to change an aspect of the initial model.
    Type: Grant
    Filed: June 8, 2021
    Date of Patent: May 17, 2022
    Assignee: SAS Institute Inc.
    Inventors: Ernest C. Pasour, Clayton A. Barker, Paris Faison
  • Patent number: 11335350
    Abstract: An apparatus includes processor(s) to: perform pre-processing operations including derive an audio noise level of speech audio of a speech data set, derive a first relative weighting for first and second segmentation techniques for identifying likely sentence pauses in the speech audio based on the audio noise level, and select likely sentence pauses for a converged set of likely sentence pauses from likely sentence pauses identified by the first and/or second segmentation techniques based on the first relative weighting; and perform speech-to-text processing operations including divide the speech data set into data segments representing speech segments of the speech audio based on the converged set of likely sentence pauses, and derive a second relative weighting based on the audio noise level for selecting words indicated by an acoustic model or by a language model as being most likely spoken in the speech audio for inclusion in a transcript.
    Type: Grant
    Filed: October 12, 2021
    Date of Patent: May 17, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Xiaolong Li, Xiaozhuo Cheng, Xu Yang