Patents Assigned to SAS Institute
-
Patent number: 12165031Abstract: A treatment model trained to compute an estimated treatment variable value for each observation vector of a plurality of observation vectors is executed. Each observation vector includes covariate variable values, a treatment variable value, and an outcome variable value. An outcome model trained to compute an estimated outcome value for each observation vector using the treatment variable value for each observation vector is executed. A standard error value associated with the outcome model is computed using a first variance value computed using the treatment variable value of the plurality of observation vectors, using a second variance value computed using the treatment variable value and the estimated treatment variable value of the plurality of observation vectors, and using a third variance value computed using the estimated outcome value of the plurality of observation vectors. The standard error value is output.Type: GrantFiled: December 5, 2023Date of Patent: December 10, 2024Assignee: SAS Institute Inc.Inventors: Sylvie Tchumtchoua Kabisa, Xilong Chen, Gunce Eryuruk Walton, David Bruce Elsheimer, Ming-Chun Chang
-
Publication number: 20240370697Abstract: In one example, a system can receive an input from a user indicating a target variable to be forecasted over a future time window. The system can then determine independent variables that influence the target variable and generate a set of candidate variables, including combinations of the independent variables. The system can then execute a random forest classifier to identify a subset of candidate variables having a threshold level of influence on the target variable. The system can then construct a machine-learning model configured to receive the identified subset of candidate variables as inputs and generate a forecast of the target variable. After constructing the machine-learning model, the system can train the machine-learning model using historical data and then execute the machine-learning model to generate the forecast.Type: ApplicationFiled: January 11, 2024Publication date: November 7, 2024Applicant: SAS Institute Inc.Inventors: Richa Chauhan, Harish Yadav, Hemil Shah, Kanchan Kamat, Arnulfo D. de Castro, Tae Yoon Lee
-
Publication number: 20240347064Abstract: A system, method, and computer-program product includes receiving speech audio of a multi-turn conversation, generating, via a speech-to-text process, a transcript of the speech audio, wherein the transcript of the speech audio textually segments speech spoken during the multi-turn conversation into a plurality of utterances, generating a speaker diarization prompt that includes contextual information about a plurality of speakers participating in the multi-turn conversation, inputting, to a large language model, the speaker diarization prompt and the transcript of the speech audio, and obtaining, from the large language model, an output comprising an enhanced transcript of the speech audio, wherein the enhanced transcript of the speech audio textually segments the speech spoken during the multi-turn conversation into a plurality of refined utterances and associates a speaker identification value with each of the plurality of refined utterances.Type: ApplicationFiled: April 12, 2024Publication date: October 17, 2024Applicant: SAS Institute Inc.Inventors: Xiaolong Li, Xiaozhuo Cheng, Xu Yang
-
Publication number: 20240346382Abstract: A method, system, and computer-program product includes identifying a set of heterogeneous sensors, configuring a plurality of model training compositions for each of the set of heterogeneous sensors, computing, for each of the plurality of model training compositions, a first efficacy metric value based on predictive outputs of the at least two machine learning models, identifying, for each sensor of the set of heterogeneous sensors, a champion model training composition of the subject sensor, the champion model training composition having a highest efficacy metric value, and electing, from a plurality of champion model training compositions corresponding to the champion model training compositions identified for each sensor of the set of heterogeneous sensors, an overall champion model training composition corresponding to a champion sensor of the set of heterogeneous sensors based on an assessment of second efficacy metric values of the plurality of champion model training compositions.Type: ApplicationFiled: April 17, 2024Publication date: October 17, 2024Applicant: SAS Institute Inc.Inventor: John Wesley Gottula
-
Publication number: 20240346324Abstract: A method, system, and computer-program product includes identifying a set of heterogeneous sensors, configuring a plurality of model training compositions for each of the set of heterogeneous sensors, computing, for each of the plurality of model training compositions, a first efficacy metric value based on predictive outputs of the at least two machine learning models, identifying, for each sensor of the set of heterogeneous sensors, a champion model training composition of the subject sensor, the champion model training composition having a highest efficacy metric value, and electing, from a plurality of champion model training compositions corresponding to the champion model training compositions identified for each sensor of the set of heterogeneous sensors, an overall champion model training composition corresponding to a champion sensor of the set of heterogeneous sensors based on an assessment of second efficacy metric values of the plurality of champion model training compositions.Type: ApplicationFiled: April 17, 2024Publication date: October 17, 2024Applicant: SAS Institute Inc.Inventor: John Wesley Gottula
-
Patent number: 12093826Abstract: A computing device trains a fair prediction model while defining an optimal event cutoff value. (A) A prediction model is trained with observation vectors. (B) The prediction model is executed to define a predicted target variable value and a probability associated with an accuracy of the predicted target variable value. (C) A conditional moments matrix is computed based on fairness constraints, the predicted target variable value, and the sensitive attribute variable value of each observation vector. The predicted target variable value has a predefined target event value only when the probability is greater than a predefined event cutoff value. (D) (A) through (C) are repeated. (E) An updated value is computed for the predefined event cutoff value. (F) (A) through (E) are repeated. An optimal event cutoff value is defined from the predefined event cutoff values used when repeating (A) through (E). The optimal value and prediction model are output.Type: GrantFiled: February 19, 2024Date of Patent: September 17, 2024Assignee: SAS Institute Inc.Inventors: Xinmin Wu, Ricky Dee Tharrington, Jr., Ralph Walter Abbey, Xin Jiang Hunt
-
Patent number: 12056207Abstract: A computing device learns a best topological order vector of a plurality of variables. A target variable and zero or more input variables are defined. (A) A machine learning model is trained with observation vectors using the target variable and the zero or more input variables. (B) The machine learning model is executed to compute an equation loss value. (C) The equation loss value is stored with the identifier. (D) The identifier is incremented. (E) (A) through (D) are repeated a plurality of times. (F) A topological order vector is defined. (G) A loss value is computed from a subset of the stored equation loss values based on the topological order vector. (F) through (G) are repeated for each unique permutation of the topological order vector. A best topological order vector is determined based on a comparison between the loss value computed for each topological order vector in (G).Type: GrantFiled: December 13, 2023Date of Patent: August 6, 2024Assignee: SAS Institute Inc.Inventors: Xilong Chen, Tao Huang, Jan Chvosta
-
Patent number: 12051087Abstract: The computing device receives data for a plurality of events that includes a timestamp associated with a digital traffic campaign in an event processing system. Based on the timestamp of the data for each event, the computing device executes operations comprising: applying filtering using digital signal processing to the event count for the combined data for each of the one or more intervals, executing a model to compute one or more backward difference approximations for the one or more candidate systems time constants from the evaluated exponential curve, and selecting a system time constant that predicts a first time instant wherein the data for the plurality of events approaches a point on a horizontal asymptote for the evaluated exponential curve. The computing device determines an epoch for the selected system time constant and outputs the determined epoch for the selected system time constant in the graphical user interface.Type: GrantFiled: November 20, 2023Date of Patent: July 30, 2024Assignee: SAS Institute Inc.Inventors: Craig Geoffrey Statham, Sauryha Lynne Gay
-
Publication number: 20240193917Abstract: A system, method, and computer-program product includes detecting, via a localization machine learning model, a target object within target image data of a scene, classifying, via an object classification machine learning model, the target object to a probable object class of a plurality of distinct object classes, routing, via the one or more processors, the target image data of the scene to a target object-condition machine learning classification model of a plurality of distinct object-condition machine learning classification models based on a mapping between the plurality of distinct object classes and the plurality of distinct object-condition machine learning classification models, classifying, via the target object-condition machine learning classification model, the target object to a probable object-condition class of a plurality of distinct object-condition classes, and displaying, via a graphical user interface, a representation of the target object in association with the probable object-conditioType: ApplicationFiled: December 4, 2023Publication date: June 13, 2024Applicant: SAS Institute Inc.Inventors: Robert Winston Blanchard, Neela Niranjani Vengateshwaran
-
Publication number: 20240169390Abstract: The computing device receives data for a plurality of events that includes a timestamp associated with a digital traffic campaign in an event processing system. Based on the timestamp of the data for each event, the computing device executes operations comprising: applying filtering using digital signal processing to the event count for the combined data for each of the one or more intervals, executing a model to compute one or more backward difference approximations for the one or more candidate systems time constants from the evaluated exponential curve, and selecting a system time constant that predicts a first time instant wherein the data for the plurality of events approaches a point on a horizontal asymptote for the evaluated exponential curve. The computing device determines an epoch for the selected system time constant and outputs the determined epoch for the selected system time constant in the graphical user interface.Type: ApplicationFiled: November 20, 2023Publication date: May 23, 2024Applicant: SAS Institute Inc.Inventors: Craig Geoffrey Statham, Sauryha Lynne Gay
-
Publication number: 20240126732Abstract: One example described herein involves a system that can receive a set of data records and execute an automated entity resolution (AER) process configured to assign the set of data records to a set of entities. For each entity in the set of entities, the system can generate a respective consistency score for the entity, generate a respective confidence score for the entity based on the respective consistency score for the entity, and determine a respective visual indicator based on the respective confidence score for the entity. The respective visual indicator can indicate a risk of record misassignment to a user. The system can then generate a graphical user interface that includes the respective visual indicator for each of the entities.Type: ApplicationFiled: April 13, 2023Publication date: April 18, 2024Applicant: SAS Institute Inc.Inventor: Nicholas Ablitt
-
Publication number: 20240126731Abstract: One example described herein involves a system that can receive a set of data records and execute an automated entity resolution (AER) process configured to assign the set of data records to a set of entities. For each entity in the set of entities, the system can generate a respective consistency score for the entity, generate a respective confidence score for the entity based on the respective consistency score for the entity, and determine a respective visual indicator based on the respective confidence score for the entity. The respective visual indicator can indicate a risk of record misassignment to a user. The system can then generate a graphical user interface that includes the respective visual indicator for each of the entities.Type: ApplicationFiled: June 22, 2023Publication date: April 18, 2024Applicant: SAS Institute Inc.Inventor: Nicholas Akbar Ablitt
-
Patent number: 11922311Abstract: A computing device trains a fair prediction model. A prediction model is trained and executed with observation vectors. A weight value is computed for each observation vector based on whether the predicted target variable value of a respective observation vector of the plurality of observation vectors has a predefined target event value. An observation vector is relabeled based on the computed weight value. The prediction model is retrained with each observation vector weighted by a respective computed weight value and with the target variable value of any observation vector that was relabeled. The retrained prediction model is executed. A conditional moments matrix is computed. A constraint violation matrix is computed. Computing the weight value through computing the constraint violation matrix is repeated until a stop criterion indicates retraining of the prediction model is complete. The retrained prediction model is output.Type: GrantFiled: June 12, 2023Date of Patent: March 5, 2024Assignee: SAS Institute Inc.Inventors: Xinmin Wu, Ricky Dee Tharrington, Jr., Ralph Walter Abbey
-
Patent number: 11914548Abstract: A computing device determines a node traversal order for computing a computational parameter value for each node of a data model of a system that includes a plurality of disconnected graphs. The data model represents a flow of a computational parameter value through the nodes from a source module to an end module. A flow list defines an order for selecting and iteratively processing each node to compute the computational parameter value in a single iteration through the flow list. Each node from the flow list is selected to compute a driver quantity for each node. Each node is selected from the flow list in a reverse order to compute a driver rate and the computational parameter value for each node. The driver quantity or the computational parameter value is output for each node to predict a performance of the system.Type: GrantFiled: June 8, 2023Date of Patent: February 27, 2024Assignee: SAS Institute Inc.Inventor: Shyam Kashinath Khatkale
-
Patent number: 11887012Abstract: A computing device identifies an anomaly among a plurality of observation vectors. An observation vector is projected using a predefined orthogonal complement matrix. The predefined orthogonal complement matrix is determined from a decomposition of a low-rank matrix. The low-rank matrix is computed using a robust principal component analysis algorithm. The projected observation vector is multiplied by a predefined demixing matrix to define a demixed observation vector. The predefined demixing matrix is computed using an independent component analysis algorithm and the predefined orthogonal complement matrix. A detection statistic value is computed from the defined, demixed observation vector. When the computed detection statistic value is greater than or equal to a predefined anomaly threshold value, an indicator is output that the observation vector is an anomaly.Type: GrantFiled: July 19, 2023Date of Patent: January 30, 2024Assignee: SAS Institute Inc.Inventors: Sudipta Kolay, Steven Guanxing Xu, Kai Shen, Zohreh Asgharzadeh Talebi
-
Patent number: 11886329Abstract: A computing device selects new test configurations for testing software. (A) First test configurations are generated using a random seed value. (B) Software under test is executed with the first test configurations to generate a test result for each. (C) Second test configurations are generated from the first test configurations and the test results generated for each. (D) The software under test is executed with the second test configurations to generate the test result for each. (E) When a restart is triggered based on a distance metric value computed between the second test configurations, a next random seed value is selected as the random seed value and (A) through (E) are repeated. (F) When the restart is not triggered, (C) through (F) are repeated until a stop criterion is satisfied. (G) When the stop criterion is satisfied, the test result is output for each test configuration.Type: GrantFiled: June 15, 2022Date of Patent: January 30, 2024Assignee: SAS Institute Inc.Inventors: Steven Joseph Gardner, Connie Stout Dunbar, David Bruce Elsheimer, Gregory Scott Dunbar, Joshua David Griffin, Yan Gao
-
Patent number: 11875189Abstract: An apparatus includes at least one node device to host a computing cluster, and at least one processor to generate a UI providing guidance through a set of configuration settings for the computing cluster, wherein, for each configuration setting that is received as an input during configuration, the at least one processor is caused to: perform a check of the set of configuration settings to determine whether the received configuration setting creates a conflict among the set of configuration settings; and in response to a determination that the received configuration setting creates a conflict among the set of configuration settings, perform operations including generate an indication of the conflict for presentation by the UI, and receive a change to a configuration setting as an input from the input device.Type: GrantFiled: March 17, 2023Date of Patent: January 16, 2024Assignee: SAS Institute Inc.Inventors: Richard K. Wellum, Joseph Daniel Henry, Holden Ernest O'Neal, John W. Waller
-
Patent number: 11862171Abstract: An apparatus includes a processor to: receive, from a requesting device, a request to perform speech-to-text conversion of a speech data set; within a first thread of a thread pool, perform a first pause detection technique to identify a first set of likely sentence pauses; within a second thread of the thread pool, perform a second pause detection technique to identify a second set of likely sentence pauses; perform a speaker diarization technique to identify a set of likely speaker changes; divide the speech data set into data segments representing speech segments based on a combination of at least the first set of likely sentence pauses, the second set of likely sentence pauses, and the set of likely speaker changes; use at least an acoustic model with each data segment to identify likely speech sounds; and generate a transcript based, at least in part, on the identified likely speech sounds.Type: GrantFiled: November 23, 2022Date of Patent: January 2, 2024Assignee: SAS Institute Inc.Inventors: Xiaolong Li, Xiaozhuo Cheng, Samuel Norris Henderson, Xu Yang
-
Patent number: 11842379Abstract: The computing device obtains a training data set related to a plurality of historic user inputs associated with preferences of one or more services or items from an entity. For each of the one or more services or items, the computing device executes operations to train a plurality of models using the training data set to generate a plurality of recommended models, apply a validation data set to generate a plurality of predictions from the plurality of recommended models, obtain a weight of each metric of a plurality of metrics from the entity, obtain user inputs associated with user preferences, and determine a relevancy score for each metric. The computing device selects a recommended model based on the relevancy score of the selected metric or a combination of selected metrics, generates one or more recommendations for the users, and outputs the one or more generated recommendations to the users.Type: GrantFiled: February 15, 2023Date of Patent: December 12, 2023Assignee: SAS Institute Inc.Inventors: Jonathan Lee Walker, Hardi Desai, Xuejun Liao, Varunraj Valsaraj
-
Publication number: 20230394109Abstract: Anomalies in a target object can be detected and diagnosed using improved Mahalanobis-Taguchi system (MTS) techniques. For example, an anomaly detection and diagnosis (ADD) system can receive a set of measurements associated with attributes of a target object. A Mahalanobis distance (MD) can be determined using a generalized inverse matrix. An abnormal condition can be detected when the MD is greater than a predetermined threshold value. The ADD system can determine an importance score for each measurement of a corresponding attribute. The attribute whose measurement has the highest importance score can be determined to be responsible for the abnormal condition.Type: ApplicationFiled: May 17, 2023Publication date: December 7, 2023Applicant: SAS Institute Inc.Inventors: Kevin L. SCOTT, Deovrat Vijay Kakde, Arin Chaudhuri, Sergiy Peredriy