Patents Assigned to SAS Institute Inc.
  • Patent number: 12293213
    Abstract: A system and method include creating a project package for an Event Stream Processing (ESP) project, generating a first manifest file from the project package, creating a first container pod on a cluster based on the first manifest file, executing a container file generator software and a build kit software on the first container pod, executing an ESP server on the container file generator software, executing the ESP project on the ESP server such that data is not streaming to the ESP server, identifying a list of required software components needed to execute the ESP project, creating a container file having a subset of software components based on the list of required software components, generating a ESP project container image for the ESP server based on the container file, and deploying the ESP project using the ESP project container image to analyze data streamed to the ESP project.
    Type: Grant
    Filed: December 20, 2024
    Date of Patent: May 6, 2025
    Assignee: SAS Institute Inc.
    Inventors: Frédéric Combaneyre, Joydeep Bhattacharya
  • Publication number: 20250139088
    Abstract: A computer-implemented system, computer-implemented method, and computer-program product includes receiving a natural language query from a user for executing an analytical task; generating an analytical large language model (LLM) prompt based on the natural language query and, in response to generating the analytical LLM prompt, orchestrating an LLM-directed workflow for handling the natural language query by: automatically prompting, using the analytical LLM prompt, an analytical task-oriented LLM to generate a structured query for querying a data catalog application; querying the data catalog application using the structured query generated by the analytical task-oriented LLM; obtaining query results from the data catalog application, where the query results include metadata associated with at least one element accessible to the data catalog application; prompting the analytical task-oriented LLM to identify a given analytical task associated with a given analytical agent; and automatically executing, by t
    Type: Application
    Filed: October 2, 2024
    Publication date: May 1, 2025
    Applicant: SAS Institute Inc.
    Inventor: David Hermann Peter Weik
  • Patent number: 12287783
    Abstract: A system and method include breaking symmetry in a query graph by converting the query graph into a transformed query graph by generating a symmetry breaking expression that includes detecting one or more orbits in the transformed query graph, selecting an orbit from the one or more orbits having more than one node, generating an automorphism breaking sub-expression for the selected orbit, assigning a node of the selected orbit a unique node attribute, recalculating the one or more orbits in the transformed query graph, repeating the process until each node is in its own orbit, and combining each of the automorphism breaking sub-expressions to obtain the symmetry breaking expression. Using the symmetry breaking expression, the system and method include finding one or more subgraphs of a main graph that match the symmetry breaking expression of the query graph.
    Type: Grant
    Filed: August 19, 2024
    Date of Patent: April 29, 2025
    Assignee: SAS Institute Inc.
    Inventors: Brandon Michael Reese, Steven Harenberg
  • Publication number: 20250117632
    Abstract: A system, method, and computer-program product includes obtaining a decisioning dataset comprising a plurality of favorable decisioning records and at least one unfavorable decisioning record; detecting, via a machine learning algorithm, a favorable decisioning record of the plurality of favorable decisioning records that has a vector value closest to a vector value of the unfavorable decisioning record; executing a counterfactual assessment between the favorable decisioning record and the unfavorable decisioning record; generating an explainability artifact based on one or more bias intensity metrics to explain a bias in a machine learning-based decisioning model; and in response to generating the explainability artifact, displaying the explainability artifact in a user interface.
    Type: Application
    Filed: July 5, 2024
    Publication date: April 10, 2025
    Applicant: SAS Institute Inc.
    Inventors: Luiz Henrique Outi Kauffmann, Aline Riquetti Campos Emídio
  • Publication number: 20250117664
    Abstract: A system, method, and computer-program product includes obtaining a decisioning dataset comprising a plurality of favorable decisioning records and at least one unfavorable decisioning record; detecting, via a machine learning algorithm, a favorable decisioning record of the plurality of favorable decisioning records that has a vector value closest to a vector value of the unfavorable decisioning record; executing a counterfactual assessment between the favorable decisioning record and the unfavorable decisioning record; generating an explainability artifact based on one or more bias intensity metrics to explain a bias in a machine learning-based decisioning model; and in response to generating the explainability artifact, displaying the explainability artifact in a user interface.
    Type: Application
    Filed: July 5, 2024
    Publication date: April 10, 2025
    Applicant: SAS Institute Inc.
    Inventors: Luiz Henrique Outi Kauffmann, Aline Riquetti Campos Emídio
  • Publication number: 20250117192
    Abstract: In one example, a computer system can generate a graphical user interface (GUI) for forecasting software including a drag-and-drop canvas with a set of rearrangeable nodes defining a forecasting pipeline. The computer system can detect a user interaction for attaching an external-language execution node to the pipeline, which can be used to insert custom code defined using an external programming language. The computer system can receive the custom code. The computer system can receive a user input to initiate execution of the pipeline. The computer system can generate wrapped custom code by augmenting the custom code with additional program code including shared variables. The computer system can provide the wrapped custom code to a set of execution threads configured to execute the wrapped custom code as part of the pipeline to generate one or more forecasts. The computer system can output the forecasts in the GUI.
    Type: Application
    Filed: July 2, 2024
    Publication date: April 10, 2025
    Applicant: SAS Institute Inc.
    Inventors: Iman Vasheghani Farahani, Mahesh V. Joshi, Phillip M. Helmkamp, Rajib Nath, Vilochan Suresh Muley, Javier Delgado, Michele Angelo Trovero
  • Publication number: 20250103578
    Abstract: In one example, a system can receive information about a tabular data structure in a memory including a set of data and a first memory allocation. The system can determine a type of the tabular data structure, the type selected from among two types including a native type and a non-native type. The system can, in response to the type being the native type, identify a first proxy data table usable as a proxy for the tabular data structure that shares the first memory allocation. The system can receive a first indication to access the set of data from application code. The system can issue one or more first read commands to the first proxy data table to cause the set of data to be read from the tabular data structure.
    Type: Application
    Filed: October 10, 2024
    Publication date: March 27, 2025
    Applicant: SAS Institute Inc.
    Inventors: Yongqiao Xiao, Mary Elizabeth Carter, Arash Dehghan Banadaki, Avery Winston Acierno, Patrick Nathan Koch
  • Publication number: 20250103579
    Abstract: In one example, a system can receive, from application code including an analysis operation performed on a set of data, an indication to access the set of data included in a tabular data structure using an application programming interface (API), in which the tabular data structure is associated with a memory allocation and a type. The system can determine that the type of the tabular data structure is the native type, the native type characterizing data structures that are accessed using a first programming language and a second programming language. The system can identify a proxy data table that shares the memory allocation, the proxy data table accessed using the API based on the second programming language. The system can issue one or more read commands to the proxy data table to cause the set of data to be read from the tabular data structure.
    Type: Application
    Filed: October 10, 2024
    Publication date: March 27, 2025
    Applicant: SAS Institute Inc.
    Inventors: Yongqiao Xiao, Mary Elizabeth Carter, Arash Dehghan Banadaki, Avery Winston Acierno, Patrick Nathan Koch
  • Publication number: 20250068658
    Abstract: Embodiments described herein relate to the efficient generation of synthetic datasets that represent many-to-many relationships. In particular, certain embodiments implement a particular factorization for many-to-many generative models, which leads to a scalable generation framework by combining random graph theory and representation learning. Further embodiments we extend the framework to establish the notion of differential privacy within the synthetically generated data. The embodiments described herein are therefore able to generate synthetic datasets efficiently while preserving information within and across many-to-many datasets with improved accuracy.
    Type: Application
    Filed: November 8, 2024
    Publication date: February 27, 2025
    Applicant: SAS Institute Inc.
    Inventors: Kai Xu, Georgi Valentinov Ganev, Emile Isak Joubert, Rees Stephen Davison, Olivier Rene Maurice Van Acker, Luke Anthony William Robinson, Sofiane Mahiou
  • Publication number: 20250068358
    Abstract: A system, method, and computer-program product includes implementing a cross-process queue within a single computer that is configured to transfer a data block between an operating system process executing a write operation and an operating system process executing a read operation, initializing in-memory cell indices within the cross-process queue that include a write operation index tracking index values of one or more cells within the cross-process queue that are available to write and a read operation index tracking index values of one or more cells within the cross-process queue that are available to read, and implementing a cell synchronization data structure tracking states of a plurality of cells of the index of cells of the cross-process queue.
    Type: Application
    Filed: June 7, 2024
    Publication date: February 27, 2025
    Applicant: SAS Institute Inc.
    Inventors: Lawrence Edmund Lewis, Mohammadreza Nazari, Amirhassan Fallah Dizche
  • Publication number: 20250068490
    Abstract: A system, method, and computer-program product includes implementing a cross-process queue within a single computer that is configured to transfer a data block between an operating system process executing a write operation and an operating system process executing a read operation, initializing in-memory cell indices within the cross-process queue that include a write operation index tracking index values of one or more cells within the cross-process queue that are available to write and a read operation index tracking index values of one or more cells within the cross-process queue that are available to read, and implementing a cell synchronization data structure tracking states of a plurality of cells of the index of cells of the cross-process queue.
    Type: Application
    Filed: June 7, 2024
    Publication date: February 27, 2025
    Applicant: SAS Institute Inc.
    Inventors: Lawrence Edmund Lewis, Mohammadreza Nazari, Amirhassan Fallah Dizche
  • Publication number: 20250068357
    Abstract: A system, method, and computer-program product includes implementing a cross-process queue within a single computer that is configured to transfer a data block between an operating system process executing a write operation and an operating system process executing a read operation, initializing in-memory cell indices within the cross-process queue that include a write operation index tracking index values of one or more cells within the cross-process queue that are available to write and a read operation index tracking index values of one or more cells within the cross-process queue that are available to read, and implementing a cell synchronization data structure tracking states of a plurality of cells of the index of cells of the cross-process queue.
    Type: Application
    Filed: June 7, 2024
    Publication date: February 27, 2025
    Applicant: SAS Institute Inc.
    Inventors: Lawrence Edmund Lewis, Mohammadreza Nazari, Amirhassan Fallah Dizche
  • Publication number: 20250068927
    Abstract: A system, method, and computer-program product includes receiving an input comprising a plurality of pre-defined factor matrices and an implicit feedback dataset partitioned into a plurality of implicit feedback data subsets; distributing the input across a controller node and a plurality of worker nodes implemented in a distributed computing environment; and training a model using the controller node and the plurality of worker nodes, wherein training the model includes: initializing, by the controller node, a controller-specific user parameters matrix and a controller-specific item parameters matrix, broadcasting, by the controller node, the controller-specific user parameters matrix and the controller-specific item parameters matrix to each worker node of the plurality of worker nodes, and concurrently executing an aggregation model training algorithm at the controller node and a plurality of localized model training algorithms across the plurality of worker nodes until a training termination condition is
    Type: Application
    Filed: February 21, 2024
    Publication date: February 27, 2025
    Applicant: SAS Institute Inc.
    Inventors: Xuejun Liao, Patrick Nathan Koch
  • Publication number: 20250053615
    Abstract: A computing device learns a directed acyclic graph (DAG). (A) A target variable is defined from variables based on a topological order vector and a first index. (B) Input variables are defined from the variables based on the topological order vector and a second index. (C) A machine learning model is trained with observation vectors using the target variable and the input variables. (D) The machine learning model is executed to compute a loss value. (E) The second index is incremented. (F) (B) through (E) are repeated a first plurality of times. (G) The first index is incremented. (H) (A) through (G) are repeated a second plurality of times. A parent set is determined for each variable based on a comparison between the loss value computed each repetition of (D). The parent set is output for each variable to describe the DAG that defines a hierarchical relationship between the variables.
    Type: Application
    Filed: October 3, 2024
    Publication date: February 13, 2025
    Applicant: SAS Institute Inc.
    Inventors: Xilong Chen, Tao Huang, Jan Chvosta
  • Patent number: 12210954
    Abstract: A point estimate value for an individual is computed using a Bayesian neural network model (BNN) by training a first BNN model that computes a weight mean value, a weight standard deviation value, a bias mean value, and a bias standard deviation value for each neuron of a plurality of neurons using observations. A plurality of BNN models is instantiated using the first BNN model. Instantiating each BNN model of the plurality of BNN models includes computing, for each neuron, a weight value using the weight mean value, the weight standard deviation value, and a weight random draw and a bias value using the bias mean value, the bias standard deviation value, and a bias random draw. Each instantiated BNN model is executed with the observations to compute a statistical parameter value for each observation vector of the observations. The point estimate value is computed from the statistical parameter value.
    Type: Grant
    Filed: December 6, 2023
    Date of Patent: January 28, 2025
    Assignee: SAS Institute Inc.
    Inventors: Sylvie Tchumtchoua Kabisa, Xilong Chen, Gunce Eryuruk Walton, David Bruce Elsheimer, Ming-Chun Chang
  • Patent number: 12197481
    Abstract: A graphical user interface (GUI) and pipeline for processing text documents is provided herein. In one example, a system can receive unstructured text documents. The system can determine entity-issue descriptions corresponding to the unstructured text documents. The system can then generate a GUI indicating the entity-issue descriptions. The GUI can also indicate assignments of the unstructured text documents to categories of a predefined schema. The GUI can allow the user to adjust the assignments of the unstructured text documents to the categories. The GUI can also include a table of rows, where each row corresponds to one of the unstructured text documents. Each row can indicate an entity-issue description in the corresponding unstructured text document and the categories assigned to the unstructured text document. Each row can also include a graphical button that is selectable to allow the user to view the unstructured text document corresponding to the row.
    Type: Grant
    Filed: June 7, 2024
    Date of Patent: January 14, 2025
    Assignee: SAS Institute Inc.
    Inventors: Murali Krishna Pagolu, Corey Kyle Kozak
  • Patent number: 12189716
    Abstract: A system and method include receiving a first set of variables associated with a real-time request, extracting a predetermined subset of the first set of variables for generating a second set of variables, identifying historical request data, computing a set of parameters based on the first set of variables and the historical request data, generating a plurality of numeric sequences and a plurality of string sequences for the real-time request, converting each of the plurality of string sequences into an encoded string sequence to obtain a plurality of encoded string sequences, inputting the plurality of numeric sequences and the plurality of encoded string sequences into a trained deep machine learning model, and computing a score from the trained deep machine learning model, the score indicative of a likelihood that the real-time request belongs to an unauthorized classification.
    Type: Grant
    Filed: May 23, 2024
    Date of Patent: January 7, 2025
    Assignee: SAS Institute Inc.
    Inventors: Yi Liao, Artin Armagan, Phoemphun Oothongsap, Brian Christopher Hare, Adheesha Sanjaya Arangala, Jin-Whan Jung
  • Publication number: 20240427811
    Abstract: A graphical user interface (GUI) and pipeline for processing text documents is provided herein. In one example, a system can receive unstructured text documents. The system can determine entity-issue descriptions corresponding to the unstructured text documents. The system can then generate a GUI indicating the entity-issue descriptions. The GUI can also indicate assignments of the unstructured text documents to categories of a predefined schema. The GUI can allow the user to adjust the assignments of the unstructured text documents to the categories. The GUI can also include a table of rows, where each row corresponds to one of the unstructured text documents. Each row can indicate an entity-issue description in the corresponding unstructured text document and the categories assigned to the unstructured text document. Each row can also include a graphical button that is selectable to allow the user to view the unstructured text document corresponding to the row.
    Type: Application
    Filed: June 7, 2024
    Publication date: December 26, 2024
    Applicant: SAS Institute Inc.
    Inventors: Murali Krishna Pagolu, Corey Kyle Kozak
  • Publication number: 20240427812
    Abstract: A graphical user interface (GUI) and pipeline for processing text documents is provided herein. In one example, a system can receive unstructured text documents. The system can determine entity-issue descriptions corresponding to the unstructured text documents. The system can then generate a GUI indicating the entity-issue descriptions. The GUI can also indicate assignments of the unstructured text documents to categories of a predefined schema. The GUI can allow the user to adjust the assignments of the unstructured text documents to the categories. The GUI can also include a table of rows, where each row corresponds to one of the unstructured text documents. Each row can indicate an entity-issue description in the corresponding unstructured text document and the categories assigned to the unstructured text document. Each row can also include a graphical button that is selectable to allow the user to view the unstructured text document corresponding to the row.
    Type: Application
    Filed: June 7, 2024
    Publication date: December 26, 2024
    Applicant: SAS Institute Inc.
    Inventors: Murali Krishna Pagolu, Corey Kyle Kozak
  • Patent number: 12175374
    Abstract: A computing system trains a classification model using distributed training data. A first worker index and a second worker index are received from a controller device and together uniquely identify a segment of a lower triangular matrix. The first and second worker indices have values from one to a predefined block size value. In response to receipt of a first computation request from the controller device, a first kernel matrix block is computed at each computing device based on the first worker index and the second worker index. In response to receipt of a second computation request from the controller device, an objective function value is computed for each observation vector included in an accessed training data subset. The computed objective function value is sent to the controller device. Model parameters for a trained classification model are output.
    Type: Grant
    Filed: April 15, 2024
    Date of Patent: December 24, 2024
    Assignee: SAS Institute Inc.
    Inventors: Yingjian Wang, Xinmin Wu