Patents by Inventor Jan Chvosta

Jan Chvosta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

LEARNING A DIRECTED ACYCLIC GRAPH USING A MACHINE LEARNING MODEL LOSS

Publication number: 20250053615

Abstract: A computing device learns a directed acyclic graph (DAG). (A) A target variable is defined from variables based on a topological order vector and a first index. (B) Input variables are defined from the variables based on the topological order vector and a second index. (C) A machine learning model is trained with observation vectors using the target variable and the input variables. (D) The machine learning model is executed to compute a loss value. (E) The second index is incremented. (F) (B) through (E) are repeated a first plurality of times. (G) The first index is incremented. (H) (A) through (G) are repeated a second plurality of times. A parent set is determined for each variable based on a comparison between the loss value computed each repetition of (D). The parent set is output for each variable to describe the DAG that defines a hierarchical relationship between the variables.

Type: Application

Filed: October 3, 2024

Publication date: February 13, 2025

Applicant: SAS Institute Inc.

Inventors: Xilong Chen, Tao Huang, Jan Chvosta
LEARNING A DIRECTED ACYCLIC GRAPH USING A MACHINE LEARNING MODEL LOSS

Publication number: 20250045355

Abstract: A computing device learns a directed acyclic graph (DAG). (A) A target variable is defined from variables based on a topological order vector and a first index. (B) Input variables are defined from the variables based on the topological order vector and a second index. (C) A machine learning model is trained with observation vectors using the target variable and the input variables. (D) The machine learning model is executed to compute a loss value. (E) The second index is incremented. (F) (B) through (E) are repeated a first plurality of times. (G) The first index is incremented. (H) (A) through (G) are repeated a second plurality of times. A parent set is determined for each variable based on a comparison between the loss value computed each repetition of (D). The parent set is output for each variable to describe the DAG that defines a hierarchical relationship between the variables.

Type: Application

Filed: June 24, 2024

Publication date: February 6, 2025

Inventors: Xilong Chen, Tao Huang, Jan Chvosta
LEARNING A DIRECTED ACYCLIC GRAPH USING A TRAINED MACHINE LEARNING MODEL

Publication number: 20250045611

Abstract: A computing device learns a directed acyclic graph for a plurality of variables. (A) A target variable and zero or more input variables are defined based on a predefined topological order vector and a first index. (B) A machine learning model is trained with observation vectors using the target variable and the input variables. (C) The machine learning model is executed using the observation vectors with the target variable and the input variables to compute a residual vector. (D) The first index is incremented. (E) (A) through (D) are repeated a first plurality of times. A parent set is determined for each variable by comparing the residual vector computed each repetition of (C) to other residual vectors computed on other repetitions of (C). The parent set is output for each variable to describe a directed acyclic graph that defines a hierarchical relationship between the variables.

Type: Application

Filed: June 24, 2024

Publication date: February 6, 2025

Inventors: Xilong Chen, Tao Huang, Jan Chvosta
TOPOLOGICAL ORDER DETERMINATION USING MACHINE LEARNING

Publication number: 20250045263

Abstract: A computing device learns a best topological order vector for a plurality of variables. (A) A topological order vector is defined. (B) A target variable and zero or more input variables are defined based on the topological order vector. (C) A machine learning model is trained with observation vectors using values of the target variable and the zero or more input variables. (D) The machine learning model is executed with second observation vectors using the values of the target variable and the zero or more input variables to compute a loss value. (E) (A) through (D) are repeated a plurality of times. Each topological order vector defined in (A) is unique in comparison to other topological order vectors defined in (A). The best topological order vector is determined based on a comparison between the loss values computed for each topological order vector in (D).

Type: Application

Filed: December 13, 2023

Publication date: February 6, 2025

Inventors: Xilong Chen, Tao Huang, Jan Chvosta
Topological order determination using machine learning

Patent number: 12056207

Abstract: A computing device learns a best topological order vector of a plurality of variables. A target variable and zero or more input variables are defined. (A) A machine learning model is trained with observation vectors using the target variable and the zero or more input variables. (B) The machine learning model is executed to compute an equation loss value. (C) The equation loss value is stored with the identifier. (D) The identifier is incremented. (E) (A) through (D) are repeated a plurality of times. (F) A topological order vector is defined. (G) A loss value is computed from a subset of the stored equation loss values based on the topological order vector. (F) through (G) are repeated for each unique permutation of the topological order vector. A best topological order vector is determined based on a comparison between the loss value computed for each topological order vector in (G).

Type: Grant

Filed: December 13, 2023

Date of Patent: August 6, 2024

Assignee: SAS Institute Inc.

Inventors: Xilong Chen, Tao Huang, Jan Chvosta
Computer system for automatically analyzing a video of a physical activity using a model and providing corresponding feedback

Patent number: 11769350

Abstract: A computer system can automatically analyze a video of a physical activity and provide corresponding feedback. For example, the system can receive a video file including image frames showing an entity performing a physical activity that involves a sequence of movement phases. The system can generate coordinate sets by performing image analysis on the image frames. The system can provide the coordinate sets as input to a trained model, the trained model being configured to assign scores and movement phases to the image frames based on the coordinate sets. The system can then select a particular movement phase for which to provide feedback, based on the scores and movement phases assigned to the image frames. The system can generate the feedback for the entity about their performance of the particular movement phase, which may improve the entity's future performance of that particular movement phase.

Type: Grant

Filed: October 20, 2022

Date of Patent: September 26, 2023

Assignee: SAS Institute, Inc.

Inventors: Ji Shen, Jared Langford Dean, Xilong Chen, Jan Chvosta
Directed acyclic graph machine learning system

Patent number: 11443198

Abstract: A computing device learns a directed acyclic graph (DAG). An SSCP matrix is computed from variable values defined for observation vectors. A topological order vector is initialized that defines a topological order for the variables. A loss value is computed using the topological order vector and the SSCP matrix. (A) A neighbor determination method is selected. (B) A next topological order vector is determined relative to the initialized topological order vector using the neighbor determination method. (C) A loss value is computed using the next topological order vector and the SSCP matrix. (D) (B) and (C) are repeated until each topological order vector is determined in (B) based on the neighbor determination method. A best topological vector is determined from each next topological order vector based on having a minimum value for the computed loss value. An adjacency matrix is computed using the best topological vector and the SSCP matrix.

Type: Grant

Filed: November 9, 2021

Date of Patent: September 13, 2022

Assignee: SAS Institute, Inc.

Inventors: Xilong Chen, Tao Huang, Jan Chvosta
Causal inference and policy optimization system based on deep learning models

Patent number: 11354566

Abstract: A treatment model that is a first neural network is trained to optimize a treatment loss function based on a treatment variable t using a plurality of observation vectors by regressing t on x(1),z. The trained treatment model is executed to compute an estimated treatment variable value {circumflex over (t)}i for each observation vector. An outcome model that is a second neural network is trained to optimize an outcome loss function by regressing y on x(2) and an estimated treatment variable t. The trained outcome model is executed to compute an estimated first unknown function value {circumflex over (?)}(xi(2)) and an estimated second unknown function value {circumflex over (?)}(xi(2)) for each observation vector. An influence function value is computed for a parameter of interest using {circumflex over (?)}(xi(2)) and {circumflex over (?)}(xi(2)). A value is computed for the predefined parameter of interest using the computed influence function value.

Type: Grant

Filed: October 21, 2021

Date of Patent: June 7, 2022

Assignee: SAS Institute Inc.

Inventors: Xilong Chen, Douglas Allan Cairns, Jan Chvosta, David Bruce Elsheimer, Yang Zhao, Ming-Chun Chang, Gunce Eryuruk Walton, Michael Thomas Lamm
Automatic spatial regression system

Patent number: 11328225

Abstract: A computing device selects a trained spatial regression model. A spatial weights matrix defined for observation vectors is selected, where each element of the spatial weights matrix indicates an amount of influence between respective pairs of observation vectors. Each observation vector is spatially referenced. A spatial regression model is selected from spatial regression models, initialized, and trained using the observation vectors and the spatial weights matrix to fit a response variable using regressor variables. Each observation vector includes a response value for the response variable and a regressor value for each regressor variable of the regressor variables. A fit criterion value is computed for the spatial regression model and the spatial regression model selection, initialization, and training are repeated until each spatial regression model is selected. A best spatial regression model is selected and output as the spatial regression model having an extremum value of the fit criterion value.

Type: Grant

Filed: November 11, 2021

Date of Patent: May 10, 2022

Assignee: SAS Institute Inc.

Inventors: Guohui Wu, Jan Chvosta, Wan Xu, Gunce Eryuruk Walton, Xilong Chen
Reducing consumption of computing resources in performing computerized sequence-mining on large data sets

Patent number: 11120032

Abstract: Computing resources consumed in performing computerized sequence-mining can be reduced by implementing some examples of the present disclosure. In one example, a system can determine weights for data entries in a data set and then select a group of data entries from the data set based on the weights. Next, the system can determine a group of k-length sequences present in the selected group of data entries by applying a shuffling algorithm. The system can then determine frequencies corresponding to the group of k-length sequences and select candidate sequences from among the group of k-length sequences based on the frequencies thereof. Next, the system can determine support values corresponding to the candidate sequences and then select output sequences from among the candidate sequences based on the support values thereof. The system may then transmit an output signal indicating the selected output sequences an electronic device.

Type: Grant

Filed: March 23, 2021

Date of Patent: September 14, 2021

Assignee: SAS INSTITUTE INC.

Inventors: Xilong Chen, Xunlei Wu, Jan Chvosta
Techniques for automated Bayesian posterior sampling using Markov Chain Monte Carlo and related schemes

Patent number: 11010451

Abstract: Techniques for automated Bayesian posterior sampling using Markov Chain Monte Carlo and related schemes are described. In an embodiment, one or more values in a stationarity phase for a system configured for Bayesian sampling may be initialized. Sampling may be performed in the stationarity phase based upon the one or more values to generate a plurality of samples. The plurality of samples may be evaluated based upon one or more stationarity criteria. The stationarity phase may be exited when the plurality of samples meets the one or more stationarity criteria. Other embodiments are described and claimed.

Type: Grant

Filed: March 13, 2014

Date of Patent: May 18, 2021

Assignee: SAS INSTITUTE INC.

Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
Techniques for estimating compound probability distribution by simulating large empirical samples with scalable parallel and distributed processing

Patent number: 10325008

Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving a compound model specification comprising a frequency model and a severity model, the compound model specification including a model error comprising a frequency model error and a severity model error, and determining a number of frequency models and severity models to generate based on the received number of models to generate. Embodiments include generating a plurality of frequency models through perturbation of the frequency model according to the frequency model error, and generating a plurality of severity models through perturbation of the severity model according to the severity model error.

Type: Grant

Filed: November 7, 2017

Date of Patent: June 18, 2019

Assignee: SAS INSTITUTE INC.

Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little
Approximate multivariate posterior probability distributions from simulated samples

Patent number: 10146741

Abstract: Various embodiments are directed to techniques for deriving a sample representation from a random sample. A computer-program product includes instructions to cause a first computing device to fit an empirical distribution function to a marginal probability distribution of a variable within a first sample portion of a random sample to derive a partial marginal probability distribution approximation, wherein the random sample is divided into multiple sample portions distributed among multiple computing devices; fit a first portion of a copula function to a multivariate probability distribution of the first sample portion, wherein the copula function is divided into multiple portions; and transmit an indication of a first likelihood contribution of the first sample portion to a coordinating device to cause a second computing device to fit a second portion of the copula function to a multivariate probability distribution of a second sample portion. Other embodiments are described and claimed.

Type: Grant

Filed: March 18, 2014

Date of Patent: December 4, 2018

Assignee: SAS INSTITUTE INC.

Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
Techniques for producing statistically correct and efficient combinations of multiple simulated posterior samples

Patent number: 10095660

Abstract: Various embodiments are generally directed to techniques for producing statistically correct and efficient combinations of multiple simulated posterior samples from MCMC and related Bayesian sampling schemes are described. One or more chains from a Bayesian posterior distribution of values may be generated. It may be determine whether the one or more chains have reached stationarity through parallel processing on a plurality of processing nodes. Based upon the determination, each of the one or more chains that have reached stationarity through parallel processing on the plurality of processing nodes may be sorted. The one or more sorted chains may be resampled through parallel processing on the plurality of processing nodes. The one or more resampled chains may be combined. Other embodiments are described and claimed.

Type: Grant

Filed: March 13, 2014

Date of Patent: October 9, 2018

Assignee: SAS Institute Inc.

Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
Techniques for estimating compound probability distribution by simulating large empirical samples with scalable parallel and distributed processing

Patent number: 9928320

Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving, at a master node of a distributed system, a compound model specification comprising frequency models, severity models, and one or more adjustment functions, wherein at least one model of the frequency models and the severity models depend on one or more regressor and distributing the compound model specification to worker nodes of the distributed system, each of the worker nodes to at least generate a portion of samples for use in predicting compound distribution model estimates. Embodiments may also include predicting the compound distribution model estimates based on the sample portions of aggregate values and adjusted aggregate values.

Type: Grant

Filed: April 12, 2017

Date of Patent: March 27, 2018

Assignee: SAS Institute Inc.

Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little
TECHNIQUES FOR ESTIMATING COMPOUND PROBABILITY DISTRIBUTION BY SIMULATING LARGE EMPIRICAL SAMPLES WITH SCALABLE PARALLEL AND DISTRIBUTED PROCESSING

Publication number: 20180060470

Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving a compound model specification comprising a frequency model and a severity model, the compound model specification including a model error comprising a frequency model error and a severity model error, and determining a number of frequency models and severity models to generate based on the received number of models to generate. Embodiments include generating a plurality of frequency models through perturbation of the frequency model according to the frequency model error, and generating a plurality of severity models through perturbation of the severity model according to the severity model error.

Type: Application

Filed: November 7, 2017

Publication date: March 1, 2018

Applicant: SAS Institute Inc.

Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little
TECHNIQUES FOR ESTIMATING COMPOUND PROBABILITY DISTRIBUTION BY SIMULATING LARGE EMPIRICAL SAMPLES WITH SCALABLE PARALLEL AND DISTRIBUTED PROCESSING

Publication number: 20170220713

Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving, at a master node of a distributed system, a compound model specification comprising frequency models, severity models, and one or more adjustment functions, wherein at least one model of the frequency models and the severity models depend on one or more regressor and distributing the compound model specification to worker nodes of the distributed system, each of the worker nodes to at least generate a portion of samples for use in predicting compound distribution model estimates. Embodiments may also include predicting the compound distribution model estimates based on the sample portions of aggregate values and adjusted aggregate values.

Type: Application

Filed: April 12, 2017

Publication date: August 3, 2017

Applicant: SAS INSTITUTE INC.

Inventors: MAHESH V. JOSHI, RICHARD POTTER, JAN CHVOSTA, MARK ROLAND LITTLE
Techniques for automated bayesian posterior sampling using markov chain monte carlo and related schemes

Patent number: 9710428

Abstract: Techniques for automated Bayesian posterior sampling using Markov Chain Monte Carlo and related schemes are described. In an embodiment, one or more values in an accuracy phase for a system configured for Bayesian sampling may be initialized. Sampling may be performed in the accuracy phase based upon the one or more values to generate a plurality of samples. The plurality of samples may be evaluated based upon one or more accuracy criteria. The accuracy phase may be exited when the plurality of samples meets the one or more accuracy criteria. Other embodiments are described and claimed.

Type: Grant

Filed: March 13, 2014

Date of Patent: July 18, 2017

Assignee: SAS Institute Inc.

Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
Compact representation of multivariate posterior probability distribution from simulated samples

Patent number: 9672193

Abstract: Various embodiments are directed to techniques for selecting a subset of a set of simulated samples. A computer-program product including instructions to cause a computing device to order a plurality of UPDFs by UPDF value, wherein the plurality of UPDFs is associated with a chain of draws of a set of simulated samples, wherein each draw comprises multiple parameters and the UPDF values map to parameter values of the parameters; select a subset of the plurality of UPDFs based on the subset of the plurality of UPDFs having UPDF values within a range corresponding to a range of parameter values to include in a subset of the set of simulated samples; and transmit an indication of a draw comprising parameters having parameter values to include in the subset of the set of simulated samples, wherein the indication identifies the draw by associated UPDF. Other embodiments are described and claimed.

Type: Grant

Filed: March 18, 2014

Date of Patent: June 6, 2017

Assignee: SAS Institute Inc.

Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
Techniques for estimating compound probability distribution by simulating large empirical samples with scalable parallel and distributed processing

Patent number: 9665669

Abstract: Techniques for estimated compound probability distribution are described. An apparatus comprising a configuration component, perturbation component, sample generation controller, an aggregation component, a distribution fitting component, and statistics generation component. The configuration component operative to receive a compound model specification and candidate distribution definition. The perturbation component operative to generate a plurality of models from the compound model specification. The sample generation controller operative to initiate the generation of a plurality of compound model samples from each of the plurality of models. The distribution fitting component to generate parameter values for the candidate distribution definition based on the compound model samples. The statistics generation component to generate approximated aggregate statistics.

Type: Grant

Filed: June 29, 2016

Date of Patent: May 30, 2017

Assignee: SAS Institute Inc.

Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little

1 2 next