Patents by Inventor Jan Chvosta
Jan Chvosta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250053615Abstract: A computing device learns a directed acyclic graph (DAG). (A) A target variable is defined from variables based on a topological order vector and a first index. (B) Input variables are defined from the variables based on the topological order vector and a second index. (C) A machine learning model is trained with observation vectors using the target variable and the input variables. (D) The machine learning model is executed to compute a loss value. (E) The second index is incremented. (F) (B) through (E) are repeated a first plurality of times. (G) The first index is incremented. (H) (A) through (G) are repeated a second plurality of times. A parent set is determined for each variable based on a comparison between the loss value computed each repetition of (D). The parent set is output for each variable to describe the DAG that defines a hierarchical relationship between the variables.Type: ApplicationFiled: October 3, 2024Publication date: February 13, 2025Applicant: SAS Institute Inc.Inventors: Xilong Chen, Tao Huang, Jan Chvosta
-
Publication number: 20250045355Abstract: A computing device learns a directed acyclic graph (DAG). (A) A target variable is defined from variables based on a topological order vector and a first index. (B) Input variables are defined from the variables based on the topological order vector and a second index. (C) A machine learning model is trained with observation vectors using the target variable and the input variables. (D) The machine learning model is executed to compute a loss value. (E) The second index is incremented. (F) (B) through (E) are repeated a first plurality of times. (G) The first index is incremented. (H) (A) through (G) are repeated a second plurality of times. A parent set is determined for each variable based on a comparison between the loss value computed each repetition of (D). The parent set is output for each variable to describe the DAG that defines a hierarchical relationship between the variables.Type: ApplicationFiled: June 24, 2024Publication date: February 6, 2025Inventors: Xilong Chen, Tao Huang, Jan Chvosta
-
Publication number: 20250045611Abstract: A computing device learns a directed acyclic graph for a plurality of variables. (A) A target variable and zero or more input variables are defined based on a predefined topological order vector and a first index. (B) A machine learning model is trained with observation vectors using the target variable and the input variables. (C) The machine learning model is executed using the observation vectors with the target variable and the input variables to compute a residual vector. (D) The first index is incremented. (E) (A) through (D) are repeated a first plurality of times. A parent set is determined for each variable by comparing the residual vector computed each repetition of (C) to other residual vectors computed on other repetitions of (C). The parent set is output for each variable to describe a directed acyclic graph that defines a hierarchical relationship between the variables.Type: ApplicationFiled: June 24, 2024Publication date: February 6, 2025Inventors: Xilong Chen, Tao Huang, Jan Chvosta
-
Publication number: 20250045263Abstract: A computing device learns a best topological order vector for a plurality of variables. (A) A topological order vector is defined. (B) A target variable and zero or more input variables are defined based on the topological order vector. (C) A machine learning model is trained with observation vectors using values of the target variable and the zero or more input variables. (D) The machine learning model is executed with second observation vectors using the values of the target variable and the zero or more input variables to compute a loss value. (E) (A) through (D) are repeated a plurality of times. Each topological order vector defined in (A) is unique in comparison to other topological order vectors defined in (A). The best topological order vector is determined based on a comparison between the loss values computed for each topological order vector in (D).Type: ApplicationFiled: December 13, 2023Publication date: February 6, 2025Inventors: Xilong Chen, Tao Huang, Jan Chvosta
-
Patent number: 12056207Abstract: A computing device learns a best topological order vector of a plurality of variables. A target variable and zero or more input variables are defined. (A) A machine learning model is trained with observation vectors using the target variable and the zero or more input variables. (B) The machine learning model is executed to compute an equation loss value. (C) The equation loss value is stored with the identifier. (D) The identifier is incremented. (E) (A) through (D) are repeated a plurality of times. (F) A topological order vector is defined. (G) A loss value is computed from a subset of the stored equation loss values based on the topological order vector. (F) through (G) are repeated for each unique permutation of the topological order vector. A best topological order vector is determined based on a comparison between the loss value computed for each topological order vector in (G).Type: GrantFiled: December 13, 2023Date of Patent: August 6, 2024Assignee: SAS Institute Inc.Inventors: Xilong Chen, Tao Huang, Jan Chvosta
-
Patent number: 11769350Abstract: A computer system can automatically analyze a video of a physical activity and provide corresponding feedback. For example, the system can receive a video file including image frames showing an entity performing a physical activity that involves a sequence of movement phases. The system can generate coordinate sets by performing image analysis on the image frames. The system can provide the coordinate sets as input to a trained model, the trained model being configured to assign scores and movement phases to the image frames based on the coordinate sets. The system can then select a particular movement phase for which to provide feedback, based on the scores and movement phases assigned to the image frames. The system can generate the feedback for the entity about their performance of the particular movement phase, which may improve the entity's future performance of that particular movement phase.Type: GrantFiled: October 20, 2022Date of Patent: September 26, 2023Assignee: SAS Institute, Inc.Inventors: Ji Shen, Jared Langford Dean, Xilong Chen, Jan Chvosta
-
Patent number: 11443198Abstract: A computing device learns a directed acyclic graph (DAG). An SSCP matrix is computed from variable values defined for observation vectors. A topological order vector is initialized that defines a topological order for the variables. A loss value is computed using the topological order vector and the SSCP matrix. (A) A neighbor determination method is selected. (B) A next topological order vector is determined relative to the initialized topological order vector using the neighbor determination method. (C) A loss value is computed using the next topological order vector and the SSCP matrix. (D) (B) and (C) are repeated until each topological order vector is determined in (B) based on the neighbor determination method. A best topological vector is determined from each next topological order vector based on having a minimum value for the computed loss value. An adjacency matrix is computed using the best topological vector and the SSCP matrix.Type: GrantFiled: November 9, 2021Date of Patent: September 13, 2022Assignee: SAS Institute, Inc.Inventors: Xilong Chen, Tao Huang, Jan Chvosta
-
Patent number: 11354566Abstract: A treatment model that is a first neural network is trained to optimize a treatment loss function based on a treatment variable t using a plurality of observation vectors by regressing t on x(1),z. The trained treatment model is executed to compute an estimated treatment variable value {circumflex over (t)}i for each observation vector. An outcome model that is a second neural network is trained to optimize an outcome loss function by regressing y on x(2) and an estimated treatment variable t. The trained outcome model is executed to compute an estimated first unknown function value {circumflex over (?)}(xi(2)) and an estimated second unknown function value {circumflex over (?)}(xi(2)) for each observation vector. An influence function value is computed for a parameter of interest using {circumflex over (?)}(xi(2)) and {circumflex over (?)}(xi(2)). A value is computed for the predefined parameter of interest using the computed influence function value.Type: GrantFiled: October 21, 2021Date of Patent: June 7, 2022Assignee: SAS Institute Inc.Inventors: Xilong Chen, Douglas Allan Cairns, Jan Chvosta, David Bruce Elsheimer, Yang Zhao, Ming-Chun Chang, Gunce Eryuruk Walton, Michael Thomas Lamm
-
Patent number: 11328225Abstract: A computing device selects a trained spatial regression model. A spatial weights matrix defined for observation vectors is selected, where each element of the spatial weights matrix indicates an amount of influence between respective pairs of observation vectors. Each observation vector is spatially referenced. A spatial regression model is selected from spatial regression models, initialized, and trained using the observation vectors and the spatial weights matrix to fit a response variable using regressor variables. Each observation vector includes a response value for the response variable and a regressor value for each regressor variable of the regressor variables. A fit criterion value is computed for the spatial regression model and the spatial regression model selection, initialization, and training are repeated until each spatial regression model is selected. A best spatial regression model is selected and output as the spatial regression model having an extremum value of the fit criterion value.Type: GrantFiled: November 11, 2021Date of Patent: May 10, 2022Assignee: SAS Institute Inc.Inventors: Guohui Wu, Jan Chvosta, Wan Xu, Gunce Eryuruk Walton, Xilong Chen
-
Patent number: 11120032Abstract: Computing resources consumed in performing computerized sequence-mining can be reduced by implementing some examples of the present disclosure. In one example, a system can determine weights for data entries in a data set and then select a group of data entries from the data set based on the weights. Next, the system can determine a group of k-length sequences present in the selected group of data entries by applying a shuffling algorithm. The system can then determine frequencies corresponding to the group of k-length sequences and select candidate sequences from among the group of k-length sequences based on the frequencies thereof. Next, the system can determine support values corresponding to the candidate sequences and then select output sequences from among the candidate sequences based on the support values thereof. The system may then transmit an output signal indicating the selected output sequences an electronic device.Type: GrantFiled: March 23, 2021Date of Patent: September 14, 2021Assignee: SAS INSTITUTE INC.Inventors: Xilong Chen, Xunlei Wu, Jan Chvosta
-
Patent number: 11010451Abstract: Techniques for automated Bayesian posterior sampling using Markov Chain Monte Carlo and related schemes are described. In an embodiment, one or more values in a stationarity phase for a system configured for Bayesian sampling may be initialized. Sampling may be performed in the stationarity phase based upon the one or more values to generate a plurality of samples. The plurality of samples may be evaluated based upon one or more stationarity criteria. The stationarity phase may be exited when the plurality of samples meets the one or more stationarity criteria. Other embodiments are described and claimed.Type: GrantFiled: March 13, 2014Date of Patent: May 18, 2021Assignee: SAS INSTITUTE INC.Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
-
Patent number: 10325008Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving a compound model specification comprising a frequency model and a severity model, the compound model specification including a model error comprising a frequency model error and a severity model error, and determining a number of frequency models and severity models to generate based on the received number of models to generate. Embodiments include generating a plurality of frequency models through perturbation of the frequency model according to the frequency model error, and generating a plurality of severity models through perturbation of the severity model according to the severity model error.Type: GrantFiled: November 7, 2017Date of Patent: June 18, 2019Assignee: SAS INSTITUTE INC.Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little
-
Patent number: 10146741Abstract: Various embodiments are directed to techniques for deriving a sample representation from a random sample. A computer-program product includes instructions to cause a first computing device to fit an empirical distribution function to a marginal probability distribution of a variable within a first sample portion of a random sample to derive a partial marginal probability distribution approximation, wherein the random sample is divided into multiple sample portions distributed among multiple computing devices; fit a first portion of a copula function to a multivariate probability distribution of the first sample portion, wherein the copula function is divided into multiple portions; and transmit an indication of a first likelihood contribution of the first sample portion to a coordinating device to cause a second computing device to fit a second portion of the copula function to a multivariate probability distribution of a second sample portion. Other embodiments are described and claimed.Type: GrantFiled: March 18, 2014Date of Patent: December 4, 2018Assignee: SAS INSTITUTE INC.Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
-
Patent number: 10095660Abstract: Various embodiments are generally directed to techniques for producing statistically correct and efficient combinations of multiple simulated posterior samples from MCMC and related Bayesian sampling schemes are described. One or more chains from a Bayesian posterior distribution of values may be generated. It may be determine whether the one or more chains have reached stationarity through parallel processing on a plurality of processing nodes. Based upon the determination, each of the one or more chains that have reached stationarity through parallel processing on the plurality of processing nodes may be sorted. The one or more sorted chains may be resampled through parallel processing on the plurality of processing nodes. The one or more resampled chains may be combined. Other embodiments are described and claimed.Type: GrantFiled: March 13, 2014Date of Patent: October 9, 2018Assignee: SAS Institute Inc.Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
-
Patent number: 9928320Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving, at a master node of a distributed system, a compound model specification comprising frequency models, severity models, and one or more adjustment functions, wherein at least one model of the frequency models and the severity models depend on one or more regressor and distributing the compound model specification to worker nodes of the distributed system, each of the worker nodes to at least generate a portion of samples for use in predicting compound distribution model estimates. Embodiments may also include predicting the compound distribution model estimates based on the sample portions of aggregate values and adjusted aggregate values.Type: GrantFiled: April 12, 2017Date of Patent: March 27, 2018Assignee: SAS Institute Inc.Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little
-
Publication number: 20180060470Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving a compound model specification comprising a frequency model and a severity model, the compound model specification including a model error comprising a frequency model error and a severity model error, and determining a number of frequency models and severity models to generate based on the received number of models to generate. Embodiments include generating a plurality of frequency models through perturbation of the frequency model according to the frequency model error, and generating a plurality of severity models through perturbation of the severity model according to the severity model error.Type: ApplicationFiled: November 7, 2017Publication date: March 1, 2018Applicant: SAS Institute Inc.Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little
-
Publication number: 20170220713Abstract: Techniques for estimated compound probability distribution are described herein. Embodiments may include receiving, at a master node of a distributed system, a compound model specification comprising frequency models, severity models, and one or more adjustment functions, wherein at least one model of the frequency models and the severity models depend on one or more regressor and distributing the compound model specification to worker nodes of the distributed system, each of the worker nodes to at least generate a portion of samples for use in predicting compound distribution model estimates. Embodiments may also include predicting the compound distribution model estimates based on the sample portions of aggregate values and adjusted aggregate values.Type: ApplicationFiled: April 12, 2017Publication date: August 3, 2017Applicant: SAS INSTITUTE INC.Inventors: MAHESH V. JOSHI, RICHARD POTTER, JAN CHVOSTA, MARK ROLAND LITTLE
-
Patent number: 9710428Abstract: Techniques for automated Bayesian posterior sampling using Markov Chain Monte Carlo and related schemes are described. In an embodiment, one or more values in an accuracy phase for a system configured for Bayesian sampling may be initialized. Sampling may be performed in the accuracy phase based upon the one or more values to generate a plurality of samples. The plurality of samples may be evaluated based upon one or more accuracy criteria. The accuracy phase may be exited when the plurality of samples meets the one or more accuracy criteria. Other embodiments are described and claimed.Type: GrantFiled: March 13, 2014Date of Patent: July 18, 2017Assignee: SAS Institute Inc.Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
-
Patent number: 9672193Abstract: Various embodiments are directed to techniques for selecting a subset of a set of simulated samples. A computer-program product including instructions to cause a computing device to order a plurality of UPDFs by UPDF value, wherein the plurality of UPDFs is associated with a chain of draws of a set of simulated samples, wherein each draw comprises multiple parameters and the UPDF values map to parameter values of the parameters; select a subset of the plurality of UPDFs based on the subset of the plurality of UPDFs having UPDF values within a range corresponding to a range of parameter values to include in a subset of the set of simulated samples; and transmit an indication of a draw comprising parameters having parameter values to include in the subset of the set of simulated samples, wherein the indication identifies the draw by associated UPDF. Other embodiments are described and claimed.Type: GrantFiled: March 18, 2014Date of Patent: June 6, 2017Assignee: SAS Institute Inc.Inventors: Christian Macaro, Jan Chvosta, Mark Roland Little
-
Patent number: 9665669Abstract: Techniques for estimated compound probability distribution are described. An apparatus comprising a configuration component, perturbation component, sample generation controller, an aggregation component, a distribution fitting component, and statistics generation component. The configuration component operative to receive a compound model specification and candidate distribution definition. The perturbation component operative to generate a plurality of models from the compound model specification. The sample generation controller operative to initiate the generation of a plurality of compound model samples from each of the plurality of models. The distribution fitting component to generate parameter values for the candidate distribution definition based on the compound model samples. The statistics generation component to generate approximated aggregate statistics.Type: GrantFiled: June 29, 2016Date of Patent: May 30, 2017Assignee: SAS Institute Inc.Inventors: Mahesh V. Joshi, Richard Potter, Jan Chvosta, Mark Roland Little