Patents Assigned to SAS Institute
  • Patent number: 11227223
    Abstract: A computing system trains a classification model using distributed training data. In response to receipt of a first request, a training data subset is accessed and sent to each higher index worker computing device, the training data subset sent by each lower index worker computing device is received, and a first kernel matrix block and a second kernel matrix block are computed using a kernel function and the accessed or received training data subsets. (A) In response to receipt of a second request from the controller device, a first vector is computed using the first and second kernel matrix blocks, a latent function vector and an objective function value are computed, and the objective function value is sent to the controller device. (A) is repeated until the controller device determines training of a classification model is complete. Model parameters for the trained classification model are output.
    Type: Grant
    Filed: July 7, 2021
    Date of Patent: January 18, 2022
    Assignee: SAS Institute Inc.
    Inventor: Yingjian Wang
  • Patent number: 11216603
    Abstract: A computing system receives a request for a computer-generated design of an experiment. A design space is defined by candidate inputs for each factor of the experiment. The system receives a disallowed combination indication indicating a user-defined disallowed combination for the design space. The disallowed combination constrains a first set of values from a first set of candidate inputs from being assigned to the first factor if the second factor is assigned one of a second set of values from a second set of candidate inputs. The system determines additional constraint(s) on computer generation of the design. The system evaluates if a user-defined model can be generated according to the disallowed combination and the constraint(s). The system generates a computer suggested modification of the user-defined model such that the design can be generated according to the computer suggested modification.
    Type: Grant
    Filed: January 8, 2021
    Date of Patent: January 4, 2022
    Assignee: SAS Institute Inc.
    Inventors: Joseph Albert Morgan, Ryan Adam Lekivetz, Bradley Allen Jones, Caleb Bridges King
  • Patent number: 11200514
    Abstract: Unclassified observations are classified. Similarity values are computed for each unclassified observation and for each target variable value. A confidence value is computed for each unclassified observation using the similarity values. A high-confidence threshold value and a low-confidence threshold value are computed from the confidence values. For each observation, when the confidence value is greater than the high-confidence threshold value, the observation is added to a training dataset and, when the confidence value is greater than the low-confidence threshold value and less than the high-confidence threshold value, the observation is added to the training dataset based on a comparison between a random value drawn from a uniform distribution and an inclusion percentage value. A classification model is trained with the training dataset and classified observations. The trained classification model is executed with the unclassified observations to determine a label assignment.
    Type: Grant
    Filed: June 9, 2021
    Date of Patent: December 14, 2021
    Assignee: SAS Institute Inc.
    Inventors: Xu Chen, Xinmin Wu
  • Patent number: 11194940
    Abstract: A computing system determines a design space for designing a design system. The computing system receives a request to identify input(s) that will provide a response of the design system that advances a user-defined goal for the design system. The computing system, responsive to the request to identify input(s), generates a design for the design space that provides design cases for generating the response of the design system and obtains responses of the design system generated for multiple design cases of the design for the design system. The computing system selects at least one design case based on obtained responses of the design system for the multiple design cases, and based on the at least one design case, outputs an indication of suggested input(s) for the design system. The suggested input(s) advances the user-defined goal for the design system and is not disallowed according to disallowed combination(s).
    Type: Grant
    Filed: January 8, 2021
    Date of Patent: December 7, 2021
    Assignee: SAS Institute Inc.
    Inventors: Joseph Albert Morgan, Yeng Saanchi, Laura Carmen Lancaster, Christopher Michael Gotwalt, Caleb Bridges King, Ryan Adam Lekivetz
  • Patent number: 11195084
    Abstract: A computing device trains a neural network machine learning model. A forward propagation of a first neural network is executed. A backward propagation of the first neural network is executed from a last layer to a last convolution layer of a plurality of convolutional layers to compute a gradient vector for first weight values of the last convolution layer using observation vectors. A discriminative localization map is computed for each observation vector with the gradient vector using a discriminative localization map function. A forward and a backward propagation of a second neural network is executed to compute a second weight value for each neuron of the second neural network using the discriminative localization map computed for each observation vector. A predefined number of iterations of the forward and the backward propagation of the second neural network is repeated.
    Type: Grant
    Filed: March 11, 2021
    Date of Patent: December 7, 2021
    Assignee: SAS Institute Inc.
    Inventors: Xinmin Wu, Yingjian Wang, Xiangqian Hu
  • Publication number: 20210366099
    Abstract: Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image.
    Type: Application
    Filed: August 9, 2021
    Publication date: November 25, 2021
    Applicant: SAS Institute Inc.
    Inventors: Yi Liao, Charles Franklin Board, William Robert Nadolski, David James Wheaton, Heather Michelle Goodykoontz, Adheesha Sanjuaya Arangala, Karthik Nakkeeran
  • Patent number: 11176692
    Abstract: A computing system responsive to obtaining original image data, detects a set of data point(s), in the original image data, that indicates an object. The system determines, based on the set of data point(s), a set of pixels associated with the object in the original image data. The system generates an alternative visual identifier for the object that provides a unique identifier for the set of pixels absent in the original image data. The system generates, autonomously from intervention by any user of the computing system, pixel information to conceal feature(s) of the object. The system obtains modified image data comprising the alternative visual identifier. The modified image data further comprises the feature(s) of the object in the original image data visually concealed in the modified image data according to the pixel information. The system outputs an image representation of a trajectory of the object through the modified image data.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: November 16, 2021
    Assignee: SAS Institute Inc.
    Inventors: Hamza Mustafa Ghadyali, Kedar Shriram Prabhudesai, Jonathan Lee Walker, Xunlei Wu, Xingqi Du, Bahar Biller, Mohammadreza Nazari, Afshin Oroojlooyjadid, Alexander Richard Phelps, Davood Hajinezhad, Varunraj Valsaraj, Jorge Manuel Gomes da Silva, Jinxin Yi
  • Patent number: 11176691
    Abstract: A computing system obtains image data representing images. Each of the images is captured at different time points of a physical environment. The physical environment comprises a first object and a second object. The computing system executes a control system to augment the physical environment. The control system detects a group forming in the images. The control system tracks an aspect of a movement, of a given object, in the group. The control system simulates the physical environment and the movement, of the given object, in the group in a simulated environment. The control system evaluates simulated actions in the simulated environment for a predefined objective for the physical environment. The predefined objective is related to an interaction between objects in the group. The control system generates based on evaluated simulated actions and autonomously from involvement by any user of the control system, an indication to augment the physical environment.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: November 16, 2021
    Assignee: SAS Institute Inc.
    Inventors: Hamza Mustafa Ghadyali, Kedar Shriram Prabhudesai, Mohammadreza Nazari, Bahar Biller, Afshin Oroojlooyjadid, Alexander Richard Phelps, Jonathan Lee Walker, Xunlei Wu, Xingqi Du, Davood Hajinezhad, Varunraj Valsaraj, Jorge Manuel Gomes da Silva, Jinxin Yi
  • Patent number: 11151463
    Abstract: Data is classified using semi-supervised data. Sparse coefficients are computed using a decomposition of a Laplacian matrix. (B) Updated parameter values are computed for a dimensionality reduction method using the sparse coefficients, the Laplacian matrix, and a plurality of observation vectors. The updated parameter values include a robust estimator of a decomposition matrix determined from the decomposition of the Laplacian matrix. (B) is repeated until a convergence parameter value indicates the updated parameter values for the dimensionality reduction method have converged. A classification matrix is defined using the sparse coefficients and the robust estimator of the decomposition of the Laplacian matrix. The target variable value is determined for each observation vector based on the classification matrix.
    Type: Grant
    Filed: February 18, 2021
    Date of Patent: October 19, 2021
    Assignee: SAS Institute Inc.
    Inventors: Xu Chen, Jorge Manuel Gomes da Silva, Brett Alan Wujek
  • Patent number: 11151480
    Abstract: A visualization is presented while tuning a machine learning model. A model tuning process writes tuning data to a history table. The model tuning process is repeatedly training and scoring a model type with different sets of values of hyperparameters defined based on the model type. An objective function value is computed for each set of values of the hyperparameters. Data stored in the history table is accessed and used to identify the hyperparameters. (A) A page template is selected from page templates that describe graphical objects presented in the display. (B) The page template is updated with the accessed data. (C) The display is updated using the updated page template. (D) At the end of a refresh time period, new data stored in the history table by the model tuning process is accessed. (E) (B) through (D) are repeated with the accessed data replaced with the accessed new data.
    Type: Grant
    Filed: November 17, 2020
    Date of Patent: October 19, 2021
    Assignee: SAS Institute Inc.
    Inventors: Oleg Borisovich Golovidov, Brett Alan Wujek, Patrick Nathan Koch, Rajendra Prasad Singh
  • Publication number: 20210312277
    Abstract: Requests for computing resources and other resources can be predicted and managed. For example, a system can determine a baseline prediction indicating a number of requests for an object over a future time-period. The system can then execute a first model to generate a first set of values based on seasonality in the baseline prediction, a second model to generate a second set of values based on short-term trends in the baseline prediction, and a third model to generate a third set of values based on the baseline prediction. The system can select a most accurate model from among the three models and generate an output prediction by applying the set of values output by the most accurate model to the baseline prediction. Based on the output prediction, the system can cause an adjustment to be made to a provisioning process for the object.
    Type: Application
    Filed: November 3, 2020
    Publication date: October 7, 2021
    Applicant: SAS Institute Inc.
    Inventors: Kedar Shriram Prabhudesai, Varunraj Valsaraj, Jinxin Yi, Daniel Keongson Woo, Roger Lee Baldridge, JR.
  • Patent number: 11132364
    Abstract: A computing system determines a response to a query. A bin start value and a bin stop value is defined for each bin based on an input bin option. End nodes are split based on the bin start value and the bin stop value of each bin to define a second plurality of end nodes. Each start node of a plurality of start nodes that is connected to each end node of the second plurality of end nodes is identified based on the respective link attributes of a plurality of link attributes. Overlapping start nodes of the plurality of start nodes that overlap at an end node of the second plurality of end nodes are identified based on a predefined overlap query graph that defines a connectivity to identify between a start node and the end node. The identified overlapping start nodes are output as a response to the predefined overlap query graph.
    Type: Grant
    Filed: April 8, 2021
    Date of Patent: September 28, 2021
    Assignee: SAS Institute Inc.
    Inventors: Matthew Victor Galati, Brandon Michael Reese
  • Publication number: 20210295846
    Abstract: An apparatus includes processor(s) to: use an acoustic model to generate a first set of probabilities of speech sounds uttered within speech audio; derive at least a first candidate word most likely spoken in the speech audio using the first set; analyze the first set to derive a degree of uncertainty therefor; compare the degree of uncertainty to a threshold; in response to at least the degree of uncertainty being less than the threshold, select the first candidate word as a next word most likely spoken in the speech audio; in response to at least the degree of uncertainty being greater than the threshold, select, as the next word most likely spoken in the speech audio, a second candidate word indicated as being most likely spoken based on a second set of probabilities generated by a language model; and add the next word most likely spoken to a transcript.
    Type: Application
    Filed: March 18, 2021
    Publication date: September 23, 2021
    Applicant: SAS Institute Inc.
    Inventor: XU YANG
  • Publication number: 20210295845
    Abstract: An apparatus includes processor(s) to: divide a speech data set into multiple data chunks that each represent a chunk of speech audio; derive a threshold amplitude based on at least one peak amplitude of the speech audio; designate each data chunk with a peak amplitude below the threshold amplitude a pause data chunk; within a set of temporally consecutive data chunks of the multiple data chunks, identify a longest subset of temporally consecutive pause data chunks; within the set of temporally consecutive data chunks, designate the longest subset of temporally consecutive pause data chunks as a likely sentence pause of a candidate set of likely sentence pauses; based on at least the candidate set, divide the speech data set into multiple data segments that each represent a speech segment of the speech audio; and perform speech-to-text conversion, to identify a sentence spoken in each speech segment.
    Type: Application
    Filed: December 30, 2020
    Publication date: September 23, 2021
    Applicant: SAS Institute Inc.
    Inventors: XIAOZHUO CHENG, XU YANG, XIAOLONG LI
  • Publication number: 20210294568
    Abstract: Geospatial data can be converted into audio outputs. For example, a system can receive a dataset indicating geospatial locations of objects within a region. Based on the dataset, the system can generate a virtual map representing the region and including virtual points representing the objects. The virtual points can be spatially positioned at locations in the virtual map corresponding to the geospatial locations of the objects in the region. The system can receive a user input via a user input device for interacting with a particular virtual point among the virtual points in the virtual map. The system can determine one or more sound characteristics for a sound based on receiving the user input. The system can then transmit an audio signal to an audio device for causing the audio device to generate the sound having the one or more sound characteristics, which may assist with exploring the virtual map.
    Type: Application
    Filed: August 24, 2020
    Publication date: September 23, 2021
    Applicant: SAS Institute Inc.
    Inventors: Claude Edward Summers, II, Sean Patrick Mealin, Julianna Elizabeth Langston, Gregory David Kraus, Jonathan Tyler Williamson, Lisa Beth Morton Robinson, Jesse Daniel Sookne, Brice Joseph Smith
  • Patent number: 11125655
    Abstract: A computing device receives a request for a design of an experiment. The device generates a data representation of a matrix Y that defines a supersaturated design for the design of the experiment. The generating the data representation is by: generating a data representation of a matrix X according to an obtained design; computing an indication of correlation between effects of factors of a matrix Y; and generating, based on the indication of correlation, the data representation of the matrix Y that is the transposition of the matrix X or is the transposition of a matrix X*. The matrix X* is a first subset of the matrix X such that the transposition of the matrix X* represents a same number of factors as the transposition of the matrix X. The device outputs a setting for each test condition of the supersaturated design for the experiment.
    Type: Grant
    Filed: August 5, 2020
    Date of Patent: September 21, 2021
    Assignee: SAS Institute Inc.
    Inventors: Ryan Adam Lekivetz, Joseph Albert Morgan, Bradley Allen Jones, Caleb Bridges King
  • Patent number: 11120072
    Abstract: A computer transforms high-dimensional data into low-dimensional data. (A) A distance matrix is computed from observation vectors. (B) A kernel matrix is computed from the distance matrix using a bandwidth value. (C) The kernel matrix is decomposed using an eigen decomposition to define eigenvalues. (D) A predefined number of largest eigenvalues are selected from the eigenvalues. (E) The selected largest eigenvalues are summed. (F) A next bandwidth value is computed based on the summed eigenvalues. (A) through (F) are repeated with the next bandwidth value until a stop criterion is satisfied. Each observation vector of the observation vectors is transformed into a second space using a kernel principal component analysis with the next bandwidth value and the kernel matrix. The second space has a dimension defined by the predefined number of first eigenvalues. Each transformed observation vector is output.
    Type: Grant
    Filed: February 23, 2021
    Date of Patent: September 14, 2021
    Assignee: SAS Institute Inc.
    Inventors: Kai Shen, Haoyu Wang, Arin Chaudhuri
  • Patent number: 11109194
    Abstract: A computing system receives geolocation information indicating aggregated locations of mobile devices configured to move in a geographic area. The geolocation information comprises measured location(s) for a given mobile device of the mobile devices. The system generates a time series representing mobility network graphs over a first time period. The time series is generated by, for each subperiod in the time series, generating data representing estimated movement of member(s) of a population between locations within the geographic area. The estimated movement is estimated based on the geolocation information and a total population for the geographic area. The system generates metric(s) derived from the time series. The system determines contamination information indicating a respective contamination status for locations for each subperiod of the time series. The system generates a computer model to predict changes in the contamination information in a second time period subsequent to the first time period.
    Type: Grant
    Filed: February 12, 2021
    Date of Patent: August 31, 2021
    Assignee: SAS Institute Inc.
    Inventors: Carlos Andre Reis Pinheiro, Matthew Victor Galati, Natalia Summerville
  • Publication number: 20210263949
    Abstract: Computerized pipelines can transform input data into data structures compatible with models in some examples. In one such example, a system can obtain a first table that includes first data referencing a set of subjects. The system can then execute a sequence of processing operations on the first data in a particular order defined by a data-processing pipeline to modify an analysis table to include features associated with the set of subjects. Executing each respective processing operation in the sequence to generate the modified analysis table may involve: deriving a respective set of features from the first data by executing a respective feature-extraction operation on the first data; and adding the respective set of features to the analysis table. The system may then execute a predictive model on the modified analysis table for generating a predicted value based on the modified analysis table.
    Type: Application
    Filed: February 11, 2021
    Publication date: August 26, 2021
    Applicant: SAS Institute Inc.
    Inventors: James Allen Cox, Nancy Anne Rausch
  • Patent number: 11099899
    Abstract: A computing device receives, from a thread of a multi-thread application, a release message. Each of the threads indicates operation(s) on a memory associated with the application. The release message indicates that a data object used by the thread is released. The device indicates that a memory slot of a data pool is unlocked permitting storage of an indication of a location of the data object in the memory. Each memory slot of the data pool is individually lockable such that a locked memory slot of the data pool indicates storing a location in the locked memory slot will not be permitted even though storing the location in an unlocked memory slot of the data pool will be permitted. The device stores, in the memory slot of the data pool, an indication of a location of the data object. The data object comprises the location of the memory slot.
    Type: Grant
    Filed: April 2, 2020
    Date of Patent: August 24, 2021
    Assignee: SAS Institute Inc.
    Inventor: Charles S. Shorb