Patents Assigned to SAS Institute Inc.
  • Patent number: 11886329
    Abstract: A computing device selects new test configurations for testing software. (A) First test configurations are generated using a random seed value. (B) Software under test is executed with the first test configurations to generate a test result for each. (C) Second test configurations are generated from the first test configurations and the test results generated for each. (D) The software under test is executed with the second test configurations to generate the test result for each. (E) When a restart is triggered based on a distance metric value computed between the second test configurations, a next random seed value is selected as the random seed value and (A) through (E) are repeated. (F) When the restart is not triggered, (C) through (F) are repeated until a stop criterion is satisfied. (G) When the stop criterion is satisfied, the test result is output for each test configuration.
    Type: Grant
    Filed: June 15, 2022
    Date of Patent: January 30, 2024
    Assignee: SAS Institute Inc.
    Inventors: Steven Joseph Gardner, Connie Stout Dunbar, David Bruce Elsheimer, Gregory Scott Dunbar, Joshua David Griffin, Yan Gao
  • Patent number: 11887012
    Abstract: A computing device identifies an anomaly among a plurality of observation vectors. An observation vector is projected using a predefined orthogonal complement matrix. The predefined orthogonal complement matrix is determined from a decomposition of a low-rank matrix. The low-rank matrix is computed using a robust principal component analysis algorithm. The projected observation vector is multiplied by a predefined demixing matrix to define a demixed observation vector. The predefined demixing matrix is computed using an independent component analysis algorithm and the predefined orthogonal complement matrix. A detection statistic value is computed from the defined, demixed observation vector. When the computed detection statistic value is greater than or equal to a predefined anomaly threshold value, an indicator is output that the observation vector is an anomaly.
    Type: Grant
    Filed: July 19, 2023
    Date of Patent: January 30, 2024
    Assignee: SAS Institute Inc.
    Inventors: Sudipta Kolay, Steven Guanxing Xu, Kai Shen, Zohreh Asgharzadeh Talebi
  • Publication number: 20240028621
    Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.
    Type: Application
    Filed: July 13, 2023
    Publication date: January 25, 2024
    Applicant: SAS INSTITUTE INC.
    Inventors: Nancy Anne Rausch, Ruth Oluwadamilola Akintunde, Brant Nathan Kay
  • Patent number: 11875238
    Abstract: A computing system obtains a first preconfigured feature set. The first preconfigured feature set defines: a first feature definition defining an input variable, and first computer instructions for locating first data. The first data is available for retrieval because it is stored, or set-up to arrive, in the feature storage according to the first preconfigured feature set. The computing system receives a requested data set for the input variable. The computing system generates an availability status indicating whether the request data set is available for retrieval according to the first preconfigured feature set. Based on the availability status, generating, by the computing system, the requested data set by: retrieving historical data for the first preconfigured feature set; retrieving a data definition associated with the historical data; and generating the requested data based on the historical data and the data definition.
    Type: Grant
    Filed: June 23, 2022
    Date of Patent: January 16, 2024
    Assignee: SAS INSTITUTE INC.
    Inventors: Piotr Kaczynski, Aneta Maksymiuk, Artur Lukasz Skalski, Wioletta Paulina Stobieniecka, Dwijendra Nath Dwivedi
  • Patent number: 11875189
    Abstract: An apparatus includes at least one node device to host a computing cluster, and at least one processor to generate a UI providing guidance through a set of configuration settings for the computing cluster, wherein, for each configuration setting that is received as an input during configuration, the at least one processor is caused to: perform a check of the set of configuration settings to determine whether the received configuration setting creates a conflict among the set of configuration settings; and in response to a determination that the received configuration setting creates a conflict among the set of configuration settings, perform operations including generate an indication of the conflict for presentation by the UI, and receive a change to a configuration setting as an input from the input device.
    Type: Grant
    Filed: March 17, 2023
    Date of Patent: January 16, 2024
    Assignee: SAS Institute Inc.
    Inventors: Richard K. Wellum, Joseph Daniel Henry, Holden Ernest O'Neal, John W. Waller
  • Patent number: 11860212
    Abstract: A computer monitors a status of grid devices using sensor measurements. Sensor data is clustered using a predefined grouping distance value to define one or more sensor event clusters. A plurality of monitored devices is clustered using a predefined clustering distance value to define one or more asset clusters. A location is associated with each monitored device of the plurality of monitored devices. A distance is computed between each sensor event cluster and each asset cluster. When the computed distance is less than or equal to a predefined asset/sensor distance value for a sensor event cluster and an asset cluster, an asset identifier of the asset cluster associated with the computed distance is added to an asset event list. For each asset cluster included in the asset event list, an asset location of an asset is shown on a map in a graphical user interface presented in a display.
    Type: Grant
    Filed: June 26, 2023
    Date of Patent: January 2, 2024
    Assignee: SAS INSTITUTE INC.
    Inventors: Thomas Dale Anderson, Priyadarshini Sharma, Mark Joseph Konya, Yuwei Liao
  • Patent number: 11862171
    Abstract: An apparatus includes a processor to: receive, from a requesting device, a request to perform speech-to-text conversion of a speech data set; within a first thread of a thread pool, perform a first pause detection technique to identify a first set of likely sentence pauses; within a second thread of the thread pool, perform a second pause detection technique to identify a second set of likely sentence pauses; perform a speaker diarization technique to identify a set of likely speaker changes; divide the speech data set into data segments representing speech segments based on a combination of at least the first set of likely sentence pauses, the second set of likely sentence pauses, and the set of likely speaker changes; use at least an acoustic model with each data segment to identify likely speech sounds; and generate a transcript based, at least in part, on the identified likely speech sounds.
    Type: Grant
    Filed: November 23, 2022
    Date of Patent: January 2, 2024
    Assignee: SAS Institute Inc.
    Inventors: Xiaolong Li, Xiaozhuo Cheng, Samuel Norris Henderson, Xu Yang
  • Patent number: 11846979
    Abstract: Anomalies in a target object can be detected and diagnosed using improved Mahalanobis-Taguchi system (MTS) techniques. For example, an anomaly detection and diagnosis (ADD) system can receive a set of measurements associated with attributes of a target object. A Mahalanobis distance (MD) can be determined using a generalized inverse matrix. An abnormal condition can be detected when the MD is greater than a predetermined threshold value. The ADD system can determine an importance score for each measurement of a corresponding attribute. The attribute whose measurement has the highest importance score can be determined to be responsible for the abnormal condition.
    Type: Grant
    Filed: May 17, 2023
    Date of Patent: December 19, 2023
    Assignee: SAS INSTITUTE, INC.
    Inventors: Kevin L. Scott, Deovrat Vijay Kakde, Arin Chaudhuri, Sergiy Peredriy
  • Patent number: 11842379
    Abstract: The computing device obtains a training data set related to a plurality of historic user inputs associated with preferences of one or more services or items from an entity. For each of the one or more services or items, the computing device executes operations to train a plurality of models using the training data set to generate a plurality of recommended models, apply a validation data set to generate a plurality of predictions from the plurality of recommended models, obtain a weight of each metric of a plurality of metrics from the entity, obtain user inputs associated with user preferences, and determine a relevancy score for each metric. The computing device selects a recommended model based on the relevancy score of the selected metric or a combination of selected metrics, generates one or more recommendations for the users, and outputs the one or more generated recommendations to the users.
    Type: Grant
    Filed: February 15, 2023
    Date of Patent: December 12, 2023
    Assignee: SAS Institute Inc.
    Inventors: Jonathan Lee Walker, Hardi Desai, Xuejun Liao, Varunraj Valsaraj
  • Publication number: 20230394109
    Abstract: Anomalies in a target object can be detected and diagnosed using improved Mahalanobis-Taguchi system (MTS) techniques. For example, an anomaly detection and diagnosis (ADD) system can receive a set of measurements associated with attributes of a target object. A Mahalanobis distance (MD) can be determined using a generalized inverse matrix. An abnormal condition can be detected when the MD is greater than a predetermined threshold value. The ADD system can determine an importance score for each measurement of a corresponding attribute. The attribute whose measurement has the highest importance score can be determined to be responsible for the abnormal condition.
    Type: Application
    Filed: May 17, 2023
    Publication date: December 7, 2023
    Applicant: SAS Institute Inc.
    Inventors: Kevin L. SCOTT, Deovrat Vijay Kakde, Arin Chaudhuri, Sergiy Peredriy
  • Patent number: 11836968
    Abstract: A system, method, and computer-program product includes detecting, via a localization machine learning model, a target object within a scene based on downsampled image data of the scene, identifying a likely position of the target object within original image data of the scene, extracting, from the original image data of the scene, a target sub-image containing the target object, classifying, via an object classification machine learning model, the target object to a probable object class of a plurality of distinct object classes, routing the target image resolution of the target sub-image to a target object-condition machine learning classification model of a plurality of distinct object-condition machine learning classification models, classifying, via the target object-condition machine learning classification model, the target object to a probable object-condition class, and displaying, via a graphical user interface, a representation of the target object in association with the probable object-condition
    Type: Grant
    Filed: August 24, 2023
    Date of Patent: December 5, 2023
    Assignee: SAS Institute, Inc.
    Inventors: Robert Winston Blanchard, Neela Niranjani Vengateshwaran
  • Publication number: 20230386473
    Abstract: A system, method, and computer-program product includes constructing a transcript adaptation training data corpus that includes a plurality of transcript normalization training data samples, wherein each of the plurality of transcript normalization training data samples includes: a predicted audio transcript that includes at least one numerical expression, an adapted audio transcript that includes an alphabetic representation of the at least one numerical expression, and a transcript normalization identifier that, when applied to a model input comprising a target audio transcript, defines a text-to-text transformation objective causing a numeric-to-alphabetic expression machine learning model to predict an alphabetic-equivalent audio transcript that represents each numerical expression included in the target audio transcript in one or more alphabetic tokens; configuring the numeric-to-alphabetic expression machine learning model based on a training of a machine learning text-to-text transformer model using th
    Type: Application
    Filed: July 11, 2023
    Publication date: November 30, 2023
    Applicant: SAS Institute Inc.
    Inventors: Xiaolong Li, Xiaozhuo Cheng, Xu Yang
  • Publication number: 20230360652
    Abstract: A system, method, and computer-program product includes constructing a transcript correction training data corpus that includes a plurality of labeled audio transcription training data samples, wherein each of the plurality of labeled audio transcription training data samples includes: an incorrect audio transcription of a target piece of audio data; a correct audio transcription of the target piece of audio data; and a transcript correction identifier that, when applied to a model input that includes a likely incorrect audio transcript, defines a text-to-text transformation objective causing an audio transcript correction machine learning model to predict a corrected audio transcript based on the likely incorrect audio transcript; configuring the audio transcript correction machine learning model based on a training of a machine learning text-to-text transformer model using the transcript correction training data corpus; and executing the audio transcript correction machine learning model within a speech-to-
    Type: Application
    Filed: June 26, 2023
    Publication date: November 9, 2023
    Applicant: SAS Institute Inc.
    Inventors: Xiaolong Li, Xiaozhuo Cheng, Xu Yang
  • Patent number: 11810572
    Abstract: A system, method, and computer-program product includes distributing a plurality of audio data files of a speech data corpus to a plurality of computing nodes that each implement a plurality of audio processing threads, executing the plurality of audio processing threads associated with each of the plurality of computing nodes to detect a plurality of tentative speakers participating in each of the plurality of audio data files, generating, via a clustering algorithm, a plurality of clusters of embedding signatures based on a plurality of embedding signatures associated with the plurality of tentative speakers in each of the plurality of audio data files, and detecting a plurality of global speakers associated with the speech data corpus based on the plurality of clusters of embedding signatures.
    Type: Grant
    Filed: June 8, 2023
    Date of Patent: November 7, 2023
    Assignee: SAS INSTITUTE INC.
    Inventors: Xiaozhuo Cheng, Xiaolong Li, Xu Yang
  • Patent number: 11809460
    Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.
    Type: Grant
    Filed: July 13, 2023
    Date of Patent: November 7, 2023
    Assignee: SAS Institute, Inc.
    Inventors: Nancy Anne Rausch, Ruth Oluwadamilola Akintunde, Brant Nathan Kay
  • Patent number: 11809915
    Abstract: A parallel processing technique can be used to expedite reconciliation of a hierarchy of forecasts on a computer system. As one example, the computer system can receive forecasts that have a hierarchical relationship with respect to one another. The computer system can distribute the forecasts among a group of computing nodes by time point, so that all data points corresponding to the same time point in the forecasts are assigned to the same computing node. The computing nodes can receive the datasets corresponding to the time points, organize the data points in each of the datasets by forecast to generate ordered datasets, and assign the ordered datasets to processing threads. The processing threads (across the computing nodes) can then execute a reconciliation process in parallel to one another to generate reconciled values, which can be output by the computing nodes.
    Type: Grant
    Filed: August 2, 2023
    Date of Patent: November 7, 2023
    Assignee: SAS Institute Inc.
    Inventors: Matthew Wayne Simpson, Caiqin Wang, Nilesh Jakhotiya, Michele Angelo Trovero
  • Patent number: 11798263
    Abstract: A computing system detects a defective object. An image is received of a manufacturing line that includes objects in a process of being manufactured. Each pixel included in the image is classified as a background pixel class, a non-defective object class, or a defective object class using a trained neural network model. The pixels included in the image that were classified as the non-defective object class or the defective object class are grouped into polygons. Each polygon is defined by a contiguous group of pixels classified as the non-defective object class or the defective object class. Each polygon is classified in the non-defective object class or in the defective object class based on a number of pixels included in a respective polygon that are classified in the non-defective object class relative to a number of pixels included in the respective polygon that are classified in the defective object class.
    Type: Grant
    Filed: April 4, 2023
    Date of Patent: October 24, 2023
    Assignee: SAS Institute Inc.
    Inventors: Kedar Shriram Prabhudesai, Jonathan Lee Walker, Sanjeev Shyam Heda, Varunraj Valsaraj, Allen Joseph Langlois, Frederic Combaneyre, Hamza Mustafa Ghadyali, Nabaruna Karmakar
  • Patent number: 11790036
    Abstract: A computing device trains a fair machine learning model. A predicted target variable is defined using a trained prediction model. The prediction model is trained with weighted observation vectors. The predicted target variable is updated using the prediction model trained with weighted observation vectors. A true conditional moments matrix and a false conditional moments matrix are computed. The training and updating with weighted observation vectors are repeated until a number of iterations is performed. When a computed conditional moments matrix indicates to adjust a bound value, the bound value is updated based on an upper bound value or a lower bound value, and the repeated training and updating with weighted observation vectors is repeated with the bound value replaced with the updated bound value until the conditional moments matrix indicates no further adjustment of the bound value is needed. A fair prediction model is trained with the updated bound value.
    Type: Grant
    Filed: November 2, 2022
    Date of Patent: October 17, 2023
    Assignee: SAS Institute Inc.
    Inventors: Xinmin Wu, Xin Jiang Hunt, Ralph Walter Abbey
  • Publication number: 20230317083
    Abstract: A system, method, and computer-program product includes distributing a plurality of audio data files of a speech data corpus to a plurality of computing nodes that each implement a plurality of audio processing threads, executing the plurality of audio processing threads associated with each of the plurality of computing nodes to detect a plurality of tentative speakers participating in each of the plurality of audio data files, generating, via a clustering algorithm, a plurality of clusters of embedding signatures based on a plurality of embedding signatures associated with the plurality of tentative speakers in each of the plurality of audio data files, and detecting a plurality of global speakers associated with the speech data corpus based on the plurality of clusters of embedding signatures.
    Type: Application
    Filed: June 8, 2023
    Publication date: October 5, 2023
    Applicant: SAS Institute Inc.
    Inventors: Xiaozhuo Cheng, Xiaolong Li, Xu Yang
  • Patent number: 11776090
    Abstract: An apparatus includes a processor to: receive an indication of ability of a node device to provide a resource for executing application routines, at least one identifier of at least one image including an executable routine stored within a cache of the node device, and an indication of at least one revision level of the at least one image; analyze the ability to provide the resource; in response to being able to support execution of the application routine, identify a first image in a repository; compare identifiers to determine whether there is a second image including a matching executable routine; in response to a match, compare revision levels; and in response to the revision level of the most recent version of the first image being more recent, retrieve the most recent version of the first image from the repository, and store it within the node device.
    Type: Grant
    Filed: December 23, 2021
    Date of Patent: October 3, 2023
    Assignee: SAS Institute Inc.
    Inventor: Jody Bridges Steadman