Patents Assigned to SAS Institute Inc.
  • Publication number: 20230360652
    Abstract: A system, method, and computer-program product includes constructing a transcript correction training data corpus that includes a plurality of labeled audio transcription training data samples, wherein each of the plurality of labeled audio transcription training data samples includes: an incorrect audio transcription of a target piece of audio data; a correct audio transcription of the target piece of audio data; and a transcript correction identifier that, when applied to a model input that includes a likely incorrect audio transcript, defines a text-to-text transformation objective causing an audio transcript correction machine learning model to predict a corrected audio transcript based on the likely incorrect audio transcript; configuring the audio transcript correction machine learning model based on a training of a machine learning text-to-text transformer model using the transcript correction training data corpus; and executing the audio transcript correction machine learning model within a speech-to-
    Type: Application
    Filed: June 26, 2023
    Publication date: November 9, 2023
    Applicant: SAS Institute Inc.
    Inventors: Xiaolong Li, Xiaozhuo Cheng, Xu Yang
  • Patent number: 11810572
    Abstract: A system, method, and computer-program product includes distributing a plurality of audio data files of a speech data corpus to a plurality of computing nodes that each implement a plurality of audio processing threads, executing the plurality of audio processing threads associated with each of the plurality of computing nodes to detect a plurality of tentative speakers participating in each of the plurality of audio data files, generating, via a clustering algorithm, a plurality of clusters of embedding signatures based on a plurality of embedding signatures associated with the plurality of tentative speakers in each of the plurality of audio data files, and detecting a plurality of global speakers associated with the speech data corpus based on the plurality of clusters of embedding signatures.
    Type: Grant
    Filed: June 8, 2023
    Date of Patent: November 7, 2023
    Assignee: SAS INSTITUTE INC.
    Inventors: Xiaozhuo Cheng, Xiaolong Li, Xu Yang
  • Patent number: 11809460
    Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.
    Type: Grant
    Filed: July 13, 2023
    Date of Patent: November 7, 2023
    Assignee: SAS Institute, Inc.
    Inventors: Nancy Anne Rausch, Ruth Oluwadamilola Akintunde, Brant Nathan Kay
  • Patent number: 11809915
    Abstract: A parallel processing technique can be used to expedite reconciliation of a hierarchy of forecasts on a computer system. As one example, the computer system can receive forecasts that have a hierarchical relationship with respect to one another. The computer system can distribute the forecasts among a group of computing nodes by time point, so that all data points corresponding to the same time point in the forecasts are assigned to the same computing node. The computing nodes can receive the datasets corresponding to the time points, organize the data points in each of the datasets by forecast to generate ordered datasets, and assign the ordered datasets to processing threads. The processing threads (across the computing nodes) can then execute a reconciliation process in parallel to one another to generate reconciled values, which can be output by the computing nodes.
    Type: Grant
    Filed: August 2, 2023
    Date of Patent: November 7, 2023
    Assignee: SAS Institute Inc.
    Inventors: Matthew Wayne Simpson, Caiqin Wang, Nilesh Jakhotiya, Michele Angelo Trovero
  • Patent number: 11798263
    Abstract: A computing system detects a defective object. An image is received of a manufacturing line that includes objects in a process of being manufactured. Each pixel included in the image is classified as a background pixel class, a non-defective object class, or a defective object class using a trained neural network model. The pixels included in the image that were classified as the non-defective object class or the defective object class are grouped into polygons. Each polygon is defined by a contiguous group of pixels classified as the non-defective object class or the defective object class. Each polygon is classified in the non-defective object class or in the defective object class based on a number of pixels included in a respective polygon that are classified in the non-defective object class relative to a number of pixels included in the respective polygon that are classified in the defective object class.
    Type: Grant
    Filed: April 4, 2023
    Date of Patent: October 24, 2023
    Assignee: SAS Institute Inc.
    Inventors: Kedar Shriram Prabhudesai, Jonathan Lee Walker, Sanjeev Shyam Heda, Varunraj Valsaraj, Allen Joseph Langlois, Frederic Combaneyre, Hamza Mustafa Ghadyali, Nabaruna Karmakar
  • Patent number: 11790036
    Abstract: A computing device trains a fair machine learning model. A predicted target variable is defined using a trained prediction model. The prediction model is trained with weighted observation vectors. The predicted target variable is updated using the prediction model trained with weighted observation vectors. A true conditional moments matrix and a false conditional moments matrix are computed. The training and updating with weighted observation vectors are repeated until a number of iterations is performed. When a computed conditional moments matrix indicates to adjust a bound value, the bound value is updated based on an upper bound value or a lower bound value, and the repeated training and updating with weighted observation vectors is repeated with the bound value replaced with the updated bound value until the conditional moments matrix indicates no further adjustment of the bound value is needed. A fair prediction model is trained with the updated bound value.
    Type: Grant
    Filed: November 2, 2022
    Date of Patent: October 17, 2023
    Assignee: SAS Institute Inc.
    Inventors: Xinmin Wu, Xin Jiang Hunt, Ralph Walter Abbey
  • Publication number: 20230317083
    Abstract: A system, method, and computer-program product includes distributing a plurality of audio data files of a speech data corpus to a plurality of computing nodes that each implement a plurality of audio processing threads, executing the plurality of audio processing threads associated with each of the plurality of computing nodes to detect a plurality of tentative speakers participating in each of the plurality of audio data files, generating, via a clustering algorithm, a plurality of clusters of embedding signatures based on a plurality of embedding signatures associated with the plurality of tentative speakers in each of the plurality of audio data files, and detecting a plurality of global speakers associated with the speech data corpus based on the plurality of clusters of embedding signatures.
    Type: Application
    Filed: June 8, 2023
    Publication date: October 5, 2023
    Applicant: SAS Institute Inc.
    Inventors: Xiaozhuo Cheng, Xiaolong Li, Xu Yang
  • Patent number: 11776090
    Abstract: An apparatus includes a processor to: receive an indication of ability of a node device to provide a resource for executing application routines, at least one identifier of at least one image including an executable routine stored within a cache of the node device, and an indication of at least one revision level of the at least one image; analyze the ability to provide the resource; in response to being able to support execution of the application routine, identify a first image in a repository; compare identifiers to determine whether there is a second image including a matching executable routine; in response to a match, compare revision levels; and in response to the revision level of the most recent version of the first image being more recent, retrieve the most recent version of the first image from the repository, and store it within the node device.
    Type: Grant
    Filed: December 23, 2021
    Date of Patent: October 3, 2023
    Assignee: SAS Institute Inc.
    Inventor: Jody Bridges Steadman
  • Patent number: 11775878
    Abstract: A computing device selects new test configurations for testing software. Software under test is executed with first test configurations to generate a test result for each test configuration. Each test configuration includes a value for each test parameter where each test parameter is an input to the software under test. A predictive model is trained using each test configuration of the first test configurations in association with the test result generated for each test configuration based on an objective function value. The predictive model is executed with second test configurations to predict the test result for each test configuration of the second test configurations. Test configurations are selected from the second test configurations based on the predicted test results to define third test configurations. The software under test is executed with the defined third test configurations to generate the test result for each test configuration of the third test configurations.
    Type: Grant
    Filed: November 10, 2021
    Date of Patent: October 3, 2023
    Assignee: SAS Institute Inc.
    Inventors: Yan Gao, Joshua David Griffin, Yu-Min Lin, Bengt Wisen Pederson, Ricky Dee Tharrington, Jr., Pei-Yi Tan, Raymond Eugene Wright
  • Patent number: 11776545
    Abstract: An apparatus includes a processor to: receive a request to perform speech-to-text conversion of a speech data set; perform pause detection to identify a set of likely sentence pauses and/or speaker diarization technique to identify a set of likely speaker changes; based the set of likely sentence pauses and/or the set of likely speaker changes, divide the speech data set into data segments representing speech segments; use an acoustic model with the data segments to derive sets of probabilities of speech sounds uttered; store the sets of probabilities in temporal order within a buffer queue; distribute the sets of probabilities from the buffer queue in temporal order among threads of a thread pool; and within each thread, and based on set(s) of probabilities, derive one candidate word and select either the candidate word or an alternate candidate word derived from a language model as the next word most likely spoken.
    Type: Grant
    Filed: November 28, 2022
    Date of Patent: October 3, 2023
    Assignee: SAS Institute Inc.
    Inventors: Xiaolong Li, Xiaozhuo Cheng, Samuel Norris Henderson, Xu Yang
  • Patent number: 11775341
    Abstract: An apparatus includes a processor to receive a request to provide a view of an object associated with a job flow, and in response to determining that the object is associated with a task type requiring access to a particular resource not accessible to a first interpretation routine: store, within a job queue, a job flow generation request message to cause generation of a job flow definition the defines another job flow for generating the requested view; within a task container in which a second interpretation routine that does have access to the particular resource is executed, generate the job flow definition; store, within a task queue, a job flow generation completion message that includes a copy of the job flow definition; use the job flow definition to perform the other job flow to generate the requested view; and transmit the requested view to the requesting device.
    Type: Grant
    Filed: April 29, 2022
    Date of Patent: October 3, 2023
    Assignee: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang “Ricky” Zhang
  • Patent number: 11769350
    Abstract: A computer system can automatically analyze a video of a physical activity and provide corresponding feedback. For example, the system can receive a video file including image frames showing an entity performing a physical activity that involves a sequence of movement phases. The system can generate coordinate sets by performing image analysis on the image frames. The system can provide the coordinate sets as input to a trained model, the trained model being configured to assign scores and movement phases to the image frames based on the coordinate sets. The system can then select a particular movement phase for which to provide feedback, based on the scores and movement phases assigned to the image frames. The system can generate the feedback for the entity about their performance of the particular movement phase, which may improve the entity's future performance of that particular movement phase.
    Type: Grant
    Filed: October 20, 2022
    Date of Patent: September 26, 2023
    Assignee: SAS Institute, Inc.
    Inventors: Ji Shen, Jared Langford Dean, Xilong Chen, Jan Chvosta
  • Publication number: 20230297409
    Abstract: An apparatus includes at least one node device to host a computing cluster, and at least one processor to generate a UI providing guidance through a set of configuration settings for the computing cluster, wherein, for each configuration setting that is received as an input during configuration, the at least one processor is caused to: perform a check of the set of configuration settings to determine whether the received configuration setting creates a conflict among the set of configuration settings; and in response to a determination that the received configuration setting creates a conflict among the set of configuration settings, perform operations including generate an indication of the conflict for presentation by the UI, and receive a change to a configuration setting as an input from the input device.
    Type: Application
    Filed: March 17, 2023
    Publication date: September 21, 2023
    Applicant: SAS Institute Inc.
    Inventors: Richard K. Wellum, Joseph Daniel Henry, Holden Ernest O'Neal, John W. Waller
  • Patent number: 11762705
    Abstract: An apparatus includes at least one node device to host a computing cluster, and at least one processor to: use at least one of a level of resource observed to be consumed by operation of the computing cluster or a level of performance observed to be provided by operation of the computing cluster as an input to a pre-existing cluster model to derive a predicted level; compare the predicted level to a corresponding observed level of resource consumed or performance provided; and in response to the predicted level not matching the observed level to within a pre-selected degree, derive a new cluster model from observations of the operation of the computing cluster, and generate a prompt to perform repeat the configuration of the computing cluster using the new cluster model in place of the pre-existing cluster model to generate a new set of configuration settings for the computing cluster.
    Type: Grant
    Filed: March 17, 2023
    Date of Patent: September 19, 2023
    Assignee: SAS Institute Inc.
    Inventors: Richard K. Wellum, Joseph Daniel Henry, Holden Ernest O'Neal, John W. Waller
  • Patent number: 11762689
    Abstract: An apparatus including a processor to: output a first request message onto a group sub-queue shared by multiple task containers to request execution of a first task routine; within a task container, respond to the first request message, by outputting a first task in-progress message onto an individual sub-queue not shared with other task containers to accede to executing the first task routine, followed by a task completion message; and respond to the task completion message by allowing the task completion message to remain on the individual sub-queue to keep the task container from executing another task routine from another request message on the group sub-queue, outputting a second request message onto the individual sub-queue to cause execution of a second task routine within the same task container to perform a second task, and responding to the second task in-progress message by de-queuing the task completion message.
    Type: Grant
    Filed: December 30, 2022
    Date of Patent: September 19, 2023
    Assignee: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang “Ricky” Zhang
  • Publication number: 20230289219
    Abstract: An apparatus includes a processor to receive a request to provide a view of an object associated with a job flow, and in response to determining that the object is associated with a task type requiring access to a particular resource not accessible to a first interpretation routine: store, within a job queue, a job flow generation request message to cause generation of a job flow definition the defines another job flow for generating the requested view; within a task container in which a second interpretation routine that does have access to the particular resource is executed, generate the job flow definition; store, within a task queue, a job flow generation completion message that includes a copy of the job flow definition; use the job flow definition to perform the other job flow to generate the requested view; and transmit the requested view to the requesting device.
    Type: Application
    Filed: April 29, 2022
    Publication date: September 14, 2023
    Applicant: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang "Ricky" Zhang
  • Publication number: 20230289218
    Abstract: An apparatus includes a processor to: receive a request to perform a job flow; within a performance container, based on the data dependencies among a set of tasks of the job flow, derive an order of performance of the set of tasks that includes a subset able to be performed in parallel, and derive a quantity of task containers to enable the parallel performance of the subset; based on the derived quantity of task containers, derive a quantity of virtual machines (VMs) to enable the parallel performance of the subset; provide, to a VM allocation routine, an indication of a need for provision of the quantity of VMs; and store, within a task queue, multiple task routine execution request messages to enable parallel execution of task routines within the quantity of task containers to cause the parallel performance of the subset.
    Type: Application
    Filed: April 29, 2022
    Publication date: September 14, 2023
    Applicant: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang "Ricky" Zhang
  • Patent number: 11757725
    Abstract: Groups of connected nodes in a network of nodes can be detected for evaluating and mitigating risks of the network of nodes. For example, a system can process one or more subnetworks of the network of nodes in parallel. For each subnetwork, the system can identify root nodes and their reachable nodes to create rooted groups of connected nodes. The system then can determine outdegrees of the remaining nodes in the network. The system can identify reachable nodes from a remaining node of the highest outdegree to create a nonrooted group of connected nodes. The system can estimate a risk value based on the number of rooted groups and nonrooted groups, the number of nodes in each rooted group and nonrooted group, and the attributes of the nodes in each group. The system can mitigate potential risks by reconfiguring the network of nodes.
    Type: Grant
    Filed: February 15, 2023
    Date of Patent: September 12, 2023
    Assignee: SAS Institute, Inc.
    Inventor: Himanshu Chandrakant Bhambhlani
  • Patent number: 11748159
    Abstract: An apparatus including a processor to: within a kill container, in response to a set of error messages indicative of errors in executing multiple instances of a task routine to perform a task of a job flow with multiple data object blocks of a data object, and in response to the quantity of error messages reaching a threshold, output a kill tasks request message that identifies the job flow; within a task container, in response to the kill tasks request message, cease execution of the task routine and output a task cancelation message that identifies the task and the job flow; and within a performance container, in response to he task cancelation message, output a job cancelation message to cause the transmission of an indication of cancelation of the job flow, via a network, and to a requesting device that requested the performance of the job flow.
    Type: Grant
    Filed: December 30, 2022
    Date of Patent: September 5, 2023
    Assignee: SAS INSTITUTE INC.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang “Ricky” Zhang
  • Patent number: 11748158
    Abstract: An apparatus includes a processor to: output a request message to cause a first task to be performed; within a task container, in response to the request message and a data object not being divided, divide the data object into a set of data object blocks based on at least the sizes of the data object and the atomic unit of organization of data therein, as well as the storage resources allocated to task containers, and output a task completion message indicating that the first task has been performed, and including a set of data block identifiers indicating the location of the set of data object blocks within at least one federated area; and in response to the task completion message, output a set of request messages to cause a second task to be performed by executing multiple instances of a task routine within multiple task containers.
    Type: Grant
    Filed: December 30, 2022
    Date of Patent: September 5, 2023
    Assignee: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang “Ricky” Zhang