Patents Assigned to SAS Institute Inc.
  • Publication number: 20220335947
    Abstract: An apparatus includes at least one processor to, in response to a request to perform speech-to-text conversion: perform a pause detection technique including analyzing speech audio to identify pauses, and analyzing lengths of the pauses to identify likely sentence pauses; perform a speaker diarization technique including dividing the speech audio into fragments, analyzing vocal characteristics of speech sounds of each fragment to identify a speaker of a set of speakers, and identifying instances of a change in speakers between each temporally consecutive pair of fragments to identify likely speaker changes; and perform speech-to-text operations including dividing the speech audio into segments based on at least the likely sentence pauses and likely speaker changes, using at least an acoustic model with each segment to identify likely speech sounds in the speech audio, and generating a transcript of the speech audio based at least on the likely speech sounds.
    Type: Application
    Filed: June 28, 2022
    Publication date: October 20, 2022
    Applicant: SAS Institute Inc.
    Inventors: XIAOLONG LI, Samuel Norris Henderson, Xiaozhuo Cheng, Xu Yang
  • Patent number: 11474863
    Abstract: An apparatus includes a processor to: derive an order of performance of a set of tasks of a job flow; based on the order of performance, store, within a task queue, a first task routine execution request message to cause a first task to be performed; within a first task container, and in response to storage of the first task routine execution request message, execute instructions of a first task routine of a set of task routines, store a mid-flow data set output of the first task within a federated area, and store a first task completion message within the task queue after completion of storage of the mid-flow data set; and in response to the storage of the first task completion message, and based on the order of performance, store, within the task queue, a second task routine execution request message to cause a second task to be performed.
    Type: Grant
    Filed: February 28, 2022
    Date of Patent: October 18, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Henry Gabriel Victor Bequet, Chaowang “Ricky” Zhang
  • Publication number: 20220327660
    Abstract: An apparatus includes a processor to: receive an indication of ability of a node device to provide a resource for executing application routines, at least one identifier of at least one image including an executable routine stored within a cache of the node device, and an indication of at least one revision level of the at least one image; analyze the ability to provide the resource; in response to being able to support execution of the application routine, identify a first image in a repository; compare identifiers to determine whether there is a second image including a matching executable routine; in response to a match, compare revision levels; and in response to the revision level of the most recent version of the first image being more recent, retrieve the most recent version of the first image from the repository, and store it within the node device.
    Type: Application
    Filed: December 23, 2021
    Publication date: October 13, 2022
    Applicant: SAS Institute Inc.
    Inventor: Jody Bridges Steadman
  • Patent number: 11460973
    Abstract: Node-link data can be converted into audio outputs. For example, a system can generate a graphical user interface (GUI) depicting a node-link diagram having nodes and links. The GUI can include a virtual reference point in the node-link diagram and a virtual control element that is rotatable around the virtual reference point by a user to contact one or more of the nodes in the node-link diagram. The system can receive user input for rotating the virtual control element around the virtual reference point, which can generate a contact between the virtual control element and a particular node of the node-link diagram. In response to detecting the contact, the system can determine a sound characteristic configured to indicate an attribute associated with the particular node. The system can then generate a sound having the sound characteristic, for example to assist the user in exploring the node-link diagram.
    Type: Grant
    Filed: April 11, 2022
    Date of Patent: October 4, 2022
    Assignee: SAS INSTITUTE INC:.
    Inventors: Sean Patrick Mealin, Claude Edward Summers, II, Mitchel Stanley Soltys, II, Ralph Johnson Marshall, Jr., Jesse Daniel Sookne, Brice Joseph Smith, Gregory David Kraus, Eric Colin Bolender, Julianna Elizabeth Langston, Lisa Beth Morton Robinson
  • Patent number: 11455190
    Abstract: An apparatus includes a processor to: within a performance container, execute a performance routine to derive an order of performance of tasks of a job flow based on dependencies, begin performing the tasks, and store, within a job queue, a job performance status indication including task performance statuses; identify a set of sub flows within the job flow based on branches in the job flow; correlate each of the task performance statuses to a corresponding sub flow performance status; reduce the job performance status indication size by, for each sub flow in which all tasks have been completed, replace the corresponding task performance statuses with the corresponding sub flow performance status of completed, and for each sub flow with no task performed, replace the corresponding task performance statuses with the corresponding sub flow performance status of not executed; and transmit the job performance status indication to the requesting device.
    Type: Grant
    Filed: December 21, 2021
    Date of Patent: September 27, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Qing Gong
  • Patent number: 11443416
    Abstract: Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image.
    Type: Grant
    Filed: August 9, 2021
    Date of Patent: September 13, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Yi Liao, Charles Franklin Board, William Robert Nadolski, David James Wheaton, Heather Michelle Goodykoontz, Adheesha Sanjuaya Arangala, Karthik Nakkeeran
  • Patent number: 11443198
    Abstract: A computing device learns a directed acyclic graph (DAG). An SSCP matrix is computed from variable values defined for observation vectors. A topological order vector is initialized that defines a topological order for the variables. A loss value is computed using the topological order vector and the SSCP matrix. (A) A neighbor determination method is selected. (B) A next topological order vector is determined relative to the initialized topological order vector using the neighbor determination method. (C) A loss value is computed using the next topological order vector and the SSCP matrix. (D) (B) and (C) are repeated until each topological order vector is determined in (B) based on the neighbor determination method. A best topological vector is determined from each next topological order vector based on having a minimum value for the computed loss value. An adjacency matrix is computed using the best topological vector and the SSCP matrix.
    Type: Grant
    Filed: November 9, 2021
    Date of Patent: September 13, 2022
    Assignee: SAS Institute, Inc.
    Inventors: Xilong Chen, Tao Huang, Jan Chvosta
  • Patent number: 11436438
    Abstract: (A) Conditional vectors are defined. (B) Latent observation vectors are generated using a predefined noise distribution function. (C) A forward propagation of a generator model is executed with the conditional vectors and the latent observation vectors as input to generate an output vector. (D) A forward propagation of a decoder model of a trained autoencoder model is executed with the generated output vector as input to generate a plurality of decoded vectors. (E) Transformed observation vectors are selected from transformed data based on the defined plurality of conditional vectors. (F) A forward propagation of a discriminator model is executed with the transformed observation vectors, the conditional vectors, and the decoded vectors as input to predict whether each transformed observation vector and each decoded vector is real or fake. (G) The discriminator and generator models are updated and (A) through (G) are repeated until training is complete.
    Type: Grant
    Filed: December 22, 2021
    Date of Patent: September 6, 2022
    Assignee: SAS Institute Inc.
    Inventors: Ruiwen Zhang, Weichen Wang, Jorge Manuel Gomes da Silva, Ye Liu, Hamoon Azizsoltani, Prathaban Mookiah
  • Patent number: 11435499
    Abstract: Tops of geological layers can be automatically identified using machine-learning techniques as described herein. In one example, a system can receive well log records associated with wellbores drilled through geological layers. The system can generate well clusters by applying a clustering process to the well log records. The system can then obtain a respective set of training data associated with a well cluster, train a machine-learning model based on the respective set of training data, select a target well-log record associated with a target wellbore of the well cluster, and provide the target well-log record as input to the trained machine-learning model. Based on an output from the trained machine-learning model, the system can determine the geological tops of the geological layers in a region surrounding the target wellbore. The system may then transmit an electronic signal indicating the geological tops of the geological layers associated with the target wellbore.
    Type: Grant
    Filed: September 22, 2021
    Date of Patent: September 6, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Sergiy Peredriy, Keith Richard Holdaway
  • Patent number: 11436444
    Abstract: A computing device trains a fair machine learning model. A prediction model is trained to predict a target value. For a number of iterations, a weight vector is computed using the bound value based on fairness constraints defined for a fairness measure type; a weight value is assigned to each observation vector based on the target value and a sensitive attribute value; the prediction model is trained with each weighted observation vector to predict the target value; and a conditional moments vector is computed based on the fairness constraints and the target and sensitive attribute values. Conditional moments difference values are computed. When the conditional moments difference values indicate to adjust the bound value, the bound value is updated and the process is repeated with the bound value replaced with the updated bound value until the conditional moments difference values indicate no further adjustment of the bound value is needed.
    Type: Grant
    Filed: December 21, 2021
    Date of Patent: September 6, 2022
    Assignee: SAS Institute Inc.
    Inventors: Xinmin Wu, Xin Jiang Hunt
  • Patent number: 11423680
    Abstract: Text profiles can be leveraged to select and configure models according to some examples described herein. In one example, a system can analyze a reference textual dataset and a target textual dataset using text-mining techniques to generate a first text profile and a second text profile, respectively. The first text profile can contain first metrics characterizing the reference textual dataset and the second text profile can contain second metrics characterizing the target textual dataset. The system can determine a similarity value by comparing the first text profile to the second text profile. The system can also receive a user selection of a model that is to be applied to the target textual dataset. The system can then generate an insight relating to an anticipated accuracy of the model on the target textual dataset based on the similarity value. The system can output the insight to the user.
    Type: Grant
    Filed: December 30, 2021
    Date of Patent: August 23, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Teresa S. Jade, Xiao Li, Chunqi Zuo, Paul Jeffrey Kovach
  • Publication number: 20220261281
    Abstract: An apparatus includes a processor to receive a request to provide a view of an object associated with a job flow, and in response to determining that the object is associated with a task type requiring access to a particular resource not accessible to a first interpretation routine: store, within a job queue, a job flow generation request message to cause generation of a job flow definition the defines another job flow for generating the requested view; within a task container in which a second interpretation routine that does have access to the particular resource is executed, generate the job flow definition; store, within a task queue, a job flow generation completion message that includes a copy of the job flow definition; use the job flow definition to perform the other job flow to generate the requested view; and transmit the requested view to the requesting device.
    Type: Application
    Filed: April 29, 2022
    Publication date: August 18, 2022
    Applicant: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang "Ricky" Zhang
  • Patent number: 11416712
    Abstract: A computing device generates synthetic tabular data.
    Type: Grant
    Filed: December 23, 2021
    Date of Patent: August 16, 2022
    Assignee: SAS Institute, Inc.
    Inventors: Amirhassan Fallah Dizche, Ye Liu, Xin Jiang Hunt, Jorge Manuel Gomes da Silva
  • Publication number: 20220253335
    Abstract: An apparatus includes a processor to: receive a request to perform a job flow; within a performance container, based on the data dependencies among a set of tasks of the job flow, derive an order of performance of the set of tasks that includes a subset able to be performed in parallel, and derive a quantity of task containers to enable the parallel performance of the subset; based on the derived quantity of task containers, derive a quantity of virtual machines (VMs) to enable the parallel performance of the subset; provide, to a VM allocation routine, an indication of a need for provision of the quantity of VMs; and store, within a task queue, multiple task routine execution request messages to enable parallel execution of task routines within the quantity of task containers to cause the parallel performance of the subset.
    Type: Application
    Filed: April 29, 2022
    Publication date: August 11, 2022
    Applicant: SAS Institute Inc.
    Inventors: Henry Gabriel Victor Bequet, Ronald Earl Stogner, Eric Jian Yang, Chaowang "Ricky" Zhang
  • Patent number: 11409966
    Abstract: An apparatus to: analyze a data set to identify a candidate topic not in a set of topics; determine whether the prominence of the candidate topic within the data set meets a threshold; in response to meeting the threshold, retrieve a rate of increase in frequency of the candidate topic in online searches; in response to meeting a threshold rate of increase, retrieve the keyword most frequently used in online searches for the candidate topic, use the keyword to retrieve a supplemental data set, and analyze input data extracted from the supplemental data set to determine whether the candidate topic can change the accuracy of a forecast model; and in response to determining that the candidate topic can change the accuracy, add the candidate topic to the set of topics and replace the forecast model with a forecast model trained for the set of topics augmented with the candidate topic.
    Type: Grant
    Filed: December 17, 2021
    Date of Patent: August 9, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Anand Arun Phand, Sudeshna Guhaneogi, Narender Ceechamangalam Veeraraghavan, Ravinder Singh Chauhan, Shikha Bhat, Kaustubh Yashvant Khandwe, Shalini Sinha, Vineet Roy, Alina Olegovna Asadullina, Vitaly Igorevich Plekhanov, Elizaveta Alekseevna Lavrenova, Dmitry Sergeevich Bodunov, Assol Raufjonovna Kubaeva, Stephen Joseph Ondrik, Steffen-Horst Schlüter, Joseph Michael Martino, John Zhiqiang Zhao, Pravinkumar Bhalerao, Valentina Larina
  • Patent number: 11404053
    Abstract: An apparatus includes processor(s) to: generate a set of candidate n-grams based on probability distributions from an acoustic model for candidate graphemes of a next word most likely spoken following at least one preceding word spoken within speech audio; provide the set of candidate n-grams to multiple devices; provide, to each node device, an indication of which candidate n-grams are to be searched for within the n-gram corpus by each node device to enable searches for multiple candidate n-grams to be performed, independently and at least partially in parallel, across the node devices; receive, from each node device, an indication of a probability of occurrence of at least one candidate n-gram within the speech audio; based on the received probabilities of occurrence, identify the next word most likely spoken within the speech audio; and add the next word most likely spoken to a transcript of the speech audio.
    Type: Grant
    Filed: July 8, 2021
    Date of Patent: August 2, 2022
    Assignee: SAS INSTITUTE INC.
    Inventors: Xiaozhuo Cheng, Xu Yang, Xiaolong Li, Biljana Belamaric Wilsey, Haipeng Liu, Jared Peterson
  • Patent number: 11403527
    Abstract: A computing device trains a neural network machine learning model. A forward propagation of a first neural network is executed. A backward propagation of the first neural network is executed from a last layer to a last convolution layer to compute a gradient vector. A discriminative localization map is computed for each observation vector with the computed gradient vector using a discriminative localization map function. An activation threshold value is selected for each observation vector from at least two different values based on a prediction error of the first neural network. A biased feature map is computed for each observation vector based on the activation threshold value selected for each observation vector. A masked observation vector is computed for each observation vector using the biased feature map. A forward and a backward propagation of a second neural network is executed a predefined number of iterations using the masked observation vector.
    Type: Grant
    Filed: October 13, 2021
    Date of Patent: August 2, 2022
    Assignee: SAS Institute Inc.
    Inventors: Xinmin Wu, Yingjian Wang, Xiangqian Hu
  • Patent number: 11379685
    Abstract: A computing device classifies unclassified observations. A first batch of unclassified observation vectors and a first batch of classified observation vectors are selected. A prior regularization error value and a decoder reconstruction error value are computed. A first batch of noise observation vectors is generated. An evidence lower bound (ELBO) value is computed. A gradient of an encoder neural network model is computed, and the ELBO value is updated. A decoder neural network model and an encoder neural network model are updated. The decoder neural network model is trained. The target variable value is determined for each observation vector of the unclassified observation vectors based on an output of the trained decoder neural network model. The target variable value is output.
    Type: Grant
    Filed: July 28, 2021
    Date of Patent: July 5, 2022
    Assignee: SAS Institute Inc.
    Inventor: Xu Chen
  • Patent number: 11379743
    Abstract: A computing device determines a recommendation. (A) A first parameter matrix is updated using a first direction matrix and a first step-size parameter value that is greater than one. The first parameter matrix includes a row dimension equal to a number of users of a plurality of users included in a ratings matrix and the ratings matrix includes a missing matrix value. (B) A second parameter matrix is updated using a second direction matrix and a second step-size parameter value that is greater than one. The second parameter matrix includes a column dimension equal to a number of items of a plurality of items included in the ratings matrix. (C) An objective function value is updated based on the first parameter matrix and the second parameter matrix. (D) (A) through (C) are repeated until the first parameter matrix and the second parameter matrix satisfy a convergence test.
    Type: Grant
    Filed: July 28, 2021
    Date of Patent: July 5, 2022
    Assignee: SAS Institute Inc.
    Inventors: Xuejun Liao, Patrick Nathan Koch, Shunping Huang, Yan Xu
  • Patent number: 11373121
    Abstract: The computing device transforms lab data and field data into a first format suitable for execution with a supervised machine learning model to determine an input variable importance for a first set of input variables in predicting a field outcome. Based on the determination, the computing device generates one or more logical rules of decision metrics, selects the one or more input variables that yields a higher input variable importance, and generates one or more pass-fail indicators. The computing device combines the one or more pass-fail indicators and generates one or more prediction factor rules. The computing device transforms the field data and the one or more prediction factor rules into a second format suitable for execution with a model to determine a treatment effect for the one or more prediction factor rules. The computing device selects the prediction factor rule that maximizes the treatment effect.
    Type: Grant
    Filed: September 23, 2021
    Date of Patent: June 28, 2022
    Assignee: SAS Institute Inc.
    Inventors: John Wesley Gottula, Bryan Matthew Mutell, Michael Lee Henderson, II