Patents by Inventor Stefan Larson

Stefan Larson has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for subscriber-based adaptation of production-implemented machine learning models of a service provider using a training application

Patent number: 12567000

Abstract: A system and method for accelerating an adaptation of one or more machine learning models of a data handling and governance service includes implementing a data handling and governance platform digitally accessible by a target subscriber of the data handling and governance service, wherein the data handling and governance platform interfaces with a plurality of distinct subscriber-agnostic digital content machine learning classification models of the data handling and governance service; identifying a target subscriber-agnostic digital content machine learning classification model; adapting the target subscriber-agnostic digital content machine learning classification model to a subscriber-specific digital content machine learning classification model based on a training of the target subscriber-agnostic digital content machine learning classification model with at least one training corpus comprising subscriber-specific training data samples; and implementing the subscriber-specific digital content machine l

Type: Grant

Filed: November 21, 2022

Date of Patent: March 3, 2026

Assignee: DryvIQ, Inc.

Inventors: Steve Woodward, Shaun Becker, Stefan Larson
Systems and methods for machine learning classification-based automated remediations and handling of data items

Patent number: 12361071

Abstract: Systems and methods for automated digital filing includes classifying digital files and curating a number of corpora of digital files based on the classifications associated with each of the digital files. In response to the classification label associated with each distinct corpus of digital files, selectively executing a set of computer instructions associated with the classification label that, when executed, modifies a digital state of a target corpus of digital files.

Type: Grant

Filed: July 15, 2022

Date of Patent: July 15, 2025

Assignee: DryvIQ, Inc.

Inventors: Steve Woodward, Shaun Becker, Stefan Larson
Systems and methods for machine learning-based classification of digital computer files using file metadata

Patent number: 12287761

Abstract: A system and method for accelerated content classification and routing of digital files in a data handling and data governance service includes identifying a digital computer file; sequentially routing the digital computer file to one or more machine learning-based content classification models of a plurality of distinct machine learning-based content classification models based on a service-defined model instantiation and execution sequence, wherein: the service-defined model instantiation and execution sequence defines a model instantiation and execution order for the plurality of distinct machine learning-based content classification models that enables a fast content classification of the digital computer file while minimizing a computation time or runtime of the one or more machine learning-based content classification models; computing, via a machine learning-based filename classification model, a content classification inference based on extracted filename feature data of the digital computer file; and

Type: Grant

Filed: February 6, 2023

Date of Patent: April 29, 2025

Assignee: DryvIQ, Inc.

Inventors: Steve Woodward, Alexis Johnson, Stefan Larson, Shaun Becker
Systems and methods for intelligent digital item discovery and machine learning-informed handling of digital items and digital item governance

Patent number: 12259847

Abstract: Systems and methods of computing classifications for and migrating digital content that includes accessing a digital content corpus within a source data storage system; in response to accessing the digital content corpus, for each distinct item of digital content of the plurality of distinct items of digital content: computing, via one or more digital content machine learning classification models, a content classification inference; identifying automated digital content handling tasks of a plurality of distinct digital content handling tasks based on the content classification inference; executing the automated content handling tasks identified for each distinct item of digital content, wherein executing the automated content handling tasks includes: designating a storage location within a target data storage system based on the in-migration content classification inference; and migrating a respective item of digital content from the source data storage system to the designated storage location within the ta

Type: Grant

Filed: November 11, 2022

Date of Patent: March 25, 2025

Assignee: DryvIQ, Inc.

Inventors: Steve Woodward, Shaun Becker, Stefan Larson
SYSTEMS AND METHODS FOR MACHINE LEARNING-BASED CLASSIFICATION OF DIGITAL COMPUTER FILES USING FILE METADATA

Publication number: 20230251999

Abstract: A system and method for accelerated content classification and routing of digital files in a data handling and data governance service includes identifying a digital computer file; sequentially routing the digital computer file to one or more machine learning-based content classification models of a plurality of distinct machine learning-based content classification models based on a service-defined model instantiation and execution sequence, wherein: the service-defined model instantiation and execution sequence defines a model instantiation and execution order for the plurality of distinct machine learning-based content classification models that enables a fast content classification of the digital computer file while minimizing a computation time or runtime of the one or more machine learning-based content classification models; computing, via a machine learning-based filename classification model, a content classification inference based on extracted filename feature data of the digital computer file; and

Type: Application

Filed: February 6, 2023

Publication date: August 10, 2023

Inventors: Steve Woodward, Alexis Johnson, Stefan Larson, Shaun Becker
SYSTEMS AND METHODS FOR AUTOMATICALLY SOURCING CORPORA OF TRAINING AND TESTING DATA SAMPLES FOR TRAINING AND TESTING A MACHINE LEARNING MODEL

Publication number: 20230122684

Abstract: A system and method of curating machine learning training data for improving a predictive accuracy of a machine learning model includes sourcing training data samples based on seeding instructions; returning a corpus of unlabeled training data samples based on a search of data repositories; assigning a distinct classification labels to each of the training data samples of the corpus; computing efficacy metrics for an in-scope corpus of labeled training data samples derived from a subset of training data samples of the corpus that have been assigned one of the plurality of distinct classification labels, wherein the efficacy metrics identify whether the in-scope corpus of labeled training data samples is suitable for training a target machine learning model; and routing the in-scope corpus of labeled training data samples based on the efficacy metrics.

Type: Application

Filed: October 19, 2022

Publication date: April 20, 2023

Inventors: Stefan Larson, Steve Woodward, Shuan Becker
SYSTEMS AND METHODS FOR INTELLIGENT DIGITAL ITEM DISCOVERY AND MACHINE LEARNING-INFORMED HANDLING OF DIGITAL ITEMS AND DIGITAL ITEM GOVERNANCE

Publication number: 20230072498

Abstract: Systems and methods of computing classifications for and migrating digital content that includes accessing a digital content corpus within a source data storage system; in response to accessing the digital content corpus, for each distinct item of digital content of the plurality of distinct items of digital content: computing, via one or more digital content machine learning classification models, a content classification inference; identifying automated digital content handling tasks of a plurality of distinct digital content handling tasks based on the content classification inference; executing the automated content handling tasks identified for each distinct item of digital content, wherein executing the automated content handling tasks includes: designating a storage location within a target data storage system based on the in-migration content classification inference; and migrating a respective item of digital content from the source data storage system to the designated storage location within the ta

Type: Application

Filed: November 11, 2022

Publication date: March 9, 2023

Inventors: Steve Woodward, Shaun Becker, Stefan Larson
SYSTEMS AND METHODS FOR MACHINE LEARNING CLASSIFICATION-BASED AUTOMATED REMEDIATIONS AND HANDLING OF DATA ITEMS

Publication number: 20230017384

Abstract: A machine learning-informed method executing automated workflows for digital file handling includes computing, by file classification machine learning models, a machine learning classification inference for each of a plurality of distinct digital files of a corpus of digital files; curating a plurality of distinct sub-corpora of digital files based on the machine learning classification inference associated with each of the plurality of distinct digital files, wherein the at least one machine learning classification inference comprises a digital file type classification inference of a plurality of distinct digital file type classification inferences; and selectiyely executing an automated digital file handling workflow for at least one sub-corpus based on the digital file type classification inference, wherein the automated digital file handling workflow includes a sequence of computer-executable tasks that, when executed, operates to modify one or more of a residency, permissions, and file metadata associate

Type: Application

Filed: July 15, 2022

Publication date: January 19, 2023

Inventors: Steve Woodward, Shaun Becker, Stefan Larson
SYSTEMS AND METHODS FOR INTELLIGENT DIGITAL ITEM DISCOVERY AND MACHINE LEARNING-INFORMED HANDLING OF DIGITAL ITEMS AND DIGITAL ITEM GOVERNANCE

Publication number: 20220413734

Abstract: Systems and methods of computing classifications for and migrating digital content that includes accessing a digital content corpus within a source data storage system; in response to accessing the digital content corpus, for each distinct item of digital content of the plurality of distinct items of digital content: computing, via one or more digital content machine learning classification models, a content classification inference; identifying automated digital content handling tasks of a plurality of distinct digital content handling tasks based on the content classification inference; executing the automated content handling tasks identified for each distinct item of digital content, wherein executing the automated content handling tasks includes: designating a storage location within a target data storage system based on the in-migration content classification inference; and migrating a respective item of digital content from the source data storage system to the designated storage location within the ta

Type: Application

Filed: June 24, 2022

Publication date: December 29, 2022

Inventors: Steve Woodward, Shaun Becker, Stefan Larson
Systems and methods for intelligent digital item discovery and machine learning-informed handling of digital items and digital item governance

Patent number: 11531640

Abstract: Systems and methods of computing classifications for and migrating digital content that includes accessing a digital content corpus within a source data storage system; in response to accessing the digital content corpus, for each distinct item of digital content of the plurality of distinct items of digital content: computing, via one or more digital content machine learning classification models, a content classification inference; identifying automated digital content handling tasks of a plurality of distinct digital content handling tasks based on the content classification inference; executing the automated content handling tasks identified for each distinct item of digital content, wherein executing the automated content handling tasks includes: designating a storage location within a target data storage system based on the in-migration content classification inference; and migrating a respective item of digital content from the source data storage system to the designated storage location within the ta

Type: Grant

Filed: June 24, 2022

Date of Patent: December 20, 2022

Assignee: DryvIQ, Inc.

Inventors: Steve Woodward, Shaun Becker, Stefan Larson
Systems and methods implementing data query language and utterance corpus implements for handling slot-filling and dialogue intent classification data in a machine learning task-oriented dialogue system

Patent number: 11183175

Abstract: A system and method of implementing an intuitive search interface for tactically searching one or more annotated utterance corpora in a machine learning-based dialogue system includes identifying an utterance corpus query for searching one or more annotated utterance corpora of a machine learning-based dialogue system; interpreting the utterance corpus query by translating the utterance corpus query into one or more search expressions recognizable to an utterance sample retrieval program searchably interfacing with the one or more annotated utterance corpora of the machine learning-based dialogue system; retrieving one or more annotated utterance samples from the one or more annotated utterance corpora based on the interpretation of the utterance corpus query; and returning the one or more annotated utterance samples to an intuitive utterance corpus search interface.

Type: Grant

Filed: February 17, 2021

Date of Patent: November 23, 2021

Assignee: Clinc, Inc.

Inventors: Stefan Larson, Kevin Leach, Michael A. Laurenzano
SYSTEMS AND METHODS IMPLEMENTING DATA QUERY LANGUAGE AND UTTERANCE CORPUS IMPLEMENTS FOR HANDLING SLOT-FILLING AND DIALOGUE INTENT CLASSIFICATION DATA IN A MACHINE LEARNING TASK-ORIENTED DIALOGUE SYSTEM

Publication number: 20210264902

Abstract: A system and method of implementing an intuitive search interface for tactically searching one or more annotated utterance corpora in a machine learning-based dialogue system includes identifying an utterance corpus query for searching one or more annotated utterance corpora of a machine learning-based dialogue system; interpreting the utterance corpus query by translating the utterance corpus query into one or more search expressions recognizable to an utterance sample retrieval program searchably interfacing with the one or more annotated utterance corpora of the machine learning-based dialogue system; retrieving one or more annotated utterance samples from the one or more annotated utterance corpora based on the interpretation of the utterance corpus query; and returning the one or more annotated utterance samples to an intuitive utterance corpus search interface.

Type: Application

Filed: February 17, 2021

Publication date: August 26, 2021

Inventors: Stefan Larson, Kevin Leach, Michael A. Laurenzano
Systems and methods for mixed setting training for slot filling machine learning tasks in a machine learning task-oriented dialogue system

Patent number: 11043208

Abstract: Systems and methods for intelligently training a subject machine learning model includes identifying new observations comprising a plurality of distinct samples unseen by a target model during a prior training; creating an incremental training corpus based on randomly sampling a collection of training data samples that includes a plurality of new observations and a plurality of historical training data samples used in the prior training of the target model; implementing a first training mode that includes an incremental training of the target model using samples from the incremental training corpus as model training input; computing performance metrics of the target model based on the incremental training; evaluating the performance metrics of the target model against training mode thresholds; and selectively choosing based on the evaluation one of maintaining the first training mode and automatically switching to a second training mode that includes a full retraining of the target model.

Type: Grant

Filed: February 19, 2021

Date of Patent: June 22, 2021

Assignee: Clinc, Inc.

Inventors: Daniel C. Michelin, Jonathan K. Kummerfeld, Kevin Leach, Stefan Larson, Joseph J. Peper, Yunqi Zhang
SYSTEMS AND METHODS FOR AUTOMATICALLY DETECTING AND REPAIRING SLOT ERRORS IN MACHINE LEARNING TRAINING DATA FOR A MACHINE LEARNING-BASED DIALOGUE SYSTEM

Publication number: 20210166138

Abstract: Systems and methods for automatically detecting annotation discrepancies in annotated training data samples and repairing the annotated training data samples for a machine learning-based automated dialogue system include evaluating a corpus of a plurality of distinct training data samples; identifying one or more of a slot span defect and a slot label defect of a target annotated slot span of a target training data sample of the corpus based on the evaluation; and automatically correcting one or more annotations of the target annotated slot span based on the identified one or more of the slot span defect and the slot label defect.

Type: Application

Filed: January 15, 2021

Publication date: June 3, 2021

Inventors: Stefan Larson, Anish Mahendran, Parker Hill, Jonathan K. Kummerfeld, Michael A. Laurenzano, Lingjia Tang, Jason Mars
Systems and methods for automatically detecting and repairing slot errors in machine learning training data for a machine learning-based dialogue system

Patent number: 10929761

Abstract: Systems and methods for automatically detecting annotation discrepancies in annotated training data samples and repairing the annotated training data samples for a machine learning-based automated dialogue system include evaluating a corpus of a plurality of distinct training data samples; identifying one or more of a slot span defect and a slot label defect of a target annotated slot span of a target training data sample of the corpus based on the evaluation; and automatically correcting one or more annotations of the target annotated slot span based on the identified one or more of the slot span defect and the slot label defect.

Type: Grant

Filed: June 8, 2020

Date of Patent: February 23, 2021

Assignee: Clinic, Inc.

Inventors: Stefan Larson, Anish Mahendran, Parker Hill, Jonathan K. Kummerfeld, Michael A. Laurenzano, Lingjia Tang, Jason Mars
SYSTEMS AND METHODS FOR CONSTRUCTING AN ARTIFICIALLY DIVERSE CORPUS OF TRAINING DATA SAMPLES FOR TRAINING A CONTEXTUALLY-BIASED MODEL FOR A MACHINE LEARNING-BASED DIALOGUE SYSTEM

Publication number: 20210004539

Abstract: Systems and methods for constructing an artificially diverse corpus of training data includes evaluating a corpus of utterance-based training data samples, identifying a slot replacement candidate; deriving distinct skeleton utterances that include the slot replacement candidate, wherein deriving the distinct skeleton utterances includes replacing slots of each of the plurality of distinct utterance training samples with one of a special token and proper slot classification labels; selecting a subset of the distinct skeleton utterances; converting each of the distinct skeleton utterances of the subset back to distinct utterance training samples while still maintaining the special token at a position of the slot replacement candidate; altering a percentage of the distinct utterance training samples with a distinct randomly-generated slot token value at the position of the slot replacement candidate; and constructing the artificially diverse corpus of training samples based on a collection of the percentage of

Type: Application

Filed: September 1, 2020

Publication date: January 7, 2021

Inventors: Andrew Lee, Stefan Larson, Christopher Clarke, Kevin Leach, Jonathan K. Kummerfeld, Parker Hill, Johann Hauswald, Michael A. Laurenzano, Lingjia Tang, Jason Mars
SYSTEMS AND METHODS FOR AUTOMATICALLY DETECTING AND REPAIRING SLOT ERRORS IN MACHINE LEARNING TRAINING DATA FOR A MACHINE LEARNING-BASED DIALOGUE SYSTEM

Publication number: 20200401914

Abstract: Systems and methods for automatically detecting annotation discrepancies in annotated training data samples and repairing the annotated training data samples for a machine learning-based automated dialogue system include evaluating a corpus of a plurality of distinct training data samples; identifying one or more of a slot span defect and a slot label defect of a target annotated slot span of a target training data sample of the corpus based on the evaluation; and automatically correcting one or more annotations of the target annotated slot span based on the identified one or more of the slot span defect and the slot label defect.

Type: Application

Filed: June 8, 2020

Publication date: December 24, 2020

Inventors: Stefan Larson, Anish Mahendran, Parker Hill, Jonathan K. Kummerfeld, Michael A. Laurenzano, Lingjia Tang, Jason Mars
Systems and methods for constructing an artificially diverse corpus of training data samples for training a contextually-biased model for a machine learning-based dialogue system

Patent number: 10796104

Abstract: Systems and methods for constructing an artificially diverse corpus of training data includes evaluating a corpus of utterance-based training data samples, identifying a slot replacement candidate; deriving distinct skeleton utterances that include the slot replacement candidate, wherein deriving the distinct skeleton utterances includes replacing slots of each of the plurality of distinct utterance training samples with one of a special token and proper slot classification labels; selecting a subset of the distinct skeleton utterances; converting each of the distinct skeleton utterances of the subset back to distinct utterance training samples while still maintaining the special token at a position of the slot replacement candidate; altering a percentage of the distinct utterance training samples with a distinct randomly-generated slot token value at the position of the slot replacement candidate; and constructing the artificially diverse corpus of training samples based on a collection of the percentage of

Type: Grant

Filed: June 22, 2020

Date of Patent: October 6, 2020

Assignee: Clinc, Inc.

Inventors: Andrew Lee, Stefan Larson, Christopher Clarke, Kevin Leach, Jonathan K. Kummerfeld, Parker Hill, Johann Hauswald, Michael A. Laurenzano, Lingjia Tang, Jason Mars
SYSTEMS AND METHODS FOR AUTOMATICALLY CONFIGURING TRAINING DATA FOR TRAINING MACHINE LEARNING MODELS OF A MACHINE LEARNING-BASED DIALOGUE SYSTEM

Publication number: 20200258007

Abstract: A system and method for improving a machine learning-based dialogue system includes: sourcing a corpus of raw machine learning training data from sources of training data based on a plurality of seed training samples, wherein the corpus of raw machine learning training data comprises a plurality of distinct instances of training data; generating a vector representation for each distinct instance of training data; identifying statistical characteristics of the corpus of raw machine learning training data based on a mapping of the vector representation for each distinct instance of training data; identifying anomalous instances of the plurality of distinct instances of training data of the corpus of raw machine learning training data based on the identified statistical characteristics of the corpus; and curating the corpus of raw machine learning training data based on each of the instances of training data identified as anomalous instances.

Type: Application

Filed: April 30, 2020

Publication date: August 13, 2020

Inventors: Stefan Larson, Anish Mahendran, Andrew Lee, Jonathan K. Kummerfeld, Parker Hill, Michael A. Laurenzano, Johann Hauswald, Lingjia Tang, Jason Mars
SYSTEMS AND METHODS FOR AUTOMATICALLY CONFIGURING TRAINING DATA FOR TRAINING MACHINE LEARNING MODELS OF A MACHINE LEARNING-BASED DIALOGUE SYSTEM INCLUDING SEEDING TRAINING SAMPLES OR CURATING A CORPUS OF TRAINING DATA BASED ON INSTANCES OF TRAINING DATA IDENTIFIEDAS ANOMALOUS

Publication number: 20200193331

Abstract: A system and method for improving a machine learning-based dialogue system includes: sourcing a corpus of raw machine learning training data from sources of training data based on a plurality of seed training samples, wherein the corpus of raw machine learning training data comprises a plurality of distinct instances of training data; generating a vector representation for each distinct instance of training data; identifying statistical characteristics of the corpus of raw machine learning training data based on a mapping of the vector representation for each distinct instance of training data; identifying anomalous instances of the plurality of distinct instances of training data of the corpus of raw machine learning training data based on the identified statistical characteristics of the corpus; and curating the corpus of raw machine learning training data based on each of the instances of training data identified as anomalous instances.

Type: Application

Filed: November 20, 2019

Publication date: June 18, 2020

Inventors: Stefan Larson, Anish Mahendran, Andrew Lee, Jonathan K. Kummerfeld, Parker Hill, Michael A. Laurenzano, Johann Hauswald, Lingjia Tang, Jason Mars

1 2 next