Patents by Inventor David Alan Johnston

David Alan Johnston has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12271353
    Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.
    Type: Grant
    Filed: June 29, 2022
    Date of Patent: April 8, 2025
    Assignee: Bytedance Inc.
    Inventors: David Alan Johnston, Andrew James, Pradhee Tandon, Sivaramakrishnan Natarajan
  • Publication number: 20240362656
    Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.
    Type: Application
    Filed: May 1, 2024
    Publication date: October 31, 2024
    Inventors: DAVID ALAN JOHNSTON, MATTHEW DELAND, SHAWN JEFFREY, TAYLOR RAACK
  • Patent number: 12002061
    Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.
    Type: Grant
    Filed: November 9, 2021
    Date of Patent: June 4, 2024
    Assignee: ByteDance Inc.
    Inventors: David Alan Johnston, Matthew Deland, Shawn Jeffrey, Taylor Raack
  • Publication number: 20220405255
    Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.
    Type: Application
    Filed: June 29, 2022
    Publication date: December 22, 2022
    Inventors: David Alan JOHNSTON, Andrew JAMES, Pradhee TANDON, Sivaramakrishnan NATARAJAN
  • Publication number: 20220318826
    Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to: access first merchant data associated with a first merchant; access second merchant data associated with a second merchant; determine a match score based the first merchant data and the second merchant data indicating a likelihood of the first merchant being the same as the second merchant; determine a match score threshold; determine whether the match score exceeds the match score threshold; and in response determining the match score fails to exceed the match score threshold, determine the first merchant as being different from the second merchant. Some embodiments may provide for techniques for machine learning with merchant data training sets to determine match scores.
    Type: Application
    Filed: January 20, 2022
    Publication date: October 6, 2022
    Inventors: David Alan Johnston, Matthew DeLand
  • Patent number: 11392564
    Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.
    Type: Grant
    Filed: June 26, 2020
    Date of Patent: July 19, 2022
    Assignee: GROUPON, INC.
    Inventors: David Alan Johnston, Andrew James, Pradhee Tandon, Sivaramakrishnan Natarajan
  • Publication number: 20220207540
    Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.
    Type: Application
    Filed: November 9, 2021
    Publication date: June 30, 2022
    Inventors: DAVID ALAN JOHNSTON, MATTHEW DELAND, SHAWN JEFFREY, TAYLOR RAACK
  • Publication number: 20220180250
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.
    Type: Application
    Filed: November 17, 2021
    Publication date: June 9, 2022
    Inventors: Shawn Ryan Jeffery, David Alan Johnston
  • Publication number: 20220092041
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automatic cleaning of entity resolution (ER) data persistently stored in a data repository.
    Type: Application
    Filed: August 30, 2021
    Publication date: March 24, 2022
    Inventors: Taylor RAACK, David Alan JOHNSTON
  • Patent number: 11263646
    Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to: access first merchant data associated with a first merchant; access second merchant data associated with a second merchant; determine a match score based the first merchant data and the second merchant data indicating a likelihood of the first merchant being the same as the second merchant; determine a match score threshold; determine whether the match score exceeds the match score threshold; and in response determining the match score fails to exceed the match score threshold, determine the first merchant as being different from the second merchant. Some embodiments may provide for techniques for machine learning with merchant data training sets to determine match scores.
    Type: Grant
    Filed: March 31, 2014
    Date of Patent: March 1, 2022
    Assignee: GROUPON, INC.
    Inventors: David Alan Johnston, Matthew DeLand
  • Patent number: 11232464
    Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.
    Type: Grant
    Filed: July 23, 2019
    Date of Patent: January 25, 2022
    Assignee: GROUPON, INC.
    Inventors: David Alan Johnston, Matthew Deland, Shawn Ryan Jeffrey, Taylor Raack
  • Patent number: 11210604
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.
    Type: Grant
    Filed: December 19, 2014
    Date of Patent: December 28, 2021
    Assignee: Groupon, Inc.
    Inventors: Shawn Ryan Jeffery, David Alan Johnston
  • Patent number: 11132343
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automatic cleaning of entity resolution (ER) data persistently stored in a data repository.
    Type: Grant
    Filed: March 18, 2016
    Date of Patent: September 28, 2021
    Assignee: Groupon, Inc.
    Inventors: Taylor Raack, David Alan Johnston
  • Publication number: 20200401566
    Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.
    Type: Application
    Filed: June 26, 2020
    Publication date: December 24, 2020
    Inventors: David Alan JOHNSTON, Andrew JAMES, Pradhee TANDON, Sivaramakrishnan NATARAJAN
  • Publication number: 20200302337
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.
    Type: Application
    Filed: March 4, 2020
    Publication date: September 24, 2020
    Inventors: SHAWN RYAN JEFFERY, Nick PENDAR, Mark Thomas DALY, Matthew DELAND, David Alan JOHNSTON
  • Publication number: 20200293951
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media configured to receive configuration data describing a desired data set distribution, and, in response to receiving new data instances, use the configuration data and the new data instances to dynamically optimize the distribution of data already stored in a data reservoir that has been discretized into bins representing the desired data distribution.
    Type: Application
    Filed: March 27, 2020
    Publication date: September 17, 2020
    Inventors: David Alan Johnston, Jonathan Esterhazy, Gaston L'Huillier, Hernan Enrique Arroyo Garcia
  • Patent number: 10733157
    Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.
    Type: Grant
    Filed: June 24, 2016
    Date of Patent: August 4, 2020
    Assignee: Groupon, Inc.
    Inventors: David Alan Johnston, Andrew James, Pradhee Tandon, Sivaramakrishnan Natarajan
  • Patent number: 10657457
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.
    Type: Grant
    Filed: December 19, 2014
    Date of Patent: May 19, 2020
    Assignee: GROUPON, INC.
    Inventors: Shawn Ryan Jeffery, Nick Pendar, Mark Thomas Daly, Matthew DeLand, David Alan Johnston
  • Patent number: 10650326
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media configured to receive configuration data describing a desired data set distribution, and, in response to receiving new data instances, use the configuration data and the new data instances to dynamically optimize the distribution of data already stored in a data reservoir that has been discretized into bins representing the desired data distribution.
    Type: Grant
    Filed: August 3, 2015
    Date of Patent: May 12, 2020
    Assignee: GROUPON, INC.
    Inventors: David Alan Johnston, Jonathan Esterhazy, Gaston L'Huillier, Hernan Enrique Arroyo Garcia
  • Patent number: 10614373
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.
    Type: Grant
    Filed: December 19, 2014
    Date of Patent: April 7, 2020
    Assignee: GROUPON, INC.
    Inventors: Shawn Ryan Jeffery, David Alan Johnston, Jonathan Esterhazy, Gaston L'Huillier, Hernan Enrique Arroyo Garcia