Patents by Inventor David Alan Johnston
David Alan Johnston has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12271353Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.Type: GrantFiled: June 29, 2022Date of Patent: April 8, 2025Assignee: Bytedance Inc.Inventors: David Alan Johnston, Andrew James, Pradhee Tandon, Sivaramakrishnan Natarajan
-
Publication number: 20240362656Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.Type: ApplicationFiled: May 1, 2024Publication date: October 31, 2024Inventors: DAVID ALAN JOHNSTON, MATTHEW DELAND, SHAWN JEFFREY, TAYLOR RAACK
-
Patent number: 12002061Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.Type: GrantFiled: November 9, 2021Date of Patent: June 4, 2024Assignee: ByteDance Inc.Inventors: David Alan Johnston, Matthew Deland, Shawn Jeffrey, Taylor Raack
-
Publication number: 20220405255Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.Type: ApplicationFiled: June 29, 2022Publication date: December 22, 2022Inventors: David Alan JOHNSTON, Andrew JAMES, Pradhee TANDON, Sivaramakrishnan NATARAJAN
-
Publication number: 20220318826Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to: access first merchant data associated with a first merchant; access second merchant data associated with a second merchant; determine a match score based the first merchant data and the second merchant data indicating a likelihood of the first merchant being the same as the second merchant; determine a match score threshold; determine whether the match score exceeds the match score threshold; and in response determining the match score fails to exceed the match score threshold, determine the first merchant as being different from the second merchant. Some embodiments may provide for techniques for machine learning with merchant data training sets to determine match scores.Type: ApplicationFiled: January 20, 2022Publication date: October 6, 2022Inventors: David Alan Johnston, Matthew DeLand
-
Patent number: 11392564Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.Type: GrantFiled: June 26, 2020Date of Patent: July 19, 2022Assignee: GROUPON, INC.Inventors: David Alan Johnston, Andrew James, Pradhee Tandon, Sivaramakrishnan Natarajan
-
Publication number: 20220207540Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.Type: ApplicationFiled: November 9, 2021Publication date: June 30, 2022Inventors: DAVID ALAN JOHNSTON, MATTHEW DELAND, SHAWN JEFFREY, TAYLOR RAACK
-
Publication number: 20220180250Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.Type: ApplicationFiled: November 17, 2021Publication date: June 9, 2022Inventors: Shawn Ryan Jeffery, David Alan Johnston
-
Publication number: 20220092041Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automatic cleaning of entity resolution (ER) data persistently stored in a data repository.Type: ApplicationFiled: August 30, 2021Publication date: March 24, 2022Inventors: Taylor RAACK, David Alan JOHNSTON
-
Patent number: 11263646Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to: access first merchant data associated with a first merchant; access second merchant data associated with a second merchant; determine a match score based the first merchant data and the second merchant data indicating a likelihood of the first merchant being the same as the second merchant; determine a match score threshold; determine whether the match score exceeds the match score threshold; and in response determining the match score fails to exceed the match score threshold, determine the first merchant as being different from the second merchant. Some embodiments may provide for techniques for machine learning with merchant data training sets to determine match scores.Type: GrantFiled: March 31, 2014Date of Patent: March 1, 2022Assignee: GROUPON, INC.Inventors: David Alan Johnston, Matthew DeLand
-
Patent number: 11232464Abstract: Systems, apparatus, and methods for determining unique contacts from a collection or pool of merchant data are discussed herein. Some embodiments may provide for an apparatus including circuitry configured to determine programmatic match results indicating whether different instances of merchant data match (e.g., describe the same contact). The circuitry may further determine probabilities of precision or recall errors with the programmatic match results. Programmatic match results having a high probability of error may be annotated by a user to generate user match results. The user match results may be used to generate a more reliable contacts database including unique contacts, as well as to train and/or update the match scoring algorithm. As such, the accuracy of machine-implemented binary classification is improved.Type: GrantFiled: July 23, 2019Date of Patent: January 25, 2022Assignee: GROUPON, INC.Inventors: David Alan Johnston, Matthew Deland, Shawn Ryan Jeffrey, Taylor Raack
-
Patent number: 11210604Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.Type: GrantFiled: December 19, 2014Date of Patent: December 28, 2021Assignee: Groupon, Inc.Inventors: Shawn Ryan Jeffery, David Alan Johnston
-
Patent number: 11132343Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automatic cleaning of entity resolution (ER) data persistently stored in a data repository.Type: GrantFiled: March 18, 2016Date of Patent: September 28, 2021Assignee: Groupon, Inc.Inventors: Taylor Raack, David Alan Johnston
-
Publication number: 20200401566Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.Type: ApplicationFiled: June 26, 2020Publication date: December 24, 2020Inventors: David Alan JOHNSTON, Andrew JAMES, Pradhee TANDON, Sivaramakrishnan NATARAJAN
-
Publication number: 20200302337Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.Type: ApplicationFiled: March 4, 2020Publication date: September 24, 2020Inventors: SHAWN RYAN JEFFERY, Nick PENDAR, Mark Thomas DALY, Matthew DELAND, David Alan JOHNSTON
-
Publication number: 20200293951Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media configured to receive configuration data describing a desired data set distribution, and, in response to receiving new data instances, use the configuration data and the new data instances to dynamically optimize the distribution of data already stored in a data reservoir that has been discretized into bins representing the desired data distribution.Type: ApplicationFiled: March 27, 2020Publication date: September 17, 2020Inventors: David Alan Johnston, Jonathan Esterhazy, Gaston L'Huillier, Hernan Enrique Arroyo Garcia
-
Patent number: 10733157Abstract: In general, embodiments of the present invention provide systems and computer readable media for implementing a single data integration platform that supports multiple data access interfaces to a single corpus of stored dynamic data collected from multiple data sources. In embodiments, the data integration platform includes a record tables layer that stores a group of data records and supports a CRUD interface for accessing the data records; a resolution mapping layer that stores a set of entities generated by a many-to-one mapping of data records to entities using entity resolution; and an entities layer that stores resolved entities which may be accessed via either a search interface based on search criteria or a hybrid search interface that supports “get via record id” queries.Type: GrantFiled: June 24, 2016Date of Patent: August 4, 2020Assignee: Groupon, Inc.Inventors: David Alan Johnston, Andrew James, Pradhee Tandon, Sivaramakrishnan Natarajan
-
Patent number: 10657457Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.Type: GrantFiled: December 19, 2014Date of Patent: May 19, 2020Assignee: GROUPON, INC.Inventors: Shawn Ryan Jeffery, Nick Pendar, Mark Thomas Daly, Matthew DeLand, David Alan Johnston
-
Patent number: 10650326Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media configured to receive configuration data describing a desired data set distribution, and, in response to receiving new data instances, use the configuration data and the new data instances to dynamically optimize the distribution of data already stored in a data reservoir that has been discretized into bins representing the desired data distribution.Type: GrantFiled: August 3, 2015Date of Patent: May 12, 2020Assignee: GROUPON, INC.Inventors: David Alan Johnston, Jonathan Esterhazy, Gaston L'Huillier, Hernan Enrique Arroyo Garcia
-
Patent number: 10614373Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.Type: GrantFiled: December 19, 2014Date of Patent: April 7, 2020Assignee: GROUPON, INC.Inventors: Shawn Ryan Jeffery, David Alan Johnston, Jonathan Esterhazy, Gaston L'Huillier, Hernan Enrique Arroyo Garcia