Patents by Inventor Nick Pendar

Nick Pendar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method, apparatus, and computer program product for classification and tagging of textual data

Patent number: 10853401

Abstract: Provided herein are systems, methods and computer readable media for classification and tagging of textual data. An example method may include accessing a corpus comprising a plurality of documents, each document having one or more labels indicative of services offered by a merchant, generating a query based on extracted features and the documents, generating a precision score for at least a portion of the generated query and selecting a subset of the generated queries based on an assigned precision score satisfying a precision score threshold, the selected subset of the generated queries configured to provide an indication of one or more labels to be applied to machine readable text. A second example method, utilized for tagging machine readable text with unknown labels, may include assigning a label to textual portions of the machine readable text based on results of the application of the queries.

Type: Grant

Filed: July 15, 2019

Date of Patent: December 1, 2020

Assignee: GROUPON, INC.

Inventor: Nick Pendar
AUTOMATIC SELECTION OF HIGH QUALITY TRAINING DATA USING AN ADAPTIVE ORACLE-TRAINED LEARNING FRAMEWORK

Publication number: 20200302337

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.

Type: Application

Filed: March 4, 2020

Publication date: September 24, 2020

Inventors: SHAWN RYAN JEFFERY, Nick PENDAR, Mark Thomas DALY, Matthew DELAND, David Alan JOHNSTON
Multi-term query subsumption for document classification

Patent number: 10726055

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for generating an optimal classifying query set for categorizing and/or labeling textual data based on a query subsumption calculus to determine, given two queries, whether one of the queries subsumes another. In one aspect, a method includes generating a group of determining queries based on analyzing text within a document; receiving a group of classifying queries; and, for each determining query within the group of determining queries, determining whether at least one of the classifying queries is subsumed by the determining query; and updating the group of classifying queries in an instance in which the classifying query is subsumed by the determining query.

Type: Grant

Filed: April 7, 2017

Date of Patent: July 28, 2020

Assignee: GROUPON, INC.

Inventor: Nick Pendar
DISCOVERY OF NEW BUSINESS OPENINGS USING WEB CONTENT ANALYSIS

Publication number: 20200160358

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for identifying a new business based on programmatically analyzing content received from online sources and, as a result, discovering one or more references to the business. In embodiments, the system stores historical data representing previously identified new businesses and then uses attributes of those businesses in search queries to receive related content. Additionally or alternatively, the system stores data representing online sources that historically provided content containing references to new businesses and then continues to access those sources for additional content. In embodiments, the system performs content analysis on structured and/or unstructured content.

Type: Application

Filed: October 22, 2019

Publication date: May 21, 2020

Inventors: Shawn Ryan Jeffery, Nick Pendar, Richard Clark Barber
Automatic selection of high quality training data using an adaptive oracle-trained learning framework

Patent number: 10657457

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for an adaptive oracle-trained learning framework for automatically building and maintaining models that are developed using machine learning algorithms. In embodiments, the framework leverages at least one oracle (e.g., a crowd) for automatic generation of high-quality training data to use in deriving a model. Once a model is trained, the framework monitors the performance of the model and, in embodiments, leverages active learning and the oracle to generate feedback about the changing data for modifying training data sets while maintaining data quality to enable incremental adaptation of the model.

Type: Grant

Filed: December 19, 2014

Date of Patent: May 19, 2020

Assignee: GROUPON, INC.

Inventors: Shawn Ryan Jeffery, Nick Pendar, Mark Thomas Daly, Matthew DeLand, David Alan Johnston
METHOD, APPARATUS, AND COMPUTER PROGRAM PRODUCT FOR CLASSIFICATION AND TAGGING OF TEXTUAL DATA

Publication number: 20200050617

Abstract: Provided herein are systems, methods and computer readable media for classification and tagging of textual data. An example method may include accessing a corpus comprising a plurality of documents, each document having one or more labels indicative of services offered by a merchant, generating a query based on extracted features and the documents, generating a precision score for at least a portion of the generated query and selecting a subset of the generated queries based on an assigned precision score satisfying a precision score threshold, the selected subset of the generated queries configured to provide an indication of one or more labels to be applied to machine readable text. A second example method, utilized for tagging machine readable text with unknown labels, may include assigning a label to textual portions of the machine readable text based on results of the application of the queries.

Type: Application

Filed: July 15, 2019

Publication date: February 13, 2020

Inventor: Nick Pendar
Automated Dynamic Data Quality Assessment

Publication number: 20200027018

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automated dynamic data quality assessment. One aspect of the subject matter described in this specification includes the actions of receiving a data quality job including a new data sample; and, if the new data sample is determined to be added to a reservoir of data samples, sending a quality verification request to an oracle; receiving a new data sample quality estimate from the oracle; and adding the new data sample and estimate to the reservoir. A second aspect of the subject matter includes the actions of receiving, from a predictive model, a judgment associated with a new data sample; analyzing the new data sample based in part on the judgment to determine whether to send a new data sample quality verification request to an oracle; and, if a new data sample quality estimate is received from the oracle, determining whether to add the new data sample and the judgment to the reservoir.

Type: Application

Filed: June 6, 2019

Publication date: January 23, 2020

Inventors: Mark Thomas Daly, Shawn Ryan Jeffery, Matthew DeLand, Nick Pendar, Andrew James, David Alan Johnston
Discovery of new business openings using web content analysis

Patent number: 10489800

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for identifying a new business based on programmatically analyzing content received from online sources and, as a result, discovering one or more references to the business. In embodiments, the system stores historical data representing previously identified new businesses and then uses attributes of those businesses in search queries to receive related content. Additionally or alternatively, the system stores data representing online sources that historically provided content containing references to new businesses and then continues to access those sources for additional content. In embodiments, the system performs content analysis on structured and/or unstructured content.

Type: Grant

Filed: August 25, 2017

Date of Patent: November 26, 2019

Assignee: GROUPON, INC.

Inventors: Shawn Ryan Jeffery, Nick Pendar, Richard Clark Barber
Anomaly detection and automated analysis using weighted directed graphs

Patent number: 10394631

Abstract: A method includes receiving a data set. The data set includes a plurality of data subsets wherein each data subset is associated with one transaction in a fully or partially masked network. The method further includes processing each data subset according to a plurality of rules to generate a plurality of activation values and an output for the each data subset. The plurality of activation values and the output for the each data subset form an activation pattern for the each data subset. The method also includes generating a predictive model based on the activation patterns. The method further includes identifying a subset of transactions as outliers based on the predictive model.

Type: Grant

Filed: September 18, 2017

Date of Patent: August 27, 2019

Assignee: Callidus Software, Inc.

Inventors: Nick Pendar, Terison Gregory
Automated Adaptive Data Analysis Using Dynamic Data Quality Assessment

Publication number: 20190258954

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automated dynamic data quality assessment. One aspect of the subject matter described in this specification includes the actions of receiving a data quality job including a new data sample; and, if the new data sample is determined to be added to a reservoir of data samples, sending a quality verification request to an oracle; receiving a new data sample quality estimate from the oracle; and adding the new data sample and estimate to the reservoir. A second aspect of the subject matter includes the actions of receiving, from a predictive model, a judgment associated with a new data sample; analyzing the new data sample based in part on the judgment to determine whether to send a new data sample quality verification request to an oracle; and, if a new data sample quality estimate is received from the oracle, determining whether to add the new data sample and the judgment to the reservoir.

Type: Application

Filed: February 19, 2019

Publication date: August 22, 2019

Inventors: Mark Thomas Daly, Shawn Ryan Jeffery, Matthew DeLand, Nick Pendar, Andrew James, David Johnston
Method, apparatus, and computer program product for classification and tagging of textual data

Patent number: 10387470

Abstract: Provided herein are systems, methods and computer readable media for classification and tagging of textual data. An example method may include accessing a corpus comprising a plurality of documents, each document having one or more labels indicative of services offered by a merchant, generating a query based on extracted features and the documents, generating a precision score for at least a portion of the generated query and selecting a subset of the generated queries based on an assigned precision score satisfying a precision score threshold, the selected subset of the generated queries configured to provide an indication of one or more labels to be applied to machine readable text. A second example method, utilized for tagging machine readable text with unknown labels, may include assigning a label to textual portions of the machine readable text based on results of the application of the queries.

Type: Grant

Filed: February 23, 2016

Date of Patent: August 20, 2019

Assignee: GROUPON, INC.

Inventor: Nick Pendar
Automated dynamic data quality assessment

Patent number: 10360516

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automated dynamic data quality assessment. One aspect of the subject matter described in this specification includes the actions of receiving a data quality job including a new data sample; and, if the new data sample is determined to be added to a reservoir of data samples, sending a quality verification request to an oracle; receiving a new data sample quality estimate from the oracle; and adding the new data sample and estimate to the reservoir. A second aspect of the subject matter includes the actions of receiving, from a predictive model, a judgment associated with a new data sample; analyzing the new data sample based in part on the judgment to determine whether to send a new data sample quality verification request to an oracle; and, if a new data sample quality estimate is received from the oracle, determining whether to add the new data sample and the judgment to the reservoir.

Type: Grant

Filed: June 12, 2017

Date of Patent: July 23, 2019

Assignee: GROUPON, INC.

Inventors: Mark Thomas Daly, Shawn Ryan Jeffery, Matthew DeLand, Nick Pendar, Andrew James, David Johnston
Automated adaptive data analysis using dynamic data quality assessment

Patent number: 10262277

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automated dynamic data quality assessment. One aspect of the subject matter described in this specification includes the actions of receiving a data quality job including a new data sample; and, if the new data sample is determined to be added to a reservoir of data samples, sending a quality verification request to an oracle; receiving a new data sample quality estimate from the oracle; and adding the new data sample and estimate to the reservoir. A second aspect of the subject matter includes the actions of receiving, from a predictive model, a judgment associated with a new data sample; analyzing the new data sample based in part on the judgment to determine whether to send a new data sample quality verification request to an oracle; and, if a new data sample quality estimate is received from the oracle, determining whether to add the new data sample and the judgment to the reservoir.

Type: Grant

Filed: February 8, 2017

Date of Patent: April 16, 2019

Assignee: GROUPON, INC.

Inventors: Mark Thomas Daly, Shawn Ryan Jeffery, Matthew DeLand, Nick Pendar, Andrew James, David Johnston
ANOMALY DETECTION AND AUTOMATED ANALYSIS USING WEIGHTED DIRECTED GRAPHS

Publication number: 20190087248

Abstract: A method includes receiving a data set. The data set includes a plurality of data subsets wherein each data subset is associated with one transaction in a fully or partially masked network. The method further includes processing each data subset according to a plurality of rules to generate a plurality of activation values and an output for the each data subset. The plurality of activation values and the output for the each data subset form an activation pattern for the each data subset. The method also includes generating a predictive model based on the activation patterns. The method further includes identifying a subset of transactions as outliers based on the predictive model.

Type: Application

Filed: September 18, 2017

Publication date: March 21, 2019

Inventors: Nick Pendar, Terison Gregory
ANOMALY DETECTION AND AUTOMATED ANALYSIS IN SYSTEMS BASED ON FULLY MASKED WEIGHTED DIRECTED

Publication number: 20190087737

Abstract: A method includes processing data sets according to a plurality of rules to generate an activation pattern for each data set. Each activation pattern includes an activation value for each rule of the plurality of rules. The method also includes normalizing the activation value for each rule and determining a standard deviation of the activation value for each rule. The method further includes identifying a first subset of rules of the plurality of rules. Each rule of the first subset of rules has activation value with the standard deviation greater than a standard deviation threshold. The method also includes identifying, using an unsupervised machine learning algorithm, outlier activation patterns and analyzing the outlier activation patterns based on a second subset of rules of the plurality of rules. The second subset of rules is a subset of the first subset of rules.

Type: Application

Filed: September 18, 2017

Publication date: March 21, 2019

Inventors: Nick Pendar, Terison Gregory
Automated Dynamic Data Quality Assessment

Publication number: 20180150767

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automated dynamic data quality assessment. One aspect of the subject matter described in this specification includes the actions of receiving a data quality job including a new data sample; and, if the new data sample is determined to be added to a reservoir of data samples, sending a quality verification request to an oracle; receiving a new data sample quality estimate from the oracle; and adding the new data sample and estimate to the reservoir. A second aspect of the subject matter includes the actions of receiving, from a predictive model, a judgment associated with a new data sample; analyzing the new data sample based in part on the judgment to determine whether to send a new data sample quality verification request to an oracle; and, if a new data sample quality estimate is received from the oracle, determining whether to add the new data sample and the judgment to the reservoir.

Type: Application

Filed: June 12, 2017

Publication date: May 31, 2018

Inventors: Mark Thomas Daly, Shawn Ryan Jeffery, Matthew DeLand, Nick Pendar, Andrew James, David Johnston
DISCOVERY OF NEW BUSINESS OPENINGS USING WEB CONTENT ANALYSIS

Publication number: 20180137524

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for identifying a new business based on programmatically analyzing content received from online sources and, as a result, discovering one or more references to the business. In embodiments, the system stores historical data representing previously identified new businesses and then uses attributes of those businesses in search queries to receive related content. Additionally or alternatively, the system stores data representing online sources that historically provided content containing references to new businesses and then continues to access those sources for additional content. In embodiments, the system performs content analysis on structured and/or unstructured content.

Type: Application

Filed: August 25, 2017

Publication date: May 17, 2018

Inventors: Shawn Ryan Jeffery, Nick Pendar, Richard Clark Barber
Multi-Term Query Subsumption For Document Classification

Publication number: 20180032532

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for generating an optimal classifying query set for categorizing and/or labeling textual data based on a query subsumption calculus to determine, given two queries, whether one of the queries subsumes another. In one aspect, a method includes generating a group of determining queries based on analyzing text within a document; receiving a group of classifying queries; and, for each determining query within the group of determining queries, determining whether at least one of the classifying queries is subsumed by the determining query; and updating the group of classifying queries in an instance in which the classifying query is subsumed by the determining query.

Type: Application

Filed: April 7, 2017

Publication date: February 1, 2018

Inventor: Nick Pendar
Automated Adaptive Data Analysis Using Dynamic Data Quality Assessment

Publication number: 20170372228

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for automated dynamic data quality assessment. One aspect of the subject matter described in this specification includes the actions of receiving a data quality job including a new data sample; and, if the new data sample is determined to be added to a reservoir of data samples, sending a quality verification request to an oracle; receiving a new data sample quality estimate from the oracle; and adding the new data sample and estimate to the reservoir. A second aspect of the subject matter includes the actions of receiving, from a predictive model, a judgment associated with a new data sample; analyzing the new data sample based in part on the judgment to determine whether to send a new data sample quality verification request to an oracle; and, if a new data sample quality estimate is received from the oracle, determining whether to add the new data sample and the judgment to the reservoir.

Type: Application

Filed: February 8, 2017

Publication date: December 28, 2017

Inventors: Mark Thomas Daly, Shawn Ryan Jeffery, Matthew DeLand, Nick Pendar, Andrew James, David Johnston
Discovery of new business openings using web content analysis

Patent number: 9773252

Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for identifying a new business based on programmatically analyzing content received from online sources and, as a result, discovering one or more references to the business. In embodiments, the system stores historical data representing previously identified new businesses and then uses attributes of those businesses in search queries to receive related content. Additionally or alternatively, the system stores data representing online sources that historically provided content containing references to new businesses and then continues to access those sources for additional content. In embodiments, the system performs content analysis on structured and/or unstructured content.

Type: Grant

Filed: July 24, 2015

Date of Patent: September 26, 2017

Assignee: Groupon, Inc.

Inventors: Shawn Ryan Jeffery, Nick Pendar, Richard Clark Barber

prev 1 2 3 next