Patents by Inventor Andrew Borthwick

Andrew Borthwick has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11941016
    Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
    Type: Grant
    Filed: March 4, 2022
    Date of Patent: March 26, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle
  • Patent number: 11514054
    Abstract: Supervised partitioning is used to perform record matching. A request to identify matches between records is received. A graph representation that indicates similarities between the records is partitioned and an evaluation of the partitioning is performed according to a supervised machine learning technique to generate a confidence value in the partitioning. An indication of equivalent records according to the partitioning and the confidence value of the partitioning may be provided.
    Type: Grant
    Filed: September 27, 2018
    Date of Patent: November 29, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Robert Anthony Barton, Jr., Stephen Michael Ash, Russell Reas
  • Publication number: 20220261413
    Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
    Type: Application
    Filed: March 4, 2022
    Publication date: August 18, 2022
    Applicant: Amazon Technologies, Inc.
    Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle
  • Patent number: 11314730
    Abstract: Techniques for memory-efficient streaming count estimation for multisets are described. A method for memory-efficient streaming count estimation for multisets may include obtaining data from a plurality of data sources, and estimating a count for one or more attributes of the data using a telescoping count-min sketch (CMS) data structure, the telescoping CMS including at least a first table and a second table, wherein count values for the data are stored in a plurality of cells of the first table and when a cell of the first table is saturated, the count values for that cell are stored in a corresponding cell of the second table determined based at least on the cell of the first table.
    Type: Grant
    Filed: March 24, 2020
    Date of Patent: April 26, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Stephen Michael Ash
  • Patent number: 11269911
    Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
    Type: Grant
    Filed: November 23, 2018
    Date of Patent: March 8, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle
  • Patent number: 11216701
    Abstract: Techniques for generating record embeddings from structured records are described. A record embeddings generating engine processes structured records to build a token vocabulary. Token embeddings are created for each token in the vocabulary. The token embeddings are trained using a loss function that relates the token embeddings to the record-attribute-data structure of the structured records. A record embedding is assembled from the trained token embeddings.
    Type: Grant
    Filed: April 17, 2018
    Date of Patent: January 4, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Yen Ling Adelene Sim, Andrew Borthwick
  • Patent number: 11113254
    Abstract: Techniques for scaling record linkage via elimination of highly overlapped blocks are described. A method for scaling record linkage via elimination of highly overlapped blocks includes identifying a first plurality of blocks based at least on a plurality of records stored in a storage service of a provider network, identifying a plurality of sets of matching blocks from the first plurality of blocks, deleting the plurality of sets of matching blocks except for a first block from each set from the plurality of sets of matching blocks, and iteratively performing dynamic blocking based at least on the first block to generate subsequent pluralities of blocks until the subsequent pluralities of blocks are below a threshold size.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: September 7, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Stephen Michael Ash
  • Patent number: 11086940
    Abstract: Techniques for Scalable parallel elimination of approximately subsumed sets are described. A method for Scalable parallel elimination of approximately subsumed sets includes identifying a first plurality of blocks based at least on a plurality of records stored in a storage service of a provider network, determining a plurality of subsumption relationships between blocks from the first plurality of blocks, retaining a first subset of the first plurality of blocks and demoting a second subset of the first plurality of blocks based at least on the plurality of subsumption relationships, and iteratively performing dynamic blocking based at least on the first subset of the plurality of matching blocks and the second subset of the plurality of matching blocks to generate a subsequent pluralities of blocks.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: August 10, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Stephen Michael Ash
  • Patent number: 10628396
    Abstract: Batch, or “offline”, blocking takes a set of records and generates sets (or blocks, hence the name blocking) of potentially matching records for the entire set. The blocks of potential matches are then passed to a matching process to evaluate which records match. Applications include but are not limited to individual matching such as student identification, householding, business matching, supply chain matching, financial matching, news or text matching, and other applications.
    Type: Grant
    Filed: January 19, 2011
    Date of Patent: April 21, 2020
    Assignee: OPEN INVENTION NETWORK LLC
    Inventors: Andrew Borthwick, Arthur Goldberg, Put Cheung, Adam Winkel
  • Patent number: 10599614
    Abstract: Block size reduction iterations are performed on a plurality of blocks of records until a block size criterion is met. An iteration comprises identifying, from a first collection of blocks, using one or more pivot operations, a set of combinations of oversized blocks such that at least one record belongs to all blocks of a combination. A new block comprising records that are members of each block of a first combination of the set is included in a second collection of blocks to be examined in a subsequent iteration. On at least one block created in an iteration, analysis operations are performed.
    Type: Grant
    Filed: January 2, 2018
    Date of Patent: March 24, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Tianyi Lu, Shehzad Qureshi, Timothy Jones
  • Patent number: 10476224
    Abstract: An optical apparatus comprises an optical resonator defining an optical resonator path for an optical resonator beam and an optical beam expander optically coupled to the optical resonator, the optical beam expander defining an optical beam expander path for an optical beam expander beam. The optical resonator path and the optical beam expander path may be configured such that the optical resonator beam and the optical beam expander beam at least partially intersect. Such an optical apparatus may be configured for use as a laser target designator or a laser range finder.
    Type: Grant
    Filed: June 10, 2016
    Date of Patent: November 12, 2019
    Assignee: THALES HOLDINGS UK PLC
    Inventors: Andrew Borthwick, Stephen Lee
  • Publication number: 20160365698
    Abstract: An optical apparatus comprises an optical resonator defining an optical resonator path for an optical resonator beam and an optical beam expander optically coupled to the optical resonator, the optical beam expander defining an optical beam expander path for an optical beam expander beam. The optical resonator path and the optical beam expander path may be configured such that the optical resonator beam and the optical beam expander beam at least partially intersect. Such an optical apparatus may be configured for use as a laser target designator or a laser range finder.
    Type: Application
    Filed: June 10, 2016
    Publication date: December 15, 2016
    Inventors: Andrew BORTHWICK, Stephen LEE
  • Patent number: 8949158
    Abstract: Record Linkage (RL) is the task of identifying two or more records referring to the same entity (e.g., a person, a company, etc.). RL models can be based on Cost Sensitive Alternating Decision Trees (ADTree), an algorithm that uniquely combines boosting and decision trees algorithms to create shorter and easier-to-interpret linking rules. These models can be naturally trained to operate at industrial precision/recall operating points, and the shorter output rules are so clear that it can effectively explain its decisions to non-technical users via score aggregation or visualization. The models significantly outperform other baselines on the desired industrial operating points, and the improved understanding of the model's decisions led to faster debugging and feature development cycles.
    Type: Grant
    Filed: October 25, 2011
    Date of Patent: February 3, 2015
    Assignee: Intelius Inc.
    Inventors: Andrew Borthwick, Sheng Chen
  • Patent number: 8645399
    Abstract: Dynamic blocking determines which pairs of records in a data set should be examined as potential duplicates. Records are grouped together into blocks by shared properties that are indicators of duplication. Blocks that are too large to be efficiently processed are further subdivided by other properties chosen in a data-driven way. We demonstrate the viability of this algorithm for large data sets. We have scaled this system up to work on billions of records on an 80 node Hadoop cluster.
    Type: Grant
    Filed: January 12, 2012
    Date of Patent: February 4, 2014
    Assignee: Intelius Inc.
    Inventors: William P. McNeill, Andrew Borthwick
  • Patent number: 8633608
    Abstract: An apparatus and method are disclosed for extracting energy from fluid motion. A deformable body adapted to be deformed by fluid motion is arranged to drive at least one energy medium to be delivered to an energy output. The energy medium is controllable in order to control at least one dynamic response of the deformable body.
    Type: Grant
    Filed: February 27, 2009
    Date of Patent: January 21, 2014
    Assignee: AWS Ocean Energy Limited
    Inventors: Simon Grey, Andrew Borthwick
  • Publication number: 20130173560
    Abstract: Dynamic blocking determines which pairs of records in a data set should be examined as potential duplicates. Records are grouped together into blocks by shared properties that are indicators of duplication. Blocks that are too large to be efficiently processed are further subdivided by other properties chosen in a data-driven way. We demonstrate the viability of this algorithm for large data sets. We have scaled this system up to work on billions of records on an 80 node Hadoop cluster.
    Type: Application
    Filed: January 12, 2012
    Publication date: July 4, 2013
    Applicant: INTELIUS INC.
    Inventors: William P. McNeill, Andrew Borthwick
  • Publication number: 20120278263
    Abstract: Record Linkage (RL) is the task of identifying two or more records referring to the same entity (e.g., a person, a company, etc.). RL models can be based on Cost Sensitive Alternating Decision Trees (ADTree), an algorithm that uniquely combines boosting and decision trees algorithms to create shorter and easier-to-interpret linking rules. These models can be naturally trained to operate at industrial precision/recall operating points, and the shorter output rules are so clear that it can effectively explain its decisions to non-technical users via score aggregation or visualization. The models significantly outperform other baselines on the desired industrial operating points, and the improved understanding of the model's decisions led to faster debugging and feature development cycles.
    Type: Application
    Filed: October 25, 2011
    Publication date: November 1, 2012
    Applicant: INTELIUS INC.
    Inventors: Andrew Borthwick, Sheng Chen
  • Patent number: 7899796
    Abstract: Batch, or “offline”, blocking takes a set of records and generates sets (or blocks, hence the name blocking) of potentially matching records for the entire set. The blocks of potential matches are then passed to a matching process to evaluate which records match. Applications include but are not limited to individual matching such as student identification, householding, business matching, supply chain matching, financial matching, news or text matching, and other applications.
    Type: Grant
    Filed: November 23, 2005
    Date of Patent: March 1, 2011
    Inventors: Andrew Borthwick, Arthur Goldberg, Put Cheung, Adam Winkel
  • Publication number: 20110006532
    Abstract: An apparatus and method for extracting energy from fluid motion comprising a deformable body adapted to be deformed by fluid motion, wherein the deformation of the deformable body is arranged to drive at least one energy medium to be delivered to an energy output; and wherein the energy medium is controllable in order to control at least one dynamic response of the deformable body.
    Type: Application
    Filed: February 27, 2009
    Publication date: January 13, 2011
    Inventors: Simon Grey, Andrew Borthwick