Patents by Inventor Stephen Michael Ash

Stephen Michael Ash has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11941016
    Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
    Type: Grant
    Filed: March 4, 2022
    Date of Patent: March 26, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle
  • Publication number: 20230325384
    Abstract: Interactive assistances for executing natural language queries to data sets may be performed. A natural language query may be received. Candidate entity linkages may be determined between an entity recognized in the natural language query and columns in data sets. The candidate linkages may be ranked according to confidence scores which may be evaluated to detect ambiguity for an entity linkage. Candidate entity linkages may be provided to a user via an interface to select an entity linkage to use as part of completing the natural language query.
    Type: Application
    Filed: March 10, 2023
    Publication date: October 12, 2023
    Applicant: Amazon Technologies, Inc.
    Inventors: Ramesh M Nallapati, Zhiguo Wang, Bing Xiang, Patrick Ng, Yung Haw Wang, Mukul Karnik, Nanyan Li, Sharanabasappa Parashuram Revadigar, Timothy Jones, Stephen Michael Ash, Sudipta Sengupta, Gregory David Adams, Deepak Shantha Murthy, Douglas Scott Cerny, Stephanie Weeks, Hanbo Li
  • Patent number: 11726997
    Abstract: Multiple stage filtering may be implemented for natural language query processing pipelines. Natural language queries may be received at a natural language query processing system and processed through a query language processing pipeline. The query language processing pipeline may filter candidate linkages for a natural language query before performing further filtering of the candidate linkages in the natural language query processing pipeline as part of generating an intermediate representation used to execute the natural language query.
    Type: Grant
    Filed: November 14, 2022
    Date of Patent: August 15, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Jun Wang, Zhiguo Wang, Sharanabasappa Parashuram Revadigar, Ramesh M Nallapati, Bing Xiang, Stephen Michael Ash, Timothy Jones, Sudipta Sengupta, Rishav Chakravarti, Patrick Ng, Jiarong Jiang, Hanbo Li, Donald Harold Rivers Weidner
  • Publication number: 20230078177
    Abstract: Multiple stage filtering may be implemented for natural language query processing pipelines. Natural language queries may be received at a natural language query processing system and processed through a query language processing pipeline. The query language processing pipeline may filter candidate linkages for a natural language query before performing further filtering of the candidate linkages in the natural language query processing pipeline as part of generating an intermediate representation used to execute the natural language query.
    Type: Application
    Filed: November 14, 2022
    Publication date: March 16, 2023
    Applicant: Amazon Technologies, Inc.
    Inventors: Jun Wang, Zhiguo Wang, Sharanabasappa Parashuram Revadigar, Ramesh M Nallapati, Bing Xiang, Stephen Michael Ash, Timothy Jones, Sudipta Sengupta, Rishav Chakravarti, Patrick Ng, Jiarong Jiang, Hanbo Li, Donald Harold Rivers Weidner
  • Patent number: 11604794
    Abstract: Interactive assistances for executing natural language queries to data sets may be performed. A natural language query may be received. Candidate entity linkages may be determined between an entity recognized in the natural language query and columns in data sets. The candidate linkages may be ranked according to confidence scores which may be evaluated to detect ambiguity for an entity linkage. Candidate entity linkages may be provided to a user via an interface to select an entity linkage to use as part of completing the natural language query.
    Type: Grant
    Filed: March 31, 2021
    Date of Patent: March 14, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Ramesh M Nallapati, Zhiguo Wang, Bing Xiang, Patrick Ng, Yung Haw Wang, Mukul Karnik, Nanyan Li, Sharanabasappa Parashuram Revadigar, Timothy Jones, Stephen Michael Ash, Sudipta Sengupta, Gregory David Adams, Deepak Shantha Murthy, Douglas Scott Cerny, Stephanie Weeks, Hanbo Li
  • Patent number: 11514054
    Abstract: Supervised partitioning is used to perform record matching. A request to identify matches between records is received. A graph representation that indicates similarities between the records is partitioned and an evaluation of the partitioning is performed according to a supervised machine learning technique to generate a confidence value in the partitioning. An indication of equivalent records according to the partitioning and the confidence value of the partitioning may be provided.
    Type: Grant
    Filed: September 27, 2018
    Date of Patent: November 29, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Robert Anthony Barton, Jr., Stephen Michael Ash, Russell Reas
  • Patent number: 11500865
    Abstract: Multiple stage filtering may be implemented for natural language query processing pipelines. Natural language queries may be received at a natural language query processing system and processed through a query language processing pipeline. The query language processing pipeline may filter candidate linkages for a natural language query before performing further filtering of the candidate linkages in the natural language query processing pipeline as part of generating an intermediate representation used to execute the natural language query.
    Type: Grant
    Filed: March 31, 2021
    Date of Patent: November 15, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Jun Wang, Zhiguo Wang, Sharanabasappa Parashuram Revadigar, Ramesh M Nallapati, Bing Xiang, Stephen Michael Ash, Timothy Jones, Sudipta Sengupta, Rishav Chakravarti, Patrick Ng, Jiarong Jiang, Hanbo Li, Donald Harold Rivers Weidner
  • Publication number: 20220261413
    Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
    Type: Application
    Filed: March 4, 2022
    Publication date: August 18, 2022
    Applicant: Amazon Technologies, Inc.
    Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle
  • Patent number: 11314730
    Abstract: Techniques for memory-efficient streaming count estimation for multisets are described. A method for memory-efficient streaming count estimation for multisets may include obtaining data from a plurality of data sources, and estimating a count for one or more attributes of the data using a telescoping count-min sketch (CMS) data structure, the telescoping CMS including at least a first table and a second table, wherein count values for the data are stored in a plurality of cells of the first table and when a cell of the first table is saturated, the count values for that cell are stored in a corresponding cell of the second table determined based at least on the cell of the first table.
    Type: Grant
    Filed: March 24, 2020
    Date of Patent: April 26, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Stephen Michael Ash
  • Patent number: 11269911
    Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
    Type: Grant
    Filed: November 23, 2018
    Date of Patent: March 8, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle
  • Patent number: 11120064
    Abstract: A data records service is configured to receive original data records and, in parallel, generate a transliterated version of the original data record into a phonetic based language. Individual fields of data records can be transliterated by identifying a primary language, generating language specific tokens for individual text portions, and transliterating the token. The records processing service can then execute matching models on both original data records and transliterated data records to detect matching data records.
    Type: Grant
    Filed: November 20, 2018
    Date of Patent: September 14, 2021
    Assignee: Amazon Technologies, Inc.
    Inventor: Stephen Michael Ash
  • Patent number: 11113254
    Abstract: Techniques for scaling record linkage via elimination of highly overlapped blocks are described. A method for scaling record linkage via elimination of highly overlapped blocks includes identifying a first plurality of blocks based at least on a plurality of records stored in a storage service of a provider network, identifying a plurality of sets of matching blocks from the first plurality of blocks, deleting the plurality of sets of matching blocks except for a first block from each set from the plurality of sets of matching blocks, and iteratively performing dynamic blocking based at least on the first block to generate subsequent pluralities of blocks until the subsequent pluralities of blocks are below a threshold size.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: September 7, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Stephen Michael Ash
  • Patent number: 11086940
    Abstract: Techniques for Scalable parallel elimination of approximately subsumed sets are described. A method for Scalable parallel elimination of approximately subsumed sets includes identifying a first plurality of blocks based at least on a plurality of records stored in a storage service of a provider network, determining a plurality of subsumption relationships between blocks from the first plurality of blocks, retaining a first subset of the first plurality of blocks and demoting a second subset of the first plurality of blocks based at least on the plurality of subsumption relationships, and iteratively performing dynamic blocking based at least on the first subset of the plurality of matching blocks and the second subset of the plurality of matching blocks to generate a subsequent pluralities of blocks.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: August 10, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Andrew Borthwick, Stephen Michael Ash
  • Publication number: 20200159857
    Abstract: A data records service is configured to receive original data records and, in parallel, generate a transliterated version of the original data record into a phonetic based language. Individual fields of data records can be transliterated by identifying a primary language, generating language specific tokens for individual text portions, and transliterating the token. The records processing service can then execute matching models on both original data records and transliterated data records to detect matching data records.
    Type: Application
    Filed: November 20, 2018
    Publication date: May 21, 2020
    Inventor: Stephen Michael Ash