Patents by Inventor Yahor Pushkin

Yahor Pushkin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11861039
    Abstract: Various embodiments of a hierarchical system or method of identifying sensitive content in data is described. In some embodiments, sensitive data classifiers local to a data storage system can analyze a plurality of data items and classify at least some data items as potentially containing sensitive data. The sensitive data classifiers can provide the classified data items to a separate sensitive data discovery component. The sensitive data discovery component can, in some embodiments, obtain the classified data items, perform a sensitive data location analysis on the classified data items to identify a location of sensitive data within some of the classified data items, and generate location information for the sensitive data within the data items containing sensitive data. The sensitive data discovery component can provide to a destination this information, in some embodiments, where the destination might redact, tokenize, highlight, or perform other actions on the located sensitive data.
    Type: Grant
    Filed: September 28, 2020
    Date of Patent: January 2, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Yahor Pushkin, Sravan Babu Bodapati, Sunil Mallya Kasaragod, Sameer Karnik, Abhinav Goyal, Yaser Al-Onaizan, Ravindra Manjunatha, Kalpit Dixit, Alok Kumar Parmesh, Syed Kashif Hussain Shah
  • Patent number: 11847406
    Abstract: Techniques for performing natural language processing (NLP) on semi-structured data are described. An exemplary method includes receiving a semi-structured document to perform NLP on using a trained NLP model; converting the semi-structured document into a secondary format, wherein the secondary format includes spatial information for tokens of the semi-structured document; flattening the converted, secondary formatted semi-structured document into a Unicode Transformation Format text file; performing NLP on the Unicode Transformation Format text file using the trained NLP model; and providing a result of the NLP to a requester.
    Type: Grant
    Filed: March 30, 2021
    Date of Patent: December 19, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Sunil Mallya Kasaragod, Yahor Pushkin, Saman Zarandioon, Graham Vintcent Horwood, Miguel Ballesteros Martinez, Yogarshi Paritosh Vyas, Yinxiao Zhang, Diego Marcheggiani, Yaser Al-Onaizan, Xuan Zhu, Liutong Zhou, Yusheng Xie, Aruni Roy Chowdhury, Bo Pang
  • Patent number: 11755536
    Abstract: A data lineage system tracks performance of data flows through different transformations independent of the systems that perform the transformations. A data flow model is maintained as a graph in the data lineage system that is updated by data processors to include performance history of different transformations in the data flow. Subsequent analyses of the data flow model, such as tracing particular data, can be supported using the recorded performance information in the graph of the data flow model.
    Type: Grant
    Filed: January 10, 2020
    Date of Patent: September 12, 2023
    Assignee: Amazon Technologies, Inc.
    Inventor: Yahor Pushkin
  • Patent number: 11741168
    Abstract: Techniques for multi-label document classification are described. Clustering is used to cluster labels in a set. A machine learning model including a multi-label classifier for each cluster is created, the multi-label classifier for a given cluster to classify a document with one or more of the labels in the cluster.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: August 29, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Sravan Babu Bodapati, Rishita Rajal Anubhai, Yahor Pushkin
  • Patent number: 11734937
    Abstract: Techniques for creating a text classifier machine learning (ML) model are described. According to some embodiments, a language processing service finetunes a language ML model on unlabeled documents of a user, and then trains that finetuned language ML model on labeled documents of the user to be a text classifier that is customized for that user’s domain, e.g., the user’s documents. Additionally, the finetuned language ML model may be trained on labeled documents of the user, for prediction objectives for unlabeled data, before being trained as the text classifier.
    Type: Grant
    Filed: January 2, 2020
    Date of Patent: August 22, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Yahor Pushkin, Sravan Babu Bodapati, Rishita Rajal Anubhai, Dimitrios Soulios, Yaser Al-Onaizan
  • Publication number: 20220100963
    Abstract: Methods, systems, and computer-readable media for event extraction from documents with co-reference are disclosed. An event extraction service identifies one or more trigger groups in a document comprising text. An individual one of the trigger groups comprises one or more textual references to an occurrence of an event. The one or more trigger groups are associated with one or more semantic roles for entities. The event extraction service identifies one or more entity groups in the document. An individual one of the entity groups comprises one or more textual references to a real-world object. The event extraction service assigns one or more of the entity groups to one or more of the semantic roles. The event extraction service generates an output indicating the one or more trigger groups and one or more entity groups assigned to the semantic roles.
    Type: Application
    Filed: September 30, 2020
    Publication date: March 31, 2022
    Applicant: Amazon Technologies, Inc.
    Inventors: Rishita Rajal Anubhai, Yahor Pushkin, Graham Vintcent Horwood, Yinxiao Zhang, Ravindra Manjunatha, Jie Ma, Alessandra Brusadin, Jonathan Steuck, Shuai Wang, Sameer Karnik, Miguel Ballesteros Martinez, Sunil Mallya Kasaragod, Yaser Al-Onaizan
  • Publication number: 20220100772
    Abstract: Methods, systems, and computer-readable media for context-sensitive linking of entities to private databases are disclosed. An entity linking service stores a plurality of representations of entities. Individual ones of the entities correspond to individual ones of a plurality of records in one or more private databases. The entity linking service determines a mention of an entity in a document. The entity linking service selects, from the plurality of records in the one or more private databases, a record corresponding to the entity. The record is selected based at least in part on the plurality of representations of the entities and based at least in part on a context of the mention of the entity in the document. The entity linking service generates output comprising a reference to the selected record in the one or more private databases.
    Type: Application
    Filed: September 30, 2020
    Publication date: March 31, 2022
    Applicant: Amazon Technologies, Inc.
    Inventors: Srikanth Doss Kadarundalagi Raghura, Yogarshi Paritosh Vyas, Miguel Ballesteros Martinez, Yahor Pushkin, Sunil Mallya Kasaragod, Yaser Al-Onaizan, Sameer Karnik, Abhinav Goyal, Graham Vintcent Horwood, Kapil Singh Badesara
  • Publication number: 20220100967
    Abstract: Methods, systems, and computer-readable media for lifecycle management for customized natural language processing are disclosed. A natural language processing (NLP) customization service determines a task definition associated with an NLP model based (at least in part) on user input. The task definition comprises an indication of one or more tasks to be implemented using the NLP model and one or more requirements associated with use of the NLP model. The service determines the NLP model based (at least in part) on the task definition. The service trains the NLP model. The NLP model is used to perform inference for a plurality of input documents. The inference outputs a plurality of predictions based (at least in part) on the input documents. Inference data is collected based (at least in part) on the inference. The service generates a retrained NLP model based (at least in part) on the inference data.
    Type: Application
    Filed: September 30, 2020
    Publication date: March 31, 2022
    Applicant: Amazon Technologies, Inc.
    Inventors: Yahor Pushkin, Rishita Rajal Anubhai, Sameer Karnik, Sunil Mallya Kasaragod, Abhinav Goyal, Yaser Al-Onaizan, Ashish Singh, Ashish Khare
  • Patent number: 10380664
    Abstract: A mobile application uses computer-readable instructions for exchanging, viewing or providing location sharing information in a context of a public group, a private group or both. The location sharing information may be made available to aid or enhance commerce-related activities performed by a merchant, a consumer or both. In another embodiment, a method for authenticating a private group permits an authenticating user to restrict the private group and selectively allow subsequent participants restricted access to the private group.
    Type: Grant
    Filed: May 15, 2014
    Date of Patent: August 13, 2019
    Inventors: Bryan Gardner Trussel, James Stanton, Steve Miller, Craig Link, Yahor Pushkin
  • Publication number: 20160155170
    Abstract: A mobile application uses computer-readable instructions for exchanging, viewing or providing location sharing information in a context of a public group, a private group or both. The location sharing information may be made available to aid or enhance commerce-related activities performed by a merchant, a consumer or both. In another embodiment, a method for authenticating a private group permits an authenticating user to restrict the private group and selectively allow subsequent participants restricted access to the private group.
    Type: Application
    Filed: May 15, 2014
    Publication date: June 2, 2016
    Inventors: Bryan Gardner Trussel, James Stanton, Steve Miller, Craig Link, Yahor Pushkin
  • Publication number: 20150262275
    Abstract: A mobile application uses computer-readable instructions for exchanging, viewing or providing location sharing information in a context of a public group, a private group or both. The location sharing information may be made available to aid or enhance commerce-related activities performed by a merchant, a consumer or both. In another embodiment, a method for authenticating a private group permits an authenticating user to restrict the private group and selectively allow subsequent participants restricted access to the private group.
    Type: Application
    Filed: May 15, 2014
    Publication date: September 17, 2015
    Inventors: Bryan Gardner Trussel, James Stanton, Steve Miller, Craig Link, Yahor Pushkin