Patents by Inventor Archiman Dutta

Archiman Dutta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11675766
    Abstract: A hierarchical representation of an input data set comprising similarity scores for respective entity pairs is generated iteratively. In a particular iteration, clusters are obtained from a subset of the iteration's input entity pairs which satisfy a similarity criterion, and then spanning trees are generated for at least some of the clusters. An indication of at least a representative pair of one or more of the clusters is added to the hierarchical representation in the iteration. The hierarchical representation is used to respond to clustering requests.
    Type: Grant
    Filed: March 3, 2020
    Date of Patent: June 13, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Xianshun Chen, Kai Liu, Nikhil Anand Navali, Archiman Dutta
  • Patent number: 11625555
    Abstract: Respective labels are generated automatically for a plurality of record pairs, with a label for a given pair indicating a relationship detected between the records of the pair. One or more machine learning models are trained using the labeled record pairs. The trained versions of the models are stored.
    Type: Grant
    Filed: March 12, 2020
    Date of Patent: April 11, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Dmitry Vladimir Zhiyanov, Lichao Wang, Archiman Dutta
  • Patent number: 11514321
    Abstract: Entity record pairs are extracted from a selected cluster of entity records. Attribute value pairs are obtained from the entity record pairs. Labels are assigned to the attribute value pairs based at least in part on entity-level similarity scores of the entity records from which the attribute value pairs were obtained. A machine learning model is trained, using a data set which includes at least some attribute value pairs to which the labels are assigned, to generate attribute similarity scores for pairs of attribute values.
    Type: Grant
    Filed: June 12, 2020
    Date of Patent: November 29, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Xianshun Chen, Zahed Patel, Kai Liu, Nikhil Anand Navali, Archiman Dutta
  • Patent number: 11423072
    Abstract: Respective text feature sets and non-text feature sets are generated corresponding to individual pairs of a plurality of record pairs. At least one text feature is based on whether a text token exists in both records of a pair. Perceptual hash values are used for non-text feature sets. A machine learning model is trained, using the text and non-text feature sets, to generate relationship scores for record pairs. The model includes a text sub-model and a non-text sub-model.
    Type: Grant
    Filed: July 31, 2020
    Date of Patent: August 23, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Xianshun Chen, Lichao Wang, Archiman Dutta
  • Patent number: 10909144
    Abstract: Methods, systems, and computer-readable media for taxonomy generation with automated analysis and auditing are disclosed. A primary classification is determined for a hierarchical taxonomy of items in a marketplace. The primary classification is selected from a plurality of terms describing items in the marketplace, and the primary classification is selected based at least in part on automated analysis of the terms. A plurality of secondary classifications are determined for the hierarchical taxonomy. The secondary classifications are selected from the terms describing the items in the marketplace, and the secondary classifications are selected based at least in part on automated analysis of the terms. The hierarchical taxonomy is modified based at least in part on feedback from a plurality of users. The feedback comprises one or more terms entered by one or more of the users to filter a set of items.
    Type: Grant
    Filed: March 6, 2015
    Date of Patent: February 2, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Archiman Dutta, Shoubhik Bhattacharya, Deepak Kumar Nayak, Avik Sinha
  • Patent number: 10783167
    Abstract: Described are techniques for modifying or creating classification data used to automatically classify items in an online marketplace or catalog, based on user interaction data. For one or more classification labels that may be applied to an item, user interaction data indicative of a count of instances that the label was accessed, a length of time during which the label was accessed, counts of instances that parent and child labels were accessed, and counts of instances that the label was accessed via a search query may be determined. Based on the user interaction data, an importance score for the label may be determined. Labels having an importance score greater than or equal to a threshold value may be included in classification data and used for subsequent classification of items. Labels having an importance score less than a threshold may be excluded from the classification data.
    Type: Grant
    Filed: August 23, 2016
    Date of Patent: September 22, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Archiman Dutta, Meghana Shivanand Rajamane
  • Patent number: 10726060
    Abstract: A technology for determining accuracy estimates for classifications used in an electronic catalog. In one example, classifications for product groupings included in an electronic catalog may be updated as a result of the classifications inaccurately representing products included in the product groupings. The electronic catalog of products may be grouped into a plurality of product groupings using classifications. Classifications of product groupings that inaccurately represent products included in the product grouping may be updated with suggested classifications. Update metrics for updates made to the grouping classifications may be collected and the update metrics may be used to calculate an accuracy estimate for the classifications used in the electronic catalog.
    Type: Grant
    Filed: June 24, 2015
    Date of Patent: July 28, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Archiman Dutta, Shoubhik Bhattacharya, Subhadeep Chakraborty, Deepak Kumar Nayak, Nathan Rose, Avik Sinha
  • Patent number: 10565385
    Abstract: Online service providers may operate a rendering service for generating and providing substitute web content information for rendering substitute web content instead of authentic web content. The rendering service may obtain web content information for the authentic web content in response to receiving a request for web content. The rendering service may use the web content information to generate the substitute web content information. The substitute web content information is useable by the computing device to generate substitute web content that includes one or more visual elements resembling resource objects of the authentic web content. The visual elements are rendered, as a result of processing by the computing device, as image content instead of interactive objects.
    Type: Grant
    Filed: August 28, 2017
    Date of Patent: February 18, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Lohith Ravi, Archiman Dutta
  • Patent number: 10339470
    Abstract: Techniques are provided herein for utilizing a classification engine to improve a classification model. For example, a classification engine may derive a statistical model based at least in part on a synthetic data set. A misclassification may be determined based at least in part on an output of the statistical model. An audit question may be provided to an individual, the audit question being determined based at least in part on the determined misclassification. Response data related to the audit question may be received. The statistical model may be validated based at least in part on the response data.
    Type: Grant
    Filed: December 11, 2015
    Date of Patent: July 2, 2019
    Assignee: Amazon Technologies, Inc.
    Inventors: Archiman Dutta, Rahul Gupta, Subhadeep Chakraborty, Dhinesh Kumar Dhanasekaran, Deepak Kumar Nayak, Avik Sinha
  • Patent number: 10217080
    Abstract: Methods, systems, and computer-readable media for item classification using customer-visible attributes are disclosed. A plurality of terms are determined that describe a plurality of items in a marketplace. Individual ones of the items are classified in a hierarchical taxonomy comprising a plurality of classifications, and individual ones of the terms correspond to individual ones of the classifications. A description of a new item is received. The description of the new item comprises a plurality of customer-visible terms. One or more of the plurality of classifications in the hierarchical taxonomy are selected for the new item. The one or more classifications are selected for the new item based at least in part on automated matching of individual ones of the customer-visible terms to individual ones of the terms that correspond to individual ones of the classifications.
    Type: Grant
    Filed: March 26, 2015
    Date of Patent: February 26, 2019
    Assignee: Amazon Technologies, Inc.
    Inventor: Archiman Dutta
  • Patent number: 9830344
    Abstract: Disclosed are various embodiments for assessing the quality of a node that comprises a collection of items containing textual data. The homogeneity of the node can be related to its quality. Highly ranked descriptive terms used in the node are identified and quality score is calculated that provides a measure of the quality of the node. Additionally, a node can be examined for outliers to improve node quality.
    Type: Grant
    Filed: February 12, 2015
    Date of Patent: November 28, 2017
    Assignee: AMAZON TECHONOLIGIES, INC.
    Inventor: Archiman Dutta
  • Publication number: 20150161187
    Abstract: Disclosed are various embodiments for assessing the quality of a node that comprises a collection of items containing textual data. The homogeneity of the node can be related to its quality. Highly ranked descriptive terms used in the node are identified and quality score is calculated that provides a measure of the quality of the node. Additionally, a node can be examined for outliers to improve node quality.
    Type: Application
    Filed: February 12, 2015
    Publication date: June 11, 2015
    Inventor: Archiman Dutta
  • Patent number: 8977622
    Abstract: Disclosed are various embodiments for assessing the quality of a node that comprises a collection of items containing textual data. The homogeneity of the node can be related to its quality. Highly ranked descriptive terms used in the node are identified and quality score is calculated that provides a measure of the quality of the node. Additionally, a node can be examined for outliers to improve node quality.
    Type: Grant
    Filed: September 17, 2012
    Date of Patent: March 10, 2015
    Assignee: Amazon Technologies, Inc.
    Inventor: Archiman Dutta