Patents by Inventor Rahul Bhotika

Rahul Bhotika has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250013621
    Abstract: An example system for parsing and transforming input data that includes processing circuitry and memory, the memory configured to store the input data. The processing circuitry is configured to determine a first delimiter in the input data. The processing circuitry is configured to determine a plurality of second delimiter hypotheses and parse the input data according to the first delimiter and the plurality of second delimiter hypotheses to generate a plurality of tables that are each associated with a respective one of the plurality of second delimiter hypotheses. The processing circuitry is configured to determine a respective consistency score for each of the plurality of tables and select a table from among the plurality of tables based on the respective consistency score associated with the table. The processing circuitry is configured to format the input data based on the selected table to generate formatted data and output the formatted data.
    Type: Application
    Filed: July 6, 2023
    Publication date: January 9, 2025
    Inventors: Sanjay Kumar Singh, Subhasis Jethy, Udit Saini, Ranju Das, Vasant Manohar, Rahul Bhotika, Carlos Morato
  • Publication number: 20240411732
    Abstract: Various embodiments of the present disclosure provide machine learning techniques for transforming disparate, third-party datasets to canonical representations. The techniques include generating, using a machine learning prediction model, a canonical representation for an input dataset. The machine learning prediction model is previously trained using permutative input embeddings for a training dataset based on canonical data entity features, such that each permutative input embedding corresponds to a different sequence of the canonical data entity features. The permutative input embeddings are leveraged to generate a latent representation for the training dataset. The latent representation is combined with a canonical data map to generate an alignment vector, which is refined to generate an output vector for the input dataset. The machine learning prediction model is trained using a model loss generated based on a comparison of the output vector with a corresponding labeled vector.
    Type: Application
    Filed: June 8, 2023
    Publication date: December 12, 2024
    Inventors: Sanjay Kumar SINGH, Subhasis JETHY, Udit SAINI, Carlos W. MORATO, Rahul BHOTIKA, Ranju DAS, Vasant MANOHAR
  • Publication number: 20240394526
    Abstract: Various embodiments of the present disclosure provide methods, apparatus, systems, computing devices, computing entities, and/or the like for disambiguating data fields mapped to a plurality of data tables according to a common data model by generating disambiguation embeddings based on a matrix representation of the common data model and one or more logical data type weights, generating a plurality of input embedding vectors for one or more prediction inputs based on the disambiguation embeddings, generating a plurality of prediction vectors based on the plurality of input embedding vectors, and assigning one or more select data fields to respective one or more candidate data tables based on the plurality of prediction vectors.
    Type: Application
    Filed: May 23, 2023
    Publication date: November 28, 2024
    Inventors: Sanjay Kumar Singh, Subhasis Jethy, Udit Saini, Ranju Das, Vasant Manohar, Rahul Bhotika, Carlos Morato
  • Patent number: 12147878
    Abstract: Techniques for feedback-based training may include selecting a scoring machine learning model based at least in part on a test metric, and applying the model on an unlabeled dataset to generate, per dataset item of the unlabeled dataset, a prediction and an importance ranking score for the prediction. Techniques for feedback-based training may further include selecting, based on the importance ranking scores, a result of the application of the model on the unlabeled dataset, providing the result and requesting feedback on the result via a graphical user interface, receiving the feedback via the graphical user interface, adding data from the unlabeled dataset into a training dataset when the feedback indicates a verified result, and retraining the model using the training dataset with the data added from the unlabeled dataset to generate a retrained model.
    Type: Grant
    Filed: November 27, 2020
    Date of Patent: November 19, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Barath Balasubramanian, Rahul Bhotika, Niels Brouwers, Ranju Das, Prakash Krishnan, Shaun Ryan James McDowell, Anushri Mainthia, Rakesh Madhavan Nambiar, Anant Patel, Avinash Aghoram Ravichandran, Joaquin Zepeda Salvatierra, Gurumurthy Swaminathan
  • Patent number: 11983243
    Abstract: Techniques for anomaly detection are described. An exemplary method includes receiving one or more requests to train an anomaly detection machine learning model using feedback-based training, the request to indicate one or more of a type of analysis to perform, a model selection indication, and a configuration for a training dataset; training the anomaly detection machine learning model according to the one or more requests using the training data; performing feedback-based training on the trained anomaly detection machine learning model; and using the retrained anomaly detection machine learning model.
    Type: Grant
    Filed: November 27, 2020
    Date of Patent: May 14, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Barath Balasubramanian, Rahul Bhotika, Niels Brouwers, Ranju Das, Prakash Krishnan, Shaun Ryan James Mcdowell, Anushri Mainthia, Rakesh Madhavan Nambiar, Anant Patel, Avinash Aghoram Ravichandran, Joaquin Zepeda Salvatierra, Gurumurthy Swaminathan
  • Patent number: 11557069
    Abstract: A system and method for estimating vascular flow using CT imaging include a computer readable storage medium having stored thereon a computer program comprising instructions, which, when executed by a computer, cause the computer to acquire a first set of data comprising anatomical information of an imaging subject, the anatomical information comprises information of at least one vessel. The instructions further cause the computer to process the anatomical information to generate an image volume comprising the at least one vessel, generate hemodynamic information based on the image volume, and acquire a second set of data of the imaging subject. The computer is also caused to generate an image comprising the hemodynamic information in combination with a visualization based on the second set of data.
    Type: Grant
    Filed: December 17, 2018
    Date of Patent: January 17, 2023
    Assignee: GE Precision Healthcare LLC
    Inventors: Robert F. Senzig, Ravikanth Avancha, Bijan Dorri, Sandeep Dutta, Steven J. Gray, Jiang Hsieh, John Irvin Jackson, Giridhar Jothiprasad, Paul Edgar Licato, Darin Robert Okerlund, Toshihiro Rifu, Saad Ahmed Sirohey, Basel Taha, Peter Michael Edic, Jerome Knoplioch, Rahul Bhotika
  • Patent number: 11481683
    Abstract: Techniques for creating machine learning models for direct homography regression for image rectification are described. In certain embodiments, a training service trains an algorithm on a source view of a training image and a homography matrix of the training image into a machine learning model that generates a normalized homography matrix for an input of the source view. The normalized homography matrix may then be utilized to generate a target view of an image input into the machine learning model. The target view of the image may be used in a document processing pipeline for document images captured using cameras.
    Type: Grant
    Filed: May 29, 2020
    Date of Patent: October 25, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Kunwar Yashraj Singh, Joaquin Zepeda Salvatierra, Erhan Bas, Vijay Mahadevan, Jonathan Wu, Rahul Bhotika
  • Patent number: 11429813
    Abstract: This disclosure describes automatically selecting and training one or more models for image recognition based upon training and testing (validation) data provided by a user. A service provider network includes a recognition service that may use models to process images and videos to recognize objects in the images and videos, features on the objects in the images and videos, and/or locate objects in the images and videos. The service provider network also includes a model selection and training service that may select one or more modeling techniques based on the objectives of the user and/or the amount of data provided by the user. Based on the selected modeling technique, the model selection and training service selects and trains one or more models for use by the recognition service to process images and videos using the training data. The trained model may be tested and validated using the testing data.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: August 30, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Avinash Aghoram Ravichandran, Rahul Bhotika, Stefano Soatto, Pietro Perona, Hao Yang
  • Patent number: 11423076
    Abstract: Various approaches discussed herein enable browsing groups of visually similar items to an item of interest, wherein the item of interest may be identified in a query image, for example. One or more visual attributes associated with the item of interest are identified, and the visually similar items matching at least one of the visual attributes are grouped together, wherein the group is ranked according to the visually similar items' overall visual similarity to the item of interest, for example by using a visual similarity score and/or metric.
    Type: Grant
    Filed: April 8, 2019
    Date of Patent: August 23, 2022
    Assignee: A9.COM, INC.
    Inventors: Rahul Bhotika, Lixin Duan, Oleg Rybakov, Jian Dong
  • Patent number: 11423265
    Abstract: Methods, systems, and computer-readable media for content moderation using object detection and image classification are disclosed. A content moderation system performs object detection on an input image using one or more object detectors. The object detection finds one or more elements in the input image. The content moderation system performs classification based at least in part on the input image using one or more image classifiers. The classification determines one or more values indicative of one or more content types in the input image. The content moderation system determines one or more scores for one or more content labels corresponding to the one or more content types. At least one of the scores represents a finding of one or more of the content types in the input image. The content moderation system generates output indicating the finding of the one or more of the content types.
    Type: Grant
    Filed: June 30, 2020
    Date of Patent: August 23, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Hao Chen, Hao Wu, Hao Li, Michael Quang Thai Lam, Xinyu Li, Kaustav Kundu, Meng Wang, Joseph P Tighe, Rahul Bhotika
  • Publication number: 20220172100
    Abstract: Techniques for feedback-based training are described.
    Type: Application
    Filed: November 27, 2020
    Publication date: June 2, 2022
    Inventors: Barath BALASUBRAMANIAN, Rahul BHOTIKA, Niels BROUWERS, Ranju DAS, Prakash KRISHNAN, Shaun Ryan James MCDOWELL, Anushri MAINTHIA, Rakesh Madhavan NAMBIAR, Anant PATEL, Avinash AGHORAM RAVICHANDRAN, Joaquin ZEPEDA SALVATIERRA, Gurumurthy SWAMINATHAN
  • Publication number: 20220171995
    Abstract: Techniques for anomaly detection are described. An exemplary method includes receiving one or more requests to train an anomaly detection machine learning model using feedback-based training, the request to indicate one or more of a type of analysis to perform, a model selection indication, and a configuration for a training dataset; training the anomaly detection machine learning model according to the one or more requests using the training data; performing feedback-based training on the trained anomaly detection machine learning model; and using the retrained anomaly detection machine learning model.
    Type: Application
    Filed: November 27, 2020
    Publication date: June 2, 2022
    Inventors: Barath BALASUBRAMANIAN, Rahul BHOTIKA, Niels BROUWERS, Ranju DAS, Prakash KRISHNAN, Shaun Ryan James MCDOWELL, Anushri MAINTHIA, Rakesh Madhavan NAMBIAR, Anant PATEL, Avinash AGHORAM RAVICHANDRAN, Joaquin ZEPEDA SALVATIERRA, Gurumurthy SWAMINATHAN
  • Patent number: 11341605
    Abstract: Techniques for document rectification via homography recovery using machine learning are described. An image rectification system can intelligently make use of multiple pipelines for rectifying document images based on the detected type of device that generated the images. The image rectification system can provide high-quality rectifications without requiring human cooperation, multiple views of the document in multiple images, and/or without being constrained to only be able to process images from one source context.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: May 24, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Kunwar Yashraj Singh, Amit Adam, Shahar Tsiper, Gal Sabina Star, Roee Litman, Hadar Averbuch Elor, Vijay Mahadevan, Rahul Bhotika, Shai Mazor, Mohammed El Hamalawi
  • Patent number: 11257006
    Abstract: Techniques for auto-generation of annotated real-world training data are described. An electronic document is analyzed to determine text represented in the document and corresponding locations of the text. A representation of the electronic document is modified to include markers and printed. The printed document is photographed in real-world environments, and the markers within the digital photographs are analyzed to allow for the depiction of the document within the photographs to be rectified. The text and location data are used to annotate the rectified images.
    Type: Grant
    Filed: November 20, 2018
    Date of Patent: February 22, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Oron Anschel, Amit Adam, Shahar Tsiper, Hadar Averbuch Elor, Shai Mazor, Rahul Bhotika, Stefano Soatto
  • Patent number: 10970530
    Abstract: Techniques for grammar-based automated generation of annotated synthetic form training data for machine learning are described. A training data generation engine utilizes a defined grammar to construct a layout for a form, select key-value units to place within the layout, and select attribute variants for the key-value units. The form is rendered and stored at a storage location, where it can be provided along with other similarly-generated forms to be used as training data for a machine learning model.
    Type: Grant
    Filed: November 13, 2018
    Date of Patent: April 6, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Amit Adam, Oron Anschel, Or Perel, Gal Sabina Star, Omri Ben-Eliezer, Hadar Averbuch Elor, Shai Mazor, Wendy Tse, Andrea Olgiati, Rahul Bhotika, Stefano Soatto
  • Patent number: 10963754
    Abstract: Techniques for training an embedding using a limited training set are described. In some examples, the embedding is trained by generating a plurality of vectors from a random sample of the limited set of training data classes using a layer of the particular machine learning classification model, randomly selecting samples from the plurality of vectors into a set of samples, computing at least one distance for each sampled class from a center parameter for the class using the set of samples, generating a discrete probability distribution over the classes for a query point based on distances to a center parameter for each of the classes in the embedding space, calculating a loss value for the modified prototypical network, the calculation of the loss value being for a fixed geometry of the embedding space and including a measure of the difference between distributions, and back propagating.
    Type: Grant
    Filed: September 27, 2018
    Date of Patent: March 30, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Avinash Aghoram Ravichandran, Paulo Ricardo dos Santos Mendonca, Rahul Bhotika, Stefano Soatto
  • Patent number: 10949661
    Abstract: Techniques for layout-agnostic complex document processing are described. A document processing service can analyze documents that do not adhere to defined layout rules in an automated manner to determine the content and meaning of a variety of types of segments within the documents. The service may chunk a document into multiple chunks, and operate upon the chunks in parallel by identifying segments within each chunk, classifying the segments into segment types, and processing the segments using special-purpose analysis engines adapted for the analysis of particular segment types to generate results that can be aggregated into an overall output for the entire document that captures the meaning and context of the document text.
    Type: Grant
    Filed: November 21, 2018
    Date of Patent: March 16, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Rahul Bhotika, Shai Mazor, Amit Adam, Wendy Tse, Andrea Olgiati, Bhavesh Doshi, Gururaj Kosuru, Patrick Ian Wilson, Umar Farooq, Anand Dhandhania
  • Patent number: 10878234
    Abstract: Techniques for automated form understanding via layout-agnostic identification of keys and corresponding values are described. An embedding generator creates embeddings of pixels from an image including a representation of a form. The generated embeddings are similar for pixels within a same key-value unit, and far apart for pixels not in a same key-value unit. A weighted bipartite graph is constructed including a first set of nodes corresponding to keys of the form and a second set of nodes corresponding to values of the form. Weights for the edges are determined based on an analysis of distances between ones of the embeddings. The graph is partitioned according to a scheme to identify pairings between the first set of nodes and the second set of nodes that produces a minimum overall edge weight. The pairings indicate keys and values that are associated within the form.
    Type: Grant
    Filed: November 20, 2018
    Date of Patent: December 29, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Amit Adam, Oron Anschel, Hadar Averbuch Elor, Shai Mazor, Gal Sabina Star, Or Perel, Wendy Tse, Andrea Olgiati, Rahul Bhotika, Stefano Soatto
  • Patent number: 10878270
    Abstract: Techniques for keypoint-based multi-label word segmentation and localization are described. A machine learning model identifies bounding regions of text within an image, and then generates multiple channel matrices representing predicted keypoints of the text within the bounding regions. The keypoints can be used to rectify the corresponding graphical content from the image including the text to improve the ability to perform optical character recognition and identify the text. Line and word segmentation and localization can be performed together.
    Type: Grant
    Filed: June 26, 2018
    Date of Patent: December 29, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Song Cao, Hao Wu, Jonathan Wu, Meng Wang, Rahul Bhotika
  • Patent number: 10872236
    Abstract: Techniques for layout-agnostic clustering-based classification of document keys and values are described. A key-value differentiation unit generates feature vectors corresponding to text elements of a form represented within an electronic image using a machine learning (ML) model. The ML model was trained utilizing a loss function that separates keys from values. The feature vectors are clustered into at least two clusters, and a cluster is determined to include either keys of the form or values of the form via identifying neighbors between feature vectors of the cluster(s) with labeled feature vectors.
    Type: Grant
    Filed: September 28, 2018
    Date of Patent: December 22, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Hadar Averbuch Elor, Oron Anschel, Or Perel, Amit Adam, Shai Mazor, Rahul Bhotika, Stefano Soatto