Patents by Inventor MILAN AGGARWAL

MILAN AGGARWAL has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12190061
    Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.
    Type: Grant
    Filed: December 17, 2021
    Date of Patent: January 7, 2025
    Assignee: ADOBE INC.
    Inventors: Shashank Shailabh, Madhur Panwar, Milan Aggarwal, Pinkesh Badjatiya, Simra Shahid, Nikaash Puri, S Sejal Naidu, Sharat Chandra Racha, Balaji Krishnamurthy, Ganesh Karbhari Palwe
  • Publication number: 20250005048
    Abstract: Embodiments are disclosed for one-shot document snippet search. A method of one-shot document snippet search may include obtaining a query snippet and a target document. A multi-modal snippet detection model combines first multi-modal features from the query snippet and second multi-modal features from the target document to create a feature volume. The multi-modal snippet detection model identifies one or more matching snippets from the target document based on the feature volume.
    Type: Application
    Filed: June 30, 2023
    Publication date: January 2, 2025
    Applicant: Adobe Inc.
    Inventors: Abhinav JAVA, Surgan JANDIAL, Shripad DESHMUKH, Milan AGGARWAL, Mausoom SARKAR, Balaji KRISHNAMURTHY, Arneh JAIN
  • Publication number: 20240386315
    Abstract: Methods and systems are provided for a transformer model for journey simulation and prediction. In embodiments described herein, training data is obtained from stored journeys. The training data for each journey indicates customer interactions with each event in the sequence of events of the journey. A machine learning model is trained using the training data to simulate customer interaction with an input journey. The trained machine learning model then generates a simulation of customer interaction with an input journey and the results of the simulation are displayed.
    Type: Application
    Filed: May 16, 2023
    Publication date: November 21, 2024
    Inventors: Thomas BOUCHER, Tanay ANAND, Stephane LECERCLE, Saurabh GARG, Pranjal PRASOON, Nikaash PURI, Mukul LAMBA, Milan AGGARWAL, Jayakumar SUBRAMANIAN, Francoise CORVAISIER, David MENDEZ ACUNA, Camel AISSANI, Balaji KRISHNAMURTHY
  • Publication number: 20240362941
    Abstract: A corrective noise system receives an electronic version of a fillable form generated by a segmentation network and receives a correction to a segmentation error in the electronic version of the fillable form. The corrective noise system is trained to generate noise that represents the correction and superimpose the noise on the fillable form. The corrective noise system is further trained to identify regions in a corpus of forms that are semantically similar to a region that was subject to the correction. The generated noise is propagated to the semantically similar regions in the corpus of forms and the noisy corpus of forms is provided as input to the segmentation network. The noise causes the segmentation network to accurately identify fillable regions in the corpus of forms and output a segmented version of the corpus of forms having improved fidelity without retraining or otherwise modifying the segmentation network.
    Type: Application
    Filed: April 27, 2023
    Publication date: October 31, 2024
    Applicant: Adobe Inc.
    Inventors: Silky Singh, Surgan Jandial, Shripad Vilasrao Deshmukh, Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy, Arneh Jain, Abhinav Java
  • Publication number: 20240355020
    Abstract: In implementations of systems for digital content analysis, a computing device implements an analysis system to extract a first content component and a second content component from digital content to be analyzed based on content metrics. The analysis system generates first embeddings using a first machine learning model and second embedding using a second machine learning model. The first embeddings and the second embeddings are combined as concatenated embeddings. The analysis system generates an indication of a content metric for display in a user interface using a third machine learning model based on the concatenated embeddings.
    Type: Application
    Filed: April 21, 2023
    Publication date: October 24, 2024
    Applicant: Adobe Inc.
    Inventors: Yaman Kumar, Somesh Singh, Seoyoung Park, Pranjal Prasoon, Nithyakala Sainath, Nisarg Shailesh Joshi, Nikitha Srikanth, Nikaash Puri, Milan Aggarwal, Jayakumar Subramanian, Ganesh Palwe, Balaji Krishnamurthy, Matthew William Rozen, Mihir Naware, Hyman Chung
  • Patent number: 12124497
    Abstract: Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar.
    Type: Grant
    Filed: March 27, 2023
    Date of Patent: October 22, 2024
    Assignee: Adobe Inc.
    Inventors: Abhinav Java, Surgan Jandial, Shripad Vilasrao Deshmukh, Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy, Arneh Jain
  • Publication number: 20240330351
    Abstract: Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar.
    Type: Application
    Filed: March 27, 2023
    Publication date: October 3, 2024
    Applicant: Adobe Inc.
    Inventors: Abhinav Java, Surgan Jandial, Shripad Vilasrao Deshmukh, Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy, Arneh Jain
  • Patent number: 12086728
    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.
    Type: Grant
    Filed: April 18, 2023
    Date of Patent: September 10, 2024
    Assignee: Adobe Inc.
    Inventors: Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy
  • Patent number: 11997056
    Abstract: The technology described herein receives a natural-language sequence of words comprising multiple entities. The technology then identifies a plurality of entities in the natural-language sequence. The technology generates a masked natural-language sequence by masking a first entity in the natural-language sequence. The technology retrieves, from a knowledge base, information related to a second entity in the plurality of entities. The technology then trains a natural-language model to respond to a query. The training uses a first representation of the masked natural-language sequence, a second representation of the information, and the first entity.
    Type: Grant
    Filed: August 29, 2022
    Date of Patent: May 28, 2024
    Assignee: ADOBE INC.
    Inventors: Sumit Bhatia, Jivat Neet Kaur, Rachit Bansal, Milan Aggarwal, Balaji Krishnamurthy
  • Patent number: 11983946
    Abstract: In implementations of refining element associations for form structure extraction, a computing device implements a structure system to receive estimate data describing estimated associations of elements included in a form and a digital image depicting the form. An image patch is extracted from the digital image, and the image patch depicts a pair of elements of the elements included in the form. The structure system encodes an indication of whether the pair of elements have an association of the estimated associations. An indication is generated that the pair of elements have a particular association based at least partially on the encoded indication, bounding boxes of the pair of elements, and text depicted in the image patch.
    Type: Grant
    Filed: November 2, 2021
    Date of Patent: May 14, 2024
    Assignee: Adobe Inc.
    Inventors: Shripad Deshmukh, Milan Aggarwal, Mausoom Sarkar, Hiresh Gupta
  • Publication number: 20240153258
    Abstract: Various embodiments classify one or more portions of an image based on deriving an “intrinsic” modality. Such intrinsic modality acts as a substitute to a “text” modality in a multi-modal network. A text modality in image processing is typically a natural language text that describes one or more portions of an image. However, explicit natural language text may not be available across one or more domains for training a multi-modal network. Accordingly, various embodiments described herein generate an intrinsic modality, which is also a description of one or more portions of an image, except that such description is not an explicit natural language description, but rather a machine learning model representation. Some embodiments additionally leverage a visual modality obtained from a vision-only model or branch, which may learn domain characteristics that are not present in the multi-modal network.
    Type: Application
    Filed: October 28, 2022
    Publication date: May 9, 2024
    Inventors: Puneet MANGLA, Milan AGGARWAL, Balaji KRISHNAMURTHY
  • Patent number: 11971884
    Abstract: An interactive search session is implemented using an artificial intelligence model. For example, when the artificial intelligence model receives a search query from a user, the model selects an action from a plurality of actions based on the search query. The selected action queries the user for more contextual cues about the search query (e.g., may enquire about use of the search results, may request to refine the search query, or otherwise engage the user in conversation to better understand the intent of the search). The interactive search session may be in the form, for example, of a chat session between the user and the system, and the chat session may be displayed along with the search results (e.g., in a separate section of display). The interactive search session may enable the system to better understand the user's search needs, and accordingly may help provide more focused search results.
    Type: Grant
    Filed: March 28, 2022
    Date of Patent: April 30, 2024
    Assignee: Adobe Inc.
    Inventors: Milan Aggarwal, Balaji Krishnamurthy
  • Publication number: 20240119122
    Abstract: Systems and methods for data augmentation are provided. One aspect of the systems and methods include receiving an image that is misclassified by a classification network; computing an augmentation image based on the image using an augmentation network; and generating an augmented image by combining the image and the augmentation image, wherein the augmented image is correctly classified by the classification network.
    Type: Application
    Filed: October 11, 2022
    Publication date: April 11, 2024
    Inventors: Shripad Vilasrao Deshmukh, Surgan Jandial, Abhinav Java, Milan Aggarwal, Mausoom Sarkar, Arneh Jain, Balaji Krishnamurthy
  • Patent number: 11948358
    Abstract: Systems and methods for video processing are described. Embodiments of the present disclosure generate a plurality of image feature vectors corresponding to a plurality of frames of a video; generate a plurality of low-level event representation vectors based on the plurality of image feature vectors, wherein a number of the low-level event representation vectors is less than a number of the image feature vectors; generate a plurality of high-level event representation vectors based on the plurality of low-level event representation vectors, wherein a number of the high-level event representation vectors is less than the number of the low-level event representation vectors; and identify a plurality of high-level events occurring in the video based on the plurality of high-level event representation vectors.
    Type: Grant
    Filed: November 16, 2021
    Date of Patent: April 2, 2024
    Assignee: ADOBE INC.
    Inventors: Sumegh Roychowdhury, Sumedh A. Sontakke, Mausoom Sarkar, Nikaash Puri, Pinkesh Badjatiya, Milan Aggarwal
  • Publication number: 20240086457
    Abstract: A content analysis system provides content understanding for a content item using an attention aware multi-modal model. Given a content item, feature extractors extract features from content components of the content item in which the content components comprise multiple modalities. A cross-modal attention encoder of the attention aware multi-modal model generates an embedding of the content item using features extracted from the content components. A decoder of the attention aware multi-modal model generates an action-reason statement using the embedding of the content item from the cross-modal attention encoder.
    Type: Application
    Filed: September 14, 2022
    Publication date: March 14, 2024
    Inventors: Yaman KUMAR, Vaibhav AHLAWAT, Ruiyi ZHANG, Milan AGGARWAL, Ganesh Karbhari PALWE, Balaji KRISHNAMURTHY, Varun KHURANA
  • Publication number: 20240073159
    Abstract: The technology described herein receives a natural-language sequence of words comprising multiple entities. The technology then identifies a plurality of entities in the natural-language sequence. The technology generates a masked natural-language sequence by masking a first entity in the natural-language sequence. The technology retrieves, from a knowledge base, information related to a second entity in the plurality of entities. The technology then trains a natural-language model to respond to a query. The training uses a first representation of the masked natural-language sequence, a second representation of the information, and the first entity.
    Type: Application
    Filed: August 29, 2022
    Publication date: February 29, 2024
    Inventors: Sumit BHATIA, Jivat Neet KAUR, Rachit BANSAL, Milan AGGARWAL, Balaji KRISHNAMURTHY
  • Publication number: 20230267345
    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.
    Type: Application
    Filed: April 18, 2023
    Publication date: August 24, 2023
    Inventors: Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy
  • Publication number: 20230169271
    Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.
    Type: Application
    Filed: December 17, 2021
    Publication date: June 1, 2023
    Inventors: Shashank Shailabh, Madhur Panwar, Milan Aggarwal, Pinkesh Badjatiya, Simra Shahid, Nikaash Puri, S Sejal Naidu, Sharat Chandra Racha, Balaji Krishnamurthy, Ganesh Karbhari Palwe
  • Patent number: 11657306
    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.
    Type: Grant
    Filed: June 17, 2020
    Date of Patent: May 23, 2023
    Assignee: Adobe Inc.
    Inventors: Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy
  • Publication number: 20230153534
    Abstract: Methods and systems are provided for facilitating generation and utilization of a commonsense contextualizing machine learning (ML) model, in accordance with embodiments described herein. In embodiments, a commonsense contextual ML model is trained by fine-tuning a pre-trained language model using a set of training path-sentence pairs. Each training path-sentence pair includes a commonsense path, identified via a commonsense knowledge graph, and a natural language sentence identified as contextually related to the commonsense path. The trained commonsense contextualizing ML model can then be used to generate a commonsense inference path for a text input. Such a commonsense inference path can include a sequence of entities and relations that provide commonsense context to the text input. Thereafter, the commonsense inference path can be provided to a natural language processing system for use in performing a natural language processing task.
    Type: Application
    Filed: November 15, 2021
    Publication date: May 18, 2023
    Inventors: Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy