Patents by Inventor MILAN AGGARWAL
MILAN AGGARWAL has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12190061Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.Type: GrantFiled: December 17, 2021Date of Patent: January 7, 2025Assignee: ADOBE INC.Inventors: Shashank Shailabh, Madhur Panwar, Milan Aggarwal, Pinkesh Badjatiya, Simra Shahid, Nikaash Puri, S Sejal Naidu, Sharat Chandra Racha, Balaji Krishnamurthy, Ganesh Karbhari Palwe
-
Publication number: 20250005048Abstract: Embodiments are disclosed for one-shot document snippet search. A method of one-shot document snippet search may include obtaining a query snippet and a target document. A multi-modal snippet detection model combines first multi-modal features from the query snippet and second multi-modal features from the target document to create a feature volume. The multi-modal snippet detection model identifies one or more matching snippets from the target document based on the feature volume.Type: ApplicationFiled: June 30, 2023Publication date: January 2, 2025Applicant: Adobe Inc.Inventors: Abhinav JAVA, Surgan JANDIAL, Shripad DESHMUKH, Milan AGGARWAL, Mausoom SARKAR, Balaji KRISHNAMURTHY, Arneh JAIN
-
Publication number: 20240386315Abstract: Methods and systems are provided for a transformer model for journey simulation and prediction. In embodiments described herein, training data is obtained from stored journeys. The training data for each journey indicates customer interactions with each event in the sequence of events of the journey. A machine learning model is trained using the training data to simulate customer interaction with an input journey. The trained machine learning model then generates a simulation of customer interaction with an input journey and the results of the simulation are displayed.Type: ApplicationFiled: May 16, 2023Publication date: November 21, 2024Inventors: Thomas BOUCHER, Tanay ANAND, Stephane LECERCLE, Saurabh GARG, Pranjal PRASOON, Nikaash PURI, Mukul LAMBA, Milan AGGARWAL, Jayakumar SUBRAMANIAN, Francoise CORVAISIER, David MENDEZ ACUNA, Camel AISSANI, Balaji KRISHNAMURTHY
-
Publication number: 20240362941Abstract: A corrective noise system receives an electronic version of a fillable form generated by a segmentation network and receives a correction to a segmentation error in the electronic version of the fillable form. The corrective noise system is trained to generate noise that represents the correction and superimpose the noise on the fillable form. The corrective noise system is further trained to identify regions in a corpus of forms that are semantically similar to a region that was subject to the correction. The generated noise is propagated to the semantically similar regions in the corpus of forms and the noisy corpus of forms is provided as input to the segmentation network. The noise causes the segmentation network to accurately identify fillable regions in the corpus of forms and output a segmented version of the corpus of forms having improved fidelity without retraining or otherwise modifying the segmentation network.Type: ApplicationFiled: April 27, 2023Publication date: October 31, 2024Applicant: Adobe Inc.Inventors: Silky Singh, Surgan Jandial, Shripad Vilasrao Deshmukh, Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy, Arneh Jain, Abhinav Java
-
Publication number: 20240355020Abstract: In implementations of systems for digital content analysis, a computing device implements an analysis system to extract a first content component and a second content component from digital content to be analyzed based on content metrics. The analysis system generates first embeddings using a first machine learning model and second embedding using a second machine learning model. The first embeddings and the second embeddings are combined as concatenated embeddings. The analysis system generates an indication of a content metric for display in a user interface using a third machine learning model based on the concatenated embeddings.Type: ApplicationFiled: April 21, 2023Publication date: October 24, 2024Applicant: Adobe Inc.Inventors: Yaman Kumar, Somesh Singh, Seoyoung Park, Pranjal Prasoon, Nithyakala Sainath, Nisarg Shailesh Joshi, Nikitha Srikanth, Nikaash Puri, Milan Aggarwal, Jayakumar Subramanian, Ganesh Palwe, Balaji Krishnamurthy, Matthew William Rozen, Mihir Naware, Hyman Chung
-
Patent number: 12124497Abstract: Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar.Type: GrantFiled: March 27, 2023Date of Patent: October 22, 2024Assignee: Adobe Inc.Inventors: Abhinav Java, Surgan Jandial, Shripad Vilasrao Deshmukh, Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy, Arneh Jain
-
Publication number: 20240330351Abstract: Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar.Type: ApplicationFiled: March 27, 2023Publication date: October 3, 2024Applicant: Adobe Inc.Inventors: Abhinav Java, Surgan Jandial, Shripad Vilasrao Deshmukh, Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy, Arneh Jain
-
Patent number: 12086728Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.Type: GrantFiled: April 18, 2023Date of Patent: September 10, 2024Assignee: Adobe Inc.Inventors: Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy
-
Patent number: 11997056Abstract: The technology described herein receives a natural-language sequence of words comprising multiple entities. The technology then identifies a plurality of entities in the natural-language sequence. The technology generates a masked natural-language sequence by masking a first entity in the natural-language sequence. The technology retrieves, from a knowledge base, information related to a second entity in the plurality of entities. The technology then trains a natural-language model to respond to a query. The training uses a first representation of the masked natural-language sequence, a second representation of the information, and the first entity.Type: GrantFiled: August 29, 2022Date of Patent: May 28, 2024Assignee: ADOBE INC.Inventors: Sumit Bhatia, Jivat Neet Kaur, Rachit Bansal, Milan Aggarwal, Balaji Krishnamurthy
-
Patent number: 11983946Abstract: In implementations of refining element associations for form structure extraction, a computing device implements a structure system to receive estimate data describing estimated associations of elements included in a form and a digital image depicting the form. An image patch is extracted from the digital image, and the image patch depicts a pair of elements of the elements included in the form. The structure system encodes an indication of whether the pair of elements have an association of the estimated associations. An indication is generated that the pair of elements have a particular association based at least partially on the encoded indication, bounding boxes of the pair of elements, and text depicted in the image patch.Type: GrantFiled: November 2, 2021Date of Patent: May 14, 2024Assignee: Adobe Inc.Inventors: Shripad Deshmukh, Milan Aggarwal, Mausoom Sarkar, Hiresh Gupta
-
Publication number: 20240153258Abstract: Various embodiments classify one or more portions of an image based on deriving an “intrinsic” modality. Such intrinsic modality acts as a substitute to a “text” modality in a multi-modal network. A text modality in image processing is typically a natural language text that describes one or more portions of an image. However, explicit natural language text may not be available across one or more domains for training a multi-modal network. Accordingly, various embodiments described herein generate an intrinsic modality, which is also a description of one or more portions of an image, except that such description is not an explicit natural language description, but rather a machine learning model representation. Some embodiments additionally leverage a visual modality obtained from a vision-only model or branch, which may learn domain characteristics that are not present in the multi-modal network.Type: ApplicationFiled: October 28, 2022Publication date: May 9, 2024Inventors: Puneet MANGLA, Milan AGGARWAL, Balaji KRISHNAMURTHY
-
Patent number: 11971884Abstract: An interactive search session is implemented using an artificial intelligence model. For example, when the artificial intelligence model receives a search query from a user, the model selects an action from a plurality of actions based on the search query. The selected action queries the user for more contextual cues about the search query (e.g., may enquire about use of the search results, may request to refine the search query, or otherwise engage the user in conversation to better understand the intent of the search). The interactive search session may be in the form, for example, of a chat session between the user and the system, and the chat session may be displayed along with the search results (e.g., in a separate section of display). The interactive search session may enable the system to better understand the user's search needs, and accordingly may help provide more focused search results.Type: GrantFiled: March 28, 2022Date of Patent: April 30, 2024Assignee: Adobe Inc.Inventors: Milan Aggarwal, Balaji Krishnamurthy
-
Publication number: 20240119122Abstract: Systems and methods for data augmentation are provided. One aspect of the systems and methods include receiving an image that is misclassified by a classification network; computing an augmentation image based on the image using an augmentation network; and generating an augmented image by combining the image and the augmentation image, wherein the augmented image is correctly classified by the classification network.Type: ApplicationFiled: October 11, 2022Publication date: April 11, 2024Inventors: Shripad Vilasrao Deshmukh, Surgan Jandial, Abhinav Java, Milan Aggarwal, Mausoom Sarkar, Arneh Jain, Balaji Krishnamurthy
-
Patent number: 11948358Abstract: Systems and methods for video processing are described. Embodiments of the present disclosure generate a plurality of image feature vectors corresponding to a plurality of frames of a video; generate a plurality of low-level event representation vectors based on the plurality of image feature vectors, wherein a number of the low-level event representation vectors is less than a number of the image feature vectors; generate a plurality of high-level event representation vectors based on the plurality of low-level event representation vectors, wherein a number of the high-level event representation vectors is less than the number of the low-level event representation vectors; and identify a plurality of high-level events occurring in the video based on the plurality of high-level event representation vectors.Type: GrantFiled: November 16, 2021Date of Patent: April 2, 2024Assignee: ADOBE INC.Inventors: Sumegh Roychowdhury, Sumedh A. Sontakke, Mausoom Sarkar, Nikaash Puri, Pinkesh Badjatiya, Milan Aggarwal
-
Publication number: 20240086457Abstract: A content analysis system provides content understanding for a content item using an attention aware multi-modal model. Given a content item, feature extractors extract features from content components of the content item in which the content components comprise multiple modalities. A cross-modal attention encoder of the attention aware multi-modal model generates an embedding of the content item using features extracted from the content components. A decoder of the attention aware multi-modal model generates an action-reason statement using the embedding of the content item from the cross-modal attention encoder.Type: ApplicationFiled: September 14, 2022Publication date: March 14, 2024Inventors: Yaman KUMAR, Vaibhav AHLAWAT, Ruiyi ZHANG, Milan AGGARWAL, Ganesh Karbhari PALWE, Balaji KRISHNAMURTHY, Varun KHURANA
-
Publication number: 20240073159Abstract: The technology described herein receives a natural-language sequence of words comprising multiple entities. The technology then identifies a plurality of entities in the natural-language sequence. The technology generates a masked natural-language sequence by masking a first entity in the natural-language sequence. The technology retrieves, from a knowledge base, information related to a second entity in the plurality of entities. The technology then trains a natural-language model to respond to a query. The training uses a first representation of the masked natural-language sequence, a second representation of the information, and the first entity.Type: ApplicationFiled: August 29, 2022Publication date: February 29, 2024Inventors: Sumit BHATIA, Jivat Neet KAUR, Rachit BANSAL, Milan AGGARWAL, Balaji KRISHNAMURTHY
-
Publication number: 20230267345Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.Type: ApplicationFiled: April 18, 2023Publication date: August 24, 2023Inventors: Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy
-
Publication number: 20230169271Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.Type: ApplicationFiled: December 17, 2021Publication date: June 1, 2023Inventors: Shashank Shailabh, Madhur Panwar, Milan Aggarwal, Pinkesh Badjatiya, Simra Shahid, Nikaash Puri, S Sejal Naidu, Sharat Chandra Racha, Balaji Krishnamurthy, Ganesh Karbhari Palwe
-
Patent number: 11657306Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.Type: GrantFiled: June 17, 2020Date of Patent: May 23, 2023Assignee: Adobe Inc.Inventors: Milan Aggarwal, Mausoom Sarkar, Balaji Krishnamurthy
-
Publication number: 20230153534Abstract: Methods and systems are provided for facilitating generation and utilization of a commonsense contextualizing machine learning (ML) model, in accordance with embodiments described herein. In embodiments, a commonsense contextual ML model is trained by fine-tuning a pre-trained language model using a set of training path-sentence pairs. Each training path-sentence pair includes a commonsense path, identified via a commonsense knowledge graph, and a natural language sentence identified as contextually related to the commonsense path. The trained commonsense contextualizing ML model can then be used to generate a commonsense inference path for a text input. Such a commonsense inference path can include a sequence of entities and relations that provide commonsense context to the text input. Thereafter, the commonsense inference path can be provided to a natural language processing system for use in performing a natural language processing task.Type: ApplicationFiled: November 15, 2021Publication date: May 18, 2023Inventors: Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy