Patents by Inventor Vlad Ion Morariu

Vlad Ion Morariu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250095631
    Abstract: Position-based text-to-speech model and training techniques are described. A digital document, for instance, is received by an audio synthesis service. A text-to-speech model is utilized by the audio synthesis service to generate digital audio from text included in the digital document. The text-to-speech model, for instance, is configured to generate a text encoding and a document positional encoding from an initial text sequence of the digital document. The document positional encoding is based on a location of the text encoding within the digital document. Digital audio is then generated by the text-to-speech model that includes a spectrogram having a reordered text sequence, which is different from the initial text sequence, by decoding the text encoding and the document positional encoding.
    Type: Application
    Filed: December 4, 2023
    Publication date: March 20, 2025
    Applicant: Adobe Inc.
    Inventors: Puneet Mathur, Franck Dernoncourt, Quan Hung Tran, Jiuxiang Gu, Ani Nenkova, Vlad Ion Morariu, Rajiv Bhawanji Jain, Dinesh Manocha
  • Publication number: 20250086860
    Abstract: Knowledge edit techniques for text-to-image models and other generative machine learning models are described. In an example, a location is identified within a text-to-image model by a model edit system. The location is configured to influence generation of a visual attribute by a text-to-image model as part of a digital image. An edited text-to-image model is formed by editing the text-to-image model based on the location. The edit causes a change to the visual attribute in generating a subsequent digital image by the edited text-to-image model. The subsequent digital image is generated as having the change to the visual attribute by the edited text-to-image model.
    Type: Application
    Filed: January 29, 2024
    Publication date: March 13, 2025
    Applicant: Adobe Inc.
    Inventors: Varun Manjunatha, Vlad Ion Morariu, Samyadeep Basu, Nanxuan Zhao
  • Patent number: 12147499
    Abstract: Certain embodiments involve using a machine-learning tool to generate metadata identifying segments and topics for text within a document. For instance, in some embodiments, a text processing system obtains input text and applies a segmentation-and-labeling model to the input text. The segmentation-and-labeling model is trained to generate a predicted segment for the input text using a segmentation network. The segmentation-and-labeling model is also trained to generate a topic for the predicted segment using a pooling network of the model to the predicted segment. The output of the model is usable for generating metadata identifying the predicted segment and the associated topic.
    Type: Grant
    Filed: September 5, 2023
    Date of Patent: November 19, 2024
    Assignee: Adobe Inc.
    Inventors: Rajiv Jain, Varun Manjunatha, Joseph Barrow, Vlad Ion Morariu, Franck Dernoncourt, Sasha Spala, Nicholas Miller
  • Patent number: 12038962
    Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure identify a claim from a document, wherein the claim corresponds to a topic, create a graph comprising a plurality of nodes having a plurality of node types and a plurality of edges having a plurality of edge types, wherein one of the nodes represents the claim, and wherein each of the edges represents a relationship between a corresponding pair of the nodes, encode the claim based on the graph using a graph convolutional network (GCN) to obtain an encoded claim, classify the claim by decoding the encoded claim to obtain a stance label that indicates a stance of the claim towards the topic, and transmit information indicating a viewpoint of the document towards the topic based on the stance label.
    Type: Grant
    Filed: July 23, 2021
    Date of Patent: July 16, 2024
    Assignee: ADOBE INC.
    Inventors: Joseph Barrow, Rajiv Bhawanji Jain, Nedim Lipka, Vlad Ion Morariu, Franck Dernoncourt, Varun Manjunatha
  • Publication number: 20240232525
    Abstract: Systems and methods for document classification are described. Embodiments of the present disclosure generate classification data for a plurality of samples using a neural network trained to identify a plurality of known classes; select a set of samples for annotation from the plurality of samples using an open-set metric based on the classification data, wherein the annotation includes an unknown class; and train the neural network to identify the unknown class based on the annotation of the set of samples.
    Type: Application
    Filed: October 24, 2022
    Publication date: July 11, 2024
    Inventors: Rajiv Bhawanji Jain, Michelle Yuan, Vlad Ion Morariu, Ani Nenkova Nenkova, Smitha Bangalore Naresh, Nikolaos Barmpalios, Ruchi Deshpande, Ruiyi Zhang, Jiuxiang Gu, Varun Manjunatha, Nedim Lipka, Andrew Marc Greene
  • Patent number: 11995394
    Abstract: Systems and methods for document editing are provided. One aspect of the systems and methods includes obtaining a document and a natural language edit request. Another aspect of the systems and methods includes generating a structured edit command using a machine learning model based on the document and the natural language edit request. Yet another aspect of the systems and methods includes generating a modified document based on the document and the structured edit command, where the modified document includes a revision of the document that incorporates the natural language edit request.
    Type: Grant
    Filed: February 7, 2023
    Date of Patent: May 28, 2024
    Assignee: ADOBE INC.
    Inventors: Vlad Ion Morariu, Puneet Mathur, Rajiv Bhawanji Jain, Jiuxiang Gu, Franck Dernoncourt
  • Patent number: 11978272
    Abstract: Adapting a machine learning model to process data that differs from training data used to configure the model for a specified objective is described. A domain adaptation system trains the model to process new domain data that differs from a training data domain by using the model to generate a feature representation for the new domain data, which describes different content types included in the new domain data. The domain adaptation system then generates a probability distribution for each discrete region of the new domain data, which describes a likelihood of the region including different content described by the feature representation. The probability distribution is compared to ground truth information for the new domain data to determine a loss function, which is used to refine model parameters. After determining that model outputs achieve a threshold similarity to the ground truth information, the model is output as a domain-agnostic model.
    Type: Grant
    Filed: August 9, 2022
    Date of Patent: May 7, 2024
    Assignee: Adobe Inc.
    Inventors: Kai Li, Christopher Alan Tensmeyer, Curtis Michael Wigington, Handong Zhao, Nikolaos Barmpalios, Tong Sun, Varun Manjunatha, Vlad Ion Morariu
  • Publication number: 20240135096
    Abstract: Systems and methods for document classification are described. Embodiments of the present disclosure generate classification data for a plurality of samples using a neural network trained to identify a plurality of known classes; select a set of samples for annotation from the plurality of samples using an open-set metric based on the classification data, wherein the annotation includes an unknown class; and train the neural network to identify the unknown class based on the annotation of the set of samples.
    Type: Application
    Filed: October 23, 2022
    Publication date: April 25, 2024
    Inventors: Rajiv Bhawanji Jain, Michelle Yuan, Vlad Ion Morariu, Ani Nenkova Nenkova, Smitha Bangalore Naresh, Nikolaos Barmpalios, Ruchi Deshpande, Ruiyi Zhang, Jiuxiang Gu, Varun Manjunatha, Nedim Lipka, Andrew Marc Greene
  • Patent number: 11922110
    Abstract: Systems and techniques for generating responsive documents are described. Digital content is organized into a structure that defines how content is presented when a document is displayed by a computing device. To generate the responsive document, relationships are defined among different digital content objects, such as groups of content objects to be presented together and content objects that are to be presented as alternatives of one another. Responsive patterns are assigned to grouped content objects, where each responsive pattern defines different layout configurations for displaying grouped content objects based on computing device display characteristics. In some implementations, multiple responsive patterns are assigned to a single content group and individual responsive patterns are associated with activation ranges for display characteristics that activate the responsive pattern.
    Type: Grant
    Filed: November 24, 2021
    Date of Patent: March 5, 2024
    Assignees: Adobe Inc., University of Maryland, College Park
    Inventors: Vlad Ion Morariu, Yuexi Chen, Christopher Alan Tensmeyer, Zhicheng Liu, Lars Niklas Emanuel Elmqvist
  • Patent number: 11880648
    Abstract: Embodiments provide systems, methods, and computer storage media for extracting semantic labels for field widgets of form fields in unfilled forms. In some embodiments, a processing device accesses a representation of a fillable widget of a form field of an unfilled form. The processing device generates an encoded input representing text and layout of a sequence of tokens in a neighborhood of the fillable widget. The processing device uses a machine learning model to extract a semantic label representing a field type of the fillable widget in view of the encoded input. The processing device causes execution of an action using the semantic label.
    Type: Grant
    Filed: November 22, 2021
    Date of Patent: January 23, 2024
    Assignee: Adobe Inc.
    Inventors: Aparna Garimella, Sumit Shekhar, Bhanu Prakash Reddy Guda, Vinay Aggarwal, Vlad Ion Morariu, Ashutosh Mehra
  • Publication number: 20230409672
    Abstract: Certain embodiments involve using a machine-learning tool to generate metadata identifying segments and topics for text within a document. For instance, in some embodiments, a text processing system obtains input text and applies a segmentation-and-labeling model to the input text. The segmentation-and-labeling model is trained to generate a predicted segment for the input text using a segmentation network. The segmentation-and-labeling model is also trained to generate a topic for the predicted segment using a pooling network of the model to the predicted segment. The output of the model is usable for generating metadata identifying the predicted segment and the associated topic.
    Type: Application
    Filed: September 5, 2023
    Publication date: December 21, 2023
    Inventors: Rajiv Jain, Varun Manjunatha, Joseph Barrow, Vlad Ion Morariu, Franck Dernoncourt, Sasha Spala, Nicholas Miller
  • Publication number: 20230376687
    Abstract: Embodiments are provided for facilitating multimodal extraction across multiple granularities. In one implementation, a set of features of a document for a plurality of granularities of the document is obtained. Via a machine learning model, the set of features of the document are modified to generate a set of modified features using a set of self-attention values to determine relationships within a first type of feature and a set of cross-attention values to determine relationships between the first type of feature and a second type of feature. Thereafter, the set of modified features are provided to a second machine learning model to perform a classification task.
    Type: Application
    Filed: May 17, 2022
    Publication date: November 23, 2023
    Inventors: Vlad Ion Morariu, Tong Sun, Nikolaos Barmpalios, Zilong Wang, Jiuxiang Gu, Ani Nenkova Nenkova, Christopher Tensmeyer
  • Publication number: 20230368003
    Abstract: The technology described herein is directed to an adaptive sparse attention pattern that is learned during fine-tuning and deployed in a machine-learning model. In aspects, a row or a column in an attention matrix with an importance score for a task that is above a threshold importance score is identified. The important row or the column is included in an adaptive attention pattern used with a machine-learning model having a self-attention operation. In response to an input, a task-specific inference is generated for the input using the machine-learning model with the adaptive attention pattern.
    Type: Application
    Filed: May 10, 2022
    Publication date: November 16, 2023
    Inventors: Jiuxiang Gu, Zihan Wang, Jason Wen Yong Kuen, Handong Zhao, Vlad Ion Morariu, Ruiyi Zhang, Ani Nenkova Nenkova, Tong Sun
  • Patent number: 11783008
    Abstract: Certain embodiments involve using a machine-learning tool to generate metadata identifying segments and topics for text within a document. For instance, in some embodiments, a text processing system obtains input text and applies a segmentation-and-labeling model to the input text. The segmentation-and-labeling model is trained to generate a predicted segment for the input text using a segmentation network. The segmentation-and-labeling model is also trained to generate a topic for the predicted segment using a pooling network of the model to the predicted segment. The output of the model is usable for generating metadata identifying the predicted segment and the associated topic.
    Type: Grant
    Filed: November 6, 2020
    Date of Patent: October 10, 2023
    Assignee: Adobe Inc.
    Inventors: Rajiv Jain, Varun Manjunatha, Joseph Barrow, Vlad Ion Morariu, Franck Dernoncourt, Sasha Spala, Nicholas Miller
  • Publication number: 20230230406
    Abstract: Methods and systems are provided for facilitating identification of fillable regions and/or data associated therewith. In embodiments, a candidate fillable region indicating a region in a form that is a candidate for being fillable is obtained. Textual context indicating text from the form and spatial context indicating positions of the text within the form are also obtained. Fillable region data associated with the candidate fillable region is generated, via a machine learning model, using the candidate fillable region, the textual context, and the spatial context. Thereafter, a fillable form is generated using the fillable region data, the fillable form having one or more fillable regions for accepting input.
    Type: Application
    Filed: January 18, 2022
    Publication date: July 20, 2023
    Inventors: Ashutosh Mehra, Christopher Alan Tensmeyer, Vlad Ion Morariu, Jiuxiang Gu
  • Publication number: 20230154221
    Abstract: The technology described includes methods for pretraining a document encoder model based on multimodal self cross-attention. One method includes receiving image data that encodes a set of pretraining documents. A set of sentences is extracted from the image data. A bounding box for each sentence is generated. For each sentence, a set of predicted features is generated by using an encoder machine-learning model. The encoder model performs cross-attention between a set of masked-textual features for the sentence and a set of masked-visual features for the sentence. The set of masked-textual features is based on a masking function and the sentence. The set of masked-visual features is based on the masking function and the corresponding bounding box. A document-encoder model is pretrained based on the set of predicted features for each sentence and pretraining tasks. The pretraining tasks includes masked sentence modeling, visual contrastive learning, or visual-language alignment.
    Type: Application
    Filed: November 16, 2021
    Publication date: May 18, 2023
    Inventors: Jiuxiang Gu, Ani Nenkova Nenkova, Nikolaos Barmpalios, Vlad Ion Morariu, Tong Sun, Rajiv Bhawanji Jain, Jason wen yong Kuen, Handong Zhao
  • Publication number: 20230085687
    Abstract: Various disclosed embodiments can resolve output inaccuracies produced by many machine learning models. Embodiments use content order as input to machine learning model systems so that they can process documents according to the position or rank of instances in a document or image. In this way, the model is less likely to misclassify or incorrectly detect instances or the ordering between predicted instances. The content order in various embodiments can be used as an additional signal to classify or make predictions.
    Type: Application
    Filed: November 21, 2022
    Publication date: March 23, 2023
    Inventors: Ashutosh MEHRA, Vlad Ion MORARIU, Kajal GUPTA, Jayant Vaibhav SRIVASTAVA, Curtis Michael WIGINGTON, Tushar TIWARI
  • Publication number: 20230033114
    Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure identify a claim from a document, wherein the claim corresponds to a topic, create a graph comprising a plurality of nodes having a plurality of node types and a plurality of edges having a plurality of edge types, wherein one of the nodes represents the claim, and wherein each of the edges represents a relationship between a corresponding pair of the nodes, encode the claim based on the graph using a graph convolutional network (GCN) to obtain an encoded claim, classify the claim by decoding the encoded claim to obtain a stance label that indicates a stance of the claim towards the topic, and transmit information indicating a viewpoint of the document towards the topic based on the stance label.
    Type: Application
    Filed: July 23, 2021
    Publication date: February 2, 2023
    Inventors: Joseph Barrow, Rajiv Bhawanji Jain, Nedim Lipka, Vlad Ion Morariu, Franck Dernoncourt, Varun Manjunatha
  • Patent number: 11544503
    Abstract: A domain alignment technique for cross-domain object detection tasks is introduced. During a preliminary pretraining phase, an object detection model is pretrained to detect objects in images associated with a source domain using a source dataset of images associated with the source domain. After completing the pretraining phase, a domain adaptation phase is performed using the source dataset and a target dataset to adapt the pretrained object detection model to detect objects in images associated with the target domain. The domain adaptation phase may involve the use of various domain alignment modules that, for example, perform multi-scale pixel/path alignment based on input feature maps or perform instance-level alignment based on input region proposals.
    Type: Grant
    Filed: May 27, 2020
    Date of Patent: January 3, 2023
    Assignee: Adobe Inc.
    Inventors: Christopher Tensmeyer, Vlad Ion Morariu, Varun Manjunatha, Tong Sun, Nikolaos Barmpalios, Kai Li, Handong Zhao, Curtis Wigington
  • Publication number: 20220391768
    Abstract: Adapting a machine learning model to process data that differs from training data used to configure the model for a specified objective is described. A domain adaptation system trains the model to process new domain data that differs from a training data domain by using the model to generate a feature representation for the new domain data, which describes different content types included in the new domain data. The domain adaptation system then generates a probability distribution for each discrete region of the new domain data, which describes a likelihood of the region including different content described by the feature representation. The probability distribution is compared to ground truth information for the new domain data to determine a loss function, which is used to refine model parameters. After determining that model outputs achieve a threshold similarity to the ground truth information, the model is output as a domain-agnostic model.
    Type: Application
    Filed: August 9, 2022
    Publication date: December 8, 2022
    Applicant: Adobe Inc.
    Inventors: Kai Li, Christopher Alan Tensmeyer, Curtis Michael Wigington, Handong Zhao, Nikolaos Barmpalios, Tong Sun, Varun Manjunatha, Vlad Ion Morariu