Patents by Inventor FRANCK DERNONCOURT

FRANCK DERNONCOURT has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Generating modified digital images utilizing a dispersed multimodal selection model

Patent number: 11594077

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.

Type: Grant

Filed: September 18, 2020

Date of Patent: February 28, 2023

Assignee: Adobe Inc.

Inventors: Trung Bui, Zhe Lin, Walter Chang, Nham Le, Franck Dernoncourt
PRESERVING USER-ENTITY DIFFERENTIAL PRIVACY IN NATURAL LANGUAGE MODELING

Publication number: 20230059367

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generate a natural language model that provides user-entity differential privacy. For example, in one or more embodiments, the disclosed systems sample sensitive data points from a natural language dataset. Using the sampled sensitive data points, the disclosed systems determine gradient values corresponding to the natural language model. Further, the disclosed systems generate noise for the natural language model. The disclosed systems generate parameters for the natural language model using the gradient values and the noise, facilitating simultaneous protection of the users and sensitive entities associated with the natural language dataset. In some implementations, the disclosed systems generate the natural language model through an iterative process (e.g., by iteratively modifying the parameters).

Type: Application

Filed: August 9, 2021

Publication date: February 23, 2023

Inventors: Thi Kim Phung Lai, Tong Sun, Rajiv Jain, Nikolaos Barmpalios, Jiuxiang Gu, Franck Dernoncourt
MACHINE LEARNING MODELS FOR DETECTING TOPIC DIVERGENT DIGITAL VIDEOS

Publication number: 20230046248

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and flexibly generating topic divergence classifications for digital videos based on words from the digital videos and further based on a digital text corpus representing a target topic. Particularly, the disclosed systems utilize a topic-specific knowledge encoder neural network to generate a topic divergence classification for a digital video to indicate whether or not the digital video diverges from a target topic. In some embodiments, the disclosed systems determine topic divergence classifications contemporaneously in real time for livestream digital videos or for stored digital videos (e.g., digital video tutorials).

Type: Application

Filed: August 2, 2021

Publication date: February 16, 2023

Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt
SYNTOPICAL READING FOR COLLECTION UNDERSTANDING

Publication number: 20230033114

Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure identify a claim from a document, wherein the claim corresponds to a topic, create a graph comprising a plurality of nodes having a plurality of node types and a plurality of edges having a plurality of edge types, wherein one of the nodes represents the claim, and wherein each of the edges represents a relationship between a corresponding pair of the nodes, encode the claim based on the graph using a graph convolutional network (GCN) to obtain an encoded claim, classify the claim by decoding the encoded claim to obtain a stance label that indicates a stance of the claim towards the topic, and transmit information indicating a viewpoint of the document towards the topic based on the stance label.

Type: Application

Filed: July 23, 2021

Publication date: February 2, 2023

Inventors: Joseph Barrow, Rajiv Bhawanji Jain, Nedim Lipka, Vlad Ion Morariu, Franck Dernoncourt, Varun Manjunatha
Model-based semantic text searching

Patent number: 11567981

Abstract: Techniques and systems are described for performing semantic text searches. A semantic text-searching solution uses a machine learning system (such as a deep learning system) to determine associations between the semantic meanings of words. These associations are not limited by the spelling, syntax, grammar, or even definition of words. Instead, the associations can be based on the context in which characters, words, and/or phrases are used in relation to one another. In response to detecting a request to locate text within an electronic document associated with a keyword, the semantic text-searching solution can return strings within the document that have matching and/or related semantic meanings or contexts, in addition to exact matches (e.g., string matches) within the document. The semantic text-searching solution can then output an indication of the matching strings.

Type: Grant

Filed: April 15, 2020

Date of Patent: January 31, 2023

Assignee: Adobe Inc.

Inventors: Trung Bui, Yu Gong, Tushar Dublish, Sasha Spala, Sachin Soni, Nicholas Miller, Joon Kim, Franck Dernoncourt, Carl Dockhorn, Ajinkya Kale
Performing global image editing using editing operations determined from natural language requests

Patent number: 11570318

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize a neural network having a long short-term memory encoder-decoder architecture to progressively modify a digital image in accordance with a natural language request. For example, in one or more embodiments, the disclosed systems utilize a language-to-operation decoding cell of a language-to-operation neural network to sequentially determine one or more image-modification operations to perform to modify a digital image in accordance with a natural language request. In some cases, the decoding cell determines an image-modification operation to perform partly based on the previously used image-modification operations. The disclosed systems further utilize the decoding cell to determine one or more operation parameters for each selected image-modification operation. The disclosed systems utilize the image-modification operation(s) and operation parameter(s) to modify the digital image (e.g.

Type: Grant

Filed: July 13, 2021

Date of Patent: January 31, 2023

Assignee: Adobe Inc.

Inventors: Ning Xu, Jing Shi, Franck Dernoncourt, Trung Bui
Utilizing logical-form dialogue generation for multi-turn construction of paired natural language queries and query-language representations

Patent number: 11561969

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media for generating pairs of natural language queries and corresponding query-language representations. For example, the disclosed systems can generate a contextual representation of a prior-generated dialogue sequence to compare with logical-form rules. In some implementations, the logical-form rules comprise trigger conditions and corresponding logical-form actions for constructing a logical-form representation of a subsequent dialogue sequence. Based on the comparison to logical-form rules indicating satisfaction of one or more trigger conditions, the disclosed systems can perform logical-form actions to generate a logical-form representation of a subsequent dialogue sequence.

Type: Grant

Filed: March 30, 2020

Date of Patent: January 24, 2023

Assignee: Adobe Inc.

Inventors: Doo Soon Kim, Anthony M Colas, Franck Dernoncourt, Moumita Sinha, Trung Bui
TRANSFER LEARNING AND PREDICTION CONSISTENCY FOR DETECTING OFFENSIVE SPANS OF TEXT

Publication number: 20230016729

Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure receive a span of text comprising an offensive span and a non-offensive span, generate a contextualized word embedding for each of a plurality of words of the span of text, generate a refined vector representation for each of the plurality of words based on the corresponding contextualized word embedding using a refinement network trained for offensive text recognition, generate label information for each of the plurality of words based on the corresponding refined vector representation, wherein the label information indicates whether each of the plurality of words includes offensive text, and transmit an indication of a location of the offensive span based on the label information.

Type: Application

Filed: July 2, 2021

Publication date: January 19, 2023

Inventors: AMIR POURAN BEN VEYSEH, Franck Dernoncourt
Generating hyper-parameters for machine learning models using modified Bayesian optimization based on accuracy and training efficiency

Patent number: 11556826

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for selecting hyper-parameter sets by utilizing a modified Bayesian optimization approach based on a combination of accuracy and training efficiency metrics of a machine learning model. For example, the disclosed systems can fit accuracy regression and efficiency regression models to observed metrics associated with hyper-parameter sets of a machine learning model. The disclosed systems can also implement a trade-off acquisition function that implements an accuracy-training efficiency balance metric to explore the hyper-parameter feature space and select hyper-parameters for training the machine learning model considering a balance between accuracy and training efficiency.

Type: Grant

Filed: March 20, 2020

Date of Patent: January 17, 2023

Assignee: Adobe Inc.

Inventors: Trung Bui, Lidan Wang, Franck Dernoncourt
Machine learning based abbreviation expansion

Patent number: 11544457

Abstract: Techniques are described herein for determining a long-form of an abbreviation using a machine learning based approach that takes into consideration both sequential context and structural context, where the long-form corresponds to a meaning of the abbreviation as used in a sequence of words that form a sentence. In some embodiments, word representations are generated for different words in the sequence of words, and a combined representation is generated for the abbreviation based on a word representation corresponding to the abbreviation, a sequential context representation, and a structural context representation. The sequential context representation can be generated based on word representations for words positioned near the abbreviation. The structural context representation can be generated based on word representations for words that are syntactically related to the abbreviation.

Type: Grant

Filed: March 25, 2020

Date of Patent: January 3, 2023

Assignee: Adobe Inc.

Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt
Interpretable label-attentive encoder-decoder parser

Patent number: 11544456

Abstract: Systems and methods for parsing natural language sentences using an artificial neural network (ANN) are described. Embodiments of the described systems and methods may generate a plurality of word representation matrices for an input sentence, wherein each of the word representation matrices is based on an input matrix of word vectors, a query vector, a matrix of key vectors, and a matrix of value vectors, and wherein a number of the word representation matrices is based on a number of syntactic categories, compress each of the plurality of word representation matrices to produce a plurality of compressed word representation matrices, concatenate the plurality of compressed word representation matrices to produce an output matrix of word vectors, and identify at least one word from the input sentence corresponding to a syntactic category based on the output matrix of word vectors.

Type: Grant

Filed: March 5, 2020

Date of Patent: January 3, 2023

Assignee: ADOBE INC.

Inventors: Khalil Mrini, Walter Chang, Trung Bui, Quan Tran, Franck Dernoncourt
TOPICAL VECTOR-QUANTIZED VARIATIONAL AUTOENCODERS FOR EXTRACTIVE SUMMARIZATION OF VIDEO TRANSCRIPTS

Publication number: 20220414338

Abstract: System and methods for a text summarization system are described. In one example, a text summarization system receives an input utterance and determines whether the utterance should be included in a summary of the text. The text summarization system includes an embedding network, a convolution network, an encoding component, and a summary component. The embedding network generates a semantic embedding of an utterance. The convolution network generates a plurality of feature vectors based on the semantic embedding. The encoding component identifies a plurality of latent codes respectively corresponding to the plurality of feature vectors. The summary component identifies a prominent code among the latent codes and to select the utterance as a summary utterance based on the prominent code.

Type: Application

Filed: June 29, 2021

Publication date: December 29, 2022

Inventors: SANGWOO CHO, Franck Dernoncourt, Timothy Jeewun Ganter, Trung Huu Bui, Nedim Lipka, Varun Manjunatha, Walter Chang, Hailin Jin, Jonathan Brandt
Customizable speech recognition system

Patent number: 11538463

Abstract: Methods and systems are provided for generating a customized speech recognition neural network system comprised of an adapted automatic speech recognition neural network and an adapted language model neural network. The automatic speech recognition neural network is first trained in a generic domain and then adapted to a target domain. The language model neural network is first trained in a generic domain and then adapted to a target domain. Such a customized speech recognition neural network system can be used to understand input vocal commands.

Type: Grant

Filed: April 12, 2019

Date of Patent: December 27, 2022

Assignee: Adobe Inc.

Inventors: Trung Huu Bui, Subhadeep Dey, Franck Dernoncourt
PERFORMING GLOBAL IMAGE EDITING USING EDITING OPERATIONS DETERMINED FROM NATURAL LANGUAGE REQUESTS

Publication number: 20220399017

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize a neural network having a long short-term memory encoder-decoder architecture to progressively modify a digital image in accordance with a natural language request. For example, in one or more embodiments, the disclosed systems utilize a language-to-operation decoding cell of a language-to-operation neural network to sequentially determine one or more image-modification operations to perform to modify a digital image in accordance with a natural language request. In some cases, the decoding cell determines an image-modification operation to perform partly based on the previously used image-modification operations. The disclosed systems further utilize the decoding cell to determine one or more operation parameters for each selected image-modification operation. The disclosed systems utilize the image-modification operation(s) and operation parameter(s) to modify the digital image (e.g.

Type: Application

Filed: July 13, 2021

Publication date: December 15, 2022

Inventors: Ning Xu, Jing Shi, Franck Dernoncourt, Trung Bui
DECOMPOSITIONAL LEARNING FOR COLOR ATTRIBUTE PREDICTION

Publication number: 20220383031

Abstract: The present disclosure describes a model for large scale color prediction of objects identified in images. Embodiments of the present disclosure include an object detection network, an attention network, and a color classification network. The object detection network generates object features for an object in an image and may include a convolutional neural network (CNN), region proposal network, or a ResNet. The attention network generates an attention vector for the object based on the object features, wherein the attention network takes a query vector based on the object features, and a plurality of key vector and a plurality of value vectors corresponding to a plurality of colors as input. The color classification network generates a color attribute vector based on the attention vector, wherein the color attribute vector indicates a probability of the object including each of the plurality of colors.

Type: Application

Filed: May 28, 2021

Publication date: December 1, 2022

Inventors: Qiuyu Chen, Quan Hung Tran, Kushal Kafle, Trung Huu Bui, Franck Dernoncourt, Walter Chang
SEMANTIC REASONING FOR TABULAR QUESTION ANSWERING

Publication number: 20220374426

Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure receive a query related to information in a table, compute an operation selector by combining the query with an operation embedding representing a plurality of table operations, compute a column selector by combining the query with a weighted operation embedding, compute a row selector based on the operation selector and the column selector, compute a probability value for a cell in the table based on the row selector and the column selector, where the probability value represents a probability that the cell provides an answer to the query, and transmit contents of the cell based on the probability value.

Type: Application

Filed: May 11, 2021

Publication date: November 24, 2022

Inventors: Dung Thai, Doo Soon Kim, Franck Dernoncourt, Trung Bui
DEPENDENCY PATH REASONING FOR MEASUREMENT EXTRACTION

Publication number: 20220358291

Abstract: The present disclosure provides systems and methods for relationship extraction. Embodiments of the present disclosure provide a relationship extraction network trained to identify relationships among entities in an input text. The relationship extraction network is used to generate a dependency path between entities in an input phrase. The dependency path includes a set of words that connect the entities, and is used to predict a relationship between the entities. In some cases, the dependency path is related to a syntax tree, but it may include additional words, and omit some words from a path extracted based on a syntax tree.

Type: Application

Filed: April 22, 2021

Publication date: November 10, 2022

Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt
CONTEXT-AWARE FONT RECOMMENDATION FROM TEXT

Publication number: 20220358280

Abstract: Embodiments are disclosed for recommending fonts based on text inputs are described. In some embodiments, a method of recommending fonts includes receiving a selection of text, providing a representation of the selection of text to a font recommendation model, generating, by the font recommendation model, a prediction score for each of a plurality of fonts based on the representation of the selection of text, and returning at least one recommended font based on the prediction score for each of the plurality of fonts.

Type: Application

Filed: September 29, 2021

Publication date: November 10, 2022

Inventors: Amirreza SHIRANI, Franck DERNONCOURT, Jose Ignacio ECHEVARRIA VALLESPI, Paul ASENTE, Nedim LIPKA, Thamar I. SOLORIO MARTINEZ
Slot filling with contextual information

Patent number: 11494647

Abstract: A system, method and non-transitory computer readable medium for editing images with verbal commands are described. Embodiments of the system, method and non-transitory computer readable medium may include an artificial neural network (ANN) comprising a word embedding component configured to convert text input into a set of word vectors, a feature encoder configured to create a combined feature vector for the text input based on the word vectors, a scoring layer configured to compute labeling scores based on the combined feature vectors, wherein the feature encoder, the scoring layer, or both are trained using multi-task learning with a loss function including a first loss value and an additional loss value based on mutual information, context-based prediction, or sentence-based prediction, and a command component configured to identify a set of image editing word labels based on the labeling scores.

Type: Grant

Filed: December 6, 2019

Date of Patent: November 8, 2022

Assignee: ADOBE INC.

Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt
INDUCING RICH INTERACTION STRUCTURES BETWEEN WORDS FOR DOCUMENT-LEVEL EVENT ARGUMENT EXTRACTION

Publication number: 20220318505

Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure receive a document comprising a plurality of words organized into a plurality of sentences, the words comprising an event trigger word and an argument candidate word, generate word representation vectors for the words, generate a plurality of document structures including a semantic structure for the document based on the word representation vectors, a syntax structure representing dependency relationships between the words, and a discourse structure representing discourse information of the document based on the plurality of sentences, generate a relationship representation vector based on the document structures, and predict a relationship between the event trigger word and the argument candidate word based on the relationship representation vector.

Type: Application

Filed: April 6, 2021

Publication date: October 6, 2022

Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt, Quan Tran, Varun Manjunatha, Lidan Wang, Rajiv Jain, Doo Soon Kim, Walter Chang

prev 1 2 3 4 5 6 next