Patents by Inventor John Collomosse
John Collomosse has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12647289Abstract: Systems and methods for managing rights for creative work are provided. One aspect of the systems and methods includes receiving, at a rights contract on a distributed virtual machine operated based on a public blockchain, input data including an ownership token identifier for an ownership token, where the ownership token indicates ownership of a creative work. Another aspect of the systems and methods includes obtaining, at the rights contract, an indication of usage rights for the creative work corresponding to the ownership token. Yet another aspect of the systems and methods includes minting, via the rights contract, a rights token corresponding to the ownership token, where the rights token includes a reference to the indication of the usage rights for the creative work.Type: GrantFiled: February 22, 2023Date of Patent: June 2, 2026Assignee: ADOBE INC.Inventors: John Collomosse, Andrew S. Parsons
-
Publication number: 20260148430Abstract: A method, apparatus, non-transitory computer readable medium, and system for generating synthetic image includes obtaining a prompt indicating an image element. In some cases, a base generation model generates a first score function based on the prompt and an auxiliary image generation model generates a second score function based on the prompt. Additionally, the first score function and the second score function are combined to obtain a combined score function. In some cases, the combined score function includes positive guidance from the first score function and negative guidance from the second score function. A synthetic image that depicts the image element is generated based on the combined score function.Type: ApplicationFiled: November 25, 2024Publication date: May 28, 2026Inventors: Sina Alemohammad, Shruti Agarwal, John Collomosse
-
Publication number: 20260134215Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for augmenting the functionality of large language models using a hybrid causal-bidirectional attention method. In particular, the disclosed systems generate, from a plurality of tokens interpretable by a large language model, a set of context tokens comprising tokens with bidirectional attention and a set of span tokens comprising tokens with causal attention and bidirectional attention. Additionally, the disclosed systems modify parameters of the large language model at a first training stage by utilizing a first loss function that incorporates the set of context tokens and a second loss function that incorporates the set of span tokens. Further, the disclosed systems modify the parameters of the large language model at a second training stage by utilizing the first loss function, the second loss function, and a third loss function that incorporates the set of context tokens.Type: ApplicationFiled: November 13, 2024Publication date: May 14, 2026Inventors: Savya Khosla, Simon Jenni, Kushal Kafle, John Collomosse, Jing Shi, Handong Zhao
-
Patent number: 12586350Abstract: Embodiments are disclosed for training a system to generate audio and video representations using self-supervised learning. The method may include receiving a video signal including an audio component and a video component. A first machine learning model is trained to determine a representation of the audio component using a contrastive learning task and a temporal learning task. A second machine learning model to determine a representation of the video component using the contrastive learning task and the temporal learning task. By training the machine learning models using both contrastive learning tasks and temporal learning tasks, the machine learning models learn short term features, long term features, and semantic features of input data.Type: GrantFiled: January 31, 2023Date of Patent: March 24, 2026Assignee: Adobe Inc.Inventors: Simon Jenni, John Collomosse
-
Patent number: 12579831Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for generating difference captions indicating detected differences in digital image pairs. The disclosed system generates a first feature map of a first digital image and a second feature map of a second digital image. The disclosed system converts, utilizing a linear projection neural network, the first feature map to a first modified feature map in a feature space corresponding to a large language machine-learning model. The disclosed system also converts, utilizing the linear projection neural network layer, the second feature map to a second modified feature map in the feature space corresponding to the large language machine-learning model.Type: GrantFiled: October 18, 2023Date of Patent: March 17, 2026Assignees: Adobe Inc., University of SurreyInventors: Yifei Fan, John Collomosse, Jing Shi, Alexander Black
-
Publication number: 20260038167Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for generating segmentations of a raster image via a half-edge mesh structure with scanline operations. The disclosed system determines, during scanline operations on a raster image, a plurality of sets of adjacent pixels having a common color value in the raster image. The disclosed system determines, during the scanline operations on the raster image, a plurality of half-edges at edges of pixels along a boundary of a set of adjacent pixels of the plurality of sets of adjacent pixels with next half-edge directions indicating directions of subsequent half-edges along the boundary of the set of adjacent pixels. The disclosed system generates one or more oriented polyline boundary loops representing the boundary of the set of adjacent pixels from the plurality of half-edges and the next half-edge directions of the set of adjacent pixels.Type: ApplicationFiled: July 30, 2024Publication date: February 5, 2026Inventors: Jing Shi, Hang Hua, Scott Cohen, John Collomosse, Kushal Kafle, Simon Jenni
-
Publication number: 20260024304Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that leverages a series of versions of a digital image to generate a caption prediction. Furthermore, the disclosed systems receive an image difference captioning request that includes a series of versions of a digital image with a series of manipulations applied to the series of versions. Moreover, the disclosed systems access one or more edit descriptions for one or more of the series of manipulations. Further, the disclosed systems generate text inputs from the series of versions of the digital image and the one or more edit description. From the text inputs and using a large language model, the disclosed systems generate a caption prediction that indicates a difference between a first version and a last version of the series of versions of the digital image.Type: ApplicationFiled: July 22, 2024Publication date: January 22, 2026Inventors: Jing Shi, Alexander Black, John Collomosse, Yifei Fan
-
Patent number: 12505565Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that utilize deep learning to identify regions of an image that have been editorially modified. For example, the image comparison system includes a deep image comparator model that compares a pair of images and localizes regions that have been editorially manipulated relative to an original or trusted image. More specifically, the deep image comparator model generates and surfaces visual indications of the location of such editorial changes on the modified image. The deep image comparator model is robust and ignores discrepancies due to benign image transformations that commonly occur during electronic image distribution. The image comparison system optionally includes an image retrieval model utilizes a visual search embedding that is robust to minor manipulations or benign modifications of images. The image retrieval model utilizes a visual search embedding for an image to robustly identify near duplicate images.Type: GrantFiled: May 27, 2022Date of Patent: December 23, 2025Assignees: Adobe Inc., University of SurreyInventors: John Collomosse, Alexander Black, Van Tu Bui, Hailin Jin, Viswanathan Swaminathan
-
Publication number: 20250307974Abstract: A method, apparatus, non-transitory computer readable medium, apparatus, and system for image processing include obtaining an input prompt describing an image element, generating, using an image generation model, an output image depicting the image element and including a watermark, and identifying the training image as a source of the output image based on the watermark. The image generation model is trained using a training image including the image element and the watermark.Type: ApplicationFiled: March 27, 2024Publication date: October 2, 2025Inventors: Shruti Agarwal, John Collomosse, Vishal Asnani
-
Patent number: 12417245Abstract: Embodiments are disclosed for performing content authentication. A method of content authentication may include dividing a query video into a plurality of chunks. A feature vector may be generated, using a fingerprinting model, for each chunk from the plurality of chunks. Similar video chunks are identified from a trusted chunk database based on the feature vectors using a multi-chunk search policy. One or more original videos corresponding to the query video are then returned.Type: GrantFiled: September 22, 2023Date of Patent: September 16, 2025Assignee: Adobe Inc.Inventors: Ritwik Sinha, Viswanathan Swaminathan, Simon Jenni, Md Mehrab Tanjim, John Collomosse
-
Patent number: 12361013Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for guided visual search. A visual search query can be represented as a sketch sequence that includes ordering information of the constituent strokes in the sketch. The visual search query can be encoded into a structural search encoding in a common search space by a structural neural network. Indexed visual search results can be identified in the common search space and clustered in an auxiliary semantic space. Sketch suggestions can be identified from a plurality of indexed sketches in the common search space. A sketch suggestion can be identified for each semantic cluster of visual search results and presented with the cluster to guide a user towards relevant content through an iterative search process. Selecting a sketch suggestion as a target sketch can automatically transform the visual search query to the target sketch via adversarial images.Type: GrantFiled: June 17, 2021Date of Patent: July 15, 2025Assignee: Adobe Inc.Inventors: Hailin Jin, John Collomosse
-
Publication number: 20250131753Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for generating difference captions indicating detected differences in digital image pairs. The disclosed system generates a first feature map of a first digital image and a second feature map of a second digital image. The disclosed system converts, utilizing a linear projection neural network, the first feature map to a first modified feature map in a feature space corresponding to a large language machine-learning model. The disclosed system also converts, utilizing the linear projection neural network layer, the second feature map to a second modified feature map in the feature space corresponding to the large language machine-learning model.Type: ApplicationFiled: October 18, 2023Publication date: April 24, 2025Inventors: Yifei Fan, John Collomosse, Jing Shi, Alexander Black
-
Publication number: 20250103649Abstract: Embodiments are disclosed for performing content authentication. A method of content authentication may include dividing a query video into a plurality of chunks. A feature vector may be generated, using a fingerprinting model, for each chunk from the plurality of chunks. Similar video chunks are identified from a trusted chunk database based on the feature vectors using a multi-chunk search policy. One or more original videos corresponding to the query video are then returned.Type: ApplicationFiled: September 22, 2023Publication date: March 27, 2025Applicant: Adobe Inc.Inventors: Ritwik SINHA, Viswanathan SWAMINATHAN, Simon JENNI, Md Mehrab TANJIM, John COLLOMOSSE
-
Patent number: 12183056Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that utilize a deep visual fingerprinting model with parameters learned from robust contrastive learning to identify matching digital images and image provenance information. For example, the disclosed systems utilize an efficient learning procedure that leverages training on bounded adversarial examples to more accurately identify digital images (including adversarial images) with a small computational overhead. To illustrate, the disclosed systems utilize a first objective function that iteratively identifies augmentations to increase contrastive loss. Moreover, the disclosed systems utilize a second objective function that iteratively learns parameters of a deep visual fingerprinting model to reduce the contrastive loss.Type: GrantFiled: January 11, 2022Date of Patent: December 31, 2024Assignee: Adobe Inc.Inventors: Maksym Andriushchenko, John Collomosse, Xiaoyang Li, Geoffrey Oxholm
-
Publication number: 20240430515Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that utilize deep learning to map query videos to known videos so as to identify a provenance of the query video or identify editorial manipulations of the query video relative to a known video. For example, the video comparison system includes a deep video comparator model that generates and compares visual and audio descriptors utilizing codewords and an inverse index. The deep video comparator model is robust and ignores discrepancies due to benign transformations that commonly occur during electronic video distribution.Type: ApplicationFiled: September 2, 2024Publication date: December 26, 2024Inventors: Alexander Black, Van Tu Bui, John Collomosse, Simon Jenni, Viswanathan Swaminathan
-
Patent number: 12081827Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that utilize deep learning to map query videos to known videos so as to identify a provenance of the query video or identify editorial manipulations of the query video relative to a known video. For example, the video comparison system includes a deep video comparator model that generates and compares visual and audio descriptors utilizing codewords and an inverse index. The deep video comparator model is robust and ignores discrepancies due to benign transformations that commonly occur during electronic video distribution.Type: GrantFiled: August 26, 2022Date of Patent: September 3, 2024Assignees: Adobe Inc., University of SurreyInventors: Alexander Black, Van Tu Bui, John Collomosse, Simon Jenni, Viswanathan Swaminathan
-
Publication number: 20240281504Abstract: Systems and methods for managing rights for creative work are provided. One aspect of the systems and methods includes receiving, at a rights contract on a distributed virtual machine operated based on a public blockchain, input data including an ownership token identifier for an ownership token, where the ownership token indicates ownership of a creative work. Another aspect of the systems and methods includes obtaining, at the rights contract, an indication of usage rights for the creative work corresponding to the ownership token. Yet another aspect of the systems and methods includes minting, via the rights contract, a rights token corresponding to the ownership token, where the rights token includes a reference to the indication of the usage rights for the creative work.Type: ApplicationFiled: February 22, 2023Publication date: August 22, 2024Inventors: John Collomosse, Andrew S. Parsons
-
Publication number: 20240273355Abstract: Embodiments are disclosed for identifying matching content using neural content fingerprints. The method may include receiving a request to identify content matching a query content item, wherein the query content item is a time varying content item, generating, by an embedding network, a neural fingerprint for the query content item, identifying one or more candidate content items based on the neural fingerprint of the query content item, determining, by a ranking network, one or more similarity scores corresponding to the one or more candidate content items, and identifying one or more matching content items based on the one or more similarity scores.Type: ApplicationFiled: February 13, 2023Publication date: August 15, 2024Applicant: Adobe Inc.Inventors: Nicholas J. BRYAN, Simon JENNI, John COLLOMOSSE, Christian James STEINMETZ
-
Publication number: 20240257496Abstract: Embodiments are disclosed for training a system to generate audio and video representations using self-supervised learning. The method may include receiving a video signal including an audio component and a video component. A first machine learning model is trained to determine a representation of the audio component using a contrastive learning task and a temporal learning task. A second machine learning model to determine a representation of the video component using the contrastive learning task and the temporal learning task. By training the machine learning models using both contrastive learning tasks and temporal learning tasks, the machine learning models learn short term features, long term features, and semantic features of input data.Type: ApplicationFiled: January 31, 2023Publication date: August 1, 2024Applicant: Adobe Inc.Inventors: Simon JENNI, John COLLOMOSSE
-
Patent number: 11966849Abstract: Techniques and systems are provided for configuring neural networks to perform certain image manipulation operations. For instance, in response to obtaining an image for manipulation, an image manipulation system determines the fitness scores for a set of neural networks resulting from the processing of a noise map. Based on these fitness scores, the image manipulation system selects a subset of the set of neural networks for cross-breeding into a new generation of neural networks. The image manipulation system evaluates the performance of this new generation of neural networks and continues cross-breeding this neural networks until a fitness threshold is satisfied. From the final generation of neural networks, the image manipulation system selects a neural network that provides a desired output and uses the neural network to generate the manipulated image.Type: GrantFiled: February 20, 2020Date of Patent: April 23, 2024Assignee: Adobe Inc.Inventors: John Collomosse, Hailin Jin