Patents by Inventor Michael Kraley

Michael Kraley has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Privacy preserving document analysis

Patent number: 12267305

Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.

Type: Grant

Filed: May 15, 2023

Date of Patent: April 1, 2025

Assignee: Adobe Inc.

Inventors: Nikolaos Barmpalios, Ruchi Rajiv Deshpande, Randy Lee Swineford, Nargol Rezvani, Andrew Marc Greene, Shawn Alan Gaither, Michael Kraley
Privacy Preserving Document Analysis

Publication number: 20230336532

Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.

Type: Application

Filed: May 15, 2023

Publication date: October 19, 2023

Applicant: Adobe Inc.

Inventors: Nikolaos Barmpalios, Ruchi Rajiv Deshpande, Randy Lee Swineford, Nargol Rezvani, Andrew Marc Greene, Shawn Alan Gaither, Michael Kraley
Document structure extraction using machine learning

Patent number: 11769072

Abstract: The structure of an untagged document can be derived using a predictive model that is trained in a supervised learning framework based on a corpus of tagged training documents. Analyzing the training documents results in a plurality of document part feature vectors, each of which correlates a category defining a document part (for example, “title” or “body paragraph”) with one or more feature-value pairs (for example, “font=Arial” or “alignment=centered”). Any suitable machine learning algorithm can be used to train the predictive model based on the document part feature vectors extracted from the training documents. Once the predictive model has been trained, it can receive feature-value pairs corresponding to a portion of an untagged document and make predictions with respect to the how that document part should be categorized. The predictive model can therefore generate tag metadata that defines a structure of the untagged document in an automated fashion.

Type: Grant

Filed: August 8, 2016

Date of Patent: September 26, 2023

Assignee: Adobe Inc.

Inventor: Michael Kraley
Privacy preserving document analysis

Patent number: 11689507

Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.

Type: Grant

Filed: November 26, 2019

Date of Patent: June 27, 2023

Assignee: Adobe Inc.

Inventors: Nikolaos Barmpalios, Ruchi Rajiv Deshpande, Randy Lee Swineford, Nargol Rezvani, Andrew Marc Greene, Shawn Alan Gaither, Michael Kraley
Automatically generating labeled synthetic documents

Patent number: 11238312

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating diverse and realistic synthetic documents using deep learning. In particular, the disclosed systems can utilize a trained neural network to generate realistic image layouts comprising page elements that comply with layout parameters. The disclosed systems can also generate synthetic content corresponding to the page elements within the image layouts. The disclosed systems insert the synthetic content into the corresponding page elements of documents based on the image layouts to generate synthetic documents.

Type: Grant

Filed: November 21, 2019

Date of Patent: February 1, 2022

Assignee: Adobe Inc.

Inventors: Verena Kaynig-Fittkau, Sruthi Madapoosi Ravi, Richard Cohn, Nikolaos Barmpalios, Michael Kraley, Kanchana Sethu
AUTOMATICALLY GENERATING LABELED SYNTHETIC DOCUMENTS

Publication number: 20210158093

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating diverse and realistic synthetic documents using deep learning. In particular, the disclosed systems can utilize a trained neural network to generate realistic image layouts comprising page elements that comply with layout parameters. The disclosed systems can also generate synthetic content corresponding to the page elements within the image layouts. The disclosed systems insert the synthetic content into the corresponding page elements of documents based on the image layouts to generate synthetic documents.

Type: Application

Filed: November 21, 2019

Publication date: May 27, 2021

Inventors: Verena Kaynig-Fittkau, Sruthi Madapoosi Ravi, Richard Cohn, Nikolaos Barmpalios, Michael Kraley, Kanchana Sethu
Privacy Preserving Document Analysis

Publication number: 20210160221

Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.

Type: Application

Filed: November 26, 2019

Publication date: May 27, 2021

Applicant: Adobe Inc.

Inventors: Nikolaos Barmpalios, Ruchi Rajiv Deshpande, Randy Lee Swineford, Nargol Rezvani, Andrew Marc Greene, Shawn Alan Gaither, Michael Kraley
Identification of reading order text segments with a probabilistic language model

Patent number: 10372821

Abstract: Certain embodiments identify a correct structured reading-order sequence of text segments extracted from a file. A probabilistic language model is generated from a large text corpus to comprise observed word sequence patterns for a given language. The language model measures whether splicing together a first text segment with another continuation text segment results in a phrase that is more likely than a phrase resulting from splicing together the first text segment with other continuation text segments. Sets of text segments, which include a first set with a first text segment and a first continuation text segment as well as a second set with the first text segment and a second continuation text segment, are provided to the probabilistic model. A score indicative of a likelihood of the set providing a correct structured reading-order sequence is obtained for each set of text segments.

Type: Grant

Filed: March 17, 2017

Date of Patent: August 6, 2019

Assignee: Adobe Inc.

Inventors: Walter Chang, Trung Bui, Pranjal Daga, Michael Kraley, Hung Bui
IDENTIFICATION OF READING ORDER TEXT SEGMENTS WITH A PROBABILISTIC LANGUAGE MODEL

Publication number: 20180267956

Abstract: A computer implemented method and system identifies correct structured reading-order sequence of text segments that are extracted from a file structured in a portable document format. A probabilistic language model is generated from a large text corpus to comprise observed word sequence patterns for a given language. The language model measures whether splicing together a first text segment with another continuation text segment results in a phrase that is more likely than a phrase resulting from splicing together the first text segment with other continuation text segments. Sets of text segments are provided to the probabilistic model, where the sets of text segments comprise a first set including the first text segment and a first continuation text segment. A second set includes the first text segment and a second continuation text segment. A score is obtained for each set of text segments. The score is indicative of a likelihood of the set providing a correct structured reading-order sequence.

Type: Application

Filed: March 17, 2017

Publication date: September 20, 2018

Applicant: Adobe Systems Incorporated

Inventors: Walter Chang, Trung Bui, Pranjal Daga, Michael Kraley, Hung Bui
DOCUMENT STRUCTURE EXTRACTION USING MACHINE LEARNING

Publication number: 20180039907

Abstract: The structure of an untagged document can be derived using a predictive model that is trained in a supervised learning framework based on a corpus of tagged training documents. Analyzing the training documents results in a plurality of document part feature vectors, each of which correlates a category defining a document part (for example, “title” or “body paragraph”) with one or more feature-value pairs (for example, “font=Arial” or “alignment=centered”). Any suitable machine learning algorithm can be used to train the predictive model based on the document part feature vectors extracted from the training documents. Once the predictive model has been trained, it can receive feature-value pairs corresponding to a portion of an untagged document and make predictions with respect to the how that document part should be categorized. The predictive model can therefore generate tag metadata that defines a structure of the untagged document in an automated fashion.

Type: Application

Filed: August 8, 2016

Publication date: February 8, 2018

Applicant: Adobe Systems Incorporated

Inventor: Michael Kraley
Automatic document classification via content analysis at storage time

Patent number: 9547712

Abstract: Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.

Type: Grant

Filed: February 25, 2016

Date of Patent: January 17, 2017

Assignee: Adobe Systems Incorporated

Inventor: Michael Kraley
AUTOMATIC DOCUMENT CLASSIFICATION VIA CONTENT ANALYSIS AT STORAGE TIME

Publication number: 20160171084

Abstract: Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.

Type: Application

Filed: February 25, 2016

Publication date: June 16, 2016

Applicant: Adobe Systems Incorporated

Inventor: Michael Kraley
AUTOMATIC DOCUMENT CLASSIFICATION VIA CONTENT ANALYSIS AT STORAGE TIME

Publication number: 20160098483

Abstract: Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.

Type: Application

Filed: December 11, 2015

Publication date: April 7, 2016

Applicant: Adobe Systems Incorporated

Inventor: Michael Kraley
Automatic document classification via content analysis at storage time

Patent number: 9298813

Abstract: Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.

Type: Grant

Filed: December 11, 2015

Date of Patent: March 29, 2016

Assignee: Adobe Systems Incorporated

Inventor: Michael Kraley
Automatic document classification via content analysis at storage time

Patent number: 9239876

Abstract: Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.

Type: Grant

Filed: December 3, 2012

Date of Patent: January 19, 2016

Assignee: ADOBE SYSTEMS INCORPORATED

Inventor: Michael Kraley
System and method for editing an item list in electronic content

Patent number: 8849869

Abstract: In various embodiments, a computerized method includes creating a first item of a list having at least two items that form a portion of electronic content. The computerized method can include creating a second item of the list within the electronic content, as well as converting the second item to a separate paragraph below the first item, wherein the separate paragraph is part of the first item. The computerized method may include creating another item of the list below the separate paragraph, wherein a continuity is maintained between the first item and the another item of the list. The computerized method includes storing the electronic content in a machine-readable medium.

Type: Grant

Filed: March 11, 2013

Date of Patent: September 30, 2014

Assignee: Adobe Systems Incorporated

Inventor: Michael Kraley
AUTOMATIC DOCUMENT CLASSIFICATION VIA CONTENT ANALYSIS AT STORAGE TIME

Publication number: 20140156665

Abstract: Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.

Type: Application

Filed: December 3, 2012

Publication date: June 5, 2014

Applicant: ADOBE SYSTEMS INCORPORATED

Inventor: Michael Kraley
SYSTEM AND METHOD FOR EDITING AN ITEM LIST IN ELECTRONIC CONTENT

Publication number: 20130198622

Abstract: In various embodiments, a computerized method includes creating a first item of a list having at least two items that form a portion of electronic content. The computerized method can include creating a second item of the list within the electronic content, as well as converting the second item to a separate paragraph below the first item, wherein the separate paragraph is part of the first item. The computerized method may include creating another item of the list below the separate paragraph, wherein a continuity is maintained between the first item and the another item of the list. The computerized method includes storing the electronic content in a machine-readable medium.

Type: Application

Filed: March 11, 2013

Publication date: August 1, 2013

Applicant: Adobe System Incorporated

Inventor: Michael Kraley
System and method for editing an item list in electronic content

Patent number: 8396900

Abstract: In various embodiments, a computerized method includes creating a first item of a list having at least two items that form a portion of electronic content. The computerized method can include creating a second item of the list within the electronic content, as well as converting the second item to a separate paragraph below the first item, wherein the separate paragraph is part of the first item. The computerized method may include creating another item of the list below the separate paragraph, wherein a continuity is maintained between the first item and the another item of the list. The computerized method includes storing the electronic content in a machine-readable medium.

Type: Grant

Filed: April 6, 2011

Date of Patent: March 12, 2013

Assignee: Adobe Systems Incorporated

Inventor: Michael Kraley