Patents by Inventor SHAWN ALAN GAITHER
SHAWN ALAN GAITHER has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20230336532Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.Type: ApplicationFiled: May 15, 2023Publication date: October 19, 2023Applicant: Adobe Inc.Inventors: Nikolaos Barmpalios, Ruchi Rajiv Deshpande, Randy Lee Swineford, Nargol Rezvani, Andrew Marc Greene, Shawn Alan Gaither, Michael Kraley
-
Patent number: 11769111Abstract: The present invention is directed towards providing automated workflows for the identification of a reading order from text segments extracted from a document. Ordering the text segments is based on trained natural language models. In some embodiments, the workflows are enabled to perform a method for identifying a sequence associated with a portable document. The methods includes iteratively generating a probabilistic language model, receiving the portable document, and selectively extracting features (such as but not limited to text segments) from the document. The method may generate pairs of features (or feature pair from the extracted features). The method may further generate a score for each of the pairs based on the probabilistic language model and determine an order to features based on the scores. The method may provide the extracted features in the determined order.Type: GrantFiled: June 18, 2020Date of Patent: September 26, 2023Assignee: Adobe Inc.Inventors: Trung Huu Bui, Hung Hai Bui, Shawn Alan Gaither, Walter Wei-Tuh Chang, Michael Frank Kraley, Pranjal Daga
-
Patent number: 11689507Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.Type: GrantFiled: November 26, 2019Date of Patent: June 27, 2023Assignee: Adobe Inc.Inventors: Nikolaos Barmpalios, Ruchi Rajiv Deshpande, Randy Lee Swineford, Nargol Rezvani, Andrew Marc Greene, Shawn Alan Gaither, Michael Kraley
-
Patent number: 11386685Abstract: Techniques are provided for identifying structural elements of a document. One Methodology includes generating a first channel of rasterized content by rasterizing a full page of the document and generating one or more additional channels of rasterized content from the page of the document by rasterizing one or more corresponding content types from the page of the document. Each of the one or more additional channels includes a specific type of content that is different from each of the other one or more additional channels. The methodology further includes inputting the first channel of rasterized content and the one or more additional channels of rasterized content into a machine learning (ML) model. The methodology continues with determining location and classification for each of a plurality of structural elements on the page of the document using the ML model.Type: GrantFiled: October 17, 2019Date of Patent: July 12, 2022Assignee: Adobe Inc.Inventors: Verena Sabine Kaynig-Fittkau, Smitha Bangalore Naresh, Shawn Alan Gaither, Richard Cohn, Paul John Asente, Eylon Stroh, Emily Seminerio
-
Publication number: 20220172501Abstract: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.Type: ApplicationFiled: February 17, 2022Publication date: June 2, 2022Applicant: Adobe Inc.Inventors: Sanjeev Tagra, Shawn Alan Gaither, Shagun Kush, Samarth Gupta, Sachin Soni, Nikolaos Barmpalios, Abhishek Jain, Naqushab Neyazee
-
Patent number: 11256913Abstract: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.Type: GrantFiled: October 10, 2019Date of Patent: February 22, 2022Assignee: Adobe Inc.Inventors: Sanjeev Tagra, Shawn Alan Gaither, Shagun Kush, Samarth Gupta, Sachin Soni, Nikolaos Barmpalios, Abhishek Jain, Naqushab Neyazee
-
Publication number: 20210160221Abstract: Systems and techniques for privacy preserving document analysis are described that derive insights pertaining to a digital document without communication of the content of the digital document. To do so, the privacy preserving document analysis techniques described herein capture visual or contextual features of the digital document and creates a stamp representation that represents these features without included the content of the digital document. The stamp representation is projected into a stamp embedding space based on a stamp encoding model generated through machine learning techniques capturing feature patterns and interaction in the stamp representations. The stamp encoding model exploits these feature interactions to define similarity of source documents based on location within the stamp embedding space. Accordingly, the techniques described herein can determine a similarity of documents without having access to the documents themselves.Type: ApplicationFiled: November 26, 2019Publication date: May 27, 2021Applicant: Adobe Inc.Inventors: Nikolaos Barmpalios, Ruchi Rajiv Deshpande, Randy Lee Swineford, Nargol Rezvani, Andrew Marc Greene, Shawn Alan Gaither, Michael Kraley
-
Publication number: 20210117666Abstract: Techniques are provided for identifying structural elements of a document. One Methodology includes generating a first channel of rasterized content by rasterizing a full page of the document and generating one or more additional channels of rasterized content from the page of the document by rasterizing one or more corresponding content types from the page of the document. Each of the one or more additional channels includes a specific type of content that is different from each of the other one or more additional channels. The methodology further includes inputting the first channel of rasterized content and the one or more additional channels of rasterized content into a machine learning (ML) model. The methodology continues with determining location and classification for each of a plurality of structural elements on the page of the document using the ML model.Type: ApplicationFiled: October 17, 2019Publication date: April 22, 2021Applicant: Adobe Inc.Inventors: Verena Sabine Kaynig-Fittkau, Smitha Bangalore Naresh, Shawn Alan Gaither, Richard Cohn, Paul John Asente, Eylon Stroh, Emily Seminerio
-
Publication number: 20210110151Abstract: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.Type: ApplicationFiled: October 10, 2019Publication date: April 15, 2021Applicant: Adobe Inc.Inventors: Sanjeev Tagra, Shawn Alan Gaither, Shagun Kush, Samarth Gupta, Sachin Soni, Nikolaos Barmpalios, Abhishek Jain, Naqushab Neyazee
-
Publication number: 20210110153Abstract: Techniques described herein implement heading identification and classification for a digital document in a digital medium environment. A document analysis system is leveraged to extract structural features from a digital document, identify heading candidates from among the structural features, validate the headings candidates, and classify validated headings into different headings types. The classified headings are then utilized to generate a sectioned version of the digital document (“sectioned document”) that is divided into different sections based on the headings. Further, a document directory is generated that includes the headings and that enables navigation to different sections of the sectioned document.Type: ApplicationFiled: October 9, 2019Publication date: April 15, 2021Applicant: Adobe Inc.Inventors: Mohit Gupta, Uttam Dwivedi, Shawn Alan Gaither, Jayant Vaibhav Srivastava, Ashutosh Mehra
-
Patent number: 10956731Abstract: Techniques described herein implement heading identification and classification for a digital document in a digital medium environment. A document analysis system is leveraged to extract structural features from a digital document, identify heading candidates from among the structural features, validate the headings candidates, and classify validated headings into different headings types. The classified headings are then utilized to generate a sectioned version of the digital document (“sectioned document”) that is divided into different sections based on the headings. Further, a document directory is generated that includes the headings and that enables navigation to different sections of the sectioned document.Type: GrantFiled: October 9, 2019Date of Patent: March 23, 2021Assignee: Adobe Inc.Inventors: Mohit Gupta, Uttam Dwivedi, Shawn Alan Gaither, Jayant Vaibhav Srivastava, Ashutosh Mehra
-
Publication number: 20200320329Abstract: The present invention is directed towards providing automated workflows for the identification of a reading order from text segments extracted from a document. Ordering the text segments is based on trained natural language models. In some embodiments, the workflows are enabled to perform a method for identifying a sequence associated with a portable document. The methods includes iteratively generating a probabilistic language model, receiving the portable document, and selectively extracting features (such as but not limited to text segments) from the document. The method may generate pairs of features (or feature pair from the extracted features). The method may further generate a score for each of the pairs based on the probabilistic language model and determine an order to features based on the scores. The method may provide the extracted features in the determined order.Type: ApplicationFiled: June 18, 2020Publication date: October 8, 2020Inventors: Trung Huu Bui, Hung Hai Bui, Shawn Alan Gaither, Walter Wei-Tuh Chang, Michael Frank Kraley, Pranjal Daga
-
Patent number: 10713519Abstract: The present invention is directed towards providing automated workflows for the identification of a reading order from text segments extracted from a document. Ordering the text segments is based on trained natural language models. In some embodiments, the workflows are enabled to perform a method for identifying a sequence associated with a portable document. The methods includes iteratively generating a probabilistic language model, receiving the portable document, and selectively extracting features (such as but not limited to text segments) from the document. The method may generate pairs of features (or feature pair from the extracted features). The method may further generate a score for each of the pairs based on the probabilistic language model and determine an order to features based on the scores. The method may provide the extracted features in the determined order.Type: GrantFiled: June 22, 2017Date of Patent: July 14, 2020Assignee: ADOBE INC.Inventors: Trung Huu Bui, Hung Hai Bui, Shawn Alan Gaither, Walter Wei-Tuh Chang, Michael Frank Kraley, Pranjal Daga
-
Publication number: 20180373952Abstract: The present invention is directed towards providing automated workflows for the identification of a reading order from text segments extracted from a document. Ordering the text segments is based on trained natural language models. In some embodiments, the workflows are enabled to perform a method for identifying a sequence associated with a portable document. The methods includes iteratively generating a probabilistic language model, receiving the portable document, and selectively extracting features (such as but not limited to text segments) from the document. The method may generate pairs of features (or feature pair from the extracted features). The method may further generate a score for each of the pairs based on the probabilistic language model and determine an order to features based on the scores. The method may provide the extracted features in the determined order.Type: ApplicationFiled: June 22, 2017Publication date: December 27, 2018Inventors: Trung Huu Bui, Hung Hai Bui, Shawn Alan Gaither, Walter Wei-Tuh Chang, Michael Frank Kraley, Pranjal Daga
-
Patent number: 10133813Abstract: Embodiments of the present invention provide systems, methods, and computer storage media directed at predicting values for an electronic form. In embodiments, the method can include forming synonym groupings of form field labels for a number of users. The synonym groupings can be based on an analysis of the similarity of form field values that are associated with form field labels. In embodiments a predictive model may be generated from these synonym groupings. The predictive model can correlate the synonym groupings of one user with synonym groupings of one or more additional users to enable a determination of one or more predicted form field values for the one user based on a queried form field label even though the one user may have never submitted an electronic form with the queried form field label. Other embodiments may be described and/or claimed.Type: GrantFiled: August 12, 2015Date of Patent: November 20, 2018Assignee: Adobe Systems IncorporatedInventors: Shawn Alan Gaither, Eylon Stroh, Priyank Mathur, Randy Swineford
-
Publication number: 20170046622Abstract: Embodiments of the present invention provide systems, methods, and computer storage media directed at predicting values for an electronic form. In embodiments, the method can include forming synonym groupings of form field labels for a number of users. The synonym groupings can be based on an analysis of the similarity of form field values that are associated with form field labels. In embodiments a predictive model may be generated from these synonym groupings. The predictive model can correlate the synonym groupings of one user with synonym groupings of one or more additional users to enable a determination of one or more predicted form field values for the one user based on a queried form field label even though the one user may have never submitted an electronic form with the queried form field label. Other embodiments may be described and/or claimed.Type: ApplicationFiled: August 12, 2015Publication date: February 16, 2017Inventors: SHAWN ALAN GAITHER, EYLON STROH, PRIYANK MATHUR, RANDY SWINEFORD