Publication number: 20210081602
Abstract: Machine learning, artificial intelligence, and other computer-implemented methods are used to identify various semantically important chunks in documents, automatically label them with appropriate datatypes and semantic roles, and use this enhanced information to assist authors and to support downstream processes. Chunk locations, datatypes, and semantic roles can often be automatically determined from what is here called “context”, to wit, the combination of their formatting, structure, and content; those of adjacent or nearby content; overall patterns of occurrence in a document, and similarities of all these things across documents (mainly but not exclusively among documents in the same document set).
Type:
Application
Filed:
August 5, 2020
Publication date:
March 18, 2021
Inventors:
Andrew Begun, Steven DeRose, Taqi Jaffri, Luis Marti Orosa, Michael Palmer, Jean Paoli, Christina Pavlopoulou, Elena Pricoiu, Swagatika Sarangi, Marcin Sawicki, Manar Shehadeh, Michael Taron, Bhaven Toprani, Zubin Rustom Wadia, David Watson, Eric White, Joshua Yongshin Fan, Kush Gupta, Andrew Minh Hoang, Zhanlin Liu, Jerome George Paliakkara, Zhaofeng Wu, Yue Zhang, Xiaoquan Zhou
Publication number: 20210081601
Abstract: Machine learning, artificial intelligence, and other computer-implemented methods are used to identify various semantically important chunks in documents, automatically label them with appropriate datatypes and semantic roles, and use this enhanced information to assist authors and to support downstream processes. Chunk locations, datatypes, and semantic roles can often be automatically determined from what is here called “context”, to wit, the combination of their formatting, structure, and content; those of adjacent or nearby content; overall patterns of occurrence in a document, and similarities of all these things across documents (mainly but not exclusively among documents in the same document set).
Type:
Application
Filed:
August 5, 2020
Publication date:
March 18, 2021
Inventors:
Andrew Begun, Steven DeRose, Taqi Jaffri, Luis Marti Orosa, Michael Palmer, Jean Paoli, Christina Pavlopoulou, Elena Pricoiu, Swagatika Sarangi, Marcin Sawicki, Manar Shehadeh, Michael Taron, Bhaven Toprani, Zubin Rustom Wadia, David Watson, Eric White, Joshua Yongshin Fan, Kush Gupta, Andrew Minh Hoang, Zhanlin Liu, Jerome George Paliakkara, Zhaofeng Wu, Yue Zhang, Xiaoquan Zhou