Patents by Inventor Douglas Ronald Burdick
Douglas Ronald Burdick has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11688193Abstract: A computer system, product, and method are provided to utilize machine learning to facilitate document processing. A document collection is introduced to an artificial neural network (ANN), which subjects the document collection to table region identification within discretized contiguous areas. The documents are assigned to one or more clusters responsive to the leveraged ANN. Documents are selectively evaluated from the clusters, and one or more label corrections are applied to the ANN. The ANN generates an updated document collection incorporating the applied one or more label corrections.Type: GrantFiled: November 13, 2020Date of Patent: June 27, 2023Assignee: International Business Machines CorporationInventors: Xin Ru Wang, Douglas Ronald Burdick, Yunyao Li
-
Patent number: 11681734Abstract: Organizing and/or aligning fragments of text that are included in a set of physical and/or digital documents so that the arrangement of the text fragments is in a readily understandable and meaningful format for a given reader. This organization and/or alignment uses a relation model of the various text fragments to correlate a meaning between and amongst the various text fragments to ultimately determine the final alignment and/or arrangement of those text fragments.Type: GrantFiled: December 9, 2020Date of Patent: June 20, 2023Assignee: International Business Machines CorporationInventors: Robert G. Farrell, Mustafa Canim, Douglas Ronald Burdick
-
Patent number: 11551146Abstract: One embodiment provides a method, including: receiving two documents, one of the two documents having at least one table that includes the same information as a corresponding table in the other of the two documents, wherein (i) one of the two documents comprises the at least one table in an unstructured table representation and (ii) the other of the two documents comprises the at least one table in a structured table representation; identifying text elements within the at least one table in the unstructured table representation; matching the identified text elements with table elements within the at least one table in the structured table representation; and annotating the at least one table in the structured table representation based upon the matches between the table elements and text elements.Type: GrantFiled: April 14, 2020Date of Patent: January 10, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Xin Ru Wang, Douglas Ronald Burdick, Ioannis Katsis
-
Publication number: 20220179896Abstract: Organizing and/or aligning fragments of text that are included in a set of physical and/or digital documents so that the arrangement of the text fragments is in a readily understandable and meaningful format for a given reader. This organization and/or alignment uses a relation model of the various text fragments to correlate a meaning between and amongst the various text fragments to ultimately determine the final alignment and/or arrangement of those text fragments.Type: ApplicationFiled: December 9, 2020Publication date: June 9, 2022Inventors: Robert G. Farrell, MUSTAFA CANIM, Douglas Ronald Burdick
-
Publication number: 20220156487Abstract: A computer system, product, and method are provided to utilize machine learning to facilitate document processing. A document collection is introduced to an artificial neural network (ANN), which subjects the document collection to table region identification within discretized contiguous areas. The documents are assigned to one or more clusters responsive to the leveraged ANN. Documents are selectively evaluated from the clusters, and one or more label corrections are applied to the ANN. The ANN generates an updated document collection incorporating the applied one or more label corrections.Type: ApplicationFiled: November 13, 2020Publication date: May 19, 2022Applicant: International Business Machines CorporationInventors: Xin Ru Wang, Douglas Ronald Burdick, Yunyao Li
-
Patent number: 11200413Abstract: Methods, systems, and computer program products for table recognition in PDF documents are provided herein. A computer-implemented method includes discretizing one or more contiguous areas of a PDF document; identifying one or more white-space separator lines within the one or more discretized contiguous areas of the PDF document; detecting one or more candidate table regions within the one or more discretized contiguous areas of the PDF document by clustering the one or more white-space separator lines into one or more grids; and outputting at least one of the candidate table regions as a finalized table in accordance with scores assigned to each of the one or more candidate table regions based on (i) border information and (ii) cell structure information.Type: GrantFiled: July 31, 2018Date of Patent: December 14, 2021Assignee: International Business Machines CorporationInventors: Douglas Ronald Burdick, Wei Cheng, Alexandre Evfimievski, Marina Danilevsky Hailpern, Rajasekar Krishnamurthy, Shajith Ikbal Mohamed, Prithviraj Sen, Shivakumar Vaithyanathan
-
Publication number: 20210319356Abstract: One embodiment provides a method, including: receiving two documents, one of the two documents having at least one table that includes the same information as a corresponding table in the other of the two documents, wherein (i) one of the two documents comprises the at least one table in an unstructured table representation and (ii) the other of the two documents comprises the at least one table in a structured table representation; identifying text elements within the at least one table in the unstructured table representation; matching the identified text elements with table elements within the at least one table in the structured table representation; and annotating the at least one table in the structured table representation based upon the matches between the table elements and text elements.Type: ApplicationFiled: April 14, 2020Publication date: October 14, 2021Inventors: Xin Ru Wang, Douglas Ronald Burdick, Ioannis Katsis
-
Publication number: 20200042785Abstract: Methods, systems, and computer program products for table recognition in PDF documents are provided herein. A computer-implemented method includes discretizing one or more contiguous areas of a PDF document; identifying one or more white-space separator lines within the one or more discretized contiguous areas of the PDF document; detecting one or more candidate table regions within the one or more discretized contiguous areas of the PDF document by clustering the one or more white-space separator lines into one or more grids; and outputting at least one of the candidate table regions as a finalized table in accordance with scores assigned to each of the one or more candidate table regions based on (i) border information and (ii) cell structure information.Type: ApplicationFiled: July 31, 2018Publication date: February 6, 2020Inventors: Douglas Ronald Burdick, Wei Cheng, Alexandre Evfimievski, Marina Danilevsky Hailpern, Rajasekar Krishnamurthy, Shajith Ikbal Mohamed, Prithviraj Sen, Shivakumar Vaithyanathan
-
Patent number: 8612368Abstract: Systems and methods for processing Machine Learning (ML) algorithms in a MapReduce environment are described. In one embodiment of a method, the method includes receiving a ML algorithm to be executed in the MapReduce environment. The method further includes parsing the ML algorithm into a plurality of statement blocks in a sequence, wherein each statement block comprises a plurality of basic operations (hops). The method also includes automatically determining an execution plan for each statement block, wherein at least one of the execution plans comprises one or more low-level operations (lops). The method further includes implementing the execution plans in the sequence of the plurality of the statement blocks.Type: GrantFiled: March 1, 2011Date of Patent: December 17, 2013Assignee: International Business Machines CorporationInventors: Douglas Ronald Burdick, Amol Ghoting, Rajasekar Krishnamurthy, Edwin Peter Dawson Pednault, Berthold Reinwald, Vikas Sindhwani, Shirish Tatikonda, Yuanyuan Tian, Shivakumar Vaithyanathan
-
Publication number: 20120226639Abstract: Systems and methods for processing Machine Learning (ML) algorithms in a MapReduce environment are described. In one embodiment of a method, the method includes receiving a ML algorithm to be executed in the MapReduce environment. The method further includes parsing the ML algorithm into a plurality of statement blocks in a sequence, wherein each statement block comprises a plurality of basic operations (hops). The method also includes automatically determining an execution plan for each statement block, wherein at least one of the execution plans comprises one or more low-level operations (lops). The method further includes implementing the execution plans in the sequence of the plurality of the statement blocks.Type: ApplicationFiled: March 1, 2011Publication date: September 6, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Douglas Ronald Burdick, Amol Ghoting, Rajasekar Krishnamurthy, Edwin Peter Dawson Pednault, Berthold Reinwald, Vikas Sindhwani, Shirish Tatikonda, Yuanyuan Tian, Shivakumar Vaithyanathan