Patents by Inventor Marco Spinaci

Marco Spinaci has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Character encoding and decoding for optical character recognition

Patent number: 11816182

Abstract: The present disclosure provides techniques for encoding and decoding characters for optical character recognition. The techniques involve determining sets of numbers for encoding a character set where each number in a particular set of numbers for encoding a particular character is mapped to a graphical unit (e.g., radical) of the particular character. A mapping between each set of numbers in the possible encodings and the character set may be determined based the closest character already encoded. A machine learning model may be trained to perform optical character recognition using training data labeled using the set of encodings and the mappings.

Type: Grant

Filed: June 7, 2021

Date of Patent: November 14, 2023

Assignee: SAP SE

Inventors: Marco Spinaci, Marek Polewczyk
MACHINE LEARNING ENABLED DOCUMENT DESKEWING

Publication number: 20230222632

Abstract: A method may include determining, based at least on an image of a document, a plurality of text bounding boxes enclosing lines of text present in the document. A machine learning model may be trained to determine, based at least on the coordinates defining the text bounding boxes, the coordinates of a document bounding box enclosing the text bounding boxes. The document bounding box may encapsulate the visual aberrations that are present in the image of the document. As such, one or more transformations may be determined based on the coordinates of the document bounding box. The image of the document may be deskewed by applying the transformations. One or more downstream tasks may be performed based on the deskewed image of the document. Related methods and articles of manufacture are also disclosed.

Type: Application

Filed: January 7, 2022

Publication date: July 13, 2023

Inventors: Marek Polewczyk, Marco Spinaci
CHARACTER ENCODING AND DECODING FOR OPTICAL CHARACTER RECOGNITION

Publication number: 20220391637

Abstract: The present disclosure provides techniques for encoding and decoding characters for optical character recognition. The techniques involve determining sets of numbers for encoding a character set where each number in a particular set of numbers for encoding a particular character is mapped to a graphical unit (e.g., radical) of the particular character. A mapping between each set of numbers in the possible encodings and the character set may be determined based the closest character already encoded. A machine learning model may be trained to perform optical character recognition using training data labeled using the set of encodings and the mappings.

Type: Application

Filed: June 7, 2021

Publication date: December 8, 2022

Inventors: Marco Spinaci, Marek Polewczyk
Rotation and scaling for optical character recognition using end-to-end deep learning

Patent number: 11302108

Abstract: Disclosed herein are system, method, and computer program product embodiments for optical character recognition (OCR) pre-processing using machine learning. In an embodiment, a neural network may be trained to identify a standardized document rotation and scale expected by an OCR service performing character recognition. The neural network may then analyze a received document image to identify a corresponding rotation and scale of the document image relative to the expected standardized values. In response to this identification, the document image may be modified in the inverse to standardize the rotation and scale of the document image to match the format expected by the OCR service. In some embodiments, a neural network may perform the standardization as well as the character recognition using a shared computation graph.

Type: Grant

Filed: September 10, 2019

Date of Patent: April 12, 2022

Assignee: SAP SE

Inventors: Johannes Hoehne, Marco Spinaci, Anoop Raveendra Katti
Visually-aware encodings for characters

Patent number: 11275969

Abstract: In some embodiments, a method inputs a set of images into a network and trains the network based on a classification of the set of images to one or more characters in a set of characters. The method obtains a set of encodings for the one or more characters based on a layer of the network that restricts the output of the layer to a number of values. Then, the method stores the set of encodings for the one or more characters, wherein an encoding in the set of encodings is retrievable when a corresponding character is determined.

Type: Grant

Filed: December 5, 2019

Date of Patent: March 15, 2022

Assignee: SAP SE

Inventors: Johannes Hoehne, Marco Spinaci
VISUALLY-AWARE ENCODINGS FOR CHARACTERS

Publication number: 20210174141

Abstract: In some embodiments, a method inputs a set of images into a network and trains the network based on a classification of the set of images to one or more characters in a set of characters. The method obtains a set of encodings for the one or more characters based on a layer of the network that restricts the output of the layer to a number of values. Then, the method stores the set of encodings for the one or more characters, wherein an encoding in the set of encodings is retrievable when a corresponding character is determined.

Type: Application

Filed: December 5, 2019

Publication date: June 10, 2021

Inventors: Johannes Hoehne, Marco Spinaci
ROTATION AND SCALING FOR OPTICAL CHARACTER RECOGNITION USING END-TO-END DEEP LEARNING

Publication number: 20210073566

Abstract: Disclosed herein are system, method, and computer program product embodiments for optical character recognition (OCR) pre-processing using machine learning. In an embodiment, a neural network may be trained to identify a standardized document rotation and scale expected by an OCR service performing character recognition. The neural network may then analyze a received document image to identify a corresponding rotation and scale of the document image relative to the expected standardized values. In response to this identification, the document image may be modified in the inverse to standardize the rotation and scale of the document image to match the format expected by the OCR service. In some embodiments, a neural network may perform the standardization as well as the character recognition using a shared computation graph.

Type: Application

Filed: September 10, 2019

Publication date: March 11, 2021

Inventors: Johannes Hoehne, Marco Spinaci, Anoop Raveendra Katti
Object detection and candidate filtering system

Patent number: 10915786

Abstract: Disclosed herein are system, method, and computer program product embodiments for providing object detection and filtering operations. An embodiment operates by receiving an image comprising a plurality of pixels and pixel information for each pixel. The pixel information indicates a bounding box corresponding to an object within the image associated with a respective pixel and a confidence score associated with the bounding box for the respective pixel. Pixels that do not correspond to a center of at least one of the bounding boxes are iteratively removed from the plurality of pixels until a subset of pixels each of which correspond to a center of at least one of the bounding boxes remains. Based on the subset, a final bounding box associated with each object of the image is determined based on an overlapping of the bounding boxes of the subset of pixels and the corresponding confidence scores.

Type: Grant

Filed: February 28, 2019

Date of Patent: February 9, 2021

Assignee: SAP SE

Inventors: Johannes Hoehne, Anoop Raveendra Katti, Christian Reisswig, Marco Spinaci
Recognizing typewritten and handwritten characters using end-to-end deep learning

Patent number: 10846553

Abstract: Disclosed herein are system, method, and computer program product embodiments for optical character recognition using end-to-end deep learning. In an embodiment, an optical character recognition system may train a neural network to identify characters of pixel images, assign index values to the characters, and recognize different formatting of the characters, such as distinguishing between handwritten and typewritten characters. The neural network may also be trained to identify, groups of characters and to generate bounding boxes to group these characters. The optical character recognition system may then analyze documents to identify character information based on the pixel data and produce segmentation masks, such as a type grid segmentation mask, and one or more bounding box masks. The optical character recognition system may supply these masks as an output or may combine the masks to generate a version of the received document having optically recognized characters.

Type: Grant

Filed: March 20, 2019

Date of Patent: November 24, 2020

Assignee: SAP SE

Inventors: Johannes Hoehne, Christian Reisswig, Anoop Raveendra Katti, Marco Spinaci
RECOGNIZING TYPEWRITTEN AND HANDWRITTEN CHARACTERS USING END-TO-END DEEP LEARNING

Publication number: 20200302208

Abstract: Disclosed herein are system, method, and computer program product embodiments for optical character recognition using end-to-end deep learning. In an embodiment, an optical character recognition system may train a neural network to identify characters of pixel images, assign index values to the characters, and recognize different formatting of the characters, such as distinguishing between handwritten and typewritten characters. The neural network may also be trained to identify, groups of characters and to generate bounding boxes to group these characters. The optical character recognition system may then analyze documents to identify character information based on the pixel data and produce segmentation masks, such as a type grid segmentation mask, and one or more bounding box masks. The optical character recognition system may supply these masks as an output or may combine the masks to generate a version of the received document having optically recognized characters.

Type: Application

Filed: March 20, 2019

Publication date: September 24, 2020

Inventors: Johannes HOEHNE, Christian REISSWIG, Anoop Raveendra KATTI, Marco SPINACI
Object Detection and Candidate Filtering System

Publication number: 20200279128

Abstract: Disclosed herein are system, method, and computer program product embodiments for providing object detection and filtering operations. An embodiment operates by receiving an image comprising a plurality of pixels and pixel information for each pixel. The pixel information indicates a bounding box corresponding to an object within the image associated with a respective pixel and a confidence score associated with the bounding box for the respective pixel. Pixels that do not correspond to a center of at least one of the bounding boxes are iteratively removed from the plurality of pixels until a subset of pixels each of which correspond to a center of at least one of the bounding boxes remains. Based on the subset, a final bounding box associated with each object of the image is determined based on an overlapping of the bounding boxes of the subset of pixels and the corresponding confidence scores.

Type: Application

Filed: February 28, 2019

Publication date: September 3, 2020

Inventors: Johannes Hoehne, Anoop Raveendra Katti, Christian Reisswig, Marco Spinaci