Patents by Inventor Mircea Cimpoi

Mircea Cimpoi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Page layout determination of an image undergoing optical character recognition

Patent number: 9785849

Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.

Type: Grant

Filed: November 13, 2013

Date of Patent: October 10, 2017

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION

Publication number: 20140072224

Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.

Type: Application

Filed: November 13, 2013

Publication date: March 13, 2014

Applicant: Microsoft Corporation

Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
Page layout determination of an image undergoing optical character recognition

Patent number: 8594422

Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.

Type: Grant

Filed: March 11, 2010

Date of Patent: November 26, 2013

Assignee: Microsoft Corporation

Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
Word recognition of text undergoing an OCR process

Patent number: 8401293

Abstract: A method for identifying words in a textual image undergoing optical character recognition includes receiving a bitmap of an input image which includes textual lines that have been segmented by a plurality of chop lines. The chop lines are each associated with a confidence level reflecting a degree to which the respective chop line properly segments the textual line into individual characters. One or more words are identified in one of the textual lines based at least in part on the textual lines and a first subset of the plurality of chop lines which have a chop line confidence level above a first threshold value. If the first word is not associated with a sufficiently high word confidence level, at least a second word in the textual line is identified based at least in part on a second subset of the plurality of chop lines which have a confidence level above a second threshold value lower than the first threshold value.

Type: Grant

Filed: May 3, 2010

Date of Patent: March 19, 2013

Assignee: Microsoft Corporation

Inventors: Aleksandar Antonijevic, Ivan Mitic, Mircea Cimpoi, Djordje Nijemcevic
WORD RECOGNITION OF TEXT UNDERGOING AN OCR PROCESS

Publication number: 20110268360

Abstract: A method for identifying words in a textual image undergoing optical character recognition includes receiving a bitmap of an input image which includes textual lines that have been segmented by a plurality of chop lines. The chop lines are each associated with a confidence level reflecting a degree to which the respective chop line properly segments the textual line into individual characters. One or more words are identified in one of the textual lines based at least in part on the textual lines and a first subset of the plurality of chop lines which have a chop line confidence level above a first threshold value. If the first word is not associated with a sufficiently high word confidence level, at least a second word in the textual line is identified based at least in part on a second subset of the plurality of chop lines which have a confidence level above a second threshold value lower than the first threshold value.

Type: Application

Filed: May 3, 2010

Publication date: November 3, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Aleksandar Antonijevic, Ivan Mitic, Mircea Cimpoi, Djordje Nijemcevic
PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION

Publication number: 20110222771

Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.

Type: Application

Filed: March 11, 2010

Publication date: September 15, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija

Page layout determination of an image undergoing optical character recognition

PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION

Page layout determination of an image undergoing optical character recognition

Word recognition of text undergoing an OCR process

WORD RECOGNITION OF TEXT UNDERGOING AN OCR PROCESS

PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION