Patents by Inventor Milan Vugdelija

Milan Vugdelija has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Page layout determination of an image undergoing optical character recognition

Patent number: 9785849

Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.

Type: Grant

Filed: November 13, 2013

Date of Patent: October 10, 2017

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
Pupil detection

Patent number: 9773170

Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.

Type: Grant

Filed: May 18, 2016

Date of Patent: September 26, 2017

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
TEXT OVERLAY TECHNIQUES IN REALTIME TRANSLATION

Publication number: 20170255614

Abstract: The technology described herein employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime. The technology recognizes text in an image and identifies a first language in which the text is written. The technology translates the text into a second language and generates an overlay in the second language. The overlay is then inserted into the display over the text in the image.

Type: Application

Filed: May 18, 2017

Publication date: September 7, 2017

Inventors: MAGDALENA VUKOSAVLJEVIC, RUCHITA BHARGAVA, MATTHEW S. ASHMAN, IVAN STOJILJKOVIC, PAVLE JOSIPOVIC, DJORDJE NIJEMCEVIC, MILAN VUGDELIJA
Text overlay techniques in realtime translation

Patent number: 9690782

Abstract: The technology described herein employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime. The technology recognizes text in an image and identifies a first language in which the text is written. The technology translates the text into a second language and generates an overlay in the second language. The overlay is then inserted into the display over the text in the image.

Type: Grant

Filed: May 2, 2015

Date of Patent: June 27, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipović, Djordje Nijemcevic, Milan Vugdelija
PUPIL DETECTION

Publication number: 20160259961

Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.

Type: Application

Filed: May 18, 2016

Publication date: September 8, 2016

Applicant: Microsoft Technology Licensing, LLC

Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
Pupil detection

Patent number: 9355315

Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.

Type: Grant

Filed: July 24, 2014

Date of Patent: May 31, 2016

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
PUPIL DETECTION

Publication number: 20160026847

Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.

Type: Application

Filed: July 24, 2014

Publication date: January 28, 2016

Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
TEXT OVERLAY TECHNIQUES IN REALTIME TRANSLATION

Publication number: 20150234812

Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.

Type: Application

Filed: May 2, 2015

Publication date: August 20, 2015

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipovic, Djordje Nijemcevic, Milan Vugdelija
Simultaneous tracking and text recognition in video frames

Patent number: 9064174

Abstract: Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.

Type: Grant

Filed: October 18, 2012

Date of Patent: June 23, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: David Nister, Frederik Schaffalitzky, Michael Grabner, Matthew S. Ashman, Milan Vugdelija, Ivan Stojiljkovic
Text overlay techniques in realtime translation

Patent number: 9037450

Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.

Type: Grant

Filed: December 14, 2012

Date of Patent: May 19, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipović, Djordje Nijemcevic, Milan Vugdelija
Automatic classification and color enhancement of a markable surface

Patent number: 8805068

Abstract: Architecture that automatically computes if a quadrangular object captured in a given image is or is not a markable board (e.g., a whiteboard, green board, chalkboard, etc.). The markable board has a surface on which marks can be applied using chalk, ink, dry ink, or any other suitable marking instrument or tool for the given surface. The imaged quadrangular object can be defined as having a background image and a foreground image. The background image is representative of a markable board with no applied surface marks and the foreground image comprises all discernible marks applied to the board surface, but does not include the background image. A set of performance-friendly features is received and processed by a machine-learning classifier to compute if the given quadrangular object is a markable board. Thereafter, if the given image is determined to be a markable board, image enhancement is performed.

Type: Grant

Filed: December 13, 2012

Date of Patent: August 12, 2014

Assignee: Microsoft Corporation

Inventors: Milan Vugdelija, Magdalena Vukosavljevic, Djordje Nijemcevic
AUTOMATIC CLASSIFICATION AND COLOR ENHANCEMENT OF A MARKABLE SURFACE

Publication number: 20140169668

Abstract: Architecture that automatically computes if a quadrangular object captured in a given image is or is not a markable board (e.g., a whiteboard, green board, chalkboard, etc.). The markable board has a surface on which marks can be applied using chalk, ink, dry ink, or any other suitable marking instrument or tool for the given surface. The imaged quadrangular object can be defined as having a background image and a foreground image. The background image is representative of a markable board with no applied surface marks and the foreground image comprises all discernible marks applied to the board surface, but does not include the background image. A set of performance-friendly features is received and processed by a machine-learning classifier to compute if the given quadrangular object is a markable board. Thereafter, if the given image is determined to be a markable board, image enhancement is performed.

Type: Application

Filed: December 13, 2012

Publication date: June 19, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Milan Vugdelija, Magdalena Vukosavljevic, Djordje Nijemcevic
TEXT OVERLAY TECHNIQUES IN REALTIME TRANSLATION

Publication number: 20140172408

Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.

Type: Application

Filed: December 14, 2012

Publication date: June 19, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipovic, Djordje Nijemcevic, Milan Vugdelija
SIMULTANEOUS TRACKING AND TEXT RECOGNITION IN VIDEO FRAMES

Publication number: 20140112527

Abstract: Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.

Type: Application

Filed: October 18, 2012

Publication date: April 24, 2014

Applicant: Microsoft Corporation

Inventors: David Nister, Frederik Schaffalitzky, Michael Grabner, Matthew S. Ashman, Milan Vugdelija, Ivan Stojiljkovic
Camera-based scanning

Patent number: 8704896

Abstract: Embodiments of camera-based scanning are described. In various embodiments, one or more quadrangular objects are automatically selected from a captured image for scanning. The automatic selection is determined to be successful based on the selected quadrangular objects having an associated score that exceeds a predefined threshold. One or more scanned documents are created from portions of the captured image corresponding to the selected quadrangular objects, and the created scanned documents include corrections for perspective distortions of the selected quadrangular objects in the captured image.

Type: Grant

Filed: November 14, 2012

Date of Patent: April 22, 2014

Assignee: Microsoft Corporation

Inventors: Djordje Nijemcevic, Magdalena Vukosavljevic, Milan Vugdelija, Ana Mitrovic, Georg F. Petschnigg, Bodin Dresevic
PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION

Publication number: 20140072224

Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.

Type: Application

Filed: November 13, 2013

Publication date: March 13, 2014

Applicant: Microsoft Corporation

Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
Page layout determination of an image undergoing optical character recognition

Patent number: 8594422

Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.

Type: Grant

Filed: March 11, 2010

Date of Patent: November 26, 2013

Assignee: Microsoft Corporation

Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
Camera-based scanning

Patent number: 8345106

Abstract: Embodiments of camera-based scanning are described. In various embodiments, scanned documents can be created using images captured by a camera associated with a device. An image captured by the camera is processed to identify portions within the image that correspond to rectangular objects such as paper, business cards, whiteboards, screens, and so forth. One or more of these portions can be selected for scanning automatically based on a scoring scheme and/or semi-automatically with the aid of input from a user. One or more scanned documents are created from the selected portions by un-warping the selected portions to remove effects of perspective (e.g., rectify the portions to rectangles) and applying various image enhancements to improve appearance.

Type: Grant

Filed: October 13, 2009

Date of Patent: January 1, 2013

Assignee: Microsoft Corporation

Inventors: Djordje Nijemcevic, Magdalena Vukosavljevic, Milan Vugdelija, Ana Mitrovic, Georg F. Petschnigg, Bodin Dresevic
Resolution adjustment of an image that includes text undergoing an OCR process

Patent number: 8311331

Abstract: An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function.

Type: Grant

Filed: March 9, 2010

Date of Patent: November 13, 2012

Assignee: Microsoft Corporation

Inventors: Djordje Nijemcevic, Milan Vugdelija, Bodin Dresevic
USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS

Publication number: 20110280481

Abstract: An electronic model of the image document is created by undergoing an OCR process. The electronic model includes elements (e.g., words, text lines, paragraphs, images) of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process. The electronic model serves as input information which is supplied to each of the stages by a previous stage that processed the image document. A graphical user interface is presented to the user so that the user can provide user input data correcting a mischaracterized item appearing in the document. Based on the user input data, the processing stage which produced the initial error that gave rise to the mischaracterized item corrects the initial error. Stages of the OCR process subsequent to this stage then correct any consequential errors arising in their respective stages as a result of the initial error.

Type: Application

Filed: May 17, 2010

Publication date: November 17, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Bogdan Radakovic, Milan Vugdelija, Nikola Todic, Aleksandar Uzelac, Bodin Dresevic

1 2 next