Patents by Inventor Milan Vugdelija
Milan Vugdelija has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9785849Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.Type: GrantFiled: November 13, 2013Date of Patent: October 10, 2017Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
-
Patent number: 9773170Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.Type: GrantFiled: May 18, 2016Date of Patent: September 26, 2017Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
-
Publication number: 20170255614Abstract: The technology described herein employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime. The technology recognizes text in an image and identifies a first language in which the text is written. The technology translates the text into a second language and generates an overlay in the second language. The overlay is then inserted into the display over the text in the image.Type: ApplicationFiled: May 18, 2017Publication date: September 7, 2017Inventors: MAGDALENA VUKOSAVLJEVIC, RUCHITA BHARGAVA, MATTHEW S. ASHMAN, IVAN STOJILJKOVIC, PAVLE JOSIPOVIC, DJORDJE NIJEMCEVIC, MILAN VUGDELIJA
-
Patent number: 9690782Abstract: The technology described herein employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime. The technology recognizes text in an image and identifies a first language in which the text is written. The technology translates the text into a second language and generates an overlay in the second language. The overlay is then inserted into the display over the text in the image.Type: GrantFiled: May 2, 2015Date of Patent: June 27, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipović, Djordje Nijemcevic, Milan Vugdelija
-
Publication number: 20160259961Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.Type: ApplicationFiled: May 18, 2016Publication date: September 8, 2016Applicant: Microsoft Technology Licensing, LLCInventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
-
Patent number: 9355315Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.Type: GrantFiled: July 24, 2014Date of Patent: May 31, 2016Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
-
Publication number: 20160026847Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.Type: ApplicationFiled: July 24, 2014Publication date: January 28, 2016Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
-
Publication number: 20150234812Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.Type: ApplicationFiled: May 2, 2015Publication date: August 20, 2015Applicant: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipovic, Djordje Nijemcevic, Milan Vugdelija
-
Patent number: 9064174Abstract: Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.Type: GrantFiled: October 18, 2012Date of Patent: June 23, 2015Assignee: Microsoft Technology Licensing, LLCInventors: David Nister, Frederik Schaffalitzky, Michael Grabner, Matthew S. Ashman, Milan Vugdelija, Ivan Stojiljkovic
-
Patent number: 9037450Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.Type: GrantFiled: December 14, 2012Date of Patent: May 19, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipović, Djordje Nijemcevic, Milan Vugdelija
-
Patent number: 8805068Abstract: Architecture that automatically computes if a quadrangular object captured in a given image is or is not a markable board (e.g., a whiteboard, green board, chalkboard, etc.). The markable board has a surface on which marks can be applied using chalk, ink, dry ink, or any other suitable marking instrument or tool for the given surface. The imaged quadrangular object can be defined as having a background image and a foreground image. The background image is representative of a markable board with no applied surface marks and the foreground image comprises all discernible marks applied to the board surface, but does not include the background image. A set of performance-friendly features is received and processed by a machine-learning classifier to compute if the given quadrangular object is a markable board. Thereafter, if the given image is determined to be a markable board, image enhancement is performed.Type: GrantFiled: December 13, 2012Date of Patent: August 12, 2014Assignee: Microsoft CorporationInventors: Milan Vugdelija, Magdalena Vukosavljevic, Djordje Nijemcevic
-
Publication number: 20140169668Abstract: Architecture that automatically computes if a quadrangular object captured in a given image is or is not a markable board (e.g., a whiteboard, green board, chalkboard, etc.). The markable board has a surface on which marks can be applied using chalk, ink, dry ink, or any other suitable marking instrument or tool for the given surface. The imaged quadrangular object can be defined as having a background image and a foreground image. The background image is representative of a markable board with no applied surface marks and the foreground image comprises all discernible marks applied to the board surface, but does not include the background image. A set of performance-friendly features is received and processed by a machine-learning classifier to compute if the given quadrangular object is a markable board. Thereafter, if the given image is determined to be a markable board, image enhancement is performed.Type: ApplicationFiled: December 13, 2012Publication date: June 19, 2014Applicant: MICROSOFT CORPORATIONInventors: Milan Vugdelija, Magdalena Vukosavljevic, Djordje Nijemcevic
-
Publication number: 20140172408Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.Type: ApplicationFiled: December 14, 2012Publication date: June 19, 2014Applicant: MICROSOFT CORPORATIONInventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipovic, Djordje Nijemcevic, Milan Vugdelija
-
Publication number: 20140112527Abstract: Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.Type: ApplicationFiled: October 18, 2012Publication date: April 24, 2014Applicant: Microsoft CorporationInventors: David Nister, Frederik Schaffalitzky, Michael Grabner, Matthew S. Ashman, Milan Vugdelija, Ivan Stojiljkovic
-
Patent number: 8704896Abstract: Embodiments of camera-based scanning are described. In various embodiments, one or more quadrangular objects are automatically selected from a captured image for scanning. The automatic selection is determined to be successful based on the selected quadrangular objects having an associated score that exceeds a predefined threshold. One or more scanned documents are created from portions of the captured image corresponding to the selected quadrangular objects, and the created scanned documents include corrections for perspective distortions of the selected quadrangular objects in the captured image.Type: GrantFiled: November 14, 2012Date of Patent: April 22, 2014Assignee: Microsoft CorporationInventors: Djordje Nijemcevic, Magdalena Vukosavljevic, Milan Vugdelija, Ana Mitrovic, Georg F. Petschnigg, Bodin Dresevic
-
Publication number: 20140072224Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.Type: ApplicationFiled: November 13, 2013Publication date: March 13, 2014Applicant: Microsoft CorporationInventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
-
Patent number: 8594422Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.Type: GrantFiled: March 11, 2010Date of Patent: November 26, 2013Assignee: Microsoft CorporationInventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
-
Patent number: 8345106Abstract: Embodiments of camera-based scanning are described. In various embodiments, scanned documents can be created using images captured by a camera associated with a device. An image captured by the camera is processed to identify portions within the image that correspond to rectangular objects such as paper, business cards, whiteboards, screens, and so forth. One or more of these portions can be selected for scanning automatically based on a scoring scheme and/or semi-automatically with the aid of input from a user. One or more scanned documents are created from the selected portions by un-warping the selected portions to remove effects of perspective (e.g., rectify the portions to rectangles) and applying various image enhancements to improve appearance.Type: GrantFiled: October 13, 2009Date of Patent: January 1, 2013Assignee: Microsoft CorporationInventors: Djordje Nijemcevic, Magdalena Vukosavljevic, Milan Vugdelija, Ana Mitrovic, Georg F. Petschnigg, Bodin Dresevic
-
Patent number: 8311331Abstract: An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function.Type: GrantFiled: March 9, 2010Date of Patent: November 13, 2012Assignee: Microsoft CorporationInventors: Djordje Nijemcevic, Milan Vugdelija, Bodin Dresevic
-
Publication number: 20110280481Abstract: An electronic model of the image document is created by undergoing an OCR process. The electronic model includes elements (e.g., words, text lines, paragraphs, images) of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process. The electronic model serves as input information which is supplied to each of the stages by a previous stage that processed the image document. A graphical user interface is presented to the user so that the user can provide user input data correcting a mischaracterized item appearing in the document. Based on the user input data, the processing stage which produced the initial error that gave rise to the mischaracterized item corrects the initial error. Stages of the OCR process subsequent to this stage then correct any consequential errors arising in their respective stages as a result of the initial error.Type: ApplicationFiled: May 17, 2010Publication date: November 17, 2011Applicant: MICROSOFT CORPORATIONInventors: Bogdan Radakovic, Milan Vugdelija, Nikola Todic, Aleksandar Uzelac, Bodin Dresevic