Patents by Inventor Milan Vugdelija

Milan Vugdelija has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9785849
    Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.
    Type: Grant
    Filed: November 13, 2013
    Date of Patent: October 10, 2017
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
  • Patent number: 9773170
    Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.
    Type: Grant
    Filed: May 18, 2016
    Date of Patent: September 26, 2017
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
  • Publication number: 20170255614
    Abstract: The technology described herein employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime. The technology recognizes text in an image and identifies a first language in which the text is written. The technology translates the text into a second language and generates an overlay in the second language. The overlay is then inserted into the display over the text in the image.
    Type: Application
    Filed: May 18, 2017
    Publication date: September 7, 2017
    Inventors: MAGDALENA VUKOSAVLJEVIC, RUCHITA BHARGAVA, MATTHEW S. ASHMAN, IVAN STOJILJKOVIC, PAVLE JOSIPOVIC, DJORDJE NIJEMCEVIC, MILAN VUGDELIJA
  • Patent number: 9690782
    Abstract: The technology described herein employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime. The technology recognizes text in an image and identifies a first language in which the text is written. The technology translates the text into a second language and generates an overlay in the second language. The overlay is then inserted into the display over the text in the image.
    Type: Grant
    Filed: May 2, 2015
    Date of Patent: June 27, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipović, Djordje Nijemcevic, Milan Vugdelija
  • Publication number: 20160259961
    Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.
    Type: Application
    Filed: May 18, 2016
    Publication date: September 8, 2016
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
  • Patent number: 9355315
    Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.
    Type: Grant
    Filed: July 24, 2014
    Date of Patent: May 31, 2016
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
  • Publication number: 20160026847
    Abstract: Embodiments that relate to determining an estimated pupil region of an eye are disclosed. In one embodiment a method includes receiving an image of an eye, with the image comprising a plurality of pixels. A rough pupil region is generated using at least a subset of the plurality of pixels. A plurality of pupil boundary point candidates are extracted from the rough pupil region, with each of the candidates weighted based on color values of at least two neighbor pixels. A parametric curve may be fitted to the weighted pupil boundary point candidates to determine the estimated pupil region of the eye of the user.
    Type: Application
    Filed: July 24, 2014
    Publication date: January 28, 2016
    Inventors: Milan Vugdelija, Djordje Nijemcevic, David Zachris Nister, Christopher Maurice Mei, Chandra Sekhar Gatla
  • Publication number: 20150234812
    Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.
    Type: Application
    Filed: May 2, 2015
    Publication date: August 20, 2015
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipovic, Djordje Nijemcevic, Milan Vugdelija
  • Patent number: 9064174
    Abstract: Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.
    Type: Grant
    Filed: October 18, 2012
    Date of Patent: June 23, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: David Nister, Frederik Schaffalitzky, Michael Grabner, Matthew S. Ashman, Milan Vugdelija, Ivan Stojiljkovic
  • Patent number: 9037450
    Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.
    Type: Grant
    Filed: December 14, 2012
    Date of Patent: May 19, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipović, Djordje Nijemcevic, Milan Vugdelija
  • Patent number: 8805068
    Abstract: Architecture that automatically computes if a quadrangular object captured in a given image is or is not a markable board (e.g., a whiteboard, green board, chalkboard, etc.). The markable board has a surface on which marks can be applied using chalk, ink, dry ink, or any other suitable marking instrument or tool for the given surface. The imaged quadrangular object can be defined as having a background image and a foreground image. The background image is representative of a markable board with no applied surface marks and the foreground image comprises all discernible marks applied to the board surface, but does not include the background image. A set of performance-friendly features is received and processed by a machine-learning classifier to compute if the given quadrangular object is a markable board. Thereafter, if the given image is determined to be a markable board, image enhancement is performed.
    Type: Grant
    Filed: December 13, 2012
    Date of Patent: August 12, 2014
    Assignee: Microsoft Corporation
    Inventors: Milan Vugdelija, Magdalena Vukosavljevic, Djordje Nijemcevic
  • Publication number: 20140169668
    Abstract: Architecture that automatically computes if a quadrangular object captured in a given image is or is not a markable board (e.g., a whiteboard, green board, chalkboard, etc.). The markable board has a surface on which marks can be applied using chalk, ink, dry ink, or any other suitable marking instrument or tool for the given surface. The imaged quadrangular object can be defined as having a background image and a foreground image. The background image is representative of a markable board with no applied surface marks and the foreground image comprises all discernible marks applied to the board surface, but does not include the background image. A set of performance-friendly features is received and processed by a machine-learning classifier to compute if the given quadrangular object is a markable board. Thereafter, if the given image is determined to be a markable board, image enhancement is performed.
    Type: Application
    Filed: December 13, 2012
    Publication date: June 19, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Milan Vugdelija, Magdalena Vukosavljevic, Djordje Nijemcevic
  • Publication number: 20140172408
    Abstract: Architecture that employs techniques for overlaying (superimposing) translated text on top of (over) scanned text in realtime translation to provide clear visual correlation between original text and translated text. Algorithms are provided that overlay text in cases of translated scanned text of language written in first direction to a language written in same direction, translate scanned text from a first language written in a first direction to a second language written in the opposite direction, and translated scanned text from a language written in a first direction to language written in a different direction.
    Type: Application
    Filed: December 14, 2012
    Publication date: June 19, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Magdalena Vukosavljevic, Ruchita Bhargava, Matthew S. Ashman, Ivan Stojiljkovic, Pavle Josipovic, Djordje Nijemcevic, Milan Vugdelija
  • Publication number: 20140112527
    Abstract: Architecture that enables optical character recognition (OCR) of text in video frames at the rate at which the frames are received. Additionally, conflation is performed on multiple text recognition results in the frame sequence. The architecture comprises an OCR text recognition engine and a tracker system; the tracker system establishes a common coordinate system in which OCR results from different frames may be compared and/or combined. From a set of sequential video frames, a keyframe is chosen from which the reference coordinate system is established. An estimated transformation from keyframe coordinates to subsequent video frames is computed using the tracker system. When text recognition is completed for any subsequent frame, the result coordinates can be related to the keyframe using the inverse transformation from the processed frame to the reference keyframe. The results can be rendered for viewing as the results are obtained.
    Type: Application
    Filed: October 18, 2012
    Publication date: April 24, 2014
    Applicant: Microsoft Corporation
    Inventors: David Nister, Frederik Schaffalitzky, Michael Grabner, Matthew S. Ashman, Milan Vugdelija, Ivan Stojiljkovic
  • Patent number: 8704896
    Abstract: Embodiments of camera-based scanning are described. In various embodiments, one or more quadrangular objects are automatically selected from a captured image for scanning. The automatic selection is determined to be successful based on the selected quadrangular objects having an associated score that exceeds a predefined threshold. One or more scanned documents are created from portions of the captured image corresponding to the selected quadrangular objects, and the created scanned documents include corrections for perspective distortions of the selected quadrangular objects in the captured image.
    Type: Grant
    Filed: November 14, 2012
    Date of Patent: April 22, 2014
    Assignee: Microsoft Corporation
    Inventors: Djordje Nijemcevic, Magdalena Vukosavljevic, Milan Vugdelija, Ana Mitrovic, Georg F. Petschnigg, Bodin Dresevic
  • Publication number: 20140072224
    Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.
    Type: Application
    Filed: November 13, 2013
    Publication date: March 13, 2014
    Applicant: Microsoft Corporation
    Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
  • Patent number: 8594422
    Abstract: A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.
    Type: Grant
    Filed: March 11, 2010
    Date of Patent: November 26, 2013
    Assignee: Microsoft Corporation
    Inventors: Mircea Cimpoi, Sasa Galic, Milan Vugdelija
  • Patent number: 8345106
    Abstract: Embodiments of camera-based scanning are described. In various embodiments, scanned documents can be created using images captured by a camera associated with a device. An image captured by the camera is processed to identify portions within the image that correspond to rectangular objects such as paper, business cards, whiteboards, screens, and so forth. One or more of these portions can be selected for scanning automatically based on a scoring scheme and/or semi-automatically with the aid of input from a user. One or more scanned documents are created from the selected portions by un-warping the selected portions to remove effects of perspective (e.g., rectify the portions to rectangles) and applying various image enhancements to improve appearance.
    Type: Grant
    Filed: October 13, 2009
    Date of Patent: January 1, 2013
    Assignee: Microsoft Corporation
    Inventors: Djordje Nijemcevic, Magdalena Vukosavljevic, Milan Vugdelija, Ana Mitrovic, Georg F. Petschnigg, Bodin Dresevic
  • Patent number: 8311331
    Abstract: An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function.
    Type: Grant
    Filed: March 9, 2010
    Date of Patent: November 13, 2012
    Assignee: Microsoft Corporation
    Inventors: Djordje Nijemcevic, Milan Vugdelija, Bodin Dresevic
  • Publication number: 20110280481
    Abstract: An electronic model of the image document is created by undergoing an OCR process. The electronic model includes elements (e.g., words, text lines, paragraphs, images) of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process. The electronic model serves as input information which is supplied to each of the stages by a previous stage that processed the image document. A graphical user interface is presented to the user so that the user can provide user input data correcting a mischaracterized item appearing in the document. Based on the user input data, the processing stage which produced the initial error that gave rise to the mischaracterized item corrects the initial error. Stages of the OCR process subsequent to this stage then correct any consequential errors arising in their respective stages as a result of the initial error.
    Type: Application
    Filed: May 17, 2010
    Publication date: November 17, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Bogdan Radakovic, Milan Vugdelija, Nikola Todic, Aleksandar Uzelac, Bodin Dresevic