Patents by Inventor Richard Szeliski

Richard Szeliski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20100238164
    Abstract: An “Oblique Image Stitcher” provides a technique for constructing a photorealistic oblique view from a set of input images representing a series of partially overlapping views of a scene. The Oblique Image Stitcher first projects each input image onto a geometric proxy of the scene and renders the images from a desired viewpoint. Once the images have been projected onto the geometric proxy, the rendered images are evaluated to identify optimum seams along which the various images are to be blended. Once the optimum seams are selected, the images are remapped relative to those seams by leaving the mapping unchanged at the seams and interpolating a smooth mapping between the seams. The remapped images are then composited to construct the final mosaiced oblique view of the scene. The result is a mosaic image constructed by warping the input images in a photorealistic manner which agrees at seams between images.
    Type: Application
    Filed: June 6, 2010
    Publication date: September 23, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Drew Steedly, Richard Szeliski, Matthew Uyttendaele, Michael Cohen
  • Patent number: 7778491
    Abstract: An “Oblique Image Stitcher” provides a technique for constructing a photorealistic oblique view from a set of input images representing a series of partially overlapping views of a scene. The Oblique Image Stitcher first projects each input image onto a geometric proxy of the scene and renders the images from a desired viewpoint. Once the images have been projected onto the geometric proxy, the rendered images are evaluated to identify optimum seams along which the various images are to be blended. Once the optimum seams are selected, the images are remapped relative to those seams by leaving the mapping unchanged at the seams and interpolating a smooth mapping between the seams. The remapped images are then composited to construct the final mosaiced oblique view of the scene. The result is a mosaic image constructed by warping the input images in a photorealistic manner which agrees at seams between images.
    Type: Grant
    Filed: April 10, 2006
    Date of Patent: August 17, 2010
    Assignee: Microsoft Corporation
    Inventors: Drew Steedly, Richard Szeliski, Matthew Uyttendaele, Michael Cohen
  • Patent number: 7702016
    Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints that from a grid of viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.
    Type: Grant
    Filed: March 31, 2005
    Date of Patent: April 20, 2010
    Assignee: Microsoft Corporation
    Inventors: Simon Winder, Matthew T. Uyttendaele, Charles Lawrence Zitnick, III, Richard Szeliski, Sing Bing Kang
  • Publication number: 20090324126
    Abstract: Described are techniques for image deconvolution to deblur an image given a blur kernel. Localized color statistics derived from the image to be deblurred serve as a prior constraint during deconvolution. A pixel's color is formulated as a linear combination of the two most prevalent colors within a neighborhood of the pixel. This may be repeated for many or all pixels in an image. The linear combinations of the pixels serve as a two-color prior for deconvolving the blurred image. The two-color prior is responsive to the content of the image and it may decouple edge sharpness from edge strength.
    Type: Application
    Filed: June 26, 2008
    Publication date: December 31, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Charles Lawrence Zitnick, Neel Joshi, Richard Szeliski
  • Publication number: 20090316995
    Abstract: A two-dimensional blur kernel is computed for a digital image by first estimating a sharp image from the digital image. The sharp image is derived from the digital image by sharpening at least portions of the digital image. The two-dimensional blur function is computed by minimizing an optimization algorithm that estimates the blur function.
    Type: Application
    Filed: June 23, 2008
    Publication date: December 24, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Richard Szeliski, Neel Joshi, Charles Lawrence Zitnick
  • Publication number: 20090310888
    Abstract: Multi-pass image resampling technique embodiments are presented that employ a series of one-dimensional filtering, resampling, and shearing stages to achieve good efficiency while maintaining high visual fidelity. In one embodiment, high-quality (multi-tap) image filtering is used inside each one-dimensional resampling stage. Because each stage only uses one-dimensional filtering, the overall computation efficiency is very good and amenable to graphics processing unit (GPU) implementation using pixel shaders. This embodiment also upsamples the image before shearing steps in a direction orthogonal to the shearing to prevent aliasing, and then downsamples the image to its final size with high-quality low-pass filtering. This ensures that none of the stages causes excessive blurring or aliasing.
    Type: Application
    Filed: June 13, 2008
    Publication date: December 17, 2009
    Applicant: Microsoft Corporation
    Inventors: Richard Szeliski, Matthew T. Uyttendaele, Simon Winder
  • Publication number: 20090263045
    Abstract: Multi-spline image blending technique embodiments are presented which generally employ a separate low-resolution offset field for every image region being blended, rather than a single (piecewise smooth) offset field for all the regions to produce a visually consistent blended image. Each of the individual offset fields is smoothly varying, and so is represented using a low-dimensional spline. A resulting linear system can be rapidly solved because it involves many fewer variables than the number of pixels being blended.
    Type: Application
    Filed: April 17, 2008
    Publication date: October 22, 2009
    Applicant: Microsoft Corporation
    Inventors: Richard Szeliski, Matthew T. Uyttendaele
  • Publication number: 20090244062
    Abstract: A collection of photos and a three-dimensional reconstruction of the photos are used to construct and texture a mesh model. In one embodiment, a first digital image of a first view of a real world scene is analyzed to identify lines in the first view. Among the lines, parallel lines are identified. A three-dimensional vanishing direction in a three-dimensional space is determined based on the parallel lines and an orientation of the digital image in the three-dimensional space. A plane is automatically generated by fitting the plane to the vanishing direction. A rendering of a three-dimensional model with the plane is displayed. Three-dimensional points corresponding to features common to the photos may be used to constrain the plane. The photos may be projected onto the model to provide visual feedback when editing the plane. Furthermore, the photos may be used to texture the model.
    Type: Application
    Filed: March 31, 2008
    Publication date: October 1, 2009
    Applicant: MICROSOFT
    Inventors: Drew Steedly, Richard Szeliski, Sudipta Sinha, Maneesh Agrawala
  • Patent number: 7583288
    Abstract: A system and process for generating a panoramic video. Essentially, the panoramic video is created by first acquiring multiple videos of the scene being depicted. Preferably, these videos collectively depict a full 360 degree view of the surrounding scene and are captured using a multiple camera rig. The acquisition phase also includes a calibration procedure that provides information about the camera rig used to capture the videos that is used in the next phase for creating the panoramic video. This next phase, which is referred to as the authoring phase, involves mosaicing or stitching individual frames of the videos, which were captured at approximately the same moment in time, to form each frame of the panoramic video. A series of texture maps are then constructed for each frame of the panoramic video. Each texture map coincides with a portion of a prescribed environment model of the scene.
    Type: Grant
    Filed: June 15, 2004
    Date of Patent: September 1, 2009
    Assignee: Microsoft Corporation
    Inventors: Matthew Uyttendaele, Richard Szeliski
  • Patent number: 7574017
    Abstract: The present invention is embodied in a system and method for statistically comparing a first set of digital data to at least a second set of digital data and matching the first set of digital data to appropriately corresponding portions of the second set of digital data. The first or the second set of digital data can be transformed during statistical analysis to enhance statistical analysis of the digital data.
    Type: Grant
    Filed: December 7, 2005
    Date of Patent: August 11, 2009
    Assignee: Microsoft Corporation
    Inventors: Richard Szeliski, Nicholas Wilt
  • Patent number: 7561620
    Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.
    Type: Grant
    Filed: August 3, 2004
    Date of Patent: July 14, 2009
    Assignee: Microsoft Corporation
    Inventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, III, Richard Szeliski, Sing Bing Kang
  • Publication number: 20090157503
    Abstract: The claimed subject matter relates to an architecture that can facilitate advertising models in connection with pyramidal volumes of advertising space. In particular, a pixel at one plane of view of an image can be associated with four pixels at a lower plane of view and so on. Advertising rights with respect to the pixel can be offered for sale, which can include all, a subset, or a different set of advertising rights with respect to other pixels in the pyramidal volume. The architecture can construct the data for the image dynamically based upon contextual input and the advertising rights as well as image format can be constructed based upon notions of zoning.
    Type: Application
    Filed: December 18, 2007
    Publication date: June 18, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Karim Farouki, Blaise Aguera y Arcas, Brett Brewer, Steven Drucker, Gary W. Flake, Stephen L. Lawler, Richard Szeliski, Michael Fredrick Cohen
  • Patent number: 7499586
    Abstract: The subject application provides an architecture that facilitates producing a single image that can visualize a scene too large to depict from any single perspective view. These images can be stitched together on a 2-dimensional picture surface to form a multi-perspective image of the entire extent of the scene depicted by the input images. Where the respective images overlap, an objective function can be employed to determine where to make a transition between one image and the next. The objective function can employ a data cost and a seam cost to make this determination.
    Type: Grant
    Filed: December 30, 2005
    Date of Patent: March 3, 2009
    Assignee: Microsoft Corporation
    Inventors: Aseem O. Agarwala, Maneesh Agrawala, Michael F. Cohen, David H. Salesin, Richard Szeliski
  • Patent number: 7460730
    Abstract: A “Keyframe Stitcher” provides an efficient technique for building mosaic panoramic images by registering or aligning video frames to construct a mosaic panoramic representation. Matching of image pairs is performed by extracting feature points from every image frame and matching those points between image pairs. Further, the Keyframe Stitcher preserves accuracy of image stitching when matching image pairs by utilizing ordering information inherent in the video. The cost of searching for matches between image frames is reduced by identifying “keyframes” based on computed image-to-image overlap. Keyframes are then matched to all other keyframes, but intermediate image frames are only matched to temporally neighboring keyframes and neighboring intermediate frames to construct a “match structure.” Image orientations are then estimated from this match structure and used to construct the mosaic.
    Type: Grant
    Filed: August 4, 2005
    Date of Patent: December 2, 2008
    Assignee: Microsoft Corporation
    Inventors: Chris Pal, Drew Steedly, Richard Szeliski
  • Patent number: 7457477
    Abstract: A system and method for improving digital flash photographs. The present invention is a technique that significantly improves low-light imaging by giving the end-user all the advantages of flash photography without producing the jarring look. The invention uses an image pair—one taken with flash the other without—to remove noise from the ambient image, sharpen the ambient image using detail from the flash image, correct for color, and remove red-eye.
    Type: Grant
    Filed: July 6, 2004
    Date of Patent: November 25, 2008
    Assignee: Microsoft Corporation
    Inventors: Georg F. Petschnigg, Richard Szeliski, Michael F. Cohen, Hugues Hoppe, Maneesh Agrawala
  • Patent number: 7424218
    Abstract: A “Panoramic Viewfinder” provides an intuitive interactive viewfinder display which operates on a digital camera display screen. This interactive viewfinder provides real-time assistance in capturing images for constructing panoramic image mosaics. The Panoramic Viewfinder “brushes” a panorama from images captured in any order, while providing visual feedback to the user for ensuring that desired scene elements will appear in the final panorama. This visual feedback presents real-time stitched previews of the panorama while capturing images.
    Type: Grant
    Filed: July 28, 2005
    Date of Patent: September 9, 2008
    Assignee: Microsoft Corporation
    Inventors: Patrick Baudisch, Chris Pal, Eric Rudolph, Drew Steedly, Richard Szeliski, Desney Tan, Matthew Uyttendaele
  • Patent number: 7382931
    Abstract: A system and process for generating High Dynamic Range (HDR) video is presented which involves first capturing a video image sequence while varying the exposure so as to alternate between frames having a shorter and longer exposure. The exposure for each frame is set prior to it being captured as a function of the pixel brightness distribution in preceding frames. Next, for each frame of the video, the corresponding pixels between the frame under consideration and both preceding and subsequent frames are identified. For each corresponding pixel set, at least one pixel is identified as representing a trustworthy pixel. The pixel color information associated with the trustworthy pixels is then employed to compute a radiance value for each pixel set to form a radiance map. A tone mapping procedure can then be performed to convert the radiance map into an 8-bit representation of the HDR frame.
    Type: Grant
    Filed: January 14, 2005
    Date of Patent: June 3, 2008
    Assignee: Microsoft Corporation
    Inventors: Sing Bing Kang, Matthew Uyttendaele, Simon Winder, Richard Szeliski
  • Patent number: 7382897
    Abstract: A system and process for identifying corresponding points among multiple images of a scene is presented. This involves a multi-view matching framework based on a new class of invariant features. Features are located at Harris corners in scale-space and oriented using a blurred local gradient. This defines a similarity invariant frame in which to sample a feature descriptor. The descriptor actually formed is a bias/gain normalized patch of intensity values. Matching is achieved using a fast nearest neighbor procedure that uses indexing on low frequency Haar wavelet coefficients. A simple 6 parameter model for patch matching is employed, and the noise statistics are analyzed for correct and incorrect matches. This leads to a simple match verification procedure based on a per feature outlier distance.
    Type: Grant
    Filed: April 27, 2004
    Date of Patent: June 3, 2008
    Assignee: Microsoft Corporation
    Inventors: Matthew Brown, Richard Szeliski
  • Patent number: 7379583
    Abstract: A system and process for computing a 3D reconstruction of a scene from multiple images thereof, which is based on a color segmentation-based approach, is presented. First, each image is independently segmented. Second, an initial disparity space distribution (DSD) is computed for each segment, using the assumption that all pixels within a segment have the same disparity. Next, each segment's DSD is refined using neighboring segments and its projection into other images. The assumption that each segment has a single disparity is then relaxed during a disparity smoothing stage. The result is a disparity map for each image, which in turn can be used to compute a per pixel depth map if the reconstruction application calls for it.
    Type: Grant
    Filed: March 31, 2005
    Date of Patent: May 27, 2008
    Assignee: Microsoft Corporation
    Inventors: Charles Zitnick, III, Sing Bing Kang, Matthew Uyttendaele, Simon Winder, Richard Szeliski
  • Publication number: 20080025633
    Abstract: A “Finite Element Preconditioner” provides locally adapted hierarchical basis functions for preconditioning large data optimization problems. For example, a few of the many typical graphics applications that make use of iterative optimization solutions include tone mapping, gradient domain blending, colorization, and scattered data interpolation. Preconditioning image data for such optimization problems allows those image optimization problems to be solved using less computational overhead and therefore to produce better quality image outputs with less computational overhead. The Finite Element Preconditioner evaluates data, such as image data, to adapt hierarchical basis functions to inhomogeneous problems for preconditioning large optimization problems.
    Type: Application
    Filed: July 25, 2006
    Publication date: January 31, 2008
    Applicant: MICROSOFT CORPORATION
    Inventor: Richard Szeliski