Patents by Inventor Richard Szeliski
Richard Szeliski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20060133688Abstract: A system and process for generating High Dynamic Range (HDR) video is presented which involves first capturing a video image sequence while varying the exposure so as to alternate between frames having a shorter and longer exposure. The exposure for each frame is set prior to it being captured as a function of the pixel brightness distribution in preceding frames. Next, for each frame of the video, the corresponding pixels between the frame under consideration and both preceding and subsequent frames are identified. For each corresponding pixel set, at least one pixel is identified as representing a trustworthy pixel. The pixel color information associated with the trustworthy pixels is then employed to compute a radiance value for each pixel set to form a radiance map. A tone mapping procedure can then be performed to convert the radiance map into an 8-bit representation of the HDR frame.Type: ApplicationFiled: January 23, 2006Publication date: June 22, 2006Applicant: Microsoft CorporationInventors: Sing Kang, Matthew Uyttendaele, Simon Winder, Richard Szeliski
-
Publication number: 20060114253Abstract: A system and process for generating a two-layer, 3D representation of a digital or digitized image from the image and a pixel disparity map of the image is presented. The two layer representation includes a main layer having pixels exhibiting background colors and background disparities associated with correspondingly located pixels of depth discontinuity areas in the image, as well as pixels exhibiting colors and disparities associated with correspondingly located pixels of the image not found in these depth discontinuity areas. The other layer is a boundary layer made up of pixels exhibiting foreground colors, foreground disparities and alpha values associated with the correspondingly located pixels of the depth discontinuity areas. The depth discontinuity areas correspond to prescribed sized areas surrounding depth discontinuities found in the image using a disparity map thereof.Type: ApplicationFiled: January 17, 2006Publication date: June 1, 2006Applicant: Microsoft CorporationInventors: Charles Zitnick, Richard Szeliski, Sing Kang, Matthew Uyttendaele, Simon Winder
-
Publication number: 20060088189Abstract: The present invention is embodied in a system and method for statistically comparing a first set of digital data to at least a second set of digital data and matching the first set of digital data to appropriately corresponding portions of the second set of digital data. The first or the second set of digital data can be transformed during statistical analysis to enhance statistical analysis of the digital data.Type: ApplicationFiled: December 7, 2005Publication date: April 27, 2006Applicant: Microsoft CorporationInventors: Richard Szeliski, Nicholas Wilt
-
Publication number: 20060083405Abstract: The present invention is embodied in a system and method for statistically comparing a first set of digital data to at least a second set of digital data and matching the first set of digital data to appropriately corresponding portions of the second set of digital data. The first or the second set of digital data can be transformed during statistical analysis to enhance statistical analysis of the digital data.Type: ApplicationFiled: December 7, 2005Publication date: April 20, 2006Applicant: Microsoft CorporationInventors: Richard Szeliski, Nicholas Wilt
-
Publication number: 20060072851Abstract: A system and method for deghosting mosaics provides a novel multiperspective plane sweep approach for generating an image mosaic from a sequence of still images, video images, scanned photographic images, computer generated images, etc. This multiperspective plane sweep approach uses virtual camera positions to compute depth maps for columns of overlapping pixels in adjacent images. Object distortions and ghosting caused by image parallax when generating the image mosaics are then minimized by blending pixel colors, or grey values, for each computed depth to create a common composite area for each of the overlapping images. Further, the multiperspective plane sweep approach described herein is both computationally efficient, and applicable to both the case of limited overlap between the images used for creating the image mosaics, and to the case of extensive or increased image overlap.Type: ApplicationFiled: November 22, 2005Publication date: April 6, 2006Applicant: Microsoft CorporationInventors: Sing Bing Kang, Richard Szeliski, Matthew Uyttendaele
-
Publication number: 20060072852Abstract: A system and method for deghosting mosaics provides a novel multiperspective plane sweep approach for generating an image mosaic from a sequence of still images, video images, scanned photographic images, computer generated images, etc. This multiperspective plane sweep approach uses virtual camera positions to compute depth maps for columns of overlapping pixels in adjacent images. Object distortions and ghosting caused by image parallax when generating the image mosaics are then minimized by blending pixel colors, or grey values, for each computed depth to create a common composite area for each of the overlapping images. Further, the multiperspective plane sweep approach described herein is both computationally efficient, and applicable to both the case of limited overlap between the images used for creating the image mosaics, and to the case of extensive or increased image overlap.Type: ApplicationFiled: November 22, 2005Publication date: April 6, 2006Applicant: Microsoft CorporationInventors: Sing Bing Kang, Richard Szeliski, Mattew Uyttendaele
-
Publication number: 20060062434Abstract: The present invention is embodied in a system and method for extracting structure from multiple images of a scene by representing the scene as a group of image layers, including reflection and transparency layers. In general, the present invention performs layer extraction from multiple images containing reflections and transparencies. The present invention includes an optimal approach for recovering layer images and their associated motions from an arbitrary number of composite images. The present invention includes image formation equations, the constrained least squares technique used to recover the component images, a novel method to estimate upper and lower bounds on the solution using min- and max-composites, and a motion refinement method.Type: ApplicationFiled: October 1, 2005Publication date: March 23, 2006Applicant: Mircosoft CorporationInventors: Richard Szeliski, Shmuel Avidan, Padmanabhan Anandan
-
Patent number: 7015926Abstract: A system and process for generating a two-layer, 3D representation of a digital or digitized image from the image and a pixel disparity map of the image is presented. The two layer representation includes a main layer having pixels exhibiting background colors and background disparities associated with correspondingly located pixels of depth discontinuity areas in the image, as well as pixels exhibiting colors and disparities associated with correspondingly located pixels of the image not found in these depth discontinuity areas. The other layer is a boundary layer made up of pixels exhibiting foreground colors, foreground disparities and alpha values associated with the correspondingly located pixels of the depth discontinuity areas. The depth discontinuity areas correspond to prescribed sized areas surrounding depth discontinuities found in the image using a disparity map thereof.Type: GrantFiled: June 28, 2004Date of Patent: March 21, 2006Assignee: Microsoft CorporationInventors: Charles Lawrence Zitnick, III, Richard Szeliski, Sing Bing Kang, Matthew T. Uyttendaele, Simon Winder
-
Publication number: 20060056682Abstract: The present invention is embodied in a system and method for extracting structure from multiple images of a scene by representing the scene as a group of image layers, including reflection and transparency layers. In general, the present invention performs layer extraction from multiple images containing reflections and transparencies. The present invention includes an optimal approach for recovering layer images and their associated motions from an arbitrary number of composite images. The present invention includes image formation equations, the constrained least squares technique used to recover the component images, a novel method to estimate upper and lower bounds on the solution using min- and max-composites, and a motion refinement method.Type: ApplicationFiled: October 1, 2005Publication date: March 16, 2006Applicant: Microsoft CorporationInventors: Richard Szeliski, Shmuel Avidan, Padmanabhan Anandan
-
Patent number: 7010174Abstract: A system and process for generating High Dynamic Range (HDR) video is presented which involves first capturing a video image sequence while varying the exposure so as to alternate between frames having a shorter and longer exposure. The exposure for each frame is set prior to it being captured as a function of the pixel brightness distribution in preceding frames. Next, for each frame of the video, the corresponding pixels between the frame under consideration and both preceding and subsequent frames are identified. For each corresponding pixel set, at least one pixel is identified as representing a trustworthy pixel. The pixel color information associated with the trustworthy pixels is then employed to compute a radiance value for each pixel set to form a radiance map. A tone mapping procedure can then be performed to convert the radiance map into an 8-bit representation of the HDR frame.Type: GrantFiled: October 15, 2004Date of Patent: March 7, 2006Assignee: Microsoft CorporationInventors: Sing Bing Kang, Matthew T. Uyttendaele, Simon Winder, Richard Szeliski
-
Patent number: 6999095Abstract: A system and process for reconstructing optimal texture maps from multiple views of a scene is described. In essence, this reconstruction is based on the optimal synthesis of textures from multiple sources. This is generally accomplished using basic image processing theory to derive the correct weights for blending the multiple views. Namely, the steps of reconstructing, warping, prefiltering, and resampling are followed in order to warp reference textures to a desired location, and to compute spatially-variant weights for optimal blending. These weights take into consideration the anisotropy in the texture projection and changes in sampling frequency due to foreshortening. The weights are combined and the computation of the optimal texture is treated as a restoration problem, which involves solving a linear system of equations.Type: GrantFiled: November 5, 2004Date of Patent: February 14, 2006Assignee: Microsoft CorporationInventors: Lifeng Wang, Sing Bing Kang, Richard Szeliski, Heung-Yeung Shum, Baining Guo
-
Publication number: 20060031915Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints that from a grid of viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.Type: ApplicationFiled: March 31, 2005Publication date: February 9, 2006Applicant: Microsoft CorporationInventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, Richard Szeliski, Sing Kang
-
Publication number: 20060029134Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.Type: ApplicationFiled: August 3, 2004Publication date: February 9, 2006Applicant: Microsoft CorporationInventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, Richard Szeliski, Sing Kang
-
Publication number: 20060028473Abstract: A system and process for rendering and displaying an interactive viewpoint video is presented in which a user can watch a dynamic scene while manipulating (freezing, slowing down, or reversing) time and changing the viewpoint at will. The ability to interactively control viewpoint while watching a video is an exciting new application for image-based rendering. Because any intermediate view can be synthesized at any time, with the potential for space-time manipulation, this type of video has been dubbed interactive viewpoint video.Type: ApplicationFiled: August 3, 2004Publication date: February 9, 2006Applicant: Microsoft CorporationInventors: Matthew Uyttendaele, Simon Winder, Charles Zitnick, Richard Szeliski, Sing Kang
-
Publication number: 20060028489Abstract: A system and process for rendering and displaying an interactive viewpoint video is presented in which a user can watch a dynamic scene while manipulating (freezing, slowing down, or reversing) time and changing the viewpoint at will. The ability to interactively control viewpoint while watching a video is an exciting new application for image-based rendering. Because any intermediate view can be synthesized at any time, with the potential for space-time manipulation, this type of video has been dubbed interactive viewpoint video.Type: ApplicationFiled: March 31, 2005Publication date: February 9, 2006Applicant: Microsoft CorporationInventors: Matthew Uyttendaele, Simon Winder, Charles Zitnick, Richard Szeliski, Sing Kang
-
Publication number: 20060031917Abstract: A process for compressing and decompressing non-keyframes in sequential sets of contemporaneous video frames making up multiple video streams where the video frames in a set depict substantially the same scene from different viewpoints. Each set of contemporaneous video frames has a plurality frames designated as keyframes with the remaining being non-keyframes. In one embodiment, the non-keyframes are compressed using a multi-directional spatial prediction technique. In another embodiment, the non-keyframes of each set of contemporaneous video frames are compressed using a combined chaining and spatial prediction compression technique. The spatial prediction compression technique employed can be a single direction technique where just one reference frame, and so one chain, is used to predict each non-keyframe, or it can be a multi-directional technique where two or more reference frames, and so chains, are used to predict each non-keyframe.Type: ApplicationFiled: July 15, 2005Publication date: February 9, 2006Applicant: Microsoft CorporationInventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, Richard Szeliski, Sing Kang
-
Publication number: 20060008171Abstract: A system and method for improving digital flash photographs. The present invention is a technique that significantly improves low-light imaging by giving the end-user all the advantages of flash photography without producing the jarring look. The invention uses an image pair—one taken with flash the other without—to remove noise from the ambient image, sharpen the ambient image using detail from the flash image, correct for color, and remove red-eye.Type: ApplicationFiled: July 6, 2004Publication date: January 12, 2006Applicant: Microsoft CorporationInventors: Georg Petschnigg, Richard Szeliski, Michael Cohen, Hugues Hoppe, Maneesh Agrawala
-
Patent number: 6985156Abstract: A system and process for reconstructing optimal texture maps from multiple views of a scene is described. In essence, this reconstruction is based on the optimal synthesis of textures from multiple sources. This is generally accomplished using basic image processing theory to derive the correct weights for blending the multiple views. Namely, the steps of reconstructing, warping, prefiltering, and resampling are followed in order to warp reference textures to a desired location, and to compute spatially-variant weights for optimal blending. These weights take into consideration the anisotropy in the texture projection and changes in sampling frequency due to foreshortening. The weights are combined and the computation of the optimal texture is treated as a restoration problem, which involves solving a linear system of equations.Type: GrantFiled: June 23, 2004Date of Patent: January 10, 2006Assignee: Microsoft CorporationInventors: Lifeng Wang, Sing Bing Kang, Richard Szeliski, Heung-Yeung Shum, Baining Guo
-
Publication number: 20050285872Abstract: A system and process for reconstructing optimal texture maps from multiple views of a scene is described. In essence, this reconstruction is based on the optimal synthesis of textures from multiple sources. This is generally accomplished using basic image processing theory to derive the correct weights for blending the multiple views. Namely, the steps of reconstructing, warping, prefiltering, and resampling are followed in order to warp reference textures to a desired location, and to compute spatially-variant weights for optimal blending. These weights take into consideration the anisotropy in the texture projection and changes in sampling frequency due to foreshortening. The weights are combined and the computation of the optimal texture is treated as a restoration problem, which involves solving a linear system of equations.Type: ApplicationFiled: July 28, 2005Publication date: December 29, 2005Applicant: Microsoft CorporationInventors: Lifeng Wang, Sing Bing Kang, Richard Szeliski, Heung-Yeung Shum, Baining Guo
-
Publication number: 20050286757Abstract: A system and process for computing a 3D reconstruction of a scene from multiple images thereof, which is based on a color segmentation-based approach, is presented. First, each image is independently segmented. Second, an initial disparity space distribution (DSD) is computed for each segment, using the assumption that all pixels within a segment have the same disparity. Next, each segment's DSD is refined using neighboring segments and its projection into other images. The assumption that each segment has a single disparity is then relaxed during a disparity smoothing stage. The result is a disparity map for each image, which in turn can be used to compute a per pixel depth map if the reconstruction application calls for it.Type: ApplicationFiled: June 28, 2004Publication date: December 29, 2005Applicant: Microsoft CorporationInventors: Charles Zitnick, Sing Kang, Matthew Uyttendaele, Simon Winder, Richard Szeliski