Patents by Inventor Richard Szeliski

Richard Szeliski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and process for generating high dynamic range video

Publication number: 20060133688

Abstract: A system and process for generating High Dynamic Range (HDR) video is presented which involves first capturing a video image sequence while varying the exposure so as to alternate between frames having a shorter and longer exposure. The exposure for each frame is set prior to it being captured as a function of the pixel brightness distribution in preceding frames. Next, for each frame of the video, the corresponding pixels between the frame under consideration and both preceding and subsequent frames are identified. For each corresponding pixel set, at least one pixel is identified as representing a trustworthy pixel. The pixel color information associated with the trustworthy pixels is then employed to compute a radiance value for each pixel set to form a radiance map. A tone mapping procedure can then be performed to convert the radiance map into an 8-bit representation of the HDR frame.

Type: Application

Filed: January 23, 2006

Publication date: June 22, 2006

Applicant: Microsoft Corporation

Inventors: Sing Kang, Matthew Uyttendaele, Simon Winder, Richard Szeliski
System and process for generating a two-layer, 3D representation of a scene

Publication number: 20060114253

Abstract: A system and process for generating a two-layer, 3D representation of a digital or digitized image from the image and a pixel disparity map of the image is presented. The two layer representation includes a main layer having pixels exhibiting background colors and background disparities associated with correspondingly located pixels of depth discontinuity areas in the image, as well as pixels exhibiting colors and disparities associated with correspondingly located pixels of the image not found in these depth discontinuity areas. The other layer is a boundary layer made up of pixels exhibiting foreground colors, foreground disparities and alpha values associated with the correspondingly located pixels of the depth discontinuity areas. The depth discontinuity areas correspond to prescribed sized areas surrounding depth discontinuities found in the image using a disparity map thereof.

Type: Application

Filed: January 17, 2006

Publication date: June 1, 2006

Applicant: Microsoft Corporation

Inventors: Charles Zitnick, Richard Szeliski, Sing Kang, Matthew Uyttendaele, Simon Winder
Statistically comparing and matching plural sets of digital data

Publication number: 20060088189

Abstract: The present invention is embodied in a system and method for statistically comparing a first set of digital data to at least a second set of digital data and matching the first set of digital data to appropriately corresponding portions of the second set of digital data. The first or the second set of digital data can be transformed during statistical analysis to enhance statistical analysis of the digital data.

Type: Application

Filed: December 7, 2005

Publication date: April 27, 2006

Applicant: Microsoft Corporation

Inventors: Richard Szeliski, Nicholas Wilt
Statistically comparing and matching plural sets of digital data

Publication number: 20060083405

Abstract: The present invention is embodied in a system and method for statistically comparing a first set of digital data to at least a second set of digital data and matching the first set of digital data to appropriately corresponding portions of the second set of digital data. The first or the second set of digital data can be transformed during statistical analysis to enhance statistical analysis of the digital data.

Type: Application

Filed: December 7, 2005

Publication date: April 20, 2006

Applicant: Microsoft Corporation

Inventors: Richard Szeliski, Nicholas Wilt
Deghosting mosaics using multiperspective plane sweep

Publication number: 20060072851

Abstract: A system and method for deghosting mosaics provides a novel multiperspective plane sweep approach for generating an image mosaic from a sequence of still images, video images, scanned photographic images, computer generated images, etc. This multiperspective plane sweep approach uses virtual camera positions to compute depth maps for columns of overlapping pixels in adjacent images. Object distortions and ghosting caused by image parallax when generating the image mosaics are then minimized by blending pixel colors, or grey values, for each computed depth to create a common composite area for each of the overlapping images. Further, the multiperspective plane sweep approach described herein is both computationally efficient, and applicable to both the case of limited overlap between the images used for creating the image mosaics, and to the case of extensive or increased image overlap.

Type: Application

Filed: November 22, 2005

Publication date: April 6, 2006

Applicant: Microsoft Corporation

Inventors: Sing Bing Kang, Richard Szeliski, Matthew Uyttendaele
Deghosting mosaics using multiperspective plane sweep

Publication number: 20060072852

Abstract: A system and method for deghosting mosaics provides a novel multiperspective plane sweep approach for generating an image mosaic from a sequence of still images, video images, scanned photographic images, computer generated images, etc. This multiperspective plane sweep approach uses virtual camera positions to compute depth maps for columns of overlapping pixels in adjacent images. Object distortions and ghosting caused by image parallax when generating the image mosaics are then minimized by blending pixel colors, or grey values, for each computed depth to create a common composite area for each of the overlapping images. Further, the multiperspective plane sweep approach described herein is both computationally efficient, and applicable to both the case of limited overlap between the images used for creating the image mosaics, and to the case of extensive or increased image overlap.

Type: Application

Filed: November 22, 2005

Publication date: April 6, 2006

Applicant: Microsoft Corporation

Inventors: Sing Bing Kang, Richard Szeliski, Mattew Uyttendaele
System and method for extracting reflection and transparency layers from multiple images

Publication number: 20060062434

Abstract: The present invention is embodied in a system and method for extracting structure from multiple images of a scene by representing the scene as a group of image layers, including reflection and transparency layers. In general, the present invention performs layer extraction from multiple images containing reflections and transparencies. The present invention includes an optimal approach for recovering layer images and their associated motions from an arbitrary number of composite images. The present invention includes image formation equations, the constrained least squares technique used to recover the component images, a novel method to estimate upper and lower bounds on the solution using min- and max-composites, and a motion refinement method.

Type: Application

Filed: October 1, 2005

Publication date: March 23, 2006

Applicant: Mircosoft Corporation

Inventors: Richard Szeliski, Shmuel Avidan, Padmanabhan Anandan
System and process for generating a two-layer, 3D representation of a scene

Patent number: 7015926

Abstract: A system and process for generating a two-layer, 3D representation of a digital or digitized image from the image and a pixel disparity map of the image is presented. The two layer representation includes a main layer having pixels exhibiting background colors and background disparities associated with correspondingly located pixels of depth discontinuity areas in the image, as well as pixels exhibiting colors and disparities associated with correspondingly located pixels of the image not found in these depth discontinuity areas. The other layer is a boundary layer made up of pixels exhibiting foreground colors, foreground disparities and alpha values associated with the correspondingly located pixels of the depth discontinuity areas. The depth discontinuity areas correspond to prescribed sized areas surrounding depth discontinuities found in the image using a disparity map thereof.

Type: Grant

Filed: June 28, 2004

Date of Patent: March 21, 2006

Assignee: Microsoft Corporation

Inventors: Charles Lawrence Zitnick, III, Richard Szeliski, Sing Bing Kang, Matthew T. Uyttendaele, Simon Winder
System and method for extracting reflection and transparency layers from multiple images

Publication number: 20060056682

Abstract: The present invention is embodied in a system and method for extracting structure from multiple images of a scene by representing the scene as a group of image layers, including reflection and transparency layers. In general, the present invention performs layer extraction from multiple images containing reflections and transparencies. The present invention includes an optimal approach for recovering layer images and their associated motions from an arbitrary number of composite images. The present invention includes image formation equations, the constrained least squares technique used to recover the component images, a novel method to estimate upper and lower bounds on the solution using min- and max-composites, and a motion refinement method.

Type: Application

Filed: October 1, 2005

Publication date: March 16, 2006

Applicant: Microsoft Corporation

Inventors: Richard Szeliski, Shmuel Avidan, Padmanabhan Anandan
System and process for generating high dynamic range video

Patent number: 7010174

Abstract: A system and process for generating High Dynamic Range (HDR) video is presented which involves first capturing a video image sequence while varying the exposure so as to alternate between frames having a shorter and longer exposure. The exposure for each frame is set prior to it being captured as a function of the pixel brightness distribution in preceding frames. Next, for each frame of the video, the corresponding pixels between the frame under consideration and both preceding and subsequent frames are identified. For each corresponding pixel set, at least one pixel is identified as representing a trustworthy pixel. The pixel color information associated with the trustworthy pixels is then employed to compute a radiance value for each pixel set to form a radiance map. A tone mapping procedure can then be performed to convert the radiance map into an 8-bit representation of the HDR frame.

Type: Grant

Filed: October 15, 2004

Date of Patent: March 7, 2006

Assignee: Microsoft Corporation

Inventors: Sing Bing Kang, Matthew T. Uyttendaele, Simon Winder, Richard Szeliski
System and process for optimal texture map reconstruction from multiple views

Patent number: 6999095

Abstract: A system and process for reconstructing optimal texture maps from multiple views of a scene is described. In essence, this reconstruction is based on the optimal synthesis of textures from multiple sources. This is generally accomplished using basic image processing theory to derive the correct weights for blending the multiple views. Namely, the steps of reconstructing, warping, prefiltering, and resampling are followed in order to warp reference textures to a desired location, and to compute spatially-variant weights for optimal blending. These weights take into consideration the anisotropy in the texture projection and changes in sampling frequency due to foreshortening. The weights are combined and the computation of the optimal texture is treated as a restoration problem, which involves solving a linear system of equations.

Type: Grant

Filed: November 5, 2004

Date of Patent: February 14, 2006

Assignee: Microsoft Corporation

Inventors: Lifeng Wang, Sing Bing Kang, Richard Szeliski, Heung-Yeung Shum, Baining Guo
System and process for compressing and decompressing multiple, layered, video streams of a scene captured from different viewpoints forming a grid using spatial and temporal encoding

Publication number: 20060031915

Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints that from a grid of viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.

Type: Application

Filed: March 31, 2005

Publication date: February 9, 2006

Applicant: Microsoft Corporation

Inventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, Richard Szeliski, Sing Kang
System and process for compressing and decompressing multiple, layered, video streams employing spatial and temporal encoding

Publication number: 20060029134

Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.

Type: Application

Filed: August 3, 2004

Publication date: February 9, 2006

Applicant: Microsoft Corporation

Inventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, Richard Szeliski, Sing Kang
Real-time rendering system and process for interactive viewpoint video

Publication number: 20060028473

Abstract: A system and process for rendering and displaying an interactive viewpoint video is presented in which a user can watch a dynamic scene while manipulating (freezing, slowing down, or reversing) time and changing the viewpoint at will. The ability to interactively control viewpoint while watching a video is an exciting new application for image-based rendering. Because any intermediate view can be synthesized at any time, with the potential for space-time manipulation, this type of video has been dubbed interactive viewpoint video.

Type: Application

Filed: August 3, 2004

Publication date: February 9, 2006

Applicant: Microsoft Corporation

Inventors: Matthew Uyttendaele, Simon Winder, Charles Zitnick, Richard Szeliski, Sing Kang
Real-time rendering system and process for interactive viewpoint video that was generated using overlapping images of a scene captured from viewpoints forming a grid

Publication number: 20060028489

Abstract: A system and process for rendering and displaying an interactive viewpoint video is presented in which a user can watch a dynamic scene while manipulating (freezing, slowing down, or reversing) time and changing the viewpoint at will. The ability to interactively control viewpoint while watching a video is an exciting new application for image-based rendering. Because any intermediate view can be synthesized at any time, with the potential for space-time manipulation, this type of video has been dubbed interactive viewpoint video.

Type: Application

Filed: March 31, 2005

Publication date: February 9, 2006

Applicant: Microsoft Corporation

Inventors: Matthew Uyttendaele, Simon Winder, Charles Zitnick, Richard Szeliski, Sing Kang
Compressing and decompressing multiple, layered, video streams employing multi-directional spatial encoding

Publication number: 20060031917

Abstract: A process for compressing and decompressing non-keyframes in sequential sets of contemporaneous video frames making up multiple video streams where the video frames in a set depict substantially the same scene from different viewpoints. Each set of contemporaneous video frames has a plurality frames designated as keyframes with the remaining being non-keyframes. In one embodiment, the non-keyframes are compressed using a multi-directional spatial prediction technique. In another embodiment, the non-keyframes of each set of contemporaneous video frames are compressed using a combined chaining and spatial prediction compression technique. The spatial prediction compression technique employed can be a single direction technique where just one reference frame, and so one chain, is used to predict each non-keyframe, or it can be a multi-directional technique where two or more reference frames, and so chains, are used to predict each non-keyframe.

Type: Application

Filed: July 15, 2005

Publication date: February 9, 2006

Applicant: Microsoft Corporation

Inventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, Richard Szeliski, Sing Kang
Digital photography with flash/no flash extension

Publication number: 20060008171

Abstract: A system and method for improving digital flash photographs. The present invention is a technique that significantly improves low-light imaging by giving the end-user all the advantages of flash photography without producing the jarring look. The invention uses an image pair—one taken with flash the other without—to remove noise from the ambient image, sharpen the ambient image using detail from the flash image, correct for color, and remove red-eye.

Type: Application

Filed: July 6, 2004

Publication date: January 12, 2006

Applicant: Microsoft Corporation

Inventors: Georg Petschnigg, Richard Szeliski, Michael Cohen, Hugues Hoppe, Maneesh Agrawala
System and process for optimal texture map reconstruction from multiple views

Patent number: 6985156

Abstract: A system and process for reconstructing optimal texture maps from multiple views of a scene is described. In essence, this reconstruction is based on the optimal synthesis of textures from multiple sources. This is generally accomplished using basic image processing theory to derive the correct weights for blending the multiple views. Namely, the steps of reconstructing, warping, prefiltering, and resampling are followed in order to warp reference textures to a desired location, and to compute spatially-variant weights for optimal blending. These weights take into consideration the anisotropy in the texture projection and changes in sampling frequency due to foreshortening. The weights are combined and the computation of the optimal texture is treated as a restoration problem, which involves solving a linear system of equations.

Type: Grant

Filed: June 23, 2004

Date of Patent: January 10, 2006

Assignee: Microsoft Corporation

Inventors: Lifeng Wang, Sing Bing Kang, Richard Szeliski, Heung-Yeung Shum, Baining Guo
System and process for optimal texture map reconstruction from multiple views

Publication number: 20050285872

Abstract: A system and process for reconstructing optimal texture maps from multiple views of a scene is described. In essence, this reconstruction is based on the optimal synthesis of textures from multiple sources. This is generally accomplished using basic image processing theory to derive the correct weights for blending the multiple views. Namely, the steps of reconstructing, warping, prefiltering, and resampling are followed in order to warp reference textures to a desired location, and to compute spatially-variant weights for optimal blending. These weights take into consideration the anisotropy in the texture projection and changes in sampling frequency due to foreshortening. The weights are combined and the computation of the optimal texture is treated as a restoration problem, which involves solving a linear system of equations.

Type: Application

Filed: July 28, 2005

Publication date: December 29, 2005

Applicant: Microsoft Corporation

Inventors: Lifeng Wang, Sing Bing Kang, Richard Szeliski, Heung-Yeung Shum, Baining Guo
Color segmentation-based stereo 3D reconstruction system and process

Publication number: 20050286757

Abstract: A system and process for computing a 3D reconstruction of a scene from multiple images thereof, which is based on a color segmentation-based approach, is presented. First, each image is independently segmented. Second, an initial disparity space distribution (DSD) is computed for each segment, using the assumption that all pixels within a segment have the same disparity. Next, each segment's DSD is refined using neighboring segments and its projection into other images. The assumption that each segment has a single disparity is then relaxed during a disparity smoothing stage. The result is a disparity map for each image, which in turn can be used to compute a per pixel depth map if the reconstruction application calls for it.

Type: Application

Filed: June 28, 2004

Publication date: December 29, 2005

Applicant: Microsoft Corporation

Inventors: Charles Zitnick, Sing Kang, Matthew Uyttendaele, Simon Winder, Richard Szeliski

prev 1 2 3 4 5 6 7 next