Patents by Inventor Charles Lawrence Zitnick
Charles Lawrence Zitnick has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8768067Abstract: Systems, methods and computer-readable storage media are disclosed for accelerating bitmap remoting by extracting patterns from source bitmaps. A server takes a source image, and performs an edge-detection operation on it. From this edge-detected image, connected segments of the image are determined by executing multiple iterations of a small operation upon the image in parallel—for instance, by assigning each non-white pixel a unique value, then assigning each pixel the minimum value among itself and its neighbors until no pixel is assigned a new value in an iteration. Executing these operations in parallel greatly reduces the time required to identify the connected segments. When the segments are identified, they may be cached by the client so that they do not need to be re-sent to the client when re-encountered by the server.Type: GrantFiled: April 15, 2010Date of Patent: July 1, 2014Assignee: Microsoft CorporationInventors: Nadim Y. Abdo, Voicu Anton Albu, Charles Lawrence Zitnick, III, Max Alan McMullen
-
Patent number: 8761520Abstract: Systems, methods and computer-readable storage media are disclosed for accelerating bitmap remoting by extracting non-grid tiles from source bitmaps. A server takes a source image, identifies possibly repetitive features, and tiles the image. For each tile that contains part of a possibly repetitive feature, the server replaces that part with the dominant color of the tile. The system then sends to a client a combination of new tiles and features, and indications to tiles and features that the client has previously received and stored, along with an indication of how to recreate the image based on the tiles and features.Type: GrantFiled: December 11, 2009Date of Patent: June 24, 2014Assignee: Microsoft CorporationInventors: Nadim Y. Abdo, Voicu Anton Albu, Charles Lawrence Zitnick, III
-
Patent number: 8538164Abstract: Described herein are various technologies for generating descriptors for image patches. An image patch can be received, and gradients of pixels in the image patch can be determined. The gradients are normalized based upon an average magnitude of the gradients in a local spatial region with respect to a given pixel under consideration. A four-dimensional histogram is defined that takes into consideration pixel orientation, and normalized gradients are selectively assigned to bins of the histogram. The bins are binarized as a function of a number of gradients assigned thereto, and the binarized bins can be utilized as a descriptor for the image patch.Type: GrantFiled: October 25, 2010Date of Patent: September 17, 2013Assignee: Microsoft CorporationInventor: Charles Lawrence Zitnick, III
-
Patent number: 8538077Abstract: Technology is described for detecting an interest point in an image using edges. An example method can include the operation of computing locally normalized edge magnitudes and edge orientations for the image using a processor to form a normalized gradient image. The normalized gradient image can be divided into a plurality of image orientation maps having edge orientations. Orientation dependent filtering can be applied to the image orientation maps to form response images. A further operation can be summing the response images to obtain an aggregated filter response image. Maxima can be identified in spatial position and scale in the aggregated filter response image. Maxima in the aggregated filter response image can be defined as interest points.Type: GrantFiled: May 3, 2011Date of Patent: September 17, 2013Assignee: Microsoft CorporationInventor: Charles Lawrence Zitnick, III
-
Publication number: 20130156329Abstract: The claimed subject matter provides for systems and/or methods for identification of instances of an object of interest in 2D images by creating a database of 3D curve models of each desired instance and comparing an image of an object of interest against such 3D curve models of instances. The present application describes identifying and verifying the make and model of a car from a possibly single image—after the models have been populated with training data of test images of many makes and models of cars. In one embodiment, an identification system may be constructed by generating a 3D curve model by back-projecting edge points onto a visual hull reconstruction from silhouettes of an instance. The system and methods employ chamfer distance and orientation distance provides reasonable verification performance, as well as an appearance model for the taillights of the car to increase the robustness of the system.Type: ApplicationFiled: December 16, 2011Publication date: June 20, 2013Applicant: MICROSOFT CORPORATIONInventors: Richard Stephan Szeliski, Edward Hsiao, Sudipta Narayan Sinha, Krishnan Ramnath, Charles Lawrence Zitnick, III, Simon John Baker
-
Publication number: 20130100256Abstract: Methods and systems for generating a depth map are provided. The method includes projecting an infrared (IR) dot pattern onto a scene. The method also includes capturing stereo images from each of two or more synchronized IR cameras, detecting a number of dots within the stereo images, computing a number of feature descriptors for the dots in the stereo images, and computing a disparity map between the stereo images. The method further includes generating a depth map for the scene using the disparity map.Type: ApplicationFiled: October 21, 2011Publication date: April 25, 2013Applicant: Microsoft CorporationInventors: Adam G. Kirk, Yaron Eshet, Kestutis Patiejunas, Sing Bing Kang, Charles Lawrence Zitnick, III, David Eraker, Simon Winder
-
Publication number: 20130095920Abstract: Methods and systems for generating free viewpoint video using an active infrared (IR) stereo module are provided. The method includes computing a depth map for a scene using an active IR stereo module. The depth map may be computed by projecting an IR dot pattern onto the scene, capturing stereo images from each of two or more synchronized IR cameras, detecting dots within the stereo images, computing feature descriptors corresponding to the dots in the stereo images, computing a disparity map between the stereo images, and generating the depth map using the disparity map. The method also includes generating a point cloud for the scene using the depth map, generating a mesh of the point cloud, and generating a projective texture map for the scene from the mesh of the point cloud. The method further includes generating the video for the scene using the projective texture map.Type: ApplicationFiled: October 13, 2011Publication date: April 18, 2013Applicant: Microsoft CorporationInventors: Kestutis Patiejunas, Kanchan Mitra, Patrick Sweeney, Yaron Eshet, Adam G. Kirk, Sing Bing Kang, Charles Lawrence Zitnick, III, David Eraker, David Harnett, Amit Mital, Simon Winder
-
Patent number: 8417039Abstract: Techniques are disclosed for acceleration techniques for improved image remoting. A rolling 2D hash of a first image sent to a client is computed. When the server has a second image to send to the client, it calculates a rolling 2D hash of the new image. It also calculates “pivot points” for the images based on the rolling 2D hashes. Based on the pivot points, it determines possible matching hash windows between the two images that correspond to window moves or scrolls. Where a match is confirmed, it determines whether a “larger” a larger matching rectangle exists between the two images. It then instructs the client to display the matching rectangle that exists in the first image that the client has in the appropriate location in the second image, thereby saving the bandwidth requirements to re-transmit it to the client.Type: GrantFiled: June 3, 2010Date of Patent: April 9, 2013Assignee: Microsoft CorporationInventors: Voicu Anton Albu, Nadim Y. Abdo, Charles Lawrence Zitnick, III
-
Publication number: 20120321175Abstract: A mobile device having the capability of performing real-time location recognition with assistance from a server is provided. The approximate geophysical location of the mobile device is uploaded to the server. Based on the mobile device's approximate geophysical location, the server responds by sending the mobile device a message comprising a classifier and a set of feature descriptors. This can occur before an image is captured for visual querying. The classifier and feature descriptors are computed during an offline training stage using techniques to minimize computation at query time. The classifier and feature descriptors are used to perform visual recognition in real-time by performing the classification on the mobile device itself.Type: ApplicationFiled: June 17, 2011Publication date: December 20, 2012Applicant: Microsoft CorporationInventors: Varsha Hedau, Sudipta Sinha, Charles Lawrence Zitnick, Richard Szeliski
-
Patent number: 8330802Abstract: The stereo movie editing technique described herein combines knowledge of both multi-view stereo algorithms and human depth perception. The technique creates a digital editor, specifically for stereographic cinema. The technique employs an interface that allows intuitive manipulation of the different parameters in a stereo movie setup, such as camera locations and screen position. Using the technique it is possible to reduce or enhance well-known stereo movie effects such as cardboarding and miniaturization. The technique also provides new editing techniques such as directing the user's attention and easier transitions between scenes.Type: GrantFiled: December 9, 2008Date of Patent: December 11, 2012Assignee: Microsoft Corp.Inventors: Sanjeev J. Koppal, Sing Bing Kang, Charles Lawrence Zitnick, III, Michael F. Cohen, Bryan Kent Ressler
-
Publication number: 20120295231Abstract: Architecture that guides the freeform drawing of objects by a user to enable the user to produce improved drawings without significant training. As the user draws, the architecture dynamically updates a relevant shadow image proximate (e.g., underlying) the user's strokes. The strokes overlay an evolving shadow image, which shadow image is suggestive of object contours that guide the user during the drawing process. Relevant edge images selected from a large database are automatically blended to construct the shadow image. As the user draws, the strokes are dynamically analyzed using an encoding of overlapping windows for fast matching with the database of images. A top ranked set of matching database edge images are aligned to the drawing, a set of spatially varying weights blend the edge images into the shadow image, and a scoring technique is employed to select the optimum shadow image for display.Type: ApplicationFiled: May 19, 2011Publication date: November 22, 2012Applicant: MICROSOFT CORPORATIONInventors: Charles Lawrence Zitnick, III, Yong Jae Lee, Michael Cohen
-
Publication number: 20120281872Abstract: Technology is described for detecting an interest point in an image using edges. An example method can include the operation of computing locally normalized edge magnitudes and edge orientations for the image using a processor to form a normalized gradient image. The normalized gradient image can be divided into a plurality of image orientation maps having edge orientations. Orientation dependent filtering can be applied to the image orientation maps to form response images. A further operation can be summing the response images to obtain an aggregated filter response image. Maxima can be identified in spatial position and scale in the aggregated filter response image. Maxima in the aggregated filter response image can be defined as interest points.Type: ApplicationFiled: May 3, 2011Publication date: November 8, 2012Applicant: Microsoft CorporationInventor: Charles Lawrence Zitnick, III
-
Publication number: 20120257116Abstract: Video signatures are generated when a small change in video information between consecutive frames N?1 and N is followed by a large change in video information between consecutive frames N and N+1. Information from frames N and/or N+1 is used to form a video signature.Type: ApplicationFiled: April 5, 2011Publication date: October 11, 2012Applicant: MICROSOFT CORPORATIONInventors: Seth Hendrickson, George Randell Dong, Saswata Mandal, Charles Lawrence Zitnick, III
-
Patent number: 8184913Abstract: Described is a technology in which video shots are clustered based upon the location at which the shots were captured. A global energy function is optimized, including a first term that computes clusters so as to be reasonably dense and well connected, to match the possible shots that are captured at a location, e.g., based on similarity scores between pairs of shots. A second term is a temporal prior that encourages subsequent shots to be placed in the same cluster. The shots may be represented as nodes of a minimum spanning tree having edges with weights that are based on the similarity score between the shots represented by their respective nodes. Agglomerative clustering is performed by selecting pairs of available clusters, merging the pairs and keeping the pair with the lowest cost. Clusters are iteratively merged until a stopping criterion or criteria is met (e.g., only a single cluster remains).Type: GrantFiled: April 1, 2009Date of Patent: May 22, 2012Assignee: Microsoft CorporationInventors: Simon J. Baker, Charles Lawrence Zitnick, III, Gerhard Florian Schroff
-
Publication number: 20120114037Abstract: A process for compressing and decompressing non-keyframes in sequential sets of contemporaneous video frames making up multiple video streams where the video frames in a set depict substantially the same scene from different viewpoints. Each set of contemporaneous video frames has a plurality frames designated as keyframes with the remaining being non-keyframes. In one embodiment, the non-keyframes are compressed using a multi-directional spatial prediction technique. In another embodiment, the non-keyframes of each set of contemporaneous video frames are compressed using a combined chaining and spatial prediction compression technique. The spatial prediction compression technique employed can be a single direction technique where just one reference frame, and so one chain, is used to predict each non-keyframe, or it can be a multi-directional technique where two or more reference frames, and so chains, are used to predict each non-keyframe.Type: ApplicationFiled: January 11, 2012Publication date: May 10, 2012Applicant: MICROSOFT CORPORATIONInventors: Simon Winder, Matthew T. Uyttendaele, Charles Lawrence Zitnick, III, Richard Szeliski, Sing Bing Kang
-
Publication number: 20120099796Abstract: Described herein are various technologies for generating descriptors for image patches. An image patch can be received, and gradients of pixels in the image patch can be determined. The gradients are normalized based upon an average magnitude of the gradients in a local spatial region with respect to a given pixel under consideration. A four-dimensional histogram is defined that takes into consideration pixel orientation, and normalized gradients are selectively assigned to bins of the histogram. The bins are binarized as a function of a number of gradients assigned thereto, and the binarized bins can be utilized as a descriptor for the image patch.Type: ApplicationFiled: October 25, 2010Publication date: April 26, 2012Applicant: Microsoft CorporationInventor: Charles Lawrence Zitnick, III
-
Patent number: 8139886Abstract: A two-dimensional blur kernel is computed for a digital image by first estimating a sharp image from the digital image. The sharp image is derived from the digital image by sharpening at least portions of the digital image. The two-dimensional blur function is computed by minimizing an optimization algorithm that estimates the blur function.Type: GrantFiled: June 23, 2008Date of Patent: March 20, 2012Assignee: Microsoft CorporationInventors: Richard Szeliski, Neel Joshi, Charles Lawrence Zitnick
-
Publication number: 20110299785Abstract: Techniques are disclosed for acceleration techniques for improved image remoting. A rolling 2D hash of a first image sent to a client is computed. When the server has a second image to send to the client, it calculates a rolling 2D hash of the new image. It also calculates “pivot points” for the images based on the rolling 2D hashes. Based on the pivot points, it determines possible matching hash windows between the two images that correspond to window moves or scrolls. Where a match is confirmed, it determines whether a “larger” a larger matching rectangle exists between the two images. It then instructs the client to display the matching rectangle that exists in the first image that the client has in the appropriate location in the second image, thereby saving the bandwidth requirements to re-transmit it to the client.Type: ApplicationFiled: June 3, 2010Publication date: December 8, 2011Applicant: Microsoft CorporationInventors: Voicu Anton Albu, Nadim Y. Abdo, Charles Lawrence Zitnick, III
-
Publication number: 20110255791Abstract: Systems, methods and computer-readable storage media are disclosed for accelerating bitmap remoting by extracting patterns from source bitmaps. A server takes a source image, and performs an edge-detection operation on it. From this edge-detected image, connected segments of the image are determined by executing multiple iterations of a small operation upon the image in parallel—for instance, by assigning each non-white pixel a unique value, then assigning each pixel the minimum value among itself and its neighbors until no pixel is assigned a new value in an iteration. Executing these operations in parallel greatly reduces the time required to identify the connected segments. When the segments are identified, they may be cached by the client so that they do not need to be re-sent to the client when re-encountered by the server.Type: ApplicationFiled: April 15, 2010Publication date: October 20, 2011Applicant: Microsoft CorporationInventors: Nadim Y. Abdo, Voicu Anton Albu, Charles Lawrence Zitnick, III, Max Alan McMullen
-
Publication number: 20110142334Abstract: Systems, methods and computer-readable storage media are disclosed for accelerating bitmap remoting by extracting non-grid tiles from source bitmaps. A server takes a source image, identifies possibly repetitive features, and tiles the image. For each tile that contains part of a possibly repetitive feature, the server replaces that part with the dominant color of the tile. The system then sends to a client a combination of new tiles and features, and indications to tiles and features that the client has previously received and stored, along with an indication of how to recreate the image based on the tiles and features.Type: ApplicationFiled: December 11, 2009Publication date: June 16, 2011Applicant: Microsoft CorporationInventors: Nadim Y. Abdo, Voicu Anton Albu, Charles Lawrence Zitnick, III