Systems and methods for synthesizing higher resolution images using a set of images containing a baseline image
Systems and methods for implementing array cameras configured to perform super-resolution processing to generate higher resolution super-resolved images using a plurality of captured images and lens stack arrays that can be utilized in array cameras are disclosed. An imaging device in accordance with one embodiment of the invention includes at least one imager array, and each imager in the array comprises a plurality of light sensing elements and a lens stack including at least one lens surface, where the lens stack is configured to form an image on the light sensing elements, control circuitry configured to capture images formed on the light sensing elements of each of the imagers, and a super-resolution processing module configured to generate at least one higher resolution super-resolved image using a plurality of the captured images.
Latest Pelican Imaging Corporation Patents:
- Extended Color Processing on Pelican Array Cameras
- Systems and Methods for Synthesizing Images from Image Data Captured by an Array Camera Using Restricted Depth of Field Depth Maps in which Depth Estimation Precision Varies
- Thin Form Factor Computational Array Cameras and Modular Array Cameras
- Array Camera Configurations Incorporating Constituent Array Cameras and Constituent Cameras
- Array Cameras Incorporating Independently Aligned Lens Stacks
This application is a continuation of U.S. Non-Provisional patent application Ser. No. 12/952,134, entitled Capturing and Processing of Images Using Monolithic Camera Array with Heterogeneous Imagers, filed Nov. 22, 2010, which claims priority to U.S. Provisional Patent Application 61/281,662 filed Nov. 20, 2009 and U.S. Provisional Patent Application 61/263,339 filed Nov. 20, 2009, the disclosure of which is hereby incorporated by reference in its entirety. U.S. Non-Provisional patent application Ser. No. 12/952,134 also claims priority as a Continuation-In-Part application to U.S. patent application Ser. No. 12/935,504, which is a national stage application of Patent Cooperation Treaty Application PCT/US2009/044687 filed May 20, 2009, and claims priority to U.S. Provisional Patent Application 61/054,694, filed May 20, 2008, the disclosure of which is hereby incorporated by reference in its entirety.
FIELD OF THE INVENTIONThe present invention is related to an image sensor including a plurality of heterogeneous imagers, more specifically to an image sensor with a plurality of wafer-level imagers having custom filters, sensors and optics of varying configurations.
BACKGROUNDImage sensors are used in cameras and other imaging devices to capture images. In a typical imaging device, light enters through an opening (aperture) at one end of the imaging device and is directed to an image sensor by an optical element such as a lens. In most imaging devices, one or more layers of optical elements are placed between the aperture and the image sensor to focus light onto the image sensor. The image sensor consists of pixels that generate signals upon receiving light via the optical element. Commonly used image sensors include CCD (charge-coupled device) image sensors and CMOS (complementary metal-oxide-semiconductor) sensors.
Filters are often employed in the image sensor to selectively transmit lights of certain wavelengths onto pixels. A Bayer filter mosaic is often formed on the image sensor. The Bayer filter is a color filter array that arranges one of the RGB color filters on each of the color pixels. The Bayer filter pattern includes 50% green filters, 25% red filters and 25% blue filters. Since each pixel generates a signal representing strength of a color component in the light and not the full range of colors, demosaicing is performed to interpolate a set of red, green and blue values for each image pixel.
The image sensors are subject to various performance constraints. The performance constraints for the image sensors include, among others, dynamic range, signal to noise (SNR) ratio and low light sensitivity. The dynamic range is defined as the ratio of the maximum possible signal that can be captured by a pixel to the total noise signal. Typically, the well capacity of an image sensor limits the maximum possible signal that can be captured by the image sensor. The maximum possible signal in turn is dependent on the strength of the incident illumination and the duration of exposure (e.g., integration time, and shutter width). The dynamic range can be expressed as a dimensionless quantity in decibels (dB) as:
Typically, the noise level in the captured image influences the floor of the dynamic range. Thus, for an 8 bit image, the best case would be 48 dB assuming the RMS noise level is 1 bit. In reality, however, the RMS noise levels are higher than 1 bit, and this further reduces the dynamic range.
The signal to noise ratio (SNR) of a captured image is, to a great extent, a measure of image quality. In general, as more light is captured by the pixel, the higher the SNR. The SNR of a captured image is usually related to the light gathering capability of the pixel.
Generally, Bayer filter sensors have low light sensitivity. At low light levels, each pixel's light gathering capability is constrained by the low signal levels incident upon each pixel. In addition, the color filters over the pixel further constrain the signal reaching the pixel. IR (Infrared) filters also reduce the photo-response from near-IR signals, which can carry valuable information.
These performance constraints of image sensors are greatly magnified in cameras designed for mobile systems due to the nature of design constraints. Pixels for mobile cameras are typically much smaller than the pixels of digital still cameras (DSC). Due to limits in light gathering ability, reduced SNR, limits in the dynamic range, and reduced sensitivity to low light scenes, the cameras in mobile cameras show poor performance.
SUMMARYCamera arrays, imaging devices including a camera array and/or a method for capturing an image that employ a plurality of imagers where each imager includes a plurality of sensor elements, and lens stack arrays that can be utilized in camera arrays in accordance with embodiments of the invention are disclosed. The plurality of imagers can include at least a first imager and a second imager, where the first imager and the second imager may have the same imaging characteristics or different imaging characteristics.
In one embodiment, the first imaging characteristics and the second imager have different imaging characteristics. The imaging characteristics may include, among others, the size of the imager, the type of pixels included in the imager, the shape of the imager, filters associated with the imager, exposure time of the imager, aperture size associated with the imager, the configuration of the optical element associated with the imager, gain of the imager, the resolution of the imager, and operational timing of the imager.
In one embodiment, the first imager includes a filter for transmitting a light spectrum. The second imager also includes the same type of filter for transmitting the same light spectrum as the first imager but captures an image that is sub-pixel phase shifted from an image captured by the first imager. The images from the first imager and the second imager are combined using a super-resolution process to obtain images of higher resolution.
In one embodiment, the first imager includes a first filter for transmitting a first light spectrum and the second imager includes a second filter for transmitting a second light spectrum. The images from the first and second imagers are then processed to obtain a higher quality image.
In one embodiment, lens elements are provided to direct and focus light onto the imagers. The lens elements form lens stacks that create optical channels, and each lens stack focuses light onto one imager. Because each lens element is associated with one imager, each lens element may be designed and configured for a narrow light spectrum. Further, the thickness of the lens element may be reduced, decreasing the overall thickness of the camera array. In such an embodiment, the lens elements may be made using any suitable fabrication technique, such as, for example using wafer level optics (WLO) technology, injection molding, and/or glass molding.
In one embodiment, the plurality of imagers include at least one near-IR imager dedicated to receiving near-IR (Infrared) spectrum. An image generated from the near-IR imager may be fused with images generated from other imagers with color filters to reduce noise and increase the quality of the images. In another such embodiment, imagers covering other spectrum ranges including far-IR and UV spectra may be included as well.
In one embodiment, the plurality of imagers may be associated with lens elements that provide a zooming capability. In one such embodiment, different imagers may be associated with lenses of different focal lengths to have different fields-of-views and provide different levels of zooming capability. Different fields-of-view can also be achieved using imagers having different sensor sizes/formats by means of different pixel sizes or different numbers of pixels/light sensing elements. A mechanism may be provided to provide smooth transition from one zoom level to another zoom level.
In one or more embodiments, the plurality of imagers is coordinated and operated to obtain at least one of a high dynamic range image, a panoramic image, a hyper-spectral image, distance to an object and a high frame rate video.
An imaging device in accordance with one embodiment of the invention includes at least one imager array, and each imager in the array comprises a plurality of light sensing elements and a lens stack including at least one lens surface, where the lens stack is configured to form an image on the light sensing elements, control circuitry configured to capture images formed on the light sensing elements of each of the imagers, and a super-resolution processing module configured to generate at least one higher resolution super-resolved image using a plurality of the captured images.
In a further embodiment of the invention, a spectral filter is provided within each imager and is configured to pass a specific spectral band of light, and spectral filters that pass different spectral bands are provided within at least two of the imagers.
In another embodiment of the invention, the spectral filter is selected from the group consisting of an organic color filter, an absorptive material, a dielectric coating, an interference filter, a multilayer coating, and combinations thereof.
In a still further embodiment of the invention, at least one of the imagers further comprises a polarizing filter.
In still another embodiment of the invention, the construction of the lens stack within an imager differs based upon the specific spectral band of light passed by the spectral filter within the imager such that chromatic aberrations are reduced.
In a yet further embodiment of the invention, the prescription of at least one surface of the lens stack within an imager is a function of the specific spectral band of light passed by the spectral filter within the imager.
In yet another embodiment of the invention, the at least one surface of the lens stack is configured so that the back focal length of each lens stack in the imager array are the same independent of the spectral band.
In a further embodiment again of the invention, a combination of high and low Abbe number materials is used in the construction of each of the lens stacks to reduce chromatic aberrations.
In another embodiment again of the invention, each lens stack includes at least one aperture stop positioned in front of the light sensing element, and the spectral filter is positioned in proximity to the aperture stop.
In a further additional embodiment of the invention, the at least one aperture stop is formed by a light blocking material selected from the group consisting of metal materials, oxide materials, black particle filled photoresists and combinations thereof.
In another additional embodiment of the invention, at least one lens surface in the optical stack of an imager differs based upon the specific spectral band of light passed by the spectral filter within the imager, and each lens surface is selected from the group consisting of diffractive, Fresnel, refractive and combinations thereof.
In a still yet further embodiment of the invention, the radii of curvature of the lens surfaces differ based upon the specific spectral band of light passed by the spectral filter within the imager.
In still yet another embodiment of the invention, the lens stacks of each of the imagers have the same back focal length.
In a still further embodiment again of the invention, the light sensing elements are disposed to detect an image in a backside imaging mode.
In still another embodiment again of the invention, a separate negative lens element is disposed in close proximity to the light sensing elements of an imager and in optical alignment with the lens stack of the imager so that the field curvature of the lens stack is corrected.
In a still further additional embodiment of the invention, a separate negative lens element is disposed in close proximity to the light sensing elements of an imager and in optical alignment with the lens stack of the imager so that the field curvature of the lens stack is corrected.
In still another additional embodiment of the invention, at least two of the imagers in the imager array have different fields of view so that different scene sizes can be captured by the imagers.
In a yet further embodiment again of the invention, the lens stacks of the at least two imagers have different focal lengths.
In yet another embodiment again of the invention, the light sensing elements in each of at least two imagers have different sizes.
In a yet further additional embodiment of the invention, the at least two imagers include different numbers of light sensing elements.
Yet another additional embodiment of the invention also includes a mechanical zoom mechanism within the lens stack of an imager for smoothly transitioning between different fields of view.
In a further additional embodiment of the invention again, at least one of the imagers is designed to image light in the near-infrared spectrum.
In another additional embodiment of the invention again, at least one imager is configured to take a measurement of the visible light spectrum, and wherein the control circuitry is configured to use said measurement to control the imaging characteristics of the other imagers in said imaging device.
In a still yet further additional embodiment of the invention, the imaging device includes at least two camera arrays.
In still yet another additional embodiment of the invention, the at least two camera arrays operate in conjunction during imaging to provide stereo images.
In a still yet further embodiment of the invention again, the at least two camera arrays are spatially separated one from the other in at least one plane.
In still yet another embodiment of the invention again, each of the camera arrays include a different combination of imagers each combination of imagers having different imaging characteristics.
In a yet further additional embodiment of the invention again, at least one camera array comprises an N×M array of imagers, where at least one of N and M is greater than 2, and each of said imagers comprising at least two light sensing elements.
In yet another additional embodiment of the invention again, the control circuitry is configured to control the imaging characteristics of each imager.
In another further embodiment of the invention, the controllable imaging characteristics include exposure time, gain, and black level offset.
In still another further embodiment of the invention, the control circuitry is configured to trigger each of said imagers in accordance with a desired sequence.
In yet another further embodiment of the invention, the control circuitry operates the imagers in a staggered sequence.
In another further additional embodiment of the invention, an initial master setting for the imaging characteristics of each imager is set by the control circuitry, predefined deviations from the initial master settings are also set by the control circuitry, and the deviations include functions selected from the group consisting of ISO settings, high dynamic range, gain settings, integration time settings, exposure settings, digital processing settings and combinations thereof.
In another further embodiment of the invention again, the camera arrays are fabricated using techniques consisting of wafer level optics techniques, injection molding, glass molding and combinations thereof.
In still yet another further embodiment of the invention, each of the plurality of imagers are optically separated from each other.
In still another further additional embodiment of the invention, each of the imagers is optically separated from the other imagers by at least two opaque surfaces located in front of the light sensing elements and having openings arranged in axial alignment with the lens stack of the imager.
In still another further embodiment of the invention again, the imagers are optically separated from each other by opaque walls disposed at the boundaries between the imagers.
In yet another further additional embodiment of the invention, the opaque walls are cavities between the imagers filled with light blocking material.
In yet another further embodiment of the invention again, the lens stack includes spacers, and the spacers are constructed from an opaque material.
In another further additional embodiment of the invention again, the lens stack includes spacers, and the spacers are coated with an opaque material.
In still yet another further additional embodiment of the invention, the control circuitry is configured to perform normalization of a captured low resolution image using a one dimensional space filling curve to describe a normalization plane.
In still yet another further embodiment of the invention again, the one dimensional space filling curve is expressed as the coefficients of a polynomial.
In yet another further additional embodiment of the invention again, the control circuitry is configured to capture multiple low resolution images using different imagers having different exposure times.
In still yet another further additional embodiment of the invention again, the light sensing elements of at least one of the imagers used to capture an image are high dynamic range light sensing elements.
In an additional embodiment of the invention, at least one of the imagers used to capture a low resolution image is configured to capture a near-IR low resolution image.
In an additional embodiment of the invention again, the super-resolution module is configured to generate a depth map for the super-resolution image.
In a number of embodiments of the invention, the super-resolution module is configured to select at least one distance as a focal plane and to apply blurring to pixels with depths that are not proximate a focal plane.
In a further embodiment of the invention, the super-resolution processing module is configured to perform temperature normalization with respect to the captured images.
In a still further embodiment of the invention, the point spread functions of the lens stacks vary with temperature, and the super-resolution processing module is configured to perform temperature normalization by selecting a point spread functions from temperature calibration data corresponding to the point spread functions of the lens stacks during the capture of the images.
In still another embodiment of the invention, the super-resolution processing module is configured to computationally correct color differences between the captured images.
In a yet further embodiment of the invention, the super-resolution processing module is configured to computationally correct geometric distortion differences between the captured images.
The features and advantages described in the specification are not all inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter.
Embodiments of the invention are now described with reference to the figures where like reference numbers indicate identical or functionally similar elements. Also in the figures, the left most digits of each reference number corresponds to the figure in which the reference number is first used.
Many embodiments relate to using a distributed approach to capturing images using a plurality of imagers of different imaging characteristics. Each imager may be configured in such a manner that each imager captures an image that is shifted by a sub-pixel amount with respect to the image captured by other imagers having similar imaging characteristics. Each imager may also include separate optics with different filters and operate with different operating parameters (e.g., exposure time). Distinct images generated by the imagers are processed to obtain an enhanced image. In many embodiments, the separate optics incorporated into each imager are implemented using a lens stack array. The lens stack array can include one or more optical elements fabricated using wafer level optics (WLO) technology.
A sensor element or pixel refers to an individual light sensing element in an imager. The light sensing element can be, but is not limited to, traditional CIS (CMOS Image Sensor), CCD (charge-coupled device), high dynamic range pixel, multispectral pixel and various alternatives thereof.
A sensor refers to a two dimensional array of pixels used to capture an image formed on the sensor by the optics of the imager. The sensor elements of each sensor have similar physical properties and receive light through the same optical component. Further, the sensor elements in the each sensor may be associated with the same color filter.
A camera array refers to a collection of imagers designed to function as a unitary component. The camera array may be fabricated on a single chip for mounting or installing in various devices.
An array of camera arrays refers to an aggregation of two or more camera arrays. Two or more camera arrays may operate in conjunction to provide extended functionality over a single camera array, such as, for example, stereo resolution.
Image characteristics of an imager refer to any characteristics or parameters of the imager associated with capturing of images. The imaging characteristics may include, among others, the size of the imager, the type of pixels included in the imager, the shape of the imager, filters associated with the imager, the exposure time of the imager, aperture size associated with the imager, the configuration of the optical element associated with the imager (such as the number of elements, the shapes, profiles and sizes of the lens surfaces, including the radii of curvature, aspheric coefficients, focal lengths and FOVs of the objectives, color correction, F/#s, etc.), the gain of the imager, the resolution of the imager, and operational timing of the imager.
Structure of Camera Array
The camera array may include two or more types of heterogeneous imagers, each imager including two or more sensor elements or pixels. Each one of the imagers may have different imaging characteristics. Alternatively, there may be two or more different types of imagers where the same type of imager shares the same imaging characteristics.
In one embodiment, each imager 1A through NM has its own filter and/or optical element (e.g., lens). Specifically, each of the imagers 1A through NM or a group of imagers may be associated with spectral color filters to receive certain wavelengths of light. Example filters include a traditional filter used in the Bayer pattern (R, G, B or their complements C, M, Y), an IR-cut filter, a near-IR filter, a polarizing filter, and a custom filter to suit the needs of hyper-spectral imaging. Some imagers may have no filter to allow reception of both the entire visible spectra and near-IR, which increases the imager's signal-to-noise ratio. The number of distinct filters may be as large as the number of imagers in the camera array. Further, each of the imagers 1A through NM or a group of imagers may receive light through lenses having different optical characteristics (e.g., focal lengths) or apertures of different sizes.
In one embodiment, the camera array includes other related circuitry. The other circuitry may include, among others, circuitry to control imaging parameters and sensors to sense physical parameters. The control circuitry may control imaging parameters such as exposure times, gain, and black level offset. The sensor may include dark pixels to estimate dark current at the operating temperature. The dark current may be measured for on-the-fly compensation for any thermal creep that the substrate may suffer from. Alternatively, compensation of thermal effects associated with the optics, e.g., because of changes in refractive index of the lens material, may be accomplished by calibrating the PSF for different temperatures.
In one embodiment, the circuit for controlling imaging parameters may trigger each imager independently or in a synchronized manner. The start of the exposure periods for the various imagers in the camera array (analogous to opening a shutter) may be staggered in an overlapping manner so that the scenes are sampled sequentially while having several imagers being exposed to light at the same time. In a conventional video camera sampling a scene at N exposures per second, the exposure time per sample is limited to 1/N seconds. With a plurality of imagers, there is no such limit to the exposure time per sample because multiple imagers may be operated to capture images in a staggered manner.
Each imager can be operated independently. Entire or most operations associated with each individual imager may be individualized. In one embodiment, a master setting is programmed and deviation (i.e., offset or gain) from such master setting is configured for each imager. The deviations may reflect functions such as high dynamic range, gain settings, integration time settings, digital processing settings or combinations thereof. These deviations can be specified at a low level (e.g., deviation in the gain) or at a higher level (e.g., difference in the ISO number, which is then automatically translated to deltas for gain, integration time, or otherwise as specified by context/master control registers) for the particular camera array. By setting the master values and deviations from the master values, higher levels of control abstraction can be achieved to facilitate a simpler programming model for many operations. In one embodiment, the parameters for the imagers are arbitrarily fixed for a target application. In another embodiment, the parameters are configured to allow a high degree of flexibility and programmability.
In one embodiment, the camera array is designed as a drop-in replacement for existing camera image sensors used in cell phones and other mobile devices. For this purpose, the camera array may be designed to be physically compatible with conventional image sensors of approximately the same resolution although the achieved resolution of the camera array may exceed conventional image sensors in many photographic situations. Taking advantage of the increased performance, the camera array in accordance with embodiments of the invention may include fewer pixels to obtain equal or better quality images compared to conventional image sensors. Alternatively, the size of the pixels in the imager may be reduced compared to pixels in conventional image sensors while achieving comparable results.
In order to match the raw pixel count of a conventional image sensor without increasing silicon area, the logic overhead for the individual imagers is preferably constrained in the silicon area. In one embodiment, much of the pixel control logic is a single collection of functions common to all or most of the imagers with a smaller set of functions applicable to each imager. In this embodiment, the conventional external interface for the imager may be used because the data output does not increase significantly for the imagers.
In one embodiment, the camera array including the imagers replaces a conventional image sensor of M megapixels. The camera array includes N×N imagers, each sensor including pixels of
Each imager in the camera array also has the same aspect ratio as the conventional image sensor being replaced. Table 1 lists example configurations of camera arrays according to the present invention replacing conventional image sensor.
The Super-Resolution Factors in Table 1 are estimates and the Effective Resolution values may differ based on the actual Super-Resolution factors achieved by processing.
The number of imagers in the camera array may be determined based on, among other factors, (i) resolution, (ii) parallax, (iii) sensitivity, and (iv) dynamic range. A first factor for the size of imager is the resolution. From a resolution point of view, the preferred number of the imagers ranges from 2×2 to 6×6 because an array size of larger than 6×6 is likely to destroy frequency information that cannot be recreated by the super-resolution process. For example, 8 Megapixel resolution with 2×2 imager will require each imager to have 2 Megapixels. Similarly, 8 Megapixel resolution with a 5×5 array will require each imager to have 0.32 Megapixels. In many embodiments, the number of imagers in the array is determined based upon the requirements of a specific application.
A second factor that may constrain the number of imagers is the issue of parallax and occlusion. With respect to an object captured in an image, the portion of the background scene that is occluded from the view of the imager can be referred to as the “occlusion set.” When two imagers capture the object from two different locations, the occlusion set of each imager is different. Hence, there may be scene pixels captured by one imager but not the other. To resolve this issue of occlusion, it is desirable to include a certain minimal set of imagers for a given type of imager and to distribute the imagers symmetrically around the central axis of the camera array.
A third factor that may put a lower bound on the number of imagers is the issue of sensitivity in low light conditions. To improve low light sensitivity, imagers for detecting near-IR spectrum may be needed. The number of imagers in the camera array may need to be increased to accommodate such near-IR imagers.
A fourth factor in determining the size of the imager is dynamic range. To provide dynamic range in the camera array, it is advantageous to provide several imagers of the same filter type (chroma or luma). Each imager of the same filter type may then be operated with different exposures simultaneously. The images captured with different exposures may be processed to generate a high dynamic range image.
Based on these factors, the preferred number of imagers is 2×2 to 6×6. 4×4 and 5×5 configurations are more preferable than 2×2 and 3×3 configurations because the former are likely to provide sufficient number of imagers to resolve occlusion issues, increase sensitivity and increase the dynamic range. In addition, rectangular arrays can also be preferred. At the same time, the computational load required to recover resolution from these array sizes will be modest in comparison to that required in the 6×6 array. Arrays larger than 6×6 may, however, be used to provide additional features such as optical zooming and multispectral imaging. Although only square imagers are described here, as will be discussed in greater detail later, such imagers may have different x- and y-dimensions.
Another consideration is the number of imagers dedicated to luma sampling. By ensuring that the imagers in the array dedicated to near-IR sampling do not reduce the achieved resolution, the information from the near-IR images is added to the resolution captured by the luma imagers. For this purpose, at least 50% of the imagers may be used for sampling the luma and/or near-IR spectra. In one embodiment with 4×4 imagers, 4 imagers samples luma, 4 imagers samples near-IR, and the remaining 8 imagers sample two chroma (Red and Blue). In another embodiment with 5×5 imagers, 9 imagers sample luma, 8 imagers sample near-IR, and the remaining 8 imagers sample two chroma (Red and Blue). Further, the imagers with these filters may be arranged symmetrically within the camera array to address occlusion due to parallax. In a further embodiment with 5×5 imager, 17 imagers sample luma, 4 images sample Red, and 4 imagers sample Blue.
In one embodiment, the imagers in the camera array are spatially separated from each other by a predetermined distance. By increasing the spatial separation, the parallax between the images captured by the imagers may be increased. The increased parallax is advantageous where more accurate distance information is important. Separation between two imagers may also be increased to approximate the separation of a pair of human eyes. By approximating the separation of human eyes, a realistic stereoscopic 3D image may be provided to present the resulting image on an appropriate 3D display device.
In one embodiment, multiple camera arrays are provided at different locations on a device to overcome space constraints. One camera array may be designed to fit within a restricted space while another camera array may be placed in another restricted space of the device. For example, if a total of 20 imagers are required but the available space allows only a camera array of 1×10 imagers to be provided on either side of a device, two camera arrays each including 10 imagers may be placed on available space at both sides of the device. Each camera array may be fabricated on a substrate and be secured to a motherboard or other parts of a device. In addition, such imagers do not have to be homogenous in size, and may have different x- and y-dimensions. The images collected from multiple camera arrays may be processed to generate images of desired resolution and performance.
A design for a single imager may be applied to different camera arrays each including other types of imagers. Other variables in the camera array such as spatial distances, color filters and combination with the same or other sensors may be modified to produce a camera array with differing imaging characteristics. In this way, a diverse mix of camera arrays may be produced while maintaining the benefits from economies of scale.
Wafer Level Optics Integration
In one embodiment, the camera array employs wafer level optics (WLO) technology. Although in many embodiments, similar optical channels can be constructed using any of a variety of techniques including but not limited to injection molding, glass molding, and/or combinations of these techniques with other techniques including WLO techniques. WLO itself is a technology that encompasses a number of processes, including, for example, molding optics (such as arrays of lens modules and arrays of those lens arrays) on glass wafers, stacking of those wafers (including wafers having lenses replicated on either side of the substrate) with appropriate spacers, at either a wafer or die-level, followed by packaging of the optics directly with the imager into a monolithic integrated module.
The WLO procedure may involve, among other procedures, using a diamond-turned mold to create each polymer lens element on a glass substrate. More specifically, the process chain in WLO generally includes producing a diamond turned lens master (both on an individual and array level), then producing a negative mold for replication of that master (also called a stamp or tool), and then finally forming a polymer replica on a glass substrate, which has been structured with appropriate supporting optical elements, such as, for example, apertures, light blocking materials, filters, etc.
In the embodiment of
Optical Crosstalk Suppression
As discussed above, the camera array assembly 250 is composed of multiple imagers, each of which, as shown in
The optical crosstalk suppressing vertical opaque walls may be made using any suitable technique that provides for the introduction of an opaque surface or material between the optical channels 284 of the camera array assembly 286. In one embodiment, the vertical opaque walls are formed by fully or partially introducing grooves into the lens stack 288 of the camera array assembly 286. It is preferable not to cut the grooves fully through the lens stack to preserve the mechanical integrity of the camera array assembly. Such grooves may be introduced by any suitable technique, such as, for example, by dicing into the front or backside of the lens array stack 286 using a wafer dicer (disk/blade), or by laser cutting, or water-jet cutting. Once the grooves are formed, they are filled with a light blocking material. Alternatively, the inner side walls of the grooves may be coated with a light blocking material and the remainder of the groove filled with another material with low shrinkage properties. As discussed above, a light blocking material is any opaque material, such as, for example, a metal material, a metal oxide, dark silicon, or a black particle filled photoresist like a black matrix polymer.
In another embodiment, shown schematically in
In a further embodiment shown schematically in
Lens Properties
Specifically, one of the issues raised in camera design is how to correct for field curvature. An image projected through a lens is not planar, but has an inherently curved surface. One way to correct this field curvature is to position a thick negative lens element 312 close to or directly on the imager surface 314. The negative lens element planarizes the various angled beams of light 316 from the image, thereby addressing the field curvature problem. Such field flattened images provide superior image performance, allow for the manufacture of array cameras with relaxed TTL requirements, and deliver very homogeneous MTF. However, one problem with this approach is that this field flattening approach intrinsically requires a high CRA. This makes the technique unsuitable for most cameras; however, camera arrays in accordance with embodiments of the invention allow for the use of backside imaging (BSI). Positioning the image sensor behind the substrate relaxes the CRA angle requirement, thereby allowing for the use of the negative lens element field flattening approach shown in
Another advantage of the array camera relates to chromatic aberrations. Specifically, in a conventional polychromatic lens, the lens has to be corrected for chromatic aberrations, because the focal length through the lens is different for different wavelengths of light. As a result, it is necessary to compromise the performance of the lens for some of the color wavelengths to get acceptable overall color performance. By making each optical channel narrow spectral band, color aberration is reduced and/or prevented, and each lens may be optimized to a specific color wavelength. For example, an imager receiving visible or near-IR spectrum may have a lens element specifically optimized for this spectral band of light. For imagers detecting other light spectrum, the lens element may be constructed with different properties, such as radii of curvature, so that a constant focal length across all wavelengths of light is achieved so that, in turn, the focal plane is the same for different spectral bands of light. The matching of the focal plane across different wavelengths of light increases the sharpness of image captured at the imager and reduces longitudinal chromatic aberration. Because each lens element may be designed to direct a narrow band of light, the concomitant lack of color aberration means that the lens elements can be subject to less rigorous design constraints, yet produce better or equivalent performance compared to a conventional lens element covering a wide light spectrum. In particular, there is no need to undertake costly aberration balancing correction. What is more, simple lenses generally have better MTF and lower F# (higher sensitivity). It should be noted that although the lenses used in these array cameras have much smaller color aberrations when compared to conventional polychromatic lenses, each lens is still designed to focus a certain wavelength-bandwidth. Accordingly, in one embodiment each of these “monochromatic” lenses can be optimally color corrected by using combinations of high and low Abbe number materials (different optical dispersions).
Light of different wavelengths having different focal lengths (longitudinal color aberration) is not the only type of aberration that occurs in polychromatic optical systems. The refractive index of a lens is dependent on the wavelength of light passing through the lens. As a result, a lens will impart different magnification to colors of different wavelengths. For example, the red wavelength band might have a slightly smaller magnification than green, and green may in turn have a slightly smaller magnification than blue. If the images obtained from these different wavelengths of light are then overlaid without correction, the image will lose resolution because the different colors will not overlap correctly. Based on the properties of the material, the differential lateral distortions of the color magnification can be determined and then corrected. Correction can be accomplished by restricting the profiles of the lenses so that each color has the same magnification, but this reduces the possible degrees of freedom available for lens manufacture, and reduces the ability to optimize MTF. Accordingly, lateral distortion can be permitted optically, and then corrected after imaging computationally. The electronic correction of the lateral color of the lens can actually provide improvements to system performance above and beyond simply correcting for the original distortion, because such correction directly improves the resolution of the system in terms of polychromatic MTF. In particular, lateral color aberrations in a lens can be seen as a color dependent distortion of the lens. By mapping all differently distorted single color images of an object back to the same rectangle, perfect overlap can be achieved in the full color image resulting in the polychromatic MTF being the same as the monochromatic one (not only due to the individual color channel color-blur correction, but also as a result of the exact superposition of the different colors).
Yet another advantage to using many lenses, each optimized for use with a narrow band of light, is that the there is no restriction on the type of lens that may be used. In particular, the array camera allows for the use of diffractive, refractive, Fresnel lenses, or combinations of these types of lenses. Diffractive lenses are attractive because they allow for the creation of complex wavefronts with an essentially flat optical element, and they are also relatively simple to manufacture. In conventional cameras it is not possible to use diffractive lenses because having a single imager means that the lens must be able to efficiently transmit a wide spectrum of light, and while diffractive lenses are very efficient at transmitting narrow wavelength bands of light, there is a steep drop-off in performance for wavelengths of light outside of this optimized range. Because each array of the current camera may be focused on a narrow wavelength of light, the narrow optimized wavelength band of these diffractive lenses is not a limiting factor.
Other advantages of smaller lens elements include, among others, reduced cost, reduced amount of materials, and the reduction in the manufacturing steps. By providing n2 lenses that are 1/n the size in x and y dimension (and thus 1/n thickness), the wafer size for producing the lens element may also be reduced. This reduces the cost and the amount of materials considerably. Further, the number of lens substrates is reduced, which results in a reduced number of manufacturing steps and reduced attendant yield costs. The placement accuracy required to register the lens array to the imagers is typically no more stringent than in the case of a conventional imager because the pixel size for the camera array according to the present invention may be substantially the same as a conventional image sensor. In addition, monochromatic aberrations scale with lens diameter. Because array cameras are able to use smaller lenses, any aberrations that exist are smaller so it is possible to use lenses with simpler profiles. This results in a system that is simultaneously better and less costly to fabricate. Smaller sized lenses also have a lower volume, which results in lower sag or shrinkage during manufacture. Shrinkage is bad for replication because it deforms the desired lens profile, and results in the need for the fabricator to precompensate for the predicted level of sag so that the final lens shape will be correct. This precompensation is difficult to control. With lower sag/shrinkage it is not necessary to have these tight fabrication controls, again lowering the overall cost of the manufacture of the lenses.
In one embodiment, the WLO fabrication process includes: (i) incorporating lens element stops by plating the lens element stops onto the substrate before lens molding, and (ii) etching holes in the substrate and performing two-sided molding of lenses through the substrate. The etching of holes in the substrate is advantageous because index mismatch is not caused between plastic and substrate. In this way, light absorbing substrate that forms natural stops for all lens elements (similar to painting lens edges black) may be used.
In one embodiment, filters are part of the imager. In another embodiment, filters are part of a WLO subsystem. In an embodiment including a filter, it is preferred to dispose the filter (whether CFA, IR and/or VIS) into or close to the aperture stop surface and not at the imager sensor surface, because when positioned at a distance from the imager sensor small defects in those filter layers are averaged out over all entrance pupil positions, and are therefore less visible.
Imaging System and Processing Pipeline
The controller 440 is hardware, software, firmware or a combination thereof for controlling various operation parameters of the camera array 410. The controller 440 receives inputs 446 from a user or other external components and sends operation signals 442 to control the camera array 410. The controller 440 may also send information 444 to the image processing pipeline module 420 to assist processing of the images 412.
The image processing pipeline module 420 is hardware, firmware, software or a combination for processing the images received from the camera array 410. The image processing pipeline module 420 processes multiple images 412, for example, as described below in detail with reference to
The address and phase offset calibration module 554 is a storage device for storing calibration data produced during camera array characterization in the manufacturing process or a subsequent recalibration process. In several embodiments, the calibration data can indicate mapping between the addresses of physical pixels 572 in the imagers and the logical addresses 546, 548 of an image. In other embodiments, a variety of calibration data appropriate to a specific application can be utilized in the address and phase offset calibration module.
The address conversion module 530 performs normalization based on the calibration data stored in the address and phase offset calibration module 554.
Specifically, the address conversion module 530 converts “physical” addresses of the individual pixels in the image to “logical” addresses 548 of the individual pixels in the imagers or vice versa. In order for super-resolution processing to produce an image of enhanced resolution, the phase difference between corresponding pixels in the individual imagers needs to be resolved. The super-resolution process may assume that for each pixel in the resulting image the set of input pixels from each of the imager is consistently mapped and that the phase offset of the image captured by each imager is already known with respect to the position of the pixel in the resulting image. Alternatively, the pixel offsets can be estimated prior to the superresolution process. The address conversion module 530 resolves such phase differences by converting the physical addresses in the images 412 into logical addresses 548 of the resulting image for subsequent processing.
The images 412 captured by the imagers 540 are provided to the upstream pipeline processing module 510. The upstream pipe processing module 510 may perform one or more of normalization of the color planes, Black Level calculation and adjustments, fixed noise compensation, optical PSF (point spread function) deconvolution, noise reduction, lateral color correction and crosstalk reduction.
In one embodiment, the upstream pipeline processing module also performs temperature normalization. Temperature normalization corrects for changes in the refractive index of the optical components through which the imagers receive light that result from changes in the temperature of the camera during use. In several embodiments, the temperature normalization process involves determining the temperature of the camera array by measuring the dark current of one or an average of a number of the camera array's imagers. Using this measurement, a refractive index normalization is performed by picking the correct point spread function from temperature calibration data. Different point spread functions may be obtained during a temperature dependent refractive index characterization of the camera during manufacture, and stored in the imaging system for use in the temperature normalization process.
After the image is processed by the upstream pipeline processing module 510, an image pixel correlation module 514 performs calculation to account for parallax that becomes more apparent as objects being captured approach the camera array. Specifically, the image pixel correlation module 514 aligns portions of images captured by different imagers to compensate for the parallax. In one embodiment, the image pixel correlation module 514 compares the difference between the average values of neighboring pixels with a threshold and flags the potential presence of parallax when the difference exceeds the threshold. The threshold may change dynamically as a function of the operating conditions of the camera array. Further, the neighborhood calculations may also be adaptive and reflect the particular operating conditions of the selected imagers.
The image is then processed by the parallax confirmation and measurement module 518 to detect and meter the parallax. In one embodiment, parallax detection is accomplished by a running pixel correlation monitor. This operation takes place in logical pixel space across the imagers with similar integration time conditions. When the scene is at practical infinity, the data from the imagers is highly correlated and subject only to noise-based variations. When an object is close enough to the camera, however, a parallax effect is introduced that changes the correlation between the imagers. Due to the spatial layout of the imagers, the nature of the parallax-induced change is consistent across all imagers. Within the limits of the measurement accuracy, the correlation difference between any pair of imagers dictates the difference between any other pair of imagers and the differences across the other imagers. This redundancy of information enables highly accurate parallax confirmation and measurement by performing the same or similar calculations on other pairs of imagers. If parallax is present in the other pairs, the parallax should occur at roughly the same physical location of the scene taking into account the positions of the imagers. The measurement of the parallax may be accomplished at the same time by keeping track of the various pair-wise measurements and calculating an “actual” parallax difference as a least squares (or similar statistic) fit to the sample data. Other methods for detecting the parallax may include detecting and tracking vertical and horizontal high-frequency image elements from frame-to-frame.
The parallax compensation module 522 processes images including objects close enough to the camera array to induce parallax differences larger than the accuracy of the phase offset information required by super resolution process. The parallax compensation module 522 uses the scan-line based parallax information generated in the parallax detection and measurement module 518 to further adjust mapping between physical pixel addresses and logical pixel addresses before the super-resolution process. There are two cases that occur during this processing. In a more common case, addressing and offsetting adjustment are required when the input pixels have shifted positions relative to the image-wise-corresponding pixels in other imagers. In this case, no further processing with respect to parallax is required before performing super-resolution. In a less common case, a pixel or group of pixels are shifted in such a way that exposes the occlusion set. In this case, the parallax compensation process generates tagged pixel data indicating that the pixels of the occlusion set should not be considered in the super-resolution process.
After the parallax change has been accurately determined for a particular imager, the parallax information 524 is sent to the address conversion module 530. The address conversion module 530 uses the parallax information 524 along with the calibration data 558 from the address and phase offset calibration module 554 to determine the appropriate X and Y offsets to be applied to logical pixel address calculations. The address conversion module 530 also determines the associated sub-pixel offset for a particular imager pixel with respect to pixels in the resulting image 428 produced by the super-resolution process. The address conversion module 530 takes into account the parallax information 524 and provides logical addresses 546 accounting for the parallax.
After performing the parallax compensation, the image is processed by the super-resolution module 526 to obtain a high resolution synthesized image 422 from low resolution images, as described below in detail. The synthesized image 422 may then be fed to the downstream color processing module 564 to perform one or more of the following operations: focus recover, white balance, color correction, gamma correction, RGB to YUV correction, edge-aware sharpening, contrast enhancement and compression.
The image processing pipeline module 420 may include components for additional processing of the image. For example, the image processing pipeline module 420 may include a correction module for correcting abnormalities in images caused by a single pixel defect or a cluster of pixel defects. The correction module may be embodied on the same chip as the camera array, as a component separate from the camera array or as a part of the super-resolution module 526.
Super-Resolution Processing
In one embodiment, the super-resolution module 526 generates a higher resolution synthesized image by processing low resolution images captured by the imagers 540. The overall image quality of the synthesized image is higher than images captured from any one of the imagers individually. In other words, the individual imagers operate synergistically, each contributing to higher quality images using their ability to capture a narrow part of the spectrum without sub-sampling. The image formation associated with the super-resolution techniques may be expressed as follows:
yk=Wk·x+nk,∀k=1 . . . p equation (2)
where Wk represents the contribution of the HR scene (x) (via blurring, motion, and sub-sampling) to each of the LR images (yk) captured on each of the k imagers and nk is the noise contribution.
Imager Configurations
The embodiment of
The use of polychromatic imagers and near-IR imagers is advantageous because these sensors may capture high quality images in low lighting conditions. The images captured by the polychromatic imager or the near-IR imager are used to denoise the images obtained from regular color imagers. However, as discussed above, these polychromatic lenses require that an associated color correction technique be used to address color aberrations inherent in a single lens trying to capture all wavelengths of light and deliver it to the same focal plane. Any conventional color correction technique may be utilized with the proposed array cameras.
Imager Layout
The premise of increasing resolution by aggregating multiple low resolution images relies upon the different low resolution images representing slightly different viewpoints of the same scene. If the LR images are all shifted by integer units of a pixel, then each image contains essentially the same information. Therefore, there is no new information in the LR images that can be used to create a HR image. In camera arrays according to embodiments of the invention, the layout of the imagers in the array may be preset and controlled so that each imager in a row or a column captures an image that is shifted a fixed sub-pixel distance relative to the images captured by its neighboring imagers. Ideally, the images captured by each imager are spatially offset from the other imagers in such a way as to provide uniform sampling of the scene or the light field and the uniformity of sampling is such that the LR images captured by each of the imagers yields non-redundant information about the sampled scene (light field). Such non-redundant information about the scene can be utilized by subsequent signal processing processes to synthesize a single HR image.
A sub-pixel shift between the images captured by two imagers is not, however, sufficient to ensure uniformity of sampling. The uniformity of sampling or sampling diversity of two imagers is a function of object distance. The sampled space by pixels of a pair of imagers is illustrated in
Referring back to the camera array structures illustrated in
The main lens associated with each imager maps the points in the object space to points in the image space such at that the mapping is bijective (onto-to-one and onto). Each microlens samples a finite extent of the sensor irradiance light field. The sensor irradiance light field is continuous and is the result of a bijective mapping from the object space. Thus, the microlens sampling of a finite extent of the sensor irradiance light field is also a sampling of a corresponding finite extent of the scene radiance light field in object space.
Moving the microlens by a small amount δ laterally along the plain of the imager pixels changes the sampled object space at a certain distance zk by a correspondingly appropriate factor δ. With an n×n (n>2) array camera, we can choose a baseline microlens shift can be determined by the main lens profile (for example, the chief ray angle) for a baseline imager. For each of the other imagers that sample the same wavelength as the baseline imager, the microlenses of each of the pixels in the imager are shifted by a sub-pixel amount to sample a different part of the scene radiance light field. Thus for a set of imagers arranged in an n×n grid, the sub-pixel shift for a imager that images the same wavelengths as the baseline imager (1,1) at a grid location (i,j) (1≦i,j≦n) is governed by (δx, δy) where,
Many camera arrays in accordance with embodiments of the invention include significantly more Green imagers than Red and Blue imagers. For example, the array camera illustrated in
The sub-pixel shifts discussed above are determined relative to a baseline imager located at the corner of the grid, many embodiments of the invention utilize radial sub-pixel shifts from a baseline imager located at the center of the sensor array. In several embodiments, the radial sub-pixel shifts are chosen so the sub-pixel shifts are evenly distributed to enable the greatest sampling diversity.
The constraints on microlens sub-pixel shifts defined above achieve the highest increases in diversity and can enable the greatest increases in resolution through superresolution processing. Sub-pixel shifts that do not satisfy the constraints, but still provide an increase in sampling diversity can also be used to enable some increase in resolution through superresolution processing. Therefore, embodiments of the invention are not limited to microlens shifts that result in the greatest increases in diversity and in many instances utilize a variety of different microlens shift configurations that provide at least some increase in sampling diversity and that are satisfactory for the requirements of a specific application.
Symmetry of Imager Placement in Camera Array
An issue of separating the spectral sensing elements into different imagers is parallax caused by the physical separation of the imagers. By ensuring that the imagers are symmetrically placed, at least two imagers can capture the pixels around the edge of a foreground object. In this way, the pixels around the edge of a foreground object may be aggregated to increase resolution as well as avoiding any occlusions. In the absence of a symmetrical distribution, a pixel around the edge of a foreground object that is visible to a first imager, for example a Red imager, may be occluded to a second imager that captures different wavelengths, for example a blue imager. Accordingly, color information for the pixel cannot be accurately reconstructed. By symmetrically distributing the sensors, the likelihood that a foreground object will occlude pixels is significantly reduced.
Pixel occlusion caused by an asymmetric distribution of Red and Blue imagers in a simple array is illustrated in
An array that includes a symmetric distribution of Red and Blue imagers in accordance with an embodiment of the invention is illustrated in
The symmetrical arrangement of the simple embodiment illustrated in
The effects of parallax on the sampling of color can also be reduced by using parallax information in polychromatic imagers to improve the accuracy of the sampling of color from the color filtered imagers.
Use of Near-IR Imagers to Obtain Improved High Resolution Images
In one embodiment, near-IR imagers are used to determine relative luminance differences compared to a visible spectra imager. Objects have differing material reflectivity results in differences in the images captured by the visible spectra and the near-IR spectra. At low lighting conditions, the near-IR imager exhibits a higher signal to noise ratios. Therefore, the signals from the near-IR sensor may be used to enhance the luminance image. The transferring of details from the near-IR image to the luminance image may be performed before aggregating spectral images from different imagers through the super-resolution process. In this way, edge information about the scene may be improved to construct edge-preserving images that can be used effectively in the super-resolution process. The advantage of using near-IR imagers is apparent from equation (2) where any improvement in the estimate for the noise (i.e., n) leads to a better estimate of the original HR scene (x).
Generation of High Resolution Images
Then it is determined 728 if the lighting condition is better than a preset parameter. If the lighting condition is better than the parameter, the process proceeds to normalize 730 a super-resolved near-IR image with respect to a super-resolved luma image. A focus recovery is then performed 742. In one embodiment, the focus recovery is performed 742 using PSF (point spread function) deblurring per each color channel. Then the super-resolution is processed 746 based on near-IR images and the luma images. A synthesized image is then constructed 750.
If it is determined 728 that the lighting condition is not better than the preset parameter, the super-resolved near-IR images and luma images are aligned 734. Then the super-resolved luma images are denoised 738 using the near-IR super-resolved images. Then the process proceeds to performing focus recovery 742 and repeats the same process as when the lighting condition is better than the preset parameter. Then the process terminates.
Normalization of Color Planes
The relative response of each of the Red, Green, Blue imagers across the imaging planes varies. The variance can be the result of many factors including the optical alignment of the lens and asymmetrical sensor light path geometry. For a given lens and imager, the variance can be compensated for by calibration and normalization. Without compensation, the variance can give rise to artifacts such as color shading.
A process for normalizing a imager with respect to a baseline imager, which is typically a Green imager located in the center of the camera array, in accordance with an embodiment of the invention is discussed below with reference to the normalization of a Red imager with respect to a baseline Green imager. A similar process can be used to normalize Blue imagers with respect to a baseline Green imager. In many embodiments, the process is applied to normalize each Red and Blue imager in a camera array.
A normalization surface can be calibrated by first capturing a scene with flat reflectance, and calculating a color ratio surface to serve as the basis for normalization. An ideal normalization surface is uniform and can be described as:
Color Ratio G/R=G(i,j)/R(i,j)=K=Gcenter/Rcenter
where (i,j) describe the pixel position, K is a constant, and Gcenter, and Rcenter, describe the pixel value at the center position.
The output pixel values of the calibration scene contain the ideal pixel values plus noise plus black level offset, and can be described as follows:
SR(i,j)=R(i,j)+Noise R(i,j)+black offset
SG(i,j)=G(i,j)+Noise G(i,j)+black offset
where SR, and SG are the output pixel values from each imager.
A process for calibrating the sensor in accordance with an embodiment of the invention is illustrated in
Norm R=G(i,j)/(R(i,j)×(Gcenter/Rcenter))
where Gcenter, and Rcenter, are the pixel values at the center position.
Following the calculation of the normalization plane, an averaging filter can be applied (768) and the values of the Norm R plane are stored (770).
The cost of carrying all of the normalization data for each of the sensors in a sensor array can be quite high. Therefore, many embodiments scan the Norm R plane using a space filling curve to form a one dimensional array. The resultant one dimensional array can be modeled in a variety of different ways including being modeled as a polynomial with suitable order. In several embodiments, the polynomials of the fitted polynomial are stored (810) as parameters that are used during calibration to reconstruct the two dimensional normalization plane. The construction of a space filling curve in accordance with several embodiments of the invention is discussed further below.
In several embodiments, a space filling curve is used to form a one dimensional array describing a normalization plane. A space filling curve, which is constructed using a spiral scan, is illustrated in
The data value along each side exhibits a fixed geometric relationship. The optical path to the focal point of the lens is shorter for the cells near the center line. The base sensitivity can be thought of as a one dimensional center cut of the calibration surface and approximated by a low order polynomial. The sensitivity polynomial can be either stored as a machine constant (i.e., common to all devices with the same design), or stored along with the scan polynomial to provide additional flexibility. Accordingly, many embodiments of the invention adjust the pixel value based upon the distance factor as follows. For each side scan, one of the coordinates will be a constant, i.e., constant ‘y’ for horizontal scan and constant ‘x’ for vertical scan. For each pixel in the side scan, the sensitivity factor is adjusted towards the constant ‘x’ or ‘y’ distance.
By way of example, for a horizontal scan the base value can be found by evaluating the sensitivity polynomial based on the distance ‘y’ from the center. In many embodiments, a suitable polynomial is a fourth order polynomial. Although other polynomials and/or other functions can be utilized in accordance with the requirements of a specific application. For each pixel in the scan path, the distance from the surface origin is used to find the corresponding sensitivity from the polynomial in the same manner. The pixel value is multiplied by an adjustment factor and then stored in the scanned data array. This adjustment factor is calculated by dividing the base value with the current sensitivity value. For the vertical scan a similar method can be applied. Although the example uses a polynomial based sensitivity adjustment, other sensitivity functions and/or adjustments can be utilized depending upon the requirements of a specific application in accordance with various embodiments of the invention.
Once calibration data has been obtained for an imager, the calibration data can be used in the normalization of pixel information captured by the imager. The process typically involves retrieving the stored calibration data, removing the black offset from the captured image and multiplying the resultant values with the normalization plane. When the normalization plane is expressed as a polynomial in the manner outlined above, the polynomial is used to generate a one-dimensional array and an inverse scan of the one-dimensional array is used to form the two dimensional normalization plane. Where a sensitivity adjustment was applied during calibration, an adjustment factor is calculated that is the reciprocal of the adjustment factor applied during the calibration scan and the adjustment factor is applied to the values in the one dimensional array during the inverse scan. When other space filling curves, representations of the resulting one dimensional data array, and/or sensitivity adjustments are performed during the calibration process, the normalization process is adjusted accordingly.
As can be readily appreciated, calibration and normalization processes in accordance with embodiments of the invention can be applied to each of the Red and Blue imagers in the camera array. In many embodiments, a Green imager located in the center of the camera array is used when performing the calibration. In other embodiments, a different Green imager and/or multiple Green imagers can be utilized in the calibration of the Red and Blue imagers in the camera array.
Image Fusion of Color Images with Near-IR Images
The spectral response of CMOS imagers is typically very good in the near-IR regions covering 650 nm to 800 nm and reasonably good between 800 nm and 1000 nm. Although near-IR images having no chroma information, information in this spectral region is useful in low lighting conditions because the near-IR images are relatively free of noise. Hence, the near-IR images may be used to denoise color images under the low lighting conditions.
In one embodiment, an image from a near-IR imager is fused with another image from a visible light imager. Before proceeding with the fusion, a registration is performed between the near-IR image and the visible light image to resolve differences in viewpoints. The registration process may be performed in an offline, one-time, processing step. After the registration is performed, the luminance information on the near-IR image is interpolated to grid points that correspond to each grid point on the visible light image.
After the pixel correspondence between the near-IR image and the visible light image is established, a denoising and detail transfer process may be performed. The denoising process allows transfer of signal information from the near-IR image to the visible light image to improve the overall SNR of the fusion image. The detail transfer ensures that edges in the near-IR image and the visible light image are preserved and accentuated to improve the overall visibility of objects in the fused image.
In one embodiment, a near-IR flash may serve as a near-IR light source during capturing of an image by the near-IR imagers. Using the near-IR flash is advantageous, among other reasons, because (i) the harsh lighting on objects of interest may be prevented, (ii) ambient color of the object may be preserved, and (iii) red-eye effect may be prevented.
In one embodiment, a visible light filter that allows only near-IR rays to pass through is used to further optimize the optics for near-IR imaging. The visible light filter improves the near-IR optics transfer function because the light filter results in sharper details in the near-IR image. The details may then be transferred to the visible light images using a dual bilateral filter as described, for example, in Eric P. Bennett et al., “Multispectral Video Fusion,” Computer Graphics (ACM SIGGRAPH Proceedings) (Jul. 25, 2006), which is incorporated by reference herein in its entirety.
Dynamic Range Determination by Differing Exposures at Imagers
An auto-exposure (AE) algorithm is important to obtaining an appropriate exposure for the scene to be captured. The design of the AE algorithm affects the dynamic range of captured images. The AE algorithm determines an exposure value that allows the acquired image to fall in the linear region of the camera array's sensitivity range. The linear region is preferred because a good signal-to-noise ratio is obtained in this region. If the exposure is too low, the picture becomes under-saturated while if the exposure is too high the picture becomes over-saturated. In conventional cameras, an iterative process is taken to reduce the difference between measured picture brightness and previously defined brightness below a threshold. This iterative process requires a large amount of time for convergence, and sometimes results in an unacceptable shutter delay.
In one embodiment, the picture brightness of images captured by a plurality of imagers is independently measured. Specifically, a plurality of imagers are set to capturing images with different exposures to reduce the time for computing the adequate exposure. For example, in a camera array with 5×5 imagers where 8 luma imagers and 9 near-IR imagers are provided, each of the imagers may be set with different exposures. The near-IR imagers are used to capture low-light aspects of the scene and the luma imagers are used to capture the high illumination aspects of the scene. This results in a total of 17 possible exposures. If exposure for each imager is offset from an adjacent imager by a factor of 2, for example, a maximum dynamic range of 217 or 102 dB can be captured. This maximum dynamic range is considerably higher than the typical 48 dB attainable in a conventional camera with 8 bit image outputs.
At each time instant, the responses (under-exposed, over-exposed or optimal) from each of the multiple imagers are analyzed based on how many exposures are needed at the subsequent time instant. The ability to query multiple exposures simultaneously in the range of possible exposures accelerates the search compared to the case where only one exposure is tested at once. By reducing the processing time for determining the adequate exposure, shutter delays and shot-to-shot lags may be reduced.
In one embodiment, the HDR image is synthesized from multiple exposures by combining the images after linearizing the imager response for each exposure. The images from the imagers may be registered before combining to account for the difference in the viewpoints of the imagers.
In one embodiment, at least one imager includes HDR pixels to generate HDR images. HDR pixels are specialized pixels that capture high dynamic range scenes. Although HDR pixels show superior performances compared to other pixels, HDR pixels show poor performance at low lighting conditions in comparison with near-IR imagers. To improve performance at low lighting conditions, signals from the near-IR imagers may be used in conjunction with the signal from the HDR imager to attain better quality images across different lighting conditions.
In one embodiment, an HDR image is obtained by processing images captured by multiple imagers by processing, as disclosed, for example, in Paul Debevec et al., “Recovering High Dynamic Range Radiance Maps from Photographs,” Computer Graphics (ACM SIGGRAPH Proceedings), (Aug. 16, 1997), which is incorporated by reference herein in its entirety. The ability to capture multiple exposures simultaneously using the imager is advantageous because artifacts caused by motion of objects in the scene can be mitigated or eliminated.
Hyperspectral Imaging by Multiple Imagers
In one embodiment, a multi-spectral image is rendered by multiple imagers to facilitate the segmentation or recognition of objects in a scene. Because the spectral reflectance coefficients vary smoothly in most real world objects, the spectral reflectance coefficients may be estimated by capturing the scene in multiple spectral dimensions using imagers with different color filters and analyzing the captured images using Principal Components Analysis (PCA).
In one embodiment, half of the imagers in the camera array are devoted to sampling in the basic spectral dimensions (R, G, and B) and the other half of the imagers are devoted to sampling in a shifted basic spectral dimensions (R′, G′, and B′). The shifted basic spectral dimensions are shifted from the basic spectral dimensions by a certain wavelength (e.g., 10 nm).
In one embodiment, pixel correspondence and non-linear interpolation is performed to account for the sub-pixel shifted views of the scene. Then the spectral reflectance coefficients of the scene are synthesized using a set of orthogonal spectral basis functions as disclosed, for example, in J. P. S. Parkkinen, J. Hallikainen and T. Jaaskelainen, “Characteristic Spectra of Munsell Colors,” J. Opt. Soc. Am., A 6:318 (August 1989), which is incorporated by reference herein in its entirety. The basis functions are eigenvectors derived by PCA of a correlation matrix and the correlation matrix is derived from a database storing spectral reflectance coefficients measured by, for example, Munsell color chips (a total of 1257) representing the spectral distribution of a wide range of real world materials to reconstruct the spectrum at each point in the scene.
At first glance, capturing different spectral images of the scene through different imagers in the camera array appears to trade resolution for higher dimensional spectral sampling. However, some of the lost resolution may be recovered. The multiple imagers sample the scene over different spectral dimensions where each sampling grid of each imager is offset by a sub-pixel shift from the others. In one embodiment, no two sampling grid of the imager overlap. That is, the superposition of all the sampling grids from all the imagers forms a dense, possibly non-uniform, montage of points. Scattered data interpolation methods may be used to determine the spectral density at each sample point in this non-uniform montage for each spectral image, as described, for example, in Shiaofen Fang et al., “Volume Morphing Methods for Landmark Based 3D Image Deformation” by SPIE vol. 2710, proc. 1996 SPIE Intl Symposium on Medical Imaging, page 404-415, Newport Beach, Calif. (February 1996), which is incorporated by reference herein in its entirety. In this way, a certain amount of resolution lost in the process of sampling the scene using different spectral filters may be recovered.
As described above, image segmentation and object recognition are facilitated by determining the spectral reflectance coefficients of the object. The situation often arises in security applications wherein a network of cameras is used to track an object as it moves from the operational zone of one camera to another. Each zone may have its own unique lighting conditions (fluorescent, incandescent, D65, etc.) that may cause the object to have a different appearance in each image captured by different cameras. If these cameras capture the images in a hyper-spectral mode, all images may be converted to the same illuminant to enhance object recognition performance.
In one embodiment, camera arrays with multiple imagers are used for providing medical diagnostic images. Full spectral digitized images of diagnostic samples contribute to accurate diagnosis because doctors and medical personnel can place higher confidence in the resulting diagnosis. The imagers in the camera arrays may be provided with color filters to provide full spectral data. Such camera array may be installed on cell phones to capture and transmit diagnostic information to remote locations as described, for example, in Andres W. Martinez et al., “Simple Telemedicine for Developing Regions: Camera Phones and Paper-Based Microfluidic Devices for Real-Time, Off-Site Diagnosis,” Analytical Chemistry (American Chemical Society) (Apr. 11, 2008), which is incorporated by reference herein in its entirety. Further, the camera arrays including multiple imagers may provide images with a large depth of field to enhance the reliability of image capture of wounds, rashes, and other symptoms.
In one embodiment, a small imager (including, for example, 20-500 pixels) with a narrow spectral bandpass filters is used to produce a signature of the ambient and local light sources in a scene. By using the small imager, the exposure and white balance characteristics may be determined more accurately at a faster speed. The spectral bandpass filters may be ordinary color filters or diffractive elements of a bandpass width adequate to allow the number of camera arrays to cover the visible spectrum of about 400 nm. These imagers may run at a much higher frame rate and obtain data (which may or may not be used for its pictorial content) for processing into information to control the exposure and white balance of other larger imagers in the same camera array. The small imagers may also be interspersed within the camera array.
Optical Zoom Implemented Using Multiple Imagers
In one embodiment, a subset of imagers in the camera array includes telephoto lenses. The subset of imagers may have other imaging characteristics that are the same as imagers with non-telephoto lenses. Images from this subset of imagers are combined and super-resolution processed to form a super-resolution telephoto image. In another embodiment, the camera array includes two or more subsets of imagers equipped with lenses of more than two magnifications to provide differing zoom magnifications.
Embodiments of the camera arrays may achieve its final resolution by aggregating images through super-resolution. Taking an example of providing 5×5 imagers with a 3× optical zoom feature, if 17 imagers are used to sample the luma (G) and 8 imagers are used to sample the chroma (R and B), 17 luma imagers allow a resolution that is four times higher than what is achieved by any single imager in the set of 17 imagers. If the number of the imagers is increased from 5×5 to 6×6, an addition of 11 extra imagers becomes available. In comparison with the 8 Megapixel conventional image sensor fitted with a 3× zoom lens, a resolution that is 60% of the conventional image sensor is achieved when 8 of the additional 11 imagers are dedicated to sampling luma (G) and the remaining 3 imagers are dedicated to chroma (R and B) and near-IR sampling at 3× zoom. This considerably reduces the chroma sampling (or near-IR sampling) to luma sampling ratio. The reduced chroma to luma sampling ratio is somewhat offset by using the super-resolved luma image at 3× zoom as a recognition prior on the chroma (and near-IR) image to resample the chroma image at a higher resolution.
With 6×6 imagers, a resolution equivalent to the resolution of conventional image sensor is achieved at 1× zoom. At 3× zoom, a resolution equivalent to about 60% of conventional image sensor outfitted with a 3× zoom lens is obtained by the same imagers. Also, there is a decrease in luma resolution at 3× zoom compared with conventional image sensors with resolution at 3× zoom. The decreased luma resolution, however, is offset by the fact that the optics of conventional image sensor has reduced efficiency at 3× zoom due to crosstalk and optical aberrations.
The zoom operation achieved by multiple imagers has the following advantages. First, the quality of the achieved zoom is considerably higher than what is achieved in the conventional image sensor due to the fact that the lens elements may be tailored for each change in focal length. In conventional image sensors, optical aberrations and field curvature must be corrected across the whole operating range of the lens, which is considerably harder in a zoom lens with moving elements than in a fixed lens element where only aberrations for a fixed focal length need to be corrected. Additionally, the fixed lens in the imagers has a fixed chief ray angle for a given height, which is not the case with conventional image sensor with a moving zoom lens. Second, the imagers allow simulation of zoom lenses without significantly increasing the optical track height. The reduced height allows implementation of thin modules even for camera arrays with zooming capability.
The overhead required to support a certain level of optical zoom in camera arrays according to some embodiments is tabulated in Table 2.
In one embodiment, the pixels in the images are mapped onto an output image with a size and resolution corresponding to the amount of zoom desired in order to provide a smooth zoom capability from the widest-angle view to the greatest-magnification view. Assuming that the higher magnification lenses have the same center of view as the lower magnification lenses, the image information available is such that a center area of the image has a higher resolution available than the outer area. In the case of three or more distinct magnifications, nested regions of different resolution may be provided with resolution increasing toward the center.
An image with the most telephoto effect has a resolution determined by the super-resolution ability of the imagers equipped with the telephoto lenses. An image with the widest field of view can be formatted in at least one of two following ways. First, the wide field image may be formatted as an image with a uniform resolution where the resolution is determined by the super-resolution capability of the set of imagers having the wider-angle lenses. Second, the wide field image is formatted as a higher resolution image where the resolution of the central part of the image is determined by the super-resolution capability of the set of imagers equipped with telephoto lenses. In the lower resolution regions, information from the reduced number of pixels per image area is interpolated smoothly across the larger number of “digital” pixels. In such an image, the pixel information may be processed and interpolated so that the transition from higher to lower resolution regions occurs smoothly.
In one embodiment, zooming is achieved by inducing a barrel-like distortion into some, or all, of the array lens so that a disproportionate number of the pixels are dedicated to the central part of each image. In this embodiment, every image has to be processed to remove the barrel distortion. To generate a wide-angle image, pixels closer to the center are sub-sampled relative to outer pixels are super-sampled. As zooming is performed, the pixels at the periphery of the imagers are progressively discarded and the sampling of the pixels nearer the center of the imager is increased.
In one embodiment, mipmap filters are built to allow images to be rendered at a zoom scale that is between the specific zoom range of the optical elements (e.g., 1× and 3× zoom scales of the camera array). Mipmaps are a precalculated optimized set of images that accompany a baseline image. A set of images associated with the 3× zoom luma image can be created from a baseline scale at 3× down to 1×. Each image in this set is a version of the baseline 3× zoom image but at a reduced level of detail. Rendering an image at a desired zoom level is achieved using the mipmap by (i) taking the image at 1× zoom, and computing the coverage of the scene for the desired zoom level (i.e., what pixels in the baseline image needs to be rendered at the requested scale to produce the output image), (ii) for each pixel in the coverage set, determine if the pixel is in the image covered by the 3× zoom luma image, (iii) if the pixel is available in the 3× zoom luma image, then choose the two closest mipmap images and interpolate (using smoothing filter) the corresponding pixels from the two mipmap images to produce the output image, and (iv) if the pixel is unavailable in the 3× zoom luma image, then choose the pixel from the baseline 1× luma image and scale up to the desired scale to produce the output pixel. By using mipmaps, smooth optical zoom may be simulated at any point between two given discrete levels (i.e., 1× zoom and 3× zoom).
In one embodiment, zooming is achieved by realizing different Fields Of View (FOV)s by electronically switching between different optical channels having different sensor sizes, but fixed Effective Focal Lengths (EFL)s. In one such embodiment, shown schematically in
In another embodiment, as shown in
In another further embodiment, as shown in
Capturing Video Images
In one embodiment, the camera array generates high frame image sequences. The imagers in the camera array can operate independently to capture images. Compared to conventional image sensors, the camera array may capture images at the frame rate up to N time (where N is the number of imagers). Further, the frame period for each imager may overlap to improve operations under low-light conditions. To increase the resolution, a subset of imagers may operate in a synchronized manner to produce images of higher resolution. In this case, the maximum frame rate is reduced by the number of imagers operating in a synchronized manner. The high-speed video frame rates can enables slow-motion video playback at a normal video rate.
In one example, two luma imagers (green imagers or near-IR imagers), two blue imagers and two green imagers are used to obtain high-definition 1080p images. Using permutations of four luma imagers (two green imagers and two near-IR imagers or three green imagers and one near-IR imager) together with one blue imager and one red imager, the chroma imagers can be upsampled to achieve 120 frames/sec for 1080p video. For higher frame rate imaging devices, the number of frame rates can be scaled up linearly. For Standard-Definition (480p) operation, a frame rate of 240 frames/sec may be achieved using the same camera array.
Conventional imaging devices with a high-resolution image sensor (e.g., 8 Megapixels) use binning or skipping to capture lower resolution images (e.g., 1080p30, 720p30 and 480p30). In binning, rows and columns in the captured images are interpolated in the charge, voltage or pixel domains in order to achieve the target video resolutions while reducing the noise. In skipping, rows and columns are skipped in order to reduce the power consumption of the sensor. Both of these techniques result in reduced image quality.
In one embodiment, the imagers in the camera arrays are selectively activated to capture a video image. For example, 9 imagers (including one near-IR imager) may be used to obtain 1080p (1920×1080 pixels) images while 6 imagers (including one near-IR imager) may be used to obtain 720p (1280×720 pixels) images or 4 imagers (including one near-IR imager) may be used to obtain 480p (720×480 pixels) images. Because there is an accurate one-to-one pixel correspondence between the imager and the target video images, the resolution achieved is higher than traditional approaches. Further, since only a subset of the imagers is activated to capture the images, significant power savings can also be achieved. For example, 60% reduction in power consumption is achieved in 1080p and 80% of power consumption is achieved in 480p.
Using the near-IR imager to capture video images is advantageous because the information from the near-IR imager may be used to denoise each video image. In this way, the camera arrays of embodiments exhibit excellent low-light sensitivity and can operate in extremely low-light conditions. In one embodiment, super-resolution processing is performed on images from multiple imagers to obtain higher resolution video imagers. The noise-reduction characteristics of the super-resolution process along with fusion of images from the near-IR imager results in a very low-noise images.
In one embodiment, high-dynamic-range (HDR) video capture is enabled by activating more imagers. For example, in a 5×5 camera array operating in 1080p video capture mode, there are only 9 cameras active. A subset of the 16 cameras may be overexposed and underexposed by a stop in sets of two or four to achieve a video output with a very high dynamic range.
Other Applications for Multiple Imagers
In one embodiment, the multiple imagers are used for estimating distance to an object in a scene. Since information regarding the distance to each point in an image is available in the camera array along with the extent in x and y coordinates of an image element, the size of an image element may be determined. Further, the absolute size and shape of physical items may be measured without other reference information. For example, a picture of a foot can be taken and the resulting information may be used to accurately estimate the size of an appropriate shoe.
In one embodiment, reduction in depth of field is simulated in images captured by the camera array using distance information. The camera arrays according to the present invention produce images with greatly increased depth of field. The long depth of field, however, may not be desirable in some applications. In such case, a particular distance or several distances may be selected as the “in best focus” distance(s) for the image and based on the distance (z) information from parallax information, the image can be blurred pixel-by-pixel using, for example, a simple Gaussian blur. In one embodiment, the depth map obtained from the camera array is utilized to enable a tone mapping algorithm to perform the mapping using the depth information to guide the level, thereby emphasizing or exaggerating the 3D effect.
In one embodiment, apertures of different sizes are provided to obtain aperture diversity. The aperture size has a direct relationship with the depth of field. In miniature cameras, however, the aperture is generally made as large as possible to allow as much light to reach the camera array. Different imagers may receive light through apertures of different sizes. For imagers to produce a large depth of field, the aperture may be reduced whereas other imagers may have large apertures to maximize the light received. By fusing the images from sensor images of different aperture sizes, images with large depth of field may be obtained without sacrificing the quality of the image.
In one embodiment, the camera array according to the present invention refocuses based on images captured from offsets in viewpoints. Unlike a conventional plenoptic camera, the images obtained from the camera array of the present invention do not suffer from the extreme loss of resolution. The camera array according to the present invention, however, produces sparse data points for refocusing compared to the plenoptic camera. In order to overcome the sparse data points, interpolation may be performed to refocus data from the spare data points.
In one embodiment, each imager in the camera array has a different centroid. That is, the optics of each imager are designed and arranged so that the fields of view for each imager slightly overlap but for the most part constitute distinct tiles of a larger field of view. The images from each of the tiles are panoramically stitched together to render a single high-resolution image.
In one embodiment, camera arrays may be formed on separate substrates and mounted on the same motherboard with spatial separation. The lens elements on each imager may be arranged so that the corner of the field of view slightly encompasses a line perpendicular to the substrate. Thus, if four imagers are mounted on the motherboard with each imager rotated 90 degrees with respect to another imager, the fields of view will be four slightly overlapping tiles. This allows a single design of WLO lens array and imager chip to be used to capture different tiles of a panoramic image.
In one embodiment, one or more sets of imagers are arranged to capture images that are stitched to produce panoramic images with overlapping fields of view while another imager or sets of imagers have a field of view that encompasses the tiled image generated. This embodiment provides different effective resolution for imagers with different characteristics. For example, it may be desirable to have more luminance resolution than chrominance resolution. Hence, several sets of imagers may detect luminance with their fields of view panoramically stitched. Fewer imagers may be used to detect chrominance with the field of view encompassing the stitched field of view of the luminance imagers.
In one embodiment, the camera array with multiple imagers is mounted on a flexible motherboard such that the motherboard can be manually bent to change the aspect ratio of the image. For example, a set of imagers can be mounted in a horizontal line on a flexible motherboard so that in the quiescent state of the motherboard, the fields of view of all of the imagers are approximately the same. If there are four imagers, an image with double the resolution of each individual imager is obtained so that details in the subject image that are half the dimension of details that can be resolved by an individual imager. If the motherboard is bent so that it forms part of a vertical cylinder, the imagers point outward. With a partial bend, the width of the subject image is doubled while the detail that can be resolved is reduced because each point in the subject image is in the field of view of two rather than four imagers. At the maximum bend, the subject image is four times wider while the detail that can be resolved in the subject is further reduced.
Offline Reconstruction and Processing
The images processed by the imaging system 400 may be previewed before or concurrently with saving of the image data on a storage medium such as a flash device or a hard disk. In one embodiment, the images or video data includes rich light field data sets and other useful image information that were originally captured by the camera array. Other traditional file formats could also be used. The stored images or video may be played back or transmitted to other devices over various wired or wireless communication methods.
In one embodiment, tools are provided for users by a remote server. The remote server may function both as a repository and an offline processing engine for the images or video. Additionally, applets mashed as part of popular photo-sharing communities such as Flikr, Picasaweb, Facebook etc. may allow images to be manipulated interactively, either individually or collaboratively. Further, software plug-ins into image editing programs may be provided to process images generated by the imaging device 400 on computing devices such as desktops and laptops.
Various modules described herein may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, application specific integrated circuits (ASICs), or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus. Furthermore, the computers referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.
While particular embodiments and applications of the present invention have been illustrated and described herein, it is to be understood that the invention is not limited to the precise construction and components disclosed herein and that various modifications, changes, and variations may be made in the arrangement, operation, and details of the methods and apparatuses of the present invention without departing from the spirit and scope of the invention as it is defined in the appended claims.
Claims
1. A method of synthesizing a higher resolution image from a plurality of images captured from different viewpoints, comprising:
- normalizing a set of images using calibration data stored in a storage device using an address conversion module, where: the set of images comprises a plurality of images that are: captured from different viewpoints and include sampling diversity; include different occlusions sets; wherein the occlusion set of a first image is the portion of a scene visible in a second image that is occluded in the first image; and the set of images includes: images captured using the same imaging characteristics; images captured using different imaging characteristics; and a first image that captures pixels around an edge of a foreground object that is visible in a baseline image and is in the occlusion set of a second image captured using the same imaging characteristics as the first imager;
- detecting and metering parallax using a parallax confirmation and measurement module, where detecting and metering parallax comprises ignoring pixels in the images that are in an exposed occlusion set;
- aligning portions of images captured by different imagers to compensate for parallax using an image pixel correlation module based upon the detected and metered parallax and the stored calibration data; and
- obtaining a higher resolution image having a resolution that is higher than the resolutions of the images in the set of images using a super-resolution module, where color information around the edge of the foreground object that is visible in the baseline image and in the occlusion set of the second image is reconstructed in the higher resolution image using the pixels captured by the first image.
2. The method of claim 1, wherein:
- the calibration data indicates mappings between addresses of physical pixels in imagers and logical addresses within an image; and
- the address conversion module resolves phase differences between the set of images by converting addresses of physical pixels in imagers to logical addresses.
3. The method of claim 1, further comprising performing normalization of the color planes of the set of images using an upstream pipeline processing module.
4. The method of claim 1, further comprising performing a process to adjust the set of images using an upstream pipeline processing module, where the process is selected from the group consisting of:
- black level calculation and adjustments;
- fixed noise compensation;
- optical point spread function deconvolution;
- noise reduction;
- lateral color correction; and
- crosstalk reduction.
5. The method of claim 1, further comprising performing temperature normalization using an upstream pipeline processing module based upon a temperature measurement.
6. The method of claim 5, wherein performing temperature normalization comprises performing a refractive index normalization using a stored point spread function selected based upon the temperature measurement.
7. The method of claim 1, wherein detecting and metering parallax comprises determining the parallax that yields the highest correlation between pixels from images captured using same imaging characteristics accounting for the positions of the imagers that captured the images.
8. The method of claim 7, wherein determining the parallax that yields the highest correlation between pixels from images captured using similar integration time conditions comprises performing pair-wise measurements to determine pixel correlation for different parallax-induced shifts.
9. The method of claim 8, wherein determining the parallax that yields the highest correlation between pixels from images captured using similar integration time conditions further comprises keeping track of various pair-wise measurements and calculating a parallax difference that yields the highest correlation as the best least squares fit of the pair-wise measurements.
10. The method of claim 1, wherein aligning portions of images captured by different imagers to compensate for parallax using an image pixel correlation module further comprises determining appropriate X and Y offsets to be applied to logical pixel address calculations using an address conversion module based upon the detected and metered parallax and the stored calibration data.
11. The method of claim 1, wherein obtaining a higher resolution image having a resolution that is higher than the resolutions of the images in the set of images comprises fusing the aligned image portions.
12. The method of claim 11, wherein image fusion is performed by interpolating luminance information from the aligned image portions to each grid point on the higher resolution image.
13. The method of claim 11, wherein obtaining a higher resolution image having a resolution that is higher than the resolutions of the images comprises performing super-resolution processing on the fused image portions to synthesize a super-resolution image.
14. The method of claim 1, wherein obtaining a higher resolution image having a resolution that is higher than the resolutions of the images in the set of images comprises performing a super-resolution processing on the aligned image portions to synthesize a super-resolution image.
15. The method of claim 1, further comprising generating a depth map using the parallax confirmation and measurement module.
16. The method of claim 15, further comprising selecting at least one distance as a focal plane and applying blurring to pixels in the higher resolution image with depths in the depth map that are not proximate a focal plane using the super-resolution processing module.
17. The method of claim 11, wherein:
- images in the set of images are captured with different exposure times; and
- normalizing a set of images further comprises linearizing the camera response in the images based upon exposure time.
18. The method of claim 1, further comprising:
- capturing the set of images using an array of imagers;
- wherein each imager includes a spectral filter selected from the group consisting of a Bayer filter, one or more Blue filters, one or more Green filters, one or more Red filters, one or more shifted spectral filters, one or more near-IR filters, and one or more hyper-spectral filters.
19. The method of claim 1, wherein the address conversion module, the parallax confirmation and measurement module, the image pixel correlation module, and the super-resolution module are implemented using a general-purpose computer selectively reconfigured by a computer program stored in the computer.
20. A method of synthesizing a higher resolution image from a plurality of images captured from different viewpoints, comprising:
- normalizing a set of images using calibration data stored in a storage device using an address conversion module, where the set of images comprises a plurality of images that are: captured from different viewpoints and include sampling diversity; include different occlusions sets; wherein the occlusion set of a first image is the portion of a scene visible in a second image that is occluded in the first image; and the set of images includes: images captured using the same imaging characteristics; images captured using different imaging characteristics;
- a first image that captures pixels around an edge of a foreground object that is visible in a baseline image and is in the occlusion set of a second image captured using the same imaging characteristics as the first imager;
- detecting and metering parallax using a parallax confirmation and measurement module, where detecting and metering parallax comprises ignoring pixels in the images that are in an exposed occlusion set;
- aligning portions of images captured by different imagers to compensate for parallax using an image pixel correlation module based upon the detected and metered parallax and the stored calibration data;
- obtaining a higher resolution image having a resolution that is higher than the resolutions of the images in the set of images using a super-resolution module, where color information around the edge of the foreground object that is visible in the baseline image and in the occlusion set of the second image is reconstructed in the higher resolution image using the pixels captured by the first image; and
- generating a depth map for the higher resolution image using the parallax confirmation and measurement module;
- wherein the higher resolution image is obtained by: fusing the aligned image portions; and performing super-resolution processing on the fused image portions to synthesize a super-resolution image.
4124798 | November 7, 1978 | Thompson |
4198646 | April 15, 1980 | Alexander et al. |
4323925 | April 6, 1982 | Abell et al. |
4460449 | July 17, 1984 | Montalbano |
4467365 | August 21, 1984 | Murayama et al. |
5005083 | April 2, 1991 | Grage |
5070414 | December 3, 1991 | Tsutsumi |
5144448 | September 1, 1992 | Hornbaker |
5629524 | May 13, 1997 | Stettner et al. |
5808350 | September 15, 1998 | Jack et al. |
5832312 | November 3, 1998 | Rieger et al. |
5880691 | March 9, 1999 | Fossum et al. |
5933190 | August 3, 1999 | Dierickx et al. |
5973844 | October 26, 1999 | Burger |
6002743 | December 14, 1999 | Telymonde |
6034690 | March 7, 2000 | Gallery et al. |
6069351 | May 30, 2000 | Mack |
6069365 | May 30, 2000 | Chow et al. |
6097394 | August 1, 2000 | Levoy et al. |
6124974 | September 26, 2000 | Burger |
6137535 | October 24, 2000 | Meyers |
6141048 | October 31, 2000 | Meyers |
6358862 | March 19, 2002 | Ireland et al. |
6603513 | August 5, 2003 | Berezin |
6611289 | August 26, 2003 | Yu |
6627896 | September 30, 2003 | Hashimoto et al. |
6635941 | October 21, 2003 | Suda |
6671399 | December 30, 2003 | Berestov |
6750904 | June 15, 2004 | Lambert |
6765617 | July 20, 2004 | Tangen et al. |
6771833 | August 3, 2004 | Edgar |
6774941 | August 10, 2004 | Boisvert et al. |
6879735 | April 12, 2005 | Portniaguine et al. |
6903770 | June 7, 2005 | Kobayashi et al. |
6909121 | June 21, 2005 | Nishikawa |
6958862 | October 25, 2005 | Joseph |
7085409 | August 1, 2006 | Sawhney et al. |
7199348 | April 3, 2007 | Olsen et al. |
7262799 | August 28, 2007 | Suda |
7292735 | November 6, 2007 | Blake et al. |
7295697 | November 13, 2007 | Satoh |
7369165 | May 6, 2008 | Bosco et al. |
7391572 | June 24, 2008 | Jacobowitz et al. |
7606484 | October 20, 2009 | Richards et al. |
7633511 | December 15, 2009 | Shum et al. |
7657090 | February 2, 2010 | Omatsu et al. |
7675080 | March 9, 2010 | Boettiger |
7675681 | March 9, 2010 | Tomikawa et al. |
7706634 | April 27, 2010 | Schmitt et al. |
7723662 | May 25, 2010 | Levoy et al. |
7782364 | August 24, 2010 | Smith |
7986018 | July 26, 2011 | Rennie |
7990447 | August 2, 2011 | Honda et al. |
8013904 | September 6, 2011 | Tan et al. |
8027531 | September 27, 2011 | Wilburn et al. |
8044994 | October 25, 2011 | Vetro et al. |
8077245 | December 13, 2011 | Adamo et al. |
8106949 | January 31, 2012 | Tan et al. |
8126279 | February 28, 2012 | Marcellin et al. |
8131097 | March 6, 2012 | Lelescu et al. |
8180145 | May 15, 2012 | Wu et al. |
8189089 | May 29, 2012 | Georgiev |
8213711 | July 3, 2012 | Tam |
8231814 | July 31, 2012 | Duparre |
8242426 | August 14, 2012 | Ward et al. |
8294099 | October 23, 2012 | Blackwell, Jr. |
8305456 | November 6, 2012 | McMahon |
8315476 | November 20, 2012 | Georgiev et al. |
8345144 | January 1, 2013 | Georgiev et al. |
8360574 | January 29, 2013 | Ishak et al. |
8406562 | March 26, 2013 | Bassi et al. |
8446492 | May 21, 2013 | Nakano et al. |
8514491 | August 20, 2013 | Duparre |
8541730 | September 24, 2013 | Inuiya |
8542933 | September 24, 2013 | Venkataraman et al. |
8619082 | December 31, 2013 | Ciurea et al. |
8692893 | April 8, 2014 | McMahon |
8780113 | July 15, 2014 | Ciurea et al. |
8804255 | August 12, 2014 | Duparre |
8831367 | September 9, 2014 | Venkataraman et al. |
20010005225 | June 28, 2001 | Clark et al. |
20010038387 | November 8, 2001 | Tomooka et al. |
20020012056 | January 31, 2002 | Trevino |
20020027608 | March 7, 2002 | Johnson |
20020063807 | May 30, 2002 | Margulis |
20020087403 | July 4, 2002 | Meyers et al. |
20020089596 | July 11, 2002 | Suda |
20020094027 | July 18, 2002 | Sato et al. |
20020101528 | August 1, 2002 | Lee et al. |
20020113867 | August 22, 2002 | Takigawa et al. |
20020113888 | August 22, 2002 | Sonoda et al. |
20020163054 | November 7, 2002 | Suda |
20020167537 | November 14, 2002 | Trajkovic |
20020177054 | November 28, 2002 | Saitoh et al. |
20030086079 | May 8, 2003 | Barth et al. |
20030124763 | July 3, 2003 | Fan et al. |
20030140347 | July 24, 2003 | Varsa |
20030179418 | September 25, 2003 | Wengender et al. |
20030211405 | November 13, 2003 | Venkataraman |
20040008271 | January 15, 2004 | Hagimori et al. |
20040012689 | January 22, 2004 | Tinnerino et al. |
20040047274 | March 11, 2004 | Amanai |
20040050104 | March 18, 2004 | Ghosh et al. |
20040056966 | March 25, 2004 | Schechner et al. |
20040100570 | May 27, 2004 | Shizukuishi |
20040114807 | June 17, 2004 | Lelescu et al. |
20040165090 | August 26, 2004 | Ning |
20040169617 | September 2, 2004 | Yelton et al. |
20040170340 | September 2, 2004 | Tipping et al. |
20040174439 | September 9, 2004 | Upton |
20040207836 | October 21, 2004 | Chhibber et al. |
20040213449 | October 28, 2004 | Safaee-Rad et al. |
20040234873 | November 25, 2004 | Venkataraman |
20040251509 | December 16, 2004 | Choi |
20050006477 | January 13, 2005 | Patel |
20050012035 | January 20, 2005 | Miller |
20050036778 | February 17, 2005 | DeMonte |
20050048690 | March 3, 2005 | Yamamoto |
20050068436 | March 31, 2005 | Fraenkel et al. |
20050132098 | June 16, 2005 | Sonoda et al. |
20050134712 | June 23, 2005 | Gruhlke et al. |
20050151759 | July 14, 2005 | Gonzalez-Banos et al. |
20050205785 | September 22, 2005 | Hornback et al. |
20050219363 | October 6, 2005 | Kohler |
20050225654 | October 13, 2005 | Feldman et al. |
20050286612 | December 29, 2005 | Takanashi |
20060002635 | January 5, 2006 | Nestares et al. |
20060023197 | February 2, 2006 | Joel |
20060023314 | February 2, 2006 | Boettiger et al. |
20060033005 | February 16, 2006 | Jerdev et al. |
20060038891 | February 23, 2006 | Okutomi et al. |
20060049930 | March 9, 2006 | Zruya et al. |
20060054780 | March 16, 2006 | Garrood et al. |
20060054782 | March 16, 2006 | Olsen et al. |
20060072029 | April 6, 2006 | Miyatake et al. |
20060087747 | April 27, 2006 | Ohzawa et al. |
20060098888 | May 11, 2006 | Morishita |
20060125936 | June 15, 2006 | Gruhike et al. |
20060138322 | June 29, 2006 | Costello et al. |
20060159369 | July 20, 2006 | Young |
20060176566 | August 10, 2006 | Boettiger et al. |
20060187338 | August 24, 2006 | May et al. |
20060210186 | September 21, 2006 | Berkner |
20060243889 | November 2, 2006 | Farnworth et al. |
20060251410 | November 9, 2006 | Trutna |
20060274174 | December 7, 2006 | Tewinkle |
20060278948 | December 14, 2006 | Yamaguchi et al. |
20060279648 | December 14, 2006 | Senba et al. |
20070002159 | January 4, 2007 | Olsen et al. |
20070024614 | February 1, 2007 | Tam |
20070040828 | February 22, 2007 | Zalevsky et al. |
20070040922 | February 22, 2007 | McKee et al. |
20070041391 | February 22, 2007 | Lin et al. |
20070052825 | March 8, 2007 | Cho |
20070083114 | April 12, 2007 | Yang et al. |
20070085917 | April 19, 2007 | Kobayashi |
20070102622 | May 10, 2007 | Olsen et al. |
20070126898 | June 7, 2007 | Feldman |
20070127831 | June 7, 2007 | Venkataraman |
20070139333 | June 21, 2007 | Sato et al. |
20070146511 | June 28, 2007 | Kinoshita et al. |
20070159541 | July 12, 2007 | Sparks et al. |
20070160310 | July 12, 2007 | Tanida et al. |
20070171290 | July 26, 2007 | Kroger |
20070211164 | September 13, 2007 | Olsen et al. |
20070216765 | September 20, 2007 | Wong et al. |
20070228256 | October 4, 2007 | Mentzer |
20070257184 | November 8, 2007 | Olsen et al. |
20070258006 | November 8, 2007 | Olsen et al. |
20070258706 | November 8, 2007 | Raskar et al. |
20070268374 | November 22, 2007 | Robinson |
20070296835 | December 27, 2007 | Olsen et al. |
20080019611 | January 24, 2008 | Larkin |
20080025649 | January 31, 2008 | Liu et al. |
20080030597 | February 7, 2008 | Olsen et al. |
20080043095 | February 21, 2008 | Vetro et al. |
20080043096 | February 21, 2008 | Vetro et al. |
20080080028 | April 3, 2008 | Bakin et al. |
20080084486 | April 10, 2008 | Enge et al. |
20080088793 | April 17, 2008 | Sverdrup et al. |
20080118241 | May 22, 2008 | Tekolste et al. |
20080131019 | June 5, 2008 | Ng |
20080131107 | June 5, 2008 | Ueno |
20080151097 | June 26, 2008 | Chen et al. |
20080152215 | June 26, 2008 | Horie et al. |
20080152296 | June 26, 2008 | Oh et al. |
20080193026 | August 14, 2008 | Horie et al. |
20080218610 | September 11, 2008 | Chapman et al. |
20080219654 | September 11, 2008 | Border et al. |
20080247638 | October 9, 2008 | Tanida et al. |
20080247653 | October 9, 2008 | Moussavi et al. |
20080278591 | November 13, 2008 | Barna et al. |
20090050946 | February 26, 2009 | Duparre et al. |
20090052743 | February 26, 2009 | Techmer |
20090060281 | March 5, 2009 | Tanida et al. |
20090086074 | April 2, 2009 | Li et al. |
20090091806 | April 9, 2009 | Inuiya |
20090102956 | April 23, 2009 | Georgiev |
20090109306 | April 30, 2009 | Shan et al. |
20090128833 | May 21, 2009 | Yahav |
20090167922 | July 2, 2009 | Perlman et al. |
20090179142 | July 16, 2009 | Duparre et al. |
20090180021 | July 16, 2009 | Kikuchi et al. |
20090200622 | August 13, 2009 | Tai et al. |
20090201371 | August 13, 2009 | Matsuda et al. |
20090207235 | August 20, 2009 | Francini et al. |
20090225203 | September 10, 2009 | Tanida et al. |
20090237520 | September 24, 2009 | Kaneko et al. |
20090268192 | October 29, 2009 | Koenck et al. |
20090268983 | October 29, 2009 | Stone |
20090274387 | November 5, 2009 | Jin |
20090284651 | November 19, 2009 | Srinivasan |
20090297056 | December 3, 2009 | Lelescu et al. |
20090302205 | December 10, 2009 | Olsen et al. |
20090323195 | December 31, 2009 | Hembree et al. |
20090323206 | December 31, 2009 | Oliver et al. |
20090324118 | December 31, 2009 | Maslov et al. |
20100002126 | January 7, 2010 | Wenstrand et al. |
20100002313 | January 7, 2010 | Duparre et al. |
20100002314 | January 7, 2010 | Duparre |
20100053342 | March 4, 2010 | Hwang et al. |
20100053600 | March 4, 2010 | Tanida et al. |
20100060746 | March 11, 2010 | Olsen et al. |
20100086227 | April 8, 2010 | Sun et al. |
20100097491 | April 22, 2010 | Farina et al. |
20100103259 | April 29, 2010 | Tanida et al. |
20100103308 | April 29, 2010 | Butterfield et al. |
20100118127 | May 13, 2010 | Nam et al. |
20100141802 | June 10, 2010 | Knight et al. |
20100142839 | June 10, 2010 | Lakus-Becker |
20100157073 | June 24, 2010 | Kondo et al. |
20100177411 | July 15, 2010 | Hegde et al. |
20100195716 | August 5, 2010 | Klein Gunnewiek et al. |
20100201834 | August 12, 2010 | Maruyama et al. |
20100208100 | August 19, 2010 | Olsen et al. |
20100220212 | September 2, 2010 | Perlman et al. |
20100231285 | September 16, 2010 | Boomer et al. |
20100244165 | September 30, 2010 | Lake et al. |
20100265385 | October 21, 2010 | Knight et al. |
20100281070 | November 4, 2010 | Chan et al. |
20100302423 | December 2, 2010 | Adams, Jr. et al. |
20110001037 | January 6, 2011 | Tewinkle |
20110032370 | February 10, 2011 | Ludwig |
20110043661 | February 24, 2011 | Podoleanu |
20110043665 | February 24, 2011 | Ogasahara |
20110043668 | February 24, 2011 | McKinnon et al. |
20110069189 | March 24, 2011 | Venkataraman et al. |
20110080487 | April 7, 2011 | Venkataraman et al. |
20110108708 | May 12, 2011 | Olsen et al. |
20110122308 | May 26, 2011 | Duparre |
20110153248 | June 23, 2011 | Gu et al. |
20110211824 | September 1, 2011 | Georgiev et al. |
20110221658 | September 15, 2011 | Haddick et al. |
20110241234 | October 6, 2011 | Duparre |
20110242342 | October 6, 2011 | Goma et al. |
20110242355 | October 6, 2011 | Goma et al. |
20110242356 | October 6, 2011 | Aleksic et al. |
20110255592 | October 20, 2011 | Sung et al. |
20110267348 | November 3, 2011 | Lin et al. |
20110273531 | November 10, 2011 | Ito et al. |
20110274366 | November 10, 2011 | Tardif |
20110279721 | November 17, 2011 | McMahon |
20110285866 | November 24, 2011 | Bhrugumalla et al. |
20110300929 | December 8, 2011 | Tardif et al. |
20110310980 | December 22, 2011 | Mathew |
20110317766 | December 29, 2011 | Lim et al. |
20120012748 | January 19, 2012 | Pain et al. |
20120026342 | February 2, 2012 | Yu et al. |
20120039525 | February 16, 2012 | Tian et al. |
20120044372 | February 23, 2012 | Côté et al. |
20120069235 | March 22, 2012 | Imai |
20120113413 | May 10, 2012 | Miahczylowicz-Wolski et al. |
20120147205 | June 14, 2012 | Lelescu et al. |
20120198677 | August 9, 2012 | Duparre |
20120200734 | August 9, 2012 | Tang |
20120249550 | October 4, 2012 | Akeley et al. |
20120262607 | October 18, 2012 | Shimura et al. |
20120287291 | November 15, 2012 | McMahon et al. |
20120293695 | November 22, 2012 | Tanaka |
20120314033 | December 13, 2012 | Lee et al. |
20120327222 | December 27, 2012 | Ng et al. |
20130003184 | January 3, 2013 | Duparre |
20130010073 | January 10, 2013 | Do |
20130022111 | January 24, 2013 | Chen et al. |
20130027580 | January 31, 2013 | Olsen et al. |
20130033579 | February 7, 2013 | Wajs |
20130050504 | February 28, 2013 | Safaee-Rad et al. |
20130050526 | February 28, 2013 | Keelan |
20130057710 | March 7, 2013 | McMahon |
20130070060 | March 21, 2013 | Chatterjee |
20130077880 | March 28, 2013 | Venkataraman et al. |
20130077882 | March 28, 2013 | Venkataraman et al. |
20130088637 | April 11, 2013 | Duparre |
20130120605 | May 16, 2013 | Georgiev et al. |
20130128068 | May 23, 2013 | Georgiev et al. |
20130128069 | May 23, 2013 | Georgiev et al. |
20130128087 | May 23, 2013 | Georgiev et al. |
20130147979 | June 13, 2013 | McMahon et al. |
20130215108 | August 22, 2013 | McMahon et al. |
20130229540 | September 5, 2013 | Farina et al. |
20130259317 | October 3, 2013 | Gaddy |
20130265459 | October 10, 2013 | Duparre et al. |
20140132810 | May 15, 2014 | McMahon |
20140232822 | August 21, 2014 | Venkataraman et al. |
20140240528 | August 28, 2014 | Venkataraman et al. |
20140240529 | August 28, 2014 | Venkataraman et al. |
840502 | May 1998 | EP |
2336816 | June 2011 | EP |
2006033493 | February 2006 | JP |
2007520107 | July 2007 | JP |
2011109484 | June 2011 | JP |
2013526801 | June 2013 | JP |
2014521117 | August 2014 | JP |
2007083579 | July 2007 | WO |
2008108271 | September 2008 | WO |
2009151903 | December 2009 | WO |
2011063347 | May 2011 | WO |
2011116203 | September 2011 | WO |
2011/063347 | October 2011 | WO |
2011143501 | November 2011 | WO |
2012057619 | May 2012 | WO |
2012057620 | May 2012 | WO |
2012057621 | May 2012 | WO |
2012057622 | May 2012 | WO |
2012057623 | May 2012 | WO |
2012057620 | June 2012 | WO |
2012074361 | June 2012 | WO |
2012078126 | June 2012 | WO |
2012082904 | June 2012 | WO |
2012/155119 | November 2012 | WO |
2013003276 | January 2013 | WO |
2013043751 | March 2013 | WO |
2013043761 | March 2013 | WO |
2013049699 | April 2013 | WO |
2013055960 | April 2013 | WO |
2013119706 | August 2013 | WO |
2013126578 | August 2013 | WO |
2014052974 | April 2014 | WO |
2014032020 | May 2014 | WO |
2014078443 | May 2014 | WO |
2014130849 | August 2014 | WO |
2014133974 | September 2014 | WO |
2014138695 | September 2014 | WO |
2014138697 | September 2014 | WO |
- International Search Report and Written Opinion for International Application No. PCT/US13/46002, Search Completed Nov. 13, 2013, Mailed Nov. 29, 2013, 7 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US13/48772, Search Completed Oct. 21, 2013, Mailed Nov. 8, 2013, 6 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US13/56065, Search Completed Nov. 25, 2013, Mailed Nov. 26, 2013, 8 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US13/59991, Search Completed Feb. 6, 2014, Mailed Feb. 26, 2014, 8 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US2013/024987, Search Completed Mar. 27, 2013, Mailed Apr. 15, 2013, 14 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US2013/056502, Search Completed Feb. 18, 2014, Mailed Mar. 19, 2014, 7 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US2013/069932, International Filing Date Nov. 13, 2013, Search Completed Mar. 14, 2014, Mailed Apr. 14, 2014, 12 pgs.
- International Search Report and Written Opinion for International Application PCT/US13/62720, report completed Mar. 25, 2014, Mailed Apr. 21, 2014, 9 Pgs.
- International Search Report and Written Opinion for International Application PCT/US14/024903 report completed Jun. 12, 2014, Mailed, Jun. 27, 2014, 13 pgs.
- International Search Report and Written Opinion for International Application PCT/US14/24407, report completed Jun. 11, 2014, Mailed Jul. 8, 2014, 9 Pgs.
- International Search Report and Written Opinion for International Application PCT/US14/25100, report completed Jul. 7, 2014, Mailed Aug 7, 2014 5 Pgs.
- International Search Report and Written Opinion for International Application PCT/US14/25904 report completed Jun. 10, 2014, Mailed Jul. 10, 2014, 6 Pgs.
- International Search Report and Written Opinion for International Application PCT/US2014/022123, report completed Jun. 9, 2014, Mailed Jun. 25, 2014, 5 pgs.
- International Search Report and Written Opinion for International Application PCT/US2014/024947, Report Completed Jul. 8, 2014, Mailed Aug 5, 2014, 8 Pgs.
- International Search Report and Written Opinion for International Application PCT/US2014/028447, report completed Jun. 30, 2014, Mailed Jul. 21, 2014, 8 Pgs.
- International Search Report and Written Opinion for International Application PCT/US2014/23762, Report Completed May 30, 2014, Mailed Jul. 3, 2014, 6 Pgs.
- International Search Report and Written Opinion of the International Searching Authority for International Application No. PCT/US2009/044687, date completed Jan. 5, 2010, date mailed Jan. 13, 2010, 9 pgs.
- IPRP for International Application No. PCT/US2012/059813, International Filing Date Oct. 11, 2012, Search Completed Apr. 15, 2014, 7 pgs.
- International Search Report and Written Opinion for International Application PCT/US11/36349, mailed Aug. 22, 2011, 12 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US2011/64921, Report Completed Feb. 25, 2011, mailed Mar. 6, 2012, 17 pgs.
- International Search Report and Written Opinion for International Application No. PCT/US2013/027146, completed Apr. 2, 2013, 12 pgs.
- International Search Report and Written Opinion for International Application PCT/US14/17766, completed May 28, 2014, Mailed Jun. 18, 2014, 9 pgs.
- International Search Report and Written Opinion for International Application PCT/US14/18084, completed May 23, 2014, Mailed Jun. 10, 2014, 12 pgs.
- International Search Report and Written Opinion for International Application PCT/US14/22118, report completed Jun. 9, 2014, Mailed, Jun. 25, 2014, 5 pgs.
- International Search Report and Written Opinion for International Application PCT/US2010/057661, completed Mar. 9, 2011, 14 pgs.
- International Search Report and Written Opinion for International Application PCT/US2012/044014, completed Oct. 12, 2012, 15 pgs.
- International Search Report and Written Opinion for International Application PCT/US2012/056151, completed Nov. 14, 2012, 10 pgs.
- International Search Report and Written Opinion for International Application PCT/US2012/059813, completed Dec. 17, 2012, 8 pgs.
- International Search Report and Written Opinion for International Application PCT/US2012/37670, Mailed Jul. 18, 2012, Search Completed Jul. 5, 2012, 9 pgs.
- International Search Report and Written Opinion for International Application PCT/US2012/58093, completed Nov. 15, 2012, 12 pgs.
- Office Action for U.S. Appl. No. 12/952,106, dated Aug. 16, 2012, 12 pgs.
- Baker et al., “Limits on Super-Resolution and How to Break Them”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Sep. 2002, vol. 24, No. 9, pp. 1167-1183.
- Bertero et al., “Super-resolution in computational imaging”, Micron, 2003, vol. 34, Issues 6-7, 17 pgs.
- Bishop et al., “Full-Resolution Depth Map Estimation from an Aliased Plenoptic Light Field”, ACCV 2010, Part II, LNCS 6493, pp. 186-200.
- Bishop et al., “The Light Field Camera: Extended Depth of Field, Aliasing, and Superresolution”, IEEE Transactions on Pattern Analysis and Machine Intelligence, May 2012, vol. 34, No. 5, pp. 972-986.
- Borman, “Topics in Multiframe Superresolution Restoration”, Thesis of Sean Borman, Apr. 2004, 282 pgs.
- Borman et al, “Image Sequence Processing”, Source unknown, Oct. 14, 2002, 81 pgs.
- Borman et al., “Block-Matching Sub-Pixel Motion Estimation from Noisy, Under-Sampled Frames—An Empirical Performance Evaluation”, Proc SPIE, Dec. 1998, 3653, 10 pgs.
- Borman et al., “Image Resampling and Constraint Formulation for Multi-Frame Super-Resolution Restoration”, Proc. SPIE, Jun. 2003, 5016, 12 pgs.
- Borman et al., “Linear models for multi-frame super-resolution restoration under non-affine registration and spatially varying PSF”, Proc. SPIE, May 2004, vol. 5299, 12 pgs.
- Borman et al., “Nonlinear Prediction Methods for Estimation of Clique Weighting Parameters in NonGaussian Image Models”, Proc. SPIE, 1998. 3459, 9 pgs.
- Borman et al., “Simultaneous Multi-Frame MAP Super-Resolution Video Enhancement Using Spatio-Temporal Priors”, Image Processing, 1999, ICIP 99 Proceedings, vol. 3, pp. 469-473.
- Borman et al., “Super-Resolution from Image Sequences—A Review”, Circuits & Systems, 1998, pp. 374-378.
- Bruckner et al., “Artificial compound eye applying hyperacuity”, Optics Express, Dec. 11, 2006, vol. 14, No. 25, pp. 12076-12084.
- Bruckner et al., “Driving microoptical imaging systems towards miniature camera applications”, Proc. SPIE, Micro-Optics, 2010, 11 pgs.
- Bruckner et al., “Thin wafer-level camera lenses inspired by insect compound eyes”, Optics Express, Nov. 22, 2010, vol. 18, No. 24, pp. 24379-24394.
- International Preliminary Report on Patentability for International Application PCT/US2013/027146,Report Completed Apr. 2, 2013, Mailed Sep. 4, 2014, 10 Pages.
- International Search Report and Written Opinion for International Application PCT/US2014/030692, report completed Jul. 28, 2014, Mailed Aug. 27, 2014, 7 Pages.
- Joshi et al., “Synthetic Aperture Tracking: Tracking Through Occlusions”, I CCV IEEE 11th International Conference on Computer Vision; Publication [online]. Oct. 2007 [retrieved Jul. 28, 2014]. Retrieved from the Internet: <URL: http:|/ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4409032&isnumber=4408819>; pp. 1-8.
- Liu et al., “Virtual View Reconstruction Using Temporal Information”, 2012 IEEE International Conference on Multimedia and Expo, 2012, pp. 115-120.
- Lo et al., “Stereoscopic 3D Copy & Paste”, ACM Transactions on Graphics, vol. 29, No. 6, Article 147, Dec. 2010, pp. 147:1-147:10.
- Muehlebach, “Camera Auto Exposure Control for VSLAM Applications”, Studies on Mechatronics, Swiss Federal Institute of Technology Zurich, Autumn Term 2010 course, 67 pgs.
- Nayar, “Computational Cameras: Redefining the Image”, IEEE Computer Society, Aug. 2006, pp. 30-38.
- Ng, “Digital Light Field Photography”, Thesis, Jul. 2006, 203 pgs.
- Ng et al., “Super-Resolution Image Restoration from Blurred Low-Resolution Images”, Journal of Mathematical Imaging and Vision, 2005, vol. 23, pp. 367-378.
- Nitta et al., “Image reconstruction for thin observation module by bound optics by using the iterative backprojection method”, Applied Optics, May 1, 2006, vol. 45, No. 13, pp. 2893-2900.
- Nomura et al., “Scene Collages and Flexible Camera Arrays”, Proceedings of Eurographics Symposium on Rendering, 2007, 12 pgs.
- Park et al., “Super-Resolution Image Reconstruction”, IEEE Signal Processing Magazine, May 2003, pp. 21-36.
- Pham et al., “Robust Super-Resolution without Regularization”, Journal of Physics: Conference Series 124, 2008, pp. 1-19.
- Protter et al., “Generalizing the Nonlocal-Means to Super-Resolution Reconstruction”, IEEE Transactions on Image Processing, Jan. 2009, vol. 18, No. 1, pp. 36-51.
- Radtke et al., “Laser lithographic fabrication and characterization of a spherical artificial compound eye”, Optics Express, Mar. 19, 2007, vol. 15, No. 6, pp. 3067-3077.
- Rander et al., “Virtualized Reality: Constructing Time-Varying Virtual Worlds From Real World Events”, Proc. of IEEE Visualization '97, Phoenix, Arizona, Oct. 19-24, 1997, pp. 277-283, 552.
- Rhemann et al, “Fast Cost-Volume Filtering for Visual Correspondence and Beyond”, IEEE Trans. Pattern Anal. Mach. Intell, 2013, vol. 35, No. 2, pp. 504-511.
- Robertson et al., “Dynamic Range Improvement Through Multiple Exposures”, In Proc. of the Int. Conf. on Image Processing, 1999, 5 pgs.
- Robertson et al., “Estimation-theoretic approach to dynamic range enhancement using multiple exposures”, Journal of Electronic Imaging, Apr. 2003, vol. 12, No. 2, pp. 219-228.
- Roy et al., “Non-Uniform Hierarchical Pyramid Stereo for Large Images”, Computer and Robot Vision, 2007, pp. 208-215.
- Shum et al., “Pop-Up Light Field: An Interactive Image-Based Modeling and Rendering System,” Apr. 2004, ACM Transactions on Graphics, vol. 23, No. 2, pp. 143-162. Retrieved from http://131.107.65.14/en-us/um/people/jiansun/papers/PopupLightField—TOG.pdf on Feb. 5.
- Stollberg et al., “The Gabor superlens as an alternative wafer-level camera approach inspired by superposition compound eyes of nocturnal insects”, Optics Express, Aug. 31, 2009, vol. 17, No. 18, pp. 15747-15759.
- Takeda et al., “Super-resolution Without Explicit Subpixel Motion Estimation”, IEEE Transaction on Image Processing, Sep. 2009, vol. 18, No. 9, pp. 1958-1975.
- Tanida et al., “Color imaging with an integrated compound imaging system”, Optics Express, Sep. 8, 2003, vol. 11, No. 18, pp. 2109-2117.
- Tanida et al., “Thin observation module by bound optics (TOMBO): concept and experimental verification”, Applied Optics, Apr. 10, 2001, vol. 40, No. 11, pp. 1806-1813.
- Vaish et al., “Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures”, Proceeding, CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition—vol. 2, pp. 2331-2338.
- Vaish et al., “Synthetic Aperture Focusing Using a Shear-Warp Factorization of the Viewing Transform”, IEEE Workshop on A3DISS, CVPR, 2005, 8 pgs.
- Vaish et al., “Using Plane + Parallax for Calibrating Dense Camera Arrays”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2004, 8 pgs.
- Veilleux, “CCD Gain Lab: The Theory”, University of Maryland, College Park—Observational Astronomy (ASTR 310), Oct. 19, 2006, pp. 1-5 (online], [retrieved on May 13, 2014]. Retrieved from the Internet <URL:.http://www.astro.umd.edu/˜veilleux/ASTR310/fall06/ccd—theory.pdf, 5 pgs.
- Vuong et al., “A New Auto Exposure and Auto White-Balance Algorithm to Detect High Dynamic Range Conditions Using CMOS Technology”, Proceedings of the World Congress on Engineering and Computer Science 2008, WCECS 2008, Oct. 22-24, 2008.
- Wetzstein et al., “Computational Plenoptic Imaging”, Computer Graphics Forum, 2011, vol. 30, No. 8, pp. 2397-2426.
- Wheeler et al., “Super-Resolution Image Synthesis Using Projections Onto Convex Sets in the Frequency Domain”, Proc. SPIE, 2005, 5674, 12 pgs.
- Wilburn, “High Performance Imaging Using Arrays of Inexpensive Cameras”, Thesis of Bennett Wilburn, Dec. 2004, 128 pgs.
- Wilburn et al., “High Performance Imaging Using Large Camera Arrays”, ACM Transactions on Graphics, Jul. 2005, vol. 24, No. 3, pp. 765-776.
- Wilburn et al., “High-Speed Videography Using a Dense Camera Array”, Proceeding, CVPR'04 Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 294-301.
- Wilburn et al., “The Light Field Video Camera”, Proceedings of Media Processors 2002, SPIE Electronic Imaging, 2002, 8 pgs.
- Wippermann et al., “Design and fabrication of a chirped array of refractive ellipsoidal micro-lenses for an apposition eye camera objective”, Proceedings of SPIE, Optical Design and Engineering II, Oct. 15, 2005, 59622C-1-59622C-11.
- Yang et al., “A Real-Time Distributed Light Field Camera”, Eurographics Workshop on Rendering (2002), pp. 1-10.
- Zhang et al., “A Self-Reconfigurable Camera Array”, Eurographics Symposium on Rendering, 2004, 12 pgs.
- Zomet et al., “Robust Super-Resolution”, IEEE, 2001, pp. 1-6.
- Capel, “Image Mosaicing and Super-resolution”, [online], Retrieved on Nov. 10, 2012. Retrieved from the Internet at URL:<http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.226.2643&rep=rep1 &type=pdf>, Title pg., abstract, table of contents, pp. 1-263 (269 total pages).
- Chan et al., “Extending the Depth of Field in a Compound-Eye Imaging System with Super-Resolution Reconstruction”, Proceedings—International Conference on Pattern Recognition, 2006, vol. 3, pp. 623-626.
- Chan et al., “Investigation of Computational Compound-Eye Imaging System with Super-Resolution Reconstruction”, IEEE, ISASSP 2006, pp. 1177-1180.
- Chan et al., “Super-resolution reconstruction in a computational compound-eye imaging system”, Multidim Syst Sign Process, 2007, vol. 18, pp. 83-101.
- Chen et al., “Interactive deformation of light fileds”, In Proceedings of SIGGRAPH I3D 2005, pp. 139-146.
- Drouin et al., “Fast Multiple-Baseline Stereo with Occlusion”, Proceedings of the Fifth International Conference on 3-D Digital Imaging and Modeling, 2005, 8 pgs.
- Drouin et al., “Geo-Consistency for Wide Multi-Camera Stereo”, Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, 8 pgs.
- Drouin et al., “Improving Border Localization of Multi-Baseline Stereo Using Border-Cut”, International Journal of Computer Vision, Jul. 2009, vol. 83, Issue 3, 8 pgs.
- Duparre et al., “Artificial apposition compound eye fabricated by micro-optics technology”, Applied Optics, Aug. 1, 2004, vol. 43, No. 22, pp. 4303-4310.
- Duparre et al., “Artificial compound eye zoom camera”, Bioinspiration & Biomimetics, 2008, vol. 3, pp. 1-6.
- Duparre et al., “Artificial compound eyes—different concepts and their application to ultra flat image acquisition sensors”, MOEMS and Miniaturized Systems IV, Proc. SPIE 5346, Jan. 2004, pp. 89-100.
- Duparre et al., “Chirped arrays of refractive ellipsoidal microlenses for aberration correction under oblique incidence”, Optics Express, Dec. 26, 2005, vol. 13, No. 26, pp. 10539-10551.
- Duparre et al., “Micro-optical artificial compound eyes”, Bioinspiration & Biomimetics, 2006, vol. 1, pp. R1-R16.
- Duparre et al., “Microoptical Artificial Compound Eyes—Two Different Concepts for Compact Imaging Systems”, 11th Microoptics Conference, Oct. 30-Nov. 2, 2005, 2 pgs.
- Duparre et al., “Microoptical telescope compound eye”, Optics Express, Feb. 7, 2005, vol. 13, No. 3, pp. 889-903.
- Duparre et al., “Micro-optically fabricated artificial apposition compound eye”, Electronic Imaging—Science and Technology, Prod. SPIE 5301, Jan. 2004, pp. 25-33.
- Duparre et al., “Novel Optics/Micro-Optics for Miniature Imaging Systems”, Proc. of SPIE, 2006, vol. 6196, pp. 619607-1-619607-15.
- Duparre et al., “Theoretical analysis of an artificial superposition compound eye for application in ultra flat digital image acquisition devices”, Optical Systems Design, Proc. SPIE 5249, Sep. 2003, pp. 408-418.
- Duparre et al., “Thin compound-eye camera”, Applied Optics, May 20, 3005, vol. 44, No. 15, pp. 2949-2956.
- Duparre et al., “Ultra-Thin Camera Based on Artificial Apposistion Compound Eyes”, 10th Microoptics Conference, Sep. 1-3, 2004, 2 pgs.
- Fanaswala, “Regularized Super-Resolution of Multi-View Images”, Retrieved on Nov. 10, 2012. Retrieved from the Internet at URL:<http://www.site.uottawa.ca/-edubois/theses/Fanaswala—thesis.pdf>, 163 pgs.
- Farrell et al., “Resolution and Light Sensitivity Tradeoff with Pixel Size”, Proceedings of the SPIE Electronic Imaging 2006 Conference, 2006, vol. 6069, 8 pgs.
- Farsiu et al., “Advances and Challenges in Super-Resolution”, International Journal of Imaging Systems and Technology, 2004, vol. 14, pp. 47-57.
- Farsiu et al., “Fast and Robust Multiframe Super Resolution”, IEEE Transactions on Image Processing, Oct. 2004, vol. 13, No. 10, pp. 1327-1344.
- Farsiu et al., “Multiframe Demosaicing and Super-Resolution of Color Images”, IEEE Transactions on Image Processing, Jan. 2006, vol. 15, No. 1, pp. 141-159.
- Feris et al., “Multi-Flash Stereopsis: Depth Edge Preserving Stereo with Small Baseline Illumination”, IEEE Trans on PAMI, 2006, 31 pgs.
- Fife et al., “A 3D Multi-Aperture Image Sensor Architecture”, Custom Integrated Circuits Conference, 2006, CICC '06, IEEE, pp. 281-284.
- Fife et al., “A 3MPixel Multi-Aperture Image Sensor with 0.7Mu Pixels in 0.11Mu CMOS”, ISSCC 2008, Session 2, Image Sensors & Technology, 2008, pp. 48-50.
- Goldman et al., “Video Object Annotation, Navigation, and Composition”, In Proceedings of UIST 2008, pp. 3-12.
- Gortler et al., “The Lumigraph”, In Proceedings of SIGGRAPH 1996, pp. 43-54.
- Hacohen et al., “Non-Rigid Dense Correspondence with Applications for Image Enhancement”, ACM Transactions on Graphics, 30, 4, 2011, pp. 70:1-70:10.
- Hamilton, “JPEG File Interchange Format, Version 1.02”, Sep. 1, 1992, 9 pgs.
- Hardie, “A Fast Image Super-Algorithm Using an Adaptive Wiener Filter”, IEEE Transactions on Image Processing, Dec. 2007, vol. 16, No. 12, pp. 2953-2964.
- Hasinoff et al., “Search-and-Replace Editing for Personal Photo Collections”, Computational Photography (ICCP) 2010, pp. 1-8.
- Horisaki et al., “Irregular Lens Arrangement Design to Improve Imaging Performance of Compound-Eye Imaging Systems”, Applied Physics Express, 2010, vol. 3, pp. 022501-1-022501-3.
- Horisaki et al., “Superposition Imaging for Three-Dimensionally Space-Invariant Point Spread Functions”, Applied Physics Express, 2011, vol. 4, pp. 112501-1-112501-3.
- Horn et al., “LightShop: Interactive Light Field Manipulation and Rendering”, In Proceedings of I3D 2007, pp. 121-128.
- Isaksen et al., “Dynamically Reparameterized Light Fields”, In Proceedings of SIGGRAPH 2000, pp. 297-306.
- Jarabo et al., “Efficient Propagation of Light Field Edits”, In Proceedings of SIACG 2011, pp. 75-80.
- Kang et al., “Handling Occlusions inn Dense Multi-View Stereo”, Computer Vision and Pattern Recognition, 2001, vol. 1, pp. I-103-I-110.
- Kitamura et al., “Reconstruction of a high-resolution image on a compound-eye image-capturing system”, Applied Optics, Mar. 10, 2004, vol. 43, No. 8, pp. 1719-1727.
- Krishnamurthy et al., “Compression and Transmission of Depth Maps for Image-Based Rendering”, Image Processing, 2001, pp. 828-831.
- Lensvector, “How LensVector Autofocus Works”, printed Nov. 2, 2012 from http://www.lensvector.com/overview.html, 1 pg.
- Levoy, “Light Fields and Computational Imaging”, IEEE Computer Society, Aug. 2006, pp. 46-55.
- Li et al., “A Hybrid Camera for Motion Deblurring and Depth Map Super-Resolution,” Jun. 23-28, 2008, IEEE Conference on Computer Vision and Pattern Recognition, 8 pgs. Retrieved from www.eecis.udel.edu/˜jye/lab—research/08/deblur-feng.pdf on Feb. 5, 2014.
Type: Grant
Filed: Aug 13, 2014
Date of Patent: Feb 17, 2015
Patent Publication Number: 20140354853
Assignee: Pelican Imaging Corporation (Santa Clara, CA)
Inventors: Kartik Venkataraman (San Jose, CA), Amandeep S. Jabbi (San Francisco, CA), Robert H. Mullis (Santa Cruz, CA), Jacques Duparre (Jena), Shane Ching-Feng Hu (Fremont, CA)
Primary Examiner: Hung Lam
Application Number: 14/459,283
International Classification: H04N 5/225 (20060101); H04N 5/262 (20060101); H04N 5/357 (20110101); H04N 5/265 (20060101); H04N 9/083 (20060101);