Estimation of panoramic camera orientation relative to a vehicle coordinate frame
A system and method are presented for estimating the orientation of a panoramic camera mounted on a vehicle relative to the vehicle coordinate frame. An initial pose estimate of the vehicle is determined based on global positioning system data, inertial measurement unit data, and wheel odometry data of the vehicle. Image data from images captured by the camera is processed to obtain one or more tracks, each track including a sequence of matched feature points stemming from a same three-dimensional location. A correction parameter determined from the initial pose estimate and tracks can then be used to correct the orientations of the images captured by the camera. The correction parameter can be optimized by deriving a correction parameter for each of a multitude of distinct subsequences of one or more runs. Statistical analysis can be performed on the determined correction parameters to produce robust estimates.
Latest Google Patents:
The present application is a continuation of U.S. patent application Ser. No. 12/708,302, filed Feb. 18, 2010, which claims the benefit of the filing date of U.S. Provisional Patent Application No. 61/177,614, filed May 12, 2009, and U.S. Provisional Patent Application No. 61/154,217, filed Feb. 20, 2009, both entitled “Estimation of Panoramic Camera Orientation Relative to a Vehicle Coordinate Frame,” the entire disclosures of which are hereby incorporated herein by reference.
BACKGROUND1. Field of the Invention
The present invention relates to camera-obtained imagery captured from a moving vehicle.
2. Related Art
A camera, such as a panoramic camera, can be mounted on a vehicle, such as a car, truck, van, or any kind of vehicle, and used to capture images as the vehicle moves. A panoramic camera is a camera, typically a system of one or multiple cameras, that is configured or arranged to capture a panoramic image (i.e., an image or view of an area in many directions, possibly every direction). Examples of a panoramic camera can include a single camera, a polycamera, a camera rosette, a rotating camera, etc. The captured images may be used for online navigation and viewing tools such as Google Inc.'s STREET VIEW tool, for example. Vehicles that use panoramic camera systems in this manner may also include other systems and devices for related data collection. For example, a data collection vehicle may include a Global Positioning System (GPS) and/or an Inertial Measurement Unit (IMU) sensor in addition to the camera system. It may also record the amount of rotation of the vehicle's wheels. These systems include sensors that can collect data, which can help estimate the location of the vehicle. Given the precise location of the vehicle, the captured images can be associated with and shown at those locations.
There is nontrivial variation in the way a panoramic camera system and GPS and IMU sensors are placed on, or within, a data collection vehicle. For example, there is little consistency in the placement of a camera rack on top of the vehicle roof. In addition, there is variation in how and where the GPS and IMU sensors are placed within the vehicle. Furthermore, cameras and camera racks are often replaced, or their configuration and/or positioning may be changed by human operators. In many applications, in order to correctly render a panoramic view, one needs to know how the ground plane and world coordinates relate to the image panorama that was captured by the panoramic camera. If this information is not known or inaccurate, objects (e.g., buildings) and their surroundings may appear incorrectly, e.g., tilted to one side. Furthermore, directional arrows that may be used in a viewing tool may point in a wrong direction. Thus, knowing the camera orientation relative to GPS and/or IMU sensors in a data collection vehicle can be important.
BRIEF SUMMARYEmbodiments of the invention relate to estimation of camera orientation relative to a vehicle coordinate frame. In one embodiment, a method for estimating orientation of a panoramic camera mounted on a vehicle may include determining an initial pose estimate of the vehicle based on global positioning system data, inertial measurement unit data, and wheel odometry data of the vehicle. The method may also include obtaining images from one or more runs of image data captured by the camera, the images each having an orientation. The method may further include processing image data from the images to obtain one or more tracks, where each track includes a sequence of matched feature points stemming from a same three-dimensional location. The method may also include determining, from the initial pose estimate and tracks, a correction parameter to correct the orientations of the images captured by the camera.
In another embodiment, a system for estimating orientation of a panoramic camera mounted on a vehicle is provided. The system may include a pose estimate module that generates an initial pose estimate of the vehicle based on global positioning system data, inertial measurement unit data, and wheel odometry data of the vehicle. The system may also include an image processing module that processes image data from one or more runs of image data captured by the camera to obtain one or more tracks, where each track includes a sequence of matched feature points stemming from a same three-dimensional location. The system may further include an optimizer module, in communication with the pose estimate module and the image processing module, that determines, from the initial pose estimate and tracks, a correction parameter to correct the orientations of the images. In an embodiment, the pose estimate module may be in communication with one or more vehicle databases having vehicle information such as global positioning system data, inertial measurement unit data, and wheel odometry data of the vehicle. In an embodiment, the image processing module may be in communication with one or more image databases having images and corresponding image data from the one or more runs of image data captured by the panoramic camera. In an alternative embodiment, the system can include the vehicle databases and/or the image databases.
In one embodiment, a computer program product includes a computer readable storage medium having control logic stored therein for causing a computer to estimate orientation of a panoramic camera mounted on a vehicle. The control logic may include a first computer readable program code that enables the computer to determine an initial pose estimate of the vehicle, the initial pose estimate based on global positioning system data, inertial measurement unit data, and wheel odometry data of the vehicle. The control logic may also include a second computer readable program code that enables the computer to obtain images from one or more runs of image data captured by the camera, the images each having an orientation. The control logic may further include a third computer readable program code that enables the computer to process image data from the images to obtain one or more tracks, where each track includes a sequence of matched feature points stemming from a same three-dimensional location. The control logic may also include a fourth computer readable program code that enables the computer to determine, from the initial pose estimate and tracks, a correction parameter to correct the orientations of the images captured by the camera.
Further embodiments, features, and advantages, as well as the structure and operation of the various embodiments, are described in detail below with reference to the accompanying drawings.
The accompanying drawings, which are incorporated herein and form part of the specification, illustrate the present invention and, together with the description, further serve to explain the principles of the invention and to enable a person skilled in the relevant art(s) to make and use the invention.
The features and advantages of the present invention will become more apparent from the detailed description set forth below when taken in conjunction with the drawings, in which like reference characters identify corresponding elements throughout. In the drawings, like reference numbers generally indicate identical, functionally similar, and/or structurally similar elements. The drawing in which an element first appears is indicated by the leftmost digit(s) in the corresponding reference number.
DETAILED DESCRIPTION OF EMBODIMENTSWhile the present invention is described herein with reference to illustrative embodiments for particular applications, it should be understood that the invention is not limited thereto. Those skilled in the art with access to the teachings provided herein will recognize additional modifications, applications, and embodiments within the scope thereof and additional fields in which the invention would be of significant utility.
It is noted that references in the specification to “one embodiment,” “an embodiment,” “an example embodiment,” etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it would be within the knowledge of one skilled in the art to incorporate such a feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
Terminology
The following provides definitions for certain terms as used in this document:
Panoramic Camera—A panoramic camera is a camera, or system of cameras, that is configured or arranged to capture an image or view of an area in one or many directions. Examples of a panoramic camera may include a single camera, a polycamera (a tightly-packed cluster of cameras providing a large field of view), a camera rosette (outward-facing equally-spaced cameras forming a circle that provide an all-around view), a rotating (or rotating line) camera (a camera that is rotated to capture images in multiple directions), etc.
GPS—Global Positioning System—The Global Positioning System (GPS) is a navigational system using satellite signals to determine the location of a radio receiver on or above the earth's surface.
IMU—Inertial Measurement Unit—An Inertial Measurement Unit is a position-tracking sensor that senses motion in terms of type, rate, and direction using a combination of accelerometers and gyroscopes.
World Coordinates—World coordinates are from the world coordinate system, which is a coordinate system that is fixed with respect to the Earth.
Pose—A pose may be defined as a three-dimensional position (e.g., in the x, y, z coordinate system) with an orientation (or rotation) that is usually referred to using rotation coordinates (e.g., roll (φ), pitch (θ), and yaw (Ψ)). Therefore, a pose may be expressed in at least six dimensions: x, y, z, φ, θ, and Ψ. The pose of the vehicle may be defined as a position and orientation of the vehicle relative to the world.
Feature Matching—Feature matching provides correspondence between feature points and images. Detected features from different camera images are matched using their appearance to find corresponding sets of features. Each set of matching features is assumed to be produced by the same entity, which has a certain three-dimensional position in the world. Matched feature points may be grouped into one or more tracks, each track including a sequence of matched feature points stemming from a single three-dimensional location. Feature detection and matching may be used for image alignment (e.g., stitching), three-dimensional reconstruction, motion tracking, etc.
Overview
Embodiments of the present invention are related to panoramic photography via a panoramic camera that is mounted on a vehicle. The embodiments are directed to optimizing orientation of images obtained via such a camera. Because the alignment of the camera may not be ideal, and the physical orientation, position, and/or location of the camera may not align with those of other related data collection sensors, such as a Global Positioning System (GPS) and/or an Inertial Measurement Unit (IMU) sensor, images obtained via the camera may be improved by applying the embodiments described herein. For example, embodiments may include the ability to automatically estimate the orientation of a camera mounted on a data collection vehicle relative to the data collection sensors.
The following description is broken down into a discussion of environment, orientation estimation, further optimization of orientation correction, system architecture, and methods of orientation estimation and optimization.
Environment
In addition to panoramic camera 104, other sensors (not shown) may be used for data collection, such as, for example, a Global Positioning System (GPS) sensor and/or an Inertial Measurement Unit (IMU) sensor. These other sensors, as well as other related equipment, may be located in the trunk 108 of vehicle 102, or anywhere else within, on, or coupled to vehicle 102. In example 100 of
Generally, any three-dimensional object can be considered as having a six-dimensional pose. A pose may be defined as a three-dimensional position (e.g., in the x, y, z coordinate system) with an orientation (or rotation) that is usually referred to using rotation coordinates (e.g., roll (φ), pitch (θ), and yaw (Ψ)), such as coordinate set 413 shown in
The pose of a moving vehicle may be defined in the above-described manner, with the coordinates constantly changing as the vehicle moves in a three-dimensional space and along uneven terrain (e.g., hilly and/or windy roads). Sensors placed in or on vehicle 102, such as GPS and/or IMU sensors, may assist in determining a pose of vehicle 102. Generally, GPS sensors use satellite data to determine location, speed, direction, and time. IMU sensors generally include a combination of accelerometers and gyroscopes, and may determine position by sensing acceleration and rotational attributes. When used as part of a navigational view capturing system, these sensors may provide information to determine how the ground plane and the world coordinates relate to an image, or image panorama, that is captured by one or more cameras of panoramic camera 104. This information is used to correctly render a panoramic view, for example. If this information is not known or inaccurate, objects (e.g., buildings, trees, etc.) and their surroundings may appear incorrectly or skewed, e.g., tilted to one side and/or displaced upward or downward relative to ground 101. Correction may then require orientation adjustment of the raw and/or rendered images. With a potential additional problem of mounted camera 104 not being aligned in a straight manner, this adjustment becomes even more important.
The above-discussed problems may be corrected by the embodiments discussed in the following description.
Orientation Estimation and Optimization
Embodiments as described below rely on the assumption that relatively accurate estimates of vehicle pose, and images for all cameras of a panoramic camera 104 mounted on the vehicle, are available for an uninterrupted data collection interval or run. With this data, accurate rotational alignment between the panoramic camera 104 and GPS/IMU sensors that were used to obtain the vehicle pose estimates may be determined.
An embodiment may include an orientation estimating system 517, as shown in
According to an embodiment, the orientation estimating system 517 may receive vehicle pose data 519, that may include GPS, IMU, and the vehicle's wheel odometry data, and image data 521, that may include image data obtained by a panoramic camera 104 mounted on the vehicle. In an embodiment, the orientation estimating system 517 may determine vehicle pose estimates based on vehicle pose data 519, or alternatively, vehicle pose estimates may be provided to orientation estimating system 517 as part of the vehicle pose data 519. In an embodiment, the orientation estimating system 517 may determine image track data based on image data 521, using feature matching for example, or alternatively, image track data may be provided to orientation estimating system 517 as part of the image data 521. The orientation estimating system 517 may then determine one or more correction parameters 523 that may be applied to image data 521, for example, to provide correctly oriented views, as discussed in the following paragraphs.
According to an embodiment, the determination of one or more correction parameters 523 may be accomplished by applying an orientation estimation algorithm to the vehicle pose estimates and image track data obtained via feature matching. The orientation estimation algorithm may be used to improve vehicle pose estimates, estimates of the three-dimensional locations of the entities used for the feature matching, and estimates of the camera orientation relative to the GPS/IMU sensors that were used to obtain the vehicle's pose estimates.
The feature matching in the captured images may be accomplished using known feature matching techniques. For various embodiments, a set of features (e.g., scale-invariant salient points on an image, where a lot of texture is present) are detected in images captured by a panoramic camera. The detected features from different camera images (e.g., captured at different times) are matched using their appearance to find corresponding sets of features. Each set of matching features is assumed to be produced by the same entity, which has a certain three-dimensional position in the world. Matched feature points may be grouped into one or more tracks, each track including a sequence of matched feature points stemming from a single three-dimensional location. An example of a portion of the feature matching process is shown in
In an embodiment, the key parameters involved with the orientation estimation algorithm are vehicle pose (P), three-dimensional locations of the entities used for feature matching (X), and camera orientation relative to the GPS/IMU sensors (R). These parameters are demonstrated in
In an embodiment, the orientation estimation algorithm is based on Equation 800, shown in
F(P,X,R)=ΣtΣiρ((TPi,R(Xt)−IXt))2+λΣi(Pi−PESTi)2
where
-
- P=P1, P2, . . . , PN and represents a set of vehicle poses;
- Pi represents a pose of the vehicle at time i;
- X=X1, X2, . . . , XM and represents three-dimensional locations of track points in a scene;
- Xt represents a three-dimensional location of a track t in the scene;
- R represents the rotation of the camera;
- ρ denotes a robustifier function (e.g., a Cauchy robustifier);
- T represents projection;
- IXt represents a fixed location in a given image, where a feature corresponding to track point Xt was detected;
- λ represents a weight used to trade off strength of a first and a second term in F; and
- PESTi represents an initial or a previous pose estimate of the vehicle.
In an embodiment, Equation 800 may be used to determine estimates of vehicle pose P, entity locations X, and camera orientation R. Multiple iterations of Equation 800 may provide improved estimates of P, X, and R. Estimated camera orientation R may be applied to image data of images captured by the camera 104 to correct their orientation so that they may be more accurately viewed. For example, the camera orientation R may be applied to the image data at a point when the image is stitched.
In Equation 800, TPi, R(Xt)−IXt represents reprojection error and Pi−PESTi represents pose error. Reprojection error is a geometric error that corresponds to the image distance between a projected point and a measured point. It is used to quantify how closely an estimate of a three-dimensional point recreates the point's true projection. In
The minimization of the objective in Equation 800 can be performed with any standard non-linear optimization technique, such as but not constrained to Levenberg-Marquardt, Conjugate Gradient, or gradient descent methods.
Assuming an accurate set of initial vehicle pose estimates determined using GPS, IMU, and wheel odometry data is used, the orientation estimation algorithm described above provides the rotation between the initial vehicle pose estimates (dependent on the GPS/IMU coordinate systems) and the poses that minimize reprojection error (dependent on the coordinate system of the camera).
In the embodiments described above, a camera rotation correction parameter is determined for a particular portion, or subsequence, of a run. The quality of the result, however, is dependent on the quality of the initial vehicle pose estimate that one may determine from wheel odometry data and the data from GPS and IMU sensors. If the initial vehicle pose estimate is inaccurate due to errors in the GPS, IMU, and/or wheel odometry inputs, the camera rotation correction parameter may also be inaccurate. The following section discusses ways to make the camera rotation correction parameter more robust in accordance with various embodiments.
Further Optimization of Orientation Correction
In the above-described embodiments, a camera rotation correction parameter is determined for a single subsequence of a given, or selected, run. According to one embodiment, the rotation correction parameter may be made more robust by analyzing multiple subsequences of a selected run. For example, in one embodiment, multiple rotation correction parameters may be determined, as described above, for a multitude of subsequences of a selected run, and statistical analysis, possibly with outlier removal, may be performed on the determined correction parameters to determine an optimized correction parameter. For example, in an embodiment, a median of the determined correction parameters may be determined and used as an optimized correction parameter.
Optionally, from a multitude of determined correction parameters, correction parameters that appear to be very different from the rest may be ignored or removed from the analysis. As an example, in one embodiment, correction parameters may be ignored for subsequences of the selected run in which the acceleration of the vehicle is above a predetermined value or outside of a given range. As a further example, in one embodiment, correction parameters may be ignored for subsequences of the selected run in which the vehicle is moving outside of a predetermined velocity range. In yet another example, in one embodiment, a cost function may be used, possibly within a dynamic programming algorithm that chooses subsequences of the selected run that abide with one or more given rules (e.g., having a vehicle acceleration that is within a given range, having a vehicle velocity that is within a given range, etc.).
In an embodiment, the above optimization may be accomplished using information gathered over multiple runs. In one embodiment, for example, multiple rotation correction parameters may be determined, as described above, for a multitude of subsequences of multiple runs. Statistical analysis, possibly with outlier removal, may be performed on the determined correction parameters to determine an optimized correction parameter, as previously described above.
In an embodiment, an optimized rotation correction parameter R may be determined by, for each run of a multitude of runs, determining a first median of the determined rotation correction parameters for each of the closest Z runs backward in time, and determining a second median of the determined rotation correction parameters for each of the closest Z runs forward in time. Either the first or second median may be chosen as the optimized rotation correction parameter based on which of the first or second median is closest to the rotation correction parameter determined for that run.
The analysis and determination of optimized rotation correction parameters can be done by orientation estimating system 517, described previously.
System Architecture
Systems 1117A and 1117B may include a pose estimate module 1174, an image processing module 1176, and an optimizer module 1178. In an embodiment, each of the pose estimate module 1174, image processing module 1176, and optimizer module 1178 may include one or more processors of one or more computing devices, such as computing device 541 shown in
With reference to
According to an embodiment, image processing module 1176 may receive or obtain images and related image data 521 obtained from a panoramic camera 104 mounted on the vehicle. Image processing module 1176 may conduct feature matching based on images and related image data 521 to determine image track data 1182 related to the three-dimensional locations of the entities used for feature matching.
According to an embodiment, optimizer module 1178 may determine, for example using Equation 800 defined above, estimates of vehicle pose (P), three-dimensional location of the entities used for feature matching (X), and a rotation correction parameter (R) based on pose estimates 1180 and image track data 1182. The P, X, and R estimates 1184 may be output for use by another system (not shown) or stored in a data store or database (not shown).
System 1117B is similar to system 1117A, except that system 1117B includes one or more databases for the vehicle data and one or more databases for the image information as part of the camera orientation estimating system. In an embodiment, vehicle database 1170 may include vehicle-related data 1162 for a particular vehicle, such as wheel odometry data, GPS-related data, and IMU-related data. Pose estimate module 1174 may use vehicle-related data 1162 to determine pose estimates 1180 for the vehicle.
In an embodiment, image database 1172 may include images and related image data 1164 obtained from a camera 104 mounted on the vehicle. Image processing module 1176 may conduct feature matching based on images and related image data 1164 that it receives from image database 1172 to determine image track data 1182 related to the three-dimensional locations of the entities used for feature matching.
As previously described, optimizer module 1178 of system 1117B may determine, for example using Equation 800 defined above, estimates of vehicle pose (P), three-dimensional location of the entities used for feature matching (X), and a rotation correction parameter (R) based on pose estimates 1180 and image track data 1182. In an embodiment, the P, X, and R estimates 1184 may be stored in a data store, such as image database 1172, for example, or another storage location (not shown). In another embodiment, the P, X, and R estimates 1184 may be output as shown in
Rotation correction parameter R may be applied to the images stored in image database 1172 for accurate viewing. In an embodiment, the application of rotation correction parameter R to a particular image may be done, for example, via a computer system, or processing module (such as an orientation correction module 1186 shown in
Methods
In an embodiment, an example of step 1414 is shown in step 1516 of the flowchart in
In an embodiment, an example of step 1620 is shown in the flowchart in
Exemplary Computer System
The various embodiments described herein may be implemented using hardware, software or a combination thereof and may be implemented in a computer system or other processing system. In an embodiment, the invention is directed toward a computer program product executing on a computer system capable of carrying out the functionality described herein. An example of a computer system 1800 is shown in
Computer system 1800 (optionally) includes a display interface 1802 (which can include input/output devices such as keyboards, mice, etc.) that forwards graphics, text, and other data from communication infrastructure 1806 (or from a frame buffer not shown) for display on display unit 1830.
Computer system 1800 also includes a main memory 1808, preferably random access memory (RAM), and may also include a secondary memory 1810. The secondary memory 1810 may include, for example, a hard disk drive 1812 and/or a removable storage drive 1814, representing a floppy disk drive, a magnetic tape drive, an optical disk drive, etc. The removable storage drive 1814 reads from and/or writes to a removable storage unit 1818 in a well-known manner. Removable storage unit 1818, represents a floppy disk, magnetic tape, optical disk, memory card, etc. which is read by and written to by removable storage drive 1814. As will be appreciated, the removable storage unit 1818 includes a computer readable storage medium having stored therein computer software and/or data.
In alternative embodiments, secondary memory 1810 may include other similar means for allowing computer programs or other instructions to be loaded into computer system 1800. Such means may include, for example, a removable storage unit 1822 and an interface 1820. Examples of such may include a program cartridge and cartridge interface (such as that found in video game devices), a removable memory chip (such as an EPROM, or PROM) and associated socket, and other removable storage units 1822 and interfaces 1820 which allow software and data to be transferred from the removable storage unit 1822 to computer system 1800.
Computer system 1800 may also include a communication interface 1824. Communication interface 1824 enables computer 1800 to communicate with external and/or remote devices. For example, communication interface 1824 allows software and data to be transferred between computer system 1800 and external devices. Communication interface 1824 also allows computer 1800 to communicate over communication networks, such as LANs, WANs, the Internet, etc. Communication interface 1824 may interface with remote sites or networks via wired or wireless connections. Examples of communications interface 1824 may include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, etc. Computer 1800 receives data and/or computer program products via communication network 1824. Software and data transferred via communications interface 1824 are in the form of signals 1828 which may be electronic, electromagnetic, optical or other signals capable of being received by communications interface 1824. These signals 1828 are provided to communications interface 1824 via a communications path (i.e., channel) 1826. This channel 1826 carries signals 1828 and may be implemented using wire or cable, fiber optics, a phone line, a cellular phone link, an RF link and other wired or wireless communications channels.
In this document, the terms “computer program medium” and “computer usable medium” and “computer readable medium” are used to generally refer to media such as removable storage drive 1814, and a hard disk installed in hard disk drive 1812. These computer program products are means for providing software to computer system 1800.
Computer programs (also called computer control logic) are stored in main memory 1808 and/or secondary memory 1810. Computer programs may also be received via communications interface 1824. Such computer programs, when executed, enable the computer system 1800 to perform the features of the present invention as discussed herein. In particular, the computer programs, when executed, enable the processor 1804 to perform the features of the present invention. Accordingly, such computer programs represent controllers of the computer system 1800.
In an embodiment implemented using software, the software may be stored in a computer program product and loaded into computer system 1800 using removable storage drive 1814, hard disk drive 1812 or communications interface 1824. The control logic (software), when executed by the processor 1804, causes the processor 1804 to perform the functions of the invention as described herein.
The invention can work with software, hardware, and operating system implementations other than those described herein. Any software, hardware, and operating system implementations suitable for performing the functions described herein can be used.
Conclusion
The present invention has been described above with the aid of functional building blocks illustrating the implementation of specified functions and relationships thereof. The boundaries of these functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternate boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed.
The foregoing description of the specific embodiments will so fully reveal the general nature of the invention that others can, by applying knowledge within the skill of the art, readily modify and/or adapt for various applications such specific embodiments, without undue experimentation and without departing from the general concept of the present invention. Therefore, such adaptations and modifications are intended to be within the meaning and range of equivalents of the disclosed embodiments, based on the teaching and guidance presented herein. It is to be understood that the phraseology or terminology herein is for the purpose of description and not of limitation, such that the terminology or phraseology of the present specification is to be interpreted by the skilled artisan in light of the teachings and guidance.
The breadth and scope of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.
The Summary and Abstract sections may set forth one or more but not all exemplary embodiments of the present invention as contemplated by the inventor(s), and thus, are not intended to limit the present invention and the appended claims in any way.
Further, the purpose of the foregoing Abstract is to enable the U.S. Patent and Trademark Office, the public generally, and especially the scientists, engineers and practitioners in the art who are not familiar with patent or legal terms or phraseology, to determine quickly from a cursory inspection the nature and essence of the technical disclosure of the application. The Abstract is not intended to be limiting as to the scope of the present invention in any way.
Claims
1. A method for estimating orientation of cameras mounted on vehicles, comprising:
- determining, by one or more processors, an initial pose estimate of a vehicle having a camera mounted thereon, the initial pose estimate being a position and orientation of the vehicle relative to the world;
- processing, by the one or more processors, image data from images captured by the camera to obtain one or more tracks, each track including a sequence of matched feature points stemming from a single three-dimensional location;
- determining, by the one or more processors, from the initial pose estimate and the tracks, a correction parameter to correct orientations of the images captured by the camera for each of a plurality of subsequences of one or more runs, wherein determining the correction parameter, for each plurality of subsequences of one or more runs, includes evaluating a set of vehicle poses, three-dimensional locations from the one or more tracks, and an orientation of the camera relative to one or more sensors of the vehicle; and
- performing statistical analysis on the determined correction parameters for each plurality of subsequences to determine a median value of the determined correction parameters.
2. The method of claim 1, wherein determining the correction parameter is done such that a reprojection error is minimized.
3. The method of claim 1, wherein the correction parameter is a camera rotation correction parameter that is determined for a particular portion of an uninterrupted data collection interval.
4. The method of claim 1, wherein the set of vehicle poses is obtained from location data, inertial measurement data, and wheel odometry data of the vehicle.
5. The method of claim 1, further comprising correcting orientations of the images by applying the correction parameter to the image data.
6. The method of claim 5, wherein the determining the correction parameter includes determining:
- an optimized pose of the vehicle;
- a location of points of the tracks in three dimensions; and
- a camera to vehicle pose rotation.
7. The method of claim 1, the median value of the determined correction parameters is the optimized correction parameter.
8. The method of claim 7, wherein determining the median value omits determined correction parameters for subsequences of the plurality of subsequences of one or more runs in which acceleration of the vehicle is above a predetermined value or outside of a predetermined range.
9. The method of claim 7, wherein determining the median value omits determined correction parameters for subsequences of the plurality of subsequences of one or more runs in which the vehicle is moving outside of a predetermined velocity range.
10. The method of claim 1, wherein determining the correction parameter for each of the plurality of subsequences of the one or more runs includes performing a cost function to chooses subsequences of the selected run that conform to a predetermined rule.
11. The method of claim 1, wherein the determining the median value of the determined correction parameters includes:
- for each run of the one or more runs, determining a first median of the correction parameters for each of a closest set of runs backward in time and determining a second median of the correction parameters for each of a closest set of runs forward in time; and
- selecting either the first median or the second median as the optimized correction parameter based on which of the first or second median is closest to the correction parameter determined for that run.
12. A system, comprising:
- one or more processors configured to:
- determine an initial pose estimate of a vehicle having a camera mounted thereon, the initial pose estimate being a position and orientation of the vehicle relative to the world;
- process image data from images captured by the camera to obtain one or more tracks, each track including a sequence of matched feature points stemming from a single three-dimensional location;
- determine, from the initial pose estimate and the tracks, a correction parameter to correct orientations of the images captured by the camera for each of a plurality of subsequences of one or more runs, wherein determining the correction parameter, for each plurality of subsequences of one or more runs, includes evaluating a set of vehicle poses, three-dimensional locations from the one or more tracks, and an orientation of the camera relative to one or more sensors of the vehicle; and
- perform statistical analysis on the determined correction parameters for each plurality of subsequences to determine a median value of the determined correction parameters.
13. The system of claim 12, wherein the one or more processors are further configured to correct orientations of the images by applying the correction parameter to the image data.
14. The system of claim 13, wherein the one or more processors are configured to determine the correction parameter by determining:
- an optimized pose of the vehicle;
- a location of points of the tracks in three dimensions; and
- a camera to vehicle pose rotation.
15. The system of claim 13, wherein:
- the one or more processors are configured to determine the correction parameter for each plurality of subsequences of one or more runs; and
- the one or more processors are further configured to perform statistical analysis on the determined correction parameters for different subsequences to determine an optimized correction parameter.
16. The system of claim 13, wherein:
- the one or more processors are configured to determine the correction parameter for each plurality of subsequences of one or more runs; and
- the one or more processors are further configured to perform statistical analysis on the determined correction parameters across the one or more runs to determine an optimized correction parameter.
17. The system of claim 16, wherein performing the statistical analysis by the one or more processors includes:
- for each of the one or more runs, determining a first median of the correction parameters for each of a closest set of runs backward in time and determining a second median of the correction parameters for each of a closest set of runs forward in time; and
- selecting either the first median or the second median as the optimized correction parameter based on which of the first or second median is closest to the correction parameter determined for that run.
18. A non-transitory computer-readable storage medium on which computer readable instructions of a program are stored, the instructions, when executed by one or more processors, cause the one or more processors to perform a method for estimating orientation of cameras mounted on vehicles, the method comprising:
- determining an initial pose estimate of a vehicle having a camera mounted thereon, the initial pose estimate being a position and orientation of the vehicle relative to the world;
- processing image data from images captured by the camera to obtain one or more tracks, each track including a sequence of matched feature points stemming from a single three-dimensional location;
- determining from the initial pose estimate and the tracks, a correction parameter to correct orientations of the images captured by the camera for each of a plurality of subsequences of one or more runs, wherein determining the correction parameter, for each plurality of subsequences of one or more runs, includes evaluating a set of vehicle poses, three-dimensional locations from the one or more tracks, and an orientation of the camera relative to one or more sensors of the vehicle; and
- performing statistical analysis on the determined correction parameters for each plurality of subsequences to determine a median value of the determined correction parameters.
5166878 | November 24, 1992 | Poelstra |
5517419 | May 14, 1996 | Lanckton et al. |
6535114 | March 18, 2003 | Suzuki et al. |
6594600 | July 15, 2003 | Arnoul et al. |
6993450 | January 31, 2006 | Takemoto et al. |
8698875 | April 15, 2014 | Anguelov et al. |
20040230375 | November 18, 2004 | Matsumoto et al. |
20060012493 | January 19, 2006 | Karlsson et al. |
20070288141 | December 13, 2007 | Bergen et al. |
20080089556 | April 17, 2008 | Salgian et al. |
20080167814 | July 10, 2008 | Samarasekera et al. |
20090201361 | August 13, 2009 | Lyon et al. |
20090244646 | October 1, 2009 | Kondo et al. |
20090285450 | November 19, 2009 | Kaiser et al. |
20100118116 | May 13, 2010 | Tomasz et al. |
20100208057 | August 19, 2010 | Meier et al. |
9532483 | November 1995 | WO |
- Chen, T., and Shibasaki, R., “A Versatile AR Type 3D Mobile GIS Based on Image Navigation Technology” IEEE Xplore, 1999, Downloaded Nov. 10, 2008, 6 pgs.
- Khan et al., “Camera Calibration for a Robust Omni-directional Photogrammetry System,” Internet Citation, May 31, 2007, 8 pages.
- Newman, J., et al., “Augmented Reality in a Wide Area Sentient Environment”, Proceedings of the IEEE and ACM International Symposium on Augmented Reality (ISAR'01), IEEE Computer Society, Washington D.C., 2001, Downloaded from IEEE Xplore Jul. 13,2009, 10 pgs.
- Taylor, Camillo J., “VideoPlus: A Method for Capturing the Structure and Appearance of Immersive Environments”, IEEE Transactions on Visualization and Computer Graphics 2:171-182, IEEE Computer Society, Washington D.C., 2002, 13 pgs. cited byapplicant.
- The International Search Report cited in International Application No. PCT/US2010/024762, dated May 27, 2010, 7 pages.
- The Written Opinion of the International Search Authority cited in International Application No. PCT/US2010/024762, dated May 27, 2010, 8 pages.
- Wagner, D., et al., “Pose Tracking from Natural Features on Mobile Phones”, 7.sup.th IEEE International Symposium on Mixed and Augmented Reality Systems, IEEE Computer Society, Washington D.C., 2008, 10 pgs.
- Xu, W., et al., “Recording Real Worlds for Playback in a Virtual Exercise Environment”, Technical Report CU-CS 1013-06, University of Colorado: Department of Computer Science, Boulder, Colorado, 2006, 12 pgs.
- Zhu et al., “Precise Visual Navigation Using Multi-Stereo Vision and Landmark Matching,” Proc. of SPIE, vol. 6561, (2007), 12 pages.
Type: Grant
Filed: Mar 12, 2014
Date of Patent: Feb 23, 2016
Patent Publication Number: 20140192145
Assignee: Google Inc. (Mountain View, CA)
Inventors: Dragomir Anguelov (San Francisco, CA), Daniel Joseph Filip (San Jose, CA)
Primary Examiner: Thao X Le
Assistant Examiner: Long Le
Application Number: 14/206,577
International Classification: H04N 5/232 (20060101); H04N 5/262 (20060101); G06T 7/00 (20060101);