Correcting path errors for vehicle navigation
An apparatus and method are disclosed for performing loop closing on one or more paths to be optimized. The paths may include poses associated with imagery obtained by a vehicle. The apparatus may identify candidate intersections from the paths based on their proximity, and may further determine relative poses from the poses of the paths using structure-from-motion techniques. The apparatus may then apply a partitioning schema to the paths to be optimized to obtain individual partition cells. The partition cells may then be sent to one or more client devices for optimizing the paths included in the partition cells. When the apparatus receives a set of optimized paths from the client devices, the apparatus may re-partition the paths to be optimized to ensure that non-optimized portions of paths are optimized.
Latest Google Patents:
Computer vision techniques have been employed to map or analyze the trajectory of an object as it travels along a path. A loop closing process may be used to correct large scale, low-frequency errors in paths by applying constraints that link parts of the path together at path intersections, namely where the path loops back on itself. However, large quantities of images and other data make such techniques challenging to implement in large scale systems in a time-sensitive manner.
BRIEF SUMMARYThis disclosure provides for a system for performing loop closing. In one embodiment, the system includes a computer-readable memory that stores a plurality of paths to be optimized, wherein each path of the plurality of paths comprises one or more poses associated with a mobile apparatus. The system may also include a processor in communication with the computer-readable memory, where the processor may be configured to identify a plurality of candidate intersections from the one or more poses, determine a plurality of relative poses from the plurality of identified candidate intersections, and apply a partitioning schema to the plurality of paths to obtain one or more partition cells, wherein each one of the plurality of paths is assigned to at least one of the one or more partition cells.
The processor may also be configured to send the one or more partition cells to one or more client devices for optimizing the paths assigned to the one or more partition cells, wherein the client device uses at least one relative pose from the plurality of relative poses as a constraint in optimizing the paths assigned to the one or more partition cells, and receive optimized paths from the client device, the optimized paths being based on at least one relative pose from the plurality of relative poses assigned to the one or more partition cells sent to the client device.
In another embodiment of the apparatus, the processor may be further configured to identify a candidate intersection from the plurality of candidate intersections based on the proximity of a first pose from a first one of the plurality of paths to a second pose from a second one of the plurality of paths.
In a further embodiment of the apparatus, the processor may be further configured to determine a given one of the plurality of relative poses by applying a structure-from-motion technique to a first image associated with a first one of the plurality of paths and a second image associated with a second one of the plurality of paths.
In yet another embodiment of the apparatus, the processor may be further configured to determine a boundary stabilizer for a portion of a given one of the paths assigned to the at least one partition cell, the boundary stabilizer identifying the portion of the path that is not to be optimized by the client device.
In yet a further embodiment of the apparatus, the boundary stabilizer may identify a location where the portion of the given path crosses a boundary of the partition cell to which the path is assigned.
In another embodiment of the apparatus, the boundary stabilizer may include a pair of geolocation coordinates.
In a further embodiment of the apparatus, the partitioning schema may be based on computing resources available to the one or more client devices.
In yet another embodiment of the apparatus, the processor may be further configured to re-apply the partitioning schema to the plurality of paths to be optimized based on whether non-optimized portions of the paths assigned to the one or more partition cells remain after optimization by the one or more client devices.
In yet a further embodiment of the apparatus, the processor may be further configured to re-determine the boundaries of the one or more partition cells based on whether non-optimized portions of the paths assigned to the one or more partition cells remain after optimization by the one or more client devices.
In another embodiment of the apparatus, the processor may be further configured to iteratively apply the partitioning schema to the plurality of paths to obtain the one or more partition cells and to send the one or more partition cells to the one or more client devices for optimizing the paths assigned to the one or more partition cells until a predetermined termination condition is met.
This disclosure also provides a method for performing loop closing. In one embodiment, the method may include identifying a plurality of candidate intersections from one or more poses of a plurality of paths to be optimized, and determining, with a processor, a plurality of relative poses from the plurality of identified candidate intersections. The method may also include applying, with the processor, a partitioning schema to the plurality of paths to obtain one or more partition cells, wherein each of the plurality of paths is assigned to at least one of the one or more partition cells. The method may further include sending, with the processor, the one or more partition cells to a client device for optimizing the paths assigned to the one or more partition cells, and receiving optimized paths from the client device, at least one of the optimized paths being based on at least one relative pose from the plurality of poses.
In another embodiment of the method, identifying the plurality of candidate intersections may include identifying a candidate intersection from the plurality of candidate intersections based on the proximity of a first pose from a first path of the plurality of paths to a second pose from a second path of the plurality of paths.
In a further embodiment of the method, determining the plurality of relative poses may include determining a relative pose from the plurality of relative poses by applying a structure-from-motion technique to a first image associated with a first path of the plurality of paths and a second image associated with a second path of the plurality of paths.
In yet another embodiment of the method, the method includes determining, with the processor, a boundary stabilizer for a portion of a path of the paths assigned to the at least one partition cell, the boundary stabilizer identifying a portion of the path that is not to be optimized by the client device.
In yet a further embodiment of the method, the boundary stabilizer may identify a location where the portion of the path crosses a boundary of the partition cell to which the path is assigned.
In another embodiment of the method, the boundary stabilizer may include a pair of geolocation coordinates.
In a further embodiment of the method, the partitioning schema of the plurality of paths may be based on computing resources available to the client device.
In yet another embodiment of the method, the method may include re-applying, with the processor, the partitioning schema to the plurality of paths to be optimized based on whether non-optimized portions of the paths assigned to the one or more partition cells remain after optimization by the client device.
In yet a further embodiment of the method, the method may include re-determining, with the processor, the boundaries of the one or more partition cells based on whether non-optimized portions of the paths assigned to the one or more partition cells remain after optimization by the client device.
In another embodiment of the method, the method may include iteratively performing, with the processor, the applying of the partitioning schema to the plurality of paths to obtain the one or more partition cells and the sending of the one or more partition cells to a client device for optimizing the paths assigned to the one or more partition cells until a predetermined termination condition is met.
The accompanying drawings are not intended to be drawn to scale. In the drawings, each identical or nearly identical component that is illustrated in various figures is represented by a like numeral. For purposes of clarity, not every component may be labeled in every drawing. In the drawings:
This disclosure provides for systems and methods to perform path optimization, for instance using structure from motion (“SFM”) constraints. According to one aspect, large-scale path optimization and coordination of path optimization between path optimizing client devices and a path server are provided.
In an embodiment of the disclosed system, a path server may receive one or more paths to be optimized. The one or more paths may include one or more “poses” of a vehicle. Each pose includes attributes that define the three-dimensional position and orientation of a vehicle-mounted camera at a particular time. The pose of the camera-mounted vehicle may be determined using various techniques, such as using the Global Positioning System (“GPS”) with inertial constraints and odometry sensors. Imagery obtained by the vehicle may be associated with one or more of the poses of a vehicle path. In one embodiment, the imagery may include a street-level image of a geographic location. The street-level image may be a raw camera image, a camera point cloud image, a laser point cloud image, or any other type of street-level image.
In one embodiment, the server may perform “loop closing” on one or more of the paths to be optimized. Loop closing is the process of correcting large scale, low-frequency errors in paths by applying constraints that link parts of the path together at intersections, namely where the path loops back on itself. For loop closing, the path server may initially determine candidate intersections for the one or more paths to be optimized. A candidate intersection may be an intersection where one or more of the paths intersect. The path server may then determine a rigid transform that establishes a relationship (e.g., translation and/or rotation) between the one or more paths of the candidate intersections. The path server may also determine various constraints (e.g., gravity constraints, odometry constraints, geolocation constraints) for optimizing the one or more paths. The path server may use SFM techniques to determine the constraints and/or rigid transform for one or more of the paths.
At this point, the one or more paths may be ready for optimization. Accordingly, the path server may determine a partitioning schema by which to divide the one or more paths into partition cells for optimization. Based on the partitioning schema, the path server may communicate partition cells to one or more path optimizing client devices to perform the optimization. As discussed below, a partition cell may include various types of data, into the paths to be optimized, path constraints, and boundary stabilizers. When an optimization is complete, a path optimizing client device may send an optimized set of paths to the path server.
To ensure that the entire set of paths has been optimized, the path server may continue iterating over the set of paths in this fashion (i.e., sending paths to be optimized to one or more path optimizing client devices) until a predetermined condition is met, such as receiving a message from one or more of the path optimizing clients that there is no further optimization to be performed, that the path server has performed a predetermined number of iterations on the paths to be optimized, or any other predetermined condition.
The path server 104 and the path optimizing clients 106-112 may communicate via a network 114. The network 114 may be implemented as any combination of wired or wireless networks. For example, and without limitation, the network 114 may include a Wide Area Network (“WAN”), such as the Internet; a Local Area Network (“LAN”); a Personal Area Network (“PAN”), or a combination of WANs, LANs, and PANs. Moreover, the network 114 may include the use of one or more wired protocols, such as the Simple Object Access Protocol (“SOAP”); wireless protocols, such as 802.11a/b/g/n, Bluetooth, or WiMAX; transport protocols, such as TCP or UDP; an Internet layer protocol, such as IP; application-level protocols, such as HTTP, a combination of any of the aforementioned protocols, or any other type of network protocol now known or later developed.
In communicating with the path optimizing clients 106-112, the path server 104 may send to, and receive from, the path optimizing clients 106-112 various types and amounts of data 116-118. As introduced above and with further details in reference to
When optimization on a partition cell is complete, the path server 104 may receive optimized paths from a corresponding path optimizing client. However, receiving optimized paths for a given partition cell 112 may not necessarily mean that the optimization of all the paths is complete. As discussed with reference to
The path server 104 may include various instructions and data for determining constraints of the paths to be optimized and for performing grid partitioning on those paths.
The computer-readable memory 206 may store information accessible by the processor 204. For example, the computer-readable memory 206 may store instructions 210 and data 212. The instructions 210 and/or data 212 may be executed or otherwise used by the processor 204 to identify candidate intersections from the paths to be optimized and determine constraints and/or rigid transformations for the paths associated with the candidate intersections. The instructions 210 and/or data 212 may also be used to partition the paths to be optimized into partition cells for optimization by the path optimizing clients 106-112.
In general, the computer-readable memory 206 may be of any type of computer-readable memory operative to store information accessible by the processor 204, including a computer-readable medium, or other medium that stores data that may be read with the aid of an electronic device. Examples of the computer-readable memory 206 include, but are not limited to, a hard-drive; a memory card; non-volatile memory, such as non-volatile random-access memory (“NVRAM”), read-only memory (“ROM”), or other non-volatile memory; volatile memory, such as dynamic random-access memory (“DRAM”) or other random-access memory (“RAM”); as well as other write-capable and read-only memories. Systems and methods may include different combinations of the foregoing, whereby different portions of the instructions and data are stored on different types of media.
The processor 204 may be any conventional processor, including Reduced Instruction Set Computing (“RISC”) processors, Complex Instruction Set Computing (“CISC”) processors, Advanced RISC Machine (“ARM”) processors, or combinations of the foregoing. Alternatively, the processor may be a dedicated device such as an applicant-specific integrated circuit (“ASIC”).
The communication interface 208 may be any type of communication interface, or combination of communication interfaces, configured to receive the paths to be optimized, communicate with the path optimizing clients 106-112, and provide the paths that were optimized by the ath optimizing clients 106-112. Examples of communication interfaces include tactile interfaces, such as keyboards, mice and touchscreen displays; non-tactile interfaces, such as a speaker or microphone; data interfaces, such as an IEEE 1394 (“Firewire”) interface or Universal Serial Bus (“USB”) interface; wired or wireless networking interfaces, such as an Ethernet interface or an IEEE 802.11 interface, and other such communication interfaces or combination of communication interfaces.
Although
Accordingly, references to a processor, computer or memory will be understood to include references to a collection of processors, computers or memories that may or may not operate in parallel. Rather than using a single processor to perform the acts described herein, some of the components, such as the communication interface(s) 208, may each have their own processor that only performs calculations related to the component's specific function.
In addition, the instructions 210 may be any set of instructions that may be executed directly (such as machine code) or indirectly (such as scripts) by the processor 204. For example, the instructions 210 may be stored as computer code on the computer-readable medium. In that regard, the terms “instructions” and “programs” may be used interchangeably herein. The instructions 210 may be stored in object code format for direct processing by the processor 204, or in any other computer language including scripts or collections of independent source code modules that are interpreted on demand or compiled in advance. Functions, methods and routines of the instructions 210 are explained in more detail below.
In one embodiment, the instructions 210 may include one or more instructions for optimizing and partitioning paths including intersection identification instructions 214, feature matching instructions 216, partitioning schema instructions 218, and boundary identification instructions 220. Although
The path server 104 may also include the data 212 for optimizing and partitioning paths. The data 212 may be retrieved, stored, or modified by the processor 204 in accordance with the instructions 210. For instance, although the disclosed embodiments are not limited by any particular data structure, the data 212 may be stored in computer registers, in a relational database as a table having a plurality of different fields and records, XML documents, flat files, or in any computer-readable format. By further way of example only, image data may be stored as bitmaps comprised of grids of pixels that are stored in accordance with formats that are compressed or uncompressed, lossless (e.g., BMP) or lossy (e.g., JPEG), and bitmap or vector-based (e.g., SVG), as well as computer instructions for drawing graphics. The data 212 may comprise any information sufficient to identify the relevant information, such as numbers, descriptive text, proprietary codes, references to data stored in other areas of the same memory or different memories (including other network locations) or information that is used by a function to calculate the relevant data.
In one embodiment, the data 212 may include the one or more paths to be optimized 222. The paths to be optimized 222 may include one or more poses of the vehicle. Each of the poses may also include geolocation coordinates, such as a latitude and longitude coordinate, which identify a geographic location for a given pose. A pose may also be associated with one or more images that were obtained by the vehicle of the geographic location associated with the pose. As discussed previously, the images may include raw camera images, laser point cloud images, radar intensity images, and other such images. The images associated with a pose may also be street-level images of the geographic location associated with the pose.
In one embodiment, the path server 104 may be in communication with the vehicle, and may receive the paths to be optimized 306-312 from the vehicle. In another embodiment, the path server 104 may receive the paths to be optimized 306-312 from a human operator or from another system in communication with the path server 104. Combinations of the foregoing are also possible.
The path server 104 may execute the intersection identification instructions 214 with the processor 204 to identify candidate intersections from the paths to be optimized 306-312.
In one embodiment, poses of the one or more paths to be optimized 306-312 may be identified as candidate intersections based on one or more intersection characteristics, which may include the distance between poses, the latitudinal proximity of poses, the longitudinal proximity of poses, and other such characteristics. For example, the path server 104 may identify poses as candidate intersections when poses of one or more paths are relatively proximate to one another, such as being within twenty meters, ten meters, 100 meters, or other distance amounts. As another example, the path server 104 may identify poses as candidate intersections when the poses are within a threshold degree of latitude or longitude.
In one embodiment, the path server 104 may determine candidate intersections for any pose from any vehicle path. In other words, the path server 104 may determine candidate intersections from a first path and a second path regardless of whether the first path and the second path travel the same, or nearly the same, route. In this regard, the path server 104 may identify candidate intersections where the paths intersect or overlap.
In another embodiment, the path server 104 may determine candidate intersections for poses within paths that have travelled the same, or substantially the same, route. In this embodiment, the path server 104 may identify candidate intersections from two paths, e.g., path1 and path2, which are substantially similar, but where the vehicle traveled these paths at different times, e.g., path1 at time t1, where t1 is 12:00 P.M. on January 1st, and path2 at time t2, where t2 is 4:00 P.M. on March 3rd. Path1 may include poses (e.g., location, orientation, etc.) that are similar to the poses of path2.
Having determined candidate intersections from the one or more paths to be optimized 306-312, the path server 104 may then determine a relative pose that relates poses of candidate intersections. The relative pose may specify the relative three-dimensional translation and/or three-dimensional rotation between poses. Moreover, the relative pose for a given set of candidate intersections may be used as a constraint in the path optimization performed by the path optimizing clients 106-112.
In one embodiment, the path server 104 may execute the feature matching instructions 216 to determine the relative poses of the candidate intersections.
In one embodiment, the path server 104 may leverage the imagery associated with the poses of the candidate intersections to determine the relative poses. For example, where imagery is available at predetermined intervals along the corresponding paths of the poses of the candidate intersections, the path server 104 may determine the relative poses from the associated imagery. In this example, the path server 104 may determine the relative pose for a candidate intersection by performing feature matching on the poses of the candidate intersection, and then determining a three-dimensional relative pose that is consistent with the matched features.
A three-dimensional relative pose may be considered consistent when the translation and/or orientation of the relative pose aligns the matched features of the images associated with the poses within a threshold amount, which may be measured in distance (e.g. millimeters, centimeters, etc.) or degrees (e.g., one degree, two degrees, etc.). Examples of features that may be matched from the imagery may include a building or portion thereof (e.g., the corner of the building), a roadsign or portion thereof (e.g., lettering, edges, or other features of the roadsign), and other such features that are common between the imagery of the poses.
In addition, the determination of a relative pose may validate whether a pair of poses are valid for a candidate intersection. For example, where a relative pose cannot be determined for a given pair of poses from a candidate intersection, such as where the imagery associated with the poses do not contain matching features, the failure to determine the relative pose may indicate that the poses are not valid for the candidate intersection. A failure to determine the relative pose may further indicate that the imagery from the poses of the candidate intersection will not align (i.e., the imagery from the poses are too different or do not have matching features). Thus, while the path server 104 may initially determine that a pair of poses meet a predetermined threshold for a candidate intersection (e.g., the pair of poses are within a threshold distance), the imagery associated with the poses may indicate that the poses are not valid for optimization by the path optimizing clients 106-112.
The path server 104 may determine the relative poses using structure-from motion methods that vary according to the type of imagery associated with the poses of the candidate intersections. For example, where the imagery of the poses is obtained with a shutter camera, the path server 104 may determine the relative pose using methods such as the 5-Point Algorithm and the Random Sample Consensus (“RANSAC”) algorithm. In another example, where the imagery of the poses is obtained with a rosette of rolling shutter cameras, the path server 104 may identify one or more features from the imagery, such as the façade of a building, and track these features across the imagery of the poses of the candidate intersections. In tracking these features, the path server 104 may determine that the feature being tracked is consistent across the imagery in both appearance and geometry (e.g., given imagery of a building having four corners, a corner being tracked is the same corner, in both appearance and geometry, across the images associated with the poses of the candidate intersections).
Where the path server 104 is leveraging tracked features, the relative pose for a candidate intersection may be simplified to determining the rigid transform that represents the relative pose. In one embodiment, the path server 104 may use an SFM algorithm, such as the RANSAC algorithm, to determine the rigid transform representing the relative pose.
Accordingly, the feature matching instructions 216 may include instructions that apply different algorithms for determining the relative pose for candidate intersections depending on the imagery associated with the poses of the candidate intersections. Thus, regardless of type of images associated with the poses of the candidate intersections, the path server 104 may determine whether the poses of the candidate intersection are valid and, if so, a relative pose that relates the poses.
After relative poses are determined for valid candidate intersections, the path server 104 may determine a partitioning scheme by which to partition the paths to be optimized 222. As previously discussed, performing path optimization on a large scale, e.g., a city, county, state, country or planet-wide scale, may require significant computing resources (e.g., processor cycles, memory, network bandwidth, etc.). Accordingly, the instructions 210 of the path server 104 may include partitioning schema instructions 218 which, when executed by the processor 204, may cause the path server 104 to partition the paths to be optimized 222, e.g., paths 306-312, into partition cells 226 for optimization by the path optimizing clients 106-112. By partitioning the paths to be optimized into partition cells 226, the path server 104 reduces the computing resources needed to optimize the portions of one or more paths included within a given partition cell.
The partition schema may be based on one or more factors, which may include the number of path optimizing clients in communication with the path server 104, the size of the geographic area being partitioned, the number of paths to be optimized, the number of poses in the paths to be optimized, the computing power of the path optimizing clients, etc.
For example, the path optimizing server 104 may query, or be preprogrammed with, the computing capabilities (e.g., processor speed, available memory, network bandwidth, etc.) of each of the path optimizing clients 106-112. The server 104 may then partition the paths to be optimized 222 into partition cells, where the computing resources needed to optimize a set of paths of a given partition cell meet or do not exceed the computing resources for the path optimizing clients 106-112. In another embodiment, the path server 104 may be preprogrammed to partition the paths to be optimized into a predetermined number of partition cells.
In addition to applying a partition schema to the paths to be optimized 226, the path server 104 may also include the boundary identification instructions 220 for determining one or more boundary stabilizers, e.g., the boundary stabilizers 232 of
In this manner, the path server 104 may determine one or more boundary stabilizers 232 for the partition cells 226 to be optimized by the path optimizing clients 106-112.
The path server 104 may then assign one or more of the partition cells 226 to be optimized by the path optimizing clients 106-112. In one embodiment, the partition cells 226 may be optimized in a round-robin fashion, where each path optimizing client is sequentially assigned a partition cell to be optimized. In another embodiment, a group of partition cells may be assigned to a path optimizing client for optimization.
Although
Referring back to
The constraints for the optimization being performed by the path optimizing clients 106-112 may include the relative poses previously discussed (e.g., the rigid transforms that define the relative poses), sensor constraints, such as wheel odometry sensor readings or geolocation readings (e.g., GPS readings), natural/physical constraints, such as the direction of gravity, and other such constraints.
Accordingly, when the path server 104 sends the partition cells 226 to be optimized by an individual path optimizing clients 106-112, the path server 104 may send the paths 228, the constraints 230 for the optimization, and the boundary stabilizers 232.
The computer-readable memory 906 may store information accessible by the processor 904, such as instructions 910 and data 912 that may be executed or otherwise used by the processor 904 to determine whether the paths of a partition cell may be optimized and, if so, perform the optimization.
In general, the computer-readable memory 906 may be of any type of computer-readable memory operative to store information accessible by the processor 904, including a computer-readable medium, or other medium that stores data that may be read with the aid of an electronic device.
The communication interface 908 is configured to receive partition cells, e.g., partition cell 916, from the path server 104. The communication interface 908 may also be configured to send optimized paths of the partition cell 916 to the path server 104 or, when the path optimizing client 106 determines that there is no further optimization to perform, a message or other identifier indicating that no further optimization on the partition cell 916 was performed.
When the path optimizing client 106 receives the partition cell 916 to optimize, the processor 904 may execute one or more instructions 910 stored in the computer-readable memory 906, such as path optimization instructions 914. As previously discussed, the path optimizing client 106 may perform a non-linear optimization on the paths of the partition cell 916, where the constraints and boundary stabilizers of the partition cell 916 may preclude certain portions of the paths from being optimized. When the optimization is complete, the path optimizing client 106 may send the optimized paths to the path server 104.
Alternatively, the path optimizing client 106 may determine that no further optimization on the paths of the partition cell 916 is needed or required. In one embodiment, the path optimizing client 106 may send a message or other identifier to the path server 104 that no further optimization was performed, and/or that no further optimization on the paths of the partition cell 916 is needed or required. The path optimizing client 106 may also identify that certain portions of the paths of the partition cell 916, such as the paths identified by the boundary stabilizers, were excluded from the optimization process.
The path server 104 may maintain a record of which partition cells of the partitioning schema have been optimized. When the path server 104 determines that a predetermined number of partition cells, such as all of the partition cells or any other number of partition cells, have been optimized, the path server 104 may determine whether there are paths, or portions of paths, that have not yet been optimized.
In a first iteration of the optimization process, non-optimized portions of the paths may include those that were initially excluded by the boundary stabilizers. To ensure that these initially, non-optimized portions are, in fact, optimized, the path server 104 may re-determine a partitioning schema for the paths to be optimized. In one embodiment, re-determining the partitioning schema may include shifting the boundaries of the partition cells by a predetermined amount, such as a predetermined distance or a predetermined number of degrees in latitude and/or longitude, so that portions of paths that previously crossed or straddled a partition cell boundary are included within the partition cell.
The path server 104 may continuously perform this shifting and re-sending of partition cells until a predetermined termination condition is met, such as by receiving a message from one or more of the path optimizing clients 106-112 that no optimization on a received partition cell is possible. In another embodiment, the predetermined termination condition may include that a predetermined number of partition cells have been sent to the path optimizing clients 106-112 (e.g., all of the partition cells), and that each of the path optimizing clients 106-112 have communicated that no further optimization on a received partition cell is possible. In this manner, the iterative process of shifting and re-sending of partition cells ensures that no portion of a path to be optimized was not optimized because it crossed or straddled the boundary of a partition cell.
The path server 104 determines candidate intersections from poses of the received paths (Block 1106). In one embodiment, the path server 104 may determine the candidate intersections based on one or more intersection characteristics, which may include the distance between poses, the latitudinal proximity of poses, the longitudinal proximity of poses, and other such characteristics.
When candidate intersections have been identified, the path server 104 may then validate the candidate intersections by determining relative poses between poses of candidate intersections (Block 1108). In one embodiment, the relative poses may be determined using structure-from-motion methods. In addition, the determined relative poses may be one type of constraint used in the path optimization of the paths received by the path server 104. Other constraints may include the sensor readings (e.g., GPS coordinates, odometry readings, etc.) from the vehicle or other apparatus used to obtain the paths to be optimized.
After determining the relative poses for the candidate intersections, the path server 104 may then apply a partitioning schema to the paths to be optimized (Block 1110). In one embodiment, this results in one or more partition cells, where the partition cells may be defined by one or more pairs of latitude and longitude coordinates, a height parameter, a width parameter, and other such parameters. A partition cell may include one or more paths to be optimized by the path optimizing clients 106-112.
The path server 104 determines boundary stabilizers for the one or more partition cells obtained from the application of the partitioning schema. In one embodiment, the path server 104 may initially determine locations where the paths to be optimized cross or straddled partition cell boundaries (Block 1112). The path server 104 may perform this determination by comparing geolocation information of the poses of the paths to be optimized with the boundaries of a given partition cell. Where a path crosses or straddles a partition cell boundary, the path server 104 may establish a boundary stabilizer at the boundary location (Block 1114). In one embodiment, a boundary stabilizer is identified by a pair of latitude and longitude coordinates.
The path server 104 may then distribute the determined partition cells to the path optimizing clients 106-112 (Block 1116). In one embodiment, the distribution may be round-robin based, where the path server 104 distributes partition cells to available path optimizing clients 106-112. In another embodiment, partition cells may be assigned to a path optimizing client.
When a path optimizing client is finished with a partition cell, the path server 104 may receive the optimized paths of the partition cell (Block 1118). Alternatively, the path server 104 may receive a message from a path optimizing client that no optimization is needed on the given partition cell. The path server 104 may then determine whether there are portions of the paths of the partition cell that have yet to be optimized (Block 1120). The path server 104 may perform this determination based on the message it receives from the path optimizing clients 106-112 and/or by monitoring which paths of the partition cell are associated with boundary stabilizers.
Where the path server 104 determines that non-optimized portions of paths remain (Block 1122), the path server 104 may shift the boundaries of the partition cells and/or re-apply the partitioning schema to determine new partition cells (Block 1124). When the boundaries of the partition cells have been shifted, the path server 104 may then re-distribute the partition cells to the path optimizing clients 106-112. The path server 104 may iterate in this manner through the paths to be optimized until a predetermined number of paths have been optimized, such as all paths, a percentage of paths (e.g., 80% of all paths), an absolute number of paths (e.g., 15 out of 20 paths to be optimized), or some other number of paths.
When it is determined that the paths to be optimized have been optimized, the path server 104 may send a message or other identifier indicating that the path optimization is complete (Block 1126).
In this manner, the system may be configured to perform loop closing on a large-scale set of paths using structure-from-motion techniques. For example, as the path server 104 leverages a partitioning schema to distribute paths to be optimized to one or more path optimizing clients 106-112, the path server 104 reduces the amount of computing resources needed to optimize the paths. Moreover, as the path server 104 may be configured to account for the varying capabilities of the path optimizing clients 106-112, the path server 104 may distribute the paths to be optimized such that bottlenecks (i.e., an overextension in computing resources) in determining optimized paths are avoided. Furthermore, the disclosed system is scalable such that additional path optimizing clients may be added to the system when additional computing resources are needed. Thus, the disclosed system and methods present an advantage over current techniques for performing loop closing using structure-from-motion methods.
Although aspects of this disclosure have been described with reference to particular embodiments, it is to be understood that these embodiments are merely illustrative of the principles and applications of the present disclosure. It is therefore to be understood that numerous modifications may be made to the illustrative embodiments and that other arrangements may be devised without departing from the spirit and scope of this disclosure as defined by the appended claims. Furthermore, while certain operations and functions are shown in a specific order, they may be performed in a different order unless it is expressly stated otherwise.
Claims
1. A system for performing loop closing, the system comprising:
- a computer-readable memory that stores a plurality of paths to be improved, wherein each path of the plurality of paths comprises one or more poses having therewith associated an image and a geographic location of a mobile apparatus; and
- one or more processors in communication with the computer-readable memory, the one or more processors being configured to: identify a plurality of candidate intersections between poses of different paths of the plurality of paths based on the geographic locations of the one or more poses of each of the plurality of paths; determine, for each particular candidate intersection of the plurality of candidate intersections, a relative pose based on the images associated with the poses of the different paths of the particular candidate intersection; partitioning the plurality of paths to obtain one or more partition cells, each partition cell including the poses of at least one particular candidate intersection of the plurality of candidate intersections and the determined relative pose for the at least one particular candidate intersection, wherein each one of the plurality of paths is assigned to at least one of the one or more partition cells; send the one or more partition cells to one or more client devices for improving the paths assigned to the one or more partition cells; and receive, in response to sending the one or more partition cells, an improved path from the client device, the improved paths being based on at least one of the determined relative poses.
2. The apparatus of claim 1, wherein the processor is further configured to identify a given candidate intersection of the plurality of candidate intersections further based on the proximity of the geographic locations of the poses of the given candidate intersection.
3. The apparatus of claim 1, wherein the processor is further configured to determine a given determined relative poses for one particular candidate intersection of the plurality of candidate intersections by applying a structure-from-motion technique to the images of the poses of the one particular candidate intersection.
4. The apparatus of claim 1, wherein the processor is further configured to determine a boundary stabilizer for a portion of a given one of the paths assigned to the at least one partition cell, the boundary stabilizer identifying the portion of the path that is not to be improved by the client device.
5. The apparatus of claim 4, wherein the boundary stabilizer identifies a location where the portion of the given path crosses a boundary of the partition cell to which the given path is assigned.
6. The apparatus of claim 4, wherein the boundary stabilizer comprises a pair of geolocation coordinates.
7. The apparatus of claim 1, wherein the partitioning schema is based on computing resources available to the one or more client devices.
8. The apparatus of claim 1, wherein the processor is further configured to re-apply the partitioning schema to the plurality of paths to be improved based on whether non-improved portions of the paths assigned to the one or more partition cells remain after improvement by the one or more client devices.
9. The apparatus of claim 1, wherein the processor is further configured to re-determine the boundaries of the one or more partition cells based on whether non-improved portions of the paths assigned to the one or more partition cells remain after improvement by the one or more client devices.
10. The apparatus of claim 1, wherein the processor is further configured to iteratively apply the partitioning schema to the plurality of paths to obtain the one or more partition cells and to send the one or more partition cells to the one or more client devices for improving the paths assigned to the one or more partition cells until a predetermined termination condition is met.
11. A method for performing loop closing, the method comprising:
- identifying, by one or more processors, a plurality of candidate intersections between poses of different paths of a plurality of paths to be improved, each of the one or more poses having associated therewith an image and a geographic location, and identifying the plurality of candidate intersections is based on the geographic locations of the one or more poses of the plurality of paths;
- determining, by the one or more processors, for each particular candidate intersection of the plurality of candidate intersections, a relative pose based on the images associated with the poses of the different paths of the particular candidate intersections;
- partitioning, by the one or more processors, the plurality of paths to obtain one or more partition cells, each partition cell included in the poses of at least one particular candidate intersection of the plurality of candidate intersections and the determined relative pose for the at least one particular candidate intersection, wherein each of the plurality of paths is assigned to at least one of the one or more partition cells;
- sending, by the one or more processors, the one or more partition cells to a client device for improving the paths assigned to the one or more partition cells; and
- receiving, by the one or more processors, in response to sending the one or more partition cells, improved paths from the client device, at least one of the improved paths being based on at least one determined relative pose.
12. The method of claim 11, wherein:
- identifying the plurality of candidate intersections comprises identifying a given candidate intersection of the plurality of candidate intersections further based on the proximity of the geographic locations of the poses of the given candidate intersection.
13. The method of claim 11, wherein:
- determining a given relative pose for one particular candidate intersection of the plurality of candidate intersections includes applying a structure-from-motion technique to the images of the poses of the one particular candidate intersection.
14. The method of claim 11, further comprising:
- determining, with the processor, a boundary stabilizer for a portion of a path of the paths assigned to the at least one partition cell, the boundary stabilizer identifying a portion of the path that is not to be optimized by the client device.
15. The method of claim 14, wherein the boundary stabilizer identifies a location where the portion of the path crosses a boundary of the partition cell to which the given path is assigned.
16. The method of claim 14, wherein the boundary stabilizer comprises a pair of geolocation coordinates.
17. The method of claim 11, wherein the partitioning schema of the plurality of paths is based on computing resources available to the client device.
18. The method of claim 11, further comprising:
- re-applying, with the processor, the partitioning schema to the plurality of paths to be optimized based on whether non-optimized portions of the paths assigned to the one or more partition cells remain after optimization by the client device.
19. The method of claim 11, further comprising:
- re-determining, with the processor, the boundaries of the one or more partition cells based on whether non-optimized portions of the paths assigned to the one or more partition cells remain after optimization by the client device.
20. The method of claim 11, further comprising:
- iteratively performing, with the processor, the applying of the partitioning schema to the plurality of paths to obtain the one or more partition cells and the sending of the one or more partition cells to a client device for optimizing the paths assigned to the one or more partition cells until a predetermined termination condition is met.
6907336 | June 14, 2005 | Gray et al. |
8259994 | September 4, 2012 | Anguelov et al. |
20060013437 | January 19, 2006 | Nister et al. |
20090138151 | May 28, 2009 | Smid et al. |
20090157301 | June 18, 2009 | Tien et al. |
20090240481 | September 24, 2009 | Durrant-Whyte et al. |
20100088146 | April 8, 2010 | Zhong et al. |
20110026910 | February 3, 2011 | Liang et al. |
20120045091 | February 23, 2012 | Kaganovich |
20130116921 | May 9, 2013 | Kasargod et al. |
20130325334 | December 5, 2013 | Mian et al. |
20130332065 | December 12, 2013 | Hakim et al. |
Type: Grant
Filed: Dec 5, 2012
Date of Patent: Sep 2, 2014
Assignee: Google Inc. (Mountain View, CA)
Inventors: David R. Martin (Oakland, CA), Bryan M. Klingner (Austin, TX)
Primary Examiner: Mary Cheung
Assistant Examiner: Rodney Butler
Application Number: 13/705,376
International Classification: G01C 21/34 (20060101);