SHARING NEIGHBORING MAP DATA ACROSS DEVICES
A computing device and method are provided for transmitting a relevant subset of map data, called a neighborhood, to enable mutual spatial understanding by multiple display devices around a target virtual location to display a shared hologram in the same exact location in the physical environment at the same moment in time. The computing device may comprise a processor, a memory operatively coupled to the processor, and an anchor transfer program stored in the memory and executed by the processor.
Latest Microsoft Patents:
- MEMS-based Imaging Devices
- CLUSTER-WIDE ROOT SECRET KEY FOR DISTRIBUTED NODE CLUSTERS
- FULL MOTION VIDEO (FMV) ROUTING IN ONE-WAY TRANSFER SYSTEMS USING MODIFIED ELEMENTARY STREAMS
- CONTEXT-ENHANCED ADVANCED FEEDBACK FOR DRAFT MESSAGES
- UNIVERSAL SEARCH INDEXER FOR ENTERPRISE WEBSITES AND CLOUD ACCESSIBLE WEBSITES
This application is a continuation from U.S. patent application Ser. No. 17/806,126, filed Jun. 9, 2022, which is a continuation from U.S. patent application Ser. No. 16/600,340 filed Oct. 11, 2019, now granted as U.S. Pat. No. 11,360,731, which is a continuation from U.S. patent application Ser. No. 15/593,147 filed May 11, 2017, now granted as U.S. Pat. No. 10,466,953, which claims priority to U.S. Provisional Patent Application No. 62/479,200 filed Mar. 30, 2017, the entirety of each of which are hereby incorporated herein by reference.
BACKGROUND6-DoF tracking, also known as six degrees of freedom tracking, is a method by which a device (e.g. mixed-reality head-mounted device (HMD), robot, smartphone, etc.) uses sensors (e.g. cameras, inertial measurement units, etc.) to determine its position relative to its surrounding physical environment. For example, a mixed-reality HMD or smartphone can use this positional understanding to place holograms or digital content so as to appear to be world-locked to a position in the physical world, and a robot can use this positional understanding to navigate itself relative to its surroundings. Recently, scenarios have arisen in which it is useful to have two or more such devices operating with a common understanding of their positions relative to a physical environment, and thus relative to each other. As discussed in detail below, there are several general approaches to developing this common understanding of positions between such devices, each with significant challenges recognized by the inventors.
SUMMARYTo address these issues, a computing device and method are provided for transmitting a relevant subset of map data, called a neighborhood, to enable mutual spatial understanding by multiple display devices around a target virtual location to display a shared hologram in the same exact location in the physical environment at the same moment in time. The computing device may comprise a processor, a memory operatively coupled to the processor, and an anchor transfer program stored in the memory and executed by the processor.
The anchor transfer program may be configured to receive a request to transfer anchor data of a target virtual place-located anchor at a target virtual location from a first display device to a second display device. Subsequently, the program may be configured to retrieve and transmit to the first display device neighboring map data corresponding to the target virtual location. Subsequent to the first display device transferring the anchor data and the neighboring map data to the second display device, the second display device incorporates the neighboring map data into existing map data of the second display device, and the second display displays the one or more holograms to a second user at the target virtual place-located anchor at the target virtual location from the second user's vantage point based on the incorporated map data and the existing map data of the second display device. The neighboring map data is map data of a neighborhood around the target virtual location.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Furthermore, the claimed subject matter is not limited to implementations that solve any or all disadvantages noted in any part of this disclosure.
As discussed briefly above, of increasing value is the ability for two or more devices that use 6-DoF tracking to operate with a common understanding of their positions relative to the physical environment, and thus relative to each other. With this common spatial understanding, devices can interact with each other and the physical environment, enabling scenarios such as the sharing of holograms positioned in the world, or robots working together. The inventors have recognized that current approaches to enable two or more devices to align their coordinates in three-dimensional space have associated drawbacks that pose barriers to their adoption.
For example, a first approach to tracking device position is known as outside-in tracking. Outside-in tracking uses sensors external to the device in the physical environment to track or help track device position. While this approach can determine position of two or more devices, it requires positioning sensors external to the device, which can be inconvenient or impractical in some situations.
A second approach to tracking device position is known as inside-out tracking. Inside-out tracking is a method by which a display device can determine its position without the help of external sensors in the environment. While this approach has the advantage of not requiring external sensors to track position, the position data is determined within each device. Therefore, a challenge remains to coordinate alignment of the devices relative to each other.
One conventional method of attaining 6-DoF coordinate alignment for two or more inside-out tracking devices is to use identified posters or markers as well-understood mutual points of reference in the physical environment, so that each device establishes its position in relation to these. However, one limitation associated with this poster method is that the physical environment needs to be changed with each additional poster, which adds to cost and setup time, and decreases spontaneity. Moreover, if the posters are moved or changed, any common spatial understanding between the devices must change with respect to the posters, thereby reducing reliability. This method also limits the physical scope of the shared coordinates, as the devices need to be within operating range of the posters in the physical environment.
Another conventional method is for devices to independently generate maps that are transmitted to a central server and stitched together to form a common map, which is then retransmitted back to each respective device.
However, this approach has the drawback that it can result in slow transmission times, as large amounts of data transfer would be required over the network to transmit the map from each device to the server, and then again to transmit the common (stitched) map from the server to each of the devices. Moreover, the devices that load the common map may experience unwanted loss of local map data, since previously existing map data within each destination device may be overwritten, removed, or deleted in the course of sharing the common map with all the destination devices, and since the common map may not retain with 100% fidelity the map data from each contributing device.
Display 20 is configured to be at least partially see-through, and includes right and left display regions 20A, 20B which are configured to display different images to each eye of the user. By controlling the images displayed on these right and left display regions 20A, 20B, a hologram 50 may be displayed in a manner so as to appear to the eyes of the user to be positioned at a distance from the user within the physical environment 9. As used herein, a hologram is an image formed by displaying left and right images on respective left and right near-eye displays that appears due to stereoscopic effects to be positioned at a distance from the user. Typically, holograms are anchored to the map of the physical environment by virtual anchors 56, which are placed within the map according to their coordinates. These anchors are world-locked, and the holograms are configured to be displayed in a location that is computed relative to the anchor. The anchors may be placed in any location, but are often placed in positions at locations where features exist that are recognizable via machine vision techniques. Typically, the holograms are positioned within a predetermined distance from the anchors, such as within 3 meters in one particular example.
In the configuration illustrated in
In addition to visible light cameras 18, a depth camera 21 may be provided that uses an active non-visible light illuminator 23 and non-visible light sensor 22 to emit light in a phased or gated manner and estimate depth using time-of-flight techniques, or to emit light in structured patterns and estimate depth using structured light techniques.
Computing device 10 also typically includes a six degree of freedom inertial motion unit 19 that includes accelerometers, gyroscopes, and possibly magnometers configured to measure the position of the computing device in six degrees of freedom, namely x, y, z, pitch, roll and yaw.
Data captured by the visible light cameras 18, the depth camera 21, and the inertial motion unit 19 can be used to perform simultaneous location and mapping (SLAM) within the physical environment 9, to thereby produce a map of the physical environment including a mesh of reconstructed surfaces, and to locate the computing device 10 within the map of the physical environment 9. The location of the computing device 10 is computed in six degrees of freedom, which is important to displaying world-locked holograms 50 on the at least partially see through display 20. Without an accurate identification of the position and orientation of the computing device 10, holograms 50 that are displayed on the display 20 may appear to slightly move or vibrate relative to the physical environment, when they should remain in place, in a world-locked position. This data is also useful in relocating the computing device 10 when it is turned on, a process which involves ascertaining its position within the map of the physical environment, and loading in appropriate data from non-volatile memory to volatile memory to display holograms 50 located within the physical environment.
The IMU 19 measures the position and orientation of the computing device 10 in six degrees of freedom, and also measures the accelerations and rotational velocities. These values can be recorded as a pose graph to aid in tracking the display device 10. Accordingly, even when there are few visual cues to enable visual tracking, in poorly lighted areas or texture-less environments for example, accelerometers and gyroscopes can still enable spatial tracking by the display device 10 in the absence of visual tracking. Other components in the display device 10 may include and are not limited to speakers, microphones, gravity sensors, Wi-Fi sensors, temperature sensors, touch sensors, biometric sensors, other image sensors, eye-gaze detection systems, energy-storage components (e.g. battery), a communication facility, etc.
Feature descriptors 11A that describe features such as edges, corners, and other patterns that are detectable through image processing techniques are prestored in a feature library 11 in non-volatile storage device 16. Particular observations of features matching feature descriptors 11A (i.e., feature instances that are detected in a region of interest of an image) may or may not have anchors 56 collocated with them (generally most will not, but some may). The location of each anchors is typically determined by users of the system, but may be programmatically determined as well. In real time, images 18A and depth images 21A are respectively captured by cameras 18 and depth camera 21, and processed by a feature matching engine 13 executed by processor 12 to detect whether features matching the prestored feature descriptors 11A are present in the captured images 18A, 21A by looking for regions in the captured images that match the feature descriptors 11A. For each detected feature, the location (e.g., coordinate area) and type of the feature are stored as observation data 17 associated with each frame. It will be appreciated that dozens or hundreds of such features matching the features descriptors 11A may be recognized in an image, and the collection of these observations 17 of features may be referred to informally as a pointcloud of detected features in the image. Further, for at least selected detected features in the image, a patch 15 from the image is taken surrounding the selected detected features and stored in memory for later recall. This patch 15 is typically a two-dimensional array of pixels or voxels from the region of the captured image, and can be used in future localization steps when the computing device 10 captures images of the selected detected features from another angle, by performing perspective correction on the patch to determine whether (and where) the selected detected features in the perspective corrected patch are present in the subsequent image. The features matching feature descriptors 11A, observations 17, and patches 15 for each frame are collectively referred to as feature matching data 13A. The feature matching data 13A typically does not include the depth image 21A or RGB image data 18A. The feature matching data 13A may be stored in non-volatile or volatile memory for certain of the frames, referred to as keyframes, as discussed below. Together, the pose graph 80, feature matching data 13A, surface reconstruction data 82, and keyframes 84 linked by pose graph 80 may collectively be referred to as map data 86. As the computing device 10 moves throughout the physical environment 9, it maps the environment and stores its aggregated knowledge of the environment as map data 86. As will be discussed below, sharing of a portion of this aggregated map data with another device, either directly or through intermediary devices such as a server, can enable other devices to more quickly and accurately localize themselves within the physical environment, saving time and processing power for the other devices.
The processor 12 may use simultaneous localization and mapping (SLAM) techniques, discussed above, based on sensor suite inputs include the image data 18A, depth image data 21A, odeometry data 19A, and GPS data 25A to generate pose graph 80, feature matching data 13A, and surface reconstruction data 82. The pose graph 80 is a directed graph with nodes that are a series of updated poses 33 detected over time. A pose is typically a unit vector with an origin at a predetermined location (x, y, and z) and extending in a predetermined orientation (pitch, yaw, and roll) in the physical space, and is calculated as described in relation to
The processor 12 may generate and store in memory key frame data which includes a plurality of key frames 84. Each key frame 84 includes one pose of the pose graph 80, and thus the key frames 84 are linked by the pose graph 80. Each key frame 84 further includes the feature matching data 13A, which includes one or more (and typically multiple) observations 17, features matching feature descriptors 11A, and associated patch 15 for that frame. The key frame data may further include metadata, which may for example include GPS data 25A, odeometry data 19A, hardware data (e.g., camera lens type), ambient temperature, etc. applicable for the frame. The key frames 84 may be generated at a periodic interval within the series of successive frames, such as every other frame, or every 10th frame, etc. Alternatively, key frames 84 may be generated at a predetermined spatial interval as the computing device 10 moves through the physical environment 9, such as every 1 or 2 meters.
The server computing device 200 may include an anchor program 214 and an anchor transfer program 216 that may be stored in mass storage 218 of the computing device. The anchor program 214 and anchor transfer program 216 may be loaded into memory 220 and executed by a processor 260 of the server computing device 200 to perform one or more of the methods and processes for generating a virtual place-located anchor 56 and transferring anchor data 54 and neighboring map data 58 from a first display device 30 to a second display device 34, as described in more detail below. The server computing device 200 may be configured with a wireless transceiver 230 that wirelessly communicates with the first display device 30 and the second display device 34 to receive instructions 52 and transfer requests 48 from the display devices and transmits map data to the display devices. It will be appreciated that neighboring map data 58 is map data of the neighborhood around the virtual location where the virtual place-located anchor 56 is situated. The type of map data applied in the present disclosure is not particularly limited, and will be understood to be any set of data that correlates points in the three-dimensional coordinate space in the physical environment to information that help orient and locate the display device in the three-dimensional space. One possible embodiment of this map data is described in more detail below with respect to
The server computing device 200 may be communicatively coupled to one or more other devices via a wired connection or a wireless connection to a network. In some examples, the network may take the form of a local area network (LAN), wide area network (WAN), wired network, wireless network, personal area network, or a combination thereof, and may include the Internet. In the example of
The processors of the first display device 30 and the second display device 34 execute a common anchor transfer program 38. The first display device 30 stores into local memory a local map data 32, while the second display device 34 stores into local memory a local map data 36. The local map data may include the recorded rotational and translational motions of the display devices tracked by the visual sensors and/or inertial measurement sensors in the display devices. The anchor transfer program 38 is executed in one of two modes: an export anchor mode and an import anchor mode. In
With reference to
As described in more detail below, the first display device 30 and second display device 34 also may include program logic of an anchor transfer program 38 configured to send and receive map data of the dining room 306. In this example, a hologram 50 is projected on a table 308 using a target anchor 56 that is on a picture 310. Another neighboring anchor 64 for another hologram is located in a clock 312 that is in the vicinity of the picture 310. The first user 302 and the second user 304 are roaming about the room 306 as they operate the first display device 30 and the second display device 34, respectively, to view the hologram 50 from various angles in the room 306 from their respective vantage points. As the users roam about the room 306, the sensors 18 within the first display device 30 and the second display device 34 capture visual and/or inertial tracking data and thereby track the rotational and translational motion of the display devices through the sensor devices 18, which observe the three-dimensional rotation and translation of the sensor device 18 to be recorded as poses 62A-G and keyframes 60A-G, which are subsequently stored as local map data 32 and local map data 36 in the first display device 30 and the second display device 34, respectively. To help each display device orient itself within the room, the display devices may be configured to observe corners in their environments, such as the four corners of each of the two depicted windows, and may use the position of these detected corners of the windows to correct their estimated position, using the predictive corrective algorithm of
In the example of
Alternatively, the first display device 30 may programmatically generate an instruction 52 for a virtual place-located anchor 56 at a world-locked virtual location. For example, the first display device 30 may use sensor data to programmatically identify the picture 310 in the dining room 306 as a virtual location to generate a virtual place-located anchor. In response to identifying the picture 310, the first display device 30 may programmatically transmit an instruction 52 to computing device 200 to generate a virtual place-located anchor 56 at a world-locked virtual location corresponding to a corner of the picture 310.
The first user 302 may desire to share the hologram 50 with a second user 304 who is using a second display device 34. In such a case, the first user 302 sends a transfer request 48 to the computing device 200 to transfer neighboring map data 58 corresponding to a target anchor 56 to a second display device 34. In certain embodiments, an original transfer request 47 from the second display device 34 to the first display device 30 may cause the first display device 30 to send the transfer request 48. The computing device 200 receives the request 48 to transfer neighboring map data 58 corresponding to the target anchor 56. The computing device 200 retrieves, serializes, and sends the neighboring map data 58 to the first display device 30 as serialized neighboring map data 66. The serialization process encodes the neighboring map data 58 into a proprietary format that only the computing device 200 can deserialize into a format that is readable to the display devices executing the anchor transfer program 38. The serialized neighboring map data 66 may be packaged in the form of a binary file or data stream. The computing device 200 may apply program logic of the anchor transfer program 216 to define a neighborhood of the anchor 56 to appropriately determine the neighboring map data 58 that corresponds to the anchor 56. This neighborhood may include poses 62A-G, keyframes 60A-D, a neighboring anchor 64 in the vicinity of the target anchor 56. The neighboring map data 58 is further illustrated and described in
The first display device 30 receives the serialized neighboring map data 66, then sends the anchor data 54 and serialized neighboring map data 66 to the second display device 34. The second display device 34 receives the anchor data 54 and serialized neighboring map data 66, then sends the serialized neighboring map data 66 to the computing device 200. The computing device 200 receives the serialized neighboring map data 66, then deserializes the neighboring map data 58 and sends the deserialized neighboring map data 68 back to the second display device 34. The second display device 34 receives the deserialized neighboring map data 68, and then stitches the deserialized neighboring map data 68 into the local map data 36 stored in the second display device 34 to create an integrated map data. The second display device 34 uses the integrated map data to render the hologram 50 image on the display 35 of the second display device 34. The stitching process may involve combining or merging two sets of map data into one set of map data by applying a common global coordinate system corresponding to the three-dimensional coordinate space of the physical environment.
Turning to
The robots 30 and 34 may generate a virtual model of the warehouse 406 using a three-dimensional coordinate space overlaid upon the real world warehouse 406. In this example, included in this virtual model is a hologram 50 that is anchored by a target anchor 56 that is on a rack 410. Another neighboring anchor 64 for another hologram is located on a pillar 412. With respect to the robots 30 and 34, the hologram 50 may not necessarily be embodied as images to be displayed by the robots, but may instead be embodied as virtual objects and/or virtual spaces that multiple robots can render and interact with during operation. For example, the hologram 50 may be embodied as a predetermined destination space to which a robot may be instructed to move, a predetermined work space where a robot may interact with objects, or alternatively as a predetermined prohibited space that a robot may be instructed to avoid.
When the first robot 30 shares the hologram 50 and neighboring map data with the second robot 34, the second robot 34 receives the deserialized neighboring map data and stitches the deserialized neighboring map data into the local map data 36 stored in the second robot 34 to create an integrated map data. The second robot 34 uses the integrated map data to incorporate the hologram 50 into the virtual model of the warehouse 406 that is stored in the second robot 34, so that the hologram 50 and neighboring map data of the target anchor 56 are shared in the virtual models of both robots 30 and 34. Accordingly, the autonomous robots 30 and 34 can align movements with one another and to work together as autonomous systems in aligned coordinate space based on shared map data, among which is the neighboring map data of the target anchor 56. These aligned movements may help avoid collisions with one another and with obstacles within the warehouse 406, as well as aid in performing operations that require the coordination and cooperation of multiple robots, especially in the fields of manufacturing and logistics. For example, the hologram 50 may represent a predetermined space for the first robot 30 to drop off an object, which is later picked up by the second robot 34. In another example, the hologram 50 may represent a predetermined space that the first robot 30 has already cleaned, and the first robot 30 may share the hologram 50 and neighboring map data to the second robot 34 to instruct the second robot 34 to avoid cleaning the predetermined space that the first robot 30 has already cleaned.
Turning to
Keyframes 60A-Q contain sets of information that can be used to improve the ability of the display device to ascertain its location, and thus help render holograms in stable locations. As discussed above, examples of data included in keyframes 60A-Q include metadata, observations and patches, and/or image feature descriptors. Metadata may include the extrinsic data of the camera, the time when keyframe was taken, gravity data, temperature data, magnetic data, calibration data, global positioning data, etc. Observations and patches may provide information regarding detected feature points in a captured image, such as corners and high contrast color changes that help correct the estimation of the position and orientation of the display device, and accordingly help better align and position the display of a holographic image via display 20 in three-dimensional space. Image feature descriptors may be feature points, sometimes efficiently represented in a small data set, in some examples as small as 32 bytes, that are used the feature matching engine 13 described above to quickly recognize features in the real time captured images 18A and depth images 21A, to accurately estimate the position of the display device, and thus accurately render the hologram on the map of the physical environment. In yet other examples, keyframes 60A-Q may include Wi-Fi fingerprints representing properties of Wi-Fi beacons emitted by surrounding Wi-Fi access points, such as MAC addresses, SSIDs, and measured signal strengths.
Pose graphs 80A, 80B interlinking the keyframes may be a plurality of continuously connected poses communicating how much rotation and translation in three-dimensional space the display device undergoes between keyframes over time. Multiple anchors may be interlinked with each other via pose 62A-T. It will be appreciated that, the geometric relationship between a display device and a hologram for a given keyframe may be computed by first computing the distance between the current pose of the device and the anchor associated with the hologram, and then computing the distance between the anchor and the hologram itself.
When there is a predetermined virtual place-based anchor 56 that is known to the first display device 30 but unknown to the second display device 34, and the first display device 30 wishes to share this virtual place-based anchor 56 with the second display device 34, the server computing device 200 merely packages neighboring map data 58 surrounding the target virtual place-based anchor 56, packaging merely a subset of the anchors 56 and 64, poses 62A-T, and keyframes 60A-Q in the entire room which corresponds to a neighborhood. In the example depicted in
Referring to
With reference to
With reference to
It will be appreciated that methods 600 and 700 are provided by way of example and is not meant to be limiting. Therefore, it is to be understood that methods 600 and 700 may include additional and/or alternative steps relative to those illustrated in
In summary, when establishing a shared coordinate is desired (e.g. to position a shared hologram in the three-dimensional physical environment that is properly viewed by two users at the same time from the different vantage points of the two users in the same location), a location of interest (e.g. virtual place-based anchor) is specified by the first display device, and neighboring map data around the virtual place-based anchor is constructed with most relevant map information. This neighborhood of relevant data is then packaged (exported) and shared with a second display device, and added (imported) to the local map of the second display device. Then as 6-DoF tracking continues to run on the second display device to leave behind a trajectory of linear map data that is stored as local map data, as the newly shared neighboring map data is observed by the sensors of the first display device in the real world, the shared neighboring map data is stitched into the existing local map of the second device, creating an integrated map with the newly shared location (virtual place-based anchor) relative to preexisting locations (anchors).
In some embodiments, the methods and processes described herein may be tied to a computing system of one or more computing devices. In particular, such methods and processes may be implemented as a computer-application program or service, an application-programming interface (API), a library, and/or other computer-program product.
Computing system 900 includes a logic processor 902 volatile memory 904, and a non-volatile storage device 906. Computing system 900 may optionally include a display subsystem 908, input subsystem 910, communication subsystem 912, and/or other components not shown in
Logic processor 902 includes one or more physical devices configured to execute instructions. For example, the logic processor may be configured to execute instructions that are part of one or more applications, programs, routines, libraries, objects, components, data structures, or other logical constructs. Such instructions may be implemented to perform a task, implement a data type, transform the state of one or more components, achieve a technical effect, or otherwise arrive at a desired result.
The logic processor may include one or more physical processors (hardware) configured to execute software instructions. Additionally or alternatively, the logic processor may include one or more hardware logic circuits or firmware devices configured to execute hardware-implemented logic or firmware instructions. Processors of the logic processor 902 may be single-core or multi-core, and the instructions executed thereon may be configured for sequential, parallel, and/or distributed processing. Individual components of the logic processor optionally may be distributed among two or more separate devices, which may be remotely located and/or configured for coordinated processing. Aspects of the logic processor may be virtualized and executed by remotely accessible, networked computing devices configured in a cloud-computing configuration. In such a case, these virtualized aspects are run on different physical logic processors of various different machines, it will be understood.
Non-volatile storage device 906 includes one or more physical devices configured to hold instructions executable by the logic processors to implement the methods and processes described herein. When such methods and processes are implemented, the state of non-volatile storage device 906 may be transformed—e.g., to hold different data.
Non-volatile storage device 906 may include physical devices that are removable and/or built-in. Non-volatile storage device 906 may include optical memory (e.g., CD, DVD, HD-DVD, Blu-Ray Disc, etc.), semiconductor memory (e.g., ROM, EPROM, EEPROM, FLASH memory, etc.), and/or magnetic memory (e.g., hard-disk drive, floppy-disk drive, tape drive, MRAM, etc.), or other mass storage device technology. Non-volatile storage device 906 may include nonvolatile, dynamic, static, read/write, read-only, sequential-access, location-addressable, file-addressable, and/or content-addressable devices. It will be appreciated that non-volatile storage device 906 is configured to hold instructions even when power is cut to the non-volatile storage device 906.
Volatile memory 904 may include physical devices that include random access memory. Volatile memory 904 is typically utilized by logic processor 902 to temporarily store information during processing of software instructions. It will be appreciated that volatile memory 904 typically does not continue to store instructions when power is cut to the volatile memory 904.
Aspects of logic processor 902, volatile memory 904, and non-volatile storage device 906 may be integrated together into one or more hardware-logic components. Such hardware-logic components may include field-programmable gate arrays (FP GAs), program- and application-specific integrated circuits (PASIC/ASICs), program- and application-specific standard products (PSSP/ASSPs), system-on-a-chip (SOC), and complex programmable logic devices (CPLDs), for example.
The terms “module,” “program,” and “engine” may be used to describe an aspect of computing system 900 typically implemented in software by a processor to perform a particular function using portions of volatile memory, which function involves transformative processing that specially configures the processor to perform the function. Thus, a module, program, or engine may be instantiated via logic processor 902 executing instructions held by non-volatile storage device 906, using portions of volatile memory 904. It will be understood that different modules, programs, and/or engines may be instantiated from the same application, service, code block, object, library, routine, API, function, etc. Likewise, the same module, program, and/or engine may be instantiated by different applications, services, code blocks, objects, routines, APIs, functions, etc. The terms “module,” “program,” and “engine” may encompass individual or groups of executable files, data files, libraries, drivers, scripts, database records, etc.
When included, display subsystem 908 may be used to present a visual representation of data held by non-volatile storage device 906. The visual representation may take the form of a graphical user interface (GUI). As the herein described methods and processes change the data held by the non-volatile storage device, and thus transform the state of the non-volatile storage device, the state of display subsystem 908 may likewise be transformed to visually represent changes in the underlying data. Display subsystem 908 may include one or more display devices utilizing virtually any type of technology. Such display devices may be combined with logic processor 902, volatile memory 904, and/or non-volatile storage device 906 in a shared enclosure, or such display devices may be peripheral display devices.
When included, input subsystem 910 may comprise or interface with one or more user-input devices such as a keyboard, mouse, touch screen, or game controller. In some embodiments, the input subsystem may comprise or interface with selected natural user input (NUI) componentry. Such componentry may be integrated or peripheral, and the transduction and/or processing of input actions may be handled on- or off-board. Example NUI componentry may include a microphone for speech and/or voice recognition; an infrared, color, stereoscopic, and/or depth camera for machine vision and/or gesture recognition; a head tracker, eye tracker, accelerometer, and/or gyroscope for motion detection and/or intent recognition; as well as electric-field sensing componentry for assessing brain activity; and/or any other suitable sensor.
When included, communication subsystem 912 may be configured to communicatively couple various computing devices described herein with each other, and with other devices. Communication subsystem 912 may include wired and/or wireless communication devices compatible with one or more different communication protocols. As non-limiting examples, the communication subsystem may be configured for communication via a wireless telephone network, or a wired or wireless local- or wide-area network, such as Bluetooth and HDMI over Wi-Fi connection. In some embodiments, the communication subsystem may allow computing system 900 to send and/or receive messages to and/or from other devices via a network such as the Internet.
The following paragraphs provide additional support for the claims of the subject application. One aspect provides a server computing device comprising a processor, a non-volatile storage device operatively coupled to the processor, and an anchor transfer program stored in the non-volatile storage device and executed by the processor of the computing device, the anchor transfer program being configured to receive a request to transfer anchor data of a target virtual place-located anchor at a target virtual location from a first display device to a second display device; retrieve and transmit to the second display device neighboring map data corresponding to the target virtual location, the neighboring map data being map data of a neighborhood in a vicinity around the target virtual location; cause the second display device to incorporate the neighboring map data into existing map data of the second display device; and cause the second display to display the one or more holograms at the target virtual place-located anchor at the target virtual location from the second user's vantage point based on the incorporated map data and the existing map data of the second display device. In this aspect, additionally or alternatively, the computing device may further comprise an anchor program stored in the memory and executed by the processor of the computing device, the anchor transfer program being configured to receive an instruction from the first display device to generate the target virtual place-located anchor at the target virtual location; generate the target anchor at the world-locked target virtual location; and send the target anchor to the first display device. In this aspect, additionally or alternatively, the anchor transfer program may be configured to transmit to the first display device neighboring map data in a serialized format; and the second display device may receive the neighboring map data in a deserialized format. In this aspect, additionally or alternatively, the target virtual location may be world-locked to a position that is fixed in a three-dimensional coordinate space overlaid upon a real world three-dimensional environment. In this aspect, additionally or alternatively, the target virtual location may be world-locked to a position relative to an object in a real world three-dimensional environment. In this aspect, additionally or alternatively, the neighboring map data may comprise keyframes and at least a portion of a pose-graph describing rotational and translational motion of the display devices through a real world three-dimensional environment. In this aspect, additionally or alternatively, the computing device may further comprise visual sensors and/or inertial measurement sensors, the visual sensors and/or inertial measurement sensors tracking the rotational and translational motion of the display devices for the keyframes and pose-graphs. In this aspect, additionally or alternatively, the keyframes may comprise at least one of a fingerprint of a Wi-Fi beacon, gravity data, temperature data, global positioning data, and calibration data.
Another aspect provides a method comprising receiving a request to transfer anchor data of a target virtual place-located anchor at a target virtual location from a first display device to a second display device; retrieving and transmitting map data to the first display device neighboring map data corresponding to the target virtual location; and subsequent to the first display device transferring anchor data and the neighboring map data to the second display device, causing the second display device to incorporate the neighboring map data into existing map data of the second display device, and causing the second display to display the one or more holograms to a second user at the target virtual place-located anchor at the target virtual location from the second user's vantage point based on the incorporated map data and the existing map data of the second display device, the neighboring map data being map data of a neighborhood around the target virtual location. In this aspect, additionally or alternatively, the method may further comprise receiving an instruction from the first display device to generate the target virtual place-located anchor at the target virtual location; and generating the target anchor at the world-locked target virtual location and sending the target anchor to the first display device. In this aspect, additionally or alternatively, the neighboring map data may be transmitted to the first display device in a serialized format, and the second display device may receive the neighboring map data in a deserialized format. In this aspect, additionally or alternatively, the target virtual location may be world-locked to a position that is fixed in a three-dimensional coordinate space overlaid upon a real world three-dimensional environment. In this aspect, additionally or alternatively, the target virtual location may be world-locked to a position relative to an object in a real world three-dimensional environment. In this aspect, additionally or alternatively, the neighboring map data may comprise keyframes and pose-graphs recording rotational and translational motion of the display devices through a real world three-dimensional environment. In this aspect, additionally or alternatively, visual sensors and/or inertial measurement sensors may track the rotational and translational motion of the display devices for the keyframes and pose-graphs. In this aspect, additionally or alternatively, the keyframes may comprise at least one of a fingerprint of a Wi-Fi beacon, gravity data, temperature data, global positioning data, and calibration data.
Another aspect provides a first computing device operated by a first user, networked with a second computing device and a third computing device, the first computing device comprising a processor; a memory operatively coupled to the processor and storing local map data of the first computing device; a first display operatively coupled to the memory and the processor; and an anchor transfer program stored in the memory and executed by the processor to be configured to receive first anchor data causing the first display to display one or more holograms to the first user at a first virtual place-located anchor at a first target virtual location from the first user's vantage point, and further configured to execute an export anchor mode; in the export anchor mode, the anchor transfer program being configured to receive first neighboring map data of a neighborhood around the first target virtual location; and send the first anchor data and the first neighboring map data to a second computing device, operated by a second user, thereby causing the second display to display the one or more holograms at the first virtual place-located anchor at the first target virtual location from the second user's vantage point based on the first neighboring map data and the local map data of the second computing device. In this aspect, additionally or alternatively, the anchor transfer program may further be configured to execute an import anchor mode; in the import anchor mode, the anchor transfer program being configured to receive, from a third computing device, second neighboring map data of a neighborhood around a second target virtual location; integrate the second neighboring map data into the local map data of the first computing device to create an integrated map data, and cause the first display to display the one or more holograms at the second virtual place-located anchor at the second target virtual location from the first user's vantage point based on the integrated map data.
It will be understood that the configurations and/or approaches described herein are exemplary in nature, and that these specific embodiments or examples are not to be considered in a limiting sense, because numerous variations are possible. The specific routines or methods described herein may represent one or more of any number of processing strategies. As such, various acts illustrated and/or described may be performed in the sequence illustrated and/or described, in other sequences, in parallel, or omitted. Likewise, the order of the above-described processes may be changed.
The subject matter of the present disclosure includes all novel and non-obvious combinations and sub-combinations of the various processes, systems and configurations, and other features, functions, acts, and/or properties disclosed herein, as well as any and all equivalents thereof.
Claims
1. A server computing device, comprising:
- a processor;
- a non-volatile storage device operatively coupled to the processor; and
- an anchor transfer program stored in the non-volatile storage device and executed by the processor of the server computing device, wherein
- the anchor transfer program is configured to: receive a transfer request from a first autonomous vehicle for neighboring map data of a neighborhood around a second virtual place-located anchor at a second target virtual location including a second pose graph created based on sensor measurements of a second autonomous vehicle; responsive to the transfer request, retrieve and transmit the neighboring map data to the first autonomous vehicle storing local map data including at least a first pose graph created based on sensor measurements of the first autonomous vehicle, the first pose graph including a first virtual place-located anchor defining a first target virtual location; and cause the first autonomous vehicle to incorporate, via a stitching process, the neighboring map data into the local map data of the first autonomous vehicle to create integrated map data comprising keyframes, each keyframe including feature matching data, the stitching process including stitching together the second pose graph of the neighboring map data and the first pose graph of the local map data by connecting the first pose graph and the second pose graph based on a spatial relationship determined by the anchor transfer program of at least one point of the first pose graph and at least one point of the second pose graph, wherein the spatial relationship comprises the first pose graph, the second pose graph, and a third pose graph connecting the first virtual place-located anchor defining the first target virtual location to the second virtual place-located anchor defining the second target virtual location.
2. The server computing device of claim 1, wherein
- the anchor transfer program is further configured to: determine a pose of the first autonomous vehicle with predictive corrective localization based on the feature matching data in the keyframes of the integrated map data; and cause the first autonomous vehicle to align movements in aligned coordinate space with the second autonomous vehicle based on the integrated map data and the determined pose of the first autonomous vehicle.
3. The server computing device of claim 1, wherein
- a virtual model is anchored by the first virtual place-located anchor and the second virtual place-located anchor, the virtual model including predetermined spaces in which the movements of the first and second autonomous vehicles are instructed, and prohibited spaces in which the movements of the first and second autonomous vehicles are prohibited.
4. The server computing device of claim 1, wherein
- the anchor transfer program is configured to transmit to the first autonomous vehicle the neighboring map data in a serialized format; and
- the second autonomous vehicle receives the neighboring map data in a deserialized format.
5. The server computing device of claim 1, wherein
- the first target virtual location and the second target virtual location are world-locked to positions that are fixed in a three-dimensional coordinate space overlaid upon a real world three-dimensional environment.
6. The server computing device of claim 1, wherein
- the first target virtual location and the second target virtual location are world-locked to positions relative to objects in a real world three-dimensional environment.
7. The server computing device of claim 1, wherein the neighboring map data comprises keyframes and at least a portion of a pose-graph describing rotational motion and translational motion of the first autonomous vehicle and the second autonomous vehicle through a real world three-dimensional environment.
8. The server computing device of claim 7, wherein
- the sensor measurements of the first autonomous vehicle and the second autonomous vehicle are received from visual sensors and/or inertial measurement sensors of the first autonomous vehicle and the second autonomous vehicle, respectively.
9. The server computing device of claim 8, wherein the keyframes comprise at least one of a fingerprint of a Wi-Fi beacon, gravity data, temperature data, global positioning data, or calibration data.
10. The server computing device of claim 1, wherein the anchor transfer program is further configured to:
- receive an instruction from the first autonomous vehicle to generate the first virtual place-located anchor at the first target virtual location;
- generate the first virtual place-located anchor at the first target virtual location; and
- send the first virtual place-located anchor to the first autonomous vehicle.
11. The server computing device of claim 1, wherein the anchor transfer program is further configured to:
- receive an instruction from the second autonomous vehicle to generate the second virtual place-located anchor at the second target virtual location;
- generate the second virtual place-located anchor at the second target virtual location; and
- send the second virtual place-located anchor to the second autonomous vehicle.
12. A method comprising:
- receiving a transfer request from a first autonomous vehicle for neighboring map data of a neighborhood around a second virtual place-located anchor at a second target virtual location including a second pose graph created based on sensor measurements of a second autonomous vehicle;
- responsive to the transfer request, retrieving and transmitting the neighboring map data to the first autonomous vehicle storing local map data including at least a first pose graph created based on sensor measurements of the first autonomous vehicle, the first pose graph including a first virtual place-located anchor defining a first target virtual location;
- causing the first autonomous vehicle to incorporate, via a stitching process, the neighboring map data into the local map data of the first autonomous vehicle to create integrated map data comprising keyframes, each keyframe including feature matching data, the stitching process including stitching together the second pose graph of the neighboring map data and the first pose graph of the local map data by connecting the first pose graph and the second pose graph based on a spatial relationship determined of at least one point of the first pose graph and at least one point of the second pose graph, wherein the spatial relationship comprises the first pose graph, the second pose graph, and a third pose graph connecting the first virtual place-located anchor defining the first target virtual location to the second virtual place-located anchor defining the second target virtual location;
- determining a pose of the first autonomous vehicle with predictive corrective localization based on the feature matching data in the keyframes of the integrated map data; and
- causing the first autonomous vehicle to align movements in aligned coordinate space with the second autonomous vehicle based on the integrated map data and the determined pose of the first autonomous vehicle, wherein
- a virtual model is anchored by the first virtual place-located anchor and the second virtual place-located anchor, the virtual model including predetermined spaces in which the movements of the first and second autonomous vehicles are instructed, and prohibited spaces in which the movements of the first and second autonomous vehicles are prohibited.
13. The method of claim 12, wherein
- the neighboring map data is transmitted to the first autonomous vehicle in a serialized format; and
- the second autonomous vehicle receives the neighboring map data in a deserialized format.
14. The method of claim 12, wherein
- the first target virtual location and the second target virtual location are world-locked to positions that are fixed in a three-dimensional coordinate space overlaid upon a real world three-dimensional environment.
15. The method of claim 12, wherein
- the first target virtual location and the second target virtual location are world-locked to positions relative to objects in a real world three-dimensional environment.
16. The method of claim 12, wherein the neighboring map data comprises keyframes and at least a portion of a pose-graph describing rotational motion and translational motion of the first autonomous vehicle and the second autonomous vehicle through a real world three-dimensional environment.
17. The method of claim 16, wherein
- the sensor measurements of the first autonomous vehicle and the second autonomous vehicle are received from visual sensors and/or inertial measurement sensors of the first autonomous vehicle and the second autonomous vehicle, respectively.
18. The method of claim 17, wherein the keyframes comprise at least one of a fingerprint of a Wi-Fi beacon, gravity data, temperature data, global positioning data, or calibration data.
19. The method of claim 12, further comprising:
- receiving an instruction from the first autonomous vehicle to generate the first virtual place-located anchor at the first target virtual location;
- generating the first virtual place-located anchor at the first target virtual location; and
- sending the first virtual place-located anchor to the first autonomous vehicle.
20. A server computing device, comprising:
- a processor;
- a non-volatile storage device operatively coupled to the processor; and
- an anchor transfer program stored in the non-volatile storage device and executed by the processor of the server computing device, wherein
- the anchor transfer program is configured to: receive a transfer request from a first autonomous vehicle for neighboring map data of a neighborhood around a second virtual place-located anchor at a second target virtual location including a second pose graph created based on sensor measurements of a second autonomous vehicle; responsive to the transfer request, retrieve and transmit the neighboring map data to the first autonomous vehicle storing local map data including at least a first pose graph created based on sensor measurements of the first autonomous vehicle, the first pose graph including a first virtual place-located anchor defining a first target virtual location; cause the first autonomous vehicle to incorporate, via a stitching process, the neighboring map data into the local map data of the first autonomous vehicle to create integrated map data comprising keyframes, each keyframe including feature matching data, the stitching process including stitching together the second pose graph of the neighboring map data and the first pose graph of the local map data by connecting the first pose graph and the second pose graph based on a spatial relationship determined by the anchor transfer program of at least one point of the first pose graph and at least one point of the second pose graph, wherein the spatial relationship comprises the first pose graph, the second pose graph, and a third pose graph connecting the first virtual place-located anchor defining the first target virtual location to the second virtual place-located anchor defining the second target virtual location; determine a pose of the first autonomous vehicle with predictive corrective localization based on the feature matching data in the keyframes of the integrated map data; and cause the first autonomous vehicle to align movements in aligned coordinate space with the second autonomous vehicle based on the integrated map data and the determined pose of the first autonomous vehicle, wherein
- a virtual model is anchored by the first virtual place-located anchor and the second virtual place-located anchor, the virtual model including predetermined spaces in which the movements of the first and second autonomous vehicles are instructed, and prohibited spaces in which the movements of the first and second autonomous vehicles are prohibited.
Type: Application
Filed: Nov 29, 2023
Publication Date: Mar 21, 2024
Applicant: Microsoft Technology Licensing, LLC (Redmond, WA)
Inventors: Ethan EADE (Pittsburgh, PA), Jeroen VANTURENNOUT (Snohomish, WA), Jonathan LYONS (Kirkland, WA), David FIELDS (Kirkland, WA), Gavin Dean LAZAROW (Redmond, WA), Tushar Cyril BHATNAGAR (Seattle, WA)
Application Number: 18/522,989