TEMPORAL ELEMENT RESTORATION IN AUGMENTED REALITY SYSTEMS

Info

Publication number: 20190114811
Type: Application
Filed: Aug 24, 2018
Publication Date: Apr 18, 2019
Applicant: ELWHA LLC (Bellevue, WA)
Inventors: Gene Fein (Westlake, CA), Royce A. Levien (Lexington, MA), Richard T. Lord (Federal Way, WA), Robert W. Lord (Seattle, WA), Mark A. Malamud (Seattle, WA), John D. Rinaldo, JR. (Bellevue, WA), Clarence T. Tegreene (Mercer Island, WA)
Application Number: 16/112,338

Abstract

Methods, apparatuses, computer program products, devices and systems are described that carry out accepting a request associated with at least one of an item, an aspect, or an element that is not present in a field of view of a user's augmented reality device; presenting in a display of the augmented reality device at least one augmented reality representation related to the at least one item, aspect, or element in response to accepting a request associated with at least one item, aspect, or element that is not present in a field of view of an augmented reality device; and processing the request and any related interaction of the user via the at least one augmented reality representation.

Description

Description

If an Application Data Sheet (ADS) has been filed on the filing date of this application, it is incorporated by reference herein. Any applications claimed on the ADS for priority under 35 U.S.C. §§ 119, 120, 121, or 365(c), and any and all parent, grandparent, great-grandparent, etc. applications of such applications, are also incorporated by reference, including any priority claims made in those applications and any material incorporated by reference, to the extent such subject matter is not inconsistent herewith.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is related to and/or claims the benefit of the earliest available effective filing date(s) from the following listed application(s) (the “Priority Applications”), if any, listed below (e.g., claims earliest available priority dates for other than provisional patent applications or claims benefits under 35 USC § 119(e) for provisional patent applications, for any and all parent, grandparent, greatgrandparent, etc. applications of the Priority Application(s)). In addition, the present application is related to the “Related Applications,” if any, listed below.

Priority Applications

NONE

Related Applications

U.S. patent application Ser. No. 13/646,147, entitled FORMATTING OF ONE OR MORE PERSISTENT AUGMENTATIONS IN AN AUGMENTED VIEW IN RESPONSE TO MULTIPLE INPUT FACTORS, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 5 Oct. 2012 with attorney docket no. 0412-003-001-000000, is related to the present application.

U.S. patent application Ser. No. 13/648,012, entitled FORMATTING OF ONE OR MORE PERSISTENT AUGMENTATIONS IN AN AUGMENTED VIEW IN RESPONSE TO MULTIPLE INPUT FACTORS, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 9 Oct. 2012 with attorney docket no. 0412-003-001-000001, is related to the present application.

U.S. patent application Ser. No. 13/672,575, entitled PRESENTING AN AUGMENTED VIEW IN RESPONSE TO ACQUISITION OF DATA INFERRING USER ACTIVITY, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 9 Nov. 2012 with attorney docket no. 0412-003-002-000000, is related to the present application.

U.S. patent application Ser. No. 13/673,070, entitled PRESENTING AN AUGMENTED VIEW IN RESPONSE TO ACQUISITION OF DATA INFERRING USER ACTIVITY, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 9 Nov. 2012 with attorney docket no. 0412-003-002-000001, is related to the present application.

U.S. patent application Ser. No. 13/689,372, entitled SYSTEMS AND METHODS FOR SHARING AUGMENTATION DATA, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 29 Nov. 2012 with attorney docket no. 0412-003-003-000000, is related to the present application.

U.S. patent application Ser. No. 13/690,003, entitled SYSTEMS AND METHODS FOR SHARING AUGMENTATION DATA, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 30 Nov. 2012 with attorney docket no. 0412-003-003-000001, is related to the present application.

U.S. patent application Ser. No. 13/709,465, entitled SYSTEMS AND METHODS FOR OBTAINING AND USING AUGMENTATION DATA AND FOR SHARING USAGE DATA, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 10 Dec. 2012 with attorney docket no. 0412-003-004-000000, is related to the present application.

U.S. patent application Ser. No. 13/711,095, entitled SYSTEMS AND METHODS FOR OBTAINING AND USING AUGMENTATION DATA AND FOR SHARING USAGE DATA, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 11 Dec. 2012 with attorney docket no. 0412-003-004-000001, is related to the present application.

U.S. patent application Ser. No. 13/721,340, entitled CORRELATING USER REACTION WITH AT LEAST AN ASPECT ASSOCIATED WITH AN AUGMENTATION OF AN AUGMENTED VIEW, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 20 Dec. 2012 with attorney docket no. 0412-003-005-000000, is related to the present application.

U.S. patent application Ser. No. 13/723,610, entitled CORRELATING USER REACTION WITH AT LEAST AN ASPECT ASSOCIATED WITH AN AUGMENTATION OF AN AUGMENTED VIEW, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 21 Dec. 2012 with attorney docket no. 0412-003-005-000001, is related to the present application.

U.S. patent application Ser. No. 13/729,278, entitled CORRELATING USER REACTIONS WITH AUGMENTATIONS DISPLAYED THROUGH AUGMENTED VIEWS, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 28 Dec. 2012 with attorney docket no. 0412-003-006-000000, is related to the present application.

U.S. patent application Ser. No. 13/731,233, entitled CORRELATING USER REACTIONS WITH AUGMENTATIONS DISPLAYED THROUGH AUGMENTED VIEWS, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 31 Dec. 2012 with attorney docket no. 0412-003-006-000001, is related to the present application.

U.S. patent application Ser. No. 13/768,048, entitled DISPLAYING IN RESPONSE TO DETECTING ONE OR MORE USER BEHAVIORS ONE OR MORE SECOND AUGMENTATIONS THAT ARE BASED ON ONE OR MORE REGISTERED FIRST AUGMENTATIONS, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 15 Feb. 2013 with attorney docket no. 0412-003-007-000000, is related to the present application.

U.S. patent application Ser. No. 13/770,053, entitled DISPLAYING IN RESPONSE TO DETECTING ONE OR MORE USER BEHAVIORS ONE OR MORE SECOND AUGMENTATIONS THAT ARE BASED ON ONE OR MORE REGISTERED FIRST AUGMENTATIONS, naming Gene Fein, Royce A. Levien, Richard T. Lord, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, Jr., and Clarence T. Tegreene as inventors, filed 19 Feb. 2013 with attorney docket no. 0412-003-007-000001, is related to the present application.

The United States Patent Office (USPTO) has published a notice to the effect that the USPTO's computer programs require that patent applicants reference both a serial number and indicate whether an application is a continuation, continuation-in-part, or divisional of a parent application. Stephen G. Kunin, Benefit of Prior-Filed Application, USPTO Official 5 Gazette Mar. 18, 2003. The USPTO further has provided forms for the Application Data Sheet which allow automatic loading of bibliographic data but which require identification of each application as a continuation, continuation-in-part, or divisional of a parent application. The present Applicant Entity (hereinafter “Applicant”) has provided above a specific reference to the application(s)from which priority is being claimed as recited by statute.

Applicant understands that the statute is unambiguous in its specific reference language and does not require either a serial number or any characterization, such as “continuation” or “continuation-in-part,” for claiming priority to U.S. patent applications. Notwithstanding the foregoing, Applicant understands that the USPTO's computer programs have certain data entry requirements, and hence Applicant has provided designation(s) of a relationship between the present application and its parent application(s) as set forth above and in any ADS filed in this application, but expressly points out that such designation(s) are not to be construed in any way as any type of commentary and/or admission as to whether or not the present application contains any new matter in addition to the matter of its parent application(s).

If the listings of applications provided above are inconsistent with the listings provided via an ADS, it is the intent of the Applicant to claim priority to each application that appears in the Priority Applications section of the ADS and to each application that appears in the Priority Applications section of this application.

All subject matter of the Priority Applications and the Related Applications and of any and all parent, grandparent, great-grandparent, etc. applications of the Priority Applications and the Related Applications, including any priority claims, is incorporated herein by reference to the extent such subject matter is not inconsistent herewith.

TECHNICAL FIELD

This description relates to data capture, data handling, and data display techniques.

SUMMARY

An embodiment provides a system. In one implementation, the system includes but is not limited to circuitry for accepting a request associated with at least one of an item, an aspect, or an element that is not present in a field of view of a user's augmented reality device; circuitry for presenting in a display of the augmented reality device at least one augmented reality representation related to the at least one item, aspect, or element in response to at least one ouput of the circuitry for accepting a request associated with at least one item, aspect, or element that is not present in a field of view of an augmented reality device; and circuitry for processing the request and any related interaction of the user via the at least one augmented reality representation. In addition to the foregoing, other system aspects are described in the claims, drawings, and text forming a part of the present disclosure.

In one or more various aspects, related systems include but are not limited to circuitry and/or programming for effecting the herein-referenced method aspects; the circuitry and/or programming can be virtually any combination of hardware, software, and/or firmware configured to effect the herein-referenced method aspects depending upon the design choices of the system designer.

In one or more various aspects, related systems include but are not limited to computing means and/or programming for effecting the herein-referenced method aspects; the computing means and/or programming may be virtually any combination of hardware, software, and/or firmware configured to effect the herein-referenced method aspects depending upon the design choices of the system designer.

An embodiment provides a computer-implemented method. In one implementation, the method includes but is not limited to accepting a request associated with at least one of an item, an aspect, or an element that is not present in a field of view of a user's augmented reality device; presenting in a display of the augmented reality device at least one augmented reality representation related to the at least one item, aspect, or element in response to accepting a request associated with at least one item, aspect, or element that is not present in a field of view of an augmented reality device; and processing the request and any related interaction of the user via the at least one augmented reality representation. In addition to the foregoing, other method aspects are described in the claims, drawings, and text forming a part of the present disclosure.

An embodiment provides an article of manufacture including a computer program product. In one implementation, the article of manufacture includes but is not limited to a signal-bearing medium configured by one or more instructions related to accepting a request associated with at least one of an item, an aspect, or an element that is not present in a field of view of a user's augmented reality device; presenting in a display of the augmented reality device at least one augmented reality representation related to the at least one item, aspect, or element in response to accepting a request associated with at least one item, aspect, or element that is not present in a field of view of an augmented reality device; and processing the request and any related interaction of the user via the at least one augmented reality representation. In addition to the foregoing, other computer program product aspects are described in the claims, drawings, and text forming a part of the present disclosure.

An embodiment provides a system. In one implementation, the system includes but is not limited to a computing device and instructions. The instructions when executed on the computing device cause the computing device to accept a request associated with at least one of an item, an aspect, or an element that is not present in a field of view of a user's augmented reality device; present in a display of the augmented reality device at least one augmented reality representation related to the at least one item, aspect, or element in response to accepting a request associated with at least one item, aspect, or element that is not present in a field of view of an augmented reality device; and process the request and any related interaction of the user via the at least one augmented reality representation. In addition to the foregoing, other system aspects are described in the claims, drawings, and text forming a part of the present disclosure.

In addition to the foregoing, various other method and/or system and/or program product aspects are set forth and described in the teachings such as text (e.g., claims and/or detailed description) and/or drawings of the present disclosure.

The foregoing is a summary and thus may contain simplifications, generalizations, inclusions, and/or omissions of detail; consequently, those skilled in the art will appreciate that the summary is illustrative only and is NOT intended to be in any way limiting. Other aspects, features, and advantages of the devices and/or processes and/or other subject matter described herein will become apparent in the teachings set forth herein.

BRIEF DESCRIPTION OF THE FIGURES

With reference now to FIG. 1, shown is a number of examples of augmented reality devices.

FIG. 2 illustrates an augmented reality device and a real world field of view of its camera.

FIG. 3 illustrates one embodiment wherein a user interacts with a system to select, drag, and drop an augmented reality representation of a book.

FIGS. 4a, 4b, 4c, 4d, 4e, and 4f illustrate an example of a system for selecting, dragging, and dropping in augmented reality systems in which embodiments may be implemented, perhaps in a device and/or through a network, which may serve as a context for introducing one or more processes and/or devices described herein.

With reference now to FIG. 5, shown is an example of an operational flow representing example operations related to selecting, dragging, and dropping in augmented reality systems, which may serve as a context for introducing one or more processes and/or devices described herein.

FIG. 6 illustrates an alternative embodiment of the example operational flow of FIG. 5.

FIG. 7 illustrates an alternative embodiment of the example operational flow of FIG. 5.

FIG. 8 illustrates an alternative embodiment of the example operational flow of FIG. 5.

FIG. 9 illustrates an alternative embodiment of the example operational flow of FIG. 5.

FIG. 10 illustrates an alternative embodiment of the example operational flow of FIG. 5.

FIG. 11 illustrates an alternative embodiment of the example operational flow of FIG. 5.

FIG. 12 illustrates an alternative embodiment of the example operational flow of FIG. 5.

With reference now to FIG. 13, shown is an example of an operational flow representing example operations related to selecting, dragging, and dropping in augmented reality systems, which may serve as a context for introducing one or more processes and/or devices described herein.

With reference now to FIG. 14, shown is an example of an operational flow representing example operations related to selecting, dragging, and dropping in augmented reality systems, which may serve as a context for introducing one or more processes and/or devices described herein.

FIG. 15 illustrates an example of a system for dynamically preserving scene elements in augmented reality systems in which embodiments may be implemented, perhaps in a device and/or through a network, which may serve as a context for introducing one or more processes and/or devices described herein.

FIGS. 16-18 depict a scenario in which there is no ability to dynamically preserve scene elements in an augmented reality system. Shown is an example of a user attempting and failing to select a displayed moving person.

FIGS. 19-23 depict a scenario in which there is an ability to dynamically preserve scene elements in an augmented reality system. Shown is an example of a user attempting and succeeding to select and interact with a displayed (initially) moving person.

With reference now to FIG. 24, shown is an example of an operational flow representing example operations related to dynamically preserving scene elements in augmented reality systems, which may serve as a context for introducing one or more processes and/or devices described herein.

FIG. 25 illustrates an alternative embodiment of the example operational flow of FIG. 24.

FIG. 26 illustrates an alternative embodiment of the example operational flow of FIG. 24.

FIG. 27 illustrates an alternative embodiment of the example operational flow of FIG. 24.

FIG. 28 illustrates an alternative embodiment of the example operational flow of FIG. 24.

With reference now to FIG. 29, shown is an example of an operational flow representing example operations related to dynamically preserving scene elements in augmented reality systems, which may serve as a context for introducing one or more processes and/or devices described herein.

FIG. 30 illustrates an alternative embodiment of the example operational flow of FIG. 29.

FIG. 31 illustrates an example of a system for temporal element restoration in augmented reality systems in which embodiments may be implemented, perhaps in a device and/or through a network, which may serve as a context for introducing one or more processes and/or devices described herein.

FIGS. 32-40 depict stages in a scenario showing an example of temporal element restoration in an augmented reality system. Shown are stages of a user reserving a taxi cab seen through an augmented reality device, and later confirming the reservation through interaction with an augmented reality representation of the taxi cab superimposed on a scene in which the taxi cab is not actually present.

With reference now to FIG. 41, shown is an example of an operational flow representing example operations related to temporal element restoration in an augmented reality system, which may serve as a context for introducing one or more processes and/or devices described herein.

FIG. 42 illustrates an alternative embodiment of the example operational flow of FIG. 41.

FIG. 43 illustrates an alternative embodiment of the example operational flow of FIG. 41.

FIG. 44 illustrates an alternative embodiment of the example operational flow of FIG. 41.

FIG. 45 illustrates an alternative embodiment of the example operational flow of FIG. 41.

FIG. 46 illustrates an alternative embodiment of the example operational flow of FIG. 41.

FIG. 47 illustrates an alternative embodiment of the example operational flow of FIG. 41.

The use of the same symbols in different drawings typically indicates similar or identical items unless context dictates otherwise.

DETAILED DESCRIPTION

In a world where people interact through augmented reality devices, such dedicated augmented reality devices (e.g., Google Glass eyeglasses), smartphones, digital cameras, camcorders, and tablets, the augmented reality display or interface provides a window on the real world, onto which is layered one or more computer generated objects, digital images, or functions. Stucturally and semantically, an augmented reality user interface is fundamentally responsive to a physical state in physical proximity to the user's device. Aspects of physical reality are typically represented on the screen; however even if they are not represented on the screen they typically still affect what's happening on the screen to some extent. This may be contrasted with virtual reality, in which a user's senses are typically fed a completely computer generated theme or environment, as an artificial sensorium.

Cross-Reality Drag and Drop

As a courtesy to the reader, and with reference to the accompanying figures herein, in general “100 series” reference numerals will typically refer to items first introduced/described by FIG. 1, “200 series” reference numerals will typically refer to items first introduced/described by FIG. 2, “300 series” reference numerals will typically refer to items first introduced/described by FIG. 3, etc.

In the following detailed description, reference is made to the accompanying drawings, which form a part hereof. In the drawings, similar symbols typically identify similar components, unless context dictates otherwise. The illustrative embodiments described in the detailed description, drawings, and claims are not meant to be limiting. Other embodiments may be utilized, and other changes may be made, without departing from the spirit or scope of the subject matter presented here.

By way of background, the traditional computer screen “desktop” area includes a drag-and-drop functionality that allows for powerful manipulation of graphical objects and environments. This typically involves 1) a source, 2) an object, and 3) a destination. These three things may determine the semantics of operations in the drag-and-drop process.

In an augmented reality situation, as described herein, a user may perform a drag-and-drop operation from the real world into an augmented reality (“AR”) view or display, and vice versa. For example, if a user is wearing AR glasses in a bookstore, there may be an AR shopping cart displayed on the glasses, viewable to the user. The user may then find a real book on a book shelf at the book store, point to the real book, peel off or otherwise pull an augmented reality representation of the book into the AR shopping cart. When the user arrives at a checkout stand or register to buy the book, the user may grab the AR book from the shopping cart, drag it onto the real cash register at which point payment may be initiated by the bookstore, and the transaction completed. The user may also select options for delivery of a physical book to herself or as a gift to someone else, and/or delivery of an electronic book to a device.

In another example, a user sitting at home in their living room may view her AR device on which is displayed an augmented reality representation of stacks of DVD's, which are functional links to Netflix videos, for example. The user may reach out and grab an augmented reality representation of a video, Star Wars, for example, and drag it onto the television in the living room, thereby signaling to Netflix to begin play of a streaming video of Star Wars on the (internet-ready) television, with concomitant notation in the user's Netflix account as to what was watched by the user when, and on which device. In some scenarios this may involve an associated debit to a credit account or bank account.

In another example, a user in the lobby of a movie theater may see a movie poster for the latest installment in the Star Wars saga, due out in theaters next year. The user may grab an an augmented reality representation of the movie poster to an augmented reality wishlist on her augmented reality display, thereby updating her Netflix queue, for example, to schedule notification when the movie is being shown at the theater, and/or when it is available on Netflix for viewing.

In each of these examples, a camera or other detector will recognize and mark the source of an action, in other words the start of the “drag.” This is the object to be dragged. The camera or other detector will then monitor the “drag” or the motion away from the source object, and finally the camera or other detector will recognize or mark the destination, or “drop.” This is the place of significance where the augmented reality representation is going. Each end of the action may be marked explicitly by the user, for example by voice, touch (of the object or of the AR device), gesture, or other signal.

Unlike traditional drag-and-drop on a computer desktop environment, not only is there a recognition step, but the user is pointing to something in a scene in which there are a limited number of targets available, which serves to constrain the recognition problem. In one embodiment, a constraint may be that a movie player app, like hulu or Netflix is running on the AR device or on another device like a television in proximity to the user. In another example, if an e-reader such as a kindle device is open during a book-shopping experience, that may be used as a constraint to tell the system to look for books in the environment during the recognition step.

Recognition of the intended object(s) will typically happen via image data from a camera viewing a scene through an AR device. Context that the user is in may be taken into account. For example, the AR device may recognize types of stores or collections of items such as books or DVD's; or even diverse collections of objects such as items at a grocery store.

Voice may be used to signal the correct recognition for “grabbing” an object prior to dragging it. Other ways of marking the start of the drag may also be used such as touching the object, tapping the object, touching a sensitive part of the AR device itself such as a button or touchscreen, and/or making a gesture that was pre-programmed in to the AR device to tell the system that a selection had been made for dragging.

In one embodiment, speech alone may be used to drag-and-drop an augmented reality representation.

In another embodiment, eye tracking may be used to identify, recognize, and select what the user is looking at, track the arc of movement, dragging, or transfer, and identify, recognize, and select the destination for the drop.

As used herein, “augmentation”, “virtual” or “augmented reality representation” may refer to things that are added to a display of a real scene, such as computer generated images, text, or photographs.

In one embodiment, a system might include a hand-held augmented reality device, with at least one sensor (such as a camera), at least one graphical display for user output, and at least one touch screen (or other similar device) for user input. As directed by the user, the augmented reality device may activate and display an augmented reality scene that includes real interface objects (such as those imaged by the augmented reality device's camera) and at least one augmented reality representation of an object.

In one embodiment, a real interface object in an augmented reality display is detected and selected (e.g., by a first gesture, voice command, touch, or some other predetermined method) and then moved (e.g., the augmented reality device tracks movement using a second gesture, voice command, or some other predetermined method) within the augmented reality interface as an augmented reality (or virtual) representation of that object, either leaving the first real interface object unmodified, or removing the first real interface object from the scene. In response to selection and movement of the real interface object in the augmented reality interface, at least one destination for dropping the augmented reality representation of the object is presented in the augmented reality interface, perhaps in proximity to the real interface object. Destinations on the display for a drop may include a thumbnail photo, an icon, or some other symbol that in some cases will convey functionality upon the augmented reality representation of the object when dropped. A destination icon or symbol represents a target upon which the representation of the real interface object may be dropped (e.g., by a third gesture, voice command, or some other predetermined method).

For example, imagine a user is viewing an augmented reality scene in a retail store. She will see the real objects in the store (such as books, microwave ovens, and housewares) as well as virtual objects in the augmented reality display (such as product annotations and a shopping cart that follows her wherever she goes). If she wishes to purchase one of the books she sees on a shelf, within the augmented reality interface she may “pick up” a representation of all twelve volumes of the real Oxford English Dictionary with a hand gesture, drag, and then drop the augmented reality representation of them into her virtual shopping cart for check-out, at which point she may decide, for example, to purchase either a real or an electronic copy of the book, or both.

In another embodiment, a virtual interface object in an augmented reality display is selected (by a first gesture, voice command, touch, or some other pre-determined method) and then moved (by a second gesture, voice command, or some other predetermined method) within the augmented reality interface. In response to selection and movement of the virtual interface object in the augmented reality interface, at least one real interface object may be presented in the augmented reality interface in proximity of the virtual interface object. Each real interface object in the augmented reality interface represents a target upon which the virtual interface object may be dropped (by a third gesture, voice command, or some other predetermined method).

For example, imagine you are viewing an augmented reality scene of your family rec room. You see all the real objects in the room (such as a television, a table, a couch, bookshelves, et cetera) overlaid with augmentations (such as the list of digital movies you own, perhaps represented by a stack of virtual DVDs on the table by the TV). You wish to watch one of the digital James Bond movies you own, so within the augmented reality interface you pick up the virtual Goldfinger DVD, drag, and drop it on the real television screen. The movie then begins to play on the real television (or it might be overlaid as an augmentation to the real television so only the user sees it, or both).

In another example, a user is given a photograph by a friend, and would like to post it on her social network page, such as her Facebook page. She may select the photograph with a hand gesture or voice, drag the resulting augmented reality representation of the photograph to a Fb icon in the corner of her augmented reality display, and drop it there to register her Facebook page as a destination for a digital copy of photograph to go. This works similarly for images to be added to Pinterest, notes to be added to a personal electronic diary, and other personal data repositories.

As a courtesy to the reader, and with reference to the accompanying figures herein, in general “100 series” reference numerals will typically refer to items first introduced/described by FIG. 1, “200 series” reference numerals will typically refer to items first introduced/described by FIG. 2, “300 series” reference numerals will typically refer to items first introduced/described by FIG. 3, etc.

In the following detailed description, reference is made to the accompanying drawings, which form a part hereof. In the drawings, similar symbols typically identify similar components, unless context dictates otherwise. The illustrative embodiments described in the detailed description, drawings, and claims are not meant to be limiting. Other embodiments may be utilized, and other changes may be made, without departing from the spirit or scope of the subject matter presented here.

FIG. 1 shows several devices that may be used for augmented reality interactions with a user. These include tablet device 100 having tablet camera screen 102, smartphone 104 having smartphone camera screen 106, digital camera 108, augmented reality eyeglasses 110 (showing an augmentation in the form of compass heading, e.g., “SW,” and ambient temperature, e.g., “65° F.”), and video camera 112. Other form factors may be fabricated having the functionality described herein.

FIG. 2 shows an augmented reality device (smartphone) 204 having an augmented reality display 208, which depicts an image of the real world field of view of the augmented reality device (smartphone camera field of view) 200, including the augmentated reality representation 206, e.g., “SW 65° F.”

FIG. 3 illustrates an example augmented reality system 322 in which embodiments may be implemented. The system 322 may work through an augmented reality device 302 for use by user 300. Augmented reality system 322 may be implemented on augmented reality device 302, or it may be implemented remotely, in whole or in part, for example as a cloud service through network 304 to communicate with augmented reality device 302. The augmented reality system 322 may contain, for example, environmental context evaluation module 306, augmented reality device context evaluation module 308, object selection module 310, image processing module 312, image database 314, digital image generation module 316, user movement tracking module 318, destination selection module 319, and drop registration module 320. Augmented reality system 322 running on or through augmented reality device 302 may communicate over a network 304, wirelessly or by hardwired connection. By way of network 304, which may include a cloud computing component, augmented reality system 322 may communicate with network payment system 324, including credit account 326, google wallet 328, and/or PayPal 330. Augmented reality system 322 may also communicate via network 304 with a retailer 332, such as Target 334. Augmented reality system 322 may also communicate via network 304 with online data service 336 such as Facebook 338, iTunes 340, and/or Google Play app store 342.

In this way, a user may interact with digital representations of her environment in order to, inter alia, complete transactions, collect items of interest, e.g., digital media including digital images of real objects, or manipulate things such as movies and games for viewing or play, respectively.

As referenced herein, the augmented reality system 322 may be used to perform various data querying and/or recall techniques with respect to real world objects and/or augmented reality representations of real world objects. For example, where real world object image data is organized, keyed to, and/or otherwise accessible using one or more image databases, augmented reality system 322 may employ various Boolean, statistical, and/or semi-boolean searching techniques to select a correct real world object image among a set of images in a real world scene, e.g., by object selection module 310, and also to provide an augmented reality representation of the object, either by finding one in, e.g., image database 314, or by generating one, e.g., via digital image generation module 316.

Many examples of databases and database structures may be used in connection with augmented reality system 322. Such examples include hierarchical models (in which data is organized in a tree and/or parent-child node structure), network models (based on set theory, and in which multi-parent structures per child node are supported), or object/relational models (combining the relational model with the object-oriented model).

Still other examples include various types of eXtensible Mark-up Language (XML) databases. For example, a database may be included that holds data in some format other than XML, but that is associated with an XML interface for accessing the database using XML. As another example, a database may store XML data directly. Additionally, or alternatively, virtually any semi-structured database may be used, so that context may be provided to/associated with stored data elements (either encoded with the data elements, or encoded externally to the data elements), so that data storage and/or access may be facilitated.

Such databases, and/or other memory storage techniques, may be written and/or implemented using various programming or coding languages. For example, object-oriented database management systems may be written in programming languages such as, for example, C++ or Java. Relational and/or object/relational models may make use of database languages, such as, for example, the structured query language (SQL), which may be used, for example, for interactive queries for disambiguating information and/or for gathering and/or compiling data from the relational database(s).

For example, SQL or SQL-like operations over one or more real world object image data may be performed, or Boolean operations using real world object image data 301 may be performed. For example, weighted Boolean operations may be performed in which different weights or priorities are assigned to one or more real world object images depending on context of the scene or context of the device 302, including other programs running on the device 302, perhaps relative to one another. For example, a number-weighted, exclusive-OR operation may be performed to request specific weightings of categories of objects depending on recognized cues such as geodata indicating location at a bookstore, e.g.

FIG. 4 illustrates an example of a user interaction with the instant augmented reality system. FIG. 4a depicts an augmented reality device (a smartphone showing on its display a bookshelf containing books in the camera's field of view.

FIG. 4b depicts a user's hand pointing to a book on one of the shelves; this gesture is detected by e.g., augmented reality system 322 and/or image processing module 312, which may capture text printed on the spine of the book in the vicinity of, or touched by, the user's index finger. Further, augmented reality device context evaluation module 308 may detect that the device is running a program having virtual shopping cart functionality associated with a specific bookstore (as indicated by the shopping cart image in the lower left corner of FIGS. 4b-4f), and, if there were other non-book items in the scene, the system may use the bookstore-related virtual shopping cart as a filter such that only books in the scene are considered for selection. In some embodiments, a menu, e.g., a drop-down menu of book titles for example may be presented for the user to select from.

Upon selection of a book, augmented reality system 322 and/or digital image generation module 316 may find in image database 314 and display, or create and display an augmented reality representation 417 of the selected book.

FIG. 4c depicts that a single book on the bookshelf is highlighted, corresponding to the one pointed to by the user's index finger.

FIG. 4d depicts a more detailed augmented reality representation 417 of the selected book in association with the user's hand as the hand moves toward the shopping cart icon on the display. This is the moving or dragging operation that will, upon the book's arrival at the shopping cart, tell the system that the information about the book should be recorded in the user's shopping cart account, perhaps on the bookstore's webpage. This is registration of the drop. For example, destination selection module 319 and/or drop registration module 320 may register the displayed augmented reality representation of the book at the shopping cart icon in the display of the augmented reality device in response to detecting that the user moved his pointing hand to the icon, as tracked by, e.g., user movement tracking module 318.

Optionally, the augmented reality display may provide an indication of registration of the drop, as shown in FIG. 4f, in which the shopping cart icon has been modified to include a 1 on it, indicating that there is one item in the cart.

The converse operations may also be carried out by augmented reality system 322, from AR to reality. This includes detecting an augmented reality representation 417 on a display, moving the displayed augmented reality representation 417 on the display of the augmented reality device according to at least one detected second action of the user (e.g., dragging it onto a real world item), and registering the displayed augmented reality representation at a location in a real world field of view of the augmented reality device in response to e.g., a dragging gesture ending at a credit card processing device for paying for a book, ending at a television for playing a movie, or ending at a car for transferring an audiobook to the car from a smartphone, for example.

And of course a reality-to-AR and back again process can be performed by the system, as shown in FIG. 14. One example of this is the complete process of detecting/selecting a real item indicated by a user, dragging an augmented reality representation 417 of it to a place on an AR device, then detecting/selecting it again for movement to a different real world object. One example of this is the complete process of choosing a book from a bookshelf at a bookstore, placing it in a virtual shopping cart, and then retrieving it for payment at a credit card processing device.

FIGS. 5-14 illustrate operational flows representing example operations related to selecting, dragging, and dropping in augmented reality systems. In the following figures that include various examples of operational flows, discussion and explanation may be provided with respect to the above-described system environments of FIGS. 1-4, and/or with respect to other examples and contexts. However, it should be understood that the operational flows may be executed in a number of other environments and contexts and/or in modified versions of FIGS. 1-4. Also, although the various operational flows are presented in the sequence(s) illustrated, it should be understood that the various operations may be performed in other orders than those which are illustrated, or may be performed concurrently.

Dynamically Preserving Scene Elements in Augmented Reality Systems

In a situation in which a user is looking at a real world scene through AR eyeglasses, for example, the user may want to select some object or person within the scene to interact with via the AR glasses. For example, if a user sees David Bowie throught his glasses, she may want to select the image of David Bowie seen through the AR glasses to activate an option to buy some of David Bowie's music online, perhaps for wireless download to the AR glasses. User input may include eyetracking, voice, gesture, or touch on the AR device or on another device such as a smartphone linked to the AR glasses (e.g., via Bluetooth), among other input modalities.

The present application, in one embodiment, provides a system in which the elements of a scene presented on an AR device may be modified or transformed in ways that preserve elements or aspects of interest to the user (or to the system) so that the user (or the system) can complete operations on such elements in cases where they would otherwise become inaccessible or unusable. As discussed in more detail below and in the claims, other embodiments include a method or system capable of pausing, or otherwise modifying, the presentation of a scene or scene elements so that elements of interest to a user that would otherwise become inaccessible are available for as long as they are needed for an interaction.

Some method aspects of the present disclosure include (a) receiving a request associated with an item, aspect, or element presented in a scene; (b) detecting that a first presentation of the item, aspect, or element has left, or is about to leave, the field of view of the scene or otherwise become inaccessible or difficult to access in the context of a present activity; (c) preserving a presentation or proxy presentation associated with the item, aspect, or element by one or more of, but not limited to (i) slowing the update or frame rate or presentation rate of the scene or aspect of the scene; (ii) retaining/capturing or incorporating a presentation of the item, aspect, or element in the scene; (iii) generating a simulated presentation of the item, aspect, or element; or (iv) generating a proxy affordance for the item, aspect, or element.

Additionally, embodiments may include (d) restoring the first presentation in response to one or more of (i) the end of the inaccessibility of the item in the context of the first presentation; (ii) user input; or (iii) the end of the present activity.

In one example, the present disclosure provides a way to slow down or pause a scene while the user interacts with items that would soon leave or become obscured in a “live” scene, optionally followed by a process of catching up to the state of the live display.

Further aspects may include one or more of (in various combinations in different embodiments): (e) determining one or more scene presentation specifications (e.g., rules for generating the scene; “live” vs. delayed, field of view, illumination, focus, highlighting, zoom, etc.); (f) determining presentations of interest, corresponding to one or more items, aspects, or elements of a scene based on one or more of (i) user task; (ii) system task; (iii) context; (iv) user interest; (v) user preferences; (g) determining a difficulty of interaction regarding the presentations of interest, according to a first (current) scene presentation specification (e.g., recognizing that an item will go off the screen or move behind an obstacle or become small or hard to discern or touch if it follows its current trajectory or the update speed is maintained or the user continues to move his device or position in the current manner); (h) modifying aspects of of a first scene presentation specification, and/or replacing a first scene presentation specification with a second scene presentation specification, which modifications or replacements reduce the difficulty of interaction regarding the presentations of interest; (i) restoring (e.g with animation or other transitions) the first scene specification, and/or removing modifications to the first scene specification in response to or prediction of (i) a determination of the end of a user's interest in or interaction with a presentation of interest; (ii) a determination of a reduction in difficulty of interaction regarding a presentation of interest employing the first scene presentation; (iii) a user request; (iv) at least one of a context, task or setting change; or (v) notification or interruption.

The present disclosure in some embodiments thus provides a way to modify the presentation of items of interest in a scene that the rules for constructing that scene would otherwise make inaccessible or unusable, e.g., by modifying the rules for generating the scene or aspects of the scene; followed, optionally, by restoring those rules.

One embodiment includes methods for pausing, capturing, or generating presentations associated with elements relevant to a current task or interaction long enough for that task or interaction to be completed, in situations in which the present scene display would cause those presentations to become inaccessible or otherwise difficult to interact with.

For example, a user might start to reserve a taxi by interacting with the presentation of a taxi as it drives by, but be in the middle of that interaction as the cab speeds away or becomes obscured by a bus or building, or drives into the distance and becomes too small to easily interact with on the screen. The present methods and systems may “pause” or “slow down” a scene or parts of a scene long enough for a user to complete her interaction, and then optionally “catch up” (e.g., by something like fast forward) to “live action” once the interaction is complete. In another embodiment, the present methods and systems might enlarge the cab (if it has receded from view and become too small) or simulate the removal of an obscuring object, such as another vehicle, a building, or a sign.

The present systems and methods also allow for “catching up” modified or delayed scenes or presentations with live scene presentations, when the delay or modifications are no longer necessary or desired. These aspects are discussed above and may be used in scenarios in which the display of a scene or aspects of a scene have been modified with respect to an initial scene presentation specification. In particular, the present methods and systems may include a determination that certain presentations or aspects of a presentation are being managed, modified or manipulated, and s “release” of those modifications or manipulations in response to demands of task, context, or user input.

Additional aspects include tools for building applications and systems supporting the features above, including platform elements, APIs, and class frameworks that provide the relevant functionality, such as: “presentation needed” states; “presentation no longer needed” events; attributes characterizing the usability and accessibility of scene elements (e.g., too small to touch, too fast-moving to track) for users with physical or cognitive impairments, and associated events (e.g., “object has become too small”), etc.

FIG. 15 illustrates an example augmented reality system 1522 in which embodiments may be implemented. The system 1522 may work through an augmented reality device 1502 for use by user 1500. Augmented reality system 1522 may be implemented on augmented reality device 1502, or it may be implemented remotely, in whole or in part, for example as a cloud service through network 1504 to communicate with augmented reality device 1502. Augmented reality device 1502 will have a visual field of view 200 that will include real world object image data 1501 and real world object motion data 1503.

The augmented reality system 1522 may contain, for example, environmental context evaluation module 1506, augmented reality device context evaluation module 1508, request detection module 1510, object detection and tracking module 1511, object vector, speed, acceleration, and trajectory tracking module 1511, image presentation modification module 1513, video manipulation module 1514, image database 1515, digital image generation module 1516, augmented reality representation 1517, device field of view tracking module 1518, menu presentation module 1519, and/or presentation restoration module 1520. Augmented reality system 322 running on or through augmented reality device 1502 may communicate over a network 1504, wirelessly or by hardwired connection. By way of network 1504, which may include a cloud computing component, augmented reality system 1522 may communicate to effectuate transactions or other interactions with network payment system 1524, including credit account 1526, google wallet 1528, and/or PayPal 1530. Augmented reality system 1522 may also communicate to effectuate transactions or other interactions via network 1504 with a retailer 1532, such as Taxi company 1534 or an online retailer, such as Amazon.com 1535 or iTunes 1540. Augmented reality system 1522 may also communicate to effectuate transactions or other interactions via network 1504 with online data service 1536 such as Facebook 1538, iTunes 1540, and/or Google Play app store 1542.

In this way, a user may interact with digital representations of her environment in order to, inter alia, complete transactions, collect physical or digital items of interest, e.g., order physical goods or create and transfer digital media including digital images of real objects, or upload digital media to a social network such as Facebook or Pinterest.

As referenced herein, the augmented reality system 1522 may be used to perform various data querying and/or recall techniques with respect to real world objects and/or augmented reality representations of real world objects. For example, where real world object image data is organized, keyed to, and/or otherwise accessible using one or more image databases, augmented reality system 1522 may employ various Boolean, statistical, and/or semi-boolean searching techniques to select a correct real world object image among a set of images in a real world scene, e.g., by request detection module 1510, and also to provide an augmented reality representation 1517 of the object, either by finding one in, e.g., image database 1515, or by generating one, e.g., via digital image generation module 1516.

Many examples of databases and database structures may be used in connection with augmented reality system 1522. Such examples include hierarchical models (in which data is organized in a tree and/or parent-child node structure), network models (based on set theory, and in which multi-parent structures per child node are supported), or object/relational models (combining the relational model with the object-oriented model).

Still other examples include various types of eXtensible Mark-up Language (XML) databases. For example, a database may be included that holds data in some format other than XML, but that is associated with an XML interface for accessing the database using XML. As another example, a database may store XML data directly. Additionally, or alternatively, virtually any semi-structured database may be used, so that context may be provided to/associated with stored data elements (either encoded with the data elements, or encoded externally to the data elements), so that data storage and/or access may be facilitated.

Such databases, and/or other memory storage techniques, may be written and/or implemented using various programming or coding languages. For example, object-oriented database management systems may be written in programming languages such as, for example, C++ or Java. Relational and/or object/relational models may make use of database languages, such as, for example, the structured query language (SQL), which may be used, for example, for interactive queries for disambiguating information and/or for gathering and/or compiling data from the relational database(s).

For example, SQL or SQL-like operations over one or more real world object image data may be performed, or Boolean operations using real world object image data 1501 may be performed. For example, weighted Boolean operations may be performed in which different weights or priorities are assigned to one or more real world object images depending on context of the scene or context of the device 1502, including other programs running on the device 1502, perhaps relative to one another. For example, a number-weighted, exclusive-OR operation may be performed to request specific weightings of categories of objects depending on recognized cues such as known user preference, e.g.

In this way ambiguity in the complexity of selection of objects within a scene may be resolved, for example by ad hoc recognition of a category of items in view of the AR device that is known to be of interest to the user. Such a recognition event by the system will greatly reduce the universe of objects in the scene from which to associate with an ambiguous request, such as a gesture to a region of the AR device's field of view. In some embodiments, the system may make decisions as to the exact nature of the user request in stages, for example by highlighting successively smaller sets of objects and prompting the user to select from among them at each stage. This may involve nested boundaries, for example, if a “Beatles” boundary is presented (as discussed in an example below, other, non-Beatles objects in the scene may be removed (or the Beatles may be highlighted), then after selection of Ringo Starr the other three Beatles could be removed, leaving Ringo at the requested item to be interacted with, signaling, perhaps, presentation of various menu options such as purchase music or movies, upload image or video data to a social network, or retrieve web information about Ringo Star.

In this way the system can distinguish location boundaries, such as pixel coordinates on an AR display, from semantic boundaries such as “Beatles.”

FIGS. 16-18 illustrate an example of a user interaction with an augmented reality system that does not include the ability to dynamically preserve elements in a scene as disclosed herein. FIG. 16 depicts an augmented reality device (a smartphone) showing on its display the Beatles walking across Abbey Road. If the user wants to buy some music by Ringo Starr, her favorite Beatle, knowing that the AR application she is using supports music purchases for any item the AR application recognizes, she will have to be fast in tapping or otherwise requesting interaction with Ringo's image to buy music.

FIG. 17 depicts a scenario in which the user has missed Ringo as he moves across the display; he is too difficult for the user to select. (Or if the user does manage to select her target, it may leave the screen before a desired action is taken, and context will be lost.). FIG. 18 depicts the same scene a few moments later when all of the Beatles have gone out of the field of view of the AR device, and off the screen of the device.

FIGS. 19-23 depict the same scenario of viewing the Beatles crossing Abbey Road but this time with the presently disclosed technology for dynamically preserving elements of a scene implemented on or through the AR device.

FIG. 19 depicts again a user trying to tap on Ringo as his image travels across the screen of an AR device.

FIG. 20 depicts a successful “tap” on Ringo as the system recognizes his image as an item of interest to the user, perhaps by both the tap as an indicator and by virtue of a previously expressed interest in Ringo Starr, known to the system, for example stored in environmental context evaluation module 1506, which can evaluate recognizable objects from the environment and match them to objects in a stored database of images. The successful tap and recognition by the system that it represents a user “request” for interaction with, in this case the person selected, may coincide with a vector physics analysis of Ringo Starr's real world movement relative to the field of view of the AR device. Such an analysis may be carried out by, e.g., object detection and tracking module 1511 and/or object vector, speed, acceleration, and trajectory processing module 1512. This analysis may be carried out in two or three dimensions, and it may factor in time as well, for example time until the object of interest is no longer within the field of view of the AR device, and therefore not available for interaction on the AR device.

Here, the augmented reality system 1522 may contain one or more thresholds for calculated time periods, for example, beneath which it will pause the requested item on the display. For example, a threshold for interacting with an element of a scene of 5 seconds may be programmed into the image presentation modification module 1513; if subsequent to the request, object vector, speed, acceleration, and trajectory processing module 1512 calculates that at Ringo's current rate of speed and current bearing, Ringo's image will leave the AR display in 1.5 seconds, then that may trigger video manipulation module 1514 to freeze or slow the video of Ringo's crossing on the display to allow the user to perform the desired interaction (because it we below the 5 second threshold lower limit). Similar thresholds may operate as to the size of the object on the display, for example an object that becomes very small, e.g., less than one square centimeter, may be deemed no longer viable for interaction and accordingly enlarged for interaction, e.g., by digital image generation module 1516 to create a larger, augmented reality representation 1517 of the object, perhaps with associated menus or command buttons to denote and facilitate the available interactions.

As shown in FIG. 20, the AR system may highlight the selected object as a way of confirming with the user that the correct request has registered with the system, and that the item of interest has been “paused” or otherwise augmented (e.g. frozen, slowed down, de-obscured, enlarged, or otherwise made more suitable for an interaction on the AR device).

As shown in FIG. 21, even though time continues to flow around Ringo (his bandmates have moved offscreen, cars continue to drive on the road, et cetera), Ringo has been frozen in place. This guarantees that the user is able to act on this particular item of interest in the scene; it remains viable for interaction for a longer period of time than it otherwise would have if the scene was presented “live.”

As shown in FIG. 22, the system now has time to identify Ringo to the user and to tag him with several commands or menu options that are available, including, for example, “Buy Music,” “Find Pictures,” or “post image or video to FaceBook.”

As shown in FIG. 23, optionally in some embodiments, when the user is no longer interested in Ringo, she releases him, and we see Ringo “fast-forward”—in this case, he rushes off-screen to catch up with his bandmates—after which the entire AR-scene is shown “live” again.

FIGS. 24-30 illustrate operational flows representing example operations related to dynamically preserving elements in augmented reality systems. In these figures that include various examples of operational flows, discussion and explanation may be provided with respect to the above-described system environment of FIG. 15, and/or with respect to other examples and contexts. However, it should be understood that the operational flows may be executed in a number of other environments and contexts and/or in modified versions of FIGS. 15-23. Also, although the various operational flows are presented in the sequence(s) illustrated, it should be understood that the various operations may be performed in other orders than those which are illustrated, or may be performed concurrently.

In one embodiment, augmented reality system 1522 may include circuitry for accepting a user request associated with at least one of an item, an aspect, or an element of a field of view of an augmented reality device; circuitry for determining that a first presentation of the at least one item, aspect, or element has a limited period of viability for user interaction relative to the field of view of the augmented reality device; and circuitry for at least one of maintaining the first presentation or providing a substantially similar second presentation in response to at least one output of the circuitry for determining that a first presentation of the at least one item, aspect, or element has a limited period of viability for interaction relative to the field of view of the augmented reality device.

Temporal Element Restoration

The present disclosure provides a system in which elements of a scene presented on an AR device may be modified and/or transformed in ways that preserve elements or aspects of interest to the user (or to the system) so that the user (or the system) can complete operations on such elements in cases where they would otherwise be inaccessible or unavailable.

The present disclosure includes two related but distinct subsystems that may be deployed together in typical embodiments: (1) methods and systems for making elements of interest that are no longer visible in the “live” scene available in, or accessible from, that scene, or a modified scene; and optionally (2) methods and systems for removing the modified or delayed scenes or presentations, and restoring live scene presentations when the modifications are no longer necessary.

Some aspects of the present disclosure include: (a) receiving a request associated with an item, aspect, or element not presented in the current scene, including (i) notifications pertaining to the item, aspect, or element; (ii) content or actions related to the item, aspect, or element; (ii) system state (e.g., of a launched or restored application, tool, or process) referring to, employing, and/or containing the item, aspect, or element; (b) producing (in response to the request) a representation associated with the item, aspect, or element, including one or more of (i) a substitution of a related item, aspect, or element (currently) present in the scene; (ii) a generation of a (scene-appropriate) proxy representation of the item, aspect, or element; (iii) a modification of the scene to include a (suitable) proxy representation of the item, aspect, or element; (c) processing the request and any subsequent related actions through subsequent interactions with the produced representation.

An alternative embodiment provides for (d) a process for acquiring (in response to the request) a suitable representation, including suggestions or hints to the user for operating his device or otherwise taking actions that will bring a suitable representation (including but not limited to the original item, aspect, or element) into the present scene.

Further embodiments include (e) terminating, dismissing, removing, or releasing the representation in response to completion of the request or aspects of the request, or in response to direction by the user. Finally, typical embodiments will provide (f) distinctive (optionally displayed) indications and affordances associated with produced (b) or acquired (d) items.

Another embodiment may include receiving a request associated with an item or aspect previously presented in an AR scene that is no longer presented in the present scene (e.g., special highlighting, etc.) and by further constraining the previous and present scenes to a single application, system, session, or timeframe.

In an example embodiment, the user might preform some task—such as switching to a new application, or responding to an e-mail or notification, or opening a document—that requires or references an item that is no longer visible in or accessible from the current “live” AR scene on his device. In these cases, the system may provide indications, in the present scene, of actions the user can take to “acquire” the item (e.g., where he should point his device), or the system may present the item as if it were a part of the current scene, temporarily inserting or overlaying it for the duration of the relevant operations, or the system may modify the scene so that the item appears to actually be a part of the scene, or the system may substitute a related item present in the scene as a proxy for the “missing” item.

For example, the user might use an AR street-scene application to reserve a taxi by interacting with a cab as it drives by, receiving a request for confirmation and additional details only after the taxi has driven out of range of scenes he can capture through his augmented reality device. In response to the subsequent receipt of a confirmation or other information regarding the cab, the presently disclosed system might then use another taxi, or a simulated taxi to represent the missing cab and to serve as a target for the subsequent interaction related to the transaction initiated with the missing cab.

In a more extreme and somewhat whimsical example that illustrates the point, the user might have entered a toy store when the request to continue the transaction arises, in which case the proxy for the missing cab might be a toy taxi in the store, or a synthesized item rendered as a toy taxi on the shelf of the store.

Alternatively, if a taxi (perhaps even the originally-designated cab) is nearby or can easily be acquired by the user's device, the system might show an indication of the direction in which the user needs to point his device to reacquire the target cab.

In some embodiments, an object no longer present that relates to a request to the user for interaction may be presented on the augmented reality device in a different way than the original object appeared, though perhaps related. For example, in the taxi example, if a request to confirm a taxi reservation is received by a user inside a building, an augmented reality representation of the taxi cab may not be the cab itself, but rather a computer generated image of a cab driver with the cab company logo on his clothing. In this way the request may be presented to the user in a way that is appropriate to the context of the user and the device. The device may automatically present an appropriate representation based on detected context, for example via environmental context evaluation module 3106. Rulesets such as “no cars inside buildings, instead present a person” may be used to facilitate this function.

Optionally, one embodiment inludes systems and methods for removing modified or delayed scenes or presentations, and reverting to a live presentation of the field of view of the augmented reality device when the modifications are no longer necessary. These embodiments include situations in which the display of a scene or aspects of a scene have been modified with respect to an initial scene presentation specification. In particular, one embodiment includes a determination that certain presentations or aspects of a presentation are being modified, and a “release” of those modifications in response to demands of task, context, or user input.

In another example of context-specific presentation of a request, a user reading a newspaper through augmented reality glasses may be presented with a request for confirmation of the taxi cab reservation via a virtual post-it note containing the cab company's logo situated on the newspaper, as an augmented reality representation of the taxi cab that was earlier selected to make the reservation.

Additional aspects of the present disclosure include tools for building applications and systems supporting the features above, including platform elements, APIs, and class frameworks that provide the relevant functionality.

FIG. 31 illustrates an example augmented reality system 3122 in which embodiments may be implemented. The system 3122 may work through an augmented reality device 3102 for use by user 3100. Augmented reality system 3122 may be implemented on augmented reality device 3102, or it may be implemented remotely via a network 3104, in whole or in part, for example as a cloud service through network 3104 to communicate with augmented reality device 3102. Augmented reality device 3102 will have a visual field of view 200 that will include real world object image data 3101 and real world object motion data 3103.

The augmented reality system 3122 may contain, for example, environmental context evaluation module 3106, augmented reality device context evaluation module 3108, request detection module 3110, object detection and tracking module 3111, object vector, speed, acceleration, and trajectory tracking module 3112, image presentation modification module 3113, video manipulation module 3114, image database 3115, digital image generation module 3116, augmented reality representation 3117, device field of view tracking module 3118, menu presentation module 3119, presentation restoration module 3120, and/or request processing module 3121.

Augmented reality system 3122 running on or through augmented reality device 3102 may communicate over a network 3104 wirelessly or by hardwired connection. By way of network 3104, which may include a cloud computing component, augmented reality system 3122 may communicate to effectuate transactions or other interactions with network payment system 3124, including credit account 3126, google wallet 3128, and/or PayPal 3130. Augmented reality system 3122 may also communicate to effectuate transactions or other interactions via network 3104 with a retailer 3132, such as Taxi company 3134 or an online retailer, such as Amazon.com 3135 or iTunes 3140. Augmented reality system 3122 may also communicate to effectuate transactions or other interactions via network 3104 with online data service 3136 such as Facebook 3138, iTunes 3140, and/or Google Play app store 3142.

In this way, a user may interact with digital representations of her environment in order to, inter alia, track orders of physical goods or services, complete transactions, or conduct lengthy or discontinuous communications.

As referenced herein, the augmented reality system 3122 may be used to perform various data querying and/or recall techniques with respect to real world objects and/or augmented reality representations of real world objects. For example, where real world object image data is organized, keyed to, and/or otherwise accessible using one or more image databases, augmented reality system 3122 may employ various Boolean, statistical, and/or semi-boolean searching techniques to select a correct real world object image among a set of images in a real world scene, e.g., by request detection module 3110, and also to provide an augmented reality representation 3117 of the object, either by finding one in, e.g., image database 3115, or by generating one, e.g., via digital image generation module 3116.

Many examples of databases and database structures may be used in connection with augmented reality system 3122. Such examples include hierarchical models (in which data is organized in a tree and/or parent-child node structure), network models (based on set theory, and in which multi-parent structures per child node are supported), or object/relational models (combining the relational model with the object-oriented model).

Still other examples include various types of eXtensible Mark-up Language (XML) databases. For example, a database may be included that holds data in some format other than XML, but that is associated with an XML interface for accessing the database using XML. As another example, a database may store XML data directly. Additionally, or alternatively, virtually any semi-structured database may be used, so that context may be provided to/associated with stored data elements (either encoded with the data elements, or encoded externally to the data elements), so that data storage and/or access may be facilitated.

Such databases, and/or other memory storage techniques, may be written and/or implemented using various programming or coding languages. For example, object-oriented database management systems may be written in programming languages such as, for example, C++ or Java. Relational and/or object/relational models may make use of database languages, such as, for example, the structured query language (SQL), which may be used, for example, for interactive queries for disambiguating information and/or for gathering and/or compiling data from the relational database(s).

For example, SQL or SQL-like operations over one or more real world object image data may be performed, or Boolean operations using real world object image data 3101 may be performed. For example, weighted Boolean operations may be performed in which different weights or priorities are assigned to one or more real world object images depending on context of the scene or context of the device 3102, including other programs running on the device 3102, perhaps relative to one another. For example, a number-weighted, exclusive-OR operation may be performed to request specific weightings of categories of objects depending on recognized cues such as known user preference, specific rulesets in place to define relationships between objects or object types and supported communicatinons and/or transactions that can be initiated, mediated, and/or completed by the augmented reality system 3122 and/or augmented reality device 3102.

In this way, a call-and-response kind of interaction can be carried out wherein a call by a user (e.g., a request for a taxi cab reservation manifested as a tap (or, e.g., a detected dwell by an eyetracking device) on a cab in a field of view of an AR device) can be processed and a response brought back to the user at a later time on the AR device through a “request” to the system to engage the user via display of an augmented reality representation of the original taxi cab or an aspect of it, wherein the actual taxi cab is no longer in the field of view of the AR device.

In this way the system can track interactions that involve lengthy or discontinuous communications such as delivery tracking; order fulfillment; simple email, text, or voice messaging; or scheduling.

FIGS. 32-39 illustrate an example of a user interaction with an augmented reality device and system that implements temporal element restoration as disclosed herein. FIG. 32 depicts an augmented reality device (a smartphone) showing on its display a scene including a taxi cab. FIG. 33 depicts a user's finger tapping the image of the taxi cab on the display of the augmented reality device 3102. FIG. 34 depicts two command options that the system has placed on the AR display in response to the selection of the taxi cab: “Speedy Taxi Co.” and “Reserve Taxi” are displayed near the cab as graphical buttons on the display. FIG. 35 depicts the user's finger tapping the “Reserve Taxi” button to reserve the taxi.

FIG. 36 depicts a user observing a street through the AR device at a time when the taxi is no longer present in the scene. The taxi company now wants to confirm the reservation the user made earlier, but there are no taxis on the street to provide the user with context for the confirmation. FIG. 37 depicts a way in which the augmented reality system 3122 may provide the user with appropriate context (and to offer the user the appropriate affordances for interaction to complete the request for confirmation from the Taxi company), wherein the AR-enabled device and/or the AR system generates an augmented reality representation 3117 of the taxi or a “virtual taxi” to place within the AR scene. This provides a context for the user to grasp in addressing the request for confirmation of the reservation from the Taxi company.

A request may be detected by request detection module 3110, and digital image generation module may automatically present based on the detected request, e.g., either an image of the object relevant to the request, a modified image of the object, a computer generated version of the object, or a different object that is related to the object.

FIG. 38 depicts the user's finger tapping on the virtual taxi to select it, and FIG. 39 depicts command options that have been placed by the system near the virtual taxi as items that the user can activate, e.g., by tapping to confirm the reservation.

Optionally, as depicted in FIG. 40, once confirmation is made, the system may remove the “virtual taxi” from the AR scene, e.g., thereby restoring a “live” view of the scene.

FIGS. 41-47 illustrate operational flows representing example operations related to temporal element restoration in augmented reality systems. In these figures that include various examples of operational flows, discussion and explanation may be provided with respect to the above-described system environment of FIG. 31, and/or with respect to other examples and contexts. However, it should be understood that the operational flows may be executed in a number of other environments and contexts and/or in modified versions of FIGS. 31-40. Also, although the various operational flows are presented in the sequence(s) illustrated, it should be understood that the various operations may be performed in other orders than those which are illustrated, or may be performed concurrently.

Operational/Functional Language Herein Describes Machines/Machine Control/Machine-Controlled Processes Unless Context Dictates Otherwise

The claims, description, and drawings of this application may describe one or more of the instant technologies in operational/functional language, for example as a set of operations to be performed by a computer. Such operational/functional description in most instances would be understood by one skilled in the art as specifically configured hardware (e.g., because a general purpose computer in effect becomes a special purpose computer once it is programmed to perform particular functions pursuant to instructions from program software).

Importantly, although the operational/functional descriptions described herein are understandable by the human mind, they are not abstract ideas of the operations/functions divorced from computational implementation of those operations/functions. Rather, the operations/functions represent a specification for massively complex computational machines or other means. As discussed in detail below, the operational/functional language must be read in its proper technological context, i.e., as concrete specifications for physical implementations. The logical operations/functions described herein are a distillation of machine specifications or other physical mechanisms specified by the operations/functions such that the otherwise inscrutable machine specifications may be comprehensible to the human reader. The distillation also allows one of skill in the art to adapt the operational/functional description of the technology across many different specific vendors' hardware configurations or platforms, without being limited to specific vendors' hardware configurations or platforms.

Some of the present technical description (e.g., detailed description, drawings, claims, etc.) may be set forth in terms of logical operations/functions. As described in more detail herein, these logical operations/functions are not representations of abstract ideas, but rather are representative of static or sequenced specifications of various hardware elements. Differently stated, unless context dictates otherwise, the logical operations/functions will be understood by those of skill in the art to be representative of static or sequenced specifications of various hardware elements. This is true because tools available to one of skill in the art to implement technical disclosures set forth in operational/functional formats—tools in the form of a high level programming language (e.g., C, java, visual basic, etc.), or tools in the form of Very High speed Hardware Description Language (“VHDL,” which is a language that uses text to describe logic circuits)—are generators of static or sequenced specifications of various hardware configurations. This fact is sometimes obscured by the broad term “software,” but, as shown by the following explanation, those skilled in the art understand that what is termed “software” is a shorthand for a massively complex interchaining/specification of ordered matter elements. The term “ordered matter elements” may refer to physical components of computation, such as assemblies of electronic logic gates, molecular computing logic constituents, quantum computing mechanisms, etc. For example, a high level programming language is a programming language with strong abstraction, e.g., multiple levels of abstraction, from the details of the sequential organizations, states, inputs, outputs, etc., of the machines that a highlevel programming language actually specifies. See, e.g., Wikipedia, High-level programming language, http://en.wikipedia.org/wiki/Highlevel_programming_language (as of Jun. 5, 2012, 21:00 GMT). In order to facilitate human comprehension, in many instances, high level programming languages resemble or even share symbols with natural languages. See, e.g., Wikipedia, Natural language, http://en.wikipedia.org/wiki/Natural_language (as of Jun. 5, 2012, 21:00 GMT).

It has been argued that because high level programming languages use strong abstraction (e.g., that they may resemble or share symbols with natural languages), they are therefore a “purely mental construct” (e.g., that “software”—a computer program or computer programming—is somehow an ineffable mental construct, because at a high level of abstraction, it can be conceived and understood by a human reader). This argument has been used to characterize technical description in the form of functions/operations as somehow “abstract ideas.” In fact, in technological arts (e.g., the information and communication technologies) this is not true.

The fact that high-level programming languages use strong abstraction to facilitate human understanding should not be taken as an indication that what is expressed is an abstract idea. In fact, those skilled in the art understand that just the opposite is true. If a high-level programming language is the tool used to implement a technical disclosure in the form of functions/operations, those skilled in the art will recognize that, far from being abstract, imprecise, “fuzzy,” or “mental” in any significant semantic sense, such a tool is instead a near incomprehensibly precise sequential specification of specific computational machines—the parts of which are built up by activating/selecting such parts from typically more general computational machines over time (e.g., clocked time). This fact is sometimes obscured by the superficial similarities between high-level programming languages and natural languages. These superficial similarities also may cause a glossing over of the fact that high level programming language implementations ultimately perform valuable work by creating/controlling many different computational machines.

The many different computational machines that a high-level programming language specifies are almost unimaginably complex. At base, the hardware used in the computational machines typically consists of some type of ordered matter (e.g., traditional electronic devices (e.g., transistors), deoxyribonucleic acid (DNA), quantum devices, mechanical switches, optics, fluidics, pneumatics, optical devices (e.g., optical interference devices), molecules, etc.) that are arranged to form logic gates. Logic gates are typically physical devices that may be electrically, mechanically, chemically, or otherwise driven to change physical state in order to create a physical reality of logic, such as Boolean logic.

Logic gates may be arranged to form logic circuits, which are typically physical devices that may be electrically, mechanically, chemically, or otherwise driven to create a physical reality of certain logical functions. Types of logic circuits include such devices as multiplexers, registers, arithmetic logic units (ALUs), computer memory, etc., each type of which may be combined to form yet other types of physical devices, such as a central processing unit (CPU)—the best known of which is the microprocessor. A modern microprocessor will often contain more than one hundred million logic gates in its many logic circuits (and often more than a billion transistors). See, e.g., Wikipedia, Logic gates, http://en.wikipedia.org/wiki/Logic_gates (as of Jun. 5, 2012, 21:03 GMT).

The logic circuits forming the microprocessor are arranged to provide a microarchitecture that will carry out the instructions defined by that microprocessor's defined Instruction Set Architecture. The Instruction Set Architecture is the part of the microprocessor architecture related to programming, including the native data types, instructions, registers, addressing modes, memory architecture, interrupt and exception handling, and external Input/Output. See, e.g., Wikipedia, Computer architecture, http://en.wikipedia.org/wiki/Computer_architecture (as of Jun. 5, 2012, 21:03 GMT).

The Instruction Set Architecture includes a specification of the machine language that can be used by programmers to use/control the microprocessor. Since the machine language instructions are such that they may be executed directly by the microprocessor, typically they consist of strings of binary digits, or bits. For example, a typical machine language instruction might be many bits long (e.g., 32, 64, or 128 bit strings are currently common). A typical machine language instruction might take the form “11110000101011110000111100111111” (a 32 bit instruction). It is significant here that, although the machine language instructions are written as sequences of binary digits, in actuality those binary digits specify physical reality. For example, if certain semiconductors are used to make the operations of Boolean logic a physical reality, the apparently mathematical bits “1” and “0” in a machine language instruction actually constitute a shorthand that specifies the application of specific voltages to specific wires. For example, in some semiconductor technologies, the binary number “1” (e.g., logical “1”) in a machine language instruction specifies around +5 volts applied to a specific “wire” (e.g., metallic traces on a printed circuit board) and the binary number “0” (e.g., logical “0”) in a machine language instruction specifies around −5 volts applied to a specific “wire.” In addition to specifying voltages of the machines' configurations, such machine language instructions also select out and activate specific groupings of logic gates from the millions of logic gates of the more general machine. Thus, far from abstract mathematical expressions, machine language instruction programs, even though written as a string of zeros and ones, specify many, many constructed physical machines or physical machine states.

Machine language is typically incomprehensible by most humans (e.g., the above example was just ONE instruction, and some personal computers execute more than two billion instructions every second). See, e.g., Wikipedia, Instructions per second, http://en.wikipedia.org/wiki/Instructions_per_second (as of Jun. 5, 2012, 21:04 GMT). Thus, programs written in machine language—which may be tens of millions of machine language instructions long—are incomprehensible to most humans. In view of this, early assembly languages were developed that used mnemonic codes to refer to machine language instructions, rather than using the machine language instructions' numeric values directly (e.g., for performing a multiplication operation, programmers coded the abbreviation “mult,” which represents the binary number “011000” in MIPS machine code). While assembly languages were initially a great aid to humans controlling the microprocessors to perform work, in time the complexity of the work that needed to be done by the humans outstripped the ability of humans to control the microprocessors using merely assembly languages.

At this point, it was noted that the same tasks needed to be done over and over, and the machine language necessary to do those repetitive tasks was the same. In view of this, compilers were created. A compiler is a device that takes a statement that is more comprehensible to a human than either machine or assembly language, such as “add 2+2 and output the result,” and translates that human understandable statement into a complicated, tedious, and immense machine language code (e.g., millions of 32, 64, or 128 bit length strings). Compilers thus translate high-level programming language into machine language. This compiled machine language, as described above, is then used as the technical specification which sequentially constructs and causes the interoperation of many different computational machines such that useful, tangible, and concrete work is done. For example, as indicated above, such machine language—the compiled version of the higher-level language—functions as a technical specification which selects out hardware logic gates, specifies voltage levels, voltage transition timings, etc., such that the useful work is accomplished by the hardware.

Thus, a functional/operational technical description, when viewed by one of skill in the art, is far from an abstract idea. Rather, such a functional/operational technical description, when understood through the tools available in the art such as those just described, is instead understood to be a humanly understandable representation of a hardware specification, the complexity and specificity of which far exceeds the comprehension of most any one human. With this in mind, those skilled in the art will understand that any such operational/functional technical descriptions—in view of the disclosures herein and the knowledge of those skilled in the art—may be understood as operations made into physical reality by (a) one or more interchained physical machines, (b) interchained logic gates configured to create one or more physical machine(s) representative of sequential/combinatorial logic(s), (c) interchained ordered matter making up logic gates (e.g., interchained electronic devices (e.g., transistors), DNA, quantum devices, mechanical switches, optics, fluidics, pneumatics, molecules, etc.) that create physical reality of logic(s), or (d) virtually any combination of the foregoing. Indeed, any physical object which has a stable, measurable, and changeable state may be used to construct a machine based on the above technical description. Charles Babbage, for example, constructed the first mechanized computational apparatus out of wood with the mechanism powered by cranking a handle.

Thus, far from being understood as an abstract idea, those skilled in the art will recognize a functional/operational technical description as a humanlyunderstandable representation of one or more almost unimaginably complex and time sequenced hardware instantiations. The fact that functional/operational technical descriptions might lend themselves readily to high-level computing languages (or high-level block diagrams for that matter) that share some words, structures, phrases, etc. with natural language should not be taken as an indication that such functional/operational technical descriptions are abstract ideas, or mere expressions of abstract ideas. In fact, as outlined herein, in the technological arts this is simply not true. When viewed through the tools available to those of skill in the art, such functional/operational technical descriptions are seen as specifying hardware configurations of almost unimaginable complexity.

As outlined above, the reason for the use of functional/operational technical descriptions is at least twofold. First, the use of functional/operational technical descriptions allows near-infinitely complex machines and machine operations arising from interchained hardware elements to be described in a manner that the human mind can process (e.g., by mimicking natural language and logical narrative flow). Second, the use of functional/operational technical descriptions assists the person of skill in the art in understanding the described subject matter by providing a description that is more or less independent of any specific vendor's piece(s) of hardware.

The use of functional/operational technical descriptions assists the person of skill in the art in understanding the described subject matter since, as is evident from the above discussion, one could easily, although not quickly, transcribe the technical descriptions set forth in this document as trillions of ones and zeroes, billions of single lines of assembly-level machine code, millions of logic gates, thousands of gate arrays, or any number of intermediate levels of abstractions. However, if any such low-level technical descriptions were to replace the present technical description, a person of skill in the art could encounter undue difficulty in implementing the disclosure, because such a low-level technical description would likely add complexity without a corresponding benefit (e.g., by describing the subject matter utilizing the conventions of one or more vendor-specific pieces of hardware).

Thus, the use of functional/operational technical descriptions assists those of skill in the art by separating the technical descriptions from the conventions of any vendor-specific piece of hardware.

In view of the foregoing, the logical operations/functions set forth in the present technical description are representative of static or sequenced specifications of various ordered-matter elements, in order that such specifications may be comprehensible to the human mind and adaptable to create many various hardware configurations. The logical operations/functions disclosed herein should be treated as such, and should not be disparagingly characterized as abstract ideas merely because the specifications they represent are presented in a manner that one of skill in the art can readily understand and apply in a manner independent of a specific vendor's hardware implementation.

One skilled in the art will recognize that the herein described components (e.g., operations), devices, objects, and the discussion accompanying them are used as examples for the sake of conceptual clarity and that various configuration modifications are contemplated. Consequently, as used herein, the specific exemplars set forth and the accompanying discussion are intended to be representative of their more general classes. In general, use of any specific exemplar is intended to be representative of its class, and the non-inclusion of specific components (e.g., operations), devices, and objects should not be taken as limiting.

Although a user may be shown/described herein as a single illustrated figure, those skilled in the art will appreciate that any user may be representative of a human user, a robotic user (e.g., computational entity), and/or substantially any combination thereof (e.g., a user may be assisted by one or more robotic agents) unless context dictates otherwise. Those skilled in the art will appreciate that, in general, the same may be said of “sender” and/or other entity-oriented terms as such terms are used herein unless context dictates otherwise.

Those skilled in the art will appreciate that the foregoing specific exemplary processes and/or devices and/or technologies are representative of more general processes and/or devices and/or technologies taught elsewhere herein, such as in the claims filed herewith and/or elsewhere in the present application.

Those having skill in the art will recognize that the state of the art has progressed to the point where there is little distinction left between hardware and software implementations of aspects of systems; the use of hardware or software is generally (but not always, in that in certain contexts the choice between hardware and software can become significant) a design choice representing cost vs. efficiency tradeoffs. Those having skill in the art will appreciate that there are various vehicles by which processes and/or systems and/or other technologies described herein can be effected (e.g., hardware, software, and/or firmware), and that the preferred vehicle will vary with the context in which the processes and/or systems and/or other technologies are deployed. For example, if an implementer determines that speed and accuracy are paramount, the implementer may opt for a mainly hardware and/or firmware vehicle; alternatively, if flexibility is paramount, the implementer may opt for a mainly software implementation; or, yet again alternatively, the implementer may opt for some combination of hardware, software, and/or firmware. Hence, there are several possible vehicles by which the processes and/or devices and/or other technologies described herein may be effected, none of which is inherently superior to the other in that any vehicle to be utilized is a choice dependent upon the context in which the vehicle will be deployed and the specific concerns (e.g., speed, flexibility, or predictability) of the implementer, any of which may vary. Those skilled in the art will recognize that optical aspects of implementations will typically employ optically-oriented hardware, software, and or firmware.

In some implementations described herein, logic and similar implementations may include software or other control structures. Electronic circuitry, for example, may have one or more paths of electrical current constructed and arranged to implement various functions as described herein. In some implementations, one or more media may be configured to bear a device-detectable implementation when such media hold or transmit a device detectable instructions operable to perform as described herein. In some variants, for example, implementations may include an update or modification of existing software or firmware, or of gate arrays or programmable hardware, such as by performing a reception of or a transmission of one or more instructions in relation to one or more operations described herein. Alternatively or additionally, in some variants, an implementation may include special-purpose hardware, software, firmware components, and/or general-purpose components executing or otherwise invoking special-purpose components. Specifications or other implementations may be transmitted by one or more instances of tangible transmission media as described herein, optionally by packet transmission or otherwise by passing through distributed media at various times.

Alternatively or additionally, implementations may include executing a special-purpose instruction sequence or invoking circuitry for enabling, triggering, coordinating, requesting, or otherwise causing one or more occurrences of virtually any functional operations described herein. In some variants, operational or other logical descriptions herein may be expressed as source code and compiled or otherwise invoked as an executable instruction sequence. In some contexts, for example, implementations may be provided, in whole or in part, by source code, such as C++, or other code sequences. In other implementations, source or other code implementation, using commercially available and/or techniques in the art, may be compiled//implemented/translated/converted into a high-level descriptor language (e.g., initially implementing described technologies in C or C++ programming language and thereafter converting the programming language implementation into a logic-synthesizable language implementation, a hardware description language implementation, a hardware design simulation implementation, and/or other such similar mode(s) of expression). For example, some or all of a logical expression (e.g., computer programming language implementation) may be manifested as a Verilog-type hardware description (e.g., via Hardware Description Language (HDL) and/or Very High Speed Integrated Circuit Hardware Descriptor Language (VHDL)) or other circuitry model which may then be used to create a physical implementation having hardware (e.g., an Application Specific Integrated Circuit). Those skilled in the art will recognize how to obtain, configure, and optimize suitable transmission or computational elements, material supplies, actuators, or other structures in light of these teachings.

The foregoing detailed description has set forth various embodiments of the devices and/or processes via the use of block diagrams, flowcharts, and/or examples. Insofar as such block diagrams, flowcharts, and/or examples contain one or more functions and/or operations, it will be understood by those within the art that each function and/or operation within such block diagrams, flowcharts, or examples can be implemented, individually and/or collectively, by a wide range of hardware, software, firmware, or virtually any combination thereof. In one embodiment, several portions of the subject matter described herein may be implemented via Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs), digital signal processors (DSPs), or other integrated formats. However, those skilled in the art will recognize that some aspects of the embodiments disclosed herein, in whole or in part, can be equivalently implemented in integrated circuits, as one or more computer programs running on one or more computers (e.g., as one or more programs running on one or more computer systems), as one or more programs running on one or more processors (e.g., as one or more programs running on one or more microprocessors), as firmware, or as virtually any combination thereof, and that designing the circuitry and/or writing the code for the software and or firmware would be well within the skill of one of skill in the art in light of this disclosure. In addition, those skilled in the art will appreciate that the mechanisms of the subject matter described herein are capable of being distributed as a program product in a variety of forms, and that an illustrative embodiment of the subject matter described herein applies regardless of the particular type of signal bearing medium used to actually carry out the distribution. Examples of a signal bearing medium include, but are not limited to, the following: a recordable type medium such as a floppy disk, a hard disk drive, a Compact Disc (CD), a Digital Video Disk (DVD), a digital tape, a computer memory, etc.; and a transmission type medium such as a digital and/or an analog communication medium (e.g., a fiber optic cable, a waveguide, a wired communications link, a wireless communication link (e.g., transmitter, receiver, transmission logic, reception logic, etc.), etc.).

In a general sense, those skilled in the art will recognize that the various aspects described herein which can be implemented, individually and/or collectively, by a wide range of hardware, software, firmware, and/or any combination thereof can be viewed as being composed of various types of “electrical circuitry.” Consequently, as used herein “electrical circuitry” includes, but is not limited to, electrical circuitry having at least one discrete electrical circuit, electrical circuitry having at least one integrated circuit, electrical circuitry having at least one application specific integrated circuit, electrical circuitry forming a general purpose computing device configured by a computer program (e.g., a general purpose computer configured by a computer program which at least partially carries out processes and/or devices described herein, or a microprocessor configured by a computer program which at least partially carries out processes and/or devices described herein), electrical circuitry forming a memory device (e.g., forms of memory (e.g., random access, flash, read only, etc.)), and/or electrical circuitry forming a communications device (e.g., a modem, communications switch, optical-electrical equipment, etc.). Those having skill in the art will recognize that the subject matter described herein may be implemented in an analog or digital fashion or some combination thereof.

Those skilled in the art will recognize that at least a portion of the devices and/or processes described herein can be integrated into a data processing system. Those having skill in the art will recognize that a data processing system generally includes one or more of a system unit housing, a video display device, memory such as volatile or non-volatile memory, processors such as microprocessors or digital signal processors, computational entities such as operating systems, drivers, graphical user interfaces, and applications programs, one or more interaction devices (e.g., a touch pad, a touch screen, an antenna, etc.), and/or control systems including feedback loops and control motors (e.g., feedback for sensing position and/or velocity; control motors for moving and/or adjusting components and/or quantities). A data processing system may be implemented utilizing suitable commercially available components, such as those typically found in data computing/communication and/or network computing/communication systems.

Those skilled in the art will recognize that it is common within the art to implement devices and/or processes and/or systems, and thereafter use engineering and/or other practices to integrate such implemented devices and/or processes and/or systems into more comprehensive devices and/or processes and/or systems. That is, at least a portion of the devices and/or processes and/or systems described herein can be integrated into other devices and/or processes and/or systems via a reasonable amount of experimentation. Those having skill in the art will recognize that examples of such other devices and/or processes and/or systems might include—as appropriate to context and application—all or part of devices and/or processes and/or systems of (a) an air conveyance (e.g., an airplane, rocket, helicopter, etc.), (b) a ground conveyance (e.g., a car, truck, locomotive, tank, armored personnel carrier, etc.), (c) a building (e.g., a home, warehouse, office, etc.), (d) an appliance (e.g., a refrigerator, a washing machine, a dryer, etc.), (e) a communications system (e.g., a networked system, a telephone system, a Voice over IP system, etc.), (f) a business entity (e.g., an Internet Service Provider (ISP) entity such as Comcast Cable, Century Link, Southwestern Bell, etc.), or (g) a wired/wireless services entity (e.g., Sprint, Verizon, AT&T, etc.), etc.

The claims, description, and drawings of this application may describe one or more of the instant technologies in operational/functional language, for example as a set of operations to be performed by a computer. Such operational/functional description in most instances would be understood by one skilled the art as specifically-configured hardware (e.g., because a general purpose computer in effect becomes a special purpose computer once it is programmed to perform particular functions pursuant to instructions from program software).

Importantly, although the operational/functional descriptions described herein are understandable by the human mind, they are not abstract ideas of the operations/functions divorced from computational implementation of those operations/functions. Rather, the operations/functions represent a specification for the massively complex computational machines or other means. As discussed in detail below, the operational/functional language must be read in its proper technological context, i.e., as concrete specifications for physical implementations.

The logical operations/functions described herein are a distillation of machine specifications or other physical mechanisms specified by the operations/functions such that the otherwise inscrutable machine specifications may be comprehensible to the human mind. The distillation also allows one of skill in the art to adapt the operational/functional description of the technology across many different specific vendors' hardware configurations or platforms, without being limited to specific vendors' hardware configurations or platforms.

Some of the present technical description (e.g., detailed description, drawings, claims, etc.) may be set forth in terms of logical operations/functions. As described in more detail in the following paragraphs, these logical operations/functions are not representations of abstract ideas, but rather representative of static or sequenced specifications of various hardware elements. Differently stated, unless context dictates otherwise, the logical operations/functions will be understood by those of skill in the art to be representative of static or sequenced specifications of various hardware elements. This is true because tools available to one of skill in the art to implement technical disclosures set forth in operational/functional formats—tools in the form of a high-level programming language (e.g., C, java, visual basic), etc.), or tools in the form of Very high speed Hardware Description Language (“VHDL,” which is a language that uses text to describe logic circuits)—are generators of static or sequenced specifications of various hardware configurations. This fact is sometimes obscured by the broad term “software,” but, as shown by the following explanation, those skilled in the art understand that what is termed “software” is a shorthand for a massively complex interchaining/specification of ordered-matter elements. The term “ordered-matter elements” may refer to physical components of computation, such as assemblies of electronic logic gates, molecular computing logic constituents, quantum computing mechanisms, etc.

For example, a high-level programming language is a programming language with strong abstraction, e.g., multiple levels of abstraction, from the details of the sequential organizations, states, inputs, outputs, etc., of the machines that a high-level programming language actually specifies. See, e.g., Wikipedia, High-level programming language, http://en.wikipedia.org/wiki/High-level_programming_language (as of Jun. 5, 2012, 21:00 GMT) (URL included merely to provide written description). In order to facilitate human comprehension, in many instances, high-level programming languages resemble or even share symbols with natural languages. See, e.g., Wikipedia, Natural language, http://en.wikipedia.org/wiki/Natural_language (as of Jun. 5, 2012, 21:00 GMT) (URL included merely to provide written description).

It has been argued that because high-level programming languages use strong abstraction (e.g., that they may resemble or share symbols with natural languages), they are therefore a “purely mental construct.” (e.g., that “software”—a computer program or computer programming—is somehow an ineffable mental construct, because at a high level of abstraction, it can be conceived and understood in the human mind). This argument has been used to characterize technical description in the form of functions/operations as somehow “abstract ideas.” In fact, in technological arts (e.g., the information and communication technologies) this is not true.

The fact that high-level programming languages use strong abstraction to facilitate human understanding should not be taken as an indication that what is expressed is an abstract idea. In fact, those skilled in the art understand that just the opposite is true. If a high-level programming language is the tool used to implement a technical disclosure in the form of functions/operations, those skilled in the art will recognize that, far from being abstract, imprecise, “fuzzy,” or “mental” in any significant semantic sense, such a tool is instead a near incomprehensibly precise sequential specification of specific computational machines—the parts of which are built up by activating/selecting such parts from typically more general computational machines over time (e.g., clocked time). This fact is sometimes obscured by the superficial similarities between high-level programming languages and natural languages. These superficial similarities also may cause a glossing over of the fact that high-level programming language implementations ultimately perform valuable work by creating/controlling many different computational machines.

The many different computational machines that a high-level programming language specifies are almost unimaginably complex. At base, the hardware used in the computational machines typically consists of some type of ordered matter (e.g., traditional electronic devices (e.g., transistors), deoxyribonucleic acid (DNA), quantum devices, mechanical switches, optics, fluidics, pneumatics, optical devices (e.g., optical interference devices), molecules, etc.) that are arranged to form logic gates. Logic gates are typically physical devices that may be electrically, mechanically, chemically, or otherwise driven to change physical state in order to create a physical reality of Boolean logic.

Logic gates may be arranged to form logic circuits, which are typically physical devices that may be electrically, mechanically, chemically, or otherwise driven to create a physical reality of certain logical functions. Types of logic circuits include such devices as multiplexers, registers, arithmetic logic units (ALUs), computer memory, etc., each type of which may be combined to form yet other types of physical devices, such as a central processing unit (CPU)—the best known of which is the microprocessor. A modern microprocessor will often contain more than one hundred million logic gates in its many logic circuits (and often more than a billion transistors). See, e.g., Wikipedia, Logic gates, http://en.wikipedia.org/wiki/Logic_gates (as of Jun. 5, 2012, 21:03 GMT) (URL included merely to provide written description).

The logic circuits forming the microprocessor are arranged to provide a microarchitecture that will carry out the instructions defined by that microprocessor's defined Instruction Set Architecture. The Instruction Set Architecture is the part of the microprocessor architecture related to programming, including the native data types, instructions, registers, addressing modes, memory architecture, interrupt and exception handling, and external Input/Output. See, e.g., Wikipedia, Computer architecture, http://en.wikipedia.org/wiki/Computer_architecture (as of Jun. 5, 2012, 21:03 GMT) (URL included merely to provide written description).

The Instruction Set Architecture includes a specification of the machine language that can be used by programmers to use/control the microprocessor. Since the machine language instructions are such that they may be executed directly by the microprocessor, typically they consist of strings of binary digits, or bits. For example, a typical machine language instruction might be many bits long (e.g., 32, 64, or 128 bit strings are currently common). A typical machine language instruction might take the form “11110000101011110000111100111111” (a 32 bit instruction).

It is significant here that, although the machine language instructions are written as sequences of binary digits, in actuality those binary digits specify physical reality. For example, if certain semiconductors are used to make the operations of Boolean logic a physical reality, the apparently mathematical bits “1” and “0” in a machine language instruction actually constitute a shorthand that specifies the application of specific voltages to specific wires. For example, in some semiconductor technologies, the binary number “1” (e.g., logical “1”) in a machine language instruction specifies around +5 volts applied to a specific “wire” (e.g., metallic traces on a printed circuit board) and the binary number “0” (e.g., logical “0”) in a machine language instruction specifies around −5 volts applied to a specific “wire.” In addition to specifying voltages of the machines' configuration, such machine language instructions also select out and activate specific groupings of logic gates from the millions of logic gates of the more general machine. Thus, far from abstract mathematical expressions, machine language instruction programs, even though written as a string of zeros and ones, specify many, many constructed physical machines or physical machine states.

Machine language is typically incomprehensible by most humans (e.g., the above example was just ONE instruction, and some personal computers execute more than two billion instructions every second). See, e.g., Wikipedia, Instructions per second, http://en.wikipedia.org/wiki/Instructions_per_second (as of Jun. 5, 2012, 21:04 GMT) (URL included merely to provide written description).

Thus, programs written in machine language—which may be tens of millions of machine language instructions long—are incomprehensible. In view of this, early assembly languages were developed that used mnemonic codes to refer to machine language instructions, rather than using the machine language instructions' numeric values directly (e.g., for performing a multiplication operation, programmers coded the abbreviation “mult,” which represents the binary number “011000” in MIPS machine code). While assembly languages were initially a great aid to humans controlling the microprocessors to perform work, in time the complexity of the work that needed to be done by the humans outstripped the ability of humans to control the microprocessors using merely assembly languages.

At this point, it was noted that the same tasks needed to be done over and over, and the machine language necessary to do those repetitive tasks was the same. In view of this, compilers were created. A compiler is a device that takes a statement that is more comprehensible to a human than either machine or assembly language, such as “add 2+2 and output the result,” and translates that human understandable statement into a complicated, tedious, and immense machine language code (e.g., millions of 32, 64, or 128 bit length strings). Compilers thus translate high-level programming language into machine language.

This compiled machine language, as described above, is then used as the technical specification which sequentially constructs and causes the interoperation of many different computational machines such that humanly useful, tangible, and concrete work is done. For example, as indicated above, such machine language—the compiled version of the higher-level language—functions as a technical specification which selects out hardware logic gates, specifies voltage levels, voltage transition timings, etc., such that the humanly useful work is accomplished by the hardware.

Thus, a functional/operational technical description, when viewed by one of skill in the art, is far from an abstract idea. Rather, such a functional/operational technical description, when understood through the tools available in the art such as those just described, is instead understood to be a humanly understandable representation of a hardware specification, the complexity and specificity of which far exceeds the comprehension of most any one human. With this in mind, those skilled in the art will understand that any such operational/functional technical descriptions—in view of the disclosures herein and the knowledge of those skilled in the art—may be understood as operations made into physical reality by (a) one or more interchained physical machines, (b) interchained logic gates configured to create one or more physical machine(s) representative of sequential/combinatorial logic(s), (c) interchained ordered matter making up logic gates (e.g., interchained electronic devices (e.g., transistors), DNA, quantum devices, mechanical switches, optics, fluidics, pneumatics, molecules, etc.) that create physical reality representative of logic(s), or (d) virtually any combination of the foregoing. Indeed, any physical object which has a stable, measurable, and changeable state may be used to construct a machine based on the above technical description. Charles Babbage, for example, constructed the first computer out of wood and powered by cranking a handle.

Thus, far from being understood as an abstract idea, those skilled in the art will recognize a functional/operational technical description as a humanly-understandable representation of one or more almost unimaginably complex and time sequenced hardware instantiations. The fact that functional/operational technical descriptions might lend themselves readily to high-level computing languages (or high-level block diagrams for that matter) that share some words, structures, phrases, etc. with natural language simply cannot be taken as an indication that such functional/operational technical descriptions are abstract ideas, or mere expressions of abstract ideas. In fact, as outlined herein, in the technological arts this is simply not true. When viewed through the tools available to those of skill in the art, such functional/operational technical descriptions are seen as specifying hardware configurations of almost unimaginable complexity.

As outlined above, the reason for the use of functional/operational technical descriptions is at least twofold. First, the use of functional/operational technical descriptions allows near-infinitely complex machines and machine operations arising from interchained hardware elements to be described in a manner that the human mind can process (e.g., by mimicking natural language and logical narrative flow). Second, the use of functional/operational technical descriptions assists the person of skill in the art in understanding the described subject matter by providing a description that is more or less independent of any specific vendor's piece(s) of hardware.

The use of functional/operational technical descriptions assists the person of skill in the art in understanding the described subject matter since, as is evident from the above discussion, one could easily, although not quickly, transcribe the technical descriptions set forth in this document as trillions of ones and zeroes, billions of single lines of assembly-level machine code, millions of logic gates, thousands of gate arrays, or any number of intermediate levels of abstractions. However, if any such low-level technical descriptions were to replace the present technical description, a person of skill in the art could encounter undue difficulty in implementing the disclosure, because such a low-level technical description would likely add complexity without a corresponding benefit (e.g., by describing the subject matter utilizing the conventions of one or more vendor-specific pieces of hardware). Thus, the use of functional/operational technical descriptions assists those of skill in the art by separating the technical descriptions from the conventions of any vendor-specific piece of hardware.

In view of the foregoing, the logical operations/functions set forth in the present technical description are representative of static or sequenced specifications of various ordered-matter elements, in order that such specifications may be comprehensible to the human mind and adaptable to create many various hardware configurations. The logical operations/functions disclosed herein should be treated as such, and should not be disparagingly characterized as abstract ideas merely because the specifications they represent are presented in a manner that one of skill in the art can readily understand and apply in a manner independent of a specific vendor's hardware implementation.

In certain cases, use of a system or method may occur in a territory even if components are located outside the territory. For example, in a distributed computing context, use of a distributed computing system may occur in a territory even though parts of the system may be located outside of the territory (e.g., relay, server, processor, signal-bearing medium, transmitting computer, receiving computer, etc. located outside the territory).

A sale of a system or method may likewise occur in a territory even if components of the system or method are located and/or used outside the territory.

Further, implementation of at least part of a system for performing a method in one territory does not preclude use of the system in another territory.

All of the above U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification and/or listed in any Application Data Sheet are incorporated herein by reference, to the extent not inconsistent herewith.

The herein described subject matter sometimes illustrates different components contained within, or connected with, different other components. It is to be understood that such depicted architectures are merely exemplary, and that in fact many other architectures may be implemented which achieve the same functionality. In a conceptual sense, any arrangement of components to achieve the same functionality is effectively “associated” such that the desired functionality is achieved. Hence, any two components herein combined to achieve a particular functionality can be seen as “associated with” each other such that the desired functionality is achieved, irrespective of architectures or intermedial components. Likewise, any two components so associated can also be viewed as being “operably connected,” or “operably coupled,” to each other to achieve the desired functionality, and any two components capable of being so associated can also be viewed as being “operably couplable,” to each other to achieve the desired functionality. Specific examples of operably couplable include but are not limited to physically mateable and/or physically interacting components, and/or wirelessly interactable, and/or wirelessly interacting components, and/or logically interacting, and/or logically interactable components.

In some instances, one or more components may be referred to herein as “configured to,” “configured by,” “configurable to,” “operable/operative to,” “adapted/adaptable,” “able to,” “conformable/conformed to,” etc. Those skilled in the art will recognize that such terms (e.g. “configured to”) can generally encompass active-state components and/or inactive-state components and/or standby-state components, unless context requires otherwise.

With respect to the use of substantially any plural and/or singular terms herein, those having skill in the art can translate from the plural to the singular and/or from the singular to the plural as is appropriate to the context and/or application. The various singular/plural permutations are not expressly set forth herein for sake of clarity.

While particular aspects of the present subject matter described herein have been shown and described, it will be apparent to those skilled in the art that, based upon the teachings herein, changes and modifications may be made without departing from the subject matter described herein and its broader aspects and, therefore, the appended claims are to encompass within their scope all such changes and modifications as are within the true spirit and scope of the subject matter described herein. It will be understood by those within the art that, in general, terms used herein, and especially in the appended claims (e.g., bodies of the appended claims) are generally intended as “open” terms (e.g., the term “including” should be interpreted as “including but not limited to,” the term “having” should be interpreted as “having at least,” the term “includes” should be interpreted as “includes but is not limited to,” etc.). It will be further understood by those within the art that if a specific number of an introduced claim recitation is intended, such an intent will be explicitly recited in the claim, and in the absence of such recitation no such intent is present. For example, as an aid to understanding, the following appended claims may contain usage of the introductory phrases “at least one” and “one or more” to introduce claim recitations. However, the use of such phrases should not be construed to imply that the introduction of a claim recitation by the indefinite articles “a” or “an” limits any particular claim containing such introduced claim recitation to claims containing only one such recitation, even when the same claim includes the introductory phrases “one or more” or “at least one” and indefinite articles such as “a” or “an” (e.g., “a” and/or “an” should typically be interpreted to mean “at least one” or “one or more”); the same holds true for the use of definite articles used to introduce claim recitations. In addition, even if a specific number of an introduced claim recitation is explicitly recited, those skilled in the art will recognize that such recitation should typically be interpreted to mean at least the recited number (e.g., the bare recitation of “two recitations,” without other modifiers, typically means at least two recitations, or two or more recitations). Furthermore, in those instances where a convention analogous to “at least one of A, B, and C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “ a system having at least one of A, B, and C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). In those instances where a convention analogous to “at least one of A, B, or C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “ a system having at least one of A, B, or C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). It will be further understood by those within the art that typically a disjunctive word and/or phrase presenting two or more alternative terms, whether in the description, claims, or drawings, should be understood to contemplate the possibilities of including one of the terms, either of the terms, or both terms unless context dictates otherwise. For example, the phrase “A or B” will be typically understood to include the possibilities of “A” or “B” or “A and B.”

With respect to the appended claims, those skilled in the art will appreciate that recited operations therein may generally be performed in any order. Also, although various operational flows are presented in a sequence(s), it should be understood that the various operations may be performed in other orders than those which are illustrated, or may be performed concurrently. Examples of such alternate orderings may include overlapping, interleaved, interrupted, reordered, incremental, preparatory, supplemental, simultaneous, reverse, or other variant orderings, unless context dictates otherwise. Furthermore, terms like “responsive to,” “related to,” or other past-tense adjectives are generally not intended to exclude such variants, unless context dictates otherwise.

While various aspects and embodiments have been disclosed herein, other aspects and embodiments will be apparent to those skilled in the art. The various aspects and embodiments disclosed herein are for purposes of illustration and are not intended to be limiting, with the true scope and spirit being indicated by the following claims.

Claims

1-41. (canceled)

42. An augmented reality device comprising:

a display;

a camera;

a wireless communication interface;

circuitry configured for accepting input associated with a real-world object that is in a field of view of the camera;

circuitry configured for performing image processing with respect to image data of the real-world object to recognize the real-world object;

circuitry configured for accessing via the wireless communication interface information regarding the real-world object from one or more remote data sources; and

circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera.

43. The augmented reality device of claim 42, wherein the augmented reality device comprises a smartphone, a tablet computer, a smartwatch, a digital camera, eyeglasses, or a digital video device.

44. The augmented reality device of claim 42, wherein the circuitry configured for accepting input associated with a real-world object that is in a field of view of the camera comprises:

circuitry configured for accepting at least one of the following types of input associated with a real-world object that is in a field of view of the camera: hand gesture, finger gesture, voice input, pointing, touch input, or eye movement input.

45. The augmented reality device of claim 42, wherein the circuitry configured for accepting input associated with a real-world object that is in a field of view of the camera comprises:

circuitry configured for accepting input associated with at least one of the following types of real-world objects: person, animal, product, sign, building, or vehicle.

46. The augmented reality device of claim 42, wherein the circuitry configured for accepting input associated with a real-world object that is in a field of view of the camera comprises:

circuitry configured for accepting input associated with a real-world object that is moving in a field of view of the camera.

47. The augmented reality device of claim 42, wherein the circuitry configured for accessing via the wireless communication interface information regarding the real-world object from one or more remote data sources comprises:

circuitry configured for accessing via the wireless communication interface at least some of the following types of information regarding the real-world object: picture, music, media, social media, or application.

48. The augmented reality device of claim 42, wherein the circuitry configured for accessing via the wireless communication interface information regarding the real-world object from one or more remote data sources comprises:

circuitry configured for accessing via the wireless communication interface information regarding the real-world object from one or more of the following remote data sources: payment, music, media, social media, or online.

49. The augmented reality device of claim 42, wherein the circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera comprises:

circuitry configured for outputting a menu for accessing the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera.

50. The augmented reality device of claim 42, wherein the circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera comprises:

circuitry configured for outputting a command option for using the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera.

51. The augmented reality device of claim 42, wherein the circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera comprises:

circuitry configured for tagging the real-world object with an augmented reality representation.

52. The augmented reality device of claim 42, wherein the circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera comprises:

circuitry configured for outputting a dialog interface including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera.

53. The augmented reality device of claim 42, wherein the circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera comprises:

circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display near the real-world object in the field of view of the camera.

54. The augmented reality device of claim 42, wherein the circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position on the display corresponding to the real-world object in the field of view of the camera comprises:

circuitry configured for outputting one or more menu items including the information regarding the real-world object based at least partly on context or geographical location data.

55. The augmented reality device of claim 42, further comprising:

circuitry configured for determining a context or a geographical location.

56. The augmented reality device of claim 42, further comprising:

circuitry configured for detecting an action based at least partly on a context or a geographical location.

57. The augmented reality device of claim 42, further comprising:

circuitry configured for generating a digital representation of the real-world object to preserve intractability with the real-world object in the field of view of the camera, the digital representation including one or more of an image, proxy, or simulation.

58. The augmented reality device of claim 42, further comprising:

circuitry configured for highlighting one or more real-world objects within the field of view of the camera that are available for selection.

59. The augmented reality device of claim 42, further comprising:

circuitry configured for moving the augmented reality representation in response to an indication of completion.

60. A system comprising:

circuitry configured for accepting input associated with a real-world object that is in a field of view of a camera of an augmented reality device;

circuitry configured for performing image processing with respect to image data of the real-world object to recognize the real-world object;

circuitry configured for accessing information regarding the real-world object from one or more remote data sources; and

circuitry configured for outputting an augmented reality representation including the information regarding the real-world object at a position corresponding to the real-world object in the field of view of the camera.

61. A computer implemented process, the process comprising:

accepting input associated with a real-world object that is in a field of view of a camera of an augmented reality device;

performing image processing with respect to image data of the real-world object to recognize the real-world object;

accessing information regarding the real-world object from one or more remote data sources; and

outputting an augmented reality representation including the information regarding the real-world object at a position corresponding to the real-world object in the field of view of the camera.