DETECTING AND INTERPRETING REAL-WORLD AND SECURITY GESTURES ON TOUCH AND HOVER SENSITIVE DEVICES
“Real-world” gestures such as hand or finger movements/orientations that are generally recognized to mean certain things (e.g., an “OK” hand signal generally indicates an affirmative response) can be interpreted by a touch or hover sensitive device to more efficiently and accurately effect intended operations. These gestures can include, but are not limited to, “OK gestures,” “grasp everything gestures,” “stamp of approval gestures,” “circle select gestures,” “X to delete gestures,” “knock to inquire gestures,” “hitchhiker directional gestures,” and “shape gestures.” In addition, gestures can be used to provide identification and allow or deny access to applications, files, and the like.
This application is a continuation of U.S. patent application Ser. No. 14/881,031, filed Oct. 12, 2015 and published on Feb. 4, 2016 as U.S. Patent Publication No. 2016-0034177, which is a continuation of U.S. patent application Ser. No. 12/753,754, filed Apr. 2, 2010 and issued on Oct. 13, 2015 as U.S. Pat. No. 9,158,454, which is a divisional of U.S. patent application Ser. No. 11/818,500, filed Jun. 13, 2007 and issued on Jan. 25, 2011 as U.S. Pat. No. 7,877,707, which claims the benefit under 35 USC 119(e) of U.S. Patent Application No. 60/879,191, filed Jan. 6, 2007, the contents of which are incorporated herein by reference in their entirety for all purposes.
FIELD OF THE INVENTIONThis relates to the detection and interpretation of gestures by touch and hover sensitive devices, and more particularly, to the detection and interpretation of real-world gestures and security-related gestures by touch and hover sensitive devices.
BACKGROUND OF THE INVENTIONThere exist today many styles of input devices for performing operations in a computer system. The operations generally correspond to moving a cursor and making selections on a display screen. The operations can also include paging, scrolling, panning, zooming, etc. By way of example, the input devices can include buttons, switches, keyboards, mice, trackballs, touch pads, joy sticks, touch screens and the like. Each of these devices can have advantages and disadvantages that can be taken into account when designing a computer system.
Buttons and switches can be mechanical in nature and provide limited control with regards to the movement of the cursor and making selections. For example, they can be dedicated to moving the cursor in a specific direction (e.g., arrow keys) or to making specific selections (e.g., enter, delete, number, etc.).
In using a mouse instrument, the movement of the input pointer on a display can correspond to the relative movements of the mouse as the user moves the mouse along a surface. In using a trackball instrument, the movement of the input pointer on the display can correspond to the relative movements of a trackball as the user moves the ball within a housing. Mouse and trackball instruments can typically also include one or more buttons for making selections. A mouse instrument can also include scroll wheels that can allow a user to scroll the displayed content by rolling the wheel forward or backward.
With touch pad instrument, such as touch pads on a personal laptop computer, the movement of the input pointer on a display can correspond to the relative movements of the user's finger (or stylus) as the finger is moved along a surface of the touch pad. Touch screens, on the other hand, are a type of display screen that typically can include a touch-sensitive transparent panel (or “skin”) that can overlay the display screen. When using a touch screen, a user typically makes a selection on the display screen by pointing directly to objects (such as GUI objects) displayed on the screen (usually with a stylus or finger).
To provide additionally functionality, hand gestures have been implemented with some of these input devices. By way of example, in touch pads, selections can be made when one or more taps can be detected on the surface of the touch pad. In some cases, any portion of the touch pad can be tapped, and in other cases a dedicated portion of the touch pad can be tapped. In addition to selections, scrolling can be initiated by using finger motion at the edge of the touch pad.
U.S. Pat. Nos. 5,612,719 and 5,590,219, assigned to Apple Computer, Inc. describe some other uses of gesturing. U.S. Pat. No. 5,612,719 discloses an onscreen button that can be responsive to at least two different button gestures made on the screen on or near the button. U.S. Pat. No. 5,590,219 discloses a method for recognizing an ellipse-type gesture input on a display screen of a computer system.
In recent times, more advanced gestures have been implemented. For example, scrolling can be initiated by placing four fingers on the touch pad so that the scrolling gesture is recognized, and thereafter moving these fingers on the touch pad to perform scrolling events. The methods for implementing these advanced gestures, however, can be limited and in many instances counterintuitive. In certain applications, it can be beneficial to enable a user to use “real-world” gestures such as hand movements and/or finger orientations that can be generally recognized to mean certain things, such as an “OK” signal, to more efficiently and accurately effect intended operations.
SUMMARY OF THE INVENTIONEmbodiments of the present invention are directed to the detection of “real-world” gestures that can be generally recognized to mean certain things by a touch or hover sensitive device to more efficiently and accurately effect intended operations in computing systems. These gestures can include, but are not limited to, “OK gestures,” “grasp everything gestures,” “stamp of approval gestures,” “circle select gestures,” “X to delete gestures,” “knock to inquire gestures,” “hitchhiker directional gestures,” and “shape gestures.” In addition, gestures can be used to provide identification and allow or deny access to applications, files, and the like.
An “OK gesture” can be detected and interpreted to perform operations that can require an affirmative response, such as a text box UI element that asks the user whether a certain action is to be taken. A “grasp everything gesture” can be detected and interpreted to perform operations that can require a grouping action, such as the grouping of UI elements such as file icons or graphics symbols on a desktop or drawing page. A “stamp of approval gesture” can be detected and interpreted to perform operations that can require an affirmative response, such as a UI element including text that asks the user whether a certain action is to be taken. A “circle select gesture” can be detected and interpreted to perform operations that can require a grouping action, such as the grouping of UI elements such as file icons or graphics symbols on a desktop or drawing page. An “X to delete gesture” can be detected and interpreted to perform operations that can require a deleting action, such as the deleting of a UI element such as file icon or graphics symbol on a desktop or drawing page. A “knock to inquire gesture” can be detected and interpreted to perform operations that can require an inquiry action, such as determining whether an application, file, person of interest or other entity is available to be opened, accessed, communicated with, and the like. A “hitchhiker gesture” can be detected and interpreted to perform operations that can require a directional input, such as a scrolling, panning, windowing, translating in time, and the like. Shape gestures can be detected and interpreted to perform various operations, including creating a graphic image of that shape, or selecting or moving UI elements. An “identification gesture” formed by either or both hands on or over a touch or hover sensitive device can be detected and interpreted to perform operations that can require an identification action, such determining whether the user is to get access, or continue to have access, to a restricted account, file or other item.
In the following description of preferred embodiments, reference is made to the accompanying drawings which form a part hereof, and in which it is shown by way of illustration specific embodiments in which the invention can be practiced. It is to be understood that other embodiments can be utilized and structural changes can be made without departing from the scope of the preferred embodiments of the present invention.
“Real-world” gestures such as hand or finger movements/orientations that are generally recognized to mean certain things (e.g., an “OK” hand signal generally indicates an affirmative response) can be interpreted by a touch or hover sensitive device to more efficiently and accurately effect intended operations. These gestures can include, but are not limited to, “OK gestures,” “grasp everything gestures,” “stamp of approval gestures,” “circle select gestures,” “X to delete gestures,” “knock to inquire gestures,” “hitchhiker directional gestures,” and “shape gestures.” In addition, gestures can be used to provide identification and allow or deny access to applications, files, and the like.
Although some embodiments of the invention can be described herein in terms of capacitive multi-touch sensor panels, embodiments of the invention can be generally applicable to other types of multi-touch sensors that can include resistive touch sensors, surface acoustic wave touch sensors, electromagnetic touch sensors, near field imaging touch sensors, optical touch sensors and the like. In addition, although the proximity sensors can be described herein as infrared (IR) proximity sensors, embodiments of the invention can be generally applicable to other types of proximity sensors having an output that can be AC-coupled to an analog channel.
The exemplary computer system 50 shown in
In most cases, the processor 56 together with an operating system can operate to execute computer code and produce and use data. Operating systems are generally well known and will not be described in greater detail. By way of example, the operating system can correspond to OS/2, DOS, Unix, Linux, Palm OS, and the like. The operating system can also be a special purpose operating system, such as can be used for limited purpose appliance-type computing devices. The operating system, other computer code and data can reside within a memory block 58 that can be operatively coupled to the processor 56. Memory block 58 can generally provide a place to store computer code and data that can be used by the computer system 50. By way of example, the memory block 58 can include Read-Only Memory (ROM), Random-Access Memory (RAM), hard disk drive and/or the like. The information could also reside on a removable storage medium and can be loaded or installed onto the computer system 50 when needed. Removable storage mediums can include, for example, CD-ROM, PC-CARD, memory card, floppy disk, magnetic tape, and a network component.
The computer system 50 can also include a display device 68 that can be operatively coupled to the processor 56. The display device 68 can be a liquid crystal display (LCD) (e.g., active matrix, passive matrix and the like). Alternatively, the display device 68 can be a monitor such as a monochrome display, color graphics adapter (CGA) display, enhanced graphics adapter (EGA) display, variable-graphics-array (VGA) display, super VGA display, cathode ray tube (CRT), and the like. The display device can also correspond to a plasma display or a display implemented with electronic inks.
The display device 68 can be configured to display a graphical user interface (GUI) 69 that can provide an easy to use interface between a user of the computer system and the operating system or application running thereon. Generally speaking, the GUI 69 can represent programs, files and operational options with graphical images, objects, or vector representations. The graphical images can include windows, fields, dialog boxes, menus, icons, buttons, cursors, scroll bars, etc. Such images can be arranged in predefined layouts, or can be created dynamically to serve the specific actions being taken by a user. During operation, the user can select and/or activate various graphical images in order to initiate functions and tasks associated therewith. By way of example, a user can select a button that opens, closes, minimizes, or maximizes a window, or an icon that launches a particular program. The GUI 69 can additionally or alternatively display information, such as non interactive text and graphics, for the user on the display device 68.
The computer system 50 can also include an input device 70 that can be operatively coupled to the processor 56. The input device 70 can be configured to transfer data from the outside world into the computer system 50. The input device 70 can, for example, be used to perform tracking and to make selections with respect to the GUI 69 on the display 68. The input device 70 can also be used to issue commands in the computer system 50. The input device 70 can include a touch-sensing device configured to receive input from a user's touch and to send this information to the processor 56. By way of example, the touch-sensing device can correspond to a touchpad or a touch screen. In many cases, the touch-sensing device can recognize touches, as well as the position and magnitude of touches on a touch sensitive surface. The touch sensing device can detect and report the touches to the processor 56 and the processor 56 can interpret the touches in accordance with its programming. For example, the processor 56 can initiate a task in accordance with a particular touch. A dedicated processor can be used to process touches locally and reduce demand for the main processor of the computer system.
The touch sensing device can be based on sensing technologies including but not limited to capacitive sensing, resistive sensing, surface acoustic wave sensing, pressure sensing, optical sensing, and/or the like. Furthermore, the touch sensing means can be based on single point sensing or multipoint sensing. Single point sensing can be capable of only distinguishing a single touch, while multipoint sensing can be capable of distinguishing multiple touches that occur at the same time.
As discussed above, the input device 70 can be a touch screen that is positioned over or in front of the display 68, integrated with the display device 68, or can be a separate component, such as a touch pad.
The computer system 50 can also include capabilities for coupling to one or more I/O devices 80. By way of example, the I/O devices 80 can correspond to keyboards, printers, scanners, cameras, microphones, speakers, and/or the like. The I/O devices 80 can be integrated with the computer system 50 or they can be separate components (e.g., peripheral devices). In some cases, the I/O devices 80 can be connected to the computer system 50 through wired connections (e.g., cables/ports). In other cases, the I/O devices 80 can be connected to the computer system 50 through wireless connections. By way of example, the data link can correspond to PS/2, USB, IR, Firewire, RF, Bluetooth or the like.
In accordance with one embodiment of the present invention, the computer system 50 can be designed to recognize gestures 85 applied to the input device 70 and to control aspects of the computer system 50 based on the gestures 85. In some cases, a gesture can be defined as a stylized interaction with an input device that can be mapped to one or more specific computing operations. The gestures 85 can be made through various hand and finger motions. Alternatively or additionally, the gestures can be made with a stylus. In all of these cases, the input device 70 can receive the gestures 85 and the processor 56 can execute instructions to carry out operations associated with the gestures 85. In addition, the memory block 58 can include a gesture operational program 88, which can be part of the operating system or a separate application. The gestural operation program 88 can generally include a set of instructions that can recognize the occurrence of gestures 85 and can inform one or more software agents of the gestures 85 and/or what action(s) to take in response to the gestures 85. Additional details regarding the various gestures that can be used as input commands are discussed further below.
In one embodiment, upon a user performing one or more gestures, the input device 70 can relay gesture information to the processor 56. Using instructions from memory 58, and more particularly, the gestural operational program 88, the processor 56 can interpret the gestures 85 and control different components of the computer system 50, such as memory 58, a display 68 and I/O devices 80, based on the gestures 85. The gestures 85 can be identified as commands for performing actions in applications stored in the memory 58, modifying image objects shown on the display 68, modifying data stored in memory 58, and/or for performing actions in I/O devices 80.
Again, although
Driver logic 19 can provide multiple multi-touch subsystem outputs 20 and can present a proprietary interface that drives high voltage driver, which can include a decoder 21 and subsequent level shifter and driver stage 22, although level-shifting functions could be performed before decoder functions. Level shifter and driver 22 can provide level shifting from a low voltage level (e.g. CMOS levels) to a higher voltage level, providing a better signal-to-noise (S/N) ratio for noise reduction purposes. Decoder 21 can decode the drive interface signals to one or more of N outputs, where N is the maximum number of rows in the panel. Decoder 21 can be used to reduce the number of drive lines needed between the high voltage driver and multi-touch panel 24. Each multi-touch panel row input 23 can drive one or more rows in multi-touch panel 24. It should be noted that driver 22 and decoder 21 can also be integrated into a single ASIC, be integrated into driver logic 19, or in some instances be unnecessary.
The multi-touch panel 24 can include a capacitive sensing medium having a plurality of row traces or driving lines and a plurality of column traces or sensing lines, although other sensing media can also be used. The row and column traces can be formed from a substantially transparent conductive medium, such as Indium Tin Oxide (ITO) or Antimony Tin Oxide (ATO), although other transparent and non-transparent materials, such as copper, can also be used. In some embodiments, the row and column traces can be formed on opposite sides of a dielectric material, and can be perpendicular to each other, although in other embodiments other non-Cartesian orientations are possible. For example, in a polar coordinate system, the sensing lines can be concentric circles and the driving lines can be radially extending lines (or vice versa). It should be understood, therefore, that the terms “row” and “column,” “first dimension” and “second dimension,” or “first axis” and “second axis” as used herein are intended to encompass not only orthogonal grids, but the intersecting traces of other geometric configurations having first and second dimensions (e.g. the concentric and radial lines of a polar-coordinate arrangement). The rows and columns can be formed on a single side of a substrate, or can be formed on two separate substrates separated by a dielectric material. In some instances, an additional dielectric cover layer can be placed over the row or column traces to strengthen the structure and protect the entire assembly from damage.
At the “intersections” of the traces of the multi-touch panel 24, where the traces pass above and below (cross) each other (but do not make direct electrical contact with each other), the traces can essentially form two electrodes (although more than two traces could intersect as well). Each intersection of row and column traces can represent a capacitive sensing node and can be viewed as picture element (pixel) 26, which can be particularly useful when multi-touch panel 24 is viewed as capturing an “image” of touch. (In other words, after multi-touch subsystem 27 has determined whether a touch event has been detected at each touch sensor in the multi-touch panel, the pattern of touch sensors in the multi-touch panel at which a touch event occurred can be viewed as an “image” of touch (e.g. a pattern of fingers touching the panel).) The capacitance between row and column electrodes can appear as a stray capacitance on all columns when the given row is held at DC and as a mutual capacitance Csig when the given row is stimulated with an AC signal. The presence of a finger or other object near or on the multi-touch panel can be detected by measuring changes to Csig. The columns of multi-touch panel 124 can drive one or more analog channels 17 (also referred to herein as event detection and demodulation circuits) in multi-touch subsystem 27. In some implementations, each column is coupled to one dedicated analog channel 17. However, in other implementations, the columns can be couplable via an analog switch to a fewer number of analog channels 17.
Computing system 10 can also include host processor 14 for receiving outputs from multi-touch panel processor 12 and performing actions based on the outputs that can include, but are not limited to, moving an object such as a cursor or pointer, scrolling or panning, adjusting control settings, opening a file or document, viewing a menu, making a selection, executing instructions, operating a peripheral device connected to the host device, etc. Host processor 14, which can be a personal computer CPU, can also perform additional functions that may not be related to multi-touch panel processing, and can be coupled to program storage 15 and display device 13 such as an LCD display for providing a user interface (UI) to a user of the device. It should also be noted that multi-touch panel 24 and display device 13 can be integrated into one single touch-screen display device.
Although capacitive touch sensor panels can detect objects hovering within the near-field of the panel, and appear to be “weak” touches, in some embodiments proximity sensor panels can be co-located with touch sensor panels to provide hover detection capabilities outside the near-field hover detection capabilities of capacitive touch sensor panels. These proximity sensor panels can be arranged as an array of proximity sensors that can be scanned in a manner similar to the scanning of a touch sensor panel to generate an “image” of hover. Further details of multi-touch sensor detection, including proximity detection by a touch panel and proximity detection by a proximity sensor panel, are described in commonly assigned co-pending applications, including U.S. patent application Ser. No. 10/840,862, published on May 11, 2006 as U.S. Publication No. US2006/0097991, and U.S. patent application entitled “Proximity and Multi-Touch Sensor Detection and Demodulation,” filed on Jan. 3, 2007, the entirety of both which are incorporated herein by reference.
Following block 102, multipoint processing method 100 can proceed to block 104 where the image can be converted into a collection or list of features. Each feature can represent a distinct input such as a touch. In most cases, each feature can include its own unique identifier (ID), x coordinate, y coordinate, Z magnitude, angle θ, area A, and the like. By way of example,
The conversion from data or images to features can be accomplished using methods described in U.S. Patent Publication No. US2006/009771. As disclosed therein, the raw data can typically be received in a digitized form, and can include values for each node of the touch screen. The values can be between 0 and 256 where 0 equates to no touch pressure and 256 equates to full touch pressure. Thereafter, the raw data can be filtered to reduce noise. Once filtered, gradient data, which indicates the topology of each group of connected points, can be generated. Thereafter, the boundaries for touch regions can be calculated based on the gradient data (i.e., a determination is made as to which points are grouped together to form each touch region). By way of example, a watershed algorithm can be used. Once the boundaries are determined, the data for each of the touch regions can be calculated (e.g., X, Y, Z, θ, A).
Following block 104, multipoint processing method 100 proceeds to block 106 where feature classification and groupings can be performed. During classification, the identity of each of the features can be determined. For example, the features can be classified as a particular finger, thumb, palm or other object. Once classified, the features can be grouped. The manner in which the groups are formed can be widely varied. In most cases, the features can be grouped based on some criteria (e.g., they carry a similar attribute). For example, the two features shown in
Following block 106, the multipoint processing method 100 proceeds to block 108 where key parameters for the feature groups can be calculated. The key parameters can include distance between features, x/y centroid of all features, feature rotation, total pressure of the group (e.g., pressure at centroid), and the like. As shown in
As should be appreciated, blocks 102-108 are repetitively performed during a user stroke thereby generating a plurality of sequentially configured signals. The initial and current parameters can be compared in later steps to perform actions in the system.
Following block 108, the process flow proceeds to block 110 where the group is or can be associated with a user interface (UI) element. UI elements can be buttons boxes, lists, sliders, wheels, knobs, pictures, documents, icons, etc. Each UI element can represent a component or control of the user interface. The application behind the UI element(s) can have access to the parameter data calculated in block 108. In one implementation, the application can rank the relevance of the touch data to the UI element corresponding there to. The ranking can be based on some predetermined criteria. The ranking can include producing a figure of merit, and whichever UI element has the highest figure of merit, giving it sole access to the group. There can even be some degree of hysteresis as well (once one of the UI elements claims control of that group, the group sticks with the UI element until another UI element has a much higher ranking). By way of example, the ranking can include determining proximity of the centroid (or features) to the image object associated with the UI element.
Following block 110, the multipoint processing method 100 proceeds to blocks 112 and 114. The blocks 112 and 114 can be performed approximately at the same time. From the user perspective, in one embodiment, the blocks 112 and 114 appear to be performed concurrently. In block 112, one or more actions can be performed based on differences between initial and current parameter values, and can also be based on a UI element to which they are associated, if any. In block 114, user feedback pertaining to the one ore more action being performed can be provided. By way of example, user feedback can include display, audio, tactile feedback and/or the like.
The above methods and techniques can be used to implement any number of GUI interface objects and actions. For example, gestures can be created to detect and effect a user command to resize a window, scroll a display, rotate an object, zoom in or out of a displayed view, delete or insert text or other objects, etc.
A basic category of gestures can allow a user to input the common commands that can be inputted through the use of a conventional mouse or trackball instrument.
As shown in
If a two finger touch is detected 711, then if at least one of the touchdown point is associated with a selectable file object 713, then a selection 715 can be made of the object. If one or more tap by one of the fingers on the touch sensitive display is detected 717 while the touchdown point is maintained, then a right-click mouse action can be invoked.
In accordance with a preferred embodiment, if a touch or touches detected are not associated with any selectable file object or hyperlinks, then as shown in
Scrolling generally pertains to moving displayed data or images across a viewing area on a display screen so that a new set of data is brought into view in the viewing area. In most cases, once the viewing area is full, each new set of data appears at the edge of the viewing area and all other sets of data move over one position. That is, the new set of data appears for each set of data that moves out of the viewing area. In essence, these functions allow a user to view consecutive sets of data currently outside of the viewing area. In most cases, the user is able to accelerate their traversal through the data sets by moving his or her finger at greater speeds. Examples of scrolling through lists can be found in U.S. Patent Publication Nos. 2003/0076303A1, 2003/0076301A1, 2003/0095096A1, which are herein incorporated by reference.
If the touch down point(s) is/are within a scrollable area, then a scrolling action can be invoked 723 similar to the pressing down of a scroll wheel on a conventional mouse instrument. If the scrollable area is scrollable in only one direction (e.g., up and down), then the scrolling action invoked will be unidirectional scroll. If the scrollable area is scrollable two dimensionally, then the scrolling action invoked will be omnidirectional.
In a unidirectional scrolling action where the scrolling is restricted to the vertical direction (i.e., the Y axis), only the vertical vector component of the tracked touch movement can be used as input for effecting vertical scrolling. Similarly, in a unidirectional scrolling action where the scrolling is restricted to the horizontal direction (i.e., the X axis), only the horizontal vector component of the tracked touch movement can be used as input for effecting horizontal scrolling. If the scrolling action is omnidirectional, then the scrolling action effected can track the movement of the tracked touch.
If the detected touch is a one finger touch, then the scrolling action is ready to be performed 724 at a normal, or 1X, speed. If and once the touched down finger begins to move on the touch screen, then a scroll action can be performed by tracking the movement of the touchdown point on the touch screen. If the detected touch is a two finger touch, then the scrolling action can be performed 725 at a double, or 2X speed. Additional fingers can be added to perform even faster scrolling action, where a detection of a four finger touch can be translated into “pg up” or “pg dn” commands within a multi-page document window.
In accordance with another embodiment, the displayed data can continue to move even when the finger is removed from the touch screen. The continuous motion can be based at least in part on the previous motion. For example the scrolling can be continued in the same direction and speed. In some cases, the scrolling slow down over time, i.e., the speed of the traversal through the media items can get slower and slower until the scrolling eventually stops, thereby leaving a static list. By way of example, each new media item brought into the viewing area can incrementally decrease the speed. Alternatively or additionally, the displayed data can stop moving when the finger is placed back on the touch screen. That is, the placement of the finger back on the touch screen can implement braking, which can stop or slow down the continuous acting motion.
By way of examples to illustrate the above discussed gestural actions, as shown in
Drag-and-drop function can be invoked by touching, with at least one finger, the image associated with the object to be dropped and graphically dragging the object to the desired drop location by maintaining the touch, such as shown in
Certain mouse functionalities can require two touches to complete. For instance, as shown in
Other file selection functions that can normally require a combination of mouse and keyboard action can be performed using only touch action. For instance, in the Microsoft Windows environment, in to select multiple files within file window 502, a user typically needs to hold down the shift button while dragging the mouse icon over the sequential files to be selected. Without holding down the shift button, the dragging of the mouse icon can be interpreted as a drag and drop action. As shown in
Referring to the scrolling actions described in
Finally, using a multi-touch display that is capable of proximity detection, such as the panels described in the aforementioned and incorporated by reference commonly assigned co-pending U.S. application Ser. No. 10/840,862 (published as U.S. Patent Publication No. US2006/0097991) and an application entitled “Proximity and Multi-Touch Sensor Detection and Demodulation,” Attorney Docket No. 10684-20011.00, filed on Jan. 3, 2007, gestures of a finger can also be used to invoke hovering action that is can be the equivalent of hovering a mouse icon over an image object.
By way of an example, referring to
Gestures can also be used to invoke and manipulate virtual control interfaces, such as volume knobs, switches, sliders, keyboards, and other virtual interfaces that can be created to facilitate human interaction with a computing system or a consumer electronic item. By way of an example, using a gesture to invoke a virtual control interface, and referring to
This association can also be based in part on the mode or current state of the computing device at the time of the input. For example, the same gesture can be interpreted alternatively as a volume knob gesture if a song is currently playing on the computing device, or as a rotate command if an object editing application is being executed. Other user feedback can be provided, including for example audible or tactile feedback.
Once the knob 170 is displayed as shown in
As shown in
As shown in
As shown in
As shown in
Also as shown in
As shown in
It should be noted that additional gestures can be performed simultaneously with the virtual control knob gesture. For example, more than one virtual control knob can be controlled at the same time using both hands, i.e., one hand for each virtual control knob. Alternatively or additionally, one or more slider bars can be controlled at the same time as the virtual control knob, i.e., one hand operates the virtual control knob, while at least one finger and maybe more than one finger of the opposite hand can operate at least one slider and maybe more than one slider bar, e.g., slider bar for each finger.
It should also be noted that although the described embodiment utilizes a virtual control knob, in another embodiment, the UI element can be a virtual scroll wheel. As an example, the virtual scroll wheel can mimic an actual scroll wheel such as those described in U.S. Patent Publication Nos. US2003/0076303A1, US2003/0076301A1, and US2003/0095096A1, all of which are herein incorporated by reference.
If the user input is classified as a gesture input, the touch-based method 200 proceeds to block 208 where one or more gesture control actions can be performed corresponding to the user input. The gesture control actions can be based at least in part on changes that occur with or between the at least two unique identifiers.
Following block 254, the touch-based method 250 proceeds to block 256 where a next image can be captured during the input stroke on the touch sensitive surface. Images can be captured sequentially during the stroke and thus the there can be a plurality of images associated with the stroke.
Following block 256, touch-based method 250 proceeds to block 258 where a determination can be made as to whether the touch mode changed between capture of the initial image and capture of the next image. If the touch mode changed, the touch-based method 250 proceeds to block 260 where the next image can be set as the initial image and thereafter the touch mode can again be determined at block 254 based on the new initial image. If the touch mode stayed the same, the touch-based method 250 proceeds to block 262 where the initial and next images can be compared and one or more control signals can be generated based on the comparison.
Following block 304, the touch-based method 300 proceeds to block 306 where the Image object can be modified based on and in unison with the gesture input. By modified, it is meant that the image object can change according to the particular gesture or gestures being performed. By in unison, it is meant that the changes can occur approximately while the gesture or gestures are being performed. In most cases, there is a one to one relationship between the gesture(s) and the changes occurring at the image object and they occur substantially simultaneously. In essence, the image object follows the motion of the fingers. For example, spreading of the fingers can simultaneously enlarge the object, closing of the fingers can simultaneously reduce the image object, rotating the fingers can simultaneously rotate the object, and translating the fingers can allow simultaneous panning or scrolling of the image object.
In one embodiment, block 306 can include determining which image object is associated with the gesture being performed, and thereafter locking the displayed object to the fingers disposed over it such that the image object changes in accordance with the gestural input. By locking or associating the fingers to the image object, the image object can continuously adjust itself in accordance to what the fingers are doing on the touch screen. Often the determination and locking occurs at set down, i.e., when the finger is positioned on the touch screen.
Following block 352, the zoom gesture method 350 proceeds to block 354 where the distance between at least the two fingers can be compared. The distance can be from finger to finger or from each finger to some other reference point as for example the centroid. If the distance between the two fingers increases (spread apart), a zoom-in signal can be generated as shown in block 356. If the distance between two fingers decreases (close together), a zoom-out signal can be generated as shown in block 358. In most cases, the set down of the fingers will associate or lock the fingers to a particular image object being displayed. For example, the touch sensitive surface can be a touch screen, and the image object can be displayed on the touch screen. This typically occurs when at least one of the fingers is positioned over the image object. As a result, when the fingers are moved apart, the zoom-in signal can be used to increase the size of the embedded features in the image object and when the fingers are pinched together, the zoom-out signal can be used to decrease the size of embedded features in the object. The zooming typically occurs within a predefined boundary such as the periphery of the display, the periphery of a window, the edge of the image object, and/or the like. The embedded features can be formed on a plurality of layers, each of which represents a different level of zoom.
In most cases, the amount of zooming varies according to the distance between the two objects. Furthermore, the zooming typically can occur substantially simultaneously with the motion of the objects. For instance, as the fingers spread apart or closes together, the object zooms in or zooms out at the same time. Although this methodology is directed at zooming, it should be noted that it can also be used for enlarging or reducing. The zoom gesture method 350 can be particularly useful in graphical programs such as publishing, photo, and drawing programs. Moreover, zooming can be used to control a peripheral device such as a camera, i.e., when the finger is spread apart, the camera zooms out, and when the fingers are closed the camera zooms in.
Following block 402, the pan method 400 proceeds to block 404 where the position of the two objects when the objects are moved together across the touch screen can be monitored. Following block 404, the pan method 400 proceeds to block 406 where a pan signal can be generated when the position of the two objects changes relative to an initial position. In most cases, the set down of the fingers can lock the fingers to a particular image object displayed on the touch screen. As a result, when the fingers are moved together across the touch screen, the pan signal can be used to translate the image in the direction of the fingers. In most cases, the amount of panning varies according to the distance the two objects move. Furthermore, the panning typically can occur substantially simultaneously with the motion of the objects. For instance, as the fingers move, the object moves with the fingers at the same time.
Following block 452, the rotate method 450 proceeds to block 454 where the angle of each of the finger can be set. The angles can typically be determined relative to a reference point. Following block 454, rotate method 450 proceeds to block 456 where a rotate signal can be generated when the angle of at least one of the objects changes relative to the reference point. In most cases, the set down of the fingers can associate or lock the fingers to a particular image object displayed on the touch screen. Typically, when at least one of the fingers is positioned over the image on the image object, the image object can be associated with or locked to the fingers. As a result, when the fingers are rotated, the rotate signal can be used to rotate the object in the direction of finger rotation (e.g., clockwise, counterclockwise). In most cases, the amount of object rotation varies according to the amount of finger rotation, i.e., if the fingers move 5 degrees then so will the object. Furthermore, the rotation typically can occur substantially simultaneously with the motion of the fingers. For instance, as the fingers rotate, the object rotates with the fingers at the same time.
It should be noted that the methods described in
Further details on the detection, interpretation and use of gestures are described in Applicant's co-pending U.S. patent application Ser. No. 11/038,590 entitled “Mode-Based Graphical User Interfaces for Touch Sensitive Input Devices,” filed Jan. 18, 2005, and published as U.S. Patent Publication No. 2006/0026535 on Feb. 2, 2006, the contents of which are incorporated by reference herein. Further details on detecting touches on a surface and interpreting them as gestures are described in Applicant's co-pending U.S. patent application Ser. No. 11/428,522 entitled “Identifying Contacts on a Touch Surface,” filed Jul. 3, 2006, and published as U.S. Patent Publication No. 2006/0238522 on Oct. 26, 2006, the contents of which are incorporated by reference herein. Further details on “knuckle” and “pengrip” detection are described in U.S. Pat. No. 6,323,846 entitled “Method and Apparatus for Integrating Manual Input,” the contents of which are incorporated by reference herein.
“Real-world” gestures such as hand or finger movements/orientations that are generally recognized to mean certain things (e.g., an “OK” hand signal generally indicates an affirmative response) can be detected by the touch or hover sensitive device and interpreted by the multipoint processing method to more efficiently and accurately effect intended operations. Like other gestures described above, “real-world” gestures can be detected and interpreted by first obtaining one or more images of touch, classifying and grouping the features, calculating any key parameters of the groups, associating the groups with UI elements where applicable, and then performing the appropriate action.
To detect an “OK gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
In
If a single feature is found, and the various characteristics indicative of an “OK gesture” are present (e.g., either images 1804 of
After the features have been classified and grouped, parameters for the group can be computed, such as an approximate center 1812 of the region of no detected touch or hover, and if this center is located coincident with the UI element (e.g. text box 1810 in
To detect a “grasp everything gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
If the images and corresponding features are tracked over time, the completion of a “grasp everything gesture” can be accurately detected. At that point, all UI elements 1902 within the calculated circumferential boundary 1906 can be associated with the “grasp everything gesture” and selected together, for example. Any UI elements outside this boundary would not be “grasped.” Note, however, that if after some time has passed, instead of the images of
After the “grasp everything gesture” has been detected and the corresponding UI elements have been grouped, the user can utilize various input mechanisms to take further actions with regard to the grasped UI elements. In an exemplary extension of the “grasp everything feature,” if images 1916 and 1918 are further tracked over time and rapidly get weaker and quickly disappear, indicating that the user may have quickly pulled the clenched hand away from the touch screen, this can be further interpreted as a “throw away” gesture to discard the grasped UI elements (i.e. a “grasp and delete everything gesture”).
To detect a “stamp of approval gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
After the detected image 2006 has been converted to one or more features and classified and grouped as a “stamp of approval gesture,” parameters for the feature can be computed, such as an approximate center 2008 of the “stamp of approval gesture,” and if this center is located coincident with the UI element (e.g. text box 2002 in
To detect a “circle select everything gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
If a “circle select gesture” is detected, all UI elements 2102 within the circumferential boundary 2106 can be associated with the gesture and selected together, for example. Any UI elements outside this boundary would not be selected. However, if after some time has passed, the moving image of touch 2108 has not yet substantially returned to its starting point, this can indicate that the user has abandoned the “circle select gesture” or did not intend that gesture at all, and in that case no UI elements would be selected.
After the “circle select gesture” has been detected and the corresponding UI elements have been selected, the user can utilize various input mechanisms to take further actions with regard to the selected UI elements. For example, the selected UI elements can be copied, moved, or deleted as a group.
To detect a “X to delete gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
After stroke 2206 is completed, the user can then lift the finger off the sensor panel and begin a second stroke 2208 shown in
If an “X to delete gesture” is detected, parameters can be computed such as the intersection between the first and second strokes 2206 and 2210. The UI element 2202 beneath the between the first and second strokes 2206 and 2210 can then be selected for deletion. Those UI elements that did not appear under the intersection would not be deleted. Note that if, for example, after some time has passed, the second stroke 2210 is not detected, this can indicate that the user has abandoned the “X to delete gesture” or that it was never intended to be an “X to delete gesture,” and in that case no UI elements would be selected and deleted.
To detect a “knock to inquire gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
If a “knock to inquire gesture” is detected, parameters can be computed such as the approximate center location 2308 of the knuckle on the middle finger. The UI element 2302 beneath center location 2308 can then be associated with the gesture and selected for inquiry. Those UI elements that did not appear under center location 2308 would not be selected. Note that if, for example, after a first group of images 2306 are detected, similar groups of images are not received in rapid succession, this can indicate that a “knock to inquire gesture” is not being performed.
After the “knock to inquire gesture” has been detected and the corresponding UI element has been selected, further actions can be taken with regard to the selected UI element. For example, if the selected UI algorithm is a shared resource such as a limited access application, an inquiry as to whether the application is available to be checked out can be initiated.
To detect a “hitchhiker gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
To detect a shape gesture, either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
In any of these examples, the images can be converted to features, which can then be classified, grouped and interpreted as a particular gesture. Each gesture described above can then cause a particular operation to be performed with regard to UI elements appearing with a certain relationship to the gesture (e.g., UI elements within the triangle formed by the “triangle shape gesture” can be selected, zoomed, and the like). In another example, if multiple touch images taken over time indicate that the “V shape gesture” is being shifted upwards, UI elements within the “V” of the “V shape gesture” could be grouped and translated (moved) upward in accordance with the motion of the “V shape gesture.” A similar concept could apply to the “inverted V shape gesture.” The “wing shape gesture” could, for example, trigger the desktop faxing or sending of a UI element under the gesture (e.g. a document) or trigger the e-mailing of a document over the Internet.
To detect an “identification gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
A time component can optionally be attached to an “identification gesture.” For example, the entire “identification gesture” can be required to be completed within a certain period of time (e.g. a few seconds), otherwise the gesture will be rejected.
In another embodiment illustrated in
In another embodiment illustrated in
To detect a “hand edge gesture,” either a touch sensor panel with some hover detecting capability or a touch sensor panel co-located with a proximity sensor panel can be employed.
If the images and corresponding features are tracked over time, the completion of various “hand edge gestures” can be accurately detected. At that point, one or more UI elements appearing on the touch screen at the time the gesture was detected can be associated with the “hand edge gesture.” After a particular “hand edge gesture” has been detected and the corresponding UI elements have been associated with the gesture, various actions can be taken with regard to the associated UI elements. For example, the detection of a hand edge gesture, without more, can be used to quickly “orient” an application or control along the elongated axis defined by the orientation of the touch image. A rotational sweep of the touch image from vertical to horizontal, can be used to “sweep away” the UI elements, or dismiss the normal desktop to bring up alternative desktops, switch users, lock the screen, and the like. A sweep from horizontal to vertical, a straightening of fingers to vertical, or a flicking action can be used to reverse the operation, e.g. resume normal desktop/workspace. Sweeping the touch image from left edge to right edge can be used to perform an erase operation (e.g. clear a drawing), switch to a workspace or virtual desktop on the right, or perform a continuous redo (or forward on a timeline). Sweeping the touch image from right edge to left edge can be used to switch to a workspace/virtual desktop on the left or perform a continuous undo (or backup on a timeline). For instance, in a drawing application, these gestures could undo/redo one pivot point at a time on drawn paths, polygons or bezier curves. Sweeping from top to bottom edge or vice versa can similarly clear a drawing, switch workspaces, users, or virtual desktops, or hide an application or UI element, while sweeping from bottom to top can be used to maximize the application or UI element. “Sawing” the gesture up and down can be used to vertically split the window or view.
In any of these exemplary identification gestures, the images can be converted to features, which can then be classified, grouped and interpreted as an identification gesture. Each identification gesture can then be associated with a particular user and UI element for which access is sought, and depending on whether access rights have been satisfied, the user may or may not be granted access to the UI element.
Although the present invention has been fully described in connection with embodiments thereof with reference to the accompanying drawings, it is to be noted that various changes and modifications will become apparent to those skilled in the art. Such changes and modifications are to be understood as being included within the scope of the present invention as defined by the appended claims.
Claims
1. A method comprising:
- at an electronic device in communication with a touch screen: detecting, at the touch screen, one or more touch inputs; and in response to detecting the one or more touch inputs: in accordance with a determination that the one or more touch inputs satisfy user interface element-display criteria, including a criterion that is satisfied when the one or more touch inputs correspond to two hands concurrently touching the touch screen, displaying, on the touch screen, a user interface element, wherein the user interface element was not displayed on the touch screen prior to detecting the one or more touch inputs.
2. The method of claim 1, further comprising:
- in response to detecting the one or more touch inputs: in accordance with a determination that the one or more touch inputs do not correspond to two hands concurrently touching the touch screen, forgoing displaying, on the touch screen, the user interface element.
3. The method of claim 1, wherein displaying the user interface element on the touch screen comprises displaying the user interface element within a region defined by the two hands on the touch screen.
4. The method of claim 3, wherein the region defined by the two hands on the touch screen includes an area between the two hands on the touch screen.
5. The method of claim 3, wherein the two hands on the touch screen do not overlap with the defined region on the touch screen.
6. The method of claim 1, wherein the user interface element is a detail portal, a magnifier, a dialog or a photo.
7. The method of claim 1, wherein the user interface element-display criteria further include a criterion that is satisfied when the two hands have a predefined shape.
8. The method of claim 7, wherein the predefined shape corresponds to an upside-down U shape.
9. An electronic device comprising:
- one or more processors;
- a touch screen; and
- memory storing instructions, which when executed by the one or more processors, cause the one or more processors to perform a method comprising:
- detecting, at the touch screen, one or more touch inputs; and
- in response to detecting the one or more touch inputs: in accordance with a determination that the one or more touch inputs satisfy user interface element-display criteria, including a criterion that is satisfied when the one or more touch inputs correspond to two hands concurrently touching the touch screen, displaying, on the touch screen, a user interface element, wherein the user interface element was not displayed on the touch screen prior to detecting the one or more touch inputs.
10. A non-transitory computer-readable storage medium storing instructions, which when executed by one or more processors of an electronic device in communication with a touch screen, cause the one or more processors to perform a method comprising:
- detecting, at the touch screen, one or more touch inputs; and
- in response to detecting the one or more touch inputs: in accordance with a determination that the one or more touch inputs satisfy user interface element-display criteria, including a criterion that is satisfied when the one or more touch inputs correspond to two hands concurrently touching the touch screen, displaying, on the touch screen, a user interface element, wherein the user interface element was not displayed on the touch screen prior to detecting the one or more touch inputs.
Type: Application
Filed: Nov 6, 2017
Publication Date: Mar 1, 2018
Inventors: Wayne Carl WESTERMAN (Burlingame, CA), Myra Mary HAGGERTY (San Mateo, CA)
Application Number: 15/804,466