GESTURES FOR TOUCH SENSITIVE INPUT DEVICES
Methods and systems for processing touch inputs are disclosed. The invention in one respect includes reading data from a multipoint sensing device such as a multipoint touch screen where the data pertains to touch input with respect to the multipoint sensing device, and identifying at least one multipoint gesture based on the data from the multipoint sensing device.
Latest Apple Patents:
This application is a continuation of U.S. patent application Ser. No. 10/903,964 filed Jul. 30, 2004, which is hereby incorporated by reference in its entirety.
BACKGROUND OF THE INVENTION1. Field of the Invention
The present invention relates generally to gesturing associated with touch sensitive devices.
2. Description of the Related Art
There exist today many styles of input devices for performing operations in a computer system. The operations generally correspond to moving a cursor and making selections on a display screen. The operations may also include paging, scrolling, panning, zooming, etc. By way of example, the input devices may include buttons, switches, keyboards, mice, trackballs, touch pads, joy sticks, touch screens and the like. Each of these devices has advantages and disadvantages that are taken into account when designing the computer system.
Buttons and switches are generally mechanical in nature and provide limited control with regards to the movement of the cursor and making selections. For example, they are generally dedicated to moving the cursor in a specific direction (e.g., arrow keys) or to making specific selections (e.g., enter, delete, number, etc.).
In mice, the movement of the input pointer corresponds to the relative movements of the mouse as the user moves the mouse along a surface. In trackballs, the movement of the input pointer corresponds to the relative movements of a ball as the user moves the ball within a housing. Mice and trackballs also include one or more buttons for making selections. Mice may also include scroll wheels that allow a user to move through the GUI by simply rolling the wheel forward or backward.
With touch pads, the movement of the input pointer corresponds to the relative movements of the user's finger (or stylus) as the finger is moved along a surface of the touch pad. Touch screens, on the other hand, are a type of display screen that has a touch-sensitive transparent panel covering the screen. When using a touch screen, a user makes a selection on the display screen by pointing directly to GUI objects on the screen (usually with a stylus or finger).
In order to provide additionally functionality, gestures have been implemented with some of these input devices. By way of example, in touch pads, selections may be made when one or more taps are detected on the surface of the touch pad. In some cases, any portion of the touch pad may be tapped, and in other cases a dedicated portion of the touch pad may be tapped. In addition to selections, scrolling may be initiated by using finger motion at the edge of the touch pad.
U.S. Pat. Nos. 5,612,719 and 5,590,219, assigned to Apple Computer, Inc. describe some other uses of gesturing. U.S. Pat. No. 5,612,719 discloses an onscreen button that is responsive to at least two different button gestures made on the screen on or near the button. U.S. Pat. No. 5,590,219 discloses a method for recognizing an ellipse-type gesture input on a display screen of a computer system.
In recent times, more advanced gestures have been implemented. For example, scrolling may be initiated by placing four fingers on the touch pad so that the scrolling gesture is recognized and thereafter moving these fingers on the touch pad to perform scrolling events. The methods for implementing these advanced gestures, however, has several drawbacks. By way of example, once the gesture is set, it cannot be changed until the user resets the gesture state. In touch pads, for example, if four fingers equals scrolling, and the user puts a thumb down after the four fingers are recognized, any action associated with the new gesture including four fingers and the thumb will not be performed until the entire hand is lifted off the touch pad and put back down again (e.g., reset). Simply put, the user cannot change gesture states midstream. Along a similar vein, only one gesture may be performed at any given time. That is, multiple gestures cannot be performed simultaneously.
Based on the above, there is a need for improvements in the way gestures are performed on touch sensitive devices.
SUMMARY OF THE INVENTIONThe invention relates, in one embodiment, to a computer implemented method for processing touch inputs. The method includes reading data from a multipoint touch screen. The data pertains to touch input with respect to the touch screen. The method also includes identifying at least one multipoint gesture based on the data from the multipoint touch screen.
The invention relates, in another embodiment to a gestural method. The method includes detecting multiple touches at different points on a touch sensitive surface at the same time. The method also includes segregating the multiple touches into at least two separate gestural inputs occurring simultaneously. Each gestural input has a different function such as zooming, panning, rotating and the like.
The invention relates, in another embodiment to a gestural method. The method includes concurrently detecting a plurality of gestures that are concurrently performed with reference to a touch sensing device. The method also includes producing different commands for each of the gestures that have been detected.
The invention relates, in another embodiment to a gestural method. The method includes displaying a graphical image on a display screen. The method also includes detecting a plurality of touches at the same time on a touch sensitive device. The method further includes linking the detected multiple touches to the graphical image presented on the display screen.
The invention relates, in another embodiment to a method of invoking a user interface element on a display via a multipoint touch screen of a computing system. The method includes detecting and analyzing the simultaneous presence of two or more objects in contact with the multipoint touch screen. The method also includes selecting a user interface tool, from a plurality of available tools, to display on a display for interaction by a user of the computing system based at least in part the analyzing. The method further includes controlling the interface tool based at least in part on the further movement of the objects in relation to the multipoint touch screen.
The invention relates, in another embodiment, to a touch-based method. The method includes detecting a user input that occurs over a multipoint sensing device. The user input includes one or more inputs. Each input has a unique identifier. The method also includes, during the user input, classifying the user input as a tracking or selecting input when the user input includes one unique identifier or a gesture input when the user input includes at least two unique identifiers. The method further includes performing tracking or selecting during the user input when the user input is classified as a tracking or selecting input. The method additionally includes performing one or more control actions during the user input when the user input is classified as a gesturing input. The control actions being based at least in part on changes that occur between the at least two unique identifiers.
The invention relates, in another embodiment, to a touch-based method. The method includes outputting a GUI on a display. The method also includes detecting a user input on a touch sensitive device. The method further includes analyzing the user input for characteristics indicative of tracking, selecting or a gesturing. The method additionally includes categorizing the user input as a tracking, selecting or gesturing input. The method further includes performing tracking or selecting in the GUI when the user input is categorized as a tracking or selecting input. Moreover, the method includes performing control actions in the GUI when the user input is categorized as a gesturing input, the actions being based on the particular gesturing input.
The invention relates, in another embodiment, to a touch-based method. The method includes capturing an initial touch image. The method also includes determining the touch mode based on the touch image. The method further includes capturing the next touch image. The method further includes determining if the touch mode changed between the initial and next touch images. The method additionally includes, if the touch mode changed, setting the next touch image as the initial touch image and determining the touch mode based on the new initial touch image. Moreover, the method includes, if the touch mode stayed the same, comparing the touch images and performing a control function based on the comparison.
The invention relates, in another embodiment, to a computer implemented method for processing touch inputs. The method includes reading data from a touch screen. The data pertaining to touch input with respect to the touch screen, and the touch screen having a multipoint capability. The method also includes converting the data to a collection of features. The method further includes classifying the features and grouping the features into one or more feature groups. The method additionally includes calculating key parameters of the feature groups and associating the feature groups to user interface elements on a display.
The invention relates, in another embodiment, to a computer implemented method. The method includes outputting a graphical image. The method also includes receiving a multitouch gesture input over the graphical image. The method further includes changing the graphical image based on and in unison with multitouch gesture input.
The invention relates, in another embodiment, to a touch based method. The method includes receiving a gestural input over a first region. The method also includes generating a first command when the gestural input is received over the first region. The method further includes receiving the same gestural input over a second region. The method additionally includes generating a second command when the same gestural input is received over the second region. The second command being different than the first command.
The invention relates, in another embodiment, to a method for recognizing multiple gesture inputs. The method includes receiving a multitouch gestural stroke on a touch sensitive surface. The multitouch gestural stroke maintaining continuous contact on the touch sensitive surface. The method also includes recognizing a first gesture input during the multitouch gestural stroke. The method further includes recognizing a second gesture input during the multitouch gestural stroke.
The invention relates, in another embodiment, to a computer implemented method. The method includes detecting a plurality of touches on a touch sensing device. The method also includes forming one or more touch groups with the plurality of touches. The method further includes monitoring the movement of and within each of the touch groups. The method additionally includes generating control signals when the touches within the touch groups are moved or when the touch groups are moved in their entirety.
It should be noted that in each of the embodiments described above, the methods may be implemented using a touch based input device such as a touch screen or touch pad, more particularly a multipoint touch based input device, and even more particularly a multipoint touch screen. It should also be noted that the gestures, gesture modes, gestural inputs, etc. may correspond to any of those described below in the detailed description. For example, the gestures may be associated with zooming, panning, scrolling, rotating, enlarging, floating controls, zooming targets, paging, inertia, keyboarding, wheeling, and/or the like.
The invention will be readily understood by the following detailed description in conjunction with the accompanying drawings, wherein like reference numerals designate like structural elements, and in which:
The invention generally pertains to gestures and methods of implementing gestures with touch sensitive devices. Examples of touch sensitive devices include touch screens and touch pads. One aspect of the invention relates to recognizing at least two simultaneously occurring gestures. Another aspect of the invention relates to displaying a graphical image and linking different touches that occur to the graphical image. Another aspect of the invention relates to immediately recognizing gestures so that actions associated with the gestures can be implemented at the same time. Another aspect of the invention relates to changing a displayed image based on and in unison with a gestural input, i.e., the displayed image continuously changes with changes in the gestural input such that the displayed image continuously follows the gestural input. Another aspect of the invention relates to implementing an input mode based on the number of fingers (or other object) in contact with the input device. Another aspect of the invention relates to providing region sensitivity where gestures mean different things when implemented over different areas of the input device. Another aspect of the invention relates to changing an input while making continuous contact with the touch sensitive surface of the touch sensitive device.
These and other aspects of the invention are discussed below with reference to
The exemplary computer system 50 shown in
In most cases, the processor 56 together with an operating system operates to execute computer code and produce and use data. Operating systems are generally well known and will not be described in greater detail. By way of example, the operating system may correspond to OS/2, DOS, Unix, Linux, Palm OS, and the like. The operating system can also be a special purpose operating system, such as may be used for limited purpose appliance-type computing devices. The operating system, other computer code and data may reside within a memory block 58 that is operatively coupled to the processor 56. Memory block 58 generally provides a place to store computer code and data that are used by the computer system 50. By way of example, the memory block 58 may include Read-Only Memory (ROM), Random-Access Memory (RAM), hard disk drive and/or the like. The information could also reside on a removable storage medium and loaded or installed onto the computer system 50 when needed. Removable storage mediums include, for example, CD-ROM, PC-CARD, memory card, floppy disk, magnetic tape, and a network component.
The computer system 50 also includes a display device 68 that is operatively coupled to the processor 56. The display device 68 may be a liquid crystal display (LCD) (e.g., active matrix, passive matrix and the like). Alternatively, the display device 68 may be a monitor such as a monochrome display, color graphics adapter (CGA) display, enhanced graphics adapter (EGA) display, variable-graphics-array (VGA) display, super VGA display, cathode ray tube (CRT), and the like. The display device may also correspond to a plasma display or a display implemented with electronic inks.
The display device 68 is generally configured to display a graphical user interface (GUI) 69 that provides an easy to use interface between a user of the computer system and the operating system or application running thereon. Generally speaking, the GUI 69 represents, programs, files and operational options with graphical images. The graphical images may include windows, fields, dialog boxes, menus, icons, buttons, cursors, scroll bars, etc. Such images may be arranged in predefined layouts, or may be created dynamically to serve the specific actions being taken by a user. During operation, the user can select and activate various graphical images in order to initiate functions and tasks associated therewith. By way of example, a user may select a button that opens, closes, minimizes, or maximizes a window, or an icon that launches a particular program. The GUI 69 can additionally or alternatively display information, such as non interactive text and graphics, for the user on the display device 68.
The computer system 50 also includes an input device 70 that is operatively coupled to the processor 56. The input device 70 is configured to transfer data from the outside world into the computer system 50. The input device 70 may for example be used to perform tracking and to make selections with respect to the GUI 69 on the display 68. The input device 70 may also be used to issue commands in the computer system 50. The input device 70 may include a touch sensing device configured to receive input from a user's touch and to send this information to the processor 56. By way of example, the touch-sensing device may correspond to a touchpad or a touch screen. In many cases, the touch-sensing device recognizes touches, as well as the position and magnitude of touches on a touch sensitive surface. The touch sensing means reports the touches to the processor 56 and the processor 56 interprets the touches in accordance with its programming. For example, the processor 56 may initiate a task in accordance with a particular touch. A dedicated processor can be used to process touches locally and reduce demand for the main processor of the computer system. The touch sensing device may be based on sensing technologies including but not limited to capacitive sensing, resistive sensing, surface acoustic wave sensing, pressure sensing, optical sensing, and/or the like. Furthermore, the touch sensing means may be based on single point sensing or multipoint sensing. Single point sensing is capable of only distinguishing a single touch, while multipoint sensing is capable of distinguishing multiple touches that occur at the same time.
The input device 70 may be a touch screen that is positioned over or in front of the display 68. The touch screen 70 may be integrated with the display device 68 or it may be a separate component. The touch screen 70 has several advantages over other input technologies such as touchpads, mice, etc. For one, the touch screen 70 is positioned in front of the display 68 and therefore the user can manipulate the GUI 69 directly. For example, the user can simply place their finger over an object to be controlled. In touch pads, there is no one-to-one relationship such as this. With touchpads, the touchpad is placed away from the display typically in a different plane. For example, the display is typically located in a vertical plane and the touchpad is typically located in a horizontal plane. This makes its use less intuitive, and therefore more difficult when compared to touch screens. In addition to being a touch screen, the input device 70 can be a multipoint input device. Multipoint input devices have advantages over conventional singlepoint devices in that they can distinguish more than one object (finger). Singlepoint devices are simply incapable of distinguishing multiple objects. By way of example, a multipoint touch screen, which can be used herein, is shown and described in greater detail in copending and commonly assigned U.S. patent application Ser. No. 10/840,862, which is hereby incorporated herein by reference.
The computer system 50 also includes capabilities for coupling to one or more I/O devices 80. By way of example, the I/O devices 80 may correspond to keyboards, printers, scanners, cameras, speakers, and/or the like. The I/O devices 80 may be integrated with the computer system 50 or they may be separate components (e.g., peripheral devices). In some cases, the I/O devices 80 may be connected to the computer system 50 through wired connections (e.g., cables/ports). In other cases, the I/O devices 80 may be connected to the computer system 80 through wireless connections. By way of example, the data link may correspond to PS/2, USB, IR, RF, Bluetooth or the like.
In accordance with one embodiment of the present invention, the computer system 50 is designed to recognize gestures 85 applied to the input device 70 and to control aspects of the computer system 50 based on the gestures 85. In some cases, a gesture is defined as a stylized interaction with an input device that is mapped to one or more specific computing operations. The gestures 85 may be made through various hand, and more particularly finger motions. Alternatively or additionally, the gestures may be made with a stylus. In all of these cases, the input device 70 receives the gestures 85 and the processor 56 executes instructions to carry out operations associated with the gestures 85. In addition, the memory block 58 may include a gesture operational program 88, which may be part of the operating system or a separate application. The gestural operation program 88 generally includes a set of instructions that recognizes the occurrence of gestures 85 and informs one or more software agents of the gestures 85 and/or what action(s) to take in response to the gestures 85.
When a user performs one or more gestures, the input device 70 relays gesture information to the processor 56. Using instructions from memory 58, and more particularly, the gestural operational program 88, the processor 56 interprets the gestures 85 and controls different components of the computer system 50, such as memory 58, a display 68 and I/O devices 80, based on the gestures 85. The gestures 85 may be identified as commands for performing actions in applications stored in the memory 58, modifying GUI objects shown on the display 68, modifying data stored in memory 58, and/or for performing actions in I/O devices 80. By way of example, the commands may be associated with zooming, panning, scrolling, paging, rotating, sizing, and the like. As further examples, the commands may also be associated with launching a particular program, opening a file or document, viewing a menu, making a selection, executing instructions, logging onto the computer system, permitting authorized individuals access to restricted areas of the computer system, loading a user profile associated with a user's preferred arrangement of the computer desktop, and/or the like.
A wide range of different gestures can be utilized. By way of example, the gestures may be single point or multipoint gestures; static or dynamic gestures; continuous or segmented gestures; and/or the like. Single point gestures are those gestures that are performed with a single contact point, e.g., the gesture is performed with a single touch as for example from a single finger, a palm or a stylus. Multipoint gestures are those gestures that can be performed with multiple points, e.g., the gesture is performed with multiple touches as for example from multiple fingers, fingers and palms, a finger and a stylus, multiple styli and/or any combination thereof. Static gestures are those gestures that do not include motion, and dynamic gestures are those gestures that do include motion. Continuous gestures are those gestures that are performed in a single stroke, and segmented gestures are those gestures that are performed in a sequence of distinct steps or strokes.
In one embodiment, the computer system 50 is configured to register multiple gestures at the same time, i.e., multiple gestures can be performed simultaneously. By way of example, a zoom gesture may be performed at the same time as a rotate gesture, or a rotate gesture may be performed at the same time as a pan gesture. In one particular implementation, zoom, rotate and pan gestures can all occur simultaneously in order to perform zooming, rotating and panning at the same time.
In another embodiment, the system is configured to immediately recognize the gestures so that actions associated with the gestures can be implemented at the same time as the gesture, i.e., the gesture and action simultaneously occur side by side rather than being a two-step process. By way of example, during a scrolling gesture, the screen moves with the finger motion.
In another embodiment, an object presented on a display 68 continuously follows the gesture occurring on a touch screen. There is a one to one relationship between the gesture being performed and the objects shown on the display 68. For example, as the gesture is performed, modifications simultaneously occur to the objects located underneath the gesture. For example, during a zooming gesture, the fingers may spread apart or close together in order to cause the object shown on the display 68 to zoom in during the spreading and zoom out during the closing. During this operation, the computer system 50 recognizes the user input as a zoom gesture, determines what action should be taken, and outputs control data to the appropriate device, in this case the display 68.
In another embodiment, the computer system 50 provides region sensitivity where gestures mean different things when implemented over different areas of the input device 68. For example, a rotation gesture over a volume knob causes volume increase/decrease, whereas a rotation gesture over a photo causes rotation of the photo.
In another embodiment, the number of fingers in contact with the touch screen may indicate an input mode. For example, a single touch as for example by a single finger may indicate the desire to perform tracking, i.e., pointer or cursor movements, or selections, whereas multiple touches as for example by a group of fingers may indicate the desire to perform gesturing. The number of fingers for implementing gesturing may be widely varied. By way of example, two fingers may indicate a first gesture mode, three fingers may indicate a third gesture mode, etc. Alternatively, any number of fingers, i.e., more than one, may be used for the same gesture mode, which can include one or more gesture controls. The orientation of the fingers may similarly be used to denote the desired mode. The profile of the finger may be detected to permit different modal operations based on whether the user has used his thumb or index finger, for example.
In another embodiment, an input can be changed while making a continuous stroke on the input device without stopping the stroke (e.g., lifting off the touch sensitive surface). In one implementation, the user can switch from a tracking (or selection) mode to gesturing mode while a stroke is being made. For example, tracking or selections may be associated with a single finger and gesturing may be associated with multiple fingers; therefore, the user can toggle between tracking/selection and gesturing by picking up and placing down a second finger on the touch screen. In another implementation, the user can switch from one gesture mode to another gesture mode while a stroke is being made. For example, zooming may be associated with spreading a pair of fingers and rotating may be associated with rotating the pair of fingers; therefore, the user can toggle between zooming and rotating by alternating the movement of their fingers between spreading and rotating. In yet another implementation, the number of gesture inputs can be changed while a stroke is being made (e.g., added or subtracted). For example, during zooming where the fingers are spread apart, the user may further rotate their fingers to initiate both zooming and rotation. Furthermore during zooming and rotation, the user can stop spreading their fingers so that only rotation occurs. In other words, the gesture inputs can be continuously input, either simultaneously or consecutively.
In one particular embodiment, a single finger initiates tracking (or selection) and two or more fingers in close proximity to one another initiates scrolling or panning. Two fingers is generally preferred so as to provide easy toggling between one and two fingers, i.e., the user can switch between modes very easily by simply picking or placing an additional finger. This has the advantage of being more intuitive than other forms of mode toggling. During tracking, cursor movement is controlled by the user moving a single finger on the touch sensitive surface of a touch sensing device. The sensor arrangement of the touch sensing device interprets the finger motion and generates signals for producing corresponding movement of the cursor on the display. During scrolling, screen movement is controlled by the user moving dual fingers on the touch sensitive surface of the touch sensing device. When the combined fingers are moved in the vertical direction, the motion is interpreted as a vertical scroll event, and when the combined fingers are moved in the horizontal direction, the motion is interpreted as a horizontal scroll event. The same can be said for panning although panning can occur in all directions rather than just the horizontal and vertical directions.
The term “scrolling” as used herein generally pertains to moving displayed data or images (e.g., text or graphics) across a viewing area on a display screen so that a new set of data (e.g., line of text or graphics) is brought into view in the viewing area. In most cases, once the viewing area is full, each new set of data appears at the edge of the viewing area and all other sets of data move over one position. That is, the new set of data appears for each set of data that moves out of the viewing area. In essence, the scrolling function allows a user to view consecutive sets of data currently outside of the viewing area. The viewing area may be the entire viewing area of the display screen or it may only be a portion of the display screen (e.g., a window frame).
As mentioned above, scrolling may be implemented vertically (up or down) or horizontally (left or right). In the case of vertical scrolling, when a user scrolls down, each new set of data appears at the bottom of the viewing area and all other sets of data move up one position. If the viewing area is full, the top set of data moves out of the viewing area. Similarly, when a user scrolls up, each new set of data appears at the top of the viewing area and all other sets of data move down one position. If the viewing area is full, the bottom set of data moves out of the viewing area.
By way of example, the display screen, during operation, may display a list of media items (e.g., songs). A user is able to linearly scroll through the list of media items by moving his or her finger across a touch screen. As the finger moves across the touch screen, the displayed items from the list of media items are varied such that the user is able to effectively scroll through the list of media items. In most cases, the user is able to accelerate their traversal of the list of media items by moving his or her finger at greater speeds. Some embodiments, which may be related to the above example, are described in greater detail below. See for example
Following block 102, multipoint processing method 100 proceeds to block 104 where the image is converted into a collection or list of features. Each feature represents a distinct input such as a touch. In most cases, each feature includes its own unique identifier (ID), x coordinate, y coordinate, Z magnitude, angle θ, area A, and the like. By way of example,
The conversion from data or images to features may be accomplished using methods described in copending U.S. patent application Ser. No. 10/840,862 which is hereby incorporated herein by reference. As disclosed therein, the raw data is received. The raw data is typically in a digitized form, and includes values for each node of the touch screen. The values may be between 0 and 256 where 0 equates to no touch pressure and 256 equates to full touch pressure. Thereafter, the raw data is filtered to reduce noise. Once filtered, gradient data, which indicates the topology of each group of connected points, is generated. Thereafter, the boundaries for touch regions are calculated based on the gradient data, i.e., a determination is made as to which points are grouped together to form each touch region. By way of example, a watershed algorithm may be used. Once the boundaries are determined, the data for each of the touch regions are calculated (e.g., x, y, Z, θ, A).
Following block 104, multipoint processing method 100 proceeds to block 106 where feature classification and groupings are performed. During classification, the identity of each of the features is determined. For example, the features may be classified as a particular finger, thumb, palm or other object. Once classified, the features may be grouped. The manner in which the groups are formed can widely varied. In most cases, the features are grouped based on some criteria (e.g., they carry a similar attribute). For example, the two features shown in
Following block 106, the multipoint processing method 100 proceeds to block 108 where key parameters for the feature groups are calculated. The key parameters may include distance between features, x/y centroid of all features, feature rotation, total pressure of the group (e.g., pressure at centroid), and the like. As shown in
Following block 108, the process flow proceeds to block 110 where the group is or associated to a user interface (UI) element. UI elements are buttons boxes, lists, sliders, wheels, knobs, etc. Each UI element represents a component or control of the user interface. The application behind the UI element(s) has access to the parameter data calculated in block 108. In one implementation, the application ranks the relevance of the touch data to the UI element corresponding there to. The ranking may be based on some predetermine criteria. The ranking may include producing a figure of merit, and whichever UI element has the highest figure of merit, giving it sole access to the group. There may even be some degree of hysteresis as well (once one of the UI elements claims control of that group, the group sticks with the UI element until another UI element has a much higher ranking). By way of example, the ranking may include determining proximity of the centroid (or features) to the GUI object associated with the UI element.
Following block 110, the multipoint processing method 100 proceeds to blocks 112 and 114. The blocks 112 and 114 can be performed approximately at the same time. From the user perspective, in one embodiment, the blocks 112 and 114 appear to be performed concurrently. In block 112, one or more actions are performed based on differences between initial and current parameter values as well as the UI element to which they are associated. In block 114, user feedback pertaining to the one or more action being performed is provided. By way of example, user feedback may include display, audio, tactile feedback and/or the like.
The above methods and techniques can be used to implement any number of GUI interface objects and actions. For example, gestures can be created to detect and effect a user command to resize a window, scroll a display, rotate an object, zoom in or out of a displayed view, delete or insert text or other objects, etc. Gestures can also be used to invoke and manipulate virtual control interfaces, such as volume knobs, switches, sliders, handles, knobs, doors, and other widgets that may be created to facilitate human interaction with the computing system.
To cite an example using the above methodologies, and referring to
Once knob 170 is displayed as shown in
As shown in
As shown in
As shown in
As shown in
Also as shown in
As shown in
It should be noted that additional gestures can be performed simultaneously with the virtual control knob gesture. For example, more than one virtual control knob can be controlled at the same time using both hands, i.e., one hand for each virtual control knob. Alternatively or additionally, one or more slider bars can be controlled at the same time as the virtual control knob, i.e., one hand operates the virtual control knob, while at least one finger and maybe more than one finger of the opposite hand operates at least one slider and maybe more than one slider bar, e.g., slider bar for each finger.
It should also be noted that although the embodiment is described using a virtual control knob, in another embodiment, the UI element can be a virtual scroll wheel. As an example, the virtual scroll wheel can mimic an actual scroll wheel such as those described in U.S. Patent Publication Nos: 2003/0076303A1, 2003/0076301A1, 2003/0095096A1, which are all herein incorporated by reference. For example, when the user places their finger on the surface of the virtual scroll wheel and makes a swirling, rotational or tangential gesture motion, a scrolling action can be performed with respect to a list of items displayed in a window.
If the user input is classified as a gesture input, the touch-based method 200 proceeds to block 208 where one or more gesture control actions are performed corresponding the user input. The gesture control actions are based at least in part on changes that occur with or between the at least two unique identifiers.
Following block 304 the touch-based method 300 proceeds to block 306 where the GUI object is modified based on and in unison with the gesture input. By modified, it is meant that the GUI object changes according to the particular gesture or gestures being performed. By in unison, it is meant that the changes occur approximately while the gesture or gestures are being performed. In most cases, there is a one to one relationship between the gesture(s) and the changes occurring at the GUI object and they occur substantially simultaneously. In essence, the GUI object follows the motion of the fingers. For example, spreading of the fingers may simultaneously enlarge the object, closing of the fingers may simultaneously reduce the GUI object, rotating the fingers may simultaneously rotate the object, translating the fingers may allow simultaneous panning or scrolling of the GUI object.
In one embodiment, block 306 may include determining which GUI object is associated with the gesture being performed, and thereafter locking the displayed object to the fingers disposed over it such that the GUI object changes in accordance with the gestural input. By locking or associating the fingers to the GUI object, the GUI object can continuously adjust itself in accordance to what the fingers are doing on the touch screen. Often the determination and locking occurs at set down, i.e., when the finger is positioned on the touch screen.
Following block 352, the zoom gesture method 350 proceeds to block 354 where the distance between at least the two fingers is compared. The distance may be from finger to finger or from each finger to some other reference point as for example the centroid. If the distance between the two fingers increases (spread apart), a zoom-in signal is generated as shown in block 356. If the distance between two fingers decreases (close together), a zoom-out signal is generated as shown in block 358. In most cases, the set down of the fingers will associate or lock the fingers to a particular GUI object being displayed. For example, the touch sensitive surface can be a touch screen, and the GUI object can be displayed on the touch screen. This typically occurs when at least one of the fingers is positioned over the GUI object. As a result, when the fingers are moved apart, the zoom-in signal can be used to increase the size of the embedded features in the GUI object and when the fingers are pinched together, the zoom-out signal can be used to decrease the size of embedded features in the object. The zooming typically occurs within a predefined boundary such as the periphery of the display, the periphery of a window, the edge of the GUI object, and/or the like. The embedded features may be formed on a plurality of layers, each of which represents a different level of zoom. In most cases, the amount of zooming varies according to the distance between the two objects. Furthermore, the zooming typically can occur substantially simultaneously with the motion of the objects. For instance, as the fingers spread apart or closes together, the object zooms in or zooms out at the same time. Although this methodology is directed at zooming, it should be noted that it may also be used for enlarging or reducing. The zoom gesture method 350 may be particularly useful in graphical programs such as publishing, photo, and drawing programs. Moreover, zooming may be used to control a peripheral device such as a camera, i.e., when the finger is spread apart, the camera zooms out and when the fingers are closed the camera zooms in.
Following block 452, the rotate method 450 proceeds to block 454 where the angle of each of the finger is set. The angles are typically determined relative to a reference point. Following block 454, rotate method 450 proceeds to block 456 where a rotate signal is generated when the angle of at least one of the objects changes relative to the reference point. In most cases, the set down of the fingers will associate or lock the fingers to a particular GUI object displayed on the touch screen. Typically, when at least one of the fingers is positioned over the image on the GUI object, the GUI object will be associated with or locked to the fingers. As a result, when the fingers are rotated, the rotate signal can be used to rotate the object in the direction of finger rotation (e.g., clockwise, counterclockwise). In most cases, the amount of object rotation varies according to the amount of finger rotation, i.e., if the fingers move 5 degrees then so will the object. Furthermore, the rotation typically can occur substantially simultaneously with the motion of the fingers. For instance, as the fingers rotate, the object rotates with the fingers at the same time.
It should be noted that the methods described in
Following block 504, the GUI operational method 500 proceeds to block 506 where an image in the vicinity of the object is generated. The image is typically based on the recognized object. The image may include windows, fields, dialog boxes, menus, icons, buttons, cursors, scroll bars, etc. In some cases, the user can select and activate the image (or features embedded therein) in order to initiate functions and tasks. By way of example, the image may be a user interface element or a group of user interface elements (e.g., one or more buttons that open, close, minimize, or maximize a window). The image may also be one or more icons that launch a particular program or files that open when selected. The image may additionally correspond to non interactive text and graphics. In most cases, the image is displayed as long as the object is detected or it may be displayed for some preset amount of time, i.e., after a period of time it times out and is removed.
In one particular embodiment, the image includes one or more control options that can be selected by the user. The control options may include one or more control buttons for implementing various tasks. For example, the control option box may include music listening control buttons as for example, play, pause, seek and menu.
The GUI operational method 650 generally begins at block 652 where a graphical image is displayed on a GUI. Following block 652, the GUI operational method 650 proceeds to block 654 where a scrolling or panning stroke on a touch sensitive surface is detected. By way of example, the stroke may be a linear or rotational stroke. During a linear stroke, the direction of scrolling or panning typically follows the direction of the stroke. During a rotational stroke (see
The GUI operational method 650 may additionally include blocks A and B. In block A, an object such as a finger is detected on the touch sensitive surface when the image is moving without the assistance of the object (block 660). In block B, the motion of the image is stopped when the object is detected, i.e., the new touch serves as a braking means. Using the metaphor above, while the piece of paper is sliding across the desktop, the user presses their finger on the paper thereby stopping its motion.
Scrolling generally pertains to moving displayed data or images (e.g., media items 681) across a viewing area on a display screen so that a new set of data (e.g., media items 681) is brought into view in the viewing area. In most cases, once the viewing area is full, each new set of data appears at the edge of the viewing area and all other sets of data move over one position. That is, the new set of data appears for each set of data that moves out of the viewing area. In essence, these functions allow a user to view consecutive sets of data currently outside of the viewing area. In most cases, the user is able to accelerate their traversal through the data sets by moving his or her finger at greater speeds. Examples of scrolling through lists can be found in U.S. Patent Publication Nos.: 2003/0076303A1, 2003/0076301A1, 2003/0095096A1, which are herein incorporated by reference.
As shown in
In one embodiment, only a single control signal is generated when the first object is detected over the first key and when the second object is detected over the second key at the same time. By way of example, the first key may be a shift key and the second key may be a symbol key (e.g., letters, numbers). In this manner, the keyboard acts like a traditional keyboard, i.e., the user is allowed to select multiple keys at the same time in order to change the symbol, i.e., lower/upper case. The keys may also correspond to the control key, alt key, escape key, function key, and the like.
In another embodiment, a control signal is generated for each actuated key (key touch) that occurs at the same time. For example, groups of characters can be typed at the same time. In some cases, the application running behind the keyboard may be configured to determine the order of the characters based on some predetermined criteria. For example, although the characters may be jumbled, the application can determine that the correct order of characters based on spelling, usage, context, and the like.
Although only two keys are described, it should be noted that two keys is not a limitation and that more than two keys may be actuated simultaneously to produce one or more control signals. For example, control-alt-delete functionality may be implemented or larger groups of characters can be typed at the same time.
In some cases, the principals of inertia as described above can be applied to the virtual scroll wheel. In cases such as these, the virtual scroll wheel continues to rotate when the fingers (or one of the fingers) are lifted off of the virtual scroll wheel and slowly comes to a stop via virtual friction. Alternatively or additionally, the continuous rotation can be stopped by placing the fingers (or the removed finger) back on the scroll wheel thereby braking the rotation of the virtual scroll wheel.
It should be noted that although a surface scroll wheel is shown, the principals thereof can be applied to more conventional scroll wheels which are virtually based. For example, scroll wheels, whose axis is parallel to the display screen and which appear to protrude through the display screen as shown in
The various aspects, embodiments, implementations or features of the invention can be used separately or in any combination.
The invention is preferably implemented by hardware, software or a combination of hardware and software. The software can also be embodied as computer readable code on a computer readable medium. The computer readable medium is any data storage device that can store data which can thereafter be read by a computer system. Examples of the computer readable medium include read-only memory, random-access memory, CD-ROMs, DVDs, magnetic tape, optical data storage devices, and carrier waves. The computer readable medium can also be distributed over network-coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.
While this invention has been described in terms of several preferred embodiments, there are alterations, permutations, and equivalents, which fall within the scope of this invention. For example, although the invention has been primarily directed at touchscreens, it should be noted that in some cases touch pads may also be used in place of touchscreens. Other types of touch sensing devices may also be utilized. It should also be noted that there are many alternative ways of implementing the methods and apparatuses of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations, and equivalents as fall within the true spirit and scope of the present invention.
Claims
1. A touch-based method, comprising:
- (a) detecting a user input that occurs over a multipoint sensing screen or pad, the user input including one or more inputs, each input having a unique identifier;
- (b) during the user input, classifying the user input as a tracking or selecting input when the user input includes one unique identifier or a gesture input when the user input includes at least two unique identifiers;
- (c) performing tracking or selecting during the user input when the user input is classified as a tracking or selecting input;
- (d) performing one or more control actions during the user input when the user input is classified as a gesturing input.
2. The method as recited in claim 1 wherein scrolling or panning is performed when the at least two unique identifiers move together in substantially the same direction.
3. The method as recited in claim 1 wherein zooming is performed when the at least two unique identifiers linearly move away or towards one another.
4. The method as recited in claim 1, wherein rotation is performed when the at least two unique identifiers rotate relative to each other or relative to a known point.
5. The method as recited in claim 1 wherein multiple control actions are performed simultaneously during the same user input.
6. The method as recited in claim 1 wherein the user input is a continuous stroke, the stroke maintaining continuous contact on the multipoint sensing screen or pad.
7. A computer implemented method for processing touch inputs, said method comprising:
- (a) reading data from a touch screen, the data pertaining to touch input with respect to the touch screen, and the touch screen having a multipoint capability;
- (b) converting the data to a collection of features, wherein a feature includes a position and a magnitude for a sensing point;
- (c) classifying the features;
- (d) grouping the features into one or more feature groups;
- (e) calculating key parameters of the feature groups; and
- (f) associating the feature groups to user interface elements on a display.
8. The computer implemented method as recited in claim 7 wherein said method further comprises:
- recognizing when at least one of the feature groups indicates performance of a gesture relative to its associated user interface element.
9. The computer implemented method as recited in claim 7 wherein said method further comprises:
- providing user feedback when at least one of the feature groups indicates performance of a gesture relative to its associated user interface element, wherein the user feedback is in addition to changes made to the user interface element.
10. The computer implemented method as recited in claim 7 wherein said method further comprises:
- implementing an action when at least one of the feature groups indicates performance of a gesture relative to its associated user interface element.
11. The computer implemented method as recited in claim 10, wherein said method further comprises:
- providing user feedback in conjunction with the action, wherein the user feedback is in addition to the implemented action and changes made to the user interface element.
12. The computer implemented method as recited in claim 7 wherein the operation of associating comprises:
- (a) receiving group of features;
- (b) determining if there is a change in number of features;
- (c) if there is a change, calculating initial parameter values;
- (d) if there is no change, calculating current parameter values; and
- (e) reporting both the initial and current parameter values.
13. A computer implemented method for initiating floating controls via a touch screen, the method comprising:
- (a) detecting the presence of an object on the touch screen;
- (b) recognizing the object; and
- (c) generating a user interface element on the touch screen in the vicinity of the object based on the recognized object.
14. The method as recited in claim 13, wherein the user interface element includes one or more control options that can be selected by a user of the touch screen.
15. The method as recited in claim 14 further comprising generating a control signal when one of the control options is selected by the object.
16. The method as recited in claim 15 wherein the control signal modifies the functionality of the computer.
17. The method as recited in claim 15 wherein the control signal modifies a user interface element.
18. The method as recited in claim 13 wherein the object is a finger or a stylus.
Type: Application
Filed: Apr 15, 2008
Publication Date: Aug 28, 2008
Applicant: Apple Inc. (Cupertino, CA)
Inventors: Steve Hotelling (San Jose, CA), Joshua A. Strickon (San Jose, CA), Brian O. Huppi (San Francisco, CA), Imran Chaudhri (San Francisco, CA), Greg Christie (San Jose, CA), Bas Ording (San Francisco, CA), Duncan Robert Kerr (San Francisco, CA), Jonathan P. Ive (San Francisco, CA)
Application Number: 12/103,646