Language input interface on a device

- Apple

Methods, systems, and apparatus, including computer program products, for inputting text. A virtual keyboard is presented in a first region of a touch sensitive display of a device. An input representing a phonetic string is received on the virtual keyboard. The entered phonetic string is presented in a second region of the touch sensitive display. One or more candidates are identified based on the phonetic string. At least a subset of the candidates is presented. An input selecting one of the candidates is received. The entered phonetic string is replaced with the selected candidate.

Skip to: Description  ·  Claims  ·  References Cited  · Patent History  ·  Patent History
Description
BACKGROUND

The subject matter of this specification is related generally to text input interfaces.

Traditional computer keyboards may be too large for portable devices, such as mobile phones, multimedia players, or personal digital assistants (PDAs). Some portable devices include a smaller version of the traditional computer keyboard or use a virtual keyboard to receive user input. A virtual keyboard can be of the form of a software application or a feature of a software application to simulate a computer keyboard. For example, in a portable device with a touch-sensitive display, a virtual keyboard can be used by a user to input text by selecting or tabbing areas of the touch-sensitive display corresponding to keys of the virtual keyboard.

These smaller keyboards and virtual keyboards may have keys that correspond to more than one character. For example, some of the keys can, by default, correspond to a character in the English language, for example, the letter “a,” and may also correspond to other additional characters, such as another letter or the letter with an accent option, e.g., the character “ä,” or other characters with accent options. Because of the physical limitations (e.g., size) of the virtual keyboard, a user may find it difficult to type characters not readily available on the virtual keyboard.

Input methods for devices having multi-language environments can present unique challenges with respect to input and spelling correction which may need to be tailored to the selected language to ensure accuracy and an efficient workflow.

SUMMARY

In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of presenting a virtual keyboard in a first region of a touch sensitive display of a device, receiving an input representing a phonetic string on the virtual keyboard, presenting the entered phonetic string in a second region of the touch sensitive display, identifying one or more candidates based on the phonetic string, presenting at least a subset of the candidates in the first region or the second region, receiving an input selecting one of the candidates, and replacing the entered phonetic string with the selected candidate. Other embodiments of this aspect include corresponding systems, apparatus, computer program products, and computer readable media.

Particular embodiments of the subject matter described in this specification can be implemented to realize one or more of the following advantages. Text in languages that require phonetic string-to-character conversion can be input more efficiently on a portable device. Error correction and word prediction techniques can be applied to entry of East Asain languages.

The details of one or more embodiments of the subject matter described in this specification are set forth in the accompanying drawings and the description below. Other features, aspects, and advantages of the subject matter will become apparent from the description, the drawings, and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of an example mobile device.

FIG. 2 is a block diagram of an example implementation of the mobile device of FIG. 1.

FIGS. 3A-3F illustrate an example user interface for entering text.

FIG. 4 illustrates an example text input process.

Like reference numbers and designations in the various drawings indicate like elements.

DETAILED DESCRIPTION Example Mobile Device

FIG. 1 is a block diagram of an example mobile device 100. The mobile device 100 can be, for example, a handheld computer, a personal digital assistant, a cellular telephone, a network appliance, a camera, a smart phone, an enhanced general packet radio service (EGPRS) mobile phone, a network base station, a media player, a navigation device, an email device, a game console, or a combination of any two or more of these data processing devices or other data processing devices.

Mobile Device Overview

In some implementations, the mobile device 100 includes a touch-sensitive display 102. The touch-sensitive display 102 can implement liquid crystal display (LCD) technology, light emitting polymer display (LPD) technology, or some other display technology. The touch sensitive display 102 can be sensitive to haptic and/or tactile contact with a user.

In some implementations, the touch-sensitive display 102 can comprise a multi-touch-sensitive display 102. A multi-touch-sensitive display 102 can, for example, process multiple simultaneous touch points, including processing data related to the pressure, degree, and/or position of each touch point. Such processing facilitates gestures and interactions with multiple fingers, chording, and other interactions. Other touch-sensitive display technologies can also be used, e.g., a display in which contact is made using a stylus or other pointing device. Some examples of multi-touch-sensitive display technology are described in U.S. Pat. Nos. 6,323,846, 6,570,557, 6,677,932, and 6,888,536, each of which is incorporated by reference herein in its entirety.

In some implementations, the mobile device 100 can display one or more graphical user interfaces on the touch-sensitive display 102 for providing the user access to various system objects and for conveying information to the user. In some implementations, the graphical user interface can include one or more display objects 104, 106. In the example shown, the display objects 104, 106, are graphic representations of system objects. Some examples of system objects include device functions, applications, windows, files, alerts, events, or other identifiable system objects.

Example Mobile Device Functionality

In some implementations, the mobile device 100 can implement multiple device functionalities, such as a telephony device, as indicated by a phone object 110; an e-mail device, as indicated by the e-mail object 112; a network data communication device, as indicated by the Web object 114; a Wi-Fi base station device (not shown); and a media processing device, as indicated by the media player object 116. In some implementations, particular display objects 104, e.g., the phone object 110, the e-mail object 112, the Web object 114, and the media player object 116, can be displayed in a menu bar 118. In some implementations, device functionalities can be accessed from a top-level graphical user interface, such as the graphical user interface illustrated in FIG. 1. Touching one of the objects 110, 112, 114, or 116 can, for example, invoke corresponding functionality.

In some implementations, the mobile device 100 can implement network distribution functionality. For example, the functionality can enable the user to take the mobile device 100 and provide access to its associated network while traveling. In particular, the mobile device 100 can extend Internet access (e.g., Wi-Fi) to other wireless devices in the vicinity. For example, mobile device 100 can be configured as a base station for one or more devices. As such, mobile device 100 can grant or deny network access to other wireless devices.

In some implementations, upon invocation of device functionality, the graphical user interface of the mobile device 100 changes, or is augmented or replaced with another user interface or user interface elements, to facilitate user access to particular functions associated with the corresponding device functionality. For example, in response to a user touching the phone object 110, the graphical user interface of the touch-sensitive display 102 may present display objects related to various phone functions; likewise, touching of the email object 112 may cause the graphical user interface to present display objects related to various e-mail functions; touching the Web object 114 may cause the graphical user interface to present display objects related to various Web-surfing functions; and touching the media player object 116 may cause the graphical user interface to present display objects related to various media processing functions.

In some implementations, the top-level graphical user interface environment or state of FIG. 1 can be restored by pressing a button 120 located near the bottom of the mobile device 100. In some implementations, each corresponding device functionality may have corresponding “home” display objects displayed on the touch-sensitive display 102, and the graphical user interface environment of FIG. 1 can be restored by pressing the “home” display object.

In some implementations, the top-level graphical user interface can include additional display objects 106, such as a short messaging service (SMS) object 130, a calendar object 132, a photos object 134, a camera object 136, a calculator object 138, a stocks object 140, a weather object 142, a maps object 144, a notes object 146, a clock object 148, an address book object 150, and a settings object 152. Touching the SMS display object 130 can, for example, invoke an SMS messaging environment and supporting functionality; likewise, each selection of a display object 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, and 152 can invoke a corresponding object environment and functionality.

Additional and/or different display objects can also be displayed in the graphical user interface of FIG. 1. For example, if the device 100 is functioning as a base station for other devices, one or more “connection” objects may appear in the graphical user interface to indicate the connection. In some implementations, the display objects 106 can be configured by a user, e.g., a user may specify which display objects 106 are displayed, and/or may download additional applications or other software that provides other functionalities and corresponding display objects.

In some implementations, the mobile device 100 can include one or more input/output (I/O) devices and/or sensor devices. For example, a speaker 160 and a microphone 162 can be included to facilitate voice-enabled functionalities, such as phone and voice mail functions. In some implementations, an up/down button 184 for volume control of the speaker 160 and the microphone 162 can be included. The mobile device 100 can also include an on/off button 182 for a ring indicator of incoming phone calls. In some implementations, a loud speaker 164 can be included to facilitate hands-free voice functionalities, such as speaker phone functions. An audio jack 166 can also be included for use of headphones and/or a microphone.

In some implementations, a proximity sensor 168 can be included to facilitate the detection of the user positioning the mobile device 100 proximate to the user's ear and, in response, to disengage the touch-sensitive display 102 to prevent accidental function invocations. In some implementations, the touch-sensitive display 102 can be turned off to conserve additional power when the mobile device 100 is proximate to the user's ear.

Other sensors can also be used. For example, in some implementations, an ambient light sensor 170 can be utilized to facilitate adjusting the brightness of the touch-sensitive display 102. In some implementations, an accelerometer 172 can be utilized to detect movement of the mobile device 100, as indicated by the directional arrow 174. Accordingly, display objects and/or media can be presented according to a detected orientation, e.g., portrait or landscape. In some implementations, the mobile device 100 may include circuitry and sensors for supporting a location determining capability, such as that provided by the global positioning system (GPS) or other positioning systems (e.g., systems using Wi-Fi access points, television signals, cellular grids, Uniform Resource Locators (URLs)). In some implementations, a positioning system (e.g., a GPS receiver) can be integrated into the mobile device 100 or provided as a separate device that can be coupled to the mobile device 100 through an interface (e.g., port device 190) to provide access to location-based services.

In some implementations, a port device 190, e.g., a Universal Serial Bus (USB) port, or a docking port, or some other wired port connection, can be included. The port device 190 can, for example, be utilized to establish a wired connection to other computing devices, such as other communication devices 100, network access devices, a personal computer, a printer, a display screen, or other processing devices capable of receiving and/or transmitting data. In some implementations, the port device 190 allows the mobile device 100 to synchronize with a host device using one or more protocols, such as, for example, the TCP/IP, HTTP, UDP and any other known protocol.

The mobile device 100 can also include a camera lens and sensor 180. In some implementations, the camera lens and sensor 180 can be located on the back surface of the mobile device 100. The camera can capture still images and/or video.

The mobile device 100 can also include one or more wireless communication subsystems, such as an 802.11b/g communication device 186, and/or a Bluetooth™ communication device 188. Other communication protocols can also be supported, including other 802.x communication protocols (e.g., WiMax, Wi-Fi, 3 G), code division multiple access (CDMA), global system for mobile communications (GSM), Enhanced Data GSM Environment (EDGE), etc.

Example Mobile Device Architecture

FIG. 2 is a block diagram 200 of an example implementation of the mobile device 100 of FIG. 1. The mobile device 100 can include a memory interface 202, one or more data processors, image processors and/or central processing units 204, and a peripherals interface 206. The memory interface 202, the one or more processors 204 and/or the peripherals interface 206 can be separate components or can be integrated in one or more integrated circuits. The various components in the mobile device 100 can be coupled by one or more communication buses or signal lines.

Sensors, devices, and subsystems can be coupled to the peripherals interface 206 to facilitate multiple functionalities. For example, a motion sensor 210, a light sensor 212, and a proximity sensor 214 can be coupled to the peripherals interface 206 to facilitate the orientation, lighting, and proximity functions described with respect to FIG. 1. Other sensors 216 can also be connected to the peripherals interface 206, such as a positioning system (e.g., GPS receiver), a temperature sensor, a biometric sensor, or other sensing device, to facilitate related functionalities.

A camera subsystem 220 and an optical sensor 222, e.g., a charged coupled device (CCD) or a complementary metal-oxide semiconductor (CMOS) optical sensor, can be utilized to facilitate camera functions, such as recording photographs and video clips.

Communication functions can be facilitated through one or more wireless communication subsystems 224, which can include radio frequency receivers and transmitters and/or optical (e.g., infrared) receivers and transmitters. The specific design and implementation of the communication subsystem 224 can depend on the communication network(s) over which the mobile device 100 is intended to operate. For example, a mobile device 100 may include communication subsystems 224 designed to operate over a GSM network, a GPRS network, an EDGE network, a Wi-Fi or WiMax network, and a Bluetooth™ network. In particular, the wireless communication subsystems 224 may include hosting protocols such that the device 100 may be configured as a base station for other wireless devices.

An audio subsystem 226 can be coupled to a speaker 228 and a microphone 230 to facilitate voice-enabled functions, such as voice recognition, voice replication, digital recording, and telephony functions.

The I/O subsystem 240 can include a touch screen controller 242 and/or other input controller(s) 244. The touch-screen controller 242 can be coupled to a touch screen 246. The touch screen 246 and touch screen controller 242 can, for example, detect contact and movement or break thereof using any of a plurality of touch sensitivity technologies, including but not limited to capacitive, resistive, infrared, and surface acoustic wave technologies, as well as other proximity sensor arrays or other elements for determining one or more points of contact with the touch screen 246.

The other input controller(s) 244 can be coupled to other input/control devices 248, such as one or more buttons, rocker switches, thumb-wheel, infrared port, USB port, and/or a pointer device such as a stylus. The one or more buttons (not shown) can include an up/down button for volume control of the speaker 228 and/or the microphone 230.

In one implementation, a pressing of the button for a first duration may disengage a lock of the touch screen 246; and a pressing of the button for a second duration that is longer than the first duration may turn power to the mobile device 100 on or off. The user may be able to customize a functionality of one or more of the buttons. The touch screen 246 can, for example, also be used to implement virtual or soft buttons and/or a keyboard.

In some implementations, the mobile device 100 can present recorded audio and/or video files, such as MP3, AAC, and MPEG files. In some implementations, the mobile device 100 can include the functionality of an MP3 player, such as an iPod™. The mobile device 100 may, therefore, include a 30-pin connector that is compatible with the iPod™. Other input/output and control devices can also be used.

The memory interface 202 can be coupled to memory 250. The memory 250 can include high-speed random access memory and/or non-volatile memory, such as one or more magnetic disk storage devices, one or more optical storage devices, and/or flash memory (e.g., NAND, NOR). The memory 250 can store an operating system 252, such as Darwin, RTXC, LINUX, UNIX, OS X, WINDOWS, or an embedded operating system such as VxWorks. The operating system 252 may include instructions for handling basic system services and for performing hardware dependent tasks. In some implementations, the operating system 252 can be a kernel (e.g., UNIX kernel).

The memory 250 may also store communication instructions 254 to facilitate communicating with one or more additional devices, one or more computers and/or one or more servers. The memory 250 may include graphical user interface instructions 256 to facilitate graphic user interface processing; sensor processing instructions 258 to facilitate sensor-related processing and functions; phone instructions 260 to facilitate phone-related processes and functions; electronic messaging instructions 262 to facilitate electronic-messaging related processes and functions; web browsing instructions 264 to facilitate web browsing-related processes and functions; media processing instructions 266 to facilitate media processing-related processes and functions; GPS/Navigation instructions 268 to facilitate GPS and navigation-related processes and instructions; camera instructions 270 to facilitate camera-related processes and functions; and/or other software instructions 272 to facilitate other processes and functions, e.g., security processes and functions. The memory 250 may also store other software instructions (not shown), such as web video instructions to facilitate web video-related processes and functions; and/or web shopping instructions to facilitate web shopping-related processes and functions. In some implementations, the media processing instructions 266 are divided into audio processing instructions and video processing instructions to facilitate audio processing-related processes and functions and video processing-related processes and functions, respectively. An activation record and International Mobile Equipment Identity (IMEI) 274 or similar hardware identifier can also be stored in memory 250.

Language data 276 can also be stored in memory 250. Language data 276 can include, for example, word dictionaries (i.e., list of possible words in a language) for one or more languages, dictionaries of characters and corresponding phonetics, one or more corpuses of characters and character compounds, and so on.

Each of the above identified instructions and applications can correspond to a set of instructions for performing one or more functions described above. These instructions need not be implemented as separate software programs, procedures, or modules. The memory 250 can include additional instructions or fewer instructions. Furthermore, various functions of the mobile device 100 may be implemented in hardware and/or in software, including in one or more signal processing and/or application specific integrated circuits.

Language Input Interface

FIGS. 3A-3F illustrate an example user interface for entering multi-lingual text on mobile device 100. Mobile device 100 can display a text input area 302 and a virtual keyboard 304 on the touch-sensitive display 102. The text input area 302 can be any area where input text can be displayed, e.g., a note-taking application, an email application, and so on. In some implementations, the text input area 302 can be one or more text fields located in a document (e.g., a web page rendered in a web browser application). The virtual keyboard 304 includes one or more virtual keys 303 that each correspond to a letter in an alphabet (e.g., Latin alphabet). The virtual keyboard 304 can include a keyboard toggle key 308 for toggling between letter keys and keys for numbers, punctuation, etc. (i.e., either letter keys or numbers/punctuation keys can be displayed in the virtual keyboard 304). A user can enter text by touching the touch-sensitive display 102 on the areas of the desired keys of the virtual keyboard 304; the user selects or hits the desired keys of the virtual keyboard 304. Letters, numbers, etc. corresponding to the touched keys are displayed in the text input area 302 as unconverted current input 310-A. The user can hit the backspace key 306 to delete the last input character.

In some implementations, the mobile device 100 includes capability to input text in a non-English language using Latin alphabet virtual keyboard. For example, the mobile device 100 can include capability for inputting Chinese and/or Japanese text, including Chinese or Japanese characters and symbols, using a Latin alphabet virtual keyboard (e.g., virtual keyboard with letters arranged in a QWERTY layout). For example, the device 100 can include a Chinese or Japanese text entry mode that utilizes a Latin alphabet keyboard. A user can use the virtual keyboard to enter a phonetic string of letters representing sounds or syllables in the non-English language. For example, a user can user the virtual keyboard to type in a romanization of one or more characters or symbols in Chinese or Japanese.

For convenience, the implementations in this specification will be described in reference to entry of Japanese language text. It should be appreciated, however, that the described implementations can be applied to other non-English languages (e.g., Chinese). More generally, the described implementations can be applied to any text input interface that involves identification, presentation, and selection of candidates for inputs, regardless of language (e.g., Latin alphabet romanization to non-Latin-alphabet text, spelling and grammar correction, thesaurus features, etc.).

When a user inputs in a first letter of a phonetic string, the letter is displayed in the text input area 302 as an unconverted current input 310-A, as shown in FIG. 3A. In some implementations, the input 310-A is displayed with underlining or some other formatting (e.g., bold text, italics, highlighting). The underlining/formatting indicates that the input is a provisional input subject to conversion pending additional input from the user, whether that additional input is additional letters or a selection of a candidate by the user. For example, in FIG. 3A, the user hits the “s” key, and the letter “s” is displayed in the text input area 302 as current input 310-A with underlining.

The virtual keyboard 304 can include a “confirm” key 314 that, when hit by the user, accepts the displayed input 310-A as is. The accepted input is displayed without the underlining. For example, in FIG. 3A, the user can hit the “confirm” key 314 to accept the input string “s” as is; “s” is displayed without underlining. In some implementations, hitting of the “confirm” key 214 also adds a space after the accepted input. In some other implementations, the addition of the space after the accepted input depends on whether the accepted input is in a language where spaces separate words and/or whether the accepted input is the end of a sentence, to name a few example criteria. In some implementations, key 314 is a “space” key that, when pressed, accepts the current input as is, effectively serving as a “confirm” key.

The virtual keyboard 304 can also include a “show candidates” key 312. By hitting the “show candidates” key 312, the user can bring up a tray of candidate characters, symbols, and combinations thereof (e.g., kanji, kana combinations) with which to replace the input 310-A. The tray of candidates is further described below.

Continuing from the example input 310-A shown in FIG. 3A, the user next hits the letter “e” on the keyboard, resulting in the string “se.” The string “se” can be converted by the device 100 to the hiragana symbol “”, of which the string “se” is the romanization, and the hiragana symbol “” , is displayed as converted current input 310-B with underlining, as shown in FIG. 3B. The user can hit the “confirm” key 314 to accept the hiragana symbol “”, as is; “”, is then displayed without underlining. Alternatively, the user can hit the “show candidates” key 312 to bring up the tray of candidates related to the string “se” (e.g., characters whose phonetic readings begin with “se”).

Continuing from the example input 310-B as shown in FIG. 3B, the user next hits the “n” key, resulting in the string “sen.” The terminal “n” letter is converted to the hiragana symbol “”, of which the terminal “n” is the romanization, and appended to the already-converted hiragana symbol “”. The hiragana symbol “” is displayed as converted current input 310-B with underlining, as shown in FIG. 3C.

In some implementations, the device 102 can display one or more suggested candidates 318 inline for the input 310-B. The suggested candidates can include single characters, phonetic symbols (e.g., Japanese kana), and combinations of multiple characters and/or phonetic symbols. For example, in FIG. 3C, the kanji character “” is displayed as a suggested candidate for “” (“sen”) is the onyomi reading for the kanji character “” In some implementations, the user can hit a suggested candidate (i.e., touch the touch-sensitive display 102 over the area of the desired suggested candidate) to select a suggested candidate, continue typing letter keys on the virtual keyboard 304 to add to the input 310-B, or hit the “show candidates” key 312 to bring up the candidates tray, among other actions. If the user selects a suggested candidate, the selected suggested candidate is displayed as accepted input 336, as shown in FIG. 3F. If the user continues typing on the letter keys on the virtual keyboard 304, the current input 310-B is extended and possible candidates for the current input 310-B are narrowed down.

In some implementations, the one or more suggested candidates 318 that are presented to the user are determined by the device 100 to be the best match for the input 310-B based on one or more criteria (e.g., frequency in the language, exact match, etc.).

In some implementations, more candidates can be displayed by the device 100 when the user hits an arrow graphical object 319 or the like on the touch-sensitive display 102. For example, when the user hits the arrow 319, a candidates tray 322 can be displayed. Alternatively, the inline suggested candidates list 318 can expand to show more candidates. The arrow 319 gives the user a hint that there are additional candidates available.

In some implementations, the user can hit the confirm key 314 once to select the first candidate of the suggested candidates 318, hit the confirm key 314 twice in quick succession to select the second candidate of the suggested candidates 318, and so on.

If the user hits the “show candidates” key 312 or the arrow 319, a candidates tray 322 can be displayed, as shown in FIG. 3D. In some implementations, the candidates tray 322 is displayed in place of the virtual keyboard 304. In some other implementations, the candidates tray 322 is displayed over all or part of the text input area 302. In some implementations, the candidates tray 322 slides over the virtual keyboard 304 or the text input area 302, and the sliding is displayed as an animated effect. When the candidates tray 322 is removed from view, the candidates tray 322 can slide off the touch-sensitive display 102.

The candidates tray 322 can include one or more candidate keys 330, each of the candidate keys 330 corresponding to a candidate for conversion of the input 310-B. A candidate (whether for the candidate keys 330 or suggested candidates 318) can be a character, a phonetic or syllabic symbol (e.g., a kana symbol), romanization, multi-character combinations forming words or phrases, multi-symbol combinations forming words or phrases, a combination of characters and symbols forming words or phrases, and so on. The candidates can include characters whose phonetic reading is or begins with the input 310-B as a reading, words that begin with the input 310-B, and so on. For example, in FIG. 3D, the candidates tray 322 includes some candidate keys 330 that correspond to kanji characters that has “” as a reading. In some implementations, the candidates in the candidates tray are ordered based on various criteria as to which candidate is the best candidate.

In some implementations, the candidates for the suggested candidates 318 and the candidates tray 322 are identified and ordered using predictive text and/or error correction techniques, examples of which include fuzzy matching, techniques for determining cursor position based on a finger contact, and so on. An example of a predictive text technique is disclosed in Masui, “An Efficient Text Input Method for Pen-based Computers,” in Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI '98), Addison-Wesley, April 1998, pp. 328-335, the disclosure of which is incorporated by reference herein in its entirety. An example of a techniques for determining cursor position based on a finger contact is disclosed in U.S. patent application Ser. No. 11/850,015, titled “Methods for Determining a Cursor Position from a Finger Contact with a Touch Screen Display,” filed Sep. 4, 2007, the disclosure of which is incorporated by reference herein in its entirety.

In some implementations, if the candidates tray 322 is displayed over the virtual keyboard 304, the candidates tray 322 can also include a keyboard switch key 328 for switching back to the virtual keyboard 304. The candidates tray 322 can also include back candidates key 326 and/or next candidates key 324 for moving back and forth between sets of candidate keys 330 within the candidates tray 322. In some implementations, the candidates tray 322 also includes the confirm key 314.

The user can hit a candidate key 330 to replace the input 310-B with the candidate corresponding to the hit candidate key 330. For example, from FIG. 3D, if the user hits the key that corresponds to the candidate character “” (key 332), the input 310-B is replaced with the character “” The character “” is displayed as accepted input 336, as shown in FIG. 3E. In FIG. 3E, the candidates tray 322 reverts back to virtual keyboard 304. The virtual keyboard 304 can include a “space” key 334 and a “return” key 332 in place of “confirm” key 314 and show candidate key 312, respectively. From FIG. 3F, the user can enter a new phonetic string input.

In some implementations, the virtual keyboard 304 can include a key for switching between multiple input keyboards for various languages.

In some implementations, the candidates tray 322 includes a cancel key 331 for reverting back to the virtual keyboard 304 from the candidates tray 322 without selecting a candidate.

In some implementations, a candidate in the suggested candidates 318 or candidates tray 322 is highlighted as the “currently selected” candidate. When the suggested candidates 318 or candidates tray 322 is first displayed after an input of a phonetic string, the initial highlighted candidate can be the phonetic string itself in the suggested candidates 318 or candidates tray 322 or the “best” candidate. Key 312 can be a “next candidate” key, where a press of the key moves the highlighting to the next candidate. In some implementations, there can be a “previous candidate” key to move the highlighting backward to a previous candidate. The confirm key 314 can be used to accept the highlighted candidate.

In some other implementations, when the user inputs a phonetic sting, no candidate is automatically selected or highlighted by default; the user can hit the confirm key 314 to accept the phonetic string as is. The user can hit the next candidate key (and optionally a previous candidate key) to move through the candidates and highlight one of them. As different candidates are highlighted, the current input 310-B changes to show the currently highlighted candidate, while still displayed with underlining or other formatting to indicate that the current input 310-B is still provisional. Hitting a return key (e.g., return key 332) confirms the currently selected candidate or the phonetic string (i.e., whatever phonetic string or candidate is shown in the current input 310-B). Adding more phonetic symbols by typing on the virtual keyboard 304 also automatically accepts the currently selected candidate or the phonetic string (i.e., whatever phonetic string or candidate is shown in the current input 310-B).

FIG. 4 illustrates an example text input process 400. For convenience, the process 400 will be described in reference to a device (e.g., device 100) that performs the process 400.

A virtual keyboard is displayed in a first region of a touch-sensitive display of a device (402). For example, the device displays virtual keyboard 304 on a portion of the touch-sensitive display 102.

An input is received entering a phonetic string on the virtual keyboard (404). A user can enter one or more letters using the virtual keyboard. The entered letters can constitute a phonetic string. The phonetic string can be a romanization of characters, words, and the like in a language that does not use the Latin alphabet, for example.

The input phonetic string is displayed in a second region of the display (406). The device 100 can display the phonetic string in a text input area on the touch-sensitive Display 102. In some implementations, the device 100 converts the phonetic string to, for example, symbols corresponding to the phonetic string (e.g., Japanese kana, Chinese zhuyin, etc.).

One or more candidates matching the phonetic string are identified (408). For example, the device 100 can look up the phonetic string in a dictionary, character database, or the like, and finds matching characters for the phonetic string. In some implementations, the device 100 can segment the phonetic string based on syllables or another criteria and find candidates for each of the segments.

At least a subset of the identified candidates is displayed in the first region of the touch sensitive display (410). For example, the candidates can be displayed in a candidates tray 322 that is displayed in place of the virtual keyboard 304. In some implementations, if there are more candidates than can fit in the tray 322, the user can navigates to the overflow candidates by hitting the back candidates 326 or next candidates key 324.

Input is received selecting one of the candidates (412). For example, a user can hit one of the candidate keys 330 in a candidates tray 322 to select the corresponding candidate.

The displayed phonetic string is replaced with the selected candidate (414). In some implementations, the selected candidate is displayed on the touch-sensitive display in place of the input phonetic string.

In some implementations, the virtual keyboard 304 and the candidates tray 322 can be dynamically resized based on the orientation of the touch-sensitive display 102. For example, FIGS. 3A-3F show the virtual keyboard 304 or the candidates tray 322 in portrait orientation. If the device 100, and thus the touch-sensitive display 102, is rotated to landscape orientation, the device 100 can detect the rotation and resize the keyboard 304 and the candidates tray 322 to fit the landscape width of the touch-sensitive display 102.

In some implementations, the suggested candidates 318 are displayed in the same orientation as the text input, whose orientation can vary by language. For example, if the text is displayed from left-to-right, the suggested candidates 318 are displayed from left to right. If the text is displayed from right to left, the suggested candidates 318 are displayed from right to left. If the text displayed from top to bottom, the suggested candidates 318 are displayed from top to bottom.

In some implementations, the phonetic string can be input by voice rather than typing on the virtual keyboard 304. For example, the device 100 can include a voice recognition module that receives and processes a user's voice input and generates a phonetic string based on the voice input. The device 100 can identify candidates for the phonetic string generated by the voice recognition module for selection by the user.

The disclosed and other embodiments and the functional operations described in this specification can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. The disclosed and other embodiments can be implemented as one or more computer program products, i.e., one or more modules of computer program instructions encoded on a computer-readable medium for execution by, or to control the operation of, data processing apparatus. The computer-readable medium can be a machine-readable storage device, a machine-readable storage substrate, a memory device, a composition of matter effecting a machine-readable propagated signal, or a combination of one or more them. The term “data processing apparatus” encompasses all apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, or multiple processors or computers. The apparatus can include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them. A propagated signal is an artificially generated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to suitable receiver apparatus.

A computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program does not necessarily correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.

The processes and logic flows described in this specification can be performed by one or more programmable processors executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit).

Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for performing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. However, a computer need not have such devices. Computer-readable media suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

To provide for interaction with a user, the disclosed embodiments can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input.

The disclosed embodiments can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of what is disclosed here, or any combination of one or more such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), e.g., the Internet.

While this specification contains many specifics, these should not be construed as limitations on the scope of what being claims or of what may be claimed, but rather as descriptions of features specific to particular embodiments. Certain features that are described in this specification in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.

Similarly, while operations are depicted in the drawings in a particular order, this should not be understand as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.

Particular embodiments of the subject matter described in this specification have been described. Other embodiments are within the scope of the following claims. For example, the actions recited in the claims can be performed in a different order and still achieve desirable results. As one example, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous.

Claims

1. A method comprising:

presenting a virtual keyboard corresponding to a first alphabet in a first region of a touch-sensitive display of a device;
receiving an input on the virtual keyboard representing a phonetic string;
presenting the phonetic string in a second region of the touch-sensitive display;
identifying one or more candidate representations based on the phonetic string, wherein the candidate representations correspond to a second alphabet different from the first alphabet;
identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
presenting a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations;
receiving an input of a selected candidate representation of the candidate representations and the additional candidate representations; and
replacing the entered phonetic string with the selected candidate representation.

2. The method of claim 1, wherein:

the phonetic string comprises Chinese romanization; and
the candidates comprise Chinese characters.

3. The method of claim 1, wherein:

the phonetic string comprises Japanese romanization; and
the candidates comprise one or more of the group consisting of Japanese kanji characters and Japanese kana symbols.

4. The method of claim 1, wherein the virtual keyboard comprises keys corresponding to letters of the Latin alphabet.

5. The method of claim 1, wherein the candidates comprise multi-character words.

6. The method of claim 1, wherein identifying one or more candidates based on the phonetic string comprises identifying one or more candidates using text prediction from the phonetic string.

7. The method of claim 6, wherein presenting at least a subset of the candidates comprises presenting the subset of the candidates in an order determined based on the text prediction.

8. A portable device comprising:

a touch-sensitive display;
memory;
one or more processors; and
instructions stored in the memory and configured for execution by the one or more processors, the instructions comprising instructions to: present a virtual keyboard corresponding to a first alphabet in a first region of the touch-sensitive display of a device; receive an input on the virtual keyboard representing a phonetic string; present the phonetic string in a second region of the touch-sensitive display; identify one or more candidate representations based on the phonetic string, wherein the candidate representations correspond to a second alphabet different from the first alphabet; identify one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets; present a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations; receive an input of a selected candidate representation of the candidate representations and the additional candidate representations; and replace the entered phonetic string with the selected candidate representation.

9. The device of claim 8, wherein:

the phonetic string comprises Chinese romanization; and
the candidates comprise Chinese characters.

10. The device of claim 8, wherein:

the phonetic string comprises Japanese romanization; and
the candidates comprise one or more of the group consisting of Japanese kanji characters and Japanese kana symbols.

11. The device of claim 8, wherein the virtual keyboard comprises keys corresponding to letters of the Latin alphabet.

12. The device of claim 8, wherein the candidates comprise multi-character words.

13. The device of claim 8, further comprising instructions to identify one or more candidates using text prediction from the phonetic string.

14. The device of claim 13, further comprising instructions to present the subset of the candidates in an order determined based on the text prediction.

15. A computer program product, encoded on a tangible program carrier, operable to cause a portable device to perform operations comprising:

presenting a virtual keyboard corresponding to a first alphabet in a first region of a touch-sensitive display of a device;
receiving an input on the virtual keyboard representing a phonetic string;
presenting the phonetic string in a second region of the touch-sensitive display;
identifying one or more candidate representations based on the phonetic strings wherein the candidate representations correspond to a second alphabet different from the first alphabet;
identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
presenting a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations;
receiving an input of a selected candidate representation of the candidate representations and the additional candidate representations; and
replacing the entered phonetic string with the selected candidate representation.

16. The program product of claim 15, wherein:

the phonetic string comprises Chinese romanization; and
the candidates comprise Chinese characters.

17. The program product of claim 15, wherein:

the phonetic string comprises Japanese romanization; and
the candidates comprise one or more of the group consisting of Japanese kanji characters and Japanese kana symbols.

18. The program product of claim 15, wherein the virtual keyboard comprises keys corresponding to letters of the Latin alphabet.

19. The program product of claim 15, wherein the candidates comprise multi-character words.

20. The program product of claim 15, wherein identifying one or more candidates based on the phonetic string comprises identifying one or more candidates using text prediction from the phonetic string.

21. The program product of claim 20, wherein presenting at least a subset of the candidates comprises presenting the subset of the candidates in an order determined based on the text prediction.

22. A method comprising:

presenting a virtual keyboard corresponding to a first alphabet in a first region of a touch-sensitive display of a device;
receiving an input on the virtual keyboard representing a phonetic string;
presenting the phonetic string in a second region of the touch-sensitive display;
identifying one or more candidate representations based on the phonetic string, wherein the candidate representations correspond to a second alphabet different from the first alphabet;
identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
presenting at least a subset of the candidate representations and a subset of the additional candidate representations inline with the phonetic string in the second region, where the candidate representations are displayed according to an orientation of text in the second region;
receiving an input selecting one of the candidate representations and the additional candidate representations; and
replacing the entered phonetic string with the selected candidate representation.

23. The method of claim 1, further comprising identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets, wherein the candidate tray includes at least a subset of the additional candidate representations.

24. The method of claim 1, wherein the candidate tray further includes the phonetic string.

25. The method of claim 1, wherein the second alphabet comprises logographic characters.

26. The method of claim 1, wherein the candidate tray is presented in place of the virtual keyboard.

27. The method of claim 8, wherein the candidate tray is presented in place of the virtual keyboard.

28. The method of claim 15, wherein the candidate tray is presented in place of the virtual keyboard.

29. The method of claim 1, wherein presenting the candidate tray comprises presenting the candidate tray in the second region of the touch-sensitive display.

30. The method of claim 1, wherein replacing the entered phonetic string comprises replacing the entered phonetic string in the second region of the touch-sensitive display.

31. The method of claim 1, wherein presenting the candidate tray comprises presenting the candidate tray adjacent to the phonetic string.

32. The method of claim 1, wherein presenting the candidate tray comprises presenting the candidate tray adjacent to the phonetic string in the second region.

33. The method of claim 1, further comprising:

receiving an input on the virtual keyboard representing a second phonetic string;
presenting the second phonetic string in the second region of the touch-sensitive display; and
receiving an input confirming the second phonetic string.

34. The method of claim 1, further comprising:

presenting an option to expand the candidate tray to include a larger subset of the candidate representations.

35. The method of claim 1, further comprising:

providing a confirm key.

36. The method of claim 35, wherein the received input of a selected candidate representation comprises a number of hits to the confirm key in succession.

37. The method of claim 36, wherein each of the candidate representations presented in the candidate tray corresponds to a different number of hits of the confirm key.

38. The method of claim 1, further comprising:

providing an option of replacing the virtual keyboard in the first region with at least a subset of the candidate representations.

39. The method of claim 1, further comprising:

providing an option of presenting a different subset of the candidate representations in the candidate tray.

40. The method of claim 1, wherein presenting the phonetic string in the second region comprises:

displaying the phonetic string using a phonetic representation different from the first alphabet.

41. The method of claim 1, further comprising:

displaying a visual indicator that identifies the presented phonetic string as a provisional input.

42. The method of claim 1, wherein presenting the candidate tray comprises:

presenting the subset of candidate representations according to a text orientation associated with the second alphabet.

43. The method of claim 42, wherein the text orientation comprises a left to right orientation.

44. The method of claim 42, wherein the text orientation comprises a right to left orientation.

45. The method of claim 42, wherein the text orientation comprises a top to bottom orientation.

46. The method of claim 1, further comprising:

presenting a next candidate key configured to allow a user to select a next candidate of the one or more candidate representations.

47. The method of claim 1, further comprising:

presenting a previous candidate key configured to allow a user to select a previous candidate of the one or more candidate representations.

48. The device of claim 8, wherein the instructions to present the candidate tray include instructions to present the candidate tray in the second region of the touch-sensitive display.

49. The device of claim 8, wherein the instructions to replace the entered phonetic string include instructions to replace the entered phonetic string in the second region of the touch-sensitive display.

50. The device of claim 8, wherein the instructions to present the candidate tray include instructions to present the candidate tray adjacent to the phonetic string.

51. The device of claim 8, wherein the instructions to present the candidate tray comprise instructions to present the candidate tray adjacent to the phonetic string in the second region.

52. The device of claim 8, further comprising instructions to:

receive an input on the virtual keyboard representing a second phonetic string;
present the second phonetic string in the second region of the touch-sensitive display; and
receive an input confirming the second phonetic string.

53. The device of claim 8, further comprising instructions to:

present an option to expand the candidate tray to include a larger subset of the candidate representations.

54. The device of claim 8, further comprising instructions to:

provide a confirm key.

55. The device of claim 54, wherein the instructions to receive an input of a selected candidate representation include instructions to recognize a number of hits to the confirm key in succession.

56. The device of claim 55, further comprising instructions to:

associate each of the candidate representations presented in the candidate tray with a different number of hits of the confirm key.

57. The device of claim 8, further comprising instructions to:

provide an option of replacing the virtual keyboard in the first region with at least a subset of the candidate representations.

58. The device of claim 8, further comprising instructions to:

provide an option of presenting a different subset of the candidate representations in the candidate tray.

59. The device of claim 8 wherein the instructions to present the phonetic string in the second region comprise instructions to display the phonetic string using a phonetic representation different from the first alphabet.

60. The device of claim 8, further comprising instructions to:

display a visual indicator that identifies the presented phonetic string as a provisional input.

61. The device of claim 8, wherein the instructions to present the candidate tray comprise instructions to present the subset of candidate representations according to a text orientation associated with the second alphabet.

62. The device of claim 61, wherein the text orientation comprises a left to right orientation.

63. The device of claim 61, wherein the text orientation comprises a right to left orientation.

64. The device of claim 61, wherein the F comprises a top to bottom orientation.

65. The device of claim 8, further comprising instructions to present a next candidate key configured to allow a user to select a next candidate of the one or more candidate representations.

66. The device of claim 8, further comprising instructions to present a previous candidate key configured to allow a user to select a previous candidate of the one or more candidate representations.

67. The program product of claim 15, wherein presenting the candidate tray comprises presenting the candidate tray in the second region of the touch-sensitive display.

68. The program product of claim 15, wherein replacing the entered phonetic string comprises replacing the entered phonetic string in the second region of the touch-sensitive display.

69. The program product of claim 15, wherein presenting the candidate tray comprises presenting the candidate tray adjacent to the phonetic string.

70. The program product of claim 15, wherein presenting the candidate tray comprises presenting the candidate tray adjacent to the phonetic string in the second region.

71. The program product of claim 15, wherein the operations further comprise:

receiving an input on the virtual keyboard representing a second phonetic string;
presenting the second phonetic string in the second region of the touch-sensitive display; and
receiving an input confirming the second phonetic string.

72. The program product of claim 15, wherein the operations further comprise:

presenting an option to expand the candidate tray to include a larger subset of the candidate representations.

73. The program product of claim 15, wherein the operations further comprise:

providing a confirm key.

74. The program product of claim 73, wherein the received input of a selected candidate representation comprises a number of hits to the confirm key in succession.

75. The program product of claim 74, wherein each of the candidate representations presented in the candidate tray corresponds to a different number of hits of the confirm key.

76. The program product of claim 15, wherein the operations further comprise:

providing an option of replacing the virtual keyboard in the first region with at least a subset of the candidate representations.

77. The program product of claim 15, wherein the operations further comprise:

providing an option of presenting a different subset of the candidate representations in the candidate tray.

78. The program product of claim 15, wherein presenting the phonetic string in the second region comprises:

displaying the phonetic string using a phonetic representation different from the first alphabet.

79. The program product of claim 15, further comprising:

displaying a visual indicator that identifies the presented phonetic string as a provisional input.

80. The program product of claim 15, wherein presenting the candidate tray comprises presenting the subset of candidate representations according to a text orientation associated with the second alphabet.

81. The program product of claim 80, wherein the text orientation comprises a left to right orientation.

82. The program product of claim 80, wherein the text orientation comprises a right to left orientation.

83. The program product of claim 80, wherein the text orientation comprises a top to bottom orientation.

84. The program product of claim 15, wherein the operations further comprise:

presenting a next candidate key configured to allow a user to select a next candidate of the one or more candidate representations.

85. The program product of claim 15, wherein the operations further comprise:

presenting a previous candidate key configured to allow a user to select a previous candidate of the one or more candidate representations.

86. The method of claim 22, wherein presenting the subset of candidate representations comprises presenting a candidate tray in the second region of the touch-sensitive display.

87. The method of claim 22, wherein replacing the entered phonetic string comprises replacing the entered phonetic string in the second region of the touch-sensitive display.

88. The method of claim 22, wherein presenting the subset of the candidate representations comprises presenting the candidate representations adjacent to the phonetic string.

89. The method of claim 22, wherein presenting the subset of candidate representations comprises presenting the candidate representations adjacent to the phonetic string in the second region.

90. The method of claim 22, further comprising:

receiving an input on the virtual keyboard representing second phonetic string;
presenting the second phonetic string in the second region of the touch-sensitive display; and
receiving an input confirming the second phonetic string.

91. The method of claim 22, further comprising:

presenting an option to expand the subset of the candidate representations.

92. The method of claim 22, further comprising:

providing a confirm key.

93. The method of claim 92, wherein the received input of a selected candidate representation comprises a number of hits to the confirm key in succession.

94. The method of claim 93, wherein each of the candidate representations corresponds to a different number of hits of the confirm key.

95. The method of claim 22, further comprising:

providing an option of replacing the virtual keyboard in the first region with at least a subset of the candidate representations.

96. The method of claim 22, further comprising:

providing an option of presenting a different subset of the candidate representations.

97. The method of claim 22, wherein presenting the phonetic string in the second region comprises:

displaying the phonetic string using a phonetic representation different from the first alphabet.

98. The method of claim 22, further comprising:

displaying a visual indicator that identifies the presented phonetic string as a provisional input.

99. The method of claim 22, wherein the orientation of text comprises a left to right orientation.

100. The method of claim 22, wherein the orientation of text comprises a right to left orientation.

101. The method of claim 22, wherein the orientation of text comprises a top to bottom orientation.

102. The method of claim 22, further comprising:

presenting a next candidate key configured to allow a user to select a next candidate of the one or more candidate representations.

103. The method of claim 22, further comprising:

presenting a previous candidate key configured to allow a user to select a previous candidate of the one or more candidate representations.

104. A method comprising:

presenting a virtual keyboard corresponding to a first alphabet in a first region of a touch-sensitive display of a device;
receiving an input on the virtual keyboard representing a phonetic string;
presenting the phonetic string in a second region of the touch-sensitive display;
identifying one or more candidate representations based on the phonetic string, wherein the candidate representations correspond to a second alphabet different from the first alphabet;
identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
presenting a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations;
receiving an input of a selected candidate representation of the candidate representations and the additional candidate representations; and
removing the presented phonetic string and presenting the selected candidate representation.

105. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by an electronic device with a touch-sensitive display, cause the device to:

present a virtual keyboard corresponding to a first alphabet in a first region of the touch-sensitive display of the device;
receive an input on the virtual keyboard representing a phonetic string;
present the phonetic string in a second region of the touch-sensitive display;
identify one or more candidate representations based on the phonetic string, wherein the candidate representations correspond to a second alphabet different from the first alphabet;
identify one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
present a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations;
receive an input of a selected candidate representation of the candidate representations and the additional candidate representations; and
remove the presented phonetic string and present the selected candidate representation.

106. A method comprising:

presenting a virtual keyboard corresponding to a first alphabet in a first region of a touch-sensitive display of a device;
receiving input representing a phonetic string corresponding to the first alphabet;
presenting the phonetic string in a second region of the touch-sensitive display;
identifying one or more candidate representations based on the phonetic string, wherein the candidate representations correspond to a second alphabet different from the first alphabet;
identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
presenting a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations;
receiving an input of a selected candidate representation of the candidate representations and the additional candidate representations; and
removing the presented phonetic string and presenting the selected candidate representation.

107. A method comprising:

presenting a virtual keyboard in a first region of a touch-sensitive display of a device;
receiving an input on the virtual keyboard representing a phonetic string;
presenting the phonetic string using a first alphabet in a second region of the touch-sensitive display;
identifying one or more candidate representations based on the phonetic string, wherein the one or more candidate representations correspond to a second alphabet different from the first alphabet;
identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
presenting a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations;
receiving an input of a selected candidate representation of the candidate representations and the additional candidate representations; and
removing the presented phonetic string and presenting the selected candidate representation.

108. A method comprising:

presenting one or more virtual keys corresponding to a first alphabet in a first region of a touch-sensitive display of a device;
receiving an input on the one or more virtual keys, wherein the input represents a phonetic string;
presenting the phonetic string in a second region of the touch-sensitive display;
identifying one or more candidate representations based on the phonetic string, wherein the one or more candidate representations correspond to a second alphabet different from the first alphabet;
identifying one or more additional candidate representations based on the phonetic string in a third alphabet different from the first and second alphabets;
presenting a candidate tray including at least a subset of the candidate representations and a subset of the additional candidate representations;
receiving an input of a selected candidate representation of the candidate representations and the additional candidate representations; and
removing the presented phonetic string and presenting the selected candidate representation.
Referenced Cited
U.S. Patent Documents
4974191 November 27, 1990 Amirghodsi et al.
5128672 July 7, 1992 Kaehler
5282265 January 25, 1994 Rohra Suda et al.
5379057 January 3, 1995 Clough et al.
5386556 January 31, 1995 Hedin et al.
5434777 July 18, 1995 Luciw
5479488 December 26, 1995 Lennig et al.
5526259 June 11, 1996 Kaji
5535119 July 9, 1996 Ito et al.
5577241 November 19, 1996 Spencer
5608624 March 4, 1997 Luciw
5675362 October 7, 1997 Clough et al.
5682539 October 28, 1997 Conrad et al.
5727950 March 17, 1998 Cook et al.
5748974 May 5, 1998 Johnson
5794050 August 11, 1998 Dahlgren et al.
5826261 October 20, 1998 Spencer
5895466 April 20, 1999 Goldberg et al.
5899972 May 4, 1999 Miyazawa et al.
5915249 June 22, 1999 Spencer
5959629 September 28, 1999 Masui
5987404 November 16, 1999 Della Pietra et al.
6052656 April 18, 2000 Suda et al.
6081750 June 27, 2000 Hoffberg et al.
6088731 July 11, 2000 Kiraly et al.
6115053 September 5, 2000 Perlin
6144938 November 7, 2000 Surace et al.
6188999 February 13, 2001 Moody
6233559 May 15, 2001 Balakrishnan
6278968 August 21, 2001 Franz et al.
6286064 September 4, 2001 King
6323846 November 27, 2001 Westerman et al.
6334103 December 25, 2001 Surace et al.
6356866 March 12, 2002 Pratley
6421672 July 16, 2002 McAllister et al.
6434524 August 13, 2002 Weber
6446076 September 3, 2002 Burkey et al.
6453292 September 17, 2002 Ramaswamy et al.
6490563 December 3, 2002 Hon
6499013 December 24, 2002 Weber
6501937 December 31, 2002 Ho et al.
6513063 January 28, 2003 Julia et al.
6523061 February 18, 2003 Halverson et al.
6526395 February 25, 2003 Morris
6532444 March 11, 2003 Weber
6567549 May 20, 2003 Marianetti et al.
6570557 May 27, 2003 Westerman et al.
6598039 July 22, 2003 Livowsky
6601026 July 29, 2003 Appelt et al.
6615172 September 2, 2003 Bennett et al.
6633846 October 14, 2003 Bennett et al.
6650735 November 18, 2003 Burton et al.
6661409 December 9, 2003 Demartines et al.
6665640 December 16, 2003 Bennett et al.
6677932 January 13, 2004 Westerman
6691111 February 10, 2004 Lazaridis et al.
6691151 February 10, 2004 Cheyer et al.
6735632 May 11, 2004 Kiraly et al.
6742021 May 25, 2004 Halverson et al.
6757362 June 29, 2004 Cooper et al.
6757718 June 29, 2004 Halverson et al.
6766179 July 20, 2004 Shiau et al.
6778951 August 17, 2004 Contractor
6789231 September 7, 2004 Reynar
6792082 September 14, 2004 Levine
6807574 October 19, 2004 Partovi et al.
6810272 October 26, 2004 Kraft
6822585 November 23, 2004 Ni
6842767 January 11, 2005 Partovi et al.
6848080 January 25, 2005 Lee
6851115 February 1, 2005 Cheyer et al.
6859931 February 22, 2005 Cheyer et al.
6888536 May 3, 2005 Westerman et al.
6895380 May 17, 2005 Sepe, Jr.
6895558 May 17, 2005 Loveland
6928614 August 9, 2005 Everhart
6937975 August 30, 2005 Elworthy
6964023 November 8, 2005 Maes et al.
6980949 December 27, 2005 Ford
6996531 February 7, 2006 Korall et al.
7027974 April 11, 2006 Busch et al.
7030863 April 18, 2006 Longe et al.
7036128 April 25, 2006 Julia et al.
7050977 May 23, 2006 Bennett
7062428 June 13, 2006 Hogenhout et al.
7069560 June 27, 2006 Cheyer et al.
7092928 August 15, 2006 Elad et al.
7096432 August 22, 2006 Huapaya et al.
7107204 September 12, 2006 Liu
7119794 October 10, 2006 Kong
7127046 October 24, 2006 Smith et al.
7136710 November 14, 2006 Hoffberg et al.
7137076 November 14, 2006 Iwema et al.
7137126 November 14, 2006 Coffman et al.
7139714 November 21, 2006 Bennett et al.
7147562 December 12, 2006 Ohara et al.
7177798 February 13, 2007 Hsu et al.
7200559 April 3, 2007 Wang
7203646 April 10, 2007 Bennett
7216073 May 8, 2007 Lavi et al.
7216080 May 8, 2007 Tsiao et al.
7225125 May 29, 2007 Bennett et al.
7233904 June 19, 2007 Luisi
7277854 October 2, 2007 Bennett et al.
7290039 October 30, 2007 Lisitsa et al.
7324947 January 29, 2008 Jordan et al.
7349953 March 25, 2008 Lisitsa et al.
7376556 May 20, 2008 Bennett
7376645 May 20, 2008 Bernard
7376648 May 20, 2008 Johnson
7379874 May 27, 2008 Schmid et al.
7386449 June 10, 2008 Sun et al.
7392185 June 24, 2008 Bennett
7398209 July 8, 2008 Kennewick et al.
7403888 July 22, 2008 Wang
7403938 July 22, 2008 Harrison et al.
7409337 August 5, 2008 Potter et al.
7415100 August 19, 2008 Cooper et al.
7426467 September 16, 2008 Nashida et al.
7447635 November 4, 2008 Konopka et al.
7454351 November 18, 2008 Jeschke et al.
7475010 January 6, 2009 Chao
7483894 January 27, 2009 Cao
7502632 March 10, 2009 Ciminelli
7502738 March 10, 2009 Kennewick et al.
7526466 April 28, 2009 Au
7539656 May 26, 2009 Fratkina et al.
7546382 June 9, 2009 Healey et al.
7548895 June 16, 2009 Pulsipher
7555431 June 30, 2009 Bennett
7599918 October 6, 2009 Shen et al.
7616190 November 10, 2009 Li
7620549 November 17, 2009 Di Cristo et al.
7624007 November 24, 2009 Bennett
7634409 December 15, 2009 Kennewick et al.
7640160 December 29, 2009 Di Cristo et al.
7647225 January 12, 2010 Bennett et al.
7657424 February 2, 2010 Bennett
7672841 March 2, 2010 Bennett
7676026 March 9, 2010 Baxter, Jr.
7684985 March 23, 2010 Dominach et al.
7693552 April 6, 2010 Ciminelli
7693720 April 6, 2010 Kennewick et al.
7698131 April 13, 2010 Bennett
7702500 April 20, 2010 Blaedow
7702508 April 20, 2010 Bennett
7707027 April 27, 2010 Balchandran et al.
7707267 April 27, 2010 Lisitsa et al.
7711672 May 4, 2010 Au
7716056 May 11, 2010 Weng et al.
7720674 May 18, 2010 Kaiser et al.
7725307 May 25, 2010 Bennett
7725320 May 25, 2010 Bennett
7725321 May 25, 2010 Bennett
7729904 June 1, 2010 Bennett
7729916 June 1, 2010 Coffman et al.
7734461 June 8, 2010 Kwak et al.
7752152 July 6, 2010 Paek et al.
7783486 August 24, 2010 Rosser et al.
7801722 September 21, 2010 Kotipalli
7809570 October 5, 2010 Kennewick et al.
7818176 October 19, 2010 Freeman et al.
7822608 October 26, 2010 Cross, Jr. et al.
7831426 November 9, 2010 Bennett
7840400 November 23, 2010 Lavi et al.
7873519 January 18, 2011 Bennett
7873654 January 18, 2011 Bernard
7881936 February 1, 2011 Longé et al.
7912702 March 22, 2011 Bennett
7917367 March 29, 2011 Di Cristo et al.
7917497 March 29, 2011 Harrison et al.
7920678 April 5, 2011 Cooper et al.
7930168 April 19, 2011 Weng et al.
7949529 May 24, 2011 Weider et al.
7974844 July 5, 2011 Sumita
7983917 July 19, 2011 Kennewick et al.
7983997 July 19, 2011 Allen et al.
7987151 July 26, 2011 Schott et al.
8000453 August 16, 2011 Cooper et al.
8005679 August 23, 2011 Jordan et al.
8015006 September 6, 2011 Kennewick et al.
8041570 October 18, 2011 Mirkovic et al.
8055708 November 8, 2011 Chitsaz et al.
8069046 November 29, 2011 Kennewick et al.
8073681 December 6, 2011 Baldwin et al.
8082153 December 20, 2011 Coffman et al.
8095364 January 10, 2012 Longé et al.
8107401 January 31, 2012 John et al.
8112275 February 7, 2012 Kennewick et al.
8165886 April 24, 2012 Gagnon et al.
8289283 October 16, 2012 Kida
8564541 October 22, 2013 Chou
20020167545 November 14, 2002 Kang et al.
20020168107 November 14, 2002 Tang et al.
20030160817 August 28, 2003 Ishida et al.
20030216913 November 20, 2003 Keely
20040006458 January 8, 2004 Fux
20040140956 July 22, 2004 Kushler et al.
20040230912 November 18, 2004 Clow et al.
20050024341 February 3, 2005 Gillespie et al.
20050071332 March 31, 2005 Ortega et al.
20050080625 April 14, 2005 Bennett et al.
20050099408 May 12, 2005 Seto et al.
20050119897 June 2, 2005 Bennett et al.
20050152600 July 14, 2005 Chen et al.
20050174333 August 11, 2005 Robinson et al.
20060053387 March 9, 2006 Ording
20060085757 April 20, 2006 Andre et al.
20060117067 June 1, 2006 Wright et al.
20060122834 June 8, 2006 Bennett
20060143007 June 29, 2006 Koh et al.
20060144211 July 6, 2006 Yoshimoto
20060265668 November 23, 2006 Rainisto
20060274051 December 7, 2006 Longe et al.
20070055529 March 8, 2007 Kanevsky et al.
20070088556 April 19, 2007 Andrew
20070100790 May 3, 2007 Cheyer et al.
20070120822 May 31, 2007 Iso
20070139382 June 21, 2007 Kotipalli
20070185917 August 9, 2007 Prahlad et al.
20070282595 December 6, 2007 Tunning et al.
20080015864 January 17, 2008 Ross et al.
20080021708 January 24, 2008 Bennett et al.
20080030481 February 7, 2008 Gunn et al.
20080034032 February 7, 2008 Healey et al.
20080052063 February 28, 2008 Bennett et al.
20080094356 April 24, 2008 Ording et al.
20080120112 May 22, 2008 Jordan et al.
20080140657 June 12, 2008 Azvine et al.
20080221903 September 11, 2008 Kanevsky et al.
20080228496 September 18, 2008 Yu et al.
20080247519 October 9, 2008 Abella et al.
20080300878 December 4, 2008 Bennett
20090006343 January 1, 2009 Platt et al.
20090030800 January 29, 2009 Grois
20090037837 February 5, 2009 Raghunath et al.
20090058823 March 5, 2009 Kocienda
20090076796 March 19, 2009 Daraselia
20090077464 March 19, 2009 Goldsmith et al.
20090100049 April 16, 2009 Cao
20090150156 June 11, 2009 Kennewick et al.
20090157401 June 18, 2009 Bennett
20090164441 June 25, 2009 Cheyer
20090171664 July 2, 2009 Kennewick et al.
20090226091 September 10, 2009 Goldsmith et al.
20090265669 October 22, 2009 Kida et al.
20090295737 December 3, 2009 Goldsmith
20090299745 December 3, 2009 Kennewick et al.
20100005081 January 7, 2010 Bennett
20100023320 January 28, 2010 Di Cristo et al.
20100036660 February 11, 2010 Bennett
20100042400 February 18, 2010 Block et al.
20100145700 June 10, 2010 Kennewick et al.
20100204986 August 12, 2010 Kennewick et al.
20100217604 August 26, 2010 Baldwin et al.
20100228540 September 9, 2010 Bennett
20100235341 September 16, 2010 Bennett
20100277579 November 4, 2010 Cho et al.
20100280983 November 4, 2010 Cho et al.
20100286985 November 11, 2010 Kennewick et al.
20100299142 November 25, 2010 Freeman et al.
20100312547 December 9, 2010 van Os et al.
20100318576 December 16, 2010 Kim
20100332235 December 30, 2010 David
20110082688 April 7, 2011 Kim et al.
20110112827 May 12, 2011 Kennewick et al.
20110112921 May 12, 2011 Kennewick et al.
20110119049 May 19, 2011 Ylonen
20110125540 May 26, 2011 Jang et al.
20110131036 June 2, 2011 Dicristo et al.
20110131045 June 2, 2011 Cristo et al.
20110144999 June 16, 2011 Jang et al.
20110161076 June 30, 2011 Davis et al.
20110175810 July 21, 2011 Markovic et al.
20110184730 July 28, 2011 Lebeau et al.
20110231182 September 22, 2011 Weider et al.
20110231188 September 22, 2011 Kennewick et al.
20110279368 November 17, 2011 Klein et al.
20110306426 December 15, 2011 Novak et al.
20120002820 January 5, 2012 Leichter
20120016678 January 19, 2012 Gruber et al.
20120020490 January 26, 2012 Leichter
20120022787 January 26, 2012 Lebeau et al.
20120022857 January 26, 2012 Baldwin et al.
20120022860 January 26, 2012 Lloyd et al.
20120022868 January 26, 2012 Lebeau et al.
20120022869 January 26, 2012 Lloyd et al.
20120022870 January 26, 2012 Kristjansson et al.
20120022874 January 26, 2012 Lloyd et al.
20120022876 January 26, 2012 Lebeau et al.
20120023088 January 26, 2012 Cheng et al.
20120034904 February 9, 2012 Lebeau et al.
20120035908 February 9, 2012 Lebeau et al.
20120035924 February 9, 2012 Jitkoff et al.
20120035931 February 9, 2012 Lebeau et al.
20120035932 February 9, 2012 Jitkoff et al.
20120042343 February 16, 2012 Laligand et al.
Foreign Patent Documents
1949158 April 2007 CN
101021763 August 2007 CN
0930760 July 1999 EP
1245023 October 2002 EP
1245023 (A1) October 2002 EP
1698982 September 2006 EP
1949158 July 2008 EP
6019965 January 1994 JP
08-272787 October 1996 JP
10-049272 February 1998 JP
2000-112636 April 2000 JP
2001-125896 May 2001 JP
2002-024212 January 2002 JP
2002-108543 April 2002 JP
3314276 August 2002 JP
2002-325965 November 2002 JP
2002-0325965 November 2002 JP
2003-517158 May 2003 JP
2003517158 (A) May 2003 JP
2005-092441 April 2005 JP
2009-036999 February 2009 JP
10-0776800 November 2007 KR
10-0776800 November 2007 KR
10-0810500 March 2008 KR
10-0810500 March 2008 KR
10 2008 109322 December 2008 KR
20080109322 December 2008 KR
10 2009 086805 August 2009 KR
20090086805 August 2009 KR
10-0920267 October 2009 KR
10-0920267 October 2009 KR
10 2011 0113414 October 2011 KR
20110113414 October 2011 KR
WO-00/74240 December 2000 WO
WO-2005/008899 January 2005 WO
WO-2005/064587 July 2005 WO
WO-2007/037809 April 2007 WO
WO-2007/047188 April 2007 WO
WO-2007/070223 June 2007 WO
WO-2009/032483 March 2009 WO
WO-2009/111138 September 2009 WO
WO-2011/088053 July 2011 WO
WO 2011/088053 July 2011 WO
Other references
  • “Iphone is Big in Japan”, Wall Street Journal May 17, 2000.
  • “Japan belongs to Apple: iPhone claims top 10 spots in total smartphone sales in November,” Rocket News 24, Nov. 13, 2013.
  • Alfred App, 2011, http://www.alfredapp.com/, 5 pages.
  • Ambite, JL., et al., “Design and Implementation of the CALO Query Manager,” Copyright .COPYRGT. 2006, American Association for Artificial Intelligence, (www.aaai.org), 8 pages.
  • Ambite, JL., et al., “Integration of Heterogeneous Knowledge Sources in the CALO Query Manager,” 2005, The 4th International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE), Agia Napa, Cyprus, ttp://www.isi.edu/people/ambite/publications/integration.sub.--he-terogeneous.sub.--knowledge.sub.--sources.sub.--calo.sub.--query.sub.--man-ager, 18 pages.
  • Belvin, R. et al., “Development of the HRL Route Navigation Dialogue System,” 2001, in Proceedings of the First International Conference on Human Language Technology Research, Paper, Copyright .COPYRGT. 2001 HRL Laboratories, LLC, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.6538, 5 pages.
  • Cheyer, A. et al., “Spoken Language and Multimodal Applications for Electronic Realties,” .COPYRGT. Springer-Verlag London Ltd, Virtual Reality 1999, 3:1-15, 15 pages.
  • Elio, R. et al., (1999) “On Abstract Task Models and Conversation Policies,” http://webdocs.cs.ualberta.ca/.about.ree/publications/papers2/ATS.AA99.pd- f, 10 pages.
  • Entry for “replace” from The American Heritage Dictionary 1048 (2nd ed. 1985).
  • Ericsson, S. et al., “Software illustrating a unified approach to multimodality and multilinguality in the in-home domain,” Dec. 22, 2006, Talk and Look: Tools for Ambient Linguistic Knowledge, http://www.talk-project.eurice.eu/fileadmin/talk/publications.sub.--publi-c/deliverables.sub.--public/D1.sub.--6.pdf, 127 pages.
  • Escerpt from Microsoft Computer Dictionary Release date: Mar. 22, 2002 ISBN: 0735614954.
  • Excerpt from Webster's New World Computer Dictionary, 10th Edition release date: Apr. 18, 2003 ISBN: 076452478X.
  • Excerpt of Inside Macintosh, vol. 1, Addison-Wesley, Copyright 1985, ISBN 0-201-17731-5.
  • Excerpt of Running Microsoft Word 97 for Windows 95 and Windows 4.0, by Russell Borland, Microsoft Press, Copyright 1997.
  • Excerpt of Smalltalk-80: The Interactive Programming Environment, by Adele Goldberg, Addison-Wesley, Copyright 1984, ISBN 0-201-17732-4.
  • Gervasio, M. T., et al., Active Preference Learning for Personalized Calendar Scheduling Assistancae, Copyright .COPYRGT. 2005, http://www.ai.sri.com/.about.gervasio/pubs/gervasio-iui05.pdf, 8 pages.
  • Gruber, T. R., “It Is What It Does: The Pragmatics of Ontology for Knowledge Sharing,” (c) 2000, 2003, http://www.cidoc-crm.org/docs/symposium.sub.--presentations/gruber.sub.---cidoc-ontology-2003.pdf, 21 pages.
  • Gruber, T. R., “Siri, a Virtual Personal Assistant—Bringing Intelligence to the Interface,” Jun. 16, 2009, Keynote presentation at Semantic Technologies conference, Jun. 2009. http://tomgruber.org/writing/semtech09.htm, 22 pages.
  • Gruber, T. R., et al.,“Nike: A National Infrastructure for Knowledge Exchange,” Oct. 1994, http://www.eit.com/papers/nike/nike.html and nike.ps, 10 pages.
  • Guzzoni, D., et al., “Active, A Platform for Building Intelligent Operating Rooms,” Surgetica 2007 Computer-Aided Medical Interventions: tools and applications, pp. 191-198, Paris, 2007, Sauramps Medical, http://lsro.epfl.ch/page-68384-en.html, 8 pages.
  • Guzzoni, D., et al., “Active, A Tool for Building Intelligent User Interfaces,” ASC 2007, Palma de Mallorca, http://lsro.epfl.ch/page-34241.html, 6 pages.
  • Guzzoni, D., et al., “Modeling Human-Agent Interaction with Active Ontologies,” 2007, AAAI Spring Symposium, Interaction Challenges for Intelligent Assistants, Stanford University, Palo Alto, California, 8 pages.
  • Hardawar, D., “Driving app Waze builds its own Siri for hands-free voice control,” Feb. 9, 2012, http://venturebeat.com/2012/02/09/driving-app-waze-builds-its-own-siri-fo-r-hands-free-voice-control/, 4 pages.
  • International Preliminary Report on Patentability in International Application No. PCT/US2008/072803 mailed Mar. 18, 2010.
  • International Preliminary Report on Patentability in PCT/US2009/033696 mailed Sep. 16, 2010, 6 pages.
  • International Search Report and Written Opinion dated Nov. 29, 2011, received in International Application No. PCT/US2011/20861, which corresponds to U.S. Appl. No. 12/987,982, 15 pages. (Thomas Robert Gruber).
  • International Search Report and Written Opinion, dated Apr. 29, 2009, issued in International Application No. PCT/US2009/033696.
  • International Search Report and Written Opinion, dated Feb. 18, 2009, issued in International Application No. PCT/US2008/072803.
  • Invitation to Pay Fees and Partial International Search Report, dated Nov. 11, 2008, issued in International Application No. PCT/US2008/072803.
  • Julia, L., et al., Un editeur interactif de tableaux dessines a main levee (An Interactive Editor for Hand-Sketched Tables), Traitement du Signal 1995, vol. 12, No. 6, 8 pages. No English Translation Available.
  • Karp, P. D., “A Generic Knowledge-Base Access Protocol,” May 12, 1994, http://lecture.cs.buu.ac.th/.about.f50353/Document/gfp.pdf, 66 pages.
  • Liu et al., “Online Recognition of Chinese Characters: The State-of-the-Art,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, No. 2, Feb. 2004, pp. 198-213.
  • Masui, “An Efficient Text Input Method for Pen-based Computers,” Proceedings of the ACM Conference on Human Factors in Computing System (CHI '98), Apr. 1998, ACM press, pp. 328-335.
  • Masui, POBox: “An Efficient Text Input Method for Handheld and Ubiquitous Computers,” Proceedings of the Internation Symposium on Handheld and Ubiquitous Computer (HUC '99), Sep. 1999, pp. 289-300.
  • McGuire, J., et al., “Shade: Technology for Knowledge-Based Collaborative Engineering,” 1993, Journal of Concurrent Engineering: Applications and Research (CERA), 18 pages.
  • Milward, D., et al., “D2.2: Dynamic Multimodal Interface Reconfiguration,” Talk and Look: Tools for Ambient Linguistic Knowledge, Aug. 8, 2006, http://www.ihmc.us/users/nblaylock/Pubs/Files/talk.sub.--d2.2.pdf, 69 pages.
  • Muhlhauser, M., “Context Aware Voice User Interfaces for Workflow Support,” Darmstadt 2007, http://tuprints.ulb.tu-darmstadt.de/876/1/PhD.pdf, 254 pages.
  • Naone, E., “Trio: Intelligent Software Assistant,” Mar.-Apr. 2009, Technology Review, http://www.technologyreview.com/printer.sub.--friendly.sub.--article.aspx- ?id=22117, 2 pages.
  • Noth, E., et al., “Verbmobil: The Use of Prosody in the Linguistic Components of a Speech Understanding System,” IEEE Transactions on Speech and Audio Processing, vol. 8, No. 5, Sep. 2000, 14 pages.
  • Rice, J., et al., (1995) “Monthly Program: Nov. 14, 1995,” The San Francisco Bay Area Chapter of ACM SIGCHI, http://www.baychi.org/calendar/19951114/, 2 pages.
  • Rice, J., et al., (1996) “Using the Web Instead of a Window System,” Knowledge Systems Laboratory, Stanford University, http://tomgruber.org/writing/ksl-95-69.pdf, 14 pages.
  • Rivlin, Z., et al., “Maestro: Conductor of Multimedia Analysis Technologies,” 1999 SRI International, Communications of the Association for Computing Machinery (CACM), 7 pages.
  • Sacher, “Interactions in Chinese: Designing Interfaces for Asian Languages,” Interactions Magazine, vol. 5, Issue 5, Sep.-Oct. 1998, pp. 28-38.
  • Seneff, S., et al., “A New Restaurant Guide Conversational System: Issues in Rapid Prototyping for Specialized Domains,” Oct. 1996, citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.16...rep . . . , 4 pages.
  • Simonite, T., “One Easy Way to Make Siri Smarter,” Oct. 18, 2011, Technology Review, http://www.technologyreview.com/printersub.--friendly.sub.--article.aspx?id=389- 15, 2 pages.
  • Tofel, K., et al., “SpeakTolt: A personal assistant for older iPhones, iPads,” Feb. 9, 2012, http://gigaom.com/apple/speaktoit-siri-for-older-iphones-ipads/, 7 pages.
  • Tucker, J., “Too lazy to grab your TV remote? Use Siri instead,” Nov. 30, 2011, http://www.engadget.com/2011/11/30/too-lazy-to-grab-your-tv-remote--use-siri-instead/, 8 pages.
  • Tur, G., et al., “The-CALO-Meeting-Assistant System,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, No. 6, Aug. 2010, 11 pages.
  • Vlingo, “Vlingo Launches Voice Enablement Application on Apple App Store,” Vlingo press release dated Dec. 3, 2008, 2 pages.
  • YouTube, “Knowledge Navigator,” 5:34 minute video uploaded to YouTube by Knownav on Apr. 29, 2008, http://www.youtube.com/watch?v=QRH8eimU.sub.--20on Aug. 3, 2006, 1 page.
  • YouTube, “Send Text, Listen to and Send E-Mail ‘By Voice’ www.voiceassist.com,” 2:11 minute video uploaded to YouTube by VoiceAssist on Jul. 30, 2009, http://www.youtube.com/watch?v=0tEU61nHHA4, 1 page.
  • YouTube, “Text'nDrive App Demo—Listen and Reply to your Messages by Voice while Driving!,” 1:57 minute video uploaded to YouTube by TextnDrive on Apr 27, 2010, http://www.youtube.com/watch?v=WaGfzoHsAMw, 1 page.
  • Zue, V. W., “Toward Systems that Understand Spoken Language,” Feb. 1994, ARPA Strategic Computing Institute, .COPYRGT.1994 IEEE, 9 pages.
  • Zue, V., “Conversational Interfaces: Advances and Challenges,” Sep. 1997, http://www.cs.cmu.edu/.about.dod/papers/zue97.pdf, 10 pages.
  • H. Sacher, “Interactions in Chinese: Designing Interfaces for Asian Languages,” Interactions Magazine, vol. 5, Issue 5, Sep.-Oct. 1998, pp. 28-38.
  • Translated First Office Action dated Jul. 29, 2010, issued in Chinese Patent Application No. 200910118235.X, 9 pages.
  • Invitation to Pay Fees and Partial International Search Report, dated Nov. 11, 2008, issued in International Application no. PCT/US2009/072803.
  • Kida et al., “Language Input Interface on a Device”, U.S. Appl. No. 12/107,711, filed Apr. 22, 2008.
  • Goldsmith et al., “Identification of Candidate Characters for Text Input”, U.S. Appl. No. 12/167,044, filed Jul. 2, 2008.
  • Chou, “Zhuyin Input Interface on a Device”, U.S. Appl. No. 12/476,121, filed Jun. 1, 2009.
  • Authorized officer Philippe Becamel, International Preliminary Report on Patentability in PCT/US2009/033696 mailed Sep. 16, 2010, 7 pages.
  • Berry, P. M., et al., “PTIME: Personalized Assistance for Calendaring,” ACM Transactions on Intelligent Systems and Technology, vol. 2, No. 4, Article 40, Publication date: Jul. 2011, 40:1-22, 22 pages.
  • Butcher, M., “EVI arrives in town to go toe-to-toe with Siri,” Jan. 23, 2012, http://techcrunch.com/2012/01/23/evi-arrives-in-town-to-go-toe-to-toe-with-siri/, 2 pages.
  • Chen, Y., “Multimedia Siri Finds and Plays Whatever You Ask for,” Feb. 9, 2012, http://www.psfk.com/2012/02/nriultinnedia-siri.html, 9 pages.
  • Cutkosky, M. R. et al., “PACT: An Experiment in Integrating Concurrent Engineering Systems,” Journal, Computer, vol. 26 Issue 1, Jan. 1993, IEEE Computer Society Press Los Alamitos, CA, USA, http://dl.acm.org/citation.cfm?id=165320, 14 pages.
  • Evi, “Meet Evi: the one mobile app that provides solutions for your everyday problems,” Feb. 8, 2012, http://www.evi.com/, 3 pages.
  • Feigenbaum, E., et al., “Computer-assisted Semantic Annotation of Scientific Life Works,” 2007, http://tomgruber.org/writing/stanford-cs300.pdf, 22 pages.
  • Gannes, L., “Alfred App Gives Personalized Restaurant Recommendations,” allthingsd.com, Jul. 18, 2011, http://allthingsd.com/20110718/alfred-app-gives-personalized-restaurant-recommendations/, 3 pages.
  • Gautier, P. O., et al. “Generating Explanations of Device Behavior Using Compositional Modeling and Causal Ordering,” 1993, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.8394, 9 pages.
  • Glass, A., “Explaining Preference Learning,” 2006, http://cs229.stanford.edu/proj2006/GlassExplainingPreferenceLearning.pdf, 5 pages.
  • Gruber, T. R., et al., “An Ontology for Engineering Mathematics,” in Jon Doyle, Piero Torasso, & Erik Sandewall, Eds., Fourth International Conference on Principles of Knowledge Representation and Reasoning, Gustav Stresemann Institut, Bonn, Germany, Morgan Kaufmann, 1994, http://www-ksl.stanford.edu/knowledge-sharing/papers/engmath.html, 22 pages.
  • Gruber, T. R., “A Translation Approach to Portable Ontology Specifications,” Knowledge Systems Laboratory, Stanford University, Sep. 1992, Technical Report KSL 92-71, Revised Apr. 1993, 27 pages.
  • Gruber, T. R., “Automated Knowledge Acquisition for Strategic Knowledge,” Knowledge Systems Laboratory, Machine Learning, 4, 293-336 (1989), 44 pages.
  • Gruber, T. R., “(Avoiding) the Travesty of the Commons,” Presentation at NPUC 2006, New Paradigms for User Computing, IBM Almaden Research Center, Jul. 24, 2006. http://tomgruber.org/writing/avoiding-travestry.htm, 52 pages.
  • Gruber, T. R., “Big Think Small Screen: How semantic computing in the cloud will revolutionize the consumer experience on the phone,” Keynote presentation at Web 3.0 conference, Jan. 27, 2010, http://tomgruber.org/writing/web30jan2010.htm, 41 pages.
  • Gruber, T. R., “Collaborating around Shared Content on the WWW,” W3C Workshop on WWW and Collaboration, Cambridge, MA, Sep. 11, 1995, http://www.w3.org/Collaboration/Workshop/Proceedings/P9.html, 1 page.
  • Gruber, T. R., “Collective Knowledge Systems: Where the Social Web meets the Semantic Web,” Web Semantics: Science, Services and Agents on the World Wide Web (2007), doi:10.1016/j.websem.2007.11.011, keynote presentation given at the 5th International Semantic Web Conference, Nov. 7, 2006, 19 pages.
  • Gruber, T. R., “Where the Social Web meets the Semantic Web,” Presentation at the 5th International Semantic Web Conference, Nov. 7, 2006, 38 pages.
  • Gruber, T. R., “Despite our Best Efforts, Ontologies are not the Problem,” AAAI Spring Symposium, Mar. 2008, http://tomgruber.org/writing/aaai-ss08.htm, 40 pages.
  • Gruber, T. R., “Enterprise Collaboration Management with Intraspect,” Intraspect Software, Inc., Intraspect Technical White Paper Jul. 2001, 24 pages.
  • Gruber, T. R., “Every ontology is a treaty—a social agreement—among people with some common motive in sharing,” Interview by Dr. Miltiadis D. Lytras, Official Quarterly Bulletin of AIS Special Interest Group on Semantic Web and Information Systems, vol. 1, Issue 3, 2004, http://www.sigsemis.org 1, 5 pages.
  • Gruber, T. R., et al., “Generative Design Rationale: Beyond the Record and Replay Paradigm,” Knowledge Systems Laboratory, Stanford University, Dec. 1991, Technical Report KSL 92-59, Updated Feb. 1993, 24 pages.
  • Gruber, T. R., “Helping Organizations Collaborate, Communicate, and Learn,” Presentation to NASA Ames Research, Mountain View, CA, Mar. 2003, http://tomgruber.org/writing/organizational-intelligence-talk.htm, 30 pages.
  • Gruber, T. R., “Intelligence at the Interface: Semantic Technology and the Consumer Internet Experience,” Presentation at Semantic Technologies conference (SemTech08), May 20, 2008, http://tomgruber.org/writing.htm, 40 pages.
  • Gruber, T. R., Interactive Acquisition of Justifications: Learning “Why” by Being Told “What” Knowledge Systems Laboratory, Stanford University, Oct. 1990, Technical Report KSL 91-17, Revised Feb. 1991, 24 pages.
  • Gruber, T. R., et al., “Machine-generated Explanations of Engineering Models: A Compositional Modeling Approach,” (1993) in Proc. International Joint Conference on Artificial Intelligence, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.34.930, 7 pages.
  • Gruber, T. R., “2021: Mass Collaboration and the Really New Economy,” TNTY Futures, the newsletter of the Next Twenty Years series, vol. 1, Issue 6, Aug. 2001, http://www.tnty.com/newsletter/futures/archive/v01-05business.html, 5 pages.
  • Gruber, T. R., “Ontologies, Web 2.0 and Beyond,” Apr. 24, 2007, Ontology Summit 2007, http://tomgruber.org/writing/ontolog-social-web-keynote.pdf, 17 pages.
  • Gruber, T. R., “Ontology of Folksonomy: A Mash-up of Apples and Oranges,” Originally published to the web in 2005, Int'l Journal on Semantic Web & Information Systems, 3(2), 2007, 7 pages.
  • Gruber, T. R., “TagOntology,” Presentation to Tag Camp, www.tagcamp.org, Oct. 29, 2005, 20 pages.
  • Gruber, T. R., et al., “Toward a Knowledge Medium for Collaborative Product Development,” in Artificial Intelligence in Design 1992, from Proceedings of the Second International Conference on Artificial Intelligence in Design, Pittsburgh, USA, Jun. 22-25, 1992, 19 pages.
  • Gruber, T. R., “Toward Principles for the Design of Ontologies Used for Knowledge Sharing,” in International Journal Human-Computer Studies 43, p. 907-928, substantial revision of paper presented at the International Workshop on Formal Ontology, Mar. 1993, Padova, Italy, available as Technical Report KSL 93-04, Knowledge Systems Laboratory, Stanford University, further revised Aug. 23, 1993, 23 pages.
  • Intraspect Software, “The Intraspect Knowledge Management Solution: Technical Overview,” http://tomgruber.org/writing/intraspect-whitepaper-1998.pdf, 18 pages.
  • Lemon, O., et al., “Multithreaded Context for Robust Conversational Interfaces: Context-Sensitive Speech Recognition and Interpretation of Corrective Fragments,” Sep. 2004, ACM Transactions on Computer-Human Interaction, vol. 11, No. 3, 27 pages.
  • Leong, L., et al., “CASIS: A Context-Aware Speech Interface System,” IUI'05, Jan. 9-12, 2005, Proceedings of the 10th international conference on Intelligent user interfaces, San Diego, California, USA, 8 pages.
  • Lieberman, H., et al., “Out of context: Computer systems that adapt to, and learn from, context,” 2000, IBM Systems Journal, vol. 39, Nos. 3/4, 2000, 16 pages.
  • Lin, B., et al., “A Distributed Architecture for Cooperative Spoken Dialogue Agents with Coherent Dialogue State and History,” 1999, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.272, 4 pages.
  • Mitra, P., et al., “A Graph-Oriented Model for Articulation of Ontology Interdependencies,” 2000, http://ilpubs.stanford.edu:8090/442/1/2000-20.pdf, 15 pages.
  • Moran, D. B., et al., “Multimodal User Interfaces in the Open Agent Architecture,” Proc. of the 1997 International Conference on Intelligent User Interfaces (IUI97), 8 pages.
  • Mozer, M., “An Intelligent Environment Must be Adaptive,” Mar./Apr. 1999, IEEE Intelligent Systems, 3 pages.
  • Neches, R., “Enabling Technology for Knowledge Sharing,” Fall 1991, AI Magazine, pp. 37-56, (21 pages).
  • Sheth, A., et al., “Relationships at the Heart of Semantic Web: Modeling, Discovering, and Exploiting Complex Semantic Relationships,” Oct. 13, 2002, Enhancing the Power of the Internet: Studies in Fuzziness and Soft Computing, SpringerVerlag, 38 pages.
  • Stent, A., et al., “The CommandTalk Spoken Dialogue System,” 1999, http://acl.ldc.upenn.edu/P/P99/P99-1024.pdf, 8 pages.
  • Tur, G., et al., “The CALO Meeting Speech Recognition and Understanding System,” 2008, Proc. IEEE Spoken Language Technology Workshop, 4 pages.
  • YouTube, “Text'nDrive App Demo—Listen and Reply to your Messages by Voice while Driving!,” 1:57 minute video uploaded to YouTube by TextnDrive on Apr. 27, 2010, http://www.youtube.com/watch?=WaGfzoHsAMw, 1 page.
  • YouTube, “Voice on the Go (BlackBerry),” 2:51 minute video uploaded to YouTube by VoiceOnTheGo on Jul. 27, 2009, http://www.youtube.com/watch?v=pJqpWgQS98w, 1 page.
  • Glass, J., et al., “Multilingual Spoken-Language Understanding in the MIT Voyager System,” Aug. 1995, http://groups.csail.mitedu/sis/publications/1995/speechcomm95-voyager.pdf, 29 pages.
  • Goddeau, D., et al., “A Form-Based Dialogue Manager for Spoken Language Applications,” Oct. 1996, http://phasedance.com/pdf/icslp96.pdf, 4 pages.
  • Goddeau, D., et al., “Galaxy: A Human-Language Interface to On-Line Travel Information,” 1994 International Conference on Spoken Language Processing, Sep. 18-22, 1994, Pacific Convention Plaza Yokohama, Japan, 6 pages.
  • Meng, H., et al., “Wheels: A Conversational System in the Automobile Classified Domain,” Oct. 1996, httphttp://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.16.3022, 4 pages.
  • Phoenix Solutions, Inc. v. West Interactive Corp., Document 40, Declaration of Christopher Schmandt Regarding the MIT Galaxy System dated Jul. 2, 2010, 162 pages.
  • Vlingo InCar, “Distracted Driving Solution with Vlingo InCar,” 2:38 minute video uploaded to YouTube by Vlingo Voice on Oct. 6, 2010, http://www.youtube.com/watch?v=Vqs8XfXxgz4, 2 pages.
Patent History
Patent number: RE46139
Type: Grant
Filed: Oct 16, 2014
Date of Patent: Sep 6, 2016
Assignee: APPLE INC. (Cupertino, CA)
Inventors: Yasuo Kida (Palo Alto), Kenneth Kocienda (San Jose, CA), Elizabeth Caroline Furches Cranfill (San Francisco, CA)
Primary Examiner: Kenneth J Whittington
Application Number: 14/516,533
Classifications
Current U.S. Class: Pop-up Control (715/808)
International Classification: G06F 3/041 (20060101); G06F 3/023 (20060101); G06F 3/0488 (20130101);