Speech Controlled System Patents (Class 704/275)
  • Publication number: 20030212562
    Abstract: The present invention provides a method for controlling wireless data transmission from a telematics service center to a voice recognition system in a mobile vehicle. The telematics service call center transmits data to the mobile vehicle over a connection, and monitors said connection for a predetermined barge-in tone. When the telematics service call center receives the user-initiated barge-in tone, it stops data transmission and may request additional commands from the user.
    Type: Application
    Filed: May 13, 2002
    Publication date: November 13, 2003
    Applicant: General Motors Corporation
    Inventors: Kaushik A. Patel, Timothy J. Morse
  • Patent number: 6647368
    Abstract: A pair of sensors are used for detecting an air pressure change signal within an ear of a person caused by the person's initiating action (thought, movement, biological function and/or speech). One of the microphones is placed at least partially within an ear of the person and the other is placed adjacent to and external to the ear, to produce two electrical signals, respectively corresponding to internally detected and to externally detected changes in air pressure. Comparison of the unmodified signal strength difference between these two signals is used to distinguish an initiating action component of each signal from an external source component of each signal. The electrical signals are processed to produce an output signal corresponding to the initiating action, which signal is then recognized by a neural network or speech recognizer, and used for control or communication.
    Type: Grant
    Filed: July 2, 2001
    Date of Patent: November 11, 2003
    Assignee: Think-A-Move, Ltd.
    Inventor: Guerman G. Nemirovski
  • Patent number: 6647363
    Abstract: A system is presented for automatically responding to a user inquiry comprising a dialog manager and a presentation manager. The dialog manager executes a machine-controlled human/machine dialog to determine a set of query items, and in response thereto, retrieves information items from memory. The presentation manager determines the inquiring user's intentions motivating and associated with the query, and in response thereto selects a preferred manner of presenting the retrieved information items, or presentation scenario. In so doing, at least one natural language phrase is generated to match the selected presentation scenario, and a speech generator verbally presents the generated phrasing to the inquiring user.
    Type: Grant
    Filed: October 7, 1999
    Date of Patent: November 11, 2003
    Assignee: ScanSoft, Inc.
    Inventor: Antonius M. W. Claassen
  • Patent number: 6643622
    Abstract: A method and apparatus utilizes a personal speech recognition system to enable data retrieval assistance operators to retrieve customer requested information from an information database based upon a voiced inquiry by the operator. Instead of requiring an operator to interpret a calling customer's request and communicate the same to a computer via a number of keyboard strokes, the live operator may submit the same or equivalent inquiry via a voice utterance that is recognized by the searching computer as a searchable inquiry by a speech recognition system, which is preferably trained to recognize the particular voice of the live operator. Only the voice of the live operator needs to be identified by the voice recognition unit, and the incorporation of the voice recognition technology into the system is thus transparent to the requesting customer.
    Type: Grant
    Filed: June 4, 2001
    Date of Patent: November 4, 2003
    Inventors: Robert O. Stuart, Scott P. Stuart, Don Hornback
  • Patent number: 6642836
    Abstract: A control system for selecting and operating one of a plurality of operating room devices from a single input source, the system includes a master controller that has a voice control interface and can route control signals. The system additionally may include a plurality of slave controllers to provide expandability of the system. Also, the system may generate messages to the user relating to the status of the control system in general and to the status of devices connected thereto.
    Type: Grant
    Filed: October 28, 1997
    Date of Patent: November 4, 2003
    Assignee: Computer Motion, Inc.
    Inventors: Yulun Wang, Charles S. Jordan, Darrin R. Uecker, Charles C. Wooters
  • Patent number: 6643621
    Abstract: Mechanisms and techniques are provided which allow a server computer system, such as a web server, to generate information, such as a web page, which includes an audio resource locator (ARL) configured in accordance with the invention. The ARL includes a reference to audio data, an audio command,and an audio server reference that identifies an audio server computer system that can process the reference to audio data within the ARL according to the audio command within the ARL to producing output, which may be audio or another type of output. The server computer system can serve the information including the ARL to an originator of a request for such information, such as a browser on a client computer system. A client computer system configured with a browser can obtain the information containing the ARL and can reference the ARL which causes the client computer system to send a request to process audio data to the audio server specified in the ARL.
    Type: Grant
    Filed: September 14, 2000
    Date of Patent: November 4, 2003
    Assignee: Cisco Technology, Inc.
    Inventors: Lewis D. Dodrill, Ryan A. Danner, Steven J. Martin
  • Patent number: 6640210
    Abstract: A customer service center includes a server connected to a computer network. The server contains a speech recognition system. An operator station is capable of connecting to the server. A microphone is connected to the operator station. A codec is connected to the microphone and the operator station.
    Type: Grant
    Filed: June 19, 2000
    Date of Patent: October 28, 2003
    Inventors: Frederick Anthony Schaefer, Paul Michael Brashear
  • Publication number: 20030197590
    Abstract: The present invention pertains to control systems and provides a run time configurable control system for selecting and operating one of a plurality of operating room devices from a single input source, the system comprising a master controller having a voice control interface and means for routing control signals. The system additionally may include a plurality of slave controllers to provide expandability of the system. Also, the system includes output means for generating messages to the user relating to the status of the control system in general and to the status of devices connected thereto.
    Type: Application
    Filed: December 9, 2002
    Publication date: October 23, 2003
    Inventors: Yulun Wang, Charles S. Jordan, Darrin R. Uecker, Charles C. Wooters
  • Publication number: 20030200096
    Abstract: A communication device includes a voice command input unit for accepting a voice command spoken by a user, a voice data storage unit for storing the voice command accepted by the voice command input unit as voice data, an electronic mail creating unit for creating an electronic mail including the voice data stored in the voice data storage unit, and an output unit for transmitting the electronic mail created by the electronic mail creating unit.
    Type: Application
    Filed: March 13, 2003
    Publication date: October 23, 2003
    Inventor: Masafumi Asai
  • Publication number: 20030200095
    Abstract: A method for presenting text information with speech according to the invention applies an information-processing apparatus in which a speech database is included. The text information is transformed into speech information according to the speech database, and turned into speech and played. The text information can also be downloaded from a network so that the text information can be online accessed and played with speech.
    Type: Application
    Filed: April 23, 2002
    Publication date: October 23, 2003
    Inventor: Shen Yu Wu
  • Patent number: 6636590
    Abstract: The present invention overcomes the problems in the existing art described above by providing a method and apparatus for specifying and obtaining services through voice commands, via a voice portal, resulting in a live conversation between a user and a selected service provider. The present invention is a system through which seekers of a wide array of services can select, contact, converse, and pay for a service provider using a simple voice-transmission medium such as the telephone. The invention enables the seeker to locate a service provider by speaking the name of a profession, such as “psychiatrist,” which is recognized by the system's voice-recognition software. In a similar fashion, the seeker can then specify by speaking aloud the price range, quality rating, language, and keyword descriptors of the service provider. Within the desired parameters, the system offers service providers who have made themselves available to render services at the present time.
    Type: Grant
    Filed: October 30, 2000
    Date of Patent: October 21, 2003
    Assignee: Ingenio, Inc.
    Inventors: Karl Jacob, Scott Faber, Sean Van Der Linden
  • Patent number: 6636831
    Abstract: A system and process for voice-controlled information retrieval. A conversation template is executed. The conversation template includes a script of tagged instructions including voice prompts and information content. A voice command identifying information content to be retrieved is processed. A remote method invocation is sent requesting the identified information content to an applet process associated with a Web browser. The information content is retrieved on the Web browser responsive to the remote method invocation.
    Type: Grant
    Filed: April 9, 1999
    Date of Patent: October 21, 2003
    Assignee: Inroad, Inc.
    Inventors: Jack H. Profit, Jr., N. Gregg Brown, Peter S. Mezey, Lianne M. Colombo
  • Patent number: 6633844
    Abstract: The combination of audio and video speech recognition in a manner to improve the robustness of speech recognition systems in noisy environments. Contemplated are methods and apparatus in which a video signal associated with a video source and an audio signal associated with the video signal are processed, the most likely viseme associated with the audio signal and video signal is determined and, thereafter, the most likely phoneme associated with the audio signal and video signal is determined.
    Type: Grant
    Filed: December 2, 1999
    Date of Patent: October 14, 2003
    Assignee: International Business Machines Corporation
    Inventors: Ashish Verma, Sankar Basu, Chalapathy Neti
  • Publication number: 20030191648
    Abstract: A method and system for error prevention and recovery of voice activated navigation through a menu having plural nodes provides situation dependent utterance verification by relating confirmation to utterance determination confidence levels. In one embodiment, a high confidence level results in implicit confirmation, a medium confidence level results in explicit confirmation and a low confidence level results in a concise interrogative prompt of a single word that requests the user to repeat the utterance. In situations where voice recognition is difficult, dual modality with DTMF navigation is provided as an option for menu selections.
    Type: Application
    Filed: April 8, 2002
    Publication date: October 9, 2003
    Inventors: Benjamin Anthony Knott, John Mills Martin, Robert Randal Bushey, Tracy Leigh Smart
  • Publication number: 20030191649
    Abstract: A system and method are described for processing transaction instructions without human intervention. In one embodiment, a voice interpreter receives transaction information in the form of voice utterances, processes that information and transmits it to a business application server, which compiles the processed information and generates transaction instructions based on the compiled information. The business application server transmits the transaction instructions to an enterprise system via a connector manager that integrates the enterprise system with the business application server. At least one housing encloses the voice interpreter, the business application server and the hardware platform that supports the connector manager.
    Type: Application
    Filed: April 3, 2003
    Publication date: October 9, 2003
    Inventors: Trevor Stout, Mark Wallin, Marius Seritan
  • Patent number: 6631350
    Abstract: A device-independent speech audio system for linking a speech driven application to specific audio input and output devices can include a media framework for transporting digitized speech audio between speech driven applications and a plurality of audio input and output devices. The media framework can include selectable device-dependent parameters which can enable the transportation of the digitized speech to and from the plurality of audio input and output devices. The device-independent speech audio system also can include an audio abstractor configurable to provide specific ones of the selectable device-dependent parameters according to the specific audio input and output devices. Hence, the audio abstractor can provide a device-independent interface to the speech driven application for linking the speech driven application to the specific audio input and output devices.
    Type: Grant
    Filed: August 28, 2000
    Date of Patent: October 7, 2003
    Assignee: International Business Machines Corporation
    Inventors: Joseph Celi, Jr., Brett Gavagni, Leo Leontiades, Bruce D. Lucas
  • Patent number: 6631346
    Abstract: A computer-implemented speech parsing method and apparatus for processing an input phrase. The method and apparatus include providing a plurality of grammars that are indicative of predetermined topics. A plurality of parse forests are generated using the grammars. Tags are associated with words preferably according to a scoring scheme utilizing the generated parse forests while parsing the input phrase. The tags that are associated with the words are used as a parsed representation of the input phrase.
    Type: Grant
    Filed: April 7, 1999
    Date of Patent: October 7, 2003
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Murat Karaorman, Jean-Claude Junqua
  • Publication number: 20030187659
    Abstract: A method and apparatus for controlling home electronic devices connected to a home network are provided. The method for controlling home electronic devices connected to a home network includes receiving a user voice command and converting the user voice command into a character command; extracting actions and objects from the character command and converting the character command into a logical command; extracting an action list containing a series of actions from the logical command by referring to an action library storing action data for controlling home electronic devices connected to the home network; and converting the series of actions included in the action list into a control signal and controlling the home electronic devices connected to the home network. According to the method and apparatus, user commands to home electronic devices connected to a complicated home network can be simplified such that home electronic devices are controlled conveniently and efficiently.
    Type: Application
    Filed: March 17, 2003
    Publication date: October 2, 2003
    Applicant: Samsung Electronics Co., Ltd.
    Inventors: Jeong-mi Cho, Jay-woo Kim, Young-jin Hong, Jun-ho Park
  • Patent number: 6629069
    Abstract: A speech recogniser is provided for identifying entries in a database. Results from the recognition of a user's speech are combined with each other and optionally with reference to data in the database in order to maximise the accuracy of an identified entry. An output is also provided which gives an indication of the likely accuracy of the identified entry.
    Type: Grant
    Filed: January 3, 2001
    Date of Patent: September 30, 2003
    Assignee: British Telecommunications a public limited company
    Inventors: David John Attwater, Hilary Richard William Greenhow, Peter John Durston
  • Patent number: 6629075
    Abstract: A speech recognition system includes a user interface configured to provide signals indicative of a user's speech. A speech recognizer of the system includes a processor configured to use the signals from the user interface to perform speech recognition operations to attempt to recognize speech indicated by the signals. A control mechanism is coupled to the voice recognizer and is configured to affect processor usage for speech recognition operations in accordance with a loading of the processor.
    Type: Grant
    Filed: June 9, 2000
    Date of Patent: September 30, 2003
    Assignee: SpeechWorks International, Inc.
    Inventor: Johan Schalkwyk
  • Patent number: 6629077
    Abstract: A universal remote control adapted to receive a voice input. The voice input is received by the remote control and compared to a plurality of voice command templates that are stored in the memory of the remote control. If the voice input matches one or more of the plurality of voice command templates, a valid voice input has been received by the remote control. Valid voice input may be a remote control command or keystroke data, input as an entire word or as individual characters. In response to a valid voice input, the remote control may transmit an operational command code and/or alphanumeric symbol code corresponding to keystroke data to a consumer electronic device.
    Type: Grant
    Filed: November 22, 2000
    Date of Patent: September 30, 2003
    Assignee: Universal Electronics Inc.
    Inventors: Paul D. Arling, Patrick H. Hayes
  • Publication number: 20030182131
    Abstract: An apparatus and a concomitant method for speech recognition. In one embodiment, a distributed speech recognition system provides speech-driven control and remote service access. The distributed speech recognition system comprises a client device and a central server, where the client device is equipped with two speech recognition modules: a foreground speech recognizer and a background speech recognizer. The foreground speech recognizer is implementing a particular spoken language application (SLA) to handle a particular task, whereas the background speech recognizer is monitoring a change in the topic and/or a change in the intent of the user. Upon detection of a change in topic or intent of the user, the background speech recognizer will effect the routing to a new SLA to address the new topic or intent.
    Type: Application
    Filed: March 25, 2002
    Publication date: September 25, 2003
    Inventors: James F. Arnold, Horacio E. Franco, David J. Israel
  • Publication number: 20030182130
    Abstract: An apparatus for recording a meeting held in a room comprises a signal receiving unit, a CODEC unit, a data processing unit, an input control unit, a storage unit, and a power source. A process for recording the meeting comprises the steps of basic input control, state control, digital data processing, voice searching, voice processing, and file system management. With this, voice of an individual who is making a speech during the meeting is located, amplified, and compressed prior to being recorded in a file of minutes. The minutes is stored in the storage unit in digital format so as to improve recording quality and facilitate reading and inquiry of the minutes.
    Type: Application
    Filed: March 22, 2002
    Publication date: September 25, 2003
    Applicant: Multisuns Corp.
    Inventors: Bruce W.H. Sun, Chih Long Lin
  • Publication number: 20030182132
    Abstract: The invention relates to a voice-controlled arrangement (1) comprising a plurality of devices to be controlled (3 to 9) and a mobile voice data entry unit (11) which is connected to said devices by a wireless communication link. At least some of the devices each have a device vocabulary memory (3a to 9a) and a vocabulary transmission unit (3b to 9b), and the voice data entry unit has selection means for selecting the vocabularies to he loaded according to the route destination.
    Type: Application
    Filed: February 28, 2003
    Publication date: September 25, 2003
    Inventor: Meinrad Niemoeller
  • Publication number: 20030177013
    Abstract: Systems and methods are described for a speech system that includes one or more speech controls incorporated into one or more speech-enabled applications that run on the speech system. The controls allow applications to be developed with minimal programming effort to incorporate common speech-enabled application functions. A question control provides a customizable template for requesting information from a user. An announcer control allows a speech-enabled application to provide a user with information without having to re-create an entire announcer process each time it is used. A command control provides a simple way to attach command and control functions to speech-enabled applications. A word trainer control provides a way to associate user-selected voice tags with certain information. Providing the controls for use with speech-enabled applications ensures standardized user interfaces across multiple speech-enabled applications.
    Type: Application
    Filed: February 4, 2002
    Publication date: September 18, 2003
    Inventors: Stephen Russell Falcon, Clement Chun Pong Yip, Dan Banay, David Michael Miller
  • Publication number: 20030177012
    Abstract: A thermostat according to the present invention is capable of capturing audio commands from a user for controlling HVAC systems. The thermostat can operate in training mode and control mode. In training mode, the user can tailor a list of predefined keywords with his voice and, in control mode, the user can change HVAC system settings. Optionally, the thermostat is equipped to interface with a remote unit. The user can speak audio commands into the remote unit, and the remote unit will transmit his commands to the thermostat.
    Type: Application
    Filed: March 13, 2002
    Publication date: September 18, 2003
    Inventor: Brett Drennan
  • Patent number: 6622122
    Abstract: In a document retrieving apparatus, an audio input section converts a sound into a character pattern. A language model storing section stores likelihood information. A word choosing section obtains a word selection result based on the likelihood information. A retrieval condition producing section produces retrieval conditions based on the word selection result. A document storing section stores document to be retrieved. And, a document retrieving section retrieves the documents based on the retrieval conditions. An effective document search can be performed regardless of the sentence recognition accuracy without requiring higher cost in collecting the required language data.
    Type: Grant
    Filed: February 24, 2000
    Date of Patent: September 16, 2003
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Yoshio Fukushige, Hiroyuki Suzuki, Naohiko Noguchi, Hayashi Ito, Mitsuhiro Sato, Masaki Kiyono, Hideki Yasukawa
  • Patent number: 6622119
    Abstract: A command prediction system for natural language understanding systems, in accordance with the present invention, includes a user interface for receiving commands from a user. A command predictor receives the commands from the user interface and predicts at least one next command which is likely to be presented by the user based on a command history. A probability calculator is included in the command predictor for determining a probability for each of the at least one next command based on the command history such that a list of predicted commands and their likelihood of being a next command are provided.
    Type: Grant
    Filed: October 30, 1999
    Date of Patent: September 16, 2003
    Assignee: International Business Machines Corporation
    Inventors: Ganesh N. Ramaswamy, Jan Kleindienst
  • Publication number: 20030171928
    Abstract: Systems and methods are described for speech systems that utilize an interaction manager to manage interactions—also known as dialogues—from one or more applications. The interactions are managed properly even if multiple applications use different grammars. The interaction manager maintains an interaction list. An application wishing to utilize the speech system submits one or more interactions to the interaction manager. Interactions are normally processed in the order in which they are received. An exception to this rule is an interaction that is configured by an application to be processed immediately, which causes the interaction manager to place the interaction at the front of the interaction list of interactions. If an application has designated an interaction to interrupt a currently processing interaction, then the newly submitted application will interrupt any interaction currently being processed and, therefore, it will be processed immediately.
    Type: Application
    Filed: February 4, 2002
    Publication date: September 11, 2003
    Inventors: Stephen Russel Falcon, Clement Chun Pong Yip, Dan Banay, David Michael Miller
  • Publication number: 20030171931
    Abstract: The present invention trains a user recognition model for a user. A user enrollment input is received and one or more cohort models are identified from a set of possible cohort models. The cohort models are identified based on a similarity measure between the set of possible cohort models and the user enrollment input. Once the cohort models have been identified, a user model is generated based on data associated with the identified cohort models.
    Type: Application
    Filed: March 11, 2002
    Publication date: September 11, 2003
    Inventor: Eric I-Chao Chang
  • Publication number: 20030171930
    Abstract: User interaction with a secure resource is controlled or mediated by the security server that includes a telephony interface by which the server is either coupled to the telephone system or provides messages to the telephone system directly or through an intermediate component. A biometric data store stores biometric data, such as speech data or visual recognition data. If desired the biometric data may also be stored in association with the extension identifiers of the telephone system. A biometric verification/.identification system accesses this data store and evaluates provided user biometric data vis-à-vis the stored biometric data to determine if the user may control or interact with the secure resource. If interaction is permitted, the security server sends control signals to the secure resource. The telephone system provides an interface through which the user trains the system to store the biometric verification/.identification data of that user.
    Type: Application
    Filed: March 7, 2002
    Publication date: September 11, 2003
    Inventor: Jean-Claude Junqua
  • Publication number: 20030171929
    Abstract: Systems and methods are described for a speech system that manages multiple grammars from one or more speech-enabled applications. The speech system includes a speech server that supports different grammars and different types of grammars by exposing several methods to the speech-enabled applications. The speech server supports static grammars that do not change and dynamic grammars that may change after a commit. The speech server provides persistence by supporting persistent grammars that enable a user to issue a command to an application even when the application is not loaded. In such a circumstance, the application is automatically launched and the command is processed. The speech server may enable or disable a grammar in order to limit confusion between grammars. Global and yielding grammars are also supported by the speech server. Global grammars are always active (e.g., “call 9-1-1”) while yielding grammars may be deactivated when an interaction whose grammar requires priority is active.
    Type: Application
    Filed: February 4, 2002
    Publication date: September 11, 2003
    Inventors: Steve Russel Falcon, Clement Chun Pong Yip, David Michael Miller, Dan Banay
  • Publication number: 20030167174
    Abstract: An audio recorder-player includes M tuners that generate N audio signals transmitted by N audio sources, an analyzer that extracts R×N audio signal characteristics from the N audio signals, a memory that stores the R×N audio signal characteristics, and output circuitry that reproduces an audio signal corresponding to one of the N audio signals responsive to selection of at least one of the R×N audio signal characteristics, where R is a positive integer and M and N are positive integers greater than 1. If desired, the audio recorder-player advantageously can be included in one of a radio, a computer, or a set-top box. Methods for operating the audio recorder-player are also described.
    Type: Application
    Filed: March 1, 2002
    Publication date: September 4, 2003
    Applicant: KONINLIJKE PHILIPS ELECTRONICS N.V.
    Inventors: Serhan Dagtas, Nevenka Dimitrova
  • Publication number: 20030167171
    Abstract: A method and apparatus is disclosed for remotely processing voice commands for controlling a television. A voice command is uttered by a user into a microphone contained in a remote control. The voice command is digitized, modulated, compressed, and wirelessly transmitted to a wireless receiver connected to a set-top box. The voice command is then transmitted to a cable head-end unit for voice and word recognition processing. Once the command function is determined, the function is transmitted back to the set-top box where the set-top box performs the command. The microphone is activated and deactivated by pressing and releasing a push-to-talk (PTT) switch. The PTT activates other functions by being turned, double-clicked and toggled up and down, left and right.
    Type: Application
    Filed: January 7, 2003
    Publication date: September 4, 2003
    Inventors: Theodore Calderone, Mark J. Foster, Harry William Printz, James Jay Kistler
  • Patent number: 6615172
    Abstract: An intelligent query system for processing voiced-based queries is disclosed. This distributed client-server system, typically implemented on an intranet or over the Internet accepts a user's queries at his/her computer, PDA or workstation using a speech input interface. After converting the user's query from speech to text, a 2-step algorithm employing a natural language engine, a database processor and a full-text SQL database is implemented to find a single answer that best matches the user's query. The system, as implemented, accepts environmental variables selected by the user and is scalable to provide answers to a variety and quantity of user-initiated queries.
    Type: Grant
    Filed: November 12, 1999
    Date of Patent: September 2, 2003
    Assignee: Phoenix Solutions, Inc.
    Inventors: Ian M. Bennett, Bandi Ramesh Babu, Kishor Morkhandikar, Pallaki Gururaj
  • Patent number: 6615175
    Abstract: An information and control system for personnel transport devices. In one embodiment, the information and control system is coupled to the elevator system of a building, and includes a touch panel input device, a flat panel display having a touch sensitive screen, and speech recognition and synthesis systems serving each elevator car. The speech recognition and synthesis systems and input device(s) are operatively coupled to a processor and storage devices having a plurality of different types of data stored thereon. Each elevator car is also a client connected to a LAN, WAN, intranet, or Internet, and capable of exchanging data with and retrieving data therefrom. Functions performed by the information and control system include a voice-actuated building directory, download of selected data to personal electronic devices (PEDs), monitoring of areas adjacent to the elevator car on destination floors, and control of lighting and security monitoring in selectable areas of destination floors.
    Type: Grant
    Filed: June 10, 1999
    Date of Patent: September 2, 2003
    Inventor: Robert F. Gazdzinski
  • Patent number: 6615176
    Abstract: A method for speech enabling labeless controls in an existing graphical user interface can comprise the steps of: identifying controls in a window contained in the graphical user interface; testing each identified control for an associated label; for each identified control having an associated label, adding the associated label to an active grammar of a speech recognition system; for each identified control not having an associated label, creating a label based upon an object property of a contextually relevant user interface object; and, further adding each created label to the active grammar. In testing each identified control for an associated label, an accessibility interface query can be applied to each identified control in the window. In addition, in creating the label, each contextually relevant object can be searched for an object property descriptive of the identified control not having an associated label.
    Type: Grant
    Filed: July 13, 1999
    Date of Patent: September 2, 2003
    Assignee: International Business Machines Corporation
    Inventors: James R. Lewis, Linda M. Boyer, Ji Whee Tan
  • Publication number: 20030163326
    Abstract: A ventilator hood includes a voice operating unit with a microphone. The microphone is a spatially selective sound pickup.
    Type: Application
    Filed: February 27, 2003
    Publication date: August 28, 2003
    Inventor: Jens Maase
  • Publication number: 20030163325
    Abstract: An electrical household appliance, in particular, a ventilator hood, includes a voice operating unit having a microphone and a voice recognition unit disposed downstream of the latter. The voice operating unit is characterized by a data memory in which voice reference data stored as electronic data are present. The voice operating unit can be calibrated using the evaluation result based upon a comparison between voice data picked up by the microphone and the voice reference data. Also provided are methods for testing a voice operating unit and for initializing a voice operating unit in the appliance.
    Type: Application
    Filed: February 27, 2003
    Publication date: August 28, 2003
    Inventor: Jens Maase
  • Publication number: 20030163324
    Abstract: Computer based voice commands recognition and controlling pre-existing devices (Fan, Lamp etc.) in homes, factories, offices etc. wirelessly using protocol based communication between single transmitter attached with the serial port of the computer and receiver module attached with each device to be controlled.
    Type: Application
    Filed: February 27, 2002
    Publication date: August 28, 2003
    Inventor: Asim Hussain Abbasi
  • Publication number: 20030163323
    Abstract: A system and method for enabling audio comments to be used when writing and executing code, during design time and run time. A code writer is hereby enabled to simultaneously write code and compose voice comments. These comments, divided into help comments, test items and variable comments, are subsequently recorded, stored, analyzed, prescribed and displayed using text to speech and voice recognition software.
    Type: Application
    Filed: February 22, 2002
    Publication date: August 28, 2003
    Applicant: Maxima Blue Ltd.
    Inventor: Max Bluvband
  • Publication number: 20030158739
    Abstract: A speech navigation system provides for speech navigation of a voice mail system. Upon establishment of a communication link between the speech navigation system and the voice mail system, the speech navigation system may receive a voice command. Upon receiving a voice command, the speech navigation system associates the voice command to at least one keypad character. The speech navigation system then provides a telephone dialing tone, which corresponds to the at least one associated keypad character, to the voice mail system.
    Type: Application
    Filed: February 15, 2002
    Publication date: August 21, 2003
    Inventors: Peter A. Moody, Mark A. Cummings-Hill
  • Publication number: 20030158738
    Abstract: A method and apparatus for processing travel-related speech input is presented. A travel server receives a speech input corresponding to a travel-related task. The travel server then converts the speech input into data reflecting the travel-related task and accesses a database for stored information corresponding to the travel-related task. This stored information is returned to the source of the speech input.
    Type: Application
    Filed: November 1, 1999
    Publication date: August 21, 2003
    Inventors: CAROLYN CROSBY, KEVIN BOMAR
  • Publication number: 20030154085
    Abstract: Systems and methods for automatically recognizing and processing verbal data associated with medical treatment diagnosis and related billing, compliance and reporting procedures in the healthcare industry are provided. A method for completing and generating a claim form comprises providing an interactive voice interface for receiving and analyzing one or more verbal inputs to a computing system. The system interprets the one or more verbal inputs to fill one or more corresponding fields in an electronic form. The system continues to receive and analyze verbal inputs until the electronic form is completed. The system associates the content of the electronic form with one or more industry codes. The industry codes identify at least the nature of one or more services associated with the one or more verbal inputs. The system then arranges the codes in a predetermined manner to generate a claim for reimbursement that can be processed by a medical claims processing facility.
    Type: Application
    Filed: February 8, 2002
    Publication date: August 14, 2003
    Applicant: ONEVOICE MEDICAL CORPORATION
    Inventor: Fredrick M. Kelley
  • Patent number: 6606373
    Abstract: There is disclosed a non-realtime messaging system that delivers a subscriber message index to a subscriber's pager device. The subscriber message index is a condensed summary of one or more of the messages directed to the subscriber. The messaging system comprises: 1) a messaging controller for receiving oral messages directed to a subscriber and for transmitting text messages to the subscriber's pager; 2) a translating controller for generating translated text messages, wherein each of the translated text messages corresponds to one of the received oral messages; 3) a data repository capable of storing the translated text messages; and 4) a summary index controller for generating from the translated text messages the subscriber message index.
    Type: Grant
    Filed: June 14, 1999
    Date of Patent: August 12, 2003
    Assignee: WebLink Wireless, Inc.
    Inventor: Larry J. Martin
  • Patent number: 6606599
    Abstract: According to the present invention, a method for integrating processes with a multi-faceted human centered interface is provided. The interface is facilitated to implement a hands free, voice driven environment to control processes and applications. A natural language model is used to parse voice initiated commands and data, and to route those voice initiated inputs to the required applications or processes. The use of an intelligent context based parser allows the system to intelligently determine what processes are required to complete a task which is initiated using natural language. A single window environment provides an interface which is comfortable to the user by preventing the occurrence of distracting windows from appearing. The single window has a plurality of facets which allow distinct viewing areas. Each facet has an independent process routing its outputs thereto. As other processes are activated, each facet can reshape itself to bring a new process into one of the viewing areas.
    Type: Grant
    Filed: March 12, 2001
    Date of Patent: August 12, 2003
    Assignee: Interactive Speech Technologies, LLC
    Inventors: Richard Grant, Peter McGregor
  • Patent number: 6606597
    Abstract: A language model comprising a plurality of augmented-word n-grams and probabilities corresponding to such n-grams. Each n-gram is comprised of a sequence of augmented words. Each augmented word is comprised of the orthographic representation of the word together with a tag representing lexical information regarding the word, such as syntactic or semantic information. Also disclosed are a method of building such a language model, a method of automatically recognizing speech using the language model and a speech recognition system that employs the language model.
    Type: Grant
    Filed: September 8, 2000
    Date of Patent: August 12, 2003
    Assignee: Microsoft Corporation
    Inventors: Eric K. Ringger, Lucian Galescu
  • Patent number: 6606598
    Abstract: A method and apparatus are disclosed for computing and reporting statistical information that describes the performance of an interactive speech application. The interactive speech application is developed and deployed for use by one or more callers. During execution, the interactive speech application stores, in a log, event information that describes each task carried out by the interactive speech application in response to interaction with the one or more callers. After the log is established, an analytical report is displayed. The report describes selective actions taken by the interactive speech application while executing, and selective actions taken by one or more callers while interacting with the interactive speech application. Information in the analytical report is selected so as to identify one or more potential performance problems in the interactive speech application. The analytical reports are generated based on the information stored in the event logs.
    Type: Grant
    Filed: September 21, 1999
    Date of Patent: August 12, 2003
    Assignee: SpeechWorks International, Inc.
    Inventors: Mark A. Holthouse, Matthew T. Marx, John N. Nguyen
  • Publication number: 20030149569
    Abstract: The present invention provides a method and apparatus for generating an animated character representation. This is achieved by using marked-up data including both content data and presentation data. The system then uses this information to generate phoneme and viseme data representing the speech to be presented by the character. By providing the presentation data this ensures that at least some variation in character appearance will automatically occur beyond that of the visemes required to make the character appear to speak. This contributes to the character having a far more lifelike appearance.
    Type: Application
    Filed: February 11, 2003
    Publication date: August 7, 2003
    Inventors: Jonathan Simon Jowitt, William James Cooper, Andrew Robert Burgess
  • Publication number: 20030149568
    Abstract: A system and terminal for facilitating a “virtual presence” allows users on a communication network to simply begin speaking through other users. A system immediately detects the destination party's name, and begins routing the audio signal to a particular destination without any noticeable call set-up. Additionally, the system performs pitch corrected speed control in order to allow the detection and processing of a speech pattern without causing delay to an end user.
    Type: Application
    Filed: January 6, 2003
    Publication date: August 7, 2003
    Inventor: Howard Bubb