Speech Controlled System Patents (Class 704/275)
-
Publication number: 20030212562Abstract: The present invention provides a method for controlling wireless data transmission from a telematics service center to a voice recognition system in a mobile vehicle. The telematics service call center transmits data to the mobile vehicle over a connection, and monitors said connection for a predetermined barge-in tone. When the telematics service call center receives the user-initiated barge-in tone, it stops data transmission and may request additional commands from the user.Type: ApplicationFiled: May 13, 2002Publication date: November 13, 2003Applicant: General Motors CorporationInventors: Kaushik A. Patel, Timothy J. Morse
-
Patent number: 6647368Abstract: A pair of sensors are used for detecting an air pressure change signal within an ear of a person caused by the person's initiating action (thought, movement, biological function and/or speech). One of the microphones is placed at least partially within an ear of the person and the other is placed adjacent to and external to the ear, to produce two electrical signals, respectively corresponding to internally detected and to externally detected changes in air pressure. Comparison of the unmodified signal strength difference between these two signals is used to distinguish an initiating action component of each signal from an external source component of each signal. The electrical signals are processed to produce an output signal corresponding to the initiating action, which signal is then recognized by a neural network or speech recognizer, and used for control or communication.Type: GrantFiled: July 2, 2001Date of Patent: November 11, 2003Assignee: Think-A-Move, Ltd.Inventor: Guerman G. Nemirovski
-
Patent number: 6647363Abstract: A system is presented for automatically responding to a user inquiry comprising a dialog manager and a presentation manager. The dialog manager executes a machine-controlled human/machine dialog to determine a set of query items, and in response thereto, retrieves information items from memory. The presentation manager determines the inquiring user's intentions motivating and associated with the query, and in response thereto selects a preferred manner of presenting the retrieved information items, or presentation scenario. In so doing, at least one natural language phrase is generated to match the selected presentation scenario, and a speech generator verbally presents the generated phrasing to the inquiring user.Type: GrantFiled: October 7, 1999Date of Patent: November 11, 2003Assignee: ScanSoft, Inc.Inventor: Antonius M. W. Claassen
-
Patent number: 6643622Abstract: A method and apparatus utilizes a personal speech recognition system to enable data retrieval assistance operators to retrieve customer requested information from an information database based upon a voiced inquiry by the operator. Instead of requiring an operator to interpret a calling customer's request and communicate the same to a computer via a number of keyboard strokes, the live operator may submit the same or equivalent inquiry via a voice utterance that is recognized by the searching computer as a searchable inquiry by a speech recognition system, which is preferably trained to recognize the particular voice of the live operator. Only the voice of the live operator needs to be identified by the voice recognition unit, and the incorporation of the voice recognition technology into the system is thus transparent to the requesting customer.Type: GrantFiled: June 4, 2001Date of Patent: November 4, 2003Inventors: Robert O. Stuart, Scott P. Stuart, Don Hornback
-
Patent number: 6642836Abstract: A control system for selecting and operating one of a plurality of operating room devices from a single input source, the system includes a master controller that has a voice control interface and can route control signals. The system additionally may include a plurality of slave controllers to provide expandability of the system. Also, the system may generate messages to the user relating to the status of the control system in general and to the status of devices connected thereto.Type: GrantFiled: October 28, 1997Date of Patent: November 4, 2003Assignee: Computer Motion, Inc.Inventors: Yulun Wang, Charles S. Jordan, Darrin R. Uecker, Charles C. Wooters
-
Patent number: 6643621Abstract: Mechanisms and techniques are provided which allow a server computer system, such as a web server, to generate information, such as a web page, which includes an audio resource locator (ARL) configured in accordance with the invention. The ARL includes a reference to audio data, an audio command,and an audio server reference that identifies an audio server computer system that can process the reference to audio data within the ARL according to the audio command within the ARL to producing output, which may be audio or another type of output. The server computer system can serve the information including the ARL to an originator of a request for such information, such as a browser on a client computer system. A client computer system configured with a browser can obtain the information containing the ARL and can reference the ARL which causes the client computer system to send a request to process audio data to the audio server specified in the ARL.Type: GrantFiled: September 14, 2000Date of Patent: November 4, 2003Assignee: Cisco Technology, Inc.Inventors: Lewis D. Dodrill, Ryan A. Danner, Steven J. Martin
-
Patent number: 6640210Abstract: A customer service center includes a server connected to a computer network. The server contains a speech recognition system. An operator station is capable of connecting to the server. A microphone is connected to the operator station. A codec is connected to the microphone and the operator station.Type: GrantFiled: June 19, 2000Date of Patent: October 28, 2003Inventors: Frederick Anthony Schaefer, Paul Michael Brashear
-
Publication number: 20030197590Abstract: The present invention pertains to control systems and provides a run time configurable control system for selecting and operating one of a plurality of operating room devices from a single input source, the system comprising a master controller having a voice control interface and means for routing control signals. The system additionally may include a plurality of slave controllers to provide expandability of the system. Also, the system includes output means for generating messages to the user relating to the status of the control system in general and to the status of devices connected thereto.Type: ApplicationFiled: December 9, 2002Publication date: October 23, 2003Inventors: Yulun Wang, Charles S. Jordan, Darrin R. Uecker, Charles C. Wooters
-
Publication number: 20030200096Abstract: A communication device includes a voice command input unit for accepting a voice command spoken by a user, a voice data storage unit for storing the voice command accepted by the voice command input unit as voice data, an electronic mail creating unit for creating an electronic mail including the voice data stored in the voice data storage unit, and an output unit for transmitting the electronic mail created by the electronic mail creating unit.Type: ApplicationFiled: March 13, 2003Publication date: October 23, 2003Inventor: Masafumi Asai
-
Publication number: 20030200095Abstract: A method for presenting text information with speech according to the invention applies an information-processing apparatus in which a speech database is included. The text information is transformed into speech information according to the speech database, and turned into speech and played. The text information can also be downloaded from a network so that the text information can be online accessed and played with speech.Type: ApplicationFiled: April 23, 2002Publication date: October 23, 2003Inventor: Shen Yu Wu
-
Patent number: 6636590Abstract: The present invention overcomes the problems in the existing art described above by providing a method and apparatus for specifying and obtaining services through voice commands, via a voice portal, resulting in a live conversation between a user and a selected service provider. The present invention is a system through which seekers of a wide array of services can select, contact, converse, and pay for a service provider using a simple voice-transmission medium such as the telephone. The invention enables the seeker to locate a service provider by speaking the name of a profession, such as “psychiatrist,” which is recognized by the system's voice-recognition software. In a similar fashion, the seeker can then specify by speaking aloud the price range, quality rating, language, and keyword descriptors of the service provider. Within the desired parameters, the system offers service providers who have made themselves available to render services at the present time.Type: GrantFiled: October 30, 2000Date of Patent: October 21, 2003Assignee: Ingenio, Inc.Inventors: Karl Jacob, Scott Faber, Sean Van Der Linden
-
Patent number: 6636831Abstract: A system and process for voice-controlled information retrieval. A conversation template is executed. The conversation template includes a script of tagged instructions including voice prompts and information content. A voice command identifying information content to be retrieved is processed. A remote method invocation is sent requesting the identified information content to an applet process associated with a Web browser. The information content is retrieved on the Web browser responsive to the remote method invocation.Type: GrantFiled: April 9, 1999Date of Patent: October 21, 2003Assignee: Inroad, Inc.Inventors: Jack H. Profit, Jr., N. Gregg Brown, Peter S. Mezey, Lianne M. Colombo
-
Patent number: 6633844Abstract: The combination of audio and video speech recognition in a manner to improve the robustness of speech recognition systems in noisy environments. Contemplated are methods and apparatus in which a video signal associated with a video source and an audio signal associated with the video signal are processed, the most likely viseme associated with the audio signal and video signal is determined and, thereafter, the most likely phoneme associated with the audio signal and video signal is determined.Type: GrantFiled: December 2, 1999Date of Patent: October 14, 2003Assignee: International Business Machines CorporationInventors: Ashish Verma, Sankar Basu, Chalapathy Neti
-
Publication number: 20030191648Abstract: A method and system for error prevention and recovery of voice activated navigation through a menu having plural nodes provides situation dependent utterance verification by relating confirmation to utterance determination confidence levels. In one embodiment, a high confidence level results in implicit confirmation, a medium confidence level results in explicit confirmation and a low confidence level results in a concise interrogative prompt of a single word that requests the user to repeat the utterance. In situations where voice recognition is difficult, dual modality with DTMF navigation is provided as an option for menu selections.Type: ApplicationFiled: April 8, 2002Publication date: October 9, 2003Inventors: Benjamin Anthony Knott, John Mills Martin, Robert Randal Bushey, Tracy Leigh Smart
-
Publication number: 20030191649Abstract: A system and method are described for processing transaction instructions without human intervention. In one embodiment, a voice interpreter receives transaction information in the form of voice utterances, processes that information and transmits it to a business application server, which compiles the processed information and generates transaction instructions based on the compiled information. The business application server transmits the transaction instructions to an enterprise system via a connector manager that integrates the enterprise system with the business application server. At least one housing encloses the voice interpreter, the business application server and the hardware platform that supports the connector manager.Type: ApplicationFiled: April 3, 2003Publication date: October 9, 2003Inventors: Trevor Stout, Mark Wallin, Marius Seritan
-
Patent number: 6631350Abstract: A device-independent speech audio system for linking a speech driven application to specific audio input and output devices can include a media framework for transporting digitized speech audio between speech driven applications and a plurality of audio input and output devices. The media framework can include selectable device-dependent parameters which can enable the transportation of the digitized speech to and from the plurality of audio input and output devices. The device-independent speech audio system also can include an audio abstractor configurable to provide specific ones of the selectable device-dependent parameters according to the specific audio input and output devices. Hence, the audio abstractor can provide a device-independent interface to the speech driven application for linking the speech driven application to the specific audio input and output devices.Type: GrantFiled: August 28, 2000Date of Patent: October 7, 2003Assignee: International Business Machines CorporationInventors: Joseph Celi, Jr., Brett Gavagni, Leo Leontiades, Bruce D. Lucas
-
Patent number: 6631346Abstract: A computer-implemented speech parsing method and apparatus for processing an input phrase. The method and apparatus include providing a plurality of grammars that are indicative of predetermined topics. A plurality of parse forests are generated using the grammars. Tags are associated with words preferably according to a scoring scheme utilizing the generated parse forests while parsing the input phrase. The tags that are associated with the words are used as a parsed representation of the input phrase.Type: GrantFiled: April 7, 1999Date of Patent: October 7, 2003Assignee: Matsushita Electric Industrial Co., Ltd.Inventors: Murat Karaorman, Jean-Claude Junqua
-
Publication number: 20030187659Abstract: A method and apparatus for controlling home electronic devices connected to a home network are provided. The method for controlling home electronic devices connected to a home network includes receiving a user voice command and converting the user voice command into a character command; extracting actions and objects from the character command and converting the character command into a logical command; extracting an action list containing a series of actions from the logical command by referring to an action library storing action data for controlling home electronic devices connected to the home network; and converting the series of actions included in the action list into a control signal and controlling the home electronic devices connected to the home network. According to the method and apparatus, user commands to home electronic devices connected to a complicated home network can be simplified such that home electronic devices are controlled conveniently and efficiently.Type: ApplicationFiled: March 17, 2003Publication date: October 2, 2003Applicant: Samsung Electronics Co., Ltd.Inventors: Jeong-mi Cho, Jay-woo Kim, Young-jin Hong, Jun-ho Park
-
Patent number: 6629069Abstract: A speech recogniser is provided for identifying entries in a database. Results from the recognition of a user's speech are combined with each other and optionally with reference to data in the database in order to maximise the accuracy of an identified entry. An output is also provided which gives an indication of the likely accuracy of the identified entry.Type: GrantFiled: January 3, 2001Date of Patent: September 30, 2003Assignee: British Telecommunications a public limited companyInventors: David John Attwater, Hilary Richard William Greenhow, Peter John Durston
-
Patent number: 6629075Abstract: A speech recognition system includes a user interface configured to provide signals indicative of a user's speech. A speech recognizer of the system includes a processor configured to use the signals from the user interface to perform speech recognition operations to attempt to recognize speech indicated by the signals. A control mechanism is coupled to the voice recognizer and is configured to affect processor usage for speech recognition operations in accordance with a loading of the processor.Type: GrantFiled: June 9, 2000Date of Patent: September 30, 2003Assignee: SpeechWorks International, Inc.Inventor: Johan Schalkwyk
-
Patent number: 6629077Abstract: A universal remote control adapted to receive a voice input. The voice input is received by the remote control and compared to a plurality of voice command templates that are stored in the memory of the remote control. If the voice input matches one or more of the plurality of voice command templates, a valid voice input has been received by the remote control. Valid voice input may be a remote control command or keystroke data, input as an entire word or as individual characters. In response to a valid voice input, the remote control may transmit an operational command code and/or alphanumeric symbol code corresponding to keystroke data to a consumer electronic device.Type: GrantFiled: November 22, 2000Date of Patent: September 30, 2003Assignee: Universal Electronics Inc.Inventors: Paul D. Arling, Patrick H. Hayes
-
Publication number: 20030182131Abstract: An apparatus and a concomitant method for speech recognition. In one embodiment, a distributed speech recognition system provides speech-driven control and remote service access. The distributed speech recognition system comprises a client device and a central server, where the client device is equipped with two speech recognition modules: a foreground speech recognizer and a background speech recognizer. The foreground speech recognizer is implementing a particular spoken language application (SLA) to handle a particular task, whereas the background speech recognizer is monitoring a change in the topic and/or a change in the intent of the user. Upon detection of a change in topic or intent of the user, the background speech recognizer will effect the routing to a new SLA to address the new topic or intent.Type: ApplicationFiled: March 25, 2002Publication date: September 25, 2003Inventors: James F. Arnold, Horacio E. Franco, David J. Israel
-
Publication number: 20030182130Abstract: An apparatus for recording a meeting held in a room comprises a signal receiving unit, a CODEC unit, a data processing unit, an input control unit, a storage unit, and a power source. A process for recording the meeting comprises the steps of basic input control, state control, digital data processing, voice searching, voice processing, and file system management. With this, voice of an individual who is making a speech during the meeting is located, amplified, and compressed prior to being recorded in a file of minutes. The minutes is stored in the storage unit in digital format so as to improve recording quality and facilitate reading and inquiry of the minutes.Type: ApplicationFiled: March 22, 2002Publication date: September 25, 2003Applicant: Multisuns Corp.Inventors: Bruce W.H. Sun, Chih Long Lin
-
Publication number: 20030182132Abstract: The invention relates to a voice-controlled arrangement (1) comprising a plurality of devices to be controlled (3 to 9) and a mobile voice data entry unit (11) which is connected to said devices by a wireless communication link. At least some of the devices each have a device vocabulary memory (3a to 9a) and a vocabulary transmission unit (3b to 9b), and the voice data entry unit has selection means for selecting the vocabularies to he loaded according to the route destination.Type: ApplicationFiled: February 28, 2003Publication date: September 25, 2003Inventor: Meinrad Niemoeller
-
Publication number: 20030177013Abstract: Systems and methods are described for a speech system that includes one or more speech controls incorporated into one or more speech-enabled applications that run on the speech system. The controls allow applications to be developed with minimal programming effort to incorporate common speech-enabled application functions. A question control provides a customizable template for requesting information from a user. An announcer control allows a speech-enabled application to provide a user with information without having to re-create an entire announcer process each time it is used. A command control provides a simple way to attach command and control functions to speech-enabled applications. A word trainer control provides a way to associate user-selected voice tags with certain information. Providing the controls for use with speech-enabled applications ensures standardized user interfaces across multiple speech-enabled applications.Type: ApplicationFiled: February 4, 2002Publication date: September 18, 2003Inventors: Stephen Russell Falcon, Clement Chun Pong Yip, Dan Banay, David Michael Miller
-
Publication number: 20030177012Abstract: A thermostat according to the present invention is capable of capturing audio commands from a user for controlling HVAC systems. The thermostat can operate in training mode and control mode. In training mode, the user can tailor a list of predefined keywords with his voice and, in control mode, the user can change HVAC system settings. Optionally, the thermostat is equipped to interface with a remote unit. The user can speak audio commands into the remote unit, and the remote unit will transmit his commands to the thermostat.Type: ApplicationFiled: March 13, 2002Publication date: September 18, 2003Inventor: Brett Drennan
-
Patent number: 6622122Abstract: In a document retrieving apparatus, an audio input section converts a sound into a character pattern. A language model storing section stores likelihood information. A word choosing section obtains a word selection result based on the likelihood information. A retrieval condition producing section produces retrieval conditions based on the word selection result. A document storing section stores document to be retrieved. And, a document retrieving section retrieves the documents based on the retrieval conditions. An effective document search can be performed regardless of the sentence recognition accuracy without requiring higher cost in collecting the required language data.Type: GrantFiled: February 24, 2000Date of Patent: September 16, 2003Assignee: Matsushita Electric Industrial Co., Ltd.Inventors: Yoshio Fukushige, Hiroyuki Suzuki, Naohiko Noguchi, Hayashi Ito, Mitsuhiro Sato, Masaki Kiyono, Hideki Yasukawa
-
Patent number: 6622119Abstract: A command prediction system for natural language understanding systems, in accordance with the present invention, includes a user interface for receiving commands from a user. A command predictor receives the commands from the user interface and predicts at least one next command which is likely to be presented by the user based on a command history. A probability calculator is included in the command predictor for determining a probability for each of the at least one next command based on the command history such that a list of predicted commands and their likelihood of being a next command are provided.Type: GrantFiled: October 30, 1999Date of Patent: September 16, 2003Assignee: International Business Machines CorporationInventors: Ganesh N. Ramaswamy, Jan Kleindienst
-
Publication number: 20030171928Abstract: Systems and methods are described for speech systems that utilize an interaction manager to manage interactions—also known as dialogues—from one or more applications. The interactions are managed properly even if multiple applications use different grammars. The interaction manager maintains an interaction list. An application wishing to utilize the speech system submits one or more interactions to the interaction manager. Interactions are normally processed in the order in which they are received. An exception to this rule is an interaction that is configured by an application to be processed immediately, which causes the interaction manager to place the interaction at the front of the interaction list of interactions. If an application has designated an interaction to interrupt a currently processing interaction, then the newly submitted application will interrupt any interaction currently being processed and, therefore, it will be processed immediately.Type: ApplicationFiled: February 4, 2002Publication date: September 11, 2003Inventors: Stephen Russel Falcon, Clement Chun Pong Yip, Dan Banay, David Michael Miller
-
Publication number: 20030171931Abstract: The present invention trains a user recognition model for a user. A user enrollment input is received and one or more cohort models are identified from a set of possible cohort models. The cohort models are identified based on a similarity measure between the set of possible cohort models and the user enrollment input. Once the cohort models have been identified, a user model is generated based on data associated with the identified cohort models.Type: ApplicationFiled: March 11, 2002Publication date: September 11, 2003Inventor: Eric I-Chao Chang
-
Publication number: 20030171930Abstract: User interaction with a secure resource is controlled or mediated by the security server that includes a telephony interface by which the server is either coupled to the telephone system or provides messages to the telephone system directly or through an intermediate component. A biometric data store stores biometric data, such as speech data or visual recognition data. If desired the biometric data may also be stored in association with the extension identifiers of the telephone system. A biometric verification/.identification system accesses this data store and evaluates provided user biometric data vis-à-vis the stored biometric data to determine if the user may control or interact with the secure resource. If interaction is permitted, the security server sends control signals to the secure resource. The telephone system provides an interface through which the user trains the system to store the biometric verification/.identification data of that user.Type: ApplicationFiled: March 7, 2002Publication date: September 11, 2003Inventor: Jean-Claude Junqua
-
Publication number: 20030171929Abstract: Systems and methods are described for a speech system that manages multiple grammars from one or more speech-enabled applications. The speech system includes a speech server that supports different grammars and different types of grammars by exposing several methods to the speech-enabled applications. The speech server supports static grammars that do not change and dynamic grammars that may change after a commit. The speech server provides persistence by supporting persistent grammars that enable a user to issue a command to an application even when the application is not loaded. In such a circumstance, the application is automatically launched and the command is processed. The speech server may enable or disable a grammar in order to limit confusion between grammars. Global and yielding grammars are also supported by the speech server. Global grammars are always active (e.g., “call 9-1-1”) while yielding grammars may be deactivated when an interaction whose grammar requires priority is active.Type: ApplicationFiled: February 4, 2002Publication date: September 11, 2003Inventors: Steve Russel Falcon, Clement Chun Pong Yip, David Michael Miller, Dan Banay
-
Publication number: 20030167174Abstract: An audio recorder-player includes M tuners that generate N audio signals transmitted by N audio sources, an analyzer that extracts R×N audio signal characteristics from the N audio signals, a memory that stores the R×N audio signal characteristics, and output circuitry that reproduces an audio signal corresponding to one of the N audio signals responsive to selection of at least one of the R×N audio signal characteristics, where R is a positive integer and M and N are positive integers greater than 1. If desired, the audio recorder-player advantageously can be included in one of a radio, a computer, or a set-top box. Methods for operating the audio recorder-player are also described.Type: ApplicationFiled: March 1, 2002Publication date: September 4, 2003Applicant: KONINLIJKE PHILIPS ELECTRONICS N.V.Inventors: Serhan Dagtas, Nevenka Dimitrova
-
Publication number: 20030167171Abstract: A method and apparatus is disclosed for remotely processing voice commands for controlling a television. A voice command is uttered by a user into a microphone contained in a remote control. The voice command is digitized, modulated, compressed, and wirelessly transmitted to a wireless receiver connected to a set-top box. The voice command is then transmitted to a cable head-end unit for voice and word recognition processing. Once the command function is determined, the function is transmitted back to the set-top box where the set-top box performs the command. The microphone is activated and deactivated by pressing and releasing a push-to-talk (PTT) switch. The PTT activates other functions by being turned, double-clicked and toggled up and down, left and right.Type: ApplicationFiled: January 7, 2003Publication date: September 4, 2003Inventors: Theodore Calderone, Mark J. Foster, Harry William Printz, James Jay Kistler
-
Patent number: 6615172Abstract: An intelligent query system for processing voiced-based queries is disclosed. This distributed client-server system, typically implemented on an intranet or over the Internet accepts a user's queries at his/her computer, PDA or workstation using a speech input interface. After converting the user's query from speech to text, a 2-step algorithm employing a natural language engine, a database processor and a full-text SQL database is implemented to find a single answer that best matches the user's query. The system, as implemented, accepts environmental variables selected by the user and is scalable to provide answers to a variety and quantity of user-initiated queries.Type: GrantFiled: November 12, 1999Date of Patent: September 2, 2003Assignee: Phoenix Solutions, Inc.Inventors: Ian M. Bennett, Bandi Ramesh Babu, Kishor Morkhandikar, Pallaki Gururaj
-
Patent number: 6615175Abstract: An information and control system for personnel transport devices. In one embodiment, the information and control system is coupled to the elevator system of a building, and includes a touch panel input device, a flat panel display having a touch sensitive screen, and speech recognition and synthesis systems serving each elevator car. The speech recognition and synthesis systems and input device(s) are operatively coupled to a processor and storage devices having a plurality of different types of data stored thereon. Each elevator car is also a client connected to a LAN, WAN, intranet, or Internet, and capable of exchanging data with and retrieving data therefrom. Functions performed by the information and control system include a voice-actuated building directory, download of selected data to personal electronic devices (PEDs), monitoring of areas adjacent to the elevator car on destination floors, and control of lighting and security monitoring in selectable areas of destination floors.Type: GrantFiled: June 10, 1999Date of Patent: September 2, 2003Inventor: Robert F. Gazdzinski
-
Patent number: 6615176Abstract: A method for speech enabling labeless controls in an existing graphical user interface can comprise the steps of: identifying controls in a window contained in the graphical user interface; testing each identified control for an associated label; for each identified control having an associated label, adding the associated label to an active grammar of a speech recognition system; for each identified control not having an associated label, creating a label based upon an object property of a contextually relevant user interface object; and, further adding each created label to the active grammar. In testing each identified control for an associated label, an accessibility interface query can be applied to each identified control in the window. In addition, in creating the label, each contextually relevant object can be searched for an object property descriptive of the identified control not having an associated label.Type: GrantFiled: July 13, 1999Date of Patent: September 2, 2003Assignee: International Business Machines CorporationInventors: James R. Lewis, Linda M. Boyer, Ji Whee Tan
-
Publication number: 20030163326Abstract: A ventilator hood includes a voice operating unit with a microphone. The microphone is a spatially selective sound pickup.Type: ApplicationFiled: February 27, 2003Publication date: August 28, 2003Inventor: Jens Maase
-
Publication number: 20030163325Abstract: An electrical household appliance, in particular, a ventilator hood, includes a voice operating unit having a microphone and a voice recognition unit disposed downstream of the latter. The voice operating unit is characterized by a data memory in which voice reference data stored as electronic data are present. The voice operating unit can be calibrated using the evaluation result based upon a comparison between voice data picked up by the microphone and the voice reference data. Also provided are methods for testing a voice operating unit and for initializing a voice operating unit in the appliance.Type: ApplicationFiled: February 27, 2003Publication date: August 28, 2003Inventor: Jens Maase
-
Publication number: 20030163324Abstract: Computer based voice commands recognition and controlling pre-existing devices (Fan, Lamp etc.) in homes, factories, offices etc. wirelessly using protocol based communication between single transmitter attached with the serial port of the computer and receiver module attached with each device to be controlled.Type: ApplicationFiled: February 27, 2002Publication date: August 28, 2003Inventor: Asim Hussain Abbasi
-
Publication number: 20030163323Abstract: A system and method for enabling audio comments to be used when writing and executing code, during design time and run time. A code writer is hereby enabled to simultaneously write code and compose voice comments. These comments, divided into help comments, test items and variable comments, are subsequently recorded, stored, analyzed, prescribed and displayed using text to speech and voice recognition software.Type: ApplicationFiled: February 22, 2002Publication date: August 28, 2003Applicant: Maxima Blue Ltd.Inventor: Max Bluvband
-
Publication number: 20030158739Abstract: A speech navigation system provides for speech navigation of a voice mail system. Upon establishment of a communication link between the speech navigation system and the voice mail system, the speech navigation system may receive a voice command. Upon receiving a voice command, the speech navigation system associates the voice command to at least one keypad character. The speech navigation system then provides a telephone dialing tone, which corresponds to the at least one associated keypad character, to the voice mail system.Type: ApplicationFiled: February 15, 2002Publication date: August 21, 2003Inventors: Peter A. Moody, Mark A. Cummings-Hill
-
Publication number: 20030158738Abstract: A method and apparatus for processing travel-related speech input is presented. A travel server receives a speech input corresponding to a travel-related task. The travel server then converts the speech input into data reflecting the travel-related task and accesses a database for stored information corresponding to the travel-related task. This stored information is returned to the source of the speech input.Type: ApplicationFiled: November 1, 1999Publication date: August 21, 2003Inventors: CAROLYN CROSBY, KEVIN BOMAR
-
Publication number: 20030154085Abstract: Systems and methods for automatically recognizing and processing verbal data associated with medical treatment diagnosis and related billing, compliance and reporting procedures in the healthcare industry are provided. A method for completing and generating a claim form comprises providing an interactive voice interface for receiving and analyzing one or more verbal inputs to a computing system. The system interprets the one or more verbal inputs to fill one or more corresponding fields in an electronic form. The system continues to receive and analyze verbal inputs until the electronic form is completed. The system associates the content of the electronic form with one or more industry codes. The industry codes identify at least the nature of one or more services associated with the one or more verbal inputs. The system then arranges the codes in a predetermined manner to generate a claim for reimbursement that can be processed by a medical claims processing facility.Type: ApplicationFiled: February 8, 2002Publication date: August 14, 2003Applicant: ONEVOICE MEDICAL CORPORATIONInventor: Fredrick M. Kelley
-
Patent number: 6606373Abstract: There is disclosed a non-realtime messaging system that delivers a subscriber message index to a subscriber's pager device. The subscriber message index is a condensed summary of one or more of the messages directed to the subscriber. The messaging system comprises: 1) a messaging controller for receiving oral messages directed to a subscriber and for transmitting text messages to the subscriber's pager; 2) a translating controller for generating translated text messages, wherein each of the translated text messages corresponds to one of the received oral messages; 3) a data repository capable of storing the translated text messages; and 4) a summary index controller for generating from the translated text messages the subscriber message index.Type: GrantFiled: June 14, 1999Date of Patent: August 12, 2003Assignee: WebLink Wireless, Inc.Inventor: Larry J. Martin
-
Patent number: 6606599Abstract: According to the present invention, a method for integrating processes with a multi-faceted human centered interface is provided. The interface is facilitated to implement a hands free, voice driven environment to control processes and applications. A natural language model is used to parse voice initiated commands and data, and to route those voice initiated inputs to the required applications or processes. The use of an intelligent context based parser allows the system to intelligently determine what processes are required to complete a task which is initiated using natural language. A single window environment provides an interface which is comfortable to the user by preventing the occurrence of distracting windows from appearing. The single window has a plurality of facets which allow distinct viewing areas. Each facet has an independent process routing its outputs thereto. As other processes are activated, each facet can reshape itself to bring a new process into one of the viewing areas.Type: GrantFiled: March 12, 2001Date of Patent: August 12, 2003Assignee: Interactive Speech Technologies, LLCInventors: Richard Grant, Peter McGregor
-
Patent number: 6606597Abstract: A language model comprising a plurality of augmented-word n-grams and probabilities corresponding to such n-grams. Each n-gram is comprised of a sequence of augmented words. Each augmented word is comprised of the orthographic representation of the word together with a tag representing lexical information regarding the word, such as syntactic or semantic information. Also disclosed are a method of building such a language model, a method of automatically recognizing speech using the language model and a speech recognition system that employs the language model.Type: GrantFiled: September 8, 2000Date of Patent: August 12, 2003Assignee: Microsoft CorporationInventors: Eric K. Ringger, Lucian Galescu
-
Patent number: 6606598Abstract: A method and apparatus are disclosed for computing and reporting statistical information that describes the performance of an interactive speech application. The interactive speech application is developed and deployed for use by one or more callers. During execution, the interactive speech application stores, in a log, event information that describes each task carried out by the interactive speech application in response to interaction with the one or more callers. After the log is established, an analytical report is displayed. The report describes selective actions taken by the interactive speech application while executing, and selective actions taken by one or more callers while interacting with the interactive speech application. Information in the analytical report is selected so as to identify one or more potential performance problems in the interactive speech application. The analytical reports are generated based on the information stored in the event logs.Type: GrantFiled: September 21, 1999Date of Patent: August 12, 2003Assignee: SpeechWorks International, Inc.Inventors: Mark A. Holthouse, Matthew T. Marx, John N. Nguyen
-
Publication number: 20030149569Abstract: The present invention provides a method and apparatus for generating an animated character representation. This is achieved by using marked-up data including both content data and presentation data. The system then uses this information to generate phoneme and viseme data representing the speech to be presented by the character. By providing the presentation data this ensures that at least some variation in character appearance will automatically occur beyond that of the visemes required to make the character appear to speak. This contributes to the character having a far more lifelike appearance.Type: ApplicationFiled: February 11, 2003Publication date: August 7, 2003Inventors: Jonathan Simon Jowitt, William James Cooper, Andrew Robert Burgess
-
Publication number: 20030149568Abstract: A system and terminal for facilitating a “virtual presence” allows users on a communication network to simply begin speaking through other users. A system immediately detects the destination party's name, and begins routing the audio signal to a particular destination without any noticeable call set-up. Additionally, the system performs pitch corrected speed control in order to allow the detection and processing of a speech pattern without causing delay to an end user.Type: ApplicationFiled: January 6, 2003Publication date: August 7, 2003Inventor: Howard Bubb