Patents by Inventor Kerry Ortega

Kerry Ortega has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for transcribing multiple files into a single document

Patent number: 6535848

Abstract: A transcription system (100, 200) includes multiple recording devices (110, 210) that individually record and store (516), into multiple files, digital data representing speech uttered by multiple speakers. In a preferred embodiment, time stamps are stored (514) along with the speech. A transcription computer (120, 230) enables a user to select (602) which of multiple files the user would like to have transcribed, and to associate (604) a speaker ID to each of the multiple files. The transcription computer then transcribes (1006) phrases within the multiple files, and stores (1008) those phrases in a sequential order, based on the time stamps. The user may also cause an offset time for each file to be adjusted (606, 916), thus affecting the ultimate sequential order of the transcribed phrases. After transcription, the user may edit (1104) the time stamps, speaker IDs, and/or phrases.

Type: Grant

Filed: June 8, 1999

Date of Patent: March 18, 2003

Assignee: International Business Machines Corporation

Inventors: Kerry A. Ortega, James R. Lewis, Ronald Vanbuskirk, Huifang Wang
Method for hands-free operation of a pointer

Patent number: 6519566

Abstract: The method of the invention involves a plurality of steps including, defining a set of user voice commands for hands-free control of a pointer and, in response to receiving a first audio input recognized as one of the set of user voice commands, initiating motion of the pointer in a direction indicated by the user voice command. Subsequently, in response to receiving a second audio input, the pointer motion can be discontinued. Finally, in response to receiving one or more subsequent audio inputs not recognized as being among the set of user voice commands, the pointer can be incrementally moved responsive to the subsequent audio inputs.

Type: Grant

Filed: March 1, 2000

Date of Patent: February 11, 2003

Assignee: International Business Machines Corporation

Inventors: Linda M. Boyer, James R. Lewis, Kerry A. Ortega, Ji Wee Tan
Method and apparatus for evaluating the accuracy of a speech recognition system

Patent number: 6507816

Abstract: A method and system for evaluating the accuracy of a computer speech recognition system counts and indexes the total number of words dictated and the number of words corrected. The corrections are tallied after being made in a correction window and include words contained in an alternative list as well as words input by the user and within a stored word database. A processor calculates the approximate accuracy of the speech recognition system as the ratio of the number of correct words to the total number of words dictated. An accuracy ratio is calculated for each dictation session and an overall ratio is calculated for all sessions combined. The system also keeps individual and overall indexes of the number of times the corrected words were in alternate lists or not within the word database and uses these indexes to calculate additional accuracy values.

Type: Grant

Filed: May 4, 1999

Date of Patent: January 14, 2003

Assignee: International Business Machines Corporation

Inventor: Kerry A. Ortega
Method and system for automatically adjusting prompt feedback based on predicted recognition accuracy

Patent number: 6505155

Abstract: In a computer speech user interface, a method and computer apparatus for automatically adjusting the content of feedback in a responsive prompt based upon predicted recognition accuracy by a speech recognizer. The method includes the steps of receiving a user voice command from the speech recognizer; calculating present speech recognition accuracy based upon the received user voice command; predicting future recognition accuracy based upon the calculated present speech recognition accuracy; and, generating feedback in a responsive prompt responsive to the predicted recognition accuracy. For predicting future poor recognition accuracy based upon poor present recognition accuracy, the calculating step can include monitoring the received user voice command; detecting a reduced accuracy condition in the monitored user voice command; and, determining poor present recognition accuracy if the reduced accuracy condition is detected in the detecting step.

Type: Grant

Filed: May 6, 1999

Date of Patent: January 7, 2003

Assignee: International Business Machines Corporation

Inventors: Ronald Vanbuskirk, Huifang Wang, Kerry A. Ortega, Catherine G. Wolf
Providing a location and item identification data to visually impaired shoppers in a site having barcode labels

Patent number: 6497367

Abstract: A portable unit assists a visually impaired user within a store by providing an output, using speech synthesis, of his location based on reading various barcode labels. The location of each barcode label is determined from data stored within the portable unit. The portable unit also determines a path between the user's location and an item he selects to find, describing the path using speech synthesis. The user can select, by speech or by depressing a button, items for a target list. Preferably, some barcode labels identify an end of an aisle, which cause the portable unit to describe, using speech synthesis, items on the aisle and items in the target list on the aisle.

Type: Grant

Filed: April 26, 2001

Date of Patent: December 24, 2002

Assignee: International Business Machines Corporation

Inventors: Vincent Charles Conzola, Aaron Roger Cox, Kerry A Ortega, Thomas John Sluchak
METHOD AND APPARATUS FOR EVALUATING THE ACCURACY OF A SPEECH RECOGNITION SYSTEM

Publication number: 20020177999

Abstract: A method and system for evaluating the accuracy of a computer speech recognition system counts and indexes the total number of words dictated and the number of words corrected. The corrections are tallied after being made in a correction window and include words contained in an alternative list as well as words input by the user and within a stored word database. A processor calculates the approximate accuracy of the speech recognition system as the ratio of the number of correct words to the total number of words dictated. An accuracy ratio is calculated for each dictation session and an overall ratio is calculated for all sessions combined. The system also keeps individual and overall indexes of the number of times the corrected words were in alternate lists or not within the word database and uses these indexes to calculate additional accuracy values.

Type: Application

Filed: May 4, 1999

Publication date: November 28, 2002

Inventor: KERRY A. ORTEGA
Off site voice enrollment on a transcription device for speech recognition

Patent number: 6477493

Abstract: A method and system for use with a computer recognition system to enroll a user. The method involves a series of steps. The invention provides a user with an enrollment script. The invention then receives a recording made with a transcription device of a dictation session in which the user has dictated at least a portion of the enrollment script. Additionally, the invention can enroll the user in the speech recognition system by decoding the recording and training the speech recognition system.

Type: Grant

Filed: July 15, 1999

Date of Patent: November 5, 2002

Assignee: International Business Machines Corporation

Inventors: Brian S. Brooks, Waltraud Brunner, Carmi Gazit, Arthur Keller, Antonio R. Lee, Thomas Netousek, Kerry A. Ortega
METHOD AND SYSTEM FOR DETERMINING AVAILABLE AND ALTERNATIVE SPEECH COMMANDS

Publication number: 20020161584

Abstract: A method and system for use with a computer speech recognition system to efficiently identify valid system commands to users. The method involves a series of steps including: receiving data representative of a speech recognition system user input; comparing the data to a grammar defined for the speech recognition system to determine whether the data is representative of a user input which is a valid system command; and notifying the user as to whether the data is representative of a valid system command. The process can also involve the additional steps of determining a functional expression for the data; and comparing the functional expression to a set of all functional expressions permitted in the grammar to identify any alternate user inputs for producing the functional expression.

Type: Application

Filed: April 13, 1999

Publication date: October 31, 2002

Inventors: JAMES R. LEWIS, KERRY ORTEGA
PROVIDING A LOCATION AND ITEM IDENTIFICATION DATA TO VISUALLY IMPAIRED SHOPPERS IN A SITE HAVING BARCODE LABELS

Publication number: 20020158133

Abstract: A portable unit assists a visually impaired user within a store by providing an output, using speech synthesis, of his location based on reading various barcode labels. The location of each barcode label is determined from data stored within the portable unit. The portable unit also determines a path between the user's location and an item he selects to find, describing the path using speech synthesis. The user can select, by speech or by depressing a button, items for a target list. Preferably, some barcode labels identify an end of an aisle, which cause the portable unit to describe, using speech synthesis, items on the aisle and items in the target list on the aisle.

Type: Application

Filed: April 26, 2001

Publication date: October 31, 2002

Applicant: International Business Machines Corporation

Inventors: Vincent Charles Conzola, Aaron Roger Cox, Kerry A. Ortega, Thomas John Sluchak
Method to aid in sizing graphical user interface touch controls

Publication number: 20020130847

Abstract: A software tool acquires data on the position of touches relative to controls on a particular window or dialog box using touch screen controls. Subjects or users log on and the time the bring a dialog box into focus is time stamped. The coordinates of each touch is recorded along with a time stamp of the touches. The test sessions are saved in a data file identifying the subject. The software tool is used by a User Interface designer to design controls for a touch screen application. The acquired data is played back graphically to the designer where each touch appears as a dot or like indication over laid over a representation of the dialog box to which it relates. The acquired data may be displayed in various ways including composite, realtime or in a single dialog box or a single touch. Previous touch data may be kept or discarded giving the UI designer many options of how to analyze the dat to determine optimum size and placement of controls.

Type: Application

Filed: March 14, 2001

Publication date: September 19, 2002

Applicant: International Business Machines Corporation

Inventors: Vincent Charles Conzola, Kerry A. Ortega
Method and system for the visually impaired to navigate a route through a facility

Publication number: 20020128765

Abstract: A system and method of the type for aiding a user in navigating a route through a facility so as too efficiently locate specific items within a facility is provided. The system includes a facility processor having a database and software stored thereon for mapping an interactive route from selected location to selected location within a facility, a label located proximate individual items, the label electronically communicating information specific to the item it is associated with, and a digital device having the interactive route electronically stored thereon, the digital device electronically communicating with the facility processor and the labels for tracking movement of the digital device along the route via communication with the labels and communicating a direction to move to follow the route.

Type: Application

Filed: March 9, 2001

Publication date: September 12, 2002

Applicant: International Business Machines Corporation

Inventors: Robert Thomas Cato, Kerry A. Ortega, Thomas John Sluchak
Method for preserving contextual accuracy in an extendible speech recognition language model

Publication number: 20020116194

Abstract: A method of generating language model statistics for a new word added to a language model incorporating at least one class file containing contextually related words. The method can include the following steps: First, language model statistics can be computed based on references to at least one incorporated class file. Second, a new word can be substituted for each reference to a selected class file. Additionally, the language model statistics can be re-computed based on the new word having been substituted for the reference. Third, the re-computed language model statistics can be displayed in a user interface and modifications can be accepted to the re-computed language model statistics through the user interface. Fourth, the language model statistics can be further re-computed based on the modifications. In consequence, the language model statistics are re-computed for the new word without introducing contextual inaccuracies in the language model.

Type: Application

Filed: February 21, 2001

Publication date: August 22, 2002

Applicant: International Business Machines Corporation

Inventors: James R. Lewis, Kerry A. Ortega, C. Thomas Rutherfoord, Maria E. Smith
Speech recognition enrollment for non-readers and displayless devices

Publication number: 20020091519

Abstract: A method for enrolling a user in a speech recognition system, without requiring reading, comprises the steps of: generating an audio user interface having an audible output and an audio input; audibly playing a text phrase; audibly prompting the user to speak the played phrase; repeating the steps of audibly prompting the user not to speak, audibly playing the phrase and audibly prompting the user to speak, for a plurality of further phrases; and, processing enrollment of the user based on the audibly prompted and subsequently spoken phrases. A graphical user interface can also be generated for: displaying text corresponding to the phrases and to the audible prompts; displaying a plurality of icons for user activation; and, selectively distinguishing different ones of the icons at different times by at least one of: color; shape; and, animation.

Type: Application

Filed: July 2, 2001

Publication date: July 11, 2002

Applicant: International Business Machines Corporation

Inventors: James R. Lewis, Huifang Wang, Ron Van Buskirk, Kerry A. Ortega
Smart correction of dictated speech

Patent number: 6418410

Abstract: In a speech recognition system, a method and system for updating a language model during a correction session can include automatically comparing dictated text to replacement text, determining if the replacement text is on an alternative word list if the comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit, and updating the language model without user interaction if the replacement text is on the alternative word list. If the replacement text is not on the alternative word list, a comparison is made between dictated word digital information and replacement word digital information, and the language model is updated if the digital comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit.

Type: Grant

Filed: September 27, 1999

Date of Patent: July 9, 2002

Assignee: International Business Machines Corporation

Inventors: Amado Nassiff, Kerry A. Ortega
METHOD AND APPARATUS FOR RECOGNIZING FROM HERE TO HERE VOICE COMMAND STRUCTURES IN A FINITE GRAMMAR SPEECH RECOGNITION SYSTEM

Publication number: 20020059071

Abstract: A method and system uses a finite state command grammar coordinated with application scripting to recognize voice command structures for performing an event from an initial location to a new location. The method involves a series of steps, including: recognizing an enabling voice command specifying the event to be performed from the initial location; determining a functional expression for the enabling voice command defined by one or more actions and objects; storing the action and object in a memory location; receiving input specifying the new location; recognizing an activating voice command for performing the event up to the new location; retrieving the stored action and object from the memory location; and performing the event from the initial location to the new location according to the retrieved action and object. Preferably, the enabling-activating command is phrased as “from here . . . to here”.

Type: Application

Filed: June 16, 1999

Publication date: May 16, 2002

Inventors: JAMES R. LEWIS, KERRY A. ORTEGA, MARIA E. SMITH, THOMAS A. KIST, LINDA M. BOYER
Method and apparatus for improving speech recognition accuracy

Patent number: 6370503

Abstract: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated. This tool could also be manually activated (206) by the user. The type of recognition problem is identified (216) by the user or automatically by the system, and the system provides (218) possible solution steps for enabling the user to adjust (219) system parameters or modify user behavior in order to alleviate the recognition problem.

Type: Grant

Filed: June 30, 1999

Date of Patent: April 9, 2002

Assignee: International Business Machines Corp.

Inventors: Kerry A. Ortega, Hans Egger, Arthur Keller, Ronald E. Vanbuskirk, Huifang Wang, James R. Lewis
Method and apparatus for activating and deactivating auxiliary topic libraries in a speech dictation system

Patent number: 6360201

Abstract: A dictation system (100) performs a method of dictating speech which automatically activates (502) and deactivates (306, 408, 510) auxiliary topic libraries based on the input speech. After receiving (206) input speech, the method searches (208, 214) a general library and topic libraries that are currently active, if any. The method also searches (220, 226) all or portions of inactive topic libraries. If the spoken word is recognized in a particular inactive topic library, the method automatically activates (502) that topic library. In a preferred embodiment, the method maintains an adjustable “score” for each active topic library. An active library's score is increased (402) each time a word is recognized in the library, and decreased (302, 404, 506) when a word is recognized in another library. If the score falls below a certain threshold, the active topic library is automatically deactivated (306, 408, 510).

Type: Grant

Filed: June 8, 1999

Date of Patent: March 19, 2002

Assignee: International Business Machines Corp.

Inventors: James R. Lewis, Kerry A. Ortega, Ronald Vanbuskirk, Huifang Wang
Automatic analysis of a speech dictated document

Patent number: 6345249

Abstract: A method for automatically analyzing a document in a speech recognition system having a vocabulary and language model can include the steps of: determining whether the document has undergone previous analysis; undoing the previous analysis; and, analyzing the document. More specifically, the determining step comprises the steps of: comparing trigrams in the document with trigrams in the language model; and, setting a reference point containing document data for undoing a previous analysis in the undoing step if the compared language model contains all the document trigrams. Moreover, the undoing step comprises the step of removing from the language model each trigram contained in the document data in the reference point.

Type: Grant

Filed: July 7, 1999

Date of Patent: February 5, 2002

Assignee: International Business Machines Corp.

Inventors: Kerry A. Ortega, Kris A. Coe, Steven J. Friedland, Burn L. Lewis, Maria E. Smith
Method and apparatus for improving speech command recognition accuracy using event-based constraints

Patent number: 6345254

Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.

Type: Grant

Filed: May 29, 1999

Date of Patent: February 5, 2002

Assignee: International Business Machines Corp.

Inventors: James R. Lewis, Kerry A. Ortega, Ronald E. Van Buskirk, Huifang Wang, Amado Nassiff, Barbara E. Ballard
Method and apparatus for improving speech recognition accuracy

Publication number: 20020013709

Abstract: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated.

Type: Application

Filed: September 21, 2001

Publication date: January 31, 2002

Applicant: International Business Machines Corporation

Inventors: Kerry A. Ortega, Hans Egger, Arthur Keller, Ronald E. Van Buskirk, Huifang Wang, James R. Lewis

prev 1 2 3 4 5 next