Patents by Inventor Kerry Ortega
Kerry Ortega has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 6535848Abstract: A transcription system (100, 200) includes multiple recording devices (110, 210) that individually record and store (516), into multiple files, digital data representing speech uttered by multiple speakers. In a preferred embodiment, time stamps are stored (514) along with the speech. A transcription computer (120, 230) enables a user to select (602) which of multiple files the user would like to have transcribed, and to associate (604) a speaker ID to each of the multiple files. The transcription computer then transcribes (1006) phrases within the multiple files, and stores (1008) those phrases in a sequential order, based on the time stamps. The user may also cause an offset time for each file to be adjusted (606, 916), thus affecting the ultimate sequential order of the transcribed phrases. After transcription, the user may edit (1104) the time stamps, speaker IDs, and/or phrases.Type: GrantFiled: June 8, 1999Date of Patent: March 18, 2003Assignee: International Business Machines CorporationInventors: Kerry A. Ortega, James R. Lewis, Ronald Vanbuskirk, Huifang Wang
-
Patent number: 6519566Abstract: The method of the invention involves a plurality of steps including, defining a set of user voice commands for hands-free control of a pointer and, in response to receiving a first audio input recognized as one of the set of user voice commands, initiating motion of the pointer in a direction indicated by the user voice command. Subsequently, in response to receiving a second audio input, the pointer motion can be discontinued. Finally, in response to receiving one or more subsequent audio inputs not recognized as being among the set of user voice commands, the pointer can be incrementally moved responsive to the subsequent audio inputs.Type: GrantFiled: March 1, 2000Date of Patent: February 11, 2003Assignee: International Business Machines CorporationInventors: Linda M. Boyer, James R. Lewis, Kerry A. Ortega, Ji Wee Tan
-
Patent number: 6507816Abstract: A method and system for evaluating the accuracy of a computer speech recognition system counts and indexes the total number of words dictated and the number of words corrected. The corrections are tallied after being made in a correction window and include words contained in an alternative list as well as words input by the user and within a stored word database. A processor calculates the approximate accuracy of the speech recognition system as the ratio of the number of correct words to the total number of words dictated. An accuracy ratio is calculated for each dictation session and an overall ratio is calculated for all sessions combined. The system also keeps individual and overall indexes of the number of times the corrected words were in alternate lists or not within the word database and uses these indexes to calculate additional accuracy values.Type: GrantFiled: May 4, 1999Date of Patent: January 14, 2003Assignee: International Business Machines CorporationInventor: Kerry A. Ortega
-
Patent number: 6505155Abstract: In a computer speech user interface, a method and computer apparatus for automatically adjusting the content of feedback in a responsive prompt based upon predicted recognition accuracy by a speech recognizer. The method includes the steps of receiving a user voice command from the speech recognizer; calculating present speech recognition accuracy based upon the received user voice command; predicting future recognition accuracy based upon the calculated present speech recognition accuracy; and, generating feedback in a responsive prompt responsive to the predicted recognition accuracy. For predicting future poor recognition accuracy based upon poor present recognition accuracy, the calculating step can include monitoring the received user voice command; detecting a reduced accuracy condition in the monitored user voice command; and, determining poor present recognition accuracy if the reduced accuracy condition is detected in the detecting step.Type: GrantFiled: May 6, 1999Date of Patent: January 7, 2003Assignee: International Business Machines CorporationInventors: Ronald Vanbuskirk, Huifang Wang, Kerry A. Ortega, Catherine G. Wolf
-
Patent number: 6497367Abstract: A portable unit assists a visually impaired user within a store by providing an output, using speech synthesis, of his location based on reading various barcode labels. The location of each barcode label is determined from data stored within the portable unit. The portable unit also determines a path between the user's location and an item he selects to find, describing the path using speech synthesis. The user can select, by speech or by depressing a button, items for a target list. Preferably, some barcode labels identify an end of an aisle, which cause the portable unit to describe, using speech synthesis, items on the aisle and items in the target list on the aisle.Type: GrantFiled: April 26, 2001Date of Patent: December 24, 2002Assignee: International Business Machines CorporationInventors: Vincent Charles Conzola, Aaron Roger Cox, Kerry A Ortega, Thomas John Sluchak
-
Publication number: 20020177999Abstract: A method and system for evaluating the accuracy of a computer speech recognition system counts and indexes the total number of words dictated and the number of words corrected. The corrections are tallied after being made in a correction window and include words contained in an alternative list as well as words input by the user and within a stored word database. A processor calculates the approximate accuracy of the speech recognition system as the ratio of the number of correct words to the total number of words dictated. An accuracy ratio is calculated for each dictation session and an overall ratio is calculated for all sessions combined. The system also keeps individual and overall indexes of the number of times the corrected words were in alternate lists or not within the word database and uses these indexes to calculate additional accuracy values.Type: ApplicationFiled: May 4, 1999Publication date: November 28, 2002Inventor: KERRY A. ORTEGA
-
Patent number: 6477493Abstract: A method and system for use with a computer recognition system to enroll a user. The method involves a series of steps. The invention provides a user with an enrollment script. The invention then receives a recording made with a transcription device of a dictation session in which the user has dictated at least a portion of the enrollment script. Additionally, the invention can enroll the user in the speech recognition system by decoding the recording and training the speech recognition system.Type: GrantFiled: July 15, 1999Date of Patent: November 5, 2002Assignee: International Business Machines CorporationInventors: Brian S. Brooks, Waltraud Brunner, Carmi Gazit, Arthur Keller, Antonio R. Lee, Thomas Netousek, Kerry A. Ortega
-
Publication number: 20020161584Abstract: A method and system for use with a computer speech recognition system to efficiently identify valid system commands to users. The method involves a series of steps including: receiving data representative of a speech recognition system user input; comparing the data to a grammar defined for the speech recognition system to determine whether the data is representative of a user input which is a valid system command; and notifying the user as to whether the data is representative of a valid system command. The process can also involve the additional steps of determining a functional expression for the data; and comparing the functional expression to a set of all functional expressions permitted in the grammar to identify any alternate user inputs for producing the functional expression.Type: ApplicationFiled: April 13, 1999Publication date: October 31, 2002Inventors: JAMES R. LEWIS, KERRY ORTEGA
-
Publication number: 20020158133Abstract: A portable unit assists a visually impaired user within a store by providing an output, using speech synthesis, of his location based on reading various barcode labels. The location of each barcode label is determined from data stored within the portable unit. The portable unit also determines a path between the user's location and an item he selects to find, describing the path using speech synthesis. The user can select, by speech or by depressing a button, items for a target list. Preferably, some barcode labels identify an end of an aisle, which cause the portable unit to describe, using speech synthesis, items on the aisle and items in the target list on the aisle.Type: ApplicationFiled: April 26, 2001Publication date: October 31, 2002Applicant: International Business Machines CorporationInventors: Vincent Charles Conzola, Aaron Roger Cox, Kerry A. Ortega, Thomas John Sluchak
-
Publication number: 20020130847Abstract: A software tool acquires data on the position of touches relative to controls on a particular window or dialog box using touch screen controls. Subjects or users log on and the time the bring a dialog box into focus is time stamped. The coordinates of each touch is recorded along with a time stamp of the touches. The test sessions are saved in a data file identifying the subject. The software tool is used by a User Interface designer to design controls for a touch screen application. The acquired data is played back graphically to the designer where each touch appears as a dot or like indication over laid over a representation of the dialog box to which it relates. The acquired data may be displayed in various ways including composite, realtime or in a single dialog box or a single touch. Previous touch data may be kept or discarded giving the UI designer many options of how to analyze the dat to determine optimum size and placement of controls.Type: ApplicationFiled: March 14, 2001Publication date: September 19, 2002Applicant: International Business Machines CorporationInventors: Vincent Charles Conzola, Kerry A. Ortega
-
Publication number: 20020128765Abstract: A system and method of the type for aiding a user in navigating a route through a facility so as too efficiently locate specific items within a facility is provided. The system includes a facility processor having a database and software stored thereon for mapping an interactive route from selected location to selected location within a facility, a label located proximate individual items, the label electronically communicating information specific to the item it is associated with, and a digital device having the interactive route electronically stored thereon, the digital device electronically communicating with the facility processor and the labels for tracking movement of the digital device along the route via communication with the labels and communicating a direction to move to follow the route.Type: ApplicationFiled: March 9, 2001Publication date: September 12, 2002Applicant: International Business Machines CorporationInventors: Robert Thomas Cato, Kerry A. Ortega, Thomas John Sluchak
-
Publication number: 20020116194Abstract: A method of generating language model statistics for a new word added to a language model incorporating at least one class file containing contextually related words. The method can include the following steps: First, language model statistics can be computed based on references to at least one incorporated class file. Second, a new word can be substituted for each reference to a selected class file. Additionally, the language model statistics can be re-computed based on the new word having been substituted for the reference. Third, the re-computed language model statistics can be displayed in a user interface and modifications can be accepted to the re-computed language model statistics through the user interface. Fourth, the language model statistics can be further re-computed based on the modifications. In consequence, the language model statistics are re-computed for the new word without introducing contextual inaccuracies in the language model.Type: ApplicationFiled: February 21, 2001Publication date: August 22, 2002Applicant: International Business Machines CorporationInventors: James R. Lewis, Kerry A. Ortega, C. Thomas Rutherfoord, Maria E. Smith
-
Publication number: 20020091519Abstract: A method for enrolling a user in a speech recognition system, without requiring reading, comprises the steps of: generating an audio user interface having an audible output and an audio input; audibly playing a text phrase; audibly prompting the user to speak the played phrase; repeating the steps of audibly prompting the user not to speak, audibly playing the phrase and audibly prompting the user to speak, for a plurality of further phrases; and, processing enrollment of the user based on the audibly prompted and subsequently spoken phrases. A graphical user interface can also be generated for: displaying text corresponding to the phrases and to the audible prompts; displaying a plurality of icons for user activation; and, selectively distinguishing different ones of the icons at different times by at least one of: color; shape; and, animation.Type: ApplicationFiled: July 2, 2001Publication date: July 11, 2002Applicant: International Business Machines CorporationInventors: James R. Lewis, Huifang Wang, Ron Van Buskirk, Kerry A. Ortega
-
Patent number: 6418410Abstract: In a speech recognition system, a method and system for updating a language model during a correction session can include automatically comparing dictated text to replacement text, determining if the replacement text is on an alternative word list if the comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit, and updating the language model without user interaction if the replacement text is on the alternative word list. If the replacement text is not on the alternative word list, a comparison is made between dictated word digital information and replacement word digital information, and the language model is updated if the digital comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit.Type: GrantFiled: September 27, 1999Date of Patent: July 9, 2002Assignee: International Business Machines CorporationInventors: Amado Nassiff, Kerry A. Ortega
-
Publication number: 20020059071Abstract: A method and system uses a finite state command grammar coordinated with application scripting to recognize voice command structures for performing an event from an initial location to a new location. The method involves a series of steps, including: recognizing an enabling voice command specifying the event to be performed from the initial location; determining a functional expression for the enabling voice command defined by one or more actions and objects; storing the action and object in a memory location; receiving input specifying the new location; recognizing an activating voice command for performing the event up to the new location; retrieving the stored action and object from the memory location; and performing the event from the initial location to the new location according to the retrieved action and object. Preferably, the enabling-activating command is phrased as “from here . . . to here”.Type: ApplicationFiled: June 16, 1999Publication date: May 16, 2002Inventors: JAMES R. LEWIS, KERRY A. ORTEGA, MARIA E. SMITH, THOMAS A. KIST, LINDA M. BOYER
-
Patent number: 6370503Abstract: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated. This tool could also be manually activated (206) by the user. The type of recognition problem is identified (216) by the user or automatically by the system, and the system provides (218) possible solution steps for enabling the user to adjust (219) system parameters or modify user behavior in order to alleviate the recognition problem.Type: GrantFiled: June 30, 1999Date of Patent: April 9, 2002Assignee: International Business Machines Corp.Inventors: Kerry A. Ortega, Hans Egger, Arthur Keller, Ronald E. Vanbuskirk, Huifang Wang, James R. Lewis
-
Patent number: 6360201Abstract: A dictation system (100) performs a method of dictating speech which automatically activates (502) and deactivates (306, 408, 510) auxiliary topic libraries based on the input speech. After receiving (206) input speech, the method searches (208, 214) a general library and topic libraries that are currently active, if any. The method also searches (220, 226) all or portions of inactive topic libraries. If the spoken word is recognized in a particular inactive topic library, the method automatically activates (502) that topic library. In a preferred embodiment, the method maintains an adjustable “score” for each active topic library. An active library's score is increased (402) each time a word is recognized in the library, and decreased (302, 404, 506) when a word is recognized in another library. If the score falls below a certain threshold, the active topic library is automatically deactivated (306, 408, 510).Type: GrantFiled: June 8, 1999Date of Patent: March 19, 2002Assignee: International Business Machines Corp.Inventors: James R. Lewis, Kerry A. Ortega, Ronald Vanbuskirk, Huifang Wang
-
Patent number: 6345249Abstract: A method for automatically analyzing a document in a speech recognition system having a vocabulary and language model can include the steps of: determining whether the document has undergone previous analysis; undoing the previous analysis; and, analyzing the document. More specifically, the determining step comprises the steps of: comparing trigrams in the document with trigrams in the language model; and, setting a reference point containing document data for undoing a previous analysis in the undoing step if the compared language model contains all the document trigrams. Moreover, the undoing step comprises the step of removing from the language model each trigram contained in the document data in the reference point.Type: GrantFiled: July 7, 1999Date of Patent: February 5, 2002Assignee: International Business Machines Corp.Inventors: Kerry A. Ortega, Kris A. Coe, Steven J. Friedland, Burn L. Lewis, Maria E. Smith
-
Method and apparatus for improving speech command recognition accuracy using event-based constraints
Patent number: 6345254Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.Type: GrantFiled: May 29, 1999Date of Patent: February 5, 2002Assignee: International Business Machines Corp.Inventors: James R. Lewis, Kerry A. Ortega, Ronald E. Van Buskirk, Huifang Wang, Amado Nassiff, Barbara E. Ballard -
Publication number: 20020013709Abstract: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated.Type: ApplicationFiled: September 21, 2001Publication date: January 31, 2002Applicant: International Business Machines CorporationInventors: Kerry A. Ortega, Hans Egger, Arthur Keller, Ronald E. Van Buskirk, Huifang Wang, James R. Lewis