Patents by Inventor Kerry Ortega

Kerry Ortega has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 6535848
    Abstract: A transcription system (100, 200) includes multiple recording devices (110, 210) that individually record and store (516), into multiple files, digital data representing speech uttered by multiple speakers. In a preferred embodiment, time stamps are stored (514) along with the speech. A transcription computer (120, 230) enables a user to select (602) which of multiple files the user would like to have transcribed, and to associate (604) a speaker ID to each of the multiple files. The transcription computer then transcribes (1006) phrases within the multiple files, and stores (1008) those phrases in a sequential order, based on the time stamps. The user may also cause an offset time for each file to be adjusted (606, 916), thus affecting the ultimate sequential order of the transcribed phrases. After transcription, the user may edit (1104) the time stamps, speaker IDs, and/or phrases.
    Type: Grant
    Filed: June 8, 1999
    Date of Patent: March 18, 2003
    Assignee: International Business Machines Corporation
    Inventors: Kerry A. Ortega, James R. Lewis, Ronald Vanbuskirk, Huifang Wang
  • Patent number: 6519566
    Abstract: The method of the invention involves a plurality of steps including, defining a set of user voice commands for hands-free control of a pointer and, in response to receiving a first audio input recognized as one of the set of user voice commands, initiating motion of the pointer in a direction indicated by the user voice command. Subsequently, in response to receiving a second audio input, the pointer motion can be discontinued. Finally, in response to receiving one or more subsequent audio inputs not recognized as being among the set of user voice commands, the pointer can be incrementally moved responsive to the subsequent audio inputs.
    Type: Grant
    Filed: March 1, 2000
    Date of Patent: February 11, 2003
    Assignee: International Business Machines Corporation
    Inventors: Linda M. Boyer, James R. Lewis, Kerry A. Ortega, Ji Wee Tan
  • Patent number: 6507816
    Abstract: A method and system for evaluating the accuracy of a computer speech recognition system counts and indexes the total number of words dictated and the number of words corrected. The corrections are tallied after being made in a correction window and include words contained in an alternative list as well as words input by the user and within a stored word database. A processor calculates the approximate accuracy of the speech recognition system as the ratio of the number of correct words to the total number of words dictated. An accuracy ratio is calculated for each dictation session and an overall ratio is calculated for all sessions combined. The system also keeps individual and overall indexes of the number of times the corrected words were in alternate lists or not within the word database and uses these indexes to calculate additional accuracy values.
    Type: Grant
    Filed: May 4, 1999
    Date of Patent: January 14, 2003
    Assignee: International Business Machines Corporation
    Inventor: Kerry A. Ortega
  • Patent number: 6505155
    Abstract: In a computer speech user interface, a method and computer apparatus for automatically adjusting the content of feedback in a responsive prompt based upon predicted recognition accuracy by a speech recognizer. The method includes the steps of receiving a user voice command from the speech recognizer; calculating present speech recognition accuracy based upon the received user voice command; predicting future recognition accuracy based upon the calculated present speech recognition accuracy; and, generating feedback in a responsive prompt responsive to the predicted recognition accuracy. For predicting future poor recognition accuracy based upon poor present recognition accuracy, the calculating step can include monitoring the received user voice command; detecting a reduced accuracy condition in the monitored user voice command; and, determining poor present recognition accuracy if the reduced accuracy condition is detected in the detecting step.
    Type: Grant
    Filed: May 6, 1999
    Date of Patent: January 7, 2003
    Assignee: International Business Machines Corporation
    Inventors: Ronald Vanbuskirk, Huifang Wang, Kerry A. Ortega, Catherine G. Wolf
  • Patent number: 6497367
    Abstract: A portable unit assists a visually impaired user within a store by providing an output, using speech synthesis, of his location based on reading various barcode labels. The location of each barcode label is determined from data stored within the portable unit. The portable unit also determines a path between the user's location and an item he selects to find, describing the path using speech synthesis. The user can select, by speech or by depressing a button, items for a target list. Preferably, some barcode labels identify an end of an aisle, which cause the portable unit to describe, using speech synthesis, items on the aisle and items in the target list on the aisle.
    Type: Grant
    Filed: April 26, 2001
    Date of Patent: December 24, 2002
    Assignee: International Business Machines Corporation
    Inventors: Vincent Charles Conzola, Aaron Roger Cox, Kerry A Ortega, Thomas John Sluchak
  • Publication number: 20020177999
    Abstract: A method and system for evaluating the accuracy of a computer speech recognition system counts and indexes the total number of words dictated and the number of words corrected. The corrections are tallied after being made in a correction window and include words contained in an alternative list as well as words input by the user and within a stored word database. A processor calculates the approximate accuracy of the speech recognition system as the ratio of the number of correct words to the total number of words dictated. An accuracy ratio is calculated for each dictation session and an overall ratio is calculated for all sessions combined. The system also keeps individual and overall indexes of the number of times the corrected words were in alternate lists or not within the word database and uses these indexes to calculate additional accuracy values.
    Type: Application
    Filed: May 4, 1999
    Publication date: November 28, 2002
    Inventor: KERRY A. ORTEGA
  • Patent number: 6477493
    Abstract: A method and system for use with a computer recognition system to enroll a user. The method involves a series of steps. The invention provides a user with an enrollment script. The invention then receives a recording made with a transcription device of a dictation session in which the user has dictated at least a portion of the enrollment script. Additionally, the invention can enroll the user in the speech recognition system by decoding the recording and training the speech recognition system.
    Type: Grant
    Filed: July 15, 1999
    Date of Patent: November 5, 2002
    Assignee: International Business Machines Corporation
    Inventors: Brian S. Brooks, Waltraud Brunner, Carmi Gazit, Arthur Keller, Antonio R. Lee, Thomas Netousek, Kerry A. Ortega
  • Publication number: 20020161584
    Abstract: A method and system for use with a computer speech recognition system to efficiently identify valid system commands to users. The method involves a series of steps including: receiving data representative of a speech recognition system user input; comparing the data to a grammar defined for the speech recognition system to determine whether the data is representative of a user input which is a valid system command; and notifying the user as to whether the data is representative of a valid system command. The process can also involve the additional steps of determining a functional expression for the data; and comparing the functional expression to a set of all functional expressions permitted in the grammar to identify any alternate user inputs for producing the functional expression.
    Type: Application
    Filed: April 13, 1999
    Publication date: October 31, 2002
    Inventors: JAMES R. LEWIS, KERRY ORTEGA
  • Publication number: 20020158133
    Abstract: A portable unit assists a visually impaired user within a store by providing an output, using speech synthesis, of his location based on reading various barcode labels. The location of each barcode label is determined from data stored within the portable unit. The portable unit also determines a path between the user's location and an item he selects to find, describing the path using speech synthesis. The user can select, by speech or by depressing a button, items for a target list. Preferably, some barcode labels identify an end of an aisle, which cause the portable unit to describe, using speech synthesis, items on the aisle and items in the target list on the aisle.
    Type: Application
    Filed: April 26, 2001
    Publication date: October 31, 2002
    Applicant: International Business Machines Corporation
    Inventors: Vincent Charles Conzola, Aaron Roger Cox, Kerry A. Ortega, Thomas John Sluchak
  • Publication number: 20020130847
    Abstract: A software tool acquires data on the position of touches relative to controls on a particular window or dialog box using touch screen controls. Subjects or users log on and the time the bring a dialog box into focus is time stamped. The coordinates of each touch is recorded along with a time stamp of the touches. The test sessions are saved in a data file identifying the subject. The software tool is used by a User Interface designer to design controls for a touch screen application. The acquired data is played back graphically to the designer where each touch appears as a dot or like indication over laid over a representation of the dialog box to which it relates. The acquired data may be displayed in various ways including composite, realtime or in a single dialog box or a single touch. Previous touch data may be kept or discarded giving the UI designer many options of how to analyze the dat to determine optimum size and placement of controls.
    Type: Application
    Filed: March 14, 2001
    Publication date: September 19, 2002
    Applicant: International Business Machines Corporation
    Inventors: Vincent Charles Conzola, Kerry A. Ortega
  • Publication number: 20020128765
    Abstract: A system and method of the type for aiding a user in navigating a route through a facility so as too efficiently locate specific items within a facility is provided. The system includes a facility processor having a database and software stored thereon for mapping an interactive route from selected location to selected location within a facility, a label located proximate individual items, the label electronically communicating information specific to the item it is associated with, and a digital device having the interactive route electronically stored thereon, the digital device electronically communicating with the facility processor and the labels for tracking movement of the digital device along the route via communication with the labels and communicating a direction to move to follow the route.
    Type: Application
    Filed: March 9, 2001
    Publication date: September 12, 2002
    Applicant: International Business Machines Corporation
    Inventors: Robert Thomas Cato, Kerry A. Ortega, Thomas John Sluchak
  • Publication number: 20020116194
    Abstract: A method of generating language model statistics for a new word added to a language model incorporating at least one class file containing contextually related words. The method can include the following steps: First, language model statistics can be computed based on references to at least one incorporated class file. Second, a new word can be substituted for each reference to a selected class file. Additionally, the language model statistics can be re-computed based on the new word having been substituted for the reference. Third, the re-computed language model statistics can be displayed in a user interface and modifications can be accepted to the re-computed language model statistics through the user interface. Fourth, the language model statistics can be further re-computed based on the modifications. In consequence, the language model statistics are re-computed for the new word without introducing contextual inaccuracies in the language model.
    Type: Application
    Filed: February 21, 2001
    Publication date: August 22, 2002
    Applicant: International Business Machines Corporation
    Inventors: James R. Lewis, Kerry A. Ortega, C. Thomas Rutherfoord, Maria E. Smith
  • Publication number: 20020091519
    Abstract: A method for enrolling a user in a speech recognition system, without requiring reading, comprises the steps of: generating an audio user interface having an audible output and an audio input; audibly playing a text phrase; audibly prompting the user to speak the played phrase; repeating the steps of audibly prompting the user not to speak, audibly playing the phrase and audibly prompting the user to speak, for a plurality of further phrases; and, processing enrollment of the user based on the audibly prompted and subsequently spoken phrases. A graphical user interface can also be generated for: displaying text corresponding to the phrases and to the audible prompts; displaying a plurality of icons for user activation; and, selectively distinguishing different ones of the icons at different times by at least one of: color; shape; and, animation.
    Type: Application
    Filed: July 2, 2001
    Publication date: July 11, 2002
    Applicant: International Business Machines Corporation
    Inventors: James R. Lewis, Huifang Wang, Ron Van Buskirk, Kerry A. Ortega
  • Patent number: 6418410
    Abstract: In a speech recognition system, a method and system for updating a language model during a correction session can include automatically comparing dictated text to replacement text, determining if the replacement text is on an alternative word list if the comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit, and updating the language model without user interaction if the replacement text is on the alternative word list. If the replacement text is not on the alternative word list, a comparison is made between dictated word digital information and replacement word digital information, and the language model is updated if the digital comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit.
    Type: Grant
    Filed: September 27, 1999
    Date of Patent: July 9, 2002
    Assignee: International Business Machines Corporation
    Inventors: Amado Nassiff, Kerry A. Ortega
  • Publication number: 20020059071
    Abstract: A method and system uses a finite state command grammar coordinated with application scripting to recognize voice command structures for performing an event from an initial location to a new location. The method involves a series of steps, including: recognizing an enabling voice command specifying the event to be performed from the initial location; determining a functional expression for the enabling voice command defined by one or more actions and objects; storing the action and object in a memory location; receiving input specifying the new location; recognizing an activating voice command for performing the event up to the new location; retrieving the stored action and object from the memory location; and performing the event from the initial location to the new location according to the retrieved action and object. Preferably, the enabling-activating command is phrased as “from here . . . to here”.
    Type: Application
    Filed: June 16, 1999
    Publication date: May 16, 2002
    Inventors: JAMES R. LEWIS, KERRY A. ORTEGA, MARIA E. SMITH, THOMAS A. KIST, LINDA M. BOYER
  • Patent number: 6370503
    Abstract: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated. This tool could also be manually activated (206) by the user. The type of recognition problem is identified (216) by the user or automatically by the system, and the system provides (218) possible solution steps for enabling the user to adjust (219) system parameters or modify user behavior in order to alleviate the recognition problem.
    Type: Grant
    Filed: June 30, 1999
    Date of Patent: April 9, 2002
    Assignee: International Business Machines Corp.
    Inventors: Kerry A. Ortega, Hans Egger, Arthur Keller, Ronald E. Vanbuskirk, Huifang Wang, James R. Lewis
  • Patent number: 6360201
    Abstract: A dictation system (100) performs a method of dictating speech which automatically activates (502) and deactivates (306, 408, 510) auxiliary topic libraries based on the input speech. After receiving (206) input speech, the method searches (208, 214) a general library and topic libraries that are currently active, if any. The method also searches (220, 226) all or portions of inactive topic libraries. If the spoken word is recognized in a particular inactive topic library, the method automatically activates (502) that topic library. In a preferred embodiment, the method maintains an adjustable “score” for each active topic library. An active library's score is increased (402) each time a word is recognized in the library, and decreased (302, 404, 506) when a word is recognized in another library. If the score falls below a certain threshold, the active topic library is automatically deactivated (306, 408, 510).
    Type: Grant
    Filed: June 8, 1999
    Date of Patent: March 19, 2002
    Assignee: International Business Machines Corp.
    Inventors: James R. Lewis, Kerry A. Ortega, Ronald Vanbuskirk, Huifang Wang
  • Patent number: 6345249
    Abstract: A method for automatically analyzing a document in a speech recognition system having a vocabulary and language model can include the steps of: determining whether the document has undergone previous analysis; undoing the previous analysis; and, analyzing the document. More specifically, the determining step comprises the steps of: comparing trigrams in the document with trigrams in the language model; and, setting a reference point containing document data for undoing a previous analysis in the undoing step if the compared language model contains all the document trigrams. Moreover, the undoing step comprises the step of removing from the language model each trigram contained in the document data in the reference point.
    Type: Grant
    Filed: July 7, 1999
    Date of Patent: February 5, 2002
    Assignee: International Business Machines Corp.
    Inventors: Kerry A. Ortega, Kris A. Coe, Steven J. Friedland, Burn L. Lewis, Maria E. Smith
  • Patent number: 6345254
    Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.
    Type: Grant
    Filed: May 29, 1999
    Date of Patent: February 5, 2002
    Assignee: International Business Machines Corp.
    Inventors: James R. Lewis, Kerry A. Ortega, Ronald E. Van Buskirk, Huifang Wang, Amado Nassiff, Barbara E. Ballard
  • Publication number: 20020013709
    Abstract: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated.
    Type: Application
    Filed: September 21, 2001
    Publication date: January 31, 2002
    Applicant: International Business Machines Corporation
    Inventors: Kerry A. Ortega, Hans Egger, Arthur Keller, Ronald E. Van Buskirk, Huifang Wang, James R. Lewis