Patents by Inventor Francis G. Kubala

Francis G. Kubala has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEMS AND METHODS FOR FACILITATING PLAYBACK OF MEDIA

Publication number: 20140289596

Abstract: A system facilitates the browsing of information of interest. The system obtains a transcription of the information and provides the transcription to a user. The system also retrieves the information in its original format and presents the information to the user in the original format.

Type: Application

Filed: May 6, 2014

Publication date: September 25, 2014

Applicants: Verizon Corporate Services Group Inc., RAYTHEON BBN TECHNOLOGIES CORP.

Inventors: Scott SHEPARD, Sean COLBATH, Francis G. KUBALA
Systems and methods for improving recognition results via user-augmentation of a database

Patent number: 8001066

Abstract: A system improves recognition results. The system receives multimedia data and recognizes the multimedia data based on training data to generate documents. The system receives user augmentation relating to one of the documents or new documents from a user. The system supplements the training data with the user augmentation or new documents and retrains based on the supplemented training data.

Type: Grant

Filed: August 13, 2010

Date of Patent: August 16, 2011

Inventors: Sean Colbath, Scott Shepard, Francis G. Kubala
Systems & methods for improving recognition results via user-augmentation of a database

Publication number: 20110004576

Abstract: A system improves recognition results. The system receives multimedia data and recognizes the multimedia data based on training data to generate documents. The system receives user augmentation relating to one of the documents or new documents from a user. The system supplements the training data with the user augmentation or new documents and retrains based on the supplemented training data.

Type: Application

Filed: August 13, 2010

Publication date: January 6, 2011

Inventors: Sean Colbath, Scott Shepard, Francis G. Kubala
Multimedia recognition system comprising a plurality of indexers configured to receive and analyze multimedia data based on training data and user augmentation relating to one or more of a plurality of generated documents

Patent number: 7801838

Abstract: A system improves recognition results. The system receives multimedia data and recognizes the multimedia data based on training data to generate documents. The system receives user augmentation relating to one of the documents or new documents from a user. The system supplements the training data with the user augmentation or new documents and retrains based on the supplemented training data.

Type: Grant

Filed: July 2, 2003

Date of Patent: September 21, 2010

Assignee: Ramp Holdings, Inc.

Inventors: Sean Colbath, Scott Shepard, Francis G. Kubala
Systems and methods for classifying audio into broad phoneme classes

Patent number: 7424427

Abstract: An audio classification system classifies sounds in an audio stream as belonging to one of a relatively small number of classes. The audio classification system includes a signal analysis component [301] and a decoder [302]. The decoder [302] includes a number of models [310-316] for performing the audio classifications. In one implementation, the possible classifications include: vowels, fricatives, narrowband, wideband, coughing, gender, and silence. The classified audio may be used to enhance speech recognition of the audio stream.

Type: Grant

Filed: October 16, 2003

Date of Patent: September 9, 2008

Assignees: Verizon Corporate Services Group Inc., BBN Technologies Corp.

Inventors: Daben Liu, Francis G. Kubala
Unified clustering tree

Patent number: 7389229

Abstract: A unified clustering tree (500) generates phoneme clusters based on an input sequence of phonemes. The number of possible clusters is significantly less than the number of possible combinations of input phonemes. Nodes (510, 511) in the unified clustering tree are arranged into levels such that the clustering tree generates clusters for multiple speech recognition models. Models that correspond to higher levels in the unified clustering tree are coarse models relative to more fine-grain models at lower levels of the clustering tree.

Type: Grant

Filed: October 16, 2003

Date of Patent: June 17, 2008

Assignee: BBN Technologies Corp.

Inventors: Jayadev Billa, Daniel Kiecza, Francis G. Kubala
Systems and methods for providing acoustic classification

Patent number: 7337115

Abstract: A speech recognition system receives an audio signal and detects various features of the audio signal. For example, the system classifies the audio signal into speech and non-speech portions, genders of speakers corresponding to the speech portions, and channel bandwidths used by the speakers. The system detects speaker turns based on changes in the speakers and assigns labels to the speaker turns. The system verifies the genders of the speakers and the channel bandwidths used by the speakers and identifies one or more languages associated with the audio signal. The system recognizes the speech portions of the audio signal based on the various features of the audio signal.

Type: Grant

Filed: July 2, 2003

Date of Patent: February 26, 2008

Assignees: Verizon Corporate Services Group Inc., BBN Technologies Corp.

Inventors: Dabien Liu, Francis G. Kubala
Systems and methods for providing multimedia information management

Patent number: 7290207

Abstract: A system facilitates the searching and retrieval of multimedia data items. The system receives data items from different types of media sources and identifies regions in the data items. The regions include document regions, section regions, and passage regions. Each of the section regions corresponds to one of the document regions and each of the passage regions corresponds to one of the section regions and one of the document regions. The system stores document identifiers that relate to the document regions in separate document records in a document table, section identifiers that relate to the section regions in separate section records in a section table, and passage identifiers that relate to the passage regions in separate passage records in a passage table.

Type: Grant

Filed: July 2, 2003

Date of Patent: October 30, 2007

Assignee: BBN Technologies Corp.

Inventors: Sean Colbath, Scott Shepard, Francis G. Kubala
Systems and methods for classifying audio into broad phoneme classes

Publication number: 20040230432

Abstract: An audio classification system classifies sounds in an audio stream as belonging to one of a relatively small number of classes. The audio classification system includes a signal analysis component [301] and a decoder [302]. The decoder [302] includes a number of models [310-316] for performing the audio classifications. In one implementation, the possible classifications include: vowels, fricatives, narrowband, wideband, coughing, gender, and silence. The classified audio may be used to enhance speech recognition of the audio stream.

Type: Application

Filed: October 16, 2003

Publication date: November 18, 2004

Inventors: Daben Liu, Francis G. Kubala
Systems and methods for speaker change detection

Publication number: 20040204939

Abstract: A speaker change detection system performs speaker change detection on an input audio stream. The speaker change detection system includes a segmentation component [401], a phone classification decode component [402], and a speaker change detection component [403]. The segmentation component [401] segments the audio stream into segments [501-504] of predetermined length intervals. The segments may overlap one another. The phone classification decode component decodes the intervals to produce a set of phone classes corresponding to each of the intervals. The speaker change detection component detects locations of speaker changes in the audio stream based on a similarity value calculated at phone class boundaries.

Type: Application

Filed: October 16, 2003

Publication date: October 14, 2004

Inventors: Daben Liu, Francis G. Kubala
Name browsing systems and methods

Publication number: 20040199495

Abstract: A system provides document browsing by proper name. The system identifies a subset of documents from a set of documents (310). The documents in the subset of documents include proper names. The system receives a selection of at least one of the proper names from the subset of documents (330) and searches the subset of documents to identify one or more of the documents in the subset of documents that include at least one occurrence of the selected proper name(s) (360). The system then presents the one or more of the documents as a result of the search (370).

Type: Application

Filed: July 2, 2003

Publication date: October 7, 2004

Inventors: Sean Colbath, Sean Boisen, Scott Shepard, Susan S. Nielsen, Andrew Wilson, Francis G. Kubala
Pronunciation symbols based on the orthographic lexicon of a language

Publication number: 20040176946

Abstract: A dictionary creation component [316] converts the normal orthographic written representation of a word into a sequence of symbols that relate to the pronunciation of the word. The symbols may be used to train conventional models for speech recognition. The symbols are not phonemes and do not need to be defined by a speech expert. The symbols are created automatically by the dictionary creation component based on the written representation of the word.

Type: Application

Filed: October 16, 2003

Publication date: September 9, 2004

Inventors: Jayadev Billa, Francis G. Kubala
Systems and methods for labeling clusters of documents

Publication number: 20040163034

Abstract: A system (520) generates labels for clusters of documents. The system (520) identifies topics associated with the documents in the clusters and determines whether the topics are associated with approximately half or more of the documents in the clusters. The system (520) then generates labels for the clusters using the topics that are associated with approximately half or more of the documents in the clusters.

Type: Application

Filed: October 16, 2003

Publication date: August 19, 2004

Inventors: Sean Colbath, Francis G. Kubala
Speech transcription tool for efficient speech transcription

Publication number: 20040138894

Abstract: A transcription tool [115] includes a graphical user interface [209] that displays the waveform of an input audio signal to a user. The user may define speaker turn segments using the displayed waveform. The graphical user interface further displays a transcription section [302] that includes a textual representation of speech that was transcribed by the user and a graphical representation of annotation information [314] relating to the transcribed text. The user may enter the annotation information on-the-fly while transcribing the text using predefined keyboard shortcut commands or other mechanisms. The graphical user interface may further display a structured representation section [303] that may present the transcribed text as a hierarchical tree structure.

Type: Application

Filed: October 16, 2003

Publication date: July 15, 2004

Inventors: Daniel Kiecza, Francis G. Kubala
Systems and methods for providing interactive speaker identification training

Publication number: 20040083104

Abstract: A system (100) provides speaker identification training. The system (100) generates speaker models and receives audio segments. The system (100) identifies speakers corresponding to the audio segments based on the speaker models. At least one of the audio segments has an unidentified or misidentified speaker (i.e., an audio segment whose speaker cannot be accurately identified). The system (100) presents, to a user, audio segments that include an audio segment whose speaker is unidentified or misidentified and receives, from the user, the name of the unidentified or misidentified speaker. The system (100) may use this information to subsequently identify the unidentified or misidentified speaker by name for future audio segments.

Type: Application

Filed: October 16, 2003

Publication date: April 29, 2004

Inventors: Daben Liu, Francis G. Kubala
Manager for integrating language technology components

Publication number: 20040083090

Abstract: A speech system includes a Language Component Manager (LCM) [116] that functions as middleware between language and technology components [117] and a high-level application [115]. In aggregate, the LCM, the language and technology component, and the high-level application form a speech system. The components of the speech system may be distributed. All data and messages in the speech system pass through the LCM. A configuration file defines relationships, such as message and data paths, for the speech system.

Type: Application

Filed: October 16, 2003

Publication date: April 29, 2004

Inventors: Daniel Kiecza, Francis G. Kubala, Peter G. Allen
Systems and methods for providing acoustic classification

Publication number: 20040030550

Abstract: A speech recognition system receives an audio signal and detects various features of the audio signal. For example, the system classifies the audio signal into speech and non-speech portions, genders of speakers corresponding to the speech portions, and channel bandwidths used by the speakers. The system detects speaker turns based on changes in the speakers and assigns labels to the speaker turns. The system verifies the genders of the speakers and the channel bandwidths used by the speakers and identifies one or more languages associated with the audio signal. The system recognizes the speech portions of the audio signal based on the various features of the audio signal.

Type: Application

Filed: July 2, 2003

Publication date: February 12, 2004

Inventors: Dabien Liu, Francis G. Kubala
Systems and methods for facilitating playback of media

Publication number: 20040004599

Abstract: A system facilitates the browsing of information of interest. The system obtains a transcription of the information and provides the transcription to a user. The system also retrieves the information in its original format and presents the information to the user in the original format. The system visually synchronizes the presentation of the information in the original format with the transcription of the information.

Type: Application

Filed: July 2, 2003

Publication date: January 8, 2004

Inventors: Scott Shepard, Sean Colbath, Francis G. Kubala
Systems and methods for improving recognition results via user-augmentation of a database

Publication number: 20040006737

Abstract: A system improves recognition results. The system receives multimedia data and recognizes the multimedia data based on training data to generate documents. The system receives user augmentation relating to one of the documents or new documents from a user. The system supplements the training data with the user augmentation or new documents and retrains based on the supplemented training data.

Type: Application

Filed: July 2, 2003

Publication date: January 8, 2004

Inventors: Sean Colbath, Scott Shepard, Francis G. Kubala
Systems and methods for providing online event tracking

Publication number: 20040006748

Abstract: A system notifies a user of the detection of data that is relevant to an event of interest. The system obtains a user profile that includes one or more example documents that define the event. The system receives data that corresponds to multimedia information and determines the relevance of the data to the event based on the one or more example documents. The system notifies the user when the data is determined to be relevant.

Type: Application

Filed: July 2, 2003

Publication date: January 8, 2004

Inventors: Amit Srivastava, Scott Shepard, Francis G. Kubala

1 2 next