Patents by Inventor Jason Peter Andrew Charlesworth
Jason Peter Andrew Charlesworth has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7337116Abstract: A system is provided for allowing a user to add word models to a speech recognition system. In particular, the system allows a user to input a number of renditions of the new word and which generates from these a sequence of phonemes representative of the new word. This representative sequence of phonemes is stored in a word to phoneme dictionary together with the typed version of the word for subsequent use by the speech recognition system.Type: GrantFiled: November 5, 2001Date of Patent: February 26, 2008Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Jebu Jacob Rajan
-
Patent number: 7310600Abstract: A dynamic programming technique is provided for matching two sequences of phonemes both of which may be generated from text or speech. The scoring of the dynamic programming matching technique uses phoneme confusion scores, phoneme insertion scores and phoneme deletion scores which are obtained in advance in a training session and, if appropriate, confidence data generated by a recognition system if the sequences are generated from speech.Type: GrantFiled: October 25, 2000Date of Patent: December 18, 2007Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 7295980Abstract: A system is provided for matching two or more sequences of phonemes both or all of which may be generated from text or speech. A dynamic programming matching technique is preferably used having constraints which depend upon whether or not the two sequences are generated from text or speech and in which the scoring of the dynamic programming paths is weighted by phoneme confusion scores, phoneme insertion scores and phoneme deletion scores where appropriate.Type: GrantFiled: August 31, 2006Date of Patent: November 13, 2007Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 7257533Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.Type: GrantFiled: September 22, 2005Date of Patent: August 14, 2007Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
-
Patent number: 7240003Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database, in response to a user's input query for desired information. The phoneme and word lattice comprises a plurality of time-ordered nodes, and a plurality of links extending between the nodes. Each link has a phoneme or word associated with it. The nodes are arranged in a sequence of time-ordered blocks such that further data can be conveniently added to the lattice.Type: GrantFiled: September 28, 2001Date of Patent: July 3, 2007Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Patent number: 7212968Abstract: A dynamic programming technique is provided for matching two sequences of phonemes both of which may be generated from text or speech. The scoring of the dynamic programming matching technique uses phoneme confusion scores, phoneme insertion scores and phoneme deletion scores which are obtained in advance in a training session and, if appropriate, confidence data generated by a recognition system if the sequences are generated from speech.Type: GrantFiled: October 25, 2000Date of Patent: May 1, 2007Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 7054812Abstract: A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.Type: GrantFiled: April 25, 2001Date of Patent: May 30, 2006Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Patent number: 6990448Abstract: The data structure is used in accessing a plurality of data files. The data stucture comprises a plurality of annotation storage areas adapted to correspond with the data files, each annotation storage area containing an annotation an annotation representing a time sequential signal and each annotation storage area comprising a plurality of block storage areas each containing phoneme and word data forming a respective temporal block of the annotation and each block having an associated time index identifying a timing of the block within the corresponding annotation. Each block storage area includes a plurality of node storage areas, each asociated with a node which represents a point in time at which a word and/or phoneme begins or ends within the corresponding annotation, and each node storage area having a time offset storage area containing a time offset defining the point in time represented by the node relative to the time index associated with the corresponding block.Type: GrantFiled: August 23, 2001Date of Patent: January 24, 2006Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
-
Patent number: 6882970Abstract: A system is provided for comparing an input query with a number of stored annotations to identify information to be retrieved from a database. The comparison technique divides the input query into a number of fixed-size fragments and identifies how many times each of the fragments occurs within each annotation using a dynamic programming matching technique. The frequencies of occurrence of the fragments in both the query and the annotation are then compared to provide a measure of the similarity between the query and the annotation. The information to be retrieved is then determined from the similarity measures obtained for all the annotations.Type: GrantFiled: October 25, 2000Date of Patent: April 19, 2005Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 6873993Abstract: An indexing apparatus and method are described for use in identifying portions of data in a database for comparison with a query. In an embodiment, the index includes a key which comprises a sequence of phoneme classifications derived from the input query by classifying each of the phonemes in the input query with a number of phoneme classes, with the phonemes in each class being defined as those that are confusable with the other phonemes in the same class.Type: GrantFiled: May 24, 2001Date of Patent: March 29, 2005Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Patent number: 6801891Abstract: A system is provided for decoding one or more sequences of sub-word units output by a speech recognition system into one or more representative words. The system uses a dynamic programming technique to align the sequence of sub-word units output by the recognition system with a number of dictionary sub-word unit sequences representative of dictionary words to identify the most likely word or words corresponding to the spoken input.Type: GrantFiled: November 13, 2001Date of Patent: October 5, 2004Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth
-
Publication number: 20030177108Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database, in response to a user's input query for desired information. The phoneme and word lattice comprises a plurality of time-ordered nodes, and a plurality of links extending between the nodes. Each link has a phoneme or word associated with it. The nodes are arranged in a sequence of time-ordered blocks such that further data can be conveniently added to the lattice.Type: ApplicationFiled: March 7, 2003Publication date: September 18, 2003Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Publication number: 20020198704Abstract: A speech detection system is described which uses a time series noise model to represent audio signals corresponding to noise. The system compares incoming audio signals with the noise model and determines the beginning or end of speech in the audio signal depending on how well the input audio compares to the noise model.Type: ApplicationFiled: May 31, 2002Publication date: December 26, 2002Applicant: CANON KABUSHIKI KAISHAInventors: Jebu Jacob Rajan, Jason Peter Andrew Charlesworth
-
Publication number: 20020120447Abstract: A system is provided for allowing a user to add word models to a speech recognition system. In particular, the system allows a user to input a number of renditions of the new word and which generates from these a sequence of phonemes representative of the new word. This representative sequence of phonemes is stored in a word to phoneme dictionary together with the typed version of the word for subsequent use by the speech recognition system.Type: ApplicationFiled: November 5, 2001Publication date: August 29, 2002Inventors: Jason Peter Andrew Charlesworth, Jebu Jacob Rajan
-
Publication number: 20020120448Abstract: A system is provided for decoding one or more sequences of sub-word units output by a speech recognition system into one or more representative words. The system uses a dynamic programming technique to align the sequence of sub-word units output by the recognition system with a number of dictionary sub-word unit sequences representative of dictionary words to identify the most likely word or words corresponding to the spoken input.Type: ApplicationFiled: November 13, 2001Publication date: August 29, 2002Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth
-
Publication number: 20020052740Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.Type: ApplicationFiled: August 23, 2001Publication date: May 2, 2002Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
-
Publication number: 20020052870Abstract: An indexing apparatus and method are described for use in identifying portions of data in a database for comparison with a query. In an embodiment, the index includes a key which comprises a sequence of phoneme classifications derived from the input query by classifying each of the phonemes in the input query with a number of phoneme classes, with the phonemes in each class being defined as those that are confusable with the other phonemes in the same class.Type: ApplicationFiled: May 24, 2001Publication date: May 2, 2002Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Publication number: 20020022960Abstract: A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.Type: ApplicationFiled: April 25, 2001Publication date: February 21, 2002Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner