Patents by Inventor Philip Neil Garner
Philip Neil Garner has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10460043Abstract: An apparatus and a method for constructing a multilingual acoustic model, and a computer readable recording medium are provided. The method for constructing a multilingual acoustic model includes dividing an input feature into a common language portion and a distinctive language portion, acquiring a tandem feature by training the divided common language portion and distinctive language portion using a neural network to estimate and remove correlation between phonemes, dividing parameters of an initial acoustic model constructed using the tandem feature into common language parameters and distinctive language parameters, adapting the common language parameters using data of a training language, adapting the distinctive language parameters using data of a target language, and constructing an acoustic model for the target language using the adapted common language parameters and the adapted distinctive language parameters.Type: GrantFiled: November 22, 2013Date of Patent: October 29, 2019Assignees: SAMSUNG ELECTRONICS CO., LTD., IDIAP RESEARCH INSTITUTEInventors: Nam-Hoon Kim, Petr Motlicek, Philip Neil Garner, David Imseng, Jae-won Lee, Jeong-Mi Cho
-
Publication number: 20140149104Abstract: An apparatus and a method for constructing a multilingual acoustic model, and a computer readable recording medium are provided. The method for constructing a multilingual acoustic model includes dividing an input feature into a common language portion and a distinctive language portion, acquiring a tandem feature by training the divided common language portion and distinctive language portion using a neural network to estimate and remove correlation between phonemes, dividing parameters of an initial acoustic model constructed using the tandem feature into common language parameters and distinctive language parameters, adapting the common language parameters using data of a training language, adapting the distinctive language parameters using data of a target language, and constructing an acoustic model for the target language using the adapted common language parameters and the adapted distinctive language parameters.Type: ApplicationFiled: November 22, 2013Publication date: May 29, 2014Applicants: IDIAP RESEARCH INSTITUTE, SAMSUNG ELECTRONICS CO., LTD.Inventors: Nam-Hoon KIM, Petr MOTLICEK, Philip Neil GARNER, David IMSENG, Jae-won LEE, Jeong-Mi CHO
-
Patent number: 7310600Abstract: A dynamic programming technique is provided for matching two sequences of phonemes both of which may be generated from text or speech. The scoring of the dynamic programming matching technique uses phoneme confusion scores, phoneme insertion scores and phoneme deletion scores which are obtained in advance in a training session and, if appropriate, confidence data generated by a recognition system if the sequences are generated from speech.Type: GrantFiled: October 25, 2000Date of Patent: December 18, 2007Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 7295980Abstract: A system is provided for matching two or more sequences of phonemes both or all of which may be generated from text or speech. A dynamic programming matching technique is preferably used having constraints which depend upon whether or not the two sequences are generated from text or speech and in which the scoring of the dynamic programming paths is weighted by phoneme confusion scores, phoneme insertion scores and phoneme deletion scores where appropriate.Type: GrantFiled: August 31, 2006Date of Patent: November 13, 2007Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 7257533Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.Type: GrantFiled: September 22, 2005Date of Patent: August 14, 2007Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
-
Patent number: 7240003Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database, in response to a user's input query for desired information. The phoneme and word lattice comprises a plurality of time-ordered nodes, and a plurality of links extending between the nodes. Each link has a phoneme or word associated with it. The nodes are arranged in a sequence of time-ordered blocks such that further data can be conveniently added to the lattice.Type: GrantFiled: September 28, 2001Date of Patent: July 3, 2007Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Patent number: 7212968Abstract: A dynamic programming technique is provided for matching two sequences of phonemes both of which may be generated from text or speech. The scoring of the dynamic programming matching technique uses phoneme confusion scores, phoneme insertion scores and phoneme deletion scores which are obtained in advance in a training session and, if appropriate, confidence data generated by a recognition system if the sequences are generated from speech.Type: GrantFiled: October 25, 2000Date of Patent: May 1, 2007Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 7054812Abstract: A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.Type: GrantFiled: April 25, 2001Date of Patent: May 30, 2006Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Patent number: 6990448Abstract: The data structure is used in accessing a plurality of data files. The data stucture comprises a plurality of annotation storage areas adapted to correspond with the data files, each annotation storage area containing an annotation an annotation representing a time sequential signal and each annotation storage area comprising a plurality of block storage areas each containing phoneme and word data forming a respective temporal block of the annotation and each block having an associated time index identifying a timing of the block within the corresponding annotation. Each block storage area includes a plurality of node storage areas, each asociated with a node which represents a point in time at which a word and/or phoneme begins or ends within the corresponding annotation, and each node storage area having a time offset storage area containing a time offset defining the point in time represented by the node relative to the time index associated with the corresponding block.Type: GrantFiled: August 23, 2001Date of Patent: January 24, 2006Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
-
Patent number: 6882970Abstract: A system is provided for comparing an input query with a number of stored annotations to identify information to be retrieved from a database. The comparison technique divides the input query into a number of fixed-size fragments and identifies how many times each of the fragments occurs within each annotation using a dynamic programming matching technique. The frequencies of occurrence of the fragments in both the query and the annotation are then compared to provide a measure of the similarity between the query and the annotation. The information to be retrieved is then determined from the similarity measures obtained for all the annotations.Type: GrantFiled: October 25, 2000Date of Patent: April 19, 2005Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
-
Patent number: 6873993Abstract: An indexing apparatus and method are described for use in identifying portions of data in a database for comparison with a query. In an embodiment, the index includes a key which comprises a sequence of phoneme classifications derived from the input query by classifying each of the phonemes in the input query with a number of phoneme classes, with the phonemes in each class being defined as those that are confusable with the other phonemes in the same class.Type: GrantFiled: May 24, 2001Date of Patent: March 29, 2005Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Patent number: 6801891Abstract: A system is provided for decoding one or more sequences of sub-word units output by a speech recognition system into one or more representative words. The system uses a dynamic programming technique to align the sequence of sub-word units output by the recognition system with a number of dictionary sub-word unit sequences representative of dictionary words to identify the most likely word or words corresponding to the spoken input.Type: GrantFiled: November 13, 2001Date of Patent: October 5, 2004Assignee: Canon Kabushiki KaishaInventors: Philip Neil Garner, Jason Peter Andrew Charlesworth
-
Publication number: 20030177108Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database, in response to a user's input query for desired information. The phoneme and word lattice comprises a plurality of time-ordered nodes, and a plurality of links extending between the nodes. Each link has a phoneme or word associated with it. The nodes are arranged in a sequence of time-ordered blocks such that further data can be conveniently added to the lattice.Type: ApplicationFiled: March 7, 2003Publication date: September 18, 2003Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Publication number: 20020120448Abstract: A system is provided for decoding one or more sequences of sub-word units output by a speech recognition system into one or more representative words. The system uses a dynamic programming technique to align the sequence of sub-word units output by the recognition system with a number of dictionary sub-word unit sequences representative of dictionary words to identify the most likely word or words corresponding to the spoken input.Type: ApplicationFiled: November 13, 2001Publication date: August 29, 2002Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth
-
Publication number: 20020052740Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.Type: ApplicationFiled: August 23, 2001Publication date: May 2, 2002Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
-
Publication number: 20020052870Abstract: An indexing apparatus and method are described for use in identifying portions of data in a database for comparison with a query. In an embodiment, the index includes a key which comprises a sequence of phoneme classifications derived from the input query by classifying each of the phonemes in the input query with a number of phoneme classes, with the phonemes in each class being defined as those that are confusable with the other phonemes in the same class.Type: ApplicationFiled: May 24, 2001Publication date: May 2, 2002Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
-
Publication number: 20020022960Abstract: A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.Type: ApplicationFiled: April 25, 2001Publication date: February 21, 2002Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner