Patents by Inventor Philip Neil Garner

Philip Neil Garner has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Apparatus and method for constructing multilingual acoustic model and computer readable recording medium for storing program for performing the method

Patent number: 10460043

Abstract: An apparatus and a method for constructing a multilingual acoustic model, and a computer readable recording medium are provided. The method for constructing a multilingual acoustic model includes dividing an input feature into a common language portion and a distinctive language portion, acquiring a tandem feature by training the divided common language portion and distinctive language portion using a neural network to estimate and remove correlation between phonemes, dividing parameters of an initial acoustic model constructed using the tandem feature into common language parameters and distinctive language parameters, adapting the common language parameters using data of a training language, adapting the distinctive language parameters using data of a target language, and constructing an acoustic model for the target language using the adapted common language parameters and the adapted distinctive language parameters.

Type: Grant

Filed: November 22, 2013

Date of Patent: October 29, 2019

Assignees: SAMSUNG ELECTRONICS CO., LTD., IDIAP RESEARCH INSTITUTE

Inventors: Nam-Hoon Kim, Petr Motlicek, Philip Neil Garner, David Imseng, Jae-won Lee, Jeong-Mi Cho
APPARATUS AND METHOD FOR CONSTRUCTING MULTILINGUAL ACOUSTIC MODEL AND COMPUTER READABLE RECORDING MEDIUM FOR STORING PROGRAM FOR PERFORMING THE METHOD

Publication number: 20140149104

Abstract: An apparatus and a method for constructing a multilingual acoustic model, and a computer readable recording medium are provided. The method for constructing a multilingual acoustic model includes dividing an input feature into a common language portion and a distinctive language portion, acquiring a tandem feature by training the divided common language portion and distinctive language portion using a neural network to estimate and remove correlation between phonemes, dividing parameters of an initial acoustic model constructed using the tandem feature into common language parameters and distinctive language parameters, adapting the common language parameters using data of a training language, adapting the distinctive language parameters using data of a target language, and constructing an acoustic model for the target language using the adapted common language parameters and the adapted distinctive language parameters.

Type: Application

Filed: November 22, 2013

Publication date: May 29, 2014

Applicants: IDIAP RESEARCH INSTITUTE, SAMSUNG ELECTRONICS CO., LTD.

Inventors: Nam-Hoon KIM, Petr MOTLICEK, Philip Neil GARNER, David IMSENG, Jae-won LEE, Jeong-Mi CHO
Language recognition using a similarity measure

Patent number: 7310600

Abstract: A dynamic programming technique is provided for matching two sequences of phonemes both of which may be generated from text or speech. The scoring of the dynamic programming matching technique uses phoneme confusion scores, phoneme insertion scores and phoneme deletion scores which are obtained in advance in a training session and, if appropriate, confidence data generated by a recognition system if the sequences are generated from speech.

Type: Grant

Filed: October 25, 2000

Date of Patent: December 18, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
Pattern matching method and apparatus

Patent number: 7295980

Abstract: A system is provided for matching two or more sequences of phonemes both or all of which may be generated from text or speech. A dynamic programming matching technique is preferably used having constraints which depend upon whether or not the two sequences are generated from text or speech and in which the scoring of the dynamic programming paths is weighted by phoneme confusion scores, phoneme insertion scores and phoneme deletion scores where appropriate.

Type: Grant

Filed: August 31, 2006

Date of Patent: November 13, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
Database searching and retrieval using phoneme and word lattice

Patent number: 7257533

Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.

Type: Grant

Filed: September 22, 2005

Date of Patent: August 14, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
Database annotation and retrieval

Patent number: 7240003

Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database, in response to a user's input query for desired information. The phoneme and word lattice comprises a plurality of time-ordered nodes, and a plurality of links extending between the nodes. Each link has a phoneme or word associated with it. The nodes are arranged in a sequence of time-ordered blocks such that further data can be conveniently added to the lattice.

Type: Grant

Filed: September 28, 2001

Date of Patent: July 3, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
Pattern matching method and apparatus

Patent number: 7212968

Abstract: A dynamic programming technique is provided for matching two sequences of phonemes both of which may be generated from text or speech. The scoring of the dynamic programming matching technique uses phoneme confusion scores, phoneme insertion scores and phoneme deletion scores which are obtained in advance in a training session and, if appropriate, confidence data generated by a recognition system if the sequences are generated from speech.

Type: Grant

Filed: October 25, 2000

Date of Patent: May 1, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
Database annotation and retrieval

Patent number: 7054812

Abstract: A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.

Type: Grant

Filed: April 25, 2001

Date of Patent: May 30, 2006

Assignee: Canon Kabushiki Kaisha

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
Database annotation and retrieval including phoneme data

Patent number: 6990448

Abstract: The data structure is used in accessing a plurality of data files. The data stucture comprises a plurality of annotation storage areas adapted to correspond with the data files, each annotation storage area containing an annotation an annotation representing a time sequential signal and each annotation storage area comprising a plurality of block storage areas each containing phoneme and word data forming a respective temporal block of the annotation and each block having an associated time index identifying a timing of the block within the corresponding annotation. Each block storage area includes a plurality of node storage areas, each asociated with a node which represents a point in time at which a word and/or phoneme begins or ends within the corresponding annotation, and each node storage area having a time offset storage area containing a time offset defining the point in time represented by the node relative to the time index associated with the corresponding block.

Type: Grant

Filed: August 23, 2001

Date of Patent: January 24, 2006

Assignee: Canon Kabushiki Kaisha

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
Language recognition using sequence frequency

Patent number: 6882970

Abstract: A system is provided for comparing an input query with a number of stored annotations to identify information to be retrieved from a database. The comparison technique divides the input query into a number of fixed-size fragments and identifies how many times each of the fragments occurs within each annotation using a dynamic programming matching technique. The frequencies of occurrence of the fragments in both the query and the annotation are then compared to provide a measure of the similarity between the query and the annotation. The information to be retrieved is then determined from the similarity measures obtained for all the annotations.

Type: Grant

Filed: October 25, 2000

Date of Patent: April 19, 2005

Assignee: Canon Kabushiki Kaisha

Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi
Indexing method and apparatus

Patent number: 6873993

Abstract: An indexing apparatus and method are described for use in identifying portions of data in a database for comparison with a query. In an embodiment, the index includes a key which comprises a sequence of phoneme classifications derived from the input query by classifying each of the phonemes in the input query with a number of phoneme classes, with the phonemes in each class being defined as those that are confusable with the other phonemes in the same class.

Type: Grant

Filed: May 24, 2001

Date of Patent: March 29, 2005

Assignee: Canon Kabushiki Kaisha

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
Speech processing system

Patent number: 6801891

Abstract: A system is provided for decoding one or more sequences of sub-word units output by a speech recognition system into one or more representative words. The system uses a dynamic programming technique to align the sequence of sub-word units output by the recognition system with a number of dictionary sub-word unit sequences representative of dictionary words to identify the most likely word or words corresponding to the spoken input.

Type: Grant

Filed: November 13, 2001

Date of Patent: October 5, 2004

Assignee: Canon Kabushiki Kaisha

Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth
Database annotation and retrieval

Publication number: 20030177108

Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database, in response to a user's input query for desired information. The phoneme and word lattice comprises a plurality of time-ordered nodes, and a plurality of links extending between the nodes. Each link has a phoneme or word associated with it. The nodes are arranged in a sequence of time-ordered blocks such that further data can be conveniently added to the lattice.

Type: Application

Filed: March 7, 2003

Publication date: September 18, 2003

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
Speech processing system

Publication number: 20020120448

Abstract: A system is provided for decoding one or more sequences of sub-word units output by a speech recognition system into one or more representative words. The system uses a dynamic programming technique to align the sequence of sub-word units output by the recognition system with a number of dictionary sub-word unit sequences representative of dictionary words to identify the most likely word or words corresponding to the spoken input.

Type: Application

Filed: November 13, 2001

Publication date: August 29, 2002

Inventors: Philip Neil Garner, Jason Peter Andrew Charlesworth
Indexing method and apparatus

Publication number: 20020052870

Abstract: An indexing apparatus and method are described for use in identifying portions of data in a database for comparison with a query. In an embodiment, the index includes a key which comprises a sequence of phoneme classifications derived from the input query by classifying each of the phonemes in the input query with a number of phoneme classes, with the phonemes in each class being defined as those that are confusable with the other phonemes in the same class.

Type: Application

Filed: May 24, 2001

Publication date: May 2, 2002

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner
Database annotation and retrieval

Publication number: 20020052740

Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.

Type: Application

Filed: August 23, 2001

Publication date: May 2, 2002

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
Database annotation and retrieval

Publication number: 20020022960

Abstract: A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.

Type: Application

Filed: April 25, 2001

Publication date: February 21, 2002

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner