Patents by Inventor Roberto Gemello

Roberto Gemello has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Methods and apparatus for training an artificial neural network for use in speech recognition

Patent number: 9627532

Abstract: Methods and apparatus for training a multi-layer artificial neural network for use in speech recognition. The method comprises determining for a first speech pattern of the plurality of speech patterns, using a first processing pipeline, network activations for a plurality of nodes of the artificial neural network in response to providing the first speech pattern as input to the artificial neural network, determining based, at least in part, on the network activations and a selection criterion, whether the artificial neural network should be trained on the first speech pattern, and updating, using a second processing pipeline, network weights between nodes of the artificial neural network based, at least in part, on the network activations when it is determined that the artificial neural network should be trained on the first speech pattern.

Type: Grant

Filed: June 18, 2014

Date of Patent: April 18, 2017

Assignee: Nuance Communications, Inc.

Inventors: Roberto Gemello, Franco Mana, Dario Albesano
Method and System for Training a Neural Network

Publication number: 20160267380

Abstract: Training a neural network is a time consuming and computationally expensive task. Embodiments provide efficient methods and systems for neural network training One example embodiment is implemented by a plurality of agents, where each agent performs a pipelined gradient analysis to update respective local models of the neural network using respective subsets of data from a common pool of training data. In turn, a common global model of the neural network is updated based upon the local models.

Type: Application

Filed: March 13, 2015

Publication date: September 15, 2016

Inventors: Roberto Gemello, Dario Albesano, Franco Mana
METHODS AND APPARATUS FOR TRAINING AN ARTIFICIAL NEURAL NETWORK FOR USE IN SPEECH RECOGNITION

Publication number: 20150371132

Abstract: Methods and apparatus for training a multi-layer artificial neural network for use in speech recognition. The method comprises determining for a first speech pattern of the plurality of speech patterns, using a first processing pipeline, network activations for a plurality of nodes of the artificial neural network in response to providing the first speech pattern as input to the artificial neural network, determining based, at least in part, on the network activations and a selection criterion, whether the artificial neural network should be trained on the first speech pattern, and updating, using a second processing pipeline, network weights between nodes of the artificial neural network based, at least in part, on the network activations when it is determined that the artificial neural network should be trained on the first speech pattern.

Type: Application

Filed: June 18, 2014

Publication date: December 24, 2015

Inventors: Roberto Gemello, Franco Mana, Dario Albesano
Conservative training method for adapting a neural network of an automatic speech recognition device

Patent number: 8126710

Abstract: A method of adapting a neural network of an automatic speech recognition device, includes the steps of: providing a neural network including an input stage, an intermediate stage and an output stage, the output stage outputting phoneme probabilities; providing a linear stage in the neural network; and training the linear stage by means of an adaptation set; wherein the step of providing the linear stage includes the step of providing the linear stage after the intermediate stage.

Type: Grant

Filed: June 1, 2005

Date of Patent: February 28, 2012

Assignee: Loquendo S.p.A.

Inventors: Roberto Gemello, Franco Mana
Method for accelerating the execution of speech recognition neural networks and the related speech recognition device

Patent number: 7827031

Abstract: A neural network in a speech-recognition system has computing units organized in levels including at least one hidden level and one output level. The computing units of the hidden level are connected to the computing units of the output level via weighted connections, and the computing units of the output level correspond to acoustic-phonetic units of the general vocabulary. This network executes the following steps: determining a subset of acoustic-phonetic units necessary for recognizing all the words contained in the general vocabulary subset; eliminating from the neural network all the weighted connections afferent to computing units of the output level that correspond to acoustic-phonetic units not contained in the previously determined subset of acoustic-phonetic units, thus obtaining a compacted neural network optimized for recognition of the words contained in the general vocabulary subset; and executing, at each moment in time, only the compacted neural network.

Type: Grant

Filed: February 12, 2003

Date of Patent: November 2, 2010

Assignee: Loquendo S.p.A.

Inventors: Dario Albesano, Roberto Gemello
Method of optimising the execution of a neural network in a speech recognition system through conditionally skipping a variable number of frames

Patent number: 7769580

Abstract: A method of optimizing the execution of a neural network in a speech recognition system provides for conditionally skipping a variable number of frames, depending on a distance computed between output probabilities, or likelihoods, of a neural network. The distance is initially evaluated between two frames at times 1 and 1+k, where k is a predetermined maximum distance between frames, and if such distance is sufficiently small, the frames between times 1 and 1+k are calculated by interpolation, avoiding further executions of the neural network. If, on the contrary, such distance is not small enough, it means that the outputs of the network are changing quickly, and it is not possible to skip too many frames. In that case, the method attempts to skip remaining frames, calculating and evaluating a new distance.

Type: Grant

Filed: December 23, 2002

Date of Patent: August 3, 2010

Assignee: Loquendo S.p.A.

Inventors: Roberto Gemello, Dario Albesano
Method of adapting a neural network of an automatic speech recognition device

Publication number: 20090216528

Abstract: A method of adapting a neural network of an automatic speech recognition device, includes the steps of: providing a neural network including an input stage, an intermediate stage and an output stage, the output stage outputting phoneme probabilities; providing a linear stage in the neural network; and training the linear stage by means of an adaptation set; wherein the step of providing the linear stage includes the step of providing the linear stage after the intermediate stage.

Type: Application

Filed: June 1, 2005

Publication date: August 27, 2009

Inventors: Roberto Gemello, Franco Mana
Noise reduction for automatic speech recognition

Patent number: 7376558

Abstract: Disclosed herein is a noise reduction method for automatic speech recognitionl.

Type: Grant

Filed: November 14, 2006

Date of Patent: May 20, 2008

Assignee: Loquendo S.p.A.

Inventors: Roberto Gemello, Franco Mana
Noise reduction for automatic speech recognition

Publication number: 20070260454

Abstract: Disclosed herein is a noise reduction method for automatic speech recognitionl.

Type: Application

Filed: November 14, 2006

Publication date: November 8, 2007

Inventors: Roberto Gemello, Franco Mana
Method of optimising the execution of a neural network in a speech recognition system through conditionally skipping a variable number of frames

Publication number: 20060111897

Abstract: A method of optimizing the execution of a neural network in a speech recognition system provides for conditionally skipping a variable number of frames, depending on a distance computed between output probabilities, or likelihoods, of a neural network. The distance is initially evaluated between two frames at times 1 and 1+k, where k is a predetermined maximum distance between frames, and if such distance is sufficiently small, the frames between times 1 and 1+k are calculated by interpolation, avoiding further executions of the neural network. If, on the contrary, such distance is not small enough, it means that the outputs of the network are changing quickly, and it is not possible to skip too many frames. In that case, the method attempts to skip remaining frames, calculating and evaluating a new distance.

Type: Application

Filed: December 23, 2002

Publication date: May 25, 2006

Inventors: Roberto Gemello, Dario Albesano
Method for accelerating the execution of speech recognition neural networks and the related speech recognition device

Publication number: 20050171766

Abstract: A method for accelerating neural network execution (4) in a speech recognition system, specifically for recognition of words contained in one or more subsets of a general vocabulary, involves the following steps.—at the recognition system initialisation phase, calculating the union of vocabulary subsets and determining the acoustic-phonetic units required for recognising the words contained in that union; re-compacting the neural network eliminating all the weighted connections afferent to computation output units corresponding to unnecessary acoustic-phonetic units;—executing unnecessary acoustic-phonetic units;—executing only the re-compacted network at each instant of time.

Type: Application

Filed: February 12, 2003

Publication date: August 4, 2005

Inventors: Dario Albesano, Roberto Gemello
Method for extracting voice signal features and related voice recognition system

Publication number: 20030191640

Abstract: A method for extracting sampled voice signal features for an automatic voice recognition system essentially comprises the following steps:

Type: Application

Filed: April 1, 2003

Publication date: October 9, 2003

Applicant: LOQUENDO S.p.A.

Inventors: Roberto Gemello, Franco Mana
Method of and a device for speech recognition employing neural network and markov model recognition techniques

Patent number: 6185528

Abstract: A method and a device for recognition of isolated words in large vocabularies are described, wherein recognition is performed through two sequential steps using neural networks and Markov models techniques, respectively, and the results of both techniques are adequately combined so as to improve recognition accuracy. The devices performing the combination also provide an evaluation of recognition reliability.

Type: Grant

Filed: April 29, 1999

Date of Patent: February 6, 2001

Assignee: CSELT - Centro Studi e Laboratori Telecomunicazioni S.p.A.

Inventors: Luciano Fissore, Roberto Gemello, Franco Ravera
Method of speeding up the execution of neural networks for correlated signal processing

Patent number: 5742739

Abstract: A method of speeding up the execution of a wide class of neural networks for processing input signals evolving slowly through time, such as, for instance, voice, radar, sonar, video signals, and which requires no specialized, costly or hard-to-find hardware. The method requires storing, for the neurons in at least one level of the network, the activation value at a certain instant and comparing it with the one computed at the subsequent instant. If the activation is equal, the neuron carries out no activity, otherwise it propagates the difference in activation, multiplied by the interconnection weights, to the neurons it is connected to.

Type: Grant

Filed: February 28, 1996

Date of Patent: April 21, 1998

Assignee: Cselt - Centro Studi e Laboratori Telecomunicazioni S.P.A.

Inventors: Dario Albesano, Roberto Gemello, Franco Mana
Speaker independent isolated word recognition system using neural networks

Patent number: 5566270

Abstract: A speech recognition apparatus in which the speech signal is digitalized and subjected to special analysis, word end detection is effected by energy analysis of the speech signal and the recognition system utilizes a Markov model in combination with a neural network learning by specific training steps.

Type: Grant

Filed: May 5, 1994

Date of Patent: October 15, 1996

Assignee: CSELT-Centro Studi E Laboratori Telecomunicazioni S.p.A.

Inventors: Dario Albesano, Roberto Gemello, Franco Mana
Method of and device for moving image contour recognition

Patent number: 5103488

Abstract: The recognition method is applied to visual telephony image coding. Matrices of digital samples relevant to the individual frames of the video transmission are submitted to a first processing whereby the foreground region containing the figure is identified. The information concerning the elements of such a region is then processed by edge recognition algorithms to detect a group of elements possibly belonging to the contour. The group of elements is analyzed to select a sequence of elements distributed on the average along a line. The sequency of elements is processed by a neural network to build up the continuous contour which is then coded.

Type: Grant

Filed: June 7, 1990

Date of Patent: April 7, 1992

Assignee: CSELT Centro Studi e Laboratori Telecommunicazioni SpA

Inventors: Roberto Gemello, Cataldo Lettera, Lorenzo Masera