Patents by Inventor Roberto Gemello

Roberto Gemello has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9627532
    Abstract: Methods and apparatus for training a multi-layer artificial neural network for use in speech recognition. The method comprises determining for a first speech pattern of the plurality of speech patterns, using a first processing pipeline, network activations for a plurality of nodes of the artificial neural network in response to providing the first speech pattern as input to the artificial neural network, determining based, at least in part, on the network activations and a selection criterion, whether the artificial neural network should be trained on the first speech pattern, and updating, using a second processing pipeline, network weights between nodes of the artificial neural network based, at least in part, on the network activations when it is determined that the artificial neural network should be trained on the first speech pattern.
    Type: Grant
    Filed: June 18, 2014
    Date of Patent: April 18, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Roberto Gemello, Franco Mana, Dario Albesano
  • Publication number: 20160267380
    Abstract: Training a neural network is a time consuming and computationally expensive task. Embodiments provide efficient methods and systems for neural network training One example embodiment is implemented by a plurality of agents, where each agent performs a pipelined gradient analysis to update respective local models of the neural network using respective subsets of data from a common pool of training data. In turn, a common global model of the neural network is updated based upon the local models.
    Type: Application
    Filed: March 13, 2015
    Publication date: September 15, 2016
    Inventors: Roberto Gemello, Dario Albesano, Franco Mana
  • Publication number: 20150371132
    Abstract: Methods and apparatus for training a multi-layer artificial neural network for use in speech recognition. The method comprises determining for a first speech pattern of the plurality of speech patterns, using a first processing pipeline, network activations for a plurality of nodes of the artificial neural network in response to providing the first speech pattern as input to the artificial neural network, determining based, at least in part, on the network activations and a selection criterion, whether the artificial neural network should be trained on the first speech pattern, and updating, using a second processing pipeline, network weights between nodes of the artificial neural network based, at least in part, on the network activations when it is determined that the artificial neural network should be trained on the first speech pattern.
    Type: Application
    Filed: June 18, 2014
    Publication date: December 24, 2015
    Inventors: Roberto Gemello, Franco Mana, Dario Albesano
  • Patent number: 8126710
    Abstract: A method of adapting a neural network of an automatic speech recognition device, includes the steps of: providing a neural network including an input stage, an intermediate stage and an output stage, the output stage outputting phoneme probabilities; providing a linear stage in the neural network; and training the linear stage by means of an adaptation set; wherein the step of providing the linear stage includes the step of providing the linear stage after the intermediate stage.
    Type: Grant
    Filed: June 1, 2005
    Date of Patent: February 28, 2012
    Assignee: Loquendo S.p.A.
    Inventors: Roberto Gemello, Franco Mana
  • Patent number: 7827031
    Abstract: A neural network in a speech-recognition system has computing units organized in levels including at least one hidden level and one output level. The computing units of the hidden level are connected to the computing units of the output level via weighted connections, and the computing units of the output level correspond to acoustic-phonetic units of the general vocabulary. This network executes the following steps: determining a subset of acoustic-phonetic units necessary for recognizing all the words contained in the general vocabulary subset; eliminating from the neural network all the weighted connections afferent to computing units of the output level that correspond to acoustic-phonetic units not contained in the previously determined subset of acoustic-phonetic units, thus obtaining a compacted neural network optimized for recognition of the words contained in the general vocabulary subset; and executing, at each moment in time, only the compacted neural network.
    Type: Grant
    Filed: February 12, 2003
    Date of Patent: November 2, 2010
    Assignee: Loquendo S.p.A.
    Inventors: Dario Albesano, Roberto Gemello
  • Patent number: 7769580
    Abstract: A method of optimizing the execution of a neural network in a speech recognition system provides for conditionally skipping a variable number of frames, depending on a distance computed between output probabilities, or likelihoods, of a neural network. The distance is initially evaluated between two frames at times 1 and 1+k, where k is a predetermined maximum distance between frames, and if such distance is sufficiently small, the frames between times 1 and 1+k are calculated by interpolation, avoiding further executions of the neural network. If, on the contrary, such distance is not small enough, it means that the outputs of the network are changing quickly, and it is not possible to skip too many frames. In that case, the method attempts to skip remaining frames, calculating and evaluating a new distance.
    Type: Grant
    Filed: December 23, 2002
    Date of Patent: August 3, 2010
    Assignee: Loquendo S.p.A.
    Inventors: Roberto Gemello, Dario Albesano
  • Publication number: 20090216528
    Abstract: A method of adapting a neural network of an automatic speech recognition device, includes the steps of: providing a neural network including an input stage, an intermediate stage and an output stage, the output stage outputting phoneme probabilities; providing a linear stage in the neural network; and training the linear stage by means of an adaptation set; wherein the step of providing the linear stage includes the step of providing the linear stage after the intermediate stage.
    Type: Application
    Filed: June 1, 2005
    Publication date: August 27, 2009
    Inventors: Roberto Gemello, Franco Mana
  • Patent number: 7376558
    Abstract: Disclosed herein is a noise reduction method for automatic speech recognitionl.
    Type: Grant
    Filed: November 14, 2006
    Date of Patent: May 20, 2008
    Assignee: Loquendo S.p.A.
    Inventors: Roberto Gemello, Franco Mana
  • Publication number: 20070260454
    Abstract: Disclosed herein is a noise reduction method for automatic speech recognitionl.
    Type: Application
    Filed: November 14, 2006
    Publication date: November 8, 2007
    Inventors: Roberto Gemello, Franco Mana
  • Publication number: 20060111897
    Abstract: A method of optimizing the execution of a neural network in a speech recognition system provides for conditionally skipping a variable number of frames, depending on a distance computed between output probabilities, or likelihoods, of a neural network. The distance is initially evaluated between two frames at times 1 and 1+k, where k is a predetermined maximum distance between frames, and if such distance is sufficiently small, the frames between times 1 and 1+k are calculated by interpolation, avoiding further executions of the neural network. If, on the contrary, such distance is not small enough, it means that the outputs of the network are changing quickly, and it is not possible to skip too many frames. In that case, the method attempts to skip remaining frames, calculating and evaluating a new distance.
    Type: Application
    Filed: December 23, 2002
    Publication date: May 25, 2006
    Inventors: Roberto Gemello, Dario Albesano
  • Publication number: 20050171766
    Abstract: A method for accelerating neural network execution (4) in a speech recognition system, specifically for recognition of words contained in one or more subsets of a general vocabulary, involves the following steps.—at the recognition system initialisation phase, calculating the union of vocabulary subsets and determining the acoustic-phonetic units required for recognising the words contained in that union; re-compacting the neural network eliminating all the weighted connections afferent to computation output units corresponding to unnecessary acoustic-phonetic units;—executing unnecessary acoustic-phonetic units;—executing only the re-compacted network at each instant of time.
    Type: Application
    Filed: February 12, 2003
    Publication date: August 4, 2005
    Inventors: Dario Albesano, Roberto Gemello
  • Publication number: 20030191640
    Abstract: A method for extracting sampled voice signal features for an automatic voice recognition system essentially comprises the following steps:
    Type: Application
    Filed: April 1, 2003
    Publication date: October 9, 2003
    Applicant: LOQUENDO S.p.A.
    Inventors: Roberto Gemello, Franco Mana
  • Patent number: 6185528
    Abstract: A method and a device for recognition of isolated words in large vocabularies are described, wherein recognition is performed through two sequential steps using neural networks and Markov models techniques, respectively, and the results of both techniques are adequately combined so as to improve recognition accuracy. The devices performing the combination also provide an evaluation of recognition reliability.
    Type: Grant
    Filed: April 29, 1999
    Date of Patent: February 6, 2001
    Assignee: CSELT - Centro Studi e Laboratori Telecomunicazioni S.p.A.
    Inventors: Luciano Fissore, Roberto Gemello, Franco Ravera
  • Patent number: 5742739
    Abstract: A method of speeding up the execution of a wide class of neural networks for processing input signals evolving slowly through time, such as, for instance, voice, radar, sonar, video signals, and which requires no specialized, costly or hard-to-find hardware. The method requires storing, for the neurons in at least one level of the network, the activation value at a certain instant and comparing it with the one computed at the subsequent instant. If the activation is equal, the neuron carries out no activity, otherwise it propagates the difference in activation, multiplied by the interconnection weights, to the neurons it is connected to.
    Type: Grant
    Filed: February 28, 1996
    Date of Patent: April 21, 1998
    Assignee: Cselt - Centro Studi e Laboratori Telecomunicazioni S.P.A.
    Inventors: Dario Albesano, Roberto Gemello, Franco Mana
  • Patent number: 5566270
    Abstract: A speech recognition apparatus in which the speech signal is digitalized and subjected to special analysis, word end detection is effected by energy analysis of the speech signal and the recognition system utilizes a Markov model in combination with a neural network learning by specific training steps.
    Type: Grant
    Filed: May 5, 1994
    Date of Patent: October 15, 1996
    Assignee: CSELT-Centro Studi E Laboratori Telecomunicazioni S.p.A.
    Inventors: Dario Albesano, Roberto Gemello, Franco Mana
  • Patent number: 5103488
    Abstract: The recognition method is applied to visual telephony image coding. Matrices of digital samples relevant to the individual frames of the video transmission are submitted to a first processing whereby the foreground region containing the figure is identified. The information concerning the elements of such a region is then processed by edge recognition algorithms to detect a group of elements possibly belonging to the contour. The group of elements is analyzed to select a sequence of elements distributed on the average along a line. The sequency of elements is processed by a neural network to build up the continuous contour which is then coded.
    Type: Grant
    Filed: June 7, 1990
    Date of Patent: April 7, 1992
    Assignee: CSELT Centro Studi e Laboratori Telecommunicazioni SpA
    Inventors: Roberto Gemello, Cataldo Lettera, Lorenzo Masera