Patents by Inventor Claudio Vair

Claudio Vair has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230267936
    Abstract: There is provided a method that includes (a) obtaining a first voice vector that was derived from a signal of a voice that was sampled at a first sampling frequency, (b) obtaining a second voice vector that was derived from a signal of a voice that was sampled at a second sampling frequency, (c) mapping the second voice vector into a mapped voice vector in accordance with a machine learning model, and (d) comparing the first voice vector to the mapped voice vector to yield a score that indicates a probability that the first voice vector and the second voice vector originated from a same person.
    Type: Application
    Filed: February 23, 2022
    Publication date: August 24, 2023
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Claudio VAIR, Haydar TALIB, Kevin Robert FARRELL, Daniele Ernesto COLIBRO
  • Patent number: 11698953
    Abstract: A method, computer program product, and computing system for defining a correction factor for a biometric profile of a plurality of biometric profiles based upon, at least in part, a detection performance metric associated with the biometric profile. The biometric profile may be adjusted based upon, at least in part, the detection policy for the biometric profile.
    Type: Grant
    Filed: January 13, 2021
    Date of Patent: July 11, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Haydar Talib, Daniele Ernesto Colibro, Claudio Vair
  • Publication number: 20230131359
    Abstract: A method, computer program product, and computing system for generating a generative model representative of a plurality of natural biometric profiles. A plurality of random samples are generated from the generative model. A plurality of synthetic biometric profiles are generated based upon, at least in part, the plurality of random samples.
    Type: Application
    Filed: October 7, 2022
    Publication date: April 27, 2023
    Inventors: Haydar Talib, Claudio Vair, Kevin Robert Farrell, Daniele Ernesto Colibro
  • Publication number: 20220222328
    Abstract: A method, computer program product, and computing system for defining a correction factor for a biometric profile of a plurality of biometric profiles based upon, at least in part, a detection performance metric associated with the biometric profile. The biometric profile may be adjusted based upon, at least in part, the detection policy for the biometric profile.
    Type: Application
    Filed: January 13, 2021
    Publication date: July 14, 2022
    Inventors: Haydar Talib, Daniele Ernesto Colibro, Claudio Vair
  • Patent number: 9865266
    Abstract: Typical speaker verification systems usually employ speakers' audio data collected during an enrollment phase when users enroll with the system and provide respective voice samples. Due to technical, business, or other constraints, the enrollment data may not be large enough or rich enough to encompass different inter-speaker and intra-speaker variations. According to at least one embodiment, a method and apparatus employing classifier adaptation based on field data in a deployed voice-based interactive system comprise: collecting representations of voice characteristics, in association with corresponding speakers, the representations being generated by the deployed voice-based interactive system; updating parameters of the classifier, used in speaker recognition, based on the representations collected; and employing the classifier, with the corresponding parameters updated, in performing speaker recognition.
    Type: Grant
    Filed: February 25, 2013
    Date of Patent: January 9, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Daniele Ernesto Colibro, Claudio Vair, Kevin R. Farrell
  • Patent number: 9728191
    Abstract: Techniques for automatically identifying a speaker in a conversation as a known person based on processing of audio of the speaker's voice to extract characteristics of that voice and on an automated comparison of those characteristics to known characteristics of the known person's voice. A speaker segmentation process may be performed on audio of the conversation to produce, for each speaker in the conversation, a segment that includes the audio of that speaker. Audio of each of the segments may then be processed to extract characteristics of that speaker's voice. The characteristics derived from each segment (and thus for multiple speakers) may then be compared to characteristics of the known person's voice to determine whether the speaker for that segment is the known person. For each segment, a degree of match between the voice characteristics of the speaker and the voice characteristics of the known person may be calculated.
    Type: Grant
    Filed: August 27, 2015
    Date of Patent: August 8, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Emanuele Dalmasso, Daniele Colibro, Claudio Vair, Kevin R. Farrell
  • Publication number: 20170061968
    Abstract: Techniques for automatically identifying a speaker in a conversation as a known person based on processing of audio of the speaker's voice to extract characteristics of that voice and on an automated comparison of those characteristics to known characteristics of the known person's voice. A speaker segmentation process may be performed on audio of the conversation to produce, for each speaker in the conversation, a segment that includes the audio of that speaker. Audio of each of the segments may then be processed to extract characteristics of that speaker's voice. The characteristics derived from each segment (and thus for multiple speakers) may then be compared to characteristics of the known person's voice to determine whether the speaker for that segment is the known person. For each segment, a degree of match between the voice characteristics of the speaker and the voice characteristics of the known person may be calculated.
    Type: Application
    Filed: August 27, 2015
    Publication date: March 2, 2017
    Applicant: Nuance Communications, Inc.
    Inventors: Emanuele Dalmasso, Daniele Colibro, Claudio Vair, Kevin R. Farrell
  • Patent number: 9373330
    Abstract: A method for performing speaker recognition comprises: estimating respective uncertainties of acoustic coverage of respective speech utterance(s) by first and second speakers, the acoustic coverage representing respective sounds used by the speakers when speaking; representing the respective uncertainties of acoustic coverage in a manner that allows for efficient memory usage by discarding dependencies between uncertainties of different sounds for the speakers; representing the respective uncertainties of acoustic coverage in a manner that allows for efficient computation by representing an inverse of the respective uncertainties of acoustic coverage and then discarding the dependencies between the uncertainties of different sounds for the speakers; and computing a score between the speech utterance(s) by the speakers in a manner that leverages the respective uncertainties of the acoustic coverage during the comparison, the score being indicative of a likelihood that the speakers are the same speaker.
    Type: Grant
    Filed: August 7, 2014
    Date of Patent: June 21, 2016
    Assignee: Nuance Communications, Inc.
    Inventors: Sandro Cumani, Claudio Vair, Daniele Ernesto Colibro, Pietro Laface, Kevin R. Farrell
  • Patent number: 9368109
    Abstract: Reliable speaker-based clustering of speech utterances allows improved speaker recognition and speaker-based speech segmentation. According to at least one example embodiment, an iterative bottom-up speaker-based clustering approach employs voiceprints of speech utterances, such as i-vectors. At each iteration, a clustering confidence score in terms of Silhouette Width Criterion (SWC) values is evaluated, and a pair of nearest clusters is merged into a single cluster. The pair of nearest clusters merged is determined based on a similarity score indicative of similarity between voiceprints associated with different clusters. A final clustering pattern is then determined as a set of clusters associated with an iteration corresponding to the highest clustering confidence score evaluated. The SWC used may further be a modified SWC enabling detection of an early stop of the iterative approach.
    Type: Grant
    Filed: May 31, 2013
    Date of Patent: June 14, 2016
    Assignee: Nuance Communications, Inc.
    Inventors: Daniele Ernesto Colibro, Claudio Vair, Kevin R. Farrell
  • Publication number: 20160042739
    Abstract: A method for performing speaker recognition comprises: estimating respective uncertainties of acoustic coverage of respective speech utterance(s) by first and second speakers, the acoustic coverage representing respective sounds used by the speakers when speaking; representing the respective uncertainties of acoustic coverage in a manner that allows for efficient memory usage by discarding dependencies between uncertainties of different sounds for the speakers; representing the respective uncertainties of acoustic coverage in a manner that allows for efficient computation by representing an inverse of the respective uncertainties of acoustic coverage and then discarding the dependencies between the uncertainties of different sounds for the speakers; and computing a score between the speech utterance(s) by the speakers in a manner that leverages the respective uncertainties of the acoustic coverage during the comparison, the score being indicative of a likelihood that the speakers are the same speaker.
    Type: Application
    Filed: August 7, 2014
    Publication date: February 11, 2016
    Inventors: Sandro Cumani, Claudio Vair, Daniele Ernesto Colibro, Pietro Laface, Kevin R. Farrell
  • Patent number: 9224391
    Abstract: A method for automatically providing a hypothesis of a linguistic formulation that is uttered by users of a voice service based on an automatic speech recognition system and that is outside a recognition domain of the automatic speech recognition system. The method includes providing a constrained and an unconstrained speech recognition from an input speech signal, identifying a part of the constrained speech recognition outside the recognition domain, identifying a part of the unconstrained speech recognition corresponding to the identified part of the constrained speech recognition, and providing the linguistic formulation hypothesis based on the identified part of the unconstrained speech recognition.
    Type: Grant
    Filed: February 17, 2005
    Date of Patent: December 29, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Daniele Colibro, Claudio Vair, Luciano Fissore, Cosmin Popovici
  • Publication number: 20140358541
    Abstract: Reliable speaker-based clustering of speech utterances allows improved speaker recognition and speaker-based speech segmentation. According to at least one example embodiment, an iterative bottom-up speaker-based clustering approach employs voiceprints of speech utterances, such as i-vectors. At each iteration, a clustering confidence score in terms of Silhouette Width Criterion (SWC) values is evaluated, and a pair of nearest clusters is merged into a single cluster. The pair of nearest clusters merged is determined based on a similarity score indicative of similarity between voiceprints associated with different clusters. A final clustering pattern is then determined as a set of clusters associated with an iteration corresponding to the highest clustering confidence score evaluated. The SWC used may further be a modified SWC enabling detection of an early stop of the iterative approach.
    Type: Application
    Filed: May 31, 2013
    Publication date: December 4, 2014
    Inventors: Daniele Ernesto Colibro, Claudio Vair, Kevin R. Farrell
  • Publication number: 20140244257
    Abstract: Typical speaker verification systems usually employ speakers' audio data collected during an enrollment phase when users enroll with the system and provide respective voice samples. Due to technical, business, or other constraints, the enrollment data may not be large enough or rich enough to encompass different inter-speaker and intra-speaker variations. According to at least one embodiment, a method and apparatus employing classifier adaptation based on field data in a deployed voice-based interactive system comprise: collecting representations of voice characteristics, in association with corresponding speakers, the representations being generated by the deployed voice-based interactive system; updating parameters of the classifier, used in speaker recognition, based on the representations collected; and employing the classifier, with the corresponding parameters updated, in performing speaker recognition.
    Type: Application
    Filed: February 25, 2013
    Publication date: August 28, 2014
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Daniele Ernesto Colibro, Claudio Vair, Kevin R. Farrell
  • Patent number: 8566093
    Abstract: A method for compensating inter-session variability for automatic extraction of information from an input voice signal representing an utterance of a speaker, includes: processing the input voice signal to provide feature vectors each formed by acoustic features extracted from the input voice signal at a time frame; computing an intersession variability compensation feature vector; and computing compensated feature vectors based on the extracted feature vectors and the intersession variability compensation feature vector.
    Type: Grant
    Filed: May 16, 2006
    Date of Patent: October 22, 2013
    Assignee: Loquendo S.p.A.
    Inventors: Claudio Vair, Daniele Colibro, Pietro Laface
  • Patent number: 7912713
    Abstract: An automatic speech recognition method for identifying words from an input speech signal includes providing at least one hypothesis recognition based on the input speech signal, the hypothesis recognition being an individual hypothesis word or a sequence of individual hypothesis words, and computing a confidence measure for the hypothesis recognition, based on the input speech signal, wherein computing a confidence measure includes computing differential contributions to the confidence measure, each as a difference between a constrained acoustic score and an unconstrained acoustic score, weighting each differential contribution by applying thereto a cumulative distribution function of the differential contribution, so as to make the distributions of the confidence measures homogeneous in terms of rejection capability, as the language, vocabulary and grammar vary, and computing the confidence measure by averaging the weighted differential contributions.
    Type: Grant
    Filed: December 28, 2004
    Date of Patent: March 22, 2011
    Assignee: Loquendo S.p.A.
    Inventors: Claudio Vair, Daniele Colibro
  • Publication number: 20110040561
    Abstract: A method for compensating inter-session variability for automatic extraction of information from an input voice signal representing an utterance of a speaker, includes: processing the input voice signal to provide feature vectors each formed by acoustic features extracted from the input voice signal at a time frame; computing an intersession variability compensation feature vector; and computing compensated feature vectors based on the extracted feature vectors and the intersession variability compensation feature vector.
    Type: Application
    Filed: May 16, 2006
    Publication date: February 17, 2011
    Inventors: Claudio Vair, Daniele Colibro, Pietro Laface
  • Publication number: 20080312926
    Abstract: An automatic dual-step, text independent, language-independent speaker voice-print creation and speaker recognition method, wherein a neural network-based technique is used in a first step and a Markov model-based technique is used in a second step. In particular, the first step uses a neural network-based technique for decoding the content of what is uttered by the speaker in terms of language independent acoustic-phonetic classes, wherein the second step uses the sequence of language-independent acoustic-phonetic classes from the first step and employs a Markov model-based technique for creating the speaker voice-print and for recognizing the speaker. The combination of the two steps enables improvement in the accuracy and efficiency of the speaker voice-print creation and of the speaker recognition, without setting any constraints on the lexical content of the speaker utterance and on the language thereof.
    Type: Application
    Filed: May 24, 2005
    Publication date: December 18, 2008
    Inventors: Claudio Vair, Daniele Colibro, Luciano Fissore
  • Publication number: 20080270129
    Abstract: A method for automatically providing a hypothesis of a linguistic formulation that is uttered by users of a voice service based on an automatic speech recognition system and that is outside a recognition domain of the automatic speech recognition system. The method includes providing a constrained and an unconstrained speech recognition from an input speech signal, identifying a part of the constrained speech recognition outside the recognition domain, identifying a part of the unconstrained speech recognition corresponding to the identified part of the constrained speech recognition, and providing the linguistic formulation hypothesis based on the identified part of the unconstrained speech recognition.
    Type: Application
    Filed: February 17, 2005
    Publication date: October 30, 2008
    Applicant: Loquendo S.p.A.
    Inventors: Daniele Colibro, Claudio Vair, Luciano Fissore, Cosmin Popovici
  • Publication number: 20080114595
    Abstract: An automatic speech recognition method for identifying words from an input speech signal includes providing at least one hypothesis recognition based on the input speech signal, the hypothesis recognition being an individual hypothesis word or a sequence of individual hypothesis words, and computing a confidence measure for the hypothesis recognition, based on the input speech signal, wherein computing a confidence measure includes computing differential contributions to the confidence measure, each as a difference between a constrained acoustic score and an unconstrained acoustic score, weighting each differential contribution by applying thereto a cumulative distribution function of the differential contribution, so as to make the distributions of the confidence measures homogeneous in terms of rejection capability, as the language, vocabulary and grammar vary, and computing the confidence measure by averaging the weighted differential contributions.
    Type: Application
    Filed: December 28, 2004
    Publication date: May 15, 2008
    Inventors: Claudio Vair, Daniele Colibro