Patents by Inventor Jason W. Pelecanos

Jason W. Pelecanos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10614797
    Abstract: A diarization embodiment may include a system that clusters data up to a current point in time and consolidates it with the past decisions, and then returns the result that minimizes the difference with past decisions. The consolidation may be achieved by performing a permutation of the different possible labels and comparing the distance. For speaker diarization, a distance may be determined based on a minimum edit or hamming distance. The distance may alternatively be a measure other than the minimum edit or hamming distance. The clustering may have a finite time window over which the analysis is performed.
    Type: Grant
    Filed: November 30, 2017
    Date of Patent: April 7, 2020
    Assignee: International Business Machines Corporation
    Inventors: Kenneth W. Church, Dimitrios B. Dimitriadis, Petr Fousek, Jason W. Pelecanos, Weizhong Zhu
  • Patent number: 10305765
    Abstract: Embodiments of the present invention provide a computer-implemented method for communicating a reference code for a transaction. The method monitors a communication session conducted between a user and an agent via a communication channel, extracts user and channel properties from the monitored communication session, selects a reference code from a set of references codes stored on a database, in which the selection is based at least in part on the extracted communication channel properties and the extracted user properties, and then communicates the selected reference code to the user.
    Type: Grant
    Filed: July 21, 2017
    Date of Patent: May 28, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Kenneth W. Church, Martin Franz, Nicholas S. Kersting, Jeffrey S. McCarley, Jason W. Pelecanos, Weizhong Zhu
  • Publication number: 20190028370
    Abstract: Embodiments of the present invention provide a computer-implemented method for communicating a reference code for a transaction. The method monitors a communication session conducted between a user and an agent via a communication channel, extracts user and channel properties from the monitored communication session, selects a reference code from a set of references codes stored on a database, in which the selection is based at least in part on the extracted communication channel properties and the extracted user properties, and then communicates the selected reference code to the user.
    Type: Application
    Filed: July 21, 2017
    Publication date: January 24, 2019
    Inventors: Kenneth W. Church, Martin Franz, Nicholas S. Kersting, Jeffrey S. McCarley, Jason W. Pelecanos, Weizhong Zhu
  • Patent number: 10147438
    Abstract: Embodiments of the invention include method, systems and computer program products for role modeling. Aspects of the invention include receiving, by a processor, audio data, wherein the audio data includes a plurality of audio conversation for one or more speakers. The one or more segments for each of the plurality of audio conversations are partitioned. A speaker is associated with each of the one or more segments. The one or more segments for each of the plurality of audio conversations are labeled with roles utilizing a speaker recognition engine. Speakers are clustered based at least in part on a number of times the speakers are present in an audio conversation.
    Type: Grant
    Filed: March 2, 2017
    Date of Patent: December 4, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Kenneth W. Church, Jason W. Pelecanos, Josef Vopicka, Weizhong Zhu
  • Publication number: 20180254051
    Abstract: Embodiments of the invention include method, systems and computer program products for role modeling. Aspects of the invention include receiving, by a processor, audio data, wherein the audio data includes a plurality of audio conversation for one or more speakers. The one or more segments for each of the plurality of audio conversations are partitioned. A speaker is associated with each of the one or more segments. The one or more segments for each of the plurality of audio conversations are labeled with roles utilizing a speaker recognition engine. Speakers are clustered based at least in part on a number of times the speakers are present in an audio conversation.
    Type: Application
    Filed: March 2, 2017
    Publication date: September 6, 2018
    Inventors: Kenneth W. Church, Jason W. Pelecanos, Josef Vopicka, Weizhong Zhu
  • Publication number: 20180158451
    Abstract: A diarization embodiment may include a system that clusters data up to a current point in time and consolidates it with the past decisions, and then returns the result that minimizes the difference with past decisions. The consolidation may be achieved by performing a permutation of the different possible labels and comparing the distance. For speaker diarization, a distance may be determined based on a minimum edit or hamming distance. The distance may alternatively be a measure other than the minimum edit or hamming distance. The clustering may have a finite time window over which the analysis is performed.
    Type: Application
    Filed: November 30, 2017
    Publication date: June 7, 2018
    Inventors: Kenneth W. CHURCH, Dimitrios B. DIMITRIADIS, Petr FOUSEK, Jason W. PELECANOS, Weizhong ZHU
  • Patent number: 9837080
    Abstract: Systems and methods for maintaining speaker recognition performance are provided. A method for maintaining speaker recognition performance, comprises training a plurality of models respectively corresponding to speaker recognition scores from a plurality of speakers over a plurality of sessions, and using the plurality of models to conclude whether a speaker seeking access to an environment is a non-ideal target speaker or a non-ideal non-target speaker. Using the plurality of models to conclude comprises calculating a first probability that the speaker seeking access is the non-ideal target speaker, calculating a second probability that the speaker seeking access is the non-ideal non-target speaker, and determining whether the first probability, the second probability or a sum of the first probability and the second probability is above a probability threshold.
    Type: Grant
    Filed: August 21, 2014
    Date of Patent: December 5, 2017
    Assignee: International Business Machines Corporation
    Inventors: Hagai Aronowitz, Shay Ben-David, David Nahamoo, Jason W. Pelecanos, Orith Toledo-Ronen
  • Publication number: 20160055844
    Abstract: Systems and methods for maintaining speaker recognition performance are provided. A method for maintaining speaker recognition performance, comprises training a plurality of models respectively corresponding to speaker recognition scores from a plurality of speakers over a plurality of sessions, and using the plurality of models to conclude whether a speaker seeking access to an environment is a non-ideal target speaker or a non-ideal non-target speaker. Using the plurality of models to conclude comprises calculating a first probability that the speaker seeking access is the non-ideal target speaker, calculating a second probability that the speaker seeking access is the non-ideal non-target speaker, and determining whether the first probability, the second probability or a sum of the first probability and the second probability is above a probability threshold.
    Type: Application
    Filed: August 21, 2014
    Publication date: February 25, 2016
    Inventors: Hagai Aronowitz, Shay Ben-David, David Nahamoo, Jason W. Pelecanos, Orith Toledo-Ronen
  • Patent number: 8930709
    Abstract: Methods and apparatus are provided for sequential authentication of a user that employ one or more error rates characterizing each security challenge. According to one aspect of the invention, a user is challenged with at least one knowledge challenge to obtain an intermediate authentication result; and the user challenges continue until a cumulative authentication result satisfies one or more criteria. The intermediate authentication result is based, for example, on one or more of false accept and false reject error probabilities for each knowledge challenge. A false accept error probability describes a probability of a different user answering the knowledge challenge correctly. A false reject error probability describes a probability of a genuine user not answering the knowledge challenge correctly. The false accept and false reject error probabilities can be adapted based on field data or known information about a given challenge.
    Type: Grant
    Filed: March 28, 2008
    Date of Patent: January 6, 2015
    Assignee: International Business Machines Corporation
    Inventors: Jiri Navratil, Ryan L. Osborn, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
  • Patent number: 8589167
    Abstract: A signal representative of an unpredictable audio stimulus is provided to a putative live speaker within a putative live recording environment. A second signal purportedly emanating from the putative live speaker and/or the environment is received. This second signal is examined for influence of the unpredictable audio stimulus on the putative live speaker and/or the putative live recording environment. The examining includes at least one of audio feedback analysis, Lombard analysis, and evoked otoacoustic response analysis. Based on the examining, a determination is made as to whether the putative live speaker is an actual live speaker and/or whether the putative live recording environment is an actual live recording environment.
    Type: Grant
    Filed: May 11, 2011
    Date of Patent: November 19, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Aaron K. Baughman, Jason W. Pelecanos
  • Publication number: 20120290297
    Abstract: A signal representative of an unpredictable audio stimulus is provided to a putative live speaker within a putative live recording environment. A second signal purportedly emanating from the putative live speaker and/or the environment is received. This second signal is examined for influence of the unpredictable audio stimulus on the putative live speaker and/or the putative live recording environment. The examining includes at least one of audio feedback analysis, Lombard analysis, and evoked otoacoustic response analysis. Based on the examining, a determination is made as to whether the putative live speaker is an actual live speaker and/or whether the putative live recording environment is an actual live recording environment.
    Type: Application
    Filed: May 11, 2011
    Publication date: November 15, 2012
    Applicant: International Business Machines Corporation
    Inventors: Aaron K. Baughman, Jason W. Pelecanos
  • Patent number: 8224649
    Abstract: A method and apparatus for remote access to a target application is disclosed where a system administrator may establish telephonic contact with an interactive voice response system and obtain access to the target application by speech communication. The interactive response system may authenticate the system administrator by implementing various measures including biometric measures. Once access is granted, the interactive response system may broker a communication between the target application using text/data and the system administrator using natural language.
    Type: Grant
    Filed: June 2, 2004
    Date of Patent: July 17, 2012
    Assignee: International Business Machines Corporation
    Inventors: Upendra V. Chaudhari, Ryan L. Osborn, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
  • Patent number: 8041571
    Abstract: A method and apparatus detect and localize electric faults in electrical power grids and circuit. High impedance faults are detected by analyzing data from remote sensor units deployed over the network using the algorithms of speech and speaker analysis software. This is accomplished by converting the voltage and/or current waveform readouts from the sensors into a digital form which is then transmitted to a computer located either near the sensors or at an operations center. The digitized data is converted by a dedicated software or software/hardware interface to a format accepted by a reliable and stable software solution, such as speech or speaker recognition software. The speech or speaker recognition software must be “trained” to recognize various signal patterns that either indicate or not the occurrence of a fault. The readout of the speech or speaker recognition software, if indicating a fault, is transmitted to a central processor and displayed to provide information on the most likely type of fault.
    Type: Grant
    Filed: January 5, 2007
    Date of Patent: October 18, 2011
    Assignee: International Business Machines Corporation
    Inventors: Sarah C. McAllister, Tomasz J. Nowicki, Jason W. Pelecanos, Grzegorz M. Swirszcz
  • Patent number: 7970614
    Abstract: The present invention provides a system and method for treating distortion propagated though a detection system. The system includes a compensation module that compensates for untreated distortions propagating through the detection compensation system, a user model pool that comprises of a plurality of model sets, and a model selector that selects at least one model set from plurality of model sets in the user model pool. The compensation is accomplished by continually producing scores distributed according to a prescribed distribution for the at least one model set and mitigating the adverse effects of the scores being distorted and lying off a pre-set operating point. The method for treating distortion propagated though a detection system includes receiving a signal from a remote device, and compensating the signal for untreated distortions.
    Type: Grant
    Filed: May 8, 2007
    Date of Patent: June 28, 2011
    Assignee: Nuance Communications, Inc.
    Inventors: Janice J. Kim, Jiri Navratil, Jason W. Pelecanos, Ganesh N. Ramaswamy
  • Patent number: 7813927
    Abstract: There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.
    Type: Grant
    Filed: June 4, 2008
    Date of Patent: October 12, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Jiri Navratil, James H. Nealand, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
  • Publication number: 20080281596
    Abstract: The present invention provides a system and method for treating distortion propagated though a detection system. The system includes a compensation module that compensates for untreated distortions propagating through the detection compensation system, a user model pool that comprises of a plurality of model sets, and a model selector that selects at least one model set from plurality of model sets in the user model pool. The compensation is accomplished by continually producing scores distributed according to a prescribed distribution for the at least one model set and mitigating the adverse effects of the scores being distorted and lying off a pre-set operating point. The method for treating distortion propagated though a detection system includes receiving a signal from a remote device, and compensating the signal for untreated distortions.
    Type: Application
    Filed: May 8, 2007
    Publication date: November 13, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Janice J. Kim, Jiri Navretil, Jason W. Pelecanos, Ganesh N. Ramaswamy
  • Patent number: 7447633
    Abstract: There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.
    Type: Grant
    Filed: November 22, 2004
    Date of Patent: November 4, 2008
    Assignee: International Business Machines Corporation
    Inventors: Jiri Navratil, James H. Nealand, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
  • Publication number: 20080235020
    Abstract: There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.
    Type: Application
    Filed: June 4, 2008
    Publication date: September 25, 2008
    Inventors: Jiri Navratil, James H. Nealand, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
  • Publication number: 20080222722
    Abstract: Methods and apparatus are provided for sequential authentication of a user that employ one or mole error rates characterizing each security challenge. According to one aspect of the invention, a user is challenged with at least one knowledge challenge to obtain an intermediate authentication result; and the user challenges continue until a cumulative authentication result satisfies one or more criteria. The intermediate authentication result is based, for example, on one or more of false accept and false reject error probabilities for each knowledge challenge. A false accept error probability describes a probability of a different user answering the knowledge challenge correctly. A false reject error probability describes a probability of a genuine user not answering the knowledge challenge correctly. The false accept and false reject error probabilities can be adapted based on field data or known information about a given challenge.
    Type: Application
    Filed: March 28, 2008
    Publication date: September 11, 2008
    Applicant: International Business Machines Corporation
    Inventors: Jiri Navratil, Ryan L. Osborn, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
  • Publication number: 20080167877
    Abstract: A method and apparatus detect and localize electric faults in electrical power grids and circuit. High impedance faults are detected by analyzing data from remote sensor units deployed over the network using the algorithms of speech and speaker analysis software. This is accomplished by converting the voltage and/or current waveform readouts from the sensors into a digital form which is then transmitted to a computer located either near the sensors or at an operations center. The digitized data is converted by a dedicated software or software/hardware interface to a format accepted by a reliable and stable software solution, such as speech or speaker recognition software. The speech or speaker recognition software must be “trained” to recognize various signal patterns that either indicate or not the occurrence of a fault. The readout of the speech or speaker recognition software, if indicating a fault, is transmitted to a central processor and displayed to provide information on the most likely type of fault.
    Type: Application
    Filed: January 5, 2007
    Publication date: July 10, 2008
    Inventors: Sarah C. McAllister, Tomasz J. Nowicki, Jason W. Pelecanos, Grzegorz M. Swirszcz