Patents by Inventor Jason W. Pelecanos
Jason W. Pelecanos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10614797Abstract: A diarization embodiment may include a system that clusters data up to a current point in time and consolidates it with the past decisions, and then returns the result that minimizes the difference with past decisions. The consolidation may be achieved by performing a permutation of the different possible labels and comparing the distance. For speaker diarization, a distance may be determined based on a minimum edit or hamming distance. The distance may alternatively be a measure other than the minimum edit or hamming distance. The clustering may have a finite time window over which the analysis is performed.Type: GrantFiled: November 30, 2017Date of Patent: April 7, 2020Assignee: International Business Machines CorporationInventors: Kenneth W. Church, Dimitrios B. Dimitriadis, Petr Fousek, Jason W. Pelecanos, Weizhong Zhu
-
Adaptive selection of message data properties for improving communication throughput and reliability
Patent number: 10305765Abstract: Embodiments of the present invention provide a computer-implemented method for communicating a reference code for a transaction. The method monitors a communication session conducted between a user and an agent via a communication channel, extracts user and channel properties from the monitored communication session, selects a reference code from a set of references codes stored on a database, in which the selection is based at least in part on the extracted communication channel properties and the extracted user properties, and then communicates the selected reference code to the user.Type: GrantFiled: July 21, 2017Date of Patent: May 28, 2019Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Kenneth W. Church, Martin Franz, Nicholas S. Kersting, Jeffrey S. McCarley, Jason W. Pelecanos, Weizhong Zhu -
ADAPTIVE SELECTION OF MESSAGE DATA PROPERTIES FOR IMPROVING COMMUNICATION THROUGHPUT AND RELIABILITY
Publication number: 20190028370Abstract: Embodiments of the present invention provide a computer-implemented method for communicating a reference code for a transaction. The method monitors a communication session conducted between a user and an agent via a communication channel, extracts user and channel properties from the monitored communication session, selects a reference code from a set of references codes stored on a database, in which the selection is based at least in part on the extracted communication channel properties and the extracted user properties, and then communicates the selected reference code to the user.Type: ApplicationFiled: July 21, 2017Publication date: January 24, 2019Inventors: Kenneth W. Church, Martin Franz, Nicholas S. Kersting, Jeffrey S. McCarley, Jason W. Pelecanos, Weizhong Zhu -
Patent number: 10147438Abstract: Embodiments of the invention include method, systems and computer program products for role modeling. Aspects of the invention include receiving, by a processor, audio data, wherein the audio data includes a plurality of audio conversation for one or more speakers. The one or more segments for each of the plurality of audio conversations are partitioned. A speaker is associated with each of the one or more segments. The one or more segments for each of the plurality of audio conversations are labeled with roles utilizing a speaker recognition engine. Speakers are clustered based at least in part on a number of times the speakers are present in an audio conversation.Type: GrantFiled: March 2, 2017Date of Patent: December 4, 2018Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Kenneth W. Church, Jason W. Pelecanos, Josef Vopicka, Weizhong Zhu
-
Publication number: 20180254051Abstract: Embodiments of the invention include method, systems and computer program products for role modeling. Aspects of the invention include receiving, by a processor, audio data, wherein the audio data includes a plurality of audio conversation for one or more speakers. The one or more segments for each of the plurality of audio conversations are partitioned. A speaker is associated with each of the one or more segments. The one or more segments for each of the plurality of audio conversations are labeled with roles utilizing a speaker recognition engine. Speakers are clustered based at least in part on a number of times the speakers are present in an audio conversation.Type: ApplicationFiled: March 2, 2017Publication date: September 6, 2018Inventors: Kenneth W. Church, Jason W. Pelecanos, Josef Vopicka, Weizhong Zhu
-
Publication number: 20180158451Abstract: A diarization embodiment may include a system that clusters data up to a current point in time and consolidates it with the past decisions, and then returns the result that minimizes the difference with past decisions. The consolidation may be achieved by performing a permutation of the different possible labels and comparing the distance. For speaker diarization, a distance may be determined based on a minimum edit or hamming distance. The distance may alternatively be a measure other than the minimum edit or hamming distance. The clustering may have a finite time window over which the analysis is performed.Type: ApplicationFiled: November 30, 2017Publication date: June 7, 2018Inventors: Kenneth W. CHURCH, Dimitrios B. DIMITRIADIS, Petr FOUSEK, Jason W. PELECANOS, Weizhong ZHU
-
Patent number: 9837080Abstract: Systems and methods for maintaining speaker recognition performance are provided. A method for maintaining speaker recognition performance, comprises training a plurality of models respectively corresponding to speaker recognition scores from a plurality of speakers over a plurality of sessions, and using the plurality of models to conclude whether a speaker seeking access to an environment is a non-ideal target speaker or a non-ideal non-target speaker. Using the plurality of models to conclude comprises calculating a first probability that the speaker seeking access is the non-ideal target speaker, calculating a second probability that the speaker seeking access is the non-ideal non-target speaker, and determining whether the first probability, the second probability or a sum of the first probability and the second probability is above a probability threshold.Type: GrantFiled: August 21, 2014Date of Patent: December 5, 2017Assignee: International Business Machines CorporationInventors: Hagai Aronowitz, Shay Ben-David, David Nahamoo, Jason W. Pelecanos, Orith Toledo-Ronen
-
Publication number: 20160055844Abstract: Systems and methods for maintaining speaker recognition performance are provided. A method for maintaining speaker recognition performance, comprises training a plurality of models respectively corresponding to speaker recognition scores from a plurality of speakers over a plurality of sessions, and using the plurality of models to conclude whether a speaker seeking access to an environment is a non-ideal target speaker or a non-ideal non-target speaker. Using the plurality of models to conclude comprises calculating a first probability that the speaker seeking access is the non-ideal target speaker, calculating a second probability that the speaker seeking access is the non-ideal non-target speaker, and determining whether the first probability, the second probability or a sum of the first probability and the second probability is above a probability threshold.Type: ApplicationFiled: August 21, 2014Publication date: February 25, 2016Inventors: Hagai Aronowitz, Shay Ben-David, David Nahamoo, Jason W. Pelecanos, Orith Toledo-Ronen
-
Patent number: 8930709Abstract: Methods and apparatus are provided for sequential authentication of a user that employ one or more error rates characterizing each security challenge. According to one aspect of the invention, a user is challenged with at least one knowledge challenge to obtain an intermediate authentication result; and the user challenges continue until a cumulative authentication result satisfies one or more criteria. The intermediate authentication result is based, for example, on one or more of false accept and false reject error probabilities for each knowledge challenge. A false accept error probability describes a probability of a different user answering the knowledge challenge correctly. A false reject error probability describes a probability of a genuine user not answering the knowledge challenge correctly. The false accept and false reject error probabilities can be adapted based on field data or known information about a given challenge.Type: GrantFiled: March 28, 2008Date of Patent: January 6, 2015Assignee: International Business Machines CorporationInventors: Jiri Navratil, Ryan L. Osborn, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
-
Patent number: 8589167Abstract: A signal representative of an unpredictable audio stimulus is provided to a putative live speaker within a putative live recording environment. A second signal purportedly emanating from the putative live speaker and/or the environment is received. This second signal is examined for influence of the unpredictable audio stimulus on the putative live speaker and/or the putative live recording environment. The examining includes at least one of audio feedback analysis, Lombard analysis, and evoked otoacoustic response analysis. Based on the examining, a determination is made as to whether the putative live speaker is an actual live speaker and/or whether the putative live recording environment is an actual live recording environment.Type: GrantFiled: May 11, 2011Date of Patent: November 19, 2013Assignee: Nuance Communications, Inc.Inventors: Aaron K. Baughman, Jason W. Pelecanos
-
Publication number: 20120290297Abstract: A signal representative of an unpredictable audio stimulus is provided to a putative live speaker within a putative live recording environment. A second signal purportedly emanating from the putative live speaker and/or the environment is received. This second signal is examined for influence of the unpredictable audio stimulus on the putative live speaker and/or the putative live recording environment. The examining includes at least one of audio feedback analysis, Lombard analysis, and evoked otoacoustic response analysis. Based on the examining, a determination is made as to whether the putative live speaker is an actual live speaker and/or whether the putative live recording environment is an actual live recording environment.Type: ApplicationFiled: May 11, 2011Publication date: November 15, 2012Applicant: International Business Machines CorporationInventors: Aaron K. Baughman, Jason W. Pelecanos
-
Patent number: 8224649Abstract: A method and apparatus for remote access to a target application is disclosed where a system administrator may establish telephonic contact with an interactive voice response system and obtain access to the target application by speech communication. The interactive response system may authenticate the system administrator by implementing various measures including biometric measures. Once access is granted, the interactive response system may broker a communication between the target application using text/data and the system administrator using natural language.Type: GrantFiled: June 2, 2004Date of Patent: July 17, 2012Assignee: International Business Machines CorporationInventors: Upendra V. Chaudhari, Ryan L. Osborn, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
-
Patent number: 8041571Abstract: A method and apparatus detect and localize electric faults in electrical power grids and circuit. High impedance faults are detected by analyzing data from remote sensor units deployed over the network using the algorithms of speech and speaker analysis software. This is accomplished by converting the voltage and/or current waveform readouts from the sensors into a digital form which is then transmitted to a computer located either near the sensors or at an operations center. The digitized data is converted by a dedicated software or software/hardware interface to a format accepted by a reliable and stable software solution, such as speech or speaker recognition software. The speech or speaker recognition software must be “trained” to recognize various signal patterns that either indicate or not the occurrence of a fault. The readout of the speech or speaker recognition software, if indicating a fault, is transmitted to a central processor and displayed to provide information on the most likely type of fault.Type: GrantFiled: January 5, 2007Date of Patent: October 18, 2011Assignee: International Business Machines CorporationInventors: Sarah C. McAllister, Tomasz J. Nowicki, Jason W. Pelecanos, Grzegorz M. Swirszcz
-
Patent number: 7970614Abstract: The present invention provides a system and method for treating distortion propagated though a detection system. The system includes a compensation module that compensates for untreated distortions propagating through the detection compensation system, a user model pool that comprises of a plurality of model sets, and a model selector that selects at least one model set from plurality of model sets in the user model pool. The compensation is accomplished by continually producing scores distributed according to a prescribed distribution for the at least one model set and mitigating the adverse effects of the scores being distorted and lying off a pre-set operating point. The method for treating distortion propagated though a detection system includes receiving a signal from a remote device, and compensating the signal for untreated distortions.Type: GrantFiled: May 8, 2007Date of Patent: June 28, 2011Assignee: Nuance Communications, Inc.Inventors: Janice J. Kim, Jiri Navratil, Jason W. Pelecanos, Ganesh N. Ramaswamy
-
Patent number: 7813927Abstract: There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.Type: GrantFiled: June 4, 2008Date of Patent: October 12, 2010Assignee: Nuance Communications, Inc.Inventors: Jiri Navratil, James H. Nealand, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
-
Publication number: 20080281596Abstract: The present invention provides a system and method for treating distortion propagated though a detection system. The system includes a compensation module that compensates for untreated distortions propagating through the detection compensation system, a user model pool that comprises of a plurality of model sets, and a model selector that selects at least one model set from plurality of model sets in the user model pool. The compensation is accomplished by continually producing scores distributed according to a prescribed distribution for the at least one model set and mitigating the adverse effects of the scores being distorted and lying off a pre-set operating point. The method for treating distortion propagated though a detection system includes receiving a signal from a remote device, and compensating the signal for untreated distortions.Type: ApplicationFiled: May 8, 2007Publication date: November 13, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Janice J. Kim, Jiri Navretil, Jason W. Pelecanos, Ganesh N. Ramaswamy
-
Patent number: 7447633Abstract: There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.Type: GrantFiled: November 22, 2004Date of Patent: November 4, 2008Assignee: International Business Machines CorporationInventors: Jiri Navratil, James H. Nealand, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
-
Publication number: 20080235020Abstract: There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.Type: ApplicationFiled: June 4, 2008Publication date: September 25, 2008Inventors: Jiri Navratil, James H. Nealand, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
-
Publication number: 20080222722Abstract: Methods and apparatus are provided for sequential authentication of a user that employ one or mole error rates characterizing each security challenge. According to one aspect of the invention, a user is challenged with at least one knowledge challenge to obtain an intermediate authentication result; and the user challenges continue until a cumulative authentication result satisfies one or more criteria. The intermediate authentication result is based, for example, on one or more of false accept and false reject error probabilities for each knowledge challenge. A false accept error probability describes a probability of a different user answering the knowledge challenge correctly. A false reject error probability describes a probability of a genuine user not answering the knowledge challenge correctly. The false accept and false reject error probabilities can be adapted based on field data or known information about a given challenge.Type: ApplicationFiled: March 28, 2008Publication date: September 11, 2008Applicant: International Business Machines CorporationInventors: Jiri Navratil, Ryan L. Osborn, Jason W. Pelecanos, Ganesh N. Ramaswamy, Ran D. Zilca
-
Publication number: 20080167877Abstract: A method and apparatus detect and localize electric faults in electrical power grids and circuit. High impedance faults are detected by analyzing data from remote sensor units deployed over the network using the algorithms of speech and speaker analysis software. This is accomplished by converting the voltage and/or current waveform readouts from the sensors into a digital form which is then transmitted to a computer located either near the sensors or at an operations center. The digitized data is converted by a dedicated software or software/hardware interface to a format accepted by a reliable and stable software solution, such as speech or speaker recognition software. The speech or speaker recognition software must be “trained” to recognize various signal patterns that either indicate or not the occurrence of a fault. The readout of the speech or speaker recognition software, if indicating a fault, is transmitted to a central processor and displayed to provide information on the most likely type of fault.Type: ApplicationFiled: January 5, 2007Publication date: July 10, 2008Inventors: Sarah C. McAllister, Tomasz J. Nowicki, Jason W. Pelecanos, Grzegorz M. Swirszcz