Patents by Inventor Ron Hoory

Ron Hoory has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

GENERATING SPEECH DATA USING ARTIFICIAL INTELLIGENCE TECHNIQUES

Publication number: 20250061882

Abstract: Methods, systems, and computer program products for generating speech data using artificial intelligence techniques are provided herein. A computer-implemented method includes implementing one or more artificial intelligence techniques in connection with one or more speech synthesis tasks; generating, in multiple sequential portions, at least one sequence of data, comprising one or more of phonetic data and prosodic data, by processing at least one previously generated sequence of data using the one or more artificial intelligence techniques; and generating speech data corresponding to at least a portion of the sequence of data by processing the at least a portion of the sequence of data using at least one artificial intelligence-based speech synthesis model.

Type: Application

Filed: August 15, 2023

Publication date: February 20, 2025

Inventors: Zvi Kons, Ron Hoory, Vyacheslav Shechtman, Avihu Dekel
Automated chatbot generation from an interactive voice response tree

Patent number: 11637927

Abstract: A method comprising: receiving an interactive voice response (IVR) tree configured to implement one or more tasks, each associated with one or more IVR node paths comprising a plurality of IVR nodes arranged in a hierarchical relationship; analyzing the IVR tree to identify one or more intent IVR nodes, each associated with one of the tasks; with respect to each of the intent IVR nodes, identifying a plurality of corresponding entity IVR nodes included within the IVR node path associated with the intent IVR node; assembling one or more task-specific chatbot skills, each comprising (i) one of the intent IVR nodes, and (ii) at least some of the plurality of corresponding entity IVR nodes, wherein each of the task-specific chatbot skills is configured to perform one of the tasks by conducting a dialog with a user; and generating a chatbot comprising at least one of the task-specific chatbot skills.

Type: Grant

Filed: July 15, 2021

Date of Patent: April 25, 2023

Assignee: International Business Machines Corporation

Inventors: Tal Drory, Aya Soffer, Ron Hoory, Aharon Satt
AUTOMATED CHATBOT GENERATION FROM AN INTERACTIVE VOICE RESPONSE TREE

Publication number: 20230020613

Abstract: A method comprising: receiving an interactive voice response (IVR) tree configured to implement one or more tasks, each associated with one or more IVR node paths comprising a plurality of IVR nodes arranged in a hierarchical relationship; analyzing the IVR tree to identify one or more intent IVR nodes, each associated with one of the tasks; with respect to each of the intent IVR nodes, identifying a plurality of corresponding entity IVR nodes included within the IVR node path associated with the intent IVR node; assembling one or more task-specific chatbot skills, each comprising (i) one of the intent IVR nodes, and (ii) at least some of the plurality of corresponding entity IVR nodes, wherein each of the task-specific chatbot skills is configured to perform one of the tasks by conducting a dialog with a user; and generating a chatbot comprising at least one of the task-specific chatbot skills.

Type: Application

Filed: July 15, 2021

Publication date: January 19, 2023

Inventors: Tal Drory, Aya Soffe, RON HOORY, Aharon Satt
Biometric authentication

Patent number: 10509895

Abstract: A method comprising using at least one hardware processor for: providing a set of development supervectors representing features of biometric samples of multiple subjects, the biometric samples being of at least a first and a second different biometric modalities; providing at least a first and a second enrollment supervectors representing features of at least a first and a second enrollment biometric samples of a target subject correspondingly, wherein the at least first and second enrollment samples are of the at least first and the second different biometric modalities correspondingly; providing at least a first and a second verification supervectors representing features of at least a first and a second verification biometric samples of the target subject correspondingly, wherein the at least first and second verification samples are of the at least first and second different biometric modalities correspondingly; concatenating the development supervectors to a set of development generic supervector, the a

Type: Grant

Filed: March 9, 2016

Date of Patent: December 17, 2019

Assignee: International Business Machines Corporation

Inventors: Hagai Aronowitz, Amir Geva, Ron Hoory, David Nahamoo, Jason William Pelecanos, Orith Toledo-Ronen
Method and apparatus for detecting splicing attacks on a speaker verification system

Patent number: 10276166

Abstract: A method of detecting an occurrence of splicing in a speech signal includes comparing one or more discontinuities in the test speech signal to one or more reference speech signals corresponding to the test speech signal. The method may further include calculating a frame-based spectral-like representation ST of the speech signal, and calculating a frame-based spectral-like representation SE of a reference speech signal corresponding to the speech signal. The method further includes aligning ST and SE in time and frequency, calculating a distance function associated with aligned ST and SE, and evaluating the distance function to determine a score. The method also includes comparing the score to a threshold to detect if splicing occurs in the speech signal.

Type: Grant

Filed: July 22, 2014

Date of Patent: April 30, 2019

Assignee: Nuance Communications, Inc.

Inventors: Zvi Kons, Ron Hoory, Hagai Aronowitz
Vehicle entertainment system

Patent number: 10226702

Abstract: A computer-implemented method, computerized apparatus and computer program product. The method comprises capturing one or more images of a scene in which a driver is driving a vehicle; analyzing the images to retrieve an event or detail; conveying to the driver the a question or a challenge related to the event or detail; receiving a response from the driver; analyzing the response; and determining a score related to the driver.

Type: Grant

Filed: May 25, 2015

Date of Patent: March 12, 2019

Assignee: International Business Machines Corporation

Inventors: Ron Hoory, Mattias Marder, Slava Shechtman
Text-to-Speech Synthesis with Dynamically-Created Virtual Voices

Publication number: 20180330713

Abstract: Text-to-speech synthesis performed by deriving from a voice dataset a sequence of speech frames corresponding to a text, wherein any of the speech frames is represented in the voice dataset by a parameterized vocal tract component, glottal pulse parameters, and an aspiration noise level, transforming the speech frames in the sequence by applying a voice transformation to any of the parameterized vocal tract component, glottal pulse parameters, and aspiration noise level representing the speech frames, wherein the voice transformation is applied in accordance with a virtual voice specification that includes at least one voice control parameter indicating a value for at least one of timbre, glottal tension and breathiness, and producing a digital audio signal of synthesized speech from the transformed sequence of speech frames.

Type: Application

Filed: May 14, 2017

Publication date: November 15, 2018

Inventors: RON HOORY, MARIA E. SMITH, ALEXANDER SORIN
VEHICLE ENTERTAINMENT SYSTEM

Publication number: 20160346695

Abstract: A computer-implemented method, computerized apparatus and computer program product. The method comprises capturing one or more images of a scene in which a driver is driving a vehicle; analyzing the images to retrieve an event or detail; conveying to the driver the a question or a challenge related to the event or detail; receiving a response from the driver; analyzing the response; and determining a score related to the driver.

Type: Application

Filed: May 25, 2015

Publication date: December 1, 2016

Inventors: Ron Hoory, Mattias Marder, Slava Shechtman
Biometric authentication

Patent number: 9405893

Abstract: A method comprising using at least one hardware processor for: providing a set of development supervectors representing features of biometric samples of multiple subjects, the biometric samples being of at least a first and a second different biometric modalities; providing at least a first and a second enrollment supervectors representing features of at least a first and a second enrollment biometric samples of a target subject correspondingly, wherein the at least first and second enrollment samples are of the at least first and the second different biometric modalities correspondingly; providing at least a first and a second verification supervectors representing features of at least a first and a second verification biometric samples of the target subject correspondingly, wherein the at least first and second verification samples are of the at least first and second different biometric modalities correspondingly; concatenating the development supervectors to a set of development generic supervector, the a

Type: Grant

Filed: February 5, 2014

Date of Patent: August 2, 2016

Assignee: International Business Machines Corporation

Inventors: Hagai Aronowitz, Amir Geva, Ron Hoory, David Nahamoo, Jason William Pelecanos, Orith Toledo-Ronen
BIOMETRIC AUTHENTICATION

Publication number: 20160188863

Abstract: A method comprising using at least one hardware processor for: providing a set of development supervectors representing features of biometric samples of multiple subjects, the biometric samples being of at least a first and a second different biometric modalities; providing at least a first and a second enrollment supervectors representing features of at least a first and a second enrollment biometric samples of a target subject correspondingly; wherein the at least first and second enrollment samples are of the at least first and the second different biometric modalities correspondingly; providing at least a first and a second verification supervectors representing features of at least a first and a second verification biometric samples of the target subject correspondingly, wherein the at least first and second verification samples are of the at least first and second different biometric modalities correspondingly; concatenating the development supervectors to a set of development generic supervector, the a

Type: Application

Filed: March 9, 2016

Publication date: June 30, 2016

Inventors: HAGAI ARONOWITZ, AMIR GEVA, RON HOORY, DAVID NAHAMOO, JASON WILLIAM PELECANOS, ORITH TOLEDO-RONEN
Method and system for text-to-speech synthesis with personalized voice

Patent number: 9368102

Abstract: A method and system are provided for text-to-speech synthesis with personalized voice. The method includes receiving an incidental audio input (403) of speech in the form of an audio communication from an input speaker (401) and generating a voice dataset (404) for the input speaker (401). The method includes receiving a text input (411) at the same device as the audio input (403) and synthesizing (312) the text from the text input (411) to synthesized speech including using the voice dataset (404) to personalize the synthesized speech to sound like the input speaker (401). In addition, the method includes analyzing (316) the text for expression and adding the expression (315) to the synthesized speech. The audio communication may be part of a video communication (453) and the audio input (403) may have an associated visual input (455) of an image of the input speaker.

Type: Grant

Filed: October 10, 2014

Date of Patent: June 14, 2016

Assignee: Nuance Communications, Inc.

Inventors: Itzhack Goldberg, Ron Hoory, Boaz Mizrachi, Zvi Kons
Accuracy improvement of spoken queries transcription using co-occurrence information

Patent number: 9330661

Abstract: Techniques disclosed herein include systems and methods for voice-enabled searching. Techniques include a co-occurrence based approach to improve accuracy of the 1-best hypothesis for non-phrase voice queries, as well as for phrased voice queries. A co-occurrence model is used in addition to a statistical natural language model and acoustic model to recognize spoken queries, such as spoken queries for searching a search engine. Given an utterance and an associated list of automated speech recognition n-best hypotheses, the system rescores the different hypotheses using co-occurrence information. For each hypothesis, the system estimates a frequency of co-occurrence within web documents. Combined scores from a speech recognizer and a co-occurrence engine can be combined to select a best hypothesis with a lower word error rate.

Type: Grant

Filed: January 16, 2014

Date of Patent: May 3, 2016

Assignee: Nuance Communications, Inc.

Inventors: Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Joseph Vozila, Nathan Bodenstab
METHOD AND APPARATUS FOR DETECTING SPLICING ATTACKS ON A SPEAKER VERIFICATION SYSTEM

Publication number: 20160027444

Abstract: A method of detecting an occurrence of splicing in a speech signal includes comparing one or more discontinuities in the test speech signal to one or more reference speech signals corresponding to the test speech signal. The method may further include calculating a frame-based spectral-like representation ST of the speech signal, and calculating a frame-based spectral-like representation SE of a reference speech signal corresponding to the speech signal. The method further includes aligning ST and SE in time and frequency, calculating a distance function associated with aligned ST and SE, and evaluating the distance function to determine a score. The method also includes comparing the score to a threshold to detect if splicing occurs in the speech signal.

Type: Application

Filed: July 22, 2014

Publication date: January 28, 2016

Inventors: Zvi Kons, Ron Hoory, Hagai Aronowitz
Vocal source extraction by maximum phase detection

Patent number: 9105272

Abstract: Methods, apparatus and computer program products implement embodiments of the present invention that include receiving a time domain voice signal, and extracting a single pitch cycle from the received signal. The extracted single pitch cycle is transformed to a frequency domain, and the misclassified roots of the frequency domain are identified and corrected. Using the corrected roots, an indication of a maximum phase of the frequency domain is generated.

Type: Grant

Filed: June 4, 2012

Date of Patent: August 11, 2015

Assignees: The Lithuanian University of Health Sciences, INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Aharon Satt, Zvi Kons, Ron Hoory, Virgilijus Ulozas
BIOMETRIC AUTHENTICATION

Publication number: 20150220716

Abstract: A method comprising using at least one hardware processor for: providing a set of development supervectors representing features of biometric samples of multiple subjects, the biometric samples being of at least a first and a second different biometric modalities; providing at least a first and a second enrollment supervectors representing features of at least a first and a second enrollment biometric samples of a target subject correspondingly, wherein the at least first and second enrollment samples are of the at least first and the second different biometric modalities correspondingly; providing at least a first and a second verification supervectors representing features of at least a first and a second verification biometric samples of the target subject correspondingly, wherein the at least first and second verification samples are of the at least first and second different biometric modalities correspondingly; concatenating the development supervectors to a set of development generic supervector, the a

Type: Application

Filed: February 5, 2014

Publication date: August 6, 2015

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Hagai ARONOWITZ, Amir GEVA, Ron HOORY, David NAHAMOO, Jason William PELECANOS, Orith TOLEDO-RONEN
METHOD AND SYSTEM FOR TEXT-TO-SPEECH SYNTHESIS WITH PERSONALIZED VOICE

Publication number: 20150025891

Abstract: A method and system are provided for text-to-speech synthesis with personalized voice. The method includes receiving an incidental audio input (403) of speech in the form of an audio communication from an input speaker (401) and generating a voice dataset (404) for the input speaker (401). The method includes receiving a text input (411) at the same device as the audio input (403) and synthesizing (312) the text from the text input (411) to synthesized speech including using the voice dataset (404) to personalize the synthesized speech to sound like the input speaker (401). In addition, the method includes analyzing (316) the text for expression and adding the expression (315) to the synthesized speech. The audio communication may be part of a video communication (453) and the audio input (403) may have an associated visual input (455) of an image of the input speaker.

Type: Application

Filed: October 10, 2014

Publication date: January 22, 2015

Applicant: Nuance Communications, Inc.

Inventors: Itzhack Goldberg, Ron Hoory, Boaz Mizrachi, Zvi Kons
Voice transformation with encoded information

Patent number: 8930182

Abstract: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

Type: Grant

Filed: March 17, 2011

Date of Patent: January 6, 2015

Assignee: International Business Machines Corporation

Inventors: Shay Ben-David, Ron Hoory, Zvi Kons, David Nahamoo
Method and system for text-to-speech synthesis with personalized voice

Patent number: 8886537

Abstract: A method and system are provided for text-to-speech synthesis with personalized voice. The method includes receiving an incidental audio input (403) of speech in the form of an audio communication from an input speaker (401) and generating a voice dataset (404) for the input speaker (401). The method includes receiving a text input (411) at the same device as the audio input (403) and synthesizing (312) the text from the text input (411) to synthesized speech including using the voice dataset (404) to personalize the synthesized speech to sound like the input speaker (401). In addition, the method includes analyzing (316) the text for expression and adding the expression (315) to the synthesized speech. The audio communication may be part of a video communication (453) and the audio input (403) may have an associated visual input (455) of an image of the input speaker.

Type: Grant

Filed: March 20, 2007

Date of Patent: November 11, 2014

Assignee: Nuance Communications, Inc.

Inventors: Itzhack Goldberg, Ron Hoory, Boaz Mizrachi, Zvi Kons
Device, method and computer program product for responding to media conference deficiencies

Patent number: 8786659

Abstract: A method for responding to media conference deficiencies, the method includes: monitoring, by at least one receiver, a quality of media conference signals being received by at least one receiver during the media conference; sending, in response to the monitoring, to at least an end user transmitter that transmitted the media conference signals, a quality indication representative of a quality of the received media conference signals; recording inadequately received media conference signals that were inadequately received by a certain end user receiver and participating in an activity related to a transmission, to the certain end user receiver, of the inadequately received media conference signals or of a representation of the inadequately received media conference signals.

Type: Grant

Filed: May 29, 2012

Date of Patent: July 22, 2014

Assignee: International Business Machines Corporation

Inventors: Ron Hoory, Michael Rodeh, Slava Shechtman
ACCURACY IMPROVEMENT OF SPOKEN QUERIES TRANSCRIPTION USING CO-OCCURRENCE INFORMATION

Publication number: 20140136197

Abstract: Techniques disclosed herein include systems and methods for voice-enabled searching. Techniques include a co-occurrence based approach to improve accuracy of the 1-best hypothesis for non-phrase voice queries, as well as for phrased voice queries. A co-occurrence model is used in addition to a statistical natural language model and acoustic model to recognize spoken queries, such as spoken queries for searching a search engine. Given an utterance and an associated list of automated speech recognition n-best hypotheses, the system rescores the different hypotheses using co-occurrence information. For each hypothesis, the system estimates a frequency of co-occurrence within web documents. Combined scores from a speech recognizer and a co-occurrence engine can be combined to select a best hypothesis with a lower word error rate.

Type: Application

Filed: January 16, 2014

Publication date: May 15, 2014

Inventors: Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Joseph Vozila, Nathan Bodenstab

1 2 3 next