Patents by Inventor Ron Hoory

Ron Hoory has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Accuracy improvement of spoken queries transcription using co-occurrence information

Patent number: 8650031

Abstract: Techniques disclosed herein include systems and methods for voice-enabled searching. Techniques include a co-occurrence based approach to improve accuracy of the 1-best hypothesis for non-phrase voice queries, as well as for phrased voice queries. A co-occurrence model is used in addition to a statistical natural language model and acoustic model to recognize spoken queries, such as spoken queries for searching a search engine. Given an utterance and an associated list of automated speech recognition n-best hypotheses, the system rescores the different hypotheses using co-occurrence information. For each hypothesis, the system estimates a frequency of co-occurrence within web documents. Combined scores from a speech recognizer and a co-occurrence engine can be combined to select a best hypothesis with a lower word error rate.

Type: Grant

Filed: July 31, 2011

Date of Patent: February 11, 2014

Assignee: Nuance Communications, Inc.

Inventors: Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Joseph Vozila, Nathan Bodenstab
VOCAL SOURCE EXTRACTION BY MAXIMUM PHASE DETECTION

Publication number: 20130325455

Abstract: Methods, apparatus and computer program products implement embodiments of the present invention that include receiving a time domain voice signal, and extracting a single pitch cycle from the received signal. The extracted single pitch cycle is transformed to a frequency domain, and the misclassified roots of the frequency domain are identified and corrected. Using the corrected roots, an indication of a maximum phase of the frequency domain is generated.

Type: Application

Filed: June 4, 2012

Publication date: December 5, 2013

Applicants: INTERNATIONAL BUSINESS MACHINES CORPORATION, UZDAROJI AKCINÊ BENDROVÊ LIETUVOS TYRIMU CENTRAS

Inventors: Aharon Satt, Zvi Kons, Ron Hoory
Skipping radio/television program segments

Patent number: 8473294

Abstract: Techniques for notifying at least one entity of an occurrence of an event in an audio signal are provided. At least one preference is obtained from the at least one entity. An occurrence of an event in the audio signal is determined. The event is related to at least one of at least one speaker and at least one topic. The at least one entity is notified of the occurrence of the event in the audio signal, in accordance with the at least one preference.

Type: Grant

Filed: March 30, 2012

Date of Patent: June 25, 2013

Assignee: International Business Machines Corporation

Inventors: Hagai Aronowitz, Itzhack Goldberg, Ron Hoory
Distributed off-line voice services

Patent number: 8451823

Abstract: A voice processing system includes a real-time voice server, which is arranged to process real-time voice processing tasks for clients of the system. A gateway processor is arranged to accept from a client a request to perform an off-line voice processing task, to convert the off-line voice processing task into an equivalent real-time voice processing task, to invoke the voice server to process the equivalent real-time voice processing task, and to output a result of the equivalent real-time voice processing task.

Type: Grant

Filed: December 13, 2005

Date of Patent: May 28, 2013

Assignee: Nuance Communications, Inc.

Inventors: Shay Ben-David, Ron Hoory, Alexey Roytman, Zohar Sivan, James Jude Sliwa
Rating speech naturalness of speech utterances based on a plurality of human testers

Patent number: 8447603

Abstract: A method that includes: generating an utterance-specific scoring model for each one of a plurality of obtained speech utterances, each scoring model usable to estimate a level of speech naturalness for a respective one of the obtained speech utterances; presenting a plurality of human-testers with some of the obtained speech utterances; receiving, for each presented speech utterance, a plurality of human tester generated speech utterances being at least one human repetition of the presented speech utterance; updating the scoring model for each presented speech utterance, based on respective human-tester generated speech utterances; and obtaining a speech naturalness score for each presented speech utterance by respectively applying the updated utterance-specific scoring model to each presented speech utterance.

Type: Grant

Filed: December 16, 2009

Date of Patent: May 21, 2013

Assignee: International Business Machines Corporation

Inventors: Ron Hoory, Slava Shechtman
Speech synthesis using complex spectral modeling

Patent number: 8280724

Abstract: A method for processing a speech signal includes dividing the speech signal into a succession of frames, identifying one or more of the frames as click frames, and extracting phase information from the click frames. The speech signal is encoded using the phase information. Methods are also provided for modeling phase spectra of voiced frames and click frames.

Type: Grant

Filed: January 31, 2005

Date of Patent: October 2, 2012

Assignee: Nuance Communications, Inc.

Inventors: Dan Chazan, Ron Hoory, Zvi Kons, Slava Shechtman, Alexander Sorin
Device, Method and Computer Program Product for Responding to Media Conference Deficiencies

Publication number: 20120239746

Abstract: A method for responding to media conference deficiencies, the method includes: monitoring, by at least one receiver, a quality of media conference signals being received by at least one receiver during the media conference; sending, in response to the monitoring, to at least an end user transmitter that transmitted the media conference signals, a quality indication representative of a quality of the received media conference signals; recording inadequately received media conference signals that were inadequately received by a certain end user receiver and participating in an activity related to a transmission, to the certain end user receiver, of the inadequately received media conference signals or of a representation of the inadequately received media conference signals.

Type: Application

Filed: May 29, 2012

Publication date: September 20, 2012

Applicant: International Business Machines Corporation

Inventors: Ron Hoory, Michael Rodeh, Slava Shechtman
VOICE TRANSFORMATION WITH ENCODED INFORMATION

Publication number: 20120239387

Abstract: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

Type: Application

Filed: March 17, 2011

Publication date: September 20, 2012

Applicant: International Business Corporation

Inventors: Shay Ben-David, Ron Hoory, Zvi Kons, David Nahamoo
Skipping radio/television program segments

Patent number: 8249872

Abstract: Techniques for notifying at least one entity of an occurrence of an event in an audio signal are provided. At least one preference is obtained from the at least one entity. An occurrence of an event in the audio signal is determined. The event is related to at least one of at least one speaker and at least one topic. The at least one entity is notified of the occurrence of the event in the audio signal, in accordance with the at least one preference.

Type: Grant

Filed: August 18, 2008

Date of Patent: August 21, 2012

Assignee: International Business Machines Corporation

Inventors: Hagai Aronowitz, Itzhack Goldberg, Ron Hoory
SKIPPING RADIO/TELEVISION PROGRAM SEGMENTS

Publication number: 20120191459

Abstract: Techniques for notifying at least one entity of an occurrence of an event in an audio signal are provided. At least one preference is obtained from the at least one entity. An occurrence of an event in the audio signal is determined. The event is related to at least one of at least one speaker and at least one topic. The at least one entity is notified of the occurrence of the event in the audio signal, in accordance with the at least one preference.

Type: Application

Filed: March 30, 2012

Publication date: July 26, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Hagai Aronowitz, Itzhack Goldberg, Ron Hoory
Device, method and computer program product for responding to media conference deficiencies

Patent number: 8228359

Abstract: A method for responding to media conference deficiencies, the method includes: monitoring, by at least one receiver, a quality of media conference signals being received by at least one receiver during the media conference; sending, in response to the monitoring, to at least an end user transmitter that transmitted the media conference signals, a quality indication representative of a quality of the received media conference signals; recording inadequately received media conference signals that were inadequately received by a certain end user receiver and participating in an activity related to a transmission, to the certain end user receiver, of the inadequately received media conference signals or of a representation of the inadequately received media conference signals.

Type: Grant

Filed: January 8, 2008

Date of Patent: July 24, 2012

Assignee: International Business Machines Corporation

Inventors: Ron Hoory, Michael Rodeh, Slava Shechtman
Parallel visual radio station selection

Patent number: 8196046

Abstract: A computer implemented method in a data processing system and a computer program product enable visual selection of a media signal. A set of media signals is received from a set of media providers. A subject matter and a performer of the subject matter are then identified for at least one of the set of media signals. A set of icons is then identified. Each of the set of icons corresponds to at least one of media signals. The set of icons and the set of media providers are then forwarded to a client media player.

Type: Grant

Filed: August 1, 2008

Date of Patent: June 5, 2012

Assignee: International Business Machines Corporation

Inventors: Barbara Finkelstein, Itzhack Goldberg, Ron Hoory, Boaz Mizrachi
SPEECH OUTPUT WITH CONFIDENCE INDICATION

Publication number: 20110313762

Abstract: A method, system, and computer program product are provided for speech output with confidence indication. The method includes receiving a confidence score for segments of speech or text to be synthesized to speech. The method includes modifying a speech segment by altering one or more parameters of the speech proportionally to the confidence score.

Type: Application

Filed: June 20, 2010

Publication date: December 22, 2011

Applicant: International Business Machines Corporation

Inventors: Shay Ben-David, Ron Hoory
RATING SPEECH NATURALNESS OF SPEECH UTTERANCES BASED ON A PLURALITY OF HUMAN TESTERS

Publication number: 20110144990

Abstract: A method that includes: generating an utterance-specific scoring model for each one of a plurality of obtained speech utterances, each scoring model usable to estimate a level of speech naturalness for a respective one of the obtained speech utterances; presenting a plurality of human-testers with some of the obtained speech utterances; receiving, for each presented speech utterance, a plurality of human tester generated speech utterances being at least one human repetition of the presented speech utterance; updating the scoring model for each presented speech utterance, based on respective human-tester generated speech utterances; and obtaining a speech naturalness score for each presented speech utterance by respectively applying the updated utterance-specific scoring model to each presented speech utterance.

Type: Application

Filed: December 16, 2009

Publication date: June 16, 2011

Applicant: International Business Machines Corporation

Inventors: Ron Hoory, Slava Shechtman
SKIPPING RADIO/TELEVISION PROGRAM SEGMENTS

Publication number: 20100042412

Abstract: Techniques for notifying at least one entity of an occurrence of an event in an audio signal are provided. At least one preference is obtained from the at least one entity. An occurrence of an event in the audio signal is determined. The event is related to at least one of at least one speaker and at least one topic. The at least one entity is notified of the occurrence of the event in the audio signal, in accordance with the at least one preference.

Type: Application

Filed: August 18, 2008

Publication date: February 18, 2010

Inventors: Hagai Aronowitz, Itzhack Goldberg, Ron Hoory
Parallel Visual Radio Station Selection

Publication number: 20100031146

Abstract: A computer implemented method in a data processing system and a computer program product enable visual selection of a media signal. A set of media signals is received from a set of media providers. A subject matter and a performer of the subject matter are then identified for at least one of the set of media signals. A set of icons is then identified. Each of the set of icons corresponds to at least one of media signals. The set of icons and the set of media providers are then forwarded to a client media player.

Type: Application

Filed: August 1, 2008

Publication date: February 4, 2010

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Barbara Finkelstein, Itzhack Goldberg, Ron Hoory, Boaz Mizrachi
Seamless hybrid computer human call service

Patent number: 7565293

Abstract: A Voice User Interface is provided for interactively responding in a synthesized voice to a call from a human caller, a Text to Speech system by which text entered by an agent and interactive data are converted to synthesized speech, a morphing transformation library containing pre-computed voice transformation parameters unique to each agent affiliated with the VUI, and a switching system for transferring handling of the call between the VUI and the agent. The human agent's verbal interaction with the caller is performed in the agent's natural voice. Text transmitted by an agent to a caller and interactive data is in a synthesized voice created using the pre-computed transformation parameters corresponding to the agent's ID selected from the morphing transformation library. All speech presented to a caller is presented in approximately the same unique voice as initially presented when the call is established, thereby permitting an aurally seamless phone call, as perceived by the caller.

Type: Grant

Filed: May 7, 2008

Date of Patent: July 21, 2009

Assignee: International Business Machines Corporation

Inventors: Oded Fuhrmann, Ron Hoory, Dan Pelleg
Device, Method and Computer Program Product for Responding to Media Conference Deficiencies

Publication number: 20090174761

Abstract: A method for responding to media conference deficiencies, the method includes: monitoring, by at least one receiver, a quality of media conference signals being received by at least one receiver during the media conference; sending, in response to the monitoring, to at least an end user transmitter that transmitted the media conference signals, a quality indication representative of a quality of the received media conference signals; recording inadequately received media conference signals that were inadequately received by a certain end user receiver and participating in an activity related to a transmission, to the certain end user receiver, of the inadequately received media conference signals or of a representation of the inadequately received media conference signals.

Type: Application

Filed: January 8, 2008

Publication date: July 9, 2009

Inventors: Ron HOORY, Michael RODEH, Slava SHECHTMAN
METHOD AND SYSTEM FOR TEXT-TO-SPEECH SYNTHESIS WITH PERSONALIZED VOICE

Publication number: 20080235024

Abstract: A method and system are provided for text-to-speech synthesis with personalized voice. The method includes receiving an incidental audio input (403) of speech in the form of an audio communication from an input speaker (401) and generating a voice dataset (404) for the input speaker (401). The method includes receiving a text input (411) at the same device as the audio input (403) and synthesizing (312) the text from the text input (411) to synthesized speech including using the voice dataset (404) to personalize the synthesized speech to sound like the input speaker (401). In addition, the method includes analyzing (316) the text for expression and adding the expression (315) to the synthesized speech. The audio communication may be part of a video communication (453) and the audio input (403) may have an associated visual input (455) of an image of the input speaker.

Type: Application

Filed: March 20, 2007

Publication date: September 25, 2008

Inventors: Itzhack Goldberg, Ron Hoory, Boaz Mizrachi, Zvi Kons
Distributed off-line voice services

Publication number: 20070133518

Abstract: A voice processing system includes a real-time voice server, which is arranged to process real-time voice processing tasks for clients of the system. A gateway processor is arranged to accept from a client a request to perform an off-line voice processing task, to convert the off-line voice processing task into an equivalent real-time voice processing task, to invoke the voice server to process the equivalent real-time voice processing task, and to output a result of the equivalent real-time voice processing task.

Type: Application

Filed: December 13, 2005

Publication date: June 14, 2007

Applicant: International Business Machines Corporation

Inventors: Shay Ben-David, Ron Hoory, Alexey Roytman, Zohar Sivan, James Sliwa

prev 1 2 3 next