Patents by Inventor David Nahamoo

David Nahamoo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Voice transformation with encoded information

Patent number: 8930182

Abstract: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

Type: Grant

Filed: March 17, 2011

Date of Patent: January 6, 2015

Assignee: International Business Machines Corporation

Inventors: Shay Ben-David, Ron Hoory, Zvi Kons, David Nahamoo
Text processing using natural language understanding

Patent number: 8924210

Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.

Type: Grant

Filed: May 28, 2014

Date of Patent: December 30, 2014

Assignee: Nuance Communications, Inc.

Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
Voice print tagging of interactive voice response sessions

Patent number: 8903052

Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.

Type: Grant

Filed: March 15, 2013

Date of Patent: December 2, 2014

Assignee: International Business Machines Corporation

Inventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
Automatically updating meeting information

Patent number: 8867707

Abstract: Techniques for automatically providing updated meeting information are provided. The techniques include facilitating receipt of a message pertaining to a meeting, automatically interpreting the message to determine if the message requires that meeting information be changed, automatically updating the meeting information if a change is required from the message, and automatically sending a message to each meeting participant informing each participant of the updated meeting information.

Type: Grant

Filed: March 23, 2011

Date of Patent: October 21, 2014

Assignee: International Business Machines Corporation

Inventors: Lior Horesh, Dimitri Kanevsky, David Nahamoo, Tara N. Sainath
Distance metrics for universal pattern processing tasks

Patent number: 8856002

Abstract: A universal pattern processing system receives input data and produces output patterns that are best associated with said data. The system uses input means receiving and processing input data, a universal pattern decoder means transforming models using the input data and associating output patterns with original models that are changed least during transforming, and output means outputting best associated patterns chosen by a pattern decoder means.

Type: Grant

Filed: April 11, 2008

Date of Patent: October 7, 2014

Assignee: International Business Machines Corporation

Inventors: Dimitri Kanevsky, David Nahamoo, Tara N Sainath
Text processing using natural language understanding

Patent number: 8856004

Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.

Type: Grant

Filed: May 13, 2011

Date of Patent: October 7, 2014

Assignee: Nuance Communications, Inc.

Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
VOICE PRINT TAGGING OF INTERACTIVE VOICE RESPONSE SESSIONS

Publication number: 20140270112

Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.

Type: Application

Filed: March 15, 2013

Publication date: September 18, 2014

Applicant: International Business Machines Corporation

Inventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
VOICE PRINT TAGGING OF INTERACTIVE VOICE RESPONSE SESSIONS

Publication number: 20140270113

Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.

Type: Application

Filed: October 22, 2013

Publication date: September 18, 2014

Applicant: International Business Machines Corporation

Inventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
TEXT PROCESSING USING NATURAL LANGUAGE UNDERSTANDING

Publication number: 20140278410

Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.

Type: Application

Filed: May 28, 2014

Publication date: September 18, 2014

Applicant: Nuance Communications, Inc.

Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
HAPTIC ACCESSORY AND METHODS FOR USING SAME

Publication number: 20140167939

Abstract: A method for providing tactile feedback comprises displaying a visual representation of a physical object having at least one haptic property, generating time-varying data associated with the at least one haptic property from the visual representation, sending the time-varying data to a computing device including a feedback apparatus electrically connected to the computing device, and generating the tactile feedback via the feedback apparatus in response to a pressure on the feedback apparatus applied by a user.

Type: Application

Filed: August 13, 2013

Publication date: June 19, 2014

Applicant: International Business Machines Corporation

Inventors: Siddique Mohammed, David Nahamoo, Dhandapani Shanmugam
HAPTIC ACCESSORY AND METHODS FOR USING SAME

Publication number: 20140167938

Abstract: A method for providing tactile feedback comprises displaying a visual representation of a physical object having at least one haptic property, generating time-varying data associated with the at least one haptic property from the visual representation, sending the time-varying data to a computing device including a feedback apparatus electrically connected to the computing device, and generating the tactile feedback via the feedback apparatus in response to a pressure on the feedback apparatus applied by a user.

Type: Application

Filed: April 17, 2013

Publication date: June 19, 2014

Applicant: International Business Machines Corporation

Inventors: Siddique Mohammed, David Nahamoo, Dhandapani Shanmugam
Optimization of natural language processing system based on conditional output quality at risk

Patent number: 8660836

Abstract: Techniques are disclosed for optimizing results output by a natural language processing system. For example, a method comprises optimizing one or more parameters of a natural language processing system so as to improve a measure of quality of an output of the natural language processing system for a first type of data processed by the natural language processing system while maintaining a given measure of quality of an output of the natural language processing system for a second type of data processed by the natural language processing system. For example, the first type of data may have a substantive complexity that is greater than that of the second type of data.

Type: Grant

Filed: March 28, 2011

Date of Patent: February 25, 2014

Assignee: International Business Machines Corporation

Inventors: Vittorio Castelli, David Nahamoo, Bing Zhao
System for preventing handheld device use while operating a vehicle

Patent number: 8547214

Abstract: Techniques for preventing a driver of a moving vehicle from using a handheld device while driving. An example system of the invention includes a plurality of biometric sensors configured to receive biometric data from the driver and a user of the handheld device. Contemporaneously with operation of the vehicle and the handheld device, the biometric data is analyzed in order to determine a match between the identity of the vehicle driver and the user of the handheld device. A controller is configured to selectively interrupt operation of the vehicle or handheld device upon detecting the match.

Type: Grant

Filed: June 11, 2010

Date of Patent: October 1, 2013

Assignee: International Business Machines Corporation

Inventors: Sara H. Basson, Dimitri Kanevsky, David Nahamoo, Tara N. Sainath
Directional optimization via EBW

Patent number: 8527566

Abstract: An optimization system and method includes determining a best gradient as a sparse direction in a function having a plurality of parameters. The sparse direction includes a direction that maximizes change of the function. This maximum change of the function is determined by performing an optimization process that gives maximum growth subject to a sparsity regularized constraint. An extended Baum Welch (EBW) method can be used to identify the sparse direction. A best step size is determined along the sparse direction by finding magnitudes of entries of direction that maximizes the function restricted to the sparse direction. A solution is recursively refined for the function optimization using a processor and storage media.

Type: Grant

Filed: May 11, 2010

Date of Patent: September 3, 2013

Assignee: International Business Machines Corporation

Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
Sparse representation features for speech recognition

Patent number: 8484023

Abstract: Techniques are disclosed for generating and using sparse representation features to improve speech recognition performance. In particular, principles of the invention provide sparse representation exemplar-based recognition techniques. For example, a method comprises the following steps. A test vector and a training data set associated with a speech recognition system are obtained. A subset of the training data set is selected. The test vector is mapped with the selected subset of the training data set as a linear combination that is weighted by a sparseness constraint such that a new test feature set is formed wherein the training data set is moved more closely to the test vector subject to the sparseness constraint. An acoustic model is trained on the new test feature set. The acoustic model trained on the new test feature set may be used to decode user speech input to the speech recognition system.

Type: Grant

Filed: September 24, 2010

Date of Patent: July 9, 2013

Assignee: Nuance Communications, Inc.

Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
Phonetic features for speech recognition

Patent number: 8484024

Abstract: Techniques are disclosed for using phonetic features for speech recognition. For example, a method comprises the steps of obtaining a first dictionary and a training data set associated with a speech recognition system, computing one or more support parameters from the training data set, transforming the first dictionary into a second dictionary, wherein the second dictionary is a function of one or more phonetic labels of the first dictionary, and using the one or more support parameters to select one or more samples from the second dictionary to create a set of one or more exemplar-based class identification features for a pattern recognition task.

Type: Grant

Filed: February 24, 2011

Date of Patent: July 9, 2013

Assignee: Nuance Communications, Inc.

Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
MT Based Spoken Dialog Systems Customer/Machine Dialog

Publication number: 20130073276

Abstract: Operation of an automated dialog system is described using a source language to conduct a real time human machine dialog process with a human user using a target language. A user query in the target language is received and automatically machine translated into the source language. An automated reply of the dialog process is then delivered to the user in the target language. If the dialog process reaches an initial assistance state, a first human agent using the source language is provided to interact in real time with the user in the target language by machine translation to continue the dialog process. Then if the dialog process reaches a further assistance state, a second human agent using the target language is provided to interact in real time with the user in the target language to continue the dialog process.

Type: Application

Filed: September 19, 2011

Publication date: March 21, 2013

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Ruhi Sarikaya, Vaibhava Goel, David Nahamoo, Real Tremblay, Bhuvana Ramabhadran, Osamuyimen Stewart
Aggregating multimodal inputs based on overlapping temporal life cycles

Patent number: 8370162

Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.

Type: Grant

Filed: September 23, 2011

Date of Patent: February 5, 2013

Assignee: Nuance Communications, Inc.

Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan
Processing user input in accordance with input types accepted by an application

Patent number: 8370163

Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.

Type: Grant

Filed: September 23, 2011

Date of Patent: February 5, 2013

Assignee: Nuance Communications, Inc.

Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan
MULTIMODAL AGGREGATING UNIT

Publication number: 20130013320

Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.

Type: Application

Filed: September 14, 2012

Publication date: January 10, 2013

Applicant: Nuance Communications, Inc.

Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan

prev 1 2 3 4 5 6 7 8 9 next