Patents by Inventor David Nahamoo
David Nahamoo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8930182Abstract: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.Type: GrantFiled: March 17, 2011Date of Patent: January 6, 2015Assignee: International Business Machines CorporationInventors: Shay Ben-David, Ron Hoory, Zvi Kons, David Nahamoo
-
Patent number: 8924210Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.Type: GrantFiled: May 28, 2014Date of Patent: December 30, 2014Assignee: Nuance Communications, Inc.Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
-
Patent number: 8903052Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.Type: GrantFiled: March 15, 2013Date of Patent: December 2, 2014Assignee: International Business Machines CorporationInventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
-
Patent number: 8867707Abstract: Techniques for automatically providing updated meeting information are provided. The techniques include facilitating receipt of a message pertaining to a meeting, automatically interpreting the message to determine if the message requires that meeting information be changed, automatically updating the meeting information if a change is required from the message, and automatically sending a message to each meeting participant informing each participant of the updated meeting information.Type: GrantFiled: March 23, 2011Date of Patent: October 21, 2014Assignee: International Business Machines CorporationInventors: Lior Horesh, Dimitri Kanevsky, David Nahamoo, Tara N. Sainath
-
Patent number: 8856002Abstract: A universal pattern processing system receives input data and produces output patterns that are best associated with said data. The system uses input means receiving and processing input data, a universal pattern decoder means transforming models using the input data and associating output patterns with original models that are changed least during transforming, and output means outputting best associated patterns chosen by a pattern decoder means.Type: GrantFiled: April 11, 2008Date of Patent: October 7, 2014Assignee: International Business Machines CorporationInventors: Dimitri Kanevsky, David Nahamoo, Tara N Sainath
-
Patent number: 8856004Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.Type: GrantFiled: May 13, 2011Date of Patent: October 7, 2014Assignee: Nuance Communications, Inc.Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
-
Publication number: 20140270112Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.Type: ApplicationFiled: March 15, 2013Publication date: September 18, 2014Applicant: International Business Machines CorporationInventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
-
Publication number: 20140270113Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.Type: ApplicationFiled: October 22, 2013Publication date: September 18, 2014Applicant: International Business Machines CorporationInventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
-
Publication number: 20140278410Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.Type: ApplicationFiled: May 28, 2014Publication date: September 18, 2014Applicant: Nuance Communications, Inc.Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
-
Publication number: 20140167939Abstract: A method for providing tactile feedback comprises displaying a visual representation of a physical object having at least one haptic property, generating time-varying data associated with the at least one haptic property from the visual representation, sending the time-varying data to a computing device including a feedback apparatus electrically connected to the computing device, and generating the tactile feedback via the feedback apparatus in response to a pressure on the feedback apparatus applied by a user.Type: ApplicationFiled: August 13, 2013Publication date: June 19, 2014Applicant: International Business Machines CorporationInventors: Siddique Mohammed, David Nahamoo, Dhandapani Shanmugam
-
Publication number: 20140167938Abstract: A method for providing tactile feedback comprises displaying a visual representation of a physical object having at least one haptic property, generating time-varying data associated with the at least one haptic property from the visual representation, sending the time-varying data to a computing device including a feedback apparatus electrically connected to the computing device, and generating the tactile feedback via the feedback apparatus in response to a pressure on the feedback apparatus applied by a user.Type: ApplicationFiled: April 17, 2013Publication date: June 19, 2014Applicant: International Business Machines CorporationInventors: Siddique Mohammed, David Nahamoo, Dhandapani Shanmugam
-
Patent number: 8660836Abstract: Techniques are disclosed for optimizing results output by a natural language processing system. For example, a method comprises optimizing one or more parameters of a natural language processing system so as to improve a measure of quality of an output of the natural language processing system for a first type of data processed by the natural language processing system while maintaining a given measure of quality of an output of the natural language processing system for a second type of data processed by the natural language processing system. For example, the first type of data may have a substantive complexity that is greater than that of the second type of data.Type: GrantFiled: March 28, 2011Date of Patent: February 25, 2014Assignee: International Business Machines CorporationInventors: Vittorio Castelli, David Nahamoo, Bing Zhao
-
Patent number: 8547214Abstract: Techniques for preventing a driver of a moving vehicle from using a handheld device while driving. An example system of the invention includes a plurality of biometric sensors configured to receive biometric data from the driver and a user of the handheld device. Contemporaneously with operation of the vehicle and the handheld device, the biometric data is analyzed in order to determine a match between the identity of the vehicle driver and the user of the handheld device. A controller is configured to selectively interrupt operation of the vehicle or handheld device upon detecting the match.Type: GrantFiled: June 11, 2010Date of Patent: October 1, 2013Assignee: International Business Machines CorporationInventors: Sara H. Basson, Dimitri Kanevsky, David Nahamoo, Tara N. Sainath
-
Patent number: 8527566Abstract: An optimization system and method includes determining a best gradient as a sparse direction in a function having a plurality of parameters. The sparse direction includes a direction that maximizes change of the function. This maximum change of the function is determined by performing an optimization process that gives maximum growth subject to a sparsity regularized constraint. An extended Baum Welch (EBW) method can be used to identify the sparse direction. A best step size is determined along the sparse direction by finding magnitudes of entries of direction that maximizes the function restricted to the sparse direction. A solution is recursively refined for the function optimization using a processor and storage media.Type: GrantFiled: May 11, 2010Date of Patent: September 3, 2013Assignee: International Business Machines CorporationInventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
-
Patent number: 8484023Abstract: Techniques are disclosed for generating and using sparse representation features to improve speech recognition performance. In particular, principles of the invention provide sparse representation exemplar-based recognition techniques. For example, a method comprises the following steps. A test vector and a training data set associated with a speech recognition system are obtained. A subset of the training data set is selected. The test vector is mapped with the selected subset of the training data set as a linear combination that is weighted by a sparseness constraint such that a new test feature set is formed wherein the training data set is moved more closely to the test vector subject to the sparseness constraint. An acoustic model is trained on the new test feature set. The acoustic model trained on the new test feature set may be used to decode user speech input to the speech recognition system.Type: GrantFiled: September 24, 2010Date of Patent: July 9, 2013Assignee: Nuance Communications, Inc.Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
-
Patent number: 8484024Abstract: Techniques are disclosed for using phonetic features for speech recognition. For example, a method comprises the steps of obtaining a first dictionary and a training data set associated with a speech recognition system, computing one or more support parameters from the training data set, transforming the first dictionary into a second dictionary, wherein the second dictionary is a function of one or more phonetic labels of the first dictionary, and using the one or more support parameters to select one or more samples from the second dictionary to create a set of one or more exemplar-based class identification features for a pattern recognition task.Type: GrantFiled: February 24, 2011Date of Patent: July 9, 2013Assignee: Nuance Communications, Inc.Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
-
Publication number: 20130073276Abstract: Operation of an automated dialog system is described using a source language to conduct a real time human machine dialog process with a human user using a target language. A user query in the target language is received and automatically machine translated into the source language. An automated reply of the dialog process is then delivered to the user in the target language. If the dialog process reaches an initial assistance state, a first human agent using the source language is provided to interact in real time with the user in the target language by machine translation to continue the dialog process. Then if the dialog process reaches a further assistance state, a second human agent using the target language is provided to interact in real time with the user in the target language to continue the dialog process.Type: ApplicationFiled: September 19, 2011Publication date: March 21, 2013Applicant: NUANCE COMMUNICATIONS, INC.Inventors: Ruhi Sarikaya, Vaibhava Goel, David Nahamoo, Real Tremblay, Bhuvana Ramabhadran, Osamuyimen Stewart
-
Patent number: 8370162Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.Type: GrantFiled: September 23, 2011Date of Patent: February 5, 2013Assignee: Nuance Communications, Inc.Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan
-
Patent number: 8370163Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.Type: GrantFiled: September 23, 2011Date of Patent: February 5, 2013Assignee: Nuance Communications, Inc.Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan
-
Publication number: 20130013320Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.Type: ApplicationFiled: September 14, 2012Publication date: January 10, 2013Applicant: Nuance Communications, Inc.Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan