Patents by Inventor David Nahamoo

David Nahamoo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8856004
    Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.
    Type: Grant
    Filed: May 13, 2011
    Date of Patent: October 7, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
  • Publication number: 20140270113
    Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.
    Type: Application
    Filed: October 22, 2013
    Publication date: September 18, 2014
    Applicant: International Business Machines Corporation
    Inventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
  • Publication number: 20140270112
    Abstract: Embodiments of the invention provide a method, system and computer program product for voice print tagging for interactive voice response (IVR) session management. In an embodiment of the invention, a method of voiceprint tagging for IVR session management is provided. The method includes establishing an IVR session for a caller from over a network and presenting a portion of the IVR session to the caller over the network. The method also includes storing a voiceprint tag in memory associating a voiceprint of the caller with a portion of the IVR session. Finally, the method includes responding to a premature termination of the IVR session by re-establishing the prematurely terminated IVR session with the caller at the portion of the IVR session indicated by the voiceprint tag of the caller.
    Type: Application
    Filed: March 15, 2013
    Publication date: September 18, 2014
    Applicant: International Business Machines Corporation
    Inventors: Victor S. Moore, David Nahamoo, Wendi L. Nusbickel, Christopher J. Vavra
  • Publication number: 20140278410
    Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.
    Type: Application
    Filed: May 28, 2014
    Publication date: September 18, 2014
    Applicant: Nuance Communications, Inc.
    Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
  • Publication number: 20140167938
    Abstract: A method for providing tactile feedback comprises displaying a visual representation of a physical object having at least one haptic property, generating time-varying data associated with the at least one haptic property from the visual representation, sending the time-varying data to a computing device including a feedback apparatus electrically connected to the computing device, and generating the tactile feedback via the feedback apparatus in response to a pressure on the feedback apparatus applied by a user.
    Type: Application
    Filed: April 17, 2013
    Publication date: June 19, 2014
    Applicant: International Business Machines Corporation
    Inventors: Siddique Mohammed, David Nahamoo, Dhandapani Shanmugam
  • Publication number: 20140167939
    Abstract: A method for providing tactile feedback comprises displaying a visual representation of a physical object having at least one haptic property, generating time-varying data associated with the at least one haptic property from the visual representation, sending the time-varying data to a computing device including a feedback apparatus electrically connected to the computing device, and generating the tactile feedback via the feedback apparatus in response to a pressure on the feedback apparatus applied by a user.
    Type: Application
    Filed: August 13, 2013
    Publication date: June 19, 2014
    Applicant: International Business Machines Corporation
    Inventors: Siddique Mohammed, David Nahamoo, Dhandapani Shanmugam
  • Patent number: 8660836
    Abstract: Techniques are disclosed for optimizing results output by a natural language processing system. For example, a method comprises optimizing one or more parameters of a natural language processing system so as to improve a measure of quality of an output of the natural language processing system for a first type of data processed by the natural language processing system while maintaining a given measure of quality of an output of the natural language processing system for a second type of data processed by the natural language processing system. For example, the first type of data may have a substantive complexity that is greater than that of the second type of data.
    Type: Grant
    Filed: March 28, 2011
    Date of Patent: February 25, 2014
    Assignee: International Business Machines Corporation
    Inventors: Vittorio Castelli, David Nahamoo, Bing Zhao
  • Patent number: 8547214
    Abstract: Techniques for preventing a driver of a moving vehicle from using a handheld device while driving. An example system of the invention includes a plurality of biometric sensors configured to receive biometric data from the driver and a user of the handheld device. Contemporaneously with operation of the vehicle and the handheld device, the biometric data is analyzed in order to determine a match between the identity of the vehicle driver and the user of the handheld device. A controller is configured to selectively interrupt operation of the vehicle or handheld device upon detecting the match.
    Type: Grant
    Filed: June 11, 2010
    Date of Patent: October 1, 2013
    Assignee: International Business Machines Corporation
    Inventors: Sara H. Basson, Dimitri Kanevsky, David Nahamoo, Tara N. Sainath
  • Patent number: 8527566
    Abstract: An optimization system and method includes determining a best gradient as a sparse direction in a function having a plurality of parameters. The sparse direction includes a direction that maximizes change of the function. This maximum change of the function is determined by performing an optimization process that gives maximum growth subject to a sparsity regularized constraint. An extended Baum Welch (EBW) method can be used to identify the sparse direction. A best step size is determined along the sparse direction by finding magnitudes of entries of direction that maximizes the function restricted to the sparse direction. A solution is recursively refined for the function optimization using a processor and storage media.
    Type: Grant
    Filed: May 11, 2010
    Date of Patent: September 3, 2013
    Assignee: International Business Machines Corporation
    Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
  • Patent number: 8484023
    Abstract: Techniques are disclosed for generating and using sparse representation features to improve speech recognition performance. In particular, principles of the invention provide sparse representation exemplar-based recognition techniques. For example, a method comprises the following steps. A test vector and a training data set associated with a speech recognition system are obtained. A subset of the training data set is selected. The test vector is mapped with the selected subset of the training data set as a linear combination that is weighted by a sparseness constraint such that a new test feature set is formed wherein the training data set is moved more closely to the test vector subject to the sparseness constraint. An acoustic model is trained on the new test feature set. The acoustic model trained on the new test feature set may be used to decode user speech input to the speech recognition system.
    Type: Grant
    Filed: September 24, 2010
    Date of Patent: July 9, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
  • Patent number: 8484024
    Abstract: Techniques are disclosed for using phonetic features for speech recognition. For example, a method comprises the steps of obtaining a first dictionary and a training data set associated with a speech recognition system, computing one or more support parameters from the training data set, transforming the first dictionary into a second dictionary, wherein the second dictionary is a function of one or more phonetic labels of the first dictionary, and using the one or more support parameters to select one or more samples from the second dictionary to create a set of one or more exemplar-based class identification features for a pattern recognition task.
    Type: Grant
    Filed: February 24, 2011
    Date of Patent: July 9, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran, Tara N. Sainath
  • Publication number: 20130073276
    Abstract: Operation of an automated dialog system is described using a source language to conduct a real time human machine dialog process with a human user using a target language. A user query in the target language is received and automatically machine translated into the source language. An automated reply of the dialog process is then delivered to the user in the target language. If the dialog process reaches an initial assistance state, a first human agent using the source language is provided to interact in real time with the user in the target language by machine translation to continue the dialog process. Then if the dialog process reaches a further assistance state, a second human agent using the target language is provided to interact in real time with the user in the target language to continue the dialog process.
    Type: Application
    Filed: September 19, 2011
    Publication date: March 21, 2013
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Ruhi Sarikaya, Vaibhava Goel, David Nahamoo, Real Tremblay, Bhuvana Ramabhadran, Osamuyimen Stewart
  • Patent number: 8370163
    Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.
    Type: Grant
    Filed: September 23, 2011
    Date of Patent: February 5, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan
  • Patent number: 8370162
    Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.
    Type: Grant
    Filed: September 23, 2011
    Date of Patent: February 5, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan
  • Publication number: 20130013320
    Abstract: In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.
    Type: Application
    Filed: September 14, 2012
    Publication date: January 10, 2013
    Applicant: Nuance Communications, Inc.
    Inventors: Alexander Faisman, Dimitri Kanevsky, David Nahamoo, Roberto Sicconi, Mahesh Viswanathan
  • Publication number: 20120310643
    Abstract: Techniques for presenting data input as a plurality of data chunks including a first data chunk and a second data chunk. The techniques include converting the plurality of data chunks to a textual representation comprising a plurality of text chunks including a first text chunk corresponding to the first data chunk and a second text chunk corresponding to the second data chunk, respectively, and providing a presentation of at least part of the textual representation such that the first text chunk is presented differently than the second text chunk to, when presented, assist a user in proofing the textual representation.
    Type: Application
    Filed: May 23, 2012
    Publication date: December 6, 2012
    Applicant: Nuance Communications, Inc.
    Inventors: Martin Labsky, Jan Kleindienst, Tomas Macek, David Nahamoo, Jan Curin, Lars Koenig, Holger Quast
  • Publication number: 20120304057
    Abstract: Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, if the at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, obtaining the correction information associated with the at least one misrecognition, and correcting the at least a portion of the text segment based, at least in part, on the correction information.
    Type: Application
    Filed: May 23, 2012
    Publication date: November 29, 2012
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Martin Labsky, Jan Kleindienst, Tomas Macek, David Nahamoo, Jan Curin, William F. Ganong, III
  • Publication number: 20120303371
    Abstract: Techniques for disambiguating at least one text segment from at least one acoustically similar word and/or phrase. The techniques include identifying at least one text segment, in a textual representation having a plurality of text segments, having at least one acoustically similar word and/or phrase, annotating the textual representation with disambiguating information to help disambiguate the at least one text segment from the at least one acoustically similar word and/or phrase, and synthesizing a speech signal, at least in part, by performing text-to-speech synthesis on at least a portion of the textual representation that includes the at least one text segment, wherein the speech signal includes speech corresponding to the disambiguating information located proximate the portion of the speech signal corresponding to the at least one text segment.
    Type: Application
    Filed: May 23, 2012
    Publication date: November 29, 2012
    Applicant: Nuance Communications, Inc.
    Inventors: Martin Labsky, Jan Kleindienst, Tomas Macek, David Nahamoo, Jan Curin, William F. Ganong, III
  • Publication number: 20120290299
    Abstract: Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech into a corresponding formal utterance, and mapping each formal utterance into a stylistically formatted written utterance.
    Type: Application
    Filed: May 13, 2011
    Publication date: November 15, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sara H. Basson, Rick Hamilton, Dan Ning Jiang, Dimitri Kanevsky, David Nahamoo, Michael Picheny, Bhuvana Ramabhadran, Tara N. Sainath
  • Publication number: 20120253783
    Abstract: Techniques are disclosed for optimizing results output by a natural language processing system. For example, a method comprises optimizing one or more parameters of a natural language processing system so as to improve a measure of quality of an output of the natural language processing system for a first type of data processed by the natural language processing system while maintaining a given measure of quality of an output of the natural language processing system for a second type of data processed by the natural language processing system. For example, the first type of data may have a substantive complexity that is greater than that of the second type of data.
    Type: Application
    Filed: March 28, 2011
    Publication date: October 4, 2012
    Applicant: International Business Machines Corporation
    Inventors: Vittorio Castelli, David Nahamoo, Bing Zhao