Patents by Inventor Roberto Pieraccini

Roberto Pieraccini has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8065148
    Abstract: A one-step correction mechanism for voice interaction is provided. Correction of a previous state is enabled simultaneously with recognition in a current or subsequent state. An application is decomposed into a set of tasks. Each task is associated with the collection of one piece of information. Each task may be in a different state. At any point during the interaction, while a task/state pair is active, the dialog manager may enable multiple other task/state pairs to be active in latent fashion. The application developer may then use those facilities or resources to the active task/state and the latent task/state pairs depending on contextual condition of the interaction state of the application.
    Type: Grant
    Filed: March 25, 2010
    Date of Patent: November 22, 2011
    Assignee: Nuance Communications, Inc.
    Inventors: Juan Manuel Huerta, Roberto Pieraccini
  • Patent number: 8041019
    Abstract: An interactive voice response (IVR) system which assists in identifying repeat callers, understanding whether they are calling for the same reason as one of their previous calls, and properly disposing of the call. If the repeat caller is calling for the same reason, information from the previous call (or the previous calls) is retrieved and an action based on a defined business logic for repeat callers may then be executed for the current call.
    Type: Grant
    Filed: January 10, 2007
    Date of Patent: October 18, 2011
    Assignee: SpeechCycle, Inc.
    Inventors: Roberto Pieraccini, Zor Gorelov, Alan Pan
  • Publication number: 20110208526
    Abstract: A method for variable resolution and error control in spoken language understanding (SLU) allows arranging the categories of the SLU into a hierarchy of different levels of specificity. The pre-determined hierarchy is used to identify different types of errors such as high-cost errors and low-cost errors and trade, if necessary, high cost errors for low cost errors.
    Type: Application
    Filed: May 6, 2011
    Publication date: August 25, 2011
    Inventors: Roberto PIERACCINI, Krishna Dayanidhi
  • Patent number: 7962339
    Abstract: A method for variable resolution and error control in spoken language understanding (SLU) allows arranging the categories of the SLU into a hierarchy of different levels of specificity. The pre-determined hierarchy is used to identify different types of errors such as high-cost errors and low-cost errors and trade, if necessary, high cost errors for low cost errors.
    Type: Grant
    Filed: March 12, 2008
    Date of Patent: June 14, 2011
    Assignee: SpeechCycle, Inc.
    Inventors: Roberto Pieraccini, Krishna Dayanidhi
  • Publication number: 20110072466
    Abstract: A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.
    Type: Application
    Filed: November 30, 2010
    Publication date: March 24, 2011
    Applicant: AT&T Intellectual Property II, L.P. via transfer from AT&T Corp.
    Inventors: Andrea Basso, Mehmet Reha Civanlar, David Crawford Gibbon, Qian Huang, Esther Levin, Roberto Pieraccini, Behzad Shahraray
  • Publication number: 20110046951
    Abstract: A system and a method to generate statistical utterance classifiers optimized for the individual states of a spoken dialog system is disclosed. The system and method make use of large databases of transcribed and annotated utterances from calls collected in a dialog system in production and log data reporting the association between the state of the system at the moment when the utterances were recorded and the utterance. From the system state, being a vector of multiple system variables, subsets of these variables, certain variable ranges, quantized variable values, etc. can be extracted to produce a multitude of distinct utterance subsets matching every possible system state. For each of these subset and variable combinations, statistical classifiers can be trained, tuned, and tested, and the classifiers can be stored together with the performance results and the state subset and variable combination.
    Type: Application
    Filed: August 21, 2009
    Publication date: February 24, 2011
    Inventors: David Suendermann, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
  • Patent number: 7877774
    Abstract: A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.
    Type: Grant
    Filed: April 19, 2000
    Date of Patent: January 25, 2011
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Andrea Basso, Mehmet Reha Civanlar, David Crawford Gibbon, Qian Huang, Esther Levin, Roberto Pieraccini, Behzad Shahraray
  • Publication number: 20100268536
    Abstract: A method and apparatus for continuously improving the performance of semantic classifiers in the scope of spoken dialog systems are disclosed. Rule-based or statistical classifiers are replaced with better performing rule-based or statistical classifiers and/or certain parameters of existing classifiers are modified. The replacement classifiers or new parameters are trained and tested on a collection of transcriptions and annotations of utterances which are generated manually or in a partially automated fashion. Automated quality assurance leads to more accurate training and testing data, higher classification performance, and feedback into the design of the spoken dialog system by suggesting changes to improve system behavior.
    Type: Application
    Filed: April 17, 2009
    Publication date: October 21, 2010
    Inventors: David Suendermann, Keelan Evanini, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
  • Publication number: 20100179805
    Abstract: A one-step correction mechanism for voice interaction is provided. Correction of a previous state is enabled simultaneously with recognition in a current or subsequent state. An application is decomposed into a set of tasks. Each task is associated with the collection of one piece of information. Each task may be in a different state. At any point during the interaction, while a task/state pair is active, the dialog manager may enable multiple other task/state pairs to be active in latent fashion. The application developer may then use those facilities or resources to the active task/state and the latent task/state pairs depending on contextual condition of the interaction state of the application.
    Type: Application
    Filed: March 25, 2010
    Publication date: July 15, 2010
    Applicant: Nuance Communications, Inc.
    Inventors: Juan Manuel Huerta, Roberto Pieraccini
  • Patent number: 7720684
    Abstract: A one-step correction mechanism for voice interaction is provided. Correction of a previous state is enabled simultaneously with recognition in a current or subsequent state. An application is decomposed into a set of tasks. Each task is associated with the collection of one piece of information. Each task may be in a different state. At any point during the interaction, while a task/state pair is active, the dialog manager may enable multiple other task/state pairs to be active in latent fashion. The application developer may then use those facilities or resources to the active task/state and the latent task/state pairs depending on contextual condition of the interaction state of the application.
    Type: Grant
    Filed: April 29, 2005
    Date of Patent: May 18, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Juan Manuel Huerta, Roberto Pieraccini
  • Publication number: 20100091954
    Abstract: A single, subjective numerical rating to evaluate the performance of a telephone-based spoken dialog system is disclosed. This CE rating is provided by expert human listeners who have knowledge of the design of the dialog system. Different human raters can be trained to achieve a satisfactory level of agreement. Furthermore, a classifier trained on ratings by human experts can reproduce the human ratings with the same degree of consistency. More calls can be given a CE rating than would be possible with limited human resources. More information can be provided about individual calls, e.g., to help decide between two disparate ratings by different human experts.
    Type: Application
    Filed: October 8, 2009
    Publication date: April 15, 2010
    Inventors: Krishna DAYANIDHI, Keelan Evanini, Phillip Hunter, Jackson Liscombe, Roberto Pieraccini, David Suendermann, Zor Gorelov
  • Patent number: 7684990
    Abstract: A method for multiple value confirmation and correction in spoken dialog systems. A user is allowed to correct errors in values captured by the spoken dialog system, such that the interaction necessary for error correction between the system and the user is reduced. When the spoken dialog system collects a set of values from a user, the system provides a spoken confirmation of the set of values to the user. The spoken confirmation comprises the set of values and possibly pause associated with each value. Upon hearing an incorrect value, the user may react and barge-in the spoken confirmation and provide a corrected value. Responsive to detecting the user interruption during the pause or after the system speaking of a value, the system halts the spoken confirmation and collects the corrected value. The system then provides a new spoken confirmation to the user, wherein the new spoken confirmation includes the corrected value.
    Type: Grant
    Filed: April 29, 2005
    Date of Patent: March 23, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Sasha Porto Caskey, Juan Manuel Huerta, Roberto Pieraccini
  • Publication number: 20080243505
    Abstract: A method for variable resolution and error control in spoken language understanding (SLU) allows arranging the categories of the SLU into a hierarchy of different levels of specificity. The pre-determined hierarchy is used to identify different types of errors such as high-cost errors and low-cost errors and trade, if necessary, high cost errors for low cost errors.
    Type: Application
    Filed: March 12, 2008
    Publication date: October 2, 2008
    Inventors: Victor Barinov, Robert Dabrowski, Kalle Levon, Roberto Pieraccini, Krishna Dayanidhi
  • Publication number: 20080183470
    Abstract: A method for multiple value confirmation and correction in spoken dialog systems. A user is allowed to correct errors in values captured by the spoken dialog system, such that the interaction necessary for error correction between the system and the user is reduced. When the spoken dialog system collects a set of values from a user, the system provides a spoken confirmation of the set of values to the user. The spoken confirmation comprises the set of values and possibly pause associated with each value. Upon hearing an incorrect value, the user may react and barge-in the spoken confirmation and provide a corrected value. Responsive to detecting the user interruption during the pause or after the system speaking of a value, the system halts the spoken confirmation and collects the corrected value. The system then provides a new spoken confirmation to the user, wherein the new spoken confirmation includes the corrected value.
    Type: Application
    Filed: April 2, 2008
    Publication date: July 31, 2008
    Inventors: Sasha Porto Caskey, Juan Manuel Huerta, Roberto Pieraccini
  • Publication number: 20080167875
    Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.
    Type: Application
    Filed: January 9, 2007
    Publication date: July 10, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Raimo Bakis, Ellen M. Eide, Roberto Pieraccini, Maria E. Smith, Jie Zeng
  • Publication number: 20070165808
    Abstract: An interactive voice response (IVR) system which assists in identifying repeat callers, understanding whether they are calling for the same reason as one of their previous calls, and properly disposing of the call. If the repeat caller is calling for the same reason, information from the previous call (or the previous calls) is retrieved and an action based on a defined business logic for repeat callers may then be executed for the current call.
    Type: Application
    Filed: January 10, 2007
    Publication date: July 19, 2007
    Inventors: Roberto Pieraccini, Zor Gorelov, Alan Pan
  • Publication number: 20070165794
    Abstract: An interactive voice response (IVR) system which allows a caller to barge-in on all prompts, yet still allows the caller to receive important information contained in the prompts. When a caller barges-in on a prompt that contains important information, the IVR system interrupts the prompt to play a short announcement with the goal of re-enforcing the importance of listening to the entire prompt. The IVR system then resumes playback of the prompt, from the beginning or of a modified version.
    Type: Application
    Filed: January 10, 2007
    Publication date: July 19, 2007
    Inventors: Roberto Pieraccini, Eric Woudenberg, Ilija Zeljkovic
  • Publication number: 20070016399
    Abstract: Techniques for detecting data anomalies in a natural language understanding (NLU) system are provided. A number of categorized sentences, categorized into a number of categories, are obtained. Sentences within a given one of the categories are clustered into a number of sub clusters, and the sub clusters are analyzed to identify data anomalies. The clustering can be based on surface forms of the sentences. The anomalies can be, for example, ambiguities or inconsistencies. The clustering can be performed, for example, with a K-means clustering algorithm.
    Type: Application
    Filed: July 12, 2005
    Publication date: January 18, 2007
    Applicant: International Business Machines Corporation
    Inventors: Yuqing Gao, Hong-Kwang Kuo, Roberto Pieraccini, Jerome Quinn, Cheng Wu
  • Publication number: 20060247913
    Abstract: A one-step correction mechanism for voice interaction is provided. Correction of a previous state is enabled simultaneously with recognition in a current or subsequent state. An application is decomposed into a set of tasks. Each task is associated with the collection of one piece of information. Each task may be in a different state. At any point during the interaction, while a task/state pair is active, the dialog manager may enable multiple other task/state pairs to be active in latent fashion. The application developer may then use those facilities or resources to the active task/state and the latent task/state pairs depending on contextual condition of the interaction state of the application.
    Type: Application
    Filed: April 29, 2005
    Publication date: November 2, 2006
    Applicant: International Business Machines Corporation
    Inventors: Juan Huerta, Roberto Pieraccini
  • Publication number: 20060247931
    Abstract: A method for multiple value confirmation and correction in spoken dialog systems. A user is allowed to correct errors in values captured by the spoken dialog system, such that the interaction necessary for error correction between the system and the user is reduced. When the spoken dialog system collects a set of values from a user, the system provides a spoken confirmation of the set of values to the user. The spoken confirmation comprises the set of values and possibly pause associated with each value. Upon hearing an incorrect value, the user may react and barge-in the spoken confirmation and provide a corrected value. Responsive to detecting the user interruption during the pause or after the system speaking of a value, the system halts the spoken confirmation and collects the corrected value. The system then provides a new spoken confirmation to the user, wherein the new spoken confirmation includes the corrected value.
    Type: Application
    Filed: April 29, 2005
    Publication date: November 2, 2006
    Applicant: International Business Machines Corporation
    Inventors: Sasha Caskey, Juan Huerta, Roberto Pieraccini