Patents by Inventor Roberto Pieraccini

Roberto Pieraccini has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230343324
    Abstract: Implementations relate to dynamically adapting a given assistant output based on a given persona, from among a plurality of disparate personas, assigned to an automated assistant. In some implementations, the given assistant output can be generated and subsequently adapted based on the given persona assigned to the automated assistant. In other implementations, the given assistant output can be generated specific to the given persona and without having to subsequently adapt the given assistant output to the given persona. Notably, the given assistant output can include a stream of textual content to be synthesized for audible presentation to the user, and a stream of visual cues utilized in controlling a display of a client device and/or in controlling a visualized representation of the automated assistant. Various implementations utilize large language models (LLMs), or output previously generated utilizing LLMs, to reflect the given persona in the given assistant output.
    Type: Application
    Filed: May 13, 2022
    Publication date: October 26, 2023
    Inventors: Martin Baeuml, Thushan Amarasiriwardena, Roberto Pieraccini, Gianluca Martini
  • Publication number: 20230343323
    Abstract: Implementations relate to dynamically adapting a given assistant output based on a given persona, from among a plurality of disparate personas, assigned to an automated assistant. In some implementations, the given assistant output can be generated and subsequently adapted based on the given persona assigned to the automated assistant. In other implementations, the given assistant output can be generated specific to the given persona and without having to subsequently adapt the given assistant output to the given persona. Notably, the given assistant output can include a stream of textual content to be synthesized for audible presentation to the user, and a stream of visual cues utilized in controlling a display of a client device and/or in controlling a visualized representation of the automated assistant. Various implementations utilize large language models (LLMs), or output previously generated utilizing LLMs, to reflect the given persona in the given assistant output.
    Type: Application
    Filed: April 21, 2022
    Publication date: October 26, 2023
    Inventors: Martin Baeuml, Thushan Amarasiriwardena, Roberto Pieraccini, Gianluca Martini
  • Publication number: 20230074406
    Abstract: As part of a dialog session between a user and an automated assistant, implementations can receive a stream of audio data that captures a spoken utterance including an assistant query, determine, based on processing the stream of audio data, a set of assistant outputs that are each predicted to be responsive to the assistant query, process, using large language model (LLM) output(s), the assistant outputs and context of the dialog session to generate a set of modified assistant outputs, and cause given modified assistant output, from among the set of modified assistant outputs, to be provided for presentation to the user in response to the spoken utterance. In some implementations, the LLM output(s) can be generated in an offline manner for subsequent use in an online manner. In additional or alternative implementations, the LLM output(s) can be generated in an online manner when the spoken utterance is received.
    Type: Application
    Filed: November 22, 2021
    Publication date: March 9, 2023
    Inventors: Martin Baeuml, Thushan Amarasiriwardena, Roberto Pieraccini, Vikram Sridar, Daniel De Freitas Adiwardana, Noam M. Shazeer, Quoc Le
  • Patent number: 9558183
    Abstract: A system and method for localizing a spoken dialog system is disclosed. Source data from a source language spoken dialog system is accessed, including semantic annotations and transcriptions of a plurality of utterances. The transcriptions are machine-translated into a target language. Semantic classifiers are trained on the machine translated transcriptions and the source language semantic annotations.
    Type: Grant
    Filed: September 3, 2010
    Date of Patent: January 31, 2017
    Assignee: Synchronoss Technologies, Inc.
    Inventors: David Suendermann, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
  • Publication number: 20160193732
    Abstract: A persistent companion robot supports both one-on-one interaction with a human and group interaction with more than one human. The interaction can be directed to a human in detectable proximity, such as a human that is near to the robot, one that is further away from the robot, or any combination of near and far humans. The interaction incorporates multi-modal human input detection (e.g., seeing, hearing, tactile) with multi-modal expression (e.g., movement, speech, non-speech sound, lighting, electronic imagery, and the like.
    Type: Application
    Filed: March 15, 2016
    Publication date: July 7, 2016
    Inventors: Cynthia Breazeal, Robert Todd Pack, Seppo Andrew Rapo, Roberto Pieraccini, Maxim Makachev
  • Publication number: 20160171979
    Abstract: Tiled grammar-based phrase spotting includes orienting at least one segment of a multi-segment robot to facilitate capturing speech of a user arriving at the robot, and configuring a plurality of processing threads of a multi-threaded processing environment into distinct tiles, wherein at least a portion of the plurality of processing threads operate simultaneously on the captured speech to recognize a phrase type using a speech recognition grammar that is associated with the corresponding tile, wherein at least two of the tiles employ different speech recognition grammars to recognize different content in the captured speech.
    Type: Application
    Filed: February 12, 2016
    Publication date: June 16, 2016
    Inventors: Cynthia Breazeal, Roberto Pieraccini, Maxim Makachev
  • Publication number: 20160042065
    Abstract: A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.
    Type: Application
    Filed: October 26, 2015
    Publication date: February 11, 2016
    Inventors: Andrea Basso, Mehmet Reha Civanlar, David Crawford Gibbon, Qian Huang, Esther Levin, Roberto Pieraccini, Behzad Shahraray
  • Patent number: 9171545
    Abstract: A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.
    Type: Grant
    Filed: November 30, 2010
    Date of Patent: October 27, 2015
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Andrea Basso, Mehmet Reha Civanlar, David Crawford Gibbon, Qian Huang, Esther Levin, Roberto Pieraccini, Behzad Shahraray
  • Patent number: 8849669
    Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech, including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.
    Type: Grant
    Filed: April 3, 2013
    Date of Patent: September 30, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Raimo Bakis, Ellen Marie Eide, Roberto Pieraccini, Maria E. Smith, Jie Z. Zeng
  • Patent number: 8831208
    Abstract: A dialog manager for a spoken dialog system. A decision module selects a path from a plurality of alternative paths for a given call, wherein each path implements one of a plurality of strategies for a call flow. A weighting module weights the path selection decision and is connected to a probability estimator for estimating the probability value that a given one of the plurality of paths is the best-performing path.
    Type: Grant
    Filed: September 23, 2011
    Date of Patent: September 9, 2014
    Assignee: Synchronoss Technologies, Inc.
    Inventors: David Suendermann, Jackson Liscombe, Jonathan Bloom, Grace Li, Roberto Pieraccini
  • Patent number: 8682669
    Abstract: A system and a method to generate statistical utterance classifiers optimized for the individual states of a spoken dialog system is disclosed. The system and method make use of large databases of transcribed and annotated utterances from calls collected in a dialog system in production and log data reporting the association between the state of the system at the moment when the utterances were recorded and the utterance. From the system state, being a vector of multiple system variables, subsets of these variables, certain variable ranges, quantized variable values, etc. can be extracted to produce a multitude of distinct utterance subsets matching every possible system state. For each of these subset and variable combinations, statistical classifiers can be trained, tuned, and tested, and the classifiers can be stored together with the performance results and the state subset and variable combination.
    Type: Grant
    Filed: August 21, 2009
    Date of Patent: March 25, 2014
    Assignee: Synchronoss Technologies, Inc.
    Inventors: David Suendermann, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
  • Publication number: 20140058734
    Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech, including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.
    Type: Application
    Filed: April 3, 2013
    Publication date: February 27, 2014
    Inventors: Raimo Bakis, Ellen Marie Eide, Roberto Pieraccini, Maria E. Smith, Jie Z. Zeng
  • Patent number: 8543401
    Abstract: A method and apparatus for continuously improving the performance of semantic classifiers in the scope of spoken dialog systems are disclosed. Rule-based or statistical classifiers are replaced with better performing rule-based or statistical classifiers and/or certain parameters of existing classifiers are modified. The replacement classifiers or new parameters are trained and tested on a collection of transcriptions and annotations of utterances which are generated manually or in a partially automated fashion. Automated quality assurance leads to more accurate training and testing data, higher classification performance, and feedback into the design of the spoken dialog system by suggesting changes to improve system behavior.
    Type: Grant
    Filed: April 17, 2009
    Date of Patent: September 24, 2013
    Assignee: Synchronoss Technologies
    Inventors: David Suendermann, Keelan Evanini, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
  • Patent number: 8520808
    Abstract: A single, subjective numerical rating to evaluate the performance of a telephone-based spoken dialog system. This CE rating is provided by expert human listeners who have knowledge of the design of the dialog system. Different human raters can be trained to achieve a satisfactory level of agreement. Furthermore, a classifier trained on ratings by human experts can reproduce the human ratings with the same degree of consistency. More calls can be given a CE rating than would be possible with limited human resources. More information can be provided about individual calls, e.g., to help decide between two disparate ratings by different human experts.
    Type: Grant
    Filed: October 8, 2009
    Date of Patent: August 27, 2013
    Assignee: Synchronoss Technologies
    Inventors: Krishna Dayanidhi, Keelan Evanini, Phillip Hunter, Jackson Liscombe, Roberto Pieraccini, David Suendermann, Zor Gorelov
  • Patent number: 8438032
    Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.
    Type: Grant
    Filed: January 9, 2007
    Date of Patent: May 7, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Raimo Bakis, Ellen M. Eide, Roberto Pieraccini, Maria E. Smith, Jie Zeng
  • Patent number: 8433572
    Abstract: A method for multiple value confirmation and correction in spoken dialog systems. A user is allowed to correct errors in values captured by the spoken dialog system, such that the interaction necessary for error correction between the system and the user is reduced. When the spoken dialog system collects a set of values from a user, the system provides a spoken confirmation of the set of values to the user. The spoken confirmation comprises the set of values and possibly pause associated with each value. Upon hearing an incorrect value, the user may react and barge-in the spoken confirmation and provide a corrected value. Responsive to detecting the user interruption during the pause or after the system speaking of a value, the system halts the spoken confirmation and collects the corrected value. The system then provides a new spoken confirmation to the user, wherein the new spoken confirmation includes the corrected value.
    Type: Grant
    Filed: April 2, 2008
    Date of Patent: April 30, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Sasha Porto Caskey, Juan Manuel Huerta, Roberto Pieraccini
  • Publication number: 20130077767
    Abstract: A dialog manager for a spoken dialog system. A decision module selects a path from a plurality of alternative paths for a given call, wherein each path implements one of a plurality of strategies for a call flow. A weighting module weights the path selection decision and is connected to a probability estimator for estimating the probability value that a given one of the plurality of paths is the best-performing path.
    Type: Application
    Filed: September 23, 2011
    Publication date: March 28, 2013
    Inventors: David SUENDERMANN, Jackson Liscombe, Jonathan Bloom, Grace Li, Roberto Pieraccini
  • Publication number: 20120166183
    Abstract: A system and method for localizing a spoken dialog system is disclosed. Source data from a source language spoken dialog system is accessed, including semantic annotations and transcriptions of a plurality of utterances. The transcriptions are machine-translated into a target language. Semantic classifiers are trained on the machine translated transcriptions and the source language semantic annotations.
    Type: Application
    Filed: September 3, 2010
    Publication date: June 28, 2012
    Inventors: David Suendermann, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
  • Patent number: 8180025
    Abstract: An interactive voice response (IVR) system which allows a caller to barge-in on all prompts, yet still allows the caller to receive important information contained in the prompts. When a caller barges-in on a prompt that contains important information, the IVR system interrupts the prompt to play a short announcement with the goal of re-enforcing the importance of listening to the entire prompt. The IVR system then resumes playback of the prompt, from the beginning or of a modified version.
    Type: Grant
    Filed: January 10, 2007
    Date of Patent: May 15, 2012
    Assignee: SpeechCycle, Inc.
    Inventors: Roberto Pieraccini, Eric Woudenberg, Ilija Zeljkovic
  • Patent number: 8180639
    Abstract: A method for variable resolution and error control in spoken language understanding (SLU) allows arranging the categories of the SLU into a hierarchy of different levels of specificity. The pre-determined hierarchy is used to identify different types of errors such as high-cost errors and low-cost errors and trade, if necessary, high cost errors for low cost errors.
    Type: Grant
    Filed: May 6, 2011
    Date of Patent: May 15, 2012
    Assignee: SpeechCycle, Inc.
    Inventors: Roberto Pieraccini, Krishna Dayanidhi