Patents by Inventor Roberto Pieraccini

Roberto Pieraccini has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DYNAMICALLY ADAPTING GIVEN ASSISTANT OUTPUT BASED ON A GIVEN PERSONA ASSIGNED TO AN AUTOMATED ASSISTANT

Publication number: 20230343324

Abstract: Implementations relate to dynamically adapting a given assistant output based on a given persona, from among a plurality of disparate personas, assigned to an automated assistant. In some implementations, the given assistant output can be generated and subsequently adapted based on the given persona assigned to the automated assistant. In other implementations, the given assistant output can be generated specific to the given persona and without having to subsequently adapt the given assistant output to the given persona. Notably, the given assistant output can include a stream of textual content to be synthesized for audible presentation to the user, and a stream of visual cues utilized in controlling a display of a client device and/or in controlling a visualized representation of the automated assistant. Various implementations utilize large language models (LLMs), or output previously generated utilizing LLMs, to reflect the given persona in the given assistant output.

Type: Application

Filed: May 13, 2022

Publication date: October 26, 2023

Inventors: Martin Baeuml, Thushan Amarasiriwardena, Roberto Pieraccini, Gianluca Martini
DYNAMICALLY ADAPTING GIVEN ASSISTANT OUTPUT BASED ON A GIVEN PERSONA ASSIGNED TO AN AUTOMATED ASSISTANT

Publication number: 20230343323

Abstract: Implementations relate to dynamically adapting a given assistant output based on a given persona, from among a plurality of disparate personas, assigned to an automated assistant. In some implementations, the given assistant output can be generated and subsequently adapted based on the given persona assigned to the automated assistant. In other implementations, the given assistant output can be generated specific to the given persona and without having to subsequently adapt the given assistant output to the given persona. Notably, the given assistant output can include a stream of textual content to be synthesized for audible presentation to the user, and a stream of visual cues utilized in controlling a display of a client device and/or in controlling a visualized representation of the automated assistant. Various implementations utilize large language models (LLMs), or output previously generated utilizing LLMs, to reflect the given persona in the given assistant output.

Type: Application

Filed: April 21, 2022

Publication date: October 26, 2023

Inventors: Martin Baeuml, Thushan Amarasiriwardena, Roberto Pieraccini, Gianluca Martini
USING LARGE LANGUAGE MODEL(S) IN GENERATING AUTOMATED ASSISTANT RESPONSE(S

Publication number: 20230074406

Abstract: As part of a dialog session between a user and an automated assistant, implementations can receive a stream of audio data that captures a spoken utterance including an assistant query, determine, based on processing the stream of audio data, a set of assistant outputs that are each predicted to be responsive to the assistant query, process, using large language model (LLM) output(s), the assistant outputs and context of the dialog session to generate a set of modified assistant outputs, and cause given modified assistant output, from among the set of modified assistant outputs, to be provided for presentation to the user in response to the spoken utterance. In some implementations, the LLM output(s) can be generated in an offline manner for subsequent use in an online manner. In additional or alternative implementations, the LLM output(s) can be generated in an online manner when the spoken utterance is received.

Type: Application

Filed: November 22, 2021

Publication date: March 9, 2023

Inventors: Martin Baeuml, Thushan Amarasiriwardena, Roberto Pieraccini, Vikram Sridar, Daniel De Freitas Adiwardana, Noam M. Shazeer, Quoc Le
System and method for the localization of statistical classifiers based on machine translation

Patent number: 9558183

Abstract: A system and method for localizing a spoken dialog system is disclosed. Source data from a source language spoken dialog system is accessed, including semantic annotations and transcriptions of a plurality of utterances. The transcriptions are machine-translated into a target language. Semantic classifiers are trained on the machine translated transcriptions and the source language semantic annotations.

Type: Grant

Filed: September 3, 2010

Date of Patent: January 31, 2017

Assignee: Synchronoss Technologies, Inc.

Inventors: David Suendermann, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
ENGAGING IN HUMAN-BASED SOCIAL INTERACTION WITH MEMBERS OF A GROUP USING A PERSISTENT COMPANION DEVICE

Publication number: 20160193732

Abstract: A persistent companion robot supports both one-on-one interaction with a human and group interaction with more than one human. The interaction can be directed to a human in detectable proximity, such as a human that is near to the robot, one that is further away from the robot, or any combination of near and far humans. The interaction incorporates multi-modal human input detection (e.g., seeing, hearing, tactile) with multi-modal expression (e.g., movement, speech, non-speech sound, lighting, electronic imagery, and the like.

Type: Application

Filed: March 15, 2016

Publication date: July 7, 2016

Inventors: Cynthia Breazeal, Robert Todd Pack, Seppo Andrew Rapo, Roberto Pieraccini, Maxim Makachev
TILED GRAMMAR FOR PHRASE SPOTTING WITH A PERSISTENT COMPANION DEVICE

Publication number: 20160171979

Abstract: Tiled grammar-based phrase spotting includes orienting at least one segment of a multi-segment robot to facilitate capturing speech of a user arriving at the robot, and configuring a plurality of processing threads of a multi-threaded processing environment into distinct tiles, wherein at least a portion of the plurality of processing threads operate simultaneously on the captured speech to recognize a phrase type using a speech recognition grammar that is associated with the corresponding tile, wherein at least two of the tiles employ different speech recognition grammars to recognize different content in the captured speech.

Type: Application

Filed: February 12, 2016

Publication date: June 16, 2016

Inventors: Cynthia Breazeal, Roberto Pieraccini, Maxim Makachev
Browsing and Retrieval of Full Broadcast-Quality Video

Publication number: 20160042065

Abstract: A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.

Type: Application

Filed: October 26, 2015

Publication date: February 11, 2016

Inventors: Andrea Basso, Mehmet Reha Civanlar, David Crawford Gibbon, Qian Huang, Esther Levin, Roberto Pieraccini, Behzad Shahraray
Browsing and retrieval of full broadcast-quality video

Patent number: 9171545

Abstract: A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.

Type: Grant

Filed: November 30, 2010

Date of Patent: October 27, 2015

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Andrea Basso, Mehmet Reha Civanlar, David Crawford Gibbon, Qian Huang, Esther Levin, Roberto Pieraccini, Behzad Shahraray
System for tuning synthesized speech

Patent number: 8849669

Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech, including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.

Type: Grant

Filed: April 3, 2013

Date of Patent: September 30, 2014

Assignee: Nuance Communications, Inc.

Inventors: Raimo Bakis, Ellen Marie Eide, Roberto Pieraccini, Maria E. Smith, Jie Z. Zeng
System and method for optimizing call flows of a spoken dialog system

Patent number: 8831208

Abstract: A dialog manager for a spoken dialog system. A decision module selects a path from a plurality of alternative paths for a given call, wherein each path implements one of a plurality of strategies for a call flow. A weighting module weights the path selection decision and is connected to a probability estimator for estimating the probability value that a given one of the plurality of paths is the best-performing path.

Type: Grant

Filed: September 23, 2011

Date of Patent: September 9, 2014

Assignee: Synchronoss Technologies, Inc.

Inventors: David Suendermann, Jackson Liscombe, Jonathan Bloom, Grace Li, Roberto Pieraccini
System and method for building optimal state-dependent statistical utterance classifiers in spoken dialog systems

Patent number: 8682669

Abstract: A system and a method to generate statistical utterance classifiers optimized for the individual states of a spoken dialog system is disclosed. The system and method make use of large databases of transcribed and annotated utterances from calls collected in a dialog system in production and log data reporting the association between the state of the system at the moment when the utterances were recorded and the utterance. From the system state, being a vector of multiple system variables, subsets of these variables, certain variable ranges, quantized variable values, etc. can be extracted to produce a multitude of distinct utterance subsets matching every possible system state. For each of these subset and variable combinations, statistical classifiers can be trained, tuned, and tested, and the classifiers can be stored together with the performance results and the state subset and variable combination.

Type: Grant

Filed: August 21, 2009

Date of Patent: March 25, 2014

Assignee: Synchronoss Technologies, Inc.

Inventors: David Suendermann, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
SYSTEM FOR TUNING SYNTHESIZED SPEECH

Publication number: 20140058734

Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech, including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.

Type: Application

Filed: April 3, 2013

Publication date: February 27, 2014

Inventors: Raimo Bakis, Ellen Marie Eide, Roberto Pieraccini, Maria E. Smith, Jie Z. Zeng
System and method for improving performance of semantic classifiers in spoken dialog systems

Patent number: 8543401

Abstract: A method and apparatus for continuously improving the performance of semantic classifiers in the scope of spoken dialog systems are disclosed. Rule-based or statistical classifiers are replaced with better performing rule-based or statistical classifiers and/or certain parameters of existing classifiers are modified. The replacement classifiers or new parameters are trained and tested on a collection of transcriptions and annotations of utterances which are generated manually or in a partially automated fashion. Automated quality assurance leads to more accurate training and testing data, higher classification performance, and feedback into the design of the spoken dialog system by suggesting changes to improve system behavior.

Type: Grant

Filed: April 17, 2009

Date of Patent: September 24, 2013

Assignee: Synchronoss Technologies

Inventors: David Suendermann, Keelan Evanini, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
System and method for robust evaluation of the user experience in automated spoken dialog systems

Patent number: 8520808

Abstract: A single, subjective numerical rating to evaluate the performance of a telephone-based spoken dialog system. This CE rating is provided by expert human listeners who have knowledge of the design of the dialog system. Different human raters can be trained to achieve a satisfactory level of agreement. Furthermore, a classifier trained on ratings by human experts can reproduce the human ratings with the same degree of consistency. More calls can be given a CE rating than would be possible with limited human resources. More information can be provided about individual calls, e.g., to help decide between two disparate ratings by different human experts.

Type: Grant

Filed: October 8, 2009

Date of Patent: August 27, 2013

Assignee: Synchronoss Technologies

Inventors: Krishna Dayanidhi, Keelan Evanini, Phillip Hunter, Jackson Liscombe, Roberto Pieraccini, David Suendermann, Zor Gorelov
System for tuning synthesized speech

Patent number: 8438032

Abstract: An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and or extended SSML to synthesized audio. Provisions are provided to create, view, play, and edit the synthesized speech including editing pitch and duration targets, speaking type, paralinguistic events, and prosody. Prosody can be provided by way of a sample recording. Users can interact with the software tool by way of a graphical user interface (GUI). The software tool can produce synthesized audio file output in many file formats.

Type: Grant

Filed: January 9, 2007

Date of Patent: May 7, 2013

Assignee: Nuance Communications, Inc.

Inventors: Raimo Bakis, Ellen M. Eide, Roberto Pieraccini, Maria E. Smith, Jie Zeng
Method and apparatus for multiple value confirmation and correction in spoken dialog system

Patent number: 8433572

Abstract: A method for multiple value confirmation and correction in spoken dialog systems. A user is allowed to correct errors in values captured by the spoken dialog system, such that the interaction necessary for error correction between the system and the user is reduced. When the spoken dialog system collects a set of values from a user, the system provides a spoken confirmation of the set of values to the user. The spoken confirmation comprises the set of values and possibly pause associated with each value. Upon hearing an incorrect value, the user may react and barge-in the spoken confirmation and provide a corrected value. Responsive to detecting the user interruption during the pause or after the system speaking of a value, the system halts the spoken confirmation and collects the corrected value. The system then provides a new spoken confirmation to the user, wherein the new spoken confirmation includes the corrected value.

Type: Grant

Filed: April 2, 2008

Date of Patent: April 30, 2013

Assignee: Nuance Communications, Inc.

Inventors: Sasha Porto Caskey, Juan Manuel Huerta, Roberto Pieraccini
SYSTEM AND METHOD FOR OPTIMIZING CALL FLOWS OF A SPOKEN DIALOG SYSTEM

Publication number: 20130077767

Abstract: A dialog manager for a spoken dialog system. A decision module selects a path from a plurality of alternative paths for a given call, wherein each path implements one of a plurality of strategies for a call flow. A weighting module weights the path selection decision and is connected to a probability estimator for estimating the probability value that a given one of the plurality of paths is the best-performing path.

Type: Application

Filed: September 23, 2011

Publication date: March 28, 2013

Inventors: David SUENDERMANN, Jackson Liscombe, Jonathan Bloom, Grace Li, Roberto Pieraccini
SYSTEM AND METHOD FOR THE LOCALIZATION OF STATISTICAL CLASSIFIERS BASED ON MACHINE TRANSLATION

Publication number: 20120166183

Abstract: A system and method for localizing a spoken dialog system is disclosed. Source data from a source language spoken dialog system is accessed, including semantic annotations and transcriptions of a plurality of utterances. The transcriptions are machine-translated into a target language. Semantic classifiers are trained on the machine translated transcriptions and the source language semantic annotations.

Type: Application

Filed: September 3, 2010

Publication date: June 28, 2012

Inventors: David Suendermann, Jackson Liscombe, Krishna Dayanidhi, Roberto Pieraccini
Methods and apparatus for enforcing caller listening behavior on interactive voice response applications

Patent number: 8180025

Abstract: An interactive voice response (IVR) system which allows a caller to barge-in on all prompts, yet still allows the caller to receive important information contained in the prompts. When a caller barges-in on a prompt that contains important information, the IVR system interrupts the prompt to play a short announcement with the goal of re-enforcing the importance of listening to the entire prompt. The IVR system then resumes playback of the prompt, from the beginning or of a modified version.

Type: Grant

Filed: January 10, 2007

Date of Patent: May 15, 2012

Assignee: SpeechCycle, Inc.

Inventors: Roberto Pieraccini, Eric Woudenberg, Ilija Zeljkovic
Method for variable resolution and error control in spoken language understanding

Patent number: 8180639

Abstract: A method for variable resolution and error control in spoken language understanding (SLU) allows arranging the categories of the SLU into a hierarchy of different levels of specificity. The pre-determined hierarchy is used to identify different types of errors such as high-cost errors and low-cost errors and trade, if necessary, high cost errors for low cost errors.

Type: Grant

Filed: May 6, 2011

Date of Patent: May 15, 2012

Assignee: SpeechCycle, Inc.

Inventors: Roberto Pieraccini, Krishna Dayanidhi

1 2 3 next