Patents by Inventor Salvator D. Lombardo
Salvator D. Lombardo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11544606Abstract: Systems and methods for compressing target content are disclosed. In one embodiment, a system may include non-transient electronic storage and one or more physical computer processors. The one or more physical computer processors may be configured by machine-readable instructions to obtain the target content comprising one or more frames, wherein a given frame comprises one or more features. The one or more physical computer processors may be configured by machine-readable instructions to obtain a conditioned network. The one or more physical computer processors may be configured by machine-readable instructions to generate decoded target content by applying the conditioned network to the target content.Type: GrantFiled: January 22, 2019Date of Patent: January 3, 2023Assignee: Disney Enterprises, Inc.Inventors: Stephan Marcel Mandt, Christopher Schoers, Jun Han, Salvator D. Lombardo
-
Patent number: 11195533Abstract: A system for incremental natural language understanding includes a media module, a memory storing a software code, and a hardware processor communicatively coupled to the media module. The hardware processor is configured to execute the software code to receive an audio stream including a first utterance, and generate a first and second incremental speech recognition outputs based on first and second portions of the first utterance. In addition, the hardware processor is configured to execute the software code to determine, prior to generating the second incremental speech recognition output, a first intent of the first utterance based on the first incremental speech recognition output. The hardware processor is further configured to execute the software code to retrieve a first resource based on the determined first intent, and incorporate the first resource in the media content to be played by the media module.Type: GrantFiled: March 25, 2020Date of Patent: December 7, 2021Assignee: Disney Enterprises, Inc.Inventors: Komath Naveen Kumar, James R. Kennedy, Salvator D. Lombardo, Prashanth Gurunath Shivakumar
-
Publication number: 20210304773Abstract: A system for incremental natural language understanding includes a media module, a memory storing a software code, and a hardware processor communicatively coupled to the media module. The hardware processor is configured to execute the software code to receive an audio stream including a first utterance, and generate a first and second incremental speech recognition outputs based on first and second portions of the first utterance. In addition, the hardware processor is configured to execute the software code to determine, prior to generating the second incremental speech recognition output, a first intent of the first utterance based on the first incremental speech recognition output. The hardware processor is further configured to execute the software code to retrieve a first resource based on the determined first intent, and incorporate the first resource in the media content to be played by the media module.Type: ApplicationFiled: March 25, 2020Publication date: September 30, 2021Inventors: Komath Naveen Kumar, James R. Kennedy, Salvator D. Lombardo, Prashanth Gurunath Shivakumar
-
Patent number: 11062692Abstract: An audio processing system for generating audio including emotionally expressive synthesized content includes a computing platform having a hardware processor and a memory storing a software code including a trained neural network. The hardware processor is configured to execute the software code to receive an audio sequence template including one or more audio segment(s) and an audio gap, and to receive data describing one or more words for insertion into the audio gap. The hardware processor is configured to further execute the software code to use the trained neural network to generate an integrated audio sequence using the audio sequence template and the data, the integrated audio sequence including the one or more audio segment(s) and at least one synthesized word corresponding to the one or more words described by the data.Type: GrantFiled: September 23, 2019Date of Patent: July 13, 2021Assignee: Disney Enterprises, Inc.Inventors: Salvator D. Lombardo, Komath Naveen Kumar, Douglas A. Fidaleo
-
Patent number: 10997476Abstract: There are provided systems and methods for performing automated content evaluation. In one implementation, the system includes a hardware processor and a system memory storing a software code including a predictive model trained based on an audience response to training content. The hardware processor executes the software code to receive images, each image including facial landmarks of an audience member viewing the content during its duration, and for each image, transforms the facial landmarks to a lower dimensional facial representation, resulting in multiple lower dimensional facial representations of each audience member. For each of a subset of the lower dimensional facial representations of each audience member, the software code utilizes the predictive model to predict one or more responses to the content, resulting in multiple predictions for each audience member, and classifies one or more time segment(s) in the duration of the content based on an aggregate of the predictions.Type: GrantFiled: May 8, 2019Date of Patent: May 4, 2021Assignee: Disney Enterprises, Inc.Inventors: Salvator D. Lombardo, Cristina Segalin, Lei Chen, Rajitha D. Navarathna, Stephan Marcel Mandt
-
Publication number: 20210090549Abstract: An audio processing system for generating audio including emotionally expressive synthesized content includes a computing platform having a hardware processor and a memory storing a software code including a trained neural network. The hardware processor is configured to execute the software code to receive an audio sequence template including one or more audio segment(s) and an audio gap, and to receive data describing one or more words for insertion into the audio gap. The hardware processor is configured to further execute the software code to use the trained neural network to generate an integrated audio sequence using the audio sequence template and the data, the integrated audio sequence including the one or more audio segment(s) and at least one synthesized word corresponding to the one or more words described by the data.Type: ApplicationFiled: September 23, 2019Publication date: March 25, 2021Inventors: Salvator D. Lombardo, Komath Naveen Kumar, Douglas A. Fidaleo
-
Publication number: 20200151524Abstract: There are provided systems and methods for performing automated content evaluation. In one implementation, the system includes a hardware processor and a system memory storing a software code including a predictive model trained based on an audience response to training content. The hardware processor executes the software code to receive images, each image including facial landmarks of an audience member viewing the content during its duration, and for each image, transforms the facial landmarks to a lower dimensional facial representation, resulting in multiple lower dimensional facial representations of each audience member. For each of a subset of the lower dimensional facial representations of each audience member, the software code utilizes the predictive model to predict one or more responses to the content, resulting in multiple predictions for each audience member, and classifies one or more time segment(s) in the duration of the content based on an aggregate of the predictions.Type: ApplicationFiled: May 8, 2019Publication date: May 14, 2020Inventors: Salvator D. Lombardo, Cristina Segalin, Lei Chen, Rajitha D. Navarathna, Stephan Marcel Mandt
-
Publication number: 20200090069Abstract: Systems and methods for compressing target content are disclosed. In one embodiment, a system may include non-transient electronic storage and one or more physical computer processors. The one or more physical computer processors may be configured by machine-readable instructions to obtain the target content comprising one or more frames, wherein a given frame comprises one or more features. The one or more physical computer processors may be configured by machine-readable instructions to obtain a conditioned network. The one or more physical computer processors may be configured by machine-readable instructions to generate decoded target content by applying the conditioned network to the target content.Type: ApplicationFiled: January 22, 2019Publication date: March 19, 2020Applicant: Disney Enterprises, Inc.Inventors: Stephan Marcel Mandt, Christopher Schoers, Jun Han, Salvator D. Lombardo