Patents by Inventor Daniela Braga

Daniela Braga has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Identifying workers in a crowdsourcing or microtasking platform who perform low-quality work and/or are really automated bots

Patent number: 11436548

Abstract: A facility for identifying workers in a crowdsourcing or micro-tasking platform who perform low-quality work and/or are really automated bots is described. To identify users who perform low-quality work and/or are really bots, the facility (1) measures the quality of at least a portion of the work done by each user, and (2) tracks the pattern of behavior performed by each user on the platform—such as which work projects they select, the content of the responses, and the timing of each user interface interaction. The facility uses this information to build and maintain a model, such as a statistical model, that uses the pattern of a user's behavior to predict the level of quality of the user's work. Users for which the model predicts a low level of quality are flagged for manual review, or automatically suspended from working or from receiving payment.

Type: Grant

Filed: November 17, 2017

Date of Patent: September 6, 2022

Assignee: DefinedCrowd Corporation

Inventors: Joao Freitas, Daniela Braga, Andre Pontes
Workflow for defining a multimodal crowdsourced or microtasking project

Patent number: 11315051

Abstract: A facility for providing a workflow tailored to defining a project for collecting multimodal data from each of a set of crowdsourcing or microtasking platform workers is described. The facility enables customers of a crowdsourcing or microtasking platform to easily define multimodal data collection projects. The facility enables customers to define any of the following types of information associated with multimodal data collection projects: worker requirements, project environment parameters, video data, audio data, physiological data, and/or location-related data. Some of this data is collected using different kinds of sensors in one or more devices (e.g., smart phones, fitness wearables, etc.) associated with the crowdsourcing or micro-tasking platforms' workers. Prior to computing data results generated by executing a multimodal data collection project, the facility an align at least a first portion of the collected data with a second portion of the second data.

Type: Grant

Filed: May 11, 2018

Date of Patent: April 26, 2022

Assignee: DefinedCrowd Corporation

Inventors: Daniela Braga, Joao Freitas, Sara Oliveira
System and method for validating natural language content using crowdsourced validation jobs

Patent number: 11069361

Abstract: A method includes causing a first crowdsourced validation job to be provided to one or more first validation devices, the first crowdsourced validation job comprising first instructions for a crowd user to provide an indication of an accuracy of a transcription of natural language content, receiving a plurality of responses from the one or more first validation devices, wherein the plurality of responses include at least a first response from at least a first validation device from among the one or more first validation devices, the first response including a first indication of an accuracy of the transcription of the natural language content, and determining a first confidence score of the first validation device based, at least in part, on the plurality of responses received from the one or more first validation devices and the first response received from the first validation device.

Type: Grant

Filed: November 14, 2019

Date of Patent: July 20, 2021

Inventors: Spencer John Rothwell, Daniela Braga, Ahmad Khamis Elshenawy, Stephen Steele Carter
Crowdsourced training of textual natural language understanding systems

Patent number: 10769187

Abstract: A facility to crowdsource training of virtual assistants and other textual natural language understanding systems is described. The facility first specifies a set of possible user intents (e.g., a kind of question asked by users). As part of specifying an intent, entities, that represent salient items of information associated with the intent are identified. Then, for each of the intents, the facility directs users of a crowdsourcing platform to input a number of different textual queries they might use to express this intent. Then, additional crowdsourcing platform users are asked to perform semantic annotation of the cleaned queries, for each selecting its intent and entities from predefined lists. Next, still other crowdsourcing platform users are asked whether the selection of intents and entities during semantic annotation was correct for each query. Once validated, the annotated queries are used to train the assistant.

Type: Grant

Filed: November 25, 2019

Date of Patent: September 8, 2020

Assignee: DefinedCrowd Corporation

Inventors: Daniela Braga, Joao Freitas, Daan Baldewijns
SYSTEM AND METHOD FOR VALIDATING NATURAL LANGUAGE CONTENT USING CROWDSOURCED VALIDATION JOBS

Publication number: 20200126562

Abstract: Systems and methods of validating transcriptions of natural language content using crowdsourced validation jobs are provided herein. In various implementations, a transcription pair comprising natural language content and text corresponding to a transcription of the natural language content may be gathered. A group of validation devices may be selected for reviewing the transcription pair. A crowdsourced validation job may be created for the group of validation devices. The crowdsourced validation job may be provided to the group of validation devices. One or more votes representing whether or not the text accurately represents the natural language content may be received from the group of validation devices. Based on the one or more votes received, the transcription pair may be stored in a validated transcription library, which may be used to process end-user voice data.

Type: Application

Filed: November 14, 2019

Publication date: April 23, 2020

Applicant: CERENCE OPERATING COMPANY

Inventors: Spencer John ROTHWELL, Daniela BRAGA, Ahmad Khamis ELSHENAWY, Stephen Steele CARTER
CROWDSOURCED TRAINING OF TEXTUAL NATURAL LANGUAGE UNDERSTANDING SYSTEMS

Publication number: 20200089698

Abstract: A facility to crowdsource training of virtual assistants and other textual natural language understanding systems is described. The facility first specifies a set of possible user intents (e.g., a kind of question asked by users). As part of specifying an intent, entities, that represent salient items of information associated with the intent are identified. Then, for each of the intents, the facility directs users of a crowdsourcing platform to input a number of different textual queries they might use to express this intent. Then, additional crowdsourcing platform users are asked to perform semantic annotation of the cleaned queries, for each selecting its intent and entities from predefined lists. Next, still other crowdsourcing platform users are asked whether the selection of intents and entities during semantic annotation was correct for each query. Once validated, the annotated queries are used to train the assistant.

Type: Application

Filed: November 25, 2019

Publication date: March 19, 2020

Inventors: Daniela Braga, Joao Freitas, Daan Baldewijns
Crowdsourced training of textual natural language understanding systems

Patent number: 10528605

Abstract: A facility to crowdsource training of virtual assistants and other textual natural language understanding systems is described. The facility first specifies a set of possible user intents (e.g., a kind of question asked by users). As part of specifying an intent, entities, that represent salient items of information associated with the intent are identified. Then, for each of the intents, the facility directs users of a crowdsourcing platform to input a number of different textual queries they might use to express this intent. Then, additional crowdsourcing platform users are asked to perform semantic annotation of the cleaned queries, for each selecting its intent and entities from predefined lists. Next, still other crowdsourcing platform users are asked whether the selection of intents and entities during semantic annotation was correct for each query. Once validated, the annotated queries are used to train the assistant.

Type: Grant

Filed: November 16, 2017

Date of Patent: January 7, 2020

Assignee: DefinedCrowd Corporation

Inventors: Daniela Braga, Joao Freitas, Daan Baldewijns
System and method for validating natural language content using crowdsourced validation jobs

Patent number: 10504522

Abstract: Systems and methods of validating transcriptions of natural language content using crowdsourced validation jobs are provided herein. In various implementations, a transcription pair comprising natural language content and text corresponding to a transcription of the natural language content may be gathered. A group of validation devices may be selected for reviewing the transcription pair. A crowdsourced validation job may be created for the group of validation devices. The crowdsourced validation job may be provided to the group of validation devices. One or more votes representing whether or not the text accurately represents the natural language content may be received from the group of validation devices. Based on the one or more votes received, the transcription pair may be stored in a validated transcription library, which may be used to process end-user voice data.

Type: Grant

Filed: March 19, 2018

Date of Patent: December 10, 2019

Assignee: Voicebox Technologies Corporation

Inventors: Spencer John Rothwell, Daniela Braga, Ahmad Khamis Elshenawy, Stephen Steele Carter
System and method of annotating utterances based on tags assigned by unmanaged crowds

Patent number: 10394944

Abstract: A system and method of tagging utterances with Named Entity Recognition (“NER”) labels using unmanaged crowds is provided. The system may generate various annotation jobs in which a user, among a crowd, is asked to tag which parts of an utterance, if any, relate to various entities associated with a domain. For a given domain that is associated with a number of entities that exceeds a threshold N value, multiple batches of jobs (each batch having jobs that have a limited number of entities for tagging) may be used to tag a given utterance from that domain. This reduces the cognitive load imposed on a user, and prevents the user from having to tag more than N entities. As such, a domain with a large number of entities may be tagged efficiently by crowd participants without overloading each crowd participant with too many entities to tag.

Type: Grant

Filed: August 14, 2017

Date of Patent: August 27, 2019

Assignee: VoiceBox Technologies Corporation

Inventors: Spencer John Rothwell, Daniela Braga, Ahmad Khamis Elshenawy, Stephen Steele Carter
WORKFLOW FOR DEFINING A MULTIMODAL CROWDSOURCED OR MICROTASKING PROJECT

Publication number: 20180330311

Abstract: A facility for providing a workflow tailored to defining a project for collecting multimodal data from each of a set of crowdsourcing or microtasking platform workers is described. The facility enables customers of a crowdsourcing or microtasking platform to easily define multimodal data collection projects. The facility enables customers to define any of the following types of information associated with multimodal data collection projects: worker requirements, project environment parameters, video data, audio data, physiological data, and/or location-related data. Some of this data is collected using different kinds of sensors in one or more devices (e.g., smart phones, fitness wearables, etc.) associated with the crowdsourcing or micro-tasking platforms' workers. Prior to computing data results generated by executing a multimodal data collection project, the facility an align at least a first portion of the collected data with a second portion of the second data.

Type: Application

Filed: May 11, 2018

Publication date: November 15, 2018

Inventors: Daniela Braga, Joao Freitas, Sara Oliveira
SYSTEM AND METHOD FOR VALIDATING NATURAL LANGUAGE CONTENT USING CROWDSOURCED VALIDATION JOBS

Publication number: 20180277118

Abstract: Systems and methods of validating transcriptions of natural language content using crowdsourced validation jobs are provided herein. In various implementations, a transcription pair comprising natural language content and text corresponding to a transcription of the natural language content may be gathered. A group of validation devices may be selected for reviewing the transcription pair. A crowdsourced validation job may be created for the group of validation devices. The crowdsourced validation job may be provided to the group of validation devices. One or more votes representing whether or not the text accurately represents the natural language content may be received from the group of validation devices. Based on the one or more votes received, the transcription pair may be stored in a validated transcription library, which may be used to process end-user voice data.

Type: Application

Filed: March 19, 2018

Publication date: September 27, 2018

Applicant: VOICEBOX TECHNOLOGIES CORPORATION

Inventors: Spencer John ROTHWELL, Daniela BRAGA, Ahmad Khamis ELSHENAWY, Stephen Steele CARTER
IDENTIFYING WORKERS IN A CROWDSOURCING OR MICROTASKING PLATFORM WHO PERFORM LOW-QUALITY WORK AND/OR ARE REALLY AUTOMATED BOTS

Publication number: 20180144283

Abstract: A facility for identifying workers in a crowdsourcing or micro-tasking platform who perform low-quality work and/or are really automated bots is described. To identify users who perform low-quality work and/or are really bots, the facility (1) measures the quality of at least a portion of the work done by each user, and (2) tracks the pattern of behavior performed by each user on the platform—such as which work projects they select, the content of the responses, and the timing of each user interface interaction. The facility uses this information to build and maintain a model, such as a statistical model, that uses the pattern of a user's behavior to predict the level of quality of the user's work. Users for which the model predicts a low level of quality are flagged for manual review, or automatically suspended from working or from receiving payment.

Type: Application

Filed: November 17, 2017

Publication date: May 24, 2018

Inventors: Joao Freitas, Daniela Braga, Andre Pontes
CROWDSOURCED TRAINING OF TEXTUAL NATURAL LANGUAGE UNDERSTANDING SYSTEMS

Publication number: 20180144046

Abstract: A facility to crowdsource training of virtual assistants and other textual natural language understanding systems is described. The facility first specifies a set of possible user intents (e.g., a kind of question asked by users). As part of specifying an intent, entities, that represent salient items of information associated with the intent are identified. Then, for each of the intents, the facility directs users of a crowdsourcing platform to input a number of different textual queries they might use to express this intent. Then, additional crowdsourcing platform users are asked to perform semantic annotation of the cleaned queries, for each selecting its intent and entities from predefined lists. Next, still other crowdsourcing platform users are asked whether the selection of intents and entities during semantic annotation was correct for each query. Once validated, the annotated queries are used to train the assistant.

Type: Application

Filed: November 16, 2017

Publication date: May 24, 2018

Inventors: Daniela Braga, Joao Freitas, Daan Baldewijns
SYSTEM AND METHOD OF ANNOTATING UTTERANCES BASED ON TAGS ASSIGNED BY UNMANAGED CROWDS

Publication number: 20180121405

Abstract: A system and method of tagging utterances with Named Entity Recognition (“NER”) labels using unmanaged crowds is provided. The system may generate various annotation jobs in which a user, among a crowd, is asked to tag which parts of an utterance, if any, relate to various entities associated with a domain. For a given domain that is associated with a number of entities that exceeds a threshold N value, multiple batches of jobs (each batch having jobs that have a limited number of entities for tagging) may be used to tag a given utterance from that domain. This reduces the cognitive load imposed on a user, and prevents the user from having to tag more than N entities. As such, a domain with a large number of entities may be tagged efficiently by crowd participants without overloading each crowd participant with too many entities to tag.

Type: Application

Filed: August 14, 2017

Publication date: May 3, 2018

Applicant: VoiceBox Technologies Corporation

Inventors: Spencer John ROTHWELL, Daniela BRAGA, Ahmad Khamis ELSHENAWY, Stephen Steele CARTER
System and method for validating natural language content using crowdsourced validation jobs

Patent number: 9922653

Abstract: Systems and methods of validating transcriptions of natural language content using crowdsourced validation jobs are provided herein. In various implementations, a transcription pair comprising natural language content and text corresponding to a transcription of the natural language content may be gathered. A group of validation devices may be selected for reviewing the transcription pair. A crowdsourced validation job may be created for the group of validation devices. The crowdsourced validation job may be provided to the group of validation devices. One or more votes representing whether or not the text accurately represents the natural language content may be received from the group of validation devices. Based on the one or more votes received, the transcription pair may be stored in a validated transcription library, which may be used to process end-user voice data.

Type: Grant

Filed: July 25, 2016

Date of Patent: March 20, 2018

Assignee: VoiceBox Technologies Corporation

Inventors: Spencer John Rothwell, Daniela Braga, Ahmad Khamis Elshenawy, Stephen Steele Carter
SYSTEM AND METHOD FOR ELICITING OPEN-ENDED NATURAL LANGUAGE RESPONSES TO QUESTIONS TO TRAIN NATURAL LANGUAGE PROCESSORS

Publication number: 20180033434

Abstract: Systems and methods gathering text commands in response to a command context using a first crowdsourced are discussed herein. A command context for a natural language processing system may be identified, where the command context is associated with a command context condition to provide commands to the natural language processing system. One or more command creators associated with one or more command creation devices may be selected. A first application one the one or more command creation devices may be configured to display command creation instructions for each of the one or more command creators to provide text commands that satisfy the command context, and to display a field for capturing a user-generated text entry to satisfy the command creation condition in accordance with the command creation instructions. Systems and methods for reviewing the text commands using second and crowdsourced jobs are also presented herein.

Type: Application

Filed: October 9, 2017

Publication date: February 1, 2018

Applicant: VoiceBox Technologies Corporation

Inventors: Spencer John ROTHWELL, Daniela BRAGA, Ahmad Khamis ELSHENAWY, Stephen Steele CARTER
System and method for eliciting open-ended natural language responses to questions to train natural language processors

Patent number: 9786277

Abstract: Systems and methods gathering text commands in response to a command context using a first crowdsourced are discussed herein. A command context for a natural language processing system may be identified, where the command context is associated with a command context condition to provide commands to the natural language processing system. One or more command creators associated with one or more command creation devices may be selected. A first application one the one or more command creation devices may be configured to display command creation instructions for each of the one or more command creators to provide text commands that satisfy the command context, and to display a field for capturing a user-generated text entry to satisfy the command creation condition in accordance with the command creation instructions. Systems and methods for reviewing the text commands using second and crowdsourced jobs are also presented herein.

Type: Grant

Filed: September 6, 2016

Date of Patent: October 10, 2017

Assignee: VoiceBox Technologies Corporation

Inventors: Spencer John Rothwell, Daniela Braga, Ahmad Khamis Elshenawy, Stephen Steele Carter
System and method of recording utterances using unmanaged crowds for natural language processing

Patent number: 9772993

Abstract: A system and method of recording utterances for building Named Entity Recognition (“NER”) models, which are used to build dialog systems in which a computer listens and responds to human voice dialog. Utterances to be uttered may be provided to users through their mobile devices, which may record the user uttering (e.g., verbalizing, speaking, etc.) the utterances and upload the recording to a computer for processing. The use of the user's mobile device, which is programmed with an utterance collection application (e.g., configured as a mobile app), facilitates the use of crowd-sourcing human intelligence tasking for widespread collection of utterances from a population of users. As such, obtaining large datasets for building NER models may be facilitated by the system and method disclosed herein.

Type: Grant

Filed: July 20, 2016

Date of Patent: September 26, 2017

Assignee: VoiceBox Technologies Corporation

Inventors: Daniela Braga, Spencer John Rothwell, Faraz Romani, Ahmad Khamis Elshenawy, Stephen Steele Carter, Michael Kennewick
System and method of annotating utterances based on tags assigned by unmanaged crowds

Patent number: 9734138

Abstract: A system and method of tagging utterances with Named Entity Recognition (“NER”) labels using unmanaged crowds is provided. The system may generate various annotation jobs in which a user, among a crowd, is asked to tag which parts of an utterance, if any, relate to various entities associated with a domain. For a given domain that is associated with a number of entities that exceeds a threshold N value, multiple batches of jobs (each batch having jobs that have a limited number of entities for tagging) may be used to tag a given utterance from that domain. This reduces the cognitive load imposed on a user, and prevents the user from having to tag more than N entities. As such, a domain with a large number of entities may be tagged efficiently by crowd participants without overloading each crowd participant with too many entities to tag.

Type: Grant

Filed: September 6, 2016

Date of Patent: August 15, 2017

Assignee: VoiceBox Technologies Corporation

Inventors: Spencer John Rothwell, Daniela Braga, Ahmad Khamis Elshenawy, Stephen Steele Carter
SYSTEM AND METHOD FOR PROVIDING WORDS OR PHRASES TO BE UTTERED BY MEMBERS OF A CROWD AND PROCESSING THE UTTERANCES IN CROWD-SOURCED CAMPAIGNS TO FACILITATE SPEECH ANALYSIS

Publication number: 20170069325

Abstract: Systems and methods of providing text related to utterances, and gathering voice data in response to the text are provide herein. In various implementations, an identification token that identifies a first file for a voice data collection campaign, and a second file for a session script may be received from a natural language processing training device. The first file and the second file may be used to configure the mobile application to display a sequence of screens, each of the sequence of screens containing text of at least one utterance specified in the voice data collection campaign. Voice data may be received from the natural language processing training device in response to user interaction with the text of the at least one utterance. The voice data and the text may be stored in a transcription library.

Type: Application

Filed: March 28, 2016

Publication date: March 9, 2017

Applicant: VOICEBOX TECHNOLOGIES CORPORATION

Inventors: Daniela BRAGA, Faraz ROMANI, Ahmad Khamis ELSHENAWY, Michael KENNEWICK

1 2 next