Patents by Inventor Michael Flor
Michael Flor has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11861310Abstract: A computer-implemented technique for characterizing lexical concreteness in narrative includes receiving data encapsulating narrative text having a plurality of words. Thereafter, the function words can be removed from the narrative text to result in only content words. A concreteness score can then be assigned to each content word by polling a database to identify matching words and to use concreteness scores associated with such matching words as specified by the database. Data can then be provided which characterizes the assigned concreteness scores. Related apparatus, systems, techniques and articles are also described.Type: GrantFiled: April 24, 2020Date of Patent: January 2, 2024Assignee: Educational Testing ServiceInventors: Michael Flor, Swapna Somasundaran
-
Patent number: 11854432Abstract: Systems and methods are provided for processing a group of essays to develop a classifier that detects nonsensical computer-generated essays. A data structure associated with a group of essays is accessed, wherein the group of essays includes nonsensical computer-generated essays and good-faith essays. Both the nonsensical computer-generated essays and the good-faith essays are assigned feature values. The distribution of feature values between the nonsensical computer-generated essays and the good-faith essays is measured. A classifier that detects whether an essay is a nonsensical computer-generated essay is developed, wherein the classifier is developed using the distribution of feature values.Type: GrantFiled: July 1, 2019Date of Patent: December 26, 2023Assignee: Educational Testing ServiceInventors: Aoife Cahill, Martin Chodorow, Michael Flor
-
Patent number: 11790227Abstract: Systems and methods are disclosed for automatically scoring a constructed response using a neural network. In embodiments, a constructed response received by a processing system may be processed to divide the constructed response into multiple series of word tokens, wherein each word token includes a sequence of characters. The constructed response may be further processed to correct one or more spelling errors. The word tokens may be encoded to generate representation vectors for the constructed response. A set of nonlinear operations may be applied to the plurality of representation vectors in a neural network to generate a single vector output. A set of predetermined network weights may be applied to the vector output of the neural network to generate a scalar output for scoring the constructed response.Type: GrantFiled: January 14, 2021Date of Patent: October 17, 2023Assignee: Educational Testing ServiceInventors: Brian W. Riordan, Kenneth Steimel, Michael Flor, Robert A. Pugh
-
Patent number: 11023684Abstract: Computer-implemented systems and methods are described herein for automatically generating questions from text. Text including one or more sentences is received. A sentence, comprising a predicate and one or more arguments associated with the predicate, is parsed from the text. Semantic role labels are assigned to the one or more arguments associated with the predicate. One or more questions are automatically generated relating to the predicate based on the assigned semantic role labels. Each answer to the generated questions is one of the one or more arguments associated with the predicate.Type: GrantFiled: March 19, 2019Date of Patent: June 1, 2021Assignee: Educational Testing ServiceInventors: Michael Flor, Brian W. Riordan
-
Patent number: 10885274Abstract: Systems and methods are provided for processing a response to essay prompts that request a narrative response. A data structure associated with a narrative essay is accessed. The essay is analyzed to generate an organization subscore, where the organization subscore is generated using a graph metric by identifying content words in each sentence of the essay and populating a data structure with links between related content words in neighboring sentences, wherein the organization subscore is determined based on the links. The essay is analyzed to generate a development subscore, where the development subscore is generated using a transition metric by accessing a transition cue data store and identifying transition words in the essay, wherein the development subscore is based on a number of words in the essay that match words in the transition cue data store. A narrative quality metric is determined based on the organization subscore and the development subscore.Type: GrantFiled: June 21, 2018Date of Patent: January 5, 2021Assignee: Educational Testing ServiceInventors: Swapna Somasundaran, Michael Flor, Martin Chodorow, Binod Gyawali, Hillary Molloy, Laura McCulla
-
Patent number: 10585985Abstract: Methods and systems for scoring written text based on use of idiomatic expressions, including reading pre-selected idiomatic expressions in a canonical form into memory, expanding idiomatic expressions from the canonical form, reading a written response into the memory, pre-processing the written response, searching the pre-processed written response for idiomatic expressions, and assigning a score to the written response. The score may be based at least in part on the number of idiomatic expressions in the written response. Corresponding apparatuses, systems, and methods are also disclosed.Type: GrantFiled: December 14, 2017Date of Patent: March 10, 2020Assignee: Educational Testing ServiceInventors: Michael Flor, Beata Beigman Klebanov
-
Patent number: 10515314Abstract: Systems and methods are provided for a computer-implemented method for identifying pairs of cohesive words within a text. A supervised model is trained to detect cohesive words within a text to be scored. Training the supervised model includes identifying a plurality of pairs of candidate cohesive words in a training essay and an order associated with the pairs of candidate cohesive words based on an order of words in the training essay. The pairs of candidate cohesive words are filtered to form a set of evaluation pairs. The evaluation pairs are provided via a graphical user interface based on the order associated with the pairs of candidate cohesive words. An indication of cohesion or no cohesion is received for the evaluation pairs via the graphical user interface. The supervised model is trained based on the evaluation pairs and the received indications.Type: GrantFiled: December 3, 2015Date of Patent: December 24, 2019Assignee: Educational Testing ServiceInventors: Beata Beigman Klebanov, Michael Flor, Daniel Blanchard
-
Patent number: 10515153Abstract: A computer-implemented method of training an assessment model for assessing constructed texts expressing opinions on subjects includes accessing a plurality of training texts, which are constructed texts. The training texts are analyzed with the processing system to derive values of a plurality of linguistic features of an assessment model. At least one of the plurality of linguistic features relates to sentiment and at least one of the plurality of linguistic feature relates to specificity. The assessment model is trained with the processing system based on the values of the plurality of linguistic features. Based on the training, a weight for each of the plurality of linguistic features is determined. The assessment model is calibrated to include the weights for at least some of the plurality of linguistic features such that the assessment model is configured to generate assessment measures for constructed texts expressing opinions on subjects.Type: GrantFiled: May 16, 2014Date of Patent: December 24, 2019Assignee: Educational Testing ServiceInventors: Michael Heilman, F. Jay Breyer, Michael Flor
-
Patent number: 10262547Abstract: Systems and methods are provided for scoring a constructed response generated by a user and providing information on the user's writing behavior. A constructed response and associated electronic process log are received. The constructed response is processed to generate first feature values representative of aspects of the constructed response. The electronic process log is processed to generate second feature values related to the user's actions in generating the constructed response. A score for the constructed response is generated using the processing system by applying a computer scoring model to the first and second feature values. A rule of a rule engine that is satisfied is identified, the rule being satisfied when one or more feature values of the second feature values meet a condition associated with the rule. Information on the user's actions in generating the constructed response is provided based on the satisfied rule.Type: GrantFiled: November 10, 2015Date of Patent: April 16, 2019Assignee: Educational Testing ServiceInventors: Paul Deane, Gary Feng, Mo Zhang, Jiangang Hao, Yoav Bergner, Michael Flor, Michael E. Wagner, Nathan Lederer, Yigal Attali
-
Patent number: 10255820Abstract: Systems and methods are provided for determining a susceptibility of a computer-implemented automated scoring engine to gaming strategies. A plurality of responses to a prompt are provided to a computer-implemented automated scoring engine to receive a first set of scores. A first transformation is performed on each of the plurality of responses to generate a first set of transformed responses. The first set of transformed responses is provided to the computer-implemented automatic scoring engine to receive a second set of scores, and a gaming susceptibility metric is determined based on the first set of scores and the second set of scores.Type: GrantFiled: March 26, 2014Date of Patent: April 9, 2019Assignee: Educational Testing ServiceInventors: Derrick Higgins, Isaac Bejar, Michael Heilman, Yoko Futagi, Michael Flor
-
Patent number: 10134297Abstract: Computer-implemented systems and methods are provided for determining a document's complexity. For example, a computer performing the complexity analysis can receive a document. The computer can determine the content words within the document and determine an association measure for each group of content words. An association profile can be created for the document using the association measures. The computer can use the association profile to determine the complexity of the document. The complexity of the document may correspond to the document's suitable reading level or, if the document is an essay, an essay score.Type: GrantFiled: February 14, 2014Date of Patent: November 20, 2018Assignee: Educational Testing ServiceInventors: Beata Beigman Klebanov, Michael Flor
-
Patent number: 9852379Abstract: Systems and methods described herein utilize supervised machine learning to generate a figure-of-speech prediction model for classify content words in a running text as either being figurative (e.g., as a metaphor, simile, etc.) or non-figurative (i.e., literal). The prediction model may extract and analyze any number of features in making its prediction, including a topic model feature, unigram feature, part-of-speech feature, concreteness feature, concreteness difference feature, literal context feature, non-literal context feature, and off-topic feature, each of which are described in detail herein. Since uses of figure of speech in writings may signal content sophistication, the figure-of-speech prediction model allows scoring engines to further take into consideration a text's use of figure of speech when generating a score.Type: GrantFiled: March 6, 2015Date of Patent: December 26, 2017Assignee: Educational Testing ServiceInventors: Beata Beigman Klebanov, Chee Wee Leong, Michael Flor, Michael Heilman
-
Patent number: 9519634Abstract: Systems and methods are provided for identifying one or more target words of a corpus that have a lexical relationship to a plurality of provided cue words. The cue words and statistical lexical information derived from a corpus of documents are analyzed to determine candidate words that have a lexical association with the cue words. The statistical information includes numerical values indicative of probabilities of word pairs appearing together as adjacent words in a well-formed text or appearing together within a paragraph of a well-formed text. For each candidate word, a statistical association score between the candidate word and each of the cue words is determined. An aggregate score for each of the candidate words is determined based on the statistical association scores. One or more of the candidate words are selected to be the one or more target words based on the aggregate scores.Type: GrantFiled: June 1, 2015Date of Patent: December 13, 2016Assignee: Educational Testing ServiceInventors: Michael Flor, Beata Beigman Klebanov
-
Publication number: 20160162806Abstract: Systems and methods are provided for a computer-implemented method for identifying pairs of cohesive words within a text. A supervised model is trained to detect cohesive words within a text to be scored. Training the supervised model includes identifying a plurality of pairs of candidate cohesive words in a training essay and an order associated with the pairs of candidate cohesive words based on an order of words in the training essay. The pairs of candidate cohesive words are filtered to form a set of evaluation pairs. The evaluation pairs are provided via a graphical user interface based on the order associated with the pairs of candidate cohesive words. An indication of cohesion or no cohesion is received for the evaluation pairs via the graphical user interface. The supervised model is trained based on the evaluation pairs and the received indications.Type: ApplicationFiled: December 3, 2015Publication date: June 9, 2016Inventors: Beata Beigman Klebanov, Michael Flor, Daniel Blanchard
-
Publication number: 20160133147Abstract: Systems and methods are provided for scoring a constructed response generated by a user and providing information on the user's writing behavior. A constructed response and associated electronic process log are received. The constructed response is processed to generate first feature values representative of aspects of the constructed response. The electronic process log is processed to generate second feature values related to the user's actions in generating the constructed response. A score for the constructed response is generated using the processing system by applying a computer scoring model to the first and second feature values. A rule of a rule engine that is satisfied is identified, the rule being satisfied when one or more feature values of the second feature values meet a condition associated with the rule. Information on the user's actions in generating the constructed response is provided based on the satisfied rule.Type: ApplicationFiled: November 10, 2015Publication date: May 12, 2016Inventors: Paul Deane, Gary Feng, Mo Zhang, Jiangang Hao, Yoav Bergner, Michael Flor, Michael E. Wagner, Nathan Lederer
-
Publication number: 20150347385Abstract: Systems and methods are provided for identifying one or more target words of a corpus that have a lexical relationship to a plurality of provided cue words. The cue words and statistical lexical information derived from a corpus of documents are analyzed to determine candidate words that have a lexical association with the cue words. The statistical information includes numerical values indicative of probabilities of word pairs appearing together as adjacent words in a well-formed text or appearing together within a paragraph of a well-formed text. For each candidate word, a statistical association score between the candidate word and each of the cue words is determined. An aggregate score for each of the candidate words is determined based on the statistical association scores. One or more of the candidate words are selected to be the one or more target words based on the aggregate scores.Type: ApplicationFiled: June 1, 2015Publication date: December 3, 2015Inventors: Michael Flor, Beata Beigman Klebanov
-
Publication number: 20150254565Abstract: Systems and methods described herein utilize supervised machine learning to generate a figure-of-speech prediction model for classify content words in a running text as either being figurative (e.g., as a metaphor, simile, etc.) or non-figurative (i.e., literal). The prediction model may extract and analyze any number of features in making its prediction, including a topic model feature, unigram feature, part-of-speech feature, concreteness feature, concreteness difference feature, literal context feature, non-literal context feature, and off-topic feature, each of which are described in detail herein. Since uses of figure of speech in writings may signal content sophistication, the figure-of-speech prediction model allows scoring engines to further take into consideration a text's use of figure of speech when generating a score.Type: ApplicationFiled: March 6, 2015Publication date: September 10, 2015Inventors: Beata Beigman Klebanov, Chee Wee Leong, Michael Flor, Michael Heilman
-
Publication number: 20140343923Abstract: A computer-implemented method of training an assessment model for assessing constructed texts expressing opinions on subjects includes accessing a plurality of training texts, which are constructed texts. The training texts are analyzed with the processing system to derive values of a plurality of linguistic features of an assessment model. At least one of the plurality of linguistic features relates to sentiment and at least one of the plurality of linguistic feature relates to specificity. The assessment model is trained with the processing system based on the values of the plurality of linguistic features. Based on the training, a weight for each of the plurality of linguistic features is determined. The assessment model is calibrated to include the weights for at least some of the plurality of linguistic features such that the assessment model is configured to generate assessment measures for constructed texts expressing opinions on subjects.Type: ApplicationFiled: May 16, 2014Publication date: November 20, 2014Applicant: Educational Testing ServiceInventors: Michael Heilman, F. Jay Breyer, Michael Flor
-
Publication number: 20140295399Abstract: Systems and methods are provided for determining a susceptibility of a computer-implemented automated scoring engine to gaming strategies. A plurality of responses to a prompt are provided to a computer-implemented automated scoring engine to receive a first set of scores. A first transformation is performed on each of the plurality of responses to generate a first set of transformed responses. The first set of transformed responses is provided to the computer-implemented automatic scoring engine to receive a second set of scores, and a gaming susceptibility metric is determined based on the first set of scores and the second set of scores.Type: ApplicationFiled: March 26, 2014Publication date: October 2, 2014Applicant: Educational Testing ServiceInventors: Derrick Higgins, Isaac Bejar, Michael Heilman, Yoko Futagi, Michael Flor
-
Publication number: 20140234810Abstract: Computer-implemented systems and methods are provided for determining a document's complexity. For example, a computer performing the complexity analysis can receive a document. The computer can determine the content words within the document and determine an association measure for each group of content words. An association profile can be created for the document using the association measures. The computer can use the association profile to determine the complexity of the document. The complexity of the document may correspond to the document's suitable reading level or, if the document is an essay, an essay score.Type: ApplicationFiled: February 14, 2014Publication date: August 21, 2014Applicant: EDUCATIONAL TESTING SERVICEInventors: Michael Flor, Beata Beigman Klebanov