SYSTEM FOR CO-CLUSTERING OF STUDENT ASSESSMENT DATA
A system and method for making use of formative assessment data collected is disclosed that identifies clusters of students and concurrently determines the characteristics of the student clusters. A decomposition of the data is performed with spectral theories of graphs and fuzzy logic algorithms to identify the clusters of students, clusters of assessment data and relationships between them. An actionable output is presented to teachers for the evaluation of educational progress.
Latest Xerox Corporation Patents:
- SYSTEM AND METHOD FOR IMPLEMENTING A DATA-DRIVEN FRAMEWORK FOR OBSERVATION, DATA ASSIMILATION, AND PREDICTION OF OCEAN CURRENTS
- Authentication for mobile print jobs on public multi-function devices
- Printed textured surfaces with antimicrobial properties and methods thereof
- Method and apparatus to generate encrypted codes associated with a document
- BIODEGRADABLE POLYMER PARTICULATES AND METHODS FOR PRODUCTION AND USE THEREOF
This application claims the priority benefit, as a divisional application, of U.S. application Ser. No. 13/214,358, filed Aug. 22, 2011, the disclosure of which is incorporated herein by reference.
FIELDThe present disclosure relates to multifunction device systems and methods for evaluating student assessment data, and in particular, for utilizing formative assessment data in order to identify co-clusters of students and metadata associated therewith that are displayed and/or printed on the multifunctional device system.
BACKGROUNDIn recent years the basic office copier has evolved into what can be called a “multi-function device” or MFD. With digital technology, a machine with the basic outward appearance of a traditional copier can perform additional functions such as printing documents submitted in digital form over a network; sending and receiving messages via facsimile; recording hard-copy original images in digital form and sending the resulting data over a network, such as in electronic mail; or recording hard-copy original images in digital form on a compact disc or equivalent medium. These additional functions present further challenges to a designer of a convenient and intuitive user interface.
The present disclosure is directed utilizing MFDs to creating, grading, and analyzing the results of tests administered to students, and in particular, to a computer-implemented educational assessment system and method for generating and administering student assessments, evaluating the results thereof to enable educators to identify strengths and weaknesses of students both individually and in the aggregate, and tailoring the learning experience of students in a classroom environment.
The present disclosure relates to the process of assessing the attributes of a student or group of students at selected times during their learning process and particularly relates to the assessment and evaluation of student attributes or progress in a structured classroom where a teacher is required to educate the students to a level of proficiency in various subject matters and at particular grade levels. Typically, in a grade level classroom, the teacher periodically gives the students printed formative assessments or tests, as they have previously been referred to, in order to obtain an indication of the student(s) level(s) of proficiency in the subject matter of immediate interest.
Where a teacher is responsible for a class having a relatively large number of students, the teacher typically passes out to all students a common assessment form. The assessments are distributed to the students who then mark their responses on the forms which are then gathered by the teacher and individually reviewed and evaluated or graded by the teacher. The process has required the teacher to then manually enter an overall metric of each student's performance on the assessment into a record book or database. Typically the metric is a single total score or percentage of possible points. This process has thus been time consuming and often requires the teacher to spend considerable time outside of the classroom performing these functions. Furthermore, no detailed record is kept regarding how each student performed on each item within the assessment. Given the low resolution metric recorded for each assessment, the recorded results do not fully or meaningfully determine the state of learning of individual students who may be experiencing learning difficulties or are insufficiently prepared for particular elements of the assessment.
Currently formative assessments are widely used in schools to close the gap between the current and the desired state of learning of a student. They are used by teachers as a feedback mechanism to assess the effectiveness of their instruction. Highly granular data on the performance of the students on a formative assessment can be captured. However, the data can be useful only if some actionable insights can be extracted from it and presented to the teachers, parents and students alike. One such view that can be extracted from the data is the information regarding clusters of students and the problem areas specific to each student cluster.
Thus, it has been desired to provide a way or means of automatically evaluating educational assessments to further enhance education needs for groups of students.
BRIEF DESCRIPTIONThe following presents a simplified summary in order to provide a basic understanding of one or more aspects of the disclosure. This summary is not an extensive overview and is neither intended to identify key or critical elements, nor to delineate the scope thereof. Rather, the primary purpose of the summary is to present some concepts of the disclosure in a simplified form as a prelude to the more detailed description that is presented later.
In one embodiment, a system for educational assessment of student groups is provided. The system includes a processor having a memory, wherein the processor has a clustering engine that includes a student identification module that identifies student clusters and associated metadata having characteristics of each of the student clusters. The clustering also includes an assessment identification module identifies assessment data clusters among assessment data for students belonging to the student clusters, which is performed concurrent with the student identification module identifying student clusters. A display module compiles the metadata related to the student clusters and the assessment data clusters, and provides relationships between the student clusters and the assessment data clusters with the metadata in a visible medium.
In another embodiment, a system for educational assessment of students is disclosed that comprises a processor having a memory, wherein the processor has a clustering engine that includes a student identification module that identifies student clusters based on metadata having characteristics of students within each student cluster and is associated with a hierarchy of assessment data of one or more formative assessments. The hierarchy of assessment data is related to different levels of demonstrated knowledge by students based on the formative assessments. A display module that compiles the metadata related to the student clusters and the assessment data clusters, and provides relationships between the student clusters and the assessment data clusters with the metadata to a visible medium in response to co-clustering of student data and assessment data. The student clusters are assigned to hard clustering so that each student is associated with only one student cluster and the assessment data is assigned to soft clustering so that the assessment data is not confined to being associated with only one assessment cluster. In addition, the assessment data includes data related to student responses to assessment evaluations of each formative assessment and each assessment evaluation includes a question for evaluation of each student that is associable with more than one assessment cluster.
In yet another embodiment, a method for co-clustering student data and assessment data from formative assessments on a processor is disclosed. The processor has a memory storing executable instructions for the method. The method comprises transforming student data from formative assessments into one or more bipartite graphs and mapping adjacency relationships between the students and the assessment data by creating one or more adjacency matrices from the one or more bipartite graphs. The data is then clustered into student clusters and the assessment data from the formative assessments into assessment data clusters while extracting metadata pertaining to each student cluster. The metadata includes characteristics of students belonging to each student cluster as well as any additional data pertaining thereto.
The following description and annexed drawings set forth in detail certain illustrative aspects and implementations of this disclosure. These are indicative of only a few of the various ways in which the principles of the disclosure may be employed.
One or more implementations of the present disclosure will now be described with reference to the attached drawings, wherein like reference numerals are used to refer to like elements throughout.
Aspects of the exemplary embodiment relate to systems and methods to dynamically assess student data by automatically extracting actionable insights from assessment data as it is received and present the insights to the teachers, parents and students alike. One such view that can be extracted from the data is the information regarding clusters of students and the problem areas specific to each student cluster. Aspects of the systems and methods provide a co-clustering based approach to simultaneously discover student clusters as well as the defining characteristics of the cluster (e.g., questions that the students answered incorrectly, a specific concept which a set of students could not master, and/or an error mode common among the students). The assessment data can be accumulated in a highly granular detail with a system more detailed in a related U.S. application, namely U.S. Patent Publication Number 2010/0157345 by Robert M Lofthus et al, which is incorporated herein by reference in its entirety and reproduced in sections herein.
Referring to the drawings,
A bus 224 permits communication among the components of the system 200. The processor 206 includes processing logic that may include a microprocessor or application specific integrated circuit (ASIC), a field programmable gate array (FPGA), or the like. The processor 206 may also include a graphical processor (not shown) for processing instructions, programs or data structures for assessment data from formative assessments provided to students.
The memory 204 may include a random access memory (RAM) or another type of dynamic storage device that may store information and instructions for execution by the processor 206, a read only memory (ROM) or another type of static storage device that may store static information and instructions for use by processing logic; a flash memory (e.g., an electrically erasable programmable read only memory (EEPROM)) device for storing information and instructions, and/or some other type of magnetic or optical recording medium and its corresponding drive.
The processor 206 drives the clustering engine 216, which includes the student identification module 218 and the assessment identification module 220. The student identification module 218 identifies student clusters and associated metadata that includes characteristics of students within each student cluster. For example, the characteristics may include data related to concepts, mastery of skills, problem areas, errors made, questions missed, and/or other characteristics related to assessment evaluations provided to each student within a student cluster. The assessment identification module 220 identifies assessment data clusters among assessment data for students belonging to the student cluster.
The transformation engine 222 compiles the assessment received from formative assessments provided to students and in turn, creates bipartite relationships in bipartite graphs of student data for each student and the assessment evaluations. The adjacency mapping engine 226 maps adjacency relationships between students and the assessment data by creating at least one adjacency matrix from the one or more bipartite graphs. The decomposition engine 228 performs a spectral decomposition on the adjacency matrices and establishes weighted distances for each relationship, which is further detailed infra.
Referring now to
An example of one output view 300 that is extracted from the data by the clustering engine is shown in
The co-clusters identified from the data can be used in several ways. It can help the teacher to easily identify clusters of students and the problem areas within these clusters, helping her or him to take remedial actions to address issues with a classroom. The co-clusters identified from a prior assessment could also help in automated generation of subsequent formative assessments.
An example methodology 400 for co-clustering student data and assessment data from formative assessments provided to students with a processor having a memory storing executable instructions for the method is illustrated in
The method 400 is described in conjunction with other figures described herein for ease of explanation and example. However, the method 400 is not limited to any one aspect of an embodiment provided in the other figures. The method 400 initiates at 402 with data preprocessing and transformation of student groups and assessment data compiled from formative assessments. The preprocessing and transformation that could be performed by the transformation engine 222 of the educational assessment system 200 as described above for example. Preprocessing gathers the data from student assessments provided to the students. The student data is then formed into one or more bipartite graphs, which are illustrated further in
Referring to
In one example, bipartite graph 502 illustrates the student-Item bipartite relationships formed by the transformation engine of the educational assessment system. The students (e.g., S1, S2, S3, S4 and S5) are represented by a square and the questions as circles. A line from the student to a question represents that the student gave an incorrect response to that question (or skipped that question).
In another example, bipartite graph 504 illustrates the student-concept bipartite relationship. When a formative assessment measures the student performance on multiple concepts (e.g. Addition, Long division etc.), the bipartite graphs may be formed in order to cluster students based on their performance in these concepts. Also here, the students are represented as square blocks and the concepts as circles. The weight between a student and the concept is the measure of student mastery of that particular concept. In addition, the length or distance of the line may represent a weight provided to the missed or skipped question for a weighted distance. Various other means may also be envisioned for representing the weight of the relationships. In one embodiment, the weights may be varied, either manually or via user interface controls of the system. Therefore, certain concepts proving more difficult or taught with more stress could be evaluated differently in relation to other relationships shown in the graph.
Referring again to
At 404 a decomposition of the adjacency matrices formed at 402 is performed by a decomposition engine 228, which performs a spectral decomposition on the adjacency matrices and establishes weighted distances for each relationship. For example, a spectral decomposition is performed with student data from the adjacency matrices. The “Spectrum” values of a graph are the eigenvalues of a graph ordered on the strength of their corresponding eigenvalues. The spectrum provides valuable insights into the connectivity of a graph. The objective of spectral decomposition of the adjacency data is two-fold: to estimate three different types of distances: Student-Student, Question-Question and Student-Question. The Student-Student distances and Question-Question distances are between homogenous entities only. By transforming the data into an adjacency representation and by applying spectral decomposition, in addition to the similarities between homogenous entities, the system 200, for example, is also able to compare the distances between dissimilar entities (e.g. students and questions). The weighted distances establish the amount of truth degree or probability for a connection between students and the student data, for example. Thus, the decomposition helps in co-clustering of students and questions. The data has high dimensionality and applying clustering on the raw data will suffer from the “curse of dimensionality”, i.e., it will result in imprecise clusters. However, by applying spectral decomposition on the adjacency data, the system is able to focus on a few dimensions with the highest variability, and thus, results in more accurate clusters.
Referring back to
At 412 reporting of the co-clustering data is performed in a display and/or at a visible medium. Examples of the reporting of co-cluster information are illustrated at
Although not required, embodiments are described in the general context of “computer readable instructions” being executed by one or more computing devices. Computer readable instructions may be distributed via computer readable media (discussed below). Computer readable instructions may be implemented as program modules, such as functions, objects, Application Programming Interfaces (APIs), data structures, and the like, that perform particular tasks or implement particular abstract data types. Typically, the functionality of the computer readable instructions may be combined or distributed as desired in various environments.
The exemplary method may be implemented on one or more general purpose computers, special purpose computer(s), a programmed microprocessor or microcontroller and peripheral integrated circuit elements, an ASIC or other integrated circuit, a digital signal processor, a hardwired electronic or logic circuit such as a discrete element circuit, a programmable logic device such as a PLD, PLA, FPGA, or PAL, or the like. In general, any device, capable of implementing a finite state machine that is, in turn, capable of implementing the flowchart shown herein.
The methods illustrated and related herein may be implemented in a computer program product that may be executed on a computer. The computer program product may be a tangible computer-readable recording medium on which a control program is recorded, such as a disk, hard drive, or may be a transmittable carrier wave in which the control program is embodied as a data signal. Common forms of computer-readable media include, for example, floppy disks, flexible disks, hard disks, magnetic tape, or any other magnetic storage medium, CD-ROM, DVD, or any other optical medium, a RAM, a PROM, an EPROM, a FLASH-EPROM, or other memory chip or cartridge, transmission media, such as acoustic or light waves, such as those generated during radio wave and infrared data communications, and the like, or any other medium from which a computer can read and use.
It will be appreciated that variants of the above-disclosed and other features and functions, or alternatives thereof, may be combined into many other different systems or applications. Various presently unforeseen or unanticipated alternatives, modifications, variations or improvements therein may be subsequently made by those skilled in the art which are also intended to be encompassed by the following claims.
Claims
1. A system for educational assessment of students, comprising:
- a processor having a memory, wherein the processor has a clustering engine that includes a student identification module that identifies student clusters based on metadata having characteristics of students within each student cluster and is associated with a hierarchy of assessment data of one or more formative assessments, wherein the hierarchy of assessment data is related to different levels of demonstrated knowledge by students based on the formative assessments;
- a display module that compiles the metadata related to the student clusters and the assessment data clusters, and provides relationships between the student clusters and the assessment data clusters with the metadata to a visible medium in response to co-clustering of student data and assessment data.
2. The system of claim 1, wherein the different levels of demonstrated knowledge are based on a certain response to a particular question, or assessment evaluation, of the formative assessments in relation to other responses to the particular question or assessment evaluation among students, and the student clusters are identified by the student identification module according to the responses of students to a plurality of questions of the formative assessments that have one or more response possibilities for one or more of the plurality of questions.
3. The system of claim 1, wherein the student clusters are assigned to hard clustering so that each student is associated with only one student cluster and the assessment data is assigned to soft clustering so that the assessment data is not confined to being associated with only one assessment cluster.
4. The system of claim 3, wherein the assessment data includes data related to student responses to assessment evaluations of each formative assessment and each assessment evaluation includes a question for evaluation of each student that is associable with more than one assessment cluster.
5. The system of claim 1, further comprising a transformation engine that compiles the assessment data from formative assessments provided to the plurality of students and creates bipartite graphs of student data for each student and assessment evaluations from the assessment data;
- an adjacency mapping engine that maps adjacency relationships between the students and the assessment data by creating at least one adjacency matrix from the bipartite graphs; and
- a decomposition engine that performs spectral decomposition on the adjacency matrix and establishes weighted distances for each relationship.
6. The system of claim 5, wherein the weighted distances are configured to be fixed according to a setting by a user.
Type: Application
Filed: Jan 28, 2014
Publication Date: Aug 21, 2014
Applicant: Xerox Corporation (Norwalk, CT)
Inventors: Sharath Srinivas (Webster, NY), Eric Scott Hamby (Webster, NY), Robert M. Lofthus (Webster, NY)
Application Number: 14/166,452
International Classification: G09B 7/00 (20060101);