Hi All,
I have just finished honours in computer science at Wits University in South Africa. My thesis was on the automated assessment of free-text paragraph responses using statistical natural language processing techniques - similar to what some of the search engines use. It compares textual submissions based on hidden semantic structures that span all the submissions. I've implemented and tested such a system and we've got very promising results. If the system is configured properly we get correlations of between 0.8 and 1.0. These are very high - humans that mark the same papers generally correlate at around 0.8.
It is very accurate if configured properly - which is difficult - the next step in our work will be to automatically generate configuration files based on features of the input. Once (if) that is done correctly, the system could prove to be a highly useful tool on Moodle.
The system is currently written in Python. We think that we'd like to keep it as a separate application that can be used anywhere as plugin. The system takes input from stdin, presents a small set of the inputted submissions to stdout for manual grading. When enough submissions have been manually marked to accurately represent the semantic structures that have been found, the system will automatically mark the remainder of the submissions and write a csv file with the awarded marks.
I administer Moodle in a number of schools in my area and would like to see this feature implemented.
Could someone please point me in the right direction to get started implementing an interface between the two systems and also if anyone is interested in being a part of such a project please let me know.
Regards,
Richard