Measuring semantic robustness in LLM-based CEFR essay scoring through systematic prompt paraphrasing. University of Exeter Year 3 Computer Science research project.
nlp research transformers automated-essay-scoring educational-technology cefr gpt-4 llm prompt-engineering semantic-robustness
-
Updated
Mar 31, 2026 - Python