Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback

Authors

Mei Tan,

Lena Phalen,

Dorottya Demszky

Date

01/1970

Publisher

arXiv

Link

https://dl.acm.org/doi/10.1145/3785022.3785113

Effective personalized feedback is critical to students√ï literacy development. Though LLM-powered tools now promise to automate such feedback at scale, LLMs are not language-neutral: they privilege standard academic English and reproduce social stereotypes, raising concerns about how √ípersonalization√ì shapes the feedback students receive. We examine how four widely used LLMs (GPT-4o, GPT-3.5-turbo, Llama-3.3 70B, Llama-3.1 8B) adapt written feedback in response to student attributes. Using 600 eighth-grade persuasive essays from the PERSUADE dataset, we generated feedback under prompt conditions embedding gender, race/ethnicity, learning needs, achievement, and motivation. We analyze lexical shifts across model outputs by adapting the Marked Words framework. Our results reveal systematic, stereotype-aligned shifts in feedback conditioned on presumed student attributes√ëeven when essay content was identical. Feedback for students marked by race, language, or disability often exhibited positive feedback bias and feedback withholding bias√ëoveruse of praise, less substantive critique, and assumptions of limited ability. Across attributes, models tailored not only what content was emphasized but also how writing was judged and how students were addressed. We term these instructional orientations Marked Pedagogies and highlight the need for transparency and accountability in automated feedback tools.

What is the application?

Teaching – Assessment and Feedback

Who is the user?

Educator

Who age?

Middle School (6-8)

Why use AI?

Outcomes – Literacy,

Outcomes – Differentiation

Study design

Quantitative – Others,

Systematic Review