Date
Publisher
arXiv
Early childhood science education is crucial for developing scientific
literacy, yet translating complex scientific concepts into age-appropriate
content remains challenging for educators. Our study evaluates four leading
Large Language Models (LLMs) - GPT-4, Claude, Gemini, and Llama - on their
ability to generate preschool-appropriate scientific explanations across
biology, chemistry, and physics. Through systematic evaluation by 30 nursery
teachers using established pedagogical criteria, we identify significant
differences in the models' capabilities to create engaging, accurate, and
developmentally appropriate content. Unexpectedly, Claude outperformed other
models, particularly in biological topics, while all LLMs struggled with
abstract chemical concepts. Our findings provide practical insights for
educators leveraging AI in early science education and offer guidance for
developers working to enhance LLMs' educational applications. The results
highlight the potential and current limitations of using LLMs to bridge the
early childhood science literacy gap.
What is the application?
Who is the user?
Who age?
Why use AI?
Study design
