Classroom observation — one of the most effective methods for teacher development — remains limited due to high costs and a shortage of expert coaches.
We present ClassMind, an AI-driven classroom observation system that integrates generative AI and multimodal learning to analyze classroom artifacts (e.g., class recordings) and deliver timely, personalized feedback aligned with pedagogical practices.
At its core is AVA-Align, an agent framework that analyzes long classroom video recordings to generate temporally precise, best-practice-aligned feedback to support teacher reflection and improvement.
Our three-phase study involved participatory co-design with educators, development of a full-stack system, and field testing with teachers at different stages of practice. Teachers highlighted the system’s usefulness, ease of use, and novelty, while also raising concerns about privacy and the role of human judgment, motivating deeper exploration of future human–AI coaching partnerships. This work illustrates how multimodal AI can scale expert coaching and advance teacher development.
Article: ClassMind: Scaling Classroom Observation and Instructional Feedback with Multimodal AI (arXiv, Sep 2025)
Key point: A multimodal AI system generates timely, personalized feedback aligned with pedagogical practices by analyzing classroom recordings, aiding teacher reflection and instructional improvement.
Quote: Teachers reported the system was useful, easy to use, and novel for professional reflection.
🔗 https://arxiv.org/abs/2509.18020