Search
Items tagged with: mathed
* The Aftermath of DrawEduMath: Vision Language Models Underperform with Struggling Students and Misdiagnose Errors
arxiv.org/abs/2603.00925
* Benchmarking the Pedagogical Knowledge of Large Language Models
arxiv.org/abs/2506.18710v1
fab-ai.org/initiatives/ai-for-…
* AI‑generated lesson plans fall short on inspiring students and promoting critical thinking
theconversation.com/ai-generat…
#AIEd #mathed #teaching #education
arxiv.org/abs/2603.00925
* Benchmarking the Pedagogical Knowledge of Large Language Models
arxiv.org/abs/2506.18710v1
fab-ai.org/initiatives/ai-for-…
* AI‑generated lesson plans fall short on inspiring students and promoting critical thinking
theconversation.com/ai-generat…
#AIEd #mathed #teaching #education
Benchmarking the Pedagogical Knowledge of Large Language Models
Benchmarks like Massive Multitask Language Understanding (MMLU) have played a pivotal role in evaluating AI's knowledge and abilities across diverse domains.arXiv.org
