Journal of Applied Science and Engineering

Published by Tamkang University Press

1.30

Impact Factor

2.10

CiteScore

Guangning QinThis email address is being protected from spambots. You need JavaScript enabled to view it.

School of Music and Dance, Zhengzhou University of Science and Technology, Zhengzhou 450064 China


 

 

Received: April 19, 2025
Accepted: June 6, 2025
Publication Date: June 28, 2025

 Copyright The Author(s). This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are cited.


Download Citation: ||https://doi.org/10.6180/jase.202603_29(3).0009  


Students behaviors can directly reflect the quality of the classroom. Analyzing and evaluating classroom behaviors through artificial intelligence and deep learning is conducive to improving teaching quality. The traditional methods for identifying students classroom behaviors involve that teachers directly observe students states or analyze them through surveillance videos after class. These methods are time-consuming, labor intensive, and have a low recognition rate, making it difficult to reflect the problems existing in the classroom and exams in real time. To solve this problem, this paper proposes a novel students classroom behaviors based on YOLOv8 deep learning model. Combining the channel attention mechanism with deep convolution, a dynamic channel attention convolution (DCAConv) is proposed, which can dynamically adjust the channel weights and capture key features more sensitively. It introduces multi-scale convolutional attention (MSCA) to maximize the ability of mining multi-scale convolutional features through element multiplication, and enhance the attention to spatial details. Meanwhile, a multi-scale context fusion (MSCF) module is constructed. Through convolution and self-attention mechanism, multi-scale feature fusion is enhanced. Adding a small target detection layer and extracting local features from larger-sized feature maps significantly improves the ability to recognize the behaviors of students in the back row. The experimental results show that the average recognition accuracy rate of the proposed behavior recognition method for various parts of the human body can reach up to 83.7% at most. The recognition rates for various behaviors in simple and crowded scenarios reach more than 92.1% and 86.3% respectively, and it can effectively recognize various behaviors in the classroom.


Keywords: Behavior analysis, YOLOv8, deep learning, channel attention mechanism, multi-scale context fusion


  1. [1] M. Imran and N. Almusharraf, (2024) “Google Gemini as a next generation AI educational tool: a review of emerging educational technology" Smart Learning Environments 11(1): 22. DOI: 10.1186/s40561-024 00310-z.
  2. [2] A. Wilson, R. Kask, and L. W. Ming,(2024) “Exploring circular digital economy strategies for sustainable environ mental, economic, and educational technology" International Transactions on Education Technology (ITEE) 2(2): 129–139. DOI: 10.33050/itee.v2i2.579.
  3. [3] M. Jaboob, M. Hazaimeh, and A. M. Al-Ansi, (2025) “Integration of generative AI techniques and applications in student behavior and cognitive achievement in Arab higher education" International journal of human computer interaction 41(1): 353–366. DOI: 10.1080/10447318.2023.2300016.
  4. [4] C.-Y. Wang, I.-H. Yeh, and H.-Y. Mark Liao. “Yolov9: Learning what you want to learn using programmable gradient information”. In: European conference on computer vision. Springer. 2024, 26286 26296. DOI: 10.1109/CVPR52733.2024.02484.
  5. [5] J. Yu, H. Li, S.-L. Yin, and S. Karim, (2020) “Dynamic gesture recognition based on deep learning in human-to computer interfaces" Journal of Applied Science and Engineering 23(1): 31–38. DOI: 10.6180/jase.202003_ 23(1).0004.
  6. [6] Z. Cao, G. Hidalgo, T. Simon, S. -E. Wei, and Y. Sheikh, (2019) “Openpose: Realtime multi-person 2d pose estimation using part affinity fields" IEEE transactions on pat tern analysis and machine intelligence 43(1): 172 186. DOI: 0.1109/TPAMI.2019.2929257.
  7. [7] Q. Li, L. Xu, and X. Yang, (2022) “2D multi-person pose estimation combined with face detection" International Journal of Pattern Recognition and Artificial Intelligence 36(02): 2256002. DOI: 10.1142/S021800142256002X.
  8. [8] C.-T. Lu, Y.-C. Liu, and Y.-C. Pan, (2024) “An intelligent playback control system adapted by body movements and facial expressions recognized by OpenPose and CNN" Multimedia Tools and Applications 83(10): 31139 31160. DOI: 10.1007/s11042-023-16880-y.
  9. [9] H.-S. Fang, S. Xie, Y.-W. Tai, and C. Lu. “ Rmpe: Regional multi-person pose estimation”. In: Proceedings of the IEEE international conference on computer vision. 2017, 2334–2343. DOI: 10.1109/ICCV.2017.256.
  10. [10] S. Yin, H. Li, A. A. Laghari, L. Teng, T. R. Gadekallu, and A. Almadhor, (2024) “FLSN-MVO: Edge Comput ing and Privacy Protection Based on Federated Learning Siamese Network With Multi-Verse Optimization Algorithm for Industry 5.0" IEEE Open Journal of the Communications Society: DOI: 10.1109/OJCOMS.2024.3520562.
  11. [11] C. Yongqing and L. Dan, (2024) “Optimization of Stu dent Behavior Detection Algorithm Based on Improved SSD Algorithm." International Journal of Advanced Computer Science & Applications 15(5): DOI: 10. 14569/ijacsa.2024.0150512.
  12. [12] C. Pabba, V. Bhardwaj, and P. Kumar, (2024) “A visual intelligent system for students’ behavior classification using body pose and facial features in a smart classroom" Multimedia Tools and Applications 83(12): 36975 37005. DOI: 10.1007/s11042-023-16388-5.
  13. [13] Q. Jia and J. He, (2024) “Student Behavior Recognition in Classroom Based on Deep Learning" Applied Sciences 14(17): 7981. DOI: 10.3390/app14177981.
  14. [14] F. Xie, B. Lin, and Y. Liu, (2022) “Research on the coordinate attention mechanism fuse in a YOLOv5 deep learning detector for the SAR ship detection task" Sensors 22(9): 3370. DOI: 10.3390/s22093370.
  15. [15] X. Wang, H. Gao, Z. Jia, and Z. Li, (2023) “BL YOLOv8: An improved road defect detection model based on YOLOv8" Sensors 23(20): 8361. DOI: 10.3390/s23208361.
  16. [16] J. Luo, (2024) “Dynamical behavior analysis and soli ton solutions of the generalized Whitham–Broer–Kaup Boussineq–Kupershmidt equations" Results in Physics 60: 107667. DOI: 10.1016/j.rinp.2024.107667.
  17. [17] M. Dang, G. Liu, Q. Xu, K. Li, D. Wang, and L. He, (2024) “Multi-object behavior recognition based on object detection for dense crowds" Expert Systems with Ap plications 248: 123397. DOI: 10.1016/j.eswa.2024. 123397.
  18. [18] J. Yu, G. Wang, X. Li, Z. Du, W. Xu, M. Akhter, and D. Li, (2025) “SDYOLO-Tracker: An efficient multi-fish hypoxic behavior recognition and tracking method" Computers and Electronics in Agriculture 232: 110079. DOI: 10.1016/j.compag.2025.110079.


    



 

2.1
2023CiteScore
 
 
69th percentile
Powered by  Scopus

SCImago Journal & Country Rank

Enter your name and email below to receive latest published articles in Journal of Applied Science and Engineering.