Umpire’s Signal Recognition in Cricket Using an Attention based DC-GRU Network

Document Type : Original Article

Authors

1 Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, India

2 Computer Science Department, Prince Hussein Bin Abdullah Faculty for Information Technology, Al al-Bayt University, Mafra, Jordan

3 Department of Electrical and Computer Engineering, Lebanese American University, Byblos, Lebanon

4 Hourani Center for Applied Scientific Research, Al-Ahliyya Amman University, Amman, Jordan

5 MEU Research Unit, Middle East University, Amman, Jordan

Abstract

Computer vision has extensive applications in various sports domains, and cricket, a complex game with different event types, is no exception. Recognizing umpire signals during cricket matches is essential for fair and accurate decision-making in gameplay. This paper presents the Cricket Umpire Action Video dataset (CUAVd), a novel dataset designed for detecting umpire postures in cricket matches. As the umpire possesses the power to make crucial judgments concerning incidents that occur on the field, this dataset aims to contribute to the advancement of automated systems for umpire recognition in cricket. The proposed Attention-based Deep Convolutional GRU Network accurately detects and classifies various umpire signal actions in video sequences. The method achieved remarkable results on our prepared CUAVd dataset and publicly available datasets, namely HMDB51, Youtube Actions, and UCF101. The DC-GRU Attention model demonstrated its effectiveness in capturing temporal dependencies and accurately recognizing umpire signal actions. Compared to other advanced models like traditional CNN architectures, CNN-LSTM with Attention, and the 3DCNN+GRU model, the proposed model consistently outperformed them in recognizing umpire signal actions. It achieved a high validation accuracy of 94.38% in classifying umpire signal videos correctly. The paper also evaluated the models using performance metrics like F1-Measure and Confusion Matrix, confirming their effectiveness in recognizing umpire signal actions. The suggested model has practical applications in real-life situations such as sports analysis, referee training, and automated referee assistance systems where precise identification of umpire signals in videos is vital.

Graphical Abstract

Umpire’s Signal Recognition in Cricket Using an Attention based DC-GRU Network

Keywords

Main Subjects


  1. Oslear D. Wisden's The Laws Of Cricket: Random House; 2010.
  2. Naik BT, Hashmi MF, Bokde ND. A comprehensive review of computer vision in sports: Open issues, future trends and research directions. Applied Sciences. 2022;12(9):4429. 10.3390/app12094429
  3. Shi F, Laganiere R, Petriu E, editors. Gradient boundary histograms for action recognition. 2015 IEEE Winter Conference on Applications of Computer Vision; 2015: IEEE. 10.1109/WACV.2015.152
  4. Kong Y, Fu Y. Human action recognition and prediction: A survey. International Journal of Computer Vision. 2022;130(5):1366-401. 10.1007/s11263-022-01594-9
  5. Xin M, Zhang H, Wang H, Sun M, Yuan D. Arch: Adaptive recurrent-convolutional hybrid networks for long-term action recognition. Neurocomputing. 2016;178:87-102. 10.1016/j.neucom.2015.09.112
  6. Simonyan K, Zisserman A. Two-stream convolutional networks for action recognition in videos. Advances in neural information processing systems. 2014;27. 10.48550/arXiv.1406.2199
  7. Xiong Q, Zhang J, Wang P, Liu D, Gao RX. Transferable two-stream convolutional neural network for human action recognition. Journal of Manufacturing Systems. 2020;56:605-14. 10.1016/j.jmsy.2020.04.007
  8. Wang L, Xiong Y, Wang Z, Qiao Y, Lin D, Tang X, et al. Temporal segment networks for action recognition in videos. IEEE transactions on pattern analysis and machine intelligence. 2018;41(11):2740-55. 10.1109/TPAMI.2018.2868668
  9. Li Z, Gavrilyuk K, Gavves E, Jain M, Snoek CG. Videolstm convolves, attends and flows for action recognition. Computer Vision and Image Understanding. 2018;166:41-50. 10.1016/j.cviu.2017.10.011
  10. Ge H, Yan Z, Yu W, Sun L. An attention mechanism based convolutional LSTM network for video action recognition. Multimedia Tools and Applications. 2019;78:20533-56. 10.1007/s11042-019-7404-z
  11. Minhas RA, Javed A, Irtaza A, Mahmood MT, Joo YB. Shot classification of field sports videos using AlexNet Convolutional Neural Network. Applied Sciences. 2019;9(3):483. 10.3390/app9030483
  12. Rafiq M, Rafiq G, Agyeman R, Choi GS, Jin S-I. Scene classification for sports video summarization using transfer learning. Sensors. 2020;20(6):1702. 10.3390/s20061702
  13. Sanchez-Caballero A, de López-Diz S, Fuentes-Jimenez D, Losada-Gutiérrez C, Marrón-Romera M, Casillas-Perez D, et al. 3dfcnn: Real-time action recognition using 3d deep neural networks with raw depth information. Multimedia Tools and Applications. 2022;81(17):24119-43. 10.1007/s11042-022-12091-z
  14. Savadi Hosseini M, Ghaderi F. A hybrid deep learning architecture using 3d cnns and grus for human action recognition. International Journal of Engineering, Transactions B: Applications,. 2020;33(5):959-65. 10.5829/ije.2020.33.05B.29
  15. Kavimandan PS, Kapoor R, Yadav K. Human action recognition using prominent camera. International Journal of Engineering, Transactions B: Applications,. 2021;34(2):427-32. 10.5829/ije.2021.34.02b.14
  16. Foysal MFA, Islam MS, Karim A, Neehal N, editors. Shot-Net: A convolutional neural network for classifying different cricket shots. Recent Trends in Image Processing and Pattern Recognition: Second International Conference, RTIP2R 2018, Solapur, India, December 21–22, 2018, Revised Selected Papers, Part I 2; 2019: Springer. 10.1007/978-981-13-9181-1_10
  17. Dey A, Dutta A, Biswas S, editors. Workoutnet: A deep learning model for the recognition of workout actions from still images. 2023 3rd International Conference on Intelligent Technologies (CONIT); 2023: IEEE. 10.1109/CONIT59222.2023.10205926
  18. Dey A, Biswas S, Le D-N. Recognition of Human Interactions in Still Images using AdaptiveDRNet with Multi-level Attention. International Journal of Advanced Computer Science and Applications. 2023;14(10).
  19. Wu F, Wang Q, Bian J, Ding N, Lu F, Cheng J, et al. A survey on video action recognition in sports: Datasets, methods and applications. IEEE Transactions on Multimedia. 2022. 10.1109/TMM.2022.3232034
  20. Li W, Nie W, Su Y. Human action recognition based on selected spatio-temporal features via bidirectional LSTM. IEEE Access. 2018;6:44211-20. 10.1109/ACCESS.2018.2863943
  21. Hussain A, Hussain T, Ullah W, Baik SW. Vision transformer and deep sequence learning for human activity recognition in surveillance videos. Computational Intelligence and Neuroscience. 2022;2022. 10.1155/2022/3454167
  22. Ravi A, Venugopal H, Paul S, Tizhoosh HR, editors. A dataset and preliminary results for umpire pose detection using SVM classification of deep features. 2018 IEEE Symposium Series on Computational Intelligence (SSCI); 2018: IEEE. 10.1109/SSCI.2018.8628877
  23. Ahmad T, Wu J, Alwageed HS, Khan F, Khan J, Lee Y. Human Activity Recognition Based on Deep-Temporal Learning Using Convolution Neural Networks Features and Bidirectional Gated Recurrent Unit With Features Selection. IEEE Access. 2023;11:33148-59. 10.1109/ACCESS.2023.3263155
  24. Wickramasinghe I. Applications of machine learning in cricket: a systematic review. Machine Learning with Applications. 2022;10:100435. 10.1016/j.mlwa.2022.100435
  25. Reddy P S, Santhosh C. Multimodal Spatiotemporal Feature map for Dynamic Gesture Recognition from Real Time Video Sequences. International Journal of Engineering, Transactions B: Applications,. 2023;36(8):1440-8. 10.5829/ije.2023.36.08B.04
  26. Pan T-Y, Tsai W-L, Chang C-Y, Yeh C-W, Hu M-C. A hierarchical hand gesture recognition framework for sports referee training-based EMG and accelerometer sensors. IEEE Transactions on Cybernetics. 2020;52(5):3172-83. 10.1109/TCYB.2020.3007173
  27. Ahmad W, Munsif M, Ullah H, Ullah M, Alsuwailem AA, Saudagar AKJ, et al. Optimized deep learning-based cricket activity focused network and medium scale benchmark. Alexandria Engineering Journal. 2023;73:771-9. 10.1016/j.aej.2023.04.062
  28. Das S, Mahmud T, Islam D, Begum M, Barua A, Tarek Aziz M, et al. Deep Transfer Learning-Based Foot No-Ball Detection in Live Cricket Match. Computational Intelligence and Neuroscience. 2023;2023. 10.1155/2023/2398121
  29. Shingrakhia H, Patel H. SGRNN-AM and HRF-DBN: a hybrid machine learning model for cricket video summarization. The Visual Computer. 2022;38(7):2285-301. 10.1007/s00371-021-02111-8
  30. Raval KR, Goyani MM. A survey on event detection based video summarization for cricket. Multimedia Tools and Applications. 2022;81(20):29253-81. 10.1007/s11042-022-12834-y
  31. Nandyal S, Kattimani SL. Cricket Event Recognition and Classification from Umpire Action Gestures using Convolutional Neural Network. International Journal of Advanced Computer Science and Applications. 2022;13(6). 10.14569/IJACSA.2022.0130644
  32. Siddiqui HUR, Younas F, Rustam F, Flores ES, Ballester JB, Diez IdlT, et al. Enhancing Cricket Performance Analysis with Human Pose Estimation and Machine Learning. Sensors. 2023;23(15):6839. 10.3390/s23156839
  33. Zhang Q, Zhang Y, Shao Y, Liu M, Li J, Yuan J, et al. Boosting Adversarial Attacks with Nadam Optimizer. Electronics. 2023;12(6):1464. 10.3390/electronics12061464