This document discusses human behavior recognition using convolutional neural networks (CNNs). It first introduces the importance of human behavior recognition and some commonly used datasets. It then discusses related works that have used techniques like CNNs, LSTM networks, and R-CNN to recognize behaviors. The document proposes using the YOLOv3 algorithm to recognize behaviors in real-time video data. It describes the YOLOv3 algorithm and how it divides images into grids to predict boundary boxes and confidence scores for object detection. The goal is to automatically recognize human behaviors from video data using a CNN-based approach like YOLOv3 without manual annotation of training data.