AGV state monitoring is becoming increasingly important for production system safety in digital workshops. This paper develops a global and first-person view fusion to monitor the AGV (Automated Guided Vehicle) running state in the complex digital workshops. Based on the affine transformation model, the AGV running states, i.e., position, velocity and orientation are estimated by the global camera with the assistance of ArUco markers, which are pasted on the front side of the AGV. Besides, the first-person view camera is used to implement pedestrian detection and distance measurement based on the YOLOv3 model. It is noticeable that the pitch angle is considered for distance measurement, and an improved triangular model for distance measurement is proposed. The effectiveness of the proposed method has been verified in our built platform.