To implement high-speed wireless communication from the deep ocean to the sea surface leveraging multiple autonomous underwater vehicles (AUVs) with underwater wireless optical communication (UWOC) technique is an emerging and promising technology that enables real-time data collection for accurate underwater exploration and monitoring, e.g., coordinated moving targets tracking. However, multi-hop UWOC is more susceptible to beam misalignment and positional uncertainty caused by external interference in the harsh marine environments. To address these challenges, we design a cooperative movement scheme for multiple AUVs based on deep reinforcement learning (DRL) approach to realize robust and reliable optical communication under mobile target tracking scenarios. In this scheme, we first model the optical channel with optical noise and then analyze the link performance of multiple AUVs to meet the bit error rate (BER) requirements. Afterwards, we map the cooperative optical communication problem to a Markov decision process (MDP) by incorporating the extended Kalman filter (EKF) technique to enhance effective communication. Finally, we propose a cooperative control strategy for multiple mobile AUVs based on deep deterministic policy gradient (DDPG). Through extensive simulations, it is demonstrated that the proposed algorithm is effective in achieving reliable underwater optical communication under mobile scenarios.