Mendoza Barrionuevo, AlejandroYanes Luis, SamuelGutiƩrrez Reina, DanielToral, S. L.2025-10-172025-10-172025-05-28Mendoza Barrionuevo, A., Yanes Luis, S., GutiƩrrez Reina, D. y Toral, S.L. (2025). Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning. IEEE Robotics and Automation Letters, 10 (5), 4930-4937.https://doi.org/10.1109/LRA.2025.3555940.2377-37662377-3774https://hdl.handle.net/11441/177770This letter presents a model-free deep reinforcement learning framework for informative path planning with heterogeneous fleets of autonomous surface vehicles to locate and collect plastic waste. The system employs two teams of vehicles: scouts and cleaners. Coordination between these teams is achieved through a deep reinforcement approach, allowing agents to learn strategies to maximize cleaning efficiency. The primary objective is for the scout team to provide an up-to-date contamination model, while the cleaner team collects as much waste as possible following this model. This strategy leads to heterogeneous teams that optimize fleet efficiency through inter-team cooperation supported by a tailored reward function. Different trainings of the proposed algorithm are compared with other state-of-the-art algorithms in three distinct scenarios, one with moderate convexity, another with narrow corridors and challenging access, and the last one larger, more complex and with more difficult to access shape. According to the obtained results, it is demonstrated that deep reinforcement learning based algorithms outperform baselines, exhibiting superior adaptability. In addition, training with examples of actions from other algorithms further improves performance, especially in scenarios where the search space is larger.application/pdf8 p.engAttribution 4.0 Internationalhttp://creativecommons.org/licenses/by/4.0/Autonomous agentsReinforcement learningEnvironment monitoring and managementHeterogeneous multirobot systemsInformative path planningOptimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learninginfo:eu-repo/semantics/articleinfo:eu-repo/semantics/openAccesshttps://doi.org/10.1109/LRA.2025.3555940