Marine Engineering

fa کنترل عمق عملیاتی ربات زیرسطحی AUV با روش یادگیری تقویتی (RL) Depth control of an AUV robot using reinforcement learning (RL) طراحي، هیدروديناميك و ساخت زيرسطحي Submarine Hydrodynamic & Design مقاله پژوهشي Research Paper امروزه استفاده از روشهای پیشرفته برای کنترل حرکات رباتهای زیرسطحی سبب بهبود راندمان و افزایش کیفیت عملیات آن ها شده است. در این پژوهش با استفاده از روش یادگیری تقویتی برای حرکت عمقی رباتهای AUV روشی توسعه داده شده است که براساس معیار پاداش الگوی حرکت ربات را یاد گرفته و براساس آن بهترین تصمیم را برای حرکت و کنترل سطوح کنترلی اتخاذ میکند. کنترل حرکت عمقی براساس یادگیری تقویتی سبب بهبود عملکرد ربات میگردد و بهینهترین سیگنال کنترلی را براساس شرایط لحظهای ربات و پاداشها اتخاذ میکند. در این پژوهش از مدل دینامیکی خطی حرکت پیچ برای توسعه مدل حرکت عمقی استفاده شده است. برای هر هدف مطلوب 500 مرتبه سناریو تکرار میشود تا در حین شبیهسازی ماتریس Q به روزرسانی شود. در ادامه با ارائه پاداش به هر سیگنال مقدار مطلوب مشخص میگردد. پس از پایان سناریو، با انتخاب مقدار بهینه از ماتریس Q، مقدار سیگنال کنترلی برای بالک مشخص میگردد. نتایج نشان داد که استفاده از روش یادگیری تقویتی کمک شایانی به کیفیت سیستم کنترل رباتهای AUV میکند تا جایی که مقدار فرارفت و نوسان کمی در عملکرد مشاهده شد. Nowadays, the use of advanced methods for controlling the movements of underwater robots has led to improved efficiency and enhanced operational quality. In this research, a method based on reinforcement learning has been developed for the depth control of AUV robots. This method learns the robot's movement pattern based on a reward criterion and makes the optimal decision for motion and control surface adjustments accordingly. Depth control using reinforcement learning improves the robot's performance and selects the most optimal control signal based on the robot's conditions and rewards. In this study, a linear dynamic model of pitch motion was used to develop the depth control model. For each desired state, the scenario is repeated 500 times to update the Q-matrix during simulation. Subsequently, by assigning rewards to each signal, the optimal value is determined. After completing the scenario, the optimal value from the Q-matrix is selected to determine the control signal for the fin. The results showed that the use of reinforcement learning significantly enhances the quality of the AUV robot's control system, resulting in minimal overshoot and oscillation in performance. یادگیری تقویتی,کنترل عمق,ربات AUV,Under actuated یادگیری تقویتی,کنترل عمق,ربات AUV,Under actuated 71 79 http://marine-eng.ir/browse.php?a_code=A-10-1369-6&slc_lang=fa&sid=1 ali hasanvand علی حسنوند ali.hassanvand@yahoo.com 10031947532846006137 10031947532846006137 Yes Postdoctoral پسا دکتری دانشگاه شریف Mohammad Saeed Seif محمد سعید سیف seif@sharif.edu 10031947532846006138 10031947532846006138 No professor, sharif university of technology استاد دانشگاه شریف