Autonomous navigation in unstructured outdoor environments is inherently challenging due to the presence of asymmetric traversal costs, such as varying energy expenditures for uphill versus downhill movement.
Traditional reinforcement learning methods often assume symmetric costs, which can lead to suboptimal navigation paths and increased safety risks in real-world scenarios.
In this paper, we introduce QuasiNav, a novel reinforcement learning framework that integrates quasimetric embeddings to explicitly model asymmetric costs and guide efficient, safe navigation. QuasiNav formulates the navigation problem as a constrained Markov decision process (CMDP) and employs quasimetric embeddings to capture directionally dependent costs, allowing for a more accurate representation of the terrain.
This approach is combined with adaptive constraint tightening within a constrained policy optimization framework to dynamically enforce safety constraints during learning.
We validate QuasiNav across three challenging navigation scenarios:
Experimental results show that QuasiNav significantly outperforms conventional methods, achieving higher success rates, improved energy efficiency, and better adherence to safety constraints.
@inproceedings{hossain2025quasinav,
author = {Hossain, Jumman and Faridee, Abu-Zaher and Asher, Derrik and Freeman, Jade and Trout, Theron and Gregory, Timothy and Roy, Nirmalya},
title = {QuasiNav: Learning Optimal Policies with Quasi-Potential Functions for Asymmetric Traversal},
booktitle = {Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)},
year = {2025},
pages = {TBD},
publisher = {IEEE}
}