Escaping the State of Nature: A Hobbesian Approach to Cooperation in Multi-Agent Reinforcement Learning