Privileged training frameworks for partially observable reinforcement learning