Policy Optimization For Long-Term Fairness In Decision Systems