Explaining Overthinking In Multi-Scale Dense Networks