Advancing Packet-Level Traffic Predictions With Transformers