Abstract: This paper proposes a highly sustainable and scalable integrated AI-native architecture defining UNified archITecture for Open RAN-enabled Distributed, Scalable and SustainabilitY-enhanced ...
Change: We will modify the reward for touching the ball and the existential reward/penalty to increase as the game progresses. This makes the agent more aggressive and strategic later in the episode.