arxiv Safe Reinforcement Learning for Strategic Bidding of Virtual Power Plants in Day-Ahead Markets