A Power-Aware Reinforcement Learning Technique for Memory Allocation in Real-time Embedded Systems