Using reinforcement learning to combine Green Light Optimised Speed Advisory and responsive traffic control systems with non-autonomous vehicles: Effects of imperfect training and incomplete information