working to run the agent in the lane: - Fixed Q-learn algorithm with dictionary - fixed car params: v, w, pose - fixed board windows