comma does raw input to output behaviour cloning like you propose.
interestingly, tesla does not. via there last public ml update, theyre self driving architecture still involves a path generator and evaluator combo, and a lot of seperately designed sensor fusion steps. parking is done via ppo reinforcement learning i think.