If I'm going about my day and a voice in my ear gives me context about my surroundings, identifies people and objects, answers questions, tells me which direction I need to walk in, records and live streams my POV and more, that's AR. The experience doesn't always have to be visual.
Imo whether you overlay an image on your eyes or if they just do binocular camera passthrough, they could both count for VR. It's just most phones can't do binocular passthrough due to the lack of appropriately spaced cameras.
This is relevant for stuff like the f35 where you can look "through" the jet