I think LLM is not synonymous with transformers and are quite a bit older. Afaik, an LLM is just a model (any kind of large model) that predicts the next token based on previous tokens.
Yes it was called Watson, but I don't think the technology is related to what they used. So maybe not an LLM at that time. "Watson" just became a brand name for IBM.