Considering that terminals can display images just fine you'll have a hard time convincing Unicode that you need arbitrary pixel sets because somehow images are plain text (emoji had prior usage in plain text and they have a lot more semantic meaning per character than just some pixels).
----
Also, I wonder if terminal lovers should maybe lobby for a set of Unicode symbols with more ‘pixels’ than the Braille 2×8.