I know that GPT-4o is fairly poor to recognize music sheets and notes.
Totally off the marks, more often than not, even the first note is not recognize on a first week solfège book.
So unless I missed something but as far as I am concerned, they are optimized for benchmarks.
So while I enjoy gen AI, image-to-text is highly subpart.
So unless I missed something but as far as I am concerned, they are optimized for benchmarks.
So while I enjoy gen AI, image-to-text is highly subpart.