I expect this kind of tool will be built into Google/Apple/Microsoft ecosystem within a year with at least GPT-4o level capabilities. So I'd say... procrastinate on building something like this?
Though I feel like it's not too hard to do something that searches a folder, uploads all the images to GPT-4o and returns the results. It won't just search text inside images, it can do things like identify which images have a product with sugar in them.
Though I feel like it's not too hard to do something that searches a folder, uploads all the images to GPT-4o and returns the results. It won't just search text inside images, it can do things like identify which images have a product with sugar in them.