Designing the metric of success is the hard part. The magic of LLMs is that they...

scotty79 · 2024-06-01T01:04:03 1717203843

I think way out of this is training models on multimodal data where one of the reasoning modes are reasoning graphs or knowledge graphs or schematics.

We might gradually improve constructing graphs from data like texts, images and speech. And we could create systems that correctly create synthetic data from generated graphs to serve as a training data for large multimodal model.