ChatGPT o1: https://chatgpt.com/share/678feedb-0b2c-8001-bd77-4e574502e4fc > Tho...

scotty79 · 2025-01-21T19:34:24 1737488064

I think firmly marrying llms with symbolic math calculator/database, so they can check things they don't really know "by heart" would go a long way towards making them seem smart.

I really hope Wolfram is working on LLM that is trying to learn what it means to be WolframAlpha user.

bongodongobob · 2025-01-21T23:06:35 1737500795

Can we stop with the "haha llms can't do math" nonsense? You'll one shot it every time if you tell it to use Python. You're holding it wrong.

dchichkov · 2025-01-22T00:04:17 1737504257

Sorry, but this was ChatGPT/o1 with access to code execution (Python) and it used almost 4 minutes to do reasoning. It had done a few checks with smaller numbers, all of which had failed. And it proceeded to make a wrong conclusion (with high confidence).

bongodongobob · 2025-01-22T18:27:24 1737570444

Of course it failed. Tell it to write a program.