Exactly. Give us access the model and let independant researchers test it. OpenAI did this with GPT4, opening access publicly and deeper access to researchers within and outside of Microsoft.
I simply don't believe the model is that good. Otherwise, maybe try to compete with OpenAI directley?
Wonder why they're not just giving us access, if it's indeed so good? Seems it's just to generate some noise and hype around Gemini. Hardly believable after the previous faked demo, as someone already said.
Google faces a different calculus than Microsoft/OpenAI when throwing these things out. It's just like Google Cloud. They have huge, valuable first-party workloads that compete for the hardware resources that would be used by generally-available free AI toys.
For Microsoft it doesn't make a difference. They are taking their own cash, investing it in OpenAI, and then turning right around and booking it as revenue. As a bonus it makes Google look wrong-footed. But fundamentally Microsoft doesn't care how much money they torch doing this.
Even the demo now is careful to show curated but possible things now. they learned their lesson.
The code changes are the most common tutorials you can find on the web. Adding a speed slider, the terrain tutorials are literary called "height maps" and focus on making it taller or flatter.
To be fair, they mostly faked the near instantaneous, real-time flow of the conversations. The answers were, as far as I know, legit. But I still agree that we should be skeptical.
The prompts they used were also different than the ones given like “is this the right order” was “is this the right order, consider the distance from the sun” they put this in their post on Google dev blog.
This one seems to be super straightforward about timeliness and capabilities, but the examples might be a bit simpler than people think. This is pretty amazing but like someone else said you could achieve similar results from rag due to the lack of novelty in these questions and the fact that each dealt with pretty independent examples as opposed to using custom code developed elsewhere in the codebase.