There's definitely room to build specification builder agents, that have access to documentation and previous specifications.
The other day I was looking into adding Trusted Types in the Content-Security-Policy header, which was something new to me. In my chat with Claude I asked:
"Lets brainstorm 10 a list of ideas closely related to this so we can think of anything we might be missing on the topic to consider."
And that provided a good list of items to review to consider and expand out the sphere of thinking for the LLM.
It is an infuriatingly hard problem to have the LLM produce excellent results every single time, and have it just do everything and want it to read our mind and all the knowledge and context of a task. I think we'll make some good progress over the next few years as agentic workflows are built out to mimic out thought processes, and the cost/capability of the LLMs keeps improving.
The other day I was looking into adding Trusted Types in the Content-Security-Policy header, which was something new to me. In my chat with Claude I asked:
"Lets brainstorm 10 a list of ideas closely related to this so we can think of anything we might be missing on the topic to consider."
And that provided a good list of items to review to consider and expand out the sphere of thinking for the LLM.
It is an infuriatingly hard problem to have the LLM produce excellent results every single time, and have it just do everything and want it to read our mind and all the knowledge and context of a task. I think we'll make some good progress over the next few years as agentic workflows are built out to mimic out thought processes, and the cost/capability of the LLMs keeps improving.