Apparently OpenAI has some excellent developer relations and marketing people to...

j_maffe · on March 26, 2023

For some reason I've never seen the idea of auto-recursive prompting in any of the papers or discussions. It makes so much sense. It can also help with model and compute size. Instead of using this large model to, say, list the number of primes less than a 1000, it can prompt GPT-3 to do it and count them, then send it back to GPT-4. Sounds quite feasible to implement too!

alchemist1e9 · on March 26, 2023

Exactly. I’m currently working on this approach. Everything is available to implement it.

j_maffe · on March 26, 2023

Awesome! Please do share once you get some results :D

amayne · on March 26, 2023

Original author here. I'm a programmer. I started on the Applied team at OpenAI back in 2020 as a prompt engineer (I helped create many of the examples in the GPT-3 docs.) I became the Science Communicator for OpenAI in 2021.

My blog audience is very non-technical so I write very broadly. We've been super busy with the launch of GPT-4 and Plugins (I produced the video content, found examples, briefed media on technical details, etc.) so I was only able to grab a few hours to put these demos together.

As far as the ChatGPT prompts go, I included a few, but they're just simple instructions. Unlike GPT 3.5 where I'd spend an hour or more getting the right instruction to do zero-shot app creation, GPT-4 just gets it.

mike_hearn · on March 26, 2023

Thanks for the reply! That makes sense, I just didn't see mention of coding in your bio. You've had a very varied career!

amayne · on March 26, 2023

I always have trouble figuring out what to put in my bio. Every five years I'd shift into something new that caught my interest.

After a stint in entertainment I realized that AI was where everything was heading. I took up programming and started studying AI.

mike_hearn · on March 26, 2023

Wow, you learned programming specifically to work with AI? That is an inspiring level of flexibility in skills and self-identification. Perhaps many of us will need to learn how to do that sort of reinvention sooner, rather than later.

precompute · on March 26, 2023

>Apparently OpenAI has some excellent developer relations and marketing people too.

I've been repeating this for a while now, I think OpenAI is 50% marketing, and a part of the rest is product.

GPT-enhanced code execution already exists (Langchain).

>sub-thoughts

Someone's trying to implement long-term memory : https://github.com/wawawario2/text-generation-webui

alchemist1e9 · on March 26, 2023

I agree 100% and in fact I’m already working on what you suggest.

> GPT-as-a-GPT-plugin

> Parallel recursive LLMs

Anyone else?

mike_hearn · on March 26, 2023

Ah cool. Do I get first dibs on a demo? ;)

(edited to delete a post that was already answered by the other reply)

alchemist1e9 · on March 26, 2023

Not desktop at all. I’m focused on it operating it’s own computing resources using the recursive approach. I call it multi-agent LLM approach. This way it can breakdown a complex task into components and attack each component in parallel or sequentially as it needs.

mike_hearn · on March 26, 2023

Nice. Are you a professional AI researcher?

alchemist1e9 · on March 26, 2023

I’m not a researcher at all but a partitioner with extensive quantitative development experience in an applied industry situation using ML tools.

I’ve been thinking that taking this up a level is more a systems architecture problem. The core LLM model is so incredibly flexible and powerful that what I’m working on is the meta application of that tool and giving it the ability to use itself to solve complex problems in layers.

Hopefully that makes sense. I already have a fairly extensive and detailed systems architecture design.