The hardest part is dealing with GPT token limits. You can't summarize long form...

ggnore7452 · on Jan 18, 2023

there is actually a paper by OpenAI themselves on summarizing long document. essentially, break a longer text into smaller chunks, and run a multi-stage sequential summarization. each chunk uses a trailing window of previous chunk as context, and run this recursively. https://arxiv.org/abs/2109.10862

did a rough implementation myself, works well for articles even 20k tokens. but kind slow because all the additional overlapping runs required. (and more costly)

poma · on Jan 24, 2023

gpt-index can do that automatically

carlosdp · on Jan 18, 2023

Not too hard, just "map reduce" it: Have GPT-3 summarize each section of the article, and then have it summarize the summaries.

alooPotato · on Jan 18, 2023

I've tried it, it doesn't really work that well. There are other much more convoluted approaches that have promise (embedding searches is one)

gamegoblin · on Jan 18, 2023

A technique I have had success with is to do it in multiple passes.

Map-reduce it with overlapping sections, but then propagate back downwards and repeat the process, but now each map-reduce node knows the context it's operating in and can summarize more salient details.

Concretely, on the first pass, your leaf nodes are given a prompt like "The following is lines X-Y of a Z length article. Output a 1 paragraph summary."

You then summarize those summaries, etc. But then you can propagate that info back down for a second pass, so in the second pass, your leaf nodes are given a prompt like "The following is lines X-Y of a Z length article. The article is about <topic>. The section before line X is about <subtopic>. The section after Y is about <subtopic>. Output a 1 paragraph summary that covers details most relevant to this article in the surrounding context."

bbig · on Jan 19, 2023

Could you expand on this? Is the idea to embed paragraphs (or some other arbitrary subsection) of text, and then semantic search for the most relevant paragraphs, and then only summarize them?

alooPotato · on Jan 19, 2023

Yes that's exactly right, but it presumes you know what to look for and what you want in your summary. Our use case is to pick out action items or next steps from meeting notes so this can work. But not for all use cases - i.e. summarize this paper.

roflyear · on Jan 18, 2023

It doesn't work but is a fun idea.

cloudking · on Jan 18, 2023

Agreed, you can try sending it in chunks but then you lose context. Perhaps the ChatGPT based API will help if they expose the conversational memory as a feature.

Maybe OP has figured out a method with the current API?

peytoncasper · on Jan 18, 2023

I saw in another thread that people were working around this by asking for a summary of sections and then combining the summaries and asking for a joint summary.

Kind of like map reducing the articles.

sudoelefant · on Jan 18, 2023

O believe this is how GPT-Index attempts to bypass prompt length limitations.

k1m · on Jan 18, 2023

This is an issue. I haven't experimented to see if there are workarounds, so the service currently checks the length of the article text and if it's very long, it will send a portion, otherwise we'll exceed the token limit. There's a note on the front page about it: "Limitations: The OpenAI API does not allow submission of large texts, so summarization may only be based on a portion of the whole article."

ijidak · on Jan 18, 2023

Not promoting this. But, sounds like a great area for monetization.

Anyone can use without conversational memory, or with limited conversational memory.

Then, charge for larger conversational memory.

roflyear · on Jan 18, 2023

ChatGPT doesn't seem to have a much longer memory

gamegoblin · on Jan 18, 2023

It's 2x the length of GPT3. 8192 tokens vs 4096.

tluyben2 · on Jan 18, 2023

I didn't try, but it seems GPT-Index and LangChain use techniques to get around prompting length limits?

tluyben2 · on Jan 19, 2023

I tried, they don’t. Seems when they were ranking #1 on HN yesterday, someone made a summary (top comment) of what they are for that isn’t quite correct.

poma · on Jan 24, 2023

Can't find it for some reason, can you provide a link? Did they summarize with GPTSimpleVectorIndex or GPTListIndex? GPTSimpleVectorIndex is in get-started examples and is cheaper, but it provides worse results.