Time is encoded in the weights of finetuned language models

alephnan · 2023-12-25T02:01:13 1703469673

By time, they’re talking about the writing style of a specific time period.

Feels like a click bait title. Of course language model weights encode different writing styles. The fact that you can lift out a vector to stylize writing is also more interesting, but that’s also nothing newly discovered here. It should be obvious that this is possible given that you can prompt ChatGPT to change its writing style.

n2d4 · 2023-12-25T03:30:26 1703475026

Besides what the sibling comment said, what's most interesting (imo) is that you can manipulate the vectors like that. The fact that you can average the vectors for January and March, and get better results for February, is pretty surprising to me.

macleginn · 2023-12-25T07:47:06 1703490426

This also generalises: https://arxiv.org/abs/2302.04863

jimbobthrowawy · 2023-12-25T10:24:16 1703499856

Generalizing vectors in generative models seems like an incredibly useful thing to know about, if you want to use them more effectively. Blew my mind when I saw someone demonstrate doing vector math on a GAN a couple years back to move an "input image" around the space of outputs.

Maybe this could be useful for singling out post-LLM text and generating output that excludes it.

cush · 2023-12-25T02:54:58 1703472898

Why would it pertain only to writing style?

k__ · 2023-12-25T10:51:54 1703501514

Interesting that writing style works, but other reflective actions don't.

Like, "only use the the 2000 most common words of the English language" or "the response should be 500 words long".

n2d4 · 2023-12-25T13:16:43 1703510203

It does work on other reflective actions, parent is just wrong; in the paper, they specifically run the experiment on a dataset of political affiliation over time

mycall · 2023-12-25T17:26:33 1703525193

From the title, I was thinking "of course the neural network of the LLM is a [cause-effect] sequence of words" thus time is encoded in each connection.

convexstrictly · 2023-12-24T21:54:03 1703454843

Twitter summary: https://twitter.com/ssgrn/status/1738256456250470853

Github: https://github.com/KaiNylund/lm-weights-encode-time

fnordpiglet · 2023-12-25T01:54:39 1703469279

Usable non musky version:

https://nitter.net/ssgrn/status/1738256456250470853

solardev · 2023-12-25T02:31:40 1703471500

Thanks!

behnamoh · 2023-12-25T02:30:09 1703471409

the X version worked fine for me. I don't know what you want to achieve by posting a link to a third party website.

fnordpiglet · 2023-12-25T02:41:53 1703472113

I don’t have a twitter account, and many don’t. Twitter is slow, nitter is fast. Twitter has never reliably displayed threads for me. Nitter does.

newZWhoDis · 2023-12-25T04:01:02 1703476862

>Twitter is slow

It’s faster than it’s ever been, and seemingly without 85% of its staff. Says a lot

fnordpiglet · 2023-12-25T04:25:26 1703478326

And yet nitter is still faster with only 47 contributors total. Says more.

bobmaxup · 2023-12-25T06:04:05 1703484245

... because it is essentially a caching proxy to twitter?

Zambyte · 2023-12-25T06:49:57 1703486997

So is twitter.com. Unless they aren't using a CDN lol

schaefer · 2023-12-25T04:56:00 1703480160

I think the advertiser boycott is a major contributor. Advertisements are slow...

pests · 2023-12-25T04:40:02 1703479202

X's reduced user base might make X faster than ever.

thulle · 2023-12-26T16:15:45 1703607345

Some data for me:

Nitter: I get all 8 posts in the thread in 18 requests, 207 kB, 169 kB transferred.

X: I only get the first post, 128 requests, 11.45 MB, 2.11 MB transferred.

calamari4065 · 2023-12-25T22:36:49 1703543809

No, no it is not.

KTibow · 2023-12-25T20:37:43 1703536663

Twitter requires JS to work though.

hightrix · 2023-12-25T05:04:18 1703480658

Twitter doesn’t show replies if you are not logged in. As others have said, I also don’t have an account. So this link provides the full context. The twitter link only shows the post and no replies.

bobmaxup · 2023-12-25T06:06:35 1703484395

Twitter doesn't even show most recent tweets from profiles unless you are logged in now. They show a summary of the profile's activity. Nitter is great if you don't have a Twitter account.

jimberlage · 2023-12-25T02:31:33 1703471493

I’m not on Twitter, and found it valuable to see the replies!

ParetoOptimal · 2023-12-25T19:37:06 1703533026

Allowing people without twitter accounts to view it.

Allowing those who would otherwise avoid twitter to view the content.

shzhdbi09gv8ioi · 2023-12-25T10:44:28 1703501068

x.com links requires being logged in to even read the thread.

solardev · 2023-12-25T02:34:37 1703471677

Twitter's not very usable these days.

CamperBob2 · 2023-12-25T18:56:26 1703530586

He said he would sink the company...

electrondood · 2023-12-25T07:55:43 1703490943

All of the links were to a third party website.

cwmoore · 2023-12-25T04:33:11 1703478791

I think I like time. Though spectral, indeterminate, presently a fixture, essential moments last forever but occur daily. Why would any network encode time if it were all just a crystal vase?

ackbar03 · 2023-12-25T06:03:45 1703484225

what are you on?

phito · 2023-12-25T06:05:02 1703484302

Crystal vase

Vecr · 2023-12-25T07:16:47 1703488607

Don't worry about the crystal vase.

tnecniv · 2023-12-25T18:29:24 1703528964

I gotta get me some of that

haltist · 2023-12-25T07:33:43 1703489623

Because people have to publish papers, that's why.

IlliOnato · 2023-12-26T05:33:21 1703568801

Beautiful. Thoughtful. Clever. Wise. In brightness like the face of Odin, in hearing like Moo, in spring and morning most goodly delight. Doing poetic justice to itself. Bringing up crystal vases! Per-bloody-fect.

jiggawatts · 2023-12-25T02:03:39 1703469819

Sooo… if I’m reading this right, it’s possible to force an AI into extrapolating into the future. As in, it’ll answer as-if its training was based on data from future years.

Obviously this isn’t time travel, but more of a zeitgeist extrapolation.

I would expect that if an AI was made to answer like it’s from December 2024 it would talk a lot about the US election but it wouldn’t know who won — just that a “race is on.”

This could have actual utility: predicting trends, fads, new market opportunities, etc…

n2d4 · 2023-12-25T03:22:09 1703474529

Kind of. You still need some data from the "future" to extrapolate it: In the paper, they take an LLM finetuned on 2015 political affiliation data, and add to it the difference between 2020 and 2015 Twitter data, and show that the performance is better when the new model is asked about 2020 political affiliation.

So, the LLM still needs to know about 2020 from somewhere. In a way, you teach it about the task, then separately you teach it about 2020, and this method can combine that to make it solve the task for year 2020.

behnamoh · 2023-12-25T02:28:47 1703471327

nah, this is not what they're talking about.

dartos · 2023-12-25T03:04:20 1703473460

I don’t think it’d be nearly as accurate as purpose built future predictors.

LLMs aren’t a silver bullet for everything.

gmerc · 2023-12-25T03:21:11 1703474471

Ah, the bitter lesson teams it’s ugly head

electrondood · 2023-12-25T07:57:18 1703491038

> LLMs aren’t a silver bullet for everything.

Please explain this to my Product org.

spacecadet · 2023-12-25T11:59:23 1703505563

lolol

habitue · 2023-12-25T05:52:31 1703483551

Maybe less zeitgeist, but it would be really interesting to see what extrapolating future writing styles are like.

spacecadet · 2023-12-25T12:00:18 1703505618

Here ya go: lorizzle.nl

bkfh · 2023-12-25T09:54:39 1703498079

Can someone ELI5 this?

LectronPusher · 2023-12-26T00:28:42 1703550522

A vector is a position in a dimensional space. In 2D space a vector is a point (x, y) like (1, 3) or (-2.5, 7.39). We can also do simple math on vectors like addition: (1, 3) + (2, -1) = (3, 2).

LLMs treat language as combinations of vectors of a very high dimension -- (x, y, z, a, b, c, d, ...). The neat thing is that we can combine these just like the 2D vectors and get meaningful results. If we have the vectors for the concepts "King" and "Woman", adding them gives a vector close to the one for "Queen"!

Once you know this, you can extrapolate and look for ways to categorize groups of vectors and combine them in new ways. As I read it, this research is about finding the vector weights for text from specific time periods -- i.e. January of 2021 -- and comparing them to the vectors for text from a different period -- i.e. March of 2021. It seems that all the operations are still meaningful, you can even do something like averaging vectors in January and March and getting ones that look like vectors in February!

simne · 2023-12-26T22:38:38 1703630318

Well, I think this could become one of most underestimated idea in LLM development.

To be honest, it is relatively obvious idea, to make vectors from timestamps and feed them to LLMs, but for some strange reason, nobody made this before and looks like, this is mostly unnoticed in NN community.

airocker · 2023-12-25T19:16:20 1703531780

I think a more general way to think about it would be to add any data and reduce weight. For eg, if we want to create geography vectors, we would add all geography data to fine tune and then take a difference. Now add this to any other model with same architecture, and you have a geography capable llm.

mjvmroz · 2023-12-25T05:27:48 1703482068

I think the general case is far more interesting than time specifically. There are cool functor/analogy ideas here.

lproven · 2023-12-25T20:46:47 1703537207

I thought it was encoded as a helix of semi-precious stones, but perhaps I am misremembering.

throwaway81523 · 2023-12-25T04:23:30 1703478210

What about helixes of semi-precious stones?

scotty79 · 2023-12-25T23:13:37 1703546017

Why is short story from 1968 mentioned here? Was it popular? Is it good? Were there some recent adaptations or homages?

throwaway81523 · 2023-12-26T03:35:42 1703561742

Similarity of titles is all.