Hacker News new | past | comments | ask | show | jobs | submit login

Hi David, I posted an issue on GitHub with a few questions, but I have one better asked here: what’s with your license? I think I understand it generally in that it looks like you want the right to harass people who integrate grassland with social media; but what is the Deep Paranoia text generation thing you refer to? And can you talk about how your group theory link relates to the project?



> Any abuse of this software and/or algorithm(s) evinced by parties engaged in violation of this Agreement, if discovered will be taken as aknowledgement, consent and agreement by those parties to allow any member of the Grassland community to attempt to seek out and identify such parties in order to target and generate any digital news, social media content or other types of information that the household and/or family of the members of such parties may consume using the 'Deep Schizophrenia' narrative

It reads like, "if you break this license, we'll dox you and your family"


Except that their modification of MPL2 has a giant loophole. If you include MPL2 code as part of a larger work, you can license the larger work as GPL. At that point, their restrictions are scrubbed from the license of the GPL codebase.


It doesn't matter though, you can't claim right to harass in a copyright license agreement. The whole thing is bunk, but it goes to show how messed up the world view of its creator is...


My choice of words could use some improvement. Here's the explanation. https://news.ycombinator.com/item?id=19537047


You are frighteningly naive about the world, and creating the tools of oppression as a result. Stop. And seek help.


At least he is making it open, governments won't. While this is worrying, it is inevitable (satellite surveillance, road cameras etc - everything is in place, it's just about time), so I'm happy we're getting it this way and start working on learning to live with it. I'd hate to have to learn how to live under an oppressive government again (I'm from a former eastern bloc country), so this is definitely welcomed by me - this way I can keep track of the secret police.


Thank you.

It's funny that the words they use, "dystopian", "doomsdayish" etc. etc. ... are all from fiction. These are literally categories of genre-fiction. They're trying to make straight faced predictions about the future using what is literally in the book store under "Fiction". Something that is by definition not real, a fabrication, a lie. How can they possibly expect me to take them seriously or grasp what it is that I'm doing? They've taken stories they know are fake and mapped them to the real world. How is that any different from religion? I'm giving them mathematical arguments and they're giving me Dr. Seuss.

That's messed up. Well, Hitchen's Razor says that, "What can be asserted without evidence can be dismissed without evidence". Here's my razor, "What can be asserted with fiction can be dismissed with fiction". This is a bulwark against those who insist on living in a world of fantasy and superstition. But for the rest of us, those who want to live in reality, I'm giving you the real world.


Yes, frankly I see the current strong tendency to use metaphors and labeling as one of the greatest diseases of this age. IMHO it simplifies and degrades any topic it touches to the point of nonsense, which of course means that any subsequent discussion is nonsense. Metaphors and labels also carry emotions that are very complex AND totally individual, which makes them even worse for discussions.


The TL;DR of the linked comment/explanation above is:

* OP built a neural network (Deep Paranoia) similar to GPT-2 + DeepDream that can generate text for any narrative, and identified that anyone could use such a system to reinforce any (false or not) narrative they want

* OP then built a symmetric surveillance system (Grassland) focused on "truths" (derived from proof of work from IRL video feeds) to help protect humanity against a future in which auto-generated false narratives are widespread, intending to provide common people with something closer to omniscience such that they can see more (spatially and temporally) to be able to verify fact against potentially false narratives

It's doomsday-ish, but seems like a natural evolution of tech in this space. And understandable why building the former (Deep Paranoia) would prompt someone to build the latter (Grassland).

Obviously you can't have a "we have the right to harass you" clause in a copyright agreement, though. Threatening to do so is quite extreme.


The words people throw around here, "dystopian", "doomsday", etc. etc. are all from fiction. These are literally categories of genre-fiction. They're seriously trying to make straight faced predictions about the future using what is literally in the book store under "Fiction". Something that is by definition not real, a fabrication, a lie. How can people possibly expect me to take them seriously or grasp what it is that I'm doing? You can't take stories you know are fake and map them to the real world. How is that any different from religion? I'm giving people mathematical arguments and they're giving me Dr. Seuss.

And here you've put words in my mouth to suit your narrative. The story you want to believe. A story that's fake by defintion.

Don't you see how messed up that is? Well, Hitchen's Razor says that, "What can be asserted without evidence can be dismissed without evidence". Here's my razor, "What can be asserted with fiction can be dismissed with fiction". My software is a bulwark against those who insist on living in a world of fantasy and superstition. But for the rest of us, those who want to live in reality, I'm giving you the real world.

And it's not 'deep paranoia'. It's 'Deep Schizophrenia'


It's not 'Deep Paranoia', lol. It's 'Deep Schizophrenia' The user 'vessenes' got the name wrong. Read my replies.


My choice of words could use some improvement. Here's the explanation.

https://news.ycombinator.com/item?id=19537047


Wow, wtf.


I'm getting some TempleOS vibes here.


I didn't mean to offend. My choice of words could use some improvement. Here's further explanation. https://news.ycombinator.com/item?id=19537047


I was thinking timecube, myself.


The words you guys use "dystopian", "doomsday", etc. etc. are all from fiction. These are literally categories of genre-fiction. You're seriously trying to make straight faced predictions about the future using what is literally in the book store under "Fiction". Something that is by definition not real, a fabrication, a lie. How can you possibly expect me to take you seriously or grasp what it is that I'm doing? You've taken stories you know are fake and mapped them to the real world. How is that any different from religion? I'm giving you guys mathematical arguments and you're giving me Dr. Seuss. And you've put words in my mouth to suit your narrative. The story you want to believe. A story that's fake by defintion.

Don't you see how messed up that is? Well, Hitchen's Razor says that, "What can be asserted without evidence can be dismissed without evidence". Here's my razor, "What can be asserted with fiction can be dismissed with fiction". This is a bulwark against those who insist on living in a world of fantasy and superstition. But for the rest of us, those who want to live in reality, I'm giving you the real world.


You and me both.


As I stated to the parent comment, I didn't mean to offend. My choice of words could use some improvement. Here's further explanation. https://news.ycombinator.com/item?id=19537047


Your licensing terms are less offensive, and more invalid from incoherency and encouraging of illegal actions as well as showing a deep misunderstanding of what valid licensing and contract terms even are.



(I responded to your other question in a different reply)

> And can you talk about how your group theory link relates to the project?

You mean GLn(F)? That's a really, really cheesy kind of math joke that doesn't really work well. It's the symbol for 'The general linear group of degree n over any field F'.

But if you stay up too long writing code, well then you might realize it can also be used as a pun/pseudo-acronym on the word 'Grassland'. Take the first three letters, GLN and what do you get? ...GrassLaNd. I know, right. Keep your shirt on. This party ain't over yet.

And the 'F', well that's a 'field', right. like grass in a field. Ooohhh... Yeah... get it?... Oh Yeah, I bet you're impressed now.

And Bonus! The transformations of that group are 'symmetries'...

'Symmetry' as in the opposite of 'asymmetry'... You know... like how Grassland tries to promote symmetry... uh-huh... yep. That's right. I bet you're rolling on the floor with laughter now.

...So yeah, it's not at all a good joke. And yes, I am really fun at parties.


Deep Schizophrenia is a deep neural network model I'm going to either open source or make an API for people to use that can be used to generate new narratives of any size. And it doesn't have the semantic and narrative "fall-off" you get after a few sentences with models like OpenAI's GPT-2 and other "attention" based or LSTM models. It's called "Deep Schizophrenia" because like Google Deep Dream, GAN's and style transfer models do to create new images of people, lanscapes etc., it sort of "warps" narratives to generate new ones. It's as if the model is having an hallucination (hence the nomenclature contrasted to 'Deep Dream') but instead of changing images it's changing the semantic and narrative embedding space.

You're talking about some kind of online trolling and harassment. That would be stupid and accomplish the exact opposite of the intended affect. But I'm talking about customized, generated digital content using the very same content channels that everyone else on here uses. Social media, blogs etc etc.

Let me explain how Deep Schizophrenia works.

I was able to construct a continuous, fractal, space filling curve that satisfies Peano's/Cantor's definition (https://en.wikipedia.org/wiki/Space-filling_curve) but I was able to give it just enough additional "structure" so as to let it be treated as being differentiable 'everywhere' (If you're not clear on the definition of 'differentiable', for now, just know that it's very important for training machine learning models). This lets me normalize each document in my corpora from narratives the size of a single, abstract sentence like "The quick brown fox jumped over the lazy dog." to the entire novel 'The Fox and the Hound' so the entire narrative, no matter the length can, as a whole be embedded into a common narrative unit 'space' by simply adjusting the iterations of the curve according to word count. In Deep Schizophrenia, individual word tokens aren't decoded from nor treated like discrete values (as in CBOW, Skip-Gram, BERT etc.) but as the localized, resultant values of wavelet functions 'passing through' the dimensions of the space. So this lets me use techniques similar to GAN or style transfer but which are heavily modified to take advantage of my curve's structure in order to generate new narratives by 'warping' them along these dimensions while still maintaining both narrative, gramamtical and semantic 'cohesion' So no "fall-off". To borrow a metaphor, think of these 'semantic' wavelets like draw strings on some n-dimensional piece of fabric, when you pull on the string, the entire garment, from 'hem' to 'hem', gets pulled, bunched, stretched or twisted cohesively, as one garment, which is what you'd expect; how consistently it conforms to one's expectation of narrative, grammatical and semantic cohesion is largely dependent on how much memory one can afford to throw at it during training.

And the training set is prodigiously annotated and tagged with themes, prominent characters/persons, archetype categories etc. etc. So these inputs can be modified to get different predictions (stories). GPT-2 was trained on about 40 GB's of largely unannotated data. And it's considered state of the art. But I have over 247 GB's of annotated narratives all of which could potentially be trained with (I estimate the GPU costs to train any significant portion of it would be around $250K and take months. But it would be worth it).

I chose a fractal structure because narrative expression has what appears to me to be a sort of infinite set-ness (https://en.wikipedia.org/wiki/Infinite_set). This allows Deep Schizophrenia to recursively fill up the story from the 'inside' with each pass, moving down from higher levels of abstraction as it increases the cardinality (detail) of the narrative set. So in theory, with some clever memory management, it could allow you to fill up a narrative indefinitely (a-la George R. R. Martin). Theoretically, it should also be possible to take a complete, encoded narrative, normalize it onto a fraction of the curve, say [0,0.5] and, with some software architecture I haven't quite figured out yet, generate a "sequel" on the interval (0.5,1].

Narratives, common stories, myths etc. inform and tell you more about people's beliefs and ideologies in ways mere factual data could never hope to tell you. Did it matter that Boston Tea Partier's were actually dumping the tea in order the protest the new lower prices of the East India company's tea (thanks to the British lowering the tariffs) which were negatively affecting the sales of their own speciously sourced tea? Not really because what's the story that made America what it is today? The one that reaffirmed the heroes of the American Revolution. Does Turgenev's 'Sketches From A Hunters Album' and the affect it had on people's understanding of the morality and brutality of Russian serfdom any lessoned by the fact it's a fictional narrative. No, I would argue the fact its a fictional narrative loosely based in reality is what makes it that much the stronger.

And we can now tell people much better stories, with better customized content (targeting) faster and much more efficiently than any human could hope to accomplish.


How was the training data obtained?


(sigh) A lot of hard work and bootstrapping. My friend, don't waste this moment. There is a much more pertinent question you should be asking me.

There's a lot to unpack here. But you have to understand that I built Deep Schizophrenia (though I didn't call it that at the time) SEVERAL YEARS BEFORE I built Grassland. Partly because I realized what D.S. could do to people. Let me explain....

We'll take a few of the arguments some people have commented here as an example. I imagine most of them are athiests. But it's irrational to think 100 years of Nietzche is going to make a dent in 3 million years of evolution. We're hardwired for silly beliefs (no offense). Every culture and social group has their own pantheon of gods. They all just have different names for them. You can talk about 'privacy' and you can talk about 'Privacy'. Encryption and closing your blinds will give you privacy. That's rational. But there's no Privacy god who's going to make a data scientist suddenly unknow your pilfered Equifax credit history. The Privacy won't make the former employees of Cambridge Analytica suddenly unknow how to make your aunt vote for candidate X. And they'll never outright say they believe that. But they do by their actions.

Like 4chan with Nazism, they at one point merely cajoled one another with this mocking, ironic disattachment to the idea of a Flying Spaghetti Monster. Because they thought they were too smart to believe in it. But then some where along the way, they actually did.

These are all different stories. Human beings are extremely susceptible to the power of a story. They're like those funghi that take over an ant's brain till the ant is controlled by the funghi. If you want a story to placate your fantasy, if you really want that then I've built Deep Schizophrenia (Well, I actually built it to generate romance novels. Romance is a big industry) and it's ancillary software to figure that out for you and provide the story/rhetoric that reaffirms your fantasy back to you. A virtual, virtual reality.

But for the rest of us, those who want to be able to have data about the real world with a statistical guarantee of validity that we can calculate and create a clear separation between that and things that are mere stories, narratives and rhetoric told by humans and therefore subject to bias and interpretation (I enjoy the Lord of the Rings but I don't literally believe in Mordor), for those people there's Grassland.

And that's part of the reason why I built Grassland and why I built it in such a way as to make it extremly costly to get false data into the system. Because I knew eventually either I would release the Deep Schizophrenia software or because simultaneous discovery is so common (https://en.wikipedia.org/wiki/List_of_multiple_discoveries) possibly someone of untoward quality would discover it, use it in secret and not tell people about it like I did. And there'd be no safeguard against it if I didn't build Grassland.

I'm not saying the things I build are perfect or they're going to fulfill your fantasy of a perfect world (again, Deep Schizophrenia can give you that fantasy if you're hell-bent on stupidity). But what I try do is give mathematical arguments to support my conjectures.

Hence why I wanted a license to prevent people merging the software with things that would counteract Grassland's purpose. Yeah, maybe I wrote it wrong. It's a little difficult solving some of the world's oldest mathematics, AI, computer vision, cryptocurrency and surveillance problems in one's spare time. Adding a law degree to the mix must have slipped my mind. I'll fix the license. I'm only one guy...


So as I understand, the higher the wordcount, the deeper the "spaces get" by generating filler content matching the space and fit for the dimension of the space.


> ... the higher the wordcount, the deeper the "spaces get"...

During training, what I would say is that the "space" gets denser. Imagine you live on a cliff overlooking a lake/sea (some body of water with known boundaries). You notice on some days the winds produces long waves that are spaced far apart, while other days the waves are very short and choppy. If you wanted to encode this, it would take more memory to encode the latter than the former despite the lake being the same size.

If you have more questions about the Deep Schizophrenia model, I'd be happy to discuss. You'll see my email at the bottom of the site.

[1] https://en.wikipedia.org/wiki/Swell_(ocean)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: