IMO the problem with the Unix model is not that it's been corrupted over time, b...

AnimalMuppet · on Aug 19, 2023

The Unix command line tools were designed to be usable in a lot of ad hoc situations. In an ad hoc situation, unstructured text is often what you have. So, to be usable for that, the command line tools had to be able to operate on unstructured text as input. That left two options - either also accept something structured as input, or accept only unstructured text as input.

"Usable in an ad hoc situation" gets us back to composability. If you can compose existing parts to meet a novel situation, that's better than having to carefully architect a solution. And the key to being able to do that is common interfaces. Unix did exactly that. And, for all the criticism, it works pretty well.

Can a well-designed interface do better? Sure. Can a well-designed interface deal with something outside the design parameters as well? Probably not.

grumblingdev · on Aug 19, 2023

I mean people still use `ls` everyday so it in fact did win. We have super rich GUIs, REPLs, the start menu, live programming environments and notebooks, web browser file listing capability, the ability to build your own web app to list files visually anyway you want.

And yet everyone still uses `ls`.

tmtvl · on Aug 19, 2023

I think that most people who need ls's functionality (listing files in a directory) use a GUI application.

AnimalMuppet · on Aug 19, 2023

Depends on how you define "functionality". You could say that ls's functionality is "providing a text listing of files in a directory. You can't get that from a GUI, certainly not without extra steps.

Why would anyone care about that difference? If I'm reading the listing, I don't care. My eyeballs don't care if the photons came from a GUI or text in a shell. But if I'm trying to pipe the results to another program, then I care. GUIs don't pipe well.

And people like me pipe the results of ls all the time.

benj111 · on Aug 19, 2023

Ls seems like a bad example.

Anything that you'd want to colour, the data should already be there.

Further I'm not sure the Unix way inherently requires completely unstructured text

rcxdude · on Aug 19, 2023

Unstructured text is the unix way. You can have pipelines sending other things between your programs (see powershell), but that's definitely not Unix.

bregma · on Aug 19, 2023

The Unix way is streams of bytes.

How those bytes are interpreted is up to the endpoints of the streams. Sure, a lot of endpoints interpret byte streams as text. That's not Unix, that's a particular endpoint.

You will notice that Unix does not even offer 'text' files (like Windows does). Or ISAM files, or random access files. or Fortran carriage control. Just binary files. Because the Unix way is streams of bytes, and nothing more.

benj111 · on Aug 19, 2023

I disagree, unstructured text is itself fuzzy.

CSV, line delimited, is that unstructured?

Further if you wanted to go with JSON or something you can, there's nothing preventing it. Pipes are based on unstructured text so you have that flexibility. If you want to add JSON ls and filter that through JSON sed to JSON less you can.

Finally 'the Unix way' is a cultural thing. It isn't set in stone. It can, and probably has evolved. So while you could argue that it meant that to some people at some time, that doesn't make it true today or in the future. The down side is that we have these disagreements over what 'the Unix way' is. But any philosophy that's been around for 40 years, in tech no less, has to be flexible, that's why it's stay relevant.

rakoo · on Aug 19, 2023

I disagree, the unix way has structure: lines of text each consisting of space-separated fields. The real issue is that sometimes your data has spaces, or even newlines, so the structure quickly isn't enough. And that "sometimes" happens xay more often than it has to, and the escape hatch is a mess.

masfuerte · on Aug 19, 2023

> lines of text each consisting of space-separated fields

I wish! Back in the day, tools like iostat and vmstat printed tabular data using fixed width fields with no spaces between. This worked fine until the computers got bigger and faster, then the larger numbers filled the fields so they ran together making the output unreadable. A surprising number of vendors had this problem.

bobbylarrybobby · on Aug 20, 2023

It's bizarre that this is the case given that ascii contains characters for this very purpose. There's record separator, group separator... Now if your data contains those, that's bad. But that's much rarer than containing ordinary whitespace.

BirAdam · on Aug 19, 2023

But, you can always override IFS, so this isn’t as much of an issue.

Kinrany · on Aug 19, 2023

I disagree: text is the minimum structure. It's often enough sufficient too, but using something higher level like JSON is also perfectly UNIX.

rcxdude · on Aug 19, 2023

I don't think so, this idea of JSON is a more modern interpretation and I'd say is counter to the UNIX philosophy, not an extension to it (it just so happens to squeeze into existing primitives a little better than e.g. a binary serialisation protocol).