test, [, and [[ (2020)

jmmv · 2023-11-23T01:15:49 1700702149

Hey, original author here. Thanks for sharing this and making it rise to the front page! :) By the way, the title probably deserves a (2020) and it would be nice if "test" wasn't capitalized, because it actually refers to the command.

Here is something related from 2021 that also touches on bash's [[ operator and that I think you might enjoy in this context: https://jmmv.dev/2021/08/useless-use-of-gnu.html

o11c · 2023-11-23T02:06:24 1700705184

[[ is not really a builtin, it's fundamentally syntactical (but presumably uses a mostly-inaccessible builtin internally). Fun fact, `]]` is also a reserved word despite never being allowed in a context where reserved words matter.

In some non-bash shells, the `function` keyword is needed to declare certain types of function.

For make `$(shell)`, if you're building a lot of targets the performance difference can be measurable. Still, it loses in the nop case, so you should actually usually do `include` to trigger re-making.

GNU is completely right to ignore POSIX, since POSIX is not useful for solving most real problems.

pmarreck · 2023-11-23T17:20:23 1700760023

> since POSIX is not useful for solving most real problems

POSIX, being a standard that can be tested, is at least a specification or agreement that can be met by multiple products to enable them to interact

account42 · 2023-11-23T11:47:54 1700740074

> In some non-bash shells, the `function` keyword is needed to declare certain types of function.

In shells that aren't POSIX-compliant [0], maybe. In which case: yes, there are many wildly different scripting languages with REPLs.

[0] https://pubs.opengroup.org/onlinepubs/9699919799/utilities/V...

account42 · 2023-11-23T11:52:36 1700740356

A lot of the GNUisms described in https://jmmv.dev/2021/08/useless-use-of-gnu.html are very useful in interactive use. Search the current directory without an explcit . - useful. Append an option to a command you just typed - heck yeah, always annoyed when commands don't support that.

For scripts though, sticking to POSIX sh often makes sense, yeah. You should at least be aware if you use of Bash-isms.

gnfargbl · 2023-11-23T08:51:48 1700729508

This article complains that using the extended GNU features (--ignore-case, set -o pipefail etc) makes scripts less portable. Fair enough.

What it doesn't explain is why a Linux user should much care about portability. OpenBSD and FreeBSD are alive and well, but the number of users seems so small that they aren't a particular concern. Maybe you could argue that we "should" consider these OSes out of a sense of fairness, but where does that stop? Do I also need to consider something obscure like vxWorks?

BusyBox (Alpine) is more interesting, but the changes there are so significant that a port will almost always be needed anyway.

Are there other compelling reasons to care about the non-GNU ecosystem?

sundarurfriend · 2023-11-23T20:56:34 1700772994

I find the obsession with shell-script portability in contexts where it doesn't matter to be bizarre, but this particular argument is amusing:

> the number of users seems so small that they aren't a particular concern. Maybe you could argue that we "should" consider these OSes out of a sense of fairness, but where does that stop?

This is the same argument that many companies and products have made over the years (and still do at times) for ignoring Linux. To have a Linux user use that same argument against OSes with even smaller userbases is kind of amusing to see.

kqr · 2023-11-23T09:19:00 1700731140

Is your comment best read in Internet Explorer 6?

----

What I'm trying to say is that open standards (and the portability that comes with them) is not something that just happens on its own. It takes active maintenance, and part of that maintenance is opting to adhere to the standard even when it would be more convenient to use extensions available in the most popular systems.

Will you personally suffer from liberally using Bashisms? Not in the first order. But if we encourage that sort of thinking as a rule, the standards become meaningless. I believe that would be a net negative change for the world, but there are many intelligent people who would disagree.

dspillett · 2023-11-23T12:43:08 1700743388

I don't disagree with you in the context of public scripts, but the vast majority of what I do with bash is local-only (where local is ${DayJob} and my own infrastructure/projects) so the fact that extensions to the standard make my work in those areas quicker to string together and easier to maintain afterwards means I'm happy to lock myself into bash.

From my PoV it is generally portable anyway: almost everywhere I use a shell of that nature bash is present. The exceptions to this are things that need to run in small environments, like anything which may end up in initrd (where busybox us generally providing shell/script support).

Though I do wish people would make sure they specify bash in the #!/bin/bash in scripts unless trying to stick to the standard, as using #!/bin/sh causes problems when extensions are used and something lighter than bash is the default script runner (i.e. dash in Debian, and again busybox commonly in small environments). Only use #!/bin/sh if you lnow you are making an effort to be compliant.

kqr · 2023-11-24T08:10:38 1700813438

> I don't disagree with you in the context of public [things], but [...] extensions to the standard make my work in [private] areas [...] happy to lock myself into [a non-standard].

This is the best argument I know in favour of not standardising, and the danger is not the direct consequences, but rather than our expertise in the standard atrophies when we do most of our work outside of it, which means our public work suffers too.

Also sometimes the private goes public and then who's gonna fund redesigning it in the standard? That rarely happens.

> From my PoV it is generally portable anyway: almost everywhere I use a shell of that nature bash is present.

Much like Internet Explorer 6, you mean.

dspillett · 2023-11-24T09:58:23 1700819903

> Much like Internet Explorer 6, you mean.

A key difference is that bash is not claiming to be standard where it is not. It implements the full standard as well as its own extensions. It does not do standard things wrong¹ in order for its extensions to work².

I see posix-sh/bash more like javascript/typescript than good-browsers/internet-explorer. Which makes me wonder: is there a gnu/bash->posix/sh converter out there…

--

[1] well, there are a few oddities, like == behaviour in test which can cause portability issues because the standard doesn't specify that and people might accidentally use it when trying to be as portable as possible, but this also affects sh so is a general environment issue not specifically a bash related one. Also that doesn't break standard portable behaviour.

[2] or in the case of IE, doing standard things wrong, and its own extension wrong between versions too!

pmarreck · 2023-11-23T17:21:45 1700760105

> Is your comment best read in Internet Explorer 6?

I actually feel triggered.

That was such a nightmare period in my web-dev career (pretty much the entire 2000's), it pretty much singlehandedly pushed me to find backend work as much as humanly possible

chasil · 2023-11-23T12:27:54 1700742474

You will notice that the parent article mentions "dash."

The dash shell is small and fast, but it does not allow any bash/korn language extensions beyond what was recorded in the POSIX.2 standard in the early 1990s.

Linux users should care because the Debian/Ubuntu family use dash as the system shell, so this problem is very real as many have learned.

pjungwir · 2023-11-23T16:23:58 1700756638

I want to agree, but macOS is a big one. For example `sed -i` has bitten me before.

jonhohle · 2023-11-24T14:32:39 1700836359

`-o pipefail` isn’t a GNU extension or even a difference in user space programs, but specifically a bash option. The portability is between shells, not operating systems. It will work on bash, regardless of the underlying OS, but may or may not work on csh, ksh, fish, etc.

jmopp · 2023-11-24T17:54:13 1700848453

Mac's command-line tools are the BSD versions. I always get confused about the command-line arguments for `date` because they differ so much betwen platforms.

hwc · 2023-11-24T12:31:08 1700829068

    if [ a = b ] || grep -q ^hello$ /usr/share/dict/words; then
      echo "test failed and grep succeeded"
    fi

> “You pick whether to be amused or horrified. I don’t know how exactly my coworker reacted when I hinted at this during a recent code review I did for them.”

Isn't that normal everyday shell use?

OJFord · 2023-11-25T10:23:01 1700907781

The only thing that might 'horrify' me in code review for that is the assumption that it only evaluates true when test failed.

(And I might quote '^hello$' personally just on principle for having special chars, especially dollar, and to help syntax highlighters.)

a1369209993 · 2023-11-23T21:22:09 1700774529

> https://jmmv.dev/2021/08/useless-use-of-gnu.html

Tangential, but your blog software seems to have mangled the headings; it's serving eg:

  make $(shell &amp;mldr;) expansion

instead of

  make $(shell ...) expansion

(note that (as is correctly written in the body) that's three pediods, not one elipsis, so mldr wouldn't be correct even on it's own, meaning there's two probably-unrelated bugs affecting it).

jmmv · 2023-11-23T22:54:18 1700780058

Hmm… I’ll have to check later but right now I’m seeing the right thing in iOS. The blog is built by Hugo, so it’s all static files. But maybe something changed with the latest update. Thanks.

a1369209993 · 2023-11-23T23:06:52 1700780812

Just checked again with curl|grep, and now I'm getting:

  <h1 id=make-shell--expansion>make $(shell &mldr;) expansion</h1>

as the actual bytes-on-the-wire, which is still wrong, but only in one way rather than two.

(Note: iOS probably displays &mldr; as U+2026 "…", which often looks identical to "...", but is actually a single non-ASCII character.)

opello · 2023-11-24T04:33:18 1700800398

The rabbit hole of why "mldr" (em length leader) is a thing vs. "hellip" (horizontal ellipsis) was pretty disappointing. I only decoded "mldr" and didn't find much in the way of an explanation for why both entities exist.

I assume there's some semantic reason in which each is appropriate. I don't know what it is and am curious though!

jasonpbecker · 2023-11-24T13:34:01 1700832841

These are default for Hugo parsing of Markdown—- https://gohugo.io/getting-started/configuration-markup/

dang · 2023-11-23T06:39:19 1700721559

Ok, I've lowercased the leading 't'. We never do that but for this, ok :)

edanm · 2023-11-23T09:14:37 1700730877

As per the Zen of Python:

> Special cases aren't special enough to break the rules.

> Although practicality beats purity.

:)

kqr · 2023-11-23T08:59:29 1700729969

Taking the last point one step further, we can also dispense with the if block entirely:

    if [ a = b ]; then
        echo "Oops!"
    else
        echo "Expected; phew!"
    fi

becomes

    [ a = b ] && echo "Oops!" || echo "Expected; phew!"

I'm not sure how often you should do this but sometimes it comes in handy for things like

    [ "$debug" ] && echo "what's going on" >&2

to conditionally print debug output to stderr.

----

And the fact that the if block tests a regular command means we can also do things like

    if grep -q 'debug' /var/log/nginx/access.log; then
        echo "Debug request found!"
    fi

----

Something I have not yet bothered to figure out is whether I should write

    [ $(expr 1 + 1) -eq 2 ] && [ $(expr 2 + 2) -eq 3 ]

or use the built in logical and of test:

    [ $(expr 1 + 1) -eq 2 -a $(expr 2 + 2) -eq 4 ]

As long as performance is not a concern, I can see roughly equal reasons in favour of either.

illo · 2023-11-23T09:47:49 1700732869

> [ a = b ] && echo "Oops!" || echo "Expected; phew!"

Not to be taken as a general rule though. I might be mistaken but I think that bash would parse the line as:

  ([ a = b ] && echo "Oops!") || echo "Expected; phew!"

so if the command sequence after `&&` fails, then the code sequence after `||` is executed anyway:

  illo@joe:~ $ [ "a" == "a" ] && >/dev/full echo "strings match" || echo "strings don't match"
  -bash: echo: write error: No space left on device
  strings don't match
  illo@joe:~ $

This is different from the semantics of the `if` block:

  illo@joe:~ $ if [ "a" == "a" ]; then >/dev/full echo "strings match"; else echo "strings don't match"; fi
  -bash: echo: write error: No space left on device
  illo@joe:~ $

cesnja · 2023-11-23T18:21:29 1700763689

I'd advise against that kind of shortening. If you use set -e, which you should, then

    if [ a = b ]; then
      echo "Oops!"
    fi

will do exactly what you imagined, but

    [ a = b ] && echo "Oops!"

will quit with an error if expression a does not equal expression b.

Calzifer · 2023-11-23T21:59:03 1700776743

No it won't. set -e is implicit disabled for the first command with && and ||. Same for a command after if/while/until and after !. It should only matter if you implicit return immediately after.

  $ bash -ec 'if [ 1 = 2 ]; then echo true; fi; echo $?'
  0
  $ bash -ec '[ 1 = 2 ] && echo true; echo $?'
  1

In both cases it does not quite and execute the last echo.

nwellnhof · 2023-11-23T10:24:26 1700735066

> Something I have not yet bothered to figure out

According to POSIX, the -a and -o binary primaries and the '(' and ')' operators have been marked obsolescent. See https://pubs.opengroup.org/onlinepubs/9699919799/utilities/t... under "Application Usage".

darrenf · 2023-11-23T10:40:33 1700736033

Regardless of `-a` vs two tests and `&&`, there's no need to shell out to `expr` if bash's arithmetic evaluation is available:

    [ $((1+1)) -eq 2 ]

neuromanser · 2023-11-23T14:50:11 1700751011

arithmetic evaluation is ((…)), $((…)) is arithmetic expansion.

There’s no need for test(1) / [(1) or conditional expressions ([[…]]) if you’re doing arithmetic:

if ((1+1 == 2)); then …; fi

darrenf · 2023-11-23T15:47:52 1700754472

Even better, and TIL. Thanks!

Too · 2023-11-23T18:16:27 1700763387

The fact that seasoned developers have to discuss how to write 1+1 correctly, says everything you need to know about the language.

kqr · 2023-11-23T11:11:19 1700737879

I'm going to assume that if one is using [ rather than [[ then one will also want to use expr rather than $(()).

ykonstant · 2023-11-23T11:29:29 1700738969

Arithmetic expansion is a feature of all POSIX compliant shells, which furthermore advises against using `expr`. The latter is only needed for non-compliant shells like some legacy implementations of the Bourne shell.

kqr · 2023-11-24T08:05:20 1700813120

Oh, I did not know that! Thanks.

darrenf · 2023-11-23T11:52:03 1700740323

What's the basis for that assumption? I'm struggling to see a reason to not use the shell for arithmetic, regardless of `/usr/bin/[`, builtin `[` or `[[`.

cellularmitosis · 2023-11-23T01:47:31 1700704051

I stopped using [ a few years ago because ‘test’ reinforces the idea that this is just a command like any other, not syntax. Also, “man test” is much more pleasant that sifting through “man bash”.

kazinator · 2023-11-23T02:02:24 1700704944

Your comment makes no sense. GNU Coreutils has a "man [" as well as "man test" man page.

Bash has "help test" for a quick cheatsheet.

The [ command is very old; it was already present in Version 7 Unix in 1979.

Karellen · 2023-11-23T10:58:08 1700737088

I think GP's point was that `[` feels like syntax, but - importantly - isn't.

Yes, `[` is a command, and has a man page and everything, but in a script it doesn't look like a command. It looks like something "special" is going on.

Whereas using `test` emphasises the point that you're just running another program. That all `if` does is check the output status of whatever program is being run, whether that's `test`/`[`, or `grep`, or anything else at all.

(Personally, I don't think that emphasis is necessary. But I've been using shell scripts long enough that I understand these nuances fairly well without having to think about them much any more. So I think that GP's point is a reasonable perspective to have, and worth considering.)

c0l0 · 2023-11-23T07:54:57 1700726097

Parts of the comment make a LOT of sense actually, when you look at shell scripts written by the uninitated. Often times, I see constructs like

    while [ 1 ]; do ...; done

which are a pretty clear indication of the author's misconception about the perceived nature of [ ], I think.

kazinator · 2023-11-23T08:11:54 1700727114

The nice way to write that isn't any kind of test command but:

   while true; do ...; done

There is never any reason to use test, other than portability to some broken environment that is missing [ but not missing test.

tuyiown · 2023-11-23T08:52:36 1700729556

> There is never any reason to use test

What's the reason for always using [ ?

falserum · 2023-11-24T13:10:44 1700831444

I bet that most code uses [ or [[, if I am right, then reasoning is: “it is the normal usage”.

(Making life easier for others and your self)

kazinator · 2023-11-23T21:42:04 1700775724

So my scripts look normal, like everyone else's.

Diti · 2023-11-23T12:42:07 1700743327

> There is never any reason to use test

    test -d /nix && echo "$_ exists" || echo "$_ doesn’t exist"

You cannot do this with [ – Don’t Repeat Yourself, and your scripts become more maintainable.

(Not POSIX-compliant. Documented in Bash and Zsh.)

kazinator · 2023-11-23T22:29:56 1700778596

It's not working for me in a non-interactive Bash script on Bash 4.4.

$_ is nto documented in the man page but is in the Info manual. It is not only set to the last argument of a command but also to the script name on script startup.

  $ bash --version
  GNU bash, version 4.4.20(1)-release (i686-pc-linux-gnu)
  [ ... ]

Contents of script:

  $ cat underscore.sh
  #!/bin/sh
  # ^^ mistake here, should be /bin/bash

  test -e foo && echo $_ exists

  echo 'value of $_' = $_

Test:

  $ touch foo
  $ ./underscore.sh
  ./underscore.sh exists
  value of $_ = ./underscore.sh

I'm seeing nothing but the behavior of $_ being set to the script, and not affected.

But at the interactive prompt:

  $ test -e foo && echo $_ exists
  foo exists

This doesn't look like something I can rely on in scripts.

In the first place, I code in POSIX, except in the rare situation of making something that is strictly geared toward Bash.

Magic global variables subject to hidden side effects are garbage; I already feel dirty enough when I have to use $?.

This piece of crap also breaks under the DEBUG trap:

  $ debug() {
  > :
  > }
  $ trap debug DEBUG
  $ test -e foo && echo $_ exists
  debug exists
  $ test -e bar && echo $_ exists
  !1!

(That !1! is how non-zero exit status is printed in my setup.)

Sorry, I'm not going back to a 1978 way of writing shell tests, in order to use some broken magic variable.

Diti · 2023-11-24T16:44:37 1700844277

Wow, I wasn’t aware of all these shortcomings! I will stop using it in my scripts too.

teo_zero · 2023-11-23T13:18:06 1700745486

> You cannot do this with [

Why not?

  [ -d /nix ] && echo ...

forgotpwd16 · 2023-11-23T14:07:28 1700748448

As '$_' is set to last arg of previous run command, the test example will have it be the name of directory whereas for [ will always be ].

elteto · 2023-11-23T15:40:48 1700754048

  D=/nix test -d $D && echo $D exists …

cagey · 2023-11-24T03:38:36 1700797116

  $ V=nada echo "x${V}x"
  xx

Dylan16807 · 2023-11-26T05:56:24 1700978184

V=nada; echo "x${V}x"

teo_zero · 2023-11-23T20:03:49 1700769829

Ah yes, now I see it.

watersb · 2023-11-23T23:40:40 1700782840

    while true; do ...; done

Is `true` a builtin?

I might want to avoid an external process call just to create an unconditional loop.

kazinator · 2023-11-24T00:26:11 1700785571

POSIX allows any command to be a builtin, as long as a real external command is also available which can be executed via execv and whatnot.

  $ echo $BASH_VERSION
  4.4.20(1)-release
  $ type true
  true is a shell builtin

Also:

  $ type test
  test is a shell builtin
  $ type [
  [ is a shell builtin

rascul · 2023-11-23T14:16:08 1700748968

Note that bash implements [ as a builtin so the coreutils man page might not match exactly.

tuyiown · 2023-11-23T08:50:08 1700729408

I'm with you, the [ (and [[ bashism) introduces lots of confusions about what is really happening, I could never manage any real confidence.

That said, [[ being guaranteed to be built-in certainly had its purpose at ages where shell script performance had any kind of relevance, and that was no so long ago.

kqr · 2023-11-23T09:06:23 1700730383

[[ has more to do with trying to build an intuitive shell scripting environment than performance. [[ makes conditionals behave much more like you'd expect from other programming languages. I think it's a great idea, but then again, if I don't have to care for POSIX portability, I'd rather use something that's not a shell language for scripting.

DonHopkins · 2023-11-23T12:19:34 1700741974

While they were at it, just to be consistent, they should have added syntax for easy-to-use non-fucked-up less-arbitrarily-punctuated versions of control structures, too:

    ifif [[ x == 1 ]] thenthen
      echo "x is one"
    elseelse
      echo "x is not one"
    fifi

    forfor x in 1 2 3 dodo
        echo "x is $x"
    donedone

    whilewhile [[ x == 1 ]] dodo
      echo "x is one"
      x=$((x + 1))
    donedone

    untiluntil [[ x != 1 ]] dodo
      echo "x is not one"
    donedone

The whole [[ ]] $(( )) ifif fifi dodo thing just seems like they're just doubling down instead of admitting they made a mistake.

And if they really wanted [[ ]] to seem like syntax instead of a shell command, they could at least allow it to be used without spaces on each side of it like $(( )) or parens or brackets in any other language. And every other language lets you use as many redundant parens as you like for clarity, without running twice as many commands or producing a syntax error or weird unexpected behavior. But I don't think clarity was ever a design goal with Unix shell scripting languages.

a1369209993 · 2023-11-23T21:32:10 1700775130

> arbitrarily-punctuated versions of control structures

Ahem:

  if [ x = 1 ]
  then echo "x is one"
  else echo "x is not one"
       echo "namely, it's $x"
  fi

`if [ ] ; then` should only ever be used in one-liners, where there is not a newline after `then`; I'm not sure how that ended up being taught as a way to write multi-line commands (I'd tenatively blame Pascal, but that's probably unfair).

paiute · 2023-11-24T14:37:08 1700836628

I’ll probably just use test after reading this, I write bash scripts infrequently enough that if statements always get me (whitespace). Now that i see the why it’s super obvious, and using test helps communicate that it’s just args

ridiculous_fish · 2023-11-23T01:25:25 1700702725

The biggest footgun in `[` and `test` is the single argument behavior. For example, you might attempt to check if a variable is nonempty like so:

     [ -n $FOO ]

but if FOO is unset, it expands to nothing (as opposed to the empty string), so this is equvalent to:

     [ -n ]

and POSIX requires that the one-argument form of `[` succeed if that argument (here, "-n") is non-empty. So this will falsely report that $FOO is non-empty.

Remember to quote your variables!

jstimpfle · 2023-11-23T08:24:34 1700727874

I think your last sentence needs to go first. Quote your variables! There's no actual footgun in the specification of the test builtin -- the footgun is shell itself. The behaviour that you mention makes sense because

   [ "$FOO" ]

is always the non-empty check regardless what it contains (could be "-n").

whateveracct · 2023-11-23T01:35:39 1700703339

ShellCheck your scripts!

mr_toad · 2023-11-23T02:13:02 1700705582

Or use set -u to fail early.

Or use [ -n ${FOO-} ] which will replace the unset variable with an empty string.

kazinator · 2023-11-23T03:08:35 1700708915

That is false. There is no difference between ${FOO} and ${FOO-}. Both disappear if unquoted, and FOO is unset or blank:

  $ printf "<%s>\n" alpha ${beta} omega
  <alpha>
  <omega>
  $ printf "<%s>\n" alpha ${beta-} omega
  <alpha>
  <omega>

The form ${var-} form is useful for safely evaluating a variable that might be unset, when "set -u" mode is in effect, and for whatever reason we cannot just fix the script so that the variable is set.

js2 · 2023-11-23T03:29:21 1700710161

They meant "${FOO:-}" which should still be quoted.

The general form is "${FOO:-default}" where default can itself be another variable or whatever string you want.

I usually prefer to set default values at the top of a script though using this idiom:

   : "${FOO:=bar}"

And when creating a local variable I'll immediately set it to an empty string if there's a chance it won't be assigned later:

  local foo=""

kazinator · 2023-11-23T04:20:26 1700713226

That colon makes no difference if the replacement is blank.

js2 · 2023-11-23T05:50:35 1700718635

Holy crap, 25 years of writing shell scripts and I just learned the difference:

With colon, tests variable for unset or empty. Without the colon tests only for unset. It's POSIX too:

https://pubs.opengroup.org/onlinepubs/9699919799/utilities/V...

kazinator · 2023-11-23T06:44:03 1700721843

So ${foo-bar} will not expand to bar if foo exists, but is empty. The empty value will prevail. ${foo:-bar} will expand to bar.

If we have nothing in place of bar, they are effectively same.

ninkendo · 2023-11-23T12:43:58 1700743438

> and for whatever reason we cannot just fix the script so that the variable is set

I use this pattern all the time for variables that should be overridable by whoever is calling the script, ie. `FORCE="${FORCE-no}"`, overridable with `FORCE=yes foo.sh`. I'm not sure of any other way to do this.

(Yes, using getopt or other option parsing is better, but for my own scripts, env vars are just such a simpler way to pass options, and after 20 years of using bash/unix/linux I still can't write a getopt stanza from memory.)

neuromanser · 2023-11-23T15:40:46 1700754046

    : ${FORCE=no}

Also, it's not so difficult

    while getopts :hab:c o; do
      case $o in
      a) opt_a=x ;;
      b) opt_b="$OPTARG" ;;
      c) opt_c=x ;;
      h) usage 0 ;;
      ?) usage 1 $o "$OPTARG" ;;
      esac
    done
    shift $((OPTIND - 1))

ninkendo · 2023-11-23T15:49:17 1700754557

> Also, it's not so difficult

Heh, was that an attempt at sarcasm? That's super difficult to do from memory, at least for me. Maybe I'm not as good at memorizing things as you are.

neuromanser · 2023-11-23T16:42:40 1700757760

> Heh, was that an attempt at sarcasm?

There was definitely a wink component, but at the same time, I meant it.

Half the code is just case / esac syntax, which you can (i do) find plenty use for outside of getopts. getopts itself is… manageable. In the end, the code is such a strong pattern, it's basically a snippet. The lack of variation is what makes it kinda easy to remember. Or you could put it in your notes or a gist and copy/paste.

> That's super difficult to do from memory, at least for me. Maybe I'm not as good at memorizing things as you are.

Nah, my memory is shot. It's a matter of practice, and not being shy with man pages. Use it often enough, it's in memory; after a long hiatus, the details are a "/^\s*getopts \[" away (assuming your man pager is less).

kazinator · 2023-11-23T19:11:26 1700766686

I find it's much better to have a copy-pasted piece of code which turns all command line options into variables in a certain namespace (provided the variables are defined beforehand)

I.e. user runs:

  script --foo --bar=yes --no-xyzzy --uiuez"

the script then sets these existing variables:

  opt_foo=y
  opt_foo_given=y

  opt_bar=y
  opt_bar_given=y

  opt_xyzzy=
  opt_xyzzy_given=y

However, the variable opt_uiuez doesn't exist and so it bails:

  script: no such option: --uiuez

With such a piece of code, all you do is define the options you support via assignments like "opt_foo=". You can give them default values this way. Include that piece of boiler-plate code. Done.

To add a new option, just define the variable. Done.

Check its value wherever needed, and possibly the _given, if the code needs to know whether it's working with the default value, or an explicitly given value. That's it.

sweeter · 2023-11-23T07:34:09 1700724849

"set -x" is a life saver for debugging. Also "set -euo pipefail" so a script just exits on errors or unexpected behavior. If done well you can really reign in squirrely behavior.

Bash has some odd behaviors and footguns but it can also be surprisingly reliable and versatile. Ive caught some seriously messed up stuff that I couldn't figure out otherwise, using set -x. Also shellcheck will forcefully hammer good practice into your head and catch a ton of hangups that are hard to know before making the mistakes.

also setting backup variables is a good idea like: pictures="${pics_dir:-$HOME/Pictures}"

LegibleCrimson · 2023-11-23T02:20:15 1700706015

That bottom one will also fail. The empty string is not treated as a shell word.

You need double quotes.

yjftsjthsd-h · 2023-11-23T02:19:10 1700705950

Preferably you'd use set -u to avoid certain problems, and also keep using shellcheck for all the other things it can catch.

Xiol32 · 2023-11-23T07:29:08 1700724548

set -euo pipefail

Effectively strict mode for shell scripts.

gpvos · 2023-11-23T09:12:37 1700730757

Also, if $FOO contains a space, it will expand into multiple arguments. Just quote your variables, always.

dixie_land · 2023-11-23T04:31:39 1700713899

[ x"$FOO" != x"" ]

stouset · 2023-11-23T06:57:03 1700722623

Please stop with tricks like this and just quote your variables. Everywhere.

overtomanu · 2023-11-23T05:52:04 1700718724

whoa, lot of history in this trick

https://news.ycombinator.com/item?id=26776956

andrewshadura · 2023-11-23T05:12:31 1700716351

No. This hasn't been necessary for decades.

mattrighetti · 2023-11-23T07:17:01 1700723821

In this case I would use [ -n "${FOO?}" ] so that the script will immediately stop if $FOO is null or unset

8n4vidtmkvmk · 2023-11-23T07:42:26 1700725346

Set -xufo pipefail

Or whatever the magic string is. Enable all the errors at the start of the script

JNRowe · 2023-11-23T01:27:40 1700702860

chubot has written an interesting document¹ exploring more of the nuance with test/[/[[, and many of the other entries in that blog have intriguing explanations of the oddities of our shells(a random example²).

¹ https://www.oilshell.org/blog/2017/08/31.html

² https://www.oilshell.org/blog/2016/11/18.html

wodenokoto · 2023-11-23T08:31:53 1700728313

I had no idea [ was a program and the fact that it checks if the last argument is a closing bracket is kinda funny to me.

But at least it explains why you need spaces on both sides of the brackets.

neverrroot · 2023-11-23T01:04:59 1700701499

[[ is bash only. If you know you’ll only use bash, use it. For details, the article is nice.

re · 2023-11-23T01:17:59 1700702279

Well, zsh too :)

https://zsh.sourceforge.io/Doc/Release/Conditional-Expressio...

dredmorbius · 2023-11-23T07:53:15 1700725995

"bash only" typically refers to "bashisms", that is, bash features not present in the plain Bourne shell (or Bourne-compatible interpreters such as dash). The fact that other shells (such as zsh) may include those features ... is beside the point of writing universally compatible shell scripts.

Confirming my facts for this comment, #TIL that "dash" is the "Debian Alquist Shell", that is, Debian's "ash" shell:

<https://en.wikibooks.org/wiki/Guide_to_Unix/Explanations/Cho...>

bgm1975 · 2023-11-23T04:01:25 1700712085

And ksh (ksh88 & later)

emmelaich · 2023-11-23T05:05:28 1700715928

Always use [[

zsh and ksh have it; in fact I'm pretty sure it originated with ksh in 1988 or earlier.

OmarAssadi · 2023-11-23T07:18:43 1700723923

> Always use [[

While zsh, bash, mksh, ksh93, probably others have it, sure. But many don't -- and not totally irrelevant ones either. Debian's default, dash, for example, does not support `[[`.

IMO, unless you're writing something like shell-specific dotfiles, avoid non-POSIX features.

It's usually pretty trivial to avoid them, especially if you're willing to call other mandated commands like awk, etc. But often, with a bit of creative thinking, most non-standard features can be replicated with some combination of `set`, separate functions and/or subshells.

Shell scripts, in general, have dozens of footguns, are pretty much impossible to statically analyze, difficult to make truly robust, and many of the shells themselves -- e.g., bash -- have huge, borderline inauditable codebases.

I can think of a dozen reasons not to write shell scripts. Yet still, there is incredible value in the fact that some form of POSIX-compliant/compliant-enough shell can usually be found on most systems.

All of that value goes out the window, though, the moment you start relying on non-standard features.

arp242 · 2023-11-23T10:43:25 1700736205

Depends what you're writing.

If you're writing an installer script or whatever that's going to be run on $many computers that you don't control: sure, it's probably best to go to the extra effort to stick with POSIX sh.

But that's not most scripts. Most scripts are things that you run on computers you control. I just write things as zsh scripts because it's so much easier than bash (never mind POSIX sh). That's fine, because I can just install zsh. And actually, this often makes scripts more portable because you need to rely on external utilities a lot less.

massysett · 2023-11-23T09:38:49 1700732329

Don’t worship at the altar of portability. There is real cost to portability, as it forces you to cater to broken implementations and to not use features that may be very useful. Also, it can be difficult to ensure compatibility with systems that you don’t test on, so true portability requires having different systems to test on.

Sometimes all those costs are necessary for the task at hand. For example, the whole point of GNU Autoconf is that it runs on a wide variety of systems.

On the other hand, many programs are for in-house or personal use and will not run on obscure systems. The cost of writing for portability simply might not be worthwhile in these situations. And that’s ok, notwithstanding some conventional wisdom of “always try to be portable.”

emmelaich · 2023-11-23T21:45:56 1700775956

It'd be easy to add [[ to dash as well. The only thing preventing it is the certain level of buttheaded-ness among the Debian crew.

nerdponx · 2023-11-23T01:07:09 1700701629

That is, test and [ are specified by POSIX and usually are physical binaries (but might also be masked by shell builtins). Whereas [[ is not specified by POSIX and usually only exists as a shell builtin.

shmerl · 2023-11-23T01:13:08 1700701988

Why would you not use Bash (unless you are using completely different shell like Fish explicitly?).

It feels like some completely archaic concern to target the minimal common shell denominator.

mikem170 · 2023-11-23T01:18:59 1700702339

Bash seems to be about as archaic as sh.

I write my shell scripts in sh, because it comes with every unix-like os by default.

I know that bash has more features, but I've never really missed them. I switch from sh to perl for more complicated tasks.

To each their own, right?

15457345234 · 2023-11-23T02:17:13 1700705833

This to me seems like good practice, it's better to use a 'real' programming language when you're crossing the boundary from 'script' to 'program'

Shellscript has way too many idiosyncracies and weirdnesses that would have been beaten out of a proper programming language by now. (I know that talking about weirdnesses is amusing in relation to perl which also has a whole armload of them.)

hnbad · 2023-11-23T07:50:47 1700725847

> This to me seems like good practice, it's better to use a 'real' programming language when you're crossing the boundary from 'script' to 'program'

PowerShell would like a word with you.

15457345234 · 2023-11-24T00:23:45 1700785425

The PowerShell syntax, to me, is one of the most unbelievably cryptic things I've ever seen. I really can't get on with it at all.

account42 · 2023-11-23T12:06:36 1700741196

PowerShell's problem is that it is a real programming language, which makes it less suitable for a shell.

usrbinbash · 2023-11-24T10:05:03 1700820303

No, power shells problem is that it cannot decide whether it's a programming language or a shell REPL, and ends up doing both badly. The syntax is too cryptic (and far too verbose) for a REPL, and too "shell-y" for a PL.

bash, for all its many many many flaws, is quite clearly a REPL first, PL second.

hnbad · 2023-11-24T00:00:09 1700784009

I was half-joking but the downvotes were to be expected in a conversation about Linux shells.

LambdaComplex · 2023-11-23T01:16:34 1700702194

Not installed by default on BSDs. Sometimes not installed in minimal environments, e.g. where BusyBox is all you have available. Maybe not installed by default on Unixes? Not sure

loa_in_ · 2023-11-23T02:02:10 1700704930

Honestly, installing bash into environments seems like less work than avoiding it. It's not hard to do.

JohnFen · 2023-11-23T03:36:50 1700710610

When your script will be running on customer machines, you often don't have the luxury (and sometimes it's not technically possible because it doesn't exist) of installing bash into the environment.

loa_in_ · 2023-11-23T11:58:00 1700740680

What prevents you from shipping bash with your solution?

JohnFen · 2023-11-23T13:59:33 1700747973

These scripts are often used to install the application, so that introduces a chicken-and-egg situation. How can I install bash before running the installer?

Also, we'd have to build and maintain our own just for platforms missing it, which increases cost and complexity. It's easier, cheaper, and less error-prone just to use a minimum common denominator that runs on everything.

Additionally, when your customers have strict policies regarding how applications are approved for use and have to vet and test everything you give them, it's best practice to avoid installing anything that you can avoid installing. Doing that minimizes the effort and hassle for both the customer and the developer.

loa_in_ · 2023-11-23T23:32:39 1700782359

How can you install bash without using it? With sh as bootstrapping shell.

Build and maintain your own bash or install script? You don't have to maintain it, it's just building which is already provided by original source build scripts. You can install it as "mysolutionsh", nobody cares as long as you link it to PATH or reference it explicitly. Still easier than avoiding bash everywhere but in personal setups. Yours already familiar with it, that's a waste!

DonHopkins · 2023-11-23T04:08:14 1700712494

Time to find better customers.

cpuguy83 · 2023-11-23T04:31:54 1700713914

You don't always have a package manager, root access to even be able to install anything, network access, etc.

"Just install bash" is not always easy, and is not even necessary when posix shell can easily do what most people use bash-specific syntax for.

emmelaich · 2023-11-23T05:07:40 1700716060

That's the sort of yak-shaving I will never do. I'd rather just not use such a primitive unix and/or be in a job where one of pdksh/ksh/zsh/bash is not available at all.

isatty · 2023-11-23T06:52:16 1700722336

Well it’s good that you have the choice to; but it’s also not all that difficult to understand that not everyone who needs to use your script, has that choice.

JohnFen · 2023-11-23T14:10:25 1700748625

That's cool. We all decide for ourselves what markets we're willing and unwilling to address. But the market that involves "primitive" unix is lucrative and that can be worth it.

For me as a developer, it's not a big deal. You can do all the same things, just in a slightly different way, and you have a script that can run almost everywhere.

OmarAssadi · 2023-11-23T07:37:56 1700725076

It's not usually hard to do, no, but it is >130K of C alone, excluding whitespace, comments, tests, examples, etc. I don't want it on my system from a security perspective alone.

Add in the bootstrapping pain it imposes due to autoconf and other stuff, I think there are many valid reasons to avoid it and choose another shell that is more auditable yet still has just as many eyeballs on it (e.g., mksh - the default on Android; dash - default on Debian; BusyBox - every embedded system).

dredmorbius · 2023-11-23T07:58:09 1700726289

Writing Bourne-compliant scripts ensures maximum portability.

As many here have noted, bash isn't universally available, with another possible issue being OpenWRT devices. Stock/base images tend to use a Bourne-compatible shell, not full Bash. Though the latter's installable through opkg, for sufficiently small devices (typical of consumer kit), you simply won't have the space to install them.

There's also the slight PITA that Apple's OSX ships with a very old, pre-GPLv2 Bash, out of licensing concerns. (Apple is phenomenally averse to GPL-based code, much as some *BSDs are, such as OpenBSD.)

And if you're dealing with legacy systems (which tend to be extraordinarily and stubbornly persistently legacy), you'll often find that either bash isn't present or is quite dated.

I freely confess that I tend to write fairly recent-feature bash scripts myself by default, and appreciate many of the newer features. But if and when I am writing portable code, I'll go back to Bourne-compatibility.

But when writing system level code, an appreciation for standards and the very long tail of legacy standards and the limitations they impose is in fact a large component of professional maturity.

JohnFen · 2023-11-23T03:34:52 1700710492

I've worked on several modern projects that needed to be able to run on a wide variety of Unix platforms, several of which didn't have bash. Writing for the common shell denominator was important, not archaic.

shmerl · 2023-11-23T04:57:49 1700715469

Still sounds archaic that it's a problem today that needs to even be addressed.

jtafurth · 2023-11-23T09:07:20 1700730440

I wouldn't say it's an archaic problem, an example I can think of is writing scripts for infrastructure running minimal docker images where you want to keep the image size to a minimum, you would usually need to support both bash and other shells.

Embedded applications come to mind as well.

bregma · 2023-11-23T11:51:17 1700740277

It's archaic if you're a dabbler, a hobbyist, an academic, or a junior dev who lacks the experience of shipping software into the wild.

For the rest of us, we have learned the hard way, sometime repeatedly, portability and adherence to published standards matters.

usrbinbash · 2023-11-24T10:11:41 1700820701

Well, I am a professional who administrates quite a lot of systems.

And they all run bash. They all understand `[[`, they all understand `~=`

If any of them did not, then I'd ensure they do.

Yes, shells that do not support modern bash syntax ARE archaic, no matter who sits in front of the screen. And same as I consider it part of my job description to keep systems up to date with security patches, I consider it part of my job to ensure they use modern versions of contemporary and widely used scripting languages and shells. And that includes bash.

So no, relying on features that can be expected on a modern system is not a sign of inexperience. Same as a dev is within his rights to not write his python code under the assumption that it has to talk with a python 3.4 interpreter, he is within his rights to not expect a system that only knows sh.

shmerl · 2023-11-23T18:41:42 1700764902

That doesn't make it less archaic if anything today still has this problem.

yjftsjthsd-h · 2023-11-23T01:21:09 1700702469

IIRC Debian uses dash as /bin/sh because it's faster (execution speed) than bash.

emmelaich · 2023-11-23T05:09:13 1700716153

That's like using a horse instead of a mule. Just not worth it. Move away from the whole equine world and use a real programming language.

yjftsjthsd-h · 2023-11-23T05:54:50 1700718890

Eh, there are things that shell is really good at, and there are things that other languages are good at. I will grant that shell is best for glue code; my personal heuristic is that I stick to POSIX sh, and if that hurts then I take that as a sign that I should be considering moving to Python or whatever. But avoiding it completely strikes me as a poor choice, because for the "glue together separate programs and manipulate files" tasks that it's meant for, it's really good and IMO everything else still falls short.

csydas · 2023-11-23T05:36:02 1700717762

it’s contextual as most things. need to actually use and work on the machine? use whatever shell you want to make your life easier.

need a script to run on many different systems and/or need to write a script to be managed automatically by a service account? probably you want a shell with syntax that is guaranteed to be the same on all your systems.

driggs · 2023-11-23T01:30:44 1700703044

Because Apple switched the default macOS shell from the GNU-licensed `bash` to the BSD-licensed `zsh`?

bgm1975 · 2023-11-23T04:13:46 1700712826

Apple switched when bash switched from GPL2 to GPL3 which they didn’t like. The older bash is still available.

masklinn · 2023-11-23T05:33:26 1700717606

> Apple switched when bash switched from GPL2 to GPL3

Apple just didn’t update. It took them years to finally switch to switch to zsh.

> The older bash is still available.

Aside from it being bash (a great reason not to use it as far as I’m concerned) it’s now a 17 years old version of bash.

toast0 · 2023-11-23T05:44:41 1700718281

> Aside from it being bash (a great reason not to use it as far as I’m concerned) it’s now a 17 years old version of bash.

I thought people liked macOs for its vintage feel? Remember a time when computers could only render a single menu bar in a fixed location, feel the experience of SYN floods, run a version of bash that is old enough to vote in the next presidential election.

mkesper · 2023-11-23T08:06:03 1700726763

Honestly this is such a waste of time every time I have to argue with developers about installing up to date homebrew (or whatever) versions of the coreutils that I wish Apple would simply DELETE all these ancient versions of tools from MacOS. As a bonus, homebrew does not offer --with-default-names any more and some tools (like make) are posted into different paths so you need to add your own symlinks or add multiple paths to your PATH.

toast0 · 2023-11-23T05:27:01 1700717221

According to the interwebs, zsh became default with Catalina in 2019 [1]; ten years after bash 4 was released with gpl v3 or later.

Also, the interwebs suggest Apple used to use tcsh as the default shell[2]; I don't know when they changed that, but it may have been after bash 4 released? (Thanks, 10.3 was in 2003, so several years before the license changed)

[1] https://www.theverge.com/2019/6/4/18651872/apple-macos-catal...

[2] https://news.ycombinator.com/item?id=18853318

masklinn · 2023-11-23T05:35:13 1700717713

bash became the default shell in 10.3. tcsh was the default before then.

aidenn0 · 2023-11-23T01:35:46 1700703346

IIRC, /bin/sh on Ubuntu defaults to dash

eikenberry · 2023-11-23T01:44:08 1700703848

That is from Debian and all child distros inherit it by default, not just Ubuntu.

jabl · 2023-11-23T09:54:57 1700733297

If you want to nitpick, it was first done by Ubuntu and then later upstreamed into Debian (from which other child distros, as you correctly point out, inherit it).

andrewshadura · 2023-11-23T05:26:30 1700717190

In fact, only dash is supported as /bin/sh on Debian and Ubuntu.

user261 · 2023-11-23T03:21:51 1700709711

I really didn't understand why the last if statement is confusing. Is it because when starting out with shell scripting one would usually assume that the [ is a part of the bash scripting language not just another program? If it's then I think I get it now. Otherwise please mention why it's surprising. Also, @author thanks for a nice article. was a good reed.

computerfriend · 2023-11-23T14:15:02 1700748902

Even if you don't know about [ as a binary, I still don't see how it's confusing. Seems like very normal bash to me.

cduzz · 2023-11-23T01:57:01 1700704621

I have strong opinions about shell and they don't actually line up with the rest of the world...

I believe that [ should never be used, only "test" because [ gives the illusion that the mechanism is some part of the language syntax when it is just another "program" (I'm including built-ins and functions in "program").

(if / || / && look at exit status; a program can't see the exit status of other things other than looking at the magic variable $? which is just another string once expanded; case looks at strings but doesn't operate based on exit status and doesn't set an exit status as part of the case ... esac operation; "programs" set an exit status)

I also believe that [ / test should only ever be used for evaluating filesystem constructs -- test -f /dev/null and if you've got string evaluations use case.

Unsurprisingly, most scripts make me itchy, and scripts that I write people find weird.

[edited to add the explanation of "program" vs "syntax" opinion]

cduzz · 2023-11-23T02:24:06 1700706246

Joke's on me; that's the point of the article.

I prefer, when writing in shell, to do this:

    if 
      program
    then
      something
    else
      otherthing
    fi

To emphasize that the thing after the if is just "look at the exit status of the last command before the then."

    if
      ls /tmp/goober
      test -d /tmp/goober
      echo "I'll always execute the then clause because echo will always return a 0 return code"
    then
      echo "this always gets run"
    else
      echo "this never gets run"
    fi

because the "if" is just looking at the exit status of "echo".

[edited to do code blocks]

tpoacher · 2023-11-23T19:32:28 1700767948

largely agree, but stylistically speaking I prefer the following:

  if   ls /tmp/goober
       test -d /tmp/goober
       echo "I'll always love you or whatever"
  then echo "the grasshopper always jumps higher"
  else "never gonna let you down"
  fi

i.e. all the commands line up at the 6th column, leaving the if/then/elif/else/fi stuff appearing on the 'margin', as it were.

This works particularly well, visually, when nested ifs are involved.

chatmasta · 2023-11-23T03:48:31 1700711311

Hello fellow traveler. I've got 8 years of scripts with `test` in them to prove I agree with you. It's a lonely road out there... I blame the Google shell style guide.

I picked up the habit of preferring `test` when I was writing scripts that needed to run in both `sh` and `bash`, but I kept it because it makes more semantic sense to me than treating a character like `[` as a command. It's also weird that `]` is an argument to `[` rather than also being a binary. I mean, I understand the technical reasoning... but it feels like a hack.

js2 · 2023-11-23T04:20:18 1700713218

Hello fellow traveler. I've got 20 years of scripts with `if test` in them. I only use `[[` when I need functionality it provides that `test` does not (pattern or regex matching, typically, and for pattern matching I'll generally use a case statement instead).

chatmasta · 2023-11-23T04:33:08 1700713988

Yeah that's when I use it too. I figure if I've already given into the temptations of Bash, I may as well go all the way. Sometimes if I'm feeling extra frisky I even do [[ .. ]] || { echo "error!" ; exit 1 }

cduzz · 2023-11-23T04:42:15 1700714535

I imagine most people have sets of macros they habitually add to any sufficiently complex script they're modifying...

  yelp(){
    es=$1
    shift
    echo "$@" >&2
    exit $es
  }
  
  test -d /tmp/goober || yelp 33 "couldn't find goober!"

(yes, I'm sure there are standards for exit status ranges and 33 is not such a thing)

quicklime · 2023-11-23T05:36:16 1700717776

The Google style guide doesn’t disagree with you, it says to code exclusively for bash where possible (ie mostly everywhere inside Google) and to prefer [[ over [ and test:

https://google.github.io/styleguide/shellguide.html#s6.3-tes...

In a scenario where you also need to support shells other than bash, Google’s shell style guide doesn’t say to use [ over test.

tuyiown · 2023-11-23T08:55:36 1700729736

Reading the comments here it looks like there are somehow dozens of us using test only. Dozens !

metadat · 2023-11-23T01:06:11 1700701571

I've learned to only use [[ when I want to do a regex match, e.g.

  if [[ "${foo}" =~ ^bar$ ]]; then echo Yes; fi

Otherwise, just stick with "test" or "[".

85,000 lines of bash and counting... Not saying bash is great, but it's still meeting my needs for a lot of stuff. Shrug.

stephenr · 2023-11-23T02:41:09 1700707269

If you want to do (relatively) simple pattern matching in a posix-compliant way, `expr`[1] can match BREs, and even return a capture group.

1: https://pubs.opengroup.org/onlinepubs/9699919799/utilities/e...

somat · 2023-11-23T10:56:13 1700736973

As someone who enjoys shell scripting I don't think I have ever used the "expr" command. this astonishes and baffles me because I used to read the man pages for fun, I even had a little script to pick a random page. So thanks a bunch for bringing it to my attention.

My only guess as to why I am unfamiliar with the command is that perhaps younger me failed to figure out "expression" means regular expression. It only clicked this time because of your comment and the see also: re_format link at the bottom.

http://man.openbsd.org/expr

stephenr · 2023-11-23T11:58:37 1700740717

Yeah it's slightly obscure for a lot of people I think: most who do know of it seem to associate it purely with arithmetic, and rightly use `$((...))` instead.

penguin_booze · 2023-11-23T07:21:01 1700724061

The fact that the regex is written bare and unquoted, got me a while ago. I'm someone who religiously quotes everything; it took me a while before I figured out why my stupidly simple regex doesn't match.

cvccvroomvroom · 2023-11-23T01:19:39 1700702379

That's a pointless example.

    [ "$foo" = bar ] && echo Yes

For substring matches, [ and * globs are generally good enough.

    [ "$bar" = extra* ] && echo '$bar began with extra'

Bash's regex dialect is primitive and rarely worth fussing over. For anything complicated, use another tool be it grep, awk, perl, or such. There are diminishing returns of obsessing over doing everything in bash when a complicated task demands more capabilities suitable to another tool with greater reusability, modularity, and intrinsic types.

aidenn0 · 2023-11-23T01:37:31 1700703451

I thought regex inside [[ was the same as egrep?

1vuio0pswjnm7 · 2023-11-24T00:01:56 1700784116

Bourne has said there were proposals from the other users at Bell Labs to use braces or brackets with the "test" form, but these suggestions were not adopted. Bourne has said he did not like having to worry about missing braces in C, and there were no IDEs with automatic syntax-checking like today, and that is why he used macros to allow him to write in C using Algol-like syntax. That's what he said. Maybe that's why he decided not to braces or brackets with test.

Personally, I prefer the "test" form to the "[" form. It's easier for me to read. Personal preference. To me, "[" looks uglier.

glandium · 2023-11-23T05:40:49 1700718049

One place where you learn not to use [ or [[, or be very careful about them is autoconf shell scripts (and that applies to more than `if`, like globs, sed, awk, etc.). Because square brackets are quote characters in autoconf. https://www.gnu.org/savannah-checkouts/gnu/autoconf/manual/a...

l0b0 · 2023-11-23T02:05:17 1700705117

See https://mywiki.wooledge.org/BashFAQ/031

yjftsjthsd-h · 2023-11-23T01:25:03 1700702703

I actually thought [ and test were symlinks, but that might just be from Alpine Linux where they're both symlinks to busybox.

somat · 2023-11-23T01:52:47 1700704367

Hardlinks actually, but who's counting? plus in real world usage you will hit the shell builtin anyway.

  ls -li /bin/\[ /bin/test
  26016 -r-xr-xr-x  2 root  bin  133256 Mar 25  2023 /bin/[
  26016 -r-xr-xr-x  2 root  bin  133256 Mar 25  2023 /bin/test

I have to admit I avoid "[" in my scripts, it is a weird hack trying to make a command look like syntax and this really bothers me for some reason.

yjftsjthsd-h · 2023-11-23T02:03:02 1700704982

No, Alpine uses symlinks:

    $ docker run --rm -ti alpine
    / # ls -l /usr/bin/test /usr/bin/[
    lrwxrwxrwx    1 root     root            12 Sep 28 11:18 /usr/bin/[ -> /bin/busybox
    lrwxrwxrwx    1 root     root            12 Sep 28 11:18 /usr/bin/test -> /bin/busybox
    / # ls -li /usr/bin/test /usr/bin/[
     495254 lrwxrwxrwx    1 root     root            12 Sep 28 11:18 /usr/bin/[ -> /bin/busybox
     495360 lrwxrwxrwx    1 root     root            12 Sep 28 11:18 /usr/bin/test -> /bin/busybox
    / #

somat · 2023-11-23T10:21:00 1700734860

Interesting... Doubly so because I thought busybox was a sort of linux crunchgen, A way to pack many independent executables into one to save space. With crunchgen at least, each executable name is then linked to the packed binary, that is, hardlinks. and the filesystem name is used to pick the correct code to run. Why did they go with softlinks? my guess... It can be moved across filesystem boundaries. Perhaps interference from cgroups?

http://man.openbsd.org/crunchgen

arp242 · 2023-11-23T10:50:34 1700736634

Both just look at the first entry in argv. Whether you used a hard- or symlink isn't very important for that. Or at least, that's how FreeBSD crunchgen works. Maybe they changed that in OpenBSD?

figmert · 2023-11-23T06:42:23 1700721743

This is different. Alpine uses busybox, where everything is implemented in the busybox binary, thus everything is symlinked to it.

In other distributions, test is a separate binary, and [ is a link.

chatmasta · 2023-11-23T04:28:34 1700713714

What bothers me is that you can write equivalent code with `test` and `[`, but when using `[` you need to terminate your conditional expression with `]`. Why? Isn't it the same binary? Why is the shell adding this extraneous requirement, just to surface "syntax errors" rather than simply treating `]` as a no-op?

kqr · 2023-11-23T09:11:35 1700730695

It is the test command that requires the terminating ] when it is invoked as [.

It's possible your shell pre-empts this requirement and presents it as a proper syntax error, but it would have failed anyway.

mmphosis · 2023-11-23T17:12:19 1700759539

They are different binaries on Linux Mint!

  # file /bin/test /bin/[
  /bin/test: ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, BuildID[sha1]=6fe552c80ab0b3d3e60de2ab09167329e222eb67, for GNU/Linux 3.2.0, stripped
  /bin/[:    ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, BuildID[sha1]=99cfd563b4850f124ca01f64a15ec24fd8277732, for GNU/Linux 3.2.0, stripped

  #/bin/test --version
  #/bin/[ --version
  [ (GNU coreutils) 8.30
  Copyright (C) 2018 Free Software Foundation, Inc.
  License GPLv3+: GNU GPL version 3 or later <https://gnu.org/licenses/gpl.html>.
  This is free software: you are free to change and redistribute it.
  There is NO WARRANTY, to the extent permitted by law.

  Written by Kevin Braunsdorf and Matthew Bradburn.

Built with LBRACKET or not ...

https://git.savannah.gnu.org/gitweb/?p=coreutils.git;a=blob;...

hibbelig · 2023-11-23T15:18:24 1700752704

I have recently written

    if some_command 1>/dev/null 2>/dev/null; then
        : # A-OK
    else
        here_is_the_code_i_actually_needed
    fi

And it leaves me wondering if there is a way to “negate” the exit status of a command...

(The command was “docker volume inspect VOLUME”, and in one place in my script I had to do things when the volume existed, and in other places I had to do things when the volume did not exist...)

jeramey · 2023-11-23T15:26:52 1700753212

Yes, there is. To negate, just use ! as with many other languages.

    if ! command …; then
        do_needed_things
    fi

ramshorns · 2023-11-23T16:50:13 1700758213

How does that work? It doesn't look like ! is a command or shell builtin, so I guess it's an argument to if.

mkl · 2023-11-23T19:41:30 1700768490

No, it's part of the shell. Do "man bash" or "man sh", then search with "/\!". From man bash: "If the reserved word ! precedes a pipeline, the exit status of that pipeline is the logical negation of the exit status".

hibbelig · 2023-11-23T22:06:02 1700777162

Amazing! I've been writing shell scripts since the 90s and now I still learn something new. Thank you!

jmclnx · 2023-11-23T23:05:09 1700780709

Interesting, on Linux (Slackware, Debian), /bin/[ and /bin/test are not the same. But on NetBSD (and I assume others too, they are the same.

shmerl · 2023-11-23T01:10:13 1700701813

> And now for the final lolz. I’ve said above that these are the commands you use to evaluate expressions… but the shell also has expressions of its own via the !, &&, and || operators—all of which work on command exit statuses.

It works for simulating very basic boolean expressions, but it gets ugly quickly if you need some more complex combos of NOT, OR and AND. I wish Bash had proper boolean expressions support.

candiddevmike · 2023-11-23T01:26:29 1700702789

It works fine, you can satisfy your boolean logic needs with bash tests and exit codes.

You'd be amazed at how much of the world runs (just fine!) on bash conditionals.

hyperhopper · 2023-11-23T03:51:45 1700711505

You'd be surprised how much of the world runs on cobol and Fortran.

Doesn't mean those are good or even okay or that people should choose to use them.

Annatar · 2023-11-23T08:42:46 1700728966

Modern post-2012 Fortran is excellent, and the compiler produces very fast code, GPU or otherwise. And it is very portable.

remram · 2023-11-23T01:24:14 1700702654

What do you mean? Bash does have (), {}, !, &&, ||

What else do you need for "proper Boolean expressions support"?

shmerl · 2023-11-23T01:57:54 1700704674

I don't think you can assign boolean variables to boolean expressions, becasue there is no boolean type.

Something like this won't work to become false:

    foo=true
    bar=false

    baz=$foo && $bar
    echo $baz

Or even simply:

    foo=! $foo

With numeric expressions, you can at least use $((...))

account42 · 2023-11-23T12:19:09 1700741949

But you can do

    foo=0 # true
    bar=1 # false

    [ $foo = 0 ] && [ $bar = 0 ]
    baz=$?
    echo $baz

The syntax is a bit unusual coming from more modern languages, as is the use of 0 as true. But you can express whatever conditionals you want. Remember that C originally had no dedicated boolean type either.

You could also use numerical expressions although I don't recommend it due to being even less readable (even more so if you have expression complex enough that you need to protect against overflow):

    baz=$((foo + bar)) // foo && bar
    baz=$((foo * bar)) // foo || bar

shmerl · 2023-11-23T18:44:46 1700765086

That still doesn't make it look nice, since if you use false / true, you can use variables directly in if checks. For example:

    foo=true

    if $foo; then
      echo yes
    fi

But with 0 / 1 - that won't work. I.e. you can get some, but not all features of a normal boolean type in various ways.

emmelaich · 2023-11-23T05:16:40 1700716600

Yeh you'd have to do something like

    foo=true
    bar=false

    $($foo) && $($bar); baz=$?
    case $baz in
    1) echo false;;
    0) echo true;;
    esac

It just ain't worth it.

shmerl · 2023-11-23T08:30:10 1700728210

Yeah, that's very convoluted. It would be useful to have some context for boolean expressions, similarly how $((...)) works for numeric ones.

gkfasdfasdf · 2023-11-24T14:15:40 1700835340

Instead of trying to make a shell script portable across all different shells, I prefer to just use #!/bin/bash and not worry about it. (Of course bash version compatibility is a concern of it's own)

DonHopkins · 2023-11-23T11:29:32 1700738972

Syntactic Syrup of Ipecac.

skrebbel · 2023-11-23T10:31:33 1700735493

I'll never understand how there's a whole class of developers who absolutely despise JavaScript for acting weird when adding arrays to objects, but at the same time gladly write bash scripts and send each other articles about how [ is totally a program but ] isn't and that's all fine and dandy and in no way objectionable.

EDIT: this comment is a bog standard HN middlebrow-dismissal and the blog post doesn't deserve this for to be the top comment. It was morning and I was grumpy. I can't delete it anymore so would appreciate some downvotes.

prerok · 2023-11-23T11:26:56 1700738816

Well, shell scripts are small helper programs, where you don't have to implement all sorts of business rules and complex object interactions. I am pretty sure most developers (I definitely) would be horrified at the prospect of having to to implement in shell what we are now doing in JS. Even if that were possible.

The likes this gets is the design simplicity and uniformity (across processes), not how it looks and what it actually does.

DonHopkins · 2023-11-23T11:36:56 1700739416

Oh come on. Who are you to say developers don't have to implement business rules and complex interactions? You're just making an after-the-fact rationalization for a terrible, foolish, thoughtless, pointlessly complex design. And then trying to dictate how people should use their tools and what kinds of problems they should limit themselves to solving.

It's not like the shell script designers sat down at a meeting and said:

"OK, it's very important that we don't want people using this language to implement all sorts of business rules and complex object interactions, because decades from now there will be invented an Ousterhoutian dichotomy and government regulations enforcing that developers should use other kinds of languages for that, because of the essential definition of what it means to be a shell scripting language, so we've got to come up with some way of punishing people who attempt to do that, and introduce obscure hard to spot bugs in their programs as a consequence if they have the audacity to do that, or even if their initially simple scripts later get more requirements and have to become more complex. Now let's brainstorm about how we can do that, and make sure the syntactic syrup of ipecac we come up to solve this problem fits in well with the rest of the language design by being totally off-the-wall and unlike every other piece of syntax in any other programming language including itself."

Then again, maybe you have a point, and they did do it on purpose, judging by how terrible the rest of the language is!

"Language Design Is Not Just Solving Puzzles" -Guido van Rossum

https://news.ycombinator.com/item?id=20672739

https://www.artima.com/weblogs/viewpost.jsp?thread=147358

>Summary: An incident on python-dev today made me appreciate (again) that there's more to language design than puzzle-solving. A ramble on the nature of Pythonicity, culminating in a comparison of language design to user interface design.

>Some people seem to think that language design is just like solving a puzzle. Given a set of requirements they systematically search the solution space for a match, and when they find one, they claim to have the perfect language feature, as if they've solved a Sudoku puzzle. For example, today someone claimed to have solved the problem of the multi-statement lambda.

>But such solutions often lack "Pythonicity" -- that elusive trait of a good Python feature. It's impossible to express Pythonicity as a hard constraint. Even the Zen of Python doesn't translate into a simple test of Pythonicity. [...]

http://lambda-the-ultimate.org/node/1298

>Guido: Language Design Is Not Just Solving Puzzles

>And there's the rub: there's no way to make a Rube Goldberg language feature appear simple. Features of a programming language, whether syntactic or semantic, are all part of the language's user interface. And a user interface can handle only so much complexity or it becomes unusable.

>The discussion is about multi-statement lambdas, but I don't want to discuss this specific issue. What's more interesting is the discussion of language as a user interface (an interface to what, you might ask), the underlying assumption that languages have character (e.g., Pythonicity), and the integrated view of semantics and syntax of language constructs when thinking about language usability. [...]

Ousterhout's Dichotomy is a contrived descriptive not prescriptive fiction, to rationalize the design of TCL after the fact. And despite its flaws and limitations, TCL is orders of magnitude better and more thoughtfully designed and purposefully thought out and internally consistent than any Unix shell scripting language.

https://news.ycombinator.com/item?id=9970505

https://en.wikipedia.org/wiki/Ousterhout%27s_dichotomy

>Ousterhout's dichotomy is computer scientist John Ousterhout's categorization[1] that high-level programming languages tend to fall into two groups, each with distinct properties and uses: system programming languages and scripting languages – compare programming in the large and programming in the small. This distinction underlies the design of his language Tcl. [...]

>Criticism: Critics believe that the dichotomy is highly arbitrary, and refer to it as Ousterhout's fallacy or Ousterhout's false dichotomy.[4] While static-versus-dynamic typing, data structure complexity, and dependent versus stand-alone might be said to be unrelated features, the usual critique of Ousterhout's dichotomy is of its distinction of compiling versus interpreting. Neither semantics nor syntax depend significantly on whether a language implementation compiles into machine language, interprets, tokenizes, or byte-compiles at the start of each run, or any mix of these. In addition, basically no languages in widespread use are purely interpreted without a compiler; this makes compiling versus interpreting a dubious parameter in a taxonomy of programming languages.

prerok · 2023-11-23T20:12:44 1700770364

Well, I agree with all of that and I think maybe my comment was misunderstood.

I was responding to the parent comment, saying "why do people like this". Its minimalism is ok-ish for small programs where you don't feel the pain so much and kind of beautiful in its solutions of packing everything into a separate executable.

And I absolutely stated that we have to implement the business rules and that I would be horrified to implement it in such a language. Therefore, yes, the language comes from a different age and shows it and, yes, would never consider it for any complex thing.

simias · 2023-11-23T12:30:02 1700742602

I dislike JavaScript and shell scripts equally, but sometimes I have to add a feature to a web page and I have to use JS, so I do, and sometimes I need to automate some un*x system task in a portable way and without heavy deps and shell scripts are the obvious solution.

What annoys me is using JavaScript and shell scripts when there are clearly superior alternatives and no clear advantage for it besides the familiarity (which, admittedly, can be a strong argument).

Shell scripts being an arcane mess is no excuse for Javascript being as clunky as it is, and vice-versa.

skrebbel · 2023-11-23T20:13:03 1700770383

One clear advantage that both JS and Bash have over nearly every other language is stability. Code you write today is very likely to still work in 20 years.

hun3 · 2023-11-23T10:43:28 1700736208

You can replace bash with JS. Scripting is their common application domain. Can you replace JS with bash, though?

If the answer is "no," that means JS has applications (e.g., servers and web clients) that bash can't be used for. And they're saying JavaScript is bad at them. Bash is arguably worse, but it's usually not an option in the first place so you don't get to complain.

mananaysiempre · 2023-11-23T11:46:08 1700739968

> You can replace bash with JS

Noo-ot really except in a Turing-tarpit sort of way. Like it or not, once learned (!) Bourne shell together with the traditional tools is a well-designed user interface and works even better for that than, say, Tcl[1]. Not an automation or scripting language, a user interface for all daily interaction. I would absolutely hate to manually sort through my files in JS, while in shell I can often do it faster than in a GUI file manager.

And, of course, it’s also a fairly strong contender as a programming language in the paradigm of many pipelined imperative processes—probably because that paradigm remains largely unexplored. I can only maybe name Icon as a viable competitor, and Icon’s also very nice. (Python, no matter how “inspired” it is by Icon, has traded its command of streams for more mainstream ease of use.) By comparison, the old complement of parallellized JS build tools (Gulp? Grunt? I forgot, it’s been a while) always surprised me with how awkwardly it accomplished shell-script-equivalent tasks.

To be clear, it’s not that Bourne shell is good and JS is bad. JS is a passable Fortran[2], while shell is at best a marginally functional one—there’s a reason Awk exists. But shell competes in categories that most other languages don’t even try to qualify for.

[Yes, I know about rc. I happen to think rc’s focus on one-level lists of strings (incidentally shared by Jam, an attempt at a better make) is a mistake, and more consistent arbitrarily-nested quoting, giving a “stringy Lisp” in the vein of Tcl, would be the way to go.]

[1] http://yosefk.com/blog/i-cant-believe-im-praising-tcl.html

[2] http://conal.net/blog/posts/can-functional-programming-be-li...

hun3 · 2023-11-23T17:47:23 1700761643

> it’s also a fairly strong contender as a programming language in the paradigm of many pipelined imperative processes—probably because that paradigm remains largely unexplored.

The paradigm is called point-free programming[1], right? I don't know how "imperative" is relevant here--you can rewrite most of the coreutils in Haskell, for example.

[2]: https://en.m.wikipedia.org/wiki/Tacit_programming

hun3 · 2023-11-23T17:48:08 1700761688

> Noo-ot really except in a Turing-tarpit sort of way.

That means bash has applications (e.g., batch processing and file management) that JS can't be (easily) used for. And they never said bash is bad at them. That...proves my point?