Highlights: Mastering Bash Scripting: From Basics to Advanced

In Best Practices & Patterns, CI/CD Pipelines, Cloud Platforms, Configuration Management, Containers & Docker, DevOps Tools
March 2, 2015
257 Views

Bash (Bourne Again Shell) is the default command-line shell on most Unix/Linux systems, and it provides a powerful scripting language for automating tasks and managing systems. This comprehensive guide will take you through Bash scripting from fundamental concepts to advanced techniques. It is structured to help beginners grasp the basics, guide intermediate users through essential scripting features, and offer advanced users deeper insights and best practices. We’ll cover everything from script structure and simple loops to error handling, signal traps, and performance tuning, with practical examples and real-world use cases throughout.

Getting Started: Basic Bash Scripting Concepts

Script Structure and Execution

Every Bash script is a plain text file containing shell commands. A script typically starts with a shebang line, which indicates which interpreter should execute the script. For Bash scripts, the shebang is usually #!/bin/bash (or #!/bin/sh for a more POSIX-sh compatible script). The shebang must be the first line of the file, and it’s critical for making the script executable directly. For example:

#!/bin/bash
# This is a simple bash script
echo "Hello from my script!"

In the above example, #!/bin/bash tells the system to use the Bash shell to run the script. Lines beginning with # (except the shebang) are comments; the shell ignores them, allowing you to document your code. After writing your script, give it execute permission (chmod +x scriptname.sh) and run it either by specifying the path (./scriptname.sh) or by calling Bash explicitly (bash scriptname.sh). By using the shebang and proper execution steps, you ensure the script runs in the correct shell environment.

Bash executes script commands sequentially, from top to bottom, unless flow control (conditionals or loops) dictates otherwise. It’s important to note that Bash scripts run in a non-interactive, line-by-line fashion, and each command returns an exit status (0 for success, non-zero for error). This exit code can be used to detect failures and make decisions in your script.

“The only thing worse than a poorly documented shell script is a well-documented shell script that doesn’t work.” — Anonymous

Variables and Shell Variables

Like other programming languages, Bash allows the use of variables to store data. A variable in Bash is essentially a named placeholder for a value (string, number, etc.). You create a variable by assigning a value to a name using the syntax name=value with no spaces around the = sign. For example:

#!/bin/bash
# Defining some variables
greeting="Hello"
name="Alice"
echo "$greeting, $name!"

In the above snippet, we set greeting to "Hello" and name to "Alice". We then use echo to print the greeting. Notice that to retrieve or use a variable’s value, you prefix its name with a $ (like $greeting). Also, we wrapped the variable expansions in quotes ("$name"). Always quote your variable expansions unless you intentionally need word splitting or globbing – unquoted variables can cause unexpected behavior if they contain spaces or special characters. Using quotes ensures the variable is treated as a single literal value (for example, if name was “Alice Smith”, using "$name" preserves the space in the value, whereas using $name without quotes would split it into two words). As a rule of thumb: Double-quote all variable references and command substitutions in scripts to avoid common bugs due to whitespace and glob characters.

Bash variables are untyped (everything is a string, by default). However, Bash can interpret variables in different ways depending on context (e.g., in arithmetic contexts). Some special variables in Bash include $0 (the script’s name), $1, $2, … (the first, second, etc. command-line arguments to the script), $# (the number of arguments), and $? (the exit status of the last command). Environment variables (like PATH or HOME) can be accessed and set via the export command to make them available to child processes.

Variable scope: By default, variables in Bash are global to the script and any subprocesses it spawns. Within functions (discussed later), you can use the local keyword to limit a variable’s scope to that function. This is useful to avoid name collisions in larger scripts.

Tip: Bash provides various parameter expansion tricks for advanced variable handling (such as default values, string substitution, substring extraction, etc.). For example, ${VARIABLE:-default} will evaluate to a default value if the variable is unset. These are beyond the basics, but remember that the ${...} syntax is used for disambiguating variable names and for such expansions.

Conditional Statements (if-then-else)

Conditionals allow your script to make decisions and execute certain parts of code only if a condition is true (or false). The primary conditional structure in Bash is the if statement, which can be used with else and elif (else-if) clauses to handle multiple cases. The basic syntax is:

if [ condition ]; then
    # commands if condition is true
elif [ other_condition ]; then
    # commands if the first condition was false, but this one is true
else
    # commands if all above conditions are false
fi

Here, the [ and ] (or the test command) are used to evaluate a condition. Important: there must be a space after [ and before ]. Conditions can include comparisons of numbers, strings, or file attributes. For example, -eq is numeric equality, while == (inside double brackets [[ ]]) or = (inside single bracket test) is string equality. Common operators include -lt (less than), -gt (greater than) for integers, -z (string is empty), -n (string is non-empty), etc. For file conditionals, Bash’s test offers operators like -f (file exists and is a regular file), -d (directory exists), -e (file or directory exists), -r (is readable), -w (writable), -x (executable), among others. For example:

if [ -f "/etc/passwd" ]; then
    echo "The file /etc/passwd exists."
fi

Using double brackets [[ ... ]] instead of single [ has some advantages: it is more flexible (allows regex matching with =~ and doesn’t do word splitting or glob expansion on unquoted variables). Many script writers prefer [[ ]] for conditionals for those reasons. For instance, if [[ "$name" == "Alice" ]]; then ... fi is a valid string comparison. In the example below, we use a string comparison and an else clause:

#!/bin/bash
user="admin"
if [[ "$user" == "admin" ]]; then
    echo "User is an administrator"
else
    echo "User is not an administrator"
fi

This will output “User is an administrator” because the condition is true. You can chain multiple conditions using elif. This is useful for checking multiple exclusive cases. For example, you might check if a number is positive, negative, or zero by using one if and two elif conditions, each with a different comparison.

Real-world use case: imagine a startup script that checks if a service is running; you could use if to detect a PID file or ping the service port and then either start the service or print “already running” based on the condition. Another common use is validating user input: if the provided parameter is valid, proceed; otherwise, print an error (possibly using else).

Loops (for, while, until)

Loops allow repetition of a block of commands. Bash supports for loops, while loops, and until loops.

For Loops: Ideal for iterating over a known list of items (like a list of files, words, or numbers). The syntax is:

  for variable in list; do
      # commands using $variable
  done

For example, to loop through a fixed list of names:

  for name in Alice Bob Charlie; do
      echo "Hello $name"
  done

This loop will greet each name in the list. You can also use globbing in the list. For instance, for f in *.txt; do echo "Processing $f"; done will iterate over all .txt files in the current directory. The for loop runs until it exhausts the list provided. If the list is omitted, a for loop will iterate over the script’s positional arguments ($@).

While Loops: A while loop repeats as long as a certain condition is true. Syntax:

  while [ condition ]; do
      # commands
  done

The loop checks the condition before each iteration, and if it is true, it executes the block, then repeats. For example:

  count=1
  while [ $count -le 5 ]; do
      echo "Count is $count"
      count=$((count + 1))
  done

This will print count from 1 to 5 and then stop (because when count becomes 6, the condition $count -le 5 is false). Such loops are useful when you need to wait for something or read through a file until end-of-file, etc.. Remember to update the condition variable inside the loop, or you may create an infinite loop.

Until Loops: An until loop is the opposite of while – it repeats until a condition becomes true (i.e., it runs as long as the condition is false). Syntax:

  until [ condition ]; do
      # commands
  done

The loop body executes, then checks the condition; if the condition is still false, it iterates again. For instance, an equivalent to the above while-example using until could be:

  count=1
  until [ $count -gt 5 ]; do
      echo "Count is $count"
      count=$((count + 1))
  done

Here the loop runs until $count -gt 5 becomes true (i.e., until count is greater than 5).

As an example combining a loop with a conditional, consider a simple script to print numbers 1 through 10:

counter=1
while [ $counter -le 10 ]; do
   echo $counter
   counter=$((counter + 1))
done

This uses a while loop to increment a counter and prints each value. Such a construct is common in scripts for tasks like retrying an operation until success, reading lines from a file (as we’ll see later), or just performing a repetitive calculation.

Real-world use cases for loops: You might use a for loop to batch-process a set of files (e.g., convert all .png images to .jpg in a directory by looping over *.png). A while loop can read from input until it’s exhausted – for example, reading a file line by line (demonstrated in a later section). until can wait for a condition, such as “until this service responds, keep checking every 5 seconds.”

Note on loop performance: Bash loops are powerful, but if you loop over a very large number of items (say thousands), it can be slow compared to using specialized commands. Sometimes it’s better to use commands like find or xargs (which we’ll discuss later) to handle large datasets rather than a huge Bash loop, since external tools often handle bulk operations more efficiently. But for moderate sizes or simple tasks, loops are perfectly fine.

Intermediate Bash Scripting Techniques

Moving beyond the basics, we’ll discuss features that help organize and manage more complex scripts: functions for reuse, arrays for structured data, methods for input/output and file handling, etc. These tools greatly expand what you can do in Bash and help in writing cleaner scripts.

Bash Functions

Functions in Bash allow you to group a set of commands into a reusable unit. This promotes modularity and avoids repetition by letting you call the same block of code from multiple places in your script. You can define a function in two ways in Bash:

With the function keyword: e.g. function name { commands; }
In a shell style without keyword: e.g. name () { commands; }

Both are accepted. For example, both definitions below are valid and equivalent:

# Definition style 1:
hello() {
    echo "Hello $1"
}

# Definition style 2:
function goodbye() {
    echo "Goodbye $1"
}

Here, hello and goodbye are function names. We can call hello "Alice" and it will print “Hello Alice”, using $1 as the first argument passed to the function. In Bash, function arguments are accessed just like script arguments: $1, $2, etc. (Inside a function, $0 still refers to the script name, not the function name.) You can also use return in a function to exit it early and provide an integer return status (0 for success, or non-zero error code). If a function doesn’t use an explicit return, the status is that of the last command executed in it.

Bash function definitions must appear before you call them (typically, you put all function definitions at the top of the script, or in a sourced file). Once defined, you invoke a function simply by writing its name, like a command. For instance:

hello "World"    # Calls the hello() function defined above
goodbye "World"  # Calls the goodbye() function

Functions help make scripts more readable and maintainable by encapsulating logic. For example, if you are writing a deployment script, you might have functions like start_server(), stop_server(), deploy_code(), etc., each containing the specific steps for that task. Then your main script logic simply calls these functions in the required order.

Local variables in functions: If you declare variables inside a function without local, they can affect or be seen by the rest of the script (global scope). Using local var_name=value confines var_name to that function (and its children), which is usually safer for avoiding side-effects.

Return values: Bash functions don’t return data the way some languages do. A function can use return <n> to set an exit status (which can be checked via $?). To “return” actual data (like a string or number), you typically output it (echo or printf) and capture that output via command substitution when calling the function, or assign to a global variable. For example:

add() {
    local result=$(( $1 + $2 ))
    echo "$result"
}
sum=$(add 3 4)  # capture output of function
echo "Sum is $sum"

This prints “Sum is 7”. We could also have the function set a global variable instead of printing. Choose the method that fits your script’s design.

Arrays in Bash

An array is a variable that holds multiple values, accessed by index. Bash supports indexed arrays (numerical indices starting at 0) and associative arrays (key-value pairs, like dictionaries). Arrays are handy for managing lists of items (filenames, usernames, etc.) within one variable.

Indexed arrays: By default, Bash treats variables as indexed arrays when you use the appropriate syntax. You can create one by assigning multiple values in parentheses. For example:

  fruits=("apple" "banana" "cherry")
  echo "${fruits[0]}"   # prints "apple"
  fruits[3]="date"      # add a new element at index 3
  echo "We have ${#fruits[@]} fruits."   # ${#array[@]} gives length

In this example, fruits[0] is “apple”, fruits[1] is “banana”, etc. Bash arrays are zero-indexed. You can also explicitly declare an indexed array with declare -a arrayname, but it’s usually not required to declare in advance unless you need an empty array to start with.

Associative arrays: These require Bash version 4 or later. They allow using string keys. For example:

  declare -A user_id
  user_id["alice"]=1001
  user_id["bob"]=1002
  echo "Alice's ID is ${user_id["alice"]}"

We declare an associative array with declare -A. Then we assign values to keys like "alice" and "bob". Access is similar to indexed arrays but using the key: ${user_id["alice"]}. Associative arrays are great for look-up tables or grouping related data by names.

To iterate over an array, you can use a for loop:

for fruit in "${fruits[@]}"; do
    echo "I like $fruit"
done

${fruits[@]} expands to all elements. Likewise, for associative arrays, you might iterate over keys with ${!user_id[@]} (the ! expands to indices or keys):

for name in "${!user_id[@]}"; do
    echo "$name has ID ${user_id[$name]}"
done

This will list each name and its ID.

Arrays provide a convenient way to handle lists without having to manage separate variables for each item. For instance, a script could use an array of server names to SSH into each and run some command, or use arrays to collect results from a series of operations.

Array operations: You can get all values with ${array[@]} and all indices with ${!array[@]}. The length of an array is ${#array[@]}. Bash doesn’t have built-in multi-dimensional arrays, but you can simulate them with associative arrays (using composite keys). You can also slice arrays (${array[@]:start:count}) to get a subset of elements.

One thing to be careful with: when expanding array elements in loops or conditions, quote them as above ("${fruits[@]}") to avoid wordsplitting issues, especially if elements might contain spaces.

Input and Output (I/O) in Bash

Scripts often need to accept input (from the user or from files) and produce output. Bash provides several mechanisms for this:

Reading user input: The read command is used to get input from standard input (keyboard or redirected file). For example:

  read -p "Enter your name: " name
  echo "Hi $name, welcome!"

The -p option prompts the user with the given string (here "Enter your name: "). The user’s input (up to newline) is stored in the variable name. This is interactive input. If you use read without -p, it will just block waiting for input (which you can supply by typing and pressing Enter, or via redirection).

Command-line arguments: As briefly noted, your script can take arguments. If you run ./script.sh foo bar, inside the script $1 will be “foo” and $2 will be “bar”. You can use these as inputs to your script. A common practice is to check $# (number of args) and use an if to provide usage instructions if no arguments are given, for example:

  if [ $# -lt 2 ]; then
      echo "Usage: $0 <inputfile> <outputfile>"
      exit 1
  fi
  infile="$1"
  outfile="$2"

Here $0 is the script name, so this prints a usage string and exits if at least 2 arguments are not provided.

Echo and Print: Output can be sent to standard output using commands like echo or printf. echo "message" will print the message (and by default add a newline). printf works like the C printf, allowing formatted output (e.g., printf "Name: %s, ID: %d\n" "$name" "$id"). For simple messages, echo is usually sufficient, but be aware that echo has some portability quirks (options like -e for escape sequences, etc., vary). printf is more reliable for certain tasks or if you need formatting.
Redirection: One of the most powerful features in shell scripting is I/O redirection. You can redirect a command’s output or input using special operators:
> redirects standard output (stdout) to a file, replacing the file’s contents if it exists (or creating it if not). For example, ls > files.txt writes the output of ls into files.txt.
>> appends stdout to a file (keeps existing contents). E.g., echo "Done" >> log.txt will append “Done” to log.txt.
< redirects a file to a command’s standard input. For example, read var < file.txt would read the first line of file.txt into var, or wc -l < file.txt counts lines in file.txt by feeding it to wc.
2> redirects standard error (stderr). You can capture errors separately from normal output. For instance, ls /nonexistent 2> errors.txt will send the error message to errors.txt instead of the screen.
You can combine descriptors: &> redirects both stdout and stderr to a file (in Bash, >& or >file 2>&1 does similar). A common pattern is command >output.txt 2>&1 which puts all output (stdout+stderr) into one file.

Redirection allows scripts to log their outputs or errors, or read input from files in place of user input. For example, rather than prompting the user within a script, you might use read to process lines from a file: while IFS= read -r line; do ... done < input.txt (the IFS= and -r are explained below).
Pipelines: Using the | operator, you can pipe the output of one command into another, which is a form of connecting input/output between processes. E.g., grep "ERROR" app.log | wc -l passes all lines containing “ERROR” into wc -l to count them. In scripts, pipelines are useful for filtering or transforming data on the fly. (Keep in mind each part of a pipeline may run in a subshell – we’ll discuss subshells later, but it means variables set in a pipeline segment might not persist after.)
Reading files line by line: A common requirement is to read a file line by line in a loop. The recommended approach is using a while loop with read. For example:

  # Process each line of a file
  while IFS= read -r line; do
      echo "Got line: $line"
  done < input.txt

Here we use IFS= (which clears the Internal Field Separator, preventing leading/trailing whitespace from being trimmed) and read -r (the -r option tells read not to treat backslashes as escape characters). This combination while IFS= read -r line; do ... done < file is a safe idiom to read a file line by line without surprises (like dropping backslashes or trimming spaces). Each iteration, $line contains the current line’s text. This technique is often used for configuration files or any input where you need to handle one record at a time.

Here-documents and here-strings: Bash also has here-documents (using << to feed a block of text to a command’s stdin) and here-strings (<<< to feed a short string into a command). For example:

  cat <<EOF > example.txt
  This is line 1
  This is line 2
  EOF

This would send the two lines into the cat command, which then writes them into example.txt (because of the > example.txt redirection). A here-string like grep "foo" <<< "$text" feeds the content of $text into grep. These can be useful for embedding multiline text or avoiding external echo calls.

Practical I/O examples: You might write a script that asks the user for some info (using read) and then writes it to a database or file. Or a script that reads from a template file line by line and does some substitution. Logging is another common scenario: you might redirect output and errors to log files for later review (exec > logfile 2>&1 at the top of a script will send all following output to that logfile). We’ll see more about error logging in the next section.

Basic File Handling

File handling in Bash typically means checking for file existence, reading/writing files, and perhaps manipulating filenames. While Bash is not a full-fledged programming language with built-in I/O libraries, it relies on Unix commands and shell facilities for file operations.

Testing files: As mentioned under conditionals, Bash’s [ ] (or test) provides many options to check files:

-e FILE – check if FILE exists (regardless of type).
-f FILE – check if FILE exists and is a regular file (not a directory or device).
-d FILE – check if FILE is a directory.
-s FILE – check if FILE exists and is not empty (size > 0).
-r FILE – readable by the script/user, -w FILE – writable, -x FILE – executable.
-O FILE – if the script’s user owns the file, etc.

Using these in an if statement allows scripts to make decisions about files. For example:

if [ -d "/backup" ]; then
    echo "Backup directory exists."
else
    echo "Creating backup directory."
    mkdir "/backup"
fi

This checks for a directory and creates it if not present. Another example:

if [ -e "$output_file" ]; then
    echo "Warning: $output_file already exists, it will be overwritten."
fi

It’s good practice to check for files or directories before trying to use them (to avoid errors or data loss).

Creating and removing files/directories: Bash can use commands like touch (to create an empty file), mkdir (make directory), rm (remove file) or rmdir (remove empty directory) for file system manipulation. For instance, touch logfile.txt creates an empty file logfile.txt (or updates its timestamp if it exists). These are external commands, but they’re standard and frequently used in scripts.

Reading and writing files: We saw how to read a file line by line with a loop. If you simply need the whole content of a file, you might use cat filename to output it, or use command substitution to get it into a variable (e.g., content=$(<filename) which is a Bash shortcut to read a whole file into a variable – be careful with large files though). For writing, redirection is the main method. Example:

echo "User $USER started the process at $(date)" >> process.log

This appends a log entry to process.log. If the file doesn’t exist, it will be created. If it does exist, the new line is added at the end.

For more controlled writing, you can use exec to assign file descriptors. For example:

exec 3> output.txt
echo "First line" >&3
echo "Second line" >&3
exec 3>&-   # close file descriptor 3

This opens output.txt for writing on file descriptor 3, writes two lines to it, then closes it. This is an advanced technique (using custom file descriptors) which is useful if you need to keep a file open for writing throughout the script (avoiding the overhead of opening/closing each time). But for most cases, simple > or >> redirection in each write is sufficient.

Example use case: Suppose you’re writing a script to process some data and produce a report. You might check if an output directory exists (using -d), create it if not, then redirect output to a file in that directory. Or consider a script that needs to clean up old log files – it could use file tests ([ -f "$file" ]) and perhaps find command integration to delete files older than a certain date.

File content manipulation: Often you will combine shell scripting with Unix text-processing tools for file content (like grep to search inside files, sed or awk to transform text). While those are outside pure Bash built-ins, learning them is part of effective shell scripting. For example, a script might use grep to find lines in a log and then process them in a loop, or use awk to calculate some statistics from a CSV file.

At this point, we’ve covered the intermediate essentials. Next, we move into more advanced territory, which will build on this knowledge and introduce techniques to make scripts more robust and efficient.

Advanced Bash Scripting Techniques

In this section, we’ll explore advanced topics that are crucial for writing robust and efficient Bash scripts. These include error handling strategies, signal trapping for cleanup, understanding subshells, using process substitution for complex piping, and optimizing script performance. Mastering these will elevate your scripting ability to handle real-world, complex scenarios.

Error Handling and Exit Status Management

Robust error handling ensures that your script doesn’t fail silently or proceed with incorrect assumptions. Bash has several mechanisms for handling errors and monitoring commands:

Exit codes: Every command in Bash returns an exit status (0 means success, any non-zero code indicates some form of error or “false” condition). In a script, you can access the last command’s exit code via the special $? variable. It’s common to do something like:

  cp "$source" "$dest"
  if [ $? -ne 0 ]; then
      echo "Error: copy failed!"
      exit 1
  fi

However, typing if [ $? -ne 0 ] after every important command is tedious. Instead, Bash provides options to make error handling easier:

set -e (errexit): This shell option makes the script exit immediately if any command returns a non-zero status (with some exceptions for commands in conditional lists, etc.). By putting set -e at the top of your script, you can prevent the script from blindly continuing after a failure. This is useful for safety, especially in scripts that might cause harm if a step fails (e.g., backups, system changes). Do note, if you enable set -e, you might need to occasionally use constructs like || true for commands that you allow to fail. For example, grep "foo" file || true would ensure that even if grep doesn’t find anything (returns 1), the script doesn’t exit at that point. Many experienced Bash authors use set -e (often combined with other options below) at the start of scripts to enforce stricter error checking.
set -u (nounset): This option makes Bash treat use of an unset variable as an error (and exit immediately, similar to set -e). This helps catch typos and logic errors. For instance, if you mistyped a variable name, the script would stop rather than proceed with an empty value which could cause unexpected behavior. It’s often enabled alongside -e. In short: set -u ensures you don’t unknowingly use undefined variables.
set -o pipefail: By default, in a pipeline, the exit status of the pipeline is that of the last command. This can be problematic – e.g., in cmd1 | cmd2, if cmd1 fails but cmd2 succeeds, the pipeline as a whole returns 0, masking the failure. set -o pipefail changes this so that the pipeline’s exit status is non-zero if any command in the pipeline fails. This is important for catching errors in complex pipelines.
set -x (xtrace): Not for error handling per se, but for debugging: this option makes Bash print each command to stderr before executing it, prefixed with +. It’s useful to trace what the script is doing (you can turn it on at problematic sections or run the whole script with bash -x script.sh).

A common practice is to enable some or all of these options at once for “strict mode” at the top of your script: for example:

set -euo pipefail

This turns on -e, -u, and -o pipefail together (the -o for pipefail is how you combine it). This way, you catch many errors early. Just be aware of how they interact; for instance, in subshells or in conditional if statements you might need to adjust.

Using || and && for error handling: You can chain commands with && (and) and || (or) to control flow based on success or failure. For example:

  mkdir "/backup" && cp important.db "/backup/" && echo "Backup succeeded." || echo "Backup failed!"

This one-liner attempts to create a directory and copy a file; the && ensures each step runs only if the previous succeeded. If any step fails, the chain after the || executes. A clearer way to write this in multiple lines might be:

  mkdir "/backup" && cp important.db "/backup/" \
      && echo "Backup succeeded." \
      || { echo "Backup failed!"; exit 1; }

Here, if any of the &&-chained commands fail, the || part will execute. This technique is effectively manual error handling. Another use is to provide custom error messages:

  command1 || { echo "command1 failed"; exit 1; }

This will print an error and exit if command1 has a non-zero exit. It’s a compact alternative to a full if-statement for the same purpose.

Trap and ERR: Bash has a trap command which we will discuss more for signals, but there is a special pseudo-signal ERR that triggers when any command fails (with set -e or not). You can trap 'echo "Error occurred on line $LINENO"' ERR to run a snippet when an error happens. This can be used to do cleanup or logging on errors. For example, you could trap ERR and in the handler send yourself an email or record something to syslog when a script fails unexpectedly.
Custom error messages and logging: Beyond just exiting on errors, a robust script should inform the user (or admin) what went wrong. This can be as simple as echoing to stderr (you can use echo "msg" >&2 to send output to stderr), or writing to a log file. For instance, you might maintain a log:

  LOGFILE="/var/log/myscript.log"
  echo "[$(date)] Starting backup job..." >> "$LOGFILE"
  # ... run commands
  echo "[$(date)] ERROR: Backup failed at step X" >> "$LOGFILE"

Logging every step can be verbose, but for critical scripts it’s often worth it. You can also redirect all output to a log (using exec > "$LOGFILE" 2>&1 at the start, which sends both stdout and stderr to the logfile). Another pattern is to duplicate output to both screen and log using tee. For example:

  exec > >(tee -i "$LOGFILE") 2>&1

This uses process substitution (explained later) to tee output. After this line, everything the script prints will go both to the logfile and to original stdout. This is advanced usage, but very powerful for debugging and monitoring scripts in real time.

Example scenario for error handling: Imagine a deployment script that brings down a service, updates it, and brings it back up. If the update step fails, you must ensure the service still comes back up or at least notify someone. Using set -e would stop the script on failure, but you might trap the error to perform a rollback or restart. Additionally, you’d log the error and perhaps send an alert. Robust error handling means your script can deal with unexpected situations gracefully (or at least not make things worse).

Signal Trapping (Cleaning Up on Exit)

Signals are asynchronous notifications sent to your process by the OS (or by user actions like pressing Ctrl+C which sends SIGINT). Bash scripts can trap signals to execute specific cleanup or handling routines. This is important, for example, if your script creates temporary files or locks that need to be removed even if the user aborts the script.

The Bash trap builtin allows you to specify commands to run when the shell receives specific signals. Common signals to handle in scripts include:

SIGINT (interrupt, typically from Ctrl+C)
SIGTERM (termination request)
EXIT (a pseudo-signal indicating the script is about to exit, whether normally or due to a signal or error)
SIGUSR1/SIGUSR2 (user-defined signals, occasionally used for custom control)
SIGHUP (hang-up, terminal closed – often used to reload configs in daemons, not always relevant to short scripts)

A typical use-case is cleaning up temp files. Example:

tempfile="/tmp/mytemp.$$"   # $$ is the script's PID
touch "$tempfile"

# Define a cleanup function
cleanup() {
    rm -f "$tempfile"
    echo "Temporary file removed"
}

# Set traps
trap cleanup EXIT        # on script exit, call cleanup
trap cleanup SIGINT SIGTERM  # on Ctrl+C or termination, call cleanup

# (script body)
echo "Doing work... (press Ctrl+C to interrupt)"
sleep 30   # simulate long task

In this script, no matter how it exits – whether the sleep finishes normally, or the user hits Ctrl+C, or a kill command is sent – the cleanup function will run and remove the temp file. The trap ... EXIT is particularly useful because it catches any exit (including those caused by set -e). It’s like a finally block in other languages, ensuring cleanup runs on exit. It’s recommended to set the trap early (right after creating the temp resource) so that even if an error happens shortly after, the trap is in place.

Another example: if you create a lock file (to ensure only one instance of your script runs at a time), you’d trap EXIT and signals to remove the lock on exit, so it doesn’t remain if the script crashes or is interrupted.

Beyond cleanup, traps can be used for other interactions:

You can trap SIGINT to prevent the script from being stopped by Ctrl+C, or to ask the user “Really quit? (y/n)” before exiting.
Trap SIGHUP if you want to maybe reload configuration (though typically daemons do this, not short scripts).
Trap custom signals if you have background jobs communicating with your script.

Example of interactive trap:

trap 'echo "Interrupted! Cleaning up..."; cleanup; exit 1' SIGINT

This will intercept Ctrl+C, print a message, call cleanup (supposedly defined), and exit with code 1.

Traps provide a way to make your script more robust and safe, especially for scripts that run for a long time or scripts that manage system state (mounting/unmounting devices, modifying configurations, etc.). Without traps, if a user interrupts the script at the wrong time, you might leave things half-done or resources locked.

Real-world use case: You have a script that mounts a disk, copies some files, then unmounts it. If the user presses Ctrl+C mid-way, you’d want to ensure the disk still gets unmounted. By trapping SIGINT and EXIT to run the unmount command, you handle that gracefully. Or consider a build script that spawns multiple background processes – you might trap SIGTERM to kill all spawned subprocesses before exiting, to avoid leaving orphan processes.

Subshells and Command Substitution

In Bash, a subshell is a separate instance of the shell process that gets spawned to execute commands. Understanding subshells is important for controlling scope and side effects in your script.

What creates a subshell?

Running a script itself is a new shell (subshell of your terminal’s shell).
Using the $(command) syntax for command substitution runs the command in a subshell and returns its output.
A pipeline (cmd1 | cmd2) runs each part in a separate subshell (with some exceptions in recent Bash versions for the last command in the pipeline).
Grouping commands in parentheses ( ... ) explicitly creates a subshell to execute that group.
Starting a background job with command & also effectively runs in a separate process (subshell).

Why subshells matter: A subshell has its own copy of variables and environment. Changes to variables inside a subshell do not affect the parent shell. For example:

X=5
(echo "Inside subshell, X=$X"; X=10)
echo "Back in parent shell, X=$X"

This will output “Inside subshell, X=5” and then “Back in parent shell, X=5”. The change to X inside the parentheses did not reflect outside, because it happened in a subshell.

Similarly, if you do output=$(somecommand), the somecommand runs in a subshell. If that command were to change a global variable or the current directory, those changes wouldn’t persist after the substitution. (Only the output is captured.)

Command substitution using backticks `cmd` or the modern syntax $(cmd) is extremely common to capture output of a command into a variable or to embed command output in a string. For example:

today=$(date +%Y-%m-%d)
echo "Backup for date $today"   # prints, e.g., Backup for date 2025-05-21

Here date +%Y-%m-%d runs in a subshell, and its output is substituted. If date failed (unlikely in this case), the script would get an empty string (and an exit code, which in context may or may not be checked depending on set -e or usage).

Grouping commands: Using ( ... ) to group commands runs them in a subshell, whereas using { ...; } groups commands in the current shell process without a subshell. The syntax differences: { } requires a space after { and a semicolon or newline before }. Example:

# Using a subshell
(cd /tmp && ls -l)
# After this, we are still in the original directory because ( ) didn't affect parent shell.

# Using braces (no subshell)
{ cd /tmp && ls -l; }
# After this group, we remain in /tmp in the current shell, because the cd was not in a subshell.

In the first case, we changed directory inside ( ), but when that subshell ended, our working directory in the parent shell didn’t change. In the second, we actually changed the directory in the current shell.

So, use a subshell when you want to isolate changes or run something concurrently without affecting the main script’s environment. Use braces when you want to group commands logically but keep effects (like variable changes or cd) in the current shell.

Example uses:

You want to capture the output of a sequence of commands: result=$( { cmd1; cmd2; } | grep "something" ). Here braces ensure cmd1; cmd2 runs in current shell feeding into grep. If you used parentheses instead of braces around cmd1; cmd2, it would have made a subshell – in this particular case it wouldn’t matter because we’re just piping output, but if cmd1 or cmd2 needed to set some variable for use after, braces would be necessary to keep that change.
Running a command in a different directory without affecting current directory: (cd /some/dir && do_something) runs in subshell, so after it, you’re still in the original directory.
Concurrency: launching things in parallel inherently uses subshells (each background job is a separate process).

Exit status of subshell/group: A subshell (in ( )) returns an exit status like any command, which is the status of the last command inside it. Same for a { } group. You can use this in conditions: e.g., if (grep -q "ERROR" logfile); then ... fi runs grep in a subshell. (Though here the subshell isn’t adding value; you could just do if grep -q "ERROR" logfile; then directly – just an illustration.)

Process substitution (discussed next) also creates subshells under the hood to produce its results.

Process Substitution `<( )` and `>( )`

Process substitution is an advanced feature of Bash (and some other shells like Zsh and KornShell) that allows you to treat the output of a command or the input to a command as if it were a file. It uses a syntax <(command) for output and >(command) for input. This is particularly useful when a command expects file arguments, but you want to provide dynamic data from other commands.

For example, the diff command normally compares two files. If you want to diff the outputs of two commands, you could do something like:

diff <(sort file1) <(sort file2)

Here, <(sort file1) runs sort file1 in a subshell and provides its output as a temporary file (like /dev/fd/63 or a named pipe) to the diff command. Similarly, the second process substitution gives another file for the second sorted output. diff then reads those “files” and compares them. No temporary files on disk were needed in this process – the shell handles it via pipes or the /dev/fd mechanism.

In simpler terms, process substitution makes the output of a process appear as a file name that you can pass to another program. The program doesn’t know (or care) that it’s not reading from a real file. This trick enables powerful compositions:

You can use it with any command that expects file arguments. For example, comm <(command1) <(command2) to feed two sorted outputs into the comm tool (which compares two sorted files line by line).
paste <(cmdA) <(cmdB) will paste the outputs of cmdA and cmdB side by side.
Capturing input: >(command) is used less frequently, but it provides a way to send output into another command’s input. For example, some program might insist on writing output to a file; you could give it a >(cmd) path so that as it writes, the data is actually piped into another command. E.g., tar cf >(gzip > archive.tar.gz) some_folder. In this (more complex) example, tar is writing its archive output to the file given by >(gzip > archive.tar.gz), but that file is actually a pipe that feeds into gzip, which compresses the data and writes to archive.tar.gz. This achieves compressing on the fly without an intermediate file.

Process substitution is especially useful when you need to deal with multiple streams of data simultaneously. Without it, you might have to create temporary files as intermediates. For instance, without <(sort file1), you’d have to do sort file1 > /tmp/file1.sorted and sort file2 > /tmp/file2.sorted then diff /tmp/file1.sorted /tmp/file2.sorted and then clean up the temp files. Process substitution automates all those steps in one neat syntax.

Requirements: Process substitution is dependent on the system supporting either named pipes (FIFOs) or the /dev/fd special files. Most Linux systems do. If you ever run a Bash script in a minimal environment (or in /bin/sh on systems that don’t have it), it might not be available. Also note, process substitution is not POSIX, it’s a Bash (and KornShell) feature.

Real-world usage: You might use process substitution to compare command outputs (like the diff example, which is common). Another common use is with the tee command for logging: some_command | tee >(grep "ERROR" > errors.log) > full.log. Here, we pipe output into tee, which duplicates it: one duplicate goes into a process substitution >(grep "ERROR" > errors.log) meaning all output is filtered by grep for “ERROR” and those lines go to errors.log, and the other duplicate of output goes to full.log. This way you get a full log and a filtered log in one pass, without writing the full log first and grepping it later. This is an advanced pattern, but it shows how flexible Bash can be with process substitution.

Performance Optimization and Best Practices

Bash is a convenient and quick scripting tool, but it’s not the fastest for heavy computation. However, there are ways to write more efficient Bash scripts and avoid common performance pitfalls. Here are some tips and best practices for optimizing Bash scripts:

Use built-in shell commands and features whenever possible: External commands (like grep, awk, sed, etc.) fork a new process which is relatively expensive if done in large loops. Bash built-in commands and operators execute within the shell and avoid spawning subshells, making them faster. For example, use Bash’s string manipulation or pattern matching instead of calling external tools for simple tasks. If you want to check if a string contains a substring, using Bash’s [[ "$string" == *sub* ]] is faster than running echo "$string" | grep sub. Arithmetic with $(( )) or let is much faster than calling the external expr.
Avoid unnecessary subprocesses: A classic example is UUOC (Useless Use of cat). Instead of cat file | grep "foo", do grep "foo" file. Each | and each command invokes a new process. Here cat is unnecessary overhead. Another example: instead of using backticks or $( ) to capture something trivial repeatedly, try to restructure the script to avoid it in a tight loop. If you need to call an external command, try to do it once and reuse the result rather than calling it in every iteration of a loop.
Optimize loops: If you have to loop, keep the loop’s content lean. Move any heavy work or external command outside the loop if possible. For instance, if you are reading 1000 lines and grepping each line inside the loop, it’s far better to just grep the whole file once outside the loop (if that achieves the same result). Or, use tools that can operate on all lines at once instead of a shell loop. Remember that shell loops themselves in Bash are not super fast (they’re okay for a few thousand iterations, but beyond that you might feel it). One often-cited approach is: do as much as possible with stream operations (like using grep, awk, sort, etc. which are in C and optimized) instead of shell loops for large data sets. For example, to sum numbers in a file, a Bash loop reading each number and adding might be fine for small input, but awk '{sum+=$1} END{print sum}' file will be magnitudes faster on large input.
Parallelize where possible: If you have independent tasks, you can run them in parallel to utilize multiple CPU cores. This can be done by simply putting & at end of command to run it in background and later using wait to wait for all to finish. Or use tools like GNU Parallel or xargs -P for parallel execution. For instance, if you need to compress 10 files, you could do them sequentially in a loop (taking, say, 10 minutes total if each is 1 minute), or you could compress a few in parallel in the background to finish faster (assuming you’re not I/O-bound heavily). Example with xargs: ls *.log | xargs -n 1 -P 4 gzip would compress log files using up to 4 parallel processes. Or using GNU Parallel: parallel -j4 gzip ::: *.log. Be mindful of not overloading the system though.
Minimize subshell usage: Forking a subshell has overhead. Sometimes you can avoid it. For example, instead of using a command substitution in a loop condition that runs every time, maybe you can restructure. Or instead of spawning a subshell solely to capture a small piece of info multiple times, run it once and store it. The MoldStud reference suggests leveraging built-ins or process substitution to avoid creating too many subshells in loops. A scenario: you might be tempted to do something like for f in $(ls *.txt); do ...; done. This spawns a subshell for ls and also has word-splitting issues. It’s better to use Bash’s globbing directly: for f in *.txt; do ...; done (no subshell, and safe with filenames).
Prefer [[ ... ]] and arithmetic expansion where applicable: The [[ ]] construct is a Bash built-in for conditionals which is more efficient than test or [ external call in some cases (though [ is also typically a shell builtin in Bash). Arithmetic expansion (( )) and $(( )) is much faster than external expr (as shown in the cited timings, using expr was drastically slower). So for numeric computations, always use shell arithmetic or even bc if high precision is needed, but avoid spawning a lot of external processes for math.
Reduce I/O operations: If your script writes to disk frequently (like writing in a loop), consider whether it can build data in memory and write once, or use buffering. Each I/O call can be slow. If you can combine multiple small writes into one big write, do it. For reading, if you need random access or multiple passes, maybe loading data into an array might be beneficial (depending on size). Also consider turning off debugging (set +x) or verbose logging in tight loops, as writing logs itself can slow down the loop.
Use appropriate data structures: Bash doesn’t have advanced data structures, but associative arrays can sometimes save time if you need to look up things by key, instead of grepping through a file or list repeatedly. For example, if you have a list of allowed users and you need to check 1000 input entries against this list, putting allowed users in an associative array (keys) and then checking each entry with a quick array lookup is faster than running a grep for each entry in a loop (which would be 1000 greps).
External tools vs Shell: Recognize when a task is better done with a specialized tool or a more powerful language. For intense text processing (large files, complex parsing), tools like awk or even switching to Python/Perl might be warranted. Bash is excellent as “glue” to connect system commands, but heavy data crunching in pure Bash can be slow. As one guideline notes, the shell should do as little as possible itself – primarily orchestrating other programs. Don’t try to reinvent functionality that already exists in optimized form.
Locale considerations: Interestingly, even locale can affect performance. If you don’t need unicode or locale-specific sorting/comparisons, setting LC_ALL=C can make certain operations faster by using the C locale (basically plain byte-wise operations). For example, pattern matching and sorting in C locale can be quicker than in, say, UTF-8, especially for large data sets. As an advanced optimization, sometimes scripts will set LC_ALL=C at the top (and maybe restore it at the end) to speed up character processing. Only do this if you are sure nothing in your script specifically needs locale-aware behavior.
Memory vs Performance trade-offs: Loading a large file into memory (e.g., read into an array) might speed up certain operations at the cost of memory. If you have the RAM and need to, it could be an option. For example, reading a config file once into an array and then processing it might be better than reading it line by line multiple times in different parts of the script.
Profile and test: If performance is critical, use time command to measure your script or sections of it. There are also tools like shellcheck (not for performance, but for correctness) and even bash profilers that can help pinpoint slow parts. Sometimes what you think is slow might not be the culprit, and profiling can reveal the actual bottleneck.

To summarize some best practices for performance:

Prefer built-ins (shell built-ins execute faster and avoid extra processes).
Avoid needless subprocesses ($( ), pipelines, cat, etc., when not needed).
Use efficient algorithms (don’t brute force in Bash what can be done by a single grep or awk).
Leverage parallelism for independent tasks (especially on large batch jobs).
Write clear code first, then optimize the hotspots – premature optimization can make scripts hard to maintain. Use these tips when you notice a script becoming slow or when you anticipate scaling issues.

Conclusion

Bash scripting is a powerful skill that lets you automate and streamline tasks on Unix-like systems, bridging the gap between manual commands and full-fledged programming. We started with the basics: understanding script structure (with the crucial shebang), creating and using variables, making decisions with conditionals, and looping over tasks. These fundamentals enable beginners to write simple yet effective scripts – for example, a script to clean up files or to ping a list of servers.

Moving into intermediate territory, we expanded our toolbox with functions (for organizing code and avoiding repetition), arrays (for handling collections of data easily), and methods for handling input/output and files. These concepts allow more complex scripts – think of a backup script that iterates over directories (arrays), uses functions to perform repeated subtasks, and reads configuration files or user input to decide what to do. At this level, scripts become more user-friendly and flexible, using input prompts or arguments, and produce structured output or logs.

In the advanced topics, we addressed robustness and efficiency. Error handling ensures your script doesn’t fail silently and can recover or at least exit gracefully – using techniques like set -e, traps, and careful checking of command outcomes. Signal trapping adds resilience, cleaning up resources no matter how the script exits. We demystified subshells and process substitution, which open up elegant ways to compose commands and control scope. Finally, we discussed performance optimizations and best practices, which become important for large or long-running scripts and for writing efficient shell code. By using built-in capabilities of Bash and minimizing overhead, your scripts can run faster and handle bigger tasks, blurring the line where one might otherwise resort to another language.

Where to go from here: With these concepts in hand, the next step is practice. Try writing scripts for your daily tasks – even if it’s just a one-liner saved in a script file. Experiment with the examples given: modify them, break them, see what errors occur, and fix them. Use tools like ShellCheck (a static analysis tool for shell scripts) to catch common pitfalls and improve your coding style. Read others’ shell scripts (many open-source projects have shell script components) to learn new tricks. Over time, you’ll develop an intuition for when Bash is the right tool for a job and how to implement a solution elegantly in Bash.

Bash scripting, at its core, embraces the Unix philosophy of combining small tools to perform complex tasks. By mastering Bash, you gain the ability to glue these tools together, automate routine work, and leverage the full power of the command line. Whether you’re a system administrator automating server maintenance, a developer setting up build/test scripts, or just a power user managing your files, Bash scripting is an invaluable skill.