Programming – Ryan Schulze

Using regex comparision in bash and BASH_REMATCH

September 8, 2017September 8, 2017Ryanbash

Bash supports regular expressions in comparisons via the =~ operator. But what is rarely used or documented is that you can use the ${BASH_REMATCH[n]} array to access successful matches (back-references to capture groups). So if you use parentheses for grouping () in your regex, you can access the content of that group.

Here is an example where I am parsing date placeholders in a text with an optional offset (e.g. |YYYY.MM.DD|+2 ). Storing the format and offset in separate groups:

while read -r line; do
	while [[ ${line} =~ \|([YMD\\/\ .-]+)\|(\+*[0-9]*) ]]; do
		dateformat=${BASH_REMATCH[1]}
		dateformat=${dateformat/YYYY/%Y}
		dateformat=${dateformat/MMMM/%B}
		dateformat=${dateformat/MM/%m}
		dateformat=${dateformat/DD/%d}
		offset='now'
		[[ ! -z ${BASH_REMATCH[2]} ]] && offset="${BASH_REMATCH[2]} days"
		line=${line/|${BASH_REMATCH[1]}|${BASH_REMATCH[2]}/$(date "+${dateformat}" --date="${offset}")}
	done
	echo "${line}"
done < input

while read -r line; do

while [[ ${line} =~ \|([YMD\\/\ .-]+)\|(\+*[0-9]*) ]]; do

dateformat=${BASH_REMATCH[1]}

dateformat=${dateformat/YYYY/%Y}

dateformat=${dateformat/MMMM/%B}

dateformat=${dateformat/MM/%m}

dateformat=${dateformat/DD/%d}

offset='now'

[[ ! -z ${BASH_REMATCH[2]} ]] && offset="${BASH_REMATCH[2]} days"

line=${line/|${BASH_REMATCH[1]}|${BASH_REMATCH[2]}/$(date "+${dateformat}" --date="${offset}")}

done

echo "${line}"

done < input

|YYYY.MM.DD|

|YYYY.MM.DD|+7

|YYYY-MM-DD|

|YYYY-MM-DD|+14

|MMMM YYYY|

|YYYY/MM|

|MM/YYYY|

This is a sentence containing a timestamp (|YYYY.MM.DD|+7) with an offset.

This is another sentence containing multiple timstamps between |YYYY.MM.DD| and |YYYY.MM.DD|+7.

2017.09.08

2017.09.15

2017-09-08

2017-09-22

September 2017

2017/09

09/2017

This is a sentence containing a timestamp (2017.09.15) with an offset.

This is another sentence containing multiple timstamps between 2017.09.08 and 2017.09.15.

Multiply floats by 10,100, … in bash

August 22, 2017September 8, 2017Ryanbash

A short one today. Bash can only handle integer numbers and not floats, so when someone searches the internet on how to use math on floats in bash the solution they find is usually “use bc” and looks something like this:

$ f=12.3456
$ bc -l <<< "${f} * 10"
123.4560

$ f=12.3456

$ bc -l <<< "${f} * 10"

123.4560

Or if they want the result to be an integer:

$ f=12.3456
$ bc -l <<< "scale=0; ${f} * 10 /1"
123

$ f=12.3456

$ bc -l <<< "scale=0; ${f} * 10 /1"

123

It’s a fine solution, and readable (which can mean a lot for people maintaining scripts). But if all you want to do is multiply by 10,100,1000, … you can achieve this faster with a bit of string manipulation:

$ f=12.3456
$ _sub="${f#*.}"
$ echo "${f%.*}${_sub:0:1}.${_sub:1}"

$ f=12.3456

$ _sub="${f#*.}"

$ echo "${f%.*}${_sub:0:1}.${_sub:1}"

It just splits the number into two strings, and assembles it again with the decimal shifted. Have a look at substring_removal and substring_expansion for more examples on how to modify strings in bash. I’d highly suggest either sticking this in a separate function, or commenting the code since it isn’t necessarily obvious what is going on

Since it is all pure bash and doesn’t need to spawn external commands, it quicker (not that bc is slow, but if you are doing a lot of calculations, it can add up). I know what you are thinking “if your goal is speed, you shouldn’t be using bash”, that doesn’t mean we can’t write efficient code.

How to fetch IP ranges/entries from SPF records in bash

March 3, 2017March 29, 2017Ryanbash, SPF

Recently I needed to fetch IP ranges from SPF records. After looking at different python/ruby/perl modules I came to the conclusion that a fancy module (sometimes with wonky dependencies) was overkill just to parse a simple SPF record. So I threw together a simple bash script that is mainly just fetching the SPF record with dig and grep:

dig txt "${fqdn}"|grep -oE 'v=spf[0-9] [^"]+'

1	dig txt "${fqdn}"\|grep -oE 'v=spf[0-9] [^"]+'

It iterates through the options (it currently recognizes a, mx, ip4, ip6, include, and redirect), and then sorts the output by ipv4, then ipv6.

fetch_spf.sh ryanschulze.net
138.201.86.179
138.201.86.184
192.249.58.230
2a01:4f8:172:2270:1::102
2604:180::ef4b:4638

fetch_spf.sh ryanschulze.net

138.201.86.179

138.201.86.184

192.249.58.230

2a01:4f8:172:2270:1::102

2604:180::ef4b:4638

Download URL: fetch_spf.sh

How to compare package version strings in bash

November 18, 2016November 18, 2016Ryanbash

This is a little function I use to compare package version strings. Sometimes they can get complex with multiple different delimiters or strings in them. I cheated a bit by using sort –version-sort for the actual comparison. If you are looking for a pure bash version to compare simpler strings (e.g. compare 1.2.4 with 1.10.2), I’d suggest this stackoverflow posting.

The function takes three parameters (the version strings and the comparison you want to apply) and uses the return code to signal if the result was valid or not. This gives the function a somewhat natural feel, for example compare_version 3.2.0-113.155 “<” 3.2.0-130.145 would return true. Aside from < and > you can also use a few words like bigger/smaller, older/newer or higher/lower for comparing the strings.

compare_version() {
  local versionOne="${1}"
  local comparision="${2}"
  local versionTwo="${3}"
  local result=
  local sortOpt=
  local returncode=1

  if [[ "${versionOne}" == "${versionTwo}" ]] ; then
    return 3
  fi

  case ${comparision} in
    lower|smaller|older|lt|"<" ) sortOpt= ;;
    higher|bigger|newer|bt|">" ) sortOpt='r' ;;
    * ) return 2 ;;
  esac

  result=($(printf "%s\n" "${versionOne}" "${versionTwo}" | sort -${sortOpt}V ))
  if [[ "${versionOne}" == "${result[0]}" ]] ; then
    returncode=0
  fi

  return ${returncode}
} # end of function compare_version

compare_version() {

local versionOne="${1}"

local comparision="${2}"

local versionTwo="${3}"

local result=

local sortOpt=

local returncode=1

if [[ "${versionOne}" == "${versionTwo}" ]] ; then

return 3

case ${comparision} in

lower|smaller|older|lt|"<" ) sortOpt= ;;

higher|bigger|newer|bt|">" ) sortOpt='r' ;;

* ) return 2 ;;

esac

result=($(printf "%s\n" "${versionOne}" "${versionTwo}" | sort -${sortOpt}V ))

if [[ "${versionOne}" == "${result[0]}" ]] ; then

returncode=0

return ${returncode}

} # end of function compare_version

List of return codes and meanings:

0: Comparison is true
1: Comparison is false
2: Did not recognize the comparison
3: Both version strings are identical

0: Comparison is true

1: Comparison is false

2: Did not recognize the comparison

3: Both version strings are identical

Convert configuration files to ansible templates

April 8, 2015June 11, 2015Ryanansible, bash, linux, scripting

I’ve been playing around with ansible a lot lately, and I noticed that while changing stuff from “installed and configured manually” to “installed and configured by ansible” I was running into quite a few configuration files that needed to be manually turned into templates. It can be quite tedious to replace values in a configuration file with placeholders and put all those placeholders in a .yml file with default values.
Automating this is something I would have typically done in perl, but since I wanted to learn more about using regex in bash I decided to have a go at it in bash using regex and ${BASH_REMATCH}

The script takes a configuration file and spits out an ansible template, as well as the variable definitions you will need to add to your defaults/main.yml or vars/main.yml

The whole script is a bit to long to post here, but the interesting part is:

if [[ ${line} =~ ^([^#][^\ ]+)[\ ]*[${Separator}][\ ]*([^\ ]+)$ ]] ; then
	VariableName="${Prefix}_${BASH_REMATCH[1]//-/_}" # create a name for this configuration variable
	VariableName="${VariableName,,}" # make lowercase
	sed -ri "s/^(${BASH_REMATCH[1]}[\ ]*[${Separator}][\ ]*).+$/\1{{ ${VariableName} }}/" "${Template}" # change the ansible template
	printf "%-40s %s\n" "${VariableName}:" "'${BASH_REMATCH[2]}'" # print variable info to stdout 
fi

if [[ ${line} =~ ^([^#][^\ ]+)[\ ]*[${Separator}][\ ]*([^\ ]+)$ ]] ; then

VariableName="${Prefix}_${BASH_REMATCH[1]//-/_}" # create a name for this configuration variable

VariableName="${VariableName,,}" # make lowercase

sed -ri "s/^(${BASH_REMATCH[1]}[\ ]*[${Separator}][\ ]*).+$/\1{{ ${VariableName} }}/" "${Template}" # change the ansible template

printf "%-40s %s\n" "${VariableName}:" "'${BASH_REMATCH[2]}'" # print variable info to stdout

(You can download the full script here ansible_template.sh).

You can use regular expressions in a [[ ]] with =~ (e.g. if [[ “boot” =~ ^b ]]), and you can access the result of the regular expression by using ( ) to mark what parts of the result to store and access them via $BASH_REMATCH (comparable to how you would do it for other languages). Here I am parsing out anything that looks like a key=value from the configfile (with multiple possible separators) and storing the results in BASH_REMATCH[1] and BASH_REMATCH[2]

Usage of the script is pretty straightforward. you give it a prefix for the variable names (so you don’t end up with multiple roles all using a common variable name like “port”), and either a local or remote file to work with, and it spits out something like this:

$ ./ansible_template.sh php webserver.somewhere.tld:/etc/php5/conf.d/xcache.ini

- name: Template

template: src={{ item.local }} dest={{ item.remote }} owner={{ item.owner }} group={{ item.group }} mode={{ item.mode }}

with_items:

- { local: 'xcache.ini.j2', remote: '/etc/php5/conf.d/xcache.ini', owner: 'root', group: 'root', mode: '0644' }

php_zend_extension: '/usr/lib/php5/20090626/xcache.so'

php_xcache.admin.enable_auth: 'On'

php_xcache.admin.user: 'admin'

php_xcache.admin.pass: 'ea6299af10b40ba80236a0f015ed627d'

php_xcache.shm_scheme: 'mmap'

php_xcache.size: '16M'

php_xcache.count: '1'

php_xcache.slots: '8K'

php_xcache.ttl: '0'

There a tons of different configuration file formats out there so this script won’t work perfectly 100% of the time, but it does do quite well and reduces the manually copy&pasting to a minimum.

$ cat xcache.ini.j2

; configuration for php Xcache module

[xcache-common]

zend_extension = {{ php_zend_extension }}

[xcache.admin]

xcache.admin.enable_auth = {{ php_xcache.admin.enable_auth }}

xcache.admin.user = "{{ php_xcache.admin.user }}"

xcache.admin.pass = "{{ php_xcache.admin.pass }}"

[xcache]

xcache.shm_scheme = "{{ php_xcache.shm_scheme }}"

xcache.size = {{ php_xcache.size }}

xcache.count = {{ php_xcache.count }}

xcache.slots = {{ php_xcache.slots }}

xcache.ttl = {{ php_xcache.ttl }}

...