Average of a skipping column of 5 using awk

Question

Average of a skipping column of 5 using awk

I want to calculate the average of the 1st column of a text file, skipping lines divisible by 5. As an example, consider the following dataset.

For the above data, I can calculate the average of the entire column using awk

like

awk '{ sum += $1 } END { if (NR > 0) print sum / NR }' file

which prints the result 5.5

.

How can I extend this code to exclude lines divisible by 5 from the middle? In the example above, this will exclude the numbers 5

and from the mean 10

, resulting in a new mean 5

.

+3

bash awk

daneel June 18 17 at 21:49

source to share

1 answer

RomanPerekhrest · Accepted Answer · 2017-06-18T22:00:55+0000

Short awk solution:

awk '{ NR%5? s+=$0 : c++ }END{ print s/(NR-c) }' file

Output:

NR%5? s+=$0 : c++

- triple condition: sums up all the values s+=$0

if the record number is NR

not divisible by 5

, else - counts the missing records (to subtract them from the average)

Average of a skipping column of 5 using awk

More articles: