Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save nepsilon/a30b652a9380e91580f28ce21eb0dec7 to your computer and use it in GitHub Desktop.
Save nepsilon/a30b652a9380e91580f28ce21eb0dec7 to your computer and use it in GitHub Desktop.
In a CSV file, how to count all lines matching a condition? — First published in fullweb.io issue #88

In a CSV file, how to count all lines matching a condition?

Following on our CSV file processing series, here is how to count the number of lines that match a condition.

We’ll keep using awk. Here is the command to run on the records.csv file:

awk -F, '{if ($5 == "foo") count+=1} END {print count}' records.csv

Easy enough, -F, tells each line use the comma to delimit each column. Then for each line in the file, check if the 5th column is equal to foo. If true then increment the count variable by 1.

When done processing all lines, just print count.

Voila, you’re ready to process GB of CSV files without killing your spreadsheet software or writing a single line of code!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment