Skip to content

Instantly share code, notes, and snippets.

@behitek
Last active May 30, 2019 04:36
Show Gist options
  • Save behitek/a2a48e049020bcd4b625c4cbffdbf51b to your computer and use it in GitHub Desktop.
Save behitek/a2a48e049020bcd4b625c4cbffdbf51b to your computer and use it in GitHub Desktop.
Useful linux command for data engineer

Remove duplicate line in text file

awk '!seen[$0]++' filename > output.txt

Merge file

cat file1 fil2 > merge.txt

Show file to read

less filename

Show head

head -n 1000 filename

Show tail

tail -n 1000 filename

Kiểm tra file encoding

file filename

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment