Skip to content

Instantly share code, notes, and snippets.

@ErykDarnowski
Last active April 30, 2023 11:38
Show Gist options
  • Save ErykDarnowski/da013b2b65de291730e95830da2ad447 to your computer and use it in GitHub Desktop.
Save ErykDarnowski/da013b2b65de291730e95830da2ad447 to your computer and use it in GitHub Desktop.
How to install `tesseract` OCR for any langauge on Linux

How to install tesseract OCR for any langauge on Linux

Simply replace <lang> with a valid language code!

apt (Ubuntu / Debian / Mint)
sudo apt update -y
sudo apt upgrade -y

sudo apt install tesseract-ocr-<lang> -y

(Installs both English and the choosen language)

See the list of all available langauges either by running sudo apt search tesseract or here

dnf (Fedora / CentOS / Red Hat Enterprise Linux)
sudo dnf check-update
sudo dnf upgrade -y

sudo dnf install tesseract-langpack-<lang> -y

(Installs both English and the choosen language)

See the list of all available langauges by running dnf search tesseract (you don't need to include .noarch)

pacman (Arch)
yes | sudo pacman -Syu tesseract-data-<lang>

(Installs only the choosen language)

See the list of all available langauges either by running pacman -Ss tesseract or here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment