Skip to content

Instantly share code, notes, and snippets.

@malev
Created September 2, 2014 21:37
Show Gist options
  • Save malev/946f83cbb928e044ab63 to your computer and use it in GitHub Desktop.
Save malev/946f83cbb928e044ab63 to your computer and use it in GitHub Desktop.
Text Extraction

TextExtractor

Requirements

  • Works with doc, odt and pdf
  • Works through an API
  • Can handle multiple files at the same time
  • Uses queues (maybe distributed)
  • It's doable
  • Works fast!

What else?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment