Skip to content

Instantly share code, notes, and snippets.

@KevM
Created July 12, 2013 20:14
Show Gist options
  • Save KevM/5987424 to your computer and use it in GitHub Desktop.
Save KevM/5987424 to your computer and use it in GitHub Desktop.
Using TikaOnDotNet for text extraction
var textExtractor = new TextExtractor();
var result = textExtractor.Extract(@"c:\projects\tikaondotnet\src\TikaOnDotNet.Tests\files\Tika.doc");
// View the result on the console
Console.WriteLine("Content Type: " + result.ContentType);
Console.WriteLine("\n\n" + result.Text.Trim());
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment