Tika on .NET

This project is a simple wrapper around the very excellent and robust Tika text extraction Java library. This project produces two nugets:

TikaOnDotNet - A straight IKVM hosted port of Java Tika project.

TikaOnDotNet.TextExtractor - Use Tika to extract text from rich documents.

Getting Started

The best way to get started is to:

Add a Nuget dependency to TikaOnDotNet.TextExtractor.
Instantiate a new TextExtractor object and call one of the Extract methods.

Usage

// using TikaOnDotNet.TextExtractor;

var textExtractor = new TextExtractor();

var wordDocContents = textExtractor.Extract(@".\path\to\my favorite word.docx");
var webPageContents = textExtractor.Extract(new Uri("https://google.com"));

Take a look at our tests for more usage examples.

How To Contribute

Have an idea to make this project better? Great! Start out by taking a look at our Contributing Guide.

Having A Problem?

Search in the Issues as your problem may be a common one. If don't find your problem please create an issue. Contributors here will chime in when they can.

Name		Name	Last commit message	Last commit date
Latest commit History 186 Commits
.paket		.paket
src		src
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
Contributing.md		Contributing.md
Developers.md		Developers.md
LICENSE		LICENSE
Readme.md		Readme.md
Release-Notes.md		Release-Notes.md
Thanks.md		Thanks.md
appveyor.yml		appveyor.yml
build.cmd		build.cmd
build.fsx		build.fsx
paket.dependencies		paket.dependencies
paket.lock		paket.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tika on .NET

Getting Started

Usage

How To Contribute

Having A Problem?

About

Releases

Packages

Languages

License

brykneval/tikaondotnet

Folders and files

Latest commit

History

Repository files navigation

Tika on .NET

Getting Started

Usage

How To Contribute

Having A Problem?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages