A collection of tools and libraries to extract, annotate and analyze text and with layout information
Command Line Tools
TextWorks
: PDF -> Text
Scala Libraries
WatrMarks
: Core library for loading, analyzing, and manipulating text
Web Server
WatrColors
: Web-based annotation and document visualization
View the full documentation