Package: boilerpipeR
Version: 1.2.2
Date: 2014-08-20
Title: Interface to the boilerpipe Java library by Christian
        Kohlschutter (http://code.google.com/p/boilerpipe/)
Author: See AUTHORS file.
Maintainer: Mario Annau <mario.annau@gmail.com>
Imports: rJava
Suggests: RCurl
Description: Generic Extraction of main text content from HTML files; removal
    of ads, sidebars and headers using the boilerpipe Java library. The
    extraction heuristics from boilerpipe show a robust performance for a wide
    range of web site templates.
License: Apache License (== 2.0)
URL: https://github.com/mannau/boilerpipeR
BugReports: https://github.com/mannau/boilerpipeR/issues
Packaged: 2014-08-20 20:34:20 UTC; mario
NeedsCompilation: no
Repository: CRAN
Date/Publication: 2014-08-21 06:42:22
