libboilerpipe-java 1.2.0-ok1 (amd64 binary) in openkylin yangtze
The boilerpipe library provides algorithms to detect and remove the surplus
"clutter" (boilerplate, templates) around the main textual content of a web
page.
.
The library already provides specific strategies for common tasks (for example:
news article extraction) and may also be easily extended for individual problem
settings.
.
Extracting content is very fast (milliseconds), just needs the input document
(no global or site-level information required) and is usually quite accurate.
Details
- Package version:
- 1.2.0-ok1
- Status:
- Deleted
- Component:
- main
- Priority:
- Optional
Downloadable files
amd64 build of boilerpipe 1.2.0-ok1 in openkylin yangtze PROPOSED produced
these files:
- libboilerpipe-java_1.2.0-ok1_all.deb (97.1 KiB)
Package relationships
- Depends on: