name |
jusText |
short description |
'jusText is a tool for removing boilerplate content, such as navigation links, headers, and footers from HTML pages. It is designed to preserve mainly text containing full sentences and it is therefore well suited for creating linguistic resources such as Web corpora.' |
software category |
scraping websites |
developer |
Jan Pomikálek |
maintainer |
|
current version |
None |
last changed |
None |
programming lanuage(s) |
Python |
operating system(s) |
|
license |
BSD |
costs |
0 |
language |
|
architecture |
library |
web-links |
supported methods |
|
additional features |