name |
goose3 |
short description |
'Goose was originally an article extractor written in Java that has most recently (Aug2011) been converted to a scala project. This is a complete rewrite in Python. The aim of the software is to take any news article or article-type web page and not only extract what is the main body of the article but also all meta data and most probable image candidate.' |
software category |
scraping websites |
developer |
Xavier Grangier |
maintainer |
Xavier Grangier |
current version |
None |
last changed |
None |
programming lanuage(s) |
Python |
operating system(s) |
|
license |
Apache-2.0 |
costs |
0 |
language |
|
architecture |
library |
web-links |
supported methods |
|
additional features |