freebsd-ports/www/p5-HTML-ExtractMain/pkg-descr
Frederic Culot 5db6b69ce0 HTML::ExtractMain is a module which takes HTML content, and uses the
Readability algorithm to detect the main body of the page, usually
skipping headers, footers, navigation, etc.

WWW: http://search.cpan.org/dist/HTML-ExtractMain/

PR:		ports/163557
Submitted by:	Jui-Nan Lin <jnlin@csie.nctu.edu.tw>
2011-12-23 15:38:05 +00:00

5 lines
232 B
Text

HTML::ExtractMain is a module which takes HTML content, and uses the
Readability algorithm to detect the main body of the page, usually
skipping headers, footers, navigation, etc.
WWW: http://search.cpan.org/dist/HTML-ExtractMain/