Scrapy is a fast high-level web crawling and web scraping framework, used to
crawl websites and extract structured data from their pages. It can be used for
a wide range of purposes, from data mining to monitoring and automated testing.
Parsel is a library to extract data from HTML and XML using XPath and CSS
selectors.
Features:
* Extract text using CSS or XPath selectors
* Regular expression helper methods
Since upstream still maintaines the 2-series it is kept in www/SOGo.
Version 3, introduced in early 2016, has a modern, fully responsive Web
frontend. Both versions share a common implementation of the communication
protocols supported in SOGo and SOPE: LDAP, IMAP, SQL, CardDAV, CalDAV, and
Microsoft Enterprise ActiveSync.
DESCR:
SOGo is fully supported and trusted groupware server with a focus
on scalability and open standards. SOGo is released under the GNU
GPL/LGPL v2 and above.
SOGo provides a rich AJAX-based Web interface and supports multiple
native clients through the use of standard protocols such as CalDAV,
CardDAV and GroupDAV.
SOGo is the missing component of your infrastructure; it sits in
the middle of your servers to offer your users an uniform and
complete interface to access their information. It has been deployed
in production environments where thousands of users are involved.
new packages. Most of which are the remaining modules of the Tryton
platform which weren't packaged. The others are dependencies of the new
modules. This was tested on FreeBSD and is based in large part on Richard
Palo's (richard@) work. This is the most recent release of the Tryton
platform, version 4.2. There's a very large list of changes from the 3.8
series we have in pkgsrc. If you're interested, those functional changes
can be found here:
http://www.tryton.org/posts/new-tryton-release-42.htmlhttp://www.tryton.org/posts/new-tryton-release-40.html
This module improves non-ascii filename interoperability of apache
(and mod_dav), by supporting various client encoding.
Current license is troublesome because of its clause 4:
> 4. Products derived from this software may not be called "mod_encoding"
> nor may "mod_encoding" appear in their names without prior written
> permission of Internet Initiative Japan Inc. For written permission,
> please contact tai%iij.ad.jp@localhost (Taisuke Yamada).
LICENSE is however left intionnaly blank, as Internet Initiative Japan, Inc.
may revise it in the near future. This is approved by wiz@ and soda@.
An parser for HTTP messages written in C, parsing both requests
and responses. It is designed to be used in performance oriented
HTTP applications. It does not make any syscalls nor allocations,
it does not buffer data, it can be interrupted at anytime. Depending
on your architecture, it only requires about 40 bytes of data per
message stream (in a web server that is per connection).