freebsd-ports/devel/py-pydoop/pkg-descr
Dmitry Sivachenko 579b191b5b New port: pydoop.
Pydoop is a Python interface to Hadoop that allows you to write MapReduce
applications in pure Python.

Pydoop offers several features not commonly found in other Python
libraries for Hadoop:

-- a rich HDFS API;
-- a MapReduce API that allows to write pure Python record readers / writers,
   partitioners and combiners;
-- transparent Avro (de)serialization;
-- easy installation-free usage;

WWW: http://crs4.github.io/pydoop/
2015-09-08 08:57:51 +00:00

13 lines
430 B
Text

Pydoop is a Python interface to Hadoop that allows you to write MapReduce
applications in pure Python.
Pydoop offers several features not commonly found in other Python
libraries for Hadoop:
-- a rich HDFS API;
-- a MapReduce API that allows to write pure Python record readers / writers,
partitioners and combiners;
-- transparent Avro (de)serialization;
-- easy installation-free usage;
WWW: http://crs4.github.io/pydoop/