Pydoop is a Python interface to Hadoop that allows you to write MapReduce applications in pure Python. Pydoop offers several features not commonly found in other Python libraries for Hadoop: -- a rich HDFS API; -- a MapReduce API that allows to write pure Python record readers / writers, partitioners and combiners; -- transparent Avro (de)serialization; -- easy installation-free usage; WWW: http://crs4.github.io/pydoop/
13 lines
430 B
Text
13 lines
430 B
Text
Pydoop is a Python interface to Hadoop that allows you to write MapReduce
|
|
applications in pure Python.
|
|
|
|
Pydoop offers several features not commonly found in other Python
|
|
libraries for Hadoop:
|
|
|
|
-- a rich HDFS API;
|
|
-- a MapReduce API that allows to write pure Python record readers / writers,
|
|
partitioners and combiners;
|
|
-- transparent Avro (de)serialization;
|
|
-- easy installation-free usage;
|
|
|
|
WWW: http://crs4.github.io/pydoop/
|