pythonmongodbmapreducedisco

Specifying output uri for a Disco mapreduce job


I would like to have a completed Disco job write directly to mongodb. Is there an easy way to specify an output url for Disco to send its data to?


Solution

  • There's no direct compatibility layer between Disco and MongoDB, although early work on such a layer is in progress. In the meantime, I would have your Disco job write to CSV or JSON (1 object per line), which you can easily import to MongoDB using mongoimport. See the MongoDB wiki for more details.