jarairflowtalend

Does Airflow support jar file?


I am a newbie to Airflow. I have some .jar jobs generated with Talend Open Studio for Big Data, and I want to schedule and manage those with Airflow. My question is, does Airflow support .jar file or generated by TOS as DAG? And if it does, how? Or is there any alternative to run .jar on Airlow?

I'm using Airflow v1.10.3. The jobs are mainly to extract and process data from a MongoDB database, then update the database with the new processed data.

Thanks !


Solution

  • Airflow will happily run .jar files. There is a few examples kicking about for you to have a look at.

    Running a standard .jar file: run_jar.py

    Running a "built" Talend jobl loan_application_data.py

    Obviously with both these examples the .jar or Talend file(s) will need to be on the server Airflow is executing on (as well as Java).