pythonamazon-s3zipairflowunzip

Is it possible to unzip a zip file stored on S3 without downloading it?


I have a zip file on S3 and I need to unzip it by means of a DAG on Airflow. Is it possible to do it without downloading it?

I have no clue of how it should be done and had no idea about how to start.


Solution

  • AWS S3 is an object storage, not a file system. Unzipping is not supported directly.

    You need some computation for unzipping, which can be achieved by copying that file to some EC2 machine or use some Lambda.