when i try to run an azure ML pipeline that has inside it a connection to datalake storage using compute cluster (python sdk v2), i get an error that the compute cluster doesnt have access to the storage but when i run the pipeline using compute instance that is assigned to me i dont have any problem but im trying to do automation using synapse pipeline so i need to use a compute cluster not compute instance.
i tried to give the azure ml workspace owner access to data lake gen2 storage but that didnt fix the problem,i tried to use managed identity on the cluster and i got a principal id but when i try to add that given principal id to datalake storage in IAM access i cant enter a principal id in it i only can write a user name
Follow the steps below to grant access to your cluster using managed identity.
Microsoft Entra ID
.Give Storage Blob Data Contributor
or Storage Blob Data Reader
based on your requirement.