google-cloud-platform google-bigquery google-cloud-storage google-cloud-dataflow google-cloud-dlp

It is possible to run a dataflow DLP de-idenification job on a group of files in GCS?

I have a large amount of csv files in a folder that I need to run a de-identification job on and was wondering if anyone knew of any way that I could run that job on the folder/multiple files? At the moment I'm creating dataflow jobs with DLP templates and that's worked fine for single datasets. I know in GCS you can run DLPs on folders with multiple files in it but there you're only allowed to use inspection templates and not de-identification templates.

Putting them into a bucket is also not a option as the parent folder is already a bucket and buckets can't be nested.

Any help would be much appreciated thanks

Solution

Correct, this feature is not yet supported. The recommended solution is to use dataflow.

How to calculate the compressed storage size of rows matching a key regex in Google Cloud Bigtable?
How to allow CORS in google cloud run?
How to connect to GCP VM instance with password using SSH?
Error: 10: Developer console is not set up correctly (Not Using Firebase) (One Tap sign-up)
Google cloud API dashboard shows usage for "Gemini for Google Cloud API"
Serverless VPC access connector is in a bad shape
Permission error when trying to deploy to Google Cloud Run
Writing data to a child node in Firestore
Authenticate to the Google Language API using an API key for a REST-based web application
Google Cloud Build substitutions: invalid value for 'build.substitutions': key in the template "FIREBASE_API_KEY" is not a valid built-in substitution
How can I model a many-to-many relationship in Firestore without exceeding the document size limit?
Firebase Auth ONLY gives INVALID_LOGIN_CREDENTIALS error and no other error codes
google colab with google cloud storage data egress
GCSFuse Provided scope(s) are not authroized error
Google Cloud Compute Engine http Connection Timeout
Is it possible to port 1st gen Cloud Functions to Cloud Run functions
Google Cloud Bigtable vs Google Cloud Datastore
Modeling Forward and Reverse Query Questions in Bigtable
Is it possible to query iceberg medata with BQ SQL?
How to auto deploy latest image from Cloud Build in Cloud Run
Is there a way to copy from a Google Sheets that is copy locked?
using "firebase deploy" to deploy to hosting - how does firebase know if it is deploying to hosting and not cloud functions?
Difference between Google's API Gateway and Cloud Endpoints
Google Document AI batch processing failing
GCP BigQuery external query to MySQL instance 8.4
gcloud secrets create always fails with “INVALID_ARGUMENT: The provided Project ID ... does not match the expected format [projects/*]”
compute.requireOsLogin violated in dataproc serverless
How to set database jdbc url for Google Cloud SQL on quarkus?
Trigger alert BigQuery
How to configure endpoints and esp to publish gRPC with enabled ServerReflection feature