While browsing I just came across Dataflow SQL. Is it any different from beamSQL?
Apache Beam SQL
is a functionality of Apache Beam
that allows you to execute queries directly from your pipeline.
As you can see here, Beam SQL
has two options of SQL
syntax: Beam Calcite SQL and Zeta SQL. The advantage of using Zeta SQL
is that its very similar to BigQuery
's syntax hence its useful in pipelines that read from or write to BigQuery
.
Dataflow SQL
is a functionality of Dataflow
that allows you to create pipelines directly from a BigQuery
query. It's said in the documentation that it supports the Zeta SQL
syntax (BigQuery syntax).
To create a new Dataflow
job through the BigQuery
's console, to the following steps:
After that, you can click in Create Cloud Dataflow job and your query will become a job in Dataflow
.
I hope it helps