From the SELECT team, a dbt package to automatically tag dbt-issued queries with informative metadata. This package uses both query comments and query tagging.
An example query comment contains:
{
"dbt_query_tags_version": "3.1.0",
"app": "dbt",
"dbt_version": "1.4.0",
"project_name": "my_project",
"target_name": "dev",
"target_database": "dev",
"target_schema": "dev",
"invocation_id": "4ffa20a1-5d90-4a27-a58a-553bb6890f25",
"node_refs": [
"model_b",
"model_c"
],
"node_name": "model_a",
"node_alias": "model_a",
"node_package_name": "my_project",
"node_original_file_path": "models/model_a.sql",
"node_database": "dev",
"node_schema": "dev",
"node_id": "model.my_project.model_a",
"node_resource_type": "model",
"node_tags": ["tag_1", "tag_2"],
"node_meta": {"owner": "@alice", "model_maturity": "in dev"},
"materialized": "incremental",
"full_refresh": false,
"which": "run"
-- dbt Cloud only
"dbt_cloud_project_id": "146126",
"dbt_cloud_job_id": "184124",
"dbt_cloud_run_id": "107122910",
"dbt_cloud_run_reason_category": "other",
"dbt_cloud_run_reason": "Kicked off from UI by niall@select.dev",
}Query tags are used solely for attaching the is_incremental flag, as this isn't available to the query comment:
{
"dbt_query_tags_version": "3.1.0",
"app": "dbt",
"is_incremental": true
}- Add this package to your
packages.ymlfile, then install it withdbt deps.
packages:
- package: get-select/dbt_query_tags
version: [">=3.0.0", "<4.0.0"]- Add the query tags. The setup differs by adapter:
Configure the dispatch search order in dbt_project.yml. This lets the package override Snowflake's built-in set_query_tag hook, which is called automatically by the adapter for every model.
dispatch:
- macro_namespace: dbt
search_order:
- <YOUR_PROJECT_NAME>
- dbt_query_tags
- dbtdbt-databricks does not have a built-in set_query_tag hook, so you need to add per-model hooks to your dbt_project.yml:
models:
<YOUR_PROJECT_NAME>:
+pre-hook: "{{ dbt_query_tags.set_query_tag() }}"
+post-hook: "{{ dbt_query_tags.unset_query_tag(query_tag) }}"- To configure the query comments, add the following config to
dbt_project.yml.
query-comment:
comment: '{{ dbt_query_tags.get_query_comment(node) }}'
append: true # Note: append is only necessary on Snowflake, which strips prefixed comments. It is harmless on other adapters.That's it! All dbt-issued queries will now be tagged.
Both meta and tag configs are automatically added to the query comments.
To add arbitrary keys and values to the comments, you can use the extra kwarg when calling dbt_query_tags.get_query_comment. For example, to add a run_started_at key and value to the comment, we can do:
query-comment:
comment: '{{ dbt_query_tags.get_query_comment(node, extra={"run_started_at": builtins.run_started_at | string }) }}'
append: true # Snowflake removes prefixed comments.Here's an example of how you can add environment variables into the query comment:
query-comment:
comment: >
{{ dbt_query_tags.get_query_comment(
node,
extra={
'CI_PIPELINE_ID': env_var('CI_PIPELINE_ID'),
'CI_JOB_NAME': env_var('CI_JOB_NAME'),
'CI_COMMIT_REF_NAME': env_var('CI_COMMIT_REF_NAME'),
'CI_RUNNER_TAGS': env_var('CI_RUNNER_TAGS')
}
) }}
append: true # Snowflake removes prefixed comments.Note: The query tag extension options below (model config, profiles.yml, environment variables, extra kwarg) are Snowflake-only. On Databricks, the adapter already auto-appends rich per-query tags (
@@dbt_model_name,@@dbt_core_version,@@dbt_databricks_version,@@dbt_materialized), and the metadata tag set by this package containsapp,dbt_query_tags_version,thread_id, andis_incremental.
To extend the information added in the query tags, there are a few options:
Set the query_tag config value to a mapping type. Example:
Model (works for dbt Core)
{{ config(
query_tag = {'team': 'data'}
) }}
select ...or (works for dbt Core and Fusion)
{{ config(
query_tag = "{'team': 'data'}"
) }}
select ...Results in the following query tag. The additional information is added by this package.
'{"team": "data", "app": "dbt", "dbt_query_tags_version": "3.1.0", "is_incremental": true}'
Note that using a non-mapping type in the query_tag config will result in a warning, and the config being ignored.
Model
{{ config(
query_tag = 'data team'
) }}
select ...Warning:
dbt-query-tags warning: the query_tag config value of 'data team' is not a mapping type, so is being ignored. If you'd like to add additional query tag information, use a mapping type instead, or remove it to avoid this message.
Additionally, you can set the query_tag value in the profiles.yml. This must be a valid mapping, not a string.
profiles.yml
default:
outputs:
dev:
+query_tag:
team: data
...
target: devAnother option is to use the optional project variable env_vars_to_query_tag_list to provide a list of environment variables to pull query tag values from.
Example:
dbt_project.yml:
vars:
env_vars_to_query_tag_list: ['TEAM','JOB_NAME']Results in a final query tag of
'{"team": "data", "job_name": "daily", "app": "dbt", "dbt_query_tags_version": "3.1.0", "is_incremental": true}'
We use changie to generate CHANGELOG entries. Note: Do not edit the CHANGELOG.md directly. Your modifications will be lost.
Follow the steps to install changie for your system.
Once changie is installed and your PR is created, simply run these changie commands:
changie new: changie will walk you through the process of creating a changelog entry.changie batch x.x.xwhere x.x.x is the new version needed. You can chech the.changesfolder to know the next version number.changie merge
Next, update the version number in these places:
dbt_project.ymlmacros/query_comment.sqlmacros/query_tags.sql- The example output strings in
README.md
This package was originally published as dbt-snowflake-query-tags and renamed to dbt-query-tags to support multiple platforms. If you were using the old package, make the following updates:
In packages.yml:
# Before
packages:
- package: get-select/dbt_snowflake_query_tags
version: [">=2.0.0", "<3.0.0"]
# After
packages:
- package: get-select/dbt_query_tags
version: [">=3.0.0", "<4.0.0"]In dbt_project.yml:
# Before
dispatch:
- macro_namespace: dbt
search_order:
- <YOUR_PROJECT_NAME>
- dbt_snowflake_query_tags
- dbt
query-comment:
comment: '{{ dbt_snowflake_query_tags.get_query_comment(node) }}'
append: true
# After
dispatch:
- macro_namespace: dbt
search_order:
- <YOUR_PROJECT_NAME>
- dbt_query_tags
- dbt
query-comment:
comment: '{{ dbt_query_tags.get_query_comment(node) }}'
append: trueIf you're using dbt Cloud or a similar hosted service, no further action is needed after making these changes. For local development, run dbt deps to install the new package.