Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CT-2161] [Feature] Support altering tables dist/sort keys #333

Open
2 tasks done
alejandrofm opened this issue Feb 22, 2023 · 10 comments
Open
2 tasks done

[CT-2161] [Feature] Support altering tables dist/sort keys #333

alejandrofm opened this issue Feb 22, 2023 · 10 comments
Labels
help_wanted Extra attention is needed pkg:dbt-redshift type:enhancement New feature or request

Comments

@alejandrofm
Copy link

Is this a new bug in dbt-redshift?

  • I believe this is a new bug in dbt-redshift
  • I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

Edited the size of a varchar field on Redshift, DBT makes an update to recreate the column, after the update it tries to drop de column but can't because is a SORTKEY.

Expected Behavior

The SORTKEY must be changed (or dropped) before trying to update the field, and then it have to be recreated

Steps To Reproduce

Change the length of a varchar field (must be sortkey) on the destination table on Redshift.
Run the dbt model

Relevant log output

[2023-02-22, 16:57:40 UTC] {pod_manager.py:228} INFO - �[0m16:57:40  �[33mDatabase Error in model xxxx.sql (models/ab_testing/xxxx.sql)�[0m
[2023-02-22, 16:57:40 UTC] {pod_manager.py:228} INFO - �[0m16:57:40    cannot drop sortkey column "column_2"

Environment

- Python: 3.9
- dbt-core: 1.3.1
- dbt-redshift: 1.3.1

Additional Context

No response

@alejandrofm alejandrofm added type:bug Something isn't working triage:product labels Feb 22, 2023
@github-actions github-actions bot changed the title [Bug] When field changes datatype and is SORTKEY, dbt fails [CT-2161] [Bug] When field changes datatype and is SORTKEY, dbt fails Feb 22, 2023
@dbeatty10
Copy link
Contributor

Thanks for reporting this @alejandrofm !

Could you share a simple example of a model (and its configuration) that could run into this problem? Specifically, it would be nice to see which materialization type you are using (table, incremental, etc) and also which config parameters you are using. e.g., could you share something like this?

-- Example with interleaved sort keys
{{ config(materialized='table',
          sort_type='interleaved'
          sort=['id', 'category'],
          dist='received_at')
}}

select ...

@alejandrofm
Copy link
Author

alejandrofm commented Feb 22, 2023

Sure! this is the config:

{{
    config(
        materialized='incremental',
        sql_where="TRUE",
        unique_key="events_date",
        incremental_strategy='delete+insert',
        sort=['column1', 'column2', 'column3'],
        dist='column1'
    )
}}

@dbeatty10
Copy link
Contributor

Thanks for providing this config @alejandrofm

I haven't tried it myself yet. Do you think it would be solved if #335 is implemented? (Thanks for opening both of these issues, by the way! Good stuff.)

@alejandrofm
Copy link
Author

Will solve it only if the ALTER COLUMN is done for increasing as well as decreasing field sizes. Increasing is "easy", decreasing needs one check to know if the old data fits on the new field size.
but if both get implemented on #335 then this issue has no point anymore.
Thanks!

@github-actions
Copy link
Contributor

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please comment on the issue or else it will be closed in 7 days.

@github-actions github-actions bot added the Stale label Aug 24, 2023
@alejandrofm
Copy link
Author

I'm just bumping it so it doesn't get closed, it's a real issue for several use cases. Thanks

@github-actions github-actions bot removed the Stale label Aug 27, 2023
@rriley99
Copy link

Any updates on fixing this behavior? It seems intermittent in my production environment.

@nathaniel-may nathaniel-may added type:enhancement New feature or request and removed type:bug Something isn't working labels Jan 3, 2024
@colin-rogers-dbt
Copy link
Contributor

I think we should treat this as a new feature and fully support altering tables dist/sort keys in addition to columns similar to what we do now for materialized views.

@dbeatty10 dbeatty10 changed the title [CT-2161] [Bug] When field changes datatype and is SORTKEY, dbt fails [CT-2161] [Feature] When field changes datatype and is SORTKEY, dbt fails Jan 3, 2024
@dbeatty10
Copy link
Contributor

@colin-rogers-dbt This might be the same/similar to #335.

Do you think this could be the key part of the implementation (in addition to applicable test cases)?

{% macro redshift__alter_column_type(relation, column_name, new_column_type) -%}
  {% call statement('alter_column_type') %}
    alter table {{ relation }} alter column {{ adapter.quote(column_name) }} type {{ new_column_type }};
  {% endcall %}
{% endmacro %}

@dbeatty10 dbeatty10 changed the title [CT-2161] [Feature] When field changes datatype and is SORTKEY, dbt fails [CT-2161] [Feature] Support altering tables dist/sort keys Jan 3, 2024
@dbeatty10
Copy link
Contributor

Do you think this could be the key part of the implementation (in addition to applicable test cases)?

Per @rriley99 here, alter won't work if the column is part of a distkey or sortkey:

I believe that will throw an error as you cannot alter dist/sort columns directly, you will need to do something like this:

create table new_table
   distkey (dist_col)
   sortkey (sort_col)
as
select *
from old_table;

Then swap the name. At least that is the best I could come up with for our ops engineers.

@nathaniel-may nathaniel-may added help_wanted Extra attention is needed and removed refinement labels Feb 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help_wanted Extra attention is needed pkg:dbt-redshift type:enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

6 participants