-
Notifications
You must be signed in to change notification settings - Fork 235
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CT-385] Limit catalog generation to specific relations #300
Comments
This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days. |
This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days. |
After reading #228 I propose to reopen the issue and implement |
Context: #228, #296
Currently, catalog generation uses the same SQL as cache population:
show table extended in <databasename> like '*'
. There are good reasons to want to change this query:Proposal
I think the change here will look like:
list_relations_without_caching
)_get_catalog_schemas
like rel_1|rel_2|rel_3|...
predicate (rather than justlike '*'
)dbt-spark/dbt/adapters/spark/impl.py
Line 292 in d7f1d38
dbt-spark/dbt/adapters/spark/impl.py
Line 323 in d7f1d38
Alternative
Revisit the change in #160. Before that change, we had to run
describe extended
for every single table. While generally much slower, that approach had some advantages:show table extended
#295)describe
that one table thanshow
many tables in the schema (hopefully solved by the refinement proposed in this issue)The text was updated successfully, but these errors were encountered: