Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ability to set granularity of replication key #68

Open
nicholasvk opened this issue Sep 29, 2023 · 1 comment
Open

Ability to set granularity of replication key #68

nicholasvk opened this issue Sep 29, 2023 · 1 comment

Comments

@nicholasvk
Copy link

nicholasvk commented Sep 29, 2023

I am using this tap to extract s3 inventory reports from an s3 bucket and trying to fine tune the replication method. I have set the replication in the meltano.yml as follows:

metadata:
      inventory_reporting:
        replication-method: INCREMENTAL
        replication-key: last_modified_date

However it seems that because the s3 inventory reports are written daily and have their own timestamp at the file level, the tap is defaulting to using the overall file modified date vs. the column level modified date. My desired outcome is to only load records from the latest report that have a last_modified_date > the last ELT run, but right now its loading all records/rows from the latest report if the report modified date is > the last ELT run. Has anyone run into this? Am I missing a setting to override the file level and use the more granular row level modified date for

@nickolasclarke
Copy link

@nicholasvk from what I can tell from reading the source code, metadata configuration values are simply ignored by the tap. I am attempting to use a FULL_REFRESH replication method, but it is not honored.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants