Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Header row being skipped in CSV reading over FTPS but not with FILE #88

Open
cwilson6163 opened this issue Oct 15, 2024 · 1 comment
Open

Comments

@cwilson6163
Copy link

cwilson6163 commented Oct 15, 2024

I have been getting an error using tap-spreadsheets-anywhere in conjunction with target-oracle:
IndexError: string index out of range cmd_type=elb consumer=True job_name=prod:tap-spreadsheets-anywhere-to-target-oracle name=target-oracle producer=False run_id=7d04cedc-ba51-4a32-ad3d-fb5457809ed4 stdio=stderr string_id=target-oracle

What seems to be happening is that when the csv file is read from Box (over ftps) tap-spreadsheets-anywhere ignores the header row and uses the first data row as the header so the schema has properties with key values of the first data row. If I go into the Box folder and modify the file to repeat the header row, then everything functions properly. I do not have the skip_initial set (but have tried with explicitly setting it to 0).

The header row is not skipped over when I have the path set using a local file (FILE://...) only when I use FTP.

@cwilson6163
Copy link
Author

This seems to be fixed by just including field_names, but I am still wondering why the header row would be skipped and row 2 would be considered the header.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant