-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dvc import-url
: can't pull data if using --no-download
#10594
Comments
DGrady
changed the title
Oct 18, 2024
dvc import-url
: Non-cached imported URLs won't pulldvc import-url
: can't pull data if using --no-download
I'm also facing this issue after running:
with the following DVC stack:
Error:
|
I tried some earlier versions of DVC. This appears to have worked as expected thru DVC v3.42.0, and started failing as described in this bug in v3.43.0. Between |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Bug Report
Description
The documentation for
import-url
explains that running this command:should create a DVC metadata file with the pointer and hash information for the source data file, and that it should not download the data immediately. That works as expected.
The documentation also states that if I later run
at that point, it will download the data and place it in my work tree. (I guess it's not clear whether the data will be added to the cache?) This doesn't work; instead
Reproduce
Expected
Based on the documentation, my expectation is that
should copy
data.csv
to my local work tree from S3, thatdata.csv
should not be added to the cache, and that any changes todata.csv
in S3 should cause local pipelines that usedata.csv
as a dependency to be flagged as out of date.This expected behavior is explained in a couple of places in the documentation:
Environment information
Output of
dvc doctor
:Additional Information (if any):
The text was updated successfully, but these errors were encountered: