-
-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow use of PV system data from PV Sites database #226
Conversation
Codecov Report
@@ Coverage Diff @@
## main #226 +/- ##
==========================================
+ Coverage 79.94% 80.11% +0.16%
==========================================
Files 126 127 +1
Lines 5551 5611 +60
==========================================
+ Hits 4438 4495 +57
- Misses 1113 1116 +3
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
…atefix/ocf_datapipes into pv_inputs_from_database
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! I would probably go with copying the GSP name conventions, just to keep it the same across those.
Co-authored-by: Jacob Bieker <[email protected]>
for more information, see https://pre-commit.ci
…atefix/ocf_datapipes into pv_inputs_from_database
for more information, see https://pre-commit.ci
Pull Request
Description
This pull request is intended to allow us to use PV data from the
pvsites
database.When loading training data, the capacity of the PV system was previously taken to be the maximum observed output in the timeseries. When loading from the database, the capacity was set to the metadata value. The maximum observed and the metadata value can be quite different, and if we normalise the PV system data by them this will cause us trouble in production.
This pull request adds new variables to the PV system DataArray. Previously we only had
capacity_watt_power
. Now we haveobserved_capacity_watt_power
andmetadata_capacity_watt_power
instead of this. This is more explicit and should help us avoid normalisation mistakes.nominal_capacity_mwp
andeffective_capacity_mwp
?Added
OpenPVFromPVSitesDBIterDataPipe
datapipe function (plus helper functions) to load frompvsites
database. The currentOpenPVFromPVDBIterDataPipe
datapipe function can only be used to load from thepv
database.I think we intend to move away from using the
pv
database, so I wasn't sure if we would want to replaceOpenPVFromPVDBIterDataPipe
to only load frompvsites
. Currently, I have left the two functions alongside each other.Added new tests for
OpenPVFromPVSitesDBIterDataPipe
Added
ApplyPVDropoutIterDataPipe
to apply dropout specific for PV systems. Independently dropout each system and set a latency for each system.Added tests for
ApplyPVDropoutIterDataPipe
Add to PVNet datapipe to include PV system inputs
Minor cleaning and refactoring
Checklist: