7 sdr data #8

lpicci96 · 2024-12-02T08:38:08Z

This pull request introduces a new module for reading Special Drawing Rights (SDR) data from the IMF, along with utility functions and tests. The most important changes include the creation of several new modules for fetching and processing SDR data, the addition of utility functions, and the restructuring of existing code to use these new utilities.

New SDR modules:

src/imf_reader/sdr/__init__.py: Added a new module that provides functions for fetching SDR allocations, holdings, interest rates, and exchange rates.
src/imf_reader/sdr/read_announcements.py: Added functions to fetch SDR allocations and holdings data, including utility functions for reading and cleaning data.
src/imf_reader/sdr/read_exchange_rate.py: Added functions to fetch and parse SDR exchange rate data, with support for different unit bases.
src/imf_reader/sdr/read_interest_rate.py: Added functions to fetch and clean SDR interest rate data from the IMF website.

Utility functions:

src/imf_reader/utils.py: Added a utility function make_request to handle HTTP requests and raise appropriate errors.

Code restructuring:

src/imf_reader/weo/scraper.py: Removed the local make_request function and replaced it with the new utility function from utils.py.

Tests:

tests/test_utils.py: Added tests for the new utility function make_request to ensure it handles various scenarios correctly.
tests/test_weo/test_scraper.py: Removed tests for the local make_request function since it has been replaced by the utility function.

Tests for SDR modules still need to be added

codecov-commenter · 2024-12-02T08:38:48Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 97.93103% with 3 lines in your changes missing coverage. Please review.

Project coverage is 93.75%. Comparing base (2e536a4) to head (127056e).

Files with missing lines	Patch %	Lines
src/imf_reader/utils.py	66.66%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main       #8      +/-   ##
==========================================
+ Coverage   90.71%   93.75%   +3.03%     
==========================================
  Files           6       11       +5     
  Lines         183      320     +137     
==========================================
+ Hits          166      300     +134     
- Misses         17       20       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jm-rivera

thanks Luca! A few comments and requests throughout, mainly seeking to improve readability.

jm-rivera · 2024-12-02T09:16:24Z

src/imf_reader/sdr/read_exchange_rate.py

+    if unit_basis == "USD":
+        col_val = "U.S.$1.00 = SDR"
+    elif unit_basis == "SDR":
+        col_val = "SDR1 = US$"
+    else:
+        raise ValueError("unit_basis must be either 'SDR' or 'USD'")
+
+    df = df.iloc[:, 0].str.split("\t", expand=True)
+    df.columns = df.iloc[0]
+    df = df.iloc[1:]
+
+    exchange_series = (
+        df.loc[lambda d: d["Report date"] == col_val].iloc[:, 1].reset_index(drop=True)
+    )
+
+    dates_series = (
+        df.dropna(subset=df.columns[3])
+        .iloc[:, 0]
+        .drop_duplicates()
+        .reset_index(drop=True)
+    )
+
+    return pd.DataFrame(
+        {"date": dates_series, "exchange_rate": exchange_series}
+    ).assign(
+        date=lambda d: pd.to_datetime(d.date),
+        exchange_rate=lambda d: pd.to_numeric(d.exchange_rate, errors="coerce"),
+    )


This does quite a bit without much documentation. It may be better to split some of this (to make it easier to test) and so the different steps are clearer.

jm-rivera · 2024-12-02T09:17:21Z

src/imf_reader/sdr/read_announcements.py

+
+
+@lru_cache
+def get_data(year: int, month: int):


Given that there are multiple functions that get data, it would be useful to give this a more meaningful name like get_holdings_and_allocations_data

jm-rivera · 2024-12-02T09:18:41Z

src/imf_reader/sdr/read_exchange_rate.py

+BASE_URL = "https://www.imf.org/external/np/fin/data/rms_sdrv.aspx"
+
+
+def read_data():


Given the different functions that get or read data, it would be good to give this a more meaningful name that hints at what data it gets (and in this case it gets rather than reads, so get_exchange_rates_data)

jm-rivera · 2024-12-02T09:19:19Z

src/imf_reader/sdr/read_interest_rate.py

+BASE_URL: str = "https://www.imf.org/external/np/fin/data/sdr_ir.aspx"
+
+
+def read_data():


Same as for other get/read data, a more meaningful name may help.

jm-rivera · 2024-12-02T09:21:16Z

src/imf_reader/sdr/read_interest_rate.py

+    )
+
+
+def _format_data(df: pd.DataFrame) -> pd.DataFrame:


this formats and filters. It would help to split in two. Also because clean data does quite a bit - and this one also deals with cleaning)

jm-rivera · 2024-12-02T09:23:04Z

tests/test_utils.py

+def test_make_request():
+    """Test make_request"""
+
+    # test successful request
+    with patch("requests.get") as mock_get:
+        mock_get.return_value.status_code = 200
+        response = utils.make_request(TEST_URL)
+        assert response == mock_get.return_value
+
+    # test failed request
+    with patch("requests.get") as mock_get:
+        mock_get.side_effect = requests.exceptions.RequestException
+        with pytest.raises(ConnectionError, match="Could not connect to"):
+            utils.make_request(TEST_URL)
+
+    # test when status code is not 200
+    with patch("requests.get") as mock_get:
+        mock_get.return_value.status_code = 404
+        with pytest.raises(ConnectionError, match="Could not connect to"):
+            utils.make_request(TEST_URL)


Not convinced of the value of this type of test. You are basically testing requests functionality, rather than anything about the logic of this package. I'd remove.

lpicci96 · 2024-12-02T09:38:36Z

@mharoruiz FYI

lpicci96 added 8 commits November 29, 2024 14:49

move make_request function to utils module

caee7df

refactor tests

bea601a

setup sdr module

31316d5

add functionality to get sdr announcements

97dbfdb

rename module

3dfbf8a

add interest rate reader

8cf43bf

add exchange rate reader

a51ff8b

update documentation

25b33eb

lpicci96 linked an issue Dec 2, 2024 that may be closed by this pull request

SDR data #7

Closed

black

7352554

lpicci96 requested a review from jm-rivera December 2, 2024 08:39

jm-rivera requested changes Dec 2, 2024

View reviewed changes

lpicci96 assigned mharoruiz Dec 2, 2024

mharoruiz added 4 commits December 3, 2024 10:31

rename, split functions

7ecc5f3

add sdr tests

6cc2220

add interest rate tests

15fd591

black

127056e

lpicci96 merged commit e060a8f into main Dec 12, 2024
6 checks passed

lpicci96 deleted the 7-sdr-data branch December 12, 2024 13:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

7 sdr data #8

7 sdr data #8

lpicci96 commented Dec 2, 2024

codecov-commenter commented Dec 2, 2024 •

edited

Loading

jm-rivera left a comment

jm-rivera Dec 2, 2024

jm-rivera Dec 2, 2024

jm-rivera Dec 2, 2024

jm-rivera Dec 2, 2024

jm-rivera Dec 2, 2024

jm-rivera Dec 2, 2024

lpicci96 commented Dec 2, 2024

		BASE_URL = "https://www.imf.org/external/np/fin/data/rms_sdrv.aspx"


		def read_data():

		BASE_URL: str = "https://www.imf.org/external/np/fin/data/sdr_ir.aspx"


		def read_data():

7 sdr data #8

7 sdr data #8

Conversation

lpicci96 commented Dec 2, 2024

codecov-commenter commented Dec 2, 2024 • edited Loading

Codecov Report

jm-rivera left a comment

Choose a reason for hiding this comment

jm-rivera Dec 2, 2024

Choose a reason for hiding this comment

jm-rivera Dec 2, 2024

Choose a reason for hiding this comment

jm-rivera Dec 2, 2024

Choose a reason for hiding this comment

jm-rivera Dec 2, 2024

Choose a reason for hiding this comment

jm-rivera Dec 2, 2024

Choose a reason for hiding this comment

jm-rivera Dec 2, 2024

Choose a reason for hiding this comment

lpicci96 commented Dec 2, 2024

codecov-commenter commented Dec 2, 2024 •

edited

Loading