Releases: sodadata/soda-core
Releases · sodadata/soda-core
v3.0.11
What's Changed
- fix: limit the bin size and handle zero division for continious DRO by @baturayo in #1624
- fix: handle DRO generation for columns with 0 rows by @baturayo in #1627
- chore: pin prophet to >=1.1 by @bastienboutonnet in #1629
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #1630
- Parse cli variables correctly, fix cli tests to actually assert result. by @m1n0 in #1634
- refactor: add bins and weights doc link to DRO exception handling logs by @baturayo in #1633
- Verify data source connection command by @m1n0 in #1636
- Redshift: use SVV_COLUMNS to get table metadata by @m1n0 in #1635
- Snowflake: use upper case in table metadata query by @m1n0 in #1639
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #1643
- Trino: fix py310 type hints by @m1n0 in #1641
- fix: variable substitution in schema check query by @ceyhunkerti in #1628
- fix(anomaly_check): only send outcomeReasons with severity "warn" or "error" by @tituskx in #1640
- fixing bq separate compute storage project by @thiagodeschamps in #1638
- Bigquery: fix distribution check by @m1n0 in #1647
- Change over time - add same day/month support by @m1n0 in #1645
New Contributors
- @ceyhunkerti made their first contribution in #1628
- @thiagodeschamps made their first contribution in #1638
Full Changelog: v3.0.10...v3.0.11
v3.0.10
What's Changed
- Failed rows configurable sample size by @m1n0 in #1608
- Skip change over time checks when historical measurements not available by @m1n0 in #1615
- Add Check source field for cloud by @m1n0 in #1614
- Use correct dicts when building scan result by @m1n0 in #1612
- Failed rows configurable sample size for duplicate_count check by @m1n0 in #1613
- feat: check historical metrics are not None or log helpful message by @bastienboutonnet in #1600
- Dremio support by @vijaykiran in #1618
- fix: handle very large bin sizes by filtering out outliers for dro generation by @baturayo in #1616
- Include psycopg2 requirement for redshift by @m1n0 in #1620
- fix: ensure PSI and SWD can deal with decimal.Decimal type by @tituskx in #1611
Full Changelog: v3.0.9...v3.0.10
v3.0.9
What's Changed
- Limit failed rows sample limit to 1000 by @m1n0 in #1599
- Add scan result getter by @m1n0 in #1602
- BQ separate project for compute and storage. by @m1n0 in #1598
- Scan results file argument by @vijaykiran in #1603
- Chore/move snowflake account by @jmarien in #1607
- Use filename in check identity by @m1n0 in #1606
Full Changelog: v3.0.8...v3.0.9
Please note that installing soda-core may fail with latest version of protobuf (protocolbuffers/protobuf#10571), if the bug isn't fixed yet, try installing 3.19.4 or 3.20.1
v3.0.8
What's Changed
- feat: raise more user-friendly log message when improting sci library fails by @bastienboutonnet in #1584
- Fix dbt table name by @vijaykiran in #1587
- SODA-914: add context authentication and impersonation for BigQuery by @tooobsias in #1588
- Basic Sqlserver regex support by @m1n0 in #1586
- Fix change over time with percentage with no extra config by @m1n0 in #1592
- Fix mysql/mariadb compatibility for regex by @vijaykiran in #1591
- Minor edits to cli help messages. by @janet-can in #1590
- fix: prevent empty message in outcomeReasons by @bastienboutonnet in #1596
- Scan results dict by @m1n0 in #1595
- Add variable resolution to query level by @vijaykiran in #1597
New Contributors
- @tooobsias made their first contribution in #1588
Full Changelog: v3.0.7...v3.0.8
v3.0.7
What's Changed
- Add
dbt:
prefix to dbt checks by @vijaykiran in #1574 - Update freshness value to be milliseconds and add measure by @vijaykiran in #1575
- Add configurable API URL for dbt cloud by @vijaykiran in #1576
- Resolve variables in user defined queries by @vijaykiran in #1577
- Fix dbt cloud ingest, improve logging. by @m1n0 in #1578
- Fixed port option and regexp by @ScottAtDisney in #1579
- Fix dbt checks not being sent properly by @vijaykiran in #1580
- Fix regex tests for mysql by @vijaykiran in #1583
Full Changelog: v3.0.6...v3.0.7
v3.0.6
What's Changed
- fix: add identityB to add datasource name in identity by @vijaykiran in #1556
- Databricks SQL Warehouse support by @vijaykiran in #1559
- SODA-239 Added application flag to snowflake connect by @tombaeyens in #1561
- Add identites by @vijaykiran in #1569
- Add support for custom sampler by @vijaykiran in #1570
- SAS-322 Handle numerical column/table names by @m1n0 in #1572
- dbt ingestion support by @m1n0 in #1552
Full Changelog: v3.0.5...v3.0.6
v3.0.5
What's Changed
- Update README to include package for mysql by @janet-can in #1532
- Fix duplicate query by @m1n0 in #1543
- Add comments explaining cloud payload by @m1n0 in #1545
- Data source contributing docs by @m1n0 in #1546
- refactor: turn no matching table error into a warning to avoid scan failing when all tables are excluded by @bastienboutonnet in #1533
- feat(profiling): add support for extra numeric and text datatypes by @bastienboutonnet in #1534
- Fix 'missing format' in numeric metrics by @m1n0 in #1549
- Trino data source by @ScottAtDisney in #1553
- Change spark installation to decouple deps by @vijaykiran in #1554
Full Changelog: v3.0.4...v3.0.5
v3.0.4
What's Changed
- testing switch to 22.04 for GA by @jmarien in #1521
- Log and trace Soda Cloud trace IDs by @m1n0 in #1520
- Update docker image for sqlserver support by @vijaykiran in #1522
- Add option to set scan datatime by @vijaykiran in #1531
- MySQL Support by @vijaykiran in #1526
Full Changelog: v3.0.3...v3.0.4
v3.0.3
New Features
- SQLServer support by @vijaykiran in #1515
Fixes
- fix: better logging messages for profiling and discover datasets by @baturayo in #1498
- Fix config file creation when first path is not writable by @m1n0 in #1504
- fix: Failed rows don't consider filter by @vijaykiran in #1505
- Fix log message by @m1n0 in #1507
- Fix reference check for null values in source column by @m1n0 in #1509
- Attach sample rows to reference check by @m1n0 in #1508
- Make sure results to sodacloud are sent when there is an exception by @vijaykiran in #1510
- Fix for regex on collated columns in Snowflake by @ScottAtDisney in #1516
Enhancements
- Check name refactor by @m1n0 in #1502
- Set basic telemetry scan data even in case of exceptions by @m1n0 in #1512
- Improve athena text fixture auth setup by @m1n0 in #1501
- Publish data source packages for python 3.7 by @m1n0 in #1514
- Inform about wrong check indentation in logs by @m1n0 in #1517
- feat: skip row count query during column profiling by @bastienboutonnet in #1518
- feat: support 'text' data type in column profiling by @bastienboutonnet in #1519
Full Changelog: v3.0.2...v3.0.3
v3.0.2
What's Changed
- fix: use
exception
rather thanexc_info
to render traceback in soda-core logger's call of prophet model by @bastienboutonnet in #1462 - feat: Add
samples limit
to failed rows checks by @vijaykiran in #1464 - fix: make name optional for failed rows by @vijaykiran in #1466
- Reduce the Docker image size to acceptable levels again by @jmarien in #1465
- fix: stored row count in cloud is wrong by @vijaykiran in #1467
- fix: handle exceptions from scientific library and log them instead or letting them raise by @tituskx in #1463
- BQ expose remaining client params and auth methods by @m1n0 in #1456
- Support cli --version to output core version by @m1n0 in #1476
- Spark DF: update example api usage by @m1n0 in #1474
- Change default scan definition name by @m1n0 in #1475
- BQ: remove schema, use dataset only by @m1n0 in #1470
- feat: warn users when quotes are present in include excludes identifiers by @bastienboutonnet in #1472
- fix: use default distribution comparison method when user has not provided one by @tituskx in #1469
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #1478
- Async Docker image building through Actions and dispatch by @jmarien in #1479
- SODA-944 Enable Snowflake Tokens by @vijaykiran in #1480
- Fix utc timezone handling by @m1n0 in #1481
- test: Improve profiling test for all tables and all columns by @baturayo in #1445
- Fix utc timezone handling by @m1n0 in #1488
- Set redshift host before trying to fetch credentials by @bbakersmith in #1486
- fix: change unassigned min and max variables for profiling logs by @baturayo in #1492
- Use check name in Metric checks by @m1n0 in #1489
- fix: if anomaly detection fails other check results are not sent to cloud by @baturayo in #1490
- Treat zero missing or invalid rows as zero percent by @bbakersmith in #1487
- fix: prevent empty table list from running all tables by @baturayo in #1494
- fix: profile column parsing fails when user provides illegal column spec by @baturayo in #1491
- Join check text with newlines instead of /n by @bbakersmith in #1484
- IBM db2 support by @vijaykiran in #1473
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #1495
New Contributors
- @bbakersmith made their first contribution in #1486
Full Changelog: v3.0.1...v3.0.2