Welcome to the DQW Structured data repository! 🏗️

This repo contains the structured data DQW streamlit app code, however, the streamlit apps have been split into 5 for maintenance purposes:

The packages used in the application are in the table below.

App section	Description	Visualisation	Selection	Package
Synthetic tabular	x	x		table-evaluator
Tabular	x	x		sweetviz
Tabular	x	x		pandas-profiling
Tabular, text			x	PyCaret

Structured (tabular) data

Key points addressed:

Quantitative measures – number of rows and columns.
Qualitative measures – column types.
Descriptive statistics with NumPy for numeric columns, for example, count, mean, percentiles and standard deviation. For discrete columns, count, unique, top and frequency.
Explore missing data.
Examine outliers.
Mitigate class imbalance.
Compare datasets, like train, test and evaluate data.
Evaluate synthetic datasets.
Create a quality report.

To complete the key points, 4 subsections are created:

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.streamlit		.streamlit
demo_data		demo_data
pdf_files		pdf_files
tabular_eda		tabular_eda
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
dqw_background.png		dqw_background.png
helper_functions.py		helper_functions.py
logo.png		logo.png
packages.txt		packages.txt
requirements.txt		requirements.txt