Children's Services' Data Tool

This repository holds a set of tools and utilities for processing and cleaning Children's Services' data.

Most of the utilities are centred around three core datasets:

SSDA903
CIN Census
Annex A

Introduction to LIIA project

The LIIA (London Innovation and Improvement Alliance) project brings together Children’s Services data from all the Local Authorities (LAs) in London with the aim of providing analytical insights that are uniquely possible using pan-London datasets.

Please see LIIA Child Level Data Project for more information about the project, its aims and partners.

Purpose of liia-tools-pipeline package

The package is designed to process data deposited onto the data platform by local authorities such that it can be used for analysis purposes.

This is a Dagster code server library which is setup to be used as a code server.

How to use:

Local Development

Run poetry install
Copy .env.sample to .env and fill in the variables there as needed
Run the following command:
- For LA-level pipeline work: poetry run dagster dev -f .\liiatools_pipeline\repository_la.py
- For Region-level (Organisation) pipeline work: poetry run dagster dev -f .\liiatools_pipeline\repository_org.py
Once running, navigate to http://localhost:3000/
Add the pre-commit hook by running pre-commit install. This will ensure your code is formatted before you commit something

Preparation for Production or Staging

How this will run in production is that the library will be brought into a docker container with configuration specified in the file Dockerfile_user_code. Which code servers are used can be specified in the installation. See The SFDATA Platform's Workspace definition for details

The idea is each code server will have its own setup which will be a copy of what's here.

Note: Multiple libraries, pipelines, etc can exist in a single code server. Different servers should be used if they have conflicting requirements (e.g. different python versions)

Documentation

Take a look at the documentation to understand what this code is designed to do and how to replicate it for your own dataset transformations. We recommend reading text first, followed by text.

Name	Name	Last commit message	Last commit date
Latest commit cyramic Fix workflow to point to correct dockerfile (#55 ) Oct 16, 2024 3ca4f7b · Oct 16, 2024 History 887 Commits
.github/workflows	.github/workflows	Fix workflow to point to correct dockerfile (#55 )	Oct 16, 2024
docs	docs	line break fix and additional bullet	Sep 11, 2024
external_dataset	external_dataset	Some minor changes to get the pipeline working on the interim solution	Aug 2, 2024
liiatools	liiatools	update regex to find prev_perm in table_id (#52 )	Oct 9, 2024
liiatools_pipeline	liiatools_pipeline	fix regex to only remove reports relating to dataset	Oct 9, 2024
.coveragerc	.coveragerc	Update code coverage to exclude tests	Aug 21, 2023
.dockerignore	.dockerignore	Updated dockerfile to have smaller size, and fixed some issues with l…	Mar 22, 2024
.env.sample	.env.sample	update env.sample	Aug 21, 2024
.gitignore	.gitignore	adding in .gitignore files	Jul 19, 2024
.gitpod.Dockerfile	.gitpod.Dockerfile	Initial release 0.1.0a4	Jun 1, 2022
.gitpod.yml	.gitpod.yml	Initial release 0.1.0a4	Jun 1, 2022
.mailmap	.mailmap	Initial release 0.1.0a4	Jun 1, 2022
.pre-commit-config.yaml	.pre-commit-config.yaml	Add pre-commit	Aug 26, 2023
Dockerfile	Dockerfile	Some minor changes to get the pipeline working on the interim solution	Aug 2, 2024
Dockerfile_LA	Dockerfile_LA	Added path values back in to fix error (#40 )	Sep 11, 2024
Dockerfile_Org	Dockerfile_Org	Added path values back in to fix error (#40 )	Sep 11, 2024
LICENSE	LICENSE	Initial commit	Jun 1, 2022
README.md	README.md	Intro doc, readme links and spelling edits	Sep 10, 2024
dagster.yaml	dagster.yaml	Updated dockerfile to have smaller size, and fixed some issues with l…	Mar 22, 2024
development-best-practices.md	development-best-practices.md	Add block on inline comments	Aug 24, 2023
poetry.lock	poetry.lock	Fixed issue with chardet not being installed from libraries that depe…	Sep 9, 2024
pyproject.toml	pyproject.toml	Fixed issue with chardet not being installed from libraries that depe…	Sep 9, 2024
workspace.yaml	workspace.yaml	Updated dockerfile to have smaller size, and fixed some issues with l…	Mar 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Children's Services' Data Tool

Introduction to LIIA project

Purpose of liia-tools-pipeline package

How to use:

Local Development

Preparation for Production or Staging

Documentation

About

Releases 5

Packages

Languages

License

SocialFinanceDigitalLabs/liia-tools-pipeline

Folders and files

Latest commit

History

Repository files navigation

Children's Services' Data Tool

Introduction to LIIA project

Purpose of liia-tools-pipeline package

How to use:

Local Development

Preparation for Production or Staging

Documentation

About

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages