A project that has built a tool and webapp to support both investigators and researchers in understanding the vast swath of previous investigations.
This GitHub repo is a project repository that has been built throughout my experience and efforts of applying data science solutions to transport accident investigations.
There are fundamentally two deliverables in this project.
- engine which is a Python program that gathers publicly available investigation reports and extracts/infers information to make datasets of safety issues, recommendations etc.
- viewer which is a Python Flask webapp deployed with Azure that implements a RAG search engine to search through some of the datasets generated in the engine.
There is also a collection of jupyter notebooks that show the development process of some of the more complex features in the engine and viewer.
This project started as a university project for my final semester. The university work was completed in July-October 2023 and finished with a basic engine and viewer app. Since then work has been completed directly with TAIC to bring the engine and viewer from POC -> Prototype -> Production. It should be released internally sometime in October 2024.
If you want more information about the project and how it works/how you can use it you should checkout the wiki.
However if you are curious about the project and its history and non-technical documentation you can check out the Project documents