Copyright 2020-2023 Vishal Mahajan
Trajectory data collected from Drones over an urban area in Athens. For more info: https://open-traffic.epfl.ch/
This repository provides an automated way to treat the noise and anomalies (in the form of unrealistic peaks) in the acceleration values of a number of vehicles in the pneuma dataset. The algorithm uses a combination of low-pass filters (Savitzky-Golay filter, Gaussian filter) to remove the noise. A machine learning model (XGBoost) is used to reconstruct acceleration time-series and detect anomalies. If anomalies are detected, they are removed. The outputs are the processed time-series profiles of vehicle speeds and accelerations without high frequency noise and unrealistic acceleration peaks.
You should be in src folder when running the codes.
- Specify the path to the raw data from pNEUMA website in the data_formatter.py and run it to convert the files from wide to long format. A sample output is provided.
- Specify the path to the output file from step-1, other paths (to save output data and plots) in the processor.py. Run this file to get the treated speed and acceleration time series.
With the default paths, the plots are saved in plots and data is saved in data/derived.
A link to the processed speed and from the 1000-1030 hour recording is provided at an external link. This file contains only four columns: vehicle id
, frame number
, processed speed
and processed acceleration
. Using vehicle id and frame columns, this data can be merged with the original data.
If you use these codes in your work, kindly cite the following:
Mahajan, V., Barmpounakis, E., Alam, M. R., Geroliminis, N., & Antoniou, C. (2023). Treating Noise and Anomalies in Vehicle Trajectories from an Experiment with a Swarm of Drones. Article Link