Learning Human Attention

This is the repository of the work "Enhancing Robot Learning through Learned Human-Attention Feature Maps".

We provide instruction to work with the experimental data recorded in driving tasks with F1TENTH miniature vehicles.

The paper has been accepted at the Workshop on effective Representations, Abstractions, and Priors for Robot Learning (RAP4Robots). A preprint is available here.

Download the raw dataset

The recorded data are freely available on Zenodo.

Each file .zip corresponds to a run on the track we built for the experiments at TU Wien. In each run, we asked a human driver to control the vehicle remotely while wearing VPS glasses to track the eye gaze.

The content of each archive contains the following files:

A .bag file that contains all the ROS-messages recorded on the car during driving on the track.
A .cal file that contains the calibration information of the user that was wearing the VPS glasses for the run
A .tsv file that contains all human attention gaze points for a specific frame
Two video files (.mkv and .mp4) recorded via webcam on the car during driving on the track
.yaml file containing the information of valid sections of the run (necessary for the pipeline processing described below)

Preprocess the data

To process the raw data and map the focus points from the eye-tracking systems into the view from the on-board camera, we perform a coordinate projection.

To reproduce the preprocessing, we provide the following docker image cyberwyrm/eye-tracking_pipeline.

Instructions

Pull the docker image from Dockerhub

docker pull cyberwyrm/eye-tracking_pipeline:latest

Run the container from the folder containing the dataset (see instructions above to download the data)

docker run -v <path-to-dataset-folder>:/raw_data/dataset --name eye-tracking_pipeline cyberwyrm/eye-tracking_pipeline:latest

where <path-to-dataset-folder> is the absolute path on the host file system where the raw data are located.

Output description

The output of the processing pipeline is produced and stored for each run in the relative folder. The output consists of:

marked_frames/: contains all frame images,
transformed_coords.csv: contains the focus points for each frame.

Citation

If you find this work useful for your own ideas, please cite our paper:

@online{scheuchenstuhl2023enhancing,
    title={Enhancing Robot Learning through Learned Human-Attention Feature Maps}, 
    author={Daniel Scheuchenstuhl and Stefan Ulmer and Felix Resch and Luigi Berducci and Radu Grosu},
    booktitle = {Workshop on effective Representations, Abstractions, and Priors for Robot Learning (ICRA 2023)},
    year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Learning Human Attention

Download the raw dataset

Preprocess the data

Instructions

Output description

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Learning Human Attention

Download the raw dataset

Preprocess the data

Instructions

Output description

Citation