Skip to content

Latest commit

 

History

History
73 lines (57 loc) · 6.67 KB

README.md

File metadata and controls

73 lines (57 loc) · 6.67 KB

Learning Transactions Representations for Information Management in Banks: Mastering Local, Global, and External Knowledge

The work is currently available as an arXiv preprint.

Abstract

Nowadays, banks use artificial intelligence to solve thousands of business problems that can improve customer’s experience. There exist two kinds of tasks: 1) local ones that focus on a client's current state, e.g., forecasting future actions, and 2) global ones, which consider the general customer behavior, e.g., predicting loan payment completion. Unfortunately, maintaining a large number of different models is challenging. To improve the quality of information management, we propose to examine the possibilities of learning a universal representation of a client given a sequence of transactional data. We introduce a novel approach that integrates external information into a client's representation, leveraging insights from other customers' activities. It is experimentally shown that our method outperforms single-task traditional models. For example, incorporating external information enhances accuracy by 20%. Contrastive self-supervised learning methods are demonstrated to excel in global problems, while generative techniques are superior at local tasks.

Data

Open transactional datasets are available online. Folder raw/ contains the datasets in their initial formats, and the final .parquet files we use are stored in the preprocessed/ folder. To reproduce the experiments, download the preprocessed/ folder and put it into the data/ directory.

Note that you can experiment with custom datasets. To do so, you should prepare a dataframe with transaction records containing columns 'user_id', 'amount', 'timestamp', 'mcc_code', 'global_target', and, optionally, 'local_target' and save it as a .parquet file.

Environment

To reproduce the experiments from the paper, please build and run the docker container using files from docker_scripts/. In particular, you should:

  1. Clone this repository locally;
  2. Fill in the credentials file with your user ID;
  3. Run the following commands in the terminal:
    foo@bar:~/transactions_gen_models/docker_scripts$ bash build_image
    foo@bar:~/transactions_gen_models/docker_scripts$ bash launch_container

Usage

To conduct a full experiment specified in config/master.yaml, including model training and validation, run

foo@bar:~/transactions_gen_models$ python main.py

Note that you can run the experiment in debug mode (single epoch, single batch, single run, turn off logging) with

foo@bar:~/transactions_gen_models$ FAST_DEV_RUN=1 python main.py

Configs structure

We use a hydra-based experiment configuration structure. To assemble the set-up you need, you can easily adjust the config/master.yaml.

  • - preprocessing: choose the dataset and the preprocessing you need from the config/preprocessing/ directory. Note that TPP models (COTIC, NHP, and A-NHP) require time normalization and, thus, have their own configs, tpp_*_nodup.yaml, while all the other models use configs *.yaml.
  • - backbone: choose the backbone representation learning model from the config/backbone/ directory. In the paper, we experiment with the following models: CoLES, AE, AR, MLM, TS2Vec, COTIC, NHP, and A-NHP. However, one could add custom models and the corresponding configuration files to the pipeline.
  • - validation: specify list of validation procedures for the model. By default, we use ['global_validation', 'event_type', 'local_target'] for the Churn, Default anf HSBC datasets. As mentioned in the paper, the Age dataset does not have binary local targets, so, in this case, we only validate the models with ['global_validation', 'event_type']. The parameters of the validation models are specified in the config/validation/ folder.
  • - logger: select wandb, comet, tensorboard, or any other logger you prefer.
  • Set pretrain: true if you want to train the representation learning model from scratch, or pretrain: false if you would like to load the existing encoder's weights and only validate it.
  • Select seed.

Models with external context

In the paper, we propose a representation learning model for transactions that accounts for the external context information of other users. It uses the pretrained CoLES model as a backbone. Please look to the notebooks/ directory to reproduce the experiments with these models. The config/pooling_*_validation.yaml files specify the corresponding configuration files.

Results summary

We compare representation learning approaches of 3 classes: contrastive SSL (CoLES, TS2Vec), generative SSL (AE, MLM, AR), and Temporal Point processing models (COTIC, NHP, A-NHP). Note that these generative SSL models have been adapted to the transactional data domain for the first time. The approaches are validated in terms of their embeddings' global and local properties. See the paper for details on validation procedures and their motivation. The key findings are highlighted here.

General benchmarking

The Figure below shows the trade-off between global validation (x-axis) and local validation (y-axis) results. Thus, the upper and righter the dot is, the better the model it represents. It is seen that generative SSL approaches (AE, MLM, and AR) come on top in terms of the local patterns while staying competitive with the contrastive methods regarding the global embedding features. Consequently, we recommend this class of models as the optimal choice for representation learning for transactional data.

Models with external information

The similar Figure below presents how existing representation learning approaches (e.g., CoLES) can be improved by using external context information from all the users in the dataset.

Citation

If you find this repository useful, please feel free to star it and cite our paper.

@article{bazarova2024universal,
  title={Universal representations for financial transactional data: embracing local, global, and external contexts},
  author={Bazarova, Alexandra and Kovaleva, Maria and Kuleshov, Ilya and Romanenkova, Evgenia and Stepikin, Alexander and Yugay, Alexandr and Mollaev, Dzhambulat and Kireev, Ivan and Savchenko, Andrey and Zaytsev, Alexey},
  journal={arXiv preprint arXiv:2404.02047},
  year={2024}
}