A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce

An (unofficial) PyTorch implementation of SizeFitNet (SFNet) architecture proposed in the paper A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce by Sheikh et. al (RecSys'19).

Dataset

The original paper demonstrates experiments on two datasets:

ModCloth
RentTheRunWay

Both datasets are curated from fashion e-commerce websites that provide transaction information that contain customer attributes, article attributes and fit (target varible). fit is a categorical variable with small, fit and large as the possible labels.

Model Architecture

The model consits of two pathways: one that captures user embeddings + user features, the other that captures item embeddings + item features. See the following figure taken from paper:

The representations within each pathway is transformed using residual/skip connections. The authors compare it against an MLP (without skip connections) baseline and show better performance.

Different from the paper, I combine the user representation (u) and item representation (v) into a new tensor as below:

   [u, v, |u-v|, u*v]

concatenation of the two representations
element-wise product u ∗ v
absolute element-wise difference |u-v|

Based on: https://arxiv.org/pdf/1705.02364.pdf

This new representation is fed to the top layer skip connection block.

Instructions

Start by installing the necessary packages (preferably within a Python virtual environment):

   pip install -r requirements.txt

Download the data from here and place it in the data/ directory.
The original data is quite messy with quite a bit of missing values (NaNs). As such, I dropped some attributes and did some data imputation. Refer to the data_exploration.ipynb notebook to understand the data processing steps. The notebook also provides instruction on creating the train/validation/test splits.
Set data, model and training configurations by appropriately modifying the jsonnet files under configs/ directory.
Train the SFNet Model:

   python train.py

The model checkpoints and run configuration will be saved under runs/<experiment_name> The above also generates tensorboard plots of training loss and validation metrics, that would be useful to view training progress.

Test the Model:

   python test.py --saved_model_path `runs/<experiment_name>`

Primary Results

The model was quite sensitive to hyperparameters. For certain sets of hyperparameters, the optimization process would diverge and the validation loss would shoot up. The configuration provided under configs/model.jsonnet was what worked best for me.

Learning Curves

Below are some tensorboard graphs for validation metrics.

Performance on Test Set

Precision	Recall	F1-score	Accuracy	AUC
0.682	0.397	0.378	0.691	0.728

Note: The metrics reported here are macro-average values.

TODO

Some future work for this repository and ideas are plausible ways to improve the results:

Experiments on RentTheRunWay Dataset
L2-regularization on the embeddings
Batch Normalization on the feed forward layers
Early Stopping
Learning Rate Decay
Weighted Loss Function to account for the class-imbalance
Contrastive Learning (Siamese Networks-?) to encourage learning different sub-spaces for positive and negative size-fits
Topic Modelling Approaches
Modelling Users/Items as Distributions

Acknowledgements

Thanks to Rishab Mishra for making the datasets used here publicly available on Kaggle. Some ideas for pre-processing the data were borrowed from NeverInAsh.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
configs		configs
images		images
notebooks		notebooks
LICENSE		LICENSE
README.md		README.md
modcloth.py		modcloth.py
model.py		model.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce

Dataset

Model Architecture

Instructions

Primary Results

Learning Curves

Performance on Test Set

TODO

Acknowledgements

About

Releases

Packages

Languages

License

HareeshBahuleyan/size-fit-net

Folders and files

Latest commit

History

Repository files navigation

A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce

Dataset

Model Architecture

Instructions

Primary Results

Learning Curves

Performance on Test Set

TODO

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages