Name		Name	Last commit message	Last commit date
parent directory ..
images		images
.gitignore		.gitignore
config.vsh.yaml		config.vsh.yaml
readme.md		readme.md
readme.qmd		readme.qmd
script.sh		script.sh

readme.md

1st Place Solution Summary

You can find the code on github here.

Multiome

Model Overview

Input Processing

Target Preprocessing

tSVD-based imputation method:

Perform dimensionality reduction on the data with tSVD
and then, Transform the data back to the original space
copy the value of the 0 part of the original data from the transformed values.

Model

Output Postprocessing and Loss

In the inference phase, the model outputs the average of the five predicted target data.

CITEseq

Model Overview

Input Preprocessing

In selecting important genes in CITEseq, the correlation coefficient is calculated for each batch and select only genes with high correlation in many batches.

Genes were selected from those related to the target proteins and pathway.

I use Reactome as pathway database.

Target Preprocessing

Model

Output Postprocessing and Loss

In the inference phase, the model outputs the average of the five predicted target data.

Local evaluation

I used two evaluation schemes.

Evaluation with cross validation:
- 5-fold cross validation grouped by donor and day
Evaluation for hyperparameter optimization with Optuna:
- Training data set is divided into training and validation data sets. ( Training data set: 80%, validation data set: 20%. )

Ensemble

I used the weighted average of predictions of the following models.

Models trained with changing the seed
Models fine-tuned on only some batches
- Batch combination pattern examples: males only, female only, Day 4, 7 only, etc.
- Use a model trained on the full training data set as a pre-training model

Development setup

Download resources

res_dir=src/shuji_suzuki/resources
mkdir -p "$res_dir"
wget https://ftp.ebi.ac.uk/pub/databases/genenames/hgnc/tsv/hgnc_complete_set.txt -O "$res_dir/hgnc_complete_set.txt"
wget https://reactome.org/download/current/ReactomePathways.gmt.zip -O "$res_dir/ReactomePathways.gmt.zip" &&
  unzip "$res_dir/ReactomePathways.gmt.zip" -d "$res_dir" && 
  rm "$res_dir/ReactomePathways.gmt.zip"

Clone repo

echo shu65_openproblems > src/shuji_suzuki/.gitignore
git clone https://github.com/shu65/open-problems-multimodal.git src/shuji_suzuki/shu65_openproblems

Executing the method

Run method

viash run src/shuji_suzuki/config.vsh.yaml -- \
  --input sample_data \
  --output output \
  ---memory 100GB \
  ---cpus 30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shuji_suzuki

shuji_suzuki

readme.md

1st Place Solution Summary

Multiome

Model Overview

Input Processing

Target Preprocessing

Model

Output Postprocessing and Loss

CITEseq

Model Overview

Input Preprocessing

Target Preprocessing

Model

Output Postprocessing and Loss

Local evaluation

Ensemble

Development setup

Executing the method

Files

shuji_suzuki

Directory actions

More options

Directory actions

More options

Latest commit

History

shuji_suzuki

Folders and files

parent directory

readme.md

1st Place Solution Summary

Multiome

Model Overview

Input Processing

Target Preprocessing

Model

Output Postprocessing and Loss

CITEseq

Model Overview

Input Preprocessing

Target Preprocessing

Model

Output Postprocessing and Loss

Local evaluation

Ensemble

Development setup

Executing the method