Skip to content

Commit

Permalink
P-values changed in %Rank
Browse files Browse the repository at this point in the history
  • Loading branch information
DavidGfeller committed Feb 21, 2019
1 parent 878493d commit a4b35f7
Show file tree
Hide file tree
Showing 6 changed files with 25,847 additions and 25,850 deletions.
14 changes: 7 additions & 7 deletions MixMHCpred
Original file line number Diff line number Diff line change
@@ -1,21 +1,21 @@
#!/bin/bash

###############
# MixMHCpred2.0.1 is a predictor of HLA ligand displayed at the cell surface.
# MixMHCpred2.0.2 is a predictor of HLA ligand displayed at the cell surface.
#
# MixMHCpred2.0.1 can be used freely by academic groups for non-commercial purposes (see license).
# MixMHCpred2.0.2 can be used freely by academic groups for non-commercial purposes (see license).
# The product is provided free of charge, and, therefore, on an "as is"
# basis, without warranty of any kind.
#
# FOR-PROFIT USERS
# If you plan to use MixMHCpred (version 2.0.1) or any data provided with the script in any for-profit
# If you plan to use MixMHCpred (version 2.0.2) or any data provided with the script in any for-profit
# application, you are required to obtain a separate license.
# To do so, please contact [email protected] or [email protected] at the Ludwig Institute for Cancer Research Ltd.
#
# If you use MixMHCpred2.0.1 in a publication, please cite:
# Bassani-Sternberg M et al. Deciphering HLA motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity (2017).
# If you use MixMHCpred2.0.2 in a publication, please cite:
# Bassani-Sternberg M et al. Deciphering HLA motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity, PLoS Comp Bio (2017).
# and
# Gfeller et al. The length distribution and multiple specificity of naturally presented HLA-I ligands, BioRxiv (2018).
# Gfeller et al. The length distribution and multiple specificity of naturally presented HLA-I ligands, J Immunol (2018).
#
# For scientific questions, please contact David Gfeller ([email protected])
#
Expand Down Expand Up @@ -43,7 +43,7 @@ usage="Usage: MixMHCpred -i INPUT_FILE -o OUTPUT_FILE -h LIST_OF_ALLELES"

if [[ "$1" == "--help" || "$1" == "-h" ]]; then
cat <<END
MixMHCpred2.0.1
MixMHCpred2.0.2
$usage
Mandatory parameters:
Expand Down
41 changes: 19 additions & 22 deletions README
Original file line number Diff line number Diff line change
@@ -1,54 +1,49 @@

###############
# MixMHCpred2.0.1 is a predictor of HLA-I ligand displayed at the cell surface.
# MixMHCpred2.0.2 is a predictor of HLA-I ligand displayed at the cell surface.
#
# MixMHCpred2.0.1 can be used freely by academic groups for non-commercial purposes (see license).
# MixMHCpred2.0.2 can be used freely by academic groups for non-commercial purposes (see license).
# The product is provided free of charge, and, therefore, on an "as is"
# basis, without warranty of any kind.
#
# FOR-PROFIT USERS
# If you plan to use MixMHCpred (version 2.0.1) or any data provided with the script in any for-profit
# If you plan to use MixMHCpred (version 2.0.2) or any data provided with the script in any for-profit
# application, you are required to obtain a separate license.
# To do so, please contact [email protected] or [email protected] at the Ludwig Institute for Cancer Research Ltd.
#
# If you use MixMHCpred2.0.1 in a publication, please cite:
# Bassani-Sternberg M et al. Deciphering HLA motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity (2017).
# If you use MixMHCpred2.0.2 in a publication, please cite:
# Bassani-Sternberg M et al. Deciphering HLA motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity, PLoS Comp Bio (2017).
# and
# Gfeller et al. The length distribution and multiple specificity of naturally presented HLA-I ligands, BioRxiv (2018).
# Gfeller et al. The length distribution and multiple specificity of naturally presented HLA-I ligands, J Immunol (2018).
#
# For scientific questions, please contact David Gfeller ([email protected])
#
# Copyright (2018) David Gfeller
###############

########################
NEW FEATURES OF VERSION 2.0.1
NEW FEATURES OF VERSION 2.0.2
########################

- Modeling HLA-I specificity with multiple motifs.
- Larger allele coverage (122 HLA-I alleles) based on manual curation of HLA-I motifs in pooled HLA peptidomics datasets, and inclusion of IEDB data for alleles not present in recent HLA peptidomics datasets.
- Predictions for 8- to 14-mers.
- Explicitly modeling peptide length distribution
- Possibility to disregard cystein-containing peptides (option "-c 0", all peptides containing a Cystein are given a score of -100.0). Cysteine can create many issues in binding or immunogenicity assays. In addition it is poorly detected in many HLA peptidomics samples, and therefore is more difficult to accurately predict.
- Most of these features are already available in version 2.0, and only a few minor changes occured in version 2.0.1.
- Following common practices in NetMHCpan and MixMHC2pred, P-values have been replaced by %Rank (i.e., multiplied by 100). These values correspond to the fraction of random peptides of length 8 to 14 that have a score higher than the one of the peptide for predictions are done.

############
INSTALLATION
############

For Mac and Linux:

1) Dowload the MixMHCpred2.0.1.zip file and move it to a directory
1) Dowload the MixMHCpred2.0.2.zip file and move it to a directory
of your choice, where you have writing permissions.

2) Unzip the MixMHCpred2.0.1.zip package.
2) Unzip the MixMHCpred2.0.2.zip package.

3) Open the MixMHCpred file and replace lib_path by the
path to the MixMHCpred2.0.1/lib directory where you have installed the script
path to the MixMHCpred2.0.2/lib directory where you have installed the script

4) To run it from anywhere on your computer, add the MixMHCpred2.0.1. directory to your path.
4) To run it from anywhere on your computer, add the MixMHCpred2.0.2. directory to your path.

5) To test your installation, make sure you are in the MixMHCpred2.0.1/ directory and run the command (should not take more than a few seconds):
5) To test your installation, make sure you are in the MixMHCpred2.0.2/ directory and run the command (should not take more than a few seconds):

./MixMHCpred -i test/test.fa -o test/out.txt -a A0101,A2501,B0801,B1801

Expand All @@ -65,7 +60,7 @@ melanoma sample (from Bassani-Sternberg et al. Nat Comm 2016).
For Linux:

After step 3), you also need to compile the MixMHCpred.cc code. Go to
MixMHCpred2.0.1/lib and compile with your favorite C++ compiler
MixMHCpred2.0.2/lib and compile with your favorite C++ compiler
(e.g. g++ -O3 MixMHCpred.cc -o MixMHCpred.x)


Expand Down Expand Up @@ -113,10 +108,10 @@ OTHER INFORMATION
a random sampling of 700,000 peptides from the human proteome (8- to 14-mers).
This P-value is simply an indicator of how a given peptide scores compares to a universe of random peptides (similar to percentile rank used in other HLA-I ligand predictors). For a single allele, scores larger than 0 correspond to P-values smaller than 0.01.

- The list of alleles is provided in MixMHCpred2.0.1/lib/allele_list.txt.
- The list of alleles is provided in MixMHCpred2.0.2/lib/allele_list.txt.
The third column indicates the number of motifs used to describe the specificity of a given allele and a given peptide length

- The MixMHCpred2.0.1/tmp/ directory is used to store temporary files. Make
- The MixMHCpred2.0.2/tmp/ directory is used to store temporary files. Make
sure there is no restriction for writing and reading files in this directory.

- Predictions of Cystein containing peptides are less reliable due to the low detection rate of Cys in MS.
Expand All @@ -126,4 +121,6 @@ OTHER INFORMATION
- the lib/pwm/ folder contains the different PWMs for each allele and each length of ligands.
The 6th line indicates the weight (differs than one for alleles modelled with multiple motifs),
the number of unique ligands used to train the model and the origin of the data
(1: unfiltered and curated MS data; 2: IEDB_MS data; 3: IEDB_non_MS data; 0: extrapolation from other lengths).
(1: unfiltered and curated MS data; 2: IEDB_MS data; 3: IEDB_non_MS data; 0: extrapolation from other lengths).

- Predictions can be run for 8- to 14-mers. The length distribution obtained in MS data is explicitely incorporated in the scores of the peptides (see Gfeller et al., J Immunol 2018).
16 changes: 8 additions & 8 deletions lib/MixMHCpred.cc
Original file line number Diff line number Diff line change
Expand Up @@ -494,22 +494,22 @@ void make_pred(){


fprintf (pFile, "####################\n");
fprintf (pFile, "# Output from MixMHCpred (v2.0.1)\n");
fprintf (pFile, "# Output from MixMHCpred (v2.0.2)\n");
fprintf (pFile, "# Alleles: %s",alleles[0]); for(int h=1; h<nh; h++){fprintf (pFile, ", %s", alleles[h]);} fprintf (pFile, "\n");
fprintf (pFile, "# Input file: %s\n", input_file_original);
fprintf (pFile, "# MixMHCpred is freely available for academic users.\n");
fprintf (pFile, "# Private companies should contact [email protected] or [email protected] at the Ludwig Institute for Cancer Research Ltd for commercial licenses.\n");
fprintf (pFile, "#\n# To cite MixMHCpred2.0.1, please refer to:\n");
fprintf (pFile, "# Gfeller et al. The length distribution and multiple specificity of naturally presented HLA-I ligands, BioRxiv (2018).\n");
fprintf (pFile, "#\n");
fprintf (pFile, "#\n# To cite MixMHCpred2.0.2, please refer to:\n");
fprintf (pFile, "# Bassani-Sternberg et al. Deciphering HLA-I motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity, PLoS Comp Bio (2017).\n");
fprintf (pFile, "# Gfeller et al. The length distribution and multiple specificity of naturally presented HLA-I ligands, J Immunol (2018).\n");
fprintf (pFile, "####################\n");

fprintf (pFile, "Peptide\t");
//fprintf (pFile, "Max_score\tMax_allele\tRanking\tP_val");
fprintf (pFile, "Score_bestAllele\tBestAllele\tPval_bestAllele");
fprintf (pFile, "Score_bestAllele\tBestAllele\t%%Rank_bestAllele");
for(int h=0; h<nh; h++){
//fprintf (pFile, "\t%s\tRanking\tP_val", alleles[h]);
fprintf (pFile, "\tScore_%s\tPval_%s", alleles[h], alleles[h]);
fprintf (pFile, "\tScore_%s\t%%Rank_%s", alleles[h], alleles[h]);
}
fprintf (pFile, "\n");

Expand All @@ -533,7 +533,7 @@ void make_pred(){
pval=P_val_bin[j];
}
}
fprintf (pFile, "\t%g", pval);
fprintf (pFile, "\t%g", 100*pval);
} else {
fprintf (pFile, "\tNA\tNA\tNA\t");
}
Expand All @@ -549,7 +549,7 @@ void make_pred(){
pval=P_val_bin[j];
}
}
fprintf (pFile, "\t%g", pval);
fprintf (pFile, "\t%g", 100*pval);

} else {
fprintf (pFile, "\tNA\tNA");
Expand Down
Binary file modified lib/MixMHCpred.x
Binary file not shown.
Binary file modified license.pdf
Binary file not shown.
Loading

0 comments on commit a4b35f7

Please sign in to comment.