PON-Del is a predictor for short (1-10 amino acid) sequence retaining deletions. It was trained on an extensive set of variations and showed superior performance compared to other tools. After evaluating multiple frameworks, LightGBM was selected as the final model.
This page provides precalculated results for all possible single amino acid deletions in proteins coded by MANE transcripts.
Deletions starting at position 1 return Pathogenic because deletion of the first amino acid (usually methionine) may prevent normal protein expression.
You can search the data in several different ways.
The datasets used for training and testing the tool are available here: data_pondel.csv
A manuscript describing the predictor has been submitted. In the meantime use URL for citation.
If you have any problems, please contact Haoyang (haoyang.zhang@med.lu.se) or Mauno (mauno.vihinen@med.lu.se).