Evaluation of presence-only species distribution models

Experimental design

Also available here for better readability / zooming functionalities.

Figure X. Comprehensive overview of the experimental workflow for simulating and evaluating species distribution models using virtual species.

Evaluation workflow

Also available here for better readability / zooming functionalities.

Rescaling metrics

Feature scaling (Min-Max-Normalisierung):

x is an original value
x' is the normalized value
min(x) lower bound of target range (for AUC 0.5)
max(x) upper bound of target range (for AUC 1)

Metric	Baseline	Min	Max	Higher Better?
AUC	0.5	0	1	Yes
COR	-	-1	1	Yes
Spec	-	0	1	Yes
Sens	-	0	1	Yes
Kappa	-	-1	1	Yes
PCC	-	0	1	Yes
TSS	0	-1	1	Yes
PRG	0.5	0	1	Yes
MAE	-	0	1	No
BIAS	-	-1	1	No

Assessment on how good the evaluation metrics are:

We plot the results of the evaluation metrics against the Pearson correlation between the suitability raster and the prediction map of virtual species.
If the model (points) are plotted on the diagonal, then the metric is performing well.

Results

Figure: AUCROC, Pearson's correlation, AUCPRG, and Specificity. In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis.The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets. The left column shows results from presence–absence (PA) data, the middle column from presence–background (PBG) data, and the right column from presence-artificial-absence (PAA) data. Rows correspond to different evaluation metrics.

Figure: Sensitivity, true skill statistic (TSS), Cohen’s kappa, and percent correctly classified (PCC). In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis. The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets. The left column shows results from presence–absence (PA) data, the middle column from presence–background (PBG) data, and the right column from presence-artificial-absence (PAA) data. Rows correspond to different evaluation metrics.

Figure: Symmetric extremal dependence index (SEDI), Smoothed boyce index mean, and omission rate. In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis. SEDI and omission rate are shown on an inversed scale. The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation of presence-only species distribution models

Experimental design

Evaluation workflow

Rescaling metrics

Assessment on how good the evaluation metrics are:

Results

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Evaluation of presence-only species distribution models

Experimental design

Evaluation workflow

Rescaling metrics

Assessment on how good the evaluation metrics are:

Results