Skip to content

Latest commit

 

History

History
166 lines (94 loc) · 11 KB

File metadata and controls

166 lines (94 loc) · 11 KB

Evaluation of presence-only species distribution models

Experimental design

Also available here for better readability / zooming functionalities.

Figure X. Comprehensive overview of the experimental workflow for simulating and evaluating species distribution models using virtual species.

Evaluation workflow

Also available here for better readability / zooming functionalities.

Rescaling metrics

Feature scaling (Min-Max-Normalisierung):

https://wikimedia.org/api/rest_v1/media/math/render/svg/358923abc154221bb5022fc329061f6fc4dcc69f

  • x is an original value
  • x' is the normalized value
  • min(x) lower bound of target range (for AUC 0.5)
  • max(x) upper bound of target range (for AUC 1)
Metric Baseline Min Max Higher Better?
AUC 0.5 0 1 Yes
COR - -1 1 Yes
Spec - 0 1 Yes
Sens - 0 1 Yes
Kappa - -1 1 Yes
PCC - 0 1 Yes
TSS 0 -1 1 Yes
PRG 0.5 0 1 Yes
MAE - 0 1 No
BIAS - -1 1 No

Assessment on how good the evaluation metrics are:

We plot the results of the evaluation metrics against the Pearson correlation between the suitability raster and the prediction map of virtual species.
If the model (points) are plotted on the diagonal, then the metric is performing well.

Evaluation Metric Explained

Results

Figure: AUCROC, Pearson's correlation, AUCPRG, and Specificity. In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis.The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets. The left column shows results from presence–absence (PA) data, the middle column from presence–background (PBG) data, and the right column from presence-artificial-absence (PAA) data. Rows correspond to different evaluation metrics.

Figure: Sensitivity, true skill statistic (TSS), Cohen’s kappa, and percent correctly classified (PCC). In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis. The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets. The left column shows results from presence–absence (PA) data, the middle column from presence–background (PBG) data, and the right column from presence-artificial-absence (PAA) data. Rows correspond to different evaluation metrics.

Figure: Symmetric extremal dependence index (SEDI), Smoothed boyce index mean, and omission rate. In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis. SEDI and omission rate are shown on an inversed scale. The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets.