-
claire.teillet_ird.fr authoredclaire.teillet_ird.fr authored
Species Distribution Modeling applied to Aedes albopictus breeding sites in public spaces (Montpellier) and generating an environmental bias map
Description
This project contains the necessary scripts to perform :
- Species Distribution Modeling (SDM) using biomod2 package in R with environmental variables and breeding sites data of Aedes albopictus. This script allows the calculation of presence probabilities.
- Exploration of environmental biases between breeding sites data and environmental variables.
Installation
Before running the scripts, make sure you have the following R packages installed:
install.packages(c("biomod2", "raster", "rgdal", "sp", "ggplot2", "FactorMineR", "terra", "dplyr","sf"))
Usage
1. Species Distribution Modeling (SDM)
Script: biomod2_breeding_sites_public_spaces.R
This script processes breeding site presence data (.csv) and environmental variables (.tif raster files) to model the distribution of Aedes albopictus breeding sites in public spaces. The workflow includes:
-
Loading Data: Import presence data and raster layers of environmental variables.
-
Data Formatting: Prepare the data for biomod2.
-
Running Individual Models: Train different SDM algorithms and evaluate their performance.
-
Assessing Model Performance: Extract evaluation scores and variable importance scores.
-
Plotting Response Curves: Visualize the relationships between environmental variables and the species presence.
-
Ensemble Modeling: Combine multiple individual models to improve prediction accuracy.
-
Projection: Apply the final model to the entire study area to generate probability maps of breeding site presence.
2. Sampling Bias Analysis
Script: sampling_bias_mtp.R
This script analyzes potential sampling biases by comparing the spatial distribution of presence data with the underlying environmental variables. It includes:
-
Loading data
-
Extraction Environmental Characteristics : Extract environmental variables at breeding site locations from the environmental raster stack.
-
Performing Principal Component Analysis (PCA) : Identify how well the sampled sites represent the environmental variability of the study area
-
Calculating Environmental Distance to Sampled Sites: Compute the minimum environmental distance between each point in the study area and the closest sampled breeding site.
-
Generating an Environmental Bias Map: Convert the computed environmental distances into a raster layer for visualization and analysis.
Data Requirements
Presence Data: A .csv file with at least two columns: longitude, latitude (WGS84 projection recommended).
Environmental Data: Raster layers (.tif format) representing environmental variables (e.g., spectral indices, textural indices, percentage of land cover, temperature, humidity).
Output
-
Model performance metrics (AUC, TSS)
-
Variable importance scores
-
Response curves
-
Probability maps of breeding site presence
-
Uncertainty associated with the probabilities
-
Environmental Bias Map