← Back to papers

Paper deep dive

SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models

Syed Usama Imtiaz, Mitra Nasr Azadani, Nasrin Alamdari

Year: 2026Venue: arXiv preprintArea: cs.AIType: PreprintEmbeddings: 25

Intelligence

Status: succeeded | Model: google/gemini-3.1-flash-lite-preview | Prompt: intel-v1 | Confidence: 98%

Last extracted: 3/26/2026, 2:37:00 AM

Summary

SpecTM (Spectral Targeted Masking) is a physics-informed self-supervised learning framework for Earth observation foundation models. By masking domain-specific diagnostic spectral bands (e.g., phycocyanin and chlorophyll-a absorption) rather than using stochastic masking, SpecTM forces the model to learn intrinsic spectral covariance. Evaluated on microcystin concentration regression using NASA PACE hyperspectral imagery over Lake Erie, SpecTM significantly outperforms baseline models in both current-week and 8-day-ahead predictions, demonstrating superior label efficiency and trustworthiness.

Entities (5)

Lake Erie · geographic-location · 100%Microcystin · chemical-compound · 100%NASA PACE · satellite-mission · 100%SpecTM · framework · 100%Vision Transformer · model-architecture · 95%

Relation Signals (3)

SpecTM appliedto Lake Erie

confidence 100% · using NASA PACE hyperspectral imagery over Lake Erie

SpecTM utilizesdatafrom NASA PACE

confidence 100% · evaluated it on a downstream microcystin concentration regression model using NASA PACE hyperspectral imagery

SpecTM improvespredictionof Microcystin

confidence 95% · SpecTM achieves R^2 = 0.695 (current week) and R^2 = 0.620 (8-day-ahead) predictions

Cypher Suggestions (2)

Identify the impact of SpecTM on specific target variables. · confidence 95% · unvalidated

MATCH (f:Framework {name: 'SpecTM'})-[:IMPROVES_PREDICTION_OF]->(t:TargetVariable) RETURN t.name

Find all models that utilize specific satellite missions for environmental monitoring. · confidence 90% · unvalidated

MATCH (m:Model)-[:UTILIZES_DATA_FROM]->(s:SatelliteMission) RETURN m.name, s.name

Abstract

Abstract:Foundation models are now increasingly being developed for Earth observation (EO), yet they often rely on stochastic masking that do not explicitly enforce physics constraints; a critical trustworthiness limitation, in particular for predictive models that guide public health decisions. In this work, we propose SpecTM (Spectral Targeted Masking), a physics-informed masking design that encourages the reconstruction of targeted bands from cross-spectral context during pretraining. To achieve this, we developed an adaptable multi-task (band reconstruction, bio-optical index inference, and 8-day-ahead temporal prediction) self-supervised learning (SSL) framework that encodes spectrally intrinsic representations via joint optimization, and evaluated it on a downstream microcystin concentration regression model using NASA PACE hyperspectral imagery over Lake Erie. SpecTM achieves R^2 = 0.695 (current week) and R^2 = 0.620 (8-day-ahead) predictions surpassing all baseline models by (+34% (0.51 Ridge) and +99% (SVR 0.31)) respectively. Our ablation experiments show targeted masking improves predictions by +0.037 R^2 over random masking. Furthermore, it outperforms strong baselines with 2.2x superior label efficiency under extreme scarcity. SpecTM enables physics-informed representation learning across EO domains and improves the interpretability of foundation models.

Tags

ai-safety (imported, 100%)csai (suggested, 92%)preprint (suggested, 88%)

Links

Your browser cannot display the PDF inline. Open PDF directly →

Full Text

24,950 characters extracted from source content.

Expand or collapse full text

SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models Syed Usama Imtiaz, Mitra Nasr Azadani, and Nasrin Alamdari Copyright 2026 IEEE. Published in the 2026 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2026), scheduled for 9 - 14 August 2026 in Washington, D.C.. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from the IEEE. Contact: Manager, Copyrights and Permissions / IEEE Service Center / 445 Hoes Lane / P.O. Box 1331 / Piscataway, NJ 08855-1331, USA. Telephone: + Intl. 908-562-3966. Abstract Foundation models are now increasingly being developed for Earth observation (EO), yet they often rely on stochastic masking that do not explicitly enforce physics constraints; a critical trustworthiness limitation, in particular for predictive models that guide public health decisions. In this work, we propose SpecTM (Spectral Targeted Masking), a physics-informed masking design that encourages the reconstruction of targeted bands from cross-spectral context during pretraining. To achieve this, we developed an adaptable multi-task (band reconstruction, bio-optical index inference, and 8-day-ahead temporal prediction) self-supervised learning (SSL) framework that encodes spectrally intrinsic representations via joint optimization, and evaluated it on a downstream microcystin concentration regression model using NASA PACE hyperspectral imagery over Lake Erie. SpecTM achieves R2=0.695R^2=0.695 (current week) and R2=0.620R^2=0.620 (8-day-ahead) predictions surpassing all baseline models by (++34% (0.51 Ridge) and ++99% (SVR 0.31)) respectively. Our ablation experiments show targeted masking improves predictions by +0.037+\!0.037 R2R^2 over random masking. Furthermore, it outperforms strong baselines with 2.2×2.2× superior label efficiency under extreme scarcity. SpecTM enables physics-informed representation learning across EO domains and improves the interpretability of foundation models. I Introduction Masked image modeling (MIM) based foundation models [1] learn representations by randomly masking a high proportion of the input image (processed as patch tokens) and reconstructing the masked regions from the visible context. Yet, random masking provides no explicit physical constraint on what the model learns, and can rely on spurious correlations present in the training distribution. In Earth observation (EO) models, particularly for applications that inform public health decisions, this creates a trustworthiness gap. Here, trustworthiness refers to developing models that are interpretable on established domain knowledge learning. Satellite remote sensing (SRS) utilizing hyperspectral imagery, such as by NASA’s recently launched Plankton, Aerosol, Cloud, Ocean Ecosystem (PACE) mission [2], provides unprecedented spectral observational details with high-fidelity that can help close this gap. While natural RGB images mainly learn from spatial patterns (shapes and textures) with limited spectral information, hyperspectral data encode physics(absorption/scattering) of constituents at specific wavelengths. For instance, phycocyanin absorption near λ​ 620​nmλ\ 620\,nm is a marker of cyanobacterial biomass and Chlorophyll-a absorption near λ​ 665​nmλ\ 665\,nm indicates phytoplankton (photosynthetic pigment) abundance [3, 4, 5]. The targeted masking of these pigment-sensitive bands would encourage the model to learn established bio-optical relationships rather than stochastic patterns. Current methods, such as SpectralGPT [6] apply 3D spatial spectral masking using a high masking ratio (90%), SatMAE [7] temporally masks spatial patches, and TerraMAE [8] groups bands using statistical reflectance correlation. Yet these methods share a critical limitation: spectral bands are masked either uniformly at random or via statistical groupings. These methods implicitly assume that all wavelengths are equally informative. We propose SpecTM (Spectral Targeted Masking), a targeted masking design that masks pigment-sensitive bands to learn cross-band spectral dependencies via reconstruction. We validate our approach on the downstream task of microcystin concentration prediction using NASA PACE hyperspectral imagery over Lake Erie, a region with annual economic impacts estimated at over $70 million. Our contributions: 1. We posit that embedding domain knowledge for masking strategy bridges the trustworthiness gap in spectral foundation models (Section I-A). 2. We introduced an adaptable self-supervised learning (SSL) framework for hyperspectral imagery that encodes spectrally intrinsic representations via multi-task joint optimization: band reconstruction, bio-optical index inference, and 8-day-ahead temporal prediction (Section I-C). 3. We demonstrate direct hyperspectral-to-microcystin regression using lab measurements as ground truth, without relying on proxies (e.g., chlorophyll-a) or threshold-based classification; to our knowledge, this is the first direct 8-day-Ahead microcystin prediction from hyperspectral imagery. (Section IV-B). I Related Work In the realm of SSL, MIM leverages unlabeled datasets through pretext tasks to learn improved representations. Masked Autoencoders (MAEs) [9] have catalyzed this paradigm shift in representation learning for Earth observation, such as SpectralGPT [6], a large (600M+) spectral remote-sensing foundation model trained on multispectral (Sentinel-2) data. However, architectural adaptations have recently emerged to enhance the efficacy of learning from satellite data. For instance, SatMAE [7]introduces temporal positional embeddings, while S-MAE [10], a dual-branch architecture that learns spatial and spectral representations in parallel. MIM was further refined in TerraMAE [8]by incorporating spectral-fidelity losses that better preserve inter-band correlations. Despite these architectural advancements, the masking strategy, particularly physics-informed spectral band masking, remains underexplored; such physically relevant band learning holds immense potential to improve both performance and the trustworthiness of foundation models. Algal bloom monitoring [11], now increasingly satellite-based, exemplifies the necessity for physics-informed spectral analysis for downstream prediction tasks that inform public health decisions. In this regard, the predictive strength of the Cyanobacteria Index, and the phycocyanin algorithms are derived precisely from targeting wavelengths with known biogeochemical significance that are directly encoded within the algorithm formulations [12]. In addition, artificial intelligence (AI) is increasingly being applied to environmental [14] and bacterial modeling [13]. Predictive modeling for bloom presence has achieved strong accuracy [15, 16], and our prior work using contrastive SSL improved detection from satellite imagery [17]. Bloom toxicity cannot be directly observed from satellite imagery, and continuous toxin concentration regression remains challenging. Microcystin cyanotoxin concentrations exhibit a nonlinear and temporally variable dependence on bloom biomass [18], and this challenge is further compounded by the scarcity of labeled observations, and inherent inland hydrology [19]. While SSL offers a potential solution to label scarcity, it doesn’t impose physics-based spectral constraints. These constraints underpin the trustworthiness of bio-optical algorithms. I Method I-A Targeted Masking Formalization Let ∈ℝBx ^B represent a B-band per-pixel hyperspectral reflectance spectrum and let ⊂1,…,BD⊂\1,…,B\ denote the set of diagnostic band indices identified from domain knowledge. Targeted masking deterministically defines the mask: mb=​[b∈],m_b=1[b ], (1) where mb∈0,1m_b∈\0,1\ indicates whether band b is masked. All diagnostic bands are masked while context bands remain fully visible. For cyanobacteria monitoring, we define D to include three bio-optically significant spectral regions: phycocyanin absorption (615–640 nm), chlorophyll-a red absorption (660–680 nm), and the red/NIR transition region (695–720 nm), spanning 28 of 122 PACE OCI bands. Masked bands are zeroed before spectral tokenization (rather than replaced with learnable mask vectors) so that masked-band values do not contribute to aggregated spectral tokens when multiple bands are combined. The selection of D follows established phytoplankton optics [20]. I-B Architecture We employ a Vision Transformer encoder with 6 layers, 8 attention heads, embedding dimension d=256d=256, and feedforward dimension 1024 (∼ 6M parameters). The 122 PACE OCI bands (∼ 345–2260 nm) are tokenized into 12 contiguous spectral tokens at predefined bio-optically meaningful boundaries (Figure 1). A 2-layer transformer decoder reconstructs band-level outputs from the encoded token representations. A learnable classification (CLS) token is prepended to the spectral token sequence; its final representation serves as the global encoding for downstream prediction heads. Meteorological features from the gridMET dataset (52 total: 10 base variables at t, t−8t-8, t−16t-16, t−32t-32 days, plus 12 derived features) are incorporated as a separate meteorology token, conditioning spectral reconstruction on environmental drivers. Figure 1: End-to-End Experimental Design for SpecTM Pretraining and Microcystin Quantification using NASA PACE Hyperspectral Imagery and limited ground truth labels I-C Multi-task Pretraining We jointly optimize three self-supervised objectives that learned complementary aspects of spectral-temporal structure. Spectral reconstruction (ℒreconL_recon) minimizes MSE between predicted and true reflectance at masked band positions only, forcing the encoder to infer diagnostic bands from spectral context. Physics index prediction (ℒphysL_phys) regresses six bio-optical indices (Figure 1) from the CLS token embedding; since defining bands are deterministically masked, success requires learning spectral covariance structure rather than direct computation. Temporal forecasting (ℒtempL_temp) predicts the full spectrum at t+8t+8 days (8-day composite cadence), encouraging the encoder to capture bloom temporal dynamics. The combined loss is: ℒSSL=λ1​ℒrecon+λ2​ℒphys+λ3​ℒtempL_SSL= _1L_recon+ _2L_phys+ _3L_temp (2) with weights λ1=1.0 _1=1.0, λ2=0.5 _2=0.5, λ3=0.3 _3=0.3 (selected via grid search on SSL validation loss). For SSL pretraining, we train for 100 epochs with AdamW (lr 10−410^-4, weight decay 0.01), batch size 256, and cosine schedule with 5-epoch warmup (∼ 4 hours on NVIDIA A100). Pretraining quality is measured by masked-position Pearson r and physics-index R2R^2. I-D Fine-Tuning and Evaluation For downstream microcystin prediction, we freeze the pretrained encoder to prevent overfitting on the small labeled dataset and train a 2-layer MLP head (hidden dimensions 128, 64; dropout 0.3) using MSE loss on the log-transformed target y=log⁡(1+MC)y= (1+MC).For fine-tuning, we use AdamW (lr 10−310^-3), up to 100 epochs, and early stopping (patience 20). Current week composite (n=147n=147) uses Leave-One-Group-Out CV by 8-day composite period; 8-day-ahead prediction (n=98n=98 consecutive pairs) 2024 → 2025 temporal split. We report R2R^2, RMSE, and Spearman ρ. IV Experiments We selected Western Lake Erie for its recurrent cyanobacterial blooms and intensive monitoring coverage, enabling systematic pairing of hyperspectral observations with in-situ toxin measurements. Figure 1 summarizes our experimental workflow. For self-supervised pretraining, we utilized NASA PACE OCI Level-3 mapped 8-day composites (2 km resolution) acquired between April 2024 and August 2025, yielding 71,320 spectral-meteorological pairs. Preprocessing follows our previous validation protocol [17]. We obtained microcystin concentrations from NOAA GLERL weekly sampling with strict alignment criteria (≤ 2 km, ± 4 days), returning 147 matched samples (0.10–10.70 μ /L) and 98 temporally paired observations for 8-day-ahead prediction. We benchmarked against 26,208 baseline configurations spanning seven algorithms and 78 feature combinations, using a strict temporal split (Train: 2024, n=112n=112; Test: 2025, n=35n=35). Ablation analyses isolate contributions of targeted masking and quantify label efficiency (5–100% training fractions). All baselines received identical input features: 122 spectral bands concatenated with 52 meteorological features, ensuring fair comparison. Figure 2: Experimental results. (a) SpecTM outperforms all baselines for both current-week (+34%) and 8-day-ahead (+99%) microcystin prediction. (b) Ablation analysis: targeted masking improves over random by +0.037 R2R^2; SSL pretraining provides +0.18 R2R^2 gain over random initialization. IV-A SSL Learns Physical Spectral Structure SpecTM reconstructs masked diagnostic bands with near-perfect accuracy (r=0.999r=0.999 on held-out validation split; Figure 3), despite the masking of diagnostic bands (620, 665, and 709 nm). This demonstrates that the model learned the spectral covariance structure rather than approximating direct arithmetic computation. To contextualize reconstruction quality, we benchmarked against baseline interpolation methods: linear (r=0.92r=0.92) and cubic spline (r=0.96r=0.96) exhibit significant residual error in comparison to our architecture that achieves near-perfect correlation (r=0.999r=0.999). This affirms that learned representations capture structural dependencies that exceed the capabilities of standard interpolation. IV-B SSL Improves Microcystin Prediction Our pretrained architecture achieved R2R^2 of 0.695 for current-week prediction and 0.620 for 8-day-ahead prediction, surpassing an exhaustive benchmark of 26,208 baseline configurations (Figure 2). The pronounced performance gain in 8-day-ahead prediction (+99% compared to +34% for current-week) demonstrates the efficacy of temporal SSL in learning complex bloom dynamics. This 8-day lead time provides actionable warning for water managers. Figure 3: SSL pretraining validation. (a) Single-sample spectral reconstruction showing visible context bands (blue), true masked values (red), and predicted values (green) across diagnostic wavelengths (615–720 nm). (b) Aggregate reconstruction accuracy across 4,096 samples (r=1.000r=1.000, RMSE=0.018=0.018). Shaded region: 95% CI. IV-C Targeted Masking Outperforms Random Masking Here, we isolate the effects of SSL pretraining via targeted ablations. We define ablation configurations as follows: SSL + context uses pretrained encoder with spectral-only input; SSL + all features adds meteorological auxiliary features; AUX-only uses auxiliary features without the encoder; Random-init replaces pretrained weights with random initialization. For random masking ablation, we matched the masking ratio (28 bands, 23%) and applied contiguous masking within spectral regions to ensure fair comparison. Under identical initialization, targeted masking outperforms random masking by 0.037 R2R^2 (Figure 2), validating that domain-informed band selection improves learned representations. The pretrained SSL features (without auxiliary) provide increase of R2R^2 0.180 relative to random initialization. This substantiates the hypothesis that SSL serves as a mechanism for distilling latent physical priors into the learned representations. The concatenation of physics-derived auxiliary features enhances performance across all configurations. The proximity of the AUX-only baseline (0.624) to the SSL + all features configuration (0.640) suggests that auxiliary features explicitly encode relationships that SSL internalizes implicitly. IV-D Label Efficiency Under Scarcity Figure 4: Label efficiency under data scarcity. SpecTM achieves 1.8×1.8× (current) and 2.2×2.2× (8-day-ahead) improvement over baseline at 5% labeled data. Shaded regions: ± 1 SD across five stratified subsamples. We evaluated label efficiency with uncertainty estimates from five stratified random subsamples per training fraction (Figure 4). At extreme scarcity (n=8n=8), SSL pretraining achieved 2.2×2.2× improvement over the AUX-only baseline despite high variance inherent to such a small environmental sample size. These results substantiate that pretrained representations provide domain-informed inductive bias that persists even under minimal supervision. With increasing labeled data, the performance disparity diminishes as both methodologies converge toward a common asymptotic limit; however, SSL maintains a persistent performance advantage across all data fractions. V Discussion Targeted masking bridges the trustworthiness gap in spectral foundation models. Our pretraining objective enforces physical consistency over stochastic noise or spurious correlations to encode spectral features of intrinsic interpretability. The ablation experiments substantiate that domain-informed masking outperforms random masking. From this, we glean that the model transcends dataset-specific patterns. We find that SpecTM learns similar spectral relationships as known physics features, but through different mechanisms. This is because spectral indices are derived from the same raw spectral bands learned by SpecTM, which makes additional feature information redundant. We posit that the efficacy of SpecTM is therefore paramount in domains where such derived features are absent, such as for applications (e.g., wildfire monitoring) or target variables (e.g., nutrient concentrations) that defy analytical formulation. Our framework extends beyond aquatic applications and establishes a principled approach to encoding domain knowledge for foundation models. Diverse spectroscopic disciplines with established spectral indices, such as vegetation indices (NDVI) in agriculture and mineral absorption indices (MAI) in geology, can tailor targeted masking strategies to encode intrinsic features. Limitation and Future Work: In this study, our experiments are confined to the water quality domain, with future work seeking to expand this scope to multi-domain applications. VI Conclusion We introduced Targeted Masking based on a principle of physics-informed self-supervised learning, designed to mask spectral bands with known sensitivities to the target variable. In this work, we presented a representative example of cyanotoxin (microcystin concentration) prediction as a proof of concept, where we masked phycocyanin, chlorophyll-a, and red-edge-sensitive bands to elucidate spectral covariance for bloom dynamics. We evaluated our approach with detailed experiments and our results demonstrate that this domain-informed learner strongly outperforms traditional baselines and maintains robust performance even with limited labels. As target-sensitive bands are domain-defined, we posit that this principle extends to hyperspectral applications beyond aquatic monitoring. References [1] X. Sun, P. Wang, W. Lu, Z. Zhu, X. Lu, Q. He, J. Li, X. Rong, Z. Yang, H. Chang, Q. He, G. Yang, R. Wang, J. Lu, and K. Fu, “RingMo: A Remote Sensing Foundation Model With Masked Image Modeling,” IEEE Trans. Geosci. Remote Sens., vol. 61, p. 1–22, 2023, doi: 10.1109/TGRS.2022.3194732. [2] P. J. Werdell, M. J. Behrenfeld, P. S. Bontempi, E. Boss, B. Cairns, G. T. Davis, B. A. Franz, U. B. Gliese, E. T. Gorman, O. Hasekamp, K. D. Knobelspiesse, A. Mannino, J. V. Martins, C. R. McClain, G. Meister, and L. A. Remer, “The Plankton, Aerosol, Cloud, Ocean Ecosystem Mission: Status, Science, Advances,” Bull. Amer. Meteorol. Soc., vol. 100, no. 9, p. 1775–1794, 2019, doi: 10.1175/BAMS-D-18-0056.1. [3] S. Mishra, D. R. Mishra, Z. Lee, and C. S. Tucker, “Quantifying cyanobacterial phycocyanin concentration in turbid productive waters: A quasi-analytical approach,” Remote Sens. Environ., vol. 133, p. 141–151, 2013, doi: 10.1016/j.rse.2013.02.004. [4] H. Lyu, Q. Wang, C. Wu, L. Zhu, B. Yin, Y. Li, and J. Huang, “Retrieval of phycocyanin concentration from remote-sensing reflectance using a semi-analytic model in eutrophic lakes,” Ecol. Informat., vol. 18, p. 178–187, 2013, doi: 10.1016/j.ecoinf.2013.09.002. [5] M.-Y. Ho, N. T. Soulier, D. P. Canniffe, G. Shen, and D. A. Bryant, “Light regulation of pigment and photosystem biosynthesis in cyanobacteria,” Curr. Opin. Plant Biol., vol. 37, p. 24–33, 2017, doi: 10.1016/j.pbi.2017.03.006. [6] D. Hong, B. Zhang, X. Li, Y. Li, C. Li, J. Yao, N. Yokoya, H. Li, P. Ghamisi, X. Jia, A. Plaza, P. Gamba, J. A. Benediktsson, and J. Chanussot, “SpectralGPT: Spectral Remote Sensing Foundation Model,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 46, no. 8, p. 5227–5244, 2024, doi: 10.1109/TPAMI.2024.3362475. [7] Y. Cong, S. Khanna, C. Meng, P. Liu, E. Rozi, Y. He, M. Burke, D. Lobell, and S. Ermon, “SatMAE: Pre-training transformers for temporal and multi-spectral satellite imagery,” Adv. Neural Inf. Process. Syst., vol. 35, p. 197–211, 2022. [8] T. B. Faruk, A. Matin, S. Pallickara, and S. L. Pallickara, “TerraMAE: Learning spatial–spectral representations from hyperspectral Earth observation data via adaptive masked autoencoders,” in Proc. 33rd ACM Int. Conf. Adv. Geogr. Inf. Syst., 2025, p. 565–568, doi: 10.1145/3748636.3762770. [9] K. He, X. Chen, S. Xie, Y. Li, P. Dollár, and R. Girshick, “Masked autoencoders are scalable vision learners,” in Proc. IEEE/CVF CVPR, 2022, p. 16000–16009. [10] J. Lin et al., “S-MAE: Spatial-spectral masked autoencoder for multi-source remote sensing image classification,” IEEE Trans. Geosci. Remote Sens., vol. 61, 2023. [11] N. Alamdari, Z. Yan, M. N. Azadani, and S. U. Imtiaz, “Chapter 11 — Algal blooms,” in Data-Driven Earth Observation for Disaster Management, X. Huang, S. Wang, K. Kalogeropoulos, and A. Tsatsaris, Eds. Elsevier, 2026, p. 183–205, doi: 10.1016/B978-0-443-33803-8.00004-4. [12] S. Mishra, R. P. Stumpf, B. Schaeffer, P. J. Werdell, K. A. Loftin, and A. Meredith, “Evaluation of a satellite-based cyanobacteria bloom detection algorithm using field-measured microcystin data,” Sci. Total Environ., vol. 774, 145462, 2021, doi: 10.1016/j.scitotenv.2021.145462. [13] A. Salou Moumouni, S. U. Imtiaz, M. Nasr Azadani, and N. Alamdari, “Near real-time and next-day prediction for *Escherichia coli* (E. coli) concentrations in highly urbanized watersheds,” Water Res., vol. 290, 125030, 2026, doi: 10.1016/j.watres.2025.125030. [14] S. H. Rabby, X. Sun, A. M. I. Hafiz, Z. Yan, S. U. Imtiaz, M. N. Azadani, M. Pakdehi, A. Salou Moumouni, E. Ahmadisharaf, and N. Alamdari, “Application of machine learning methods in water quality modeling,” in Machine Learning and Artificial Intelligence in Toxicology and Environmental Health, Z. Lin and W.-C. Chou, Eds. Academic Press, 2026, p. 271–309, doi: 10.1016/B978-0-443-30010-3.00016-7. [15] P. R. Hill, A. Kumar, M. Temimi, and D. R. Bull, “HABNet: Machine learning, remote sensing–based detection of harmful algal blooms,” IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., vol. 13, p. 3229–3239, 2020, doi: 10.1109/JSTARS.2020.3001445. [16] C. Acuña-Alonso, D. Barba-Barragáns, E. Seoane-Martínez, and X. Álvarez, “Deep learning for the prediction of cyanobacterial harmful algal blooms in freshwater reservoirs,” Remote Sens. Appl.: Soc. Environ., vol. 40, 101792, 2025, doi: 10.1016/j.rsase.2025.101792. [17] S. U. Imtiaz, M. Nasr Azadani, and N. Alamdari, “SimCLR-enabled wide and deep learning for cyanobacterial bloom prediction from NASA’s PACE hyperspectral mission,” IEEE Geosci. Remote Sens. Lett., vol. 22, p. 1–5, 2025, Art. no. 1504905. [18] R. P. Stumpf, T. W. Davis, T. T. Wynne, J. L. Graham, K. A. Loftin, T. H. Johengen, D. Gossiaux, D. Palladino, and A. Burtner, “Challenges for mapping cyanotoxin patterns from remote sensing of cyanobacteria,” Harmful Algae, vol. 54, p. 160–173, 2016, doi: 10.1016/j.hal.2016.01.005. [19] M. Nasr Azadani, S. U. Imtiaz, and N. Alamdari, “Role of impoundment and irrigation in intensive agriculture watersheds,” J. Hydrol., vol. 662, pt. C, 134075, 2025, doi: 10.1016/j.jhydrol.2025.134075. [20] T. T. Wynne, R. P. Stumpf, M. C. Tomlinson, and J. Dyble, “Characterizing a cyanobacterial bloom in Western Lake Erie using satellite imagery and meteorological data,” Limnol. Oceanogr., vol. 55, no. 5, p. 2025–2036, 2010, doi: 10.4319/lo.2010.55.5.2025.