Integration of AI Models in Renewable Energy Forecasting Systems — Glossary · Professional Certificate in AI for Renewable Energy Forecasting (Thailand)

Adaptive Boosting (AdaBoost) #

A machine‑learning ensemble method that combines multiple weak learners to create a strong predictive model.

Related terms #

weak learner, ensemble.

Explanation #

AdaBoost sequentially trains classifiers, increasing the weight of mis‑classified instances so subsequent models focus on harder cases.

Example #

Using decision stumps to improve short‑term solar irradiance forecasts.

Practical application #

Enhances forecast accuracy for photovoltaic (PV) output during rapidly changing cloud conditions.

Challenges #

Sensitive to noisy data; over‑fitting can occur if many boosting rounds are used without proper regularisation.

Artificial Neural Network (ANN) #

A computational model inspired by biological neurons, consisting of interconnected layers that learn nonlinear relationships.

Related terms #

deep learning, backpropagation.

Explanation #

ANNs adjust weights through gradient descent to minimise forecast error between predicted and observed renewable generation.

Example #

A feed‑forward network predicts wind turbine power based on wind speed, direction, and temperature.

Practical application #

Real‑time forecasting for grid operators to schedule dispatch of wind farms.

Challenges #

Requires large training datasets; hyperparameter tuning is computationally intensive.

AutoRegressive Integrated Moving Average (ARIMA) #

A statistical time‑series model that captures autocorrelation, differencing, and moving‑average components.

Related terms #

stationarity, Box‑Jenkins.

Explanation #

ARIMA predicts future values by modelling past observations and error terms, often serving as a baseline for renewable forecasts.

Example #

Forecasting daily solar generation using historical irradiance data.

Practical application #

Provides quick, interpretable benchmarks for solar plant operators.

Challenges #

Struggles with non‑linear patterns and abrupt weather changes; requires manual order selection.

Bidirectional Long Short‑Term Memory (Bi‑LSTM) #

An extension of LSTM networks that processes sequences forward and backward, capturing past and future context.

Related terms #

sequence modelling, recurrent neural network.

Explanation #

Bi‑LSTM layers retain information from both directions, improving temporal dependencies in energy forecasts.

Example #

Predicting hourly wind speed using past 48 hours and future meteorological model outputs.

Practical application #

Enhances day‑ahead forecasts for wind farms integrated into market scheduling.

Challenges #

Higher computational cost; risk of over‑parameterisation with limited data.

Capacity Factor (CF) #

The ratio of actual energy produced by a renewable asset over a period to its theoretical maximum output.

Related terms #

capacity, utilization.

Explanation #

CF = (actual generation) / (maximum possible generation) and reflects asset performance.

Example #

A solar farm with a 20 % capacity factor generates 20 % of its name‑plate capacity over a year.

Practical application #

Used by investors to assess profitability and by planners to size storage.

Challenges #

Varies widely with location, technology, and weather; forecasting CF requires accurate climate data.

Convolutional Neural Network (CNN) #

A deep learning architecture that applies convolutional filters to extract spatial features from data.

Related terms #

kernel, pooling.

Explanation #

CNNs are adept at processing gridded data such as satellite images or numerical weather prediction (NWP) fields.

Example #

Using satellite cloud imagery to predict solar irradiance for PV farms.

Practical application #

Provides high‑resolution short‑term forecasts where traditional models lack spatial detail.

Challenges #

Requires large labelled datasets; may be less effective for purely temporal sequences without spatial dimensions.

Cross‑Validation (CV) #

A statistical technique for assessing model performance by partitioning data into training and testing subsets multiple times.

Related terms #

k‑fold, hold‑out.

Explanation #

CV mitigates over‑fitting by evaluating models on unseen data across several folds.

Example #

5‑fold CV for a gradient‑boosted tree predicting solar output.

Practical application #

Helps select hyper‑parameters for AI models used in renewable forecasting.

Challenges #

In time‑series data, random shuffling can break temporal dependencies; must use rolling‑origin CV.

Data Assimilation (DA) #

The process of integrating observational data with model outputs to improve forecast accuracy.

Related terms #

Kalman filter, ensemble.

Explanation #

DA updates NWP or AI model states using real‑time measurements, reducing uncertainty.

Example #

Incorporating ground‑based pyranometer readings into a solar forecast model.

Practical application #

Provides more reliable day‑ahead forecasts for solar farms.

Challenges #

Requires sophisticated algorithms and high‑frequency observations; computationally demanding.

Deep Learning (DL) #

A subset of machine learning that uses multi‑layer neural networks to learn hierarchical representations.

Related terms #

feature extraction, backpropagation.

Explanation #

DL models automatically discover complex patterns in large datasets, making them suitable for weather‑driven energy forecasts.

Example #

A stacked auto‑encoder compresses NWP variables before feeding them to a regression layer for wind power prediction.

Practical application #

Enables end‑to‑end forecasting pipelines from raw satellite data to power output.

Challenges #

High data and compute requirements; interpretability remains limited.

Ensemble Forecasting #

Combining multiple models or predictions to produce a single, more robust forecast.

Related terms #

bagging, model averaging.

Explanation #

Ensembles reduce variance and bias by aggregating diverse forecasts, often improving reliability.

Example #

Averaging outputs from a physics‑based NWP model and an AI‑based regression model for wind speed.

Practical application #

Provides probabilistic forecasts for grid operators to manage reserve margins.

Challenges #

Determining optimal weighting; increased computational load.

Feature Engineering #

The process of creating informative variables from raw data to improve model performance.

Related terms #

dimensionality reduction, transformation.

Explanation #

In renewable forecasting, features may include lagged power, sky‑clearness indices, or terrain attributes.

Example #

Deriving a “clear‑sky index” from satellite radiance to enhance PV output predictions.

Practical application #

Boosts accuracy of machine‑learning models where raw inputs are noisy or high‑dimensional.

Challenges #

Requires domain expertise; risk of leakage if future information is inadvertently used.

Forecast Horizon #

The future time interval over which predictions are made, e.g., minutes, hours, days.

Related terms #

lead time, temporal resolution.

Explanation #

Different horizons demand distinct modelling approaches; short‑term forecasts rely on high‑frequency data, while long‑term use climatology.

Example #

A 15‑minute horizon for inverter curtailment decisions versus a 24‑hour horizon for market bidding.

Practical application #

Guides selection of AI model architecture and input data frequency.

Challenges #

Maintaining accuracy across multiple horizons; balancing computational cost with forecast granularity.

Gaussian Process Regression (GPR) #

A non‑parametric Bayesian method that provides both predictions and uncertainty estimates.

Related terms #

kernel function, covariance.

Explanation #

GPR models the distribution over functions, yielding a mean forecast and confidence intervals.

Example #

Predicting solar PV output with associated variance to inform storage dispatch.

Practical application #

Offers probabilistic forecasts crucial for risk‑aware grid operation.

Challenges #

Scales poorly with large datasets (O(n³) complexity); requires careful kernel selection.

Generalized Additive Model (GAM) #

A statistical model where the response variable is expressed as a sum of smooth functions of predictors.

Related terms #

spline, link function.

Explanation #

GAMs capture non‑linear relationships while remaining interpretable.

Example #

Modelling solar irradiance as smooth functions of cloud cover, humidity, and time of day.

Practical application #

Provides transparent forecasts for regulatory reporting.

Challenges #

Limited ability to capture complex interactions; may under‑perform deep‑learning models on high‑dimensional data.

Gradient Boosting Machine (GBM) #

An ensemble technique that builds trees sequentially, each correcting errors of its predecessor.

Related terms #

learning rate, decision tree.

Explanation #

GBMs minimise a loss function by adding weak learners, yielding strong predictive performance.

Example #

XGBoost predicts wind farm power using meteorological and turbine‑specific features.

Practical application #

Widely adopted for day‑ahead renewable forecasts due to speed and accuracy.

Challenges #

Hyperparameter tuning is critical; can over‑fit if trees become too deep.

Hybrid Modelling #

Combining physics‑based and data‑driven approaches to leverage strengths of both.

Related terms #

physics‑informed neural network, model coupling.

Explanation #

Hybrid models embed physical constraints within AI architectures or fuse outputs from separate models.

Example #

Integrating a physical PV performance model with a neural network that learns residual errors.

Practical application #

Improves forecast reliability under extreme weather where pure AI models may extrapolate poorly.

Challenges #

Designing appropriate coupling strategies; increased model complexity.

Hyperparameter Tuning #

The process of selecting optimal configuration settings for a learning algorithm.

Related terms #

grid search, Bayesian optimisation.

Explanation #

Hyperparameters (e.g., learning rate, number of layers) are not learned from data and must be set before training.

Example #

Using random search to find the best dropout rate for a LSTM forecasting wind speed.

Practical application #

Directly impacts forecast accuracy and model stability for renewable energy applications.

Challenges #

Computationally expensive; risk of over‑optimising on validation data.

Imbalanced Data #

A situation where certain classes or value ranges dominate the dataset, common in rare event forecasting.

Related terms #

oversampling, minority class.

Explanation #

Models may become biased toward majority patterns, reducing performance on under‑represented events (e.g., sudden cloud breaks).

Example #

Using SMOTE to synthetically increase low‑irradiance samples for PV forecasting.

Practical application #

Improves detection of extreme under‑production events that affect grid reliability.

Challenges #

Synthetic data may not capture true physical dynamics; careful validation required.

Inference Engine #

The component that applies a trained AI model to new data to generate predictions.

Related terms #

deployment, latency.

Explanation #

Inference must be efficient to meet real‑time forecasting requirements.

Example #

Serving a trained CNN on edge devices at solar farms for on‑site power estimation.

Practical application #

Enables low‑latency forecasts for inverter control and demand‑response actions.

Challenges #

Model size and hardware constraints; ensuring consistency between training and inference environments.

Integration Window #

The time span over which observational data are merged with model outputs during data assimilation.

Related terms #

assimilation cycle, observation latency.

Explanation #

Selecting an appropriate window balances timeliness against data availability.

Example #

A 30‑minute integration window for assimilating wind profiler measurements into a short‑term wind forecast.

Practical application #

Improves accuracy of very‑short‑term forecasts used for turbine curtailment decisions.

Challenges #

Delayed observations can degrade forecast quality; optimal window varies with sensor network density.

Joint Forecasting #

Simultaneously predicting multiple correlated variables, such as wind speed and direction, or solar irradiance and temperature.

Related terms #

multivariate, co‑prediction.

Explanation #

Joint models exploit inter‑variable dependencies to improve overall accuracy.

Example #

A multi‑output LSTM that outputs both wind speed and power output for a turbine.

Practical application #

Provides comprehensive inputs for dispatch optimisation and market bidding.

Challenges #

Increased model complexity; need for consistent training data across all variables.

K #

Nearest Neighbors (KNN): A simple, instance‑based learning algorithm that predicts outcomes based on the most similar historical observations.

Related terms #

distance metric, lazy learning.

Explanation #

For a new input, KNN finds the k closest points in feature space and averages their target values.

Example #

Estimating solar PV output by averaging the power of past days with similar cloud cover patterns.

Practical application #

Offers an interpretable baseline for rapid prototyping of forecasts.

Challenges #

Sensitive to feature scaling; computationally intensive for large datasets.

Kalman Filter (KF) #

An algorithm that recursively estimates the state of a dynamic system by blending predictions with noisy measurements.

Related terms #

state‑space, covariance.

Explanation #

KF provides optimal linear estimation under Gaussian noise assumptions.

Example #

Updating wind speed forecasts using real‑time anemometer data on a turbine hub.

Practical application #

Enables continuous refinement of short‑term forecasts for turbine control.

Challenges #

Linear assumption limits performance for highly non‑linear atmospheric processes; extensions like EKF or UKF increase complexity.

Knowledge Distillation #

A technique where a large, complex “teacher” model transfers its learned representations to a smaller “student” model.

Related terms #

model compression, teacher‑student.

Explanation #

The student model learns to mimic the teacher’s outputs, retaining performance with reduced size.

Example #

Compressing a deep CNN into a lightweight network for on‑site solar forecasting on embedded hardware.

Practical application #

Facilitates deployment of AI models on edge devices with limited compute.

Challenges #

Maintaining accuracy after compression; selecting appropriate temperature and loss functions.

Latent Variable Model #

A statistical model that includes unobserved (latent) variables to capture hidden structure in data.

Related terms #

factor analysis, hidden Markov model.

Explanation #

Latent variables can represent underlying weather regimes influencing renewable generation.

Example #

Using a hidden Markov model to infer cloud cover states that affect solar PV output.

Practical application #

Improves forecasting during periods with sparse observations.

Challenges #

Model identification can be difficult; convergence may depend on initialisation.

Lead‑Time Bias #

Systematic error that varies with the forecast horizon, often caused by model drift or data latency.

Related terms #

systematic error, bias correction.

Explanation #

Forecasts may consistently over‑ or under‑predict as the lead time increases.

Example #

A wind forecast model that underestimates speeds beyond 6 hours due to outdated boundary conditions.

Practical application #

Identifying and correcting lead‑time bias improves market participation of renewable assets.

Challenges #

Requires continuous monitoring and dynamic bias‑adjustment mechanisms.

Loss Function #

A mathematical expression that quantifies the difference between predicted and actual values during model training.

Related terms #

mean squared error, cross‑entropy.

Explanation #

Minimising the loss drives the learning algorithm toward better forecasting performance.

Example #

Using mean absolute percentage error (MAPE) as the loss for PV power prediction to emphasise relative errors.

Practical application #

Choice of loss influences model sensitivity to outliers and extreme events.

Challenges #

Selecting a loss that aligns with business objectives (e.g., revenue vs. reliability) can be non‑trivial.

Meta‑Learning #

“Learning to learn” where an algorithm adapts quickly to new tasks using knowledge acquired from previous tasks.

Related terms #

few‑shot learning, model‑agnostic meta‑learning.

Explanation #

Meta‑learners can rapidly update forecasting models for a newly commissioned wind farm with limited data.

Example #

Applying MAML to adapt a generic wind power predictor to a specific turbine’s performance.

Practical application #

Reduces data‑collection time for new renewable projects.

Challenges #

Requires diverse meta‑training tasks; risk of negative transfer if tasks are dissimilar.

Monte Carlo Simulation #

A computational technique that uses random sampling to estimate the probability distribution of outcomes.

Related terms #

stochastic modelling, uncertainty quantification.

Explanation #

In renewable forecasting, Monte Carlo methods generate ensembles of possible generation profiles.

Example #

Simulating 10 000 possible solar irradiance trajectories to assess expected PV output variance.

Practical application #

Supports risk‑aware planning for battery storage sizing.

Challenges #

Computationally intensive; quality depends on the underlying probability models.

Multivariate Time Series #

A collection of interrelated temporal sequences, such as wind speed, direction, and temperature, recorded simultaneously.

Related terms #

vector autoregression, cross‑correlation.

Explanation #

Joint modelling captures dependencies that single‑variable approaches miss.

Example #

A vector‑autoregressive (VAR) model forecasts wind speed and direction together for turbine yaw control.

Practical application #

Improves coordination of multiple renewable assets in a microgrid.

Challenges #

High dimensionality can lead to over‑parameterisation; requires careful variable selection.

Neural Architecture Search (NAS) #

An automated process that discovers optimal neural network structures for a given task.

Related terms #

search space, reinforcement learning.

Explanation #

NAS algorithms evaluate many candidate architectures to find the best performing model for renewable forecasting.

Example #

Using NAS to design a lightweight CNN for on‑site solar irradiance prediction.

Practical application #

Reduces manual effort in model design while achieving state‑of‑the‑art accuracy.

Challenges #

Search is computationally expensive; risk of over‑fitting to training data.

Normalization #

Scaling input features to a common range or distribution to improve model convergence.

Related terms #

standardisation, min‑max scaling.

Explanation #

Normalisation mitigates the impact of differing units (e.g., wind speed vs. temperature) on learning.

Example #

Applying z‑score normalisation to meteorological variables before feeding them into a LSTM.

Practical application #

Accelerates training of deep models for renewable forecasts.

Challenges #

Must apply the same scaling to inference data; outliers can distort scaling parameters.

Oblique Decision Tree #

A type of decision tree that splits data using linear combinations of features rather than single variables.

Related terms #

hyperplane, CART.

Explanation #

Oblique splits can capture interactions between predictors, potentially improving forecast precision.

Example #

Splitting based on a weighted sum of wind speed and temperature to predict turbine power.

Practical application #

Provides interpretable models for regulatory reporting while handling feature interactions.

Challenges #

Training is more complex than axis‑aligned trees; may be prone to over‑fitting.

Out‑of‑Sample Testing #

Evaluating model performance on data that were not used during training or validation.

Related terms #

hold‑out set, generalisation.

Explanation #

Out‑of‑sample tests assess how well a forecasting model will perform in real operational conditions.

Example #

Testing a solar forecast model on a year of data from a different geographical region.

Practical application #

Ensures robustness before deploying AI models in production environments.

Challenges #

Selecting representative test sets; temporal shifts can make “out‑of‑sample” very different from training data.

Over‑fitting #

When a model learns noise and specific patterns in the training data, reducing its ability to generalise.

Related terms #

regularisation, validation loss.

Explanation #

Over‑fitted models show low training error but high error on unseen data.

Example #

A deep network that perfectly fits historical solar output but fails on a cloudy summer day.

Practical application #

Detecting over‑fitting is essential for reliable renewable forecasts.

Challenges #

Balancing model capacity with data volume; employing techniques such as dropout or early stopping.

Parameter Sharing #

Reusing the same set of weights across different parts of a neural network, common in convolutional and recurrent layers.

Related terms #

weight tying, translational invariance.

Explanation #

Sharing reduces the number of trainable parameters, improving efficiency and generalisation.

Example #

Convolutional filters applied across the entire satellite image for solar forecasting.

Practical application #

Enables compact models suitable for deployment on low‑power devices at renewable sites.

Challenges #

May limit model flexibility if the shared parameters cannot capture local variations.

Partial Autocorrelation Function (PACF) #

A statistical tool that measures the correlation between a time series and its lagged values after removing intermediate lags.

Related terms #

ACF, lag selection.

Explanation #

PACF assists in identifying the appropriate order of autoregressive components in ARIMA models.

Example #

Using PACF to decide on the number of lagged wind speed terms in a time‑series model.

Practical application #

Improves model specification for traditional statistical forecasting of renewable generation.

Challenges #

Interpretation can be ambiguous when data are noisy; requires stationarity.

Physics‑Informed Neural Network (PINN) #

A neural network that incorporates governing physical equations into its loss function.

Related terms #

regularisation, differential equation.

Explanation #

PINNs enforce physical consistency, reducing reliance on large datasets.

Example #

Embedding the solar cell efficiency equation into a network that learns residual errors from measured PV output.

Practical application #

Enhances forecast reliability under extrapolation beyond observed conditions.

Challenges #

Formulating appropriate physics constraints; increased training complexity.

Probabilistic Forecasting #

Generating predictions that include a probability distribution rather than a single point estimate.

Related terms #

prediction interval, quantile regression.

Explanation #

Probabilistic forecasts convey uncertainty, enabling risk‑aware decision making.

Example #

Producing 10 %, 50 %, and 90 % quantiles for hourly wind power.

Practical application #

Supports market participants in bidding strategies and reserve allocation.

Challenges #

Calibration of probabilities; computational cost of generating ensembles.

Quantile Regression #

A regression technique that estimates conditional quantiles of the response variable, useful for constructing prediction intervals.

Related terms #

pinball loss, asymmetric loss.

Explanation #

By modelling different quantiles, the approach yields a full predictive distribution.

Example #

Predicting the 5th and 95th percentiles of solar PV output to assess low‑generation risk.

Practical application #

Provides actionable uncertainty bounds for grid operators.

Challenges #

Requires sufficient data to estimate extreme quantiles; may produce crossing quantiles without constraints.

Recurrent Neural Network (RNN) #

A class of neural networks designed to handle sequential data by maintaining hidden states across time steps.

Related terms #

temporal dependence, vanishing gradient.

Explanation #

RNNs capture temporal dynamics essential for time‑series forecasting of renewable generation.

Example #

An RNN that predicts wind turbine power using past 24 hours of wind speed and turbine status.

Practical application #

Enables short‑term forecasts for turbine control and curtailment.

Challenges #

Standard RNNs suffer from gradient issues; LSTM or GRU units are often preferred.

Residual Learning #

Training a model to predict the difference (residual) between a baseline forecast and the actual observation.

Related terms #

error correction, stacking.

Explanation #

By focusing on residuals, the model learns to correct systematic errors of the baseline.

Example #

Using a shallow neural network to adjust the output of a physics‑based solar model.

Practical application #

Improves accuracy without discarding valuable physical insights.

Challenges #

Residuals may be noisy; requires careful handling to avoid amplifying errors.

Ridge Regression #

A linear regression technique that adds an L2 penalty to the loss function to shrink coefficients and reduce multicollinearity.

Related terms #

regularisation, shrinkage.

Explanation #

The penalty discourages large weights, stabilising the model in the presence of correlated predictors.

Example #

Forecasting wind power using correlated wind speed and turbulence intensity variables.

Practical application #

Provides a simple yet robust baseline for renewable energy forecasting.

Challenges #

Selecting the regularisation parameter λ; may not capture non‑linear relationships.

Scaling Laws #

Empirical relationships that describe how model performance changes with data size, compute, or architecture depth.

Related terms #

power law, efficiency.

Explanation #

Understanding scaling helps plan resource allocation for training large AI models in renewable forecasting.

Example #

Observing that forecast error decreases proportionally to the square root of the training data volume.

Practical application #

Guides investment decisions for data collection and compute infrastructure.

Challenges #

Scaling behaviour may differ across domains; extrapolation beyond observed ranges can be unreliable.

Seasonal Decomposition #

Breaking a time series into trend, seasonal, and residual components to isolate patterns.

Related terms #

STL, additive model.

Explanation #

Decomposition helps identify recurring seasonal effects in solar or wind generation.

Example #

Using STL to separate the daily solar cycle from stochastic cloud variations.

Practical application #

Improves model training by providing deseasonalised inputs.

Challenges #

Requires sufficient historical data; may misclassify irregular events as seasonal.

Spatial Interpolation #

Estimating values at unsampled locations using measurements from nearby sites.

Related terms #

kriging, inverse distance weighting.

Explanation #

Interpolation fills gaps in sensor networks, providing complete fields for AI models.

Example #

Interpolating wind speed from a sparse anemometer network to generate input grids for a turbine‑level forecast.

Practical application #

Enables high‑resolution forecasting where direct measurements are unavailable.

Challenges #

Accuracy depends on sensor density and terrain complexity; computational cost can be high for large domains.

Stochastic Gradient Descent (SGD) #

An optimisation algorithm that updates model parameters using a random subset (mini‑batch) of data at each iteration.

Related terms #

learning rate, momentum.

Explanation #

SGD reduces computational load compared to full‑batch gradient descent, facilitating training on large datasets.

Example #

Training a deep LSTM for wind power prediction with mini‑batches of 64 samples.

Practical application #

Enables rapid model updates as new weather observations become available.

Challenges #

Requires careful tuning of learning rate and batch size; may converge to sub‑optimal minima.

Support Vector Regression (SVR) #

A regression variant of support vector machines that finds a function within a tolerance margin (ε) while maximising flatness.

Related terms #

kernel trick, margin.

Explanation #

SVR handles non‑linear relationships by mapping inputs into high‑dimensional spaces.

Example #

Predicting solar PV output using a radial basis function kernel to capture complex irradiance patterns.

Practical application #

Offers a robust alternative to deep learning when data are limited.

Challenges #

Sensitive to hyperparameter selection; scaling to large datasets can be problematic.

Time‑Series Cross‑Validation #

A validation strategy that respects temporal ordering by training on past data and testing on future windows.

Related terms #

rolling origin, walk‑forward.

Explanation #

This approach prevents data leakage and provides realistic performance estimates for forecasting models.

Example #

Using a rolling 30‑day training window to evaluate a 24‑hour ahead solar forecast model.

Practical application #

Guides model selection for operational renewable forecasting pipelines.

Challenges #

Computationally intensive; choosing appropriate window sizes requires domain knowledge.

Transfer Learning #

Reusing a pre‑trained model on a related task, often with fine‑tuning on a smaller target dataset.

Related terms #

fine‑tuning, domain adaptation.

Explanation #

Transfer learning accelerates model development for new renewable sites with limited data.

Example #

Adapting a generic wind power predictor to a specific turbine type by retraining the final layers.

Practical application #

Reduces data collection time and improves early‑stage forecast accuracy.

Challenges #

Negative transfer can occur if source and target domains differ significantly.

Uncertainty Quantification (UQ) #

The process of characterising and propagating uncertainties through models to assess confidence in forecasts.

Related terms #

Monte Carlo, Bayesian inference.

Explanation #

UQ provides probabilistic information essential for risk‑aware decision making.

Example #

Quantifying the impact of sensor noise on wind speed forecasts using Bayesian neural networks.

Practical application #

Supports reserve planning and insurance calculations for renewable assets.

Challenges #

Computationally expensive; requires accurate representation of all uncertainty sources.

Validation Set #

A subset of data used to tune model hyperparameters and assess performance during development, distinct from training and test sets.

Related terms #

hold‑out, early stopping.

Explanation #

The validation set provides feedback on generalisation without contaminating the final test evaluation.

Example #

Reserving 15 % of a solar dataset for validation while training a deep learning model.

Practical application #

Prevents over‑fitting and guides model selection for deployment.

Challenges #

In time‑series contexts, random splits may violate temporal dependencies; careful temporal partitioning is needed.

Variational Autoencoder (VAE) #

A generative neural network that learns a probabilistic latent space, enabling synthesis of new data samples.

Related terms #

latent distribution, reconstruction loss.

Explanation #

VAEs can generate realistic weather fields that feed stochastic renewable forecasts.

Example #

Producing synthetic cloud cover maps to augment training data for solar forecasting.

Practical application #

Addresses data scarcity and improves model robustness to unseen conditions.

Challenges #

Generated samples may lack physical realism; balancing reconstruction and regularisation terms is delicate.

Wavelet Transform #

A signal‑processing technique that decomposes a time series into time‑frequency components using localized basis functions.

Related terms #

scalogram, multi‑resolution.

Explanation #

Wavelets capture both short‑term fluctuations and long‑term trends in renewable generation signals.

Example #

Decomposing wind speed into high‑frequency gusts and low‑frequency trends before feeding into a forecasting model.

Practical application #

Improves feature extraction for AI models handling non‑stationary data.

Challenges #

Choice of mother wavelet and decomposition level influences performance; may increase preprocessing complexity.

Weight Decay #

A regularisation technique that adds an L2 penalty on network weights during training to discourage large coefficients.

Related terms #

regularisation, λ (lambda).

Explanation #

Weight decay helps prevent over‑fitting by shrinking weight magnitudes.

Example #

Applying weight decay to a deep CNN that predicts solar irradiance from satellite images.

Practical application #

Produces smoother models that generalise better to unseen weather patterns.

Challenges #

Selecting the decay factor requires experimentation; excessive decay can under‑fit the data.

XGBoost #

An efficient, scalable implementation of gradient‑boosted decision trees with built‑in regularisation.

Related terms #

tree booster, learning rate.

Explanation #

XGBoost handles missing values, supports parallel processing, and often outperforms other ML methods in tabular data.

Example #

Forecasting wind farm power using turbine metadata, terrain elevation, and NWP variables.

Practical application #

Widely adopted for day‑ahead market bidding due to speed and accuracy.

Challenges #

Hyperparameter optimisation is crucial; model interpretability can be limited compared to simpler regressions.

Yield Curve #

In renewable energy, a representation of expected generation performance across different time horizons or operating conditions.

Related terms #

capacity factor, performance index.

Explanation #

The curve illustrates how output varies with factors such as solar irradiance intensity or wind speed.

Example #

Plotting expected PV output versus solar zenith angle to understand seasonal variations.

Practical application #

Assists investors in evaluating project economics and financing structures.

Challenges #

Accurate estimation requires comprehensive historical data and robust forecasting models.

Zero‑Shot Learning #

A paradigm where a model predicts outcomes for classes or scenarios it has never seen during training.

Related terms #

semantic embedding, attribute transfer.

Explanation #

In renewable forecasting, zero‑shot techniques enable predictions for a newly commissioned turbine with no historical data.

Example #

Using turbine specifications (blade length, rated power) as attributes to infer power curves for a brand‑new turbine model.

Practical application #

Accelerates commissioning of renewable assets by providing immediate forecasts.

Challenges #

Relies heavily on the quality of attribute representations; performance may be lower than models trained on actual data.