Proper Batch Normalization with learnable gamma and beta #126

Hitomamacs · 2025-01-05T14:06:04Z

Feature Description

A fully-fledged Batch Normalization (BN) layer that follows the standard approach in deep learning. This layer should include learnable parameters (gamma, beta), running statistics (running mean, running variance), and the correct forward/backward passes with mean/variance computation.

Problem Statement

Currently, we only have a min-max normalization layer, which is insufficient for most neural network architectures that rely on BN’s ability to stabilize training by normalizing activations to zero mean and unit variance. Min-max normalization does not address batch variance nor learnable scaling factors, which are crucial for many deep-learning frameworks.

Additional Context

BN often accelerates convergence and improves stability of deep networks.
Proper dimension checks should be enforced, ensuring the last dimension (or a specified dimension) matches the number of features.
We may use an additional flag (e.g., is_training) to switch between training and inference modes.
The layer’s interface should be consistent with existing layers, supporting init, forward, backward, and possibly separate functions or arguments for updating (\gamma) and (\beta).

_{Huly®: X_PI-158}

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proper Batch Normalization with learnable gamma and beta #126

Proper Batch Normalization with learnable gamma and beta #126

Hitomamacs commented Jan 5, 2025 •

edited

Loading

Proper Batch Normalization with learnable gamma and beta #126

Proper Batch Normalization with learnable gamma and beta #126

Comments

Hitomamacs commented Jan 5, 2025 • edited Loading

Feature Description

Problem Statement

Suggested Solution

Additional Context

Hitomamacs commented Jan 5, 2025 •

edited

Loading