Edifice.Contrastive.VICReg (Edifice v0.2.0)

VICReg - Variance-Invariance-Covariance Regularization.

Implements VICReg from "VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning" (Bardes et al., ICLR 2022). VICReg prevents representation collapse through three explicit regularization terms applied directly to the embedding vectors, without requiring negative pairs, asymmetric networks, or momentum encoders.

Key Innovations

Explicit collapse prevention: Three distinct terms each prevent a different mode of collapse
No architectural tricks: Symmetric architecture, no stop-gradient, no momentum encoder, no negative mining
Interpretable loss: Each term has a clear geometric meaning

Loss Terms

Variance (v): Maintain variance of each embedding dimension above a threshold (prevents informational collapse where all embeddings become identical)
Invariance (i): MSE between embeddings of augmented views (ensures representations are view-invariant)
Covariance (c): Decorrelate embedding dimensions (prevents dimensional collapse where all dimensions are correlated)

L = lambda * invariance(Z, Z')
  + mu * [variance(Z) + variance(Z')]
  + nu * [covariance(Z) + covariance(Z')]

Architecture

Augmented View 1         Augmented View 2
      |                         |
      v                         v
+------------+           +------------+
|  Encoder   |           |  Encoder   |  (shared weights)
+------------+           +------------+
      |                         |
      v                         v
+------------+           +------------+
| Projector  |           | Projector  |  (shared weights)
+------------+           +------------+
      |                         |
      v                         v
     Z                         Z'
      |                         |
      +------> VICReg Loss <----+

Usage

model = VICReg.build(encoder_dim: 287, projection_dim: 256)

# Compute loss between two batches of projections
loss = VICReg.vicreg_loss(z, z_prime,
  lambda_inv: 25.0,
  mu_var: 25.0,
  nu_cov: 1.0
)

References

Paper: https://arxiv.org/abs/2105.04906

Summary

Types

build_opt()

Options for build/1.

Functions

build(opts \\ [])

Build a VICReg model (encoder + projector).

covariance_loss(z)

Covariance term: decorrelate embedding dimensions.

default_hidden_size()

Default encoder/projector hidden dimension

default_lambda_inv()

Default invariance loss coefficient

default_mu_var()

Default variance loss coefficient

default_nu_cov()

Default covariance loss coefficient

default_projection_dim()

Default projection head output dimension

default_variance_target()

Default variance target threshold

invariance_loss(z, z_prime)

Invariance term: MSE between the two views.

output_size(opts \\ [])

Get the output size of the VICReg model.

variance_loss(z, target \\ 1.0)

Variance term: hinge loss on standard deviation.

vicreg_loss(z, z_prime, opts \\ [])

Compute the full VICReg loss.