Physics and Algorithm Guide: VibeSpin

This document details the theoretical foundations, physical models, and algorithmic requirements of the VibeSpin framework.

1. Physical Models

VibeSpin implements three foundational 2D lattice spin models, each defined by its Hamiltonian and state space. They span the full spectrum from discrete to continuous on-site symmetry, and together they cover the major universality classes accessible in two dimensions.

Ising Model

For a comprehensive introduction to the Ising model and its exact solution in two dimensions, see Onsager [1]. See also the Ising model article on Wikipedia for a modern summary.

The Ising model assigns a scalar spin \(s_i \in \{+1, -1\}\) to every site of a square lattice. Nearest-neighbor pairs interact through the Hamiltonian

\[E = -J \sum_{\langle i,j \rangle} s_i s_j,\]

where \(\langle i,j \rangle\) runs over all distinct neighbor bonds. The competition between the ferromagnetic coupling \(J > 0\), which favors alignment, and thermal fluctuations drives a second-order phase transition at the Onsager critical point \(T_c \approx 2.269\,J/k_B\). Below \(T_c\) the system spontaneously magnetizes; above it, entropy dominates and the net magnetization vanishes.

XY Model

In the XY model each spin is a 2D unit vector \(\mathbf{s}_i = (\cos\theta_i,\,\sin\theta_i)\) free to point in any planar direction. The Hamiltonian takes the form

\[E = -J \sum_{\langle i,j \rangle} \cos(\theta_i - \theta_j).\]

Because continuous symmetry cannot break spontaneously in two dimensions (Mermin–Wagner theorem), the XY model does not develop true long-range order at any finite temperature. Instead it undergoes the Berezinskii–Kosterlitz–Thouless (BKT) transition: at low temperature, bound vortex–antivortex pairs maintain quasi-long-range order with algebraically decaying correlations, while above \(T_{\mathrm{BKT}}\) the pairs unbind and correlations decay exponentially. This topological mechanism makes the 2D XY model qualitatively distinct from conventional order–disorder transitions. For background, see Kosterlitz and Thouless [2], Mermin and Wagner [3], and Villain [11]. See also the XY model article on Wikipedia.

q-state Clock Model

The clock model interpolates between the Ising limit (\(q = 2\)) and the XY limit (\(q \to \infty\)) by restricting spins to \(q\) equally spaced angles \(\theta_k = 2\pi k / q\). VibeSpin provides two representations. The continuous form retains the XY interaction and adds an anisotropy potential that pins spins toward the discrete directions. For a review, see Lapilli et al. [4]. See also the Clock model (Vector Potts model) article on Wikipedia.

\[E = -J \sum_{\langle i,j \rangle} \cos(\theta_i - \theta_j) \;-\; A \sum_i \cos(q\,\theta_i).\]

The discrete form evaluates the same interaction directly on integer state indices \(k_i \in \{0,\dots,q-1\}\) using precomputed cosine lookup tables, eliminating per-site trigonometric calls. For large \(q\) the model exhibits two successive BKT-type crossovers — one for the onset of quasi-long-range order and a lower one for the discrete locking transition — while for small \(q\) the behavior collapses to an Ising-like single transition.

2. The Metropolis-Hastings Algorithm

All simulations in VibeSpin sample the Boltzmann distribution \(P(s) \propto \exp(-\beta E(s))\) via the Metropolis-Hastings algorithm. For a pedagogical introduction, see Hastings [5]. See also the Metropolis–Hastings algorithm article on Wikipedia.

Detailed Balance

Convergence to the target distribution requires that every pair of configurations \(A\) and \(B\) satisfies \(P(A)\,W(A \to B) = P(B)\,W(B \to A)\). The Metropolis acceptance rule

\[P_{\mathrm{acc}} = \min\!\bigl(1,\;\exp(-\beta\,\Delta E)\bigr)\]

fulfills this condition exactly, accepting all energy-lowering moves and accepting energy-raising moves with an exponentially suppressed probability.

Ergodicity

The proposal distribution must connect every configuration to every other in a finite number of steps. In the Ising model, single-spin flips are sufficient because any configuration can be reached one flip at a time. The discrete clock model draws each proposal uniformly from the full set of \(q\) states (randint(0, q)), so every site can reach any allowed orientation in a single move. For the XY model, uniform phase perturbations drawn from \([-\delta,\,\delta]\) ensure that successive proposals can accumulate to traverse the entire \([0, 2\pi)\) circle.

Update Schemes

VibeSpin enforces a strict separation between two update strategies, each valid only in its own physical regime. Checkerboard updates are used exclusively for equilibrium and thermodynamic measurements: the lattice is divided into two independent sublattices (analogous to the black and white squares of a chessboard), and each sublattice is swept in parallel. Because no two simultaneously updated sites share a neighbor, this scheme is both correct and highly vectorizable, maximizing SIMD and multi-core throughput. Random site selection is mandatory for kinetics and non-equilibrium dynamics: \(N^2\) sites are chosen uniformly at random per sweep, preserving the exact stochastic trajectory needed to study coarsening, domain growth, and aging phenomena. Mixing the two schemes across regimes would invalidate either the parallelism guarantee or the physical time evolution.

3. Physical Observables

Thermodynamic Averages

For a pedagogical introduction to thermodynamic observables in spin models, see Huang [6]. See also the Magnetization, Magnetic susceptibility, and Heat capacity articles on Wikipedia.

Temperature-sweep simulations also compute the entropy by integrating the specific-heat curve downward from a high-temperature reference:

\[S(T) = S_{\mathrm{ref}} - \int_T^{T_{\mathrm{ref}}} \frac{C_v(T')}{T'}\,dT'.\]

The highest simulated temperature serves as the reference point. For clock models the absolute high-temperature limit is \(S_{\mathrm{ref}} = \ln q\) per site (in units of \(k_B\)), corresponding to equipartition over all \(q\) orientations.

Finally, the integrated autocorrelation time \(\tau_{\mathrm{int}}\), extracted from the magnetization time series, quantifies how many sweeps separate statistically independent samples [12]. Near a critical point \(\tau_{\mathrm{int}}\) diverges — the hallmark of critical slowing down — and its magnitude directly governs the statistical efficiency of the Monte Carlo run. To ensure measurements are strictly taken from the stationary distribution, VibeSpin uses a Two-Start Convergence equilibration routine: at each \((T, \text{seed})\) point, two independent simulations are launched — one from a random (disordered) state and one from a fully aligned (ordered) state. The burn-in proceeds in chunks until both smoothed magnetization traces satisfy a mutual cross-band test: the random-start trace must lie within a band of \(\pm k\) standard deviations of the ordered-start tail mean, and the ordered-start trace must simultaneously lie within \(\pm k\) standard deviations of the random-start tail mean. A sigma floor prevents band collapse when either trace is nearly variance-free.

This protocol includes Quasi-Steady Stuck Detection to handle the physical reality of domain-wall trapping in the deep ordered phase. If the random-start trace becomes stranded in a metastable plateau while the ordered-start trace remains stable and cleanly ordered (magnetization \(|M| \approx 1\)), the system identifies the stuck state and declares convergence early. In such cases, the engine automatically falls back to the stable ordered-start simulation for all subsequent measurements. This non-equilibrium diagnostic guarantees that the system has reached the global minimum or a physically representative stationary state before any thermodynamic measurements are recorded. To maximize hardware utilization, these points are parallelized individually across the worker pool, ensuring full CPU throughput even for single-seed sweeps.

Spatial Diagnostics

For a review of spatial diagnostics and correlation functions, see Goldenfeld [7]. See also the Correlation function and Structure factor articles on Wikipedia.

Topological Diagnostics

For the BKT transition and topological diagnostics, see Kosterlitz and Thouless [2]. See also the BKT transition, Vortex, and Superfluid stiffness (Helicity modulus) articles on Wikipedia.

4. The Wolff Cluster Algorithm

Motivation: Critical Slowing Down

The Metropolis single-spin-flip algorithm becomes increasingly inefficient as a continuous phase transition is approached. Near the critical point the correlation length \(\xi\) diverges, and the spin configurations develop large coherent domains whose characteristic size \(\xi\) sets the natural unit of any proposed single-site change. Because a single flip disturbs only one spin at a time, the algorithm must perform \(O(\xi^z)\) sweeps to decorrelate the system from one independent sample to the next, where the dynamic critical exponent \(z \approx 2.17\) for the 2D Ising model. This quadratic growth of autocorrelation time with lattice size — the critical slowing down — makes precise equilibrium measurements near \(T_c\) computationally expensive with Metropolis alone.

The Wolff cluster algorithm drastically reduces critical slowing down by operating at the scale of the correlated domain rather than the individual spin. Rather than proposing a single flip, it grows an entire correlated cluster and flips it as a single collective move. The dynamic exponent in cluster-step units is \(z^{\mathrm{cs}} \approx 0.5\) asymptotically. However, a single cluster-step flips an \(O(L^{\gamma/\nu})\)-size cluster (where \(\gamma/\nu = 7/4\) for the 2D Ising model), so normalizing to equivalent-sweep units yields \(z^{\mathrm{sw}} = z^{\mathrm{cs}} - 1/4 \approx 0.25\) — more than an order-of-magnitude reduction compared to Metropolis. Measured values of \(z^{\mathrm{cs}}\) at finite lattice sizes (e.g., \(L = 16\)–\(128\)) can deviate from the asymptotic value due to finite-size effects.

Ising Wolff: Fortuin-Kasteleyn Construction

The theoretical basis for the Ising Wolff algorithm is the Fortuin-Kasteleyn (FK) random-cluster representation, which maps the Ising partition function onto a bond-percolation problem. For the FK construction, see Fortuin and Kasteleyn [8]. See also the Random cluster model (Fortuin–Kasteleyn representation) article on Wikipedia.

\[P_{\mathrm{add}} = 1 - e^{-2\beta J}.\]

After the cluster \(\mathcal{C}\) is fully grown, all spins in \(\mathcal{C}\) are flipped simultaneously: \(\sigma_i \to -\sigma_i\) for all \(i \in \mathcal{C}\). The bond probability is derived precisely so that this collective move satisfies detailed balance without any rejection step — the cluster flip is always accepted. This zero-rejection property, combined with the divergence of the mean cluster size \(\langle|\mathcal{C}|\rangle \sim \xi^{d_f}\) at \(T_c\) (where \(d_f\) is the fractal dimension of the FK cluster), is the mechanism behind the dramatic acceleration near criticality.

Unit Convention. Reported measurements of \(\tau_{\mathrm{int}}\) and the dynamic exponent \(z\) depend critically on how a “step” is defined. In cluster-step units (one cluster-flip as one step), \(\tau_{\mathrm{int}}\) and \(z\) are measured directly from the Wolff trajectory. To convert to sweep-equivalent units (matching Metropolis, where one step = \(N^2\) single-flip attempts), we normalize by the mean cluster size: \(\tau^{\mathrm{sw}}_{\mathrm{Wolff}} = \tau^{\mathrm{cs}}_{\mathrm{Wolff}} \times \langle C \rangle / L^2 \sim \tau^{\mathrm{cs}}_{\mathrm{Wolff}} \times L^{7/4} / L^2 = \tau^{\mathrm{cs}}_{\mathrm{Wolff}} \times L^{-1/4}\). This gives \(z^{\mathrm{sw}} = z^{\mathrm{cs}} - 1/4\), an exact relation for the 2D Ising model.

XY and Clock Wolff: The Reflection Trick

For the Wolff-Evertz generalization to \(O(2)\) models, see Wolff [9] and Newman and Barkema [10]. See also the Wolff algorithm article on Wikipedia.

\[P_{\mathrm{add}} = 1 - e^{-2\beta J\,\sigma_i \sigma_j}.\]

Once the cluster is formed, every cluster spin is reflected through the hyperplane perpendicular to \(\hat{r}\):

\[\mathbf{s}_i \to \mathbf{s}_i - 2(\mathbf{s}_i \cdot \hat{r})\,\hat{r}.\]

This reflection preserves the Euclidean norm \(|\mathbf{s}_i| = 1\) exactly, requires no renormalisation, and satisfies detailed balance for the pure Heisenberg exchange term \(-J \mathbf{s}_i \cdot \mathbf{s}_j\). For the continuous clock model, which includes an additional crystal-field anisotropy \(-A\cos(q\theta_i)\), the reflection symmetry required by the FK bond construction holds only for the exchange part of the Hamiltonian; the anisotropy term breaks this symmetry and falls outside the standard Wolff-Evertz derivation. The Wolff kernel therefore satisfies detailed balance exactly only when \(A = 0\), and converges to an approximation of the correct equilibrium distribution for \(A > 0\). Its use in the clock model is most appropriate when \(A \ll J\), or as a diagnostic in the XY-like regime.

Practical Semantics

One call to a Wolff step() constitutes one cluster sweep, meaning a single cluster is grown and flipped. This differs from the Metropolis convention, where one step() comprises \(N^2\) single-spin-flip attempts. The two schemes are therefore not directly comparable on a per-step basis near \(T_c\); the relevant comparison is per unit of computational time or per independent sample. Because the mean cluster size scales with the correlation length, the cost per cluster sweep also scales with \(\xi\), but the autocorrelation time in units of sweeps falls far faster than it rises in cost, resulting in a substantial net gain precisely where it is most needed.

The parallel=True flag is silently ignored when update='wolff'. Cluster growth is a sequential depth-first search whose frontier depends on each newly added site; it cannot be decomposed into independent sublattices and is therefore incompatible with the checkerboard parallelisation strategy. The Wolff algorithm is inherently a single-threaded traversal, and its performance advantage over Metropolis is algorithmic rather than hardware-parallel.

5. Statistical Uncertainty Estimation

Monte Carlo time series are not sequences of independent draws. Each configuration is generated by a Markov chain whose proposal depends on the current state, so successive measurements are correlated over a characteristic decorrelation scale set by \(\tau_{\mathrm{int}}\). Treating \(N\) correlated measurements as \(N\) independent samples underestimates the true statistical error by a factor of \(\sqrt{2\tau_{\mathrm{int}}}\). Correct uncertainty quantification requires accounting for this serial correlation explicitly. For the foundational treatment, see Sokal [12].

Blocking Analysis

The standard technique for estimating the error on an autocorrelated mean is the blocking (or “batch means”) method. Successive measurements are grouped into blocks of size \(b\), and the block means are treated as approximately independent if \(b \gg \tau_{\mathrm{int}}\). As \(b\) increases from 1 to \(N/2\), the block-mean variance initially rises and then plateaus once the blocks contain at least one full decorrelation interval. VibeSpin identifies this plateau by selecting the block size where the estimated standard error stabilizes. The plateau estimate is the autocorrelation-aware standard error on the time-series mean.

For a block containing \(\lfloor N/b \rfloor\) independent blocks of \(b\) measurements each, the standard error on the mean is

\[\sigma_{\bar{x}} = \sqrt{\frac{\mathrm{Var}(\bar{x}_{\text{block}})}{N/b}},\]

where \(\mathrm{Var}(\bar{x}_{\text{block}})\) is the variance of the block means. Under the plateau condition this reduces to \(\sigma_{\bar{x}} \approx \sigma_x \sqrt{2\tau_{\mathrm{int}} / N}\), recovering the exact asymptotic formula.

Effective Sample Size

The effective sample size \(N_\mathrm{eff}\) summarizes how many statistically independent observations the run is worth, accounting for serial correlation:

\[N_\mathrm{eff} = \frac{N}{2\,\tau_{\mathrm{int}}}.\]

\(N_\mathrm{eff}\) is bounded above by \(N\) and approaches it only when successive measurements are uncorrelated. Near a critical point, \(\tau_{\mathrm{int}}\) diverges as a power of \(L\) (critical slowing down), so \(N_\mathrm{eff} \ll N\) even for long runs. The ratio \(N_\mathrm{eff} / N\) is a direct diagnostic of simulation efficiency and guides decisions about run length and algorithm choice.

Confidence Intervals and Conventions

VibeSpin reports uncertainty intervals at a default confidence level of 68%, the one-sigma Gaussian equivalent. Under this convention the interval \([\bar{x} - \sigma_{\bar{x}},\, \bar{x} + \sigma_{\bar{x}}]\) contains the true mean with probability 0.68 for large samples from a normal distribution. This matches the standard physics convention of quoting a single symmetric error bar and is set by the constant DEFAULT_CONFIDENCE_LEVEL = 0.68 in utils/physics_helpers.py.

When a time series has zero variance (a fully ordered configuration, for example), \(\tau_{\mathrm{int}}\) is undefined. VibeSpin handles this explicitly: the ZeroVarianceAutocorrelationError exception is caught and the corresponding fields (err, tau_int, n_eff) are stored as NaN rather than silently defaulting to zero or propagating exceptions into the aggregation layer.

Single-Seed and Multi-Seed Aggregation

For a single seed at fixed temperature, VibeSpin estimates uncertainty directly from the measured trajectory via blocking. This gives a statistically valid error bar even with one seed, provided the time series is long enough to resolve a blocking plateau. For multi-seed sweeps, VibeSpin uses a hierarchical combination: the final uncertainty on the mean includes both the between-seed spread of point estimates and the average within-seed blocking uncertainty. This avoids underestimating errors in regimes where seed-to-seed variability is large and avoids overestimating them when between-seed spread is small but each trajectory remains autocorrelated.

Regime Caveats

In the deep ordered phase (low \(T\)), configurations can be nearly frozen and \(\tau_{\mathrm{int}}\) may approach zero from below as the estimator, in which case \(N_\mathrm{eff}\) is set to NaN to flag the ill-conditioned estimate rather than reporting an artificially inflated value. Near a continuous phase transition, \(\tau_{\mathrm{int}} \propto L^z\) grows rapidly with system size, and the Wolff cluster algorithm is strongly preferred to reduce \(z\) from roughly 2.2 to roughly 0.25 and restore statistical efficiency. For derived observables like susceptibility \(\chi\) and specific heat \(C_v\), which are nonlinear functions of the measured time series (and are therefore not simply the mean of an ergodic chain), VibeSpin supports both an autocorrelation-aware blocking estimator (the default) and an optional block-bootstrap propagation of the block-mean distribution.

Bibliography

[1] L. Onsager, “Crystal Statistics. I. A Two-Dimensional Model with an Order-Disorder Transition,” Physical Review, vol. 65, no. 3-4, pp. 117–149, 1944. APS Open Access

[2] J. M. Kosterlitz and D. J. Thouless, “Ordering, metastability and phase transitions in two-dimensional systems,” Journal of Physics C: Solid State Physics, vol. 6, no. 7, pp. 1181–1203, 1973. IOP Open Access

[3] N. D. Mermin and H. Wagner, “Absence of Ferromagnetism or Antiferromagnetism in One- or Two-Dimensional Isotropic Heisenberg Models,” Physical Review Letters, vol. 17, no. 22, pp. 1133–1136, 1966. APS Open Access

[4] J. Lapilli, P. Pfeifer, and C. Wexler, “Universality away from critical points in two-dimensional phase transitions,” Physical Review Letters, vol. 96, no. 14, 140603, 2006. APS Open Access

[5] W. K. Hastings, “Monte Carlo sampling methods using Markov chains and their applications,” Biometrika, vol. 57, no. 1, pp. 97–109, 1970. Oxford Academic

[6] K. Huang, “Statistical Mechanics,” 2nd Edition, Wiley, 1987. Statistical Mechanics lecture notes, John Cardy, Oxford (Archive)

[7] N. Goldenfeld, “Lectures on Phase Transitions and the Renormalization Group,” Addison-Wesley, 1992. Internet Archive (Open Access)

[8] C. M. Fortuin and P. W. Kasteleyn, “On the random-cluster model. I. Introduction and relation to other models,” Physica, vol. 57, no. 4, pp. 536–564, 1972. Elsevier Open Access

[9] U. Wolff, “Collective Monte Carlo Updating for Spin Systems,” Physical Review Letters, vol. 62, no. 4, pp. 361–364, 1989. APS Open Access

[10] M. E. J. Newman and G. T. Barkema, “Monte Carlo Methods in Statistical Physics,” Oxford University Press, 1999. Lecture Notes Summary (H. G. Katzgraber)

[11] J. Villain, “Theory of one- and two-dimensional magnets with an easy magnetization plane. II. The planar, classical, two-dimensional magnet,” J. Phys. France 36, 581-590 (1975). Open Access

[12] A. D. Sokal, “Monte Carlo Methods in Statistical Mechanics: Foundations and New Algorithms,” lecture notes (1989), published in Functional Integration: Basics and Applications (C. DeWitt-Morette, P. Cartier, A. Folacci, eds.), Springer, 1997, pp. 131–192. Springer Link