SlotCTM

Research artifact for: Per-Object Slot Decomposition for Scalable Neural World Modeling: When Does Attention Beat Mean-Field?

Archon, Jesse Caldwell, Aura — DuoNeural, April 2026

Overview

A systematic ablation of slot-based CTM world models on N-body bouncing ball physics. Tests when per-object attention (SlotCTM) outperforms mean-field interaction, identifies the capacity bottleneck at scale, and characterizes the collision density phase transition.

Central question: When does modeling object interactions via attention beat modeling them via mean-field (SlotGNN with pooled interaction)?

Key Findings

Temporal Specialization Arc (v21–v24)

Version	Setting	Spec Score	Key Finding
v21	Learned, no constraint	0.0078	No specialization. All slots generalists.
v22	Hard delay (slot i → t-i·τ)	0.2777	Forced specialization works (35× v21), but 2–7× perf cost.
v23	Soft learned gates	0.0876	Freedom collapses to present. Delta-function gates.
v24	Forced diversity loss	0.2353	Gates spread to [0–15] but performance unchanged.

Conclusion: Temporal gate diversity emerges only when the task requires it. Bouncing ball state is Markovian — one frame is sufficient. The optimal temporal gate is the task's predictability horizon.

N-Body Scaling (v10, v14)

SlotCTM advantage inverts at N≥5 without proportional hidden dimension scaling. At N=8 with standard HIDDEN_DIM=384, CTM is 2.8× worse than MLP. Scaling HIDDEN_DIM = N×128 recovers the advantage.

Phase Transition (v12)

Collision density r_critical ≈ 0.09–0.11 separates two regimes:

Ballistic (r < 0.10): MLP fine, CTM overkill
Collision-entangled (r > 0.10): CTM wins, advantage grows monotonically

At r=0.20, k=100: MLP MSE = 89,241, CTM = 0.352. Ratio: 253,000:1.

Partial Observability (v13 extension of v7)

VarCTM with single-frame position-only observations outperforms MLP-with-velocity-estimation by >180× at k=100 (MLP: 63.8 trillion, TempCTM: 0.347). The CTM hidden state IS the belief state.

Architecture

SlotCTM processes each physical object as an independent slot:

SlotGNN: Per-object encoders + multi-head attention message passing
CTM dynamics: Shared-weight recurrent ticks per dynamics step
VarCTM: Variable training horizon k~U(1,20) for best generalization
TSSP: Thought-Space Self-Prediction auxiliary loss

Why Attention Beats Mean-Field

In dense collision regimes, pairwise object interactions are non-linear and non-symmetric. Mean-field pooling loses the directionality of collision impulses. Attention learns to weight relevant pair interactions, critical for large N and high collision density.

Citation

@article{archon2026slotctm,
  title     = {Per-Object Slot Decomposition for Scalable Neural World Modeling: When Does Attention Beat Mean-Field?},
  author    = {Archon and Caldwell, Jesse and Aura},
  year      = {2026},
  doi       = {10.5281/zenodo.19846804},
  url       = {https://doi.org/10.5281/zenodo.19846804},
  publisher = {Zenodo}
}

DuoNeural

DuoNeural is an open AI research lab — human + AI in collaboration.


🤗 HuggingFace	huggingface.co/DuoNeural
🐙 GitHub	github.com/DuoNeural
🐦 X / Twitter	@DuoNeural
📧 Email	duoneural@proton.me
📬 Newsletter	duoneural.beehiiv.com
☕ Support	buymeacoffee.com/duoneural
🌐 Site	duoneural.com

Research Team

Jesse — Vision, hardware, direction
Archon — AI lab partner, post-training, abliteration, experiments
Aura — Research AI, literature synthesis, novel proposals

DuoNeural Research Publications

Title	DOI
Nano-CTM: Ternary Continuous Thought Machines with Thought-Space Self-Prediction for Efficient Iterative Reasoning	10.5281/zenodo.19775622
Recurrence as World Model: CTM Learns Implicit Belief States in Partially Observable Physical Environments	10.5281/zenodo.19810620
Per-Object Slot Decomposition for Scalable Neural World Modeling: When Does Attention Beat Mean-Field?	10.5281/zenodo.19846804

Open access, CC BY 4.0. Authored by Archon, Jesse Caldwell, Aura — DuoNeural.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support