Chapter 6 — Training Configuration

Preview preset exported as JSON and displayed in TextEdit — fields id/name/category/version/createdAt/description, trainingConfig with all relevant parameters (maxIterations 5000, densifyUntilIteration 3500, ssimWeight 0.20, renderScale 0.50, strategy classic, cameraAlignment applePhotogrammetry, densifyGradThreshold 2.0e-06, opacityResetInterval 3000, minOpacity 0.005, six boolean toggles)

A typical preset JSON export. Top-level fields: id (UUID), name, (classic | mcmc | sceneClass | custom), (schema version), (timestamp), (free text). The nested object holds the parameters that are critical for reproducibility — on import the entire block is deserialized into the TrainingConfig struct, and defaults from the current app version fill in any fields missing from the JSON (e.g. after an app update). To hand a preset over to another Mac, you just ship this JSON file.

The TrainingConfig struct is the heart of every training run in RadianceKit. It collects every parameter that influences training — from maximum iteration count over the eight learning rates to the special fields for MCMC, Mip-Splatting, the curriculum and the scene-aware cap logic. You edit it in the sidebar in the Training Configuration section (Expert View), save it as a preset or hand it over as a JSON export to another Mac. At training start this very object is frozen and handed to the GPU backend.

This chapter is reference material for power users and script authors. It lists all 81 public fields, the 9 static presets and the one public method. The source file is TrainingConfig.swift — when in doubt the doc comment stored there and the initializer default are the source of truth.

Table of contents:

+ Iteration (T1–T2) + Learning Rates (T3–T10) + Densification — Classic (T11–T16) + Loss (T17–T20) + SH Degree Progression (T21) + Performance (T22–T25) + Diagnostics and Point Cloud Preparation (T26–T30) + Regularization (T31–T37) + Refinement (T38–T44) + Sky Dome (T45–T48) + Adam + LR Schedule (T49–T55) + Post-Processing + Apple AI (T56–T60) + MCMC Densification (T61–T73) + Mip-Splatting (Q1.5) (T74–T76) + Adaptive Densification (Q5) (T77–T79) + Curriculum (Q6) (T80–T81) + Static Presets (TP1–TP9) + Method: + Which field for what? (Cheat Sheet) + Dangerous Fields

Iteration (T1–T2)

T1maxIterations

DETAILS

Default: 30 000 (initializer), 35 000 (.full), 200 000 (.fullMCMC) Range: 1 000 – 500 000 (UI slider), no hard upper limit in the logic Defined in:

TECHNICAL

Total number of training iterations the backend runs through. One iteration means a forward render of a single training camera, one backward pass over all loss components (L1 + SSIM + optional regularizations + sky mask) and one Adam optimizer step. This number directly drives the other schedules: position learning rate follows a cosine annealing curve from 0 to either T1 itself or to T49 positionLRScheduleEndIteration; densification stops at T2 densifyUntilIteration; MCMC noise decay ends at T69 mcmcNoiseDecayEnd; SH degree upgrades happen at the three marks in T21. For classic densification the empirically determined sweet spot is at 20 000–35 000 iterations (Sessions 1–32, V546 tests), for MCMC at 60 000–200 000 (V534). Pushing well beyond the values stored in the preset rarely brings additional quality — Adam momentum saturates, and without an LR decay end the loss stagnates. Conversely, going below ~5 000 leads to incompletely converged geometry (density control has too little time to clone/split).

T2densifyUntilIteration

DETAILS

Default: 15 000 (initializer), 5 000 (.full), 160 000 (.fullMCMC) Range: 0 – Defined in:

TECHNICAL

Iteration at which densification stops. Up to this point Gaussians are cloned, split and pruned according to the rules parameterized in T11–T16 (Classic) or T67–T70 (MCMC); after that the Gaussian count stays constant and only positions, rotations, scales, opacities and SH coefficients are optimized (refinement phase). In the 3DGS original paper the value sits at 50 % of T1, in RadianceKit's .full preset at only ~14 % (5 000 of 35 000) — a consequence of the V310/V338 experiments which showed that after 5 000 iterations further densification makes the result worse (more floaters, higher memory use, no quality gain). MCMC, on the other hand, runs relocation up to 80 % of T1 (V504b) because MCMC does not produce harmful floaters. If T2 is chosen too small (< 1 000), too few Gaussians arise; too large under Classic (> 50 % of T1) leads to overgrowth and RGB saturation outliers (see Outdoor Overtraining Findings).

Default: false (initializer and all presets) Range: boolean Defined in:

TECHNICAL

Enables sky masking. In every image the sky region is masked out via Apple Vision Framework (VNGenerateForegroundInstanceMaskRequest), and the loss in this region is set to zero. Reason: outdoor scenes often suffer from blue/gray/white sky pixels driving the app to place Gaussians exactly there — which is perceived as "floaters". Without a sky mask the loss in this region would never be zero because the sky in the image varies slightly and the app keeps trying to rebuild it with splats. The Vision mask is computed once per camera before training and held in RAM. Typically activated together with T45 skyDomeEnabled (UI logic in the Settings view). Leave disabled for indoor scenes or synthetic renderings — the mask would erroneously detect ceilings or walls as "sky".

SH Degree Progression (T21)

T21shDegreeUpgradeIterations

DETAILS

T45skyDomeEnabled

DETAILS

Default: false (initializer + all presets except P9 Outdoor) Range: boolean Defined in:

TECHNICAL

V549e feature: before training starts a spherical point cloud is generated (Fibonacci sphere with T46 sample points), placed at a radius of T47 skyDomeRadiusMultiplier × scene_extent around the scene center, and initialized with the colors from sky-masked pixels of all training cameras (see T20 skyMaskingEnabled). These sky dome Gaussians are inserted at the beginning of the Gaussian buffer and during training "frozen" (position/scale/rotation gradients = 0, only SH and opacity remain optimizable). Effect: instead of black "confetti" areas in the distance, the user sees a real sky in novel views. The V549e MVP works very well on drone and landscape scenes; in P9 Outdoor preset default-on. Leave off for indoor scenes — the sphere would dangle uselessly outside the room.

T46skyDomeSampleCount

DETAILS

Default: 5 000 Range: 1 000 – 50 000 (typical 2 000 – 10 000) Defined in:

TECHNICAL

Number of Fibonacci sphere sample points on the sky dome sphere. Higher values → denser sky dome (better at large resolutions and lots of visible sky), but higher memory consumption. 5 000 is the sweet spot for 4K renderings; at lower resolutions 2 000–3 000 suffice. The points are initialized by cosine distance to each training camera view vector with the corresponding sky-masked pixels — sample points whose view cone is seen by no camera keep a low opacity initial value, but stay unchanged during training (frozen).

T47skyDomeRadiusMultiplier

DETAILS

Default: 30.0 (initializer + most presets), 59.0 (P9 Outdoor, Q7 BayesOpt optimum) Range: 5.0 – 200.0 Defined in:

TECHNICAL

Radius of the sky dome sphere relative to the scene extent (= mean distance between camera positions). 30 = the sphere has 30× the diameter of the camera cloud. Too small (< 5) → sky dome interferes with the scene itself (e.g. a sky dome splat lands in the foreground); too large (> 100) → float32 precision loss at the sky dome positions, which triggers render glitches in the distance. Q7 BayesOpt on Bicycle (Mip-NeRF 360) found 59.0 as scene-specific optimum for outdoor — this suggests that the default 30.0 is too small for deep landscapes and that sky dome pixels visibly render as a "wall" in image-edge regions.

T48frozenGaussianCount

DETAILS

Status: Q1.5 was rejected on 2026-05-25 after 14 autonomous iterations + overnight 1.5M confidence check as "closed no-win" (max Δ@2× = +0.27 dB, original gate required ≥ +1.5 dB mean over 0.5×/2×, FAILS on 0/11 pair scenes). The fields remain opt-in for research experiments; all production presets have them off. See verdict: docs/plans/2026-05-25-phase-q1.5-final-verdict.md.

T74useMipSplatting

DETAILS

Default: false (all production presets), true (.fullMCMCMip — research sibling) Range: boolean Defined in:

TECHNICAL

Enables Mip-Splatting (Yu et al.~CVPR 2024): 3D smoothing filter + 2D filter + α compensation that limits per-Gaussian frequency to the Nyquist bound of the densest training camera sampling rate. Theoretical goal: eliminate aliasing when rendering at off-training scales (0.5× or 2× of training resolution). Enabled in the preprocess and backward projection shaders, functional correctness verified in Q1.5-D test. But: the original acceptance gate (Δ@1× ≥ +0.3 dB AND avg(Δ@0.5×, Δ@2×) ≥ +1.5 dB) was reached on none of 11 pair scenes. Maximum observed: family 750K classic Δ@2× = +0.270 dB. Outdoor scenes (Truck, Flowers) even showed worsening 1× and 0.5×. Hypothesis: 3D smoothing competes with MCMC relocation at high Gs. Field remains for future multi-scale re-eval with correct Mip-NeRF-360 methodology (see O3 backlog in the benchmark path).

T75mipSmoothing3DScale

DETAILS

Default: 0.2 (paper default) Range: 0.05 – 1.0 Defined in:

TECHNICAL

3D smoothing scale parameter (Yu et al.~§3.3, paper default 0.2). Larger = more world-space smoothing per Gaussian (= more anti-aliasing but also more blur at the default scale), smaller = sharper but more aliasing-prone. Only consulted when T74 useMipSplatting = true. Not further optimized in Q1.5 tests — the A/B gate already lost with the paper default 0.2, further sweeps would be pointless.

T76mipFilter2DVariance

DETAILS

T80curriculumResolutionRamp

DETAILS

Default: false Range: boolean Defined in:

TECHNICAL

Q6 feature: training resolution starts at 0.5× and switches at T50 positionLRScheduleEndIteration / 2 (or T1 maxIterations / 2, if T50 is not set) to T22 trainingRenderScale. Uses the resize/restoreImageBuffers infrastructure developed in Q1.5.1. Overrides T23 resolutionWarmupScale when enabled. Q6 passed as "carrier of the quality gain" in the Q5+Q6 bundle (see T77) — the gradual resolution increase gives the app time to find coarse geometry at lower resolution before moving on to fine detail work. Via CLI: –curriculum-resolution.

T81curriculumSHProgression

DETAILS

Default: false Range: boolean Defined in:

TECHNICAL

Q6 feature: overrides T21 shDegreeUpgradeIterations with [maxIter/4, maxIter/2, maxIter*3/4], distributing SH upgrades evenly across training time instead of front-loading them. Hypothesis: stable geometry is established before color detail explosion, which places the view-direction-dependent gloss effects more precisely. Q5+Q6 together PASS 1/3 scenes, Q6 as carrier of the gain (Q5 alone FAIL). Via CLI: –curriculum-sh.

Signature: public func resolveMcmcMaxGaussians(initialPointCount: Int, bufferCapacity: Int) -> Int Defined in:

Single source of truth for the question "how many Gaussians can MCMC be allowed to grow to?" Computed from three inputs: the configured T62 mcmcMaxGaussians (with mass extinction floor 150 000 if 0), the (number of SfM init points) and the (preallocated Gaussian buffer size). Logic:

+ base = T62 > 0 ? T62: 150_000 (the mass extinction floor protects against initializer default bugs like the 1.4.3 mass extinction incident) + If T73 mcmcAutoScaleByScene && initialPointCount > 0 && T72 mcmcCapMultiplier > 0: - scaled = max(base, ceil(initialPointCount × T72)) else

+ If bufferCapacity > 0: return min(scaled, bufferCapacity) + Else return scaled

Example: Bicycle (Mip-NeRF 360, 194 photo frames) → SfM init ~156 K points, T62 = 150 000, T72 = 5.32,, buffer capacity 8 M. Resolved cap = min(8M, max(150K, ceil(156K × 5.32))) = min(8M, 830K) = 830 K. That is the effective growth cap the MCMC relocation logic adheres to.

Computes the actual maximum splat count under MCMC. Takes your setting, looks at how many points your scene starts with, and scales by the Multiplier, if automatic adaptation is on. So the cap adapts to the scene instead of forcing the same value for a tiny and a huge scene. You don't have to call the method yourself — the training uses it internally.

Which field for what? (Cheat Sheet)

Goal	Fields to tweak
More detail in the distance	`T62 mcmcMaxGaussians` up, `T72 mcmcCapMultiplier` 5+
More detail overall (Classic)	`T1 maxIterations` up (≤ 40K), `T2 densifyUntilIteration` ≤ 14 % of T1
Reduce floaters in drone flights	`T43 frustumCullEnabled` on, `T20 skyMaskingEnabled` on, `T45 skyDomeEnabled` on
Nice sky in outdoor scenes	`T45 skyDomeEnabled` on, `T47 skyDomeRadiusMultiplier` 30–60
Smaller export file	Strategy `.mcmc` (T61), `T56 postTrainingCompactification` on, `T62 mcmcMaxGaussians` ≤ 200K
Faster training	`T22 trainingRenderScale` 0.5, `T1 maxIterations` halved — but not both!
Better highlights	`T21 shDegreeUpgradeIterations` with `[2K, 5K, 8K]` (no early front-load), MCMC + 200K iter
Keep Mac responsive	`T25 throttleDelayMs` 5–10 (costs ~15 % training time)
Live preview more often	`T59 livePreviewInterval` down to 10–20
Smoother transitions in shadows	`T17 ssimWeight` slightly up (0.15–0.25), but not above 0.3
Keep interiors compact	P10 Indoor preset (, `T72 = 1.76`)

Dangerous Fields

These fields can, with misconfiguration, lead to OOM, app crash, mass extinction of Gaussians, or unusable benchmark data. Handle with care:

- T11 densifyGradThreshold — halving can create 2–4× as many Gaussians, quickly blowing up GPU memory. Also note: must match the T22 trainingRenderScale (1.0× → 1e-6, 0.5× → 2e-6, 0.25× → 4e-6). - T72 mcmcCapMultiplier — with large scenes with > 200 K SfM init points and a multiplier > 5 a resolved cap of millions of Gaussians arises. On 36 GB RAM Macs OOM is possible. Outdoor preset 5.32 works only because Mip-NeRF 360 Bicycle has 156 K init points → 830 K cap. - T39 testViewIndices — manually setting can make the benchmark unusable (all indices > N → no holdouts). Let the –benchmark flag set it. - T64 mcmcOpacityRegWeight and T65 mcmcScaleRegWeight — In 1.4.3 beta set to 0.01, which led to mass extinction (460 K → 5 Gaussians in one iteration). Since 1.4.4 pinned at 0.0, but manually increasing can reproduce the issue. - T15 opacityResetInterval — if not 100 000+ (effectively off) and the training is shorter than 10 000 iterations, the reset destroys convergence. .preview therefore has it at 100 000 despite maxIterations = 5 000. - T54/T55 densifyPhase2* — two-phase densification ended in tests in a 0-Gaussians cascade. Leave both at 0. - T74 useMipSplatting — Q1.5 closed-no-win 2026-05-25, can even worsen PSNR on some outdoor scenes. Default off, opt-in only for research.

If a field is on this list and you want to change it, first back up your current preset (export as JSON) and consider whether you can reproducibly measure the result — otherwise you won't know afterwards whether you brought about an improvement or a worsening.