OCT5

SUN2025

Bayesian NAV Updating w/ Kalman Filters

Updating quarterly marks via

\hat{x}_{t\mid t} = \hat{x}_{t\mid t-1} + K_t(z_t - H\hat{x}_{t\mid t-1})

private equitybayesiannowcastingvaluationkalmanhierarchical

Valuation is not a verdict; it is a belief with receipts. Bayesian updating turns scraps of quarter-to-date evidence into a living, probabilistic estimate of NAV. The math is compact; the philosophy is wide: confidence with a dial. Laplace called probability "common sense reduced to calculation." Tetlock reminded us that "beliefs are hypotheses to be tested, not treasures to be guarded."

This post builds a model step by step. Each step adds one idea and shows how it changes the posterior and the decisions you might take. Think "from toy to tool":

simple normal update,
many signals,
sequential updates,
better priors,
robust likelihoods,
a small state-space model,
cross-fund pooling,
decisions and diagnostics.

TL;DR

Model the quarter-end return $r_q$ as a hidden variable.
Combine a prior (manager, macro) with the likelihood of partial signals.
Interpret the posterior as a belief to act on, then test and recalibrate.

Ten-second intuition

Bayes updates beliefs:

p(r \mid y) \propto p(y \mid r)\,p(r).

The prior $p(r)$ encodes what you believed before you saw this quarter.
The likelihood $p(y \mid r)$ says how probable the data would be if the true return were $r$ .
The posterior $p(r \mid y)$ blends the two, with the balance set by comparative uncertainty.

A tighter prior (more certain) moves less; a cleaner signal (less noisy) moves you more. This is the same engine behind weather nowcasting, sensor fusion, and Black-Litterman portfolio updates.

Interactive — Prior vs Posterior (one latent). Use the sliders to change the prior mean and standard deviation. Watch how the posterior curve shifts and tightens as the signal gets cleaner. Treat prior sd as your "confidence dial": narrow = dogmatic, wide = humble.

Play with prior confidence

Prior standard deviation s0 (%, on quarterly r)

humble

dogmatic

Prior mean m0 (%)

Base NAV V_{q-1}

E[r|y] ≈ 1.44%

sd ≈ 0.61%

N-N: mN=1.44%, sN=0.61%

Posterior combines the prior with active signals. Narrower priors move less; stronger data move more.

Setup and notation

Let $V_q$ be NAV at quarter end $q$ . Define quarterly log return

r_q = \log\!\left(\frac{V_q}{V_{q-1}}\right).

We use log returns because they add across time: if monthly logs are $r_1, r_2, r_3$ , then $r_q = r_1 + r_2 + r_3$ . That makes sequential updating algebraic instead of acrobatic.

During the quarter, partial evidence trickles in:

exact or near-final month 1,
provisional month 2,
proxies for month 3 (sector ETFs, macro surprises, peer marks).

Our task: infer $r_q$ continuously and map it into a distribution for $V_q$ .

NAV mapping. If $r_q \sim \mathcal{N}(m, s^2)$ , then $V_q \mid \text{data}$ is lognormal with

\mathbb{E}[V_q \mid \text{data}] = V_{q-1}\,\exp\big(m + \tfrac{1}{2}s^2\big).

\text{95\% band} \;\approx\; \big[\,V_{q-1}\,\exp\big(m - 1.96\,s\big),\; V_{q-1}\,\exp\big(m + 1.96\,s\big)\,\big].

The minimal model: one latent, one signal

Start with one belief, one clue.

Prior: $r \sim \mathcal{N}(m_0, s_0^2)$
Signal: $y \mid r \sim \mathcal{N}(r, s_y^2)$

Posterior:

s_1^{-2} = s_0^{-2} + s_y^{-2},\qquad m_1 = s_1^2 \left(s_0^{-2} m_0 + s_y^{-2} y\right).

This is a precision-weighted average. Intuition: if the signal is twice as precise (half the variance) as the prior, it gets twice the weight.

Common pitfalls.

Do not double-count the same piece of information via multiple "signals".
Keep units straight: work in logs when you add over months, and convert to percents only for display.

Interactive — Signals and posterior (start simple). Set one signal equal to a clean month-1 return, vary its sd, and watch the posterior mean migrate toward $y$ while the band tightens.

Signals

Month 1 exact

Active

a (loading)

b (offset)

y (obs, %)

s (sd, %)

Stage

Month 2 exact

Active

a (loading)

b (offset)

y (obs, %)

s (sd, %)

Stage

Month 3 proxy

Active

a (loading)

b (offset)

y (obs, %)

s (sd, %)

Stage

NAV: E[V] ≈ 101.45 | median ≈ 101.45 | 5–95% ≈ [100.43, 102.49]

Multi-signal extension (conjugate Normal)

Now allow many clues that linearly load on $r$ :

y_k \mid r \sim \mathcal{N}(a_k r + b_k, s_k^2),\quad k=1,\dots,K.

Stack in matrix form: $y = A r + b + \epsilon$ with $A = [a_1,\dots,a_K]^\top$ and $\epsilon \sim \mathcal{N}(0, R)$ , $R=\mathrm{diag}(s_k^2)$ . The posterior remains Normal:

s_N^2 = \left(s_0^{-2} + A^\top R^{-1} A\right)^{-1},\qquad m_N = s_N^2\left(s_0^{-2} m_0 + A^\top R^{-1}(y - b)\right).

Why this matters. Different proxies carry different units and cleanliness. A sector ETF with $a_k \approx 1$ and small $s_k$ will move beliefs more than a noisy peer print with $a_k \approx 0.4$ .

NAV expectation.

\mathbb{E}[V_q \mid \text{data}] = V_{q-1}\,\exp\!\left(m_N + \tfrac{1}{2}s_N^2\right).

Interactive — Multiple signals. Toggle $a_k$ , $b_k$ , $y_k$ , and $s_k$ to see how weights reallocate when you add or remove evidence. The implied NAV band updates in place.

Signals

Month 1 exact

Active

a (loading)

b (offset)

y (obs, %)

s (sd, %)

Stage

Month 2 exact

Active

a (loading)

b (offset)

y (obs, %)

s (sd, %)

Stage

Month 3 proxy

Active

a (loading)

b (offset)

y (obs, %)

s (sd, %)

Stage

NAV: E[V] ≈ 101.45 | median ≈ 101.45 | 5–95% ≈ [100.43, 102.49]

Sequential updating through the quarter

Evidence arrives in order, but conjugate Normal updates commute: updating with $y_1$ then $y_2$ yields the same posterior as updating once with $(y_1,y_2)$ together. That lets you treat the quarter as a loop:

T1 (after month 1): prior $(m_0,s_0^2)$ updated by $y_1$ gives $(m_1,s_1^2)$
T2: update by $y_2$ to $(m_2,s_2^2)$
T3: add proxies for month 3
T4: final mark collapses uncertainty

Order-invariance proof sketch. With independent Gaussian signals, posteriors multiply and the product of Gaussians is Gaussian; multiplication is commutative, so the update order does not matter.

Interactive — Timeline scrubber. Scrub T1 to T4 and see the band narrow. The badges report posterior mean and sd at each stage.

Sequential updates

T1: after month 1

T2: after month 2

T3: before close

T4: final print

T1: m=1.79%, sd=0.89%

T2: m=1.50%, sd=0.63%

T3: m=1.44%, sd=0.61%

T4: m=1.44%, sd=0.61%

As you move from T1 to T4, evidence accumulates and the distribution tightens. T4 can be modeled as an almost-exact observation.

Stage	Mean r (%)	SD r (%)
T1	1.79	0.89
T2	1.50	0.63
T3	1.44	0.61
T4	1.44	0.61

Constructing the prior: reputation and regime

A good prior is earned, not guessed.

Reputation as pseudo-data. Suppose a manager has track record mean $\bar r$ and variance $s_{\text{track}}^2$ over $n$ quarters. Use an effective sample size $n_{\text{eff}} \le n$ to discount for drift or luck. Then

m_0 = \lambda\,\bar r + (1-\lambda)\,m_{\text{peer}},\qquad s_0^2 = \frac{s_{\text{track}}^2}{n_{\text{eff}}},\quad \lambda \in [0,1].

Regime priors. Add macro features $f$ with a ridge prior on coefficients:

r \mid f \sim \mathcal{N}(m_{\text{base}} + \beta^\top f,\ s_{\text{base}}^2),\qquad \beta \sim \mathcal{N}(0,\ \tau^2 I).

This is just regularized regression; Black-Litterman is a portfolio-size cousin.

Interactive — Prior builder. Feed in a track record and a peer rate, set $n_{\text{eff}}$ , and push $(m_0,s_0)$ into the rest of the post.

Prior builder: reputation and regime

Manager avg quarter r̄ (%)

Track record sd (%)

Quarters in record (n)

low rep

high rep

Peer base rate m_peer (%)

m0 ← 1.65%

s0 ← 1.59%

n_eff = 12

Apply to prior

We treat past performance as pseudo-observations and shrink toward a peer base rate. Smaller effective samples widen s0; tighter priors imply stronger conviction.

Robustness: when reality is not Gaussian

Marks can jump, proxies can be wrong, and volatility is not constant.

Unknown variance. A Normal-Inverse-Gamma prior on $(r,\sigma^2)$ yields a Student-t posterior on $r$ . Fatter tails slow overreaction to outliers.
Outlier-resistant likelihood. Model $y_k \mid r \sim t_\nu(a_k r + b_k, s_k^2)$ . Small $\nu$ down-weights wild points.

Interactive — Likelihood chooser. Flip between Gaussian and t, vary $\nu$ , and note how the posterior spreads when you admit heavy tails.

Robustness to tails and noise

Gaussian likelihood

Student-t likelihood

ν (df for Student-t)

Plot range r_min (%)

Plot range r_max (%)

E[r|y] ≈ 1.44%

sd ≈ 0.61%

Lower ν thickens tails, putting more probability on outliers and slowing overreaction to noisy signals.

State-space step-up: monthly latent returns (Kalman)

We now model the monthly path, not just the quarterly sum. The latent state is

x = \begin{bmatrix} r_1 \\ r_2 \\ r_3 \end{bmatrix},\qquad r_q = \mathbf{1}^\top x.

Process (prior). Each month has baseline mean plus macro tilt and correlated uncertainty:

r_m \sim \mathcal{N}(\mu_m,\ s_{\text{proc}}^2),\quad \mathrm{Cov}(x) = s_{\text{proc}}^2\,C(\rho),

where $C(\rho)$ is a 3x3 Toeplitz correlation with $C_{ij}=\rho^{|i-j|}$ . This makes months related but not identical.

Observations.

exact months: $y_m = r_m + \epsilon_m$ , with small but nonzero sd $s_{\text{obs12}}$ (near-final, not perfect),
month-3 proxy: $y_3 = r_3 + e_3$ , $e_3 \sim \mathcal{N}(0, s_{\text{obs}}^2)$ ,
optional final sum: $y_{\Sigma} = r_1 + r_2 + r_3 + \eta$ , with sd $s_{\Sigma}$ .

Kalman update (scalar measurement). For a measurement $y = Hx + \varepsilon$ , $\varepsilon \sim \mathcal{N}(0,R)$ , the update is

K = P^- H^\top (H P^- H^\top + R)^{-1},\quad m^+ = m^- + K(y - H m^-),

P^+ = (I - K H)\,P^-\,(I - K H)^\top + K R K^\top\quad \text{(Joseph form, PSD safe)}.

What the gain $K$ does. $K$ increases when $P^-$ is large (you were uncertain) or $R$ is small (measurement is clean). That is, move fast when you were unsure and the new data are good.

Total-quarter uncertainty. The variance of the quarterly return is

\mathrm{Var}(r_q) = \mathbf{1}^\top P \mathbf{1}.

Interactive — Monthly state nowcast.

Move through stages T1 to T4; see how exact months collapse their bands and the sum variance shrinks.
Increase $\rho_{\text{proc}}$ to couple months more tightly.
Use nonzero $s_{\text{obs12}}$ so "exact" months are near-final rather than perfect; this prevents degeneracy and is more realistic.

Kalman nowcast (3-month state)

Include final sum at T4

Use macro tilts

Base monthly mean mu

s_proc (monthly sd, %)

rho_proc (prior corr)

s_obs12 (M1/M2 sd, %)

s_obs3 (M3 proxy sd, %)

s_sum (final mark sd, %)

y1 (M1, %)

y2 (M2, %)

y3 proxy (M3, %)

Quarter sum y_sum (%, T4)

r_sum mean ≈ 1.27%

sd ≈ 0.92%

E[V] ≈ 101.29

5–95% ≈ [99.46, 103.13]

Non-zero prior correlation (rho_proc) and finite observation noise (s_obs12) prevent the geometry from degenerating to identity. The Joseph update keeps P PSD and numerically stable.

Measurement geometry

H selects which r_m you observe

R sets measurement noise

P → ρ for readability

We visualize correlation rather than raw covariance to center the color scale at 0 and emphasize structure over magnitude. H is a simple loading vector; R is the scalar noise level for the current measurement.

How to read the geometry.

The correlation heatmap shows structure across months; off-diagonals indicate shared shocks.
The $H$ bar shows which latent you are observing at this stage (e.g., month 2 or a sum).
$R$ is the observation noise; larger $R$ yields smaller $K$ and gentler moves.

Cross-fund pooling: hierarchical Bayes

You rarely follow one fund. To calibrate beliefs across many, you share statistical strength.

Model quarterly fund returns $r_{iq}$ with fund-specific means $\mu_i$ and idiosyncratic noise $\sigma_i$ :

r_{iq} \sim \mathcal{N}(\mu_i + \beta^\top f_q,\ \sigma_i^2), \qquad \mu_i \sim \mathcal{N}(\mu_{\text{sector}},\ \tau^2).

Given a fund summary with sample mean $\bar r_i$ , variance $s_i^2$ , sample size $n_i$ , the posterior for $\mu_i$ is

\mu_i \mid \text{data} \sim \mathcal{N}\!\left( \underbrace{\omega_i \bar r_i + (1-\omega_i)\mu_{\text{sector}}}_{\text{shrunken mean}},\ \underbrace{\left(\frac{n_i}{s_i^2} + \frac{1}{\tau^2}\right)^{-1}}_{\text{posterior variance}} \right),

with shrinkage weight

\omega_i = \frac{n_i / s_i^2}{n_i / s_i^2 + 1/\tau^2}.

Interpretation: data-poor or high-volatility funds (small $n_i/s_i^2$ ) shrink more. This is James-Stein in fund clothing.

Interactive — Shrinkage in action. Move $\tau$ to tune pooling strength. Long arrows mean heavy shrinkage. Try increasing volatility for a couple of funds to see how their posteriors retreat toward the sector anchor.

Cross-fund pooling

Sector mean mu_sector (%)

tight

loose

Funds (n)

Shrinks extremes toward mu_sector

tau controls pooling strength

Fund	n	sigma (%)	raw (%)	post (%)	\|shrink\| (pp)
Fund 2	12	10.85	-7.01	-4.69	2.31
Fund 7	8	6.86	7.55	6.34	1.21
Fund 8	14	7.64	-4.94	-4.06	0.88
Fund 3	27	11.87	-3.83	-2.96	0.87
Fund 9	22	8.97	-5.02	-4.23	0.79

Data-poor or high-vol funds shrink more. Pooling trades narrative for calibration.

From belief to action

A posterior is not an answer; it is a control surface for choices. Here are three operational views:

Tail odds. For a loss threshold $L$ in log terms, compute $P(r_q \le L)$ and decide if rebalancing or hedging triggers.
Beat-the-benchmark odds. If the benchmark quarterly log return is $r^*$ , then $P(r_q > r^*)$ quantifies the chance of outperformance.
Risk-adjusted rule. Define risk-adjusted return

\mathrm{RA} = m - \lambda s,

with $m$ the posterior mean and $s$ the posterior sd. Choose thresholds $t_{\text{add}}$ and $t_{\text{trim}}$ ; Add if $\mathrm{RA} \ge t_{\text{add}}$ , Trim if $\mathrm{RA} \le t_{\text{trim}}$ , else Hold.

Contours. Lines of constant RA satisfy $s = (m - t)/\lambda$ ; steeper $\lambda$ penalizes uncertainty more.

Value of information (quick approximation). If the next signal reduces variance from $s^2$ to $s^2 - \Delta$ , the RA improves by about $(\lambda/2)\,\Delta/s$ for small changes. Expensive information should clear this expected improvement hurdle.

Interactive — Decision map. The heatmap shows Add/Hold/Trim regions in the nonnegative quadrant. The cross marks your current posterior. Tune $\lambda$ and thresholds to match mandate and risk appetite.

From beliefs to actions

lambda (pp cost per 1 sd)

t_add (pp)

t_trim (pp)

x max (%)

y max (%)

Recommend: ADD

RA = 0.83 pp

The map now shows only non-negative means and standard deviations. Adjust the axis maxima to fit your fund’s posterior scale.

Diagnostics and epistemic humility

Beliefs need audit trails. Three checks help keep you honest.

Log score. The average log predictive density $\frac{1}{N}\sum_i \log p(r_i \mid \mu_i, s_i)$ rewards sharp and calibrated forecasts.
Brier score. For a binary event like $r<0$ , the mean squared error between predicted probabilities and outcomes.
Calibration curves and PIT. Probability integral transforms $u_i = F_i(r_i)$ should be uniform if calibrated; reliability curves should sit on the diagonal.

Interactive — Calibration dashboard. Simulate forecasts and outcomes. Inject mean bias or noise mis-specification and see how the PIT warps and scores deteriorate. Then fix your priors or likelihoods upstream.

Calibration and scoring

Samples (n)

Bias on mu (pp)

Noise scale (x sd)

sd floor (%)

Log score ≈ 1.799

Brier ≈ 0.217

A flat PIT and a reliability curve on the diagonal indicate well-calibrated beliefs. If not, widen priors or revisit likelihood assumptions.

Implementation notes

Treat each signal’s uncertainty $s_k$ as first-class metadata.
Prevent leakage: never let final marks influence interim updates.
Unit test the algebra: conjugate updates, Kalman recursions (including Joseph form), and posterior predictive checks.
Beware double counting: a peer print and an ETF move may express the same shock; correlate or down-weight accordingly.

The progression — simple average -> multi-signal -> sequential -> dynamic -> pooled — mirrors the evolution of belief itself: from a single guess to a system that learns, checks itself, and earns its confidence.