Knowledge · concepts

The learning ladder

The vocabulary the research rests on, grouped from foundations to data discipline. Each concept lists the working papers that reference it.

Stock-level risk (bottom-up)

Factor model2 papers
A linear model decomposing an asset's return into common factors (market, sector, …) plus a residual. The foundation of the bottom-up unit.
Beta (β)7 papers
The slope of an asset's return regressed on a factor; how much the asset moves per unit factor move.
Residual9 papers
The part of a stock's return left after the factor model is removed. RiskModels emphasizes the residual as where selection skill lives.
Idiosyncratic4 papers
Synonym for residual in classical finance literature. We standardize on *residual*.
Alpha (α)6 papers
Skill-attributable excess return at the position level.
Hedge ratio1 paper
The factor exposure you'd offset to neutralize a position's exposure to that factor.
Variance decomposition1 paper
Splitting a stock's variance into shares attributable to each factor and the residual. Sums to 1.0.
Return attribution2 papers
Splitting a stock's *realized return* into contributions from each factor. Distinct from variance: a name can be variance-dominated by market yet return-driven by residual.

From stocks to portfolios (aggregation)

Position weight
A holding's share of portfolio value; the weight that scales its risk and return contribution when positions aggregate.
Holdings look-through1 paper
Rolling individual holdings up to portfolio-level factor exposure by weight — the bridge from stock risk to portfolio risk.
Peer cohort
A cap-weighted set of similar securities (typically by sector and sub-sector) used as the comparison universe for relative analyses.
Covariance2 papers
How two positions move together; why portfolio risk is not just the weighted sum of position risks.
Diversification
The risk reduction from holding imperfectly-correlated positions; the gap between summed position risk and actual portfolio risk.

Portfolio performance & risk decomposition

Portfolio variance decomposition
Splitting total portfolio variance into factor and residual shares, accounting for cross-position covariance. The portfolio-level analog of the single-stock decomposition.
Risk contribution
Each position's (or factor's) share of *total portfolio risk*, accounting for correlation. Component contributions sum to the total — unlike standalone position risk.
Portfolio attribution
Splitting the portfolio's realized return into contributions from market, sector, subsector, and residual selection.
Active share5 papers
The fraction of a portfolio that differs from its benchmark; how much of the book is an active bet versus the index.
Tracking error
The volatility of portfolio-minus-benchmark return; the size of the active risk.
Benchmark fit
How well a portfolio's returns are explained by a benchmark (R², tracking error) — the quality of the comparison.
Sharpe ratio3 papers
Return per unit of volatility.
Drawdown1 paper
Peak-to-trough decline. *Max drawdown* is the worst observed.

Point-in-time data discipline

Point-in-time (PIT)6 papers
A dataset that respects when each fact was *knowable*, not just when it was *true*. Bloomberg and FactSet sell PIT databases as a premium product.
Look-ahead bias3 papers
Letting a backtest see information that wasn't yet public on the simulated date. The most common research bug.
Filing lag2 papers
The gap between when a fund's holdings reflected reality and when they became publicly knowable via SEC filings. ~45 days for 13F, ~60 days for N-PORT.
Survivorship bias1 paper
Backtesting on a universe that excludes delisted or failed entities. Produces optimistic results.

Regulatory data

13F
SEC filing (Form 13F-HR) listing institutional managers' US-listed equity holdings. Quarterly, 45-day lag.
N-PORT8 papers
SEC filing reporting registered fund (mutual fund / ETF) holdings monthly. Only every third month is publicly disclosed.
N-CEN8 papers
Annual SEC filing for registered funds; carries adviser relationships and share-class detail.
CIK2 papers
The SEC's identifier for any filer (manager, fund, adviser).
Security master
The canonical record of *what is this security and what was it called when?* — the foundation under any survivorship-corrected analysis.

What RiskModels publishes

RiskModels API1 paper
The public REST + MCP surface for decomposition, attribution, and exposure analyses. Documented at [riskmodels.app](https://riskmodels.app).
`riskmodels-py` (SDK)
The Python client for the API. `pip install riskmodels-py`.
Position snapshot
A one-page institutional analysis of a single equity: factor decomposition, attribution, residual quality, peer context.
Fund snapshot
A one-page tearsheet for a single fund: portfolio attribution, holdings, NAV, benchmark fit.
AOM
Analysis Object Model — our publicly-versioned protocol for describing analyses (subject + lens + view) before they hit any endpoint.

How we keep results trustworthy

Deterministic2 papers
Same input → byte-identical output on re-run. Required for cache correctness and auditability.
Provenance
The metadata recording how an artifact was produced: subject, parameters, as-of date, model version.
Render-once
Web, PDF, PNG, and agent surfaces all read the same canonical artifact. Drift across surfaces is impossible by construction.

The RiskModels engine (ERM3)

ERM38 papers
Equity Risk Model v3 — the engine. Decomposes any US equity into market → sector → subsector → residual via a hierarchical orthogonal cascade.
Hierarchical cascade (L1/L2/L3)2 papers
L1 = market (SPY); L2 = sector ETF (e.g. XLK) fit on the L1 residual; L3 = subsector ETF (e.g. SOXX) fit on the L2 residual. Each level explains what the prior one left over.
Link beta (λ)
The coefficient that strips a higher level's exposure from a lower one, so each β captures only the *incremental* effect of its own level — no double-counting.
Explained risk (ER)1 paper
The variance share attributable to each level. ER(L1) + ER(L2) + ER(L3) + residual = 1.0 by construction, verified at runtime to ±0.1%.
Replication equation
The raw-ETF identity that reproduces a stock's return from its hedge legs — only tradeable ETF returns, no orthogonalization at trade time. Verified to ±0.1% for every stock, every date.
Robust beta (Huber-M)
Betas estimated by 252-day rolling Huber-M regression, robust to the fat tails (earnings gaps, single-day reclassifications) that distort ordinary least squares.
Vasicek shrinkage2 papers
L2/L3 betas shrunk toward the log-mcap-weighted peer mean within a 4-digit industry cohort; short-history and small-caps are pulled more. Market beta (L1) is not shrunk.
L-star rule5 papers
The L\* rule: a cost-aware, learned selector that picks the shallowest level whose expected out-of-sample variance reduction outweighs the added hedging cost of going deeper — describing a name at the coarsest level that still captures its risk. Scored point-in-time, so the dispatch is backtest-safe.
Geometric attribution bridge
Telescoping compound differences (market → sector → subsector → residual) so multi-period factor contributions sum exactly to the gross compound return — not the naive daily sum, which overstates by volatility drag.
Residual mean-reversion signal
A 5-day cumulative L3 residual return, z-scored by the stock's own 60-day residual volatility. A combo-input factor for multi-signal stacks, not a standalone strategy.

Stock-level risk (bottom-up)

Factor model2 papers

A linear model decomposing an asset's return into common factors (market, sector, …) plus a residual. The foundation of the bottom-up unit.

Beta (β)7 papers

The slope of an asset's return regressed on a factor; how much the asset moves per unit factor move.

Residual9 papers

The part of a stock's return left after the factor model is removed. RiskModels emphasizes the residual as where selection skill lives.

Idiosyncratic4 papers

Synonym for residual in classical finance literature. We standardize on *residual*.

Alpha (α)6 papers

Skill-attributable excess return at the position level.

Hedge ratio1 paper

The factor exposure you'd offset to neutralize a position's exposure to that factor.

Variance decomposition1 paper

Splitting a stock's variance into shares attributable to each factor and the residual. Sums to 1.0.

Return attribution2 papers

Splitting a stock's *realized return* into contributions from each factor. Distinct from variance: a name can be variance-dominated by market yet return-driven by residual.

From stocks to portfolios (aggregation)

Position weight

A holding's share of portfolio value; the weight that scales its risk and return contribution when positions aggregate.

Holdings look-through1 paper

Rolling individual holdings up to portfolio-level factor exposure by weight — the bridge from stock risk to portfolio risk.

Peer cohort

A cap-weighted set of similar securities (typically by sector and sub-sector) used as the comparison universe for relative analyses.

Covariance2 papers

How two positions move together; why portfolio risk is not just the weighted sum of position risks.

Diversification

The risk reduction from holding imperfectly-correlated positions; the gap between summed position risk and actual portfolio risk.

Portfolio performance & risk decomposition

Portfolio variance decomposition

Splitting total portfolio variance into factor and residual shares, accounting for cross-position covariance. The portfolio-level analog of the single-stock decomposition.

Risk contribution

Each position's (or factor's) share of *total portfolio risk*, accounting for correlation. Component contributions sum to the total — unlike standalone position risk.

Portfolio attribution

Splitting the portfolio's realized return into contributions from market, sector, subsector, and residual selection.

Active share5 papers

The fraction of a portfolio that differs from its benchmark; how much of the book is an active bet versus the index.

Tracking error

The volatility of portfolio-minus-benchmark return; the size of the active risk.

Benchmark fit

How well a portfolio's returns are explained by a benchmark (R², tracking error) — the quality of the comparison.

Sharpe ratio3 papers

Return per unit of volatility.

Drawdown1 paper

Peak-to-trough decline. *Max drawdown* is the worst observed.

Point-in-time data discipline

Point-in-time (PIT)6 papers

A dataset that respects when each fact was *knowable*, not just when it was *true*. Bloomberg and FactSet sell PIT databases as a premium product.

Look-ahead bias3 papers

Letting a backtest see information that wasn't yet public on the simulated date. The most common research bug.

Filing lag2 papers

The gap between when a fund's holdings reflected reality and when they became publicly knowable via SEC filings. ~45 days for 13F, ~60 days for N-PORT.

Survivorship bias1 paper

Backtesting on a universe that excludes delisted or failed entities. Produces optimistic results.

Regulatory data

13F

SEC filing (Form 13F-HR) listing institutional managers' US-listed equity holdings. Quarterly, 45-day lag.

N-PORT8 papers

SEC filing reporting registered fund (mutual fund / ETF) holdings monthly. Only every third month is publicly disclosed.

N-CEN8 papers

Annual SEC filing for registered funds; carries adviser relationships and share-class detail.

CIK2 papers

The SEC's identifier for any filer (manager, fund, adviser).

Security master

The canonical record of *what is this security and what was it called when?* — the foundation under any survivorship-corrected analysis.

What RiskModels publishes

RiskModels API1 paper

The public REST + MCP surface for decomposition, attribution, and exposure analyses. Documented at [riskmodels.app](https://riskmodels.app).

`riskmodels-py` (SDK)

The Python client for the API. `pip install riskmodels-py`.

Position snapshot

A one-page institutional analysis of a single equity: factor decomposition, attribution, residual quality, peer context.

Fund snapshot

A one-page tearsheet for a single fund: portfolio attribution, holdings, NAV, benchmark fit.

AOM

Analysis Object Model — our publicly-versioned protocol for describing analyses (subject + lens + view) before they hit any endpoint.

How we keep results trustworthy

Deterministic2 papers

Same input → byte-identical output on re-run. Required for cache correctness and auditability.

Provenance

The metadata recording how an artifact was produced: subject, parameters, as-of date, model version.

Render-once

Web, PDF, PNG, and agent surfaces all read the same canonical artifact. Drift across surfaces is impossible by construction.

The RiskModels engine (ERM3)

ERM38 papers

Equity Risk Model v3 — the engine. Decomposes any US equity into market → sector → subsector → residual via a hierarchical orthogonal cascade.

Hierarchical cascade (L1/L2/L3)2 papers

L1 = market (SPY); L2 = sector ETF (e.g. XLK) fit on the L1 residual; L3 = subsector ETF (e.g. SOXX) fit on the L2 residual. Each level explains what the prior one left over.

Link beta (λ)

The coefficient that strips a higher level's exposure from a lower one, so each β captures only the *incremental* effect of its own level — no double-counting.

Explained risk (ER)1 paper

The variance share attributable to each level. ER(L1) + ER(L2) + ER(L3) + residual = 1.0 by construction, verified at runtime to ±0.1%.

Replication equation

The raw-ETF identity that reproduces a stock's return from its hedge legs — only tradeable ETF returns, no orthogonalization at trade time. Verified to ±0.1% for every stock, every date.

Robust beta (Huber-M)

Betas estimated by 252-day rolling Huber-M regression, robust to the fat tails (earnings gaps, single-day reclassifications) that distort ordinary least squares.

Vasicek shrinkage2 papers

L2/L3 betas shrunk toward the log-mcap-weighted peer mean within a 4-digit industry cohort; short-history and small-caps are pulled more. Market beta (L1) is not shrunk.

L-star rule5 papers

The L\* rule: a cost-aware, learned selector that picks the shallowest level whose expected out-of-sample variance reduction outweighs the added hedging cost of going deeper — describing a name at the coarsest level that still captures its risk. Scored point-in-time, so the dispatch is backtest-safe.

Geometric attribution bridge

Telescoping compound differences (market → sector → subsector → residual) so multi-period factor contributions sum exactly to the gross compound return — not the naive daily sum, which overstates by volatility drag.

Residual mean-reversion signal

A 5-day cumulative L3 residual return, z-scored by the stock's own 60-day residual volatility. A combo-input factor for multi-signal stacks, not a standalone strategy.