US IV Asset Pricing Lab | QuantifiedTrader

Research motivation

Standard two-stage asset pricing tests (Fama–MacBeth) estimate factor betas in a first-stage time-series regression, then use those noisy estimates as regressors in a second-stage cross-section. This errors-in-variables (EIV) problem biases OLS risk-premium estimates even when the time series is long.

Researchers historically formed portfolios to diversify away estimation error, but grouping sacrifices cross-sectional information and can mask stock-level effects. Instrumental variables (IV) methods retain individual stocks as test assets while delivering N-consistent premium estimates as the number of stocks grows (Jegadeesh et al., 2019).

A Turkish-market study (1990–2022) applied this framework and found that classic and modern factors (CAPM, Fama–French, liquidity-adjusted LCAPM) often fail to price risk once firm characteristics are controlled — highlighting time-varying dynamics in emerging markets. This project asks whether similar patterns appear in US equities under the same econometric discipline.

The EIV problem and portfolio trade-off

Definition: EIV bias arises when independent variables (factor betas) are measured with error. In Fama–MacBeth, first-stage β̂ᵢ are treated as known in the monthly cross-section, distorting γ̂ₖ (risk premia) and standard errors.

Portfolio grouping averages away idiosyncratic beta noise but reduces power, hides variation visible only at the stock level, and limits how many characteristics can be studied simultaneously.

IV innovation: Use β̂ estimated from even calendar months as regressors and β̂ from odd months as instruments (roles swap in odd months). Measurement errors in the two samples are plausibly independent, mitigating EIV without abandoning individual stocks.

Methodology (US reproduction)

Universe: Sector-balanced sample of US large caps. Full academic replication would use CRSP returns and Compustat characteristics; this lab uses Ken French factors with standard econometric adjustments.

Factors: Daily Ken French MKT, SMB, HML, RMW, CMA, and momentum. Models tested: CAPM, FF3, FF4 (+Mom), FF5.

Stage 1 — Split-sample betas: Rolling 36-month window of daily excess returns. Separate time-series regressions on even-month vs. odd-month days. Dimson (1979): include contemporaneous, one-day lagged, and one-day led factor returns; sum loadings to total β per factor.

Stage 2 — Cross-section: Each month $t$ , regress stock excess returns on lagged β̂ and optional controls: $r_{i, t} = γ_{0, t} + \sum_{k} γ_{k, t} \hat{β}_{i, k, t - 1} + \sum_{l} λ_{l, t} \overset{ˉ}{C}_{i, l, t - 1} + ξ_{i, t}$ . Even months: β̂_EV explanatory, β̂_IV instrument; odd months: roles reverse. Compare OLS and IV (2SLS).

Characteristics: Twelve-month rolling averages of size (log market cap), book-to-market proxy, profitability (ROE), and investment proxy — cross-sectionally z-scored. Many published “factor premia” weaken once these enter as direct controls.

Instrument strength and inference

IV validity requires sufficiently correlated instruments. Report ρ(β̂_EV, β̂_IV) each month; the Nelson–Startz (1990) rule of thumb requires ρ ≫ 1/√N (N = cross-section size) to avoid weak-instrument bias.

While true betas are unobserved, √(ρ(β̂_EV, β̂_IV)) proxies the correlation between estimated and true betas under independent measurement errors across splits.

Risk premia are time-averaged monthly γ̂ₖ with Fama–MacBeth t-statistics (mean over standard error of the monthly series). Compare IV vs. OLS and specifications with vs. without characteristics.

Instrument diagnostics (FF3)

Factor	ρ(β̂_EV, β̂_IV)	√ρ proxy	Nelson–Startz
MKT	0.725	0.851	Pass
SMB	0.555	0.745	Pass
HML	0.781	0.883	Pass

Avg. cross-section N ≈ 56. Weak instruments if ρ ≲ 1/√N.

How to interpret US results

Factor vs. characteristic: If SMB or HML premia are significant in OLS but not after characteristics or under IV, the “priced factor” may reflect size or value characteristics rather than incremental risk compensation.

Persistent alpha: A positive, significant intercept under IV with characteristic controls suggests residual pricing not captured by linear factor models — consistent with both US anomaly literature and emerging-market findings.

OLS vs. IV: With long daily windows OLS and IV may look similar; IV is theoretically preferred when EIV bias is material or samples are shorter. Large shifts under IV indicate beta measurement error mattered for inference.

US vs. Turkey: Developed US markets are more liquid; nonsynchronous trading adjustments matter less but remain standard. Divergence from Turkish results would support market-specific drivers; convergence would suggest universal limits of classical factor models at the stock level.

Literature and reference design

The split-sample IV approach follows Jegadeesh, Noh, Gao, and Ravi (2019), who show that Fama–MacBeth OLS can misstate risk premia when betas are estimated with error, and that IV using independent beta estimates from alternating subsamples yields consistent inference as the cross-section grows.

The motivating emerging-market application (Borsa Istanbul, 1990–2022) documents that textbook factor premia often lose significance once firm characteristics enter the second stage — a pattern this US lab tests on a liquid large-cap panel with the same econometric steps.

Dimson (1979) lag-lead adjustments remain standard when daily returns are used: nonsynchronous trading between stocks and factors can otherwise deflate measured betas and bias premium estimates.

What to look for in the results

Compare OLS vs IV premia factor by factor: large gaps for the market leg suggest beta measurement error materially affects inference; stability across both estimators supports robust pricing.

The characteristic-control tables show whether SMB/HML-style premia survive once size, book-to-market, profitability, and investment enter directly — if premia collapse, the factor may be proxying for characteristics rather than incremental risk.

Instrument diagnostics validate whether split-sample betas are strong enough for IV; factors that fail Nelson–Startz should be interpreted cautiously.

The monthly IV premium path reveals time variation (crisis spikes vs calm periods) that a single full-sample average would hide.

Cross-sectional IV study on 56 US large caps with split-sample betas and Dimson adjustment.
FF3 market premium: OLS 9.1% p.a. vs IV -24.8% (t_OLS=1.86, t_IV=-1.63).
Instrument diagnostics: 3/3 FF3 factors pass Nelson–Startz.
Panel covers 56 US large caps over 2010-01-31–2026-07-31 with split-sample betas (36-month rolling window) and monthly Fama–MacBeth passes.
FF3 market premium: OLS 9.1% p.a. (significant at 10%) vs IV -24.8% (insignificant); IV–OLS gap -33.9 pp — material EIV correction for the market factor.
SMB (FF3): OLS 8.8% (significant at 1%) → IV 37.9% (significant at 1%).
HML (FF3): OLS -6.5% (significant at 5%) → IV -11.3% (significant at 10%).
IV market premium rises from 12.4% (CAPM) to -40.6% (FF5), reflecting how additional factor dimensions absorb cross-sectional structure.
Instrument strength: 3/3 FF3 factors pass Nelson–Startz (ρ ≫ 1/√N). Weakest even–odd β correlation: SMB (ρ≈0.56).
Monthly IV MKT premium: mean -2.07% per month; 94 positive vs 103 negative months — time variation supports regime-aware interpretation rather than a single constant premium.

Reading the interactive dashboard

Premium tables compare OLS and IV average γ̂ₖ (annualized where noted) for CAPM through FF5, with and without characteristic controls. Large gaps between OLS and IV for the same factor suggest EIV bias mattered; gaps between specs with vs. without characteristics suggest the factor may proxy for size, value, or profitability rather than incremental risk.

Monthly paths plot time series of estimated market (and other) premia — useful for spotting crisis spikes (e.g. March 2020) versus calm periods.

Instrument diagnostics report ρ(β̂_EV, β̂_IV) by month; values below the Nelson–Startz threshold warn that IV inference may be unreliable for that date.

Research notes in the dashboard summarize headline findings for drafting and interpretation.

US IV reproduction lab. Research illustration using Ken French factors and a US large-cap equity panel. Not investment advice.Window: 2010-01-31 → 2026-07-31Stocks: 56 · β window: 36mUpdated: 2026-07-18T09:14:20Z

OLS vs. IV factor premia (annualized)

OLS–IV premium comparison

Factor	OLS (%)	IV (%)	IV − OLS	t (OLS)	t (IV)
MKT	9.09	-24.81	-33.90	1.86	-1.63
SMB	8.77	37.91	+29.15	3.03	2.96
HML	-6.54	-11.31	-4.76	-2.00	-1.94

Characteristic control attenuation

Term	OLS (factors only)	OLS (+ chars)	IV (+ chars)
MKT	9.09%t=1.86	—	—
SMB	8.77%t=3.03	—	—
HML	-6.54%t=-2.00	—	—

OLS — betas only

Term	Ann. premium (%)	FM t-stat
const	8.89	1.98
MKT	9.09	1.86
SMB	8.77	3.03
HML	-6.54	-2.00

IV — betas only

Term	Ann. premium (%)	FM t-stat
const	42.96	2.63
MKT	-24.81	-1.63
SMB	37.91	2.96
HML	-11.31	-1.94

No estimates available.

Monthly MKT premium (IV, %)

Errors-in-variables (EIV) bias

In standard two-stage Fama–MacBeth tests, factor betas are estimated with error in the first stage and treated as known in the second-stage cross-section. OLS risk-premium estimates are biased and inconsistent as the number of time periods grows. Portfolio grouping diversifies idiosyncratic noise but sacrifices cross-sectional granularity and may hide stock-level variation.

IV split-sample mechanism

Following Jegadeesh et al. (2019), betas estimated from even-calendar-month daily returns (β̂_EV) use odd-month betas (β̂_IV) as instruments, and roles reverse in odd months. Measurement errors in the two samples are plausibly independent, yielding an estimator that is N-consistent as the number of stocks grows.

Dimson nonsynchronous-trading adjustment

Infrequently traded stocks align poorly with same-day factor returns. Each beta regression includes contemporaneous, one-day lagged, and one-day led factor returns; loadings are summed to a total factor beta.

Factor premia vs. characteristics

Many factors that appear priced in portfolio sorts lose significance once firm characteristics (size, book-to-market, profitability, investment) enter as direct controls. Persistent positive intercepts (alpha) suggest residual pricing not captured by classical linear factor structures.

Instrument strength (Nelson–Startz)

Weak instruments invalidate IV inference. Reported even–odd beta correlations should exceed 1/√N (N = cross-section size). The square root of this correlation proxies correlation between estimated and true betas when errors are independent across splits.

US vs. emerging-market context

Turkish-market evidence shows time-varying dynamics and local factors dominating global benchmarks once characteristics are controlled. US reproduction tests whether factor premia survive IV correction and characteristic controls in a liquid developed market — differences highlight market-specific drivers.

β̂_EV: Factor loadings estimated from even-month daily returns in the rolling window.
β̂_IV: Factor loadings estimated from odd-month daily returns; used as instruments when month t is even.
Fama–MacBeth t: Mean premium divided by standard error of the monthly premium series (×√T).
N-consistent IV: Estimator converges to true risk premia as the number of stocks N → ∞, under standard IV assumptions.
LCAPM: Liquidity-adjusted CAPM with separate betas on market return and market illiquidity (not fully implemented here; noted for extension).

Limitations

Public market prices and issuer metadata are not CRSP/Compustat. Investment and book-to-market proxies are simplified.

LCAPM (liquidity-adjusted CAPM) and q-factor models are noted for extension but not fully estimated here.

Overlapping monthly windows and large cross-sections make t-statistics indicative; bootstrap or Shanken corrections are not applied here.

Research and education only — not investment advice.

Conclusion

Split-sample IV correction can materially shift Fama–MacBeth risk-premium estimates when beta measurement error is large. Characteristic controls often attenuate factor premia that appear significant under OLS alone.

Instrument strength diagnostics should be consulted before drawing inference from IV paths. This lab is research and education only — not investment advice.