US IV Asset Pricing Lab
Research motivation
Standard two-stage asset pricing tests (Fama–MacBeth) estimate factor betas in a first-stage time-series regression, then use those noisy estimates as regressors in a second-stage cross-section. This errors-in-variables (EIV) problem biases OLS risk-premium estimates even when the time series is long.
Researchers historically formed portfolios to diversify away estimation error, but grouping sacrifices cross-sectional information and can mask stock-level effects. Instrumental variables (IV) methods retain individual stocks as test assets while delivering N-consistent premium estimates as the number of stocks grows (Jegadeesh et al., 2019).
A Turkish-market study (1990–2022) applied this framework and found that classic and modern factors (CAPM, Fama–French, liquidity-adjusted LCAPM) often fail to price risk once firm characteristics are controlled — highlighting time-varying dynamics in emerging markets. This project asks whether similar patterns appear in US equities under the same econometric discipline.
The EIV problem and portfolio trade-off
Definition: EIV bias arises when independent variables (factor betas) are measured with error. In Fama–MacBeth, first-stage β̂ᵢ are treated as known in the monthly cross-section, distorting γ̂ₖ (risk premia) and standard errors.
Portfolio grouping averages away idiosyncratic beta noise but reduces power, hides variation visible only at the stock level, and limits how many characteristics can be studied simultaneously.
IV innovation: Use β̂ estimated from even calendar months as regressors and β̂ from odd months as instruments (roles swap in odd months). Measurement errors in the two samples are plausibly independent, mitigating EIV without abandoning individual stocks.
Methodology (US reproduction)
Universe: Sector-balanced sample of US large caps. Full academic replication would use CRSP returns and Compustat characteristics; this lab uses Ken French factors with standard econometric adjustments.
Factors: Daily Ken French MKT, SMB, HML, RMW, CMA, and momentum. Models tested: CAPM, FF3, FF4 (+Mom), FF5.
Stage 1 — Split-sample betas: Rolling 36-month window of daily excess returns. Separate time-series regressions on even-month vs. odd-month days. Dimson (1979): include contemporaneous, one-day lagged, and one-day led factor returns; sum loadings to total β per factor.
Stage 2 — Cross-section: Each month , regress stock excess returns on lagged β̂ and optional controls: . Even months: β̂_EV explanatory, β̂_IV instrument; odd months: roles reverse. Compare OLS and IV (2SLS).
Characteristics: Twelve-month rolling averages of size (log market cap), book-to-market proxy, profitability (ROE), and investment proxy — cross-sectionally z-scored. Many published “factor premia” weaken once these enter as direct controls.
Instrument strength and inference
IV validity requires sufficiently correlated instruments. Report ρ(β̂_EV, β̂_IV) each month; the Nelson–Startz (1990) rule of thumb requires ρ ≫ 1/√N (N = cross-section size) to avoid weak-instrument bias.
While true betas are unobserved, √(ρ(β̂_EV, β̂_IV)) proxies the correlation between estimated and true betas under independent measurement errors across splits.
Risk premia are time-averaged monthly γ̂ₖ with Fama–MacBeth t-statistics (mean over standard error of the monthly series). Compare IV vs. OLS and specifications with vs. without characteristics.
How to interpret US results
Factor vs. characteristic: If SMB or HML premia are significant in OLS but not after characteristics or under IV, the “priced factor” may reflect size or value characteristics rather than incremental risk compensation.
Persistent alpha: A positive, significant intercept under IV with characteristic controls suggests residual pricing not captured by linear factor models — consistent with both US anomaly literature and emerging-market findings.
OLS vs. IV: With long daily windows OLS and IV may look similar; IV is theoretically preferred when EIV bias is material or samples are shorter. Large shifts under IV indicate beta measurement error mattered for inference.
US vs. Turkey: Developed US markets are more liquid; nonsynchronous trading adjustments matter less but remain standard. Divergence from Turkish results would support market-specific drivers; convergence would suggest universal limits of classical factor models at the stock level.
Literature and reference design
The split-sample IV approach follows Jegadeesh, Noh, Gao, and Ravi (2019), who show that Fama–MacBeth OLS can misstate risk premia when betas are estimated with error, and that IV using independent beta estimates from alternating subsamples yields consistent inference as the cross-section grows.
The motivating emerging-market application (Borsa Istanbul, 1990–2022) documents that textbook factor premia often lose significance once firm characteristics enter the second stage — a pattern this US lab tests on a liquid large-cap panel with the same econometric steps.
Dimson (1979) lag-lead adjustments remain standard when daily returns are used: nonsynchronous trading between stocks and factors can otherwise deflate measured betas and bias premium estimates.
What to look for in the results
Compare OLS vs IV premia factor by factor: large gaps for the market leg suggest beta measurement error materially affects inference; stability across both estimators supports robust pricing.
The characteristic-control tables show whether SMB/HML-style premia survive once size, book-to-market, profitability, and investment enter directly — if premia collapse, the factor may be proxying for characteristics rather than incremental risk.
Instrument diagnostics validate whether split-sample betas are strong enough for IV; factors that fail Nelson–Startz should be interpreted cautiously.
The monthly IV premium path reveals time variation (crisis spikes vs calm periods) that a single full-sample average would hide.
Reading the interactive dashboard
Premium tables compare OLS and IV average γ̂ₖ (annualized where noted) for CAPM through FF5, with and without characteristic controls. Large gaps between OLS and IV for the same factor suggest EIV bias mattered; gaps between specs with vs. without characteristics suggest the factor may proxy for size, value, or profitability rather than incremental risk.
Monthly paths plot time series of estimated market (and other) premia — useful for spotting crisis spikes (e.g. March 2020) versus calm periods.
Instrument diagnostics report ρ(β̂_EV, β̂_IV) by month; values below the Nelson–Startz threshold warn that IV inference may be unreliable for that date.
Research notes below summarize headline findings for drafting and interpretation.
Limitations
Public market prices and issuer metadata are not CRSP/Compustat. Investment and book-to-market proxies are simplified.
LCAPM (liquidity-adjusted CAPM) and q-factor models are noted for extension but not fully estimated here.
Overlapping monthly windows and large cross-sections make t-statistics indicative; bootstrap or Shanken corrections are not applied here.
Research and education only — not investment advice.
Empirical results
Interactive tables and charts below report OLS vs. IV premia by model, specifications with and without characteristics, monthly market-premium paths, instrument diagnostics, and structured notes suitable for drafting a research paper.
Empirical results
Key analytical insights
- Panel covers 61 US large caps over 2010-01-31–2026-05-31 with split-sample betas (36-month rolling window) and monthly Fama–MacBeth passes.
- FF3 market premium: OLS 9.4% p.a. (significant at 5%) vs IV -1.2% (insignificant); IV–OLS gap -10.6 pp — material EIV correction for the market factor.
- SMB (FF3): OLS 9.0% (significant at 1%) → IV 20.0% (significant at 10%).
- HML (FF3): OLS -6.9% (significant at 5%) → IV -6.7% (insignificant).
- With characteristics, FF3 market premium attenuates from 5.4% (OLS, t=1.19) to -13.7% (IV, t=-0.88) — 19.1 pp of the OLS market premium may reflect size/value/profitability exposure rather than pure factor risk.
- SIZE characteristic remains significant at 1% under IV (8.6% p.a.), suggesting a direct size effect beyond SMB/HML/MKT betas alone.
- IV intercept with characteristics: 10.55% p.a. (t=0.59) — no strong evidence of average mispricing after controls.
- IV market premium rises from 13.4% (CAPM) to 19.4% (FF5), reflecting how additional factor dimensions absorb cross-sectional structure.
- Instrument strength: 3/3 FF3 factors pass Nelson–Startz (ρ ≫ 1/√N). Weakest even–odd β correlation: SMB (ρ≈0.55).
- Monthly IV MKT premium: mean -0.10% per month; 103 positive vs 92 negative months — time variation supports regime-aware interpretation rather than a single constant premium.
OLS vs. IV factor premia (annualized)
OLS–IV premium comparison
Annualized average premia and Fama–MacBeth t-stats. Large |Δ| suggests measurement-error bias in OLS.
| Factor | OLS (%) | IV (%) | IV − OLS | t (OLS) | t (IV) |
|---|---|---|---|---|---|
| MKT | 9.37 | -1.22 | -10.59 | 2.01 | -0.11 |
| SMB | 8.98 | 19.96 | +10.98 | 3.23 | 1.86 |
| HML | -6.93 | -6.72 | +0.21 | -2.22 | -1.37 |
Characteristic control attenuation
How premia change when size, book-to-market, profitability, and investment enter the cross-section alongside factor betas.
| Term | OLS (factors only) | OLS (+ chars) | IV (+ chars) |
|---|---|---|---|
| MKT | 9.37%t=2.01 | 5.42%t=1.19 | -13.66% |
| SMB | 8.98%t=3.23 | 11.18%t=3.84 | 28.89% |
| HML | -6.93%t=-2.22 | -3.01%t=-0.98 | 7.87% |
| SIZE | — | 4.44%t=3.36 | 8.65% |
| BM | — | 1.51%t=0.99 | 2.77% |
| OP | — | -64.86%t=-1.28 | -112.24% |
| INV | — | 2.89%t=1.46 | 3.42% |
OLS — betas only
| Term | Ann. premium (%) | FM t-stat |
|---|---|---|
| const | 7.77 | 1.94 |
| MKT | 9.37 | 2.01 |
| SMB | 8.98 | 3.23 |
| HML | -6.93 | -2.22 |
IV — betas only
| Term | Ann. premium (%) | FM t-stat |
|---|---|---|
| const | 17.25 | 1.54 |
| MKT | -1.22 | -0.11 |
| SMB | 19.96 | 1.86 |
| HML | -6.72 | -1.37 |
OLS — with characteristics
| Term | Ann. premium (%) | FM t-stat |
|---|---|---|
| const | 1.96 | 0.24 |
| MKT | 5.42 | 1.19 |
| SMB | 11.18 | 3.84 |
| HML | -3.01 | -0.98 |
| SIZE | 4.44 | 3.36 |
| BM | 1.51 | 0.99 |
| OP | -64.86 | -1.28 |
| INV | 2.89 | 1.46 |
IV — with characteristics
| Term | Ann. premium (%) | FM t-stat |
|---|---|---|
| const | 10.55 | 0.59 |
| MKT | -13.66 | -0.88 |
| SMB | 28.89 | 2.29 |
| HML | 7.87 | 1.23 |
| SIZE | 8.65 | 3.70 |
| BM | 2.77 | 1.19 |
| OP | -112.24 | -1.62 |
| INV | 3.42 | 1.17 |
IV intercept (α) with characteristics: 10.55% p.a. (t = 0.59)
Monthly MKT premium (IV, %)
Instrument diagnostics
| Factor | ρ(β̂_EV, β̂_IV) | √ρ proxy | Nelson–Startz |
|---|---|---|---|
| MKT | 0.744 | 0.863 | Pass |
| SMB | 0.545 | 0.738 | Pass |
| HML | 0.766 | 0.875 | Pass |
Avg. cross-section N ≈ 61. Weak instruments if ρ ≲ 1/√N.
Paper-ready research notes
Errors-in-variables (EIV) bias
In standard two-stage Fama–MacBeth tests, factor betas are estimated with error in the first stage and treated as known in the second-stage cross-section. OLS risk-premium estimates are biased and inconsistent as the number of time periods grows. Portfolio grouping diversifies idiosyncratic noise but sacrifices cross-sectional granularity and may hide stock-level variation.
IV split-sample mechanism
Following Jegadeesh et al. (2019), betas estimated from even-calendar-month daily returns (β̂_EV) use odd-month betas (β̂_IV) as instruments, and roles reverse in odd months. Measurement errors in the two samples are plausibly independent, yielding an estimator that is N-consistent as the number of stocks grows.
Dimson nonsynchronous-trading adjustment
Infrequently traded stocks align poorly with same-day factor returns. Each beta regression includes contemporaneous, one-day lagged, and one-day led factor returns; loadings are summed to a total factor beta.
Factor premia vs. characteristics
Many factors that appear priced in portfolio sorts lose significance once firm characteristics (size, book-to-market, profitability, investment) enter as direct controls. Persistent positive intercepts (alpha) suggest residual pricing not captured by classical linear factor structures.
Instrument strength (Nelson–Startz)
Weak instruments invalidate IV inference. Reported even–odd beta correlations should exceed 1/√N (N = cross-section size). The square root of this correlation proxies correlation between estimated and true betas when errors are independent across splits.
US vs. emerging-market context
Turkish-market evidence shows time-varying dynamics and local factors dominating global benchmarks once characteristics are controlled. US reproduction tests whether factor premia survive IV correction and characteristic controls in a liquid developed market — differences highlight market-specific drivers.
Glossary
- β̂_EV
- Factor loadings estimated from even-month daily returns in the rolling window.
- β̂_IV
- Factor loadings estimated from odd-month daily returns; used as instruments when month t is even.
- Fama–MacBeth t
- Mean premium divided by standard error of the monthly premium series (×√T).
- N-consistent IV
- Estimator converges to true risk premia as the number of stocks N → ∞, under standard IV assumptions.
- LCAPM
- Liquidity-adjusted CAPM with separate betas on market return and market illiquidity (not fully implemented here; noted for extension).