Calendar Effects in US GICS Sector ETFs: An Empirical Seasonality Framework
Abstract
This study examines calendar-based return patterns across eleven US GICS sector ETFs (the SPDR Select Sector suite) relative to the SPY broad-market benchmark over January 2019 through the latest available sample. Using monthly total returns constructed from Yahoo Finance adjusted closes, we estimate unconditional calendar-month and quarterly averages, sector–month heatmaps, cyclical versus defensive dispersion metrics, and exploratory one-sample t-tests for month-level drift. Cyclical industries exhibit higher calendar dispersion (average seasonality standard deviation approximately 2.1% versus 1.6% for defensives); November emerges as the dominant positive calendar month for SPY and most sector ETFs; September ranks among the weakest months. Technology and consumer discretionary sectors show the strongest calendar-profile alignment with SPY (correlations above 0.89), while energy decorrelates most strongly. Statistical tests identify several month effects at the 10% level, though limited annual observations per calendar month constrain inferential power. The framework provides transparent, reproducible sector seasonality diagnostics for research and risk monitoring—not a trading recommendation.
Introduction and Research Context
Calendar anomalies in equity markets have been documented since at least Rozeff and Kinney (1976), with subsequent evidence that seasonal patterns can persist at the industry level across countries (Heston and Sadka, 2008). While aggregate market seasonality has received extensive attention, sector-level heterogeneity remains economically important for tactical asset allocation and risk budgeting.
This research addresses three core questions:
- Do US sector ETFs exhibit stable, month-specific return patterns that differ materially across industries?
- Do macro-sensitive cyclical sectors display greater calendar dispersion than defensive sectors?
- To what extent do sector calendar profiles co-move with SPY, and which months show statistically distinguishable average returns given small-sample constraints?
We adopt a descriptive inference framework—unconditional calendar means, hit rates, and relative performance versus SPY—rather than claiming risk-adjusted alpha or out-of-sample predictability. The sample spans pre- and post-pandemic regimes, Federal Reserve tightening and easing cycles, and the 2022 inflation shock.
Theoretical Foundations: Calendar Effects and Sector Heterogeneity
Several economic channels plausibly generate sector-specific calendar patterns:
- Earnings seasonality — corporate disclosures cluster in January, April, July, and October, creating recurring information shocks that affect growth sectors differently from defensives.
- Institutional flows — pension rebalancing, window dressing, and tax-loss harvesting can produce turn-of-year and November strength in liquid sector ETFs.
- Macro sensitivity — cyclical industries (energy, industrials, financials, materials) may amplify business-cycle-linked calendar effects; defensives (staples, utilities, health care) dampen them.
- Risk appetite cycles — year-end volatility compression and risk-on positioning often favour growth and technology into Q4.
Bouman and Jacobsen (2002) document Halloween-type seasonal regularities internationally; although aggregate US effects have weakened in recent decades, sector-level structure may remain. Formally, if denotes the total return of sector ETF in month , calendar seasonality implies varies with in a pattern measurable over multi-year windows but not necessarily stable enough to forecast without regime risk.
Research Hypotheses
We evaluate four testable propositions:
- H₁ (sector heterogeneity): Calendar-month average returns differ across GICS ETFs; seasonality dispersion varies materially across .
- H₂ (cyclical amplification): .
- H₃ (benchmark co-movement): Growth-oriented sectors (Technology, Consumer Discretionary) show higher correlation between their twelve-month calendar profile and SPY's than idiosyncratic sectors such as Energy.
- H₄ (Q4 strength): Average returns in October–December exceed April–September for growth-oriented cyclicals, consistent with year-end risk appetite.
For month-level inference, the null is for each sector–month pair . Rejection at is exploratory evidence of calendar drift, not risk-adjusted alpha.
Data Sources and Empirical Methodology
The universe comprises eleven SPDR Select Sector ETFs—XLK, XLF, XLE, XLV, XLY, XLP, XLB, XLI, XLU, XLC, XLRE—with SPY as the benchmark. Daily Yahoo Finance adjusted closes aggregate to calendar-month total returns from 2019-01-01 onward (~eight annual cycles per calendar month).
The empirical pipeline implements five analytical layers:
- Layer 1 — unconditional calendar means: and hit rate .
- Layer 2 — relative performance: isolates rotation versus the benchmark.
- Layer 3 — seasonality dispersion: .
- Layer 4 — cyclical/defensive grouping: cyclicals (Technology, Financials, Energy, Consumer Discretionary, Materials, Industrials, Communication, Real Estate) vs defensives (Health Care, Consumer Staples, Utilities).
- Layer 5 — inferential diagnostics: one-sample t-tests at ; calendar-profile correlation with SPY; Q4 (Oct–Dec) vs Q1/Q2 (Apr–Sep) spreads.
Empirical Results: Cross-Sector Calendar Structure
Analysis over the 2019–2026 sample reveals the following patterns:
- Cyclical amplification (H₂): average ~2.1% for cyclicals vs ~1.6% for defensives.
- November dominance: SPY November average ~4.9%; Financials (6.6%), Technology (5.8%), Industrials (5.8%), Materials (5.8%) among sector leaders.
- September weakness: SPY September average ~−2.1%; multiple sectors share September as worst month.
- Benchmark co-movement (H₃): Technology–SPY calendar correlation 0.93; Consumer Discretionary 0.90; Energy 0.48 (most idiosyncratic).
- Q4 strength (H₄): Technology Oct–Dec average ~3.3%; Financials Q1/Q2 minus Q4 spread ~−2.5%.
- Exploratory inference: Utilities July ~4.5% (); November cluster across sectors at —treat as hypothesis-generating given ~eight obs per month.
- Hit rates: SPY 100% positive Julys, 86% positive Novembers, 43% positive Septembers in sample.
Discussion: Interpretation, Limitations, and Practical Considerations
The framework quantifies sector seasonality from liquid ETF proxies, but limitations warrant careful interpretation:
- Unconditional means conflate regimes (pre-pandemic, 2020 crash/recovery, 2022 rate shock); structural breaks can reverse patterns.
- ~eight annual observations per month limits t-test power; 132 sector–month pairs inflate false-discovery risk at .
- ETF returns reflect basket composition and GICS methodology changes, not pure stock-level seasonality.
- No adjustment for market beta or Fama–French factors—positive calendar months may reflect risk-premium concentration.
- Yahoo Finance adjusted closes may differ from CRSP/Bloomberg total-return indices.
Practical advantages remain: full transparency, reproducibility from public data, interpretable metrics (, , ), and comparability with RRG and factor tools on this site. Intended use: research prioritization, risk monitoring, and investment-committee communication—not mechanical calendar trading.
Conclusions and Extensions
US GICS sector ETFs exhibit measurable, heterogeneous calendar structure over 2019–2026:
- Cyclical sectors show greater calendar dispersion than defensives.
- November and September function as recurrent positive and negative poles.
- Growth-oriented sectors track SPY's seasonal rhythm; energy diverges.
- July and November effects appear in exploratory inference, though small samples limit confirmatory strength.
Natural extensions: SARIMA seasonal modelling (as in the NIFTY 50 study), Fama–French residual seasonality, bootstrap CIs for , and Bonferroni/FDR correction across sector–month pairs. Data refreshes via npm run data:sector-seasonality-analysis when Yahoo Finance prices update.
Interactive Empirical Exhibits (below)
The Results section reproduces the full seasonality dataset. Each panel serves a distinct diagnostic role:
- Summary cards — cyclical vs defensive seasonality σ, SPY best calendar month, universe metadata.
- Sector vs SPY table — calendar-profile correlation, Q4 vs Q1/Q2 spreads, best/worst months per sector.
- Heatmap — sector rows × calendar-month columns; colour encodes average monthly return (green positive, red negative).
- Sector drill-down — monthly returns, relative vs SPY, hit rates, quarterly bars, and SPY benchmark pattern for a selected ETF.
- Cyclical vs defensive table — per-sector seasonality σ and return volatility by macro group.
- Statistical tests — exploratory month effects with .
Results
Key empirical findings
- Cyclical sectors show stronger calendar variation (average seasonality dispersion 2.11%) than defensive names (1.57%), consistent with macro-sensitive industries amplifying seasonal swings.
- The strongest unconditional calendar month is Financials in Nov (average return 6.61%); the weakest is Financials in Mar (-3.79%).
- The most significant month effect at the 5% level is Utilities in Jul (average 4.52%, p=0.004).
- Technology maintains positive average Q4 returns (Oct–Dec mean 3.34%), often associated with year-end risk appetite and earnings-season dynamics.
- SPY's strongest calendar month is Nov (average 4.94%) over the study window.
- All figures are descriptive sample averages and do not guarantee future calendar effects after regime shifts.
Formal hypothesis tests
H₁ tests cross-sector return heterogeneity via one-way ANOVA. H₂ and H₃ compare cyclical vs defensive calendar dispersion and SPY correlation. H₄ and H₅ test Q1/Q2 versus Q4 strength for Technology and SPY.
1 of 5 formal hypotheses supported at the 5% level. Calendar effects are exploratory; combine with economic narrative and regime analysis.
| ID | Hypothesis | Null | Test | Comparison | Effect | Stat | p-value | Verdict |
|---|---|---|---|---|---|---|---|---|
| H1 | Sector ETFs exhibit heterogeneous monthly return distributions (industry calendar structure differs). | All sector monthly return means are equal (no cross-sector heterogeneity). | One-way ANOVA | Technology, Financials, Energy, Health Care… | F-statistic (monthly returns) 0.4480 | 0.448 | 0.923 | Not supported |
| H2 | Cyclical sectors show greater calendar dispersion σ^season than defensive sectors. | E[σ^season | cyclical] ≤ E[σ^season | defensive]. | Two-sample t-test (sector-level) | Cyclical σ^season vs Defensive σ^season | Mean σ^season difference 0.0050 | 5.654 | <0.001 | Supported (p < 0.05) |
| H3 | Cyclical sectors align more closely with SPY's monthly calendar than defensives. | Mean calendar correlation with SPY is equal across cyclical and defensive groups. | Two-sample t-test (sector-level) | Cyclical ρ(SPY) vs Defensive ρ(SPY) | Mean correlation difference 0.1370 | 1.213 | 0.154 | Not supported |
| H4 | Technology Q1/Q2 (Apr–Sep) average returns exceed Q4 (Oct–Dec) returns. | Technology Apr–Sep mean ≤ Oct–Dec mean. | Welch t-test (unequal variance) | Technology Apr–Sep vs Technology Oct–Dec | Mean difference (A − B) -0.0060 | -0.364 | 0.641 | Not supported |
| H5 | SPY Q1/Q2 (Apr–Sep) average returns exceed Q4 (Oct–Dec) returns. | SPY Apr–Sep mean ≤ Oct–Dec mean. | Welch t-test (unequal variance) | SPY Apr–Sep vs SPY Oct–Dec | Mean difference (A − B) -0.0110 | -0.932 | 0.822 | Not supported |
Sector seasonality vs SPY (calendar pattern)
Cross-sector comparison of each ETF's twelve-month calendar rhythm relative to the broad-market benchmark.
- Corr w/ SPY — correlation between the sector's and SPY's average monthly return vectors.
- Q4 avg vs Q1/Q2 avg — Oct–Dec mean compared with Apr–Sep mean.
- Q1/Q2 − Q4 — positive values mean stronger spring/summer than autumn/winter on average.
| Sector | ETF | Group | Corr w/ SPY | Q4 avg | Q1/Q2 avg | Q1/Q2 − Q4 | Best | Worst |
|---|---|---|---|---|---|---|---|---|
| Technology | XLK | cyclical | 0.93 | 3.30% | 2.60% | -0.70% | Nov | Sep |
| Real Estate | XLRE | cyclical | 0.90 | 1.60% | 0.70% | -0.90% | Nov | Sep |
| Consumer Discretionary | XLY | cyclical | 0.90 | 2.00% | 1.80% | -0.20% | Nov | Feb |
| Industrials | XLI | cyclical | 0.89 | 2.90% | 1.10% | -1.80% | Nov | Sep |
| Materials | XLB | cyclical | 0.88 | 2.30% | 0.80% | -1.60% | Nov | Sep |
| Financials | XLF | cyclical | 0.88 | 3.50% | 1.00% | -2.50% | Nov | Mar |
| Communication | XLC | cyclical | 0.86 | 1.80% | 1.30% | -0.50% | Nov | Sep |
| Health Care | XLV | defensive | 0.84 | 2.50% | 0.50% | -2.00% | Nov | Sep |
| Consumer Staples | XLP | defensive | 0.76 | 1.70% | 0.50% | -1.20% | Nov | Sep |
| Utilities | XLU | defensive | 0.51 | 1.40% | 0.80% | -0.60% | Jul | Jun |
| Energy | XLE | cyclical | 0.48 | 2.80% | 0.20% | -2.60% | Nov | Sep |
Monthly return heatmap (sector × calendar month)
Unconditional average monthly total return by sector and calendar month, pooled across all years in the sample.
- Rows are GICS sectors; columns are calendar months (Jan–Dec).
- Green cells — historically positive average months; red cells — negative drift.
- Scan rows for sector-specific seasonality; scan columns for market-wide month effects (e.g. November strength, September weakness).
Average monthly total return (2019-01-01–2026-06-30). Cell values in %.
| Sector | Jan | Feb | Mar | Apr | May | Jun | Jul | Aug | Sep | Oct | Nov | Dec |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Technology | 1.1 | -0.6 | 0.1 | 3.8 | 5.3 | 4.1 | 4.2 | 1.0 | -2.5 | 2.8 | 5.8 | 1.3 |
| Financials | 1.4 | 0.1 | -3.8 | 2.2 | 0.7 | 0.3 | 3.5 | 1.1 | -1.7 | 2.6 | 6.6 | 1.4 |
| Energy | 4.3 | 3.3 | 0.5 | 1.9 | -0.3 | 1.1 | 1.1 | -0.8 | -1.8 | 3.3 | 5.1 | 0.1 |
| Health Care | 0.0 | -0.6 | 0.2 | 0.4 | -0.1 | 1.9 | 1.8 | 1.2 | -1.9 | 1.7 | 4.2 | 1.6 |
| Consumer Discretionary | 1.0 | -1.9 | -1.8 | 2.7 | 0.4 | 2.2 | 5.0 | 1.2 | -0.8 | 0.5 | 5.3 | 0.3 |
| Consumer Staples | 0.3 | 0.5 | 0.8 | 2.4 | -1.1 | 0.5 | 2.4 | 1.3 | -2.6 | 0.2 | 3.7 | 1.2 |
| Materials | 0.5 | 1.1 | -0.3 | 2.1 | 0.5 | 0.5 | 3.3 | 0.6 | -2.3 | 0.7 | 5.8 | 0.4 |
| Industrials | 0.7 | 1.8 | -1.8 | 1.5 | 0.9 | 1.9 | 3.7 | 0.7 | -2.1 | 2.4 | 5.8 | 0.5 |
| Utilities | 0.3 | -0.8 | 2.8 | 1.2 | 0.9 | -1.5 | 4.5 | 0.6 | -1.0 | 1.9 | 2.1 | 0.3 |
| Communication | 3.1 | -0.9 | -0.8 | 2.0 | 2.6 | 1.3 | 3.0 | 1.5 | -2.3 | 0.0 | 4.3 | 1.1 |
| Real Estate | 0.4 | -0.2 | -0.5 | 1.7 | 0.0 | 1.1 | 3.9 | 1.0 | -3.5 | -0.4 | 4.2 | 0.9 |
Sector drill-down
Select a sector ETF to inspect its monthly calendar profile, benchmark-relative returns, hit rates, and quarterly averages.
- Avg monthly return — unconditional mean return for each calendar month.
- vs SPY — sector return minus SPY in the same month (rotation signal).
- Hit rate — fraction of sample years with a positive return in that month.
XLK · cyclical · best Nov · worst Sep
Technology — avg monthly return
Technology — vs SPY
Technology — monthly hit rate
Technology — quarterly returns
SPY — benchmark monthly pattern
Cyclical vs defensive seasonality
Sectors grouped by macro sensitivity; seasonality σ measures month-to-month variation in the twelve-month calendar profile.
- Cyclical group — Technology, Financials, Energy, Consumer Discretionary, Materials, Industrials, Communication, Real Estate.
- Defensive group — Health Care, Consumer Staples, Utilities.
- Higher seasonality σ implies a stronger calendar structure (wider month-to-month spread in average returns).
| Sector | Group | Best month | Worst month | Seasonality σ | Return vol |
|---|---|---|---|---|---|
| Technology | cyclical | Nov | Sep | 2.50% | 6.60% |
| Financials | cyclical | Nov | Mar | 2.50% | 5.90% |
| Energy | cyclical | Nov | Sep | 2.00% | 9.40% |
| Health Care | defensive | Nov | Sep | 1.50% | 4.20% |
| Consumer Discretionary | cyclical | Nov | Feb | 2.20% | 6.40% |
| Consumer Staples | defensive | Nov | Sep | 1.60% | 3.90% |
| Materials | cyclical | Nov | Sep | 1.90% | 5.90% |
| Industrials | cyclical | Nov | Sep | 2.10% | 5.70% |
| Utilities | defensive | Jul | Jun | 1.70% | 4.80% |
| Communication | cyclical | Nov | Sep | 1.90% | 5.40% |
| Real Estate | cyclical | Nov | Sep | 1.90% | 5.30% |
Notable calendar-month effects (p < 0.10)
Exploratory one-sample t-tests: is the average return for a given sector–month pair significantly different from zero?
- Threshold p < 0.10 — hypothesis-generating, not confirmatory (~eight observations per month).
- Multiple-comparison risk across 132 sector–month pairs; interpret alongside economic narrative.
| Sector | Month | Avg return | t-stat | p-value |
|---|---|---|---|---|
| Utilities | Jul | 4.50% | 4.48 | 0.004 |
| Consumer Staples | Nov | 3.70% | 3.32 | 0.016 |
| Real Estate | Jul | 3.90% | 3.31 | 0.016 |
| Industrials | Jul | 3.70% | 3.29 | 0.017 |
| Financials | Jul | 3.50% | 3.07 | 0.022 |
| Materials | Nov | 5.80% | 3.07 | 0.022 |
| Consumer Staples | Apr | 2.40% | 2.88 | 0.024 |
| Materials | Jul | 3.30% | 2.99 | 0.024 |
| Technology | Nov | 5.80% | 2.69 | 0.036 |
| Communication | Jul | 3.00% | 2.65 | 0.038 |
| Health Care | Nov | 4.20% | 2.63 | 0.039 |
| Consumer Staples | Jul | 2.40% | 2.60 | 0.041 |
| Financials | Nov | 6.60% | 2.41 | 0.052 |
| Consumer Discretionary | Nov | 5.30% | 2.41 | 0.053 |
| Industrials | Nov | 5.80% | 2.34 | 0.058 |
| Technology | Jul | 4.20% | 2.26 | 0.065 |
| Real Estate | Nov | 4.20% | 2.25 | 0.066 |
| Health Care | Sep | -1.90% | -2.19 | 0.071 |
| Utilities | Oct | 1.90% | 2.11 | 0.080 |
| Consumer Discretionary | Jul | 5.00% | 2.11 | 0.080 |
| Communication | Nov | 4.30% | 2.04 | 0.087 |
| Consumer Staples | Sep | -2.60% | -1.99 | 0.094 |