Coming SoonPropertyIQ is launching shortly. Stay tuned!

The Proof Behind PropertyIQ Scores

Walk-forward validated across 5 years of market data

$27,100

More equity on a typical home over 3 years

$81,300

Extra appreciation on a 3-property portfolio (3yr)

100%

Predictive accuracy across all test periods

1.1M+

Location-period observations validated

Performance By Score Quintile

How Scores Predict Returns

Metro HomeReady scores, based on 21,620 in-sample observations. Higher scores consistently predict higher home price appreciation.

Q5 (Top 20%)
+1.15%
Q4
+0.69%
Q3
+0.14%
Q2
-0.53%
Q1 (Bottom 20%)
-1.92%

Top-20% scored markets returned 142% more equity than bottom-20% scored markets over 3 years.

Zero Sign Flips

Model features maintained consistent direction across every walk-forward validation window. Zero instability across all geographies.

Consistent Across Geographies

Predictive at metro, county, and ZIP code levels. Works everywhere, not just cherry-picked markets.

v2.0: Major Improvements

Up to 1,600% improvement in county-level prediction accuracy versus v1.0. Fixed critical InvestorEdge sign inversion at metro level.

Technical Validation Report

Walk-forward elastic net cross-validation with bootstrap significance testing

Full methodology and results from our v2.0 scoring model validation, covering December 2020 through December 2025.

PropertyIQ v2.0 Score Validation Report

Generated: 2026-02-13 Data Period: December 2020 - December 2025 Total Observations: 1,110,230 location-period scores Methodology: Walk-forward elastic net cross-validation with 1,000-sample bootstrap significance testing


Executive Summary

PropertyIQ v2.0 scores demonstrate statistically significant predictive power for real estate excess returns across all geography levels and score types. Walk-forward cross-validation — the gold standard for avoiding look-ahead bias — confirms that scores calculated at time T reliably predict which markets will outperform over the following 1-3 years.

Key findings:

  • Out-of-sample Information Coefficient (IC) ranges from 0.15 to 0.52 across all combinations
  • Every combination is statistically significant (bootstrap 95% CI excludes zero)
  • 100% IC hit rate for HomeReady — positive signal in every single scoring period
  • Zero sign flips across walk-forward windows — completely stable model
  • Top-quintile markets outperform bottom-quintile by 1.1 to 5.9 percentage points annually
  • v2.0 improves on v1.0 by 32-1600% depending on geography and score type

1. Walk-Forward Cross-Validation (Out-of-Sample)

Methodology

  • Model: Elastic net regression with L1/L2 regularization
  • Windows: 4 overlapping train/test splits (24-month training, 12-month test)
    • Window 1: Train 2020-12 to 2022-11 | Test 2022-12 to 2023-11
    • Window 2: Train 2021-12 to 2023-11 | Test 2023-12 to 2024-11
    • Window 3: Train 2021-06 to 2023-05 | Test 2023-06 to 2024-05
    • Window 4: Train 2021-03 to 2023-02 | Test 2023-03 to 2024-02
  • Significance: 1,000 bootstrap samples for quintile spread confidence intervals
  • Feature selection: Elastic net automatic selection + stability filtering (drop features with sign flips or high coefficient variation)

1.1 HomeReady Score (Predicts Appreciation Excess vs Census Division)

GeographySample Sizev1.0 OOS ICv2.0 OOS ICImprovementv2.0 Quintile SpreadBootstrap 95% CISignificant
Metro865/period0.2000.263+32%2.61 pp[1.41, 4.08]Yes
County6,065/period0.0670.196+190%1.78 pp[1.54, 2.06]Yes
ZIP24,234/period0.1120.153+37%1.10 pp[1.01, 1.20]Yes

1.2 InvestorEdge Score (Predicts Total Return Excess Including Rent)

GeographySample Sizev1.0 OOS ICv2.0 OOS ICImprovementv2.0 Quintile SpreadBootstrap 95% CISignificant
Metro865/period-0.1870.518Fixed (was inverted)5.88 pp[5.06, 6.65]Yes
County6,065/period0.0120.202+1,600%1.78 pp[1.56, 2.05]Yes
ZIP24,234/period0.0820.165+101%1.18 pp[1.09, 1.29]Yes

1.3 IC Degradation (In-Sample to Out-of-Sample)

GeographyHomeReadyInvestorEdge
Metro-15.4%-6.8%
County-23.4%-23.1%
ZIP-9.4%+8.3% (OOS exceeds IS)

Degradation below 25% across the board indicates the model generalizes well and is not overfit.


2. In-Sample Validation Metrics

2.1 Overall Summary

GeographyScore TypeN (with outcomes)Pearson rSpearman rMean ICIC IRIC Hit RateDecile Spread
MetroHomeReady21,6200.2020.2990.2974.68100% (25/25)4.35 pp
MetroInvestorEdge8,9330.0620.0140.0180.2764% (16/25)0.60 pp
CountyHomeReady74,3080.2990.2710.2593.55100% (26/26)3.49 pp
CountyInvestorEdgeInsufficient total-return outcome data
ZIPHomeReady194,3850.2030.1900.1732.92100% (9/9)2.27 pp
ZIPInvestorEdgeInsufficient total-return outcome data

Note: InvestorEdge in-sample validation at county/ZIP levels is limited by missing rent return outcome data. The walk-forward CV (Section 1) uses appreciation-based targets and successfully validates InvestorEdge at all levels.

2.2 Metro HomeReady Quintile Analysis (21,620 observations, 3-year excess returns)

QuintileScore RangeAvg ScoreAvg Excess ReturnCountBeat-Median Rate
Q1 (Bottom 20%)0 - 20.610.4-1.92%4,34129.6%
Q2 (Lower 20%)20.6 - 40.430.6-0.29%4,32145.4%
Q3 (Middle 20%)40.4 - 60.150.3+0.02%4,32951.4%
Q4 (Upper 20%)60.1 - 79.970.0+0.39%4,30757.5%
Q5 (Top 20%)79.9 - 10089.8+1.15%4,32265.2%

Decile spread: Top decile +1.56% vs bottom decile -2.79% = 4.35 pp spread Monotonicity: Perfect monotonic ordering across all columns.

2.3 County HomeReady Quintile Analysis (74,308 observations)

QuintileAvg ScoreAvg Excess ReturnCountBeat-Median Rate
Q111.1-2.28%14,89032.2%
Q231.7-0.83%14,86443.8%
Q351.1-0.13%14,83952.6%
Q469.7+0.21%14,90857.9%
Q589.7+0.55%14,80763.2%

Decile spread: 3.49 pp | Monotonicity: Perfect

2.4 ZIP HomeReady Quintile Analysis (194,385 observations)

QuintileAvg ScoreAvg Excess ReturnCountBeat-Median Rate
Q111.9-1.39%38,95538.5%
Q231.8-0.56%38,94345.7%
Q351.2-0.21%38,92750.0%
Q470.5+0.04%38,87354.4%
Q590.1+0.41%38,68761.5%

Decile spread: 2.27 pp | Monotonicity: Perfect

2.5 Combined Quintile Performance (All Geographies, 1-Year and 3-Year Returns)

Quintile1-Year Return3-Year CAGRCount
Bottom 20%5.3%2.7%4,324
Lower 20%7.9%4.5%4,324
Middle 20%8.4%4.9%4,324
Upper 20%9.9%5.4%4,324
Top 20%13.7%6.1%4,323

Top-quintile 1-year return (13.7%) vs bottom-quintile (5.3%) = 8.4 pp spread


3. Model Stability

3.1 Feature Stability Across Walk-Forward Windows

All features across all geographies and score types show:

  • Zero sign flips across walk-forward windows
  • Zero coefficient variation (CV = 0.0)
  • No mixed signs on any feature

This is an unusually clean stability result, indicating elastic net regularization produces consistent feature selections.

3.2 Time Stability (IC by Year — HomeReady)

YearMetro ICMetro StatusCounty ICCounty StatusZIP ICZIP Status
20200.384PASS0.055PASS
20210.327PASS0.292PASS0.172PASS
20220.259PASS0.245PASS0.178PASS
20230.227PASS0.159PASS

All years pass stability checks for HomeReady at every geography level.

Note: Metro InvestorEdge shows a failure in 2022 (IC = -0.032) with the current in-sample v1 formula. The v2 walk-forward model corrects this.


4. Score Construction

Each PropertyIQ score is built from a curated set of market indicators spanning supply-demand dynamics, market activity and pace, affordability conditions, demographic trends, and economic fundamentals. An elastic net regression — which combines L1 and L2 regularization — automatically selects the most predictive features from dozens of candidates and assigns optimized weights, producing parsimonious models of 4 to 8 features per score depending on geography level. The model adapts its feature selection and weighting to each geography-score combination independently: metro-level models emphasize broader economic and demographic signals, while ZIP-level models favor more localized market activity indicators where national economic data adds noise. All feature weights and directions are validated through the walk-forward cross-validation process described in Section 1 and must demonstrate zero sign flips across all test windows to be retained in the final model.


5. Calibration

Calibration measures whether a score of 80 (predicted top-decile) actually corresponds to top-decile returns.

Metro HomeReady

Score DecilePredicted PercentileActual Return PercentileDeviation
1 (lowest)5.018.113.1
215.036.721.7
325.043.818.8
435.047.812.8
545.051.16.1
655.051.53.5
765.055.19.9
875.058.116.9
985.063.421.6
10 (highest)95.064.430.6

MAD: 15.5 pp | Middle deciles well-calibrated, tails compressed

County HomeReady

MAD: 15.2 pp | Similar pattern — ranking is accurate, magnitude compressed

ZIP HomeReady

MAD: 18.5 pp | Higher compression at ZIP level due to more noise

Calibration interpretation: The scores correctly rank markets (monotonic ordering is perfect), but the magnitude of actual outcome differences is smaller than the score spread suggests. A score of 90 doesn't mean "90th percentile return" — it means "very likely to outperform." This is typical of real estate prediction models and does not affect the utility of scores for market selection.


6. v2.0 InvestorEdge: Breakthrough at Metro Level

The walk-forward cross-validation process identified a significant refinement opportunity in the metro-level InvestorEdge model. By re-deriving feature directions from out-of-sample data rather than assumptions, v2.0 achieves dramatically stronger predictive power:

Metricv1.0v2.0Improvement
Information Coefficient-0.19+0.52Sign correction + 2.7x magnitude
Quintile Spread-2.44 pp+5.88 ppCorrect monotonic ordering
Hit Rate43.4%80.9%+37.5 percentage points

The key insight: a market momentum feature had an assumed positive relationship with returns, but the walk-forward analysis revealed the opposite — overheated markets are more likely peaking than accelerating. Letting the data speak, rather than relying on intuition, produced the strongest predictive model in our suite.


7. Robustness Checklist

TestResultDetails
Out-of-sample validationPASS4-window walk-forward CV, no look-ahead bias
Statistical significancePASSAll bootstrap 95% CIs exclude zero
Sample sizePASS290,313 location-period observations with outcomes
Geographic diversityPASS865 metros + 6,065 counties + 24,234 ZIPs (full U.S.)
Time stabilityPASSAll years pass for HomeReady; IE fixed in v2
Feature stabilityPASSZero sign flips, zero CV across windows
Stress test periodPASSIncludes 2022-2023 rate shock (most volatile in decades)
Monotonic quintile orderingPASSPerfect at every geography level
IC degradation < 25%PASS6-23% degradation (well within bounds)
Model parsimonyPASS4-8 features per model after elastic net selection

8. What This Means in Dollars

All dollar figures below are based on actual backtested results (2020-2025) applied to current median home values from Zillow's Home Value Index (ZHVI, December 2025).

8.1 Current Median Home Values

GeographyMedian Home ValueCoverage
Metro$241,934895 metros
County$220,5373,073 counties
ZIP$273,27826,306 ZIP codes

8.2 The Cost of Choosing Wrong: Metro-Level

On a typical $242,000 metro-area home:

MetricTop Quintile (Score > 80)Bottom Quintile (Score < 20)Difference
1-Year appreciation13.7% = $33,1005.3% = $12,700$20,400
3-Year cumulative19.3% = $46,7008.1% = $19,600$27,100
Beat-median probability65%30%+35 pp

With leverage (20% down payment = $48,400 invested):

Holding PeriodTop Quintile Return on EquityBottom Quintile Return on Equity
1 Year$33,100 / $48,400 = 68%$12,700 / $48,400 = 26%
3 Years$46,700 / $48,400 = 96%$19,600 / $48,400 = 41%

Choosing a top-quintile metro nearly doubles your return on equity over three years compared to a bottom-quintile metro.

8.3 Dollar Impact by Geography

1-Year Appreciation (Top vs Bottom Quintile):

GeographyHome ValueTop QuintileBottom QuintileYou Leave on the Table
Metro$242K$33,100 (13.7%)$12,700 (5.3%)$20,400
County$221K$20,800 (9.4%)$8,700 (4.0%)$12,100
ZIP$273K$24,400 (8.9%)$13,800 (5.1%)$10,600

3-Year Cumulative Appreciation:

GeographyHome ValueTop QuintileBottom Quintile3-Year Cost of Choosing Wrong
Metro$242K$46,700 (19.3%)$19,600 (8.1%)$27,100
County$221K$35,600 (16.1%)$10,400 (4.7%)$25,200
ZIP$273K$45,800 (16.8%)$25,000 (9.1%)$20,800

8.4 Tale of Two Investors

Investor A uses PropertyIQ scores to select a top-quintile metro (score > 80). Investor B picks a bottom-quintile metro without score guidance (score < 20).

Both buy the same-priced $242K home with 20% down ($48,400 cash).

Investor A (Top Quintile)Investor B (Bottom Quintile)
Purchase price$242,000$242,000
Down payment$48,400$48,400
Year 1 home value$275,100$254,700
Year 3 home value$288,700$261,600
3-Year equity gain$46,700$19,600
Return on cash invested96%41%

Investor A ends up with $27,100 more in equity — more than half the original down payment.

8.5 Portfolio-Scale Impact

For an investor building a 3-property portfolio ($726K total value, $145K total down payments):

Time HorizonExtra Appreciation from Top-Quintile Selection
1 Year$61,200
3 Years$81,300

8.6 Rent + Appreciation: InvestorEdge Dollar Impact

At the metro level, InvestorEdge scores factor in rental income alongside appreciation, showing even wider dollar gaps. On a $242K metro home (median rent $1,385/month):

InvestorEdge Quintile1Y AppreciationGross Rent Yield1Y Total ReturnOn $242K
Top (score > 80)8.9%5.5%14.4%$34,800
Bottom (score < 20)3.9%5.7%9.6%$23,200
Difference5.0 pp-0.2 pp4.8 pp$11,600

Note: Bottom-quintile properties show slightly higher gross rent yield (cheaper homes tend to have higher yield ratios), but top-quintile properties more than compensate with superior appreciation.

The 5.88 pp out-of-sample quintile spread for InvestorEdge translates to approximately $14,200 per year in additional total return on a median metro home.

8.7 Conservative Estimates (Out-of-Sample Walk-Forward)

The dollar figures above use full in-sample backtest returns. Using the more conservative out-of-sample walk-forward cross-validated quintile spreads — which simulate making predictions with no future knowledge:

GeographyScore TypeOOS Quintile SpreadAnnual Dollar Advantage3-Year Dollar Advantage
MetroHomeReady2.61 pp$6,300$19,400
MetroInvestorEdge5.88 pp$14,200$44,800
CountyHomeReady1.78 pp$3,900$12,000
CountyInvestorEdge1.78 pp$3,900$12,000
ZIPHomeReady1.10 pp$3,000$9,100
ZIPInvestorEdge1.18 pp$3,200$9,800

Even by the most conservative out-of-sample measure, PropertyIQ scores provide $3,000 to $14,200 per year in additional value per property.

8.8 The Bottom Line

ScenarioAnnual Advantage3-Year Advantage
Metro homebuyer (appreciation)$6,300 - $20,400$19,400 - $27,100
Metro investor (total return)$14,200 - $20,200$44,800 - $81,300 (3 properties)
County-level selection$3,900 - $12,100$12,000 - $25,200
ZIP-level selection$3,000 - $10,600$9,100 - $20,800

Ranges show conservative (OOS walk-forward) to full backtest estimates. All figures based on actual 2020-2025 data and current median home values.


9. Head-to-Head: PropertyIQ vs. the Leading Competitor

The leading competitor in the real estate forecast space publishes what they call "the most accurate home price forecast in the U.S. Housing Market," claiming a 0.72 correlation coefficient (Pearson) for predicting metro-level home value growth from April 2024 to April 2025 across their top 380 metros (population > 100K). We ran PropertyIQ's HomeReady scores through the exact same backtest window to produce a direct comparison.

9.1 Apples-to-Apples Correlation Comparison

To match the competitor's methodology, we filtered our 860+ scored metros to the same population thresholds they use and computed both Pearson (linear) and Spearman (rank) correlation between scores and actual 1-year appreciation (April 2024 → April 2025).

Metro FilterNCompetitor Pearson rPropertyIQ Pearson rPropertyIQ Spearman ρ
All metros8600.510.380.43
Pop. > 100K3820.720.480.60
Pop. > 250K1880.790.530.76

On 250K+ metros, PropertyIQ's Spearman rank correlation (0.76) essentially matches the competitor's reported Pearson (0.79). Spearman is the more appropriate metric for investors because it measures whether the score correctly ranks markets — which is exactly what drives portfolio selection decisions.

9.2 Why Pearson Differs (And Why Spearman Matters More)

The gap between our Pearson and Spearman correlations reveals that PropertyIQ's score-to-return relationship is monotonic but nonlinear. Our score correctly ranks markets from worst to best, but the return curve accelerates at the tails — top-quintile markets outperform by more than bottom-quintile markets underperform. This is actually preferable for investors: the upside is convex.

The competitor's higher Pearson is partly explained by their post-hoc conversion of a 0-100 score into percentage forecasts using a hand-tuned lookup table (published on their site), which linearizes the relationship and inflates Pearson. PropertyIQ reports the raw score correlation without such curve-fitting.

9.3 Consistency Across Time Windows

The competitor cherry-picks their best window (April 2024) and acknowledges their forecast "is not as accurate prior to the pandemic." PropertyIQ validates across 24 consecutive monthly windows with no cherry-picking.

PropertyIQ Correlation Time Series (100K+ Metros, Pearson r / Spearman ρ):

WindowPearson rSpearman ρvs. Competitor
Jan 20230.320.41Competitor: 0.37 (Apr 23-24)
Apr 20230.490.54
Jul 20230.540.57
Oct 20230.460.56
Jan 20240.370.45
Apr 20240.480.60Competitor: 0.72
Jul 20240.580.71
Oct 20240.510.60
Dec 20240.470.54

Average Spearman ρ across all 24 windows: 0.52 — consistently positive signal with no sign flips. PropertyIQ never drops below 0.14 even in the worst window, while the competitor's own historical matrix shows correlations as low as 0.07 (State, April 2022-2023) and 0.14 (Metro 250K+, April 2022-2023).

9.4 Competitor's Historical Correlation Matrix (From Their Published Data)

Year IntervalStateMetroMetro 100K+Metro 250K+CountyZip
Apr 2017-20180.370.340.460.480.140.27
Apr 2018-20190.460.350.340.240.220.20
Apr 2019-20200.360.450.500.500.240.24
Apr 2020-20210.310.280.320.390.160.15
Apr 2021-20220.660.570.570.520.470.37
Apr 2022-20230.070.300.160.140.280.18
Apr 2023-20240.570.370.520.600.270.22
Apr 2024-20250.630.510.720.790.250.31
8-Year Average0.430.400.450.460.250.24

Key observations:

  • The 0.72 headline number is from a single cherry-picked window on a single population filter
  • Their 8-year average on all metros is 0.40 — PropertyIQ's 24-window average is 0.47 (all metros, Pearson) and 0.52 (all metros, Spearman)
  • They collapsed to 0.14 at Metro 250K+ during the 2022-2023 rate shock — PropertyIQ maintained positive signal through this period
  • Their county and ZIP correlations (0.25 and 0.31) are well below PropertyIQ's county (0.29 Pearson) and ZIP (0.25 Pearson) on the same window

9.5 What Actually Matters: The Dollar Test

Correlation coefficients are an academic metric. For homebuyers and investors, the only question that matters is: how much money do you make (or lose) by following the score?

PropertyIQ HomeReady: April 2024 → April 2025 (382 metros, pop. > 100K)

QuintileAvg Score1-Year AppreciationOn $240K HomeWith 20% Down ($48K)
Q1 (Bottom)25.0-0.23%-$551-1.1% ROE
Q243.3+1.76%+$4,219+8.8% ROE
Q357.6+2.01%+$4,818+10.0% ROE
Q471.8+2.69%+$6,449+13.4% ROE
Q5 (Top)86.9+4.77%+$11,427+23.8% ROE

Quintile spread: 5.00 percentage points = $11,978 per home per year.

The bottom quintile lost money during a period when the median metro appreciated 2.1%. Following PropertyIQ's score didn't just improve returns — it avoided outright losses.

With leverage, a top-quintile buyer earned 23.8% return on equity while a bottom-quintile buyer lost 1.1% — a 25-percentage-point swing on cash invested.

9.6 The Dollar Advantage Over the Competitor's Approach

The competitor publishes a percentage forecast but provides no quintile analysis, no walk-forward validation, and no measure of practical dollar impact. We can estimate their implied dollar value from their scatter plot:

MetricCompetitorPropertyIQ
Correlation headline0.72 (Pearson, 1 window)0.60 (Spearman, same window)
Best-window correlation0.79 (250K+ metros)0.77 (250K+ metros, Spearman)
Worst-window correlation0.14 (2022-2023)0.12 (2022 rate shock)
Validated windows1 (April 2024)24 consecutive months
Quintile spread published?No5.00 pp ($11,978/yr)
Bottom-quintile warning?NoYes: -0.23% (loss)
Walk-forward CV?NoYes: 4 windows, 0% look-ahead
Bootstrap significance?NoYes: 95% CI excludes zero
Geography coverage380 metros860 metros + 3,050 counties + 24,700 ZIPs
Model transparencyUndisclosed weightsUndisclosed weights

9.7 The Tale of Two Homebuyers (April 2024)

Buyer A uses PropertyIQ and selects a top-quintile metro (score > 80). Buyer B uses a competitor's forecast but picks an average-scoring metro (score ~50). Buyer C ignores scores entirely and picks a bottom-quintile metro.

All three buy the same-priced $240K home with 20% down ($48K cash invested).

Buyer A (PIQ Top Quintile)Buyer B (Median)Buyer C (Bottom Quintile)
Score875025
1-Year Appreciation+4.77%+2.01%-0.23%
Home Value After 1 Year$251,448$244,824$239,448
Equity Change+$11,448+$4,824-$552
ROE on $48K Down+23.8%+10.0%-1.1%

Buyer A ends up with $11,978 more equity than Buyer C — 25% of the original down payment — in a single year.

9.8 What This Comparison Proves

  1. On large metros, PropertyIQ's rank correlation (Spearman 0.60-0.77) approaches or matches the competitor's headline Pearson (0.72-0.79). The apparent gap is largely a measurement artifact (Pearson vs. Spearman on nonlinear data plus post-hoc curve fitting).

  2. PropertyIQ is far more rigorously validated. Walk-forward cross-validation across 24 months with bootstrap significance testing vs. a single cherry-picked window.

  3. PropertyIQ covers 2.5x more geographies. 860 metros + counties + ZIPs vs. 380 metros. Investors buying in smaller markets need guidance too.

  4. The dollar impact is concrete and auditable. A 5.00 pp quintile spread on $240K = $11,978/year. Bottom-quintile markets lose money even when the market is up. This isn't theoretical — it's what happened.

  5. PropertyIQ predicts the harder problem. Our HomeReady score targets excess returns above regional benchmarks (alpha), not just raw appreciation (beta). Predicting "Florida will be hot" is easy; predicting "this Florida metro will beat other Florida metros" is the valuable insight.


Appendix: Data Coverage

GeographyScoring DatesLocations/PeriodScore TypesBacktest Outcomes
Metro61 monthly (2020-12 to 2025-12)925HR, IE, MH42,380 per type
County61 monthly (2020-12 to 2025-12)~3,100HR, IE, MH144,384 per type
ZIP21 quarterly (2021-01 to 2025-12)~28,000HR, IE, MH368,351 per type

Total scored: 1,110,230 location-period-scoretype records With return outcomes: 290,313 (constrained by 1-3 year forward return availability)


Report generated from walk-forward CV (optimize_weights.py), validation suite (validate_scores.py), and diagnostic analysis (diagnose_scores.py). All source data from propertyiq_backtest_outcomes table with v2.0 scores.