The worst part about quantitative trading isn’t having a bad strategy. It’s not knowing whether your strategy is good or bad. A strategy with 30% annualized returns sounds great, until you realize the max drawdown was 60% — you’d never have held through it. A Sharpe ratio of 2.0 looks impressive, but if it’s propped up by a few windfall trades in extreme market conditions, the Sortino ratio will tell a very different story.

Metrics aren’t decorations for backtesting reports. They’re the tools that help you decide whether a strategy is worth putting real money behind.

Return Metrics

Return metrics answer the fundamental question: how much money does this strategy make?

Annualized return is the most basic metric. It converts returns from any time period into a yearly figure for easy comparison. The calculation uses compound returns:

$$R_{annual} = (1 + R_{total})^{252/n} - 1$$

Here $n$ is the number of trading days, and 252 is the standard number of trading days per year. Using compound returns rather than simple division matters — simple division overstates returns for longer periods.

Alpha measures the portion of returns that beats the benchmark. More precisely, alpha is the excess return after stripping out market risk exposure (Beta):

$$\alpha = R_p - [R_f + \beta \times (R_m - R_f)]$$

$R_p$ is the strategy return, $R_f$ is the risk-free rate, $R_m$ is the market return. Positive alpha means the strategy generates genuine edge, not just riding the market.

Beta describes how sensitive the strategy is to market movements. Beta = 1 means the strategy moves in lockstep with the market. Beta = 0.5 means it captures half the market’s movement. Market-neutral strategies aim for beta near zero, while long-only strategies typically run 0.8-1.2. Beta isn’t inherently good or bad — you just need to know what risk you’re taking.

Risk Metrics

Return metrics tell you how much you earned. Risk metrics tell you what you endured to earn it.

Volatility is the most common risk measure, typically the annualized standard deviation of daily returns:

$$\sigma_{annual} = \sigma_{daily} \times \sqrt{252}$$

High volatility isn’t necessarily bad — it depends on your strategy type. But for most investors, annualized volatility above 25% means the account will regularly show drawdowns that are hard to stomach.

Maximum drawdown is the largest peak-to-trough decline. It’s the most intuitive “pain metric,” answering: what’s the worst it gets?

$$MDD = \max_{t} \left(\frac{Peak_t - Trough_t}{Peak_t}\right)$$

In practice, max drawdown under 20% is manageable, 20-40% is high risk, and above 40%, almost nobody runs the strategy live. Not because the strategy is broken, but because human psychology can’t handle it — deep drawdowns lead to panic exits at the worst possible time.

VaR (Value at Risk) answers: at a given confidence level, what’s the most I can lose? For example, “95% VaR = -2%” means there’s a 95% probability that the daily loss won’t exceed 2%. The catch is VaR doesn’t tell you what happens in that other 5%, which is why it’s often paired with CVaR (Conditional VaR, also called Expected Shortfall) — the average loss in scenarios that exceed the VaR threshold.

Risk-Adjusted Return Metrics

Looking at returns or risk alone isn’t enough. Risk-adjusted metrics combine both, answering: how much return per unit of risk?

Sharpe ratio is the classic:

$$Sharpe = \frac{R_p - R_f}{\sigma_p}$$

Numerator is excess return, denominator is volatility. Sharpe above 1.0 is decent, above 2.0 is excellent, above 3.0 is either genius or overfitting. The problem with Sharpe is that it penalizes upside and downside volatility equally — but for investors, upward volatility isn’t risk.

Sortino ratio fixes this by using only downside volatility:

$$Sortino = \frac{R_p - R_f}{\sigma_{downside}}$$

It only penalizes volatility in the loss direction. If a strategy has right-skewed returns (occasional big wins), Sortino will be significantly higher than Sharpe. That’s a good sign — it means the volatility is coming from the profitable side.

Calmar ratio uses maximum drawdown as the denominator:

$$Calmar = \frac{R_{annual}}{|MDD|}$$

Calmar directly relates to the worst pain you’ll experience. A Calmar above 1 means annual returns exceed the max drawdown, which makes the strategy psychologically easier to hold. This metric is particularly valuable for medium to long-term strategy evaluation.

Information ratio is similar to Sharpe, but benchmarked against an index rather than the risk-free rate:

$$IR = \frac{R_p - R_{benchmark}}{\sigma_{tracking}}$$

The denominator is tracking error — the volatility of the difference between strategy and benchmark returns. A high information ratio means the strategy is consistently beating the benchmark, not just getting lucky on a few days. Fund managers are typically evaluated on this metric.

Trading Metrics

The metrics above focus on outcomes. Trading metrics focus on the process.

Win rate is the proportion of profitable trades. Sounds simple, but high win rate doesn’t guarantee profits. A strategy with 90% win rate but where each loss is 10x the average win will lose money overall. Win rate must always be examined alongside the profit/loss ratio.

Profit/loss ratio is the average win divided by the average loss:

$$ProfitLossRatio = \frac{AvgWin}{|AvgLoss|}$$

Trend-following strategies typically have low win rates (30-40%) but high profit/loss ratios (3:1 or higher), relying on a few big wins to cover many small losses. Mean-reversion strategies work the opposite way — high win rate, low profit/loss ratio. Both approaches can be profitable. The trap is trying to optimize for both high win rate and high profit/loss ratio at the same time — that’s almost always overfitting.

The relationship between win rate and profit/loss ratio has a simple breakeven formula:

$$WinRate_{breakeven} = \frac{1}{1 + ProfitLossRatio}$$

A strategy with 2:1 profit/loss ratio only needs a win rate above 33.3% to be profitable. At 1:1, you need above 50%. This formula gives you a quick sanity check on whether a set of trading statistics makes sense.

Turnover measures trading frequency, typically defined as total traded value divided by average portfolio value over a period. High turnover means high transaction costs and greater slippage impact. Many strategies that look great in backtests see their returns collapse once realistic commissions and slippage are factored in. Always run cost sensitivity analysis before going live.

Putting It All Together

No single metric tells the full story. In practice, strategy evaluation uses them in combination.

Sharpe and Sortino reveal risk-adjusted performance. If Sharpe is decent but Sortino is significantly higher, the volatility is mostly on the upside — a good sign.

Maximum drawdown and Calmar expose tail risk. A strategy with Sharpe 2.0 but 50% max drawdown? You probably can’t hold it.

Win rate and profit/loss ratio together reveal the profit model. This determines your psychological state during losing streaks. Ten consecutive losses is normal for a 35% win-rate trend-following strategy. For an 80% win-rate mean-reversion strategy, it might signal the strategy is broken.

Alpha and Beta clarify where returns come from. If Alpha is near zero and Beta near 1, your strategy is essentially just going long on the market. You’d be better off buying an index fund and saving yourself the effort and transaction costs.

No single metric can definitively judge a strategy. But when all the metrics point to the same conclusion, that conclusion is probably right.