Why SHAP Changed How I Think About ML Models

When I built a thyroid disease classification model that achieved 97.6% accuracy, I thought the hard part was done. It wasn't. The real challenge was convincing anyone in healthcare to trust a black-box prediction. SHAP (SHapley Additive exPlanations) fundamentally changed how I approach ML model development, and I believe every data scientist working in high-stakes domains needs to understand why.

The Challenge

For my research project — later published in Springer — I was building a machine learning system to classify thyroid diseases from patient hormone level data. The dataset contained 7,200 patient records with multiple features including TSH levels, T3 and T4 concentrations, age, sex, and various clinical indicators. The goal was to create a diagnostic support tool that could assist endocrinologists in making faster, more consistent diagnoses.

The technical challenge wasn't achieving high accuracy — XGBoost and Random Forest both crossed 95% relatively quickly with proper feature engineering and hyperparameter tuning. The real problem was that healthcare professionals don't trust models they can't understand. A doctor isn't going to change their diagnostic approach based on a system that simply outputs "thyroid disease: yes" with a confidence score. They need to understand why the model is making that prediction, which features are driving the decision, and whether those features align with known clinical indicators.

The Goal

My task evolved from "build an accurate classifier" to "build an accurate classifier that can explain its reasoning in terms clinicians understand." The model needed to not only predict correctly but also provide feature-level explanations for each individual prediction, show which patient attributes contributed most to the diagnosis, and demonstrate that the model's decision logic aligned with established medical knowledge about thyroid disease.

The Approach

Why SHAP Over Other Explainability Methods

I evaluated several explainability approaches before settling on SHAP. Feature importance scores from tree-based models give global rankings but don't explain individual predictions. LIME (Local Interpretable Model-agnostic Explanations) provides local explanations but can be unstable — running it twice on the same instance can produce different explanations. SHAP, grounded in cooperative game theory and Shapley values, provides both consistent local explanations and meaningful global feature importance — with mathematical guarantees of fairness and consistency.

Integrating SHAP with XGBoost

XGBoost was chosen as the base model because it consistently outperformed other algorithms on the dataset, and the shap library includes a specialized TreeExplainer optimized for tree-based models that computes exact Shapley values efficiently. After training the XGBoost model with optimized hyperparameters through cross-validated grid search, I generated SHAP values for every prediction in the test set.

The SHAP summary plots revealed that TSH (Thyroid Stimulating Hormone) levels were by far the most influential feature — exactly what endocrinologists would expect. High TSH values pushed predictions strongly toward hypothyroidism, while low values indicated hyperthyroidism. T3 and T4 levels were the second and third most important features, again aligning with clinical knowledge. This alignment between the model's learned patterns and established medical understanding was the key validation that the model was learning real clinical signals rather than spurious correlations.

Individual Patient Explanations

For each patient prediction, SHAP force plots showed exactly which features pushed the prediction toward or away from a thyroid disease diagnosis. A clinician could look at a specific patient's SHAP explanation and see that the model flagged thyroid disease primarily because of elevated TSH and suppressed T4, rather than relying on less clinically meaningful features like age or sex. This transparency transforms the model from a black box into a tool that augments clinical judgment.

The Impact

The final model achieved 97.6% classification accuracy with complete SHAP-based explainability for every prediction. The research was published in Springer conference proceedings, with the explainability component being the key differentiator from prior work in this space. The SHAP analysis confirmed that the model's feature importance rankings perfectly matched established clinical knowledge about thyroid disease markers, validating both the model's accuracy and its decision logic.

More importantly for my development as an engineer, SHAP changed my default approach to any ML project. I now consider explainability from the start — not as an afterthought once the model is trained. In domains like healthcare, finance, and criminal justice, a model that's accurate but unexplainable is often less valuable than a slightly less accurate model that can justify its decisions.

Key Takeaways

Accuracy alone isn't sufficient for high-stakes ML applications — explainability determines whether models get adopted in practice. SHAP provides mathematically grounded, consistent explanations at both individual and global levels, making it superior to alternatives like LIME for production use. When SHAP feature importances align with domain expertise, it validates that the model is learning real signals. And building explainability into the ML pipeline from day one is far more effective than trying to retrofit it after training.

Questions about ML explainability? Reach me at [email protected]