How does SHAP differ from LIME?

SHAP is based on cooperative game theory and provides a mathematically exact decomposition of the prediction. LIME builds a local approximation using random perturbations and a simple model. SHAP is deterministic, LIME is stochastic. SHAP is slower for arbitrary models, but TreeSHAP is fast for trees.

Can SHAP be used for neural networks?

Yes, via GradientSHAP or KernelSHAP. GradientSHAP requires differentiability, KernelSHAP works with any model but is slow. For transformers there is PartitionExplainer, but latency for explaining one text can reach 30–60 seconds.

When is LIME preferable to SHAP?

When the model is not supported by TreeSHAP and is too slow for KernelSHAP. LIME works well with images (superpixels) and texts (word highlighting). It is also convenient for quick prototypes, but due to stochasticity requires fixing the seed and a large number of perturbations.

How to interpret SHAP values in production?

SHAP values are logged for each prediction. Aggregated analysis reveals feature drift. In an insurance case, we returned the top 3 features with the highest impact and generated a text explanation for the client. TreeSHAP latency was 35ms at 18ms inference time.

How does SHAP differ from LIME?

SHAP is based on cooperative game theory and provides a mathematically exact decomposition of the prediction. LIME builds a local approximation using random perturbations and a simple model. SHAP is deterministic, LIME is stochastic. SHAP is slower for arbitrary models, but TreeSHAP is fast for trees.

Can SHAP be used for neural networks?

Yes, via GradientSHAP or KernelSHAP. GradientSHAP requires differentiability, KernelSHAP works with any model but is slow. For transformers there is PartitionExplainer, but latency for explaining one text can reach 30–60 seconds.

When is LIME preferable to SHAP?

When the model is not supported by TreeSHAP and is too slow for KernelSHAP. LIME works well with images (superpixels) and texts (word highlighting). It is also convenient for quick prototypes, but due to stochasticity requires fixing the seed and a large number of perturbations.

How to interpret SHAP values in production?

SHAP values are logged for each prediction. Aggregated analysis reveals feature drift. In an insurance case, we returned the top 3 features with the highest impact and generated a text explanation for the client. TreeSHAP latency was 35ms at 18ms inference time.

Implementing SHAP/LIME for Model Explainability

Q: What are the limitations of explainability methods?

SHAP and LIME show correlation, not causation. Multicollinearity leads to arbitrary splitting of influence. For LLMs both methods give rough estimates — attention weights are often more informative. Implementing SHAP/LIME in a pipeline takes 1–2 weeks, with monitoring 3–4 weeks.

We design and deploy artificial intelligence systems: from prototype to production-ready solutions. Our team combines expertise in machine learning, data engineering and MLOps to make AI work not in the lab, but in real business.

8+Years of workmore info 900+Completed projectsmore info 100+In house employeesmore info 19+Partnersmore info

Services we offer

Showing 1 of 1All 1566 services

Implementing SHAP/LIME for Model Explainability

Medium

~3-5 days

Frequently Asked Questions

AI Development Areas

Discuss your AI project

Free consultation — we'll show you how AI can solve your challenge

Get a quote

We'll estimate the budget and timeline for your AI project

AI Solution Development Stages

Latest works

B2B ADVANCE company website development
1317
Development of a web application for FEEDME
1226
Website development for BELFINGROUP
925
Development of an online store for the company FURNORO
1156
B2B Advance company logo design
620
Development of a web application for Enviok
894

Show more works

Model XGBoost gives AUC 0.91 on validation. In production, unexpected predictions appear — high scores for obviously irrelevant objects. Feature importance from boosting itself shows top-10 features but does not explain a specific prediction. That particular object may get score 0.87 for non-obvious reasons — and we as engineers are obliged to answer. We implement SHAP and LIME for model explainability in production, and this is not just an audit — it is part of an end-to-end ML pipeline.

SHAP and LIME answer different versions of the question "why?". It is important to understand when to apply each method and where they break.

How do SHAP and LIME work?

SHAP (SHapley Additive exPlanations, Lundberg & Lee, 2017) is based on Shapley values from cooperative game theory. The idea: the contribution of each feature to the prediction is the average of its marginal influence across all possible feature coalitions. Key property: additivity. The sum of SHAP values of all features plus the base value (average model prediction) equals the specific prediction. This is a mathematically exact decomposition, not an approximation.

LIME (Locally Interpretable Model-agnostic Explanations, Ribeiro et al., 2016) works differently: a random cloud of perturbations is generated around the object, the black-box model predicts each perturbed instance, and then a simple interpretable model (linear regression or decision tree) is trained on that cloud. LIME is stochastic, so in production we fix the seed and use num_samples=5000+.

What problems do SHAP and LIME solve?

Failures of feature importance. Built-in feature importance in XGBoost shows a global picture but does not explain a single case. SHAP solves this with deterministic decomposition.
Black box for business. Regulators require explanations for each decision. TreeSHAP provides transparency in acceptable time.
Model drift without signal. SHAP values logged to ClickHouse allow tracking changes in feature influence earlier than metric drops.

TreeSHAP — why architectural specialization matters

For tree-based models (XGBoost, LightGBM, CatBoost, sklearn RandomForest) there is TreeSHAP — an algorithm with polynomial complexity O(TLD²). This is orders of magnitude faster than naive KernelSHAP.

import shap
import xgboost as xgb

model = xgb.XGBClassifier()
model.fit(X_train, y_train)

explainer = shap.TreeExplainer(model)
shap_values = explainer.shap_values(X_test)

# Waterfall plot for a specific prediction
shap.plots.waterfall(explainer(X_test)[0])

# Summary plot — global importance
shap.summary_plot(shap_values, X_test)

In practice, TreeSHAP on LightGBM with 500 trees processes 10,000 examples in 2–3 seconds on CPU. Quite acceptable for batch inference.

Why LIME is sometimes better than SHAP?

The model is not supported by TreeSHAP and KernelSHAP is too slow.
You need an explanation in terms of "superpixels" for images or word highlighting for texts.
A quick prototype without deep mathematics is required.

But remember: LIME is not deterministic. With different random_state, explanations for the same object can differ. In production we use a fixed seed and num_samples=5000+.

Method comparison

Characteristic	TreeSHAP	KernelSHAP	LIME
Applicability	Only trees	Any model	Any model
Mathematical exactness	Exact	Exact	Approximation
Stability	Deterministic	Deterministic	Stochastic
Speed (10k objects)	Seconds	Hours	Minutes
Text/image support	No	No natively	Yes

Typical problems and solutions

Problem	Solution
Slow explanations with KernelSHAP	Switch to GradientSHAP or use sampling
LIME instability	Fix seed, increase num_samples to 5000+
SHAP doesn't work for LLM	Use attention weights or partition explainer

Integration into production ML pipeline

Explanations are needed not only for audit — they are part of the operational pipeline.

Case study: client — an insurance company, calculating insurance premiums (LightGBM, 120 features). Requirement: an agent must explain over the phone why a premium is high. Solution: TreeSHAP in the inference API. For each prediction, return the top-3 features with the highest SHAP values + an automatic text template: "Your premium is above average due to: vehicle age (+12%), registration region (+8%), claims history (+6%)". Latency overhead: 35ms for TreeSHAP with average inference 18ms — acceptable.

Monitoring: SHAP values are logged to ClickHouse. Once a week we aggregate — drift in SHAP value distribution signals feature drift earlier than AUC drop.

Limitations to be aware of

SHAP ≠ causality. A high SHAP value for a feature means correlation with the prediction, not causation. "Feature X influences the prediction" ≠ "changing X will change the outcome in reality".

Multicollinearity breaks interpretation. If two features are correlated (r > 0.8), SHAP splits their influence arbitrarily. Correlation analysis is needed when interpreting.

For LLMs — both methods give rough estimates. Attention weights are often more informative for generation tasks, but are also not a strict proxy for importance.

What is included in our work

Model and data analysis: select the appropriate method — TreeSHAP, KernelSHAP, LIME — considering architecture and latency requirements.
Explainer module development: integration into the existing inference API.
Report generation: waterfall plots, summary plots, automatic text templates.
Monitoring: logging SHAP values to ClickHouse, building drift dashboards.
Team training: documentation, workshop for engineers and business users.

Timelines and cost

Timelines: from 1 week for basic integration of one method to 3–4 weeks for a full pipeline with monitoring and dashboards. Cost is calculated individually for each project. Contact us for a preliminary assessment — we will tell you what results you will get.

Our experience: over 50 projects in explainable AI, 5 years in the market, certified ML engineers. We guarantee transparency and post-implementation support.

Request a consultation — we'll help make your model explainable and compliant with regulatory requirements.