After training an XGBoost model using K-fold cross-validation, Can I use SHAP to interpret this dataset?

27 June 2024 1 7K Report

In other words, I did not use the trained XGBoost model to make predictions on the test set and then use SHAP for interpretation. The reasons are as follows:

Even with the best and most comprehensive data available, we are unable to achieve accurate predictions due to the high variability in patterns.(I constructed a 1048days-30min time dissolution dataset: Input feature: Soil Temperature/ Soil moisture/ AirTemperature/ Rain/ Barometric pressure/Relative Humidity; Target feature: Soil CO2(ppm) )

Cross-validation and test sets are primarily used to ensure the stability and generalizability of the model, while SHAP interpretation is for understanding and analyzing the model's decision-making process. These two aspects are not equivalent.

If we consider the test set as an exam, only performing well in the exam can prove the model's reliability and allow for SHAP interpretation. However, in my context, can cross-validation be considered as reviewing and understanding the learned knowledge rather than just taking an exam? Based on this idea, even if the questions are very difficult or the patterns vary greatly, the process of review and understanding is still valuable.

Jorge Bernal-Alviz

Indeed! However, you must ensure the robustness and reliability of your validation method with respect to overfitting, potential biases, calibration, and data leakage. The most important consideration is using an appropriate SHAP algorithm. I recommend opting for conditional, parametric, and causal variants instead of interventional ones, as they tend to alter real estimations at both local and global scales. Good luck!

Badges
Science topic

More Shiang Lu's questions See All

How are Large Models Exploring and Outputting Knowledge Understanding in Specific Content Areas, and What Does Academic Research Say About It?

Hello everyone！ I am currently exploring the performance of large models in understanding knowledge in specific domains, and attempting to construct a knowledge framework similar to what...

05 August 2024 5,729 2 View

Is a reliability test necessary in my survey on translations?

Dear all, I gave 116 respondents 18 translated sentences and asked them to indicate their levels of acceptance of these translations on a five-point scale. Some translations result from strategies...

24 July 2024 8,245 5 View

How do nutrient transporters and aquaporins interact under abiotic stress in plants?

"The Correlation between Nutrition and Transport Mechanism under Abiotic Stress in Plants: A Comprehensive Review" by Muhammad Saleem, Jianhua Zhang, Muhammad Qasim, Rashid Iqbal, and Li Song,...

23 July 2024 9,109 0 View

BD CBA mouse Th cytokine kit vs. Biolegend mouse Th LEGENDplex, which one has better performance?

Hi, Just a quick question for those who have profiled cytokines using sera or cell culturing media supernatant samples. I compared the sensitivity of these two in their manuals and found BD CBA...

14 May 2024 4,596 0 View

How to calculate Combination Index (CI) for drug-drug interaction with Mac system?

I want to calculate the drug combination index in my macbook, but I cann't find the compusyn software.

03 May 2024 1,115 1 View

What does it mean when I observed my over-expression protein localizing in lysosome?

Hi there, I am doing regular confocal experiments to observe my over-expression target protein localization. (which is novel, encoded by a novel transcript, that is all I can say). My OE protein...

01 May 2024 4,809 2 View

How can advances in technology, such as barcoding systems, improve specimen management processes in diagnostic microbiology?

The question is based on the article “Tenets of Specimen Management in Diagnostic Microbiology” by Rajeshwar and Pathak. It explores how advances in technology, specifically barcoding systems, can...

10 April 2024 9,863 1 View

The FDA recommends against using warm mist humidifiers because they "can cause nasal passages to swell". Is there any evidence that supports this?

The text in context: "Here are a few alternative treatments for infants to help with cough and cold symptoms: A cool mist humidifier helps nasal passages shrink and allow easier breathing. Do not...

01 February 2024 3,628 3 View

How to detect fraud phone call?

Too many spam calls now

27 January 2024 403 0 View

How to claim a article which had been claimed by one of coauthors?

I want claim one of my published paper , but was rejected by the systmem. The reason is this paper had been claimed by one of coauthos. Does one paper only be claimed by one of coauthors? And how...

23 January 2024 3,586 2 View

Using OBD technique i am trying to measure laser induced shockwaves velocity i found that at start velocity increases and then decay?

i am unable to interpret why its increases in start as shown in figure

11 August 2024 2,179 1 View

How do soil microflora interact with plant roots and influence plant nutrition, health, and productivity?

06 August 2024 9,618 3 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View

How do soil microbes affect plant health and productivity and how do plants interact with soil microorganisms and contribute to soil fertility?

31 July 2024 5,087 4 View

How do pollutants in soil affect soil microorganisms and role do microorganisms play in nutrient cycling and soil fertility?

31 July 2024 368 2 View

What is the role of microorganisms in soil fertility and crop production cycles and role of microorganisms in mineral transformation?

31 July 2024 8,483 8 View

Why only alpha wave of brain are found in some patients?

I have found an EEG where only alpha waves are present. Beta waves are not found in active patients. What interpretations ?

26 July 2024 4,741 1 View

How to interpret results of AST for Pseudomonas aeruginosa?

While doing AST for Pseudomonas aeruginosa, after incubation, no zone of inhibition observed in the plate near the well. wells surrounded by bacterial growth, when the same plate observed under UV...

25 July 2024 9,229 1 View

Does anyone know the scoring and interpretation for Psychological Birth Order Inventory (PBOI) of White Campbell?

Hi, I am currently a upcoming 4th year student who is need of your help as I couldn't find any accessible file for the manual scoring and interpretation for the PBOI - White Campbell. My group and...

23 July 2024 2,269 1 View

How do we pick data for determination of Validation Acceptance Criteria?

Hello, colleagues! There is commenting open for new upcoming edition of USP 1033. Validation target acceptance criteria is now different from what it used to be and it doesn't include Cpm....

23 July 2024 7,292 3 View