Regression tree analysis does not require transforming variables, but is it a problem if we for e.g. log-transform the response variable?

More Paulo Fernandes's questions See All

Could you recommend some articles on Urban Transportation System optimization and Innovation?

13 August 2024 2,595 3 View

Is there any cases algae not using the nutrient from the wastewater and grow normally?

I am working on microalgae cultivation using waste water. The initial concentration of nutrients were less but the microalgae has achieved biomass growth of 2 g/L. The final concentration of...

08 August 2024 4,812 2 View

1. If I can quantize the atom using this hyperbolic spiral and classical physics, could nature do the same?

If we map as a continuous motion an ionising electron (beginning its journey at n=1) in an H atom, a specific hyperbolic spiral appears (see animation). When we solve this spiral formula, we find...

07 August 2024 5,343 2 View

Articles on" Gender disparities i leatherwork education"?

Articles on" Gender disparities i leatherwork education"

07 August 2024 2,500 0 View

Why results of ROS flurescence are negative as there was no bacteria within?

Hello. I am working on ROS production of two systems: system A is cerium oxide and hydrogen peroxide, system B is cerium oxide nanoparticle, hydrogen peroxide and potassium bromide. I did some...

04 August 2024 5,974 3 View

What should I do with parameters that are not relate to my simulation in MyLake model?

I want to Estimate surface heat fluxes using MyLake, but I don't have all the initial values in model parameters section and other sections,is there a way?

04 August 2024 1,537 1 View

Why reactivity isn't increased with more empty spots in valence shell?

If from a geometric perspective the non-halogens, non-noble gases have more empty spots in their valence shell, and the filling/exiting of any of the empty spots in the shell constitutes a...

03 August 2024 4,787 2 View

Why is the molecule's orientation with an electric field affect polarizability?

Why is the molecule's orientation with an electric field affect polarizability? Electrons are diffuse enough to be independent with respect to orientation and effect of electric field on...

03 August 2024 7,843 1 View

Why don't d-orbitals split themselves, why does it take a ligand? why don't protons from ligand repel nucleus split d-orbitals?

why don't d-orbitals split themselves because of themselves without the presence of ligands? Electrons are indistinguishable. Why wouldn't it be more correct that protons from a ligand split the...

03 August 2024 3,589 3 View

Why doesn't chromium 2+ ion use all its d-orbitals to receive lone pairs from six waters in [Cr(H20)6)]3+?

I'm guessing it's because the ligand experiences too much electron repulsion or proton repulsion from the chromium to insert them close to the 3d-orbitals which are close to the metal nucleus. Is...

03 August 2024 1,370 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Francisco Moreira

As far as I remember, regression trees are very robust to outliers and skewed distributions in EXPLANATORY variables. For the response/dependent variable, that's a different story... Transformations can make sense.

Ioannis Mitsopoulos

Issues with response/dependent variables log-transformations can be resolved by using the "regression random forest" machine learning approach instead of the classical CART analysis

Paulo Fernandes

Thanks!

Yes, I understand the advantages of random forests and boosted regression trees but a regular tree is enough in this case. Just found the answer in the Death & Fabricius 2000 paper in Ecology: because of nonconstant variation, transforming the response variable is "often desirable"

Good to read that.

So, go ahead with log transformation! Good luck with your research analysis.

Thanks for sharing the info Paulo !

Yang Zhou

I have read the paper of Death & Fabricius 2000 in Ecology too, but I have a question, if I transformed the response variable, then at the terminal nodes, I get the mean of log(response variable) of the groups. It can be explained as the mean of response variable if I didn't transform it, right?

Yes, after you back-transform it. In regression analysis log-transformation induces a bias that should be corrected, but I suppose that regression trees are immune to that.

Bijoy Dey

Can we take log of independent variable in the random forest regression??

Zhengxiao Yan

Bijoy Dey Normally, we don't do that. As for log transformation for the response variable, as far as I know, there should be no big difference between the original and after-log. Please be aware of the metrics, you need to calculate the original target R2 rather than after-log target to make the comparison.

Thank you