I'm working on a Demand Forecasting project, I have a lot of 0 (75% of the database)
I got a highly right skewed target (5.5).
So I decided to log transform my target: target = log(target + 1)
When I train my models (linear regression or LGBM, RandomForest), performance (RMSE here) decrease with log transformation.
If I'm not wrong, Tree based algorithms doesn't care about skewed data. But even, I don't understand how decreasing skewness can decrease performance