pythonmachine-learningdata-science

How to achieve regression model without underfitting or overfitting


I have my university project and i'm given a dataset which almost all features have a very weak (only 1 feature has moderate correlation with the target) correlation with the target. It's distribution is not normal too. I already tried to apply simple model linear regression it caused underfitting, then i applied simple random forest regressor but it caused overfitting but when i applied random forest regressor with optimization with randomsearchcv it took time so long. Is there any way to get decent model with not-so-good dataset without underfitting or overfitting? or it's just not possible at all?


Solution

  • Well, to be blunt, if you could fit a model without underfitting or overfitting you would have solved AI completely.

    Some suggestions, though:

    Overfitting on random forests

    Underfitting on linear regression