python - Why RandomForestClassifier doesn't have cost_complexity_pruning_path method?

Collectives™ on Stack Overflow

Find centralized, trusted content and collaborate around the technologies you use most.

Learn more about Collectives

Teams

Q&A for work

Connect and share knowledge within a single location that is structured and easy to search.

Learn more about Teams

In trying to prevent my Random Forest model from overfitting on the training dataset, I looked at the ccp_alpha parameter. I do notice that it is possible to tune it with a hyperparameter search method (as GridSearchCV ).

I discovered that there is a Scikit-Learn tutorial for tuning this ccp_alpha parameter for Decision Tree models. The methodology described uses the cost_complexity_pruning_path method of the Decision Tree model. This section explains well how the method works. I understand that it seeks to find a sub-tree of the generated model that reduces overfitting, while using values of ccp_alpha determined by the cost_complexity_pruning_path method.

clf = DecisionTreeClassifier()
path = clf.cost_complexity_pruning_path(X_train, y_train)
ccp_alphas, impurities = path.ccp_alphas, path.impurities
However, I wonder why the Random Forest model type in Scikit-Learn does not implement these ccp_alpha selection and pruning concept.
Would it be possible to do this with a little tinkering?
It seems more logical to me than trying to find a good value by searching for hyperparameters (whatever one you use..)
                I’m voting to close this question because it probably belongs on datascience.stackexchange.com
– rickhg12hs
                Nov 3, 2021 at 20:50
        Thanks for contributing an answer to Stack Overflow!
Please be sure to answer the question. Provide details and share your research!
But avoid …
Asking for help, clarification, or responding to other answers.
Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.

推荐文章

长情的火锅 · python读取excel,获得下拉选中所有选项_python获取下拉菜单内容 excel

3 周前

勤奋的鸭蛋 · python - Set up of virtual environment in anaconda failing - Stack Overflow

2 周前

大力的长颈鹿 · python - Conda env create from .yml gives "unexpected error" - Stack Overflow

2 周前

讲道义的闹钟 · 如何释放Python占用的内存？开发者社区

3 天前

聪明的橙子 · python内存机制与垃圾回收、调优手段 - 长安223

3 天前

有爱心的灌汤包 · Android WebView的加载超时处理_51CTO博客_android webview加载本地html

1 年前

想出家的墨镜 · python 两个numpy比较大小 python中比较两个数大小_mob64ca1407d5aa的技术博客_51CTO博客

2 年前

儒雅的遥控器 · gitlab不能创建分支-掘金

2 年前

有爱心的显示器 · [R语言笔记] ggplot绘图 004 细节修改与出图 - 知乎

2 年前

爱健身的眼镜 · ajax方法传参数时，把两个json对象，合并成一个_御风傲天的博客-CSDN博客

2 年前