6th Position winner solution for the : ICR - Identifying Age-Related Conditions

568. ICR - Identifying Age-Related Conditions | icr-identify-age-related-conditions

开始: 2023-05-11 结束: 2023-08-10 基因组学与生物信息数据算法赛

第6名获奖方案：ICR - 识别年龄相关疾病

第6名获奖方案：ICR - 识别年龄相关疾病

作者：Diego Silva França （专家）

竞赛排名：第6名

得票数：30票

发布时间：2023年8月11日

我的解决方案分为7个主要步骤：
（代码版本10）

使用pandas的interpolate实例进行线性插值填补缺失数据。
使用随机森林分类器通过gini-importance找出数据集中最重要的特征。
使用贝叶斯优化来寻找XGBoost分类器的最优参数。
重复步骤3多次，收集XGBoost分类器的多个最优参数。
使用最优参数构建XGBoost分类器集成。
使用GridSearchCV再次微调XGBoost分类器（因为贝叶斯优化只是参数估计）。
使用投票分类器（每个XGBoost概率的平均值）来分类测试集。

查看完整代码 https://www.kaggle.com/code/diegosilvadefrana/notebooke87ef51e7e/notebook

同比赛其他方案

How on Earth did I win this competetion?

Wow (and our solution)

3rd Place Solution for the "ICR - Identifying Age-Related Conditions" Competition

4rd Place Solution for the "ICR - Identifying Age-Related Conditions"

5. place solution