642. CIBMTR - Equity in post-HCT Survival Predictions | equity-post-HCT-survival-predictions
首先,我和我的队友 @siddhant1104 想要感谢组织者举办这次比赛,并祝贺所有获奖者。决定是专注于提高 LB 分数还是 CV 分数非常困难,因为当 CV 分数提高时,我们的 LB 分数大多会变差。我们的解决方案是两种不同方法的加权集成。
dri_score 和 conditioning_intensity 进行有序编码(使用 optuna 调优)donor_age 和 age_at_hct 特征以提取信息sin_year 和 cos_year)dri_score_mapping = {
"High": 0.6850752146154907,
"High - TED AML case <missing cytogenetics": 0.18589473149703015,
"Intermediate": 0.5683465067841215,
"Intermediate - TED AML case <missing cytogenetics": 0.7708720693082163,
"Low": 0.9586424711654987,
"Missing disease status": 0.6831791561653417,
"N/A - disease not classifiable": 0.7166435651957048,
"N/A - non-malignant indication": 0.8821201547093761,
"N/A - pediatric": 0.49866306284678735,
"TBD cytogenetics": 0.9411056819278409,
"Unknown": 0.1890854786067684,
"Very high": 0.5377767827330516
}
conditioning_intensity_mapping = {
"Unknown": 0.6026915942898587,
"MAC": 0.02153075067313332,
"RIC": 0.8995437792670338,
"NMA": 0.9211477712757186,
"TBD": 0.7388173559148422,
"No drugs reported": 0.22412092165882558,
"N/A, F(pre-TED) not submitted": 0.32745500022610163
}
sin_year 和 cos_year)