第30名解决方案
第30名解决方案
作者: Felipe Loque | 比赛排名: 第30名
1 - 特征工程
首先,我们使用了以下62个特征。你可以通过它们的名称理解其含义:
- 'Yards_to_touchdown' (达阵码数),
- 'YardLine_ref' (码线参考),
- 'X_defense_spread' (防守X方向散布),
- 'X_defense_std' (防守X方向标准差),
- 'Y_defense_spread' (防守Y方向散布),
- 'Y_defense_std' (防守Y方向标准差),
- 'X_offense_spread' (进攻X方向散布),
- 'X_offense_std' (进攻X方向标准差),
- 'X_offense_centroid' (进攻X方向质心),
- 'X_defense_centroid' (防守X方向质心),
- 'DistToBallCarrier_Offense_Mean' (进攻方距持球者平均距离),
- 'DistToBallCarrier_Defense_Mean' (防守方距持球者平均距离),
- 'DistToBallCarrier_Offense_std' (进攻方距持球者距离标准差),
- 'DistToBallCarrier_Defense_std' (防守方距持球者距离标准差),
- 'DistToBallCarrier0.5_Offense_Mean',
- 'DistToBallCarrier0.5_Defense_Mean',
- 'DistToBallCarrier0.5_Offense_std',
- 'DistToBallCarrier0.5_Defense_std',
- 'Y_offense_spread',
- 'Y_offense_std',
- 'Y_offense_centroid',
- 'Y_defense_centroid',
- 'Seconds_since_quarter' (本节已进行秒数),
- 'Distance_to_down' (距离首攻码数),
- 'DefendersInTheBox' (盒子内防守球员数),
- 'Average_tackle_time' (平均擒抱时间),
- 'Min_tackle_time' (最小擒抱时间),
- 'Season_cat_2017' (赛季分类2017),
- 'Season_cat_2019' (赛季分类2019),
- 'A1',
- 'S1',
- 'X_estimated1',
- 'Y_estimated1',
- 'Dir_cos1',
- 'Dir_sin1',
- 'S1_vs_S12',
- 'A12',
- 'S12',
- 'X_ref12',
- 'Y_ref12',
- 'X_estimated12',
- 'Y_estimated12',
- 'DistToBallCarrier12',
- 'DistToBallCarrier0.5_estimated12',
- 'Dir_cos12',
- 'Dir_sin12',
- 'Min_tackle_time12',
- 'S_horizontal12',
- 'S_vertical12',
- 'A13',
- 'S13',
- 'X_ref13',
- 'Y_ref13',
- 'X_estimated13',
- 'Y_estimated13',
- 'DistToBallCarrier13',
- 'DistToBallCarrier0.5_estimated13',
- 'Dir_cos13',
- 'Dir_sin13',
- 'Min_tackle_time13',
- 'S_horizontal13',
- 'S_vertical13'
提高我们分数的一个关键点是通过以下代码行对数据框进行排序:
train = train.sort_values(by= ["PlayId", "IsOnOffense", "IsBallCarrier", "DistToBallCarrier0.7_estimated"], ascending=[1, 0, 0, 1])
“DistToBallCarrier0.7_estimated”特征是球员在0.7秒