407. Mechanisms of Action (MoA) Prediction | lish-moa
两个三层神经网络:
a) (BatchNorm -> Dropout -> WeightNorm(Dense) -> ReLU) * 3
b) (BatchNorm -> Dropout -> WeightNorm(Dense) -> LeakyReLU) * 3
带有跳跃连接的双头神经网络,使用非评分数据进行迁移学习训练:
Head1 --> out1 = (BatchNorm -> Dropout -> Dense -> LeakyReLU) * 4
Head2 --> out2 = out1 + (BatchNorm -> Dropout -> Dense -> LeakyReLU) * 4
avg --> Average([out1, out2])
final head --> avg + (BatchNorm -> Dropout -> Reshape -> LSTM) * 1
(BatchNorm -> Dense -> LeakyReLU) * 3
带有跳跃连接的双头神经网络,使用非评分数据进行迁移学习训练:
Head1 --> out1 = (BatchNorm -> Dropout -> Dense -> ReLU) * 4
Head2 --> out2 = out1 + (BatchNorm -> Dropout -> Dense -> ReLU) * 4
avg --> Average([out1, out2])
final head --> avg + (BatchNorm -> Dropout -> Dense) * 1
(BatchNorm -> Dense -> ReLU) * 3
TabNet。
学习率调度器: ReduceLROnPlateau, OneCycleLR
优化器: Adam, AdamW
激活函数: ReLU, LeakyReLU
(所有模型均使用上述不同的特征和参数进行训练,共 16 个模型。)提交所有这些模型的平均值取得了:
CV: 0.01501
Public: 0.01824
Private: 0.01611
通过取所有预测的简单平均值,堆叠所有模型集成的预测结果。
使用原始特征 + 堆叠特征训练模型取得了:
CV: 0.01495
Public: 0.01821
Private: 0.01610
使用不同特征训练的所有模型(16 个模型)与使用堆叠特征训练的模型(10 个模型)的简单平均,总共 26 个模型的集成取得了:
CV: 0.01475
Public: 0.01819
Private: 0.01608
提交时对权重进行了少量混合