kaggle Open Problems – Single-Cell Perturbations 1st & 2nd place solution summary

发布时间 2023-12-23 00:00:39作者: ZephyrYin

Leaderboard: https://www.kaggle.com/competitions/open-problems-single-cell-perturbations/leaderboard

2nd

Solution: https://www.kaggle.com/competitions/open-problems-single-cell-perturbations/discussion/458738
Code: https://github.com/Eliorkalfon/single_cell_pb

  1. 使用transformer
  2. target encoding
  3. kmeans改变分布,获得balanced distribution。或者说利用kmeans进行类别均衡

1st

Post: https://www.kaggle.com/competitions/open-problems-single-cell-perturbations/discussion/459258
Code: https://github.com/Jean-KOUAGOU/1st-place-solution-single-cell-pbs

主要是2点:

feature augmentation

  1. 加入cell type text embedding
  2. 加入SMILES的 text embedding
  3. 老生常谈:target encode
  4. 仍然使用one hot
  5. 全部concate起来

ensamble learning

  1. 各个输入模型k-fold后加权平均
  2. 3中不同程度的输入feature,
  3. 寻找3种不同的model arch(最后取其中的1种)
  4. 加权平均(集成学习模型)