673. MAP - Charting Student Math Misunderstandings | map-charting-student-math-misunderstandings
首先,我要衷心感谢组织者和 Kaggle 举办了如此精彩的比赛。在整个比赛过程中,我个人学到了非常多的知识。在接下来的部分中,我将分享我的解决方案以及整个分数提升的过程。
train['candidate_targets'] = train['QuestionId'].map(train.groupby("QuestionId")['target'].apply(set).apply(list))
train['neg_candidate_targets'] = train.apply(lambda x : [y for y in x["candidate_targets"] if x["target"] != y], axis=1)
虽然在线测试集可能会出现不同的场景——例如每个问题可能出现新的误解标签——但我观察到标记为 "Correct" 和 "Neither" 的样本占大多数。因此,我打算让模型主要关注当前可用样本中观察到的模式。
我测试的实验模型主要来自 Qwen 系列。每个模型的实验结果总结如下(基于多折数据):
Qwen3-32B >= Qwen3-Reranker-8B >= QWQ-32B >= Qwen3-14B >= Qwen2.5-Math-7B-Instruct > Qwen2.5-14B-Instruct > Qwen3-8B > Qwen2.5-7B-Instruct
由于 Qwen3-Reranker-8B 和 Qwen2.5-Math-7B-Instruct 表现出色,我决定使用 Qwen3-Reranker-8B 进行迭代优化,同时使用更大的 Qwen2.5-Math-72B-Instruct 作为教师模型进行知识蒸馏。关于教师模型的选择,由于我的 GPU 资源有限,我没有实验许多其他大规模模型。
<|im_start|>system
You are an expert in detecting grade-school level math misconceptions. The task performs 2 steps:
1.Assesses whether the explanation contains a misconception. (Correct, Misconception, or Neither)
2.Identifies the specific misconception present, if any.<|im_end|>
<|im_start|>user
Question: What fraction of the shape is not shaded? Give your answer in its simplest form. [Image: A triangle split into 9 equal smaller triangles. 6 of them are shaded.]
Student's Answer: \( \frac{1}{3} \)
This answer is correct.
Student's Explanation: I think that 1/3 is the answer, as it's the simplest form of 3/9.
Now we judge the student's explanation is Correct, Does this diagnosis is correct? (Yes/No)<|im_end|>
<|im_start|>assistant
<|im_start|>system
You are an expert in detecting grade-school level math misconceptions. The task performs 2 steps:
1.Assesses whether the explanation contains a misconception. (Correct, Misconception, or Neither)
2.Identifies the specific misconception present, if any.<|im_end|>
<|im_start|>user
Question: What fraction of the shape is not shaded? Give your answer in its simplest form. [Image: A triangle split into 9 equal smaller triangles. 6 of them are shaded.]
Student's Answer: \( \frac{1}{3} \)
This answer is correct.
Correct's Explanation Samples:
- Because there are 9 triangles and 3 arent shaded it is 3/9 and they are both multiples of 3 so it's 1/3
- The answer is 3/9 which can be simplified to 1/3
- I think this because there are 9 triangles and 3 are not shaded and 3/9 is 1/3.
- there are 9 triangles shaded and 3 are not shaded=3/9=1/3
- I think this because the answer is three over nine then they can be simplified to one over three.
- There are 9 triangles and 3 is not shaded. 3/9 is a possibility but the question says to put it in its simplest form. 9 divided by 3 is 3 and 3 divided by 3 is one. You cannot simplify it anymore so the answer is 1/3.
--
Neither's Explanation Samples:
- i counted the in-shaded parts then the shaded part then i made them a fraction.
- there are 9 triangles in total and 3 are not shaded. this means that 3 times 9 is 3 and 1 is 9.
- I think this because it is hard of the number
- 1/3. 3/9 shaded, and to find simplest from we divide the top and the bottom by 3.
- Because I counted them and seen the awnser
- I think this is because the fraction of the shape shaded is 3/9 but simplified it is 1/3.
--
Misconception:Incomplete's Explanation Samples:
- there are 9 boxes altogether and 3 of them are not shaded.
- there are 9 triangles andd 3 are not shaded
- Because there is nine triangular spaces and three are unshaded
- There are 9 squares but as you can see theree are 3 not shaded.
- I think this because there are nine triangles and thee are not shaded so it's three over nine.
- i think this because 3 out of nine triangles are not shaded
--
Misconception:WNB's Explanation Samples:
- 6 is the total and 3 is blank
- because 3 arr white and 6 are blue.
- 6 triangles are shaded so 3 aren't shaded
- 3 of the triangles are not shaded and 6 are shaded so therefore it is 3/6
- i counted the white ones and the blue ones.
- because 3 are shaded and 6 are not
--
Student's Explanation: I think that 1/3 is the answer, as it's the simplest form of 3/9.
Now we judge the student's explanation exhibited the misconception of WNB, Does this diagnosis is correct? (Yes/No)<|im_end|>
<|im_start|>assistant
| 模型 | 任务 | 交叉验证 | 公榜 (lb) | 私榜 (pb) | 使用全量 | 推理时间 |
|---|---|---|---|---|---|---|
| prompt_v1 + Qwen3-Reranker-8B + 1 轮次 | CausalLM | 0.9473 | 0.94419 | 0.94270 | 40 分钟 | |
| prompt_v1 + Qwen3-Reranker-8B + 蒸馏 + 1 轮次 | CausalLM | 0.9480 | 0.94674 | 0.94502 | 40 分钟 | |
| prompt_v2 + Qwen3-Reranker-8B + 蒸馏 + 1 轮次 | CausalLM | 0.9493 | 0.94875 | 0.94617 | 4 小时 | |
| prompt_v2 + Qwen3-Reranker-8B + 蒸馏 + 1 轮次 | CausalLM | 0.9493 | 0.95014 | 0.94737 | ✅ | 4 小时 |
| prompt_v2 + Qwen3-Reranker-8B + 蒸馏 + 2 轮次 | CausalLM | 0.9499 | 0.95120 | 0.94873 | ✅ | 4 小时 |
| prompt_v1 + Qwen3-32B-AWQ + 蒸馏 + 1 轮次 | CausalLM | 0.9502 | 0.94955 | 0.94655 | ✅ | 2.5 小时 |
| 最终选定模型 (最佳 Qwen3-Reranker-8B + Qwen3-32B-AWQ) | CausalLM | 0.9518 | 0.95211 | 0.94835 | ✅ | 6.5 小时 |
<|im_start|>system
You are an expert in detecting grade-school level math misconceptions. The task performs 2 steps:
1.Assesses whether the explanation contains a misconception. (Correct, Misconception, or Neither)
2.Identifies the specific misconception present, if any.<|im_end|>
<|im_start|>user
Question: What fraction of the shape is not shaded? Give your answer in its simplest form. [Image: A triangle split into 9 equal smaller triangles. 6 of them are shaded.]
Student's Answer: \( \frac{1}{3} \)
This answer is correct.
Top relevant explanations with their tags:
- Because there are 9 triangles and 3 of them were not shaded (Correct)
- because there is 9 triangles and 3 of them were not shaded (Correct)
- there aree 9 triangles in total and 3 of them are not shaded. this gives you 3 / 9. (Correct)
--
Student's Explanation: I think that 1/3 is the answer, as it's the simplest form of 3/9.
Now we judge the student's explanation is Correct, Does this diagnosis is correct? (Yes/No)<|im_end|>
<|im_start|>assistant
生成更多解释用于数据增强