632. Eedi - Mining Misconceptions in Mathematics | eedi-mining-misconceptions-in-mathematics
首先,我要感谢主办方组织了如此精彩的比赛。还要感谢我的队友 charmq 和 kami 的合作。
Qwen2.5-72B-Instruct-AWQ 生成数据。Qwen2.5-32B-instruct-AWQ 从问题、答案等生成误解。vLLM 加速处理过程。QuestionId)Qwen2.5-72B-Instruct-AWQ 为训练数据集中未出现的每个误解生成新问题、答案选项和错误答案。数据生成的提示如下,最大 token count 约为 20000。
"""You are an expert in mathematics.
Refer to the examples below to create new problem with given misconception.
Misconception: {MisconceptionText}
The output format shoud be below.
ConstructName:
SubjectName:
Math problem:
Answer A text:
Answer B text:
Answer C text:
Answer D text:
Answer:
Incorrect answer:
The examples are below
Example 1:
ConstructName: {ConstructName_1}
SubjectName: {SubjectName_1}
Math problem: {QuestionText_1}
Answer A text: {AnswerAText_1}
Answer B text: {AnswerBText_1}
Answer C text: {AnswerCText_1}
Answer D text: {AnswerDText_1}
Answer: {CorrectAnswer_1}
Incorrect answer: {IncorrectAnswer_1}
Misconception: {MisconceptionText_1}
...
Example 100:
ConstructName: {ConstructName_100}
SubjectName: {SubjectName_100}
Math problem: {QuestionText_100}
Answer A text: {AnswerAText_100}
Answer B text: {AnswerBText_100}
Answer C text: {AnswerCText_100}
Answer D text: {AnswerDText_100}
Answer: {CorrectAnswer_100}
Incorrect answer: {IncorrectAnswer_100}
Misconception: {MisconceptionText_100}
"""
以下是生成问题的示例。72B 模型似乎具有很高的问题生成能力。
ConstructName: Calculate the circumference of a circle given the radius
SubjectName: Circles
Math problem: If the radius of a circle is \( 7 \) cm, what is the circumference of the circle?
Answer A text: \( 22 \) cm
Answer B text: \( 44 \) cm
Answer C text: \( 14 \) cm
Answer D text: \( 154 \) cm
Answer: B
Incorrect answer: A
Misconception: Thinks circumference is radius x pi
ConstructName: Simplify algebraic fractions by identifying and cancelling common factors
SubjectName: Simplifying Algebraic Fractions
Math problem: Simplify the following algebraic fraction:
\[
\frac{6x^2y}{9xy^2}
\]
Answer A text: \( \frac{2x}{3y} \)
Answer B text: \( \frac{6x}{9y} \)
Answer C text: \( \frac{2xy}{3y^2} \)
Answer D text: \( \frac{6x^2}{9y^2} \)
Answer: A
Incorrect answer: B
Misconception: Cannot identify a common factor when simplifying algebraic fractions
这些示例在数学上是有效的,并且在提示中包含大量示例(100 个案例)对于生成有效问题至关重要。
Qwen2.5-32B-instruct 生成误解。使用了以下提示:"""You are an expert in mathematics.
Refer to the examples below to identify and describe the misconception that led to the incorrect answer.
Example1
ConstructName: Recognise and use efficient methods for mental multiplication
SubjectName: Mental Multiplication and Division
Math problem: Tom and Katie are discussing ways to calculate\\( 21\\times 12\\) mentally. Tom does\\( 12\\times 7\\) and then multiplies his answer by\\( 3\\); Katie does\\( 21\\times 6\\) and then doubles her answer. Who would get the correct answer?
Incorrect answer: Only Katie
Misconception: Does not correctly apply the distributive property of multiplication
Example2
ConstructName: Multiply a decimal by an integer
SubjectName: Mental Multiplication and Division
Math problem:\\( 9.4\\times 50=\\)
Incorrect answer:\\( 4700\\)
Misconception: When multiplying a decimal by an integer, ignores decimal point and just multiplies the digits
ConstructName:{ConstructName}
SubjectName:{SubjectName}Math problem:{QuestionText}
Incorrect answer:{AnswerText}
Misconception:
"""
Qwen2.5-14B-instruct
LoRA_rank: 32LoRA_alpha: 64Qwen2.5-32B-instruct-GPTQ-Int4
vllm 加速嵌入计算。由于无法直接使用,对其实现进行了一些修改以适应我们的用例。这种方法在保持准确性的同时实现了高效的推理。Qwen2.5-32B-Instruct-GPTQ-Int4
Qwen2.5-32B-Instruct-GPTQ-Int4
通过集成使用 72B 模型生成的数据训练的重排序模型(模型 3),私有分数提高了 0.01。