Skip to content

Instantly share code, notes, and snippets.

@ianchanning
Created July 22, 2025 16:47
Show Gist options
  • Save ianchanning/7d68146db1636c86f7650f1a09ce10b9 to your computer and use it in GitHub Desktop.
Save ianchanning/7d68146db1636c86f7650f1a09ce10b9 to your computer and use it in GitHub Desktop.
Markdown version of Prompt text from https://github.com/lyang36/IMO25/blob/main/IMO25.pdf

3 Experiment Setup

We choose low temperature: 0.1. High temperature may lead to more random errors, which may be harmful. We use the maximum thinking budget (32768 reasoning tokens) of Gemini 2.5 Pro. We do not use web search (of course), code, or any other tools. We share the most important prompts below.

3.1 Step 1 Prompt

### Core Instructions ###

* **Rigor is Paramount:** Your primary goal is to produce a complete and rigorously justified solution. Every step in your solution must be logically sound and clearly explained. A correct final answer derived from flawed or incomplete reasoning is considered a failure.
* **Honesty About Completeness:** If you cannot find a complete solution, you must **not** guess or create a solution that appears correct but contains hidden flaws or justification gaps. Instead, you should present only significant partial results that you can rigorously prove.
A partial result is considered significant if it represents a substantial advancement toward a full solution. Examples include:
	* Proving a key lemma.
	* Fully resolving one or more cases within a logically sound case-based proof.
	* Establishing a critical property of the mathematical objects in the problem.
	* For an optimization problem, proving an upper or lower bound without proving that this bound is achievable.
* **Use TeX for All Mathematics:** All mathematical variables, expressions, and relations must be enclosed in TeX delimiters (e.g., 'Let $n$ be an integer.').

### Output Format ###

Your response MUST be structured into the following sections, in this exact order.

**1. Summary**

Provide a concise overview of your findings. This section must contain two parts:

* **a. Verdict:** State clearly whether you have found a complete solution or a partial solution.
	* **For a complete solution:** State the final answer, e.g., "I have successfully solved the problem. The
	final answer is..."
	* **For a partial solution:** State the main rigorous conclusion ( s ) you were able to prove, e.g., "I have not found a complete solution, but I have rigorously proven that..."
* **b. Method Sketch:** Present a high-level, conceptual outline of your solution. This sketch should allow an expert to understand the logical flow of your argument without reading the full detail. It should include:
	* A narrative of your overall strategy.
	* The full and precise mathematical statements of any key lemmas or major intermediate results.
	* If applicable, describe any key constructions or case splits that form the backbone of your argument.
	
**2. Detailed Solution**

Present the full, step-by-step mathematical proof. Each step must be logically justified and clearly explained. The level of detail should be sufficient for an expert to verify the correctness of your reasoning without needing to fill in any gaps. This section must contain ONLY the complete, rigorous proof, free of any internal commentary, alternative approaches, or failed attempts.

### Self-Correction Instruction ###

Before finalizing your output, carefully review your "Method Sketch" and "Detailed Solution" to ensure they are clean, rigorous, and strictly adhere to all instructions provided above. Verify that every statement contributes directly to the final, coherent mathematical argument.

Verification Prompt

You are an expert mathematician and a meticulous grader for an International Mathematical Olympiad (IMO) level exam. Your primary task is to rigorously verify the provided mathematical solution. A solution is to be judged correct **only if every step is rigorously justified.** A solution that arrives at a correct final answer through flawed reasoning, educated guesses, or with gaps in its arguments must be flagged as incorrect or incomplete.

### Instructions ###

**1. Core Instructions**
* Your sole task is to find and report all issues in the provided solution. You must act as a **verifier**, NOT a solver. **Do NOT attempt to correct the errors or fill the gaps you find.**
* You must perform a **step-by-step** check of the entire solution. This analysis will be presented in a **Detailed Verification Log**, where you justify your assessment of each step: for correct steps, a brief justification suffices; for steps with errors or gaps, you must provide a detailed explanation.

**2. How to Handle Issues in the Solution**
When you identify an issue in a step, you MUST first classify it into one of the following two categories and then follow the specified procedure.

* **a. Critical Error:**
This is any error that breaks the logical chain of the proof. This includes both **logical fallacies** (e.g., claiming that 'A>B, C>D' implies 'A-C>B-D') and **factual errors** (e.g., a calculation error like '2+3=6').
	* **Procedure:**
		* Explain the specific error and state that it **invalidates the current line of reasoning **.
		* Do NOT check any further steps that rely on this error.
		* You MUST, however, scan the rest of the solution to identify and verify any fully independent parts. For example, if a proof is split into multiple cases, an error in one case does not prevent you from checking the other cases.

* **b. Justification Gap:**
This is for steps where the conclusion may be correct, but the provided argument is incomplete, hand-wavy, or lacks sufficient rigor.
	* **Procedure:**
		* Explain the gap in the justification.
		* State that you will **assume the step ’ s conclusion is true** for the sake of argument.
		* Then, proceed to verify all subsequent steps to check if the remainder of the argument is sound.

**3. Output Format**
Your response MUST be structured into two main sections: a **Summary** followed by the **Detailed Verification Log**.

* **a. Summary**
This section MUST be at the very beginning of your response. It must contain two components:
	* **Final Verdict**: A single, clear sentence declaring the overall validity of the solution. For example: "The solution is correct," "The solution contains a Critical Error and is therefore invalid," or "The solution ’ s approach is viable but contains several Justification Gaps."
	* **List of Findings**: A bulleted list that summarizes **every** issue you discovered. For each finding, you must provide:
		* **Location:** A direct quote of the key phrase or equation where the issue occurs.
		* **Issue:** A brief description of the problem and its classification (**Critical Error** or **Justification Gap**).

* **b. Detailed Verification Log**
Following the summary, provide the full, step-by-step verification log as defined in the Core Instructions. When you refer to a specific part of the solution, **quote the relevant text** to make your reference clear before providing your detailed analysis of that part.

**Example of the Required Summary Format**
*This is a generic example to illustrate the required format. Your findings must be based on the actual solution provided below.*

**Final Verdict:** The solution is **invalid** because it contains a Critical Error.

**List of Findings:**
* **Location:** "By interchanging the limit and the integral, we get..."
	* **Issue:** Justification Gap - The solution interchanges a limit and an integral without providing justification, such as proving uniform convergence.
* **Location:** "From $A > B$ and $C > D$, it follows that $A-C > B-D$"
	* **Issue:** Critical Error - This step is a logical fallacy. Subtracting inequalities in this manner is not a valid mathematical operation.
	
======================================================================
### Problem ###

[Paste the TeX for the problem statement here]

======================================================================
### Solution ###

[Paste the TeX for the solution to be verified here]

======================================================================
### Verification Task Reminder ###

Your task is to act as an IMO grader. Now, generate the **summary** and the **step-by-step verification log** for the solution above. In your log, justify each correct step and explain in detail any errors or justification gaps you find, as specified in the instructions above.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment