Skip to content

Instantly share code, notes, and snippets.

@crystalattice
Created August 30, 2017 21:41
Show Gist options
  • Select an option

  • Save crystalattice/08251eeee3ddaaaf3203cb14b26ce5f5 to your computer and use it in GitHub Desktop.

Select an option

Save crystalattice/08251eeee3ddaaaf3203cb14b26ce5f5 to your computer and use it in GitHub Desktop.
To prevent cheating, a teacher writes three versions of a test. She stacks the three versions together, first all copies of Version A, then all copies of Version B, then all copies of Version C. As students arrive for the exam, each student takes a test. When grading the test, the teacher finds that students who took Version B scored higher than students who took either Version A or Version C. She concludes from this that Version B is easier, and discards it.
Problem: Which test questions are sufficient in meeting the goals of the course, ensuring students have adequately retained the information, while not being too easy or too difficult for students to answer?
Hypothesis: Providing a selection of tests, each with different questions, will determine which questions meet the criteria of ensuring students have adequately retained the material.
Experiment Design: Create a test bank of questions, then randomly print these questions on each test that will be handed out. The questions need to be evenly weighted in their deployment, to ensure fair calculation.
Analysis Plan: When the tests are graded, each question in the bank will be annotated with the pass/failure rate from the entire class.
Benchmarks: Any question that results in 90% or greater passing rate are identified as being too easy. Any questions with 20% or less passing rate are too difficult.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment