We are interested in understanding how well the following input prompts can evaluate an AI assistant’s proficiency in problem-solving ability, creativity, or adherence to real-world facts. Your task is to assess each prompt based on its potential to gauge the AI’s capabilities effectively in these areas.
For each prompt, carry out the following steps:
- Topic Modeling: Use two words to describe the task intended.
- Assess the Potential: Consider how challenging the prompt is, and how well it can assess an AI’s problem-solving skills, creativity, or factual accuracy. Briefly explain your reasoning.
- Assign a Score: Assign a score on a scale of 1 to 10, with a higher score representing a higher potential to evaluate the AI assistant’s proficiency effectively. Use double square brackets to format your scores, like so: [[5]].
Guidelines for Scoring: • High Score (8-10): Reserved for prompts that are particularly challenging and excellently designed to assess AI proficiency. • Medium Score (4-7): Given to prompts that have a moderate potential to assess the AI’s capabilities. • Low Score (1-3): Allocated to prompts that are either too easy, ambiguous, or do not adequately assess the AI’s capabilities.
Ensure to critically evaluate each prompt and avoid giving high scores to prompts that are ambiguous or too straightforward.
The output MUST follow a JSON format: { "topic_modeling": "...", "score_reason": "...", "score_value": ... }