Key Responsibilities
- Evaluate outputs from advanced AI models and compare responses across multiple systems
- Participate in qualitative feedback and focus-group style sessions to assess AI performance
- Share real-world AI workflows and prompting strategies to improve model reliability
- Contribute to structured evaluation and rating tasks for next-generation AI systems
- Provide detailed feedback and analysis to enhance AI model behavior and usefulness
Requirements
- Active daily use of AI tools such as ChatGPT, Claude, Gemini, Perplexity, Copilot, Midjourney, Cursor, or Grok
- Strong analytical thinking with experience in research, writing, productivity, coding, or structured problem-solving
- Ability to quickly identify hallucinations, weak reasoning, or low-quality outputs in AI responses
- Critical thinking about model behavior, reliability, and practical applications
- Continuous refinement of prompting techniques and workflows