LLM Assisted Automated Item Generation

  • Challenges
    • Difficulty
    • Variety
      • Chan, X., Wang, X., Yu, D., Mi, H., and Yu, D. Scaling synthetic data creation with 1,000,000,000 personas. arXiv preprint arXiv:2406.20094, 2024.
    • Evaluation is saturated — synthesized items can be used to elevate model performance, but may cause overfitting.
  • The current item generations are mostly used in fine-tuning rather than in evaluations.