Inference-Time-Decontamination

June 17, 2024 ยท View on GitHub

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation