publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- EMNLPSYNC: A Synthetic Long-Context Understanding Benchmark for Controlled Comparisons of Model Capabilities2025
- LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?2025