Benchmark LLM reasoning using esoteric programming languages.
EsoLang-Bench evaluates the reasoning capabilities of Large Language Models (LLMs) by challenging them with tasks written in esoteric programming languages. It provides a framework to test if LLMs truly understand code or simply recognize patterns.
Unknown