Senior software engineer needed to evaluate how large language models (LLMs) interact with real code by analyzing and triaging GitHub issues, setting up and configuring code repositories, evaluating unit test coverage and quality, and modifying and running codebases locally to assess LLM performance in bug-fixing scenarios. Strong experience with programming languages like Python, JavaScript, Java, Go, Rust, C/C++, C#, or Ruby is required, along with proficiency in Git, Docker, and basic software pipeline setup.