Key Responsibilities
- Analyze and triage GitHub issues across trending open-source libraries to identify areas for improvement and contribute to LLM evaluation and training datasets.
- Set up and configure code repositories, including dockerization and environment setup, to ensure seamless development and testing.
- Evaluate test coverage and quality to ensure high-quality public GitHub repositories and contribute to project goals.
- Develop automation scripts to streamline development environment setup and issue triaging, improving overall efficiency.
- Collaborate with the team to expand dataset coverage across programming languages, difficulty levels, and more.
Requirements
- Strong proficiency in C# programming language and software engineering principles.
- Experience with high-quality public GitHub repositories, GitHub issues, and repository management.
- Understanding of software engineering principles, including testing, debugging, and version control.
- Ability to work independently and collaboratively in a distributed team with excellent communication and problem-solving skills.
- Familiarity with LLM evaluation, repository validation, and machine learning/AI research is desirable.