About the Role
You will create reliable, reproducible task environments and testing workflows that ensure target repositories behave as expected. The work centers on containerized testbeds, unit testing, and documentation that enable consistent validation across SWE-Bench and Terminal-Bench workflows.



