3.0
@seventyfivepupmstr
7 days ago
“Should have set a very specific end goal and had every model do rework until it achieved the final goal. The time and cost to get to an end goal would be a much more beneficial test.”
+Can attempt complex multi-step coding tasks autonomously−Should be tested with specific end goals and rework cycles to measure true cost-effectiveness