GPT-4 Class Models Lead AgentBench at 44% Success Rate | Benchmark Signal | NeoSignal