Discussion about this post

User's avatar
Aleksey Noone's avatar

>It’s worth noting that GLM 4.6-attempted tool calling during the reasoning phase and failed on all our runs.

Is this not an issue with the environment (Kilo) rather than the model? Could it be that Kilo should not respond to tool calls during the model's reasoning phase?

Expand full comment
Dan Cucolea's avatar

Thanks a lot for running these tests, I planned on doing something like this in the near future woth a focus on creating and debugging Cloudflare Workers built with Hono with Queues and R2 integrations.

Expand full comment

No posts

Ready for more?