Back to changelog

Apr 1, 2025

Code Interpreter — Run Python in Chat

Code Interpreter — Run Python in Chat

Orange Flower
Orange Flower

Smart intent routing

This month we introduced smart intent routing and an updated policy wizard. The router now inspects each prompt’s intent, automatically steering Q&A traffic to fast models while sending creative tasks to richer LLMs. A new preview pane shows token cost and latency projections for every rule before you hit Save, so you can tune policies with confidence.

Patch Notes

  • Earlier in the year we focused on stability, observability, and fine‑grained control. See the highlights below.

  • Added intent classifier for real‑time model selection

  • New bulk‑edit mode for adjusting weights across many routes

  • Latency budget slider to cap 95th‑percentile response times

  • Per‑model retry limits to prevent runaway loops

  • CSV export of routing logs with token counts and costs

  • Alerting webhook for failover events and degraded endpoints

  • UI fix: dropdown no longer resets on browser back‑navigation

  • Security patch for rare race condition in auth middleware