Skip to content

Routing Modes

TokenRouter offers four routing modes to optimize for different priorities.

ModeUse CaseCost PriorityQuality PriorityLatency Priority
auto:balanceGeneral purpose40%40%20%
auto:costHigh volume80%10%10%
auto:qualityComplex tasks10%80%10%
auto:latencyReal-time10%10%80%
  • Chat applications
  • General Q&A
  • Content generation
  • Batch processing
  • Simple classification
  • High-volume operations
  • Code generation
  • Research
  • Complex reasoning
  • Real-time chat
  • Interactive apps
  • Quick lookups