Token Budget
The maximum number of tokens allocated for an LLM request or workflow, used to control costs, latency, and context window utilization.
The maximum number of tokens allocated for an LLM request or workflow, used to control costs, latency, and context window utilization.