The hard part is not the chat box. It is tool outputs you can trust, retries that stop, permissions that hold, evals that catch regressions, and cost controls that do not surprise finance.
If the job is an FAQ bot, use a commodity tool. If the agent touches real data, takes real actions, or affects someone's workday, we scope it like production software.