Modern language models are capable of performing a wide range of tasks, from debugging code to booking complex travel. However, despite their capabilities, they often struggle with something seemingly simple: waiting. Ask an agent to monitor your email for a colleague’s response or watch for a price drop over several days, and it will fail.










