Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The high TTFT (around 5-6 seconds) is what kills the excitement for this for me. Sure, when it starts outputting its crazy fast so it’s good for generating single file prototypes, but as soon as you try to use it in Cline or any other agentic loop you’ll be waiting for API requests constantly and it’s a real bottleneck.


TTFT == time to first token.

(I would've just said, "the throughput is fantastic, but the latency is about 3 times higher than other offerings".)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: