Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That’s not the actual time if you run it, encoding and decoding is extra


Nevertheless it does seem that generating will fairly soon become fast enough to extend a video clip in realtime. Autoregressive by the second. Integrated with a multi modal input model you would be very close to an AI avatar that would be extremely compelling.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: