Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This will work very poorly when your data is changing because the centroids degrade and you'll have very poor recall but likely not know it unless you are also monitoring recall.

I didn't see this in the write-up, so adding it here as a common foot gun.



Good call-out. The article mentions using a representative sample, but I should definitely put it somewhere on top.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: