Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Random projections methods are cheaper alternatives to SVD. For example you can bin contexts with a hash function and count collocations between word and binned contexts the same way this article does. Then apply weighting and SVD if you really want the top n principal components.

What's nice with counting methods is that you can simply add matrices from different collections of documents.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: