Latent semantic indexing: Cosmos — All that is, or was, or ever will be

Latent semantic indexing

cosmos 4th November 2016 at 2:43pm

Essentially application of PCA to text data, where we usually skip the pre-processing step.

We use it for measuring document similarity. Use angle between vectors representing documents.