reduction of complexity


  • reduce M*N matrix to K*K matrix using LSI
  • basic idea -- similar documents have similar word frequencies
  • reduce the size of the frequency table -- M*N to K*K, K << M,N