reduction
stop list -- irrelevant words
word stems -- reduce different words to relevant part