Feature selection is the process of selecting what we think is worthwhile in our documents, and what can be ignored. This will likely include removing punctuation and stopwords, modifying words by making them lower case, choosing what to do with typos or grammar features, and choosing whether to do stemming.

Leave a Reply