Explain stop words and its impact on model accuracy.
Words like ‘a’ and ‘the’ appear so frequently that they don’t require tagging as thoroughly as nouns, verbs and modifiers. These are known as stop words which are common words like articles, determiners etc. which can be filtered from the text to be processed.
When too much of stop words are included in the final model building, then the model must face low accuracy due to the different proportions of stop words included in different text bodies. Mostly, the inaccuracies occur in text classification when term frequencies are used.
The snippets of implementation of stop words are given below.