The real word text is a combination of a high amount of noise and minimal insights.
So we need to apply different text preprocessing techniques before going to build models. If you think we can use a bunch of machine learning preprocessing techniques. Then we are on very very wrong track.
The natural language preprocessing techniques are entirely different from machine learning and deep learning data preprocessing techniques.
The below link explains the most popular 20+ text preprocessing techniques along with implementation in python.
If we miss any popular methods? Do let us know; we would love to know that.