A computer implemented method includes receiving text data, detecting auto-generated text in the received text data to identify tags in the received text to distinguish between the auto-generated text and user generated text, and providing the tagged text data to a machine learning language model.