AI Detector is a project developed for the Text Mining course, utilizing Random Forest and SVM models to predict whether a text is AI-generated or human-written. Combining advanced machine learning techniques, it aims to provide reliable and efficient text classification for authenticity verification.
Dataset used: https://www.kaggle.com/datasets/starblasters8/human-vs-llm-text-corpus
[1] Hua, H., & Yao, C.-J. (2024). Investigating generative AI models and detection techniques: Impacts of tokenization and dataset size on identification of AI-generated text. Frontiers in Artificial Intelligence, 7, Article 1469197. https://doi.org/10.3389/frai.2024.1469197. [Accessed: 13-Dec-2024]
[2] Prova, N. N. I. (2024). Detecting AI generated text based on NLP and machine learning approaches. arXiv preprint arXiv:2404.10032. https://arxiv.org/pdf/2404.10032. [Accessed: 13-Dec-2024]
[3] Li, Y., Li, Q., Cui, L., Bi, W., Wang, Z., Wang, L., Yang, L., Shi, S., & Zhang, Y. (2024). MAGE: Machine-generated text detection in the wild. arXiv preprint arXiv:2305.13242. https://arxiv.org/pdf/2305.13242. [Accessed: 13-Dec-2024].