شناسایی حساب‌های کاربری جعلی با استفاده از یادگیری ماشین: مطالعه‌ای روی شبکۀ اجتماعی X (توییتر سابق)

رجبیان, محمد; حسینی, منیره

doi:10.22034/aimj.2025.489949.1616

شناسایی حساب‌های کاربری جعلی با استفاده از یادگیری ماشین: مطالعه‌ای روی شبکۀ اجتماعی X (توییتر سابق)

نوع مقاله : مقاله پژوهشی

نویسندگان

محمد رجبیان ¹

منیره حسینی ²

¹ کارشناس ارشد، گروه فناوری اطلاعات، دانشکده صنایع، دانشگاه خواجه نصیرالدین طوسی، تهران، ایران

² دانشیار، گروه فناوری اطلاعات، دانشکده صنایع، دانشگاه خواجه نصیرالدین طوسی، تهران، ایران

10.22034/aimj.2025.489949.1616

چکیده

محبوبیت شبکه‌های اجتماعی آنلاین (OSNs) هر روز افزایش می‌یابد و تعداد کاربران و میزان استفاده از این شبکه‌های اجتماعی، روزبه‌روز بیشتر می‌شود. این موضوع باعث شده است که سودجویانی از این فضا سوءاستفاده کنند و به اقدامات مخرب و حتی غیرقانونی نظیر، کلاه‌برداری، انتشار اخبار جعلی، جعل هویت و فریب کاربران بپردازند. اغلب این افراد از حساب‌های کاربری جعلی، برای اقدامات مخربانه خود استفاده می‌کنند. در بیشتر پژوهش‌های گذشته، پژوهشگران ابتدا با استفاده از روش‌های کاهش ویژگی، انتخاب ویژگی یا استخراج ویژگی، تعداد ویژگی‌‌های مجموعه داده را کاهش می‌دادند؛ سپس با استفاده از روش‌های ترکیبی یادگیری ماشین، به ارائه روشی جهت افزایش صحت در شناسایی حساب‌های کاربری جعلی اقدام می‌کردند. در پژوهش جاری، 67 ویژگی برای شناسایی حساب‌های کاربری جعلی انتخاب شده‌اند که مقادیر عددی دارند و این امر باعث می‌شود که نیاز به نرمال‌سازی و تحلیل متن وجود نداشته باشد و مدل آموزش‌دیده توسط یادگیری ماشین، به زبان خاصی وابسته نباشد و بتواند واقعی و جعلی بودن حساب‌های کاربری‌ای را که با زبان‌های مختلف ایجاد شده‌اند، شناسایی کند. همچنین روشی ترکیبی جدیدی ارائه شده است که در آن، مقدار پارامتر C الگوریتم ماشین بردار پشتیبان (SVM) با الگوریتم‌های بهینه‌سازی ازدحام ذرات (PSO) و بهینه‌ساز گرگ خاکستری (GWO) و شبکۀ عصبی (NN) بهینه‌سازی می‌شود. در پژوهش جاری بهترین میانگین صحت به‌دست‌آمده، در صد مرتبه اجرای الگوریتم‌ها 28/98درصد بوده که هنگام ترکیب الگوریتم SVM با NN اتفاق افتاده است.

کلیدواژه‌ها

حساب کاربری جعلی

شبکۀ اجتماعی X (توییتر)

یادگیری ماشین

الگوریتم ماشین بردار پشتیبان

بهینه‌سازی

عنوان مقاله English

Detecting Fake Accounts with Machine Learning Techniques: A Case Study on the X Social Network (Formerly Twitter)

نویسندگان English

Mohammad Rajabian ¹

Monireh Hosseini ²

¹ MSc., Department of Information Technology, Faculty of Industry, K.N. Toosi Universityof Technology, Tehran, Iran

² Associate Prof., Department of Information Technology, Faculty of Industry, K.N. Toosi Universityof Technology, Tehran, Iran

چکیده English

The popularity of online social networks (OSNs) is increasing daily. This has led to malicious individuals exploiting these platforms. Often, these individuals use fake accounts to carry out their malicious actions. Many researchers have focused on identifying fake user accounts on various social networks using machine learning methods. Most previous studies have used feature reduction, selection, or extraction methods to reduce the number of features and increase the speed when using ensemble machine learning methods to accurately identify fake user accounts. In the current research, 67 numerical features were selected to identify fake user accounts, which ensures high classification speed and eliminates the need for feature reduction. Additionally, a new hybrid method has been proposed in which the Support Vector Machine (SVM) algorithm is optimized with Particle Swarm Optimization (PSO), Grey Wolf Optimizer (GWO), and Neural Networks (NN). In the current research, the best average accuracy achieved over one hundred runs of the algorithms was 98.28%, which occurred when combining the SVM algorithm with NN.

کلیدواژه‌ها English

Fake account

X social network

Machine learning

Support Vector Machines (SVM)

Optimization

رمضان‌زاده رستمی، رضا و کرباسی، سهیلا (1397). شناسایی حساب‌های جعلی در شبکۀ اجتماعی توییتر با استفاده از رویکرد انتخاب ویژگی ترکیبی چندهدفه. ششمین همایش ملی مدیران فناوری اطلاعات، تهران.

Alhassun, A. S. & Rassam, M. A. (2022). A Combined Text-Based and Metadata-Based Deep-Learning Framework for the Detection of Spam Accounts on the Social Media Platform Twitter. Processes, 10(3). https://doi.org/10.3390/pr10030439

Bala Anand, M., Karthikeyan, N., Karthik, S., Varatharajan, R., Manogaran, G. & Sivaparthipan, C. B. (2019). An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter. The Journal of Supercomputing, 75(9), 6085-6105. https://doi.org/10.1007/s11227-019-02948-w

Ben Sassi, I. & Ben Yahia, S. (2021). Malicious accounts detection from online social networks: a systematic review of literature. International Journal of General Systems, 50(7), 741-814. https://doi.org/10.1080/03081079.2021.1976773

Bharti, K. K. & Pandey, S. (2021). Fake account detection in twitter using logistic regression with particle swarm optimization. Soft Computing, 25(16), 11333-11345. https://doi.org/10.1007/s00500-021-05930-y

Cea, C. (2021). Dataset for supervised bot detection on Twitter. https://doi.org/10.5281/zenodo.5574403

Cresci, S., Di Pietro, R., Petrocchi, M., Spognardi, A. & Tesconi, M. (2015). Fame for sale: Efficient detection of fake Twitter followers. Decision Support Systems, 80, 56-71. https://doi.org/https://doi.org/10.1016/j.dss.2015.09.003

Dixon, S.J. (2023). Global integrated drought monitoring and prediction system (GIDMaPS) data sets. Availavble: https://www.statista.com/statistics/303681/twitter-users-worldwide

Jabardi, M. & Hadi, A. (2021). Ontology Meter for Twitter Fake Accounts Detection. International Journal of Intelligent Engineering and Systems, 14, 410-419. https://doi.org/10.22266/ijies2021.0228.38

Jia, J., Wang, B. & Gong, N. Z. (2017, 26-29 June 2017). Random Walk Based Fake Account Detection in Online Social Networks. 2017 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

Khaled, S., El-Tazi, N. & Mokhtar, H. M. O. (2018, 10-13 Dec. 2018). Detecting Fake Accounts on Social Media. 2018 IEEE International Conference on Big Data (Big Data).

Kondeti, P., Yerramreddy, L. P., Pradhan, A. & Swain, G. (2021, 2021//). Fake Account Detection Using Machine Learning. Evolutionary Computing and Mobile Sustainable Networks, Singapore.

Kwak, H., Lee, C., Park, H. & Moon, S. (2010, April). What is Twitter, a social network or a news media? In Proceedings of the 19th international conference on World wide web (pp. 591-600).

Meligy, A., Ibrahim, H. & Torky, M. (2017). Identity Verification Mechanism for Detecting Fake Profiles in Online Social Networks. International Journal of Computer Network and Information Security, 9, 31-39. https://doi.org/10.5815/ijcnis.2017.01.04

Mujeeb, S. & Gupta, S. (2022, 2022//). Fake Account Detection in Social Media Using Big Data Analytics. Proceedings of Second International Conference on Advances in Computer Engineering and Communication Systems, Singapore.

Naga Praveena, T.A. M., Singh, R., Manogna, S. & Pragna, B. (2024). Fake profile identification in social network using machine learning and NLP. Journal of Engineering Sciences, 15(01), 1-11.

Pakaya, F. N., Ibrohim, M. O. & Budi, I. (2019, 16-17 Oct. 2019). Malicious Account Detection on Twitter Based on Tweet Account Features using Machine Learning. 2019 Fourth International Conference on Informatics and Computing (ICIC).

Prabhu Kavin, B., Karki, S., Hemalatha, S., Singh, D., Vijayalakshmi, R., Thangamani, M., Haleem, S. L. A., Jose, D., Tirth, V., Kshirsagar, P. R. & Adigo, A. G. (2022). Machine Learning-Based Secure Data Acquisition for Fake Accounts Detection in Future Mobile Communication Networks. Wireless Communications and Mobile Computing, 2022, 6356152. https://doi.org/10.1155/2022/6356152

Revathi, S. & Suriakala, D. M. (2018, 20-22 Dec. 2018). Profile Similarity Communication Matching Approaches for Detection of Duplicate Profiles in Online Social Network. 2018 3rd International Conference on Computational Systems and Information Technology for Sustainable Solutions (CSITSS).

Sahoo, S. R. & Gupta, B. B. (2019). Hybrid approach for detection of malicious profiles in twitter. Computers & Electrical Engineering, 76, 65-81. https://doi.org/10.1016/j.compeleceng.2019.03.003

Sai Raja, E. V., Aditya, B. L. & Mohanty, S. N. (2023). Fake Profile Detection Using Logistic Regression and Gradient Descent Algorithm on Online Social Networks. EAI Endorsed Transactions on Scalable Information Systems, 11(1). https://doi.org/10.4108/eetsis.4342

Shen, X., Lv, W., Qiu, J., Kaur, A., Xiao, F. & Xia, F. (2023). Trust-Aware Detection of Malicious Users in Dating Social Networks. IEEE Transactions on Computational Social Systems, 10(5), 2587-2598. https://doi.org/10.1109/TCSS.2022.3174011

Tang, Y., Zhang, D., Liang, W., Li, K. C., & Sukhija, N. (2021, December). Active malicious accounts detection with multimodal fusion machine learning algorithm. In Inernational conference on ubiquitous security (pp. 38-52). Singapore: Springer Singapore.

Wani, M. A., Agarwal, N., Jabin, S. & Hussain, S. Z. (2019, 7-11 Jan). Analyzing Real and Fake users in Facebook Network based on Emotions. 2019 11th International Conference on Communication Systems & Networks (COMSNETS).