Advanced Deep Fake Detection System

Authors

  • Kumpatla Chinna Department of Artificial Intelligence and Machine Learning, Bonam Venkata Chalamayya Engineering College, Andhra Pradesh India
  • Bolla Siva Naga kumar Department of Artificial Intelligence and Machine Learning, Bonam Venkata Chalamayya Engineering College, Andhra Pradesh India
  • Somisetti Satya Department of Artificial Intelligence and Machine Learning, Bonam Venkata Chalamayya Engineering College, Andhra Pradesh India
  • Polisetti S.S.C. Mahesh Department of Artificial Intelligence and Machine Learning, Bonam Venkata Chalamayya Engineering College, Andhra Pradesh India
  • Bokka Satish Department of Artificial Intelligence and Machine Learning, Bonam Venkata Chalamayya Engineering College, Andhra Pradesh India
  • Alapati Ramakrishna Department of Artificial Intelligence and Machine Learning, Bonam Venkata Chalamayya Engineering College, Andhra Pradesh India

DOI:

https://doi.org/10.63671/ijsesr.v2i1.70

Keywords:

Deepfake Detection, Multimodal Learning, Convolutional Neural Network (CNN), Random Forest Classifier, TF-IDF Vectorizer, Cosine Similarity, Vision Transformer, SIGLIP, RoBERTa, AI-Generated Content Detection, Digital Forensics, Content Authentication, Machine Learning

Abstract

The production and distribution of deepfake material in text, audio, video, and image modalities has greatly grown because to the quick development of generative artificial intelligence. Public safety, journalism, cybersecurity, and digital trust are all seriously threatened by such synthetic media. The efficiency of current detection techniques in handling intricate, multimodal manipulation techniques is limited by their primary focus on single-modality analysis. This work offers an integrated multimodal deepfake detection methodology intended to evaluate authenticity across diverse media sources inside a single architecture in order to get over this restriction. A Convolutional Neural Network (CNN) is used to extract spatial information from video frames and detect manipulation traces, visual artifacts, and face abnormalities in order to detect video deepfakes. In order to detect audio deepfakes, discriminative acoustic features are extracted and then classified using a Random Forest method, which offers resilience against attacks using speech synthesis and voice cloning. A TF-IDF Vectorizer in conjunction with Cosine Similarity is used to quantify the semantic similarity between reference materials and input text in order to detect textual plagiarism. The system incorporates a refined vision transformer model based on SIGLIP for image authenticity verification in order to differentiate between AI-generated images and content provided by humans. Furthermore, a RoBERTa-base transformer model that can categorize both machine-generated and human-written text utilizing contextual embeddings is used for AI-generated text identification. A full authenticity score is generated by combining the outputs from all modalities. The framework's usefulness for automated content moderation systems and real-world digital forensics is highlighted by experimental evaluation, which shows better robustness and scalability in comparison to isolated detection approaches.

Downloads

Published

2026-03-12

Issue

Section

Articles

How to Cite

Chinna, K., Naga kumar, B. S., Satya, S., Mahesh, P. S., Satish, B., & Ramakrishna, A. (2026). Advanced Deep Fake Detection System. International Journal of Science and Engineering Science Research, 2(1), 44-59. https://doi.org/10.63671/ijsesr.v2i1.70

Similar Articles

You may also start an advanced similarity search for this article.