Detecting the origin of a digital video within a social network is a critical task that aids law enforcement and intelligence agencies in identifying the creators of misleading visual content. In this research, we introduce an innovative method for identifying the original social network of a video, even when the video has been altered through actions like group of frames removal and file container reconstruction. The proposed method takes advantage of the video encoding’s temporal uniformity, leveraging motion vectors to characterize the specific features associated to various social media platforms. Each video is represented by a graph where nodes correspond to macroblocks. These macroblocks are interconnected by following the inter-prediction rules outlined in the H.264/AVC codec standard. Such a structure can be then classified using a graph neural network to predict the platform on which the video has been shared. Experimental results demonstrate that this approach outperforms both codec- and content-based approaches, underscoring the effectiveness of a structural approach in attributing the social media platform from which videos originated.
STRUCTURE MATTERS: ANALYZING VIDEOS VIA GRAPH NEURAL NETWORKS FOR SOCIAL MEDIA PLATFORM ATTRIBUTION / Gemelli A.; Shullani D.; Baracchi D.; Marinai S.; Piva A.. - ELETTRONICO. - (2024), pp. 4735-4739. (Intervento presentato al convegno 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 tenutosi a Seoul, Korea nel 2024) [10.1109/ICASSP48485.2024.10447089].
STRUCTURE MATTERS: ANALYZING VIDEOS VIA GRAPH NEURAL NETWORKS FOR SOCIAL MEDIA PLATFORM ATTRIBUTION
Gemelli A.;Shullani D.;Baracchi D.;Marinai S.;Piva A.
2024
Abstract
Detecting the origin of a digital video within a social network is a critical task that aids law enforcement and intelligence agencies in identifying the creators of misleading visual content. In this research, we introduce an innovative method for identifying the original social network of a video, even when the video has been altered through actions like group of frames removal and file container reconstruction. The proposed method takes advantage of the video encoding’s temporal uniformity, leveraging motion vectors to characterize the specific features associated to various social media platforms. Each video is represented by a graph where nodes correspond to macroblocks. These macroblocks are interconnected by following the inter-prediction rules outlined in the H.264/AVC codec standard. Such a structure can be then classified using a graph neural network to predict the platform on which the video has been shared. Experimental results demonstrate that this approach outperforms both codec- and content-based approaches, underscoring the effectiveness of a structural approach in attributing the social media platform from which videos originated.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.