Comics Datasets Framework: Mix of Comics datasets for detection
  benchmarking

Vivoli, Emanuele; Campaioli, Irene; Nardoni, Mariateresa; Biondi, Niccolò; Bertini, Marco; Karatzas, Dimosthenis

Comics, as a medium, uniquely combine text and images in styles often distinct from real-world visuals. For the past three decades, computational research on comics has evolved from basic object detection to more sophisticated tasks. However, the field faces persistent challenges such as small datasets, inconsistent annotations, inaccessible model weights, and results that cannot be directly compared due to varying train/test splits and metrics. To address these issues, we aim to standardize annotations across datasets, introduce a variety of comic styles into the datasets, and establish benchmark results with clear, replicable settings. Our proposed Comics Datasets Framework standardizes dataset annotations into a common format and addresses the overrepresentation of manga by introducing Comics100, a curated collection of 100 books from the Digital Comics Museum, annotated for detection in our uniform format. We have benchmarked a variety of detection architectures using the Comics Datasets Framework. All related code, model weights, and detailed evaluation processes are available at https://github.com/emanuelevivoli/cdf, ensuring transparency and facilitating replication. This initiative is a significant advancement towards improving object detection in comics, laying the groundwork for more complex computational tasks dependent on precise object recognition.

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking / Emanuele Vivoli; Irene Campaioli; Mariateresa Nardoni; Niccolò Biondi; Marco Bertini; Dimosthenis Karatzas. - ELETTRONICO. - (2024), pp. 154-167. ( International Conference on Document Analysis and Recognition Atene 2024).

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking

Emanuele Vivoli;Irene Campaioli;Mariateresa Nardoni;Niccolò Biondi;Marco Bertini;Dimosthenis Karatzas

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2024
			
	Titolo del volume degli atti
	
				International Conference on Document Analysis and Recognition
			
	Titolo del congresso
	
				International Conference on Document Analysis and Recognition
			
	Luogo del congresso
	
				Atene
			
	Data del congresso
	
				2024
			
	Tutti gli autori
	
						Emanuele Vivoli; Irene Campaioli; Mariateresa Nardoni; Niccolò Biondi; Marco Bertini; Dimosthenis Karatzas
					
	Appare nelle tipologie:
	
				4a - Articolo in atti di congresso

File in questo prodotto:

File	Dimensione	Formato
2409.16159v1.pdf accesso aperto Tipologia: Pdf editoriale (Version of record) Licenza: Solo lettura Dimensione 6.34 MB Formato Adobe PDF	6.34 MB	Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1395433

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking

Emanuele Vivoli;Irene Campaioli;Mariateresa Nardoni;Niccolò Biondi;Marco Bertini;Dimosthenis Karatzas

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking

Emanuele Vivoli;Irene Campaioli;Mariateresa Nardoni;Niccolò Biondi;Marco Bertini;Dimosthenis Karatzas

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)