Small Codes is an open digital infrastructure designed to support the preservation and revitalization of minority languages through scalable, interoperable and user-friendly tools. The platform combines linguistic data management with web-based technologies, offering an integrated suite of software modules—including online dictionaries, spell-checkers, corpus alignment systems, linguistic maps, and multimedia archives—tailored for under-resourced and dialectally fragmented languages. Unlike standard language technology pipelines designed for dominant languages, Small Codes supports linguistically diverse input and community-led data models. It operates through a federated, semi-industrial development model, balancing long-term sustainability with flexibility for academic and institutional partners. This paper outlines the system architecture and core functionalities of Small Codes, presents selected implementation scenarios, and discusses its contribution to digital heritage and computational dialectology.

Small Codes: a platform for digital resources and tools for minority languages and dialects / Carlo Zoli, Greta Mazzaggio, Neri Binazzi. - ELETTRONICO. - (2025), pp. 1-9. (Intervento presentato al convegno Digital Heritage 2025).

Small Codes: a platform for digital resources and tools for minority languages and dialects.

Greta Mazzaggio;Neri Binazzi
2025

Abstract

Small Codes is an open digital infrastructure designed to support the preservation and revitalization of minority languages through scalable, interoperable and user-friendly tools. The platform combines linguistic data management with web-based technologies, offering an integrated suite of software modules—including online dictionaries, spell-checkers, corpus alignment systems, linguistic maps, and multimedia archives—tailored for under-resourced and dialectally fragmented languages. Unlike standard language technology pipelines designed for dominant languages, Small Codes supports linguistically diverse input and community-led data models. It operates through a federated, semi-industrial development model, balancing long-term sustainability with flexibility for academic and institutional partners. This paper outlines the system architecture and core functionalities of Small Codes, presents selected implementation scenarios, and discusses its contribution to digital heritage and computational dialectology.
2025
DIGITAL HERITAGE (2025)
Digital Heritage 2025
Carlo Zoli, Greta Mazzaggio, Neri Binazzi
File in questo prodotto:
File Dimensione Formato  
dh20253331.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Creative commons
Dimensione 979.58 kB
Formato Adobe PDF
979.58 kB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1434635
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact