Transkribus → eScriptorium

Archives nationales de France

The Archives nationales de France, working with Inria, launched the LECTAUREP project to automatically read Parisian notaries' repertoires — an enormous corpus running to over a million heterogeneous, heavily abbreviated handwritten pages. The project began with Transkribus but migrated to the open-source eScriptorium to secure open-source control and full ownership of its trained models. That meant exporting PAGE XML ground truth from Transkribus and painstakingly retraining models in the Kraken engine. The team produced specialised models for 19th-century French administrative hands and built an open pipeline outputting TEI-compliant XML, greatly improving searchable access to the notarial record. A flagship national-archive HTR migration.

Original source
LECTAUREP — la lecture automatique de répertoires de notaires
Archives nationales de France
The archived copy opens a snapshot on the Internet Archive's Wayback Machine, preserved for when the original moves or disappears.
The Archive Migration Review summarises this story in its own words and links to the original source for verification. We are editorially independent and not affiliated with the institution or software project named above. Summaries are compiled in good faith from publicly available accounts; corrections are welcome.
Keep reading

Related migrations