Schechter Institutes & National Library of Israel (MiDRASH)
Within the ERC-funded MiDRASH project, the Schechter Institutes and the National Library of Israel set out to extract text from around 30,000 medieval Hebrew, Aramaic and Judeo-Arabic manuscripts, drawing heavily on the Cairo Geniza. Replacing fragmented manual efforts, the team used eScriptorium to run global 'transcribe-a-thons': Kraken models generated rough baseline transcriptions, crowdsourced volunteers and scholars corrected them, and those corrections were fed back to recursively improve the models. The open-source approach ensured both the resulting data and the underlying algorithms remained permanently public. A leading Israeli example of combining HTR with large-scale scholarly crowdsourcing.