
Last update: 04/08/2021
Optimising open data from Luxembourg’s historical newspapers
The National Library of Luxembourg has developed an OCR-tool, that users of historical open data can find pre-trained on GitHub. This software tool is an enhancer of quality of existing XML schema or the regular OCR-engine. It currently comes with a “training set” for the software. Nautilus-OCR is an open source software tool provided by Bibliothèque nationale de Luxembourg (BnL), the National Library of Luxembourg. BnL started digitalising newspapers back in 2006 by using layout recognition and Optical Character Recognition (OCR). The repository for Nautilus-OCR was created by the reuse of…
Open Source Software