DSpace CSV metadata quality checker

A simple, but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem (though it could theoretically work on any CSV that uses Dublin Core fields as columns). The implementation is essentially a pipeline of checks and fixes that begins with splitting multi-v...

Descripción completa

Detalles Bibliográficos
Autor principal: Orth, Alan S.
Formato: Source Code
Lenguaje:Inglés
Publicado: International Livestock Research Institute 2019
Materias:
Acceso en línea:https://hdl.handle.net/10568/110997
Descripción
Sumario:A simple, but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem (though it could theoretically work on any CSV that uses Dublin Core fields as columns). The implementation is essentially a pipeline of checks and fixes that begins with splitting multi-value fields on the standard DSpace separator, trimming leading/trailing whitespace, and then proceeding to more specialized cases like ISSNs, ISBNs, languages, unnecessary Unicode, AGROVOC terms, etc.