I am asking around on different mailing lists to gain some insight into the archiving habits of linguists who use lexical databases. I am specifically interested in databases created by tools like
FLEx, ToolBox, Lexus, TshwaneLex, etc.
Background Story Continue reading
It looks like UNT might never adopt OLAC metadata Or even QDC. So is it possible to "upscale" these values via a utility?
I wonder if I can craft a CIDOC translator to OLAC.
Standards and guidelines
Three utilities I find helpful
extract images out of PDFs: https://askubuntu.com/questions/150100/extracting-embedded-images-from-a-pdf
Mark up images online: https://markuphero.com/try/annotate.html
some latex publishers require an eps instead of svg (even though there is an SVG module for Latex). https://cloudconvert.com/svg-to-eps
Content Reuse design,
challenges of the dark web (indexing language archives), challenges of multilingualism (Discovery), challenges of different theoretical terms of art within a multi-discipline area of activity
From time to time I need to reference Heidi Johnson's work published as part of the LREC workshops in 2002 and 2006 under the title: "International Workshop on Resources and Tools in Field Linguistics". The papers never got hosted on the official LREC website. Rather the papers were hosted on the MPI website.
Who do I talk to about getting these papers into the
https://aclanthology.org database of papers. They would get the attention they need in that paper repository.
An idea for R-90.
I should do a reading of Beatrix potter with my Mics... I think it would be great to see how one comes out.
The OLAC validator runs off of an unit of software which has the heartbleed security vulnerability. Thinking about implementing a validator the following software comes to mind.
https://github.com/zimeon/oaipmh-validator There was also an Online OAI-PMH validator from a former engineer on the Europeana project. I think he is based in Greece. His solution is not open source, but he mentioned that he would consider adding the OLAC profile. https://validator.oaipmh.com/
It would be good to see what other OAI-PMH validators look like and how submitters expect to interact with them.
https://rdamsc.bath.ac.uk/msc/t64; https://www.openaire.eu/validator-registration-guide ; https://github.com/EuroCRIS/openaire-cris-validator; https://www.fosteropenscience.eu/content/openaire-compatibility-validator-presentation
I read tonight about a bug in Xoai a foundational library for DSpace and Dataverse which uses lxml lib for parsing.
Since the OLAC XML implementation of OAI-PMH requires the use of an XSI element it seems that the bug defined here
https://github.com/DSpace/xoai/issues/67 and discussed here would apply https://github.com/gdcc/xoai/issues/141