Spatial Coverage on the OLAC network

The issues is that OLAC and these other uses of Dublin Core don't agree in the semantics of spatial coverage.

https://archive-intranet.ardc.edu.au/display/DOC/Spatial+coverage#:~:text=Spatial%20coverage%20refers%20to%20a,the%20focus%20of%20an%20activity.

Critical question here, is one where we ask: "what do English think geography is for language?"

Thinking deeply about:

OLAC spelling mistakes

I wonder how many spelling mistakes we can find in various records in OALC... This is a great reason for OLAC to retain the kind of language the record is in.

https://opensource.com/article/18/2/aspell

Spell Checking Your Programming from the Linux Command Line

Install with my blog workflow:
https://github.com/tbroadley/spellchecker-cli

other options: https://vi.stackexchange.com/questions/22220/how-to-make-spell-check-work-for-text-inside-a-xml-file
https://metacpan.org/dist/XML-Twig/view/tools/xml_spellcheck/xml_spellcheck

OLAC data quality investigator

On the flight back from Finland I found it challenging to use my laptop and pulled out my scratch pad to draw out some ideas I was having. One of those ideas was an idea for a record quality investigator. A tool which lets one investigator the presence or absence of features or sets of features in a record or set of records. The goal is to look for any patterns in the records which might be interesting and notable.

What follows are my written notes.

Page 1
Page 2
Page 3
Page 4
Page 5
Page 6
Page 7
Page 8

OLAC Validator Custom Messages

OLAC Validator custom messages can be created following these steps:
https://xerces.apache.org/xerces2-j/faq-xs.html#faq-4

This is the software it uses for its validator: https://xerces.apache.org/xerces-p/samples/validator.html Ideally this would also be containerized with the other parts.

One approach to get this containerized might be to use this script (which is older and linux oriented) https://github.com/dgricci/xmllint

Another option is to use: https://hub.docker.com/r/isaitb/xml-validator

If this service were implemented on a new server, with a web-interface we might expect to use a newer HTML front end.

here is what I found via gitub:

https://github.com/ebruchez/darius-xml.js
https://github.com/fulvio999/jxmlutil

Darius looks more promising but neither are "out of the box" tools.

Stack Exchange for Language Resource Archiving

I wonder if it would be productive to have language archive discussions via stack exchange. OLAC implementation, cataloging, and indexing discussions.

https://area51.stackexchange.com/faq

1000 questions in 6 months with 70% answers... that means 6 people asking one question a day for 6 months. Do we have that many questions? Do we have that big of a community?