






SD-LLOD’23
Notes on an interesting workshop.
Registration at NexusLinguarum still needed... from a USA perspective it seems like a cult... but maybe this is Scientific European Union culture around technology.
https://mail.google.com/mail/u/0/#search/LLOD/FMfcgzGsmDqHGCMDmtmPjvlZDnLqKTDq
5th Summer Datathon on Linguistic Linked Open Data (SD-LLOD-23)
https://vocbench.op.europa.eu/#/Home
https://docs.google.com/forms/d/1CnIbd_KT0racSheTyNcnwcrcocTUhWr0f1USAGc5tyM/viewform?edit_requested=true
Customizing Hugo Wowchemy
The following linked article shows how to make posts appear in isotope format. This can be useful in other contexts too:
https://github.com/wowchemy/wowchemy-hugo-themes/issues/808#issuecomment-751313461
And this blog post for some other very helpful tips:
https://nickballou.com/blog/custom-wowchemy/
Legal issues in Linguistics
I've been looking a the growing scholarship around legal issues linguists face in their research. The following is a list of links.
- https://www.clarin.eu/content/legal-information-platform
- http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-Legal-Issues.pdf
- http://www.elra.info/en/dissemination/legal-issues-papers/legal-issues-webcrawling-report/
- http://www.elra.info/en/dissemination/legal-issues-papers/
- http://www.elra.info/en/tag/207/
- http://www.elra.info/en/elra-events/legal-issues-workshop-lrec2016/
OLAC: Multi-Audience Service Architecture


Original hand drawn source image.
Hugo Shortcode list
The list of Hugo Shortcodes on Github is endless... almost endless.
Here are some of the more interesting ones:
Maps with GPX Tracks
* https://github.com/altrdev/hugo-leaflet
* https://github.com/martinschneider/hugo-gpx-shortcode
Archives
* https://github.com/davidsneighbour/hugo-shortcodes
Gallery
- https://github.com/whimSYZ/hugo-gallery-shortcode
- https://github.com/Bishma/hugo-shortcodes/tree/main/image-squash
- https://matze.rocks/images/
- https://github.com/mfg92/hugo-shortcode-gallery
- Photos on a Map: https://tmuguet.gitlab.io/hugo-split-gallery/posts/grand-veymont/
- https://gitlab.com/tmuguet/hugo-split-gallery
Callout:
* https://github.com/mr-islam/hugo-callout
React App and Chart.js
* https://github.com/romankurnovskii/awesome-hugo-shortcodes
Plotly:
https://metalblueberry.github.io/post/howto/2019-11-23_add_plots_with_hugo_shortcodes/
EXIF:
* https://github.com/shombando/shom.dev/commit/7837de9a07fb6bebbd0ba1d4a689cf4cb020a4ad#diff-e09652119eab2b17c2119eefe3e1b9c388520440620609a8d6118cc099930635
* https://github.com/shombando/shom.dev/tree/7837de9a07fb6bebbd0ba1d4a689cf4cb020a4ad
* https://shom.dev/posts/20220128_hugo-photos-with-exif-data/
* https://github.com/Wivik/hugo-shortcodes
Google Contact Form:
* https://github.com/pkgstore-old-01/hugo-ui-shortcodes/blob/main/layouts/shortcodes/form-contact.google.html
Render PDF to the browser:
* https://github.com/sytranvn/hugo-pdf
Analytics:
* https://github.com/holehan/hugo-component-matomo
Podcast feed:
* https://github.com/valeriogalano/podcast-feed-hugo
atom feed:
* https://github.com/jhauraw/hugo-atom-xml-template
Convert to Hugo
* https://github.com/hzmangel/wp2hugo
Sections
* https://cloudcannon.com/blog/the-ultimate-guide-to-hugo-sections/
BibTex Materials in XML Markup
It seems that BibTeX materials can be written as XML.
Here are some resources I found on this:
- a python2 script: https://www.eecis.udel.edu/~sprenkle/bibtex2html/bibtex2xml.py
- an overview presentation: https://pdfs.semanticscholar.org/8c8e/44b18bacac15f14113af3d4b55f028e0a842.pdf
- Software Package: http://www.sfs.uni-tuebingen.de/hpsg/archive/projects/bibtex2html/
- Prior research: https://web.science.mq.edu.au/~rdale/resources/bibtex/index.html
- maybe some XSLT files: https://sourceforge.net/projects/bibtexml/
Hugo and XML
I did not know that Hugo could query XML...
I discovered this.... this is in fact really exciting news.
Three tools for study:
- https://gohugo.io/functions/transform.unmarshal/#xml-data
- https://github.com/gohugoio/hugoDocs/issues/1622
- https://discourse.gohugo.io/t/parsing-xml-in-data-with-hugos-new-xml-support-attributes/36654
Ski tips
Metadata Interoperability at OLAC
This week we had a lecture on metadata interoperability. Interoperability is a major theme of Gary Simons work on OLAC. It was the keyword or concept that he used to push the social behavior requirements related to the activities around, in, and at language archives.
I think that across the history of OLAC there have been various understandings on the kinds of metadata needed to describe language resources. That is, discovery is the architectural goal of OLAC, but other requirements also exist. In the beginning of OLAC many of the participants were looking at OLAC for a complete solution to the kinds of metadata they should be collecting and using. The other requirements upon resource stewards have always meant additional fields in diverse institutional contexts. The freedom to explore these other requirements has not always been explored or embraced by stewards. Some have seen OLAC as an all or nothing involvement. Maybe the fear has been that there will be divergence from a communal norm.
However, my perspective is that it is quite normal for each institution to have its own metadata schema or application profile some portion of which gets shared with OLAC.
With this as background then, with the assumption that different management practices will produce different metadata schemes it seems reasonable that each institution should update their schema from time to time. This implies that metadata quality in terms of coverage or "encoding" is a moving target. Another implication then, is that even in fields which are shared with the OLAC aggregator and are defined in the OLAC metadata application profile, that those fields may have different internal syntax at different providers or at different time depths of the records creation.
The ISO639-3 field is one evidence of evolutionary change. This standard has fields which split and merge from time to time. Associating a records time of creation with a version of an institutions metadata schema is a useful dynamic when evaluating a record's quality.
The question is how should a record and the version of its applicable metadata profile be associated in the OLAC context? How should this information be communicated to record viewers?
The answer is rather straightforward, but requires two parts. The first part requires a modification to the archive profile to have two information bits:
- The name of the native application profile at the data provider
- A link to the native metadata application profile documentation
The documentation should be in a publicly accessible place so that the provided metadata makes sense. There are several ways this could be accomplished one way is to create a manifestation record for each iteration of the application profile. These could be related into a collection or they could have a single relation.
which in the listSet
The OLAC OAI record should have in its source in the first harvest the name and version of the native metadata schema used for the generation of the record. The link to the native version of the providers metadata schema's documentation should be provided in the archive section of the OAI describer.
Some utilities in OAI can modify data, some can be servers only, some havesters only, some harvesters and servers.
Some OAI providers are
Using record sets:
OLAC could allow end-users to dynamically create sets of records for export using the setSpec part of OAI. Playing with this and audience interest might create some social interest.