Lexical Database Archiving Questionnaire

Featured

It's true!

I am asking around on different mailing lists to gain some insight into the archiving habits of linguists who use lexical databases. I am specifically interested in databases created by tools like FLEx, ToolBox, Lexus, TshwaneLex, etc.

Background Story Continue reading

Academic Heritage in MARC records

I have long wondered how I might encode information contained in the https://academictree.org in MARC Authority Records. Obviously via MARC relator roles the information might be encoded within the bibliographic record. However, looking at the MARC 510 field on the linked record https://lccn.loc.gov/no2004069960, it seems that the MARC 500 field on authority Records might be used with several of the sub-fields.

Converting DC Subjects to Schema.org

DC has the ability to specify subjects such as UDC, LCSH or DDC. How would these be converted to schema.org?

{
"@context": "https://schema.org/",
"@type": "Book",
"name": "British Isles - A History",
"about": {
"@type": "DefinedTerm",
"@id": "https://id.oclc.org/worldcat/ddc/E48m7BJXdrm3XY4Xv86d88WGt3",
"name": "British Isles",
"termCode": "941",
"inDefinedTermSet": "https://id.oclc.org/worldcat/ddc/"
}
}

Language Documentation Gear

I have been investigating some new video equipment for language documentation. Currently highly recommended is the Osmo Pocket 3 and the GoPro Hero 11 (as this has GPS embedded whereas the GP12 does not). It is not entirely clear if the GP13 has GPS or not.

Also my microphones need some attention. I need some endcaps and some windscreens.

I have been thinking about using updating my capabilities for recording and regularly recording Katja Playing Music.

Serials, MARC Records and RDA Core

So I have several questions:

  1. What does a high (over the top) MARC Record look like both for the ISSN(s) and the article?
  2. What is the overlap between RDA Core and Dublin Core?
  3. What user tasks are left out between RDA full and RDA Core?

Some resources:

For RDA tips:
https://web.library.yale.edu/book/export/html/1733
https://libguides.nmstatelibrary.org/rda

RDA Core
https://www.loc.gov/aba/rda/pdf/core_elements.pdf
https://www.librarianshipstudies.com/2016/03/rda-core-elements.html
https://www.loc.gov/aba/rda/core_elements.html

RDA Core for serials: https://www.youtube.com/watch?v=AF2DDYczBeA
https://socialsci.libretexts.org/Courses/Sacramento_City_College/LIBT331%3A_Library_Cataloging_Procedures_(Li)/03%3A_Copy_Cataloging/3.04%3A_CONSER_RDA_Core_Elements_for_Serials

OLAC CMS options via XML

One plan for pushing language resources to the web en-mass before a full django application is available is to use Hugo and its XML input process to parse OALC/OAI-PMH files (generated with an OAI-PMH bash script) and then convert bibtex entries to xml (using Jabref or commandline) or JSON. The content issue to overcome is that pages in Hugo have to have front matter. The XML and JSON are both content and front matter (or visible front matter). I think I need to generate an .md file for each entry meaning XML data is not really that useful. But there may be hope

OLAC Collection Description and Linked Data Terms

The DCTerms provides some fields for use in the description of Collections. While it may be possible to use terms from specific vocabularies in the CLD (see a list of works) or see the whole application profile here. However, It is interesting to note that RDA has some of these terms too and even though purls were minted for CLD that linked data URIs exist for these concepts as well.

Historical versions 1 and the collection provenance term .

Zotero Plugins

Zotero is having a bit of an explosion with plugins. Really cool to see. Some of the ones I benefit from:

https://github.com/bwiernik/zotero-shortdoi

https://github.com/MuiseDestiny/zotero-reference
https://github.com/volatile-static/Chartero

https://github.com/panhaoyu/zotero-categorial-tags
https://github.com/syt2/zotero-addons

https://github.com/windingwind/zotero-actions-tags
https://github.com/PubPeerFoundation/pubpeer_zotero_plugin

https://github.com/jlegewie/zotfile

https://github.com/RoadToDream/ZotMeta

https://github.com/franzbischoff/zotero-pdf-metadata

https://github.com/github-young/zotero-better-authors

https://github.com/fkguo/zotero-inspire

https://github.com/ChenglongMa/zoplicate

OLAC and User Tasks

What are the User Task in OLAC and for an OLAC 2.0?

What overlap does DCTerms have with the IFLA-LRM/RDA models? That is, if OLAC stays with DCTerms, what user tasks is it known to not support? On the flip side, what user tasks do archives need to support beyond OLAC's capabilities?

Supporting literature:

https://www.researchgate.net/publication/235869817_The_actual_importance_of_RDA_elements_in_supporting_key_user_tasks
https://www.tandfonline.com/doi/abs/10.1080/01639374.2013.825827