Lexical Database Archiving Questionnaire

Featured

It's true!

I am asking around on different mailing lists to gain some insight into the archiving habits of linguists who use lexical databases. I am specifically interested in databases created by tools like FLEx, ToolBox, Lexus, TshwaneLex, etc.

Background Story Continue reading

OLAC with other materials

One plan for pushing language resources to the web en-mass before a full django application is available is to use Hugo and its XML input process to parse OALC/OAI-PMH files (generated with an OAI-PMH bash script) and then convert bibtex entries to xml (using Jabref or commandline) or JSON. The content issue to overcome is that pages in Hugo have to have front matter. The XML and JSON are both content and front matter (or visible front matter). I think I need to generate an .md file for each entry meaning XML data is not really that useful. But there may be hope

OLAC Collection Description and Linked Data Terms

The DCTerms provides some fields for use in the description of Collections. While it may be possible to use terms from specific vocabularies in the CLD (see a list of works) or see the whole application profile here. However, It is interesting to note that RDA has some of these terms too and even though purls were minted for CLD that linked data URIs exist for these concepts as well.

Historical versions 1 and the collection provenance term .

Zotero Plugins

Zotero is having a bit of an explosion with plugins. Really cool to see. Some of the ones I benefit from:

https://github.com/bwiernik/zotero-shortdoi

https://github.com/MuiseDestiny/zotero-reference
https://github.com/volatile-static/Chartero

https://github.com/panhaoyu/zotero-categorial-tags
https://github.com/syt2/zotero-addons

https://github.com/windingwind/zotero-actions-tags
https://github.com/PubPeerFoundation/pubpeer_zotero_plugin

https://github.com/jlegewie/zotfile

https://github.com/RoadToDream/ZotMeta

https://github.com/franzbischoff/zotero-pdf-metadata

https://github.com/github-young/zotero-better-authors

https://github.com/fkguo/zotero-inspire

https://github.com/ChenglongMa/zoplicate

OLAC and User Tasks

What are the User Task in OLAC and for an OLAC 2.0?

What overlap does DCTerms have with the IFLA-LRM/RDA models? That is, if OLAC stays with DCTerms, what user tasks is it known to not support? On the flip side, what user tasks do archives need to support beyond OLAC's capabilities?

Supporting literature:

https://www.researchgate.net/publication/235869817_The_actual_importance_of_RDA_elements_in_supporting_key_user_tasks
https://www.tandfonline.com/doi/abs/10.1080/01639374.2013.825827

Customer Service Model in Libraries

The service desk needs a customer service Model.

The question is how can OLAC 2.0 implement the customer service Model/Customer interaction model and to what effect? this led me to ask how is the interview conducted via online tools, and then how does one show empathy via a digital interaction: https://www.interaction-design.org/literature/topics/empathize https://blog.resellerclub.com/empathy-in-web-design-the-what-the-how-and-the-why/

see the five stages here: https://learn.g2.com/customer-interaction some of the diagrams here.

read: https://www.freshworks.com/freshdesk/customer-interaction

Consider the applicability of: https://www.researchgate.net/figure/Framework-of-the-customer-service-platform-business-model_fig1_352044737 from: A Data-Enabled Business Model for a Smart Healthcare Information Service Platform in the Era of Digital Transformation

Venn diagrams

I found the following resources really helpful with boolean operators and Venn diagrams.

Snarky Math (Director). (2021, October 21). Can you draw a Venn diagram for 4 sets? | Why Venn diagrams are not easy [Streamed]. Snarky Math. https://youtu.be/IekSOZIF5uI

Student Contributors The University of Edinburgh School of Informatics. (n.d.). Better Informatics. Betterinformatics.com. Retrieved November 10, 2024, from https://betterinformatics.com/resources/inf1-cl/venn/

This one allows prime sets: https://statpowers.com/venn.html

The University of Edinburgh School of Informatics. (n.d.). Venn Diagrams. The University of Edinburgh School of Informatics Teaching Aids. Retrieved November 10, 2024, from https://www.inf.ed.ac.uk/teaching/courses/inf1/cl/tools/venn/

I haddn't really thought about what they represent or the appropriateness of their use. https://www.sciencedirect.com/science/article/abs/pii/B9780444529374500113 https://blog.jooq.org/say-no-to-venn-diagrams-when-explaining-joins/ https://github.com/tctianchi/pyvenn This python lib is interesting for generating visualizations if they are accurate. I used an inaccurate visualization in my presentation on OLAC roles. Maybe this could be added to django to update automatically. https://www.sciencedirect.com/topics/mathematics/venn-diagram https://www.dubberly.com/concept-maps/visualizing-venn-diagrams.html

Django Database options

I have been looking at Django for several projects. I've been looking at implementation details including hosting and database.

database options for Django and have settled on PostGIS as in PostgresSQL with GIS support.
https://stackoverflow.com/questions/3743632/gis-postgis-postgresql-vs-mysql-vs-sql-server

In general I have found reading through the following helpful
https://www.digitalocean.com/community/tutorials/how-to-create-a-django-app-and-connect-it-to-a-database
https://djangobuilder.io/#/
http://darrenoneill.co.uk/post/using-postgis-and-geodjango-find-nearest-neighbour/
https://medium.com/@amirm.lavasani/classic-machine-learning-in-python-k-nearest-neighbors-knn-a06fbfaaf80a
https://www.geeksforgeeks.org/find-the-nearest-node-to-a-point-using-osmnx-distance-module/
https://github.com/fabiocaccamo/django-treenode
https://github.com/peopledoc/django-ltree-demo


https://stackoverflow.com/questions/38921444/modelling-a-polymorphic-tree-with-django-treebeard
https://dev.to/pawnsapprentice/postgresql-in-geospatial-applications-unleashing-the-power-of-location-data-4jan
https://github.com/gboeing/osmnx
https://github.com/feincms/django-tree-queries
https://django-treebeard.readthedocs.io/en/stable/

Find adjacent language equals
Weight algorithm: Node edge in lang-tree = .5, overlapping GIS = .20, adjacent GIS edge = .25