BibTex Materials in XML Markup

It seems that BibTeX materials can be written as XML.

Here are some resources I found on this:

  • a python2 script: https://www.eecis.udel.edu/~sprenkle/bibtex2html/bibtex2xml.py
  • an overview presentation: https://pdfs.semanticscholar.org/8c8e/44b18bacac15f14113af3d4b55f028e0a842.pdf
  • Software Package: http://www.sfs.uni-tuebingen.de/hpsg/archive/projects/bibtex2html/
  • Prior research: https://web.science.mq.edu.au/~rdale/resources/bibtex/index.html
  • maybe some XSLT files: https://sourceforge.net/projects/bibtexml/

Hugo and XML

I did not know that Hugo could query XML...

I discovered this.... this is in fact really exciting news.

Three tools for study:

  • https://gohugo.io/functions/transform.unmarshal/#xml-data
  • https://github.com/gohugoio/hugoDocs/issues/1622
  • https://discourse.gohugo.io/t/parsing-xml-in-data-with-hugos-new-xml-support-attributes/36654

XML formats for publishing

  1. TEI based XML format at DHQ. Documentation.
  2. JATS based XML at de Gruyter and tons of other publishers.
  3. DocBook and Balisage-1.3 at Balisage. Example
  4. XLingPaper XML

https://relaxng.org/tutorial-20011203.html
https://www.w3.org/2000/04/schema_hack/

https://www.balisage.net/Proceedings/vol21/print/Tovey01/BalisageVol21-Tovey01.html

https://typeset.io/resources/typeset-evolving-into-scispace/
https://typeset.io/resources/jats-xml-everything-a-publisher-needs-to-know/

JATS in multilingual books:
https://www.ncbi.nlm.nih.gov/books/NBK579699/

JATS
https://www.escienceediting.org/upload/kcse-284.pdf
https://jats.nlm.nih.gov/jats-con/2013/presentations/graham2013.pdf
https://github.com/ncbi/JATSPreviewStylesheets/blob/master/xslt/main/jats-xslfo.xsl
https://assets.pubpub.org/z1qlpyk6/01566244075411.pdf

https://av.tib.eu/media/51339
https://github.com/pkp/texture
https://jats.niso.org/jatswiki/index.php/Tools
https://www.ncbi.nlm.nih.gov/books/NBK425544/

Interesting examples:
https://typeset.io/papers/defining-a-linguistic-area-south-asia-gilv2538jj
Pricing:
https://typeset.io/account/pricing/?source=plans-billing-page

OLAC Validator Custom Messages

OLAC Validator custom messages can be created following these steps:
https://xerces.apache.org/xerces2-j/faq-xs.html#faq-4

This is the software it uses for its validator: https://xerces.apache.org/xerces-p/samples/validator.html Ideally this would also be containerized with the other parts.

One approach to get this containerized might be to use this script (which is older and linux oriented) https://github.com/dgricci/xmllint

Another option is to use: https://hub.docker.com/r/isaitb/xml-validator

If this service were implemented on a new server, with a web-interface we might expect to use a newer HTML front end.

here is what I found via gitub:

https://github.com/ebruchez/darius-xml.js
https://github.com/fulvio999/jxmlutil

Darius looks more promising but neither are "out of the box" tools.

Excel, XML, and CSV

I never thought the day would come when I would say that I wished that I had a Windows version of MS Excel. I am simply aghast. But never-the-less I have been looking for an XML parsing solution for OS X and can not find one which is graphically oriented.

I want to move certain XML encoded content to my blog and the best way (that I can figure) to do this is to import CSV files (although there is a WordPress plugin for importing XML).

I want to be able to do this, but the Mac version of Excel does not do this:

I really want to drag and drop, but this tutorial makes it look easy-ish to do at the command line. http://blog.mclaughlinsoftware.com/oracle-sql-programming/how-to-convert-xml-to-csv-and-upload-into-oracle/


What am I Using this for? Well I would like to use it with itunes XML, Endnote XML, Bookpedia XML, BibTeXXML, SIL-OLAC data as XML, WorldCat Data as XML? Glotalogue data as XML.

Audio Dominant Texts and Text Dominant Audio

As linguistics and language documentation interface with digital humanities there has been a lot of effort to time-align texts and audio/video materials. At one level this is rather trivial to do and has the backing of comercial media processes like subtitles in movies. However, at another level this task is often done in XML for every project (digital corpus curation) slightly differently. At the macro-scale the argument is that if the annotation of the audio is in XML and someone wants to do something else with it, then they can just convert the XML to whatever schema they desire. This is true.

However, one antidotal point that I have not heard in discussion of time aligned texts is specifications for Audio Dominant Text vs. Text Dominant Audio. This may not initially seem very important, so let me explain what I mean.
Continue reading