From time to time we might wonder where archival materials might be located. There might be a way to discover these using OLAC and FOAF.
Tag Archives: in_Obsidian
Copyrights and types
I am trying to establish the kinds of materials defined as permitted for copyright, and some associated protections. For example, copyright circular 1 lists some types on the first page. Among them: Literary works, and sound recordings, which are two "broad" types of works outlined in US copyright law (Title 17 reproduced in Circular 92). The class Literary works further is divided down into several sub-types, one of which is software. My understanding of "literary" in this context is that one reads the text. So, I have some questions about these designated resource types.
- Is there a basis for understanding digital sound or motion pictures as software bundles? Or for copyright purposes are software and motion pictures very distinct classes of "things"?
My understanding is that the rights afforded to copyright holders are limited by type. For example, copyright allows the holder of an audio copyrighted resource to regulate the transmission via a digital audio transmission. Circular 1 states it as: "Perform the work publicly by means of a digital audio transmission if the work is a sound recording".
- Do I rightly conclude then that this protection for audio is "type specific"? It cannot apply to, for example, sculptures, or motion pictures.
My understanding is that these types have slight variations to the protections afforded to them. For example, my understanding is that a motion picture is only copyright-able in a fixed format (congruent with other "types"). However, my understanding is that the copyright term only starts once it is published... published is a hard term to find a definition for. Circular 45 says the following:
"Publication of a motion picture takes place when one or more copies are distributed to the public by sale, rental, lease, or lending or when an offering is made to distribute copies to a group (wholesalers, retailers, broadcasters, motion picture distributors, and the like) for purposes of further distribution or public performance. Offering to distribute a copy of a motion picture for exhibition during a film festival may be considered publication of that work. For an offering to constitute publication, copies must be made and be ready for distribution. The performance itself of a motion picture (for example, showing it in a theater, on television, or in a school room) does not constitute publication."
- Do Creative Commons licenses apply to video material I host on my own website? In order for Creative Commons licenses to be enforceable or valid the copyright must be valid. If the copyright term only starts at publication, and publication requires sale, rental, lease, or lending, where in that process is free (libre) distribution?
This is of some interest to me as I am looking at comparing protections in different national copyright frameworks for ethnographic resources produced in scholarly contexts.
App Build and Deploy
https://fly.io/docs/languages-and-frameworks/python/
Great example of HTML-DC metadata
https://gist.github.com/HughP/6f059c39eb256047200329701c58eb4f
from:
https://repositorio.unal.edu.co/handle/unal/81557
This makes Zotero recognize it as a thesis
OLAC: Multi-Audience Service Architecture
Original hand drawn source image.
Metadata Interoperability at OLAC
This week we had a lecture on metadata interoperability. Interoperability is a major theme of Gary Simons work on OLAC. It was the keyword or concept that he used to push the social behavior requirements related to the activities around, in, and at language archives.
I think that across the history of OLAC there have been various understandings on the kinds of metadata needed to describe language resources. That is, discovery is the architectural goal of OLAC, but other requirements also exist. In the beginning of OLAC many of the participants were looking at OLAC for a complete solution to the kinds of metadata they should be collecting and using. The other requirements upon resource stewards have always meant additional fields in diverse institutional contexts. The freedom to explore these other requirements has not always been explored or embraced by stewards. Some have seen OLAC as an all or nothing involvement. Maybe the fear has been that there will be divergence from a communal norm.
However, my perspective is that it is quite normal for each institution to have its own metadata schema or application profile some portion of which gets shared with OLAC.
With this as background then, with the assumption that different management practices will produce different metadata schemes it seems reasonable that each institution should update their schema from time to time. This implies that metadata quality in terms of coverage or "encoding" is a moving target. Another implication then, is that even in fields which are shared with the OLAC aggregator and are defined in the OLAC metadata application profile, that those fields may have different internal syntax at different providers or at different time depths of the records creation.
The ISO639-3 field is one evidence of evolutionary change. This standard has fields which split and merge from time to time. Associating a records time of creation with a version of an institutions metadata schema is a useful dynamic when evaluating a record's quality.
The question is how should a record and the version of its applicable metadata profile be associated in the OLAC context? How should this information be communicated to record viewers?
The answer is rather straightforward, but requires two parts. The first part requires a modification to the archive profile to have two information bits:
- The name of the native application profile at the data provider
- A link to the native metadata application profile documentation
The documentation should be in a publicly accessible place so that the provided metadata makes sense. There are several ways this could be accomplished one way is to create a manifestation record for each iteration of the application profile. These could be related into a collection or they could have a single relation.
which in the listSet
The OLAC OAI record should have in its source in the first harvest the name and version of the native metadata schema used for the generation of the record. The link to the native version of the providers metadata schema's documentation should be provided in the archive section of the OAI describer.
Some utilities in OAI can modify data, some can be servers only, some havesters only, some harvesters and servers.
Some OAI providers are
Using record sets:
OLAC could allow end-users to dynamically create sets of records for export using the setSpec part of OAI. Playing with this and audience interest might create some social interest.
Dublin Core Acronyms
DC = Dublin Core: This may refer to simple Dublin Core which, depending on the time of writing may refer to the original 15 elements. See Phelps (2012)
DCMI = Dublin Core Metadata Initiative as used by Cole (2002), later changed to Dublin Core Metadata Innovation; but the term innovation does not appear on the current-(2022/2023) Dublin Core website, or it's parent organization ASIS&T.
QDC = Qualified Dublin Core as used by Cole (2002).
DCMES = Dublin Core Metadata Element Set: Generally this means the 18 elements 15 of which are in the DC 1.1 namespace and the other three in the DCTERMS namespace. In prefered parlance elements are known as properties, however due to the historical practice of using Dublin Core within an XML context and seeing these properties used XML elements, the term elements was applied. In my opinion, choosing a term like "properties" from the parlance of RDF is just as jaded. Used for example by Ward (2004), Saadat Alijani & Jowkar (2009), Phelps (2012), Jackson et al (2008), and Nevile & Lissonnet (2004).
DCMS = Dublin Core Metadata Standard. See Eckert et al (2009) and Quam (2001).
DCMES 1.1 = Dublin Core Metadata Element Set; Simple Dublin Core. See also this (DC Website) and this (OLAC).
DCTERMS = Dublin Core Terms or Qualified Dublin Core.
Cole, Timothy W. 2002. “Qualified Dublin Core Metadata for Online Journal Articles.” OCLC Systems & Services: International Digital Library Perspectives 18 (2). MCB UP Ltd: 79–87. doi:10.1108/10650750210430141.
Eckert, K., Pfeffer, M., & Stuckenschmidt, H. (2009). A Unified Approach for Representing Metametadata. International Conference on Dublin Core and Metadata Applications, pp. 21–29. Retrieved from https://dcpapers.dublincore.org/pubs/article/view/973
Jackson, Amy S., Myung-Ja Han, Kurt Groetsch, Megan Mustafoff, and Timothy W. Cole. 2008. “Dublin Core Metadata Harvested Through OAI-PMH.” Journal of Library Metadata 8 (1). Routledge: 5–21. doi:10.1300/J517v08n01_02.
Phelps, Tyler Elisabeth. 2012. “An Evaluation of Metadata and Dublin Core Use in Web-Based Resources.” Libri 62 (4). doi:10.1515/libri-2012-0025.
Nevile, L., & Lissonnet, S. (2004). The Case for a Person/Agent Dublin Core Metadata Element Set. International Conference on Dublin Core and Metadata Applications, . Retrieved from https://dcpapers.dublincore.org/pubs/article/view/780
Quam, Eileen. 2001. “Informing and Evaluating a Metadata Initiative: Usability and Metadata Studies in Minnesota’s Foundations Project.” Government Information Quarterly 18 (3): 181–94. doi:10.1016/S0740-624X(01)00075-2.
Saadat Alijani, Alireza, and Abdolrasool Jowkar. 2009. “Dublin Core Metadata Element Set Usage in National Libraries’ Web Sites.” The Electronic Library 27 (3). Emerald Group Publishing Limited: 441–47. doi:10.1108/02640470910966880.
Ward, Jewel. 2004. “Unqualified Dublin Core Usage in OAI‐PMH Data Providers.” OCLC Systems & Services: International Digital Library Perspectives 20 (1): 40–47. doi:10.1108/10650750410527322.
Spatial Coverage on the OLAC network
The issues is that OLAC and these other uses of Dublin Core don't agree in the semantics of spatial coverage.
https://archive-intranet.ardc.edu.au/display/DOC/Spatial+coverage#:~:text=Spatial%20coverage%20refers%20to%20a,the%20focus%20of%20an%20activity.
Critical question here, is one where we ask: "what do English think geography is for language?"
Thinking deeply about:
https://twitter.com/elararchive/status/1637559068398157824?s=46&t=Zdt2jeAjeFQx6k372aS64A
OLAC spelling mistakes
I wonder how many spelling mistakes we can find in various records in OALC... This is a great reason for OLAC to retain the kind of language the record is in.
https://opensource.com/article/18/2/aspell
https://github.com/uribench/spell-check/blob/master/docs/XML%20Spell%20Checking%20Workaround.md
Install with my blog workflow:
https://github.com/tbroadley/spellchecker-cli
other options: https://vi.stackexchange.com/questions/22220/how-to-make-spell-check-work-for-text-inside-a-xml-file
https://metacpan.org/dist/XML-Twig/view/tools/xml_spellcheck/xml_spellcheck
Interesting Public DC schema use
This record is interesting in that they use a dot notation for Dublin Core.
https://www.repository.cam.ac.uk/handle/1810/316390?show=full
They also have 4 total schemas used in their application profile, in contrast to extending dublin core with custom serializations.