One plan for pushing language resources to the web en-mass before a full django application is available is to use Hugo and its XML input process to parse OALC/OAI-PMH files (generated with an OAI-PMH bash script) and then convert bibtex entries to xml (using Jabref or commandline) or JSON. The content issue to overcome is that pages in Hugo have to have front matter. The XML and JSON are both content and front matter (or visible front matter). I think I need to generate an .md file for each entry meaning XML data is not really that useful. But there may be hope
Tag Archives: Django
Venn diagrams
I found the following resources really helpful with boolean operators and Venn diagrams.
Snarky Math (Director). (2021, October 21). Can you draw a Venn diagram for 4 sets? | Why Venn diagrams are not easy [Streamed]. Snarky Math. https://youtu.be/IekSOZIF5uI
Student Contributors The University of Edinburgh School of Informatics. (n.d.). Better Informatics. Betterinformatics.com. Retrieved November 10, 2024, from https://betterinformatics.com/resources/inf1-cl/venn/
This one allows prime sets: https://statpowers.com/venn.htmlThe University of Edinburgh School of Informatics. (n.d.). Venn Diagrams. The University of Edinburgh School of Informatics Teaching Aids. Retrieved November 10, 2024, from https://www.inf.ed.ac.uk/teaching/courses/inf1/cl/tools/venn/
I haddn't really thought about what they represent or the appropriateness of their use. https://www.sciencedirect.com/science/article/abs/pii/B9780444529374500113 https://blog.jooq.org/say-no-to-venn-diagrams-when-explaining-joins/ https://github.com/tctianchi/pyvenn This python lib is interesting for generating visualizations if they are accurate. I used an inaccurate visualization in my presentation on OLAC roles. Maybe this could be added to django to update automatically. https://www.sciencedirect.com/topics/mathematics/venn-diagram https://www.dubberly.com/concept-maps/visualizing-venn-diagrams.htmlDjango Database options
I have been looking at Django for several projects. I've been looking at implementation details including hosting and database.
database options for Django and have settled on PostGIS as in PostgresSQL with GIS support.
https://stackoverflow.com/questions/3743632/gis-postgis-postgresql-vs-mysql-vs-sql-server
In general I have found reading through the following helpful
https://www.digitalocean.com/community/tutorials/how-to-create-a-django-app-and-connect-it-to-a-database
https://djangobuilder.io/#/
http://darrenoneill.co.uk/post/using-postgis-and-geodjango-find-nearest-neighbour/
https://medium.com/@amirm.lavasani/classic-machine-learning-in-python-k-nearest-neighbors-knn-a06fbfaaf80a
https://www.geeksforgeeks.org/find-the-nearest-node-to-a-point-using-osmnx-distance-module/
https://github.com/fabiocaccamo/django-treenode
https://github.com/peopledoc/django-ltree-demo
https://stackoverflow.com/questions/38921444/modelling-a-polymorphic-tree-with-django-treebeard
https://dev.to/pawnsapprentice/postgresql-in-geospatial-applications-unleashing-the-power-of-location-data-4jan
https://github.com/gboeing/osmnx
https://github.com/feincms/django-tree-queries
https://django-treebeard.readthedocs.io/en/stable/
Find adjacent language equals
Weight algorithm: Node edge in lang-tree = .5, overlapping GIS = .20, adjacent GIS edge = .25
Django Zotero options
Nice formatted example: https://dalme.org/project/bibliography/
See code: view-source:https://dalme.org/project/bibliography/
Python API wrapper: https://pypi.org/project/django-zotero/
A library-ish app that imports zotero things: https://gitlab.com/LARAsuite/lara-django-library
Look more at this: https://django-kdl-wagtail.readthedocs.io/en/latest/installation.html
and much more at this: https://gitlab.unimelb.edu.au/rturnbull/django-publications <-- this one is needs forked.
https://princeton-cdh.github.io/derrida-django/deploynotes.html
https://pypi.org/project/acdh-django-zotero/
Django and payment options
https://www.orientsoftware.com/blog/how-to-build-a-payment-gateway/
https://medium.com/geekculture/django-payment-processing-example-5d9f4f3678a5
https://django-payments.readthedocs.io/en/latest/
https://reintech.io/blog/integrating-payment-system-django
Some python tools for NER extraction tool chains
https://github.com/johnfraney/django-ner-trainer
https://johnfraney.github.io/django-ner-trainer/
https://github.com/doccano/doccano
https://prodi.gy/buy
https://github.com/GitTeaching/Django_NER_Crisis?tab=readme-ov-file
Django modules and links
Django application to collect submitted DOIs, acquire their API provided metadata (Bibliographic metadata and citation graph metadata), allow limited (specified) annotation, and then make those records harvestable via OAI-PMH. Language Resource tagger—Adding a layer of language related metadata to published resources.
Some Django modules for OAI-PMH
https://github.com/saw-leipzig/foaipmh
https://github.com/jnphilipp/django_oai_pmh
https://pypi.org/user/jnphilipp/ his topic extraction module looks interesting.
Also look at the xsd schema here https://github.com/saw-leipzig/foaipmh/blob/5b15d5cc4700a3cccf497c47218c2fba6b3421d5/entrypoint.prod.sh#L5
Metadata utility for OAI-PMH
https://combine.readthedocs.io/en/master/configuration.html
User Authentication
https://github.com/ubffm/django-orcid
https://django.fun/en/docs/social-docs/0.1/backends/orcid/
Crossref
https://github.com/fabiobatalha/crossrefapi
Database Versioning
This depends on how the DB is set up. If we only have one record per item or one record per state... This needs more definition.
https://djangopackages.org/grids/g/versioning/
https://www.wpbeginner.com/beginners-guide/complete-guide-to-wordpress-post-revisions/
Form Builders
https://djangopackages.org/grids/g/form-builder/
Some Javascript tools for creating the specific forms needed:
https://github.com/HughP/dublin-core-generator
https://nsteffel.github.io/dublin_core_generator/generator.html
Markdown for documentation
https://neutronx.github.io/django-markdownx/
Bibtex
https://bibtexparser.readthedocs.io/en/master/
https://github.com/sciunto-org/python-bibtexparser
https://github.com/jnphilipp/bibliothek
https://github.com/lucastheis/django-publications <-- also check the network as "improvements" are all over the place.
Other names include:
* Babybib
* Pybtex
* Pybibliographer
APIs
ORCID
https://github.com/ORCID/python-orcid
Crossref API doc
https://github.com/CrossRef/rest-api-doc/blob/master/demos/crossref-api-demo.ipynb
Crossref types: https://www.crossref.org/documentation/register-maintain-records/
https://api.crossref.org/swagger-ui/index.html#/Types/get_types__id__works
Others — Mostly citation and references
http://www.scholix.org/
https://scholexplorer.openaire.eu/#/query/page=5/q=language
https://crossref.gitlab.io/knowledge_base/products/event-data/
FatCat https://fatcat.wiki/
InternetArchive Scholar https://scholar.archive.org/
Thor project https://project-thor.readme.io/docs/introduction-for-integrators
Corsscite.org
Semantic Scholar API https://api.semanticscholar.org/api-docs/graph
https://core.ac.uk/
https://opencitations.net/
https://unpaywall.org/ --> see: http://musingsaboutlibrarianship.blogspot.com/2017/11/using-oadoi-crossref-event-data-api-to.html
https://openalex.org/
https://arxiv.org/help/api/index
https://www.aminer.org/citation
https://www.aminer.org/download
https://open.aminer.cn/
https://analytics.hathitrust.org/datasets#top
https://pro.dp.la/developers/api-codex
https://pro.europeana.eu/page/apis
LCSH
https://github.com/edsu/id
MARC
For generating an ingesting MARC records
https://pymarc.readthedocs.io/en/latest/
Zotero
https://github.com/urschrei/pyzotero
Overview see: https://researchguides.smu.edu.sg/api-list/scholarly-metadata-api
ISSNs
ISSN.org is supposed to have an API.. but not sure if they do.
https://portal.issn.org/resource/ISSN/1904-0008
Any request to the portal may be automated thanks to the use of REST protocol. The download of results is also automated. This service is restricted to subscribing users. Please contact sales [at] issn.org for more information.
https://portal.issn.org/node/170
https://portal.issn.org/resource/ISSN/2549-5089#
https://portal.issn.org/resource/ISSN/2549-5089?format=json
We could also slurp the HTML for the sameAs links to other DBs if needed.
JATS
https://pypi.org/project/jatsgenerator/
https://stackoverflow.com/questions/42084165/extracting-text-from-jats-xml-file-using-python
https://github.com/sibils/jats-parser
Pandas
https://pypi.org/project/django-pandas/
Beautiful Soup
There is the issue of how do we add to a Dublin Core OAI record how it was changed over time.... I need to architect this out.
Record Provenance:
[]Explore
https://www.w3.org/TR/prov-dc/
https://www.w3.org/2011/prov/track/issues/607?changelog
http://www.ukoln.ac.uk/metadata/dcmi/collection-provenance/
https://edoc.hu-berlin.de/bitstream/handle/18452/2727/332.pdf?sequence=1&isAllowed=y
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4177195/
https://www.loc.gov/standards/mods/userguide/recordinfo.html
https://tsl.access.preservica.com/tslac-digital-preservation-framework/qualified-dublin-core-schema/
https://dl.acm.org/doi/10.5555/2770897.2770924
https://blog.datacite.org/exposing-doi-metadata-provenance/
https://dgarijo.com/papers/dc2011.pdf
https://ceur-ws.org/Vol-670/paper_3.pdf
https://ecommons.cornell.edu/bitstream/handle/1813/55327/Encoding%20Provenance%20for%20Social%20Science%20Data-final.pdf?sequence=3&isAllowed=y
Views:
1. login with ORCID
2. query APIs (DOIs, ISBNs, ISSNs, ORCID, WikiData, etc.)
3. results display and annotation
4. submission
5. List of past submissions
6. update past submission screen (same as #3?)
If we ran a module like this:
https://pybliometrics.readthedocs.io/en/latest/classes/SerialTitle.html
Then we could take a reading on where the least spoken languages appear in the most highly ranked journals and determine if there was a bias or a loss to science.
Data Examples:
Have been moved to:
https://github.com/HughP/CrossRef-to-OLAC-data-examples
PDF Extraction:
https://levelup.gitconnected.com/scrap-data-from-website-and-pdf-document-for-django-app-fa8f37010085
https://towardsdatascience.com/how-to-extract-pdf-data-in-python-876e3d0c288
https://stackoverflow.com/questions/71850349/download-a-pdf-from-url-edit-it-an-render-it-in-django
https://stackoverflow.com/questions/48882768/django-reading-pdf-files-content
https://www.geeksforgeeks.org/working-with-pdf-files-in-python/
PDF Creation:
https://docs.djangoproject.com/en/4.1/howto/outputting-pdf/
https://jeltef.github.io/PyLaTeX/current/examples/header.html
NER:
https://johnfraney.github.io/django-ner-trainer/settings/
Other:
https://prodi.gy/
https://realpython.com/testing-in-django-part-1-best-practices-and-examples/
here is a django app for controlling URIs for linked data vocabularies.
https://github.com/unt-libraries/django-controlled-vocabularies
as seen here https://digital2.library.unt.edu/vocabularies/agent-qualifiers/
And here is a one for source authority records.
https://github.com/unt-libraries/django-name
as seen here: https://digital2.library.unt.edu/name/nm0000001/
Link Checker
https://github.com/Kaltsoon/dead-link-checker
https://pypi.org/project/django-linkcheck/
https://github.com/bartdag/pylinkvalidator
https://stackoverflow.com/questions/43264291/in-django-how-can-i-unit-test-all-links-recursively-every-view-check-for-200-o