Text Extraction
https://www.geeksforgeeks.org/extract-text-from-pdf-file-using-python/
https://towardsdatascience.com/how-to-extract-text-from-pdf-245482a96de7
https://betterprogramming.pub/how-to-convert-pdfs-into-searchable-key-words-with-python-85aab86c544f
What if my import pd data array was the OLAC metadata schema? https://www.semanticscholar.org/paper/A-Gentle-Introduction-to-Topic-Modeling-Using-Saxton/38742c56eadfdf11fb7218f7702c8fccfc78bd95 https://gist.github.com/umbertogriffo/5041b9e4ec6c3478cef99b8653530032 https://towardsdatascience.com/contextualized-topic-modeling-with-python-eacl2021-eacf6dfa576 https://www.analyticsvidhya.com/blog/2016/08/beginners-guide-to-topic-modeling-in-python/ https://www.holisticseo.digital/python-seo/topic-modeling/ https://melaniewalsh.github.io/Intro-Cultural-Analytics/05-Text-Analysis/08-Topic-Modeling-Text-Files.html https://asandeepc.bitbucket.io/courses/inls613_summer2019/lectures/08-lda_topic_modeling.pdf http://derekgreene.com/slides/topic-modelling-with-scikitlearn.pdf https://ourcodingclub.github.io/tutorials/topic-modelling-python/ https://stackabuse.com/python-for-nlp-topic-modeling/ https://www.toptal.com/python/topic-modeling-python https://towardsdatascience.com/end-to-end-topic-modeling-in-python-latent-dirichlet-allocation-lda-35ce4ed6b3e0