I was looking at the wikipedia article for Language Documentation. The only reference cited was a thesis by Debbie Chang. I happen to know Debbie. So I thought I would take a look at her thesis and see what she said. So I clicked the link and was delivered to a 404 error page on GIAL’s website. GIAL had recently renovated their website. I was able to locate thesis and fix the URL on wikipedia by digging through the GIAL website. The new URL is: http://www.gial.edu/images/theses/Chang_Debbie-thesis.pdf
But then I looked at the URL and asked: Why are PDFS in the images folder? What is the long term infrastructure for this school? It seems that when PDFs (thesis) are put into the images folder rather than into a digital repository that something is not quite right with the longterm planning for the school. Ironically, this is not too far from the main thrust of Debbie’s thesis.
It would seem that the long term solution for this kind of problem would be for a small school like GAIL to A. have its library develop an infrastructure for permanently housing these kinds of materials. Or B. contract with another organization or archive which could take care of these sorts of issues for them, provide handles or stable URLs, and then for GIAL to link to the permanent location of these items from GIAL’s website. It is interesting to note that on the same campus as GIAL is SIL International’s Language and Culture Archive, yet GIAL has not taken advantage of this opportunity.
For one of the web projects I am working in we have been throwing around the idea of having a world map as a navigation element. Each country would then be clickable. This kind of navigation has been done with hyperlinked bitmaps like the LL-Map project.
I have not seen any implementations in HTML5 canvas or in SVG. It occurs to me that these technologies could be used. I am not deeply familiar with either technology. So I did some googling.
I found some interesting articles on the matter.
Performance of SVG vs. Canvas[1] Boris Smus. 19 January 2009. Performance of Canvas versus SVG. http://smus.com/canvas-vs-svg-performance [Link] [Accessed: 4 March 2012]
How to Choose Between Canvas and SVG[2] Patrick Dengler. 28 September 2011. How to Choose Between Canvas and SVG. http://www.sitepoint.com/how-to-choose-between-canvas-and-svg/#fbid=6CJz-eeIXxl [Link] [Accessed: 4 March 2012]
SVG or Canvas? Сhoosing between the two[3] Mihai Sucan. 4 February 2010. SVG or Canvas? Сhoosing between the two. http://dev.opera.com/articles/view/svg-or-canvas-choosing-between-the-two/. [Link] [Accessed: 4 March 2012]
CanVG: Using Canvas to render SVG files[4] 29 March 2010. CanVG: Using Canvas to render SVG files. http://ajaxian.com/archives/canvg-using-canvas-to-render-svg-files [Link] [Accessed: 4 March 2012]
I am not sure that I have any answers but this is my thought towards the problem space.
There is one map of languages I have found which deserves to be mentioned. I am not sure of the technology used but it seems it would be either of these methods. It is the map of the Languages of California hosted at Berkeley.
Patrick Dengler. 28 September 2011. How to Choose Between Canvas and SVG. http://www.sitepoint.com/how-to-choose-between-canvas-and-svg/#fbid=6CJz-eeIXxl [Link] [Accessed: 4 March 2012]
Mihai Sucan. 4 February 2010. SVG or Canvas? Сhoosing between the two. http://dev.opera.com/articles/view/svg-or-canvas-choosing-between-the-two/. [Link] [Accessed: 4 March 2012]
29 March 2010. CanVG: Using Canvas to render SVG files. http://ajaxian.com/archives/canvg-using-canvas-to-render-svg-files [Link] [Accessed: 4 March 2012]
Umm frankly, I am not sure anything out there right now is going to work to bring OAI-PMH services to WordPressConsider these three resources for more info on OAI:
. If it does then is it going to be able to use WordPress to advertise things or is it going to use WordPress to aggregate things? if the former then nothing out there ever let the admin user choose which fields were matched to which attributes, dynamically. But if it is also the former then why would anyone actually want this functionality? What is the Use Case? If one is using WordPress as a bibliography reference system like some libraries do, then this makes a lot of sense. However, there is another use case I would like to present. That is, the website which is about several or a single language. There are potentially two ways to conceptualize this:
unAPI Server for WordPress. [2] Mike Giarlo. 19 May 2006. unAPI Server for WordPress. Technosophia. http://lackoftalent.org/michael/blog/unapi-wordpress-plug-in/ [Accessed: 5 March 2012]
WordPress, now with added unAPI![3] Peter Binkley. 18 February 2006. WordPress, now with added unAPI!. http://www.wallandbinkley.com/quaedam/2006/02_18_wordpress-now-with-added-unapi.html [Accessed: 5 March 2012]
I think there is a second question here too: why does one need OAI-PMH for wordpress… is it as a provider or as a consumer? If one needs a PHP app for OAI-PMH maybe they can use: https://github.com/caseyamcl/phpoaipmh
Peter Binkley. 9 December 2005. COinS-PMH (unAPI) WordPress Plugin. http://www.wallandbinkley.com/quaedam/2005/12_09_coins-pmh-unapi-wordpress-plugin.html [Accessed: 5 March 2012]
Mike Giarlo. 19 May 2006. unAPI Server for WordPress. Technosophia. http://lackoftalent.org/michael/blog/unapi-wordpress-plug-in/ [Accessed: 5 March 2012]
Peter Binkley. 18 February 2006. WordPress, now with added unAPI!. http://www.wallandbinkley.com/quaedam/2006/02_18_wordpress-now-with-added-unapi.html [Accessed: 5 March 2012]
I have been Looking at different ways to make SIL’s digital research content more interactive, findable, and usable. Today I found http://research.microsoft.com/en-us/. It is interesting how they approach the facets of Location, Projects, Publications, and People up in the right hand corner. I think they did a good job. The site feels like it is balanced.
This paper is motivated by an experience in collecting, analyzing, and then redeploying (sharing while making relevant to other corporate SIL functions) corporate intellectual assets. These assets are relevant to both products SIL products and services and corporate processes. This paper attempts to document some of the current challenges presented to the SIL staff person as well as present some items for consideration in overcoming these challenges. Continue reading →
The Ethnologue [1] M. Paul Lewis. (ed.), 2009. Ethnologue: Languages of the World, 16th Edn. Dallas, Tex.: SIL International. as an academic book, is somewhat of a straw man in linguistics. Many people who write grants for language documentation projects (generally on under described or endangered languages) will cite the Ethnologue and some other resources or lack of resources [2] Steven A. Marlett. 2011. Documenting the Me’phaa genus. DEH-NEH fellowship proposal. http://www.neh.gov/grants/guidelines/pdf/DEL_NEH_Marlett.pdf. [PDF] [DEL Awards] [Accessed: 15 February 2011] [3] Sadaf Munshi. 2011. Archive of Annotated Burushaski Texts. NSF grant proposal. http://www.neh.gov/grants/guidelines/pdf/DEL_NSF_Munshi.pdf. [PDF] [DEL Awards] [Accessed: 15 February 2011] [4]Monica A. Macaulay. 2011. Potawatomi Documentation, Lexical Database, and Dictionary. NEH grant proposal. http://www.neh.gov/grants/guidelines/pdf/DEL_NEH_Macaulay.pdf. [PDF] [DEL Awards] [Accessed: … Continue reading . These efforts seeking funding are usually an effort to get more language data. The rationale for this is two fold:
Because so little is known that we do not know if the Ethnologue is correct.
Because there is a conflict between other published sources and the Ethnologue [5]Roger Blench. n.d. Introduction to the Temein languages http://www.rogerblench.info/Language/Nilo-Saharan/Eastern%20Sudanic/Temein%20cluster/Blench%20Temein%20language%20NM%20proceedings.pdf [PDF] … Continue reading .
Steven A. Marlett. 2011. Documenting the Me’phaa genus. DEH-NEH fellowship proposal. http://www.neh.gov/grants/guidelines/pdf/DEL_NEH_Marlett.pdf. [PDF] [DEL Awards] [Accessed: 15 February 2011]
Roger Blench. n.d. Introduction to the Temein languages http://www.rogerblench.info/Language/Nilo-Saharan/Eastern%20Sudanic/Temein%20cluster/Blench%20Temein%20language%20NM%20proceedings.pdf [PDF] [Accessed: 15 February 2011]
Is what you say what you want really what you want?
I am involved in an operation which is tasked with digitizing content created by SIL staff in the Americas. All 80 or so years of history. The end goal is to make the items accessible and usable as widely as possible (there are a lot of factors which dictate how wide, wide truly is). Today I came across an item which was created at the end of 2008. It was "born digital" that is, it was created on a computer. As such it should not need to be scanned if the digital production file can be located. Unfortunately, this is not the only item in its class. There are quite a few items in the line up to be scanned which have been born digital in the last few years. It would help us to understand a little bit about the item in question to fully realize this scenario.
Here was the process for creating the item in Dec. 2008:
Item was created in a .txt / xml environment.
The text was flowed through a page layout process and put into a PDF.
The PDF was taken to a printer and printed.
A copy of the printout was presented to the Language and Culture Archive
So there should be a .txt/xml type file (valid archival format) for this item, and there should be a PDF for this item (also possibly an archival format). Neither of these files has been submitted to the archive at SIL International nor does the SIL Area archiving staff have a definitive recourse to acquire the file.
To understand some of the impact of this statement it is important to understand some of the corporate history and the corporate structure (with a hint of corporate culture).
SIL's history is as one organization, which started in Mexico. Through time the founders also started what might be best classified as sister organization with the same name in various countries. Again with the passage of time an organization was conceived which needed to support and in some ways "unify" the various sister organizations. This cover organization is known as SIL International. These management structures, or their vestiges still exist today. Though in recent times expatriate staff have been returning from working within host countries and overall staff counts have been in decline (particularly in the Americas). So as branches (these former sister companies) have folded, they have folded into a larger management structure called an Area. These branches retain a rather autonomous position (in management practice and in goal setting and policy), while being connected and dedicated at some level to the larger overarching stated goals of SIL International. Yet an individual might be underThis is not a universally understood concept. That is, the alternative perspectives Is an SIL staff person there for the needs of the company or is the company there to serve the individual? are still a disputed issue in the minds of many people serving with SIL. the administration of any of these administrative structures.
Administrative Structures of SIL
The levels of autonomy in the above diagram are illustrated by the solid line and the dotted line with more autonomous units further up the chart and separated by more solid lines. Aside from these basic structures there is the autonomy factor for the Areas. These areas operate on a semi-autonomous basis, from each other and from the organization known as SIL International.
This history has left the archiving practice in an interesting managerial arrangement. Former branches which have folded into the area are often called regions and are administered by a regional director. This might be illustrated by the following diagram.
Archive organization in Americas Area as of 2011-2012
An alternative organization method would be to organize around the content of the task. That is illustrated in the lower right of the above diagram by grouping all of the archivist together administratively and marketing their operations as a service. However, discussion of that sort of organizational change is beyond the scope of this post.
Current dilemma
As things stand currently though, the operational goal of this project is to make content accessible and usable to end users. More use cases are able to be solved if archivable formats are used and the objects collected are actually those same digitally created objects. However, managerial success on the project is measured by how many scans are made of products in the Americas Area's reach, rather than the quality of the items that the archive is able to put into the hands of end users. So for these items which were born digital, because we do not have a recourse to pursue the file we will scan the item. We will also then "clean up" the item and make it into .tiff files and a PDF (a sum of about 5 hours of work for every 100 pages). Now is the original digital item out of reach of our pursuit? Well, there is one more structure which is needed to be understood so that this can be fully realized.
Organizational structure of Manpower in SIL Americas Area
In this diagram the area director has the mandate to secure all property belonging to the SIL organizational/business unit including intellectual property. This part of his responsibility has been delegated to one of his subordinates, the Support Services Director. The Support Services Directer manages the staff providing services to the Language Program People. But in the Americas Area, Language Program personnel are trained not to respond to persons who are not in their direct chain of supervisors. This means that the area Archive Coordinator has to coordinate with the Language Programs Director to get a request to the appropriate field person. It also means that the person working in the field is not responsible to archive their work (because this part of the mandate is viewed to be fulfilled by the archive coordinator).This leads to some interesting problems in terms of managing intellectual property. Intellectual property accountability and human resource accountability are not as highly ranked as financial accountability. These can be inherently difficult aspects of any business to manage, let alone a Not-for-Profit organization. It would be interesting if IP and HR resources could be evaluated like finances are by the ECFA. It would seem that in the SIL family of organizations that there is a corporate value/culture to not value intellectual property. In terms of market economy, intellectual property is generally not viewed as being monetizable. Therefore, the products containing the IP are also not worth more than the moment's task. This is possibly in part because the organization is a relationally motivated organization and not a data driven organization. There are several ways that this disjunct can be viewed. One of them is that there should be a data planThis data plan would include archiving, backup, and distribution. as part of the project plan before funding for the plan is provided. Additionally, a separate but related plan should be implemented to cover IP issues, copyright issues, and the licensing and use of data, and products. By pushing this to the project planing level it puts the burden on the project doers to meet the requirements for funding. This model is often used in European Union financed research projects. In 2011 the National Science Foundation in the U.S. also required a data management plan to be submitted with grants being applied for. It is interesting that SIL International's funders do not require this to be part of the project planning.
However, having a data management plan does not cover the above use case completely. The project did submit a physical object to the archive at one point. The problem here is the continued access to an ongoing project by services being performed in one part of the company to individuals in another part of the company. This is a management and service integration issue. Because there is a perception that management is too busy or that this is not a high enough priority for them to act on in a timely manner, then it costs the archiving department 5-6 man hours when all that might be needed is 10-20 minutes of email time. But being efficient, or providing a higher quality product which is more usable and has a smaller digital foot print does not come in the the matrix for evaluating results. Seems to me to be a process design FAIL.
Over the last several months I have been looking for and comparing digitization services for audio, film, and for images (slides and more). I have been doing this as part of the ongoing work at the Language and Culture Archive to preserve the linguistic and cultural heritage of the people groups SIL International has encountered and served. I have not come to any hard and fast conclusions on “what is the best service provider”. This is partially because we are still looking at various out sourcing options and looking at multiple mediums is time consuming. Then there is also the issue of looking for archival standards and the creation of corporate policy for the digitization of these materials. I am presenting several names here as the results of several searches for digitization services providers.
Another option the Archive has been looking at is to determine if the the quantity of the work is cost prohibitive to have professional done. Meaning that, we would be better served by buying the equipment and doing the work in house. So in the process I have also been looking at people’s experience with various kinds of equipment and technology used in scanning.
I have been reading a lot of user stories like Dave Dyer’s reflection on Slide Transfer and MacIntouch Reader Reports from 26 April 2006 on Slide Digitization.
The last couple of weeks I have been working on applying the Zachman's framework for enterprise architecture to two projects. I have been struggling through the first row and then skipped around a bit. I think I have found the part of the project (any project) I am most passionate about.... Working with Human Interface Architecture and explaining it as a designer to the builder of the Presentation Architecture. In my mind this level needs to be closely related to the Business Process Model and to the List of Business Goals/Strategies. [1] John Zachman. 2008. Diagram of: A framework for enterprise architecture. http://zachmaninternational.com/2/Zachman_Framework.asp. [Accessed: 2 December 2011] [PDF] [Link]
Where do I see myself most helpful in the large project...
John Zachman. 2008. Diagram of: A framework for enterprise architecture. http://zachmaninternational.com/2/Zachman_Framework.asp. [Accessed: 2 December 2011] [PDF] [Link]