The following example points to the need for users to be able to sort collection by license, relationships, and extent.
I am looking for large spoken corpora of spontaneous speech in any
language (ideally > 100 hours) with a time-aligned transcription. I am
not committed to a specific genre as long as it is spontaneous speech.
It should be available as a download (for research, no commercial use),
ideally free but I may be able to pay for it as well.