I have been working on an addition to the IA Upload tool these last few days, and it’s ready for testing. Hopefully we’ll merge it tomorrow or the next day.
This is the first time I’ve done much work with the internal structure of DjVu files, and really it’s all been pretty straight-forward. A couple of odd bits about matching element and page names up between things, but once that was sorted it all seems to be working as it should.
It’s a shame that the Internet Archive has discontinued their production of DjVu files, but I guess they’ve got their reasons, and it’s not like anyone’s ever heard of DjVu anyway. I don’t suppose anyone other than Wikisource was using those files. Thankfully they’re still producing the DjVu XML that we need to make our own DjVus, and it sounds like they’re going to continue doing so (because they use the XML to produce the text versions of items).
I can’t believe I’m going to miss this by two days! I’m going to be in San Francisco for the first time since 1997 for the week before. What are the odds.
“For 20 years, the Internet Archive has been capturing the Web– that amazing universe of images, audio, text and software that forms our shared digital culture. Now it’s time to celebrate and we’re throwing a party! Please join us for our 20th Anniversary celebration on Wednesday, October 26th, 2016, from 5-9:30 pm.”
We offered unlimited storage, unlimited bandwidth, for ever, for free — to anybody who has something to share that belongs in a library.
—Brewster Kahle, Entertainment Gathering Conference 2007 (republished as a TED Talk). The above quote is at 14:19.
The crux of it is of course “something that belongs in a library”. If one has something that could conceivably be held in a library, then there should be a library in which it can be held; the Internet Archive is one possibility.