Wednesday, September 21st, 2005

Language support finished

LibraryThing now provides good support for the familiar—to many—non-English diacriticals such as é and ñ and gamely handles the exotic—to many—ones like č and å. The encoding is now “UTF-8″ so, in theory, is should be able to handle a much wider variety of scripts, but neither Amazon nor the Library of Congress transmit material in non-Latin scripts, so you’ll have to type it yourself for now.

LibraryThing is now ready to start adding foreign libraries, and that is high on my list of goals. I shall also see if I can get Amazon Japan working.

Everything should be good going forward. The trick was getting all the old material into shape. LibraryThing-ers had employed all manner of tricks to get diacriticals to work, and some of that editing was hard to detect and correct for. Again, French, Spanish and German books should be fine, but I’m not so sure about the Georgian books languagehat entered. (In this case, my Mac simply won’t show Georgian, so I couldn’t tell anyway.)

The other gap is material entered between 8pm and 1am. I shut down book editing and tried to process everything coming in, but at almost 1,000 books/hr, a delay of thirty seconds between processes may have left some books “half cooked.”

Labels: 1

0 Comments:

Leave a Reply