found in a project gutenberg file: "Updateŕs note: This file has been recoded to UTF8." (yes, that is LATIN SMALL LETTER R WITH ACUTE)

I can't tell if this is supposed to be a joke or not

@darius wouldn't that imply that the text "Updater's note: This file has been recoded to UTF8" was in the *physical* copy?

@aparrish My thought was:

physical -> some other encoding via OCR -> UTF8, and the error happened in first conversion


@darius @aparrish my guess is that it’s visible proof of the Unicode being correct

Sign in to participate in the conversation

Server run by the main developers of the project 🐘 It is not focused on any particular niche interest - everyone is welcome as long as you follow our code of conduct!