uchardet uchardet C library and command-line tool that detects character encoding in text files and streams. library text-processor encoding-detection