Python 3.x stuffing utf-8 into SQLite db

Skip Montanaro skip.montanaro at gmail.com
Mon Feb 9 12:30:41 EST 2015


On Sun, Feb 8, 2015 at 9:58 PM, Chris Angelico <rosuav at gmail.com> wrote:
> Those three characters are the CP-1252 decode of the bytes for U+2019
> in UTF-8 (E2 80 99). Not sure if that helps any, but given that it was
> an XLSX file, Windows codepages are reasonably likely to show up.

Thanks, Chris. Are you telling me I should have defined the input file
encoding for my CSV file as CP-1252, or that something got hosed on
the export from XLSX to CSV? Or something else?

Skip



More information about the Python-list mailing list