Michel> # -*- coding: cp1252 -*- Michel> aa = u"ä" Michel> aa.encode("ascii","ignore") A somewhat less destructive solution might be to try my latscii codec: http://manatee.mojam.com/~skip/python/latscii.py (assuming your input is encoded as latin-1). Skip