unicode direction control characters

Robin Becker robin at reportlab.com
Tue Jan 2 09:30:42 EST 2018


I'm seeing some strange characters in web responses eg

u'\u200e28\u200e/\u200e09\u200e/\u200e1962'

for a date of birth. The code \u200e is LEFT-TO-RIGHT MARK according to 
unicodedata.name.  I tried unicodedata.normalize, but it leaves those characters 
there. Is there any standard way to deal with these?

I assume that some browser+settings combination is putting these in eg perhaps 
the language is normally right to left but numbers are not.
-- 
Robin Becker




More information about the Python-list mailing list