[New-bugs-announce] [issue41622] Add support for emoji-data.txt and emoji-variation-sequences.txt to unicodedata

jack1142 report at bugs.python.org
Sun Aug 23 16:34:50 EDT 2020


New submission from jack1142 <kuba.kuczys at gmail.com>:

`emoji-data.txt` and `emoji-variation-sequences.txt` files were formally pulled into the UCD as of Version 13.0 [1] so I think that unicodedata as a package providing access to UCD could support those as well.
In particular:
- `emoji-data.txt` lists character properties for emoji characters [2] 
- `emoji-variation-sequences.txt` lists valid text and emoji presentation sequences [3]

Data from `emoji-variation-sequences.txt` can be used to ensure consistent rendering of emoji characters across devices [4] (`StandardizedVariants.txt` has a similar purpose for non-emoji characters).
I'm not entirely sure of the use cases for `emoji-data.txt`, but because it's also newly added in UCD 13.0.0, I figured I at least shouldn't omit it when making this issue.


[1] https://www.unicode.org/reports/tr44/#Change_History - Changes in Unicode 13.0.0, "Emoji Data" section
[2] https://www.unicode.org/reports/tr51/#Emoji_Properties_and_Data_Files
[3] https://www.unicode.org/reports/tr51/#Emoji_Variation_Sequences
[4] https://unicode.org/faq/vs.html#1

----------
components: Unicode
messages: 375826
nosy: ezio.melotti, jack1142, vstinner
priority: normal
severity: normal
status: open
title: Add support for emoji-data.txt and emoji-variation-sequences.txt to unicodedata
type: enhancement
versions: Python 3.10, Python 3.9

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue41622>
_______________________________________


More information about the New-bugs-announce mailing list