Open
Description
Currently the confusable data doesn't fully include CJK ideographs related info.
Unihan database provides these:
kZVariant
which describes logically unifiable ideographs that are separately encoded for other reasonskSpoofingVariant
which describes confusable ideograph relationshipsEquivalentUnifiedIdeograph.txt
which describes CJK Radical/Stroke and their corresponding Ideograph.
Procedurally i'd recommend Unicode security standards adopt these as confusables upstream, either by data or by algorithm, and then this crate can support them automatically.
cc @Manishearth
Metadata
Metadata
Assignees
Labels
No labels