8000 Confusable checking for CJK Ideographs · Issue #32 · unicode-rs/unicode-security · GitHub
[go: up one dir, main page]

Skip to content
Confusable checking for CJK Ideographs #32
Open
@crlf0710

Description

@crlf0710

Currently the confusable data doesn't fully include CJK ideographs related info.
Unihan database provides these:

  • kZVariant which describes logically unifiable ideographs that are separately encoded for other reasons
  • kSpoofingVariant which describes confusable ideograph relationships
  • EquivalentUnifiedIdeograph.txt which describes CJK Radical/Stroke and their corresponding Ideograph.

Procedurally i'd recommend Unicode security standards adopt these as confusables upstream, either by data or by algorithm, and then this crate can support them automatically.

cc @Manishearth

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0