Unicode
One-To-Many Case Mappings
11_ENTRIESMost of the below characters express their one-to-many case mappings when uppercased- while others should be lowercased. This list should be split up
- PhantomScript
Invisible JavaScript code execution & social engineering
- ESReverser
A Unicode-aware string reverser written in JavaScript.
- mimic
[ab]using Unicode to create tragedy
- python-ftfy
Given Unicode text, make its representation consistent and possibly less broken.
- vim-troll-stopper
Stop Unicode trolls from messing with your code.
- Emojipedia
Information about specific emoji, news blog.
- emojitracker
Realtime emoji use on Twitter.
- World Translation Foundation
A way to promote, explore, and translate the written word into the pictorial alphabet of Emoji.
- Can I Emoji?
Displays the current status of native Emoji support across iOS, Android and Windows.
Recursive HTML Tag Renaming Script
17_ENTRIESIf you want to rename all your HTML tags to what appears as nothing, the following script is just what your looking for.
Do note however that HTML does not support all unicode characters.
Here is what it does support:
And heres some basic results
A single TrueType / OpenType font format cannot cover all UTF-8 characters as there is a hard limit of 65535 glyphs in a font. Since there are over 1.1 million UTF-8 glphys, you will need to use a font-family to cover them all.
- Space Yourself
Smashing Magazine's Spacing Guide
- Shapecatcher
Draw the character you're looking for.
Unicode Blocks
11_ENTRIESThe Unicode standard arranges groups of characters together in blocks. This is the complete list of blocks across all 17 planes.
The Unicode Standard set forth the following fundamental principles:
-
Universal repertoire - Every writing system ever used shall be respected and represented in the standard
-
Logical order - In bidirectional text are the characters stored in logical order, not in a way that the representaion
-
Efficiency - The documentation must be efficient and complete.
-
Unification - Where different cultures or languages use the same character, it shall be only included once. This point is
-
Characters, not glyphs - Only characters, not glyphs shall be encoded. In a nutshell, glyphs are the actual graphical
-
Dynamic composition - New characters can be composed of other, already standardized characters. For example, the character βΓβ can be composed of an βAβ and a dieresis sign (β Β¨ β).
-
Semantics - Included charβ¦
- Version 9.0.0
(Latest Version, August 2016 - adds exactly 7,500 characters)
- Version 4.0.0
See the Awesome Unicode contribution guide for details on how to contribute.