by Jakub Marian

Vietnamese is an interesting language in that it uses the most diacritical marks of all languages that use the Latin alphabet in their written form (it contains additional 69 different letters with diacritical marks). I’ve noticed that it uses them to such a degree that almost anything written with so many diacritical marks starts looking Vietnamese at first sight.

Just for fun, I decided to write a script that adds Vietnamese diacritical marks randomly to a text. To achieve greater authenticity, it adds them with the same probability distribution as with which they are used in real written Vietnamese. The option Use only characters with diacritics causes the script to avoid letters without diacritical marks to make the words look even “more Vietnamese”.

