Thoughts relating to language, words, proverbs, and the like.
A count of Unicode characters grouped by script
Tuesday, 14 October, 2008.We all know what Unicode is (if you don't, then read all about it and come back later). We all know that it's big. Hey, of course it's big: its aim is to allow for the representation of characters from every major language script in the world. That's gotta be a lot of characters, right? It's reasonably easy to find out how many unicode characters there are in total: e.g. the Wikipedia page (linked above) states that: "As of Unicode 5.1 there are 100,507 graphic [assigned] characters." I got a bit curious today, and — to my disappointment — after some searching, I was unable to find a nice summary of how many characters there are in each script that Unicode supports. And thus it is that I present to you my count of all assigned Unicode characters (as of v5.1), grouped by script and by category.
Legislation and programming: two peas in a pod
Wednesday, 27 August, 2008.The language of law and the language of computers hardly seem like the most obvious of best buddies. Legislation endeavours to be unambiguous, and yet it's infamous for being plagued with ambiguity problems, largely because it's ultimately interpreted by subjective and unpredictable humang beings. Computer code doesn't try to be unambiguous, it simply is unambiguous — by its very definition. A piece of code, when supplied with any given input, is quite literally incapable of returning inconsistent output. A few weeks ago, I finished an elective subject that I studied at university, called Legal Method and Research. The main topic of the subject was statutory interpretation: that is, the process of interpreting the meaning of a single unit of law, and applying a given set of facts to it. After having completed this subject, one lesson that I couldn't help but take away (being a geek 'n' all) was how strikingly similar the structure of legislation is to the structure of modern programming code. This is because at the end of the day, legislation — just like code — needs to be applied to a real case, and it needs to yield a Boolean outcome.
A novel style
Tuesday, 3 January, 2006.The novel is considered the most ubiquitous of all forms of literature. You can find novels by the truckload in any old bookstore. But what is the true 'novel style', and just how common are 'real novels'? Read on to find out why novels aren't quite so common as you might think.
Draft dodging: the first and final version
Saturday, 12 November, 2005.The concept of a 'draft version' has always seemed rather alien to me. Without ever even realising it, I always correct and analyse my writing as I go, making sure that my first version is as near to final as can be. It recently occurred to me that I am incapable of writing in 'rough form'. Is this a blessing or a curse?
What a mean word
Friday, 12 August, 2005.Some words are perfectly suited to their alternative definitions. The word 'mean' is one of these. 'Mean' refers to the average of a set of numbers. A mean is a cruel, unforgiving, and brutally honest number: in short, it really is a mean number. Read on for more about 'mean', and about other words that have multiple but related meanings (polysemes).
Room to swing a cat
Thursday, 28 July, 2005.There are plenty of weird proverbs in the English language, but this one would have to be among the weirdest. Seriously, who swings cats? How does someone know that there's no room to swing a cat? Have they tried? Join me as I seek the answer to these and other questions.



