svn commit: r196067 - user/edwin/locale/tools

[ Available lists | Index of svn-src-user | Month of Aug 2009 | Week of 5 Aug 2009 | Raw email | View thread | Wrap long lines | Reply | Tag ]
From
Edwin Groothuis <edwin@FreeBSD.org>
Date
5 Aug 2009 07:44:31
Subject
svn commit: r196067 - user/edwin/locale/tools
Message-ID
200908050744.n757iVWv047701@svn.freebsd.org


[ Hide this part ]
Author: edwin
Date: Wed Aug 5 07:44:30 2009
New Revision: 196067
URL: http://svn.freebsd.org/changeset/base/196067

Log:
show what a certain unicode name can be

Added:
user/edwin/locale/tools/whatis.pl (contents, props changed)

Added: user/edwin/locale/tools/whatis.pl
==============================================================================
--- /dev/null 00:00:00 1970 (empty, because file is newly added)
+++ user/edwin/locale/tools/whatis.pl Wed Aug 5 07:44:30 2009 (r196067)
@@ -0,0 +1,29 @@
+#!/bin/sh
+
+UNIDATA=/home/edwin/unicode/UNIDATA/5.2.0/UnicodeData.txt
+CHARMAPS=/home/edwin/svn/edwin/locale/tools/charmaps
+UTF8=~/unicode/cldr/1.7.1/posix/UTF-8.cm
+
+if [ -z "$1" ]; then
+ echo "Usage: $0 <unicode string>"
+ exit
+fi
+
+UCS=$*
+UCS_=$(echo $* | sed -e 's/ /./g')
+echo UCS: ${UCS}
+
+echo UTF-8.cm:
+grep "${UCS_}" ${UTF8} | sed -e 's/ */ /g'
+
+echo UNIDATA:
+grep "${UCS_}" ${UNIDATA}
+L=$(grep "${UCS_}" ${UNIDATA})
+
+echo UCC:
+grep "${UCS_}" ${UNIDATA} | awk -F\; '{ print $1 }'
+
+
+echo CHARMAPS:
+grep ${UCS_} ${CHARMAPS}/* | sed -e "s|${CHARMAPS}/||g"
+grep ${UCC} ${CHARMAPS}/* | sed -e "s|${CHARMAPS}/||g"


Elapsed time: 0.053 seconds