Monday, March 27, 2006

TextBreak is *moved* back to GNA.org

This morning I took 2 hours for uploading TextBreak family to web server by FTP but it was failed. I don't know exactly why it was failed. The network here was shaped? Cheap web hosting? Anyways, I just moved codes back to Subversion at https://gna.org/projects/textbreak/ and release files there too. This release is not usable yet. Anyways it might be useful for who want to know how it is like.

Sunday, March 26, 2006

TextBreak development strategy


TextBreak development strategy
Originally uploaded by veetai.
This diagram show the development strategy of TextBreak. There 3 sub-projects that are running simultaneously. Since implementation TextBreak in C is pretty difficult. Thus the prototype in Python was built before building fully implementation in C. However, there is some modules have been written in C already. For instance, Dict, which is dictionary in Trie structure. In order to integrate them, Python binding is built. At the last phase, the prototype will be ported into C. :-)

Friday, March 24, 2006

QT4 + Thai word breaking

OB proposes to hack QT4 in order to plug word breaking module,
for instance, LibThai. If there is one who is doing that or interested
in that, please reply him.

http://linux.thai.net/phpbb2/viewtopic.php?t=28034

Machine translation: Bookmark

http://www.cs.cmu.edu/People/ref/mlim/chapter4.html