所属分类:
多国语言处理
开发工具:Visual C++
文件大小:170KB
下载次数:107
上传日期:2009-07-22 10:34:41
说明: 基于双数组trie的分词程序,分词速度20MB/S,能够支持GBK、UTF8编码
(Double array trie-based sub-word procedure word speed 20MB/S, can support GBK, UTF8 encoding)
文件列表:
mmseg
.....\document
.....\........\mmseg.doc
.....\mmseg
.....\.....\mmseg
.....\.....\.....\dict.trie
.....\.....\.....\dict.txt
.....\.....\.....\main.cpp
.....\.....\.....\mmseg.cpp
.....\.....\.....\mmseg.h
.....\.....\.....\mmseg.vcproj
.....\.....\.....\mmseg.vcproj.DUANPC.jgduan.user
.....\.....\.....\mmseg.vcproj.JGDUAN.duanjianguo.user
.....\.....\.....\mmseg.vcproj.LENOVO-10A4FF50.Owner.user
.....\.....\mmseg.ncb
.....\.....\mmseg.sln
.....\mmseg_utf8
.....\..........\mmseg
.....\..........\.....\dict.txt
.....\..........\.....\main.c
.....\..........\.....\mmseg.vcproj
.....\..........\.....\mmseg.vcproj.DUANPC.jgduan.user
.....\..........\.....\mmseg.vcproj.LENOVO-10A4FF50.Owner.user
.....\..........\.....\mmseg_utf8.c
.....\..........\.....\mmseg_utf8.h
.....\..........\.....\test.txt
.....\..........\mmseg.ncb
.....\..........\mmseg.sln