所属分类:
多国语言处理
开发工具:Java
文件大小:14KB
下载次数:4
上传日期:2014-04-15 22:34:46
说明: 实现英文文档的分词,并且对词汇进行波特词干处理,输出文章中词干的出现数量
(Achieve the English word document and vocabulary Porter Stemming for processing, the output article appeared in the number of stem)
文件列表:
程序
....\CalFreq.class,3772,2014-03-07
....\CalFreq.java,3451,2014-03-07
....\resultbig.txt,1674,2014-03-05
....\Stemmer.class,6314,2014-03-07
....\Stemmer.java,13587,2014-03-07
....\stopwords.txt,4399,2014-03-07
....\wordfreq.txt,964,2014-03-07