|
实验室发表的部分论文
☆ 2008 年论文
Huizhen Wang, Jingbo Zhu, Keh-Yih Su. Divergence-based feature selection for naive bayes text classification. Proceeding of international conference on natural language processing and knowledge engineering (IEEE NLPKE 2008). pp209-215. 2008.
Jingbo Zhu, Huizhen Wang and Benjamin Tsou. Active Learning with Sampling by Uncertainty and Density for Word Sense Disambiguation and Text Classification. In Proc. of Coling08
Jingbo Zhu, Huizhen Wang and Eduard Hovy. Multi-Criteria-based Strategy to Stop Active Learning for Data Annotation. In Proc. of Coling08
Zhenxing Wang, Changning Huang, Jingbo Zhu. The Character-based CRF Segmenter of MSRA & NEU for the 4th Bakeoff. Sighan4 workshop of IJCNLP2008. India. 2008.1.
Zhenxing Wang, Changning Huang, Jingbo Zhu. Which Performs Better on In-Vocabulary Word Segmentation Based on Word or Character? Sighan4 workshop of IJCNLP2008. India . 2008.1.
Feiliang Ren, Li Zhang, Minghan Hu, Tianshun Yao. EBMT Based on Finite Automata State Transfer Generation..TMI-07.2008.1
Zhenxing Wang, Changning Huang, Jingbo Zhu. The Character-based CRF Segmenter of MSRA & NEU for the 4th Bakeoff. Journal of Chinese Language and Computing.
Zhenxing Wang, Changning Huang, Jingbo Zhu. Which Performs Better on In-Vocabulary Word Segmentation Based on Word or Character? International Journal of Computer Processing of Oriental Languages.
Jingbo Zhu, Huizhen Wang, Eduard Hovy. Learning a Stopping Criterion for Active Learning for Word Sense Disambiguation and Text Classification. The Third International Joint Conference on Natural Language Processing (IJCNLP-08). Hyderabad , India. 2008.
张祝玉,任飞亮,朱靖波. 基于条件随机场的中文命名实体识别特征比较研究. 第四届全国信息检索与内容安全学术会议NCIRCS2008. pp111-117. 2008-11
李天宁,肖桐,朱靖波. 科技论文的IPC自动标注. 第四届全国信息检索与内容安全学术会议. pp346-354.2008-11.
王克,张春良,高晓兴,朱靖波. 基于三类训练两类判别框架的主客观性句子识别. 第四届全国学生计算语言学研讨会. pp83-89. 2008.
郑妍,肖桐,朱靖波. 基于Bootstrapping的领域多词串自动获取. 第四届全国学生计算语言学研讨会. Pp166-172. 2008.
杨旭,肖桐,张俐. 面向新闻领域的主谓关系识别. 第四届全国学生计算语言学研讨会. pp131-137. 2008.
陈如山,肖桐,朱靖波. 利用1-m词对齐信息改善统计机器翻译性能. 第四届全国学生计算语言学研讨会. pp330-335. 2008.
王辰 宋国龙 吴宏林 张俐 刘绍明. 基于序列相交的短语译文获取. 第四届全国学生计算语言学研讨会. pp323-329. 2008.
胡海鹏,闫永明,吴宏林,张俐,刘绍明. 基于组合线索和核心扩展方阵匹配的中日句对齐. 第四届全国学生计算语言学研讨会. pp.317-322. 2008.
☆ 2007 年论文(rar)
Jingbo Zhu, Eduard Hovy, Active Learning for Word Sense Disambiguation with Methods for Addressing the Class Imbalance Problem, EMNLP-CoNLL. Pp783-790.2007.
Chen Wenliang, Zhang Yujie, Isahara Hitoshi, A Two-stage Parser for Multilingual Dependency Parsing, EMNLP-CoNLL2007 (shared task), pp. 1129-1133, Prague, June 28-30, 2007
Ma, Matthew Y., Zhu, Jingbo, Guo, Jinhong K. A Recommender Framework for Electronic Programming Guide on a Mobile Device. 2007 IEEE International Conference on Multimedia and Expo. Pp332-335. 2007.7
Jingbo Zhu , Matthew Y. Ma, Jinhong K. Guo, Zhenxing Wang. Content Classification and Recommendation Techniques for viewing Electrinic Programming Guide on Portable Device. International Journal of Pattern Recognition and Artificial Intelligence.Vol.21,No.2 pp.375-395. 2007.3.
Na Ye, Jingbo Zhu, Huizhen Wang, Matthew Y. Ma, Bin Zhang, An Improved Model of Dotplotting for Text Segmentation, Journal of Chinese Language and Computing. (to appear)
Feiliang, Ren,Li Zhang, Minghan Hu, Tiaoshun Yao. EBMT Based on Finite State Automata Transfer Generation. The Eleventh Conference on Theoretical and Methodological Issues in Machine Translation (TMI2007).
Zhenxing Wang, Jingbo Zhu. Improving K-NN Text Categorization by Bootstrap Technique. International Conference on Chinese Computing 2007 (ICCC2007). Oct.12-15, Wuhan , China p493-499.
Qing Chen, Mu Li,Improving, Query Spelling Correction Using Web Search Results, EMNLP-CoNLL.pp181-189.2007( Poster )
Zhenxing Wang, Changning Huang, Jingbo Zhu. The Character-based CRF Segmenter of MSRA & NEU for the 4th Bakeoff. Sighan4 workshop of IJCNLP2008. India. 2008.1. (Accepted)
Zhenxing Wang, Changning Huang, Jingbo Zhu. Which Performs Better on In-Vocabulary Word Segmentation Based on Word or Character? Sighan4 workshop of IJCNLP2008. India . 2008.1. (Accepted)
Feiliang Ren, Jingbo Zhu, An Effective Hybrid Machine Learning Approach for Coreference Resolution. Sighan4 workshop of IJCNLP2008. India. 2008.1. (Accepted)
朱靖波,叶娜,罗海涛. 基于多元判别分析的文本分割模型. 软件学报. Vol.18, No.3. pp. 85-94.2007.3
朱靖波 , 王会珍 , 张希娟. 面向文本分类的混淆类判别技术. 软件学报. Vol19, No.3. pp.630-639. 2008
吴宏林,刘绍明. 基于二部图最大匹配的汉日词对齐. 中文信息学报 Vol.21. No.5. pp. 101-107. 2007.8
张希娟,王会珍,朱靖波. 面向文本分类的基于最小冗余原则的特征选取. 中文信息学报 Vol.21. No.5. pp56-60.
张海雷,曹菲菲,陈文亮,任飞亮,王会珍,朱靖波. 基于多层次特征集成的中文实体指代识别. 中文信息学报 Vol.21. No.5. pp.126-130. 2007.8
叶娜,郑妍,朱靖波,张斌. 基于二维动态规划的文本分割模型. 第三届全国信息检索与内容安全学术会议 . p209-215. 2007.11
张希娟, 朱靖波. 主动学习中后验概率尖锐现象的平滑处理. 第三届全国信息检索与内容安全学术会议 . p821-827. 2007.11
吴宏林,刘绍明. 基于二部图最大匹配的汉日词对齐. 全国第九届计算语言学学术会议 .pp 368-373. 2007.8
张海雷,曹菲菲,陈文亮,任飞亮,王会珍,朱靖波. 基于多层次特征集成的中文实体指代识别. 全国第九届计算语言学学术会议 .pp 485-490.2007.8
张希娟,王会珍,朱靖波. 面向文本分类的基于最小冗余原则的特征选取. 全国第九届计算语言学学术会议 .pp.612-617. 2007.8
张海雷,王会珍,王安慧,朱靖波. 基于朴素贝叶斯模型的垃圾邮件过滤技术比较分析. 全国网络与信息安全技术研讨会. pp.551-557. 2007.7
☆ 2006 年论文(rar)
Jingbo Zhu, Huizhen Wang, Xijuan Zhang. Discrimination-based Feature Selection for Multinomial Naive Bayes Text Classification. ICCPOL2006. 2006.11(Poster)
Honglin Wu and Shaoming Liu Word Alignment Between Chinese and Japanese Using Maximum Weight Matching on Bipartite Graph. ICCPOL2006. 2006.11
Chen Wenliang, Zhang Yujie, Isahara Hitoshi. Chinese Chunking with Tri-training Learning. 21st International Conference on the Computer Processing of Oriental Languages (ICCPOL2006). pp.466-473. 2006.(Poster)
Chen Wenliang, Zhang Yujie, Isahara Hitoshi. Chinese Named Entity Recognition with Conditional Random Fields. SIGHAN 2006 Bakeoff. pp.118-121. 2006.
Chen Wenliang, Zhang Yujie, Isahara Hitoshi. An Empirical Study of Chinese Chunking. Coling-ACL2006 (Poster Session). pp.97-104. 2006.(Poster)
Chen Wenliang, Zhang Yujie, Isahara Hitoshi. Chinese Chunking based on Conditional Random Fields. NLP2006, Yokohama, Japan. pp.149-152. 2006.
Feiliang Ren, Shaoming Liu. Build Translation Memory System by N-Gram. The 20th Paclic Asia Conference on Language, Information and Computation. pp.452-458. 2006.
Feiliang Ren, Tianshun Yao. Remove Redundancy Samples for SVM in A Chinese Word Segmentation Task. Journal of Communication and Computer. Vol.3, No.5. pp.103-107. 2006.
Feiliang Ren,Tianshun Yao. Make Word Sense Disambiguation in EBMT Practical. The 20th Paclic Asia Conference on Language. Information and Computation. pp.414-417. 2006.
王会珍,朱靖波,季铎,叶娜,张斌. 基于反馈学习自适应的中文话题追踪. 中文信息学报. Vol.20, No.3. pp.92-98. 2006.
王会珍,张希娟,朱靖波,张斌. 基于主动学习的自适应话题追踪. 中国中文信息学会二十五周年会议. pp.373-382. 2006.
陈晴,姚天顺,张俐,姜涛,石磊,李彦丹,肖桐. 基于谓词驱动模板的汉日机器翻译方法. 中国中文信息学会二十五周年学术年会. pp.439-446. 2006.
叶娜,罗海涛,郑妍,朱靖波,张斌. 基于改进型 Dotplotting 的文本分割模型. 中国中文信息学会二十五周年学术年会. pp.352-360. 2006.
季铎,朱靖波. 基于词分布的初始点选取方法. 中国中文信息学会二十五周年学术年会. pp.315-321. 2006.
王安慧,陈文亮,朱靖波. 面向文本分类的文本特征学习. 小型微型计算机系统. Vol.27, pp.360-362. 2006.
张希娟,王会珍,朱靖波. 基于朴素贝叶斯的文本分类. 小型微型计算机系统. Vol.27, pp.369-370. 2006.
曹菲菲,朱慕华,朱靖波. 基于抽样的两阶段支持向量机训练算法. 第三届学生计算语言学研讨会. pp.177-180. 2006.
王屹林,朱慕华,朱靖波. 针对 SVM 中文分词特性的个性化后处理设计. 第三届学生计算语言学研讨会. pp.33-37. 2006
罗海涛 叶娜 朱靖波. Dotplotting文本分割技术的设计与改进. 第三届学生计算语言学研讨会. pp.187-191. 2006.
☆ 2005 年论文集(rar)
Zhu Jingbo, Ye Na, Chang Xinzhi, Chen Wenliang, Using Multiple Discriminant Analysis Approach for Linear Text Segmentation, 2nd International Joint Conference on Natural Language Processing, IJCNLP-05, Jeju Island, Korea, October 11-13, 2005.
Yongmei Tan, Tianshun Yao, Qing Chen and Jingbo Zhu. Applying Conditional Random Fields to Chinese Shallow Parsing. The Sixth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2005) . LNCS, Vol.3406, Springer, pp.167-176, Mexico City , Mexico , Feb. 2005.
Zhu Jingbo, Chen Wenliang . Improving Text Categorization Using Domain Knowledge, A. Montoyo et al. (Eds.): NLDB 2005, LNCS 3513, Springer-Verlag, pp.103–113, 2005.
Ye Na, Zhu Jingbo, Luo Haitao, Wang Huizhen, Zhang Bin. Improvement of the dotplotting method for linear text segmentation. IEEE International Conference on Natural Language Processing and Knowledge Engineering. pp.636-641. 2005.
Zhu Muhua, Zhu Jingbo, Chen Wenliang. Effect analysis of dimension reduction on support vector machines. IEEE International Conference on Natural Language Processing and Knowledge Engineering. pp.592-596. 2005.
Wang Huizhen, Zhu Jingbo, Ji Duo, Ye Na, Zhang Bin. Time Adaptive Boosting Model for Topic Tracking. IEEE International Conference on Natural Language Processing and Knowledge Engineering. pp.488-492. 2005.
Ren Feiliang, Shi Lei, Yao Tianshun. A Dynamic Weighted Method With Support Vector Machines for Chinese Word Segmentation. IEEE International Conference on Natural Language Processing and Knowledge Engineering. pp.366-370. 2005.
Zhu Jingbo, Chen Wenliang. Some Studies on Chinese Domain Knowledge Dictionary and Its Application to Text Classification. SIGHAN4. pp.110-115. 2005.
Chen Wenliang, Zhu Jingbo, Zhu Muhua, Zhang Li, Yao Tianshun. Improving Domain Dictionary-based Text Categorization using Self-Partition Model. International Journal of Computer Processing of Oriental Languages (IJCPOL). Vol.18, No.3. pp.197-210. 2005.
Zhang Yuejie, Zhang Tao, Zhu Jingbo, Yao Tianshun. Research on Dop-based Chinese Parsing. International Conference on Machine Learning and Cybernetics. pp.3840-3845. 2005.
李珩,朱靖波,姚天顺. 基于stacking算法的组合分类器及其应用于中文组块分析. 计算机研究与发展. Vol.42, No.5. pp.844-848. 2005.
任飞亮,石磊,姚天顺. 应用支持向量机进行中文分词. 全国第八届计算语言学联合学术会议. pp.46-52. 2005.
朱慕华,朱靖波,陈文亮. 面向文本分类的多类别SVM组合方式的比较. 全国第八届计算语言学联合学术会议. pp.435-441. 2005.
叶娜,罗海涛,朱靖波,张斌. 基于归纳逻辑编程的多槽信息抽取规则自动学习方法. 全国第八届计算语言学联合学术会议. pp.461-466. 2005.
王会珍,朱靖波,季铎,张斌. 基于多向量模型的中文话题追踪. 全国第八届计算语言学联合学术会议. pp.669-671. 2005.
陈文亮,朱慕华,朱靖波,姚天顺. 基于Bootstrapping的文本分类模型. 中文信息学报. Vol.19, No.2. pp.86-92. 2005华,
姚天顺. 基于领域词典的文本特征表示. 计算机研究与发展. Vol.42, No.12. pp.2155-2160. 2005.
朱慕华,朱靖波,陈文亮. 面向支持向量机的降维方法比较分析. 第二届全国信息检索与内容安全学术会议. pp.221-227. 2005.
薛永刚,朱靖波,魏刚. 基于核主成分分析的文本分类. 第二届全国信息检索与内容安全学术会议. pp.180-188. 2005.
王会珍,朱靖波,季铎,叶娜,张斌. 基于反馈学习自适应的中文话题追踪. 第二届全国信息检索与内容安全学术会议. pp.244-253. 2005.
朱慕华,朱靖波,陈文亮. 支持向量机在文本分类中的应用. 小型微型计算机系统. Vol.26, pp.244-245. 2005.
薛永刚,朱靖波,季铎. 面向文本分类的降维技术的研究. 小型微型计算机系统. Vol.26, pp.241-243. 2005.
朱靖波,陈文亮. 基于领域知识的文本分类. 东北大学学报(自然科学版). Vol.26, No.8. pp.733-735. 2005.
☆ 2004 年论文集(rar)
Le Zhang , Jingbo Zhu and Tianshun Yao. An Evaluation of Statistical Spam Filtering Techinques. ACM Transactions on Asian Language Information Processing (TALIP) Vol.3, No.4, pages 243-269, December 2004.
Zhu Jingbo , Chen Wenliang , and Yao Tianshun. Using Seed Words to Learn to Categorize Chinese Text. Advances in Natural Language Processing: 4th International Conference (EsTAL 2004), LNCS , Vol. 3230, Springer-Verlag, pp.464-473, 2004.
Chen Wenliang , Chang Xingzhi, Wang Huizhen, Zhu Jingbo, and Yao Tianshun. Automatic Word Clustering for Text Categorization Using Global Information. First Asia Information Retrieval Symposium (AIRS 2004), Beijing, pp.1-6, 2004.10.
Chen Wenliang, Zhu Jingbo, Wu Honglin, Yao Tianshun. Automatic Learning Features Using Bootstrapping for Text Categorization. Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2004), Seoul, South Korea.
Yong-mei Tan, Tian-shun Yao, Qing Chen, Jing-bo Zhu. Chinese Chunk Identification Using SVMs plus Sigmoid. The First International Joint Conference on Natural Language Processing (IJCNLP-04), March 22-24, 2004 Sanya City, Hainan Island, China.
Na Ye, Xuejun Wu, Jingbo Zhu, Wenliang Chen, Tianshun Yao. Web Information Extraction Based on Similar Patterns. The Fifth International Conference on Web-Age Information Management (WAIM 2004), Dalian, China.
Zhu Jingbo, Benjamin K Tsou, Wu Xuejun, Yao Tianshun. Using Co-Training for Chinese Organization NE Identification. NER workshop of IJCNLP-04. . 2004.
Zhang Yuejie, Zhang Tao, Zhu Jingbo, Yao Tianshun. The Application of Data-oriented Parsing Technique in English-Chinese Machine Translation. International Conference on Machine Learning and Cybernetics. pp.2979-2984. 2004.
李珩,朱靖波,姚天顺. 基于SVM 的中文组块分析. 中文信息学报. Vol.18, No.2. pp.1-7. 2004
李珩,谭咏梅,朱靖波,姚天顺. 汉语组块识别. 东北大学学报(自然科学版). Vol.25, No.2. 2004.
孙连恒,杨莹,姚天顺. OpenE:一种基于n-gram 共现的自动机器翻译评测方法. 中文信息学报. Vol.18, No.2. pp.15-22. 2004.
叶娜,吴雪军,朱靖波,陈文亮. 基于相似计算的信息抽取模板自动获取方法. 第二届全国计算语言学学生会议. pp.434-439. 2004.
于楠,朱靖波,陈文亮. 领域知识库的构建机制. 第二届全国计算语言学学生会议. pp.215-220. 2004.
朱慕华,陈文亮,朱靖波. 词聚类在文本分类中的应用. 第二届全国计算语言学学生会议. pp.399-405. 2004.
王会珍,朱靖波,陈文亮 季铎 张斌. 基于一元语法模型的中文话题追踪. 第二届全国计算语言学学生会议. pp.422-427. 2004.
陈晴,姚天顺. 基于双语句对语料库的词对齐模型. 第二届全国计算语言学学生会议. pp.354-356. 2004.
刘世岳,李珩,张俐,姚天顺. Co-training 机器学习方法在中文组块识别中的应用. 第二届全国计算语言学学生会议. pp.190-196. 2004.
任登君,李珩,张俐,姚天顺. 基于词对齐的双语组块对齐. 第二届全国计算语言学学生会议. pp.326-331. 2004.
谭咏梅,姚天顺, 陈晴,朱靖波. 基于SVM+Sigmoid 的汉语组块识别. 计算机科学. No.8. pp.142-146. 2004.
吴宏林,吕学强,任飞亮,赵英科,姚天顺. 基于语料库的最小求交词对齐. 小型微型计算机系统. Vol.25, pp.103-104. 2004.
陈文亮,朱慕华,朱靖波,姚天顺. 基于Bootstrapping 的文本分类模型. 第一届信息检索和内容安全学术会议. pp.196-203. 2004.
全德,陈文亮,薛永刚,朱靖波.基于潜在语义索引的文本分类. 小型微型计算机系统. Vol.25, pp.182-183. 2004.
吴宏林,王会珍,朱靖波.话题检测与追踪. 小型微型计算机系统. Vol.25, pp.103-104. 2004.
李珩,杨峰,朱靖波. 基于增益的隐马尔科夫模型的文本组块分析. 计算机科学. Vol.31, No.2. pp.152-154.
☆ 2003年以前(部分)
|