Brief Biography
I am a fourth-year Ph.D. student in Computer Science at Northeastern University (NEU). I am also a member of Natural Language Processing Lab at NEU, working under my supervisor Prof. Jingbo Zhu.
I graduated in March 2008 with my Master's Degree from Northeastern University, where I worked on Example-Based Machine Translation (EBMT) and Chinese chunking with my advisors Prof. Li Zhang and Prof. Tianshun Yao.
Northeastern University is located in Shenyang (ÉòÑô), China, where I grew up. Shenyang is really a wonderful place. I love it very much!
Interests
I am interested in NLP and machine learning, particularly in Statistical Machine Translation (SMT). My another specific research interest lies in patent processing. I developed a patent mining system which achieved the best performance in English patent mining task at NTCIR-7. Recently I work on syntax-based SMT, document-level translation and consensus-based ranking for patent mining.
Selected Publications (full list)
Document-level Consistency Verificaton in Machine Translation, Tong Xiao, Jingbo Zhu, Shujie Yao and Hao Zhang
In Proc. of MT summit XIII, 2011, Xiamen, China. [pdf]
Improving Decoding Generalization for Tree-to-String Translation, Jingbo Zhu and Tong Xiao
In Proc. of The 49th Annual Meeting of the Association for Computational Linguistics (ACL, short paper), 2011, Portland, USA. [pdf]
Automatic Treebank Conversion via Informed Decoding - A Case Study on Chinese Treebanks, Muhua Zhu, Jingbo Zhu and Tong Xiao
To appear in ACM Transactions on Asian Language Information Processing (TALIP), Speical Issue on Chinese Language Processing
Language Modeling for Syntax-based Machine Translation Using Tree Substitution Grammars: A Case Study on Chinese-English Translation, Tong Xiao, Jingbo Zhu and Muhua Zhu
To appear in ACM Transactions on Asian Language Information Processing (TALIP), Speical Issue on Chinese Language Processing
An Empirical Study of Translation Rule Extraction with Multiple Parsers, Tong Xiao, Jingbo Zhu, Hao Zhang and Muhua Zhu
In Proc. of The 23rd International Conference on Computational Linguistics (COLING, poster session), 2010, Beijing, China. [pdf]
Heterogeneous Parsing via Collaborative Decoding, Muhua Zhu, Jingbo Zhu and Tong Xiao
In Proc. of The 23rd International Conference on Computational Linguistics (COLING), 2010, Beijing, China. [pdf]
Boosting-based System Combination for Machine Tranlsation, Tong Xiao, Jingbo Zhu, Muhua Zhu and Huizhen Wang
In Proc. of The 48th Annual Meeting of the Association for Computational Linguistics (ACL), 2010, Uppsala, Sweden. [pdf, slides]
Better Synchronous Binarization for Machine Translation, Tong Xiao, Mu Li, Dongdong Zhang, Jingbo Zhu and Ming Zhou
In Proc. of Empirical Methods in Natural Language Processing (EMNLP), 2009, Singapore. [pdf]
The Feature Subspace Method for SMT System Combination, Nan Duan, Mu Li, Tong Xiao and Ming Zhou
In Proc. of Empirical Methods in Natural Language Processing (EMNLP), 2009, Singapore. [pdf]
Competitions
NTCIR-9 Chinese-English patent MT track - 2nd place (human evaluation)
The NiuTrans Machine Translation System for NTCIR-9, Tong Xiao, Qiang Li, Qi Lu, Hao Zhang, Haibo Ding, Shujie Yao, Xiaoming Xu, Xiaoxu Fei, Jingbo Zhu, Feiliang Ren and Huizhen Wang
To appear [pdf]
CWMT2011 English-Chinese and Chinese-English news tracks - 1st place and 4th place (BLEU)
The NiuTrans Machine Translation System for CWMT2011, Tong Xiao, Hao Zhang, Qiang Li, Qi Lu, Jingbo Zhu, Feiliang Ren and Huizhen Wang
In Proc. of The 6th China workshop on Machine Translation (CWMT), 2011, Xiamen, China. [pdf]
CWMT2009 Chinese-English Single System Track - 2nd place (BLEU)
NEUTrans: a Phrase-Based SMT System for CWMT2009, Tong Xiao, Rushan Chen, Tianning Li, Muhua Zhu, Jingbo Zhu, Huizhen Wang and Feiliang Ren
In Proc. of The 5th China workshop on Machine Translation (CWMT), 2009, Nanjing, China. [pdf]
NTCIR-7 English Patent Mining Track - 1st place (MAP)
KNN and Re-ranking Models for English Patent Mining at NTCIR-7, Tong Xiao, Feifei Cao, Tianning Li, Guolong Song, Ke Zhou, Jingbo Zhu and Huizhen Wang
In Proc. of NTCIR-7 Workshop Meeting, 2008, Tokyo, Japan. [pdf]
Software
NiuTrans: an open-source MT system. Many useful features can be found in NiuTrans. Check it out!
Professional Activities
Reviewer for IJCNLP2011, SWCL2010, SEWM2009, SWCL2008
Secondary reviewer for AAAI2011
Beyond Academics
I like playing basketball, hiking, climbing mountains.
Finding my sweetheart, Tongran Liu
, was the luckiest thing that ever happened to me.
~ Have fun!
Page last modified on Sept 29, 2011
